CSVLogger is setting `val_*` metrics to `nan` despite no validation data being provided. #21025

innat · 2025-03-13T17:15:56Z

Basically two issue.

import tensorflow as tf

import os
import numpy as np
import keras
from keras import layers
from keras import ops
keras.__version__ # 3.8.0

inputs = keras.Input(shape=(784,), name="digits")
x = layers.Dense(64, activation="relu", name="dense_1")(inputs)
x = layers.Dense(64, activation="relu", name="dense_2")(x)
outputs = layers.Dense(10, activation="softmax", name="predictions")(x)
model = keras.Model(inputs=inputs, outputs=outputs)

(x_train, y_train), (x_test, y_test) = keras.datasets.mnist.load_data()
# Preprocess the data (these are NumPy arrays)
x_train = x_train.reshape(60000, 784).astype("float32") / 255
x_test = x_test.reshape(10000, 784).astype("float32") / 255
y_train = y_train.astype("float32")
y_test = y_test.astype("float32")
x_val = x_train[-10000:]
y_val = y_train[-10000:]
x_train = x_train[:-10000]
y_train = y_train[:-10000]

model.compile(
    optimizer=keras.optimizers.RMSprop(),  # Optimizer
    # Loss function to minimize
    loss=keras.losses.SparseCategoricalCrossentropy(),
    # List of metrics to monitor
    metrics=[keras.metrics.SparseCategoricalAccuracy()],
)

print("Fit model on training data")
csv_logger = keras.callbacks.CSVLogger('training.csv')

history = model.fit(
    x_train,
    y_train,
    batch_size=64,
    epochs=2,
    callbacks=[csv_logger]
)
import pandas as pd
history = pd.read_csv('training.csv')
history.head()

The recorded scores for training data in csv file are also wrong.

Fit model on training data
Epoch 1/2
782/782 ━━━━━━━━━━━━━━━━━━━━ 3s 3ms/step - loss: 0.5810 - sparse_categorical_accuracy: 0.8389
Epoch 2/2
782/782 ━━━━━━━━━━━━━━━━━━━━ 2s 3ms/step - loss: 0.1661 - sparse_categorical_accuracy: 0.9494

Training log says, for first epoch acc: 0.83 but in csv, it is .90. Also with the loss scores.

The text was updated successfully, but these errors were encountered:

dhantule · 2025-03-17T11:20:24Z

Hi @innat, Thanks for reporting this.

The mismatch in scores could be because the scores in the training log are updated after each batch.

history.history scores match the scores in the CSV, the loss and metrics you get from history.history are averages over epoch. I've provided validation data and run your code in this gist and it seems to work.

github-actions · 2025-04-01T02:14:48Z

This issue is stale because it has been open for 14 days with no activity. It will be closed if no further activity occurs. Thank you.

github-actions bot assigned sachinprasadhs Mar 13, 2025

dhantule added the type:Bug label Mar 17, 2025

dhantule added the stat:awaiting response from contributor label Mar 17, 2025

github-actions bot added the stale label Apr 1, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

CSVLogger is setting `val_*` metrics to `nan` despite no validation data being provided. #21025

CSVLogger is setting `val_*` metrics to `nan` despite no validation data being provided. #21025

innat commented Mar 13, 2025

dhantule commented Mar 17, 2025

github-actions bot commented Apr 1, 2025

CSVLogger is setting val_* metrics to nan despite no validation data being provided. #21025

CSVLogger is setting val_* metrics to nan despite no validation data being provided. #21025

Comments

innat commented Mar 13, 2025

dhantule commented Mar 17, 2025

github-actions bot commented Apr 1, 2025

CSVLogger is setting `val_*` metrics to `nan` despite no validation data being provided. #21025

CSVLogger is setting `val_*` metrics to `nan` despite no validation data being provided. #21025