Neural Network Training

Manish_Palsaniya27 · June 25, 2025, 1:29pm

Hey!
I have manipulated a NN using Tensorflow framework on famous dataset of fashion_mnist, I use RMSProp as optimizer and i also tried adam optimizer but every time the cross validation error keeps on oscillating and on an average it gets increased,
Any idea where i might be wrong or what improvement can i make

paulinpaloalto · June 25, 2025, 4:05pm

It would shed more light to also know how your training error is behaving at the time that your validation error is oscillating or increasing. But it sounds like this may be a form of overfitting, so regularization might be one other thing to try.

Manish_Palsaniya27 · June 25, 2025, 5:50pm

Hey!
I am curios to know but how do you figure out that regularization solve this issue, as per my knowledge we use regularization when our model overfits the data that results to poor generalization…

Manish_Palsaniya27 · June 25, 2025, 5:55pm

Is this a good learning Curve for a model…?

Luis_Fernando · June 25, 2025, 8:33pm

Hi,

I think, looking at your graph, we can’t say overfitting yet.
I advise you to increase batch size first.
Thank you.

paulinpaloalto · June 25, 2025, 8:37pm

I have not tried to train a model against that particular dataset, but take a look at how the accuracy evolves after 8 Epochs (or 8 thousand or what ever that number means on your graphs): the training accuracy is much higher the validation accuracy. So that’s the definition of overfitting, isn’t it? Now there are two interesting follow up questions:

The question you asked above: does the strange divergence of the validation accuracy between 3 and 5 and between 8 and 10 mean something pathological is going on?
What do we do about the overfitting?

Maybe an easier question is what happens if you continue for a few more epochs? Maybe you get another spell where the training and validation accuracy converge as they did between 5 and 7 epochs.

This is an experimental science. I don’t claim to know the answer to question 1 as a general matter. Do you get similar behavior with both RMSprop and Adam? Have you tried any other model architectures?

TMosh · June 25, 2025, 9:26pm

I would not worry too much about the dips in validation accuracy, given the vertical axis scaling. A couple of percent variation is not likely significant.

But it’s difficult to say without knowing the size of the data sets.

gent.spah · June 26, 2025, 8:41am

Looking at those curves and if this happens only at a few particular epochs and then stabilizes I would agree with Tom above that its not a major issue. Why, because the validation incudes data the model has not seen and when a lot of those data are concentrated in a particular pass then its natural to have a drop in performance for that pass. Of course for a model to be robust needs regularization as well as Paul suggests.

Manish_Palsaniya27 · June 28, 2025, 7:07pm

Its the fashion MNIST dataset in Keras, consists of 60,000 images for the training set and 10,000 images for the testing set. Each image is a 28 x 28 size grayscale image categorized into ten different classes. Scaling the curve make sense but there is sudden dip in accuracy from 85 to 81 percent.

TMosh · June 29, 2025, 12:02am

That’s probably not a significant change in the accuracy value.

Topic		Replies	Views
Optimizer and different performances Convolutional Neural Networks in TensorFlow week-module-1	2	577	March 2, 2022
I made a deep learning model and after 100 epochs the graph of validation accuracy is having too many oscillations with every epoch. How do we normalize the validation accuracy. Although the validation loss is satisfactory. AI Discussions ai-discussions , project	25	349	November 23, 2024
Loss is not stable when training neural network Advanced Learning Algorithms week-module-3	7	95	September 16, 2024
Need support to improve model accuracy Convolutional Neural Networks in TensorFlow week-module-1	5	597	July 5, 2023
Course 2 Week 1 accuracy barely 90% Convolutional Neural Networks in TensorFlow week-module-1	8	588	December 6, 2022

Neural Network Training

Related topics