Course 2 Week 1 accuracy barely 90%

Ioana_T · November 22, 2022, 10:15am

Hello, I’m struggling to achieve 95% accuracy for the training accuracy, the validation accuracy is always above 80% and seems to increase 1:1 with the training acc. I tried all possible architectures : with 3 Conv layers, 5 Conv layers, more Dense layers or a single Dense layer.

I’m using Adam optimizer and batch size =32 (also tried 64) for both training & validation generator. I’ve also used augmentation params to improve the accuracy but I’m stuck at a max. of 90% for training accuracy.

Could someone take a look at my notebook please ? Thanks.

balaji.ambresh · November 22, 2022, 10:33am

Batch size of 32 is a good setting if you don’t know how to tune the learning rate based on batch size. See this paper for more details.
As far as the conv layers are concerned, if you are:

Following every conv2d layer with a maxpool2d layer
Increasing the number of conv2d filters as you go deeper in the network (as powers of 2 with 8 being the fewest number of filters in the network).

Please click my name and message your notebook as an attachment.

Jean-Pierre_Gergie · November 23, 2022, 12:05pm

I am facing the same issue, did you find a solution??

Ioana_T · November 23, 2022, 12:41pm

not yet, I’m still trying out architectures and I’ve increased the epochs to 30 to give it more time to train.

balaji.ambresh · November 23, 2022, 1:19pm

@Ioana_T

There’s no need to shuffle or augment the validation set. This should be okay since we want to measurement to be the same across different training epochs.

Regarding your architecture search, start with say, 32 and then gradually increase the number of filters (in your writeup, you’ve used 18. While there is no formula to pick the number of filters, powers of 2 that are >= 8 tend to usually work well). As far as the number of Dense layers are concnerned, you can have more than 1 Dense layer as well.

Jozsef_Vass · December 6, 2022, 10:23am

If you made the model overly complex reduce the learning rate to 0.0001 (Adam in my case). The gradients are very sensible if you stacked up a lot of convolution and dense layers. This way I achieved 99.3% of training accuracy and 85% on validation. It is kind of forcing to overfit extremely…

balaji.ambresh · December 6, 2022, 10:48am

What’s your batch size?

Jozsef_Vass · December 6, 2022, 11:32am

I used 100 for training and 10 for validation. But the LR marked me most the difference.

balaji.ambresh · December 6, 2022, 12:10pm

Your learning rate of 1e-4 looks small for such a batch size. The default learning rate for adam is 1e-3. This setting goes well with a batch size of ~32. A bigger batch size can get better results from a higher learning rate. You might want try learning rates like 2e-3, 3e-3 etc.

Topic		Replies	Views
Course 2 week 1 accuracy Convolutional Neural Networks in TensorFlow week-module-1	10	675	October 16, 2022
Can not achieve 95% accuracy Convolutional Neural Networks in TensorFlow week-module-1	8	969	November 14, 2022
Training accuracy 86% validation 94%, required training accuracy 99% validation accuracy 95% Convolutional Neural Networks in TensorFlow week-module-4	17	1659	June 19, 2023
Difficulty achieving training accuracy Convolutional Neural Networks in TensorFlow week-module-1	3	537	August 21, 2022
Validation and training accuracy just cant reach 80% Convolutional Neural Networks in TensorFlow week-module-2	20	1970	July 11, 2023

Course 2 Week 1 accuracy barely 90%

Related topics