Training loss and validation loss

emmanuel_Obolo · July 13, 2023, 10:05am

Dear Community I am working on landmark classification problem but during my training of using a convolutional neural network from scratch by using about 8 layers of CNN in the backbone and 4 layers of MLP in the head i.e 3 hidden layers and 1 output layer during my training my training loss drops till it get stuck at 3.9 my validation loss starts from 3.9 and get stucked there I am using sgd optimizer but it seems that i got trapped at a local minimum any ideas to improve this network

balaji.ambresh · July 13, 2023, 10:07am

Please check your loss function and number of neurons in the output layer.
Do check deep learning specialization for lectures on optimizers and the related assignment.

emmanuel_Obolo · July 13, 2023, 10:12am

ok I used nn.CrossEntropyloss and I know it does a softmax first before doing the loss so I made my network not calculate a softmax in the output layer of the MLP only do the Linear sum and this is not from any of the speciialization course It is a self project sir

balaji.ambresh · July 13, 2023, 10:44am

No worries. Tips of model optimization and ways to approach data augmentation are applicable to the problem and is independent pytorch or tensorflow.

You are correct about the model requiring to predict just logits for CrossEntropyLoss.

Unfortunately, it’s impossible for me to repeat courses 2 and 3 of deep learning specialization. Please see the lectures for pointers on dealing with bias and variance issues.

emmanuel_Obolo · July 13, 2023, 10:57am

so what do you suggest sir
that could help me in my problem

balaji.ambresh · July 13, 2023, 11:49am

I already gave you my sugggestion:

Topic		Replies	Views
Help concerning the 8th exercise Neural Networks and Deep Learning coursera-platform	1	505	July 13, 2023
Convergence Problem while training Convolutional Neural Networks coursera-platform	1	525	August 3, 2021
Loss values do not change AI Discussions	11	62	May 4, 2023
Assignment: model does not coverge Advanced Computer Vision with TensorFlow week-module-4	2	549	December 17, 2021
How to improve Train Accuracy Convolutional Neural Networks in TensorFlow week-module-1	7	734	February 13, 2022

Training loss and validation loss

Related topics