Regularization Dropout Programming Assignment: How to intepretate when test accuracy is higher than training accuracy

Lu_Huang · October 17, 2022, 5:22am

Hi,

I just done the regularization programming assignment. One interesting thing is that when using dropout, the training accuracy is ~92% while test accuracy is ~95%. My naive gut intuition is that test accuracy is theoretically <= train accuracy. If test accuracy > train accuracy, it might be just because a lucky split of train vs test data set where train has more noise. Is the above intuition correct? How do I intepretate when test > train accuracy?

Welcome any thoughts and discussion!

gent.spah · October 17, 2022, 7:48am

Yeah I think your intuition is correct. Its just the test split which has a lot of familiar and easy data that was seen in the training phase.

davidg · November 13, 2022, 4:45pm

If the train and test data come from the same distribution, this behavior would get increasingly unlikely, the bigger both data sets are. Or the other way, the smaller a data set the more it might regress from the mean, potentially leading to test accuracy being larger than train accuracy. You can systematically reshuffle between train and test data and see if the result persists.

However, a different thought would be, but that is only my guess, that strictly speaking, because of dropout, the (averaged) NN used in training is smaller than the one used in testing, therefore it might not be THAT surprising that the larger network performs better as it uses many more units than the one used for training. Despite training found the corrects weights for all of the units, on average, the NN with less units used in dropout performs worse (even in training) than the one used in training.

If at all, worse performance in training might be an indicator that the data are not being overfitted and are therefore indicative of the quality of the NN. Since size reduction by dropout and overfitting will have opposite effects on training performance, no?

Topic		Replies	Views
Course 2 Week 1 Programming Assignment Regularization Improving Deep Neural Networks: Hyperparameter tun coursera-platform	7	694	September 10, 2021
Test Accuracy Higher than Train accuracy? Improving Deep Neural Networks: Hyperparameter tun coursera-platform	42	2262	August 9, 2024
Week 1 Assignment 2 Improving Deep Neural Networks: Hyperparameter tun coursera-platform	1	660	July 17, 2021
Course 3 Week1: put it into practice, cat classification Structuring Machine Learning Projects week-module-1 , coursera-platform	22	532	January 31, 2024
Regularization lab/exercise Improving Deep Neural Networks: Hyperparameter tun coursera-platform	1	504	April 22, 2022

Regularization Dropout Programming Assignment: How to intepretate when test accuracy is higher than training accuracy

Related topics