A lecture issue in dropout regularization implementation in week 1

paulinpaloalto · December 8, 2022, 4:22pm

It’s great that the discussion was useful. Thanks for confirming!

Another followup question that has come up before is whether the fact that dropout works in a given case means that there is actually a smaller network that we could have started with and trained without dropout that would have achieved the same “Goldilocks” balance between fitting the test data and not overfitting the training data. I don’t definitively know the answer, but it seems likely that this is true. The problem is that finding it is not as practical as using dropout or other forms of regularization. Here’s a thread from a while ago that discusses this point in more detail.

Topic		Replies	Views
Inverted Dropout Improving Deep Neural Networks: Hyperparameter tun	22	1749	July 27, 2023
[C2W1] Dropout Regularization - Lecture issue Improving Deep Neural Networks: Hyperparameter tun	2	539	January 11, 2022
Inverted dropout Intuition? Improving Deep Neural Networks: Hyperparameter tun	3	669	May 24, 2022
Regularization Improving Deep Neural Networks: Hyperparameter tun	3	592	July 15, 2023
Dropout scaling fix (division by keep_prob) Improving Deep Neural Networks: Hyperparameter tun	3	678	September 28, 2022

A lecture issue in dropout regularization implementation in week 1

Related topics