Question about dropout regularization

paulinpaloalto · November 24, 2022, 10:32pm

This is an interesting question that has come up before. In the limit, if you were doing Stochastic GD, then there would be one sample in each minibatch so each sample would be handled differently w.r.t. dropout. The way Prof Ng has us do it has that behavior even if the batchsize is greater than 1. Here’s an earlier thread which discusses this point and even shows some experimental results with both methods.

Topic		Replies	Views
W1 Assignment2_Regularization Improving Deep Neural Networks: Hyperparameter tun	20	584	July 17, 2023
Dropout Frequency Improving Deep Neural Networks: Hyperparameter tun	1	580	June 28, 2021
Dropout technique makes me confused Improving Deep Neural Networks: Hyperparameter tun	7	720	May 12, 2022
A doubt on dropout Improving Deep Neural Networks: Hyperparameter tun	4	518	August 17, 2023
Week1 - Programming Assignment: Regularization - dropout code Improving Deep Neural Networks: Hyperparameter tun	3	693	April 1, 2022

Question about dropout regularization

Related topics