Hi all, I have a question regarding the frequency for which dropout is applied during forward and backwards propagation. In lectures, I was under the impression that dropout is applied for every training example separately. In other words, there’s a random selection (pattern) of the network for eve…

Dropout Frequency

nramon June 28, 2021, 9:51am 2

Your understanding seems correct to me. For each training example in a mini-batch, you sample a thinned network by dropping out units (source).

If that were not the case, instead of an Nl x m matrix you’d sample an Nl x 1 vector and broadcast it along the dimension of m, right?

I hope you’re enjoying the course

1 Like

Topic		Replies	Views
Dropout Technique Improving Deep Neural Networks: Hyperparameter tun coursera-platform	1	554	June 15, 2021
Question about dropout regularization Improving Deep Neural Networks: Hyperparameter tun coursera-platform	4	748	November 24, 2022
Week1 - Programming Assignment: Regularization - dropout code Improving Deep Neural Networks: Hyperparameter tun coursera-platform	3	694	April 1, 2022
Does Dropout implementation shut down random features and not random neurons? Improving Deep Neural Networks: Hyperparameter tun coursera-platform	1	545	August 7, 2021
Implementing dropout regularization Improving Deep Neural Networks: Hyperparameter tun coursera-platform	3	635	May 14, 2022