During the week 1 lecture series where professor andrew introduces the concept of regularization, he talks about using “Dropout” as a technique for the same. Now, while implementing dropout at any layer “l”, we first calculate a dropout matrix for that layer and then multiply the same to A[l] (activ…

Doubt about the implementation of inverted dropout

Course Q&A Deep Learning Specialization Improving Deep Neural Networks: Hyperparameter tun

nramon May 10, 2021, 9:59am 3

To be more specific, at training time you’re multiplying the activations with a vector of independent Bernoulli random variables whose expected value is precisely keep_probs, so you divide by keep_probs to compensate for this.

Let me know if that helped.

2 Likes

Regularization by Inverted Dropout

Topic		Replies	Views
[C2W1 - Regularization] A question about inverted dropout scaling factor Improving Deep Neural Networks: Hyperparameter tun coursera-platform	3	1126	January 27, 2024
Course 2 -- Week 1 -- Dropout Improving Deep Neural Networks: Hyperparameter tun coursera-platform	1	755	June 28, 2021
Week 1 - Doubt in Dropout Regularization lecture video Improving Deep Neural Networks: Hyperparameter tun coursera-platform	7	768	June 16, 2021
Inverted Dropout Improving Deep Neural Networks: Hyperparameter tun coursera-platform	22	1906	July 27, 2023
Why do you divide the activations by keep_prob when you use drop Improving Deep Neural Networks: Hyperparameter tun coursera-platform	7	805	May 22, 2023

Doubt about the implementation of inverted dropout

Related topics