Dropout: Why divide by keep prob?

shamus · July 10, 2021, 2:06pm

Hi, I am struggling to understand the reason behind doing the 4th step of dropout as follows.

Divide 𝐴[1] by keep_prob. By doing this you are assuring that the result of the cost will still have the same expected value as without drop-out. (This technique is also called inverted dropout.)

I understand what the 4th step is but can u elaborate on the 2nd sentence above? How do we tie it back to the cost function?

paulinpaloalto · July 10, 2021, 11:19pm

Please see this recent thread for a pretty thorough discussion of this issue.

Topic		Replies	Views
Inverted Dropout step Improving Deep Neural Networks: Hyperparameter tun	2	625	February 12, 2023
Inverted dropout Intuition? Improving Deep Neural Networks: Hyperparameter tun	3	671	May 24, 2022
Doubt related to Inverted Dropout Technique Improving Deep Neural Networks: Hyperparameter tun	2	820	February 16, 2023
Regularization by Inverted Dropout Improving Deep Neural Networks: Hyperparameter tun	1	687	August 12, 2021
Inverted Dropout Improving Deep Neural Networks: Hyperparameter tun	22	1783	July 27, 2023

Dropout: Why divide by keep prob?

Related topics