Week 1, Dropout Regularization

Sharara_Hossain · June 7, 2022, 7:45am

In the lesson, after reducing the activation matrix a3 by 20%, it is divided by the keep_probs value to scale it back up. I don’t understand what’s happening here.

How does a3 /= keep_probs work?

anon57530071 · June 7, 2022, 8:20am

Dropout accepts two inputs. One is of course input tensor, and the other is “training” to show whether the network should work in a training mode or in inference (prediction) mode. The reason why there are two modes is, Dropout works during a training mode, but it does not during a prediction mode. At a prediction time, to get a similar output as a training time, it is better to set the amount of network flow equal to the training time where dropout was working. If keep_probs = 0.8, then, the amount of network flow during a training time is 0.8 times of that in a prediction time.
So, there are two ways. Reduce the amount of network flow in a prediction time by multiplying 0.8 or increase the amount of network flow in a training time by dividing 0.8. Latter is “Inverted dropout” that Andrew introduced, and what you mentioned.

Sharara_Hossain · June 7, 2022, 12:06pm

Thanks! I will look up graph theory now

paulinpaloalto · June 7, 2022, 2:56pm

Here’s a thread from a while back that discusses this in more detail and shows examples of the effect of the inverted dropout on the L2 Norm of the activation outputs.

Sharara_Hossain · June 8, 2022, 5:29am

Thank you! That’s helpful

Topic		Replies	Views
[C2W1 - Regularization] A question about inverted dropout scaling factor Improving Deep Neural Networks: Hyperparameter tun coursera-platform	3	1085	January 27, 2024
Doubt about the implementation of inverted dropout Improving Deep Neural Networks: Hyperparameter tun coursera-platform	5	830	November 19, 2024
Regularization Improving Deep Neural Networks: Hyperparameter tun coursera-platform	3	592	July 15, 2023
Course 2 -- Week 1 -- Dropout Improving Deep Neural Networks: Hyperparameter tun coursera-platform	1	738	June 28, 2021
Regularization by Inverted Dropout Improving Deep Neural Networks: Hyperparameter tun coursera-platform	1	687	August 12, 2021

Week 1, Dropout Regularization

Related topics