Doubt regarding D1 and D2 in dropout regularization in assignment 2 of week 1

Mihir09 · July 31, 2021, 1:24pm

In the exercise, it is mentioned that I should initialize D1 in the following way
D1 = np.random.rand(A1.shape[0],A1.shape[1])
D1 = (D1<keep_prob).astype(int)

But my doubt is that what if I initialize D1 in following way
D1 = np.random.rand(A1.shape[0],A1.shape[1])
D1 = (D1>(1-keep_prob)).astype(int)

Will it make any differnece to my model??

paulinpaloalto · November 29, 2021, 1:02am

This is an interesting question! The point is that those two implementations have the same statistical behavior in terms of how many nodes are zeroed, but the actual nodes that get zeroed are different, right? But it turns out that the test cases here are written to expect that you use the first method.

Since all the behavior of dropout is fundamentally statistical, either of the implementations will have the same overall effect in actual use for training a model. But only the first one will pass the grader for this assignment.

Topic		Replies	Views
Week1 - Programming Assignment: Regularization - dropout code Improving Deep Neural Networks: Hyperparameter tun coursera-platform	3	712	April 1, 2022
[C2W1] Dropout Regularization - Lecture issue Improving Deep Neural Networks: Hyperparameter tun coursera-platform	2	557	January 11, 2022
Implementation keep_prob in dropout Improving Deep Neural Networks: Hyperparameter tun coursera-platform	4	739	July 4, 2021
Course 2 week 1 PA 2 Improving Deep Neural Networks: Hyperparameter tun coursera-platform	1	618	November 14, 2021
Implementing dropout regularization Improving Deep Neural Networks: Hyperparameter tun coursera-platform	3	641	May 14, 2022

Doubt regarding D1 and D2 in dropout regularization in assignment 2 of week 1

Related topics