Week 1 ex 2 forward_propagation_with_dropout error

anteii · June 15, 2021, 4:27pm

Hello, I’ve been having a rough time implementing forward_propagation_with_dropout, and the case is, my implementation doesn’t pass Test 2. I made sure that i used correct random function (np.random.rand). Please help.

paulinpaloalto · June 15, 2021, 6:30pm

Did you check to make sure you did that at both of the layers with dropout? And remembered to divide the output by keep_prob? Also note that they tell us to make the mask such that it does not treat each sample the same, right? In other words you don’t generate a column of randoms and then duplicate it: you create the random matrix directly in the final shape that you actually want.

anteii · June 16, 2021, 8:15am

Thank for your quick reply. I worked my problem out. It was just me dropping neurons with probability higher than keep_prob (D1 > keep_prob), however in lecture we used to drop neurons with probability lower than keep_prob (D1 < keep_prob).

paulinpaloalto · June 16, 2021, 1:15pm

That’s great to hear that you found the solution under your own power! Thanks for confirming

madhav2381 · June 27, 2021, 1:13pm

hi, @anteii I got the same problem as you have encountered. Can you tell me how did you rectify the error?

anteii · June 27, 2021, 1:37pm

Hello, @madhav2381. After you sampled random vector D1, you should convert entries in it to 0 or 1. As i wrote earlier, my mistake was converting it with formula D1 > keep_prob. I solved this by simply using correct formula wich is D1 < keep_prob. And just in case, check whether you are using right numpy function for sampling random vector or not.

madhav2381 · June 27, 2021, 1:46pm

@anteii Thanks I used the wrong NumPy function for creating D1, D2 masks.

paulinpaloalto · June 27, 2021, 5:10pm

Yes, that’s the other popular mistake: using “randn” instead of “rand”.

Michael_Trimmel · December 10, 2021, 11:17am

I´m getting the same A3 as you but I´m having (D1 < keep_prob). Did you have an additional error?

{moderator edit - solution code removed}

This is my code…cannot find the problem. Any hints?

Michael_Trimmel · December 10, 2021, 11:20am

Ok solved it randn as mentioned by @paulinpaloalto was it haha

83demon · April 28, 2022, 4:53pm

I followed the advice listed above, and I still get an error. I’m using np.random.rand not ‘randn’ , comparing D1 < keep_prob and dividing A1 by keep_prob. Maybe you can advise anything, @paulinpaloalto ?
Here’s the output

paulinpaloalto · April 28, 2022, 5:41pm

That all sounds correct to me. The only other thing I can think of is that you also need to apply dropout at layer 2. If you’ve done that in a similar way, then we’ll probably need to graduate to the “in case of emergency, break glass” method. Check your DMs.

paulinpaloalto · April 28, 2022, 5:51pm

One other thing I can think of is that just typing new code into a function cell and then calling the function again does nothing: it just runs the old code again. You actually need to click “Shift-Enter” to execute the function cell itself in order to get the new code added to the runtime image. So maybe your code is actually correct, but that’s not what you are really running.

The other way to make sure things are consistent is to do “Kernel → Restart and Clear Output” followed by “Cell → Run All”.

paulinpaloalto · April 28, 2022, 8:15pm

The other error is to set the dropout “mask” to be a column vector, so that all samples are treated the same in each minibatch. The matrix multiply uses “broadcasting” in that case. But it turns out that is not what they intend here. The random mask should be the size of the activation matrix, which means that each sample is handled differently.

It is a legitimate question which approach makes more sense. There have been some interesting discussions of that in the past with some actual experimental results which seem to show that (at least in the particular case here) the two methods are pretty much equivalent.

ec211171 · June 14, 2022, 5:08pm

Hi all,

I believe I have followed the advice above and I am also getting an error.
I have done: making sure I use rand not randn
Making sure that the Dx < keep_prob
Diving Ax by keep_prob
The code looks the same to me in both layers.

The code and output is below, is there anything else I am missing?

{moderator edit - solution code removed}

paulinpaloalto · June 14, 2022, 6:01pm

Hmmm, I would say that your code looks correct, but with essentially the same code, I get different results on that test cell:

A3 = [[0.36974721 0.00305176 0.04565099 0.49683389 0.36974721]]
 All tests passed.

One theory would be that you changed the code, but did not actually click “Shift - Enter” on the function cell itself. If you just call it again from the test cell without actually executing the function cell again, it just runs the old code. Or try “Kernel → Restart and Clear Output” followed by “Cell → Run All” and see if you still fail the test.

ec211171 · June 14, 2022, 6:32pm

Hey paulinpaloalto,

Thank you so much for you swift reply. I have tried restarting the Kernel to no avail.

I have tried to delete my jupyter notebook and get the “updated version” and rewrote my code, I have gotten a different set of results which is still incorrect of:

A3 = [[0.36974721 0.01459639 0.09212225 0.49683389 0.36974721]] 1 Tests passed 1 Tests failed

Which looks closer to your numbers, I don’t believe I have changed the “np.random.seed(1)” but could that be causing the issue?

paulinpaloalto · June 14, 2022, 6:48pm

If you changed the seed, I would expect you to get a different value, but it’s also set to 1 in the standard template code and in my code. So there must be something else going on. It’s probably time for the “in case of emergency, break glass” method. Please check your DMs.

paulinpaloalto · June 15, 2022, 2:21am

Ok, we worked this out: it turns out that there is one place on the LHS of an assignment statement where the variable name is misspelled. It was intended to be A1, but it was actually Al (A followed by lower case letter ell). It’s really really hard to see with the default font that the notebooks use.

tazet · October 27, 2022, 1:06pm

Look to function definition -

GRADED FUNCTION: forward_propagation_with_dropout

def forward_propagation_with_dropout(X, parameters, keep_prob = 0.7):
!!!
parameter predefined !
remove “=0.7” !!!

Topic		Replies	Views
Week1 Regularization Assignment - forward_propagation_with_dropout() error Improving Deep Neural Networks: Hyperparameter tun	6	713	August 26, 2021
Week 1 Exercise 6 - test doesn't pass Improving Deep Neural Networks: Hyperparameter tun	1	583	March 14, 2023
Course 2 week 1 Regularization assignment Improving Deep Neural Networks: Hyperparameter tun	4	804	February 24, 2024
Week 1 ex 2 forward_propagation_with_dropout Improving Deep Neural Networks: Hyperparameter tun	4	292	December 14, 2023
Regularization homework Improving Deep Neural Networks: Hyperparameter tun	2	538	October 27, 2021

Week 1 ex 2 forward_propagation_with_dropout error

GRADED FUNCTION: forward_propagation_with_dropout

Related topics