Week 1 - initialize_parameters_he_test error?

user179 · January 17, 2022, 4:59pm

Hello,
I’m stuck on initialize_parameters_he , my formula seems right, I’m using randn and multiplying by the provided sqrt formula - and my first layer weights seemed like they’re initialized right.
Here’s my output.

Screen Shot 2022-01-17 at 11.52.26 AM

Here’s the expected output.
Screen Shot 2022-01-17 at 11.52.29 AM

I don’t see how W1 can be right but not W2?

It seems like W2 weights are divided by 2 from what the actual results should be?

user179 · January 17, 2022, 6:23pm

When navigating public_tests.py - seems like the test expects a different layer_dims then the exercise description [2, 4, 1] vs [3,1,2]?

paulinpaloalto · January 17, 2022, 6:58pm

We are writing general code here, right? So there can be test cases which have different values as inputs in order to test your logic.

If you have hard-coded any of the values, that will fail.

But your values for W2 come out to be 2x what they should be. Seems like that’s a pretty good clue where to look for the error. How can they be right for W1 and wrong for W2? Your formula is wrong in probably two ways: how you handle the square root, the factor of 2 and which layer dimension you use as the input to the calculation. Somehow your mistakes happen to cancel each other out in the layer 1 case.

user179 · January 17, 2022, 7:05pm

I didn’t hard code any of the values.

[Removed the source code]
Training and test set accuracy looks good as well:
Screen Shot 2022-01-17 at 2.04.48 PM

Let me know if i should delete this post (regarding putting in code I wrote)

paulinpaloalto · January 17, 2022, 7:08pm

Interesting. Yes, I’d say that code looks correct. Are you sure you clicked “Shift - Enter” on the code cell before you called it again? Just typing new code and then calling the function again runs the old code. You can easily demonstrate to yourself that this is how it works.

paulinpaloalto · January 17, 2022, 7:09pm

You don’t have to delete the post, but it would be a nice thing to edit it to remove the source code. Thanks!

user179 · January 17, 2022, 7:12pm

Indeed I did. I changed around the code to check etc… (check output vs expected output, W1 is the same, W2 isn’t). Is it possible the test is broken or has been changed?

paulinpaloalto · January 17, 2022, 7:17pm

It works fine for me.

paulinpaloalto · January 17, 2022, 7:21pm

Oh, no, we just aren’t looking at the code carefully enough. Isn’t that always the way?

Look more carefully at what’s under the square root there and notice that 1 and l look pretty similar.

user179 · January 17, 2022, 7:25pm

OMG…so embarrassing…That was it! Thank you! I was close to figuring out by removing the sqrt initialization and seeing that my W1 weight wasn’t changing / but W2 was still wrong.

paulinpaloalto · January 17, 2022, 7:30pm

As they say in my country, “D’oh!” But don’t feel bad: I’ve done similar things a thousand times in my programming career (so far). It’s so easy to look at a piece of code and not really “see” it.

paulinpaloalto · January 17, 2022, 7:34pm

Actually there’s another useful lesson there about python indexing. If you’ve done a lot of python programming, maybe you already knew that, but it’s perfectly valid to use negative array indices. It just counts backward from the end of the array: myArray[-1] gives you the last element, myArray[-2] the second to last and so forth.

Topic		Replies	Views
Exercise 3 - initialize_parameters_he Improving Deep Neural Networks: Hyperparameter tun	1	613	May 4, 2021
W1A1 problem in HE initialization Improving Deep Neural Networks: Hyperparameter tun	2	504	December 25, 2021
Problem with the Initialization Assignment in C2 W1 Improving Deep Neural Networks: Hyperparameter tun week-1	5	232	February 12, 2024
Week 1 Assignment 1 Exercise 3 : len-1? L+1? Improving Deep Neural Networks: Hyperparameter tun	3	556	May 14, 2021
Course 1 Week 4 Exercise 2 - initialize_parameters_deep Neural Networks and Deep Learning	9	665	October 16, 2021

Week 1 - initialize_parameters_he_test error?

Related topics