Question on when initializing the parameters

Mohammad_Omar_Adde · October 16, 2023, 8:46am

W = np.random.randn(n_y, n_x) * 0.01
b = np.zeros((n_y, 1))

why n_y comes before n_x and also n_y, 1 is there a reason.

Kic · October 16, 2023, 9:51am

The generation of weight matrix, W, reflects the network structure. So what we have here is that n_x is the number of units in the input layer, X; n_y is the number of units in the output layer. The network diagram should indicate the arrangement.

n_y comes before n_x because the network is moving in the direction from input to output. When initialise b, the bias vector, np.zeros() is called with the shape of the array as argument, where the 1 means it is a column vector.

Mohammad_Omar_Adde · October 16, 2023, 9:58am

thanks for the clarification.

Mohammad_Omar_Adde · October 16, 2023, 10:05am

Hi @Kic
sorry, why 0.01 is there a reason, because i thing the instructor don’t mention during the lecture.

Kic · October 16, 2023, 4:10pm

Hi @Mohammad_Omar_Adde

Multiplying the output from random.randn() by 0.01 would scale down the values. It doesn’t change the essence of data.

TMosh · October 16, 2023, 5:19pm

There is some theoretical basis for selecting the range of the random initial values. It’s complicated, as it depends on the size of the NN, the number of layers, the numbers of units, etc. It’s an area of some research.

In practice you just try values between 0 and +1, or -1 and +1, and see how it goes. You may need to adjust the multiplier depending on your specific model.

Mohammad_Omar_Adde · October 16, 2023, 5:38pm

i really appreciate you, as mentors for your clear answers,

i tried using local machine and i have the data set there but if i try to run the code some where in the middle the two answers are not similar, the instructor’s answer and mine, even if take similar seed as the instructor, and the course is calculus week 3 notebook one linear regression

TMosh · October 16, 2023, 6:21pm

Please post a screen capture that image that shows the results you mentioned.

Topic		Replies	Views
Week 4 Assignment 1 Exercise 3.1 Initialize_parameters Neural Networks and Deep Learning	5	616	December 1, 2021
Randomly initialize parameter b instead of W Neural Networks and Deep Learning	6	660	August 23, 2022
C2_W3 lab 1 regression part W=(n_y,1) Calculus for Machine Learning and Data Science week-3	2	23	October 11, 2024
Week 3 Programming Assignment Exercise 3 Error Neural Networks and Deep Learning	5	791	October 11, 2021
Course 4, Week 1, Assignment 1, Exercise 5 conv_backward Convolutional Neural Networks	3	604	July 9, 2021

Question on when initializing the parameters

Related topics