How should we think when initializing w & b?

Kutay_Eroglu · August 15, 2022, 12:31pm

I seem to be having a problem while initializing the parameters w and b in the model function. Could you give any tips on how to think about their shapes? Shouldn’t we be just considering the equation transpose of x times w + b?

kenb · August 15, 2022, 1:18pm

Hi, @Kutay_Eroglu . Welcome to the specialization. When creating a new topic in Discourse, it is a great help to the community if you post the week, assignment number, and Exercise number in your topic heading. Example: W3, A1, Ex 3: How should we think … .

Also, please post a snapshot the “traceback” (i.e. the error log) generated by your code. Do not post the code directly from the function that you are trying to complete. That would be a violation of the course honor code.

Thanks!

kenb · August 15, 2022, 1:23pm

Here is an example of another learner’s post just today:

Week 4 Exercise 5 - L_model_forward shape challenges

alvaroramajo · August 15, 2022, 2:42pm

Hi, @Kutay_Eroglu !

Yes, you are right for lineal layers. For convolutional layers, check this post.

If you are using other types of layers or more complex ones, the output size will difer.

paulinpaloalto · August 15, 2022, 9:03pm

I’m guessing this question is about the C1 W2 Logistic Regression assignment. There the linear activation formula is:

Z = w^T \cdot X + b

In that case b is a scalar value and w is a column vector with dimensions n_x x 1, where n_x is the number of “features” or elements in each input sample. Each column of X is one input sample, so n_x is the number of rows in the X matrix.

All this was discussed in the lectures and also in the notebook. Please read the material in the notebook again carefully if you still have questions.

Topic		Replies	Views
Week 3, Quiz , question 9 and 10 Neural Networks and Deep Learning	6	764	November 30, 2021
Question regarding dimensions of w in logistic regression Neural Networks and Deep Learning	3	338	October 13, 2023
Question on how we initialize our "w" array Neural Networks and Deep Learning	2	487	October 17, 2022
W[1] in Week 3 Quiz Neural Networks and Deep Learning	1	520	January 22, 2022
Week 3 W1 Shape Neural Networks and Deep Learning	3	556	July 4, 2021

How should we think when initializing w & b?

Week 4 Exercise 5 - L_model_forward shape challenges

Related topics