Week 3: matrix W

echerny · August 15, 2021, 12:13am

From my understanding, the input layer takes 3 features of an individual from X, so the input layer is merely one individual. These feature values are then put into a linear regression model with wx+b in layer 1, and then thru the sigmoid function into a logistic regression model. What I don’t understand is: if the first node of layer 1 already has values of vector w and it calculated the prediction for training example x given vector w (i.e., y hat), then why is there a need for the other 3 nodes in layer 1?–we already calculated y hat. And from there once it takes all of these y hats, how does it consolidate them into a single y hat in layer 2?

Thank you

carlosrl · August 16, 2021, 5:18am

Hi @echerny
The idea of multiple neurons in DNN, is that each neuron will learn a different feature from the input. So, as more neurons we have, more features the model will learn. But this has a limit. Remember that depending on the network architecture, we need to add dropout to remove some neurons during calculation and avoid overfitting.

Topic		Replies	Views
Doubt in concept of neural network representation (week3 video3) Neural Networks and Deep Learning coursera-platform	4	656	November 3, 2021
Why layers? What is the purpose? Advanced Learning Algorithms week-module-1	12	1149	December 8, 2022
Why 4 units in the hidden layer if we have 3 input features? Neural Networks and Deep Learning coursera-platform	4	576	January 26, 2022
Understanding how neural network work Neural Networks and Deep Learning coursera-platform	8	588	May 28, 2023
What does each neuron really do? Advanced Learning Algorithms week-module-1	9	636	March 28, 2023

Week 3: matrix W

Related topics