Advanced Learning Algorithms: Neural Network Concept Question

Michael_McCandless · February 5, 2024, 12:02pm

I’m in week 1 of the class, where the example for T-shirt demand based on affordability, quality, etc. is used. A simple neural network is created, with an hidden layer accepting a vector of 4 values, and outputting a vector of 3 values. This is fed to the output layer.

I am confused: how is it that each neuron in the hidden layer - which accepts the same 4 values - output a different value? I get it that the output of each neuron is based on W and B, so am unclear how W and B would be different for each neuron since each neuron gets the same 4 inputs.

Thanks.

lukmanaj · February 5, 2024, 12:25pm

Hi @Michael_McCandless. Welcome to the community.
You posted this in AI discussions. I have moved it to the right course section. The mentors will attend to your question.

Michael_McCandless · February 5, 2024, 12:57pm

Thank you.

TMosh · February 5, 2024, 6:33pm

There are two aspects of NN implementation that make this possible:

The hidden layer activations always include some non-linear function.
The individual weights that feed into each hidden layer unit are randomly initialized to small random values.

This second point is critical - it’s called “symmetry breaking”, and sets each hidden layer unit on a separate path toward convergence.

The magic is that it actually works extremely well. The theoretical background is complicated and I personally I do not worry about it. You can probably find some papers about it online if you want further details.

Michael_McCandless · February 5, 2024, 6:53pm

Thank you. I agree that it feels like “magic” going on with NN, but you clearly answered my question.

Will we encounter - in advanced algorithms course - any discussion on how to determine number of hidden layers, and number of units within each hidden layer? I suspect those choices affect the success of the “magic” so would be good to know.

TMosh · February 5, 2024, 7:07pm

It’s based on guided experimentation. I believe it’s covered during MLS.

Essentially, there is a compromise between getting good-enough results, and creating a model that is too complex or difficult to train considering your specific goals in the project.

The “guided experimentation” comes in deciding what “good-enough” is for a specific situation.

paulinpaloalto · February 5, 2024, 8:24pm

Yes, you’re right that choices of that type are important in whether the effort succeeds or not. As Tom says, it is based on guided experimentation and on previous experience and comparing to other known solutions to similar problems. I have not taken MLS, so I don’t know how much they discuss such issues about design choices. The topic of how to approach making these design choices in a systematic way is covered in some detail in DLS, particularly in Course 2 and Course 3. A reasonable path would be to complete MLS and then take DLS to go “deeper” (pun fully intended).

Topic		Replies	Views
Neural Networks Hidden Layers AI Discussions	1	203	August 26, 2022
Ask a question about neural networks Advanced Learning Algorithms week-1	3	281	March 20, 2024
Question in intro video Advanced Learning Algorithms week-1	4	15	December 20, 2024
Open question: Regarding the working of a layer in the neural network Advanced Learning Algorithms week-1	3	487	March 6, 2023
Neural Network intuition Advanced Learning Algorithms week-1	2	283	December 1, 2023

Advanced Learning Algorithms: Neural Network Concept Question

Related topics