Hidden layer first iteration neural network

paulinpaloalto · January 18, 2022, 10:42pm

It is a good question: the reason is that you start with the initial values of the weights as different for every node. This is called “Symmetry Breaking” and Prof Ng does talk about it in the lectures. You’re right that if you started out with all the weights the same, then the outputs would be the same and back propagation would be the same, so you basically end up with only one real neuron effectively.

Here’s a thread which talks more about Symmetry Breaking.

Topic		Replies	Views
C2W1 Individual Neurons and Classification Advanced Learning Algorithms week-module-1	18	1453	November 23, 2022
Neural Networks Weights Neural Networks and Deep Learning coursera-platform	4	602	January 1, 2023
Activation Functions, Weights, and Biases of Each Layer Neural Networks and Deep Learning coursera-platform	2	1195	December 5, 2021
Symmetry Breaking versus Zero Initialization Neural Networks and Deep Learning week-module-3 , coursera-platform	7	9702	January 5, 2022
How will two units in a dense layer reach different weights and biases? Advanced Learning Algorithms week-module-2	1	303	October 27, 2023

Hidden layer first iteration neural network

Related topics