How do we get diff weights and bias for a dense layer

gauravv · August 2, 2022, 2:43am

I am bit confused about tensor flow per layer W and B.
When training the tensor flow model. How are weights and bias learnt if there are multiple units? Considering that each unit in the first layer is getting the same input vector X, why doesn’t each layer eventually determine the same weights if the input W is also the same? How does tensor flow determine the features for each unit?
Taking the example of coffee roasting example, what would be the features for the input layer’s units?

rmwkwok · August 2, 2022, 3:46am

Hello @gauravv,

In short, the neurons in a layer learn different things because they are different at the beginning. When building a model with Tensorflow, we need to initialize the weights for each neuron to some values, and by default they are initialized randomly so that it is essentially impossible for any two weights to share the same value. This initial diversity allows weights to go through different learning paths throughout the process of gradient descent, and ending up learning different things.

Above is how I would explain this without maths, but if you want some simple maths and an example to persuade yourself, you may read this.

Cheers,
Raymond

gauravv · August 3, 2022, 2:59am

Thank you Raymond. This was really helpful.

rmwkwok · August 3, 2022, 3:03am

You are welcome @gauravv. It’s a great question.

Raymond

Topic		Replies	Views
How do units within the same layer end up with different weights? Advanced Learning Algorithms week-module-2	3	713	July 28, 2022
How did the coffee roasting NN get trained? Advanced Learning Algorithms week-module-1	5	620	September 26, 2022
Question about Hidden Layers Advanced Learning Algorithms week-module-1	2	393	July 25, 2023
Why Tensorflow outputs the initial, randomly initialized weights? Advanced Learning Algorithms week-module-1	2	528	August 24, 2022
How will two units in a dense layer reach different weights and biases? Advanced Learning Algorithms week-module-2	1	306	October 27, 2023

How do we get diff weights and bias for a dense layer

Related topics