Course 2 Initialization with zero weights and none zeros bias

rae · May 13, 2021, 12:56pm

Hi there,

We’ve seen initializing both weights and bias to zero leads to the same value of neurons in each layer. Can we initialize bias to some none zero terms and weights to zero? I think since bias are none-zero, the neurons in a layer are no longer symmetric.

Thanks,
Rae

yanivh · May 13, 2021, 1:29pm

Hi @rae and welcome to Discourse. The input to a layer is written as: z=Wx+b. If only b is non-zero, you would still get a symmetric response in each layer. The reason is that the output from each layer will be constant, which could be non-zero, but still the same within a layer. This would result in uniform updates of weights, and a sub-optimal training

rae · May 13, 2021, 3:35pm

Hi @yanivh, by symmetry response and constant output, do you mean symmetry and constant among all observations or among all neurons in the layer?

For example, W is a zero matrix and b is [1,2]^T, then the two neurons in the layer of Z are [1,2] for all observations, and this is not symmetric among the neurons.

yanivh · May 14, 2021, 1:02pm

@rae, you are correct. In your example symmetry is broken within a layer. In fact, you can think of b as weights associated with an input x=1 (it could have been represented as the first row in W, and for x you would add 1 as the first element).
So, taking back what I wrote before. Initializing b as non-zero would have a similar effect as W itself being non-zero. I wouldn’t practice DL this way, but theoretically it should work. You can try this yourself in one of the assignments in the course.

Topic		Replies	Views
Weight matrix initialization Neural Networks and Deep Learning	2	700	July 20, 2021
Initializing Weights Neural Networks and Deep Learning	1	609	September 6, 2022
Why zero initializations fails? Improving Deep Neural Networks: Hyperparameter tun	3	682	January 7, 2023
Concept in Initialization Assignment-Help needed in understanding Improving Deep Neural Networks: Hyperparameter tun	6	658	March 11, 2025
W3_Initialization with identity matrix Neural Networks and Deep Learning	2	613	December 3, 2022

Course 2 Initialization with zero weights and none zeros bias

Related topics