Clarification on Zero Initialization in Neural Network Linear Regression

paulinpaloalto · November 16, 2023, 4:30pm

It looks like you already found this thread, but I’ll add the link for anyone else who finds the current thread. It gives the math that hackyon mentions to show that zero initialization does not prevent learning in the logistic regression case. Based on that example, you can do the analogous derivation for linear regression. The cost function is different, of course. But once we get to real Neural Networks, we’ll need symmetry breaking as hackyon mentioned and that is also shown.

Topic		Replies	Views
Initialization of Weights in Logistic and Linear Regression Neural Networks and Deep Learning coursera-platform	1	673	June 27, 2021
Week 1, Programming Assignment initialization, Exercise 1 - initialize_parameters_zeros Improving Deep Neural Networks: Hyperparameter tun coursera-platform	8	834	December 15, 2023
Week 2 initialize w's to 0 Neural Networks and Deep Learning coursera-platform	1	564	July 15, 2021
Why zero initializations fails? Improving Deep Neural Networks: Hyperparameter tun coursera-platform	3	703	January 7, 2023
How to decide the initial value of weight and bias? Supervised ML: Regression and Classification	3	175	June 14, 2024

Clarification on Zero Initialization in Neural Network Linear Regression

Related topics