Week 1 increasing number of iterations for big randomly initialize value of W does not give better results

realnoob · August 20, 2022, 5:44am

In the second case of the lab where we randomly initialize big values for W, I understand that the cost is high because the values for activation functions in each layer are near 0 or 1.

But when I try running the model for more iterations (up to 100000), why the cost does not continue to decrease and we get beautiful values of W as in the third case when we use he initialization?

gent.spah · August 20, 2022, 10:29am

Maybe the model gets stuck at a local optima!

realnoob · August 21, 2022, 2:50pm

does it mean the result i get is specific for this case only and in other cases where i use big randomly initialized w, the cost can be as low compared to He or Xavier initialization?

gent.spah · August 21, 2022, 6:23pm

Im not sure but maybe initializing with those methods is probably better.

Topic		Replies	Views
Week 4 - initialize_parameters_deep - w initialisation redefined for Exercise 2 Neural Networks and Deep Learning coursera-platform	5	653	July 2, 2022
Week 1 - Programming Assignment 1 Improving Deep Neural Networks: Hyperparameter tun coursera-platform	1	611	April 10, 2022
He initialization Improving Deep Neural Networks: Hyperparameter tun coursera-platform	1	465	May 29, 2023
Week2 Programming Assignment 1 - random weight initialization Improving Deep Neural Networks: Hyperparameter tun coursera-platform	3	517	October 20, 2022
DLS Week 1 Initialization: HE Initialization Improving Deep Neural Networks: Hyperparameter tun coursera-platform	2	585	August 29, 2021

Week 1 increasing number of iterations for big randomly initialize value of W does not give better results

Related topics