Values for wj for multiple logistic regression

Eric5 · September 26, 2022, 6:49am

Hi

Just wondering for gradient descent for multiple logistic regression as in the talk by Prof Ng for working out w1 ie (j=1) we start with a random value of w1 eg w=1 then use gradient descent to converge. However when we are converging for w1=1 to start, what value do we use for the other w values in oder to calcuate the fw,b(xi). Do we start with w=1 for all the w values and then update for j=1…n simultaneously?

THank you

Eric

shanup · September 26, 2022, 7:24am

Hello @Eric5

We can start with any values for the weights… even same values are fine. This is not so much an issue with logistic regression.

When we get into neural networks, there comes the need for symmetry breaking and hence we initialize the weights of different neurons with different random values. You will see this in course 2.

But for now, you can go ahead and initialize in any way you choose.

Yes, Prof. Andrew recommends to do a simultaneous update of all the weights for each iteration.

Topic		Replies	Views
Symmetry Breaking versus Zero Initialization Neural Networks and Deep Learning week-3 , coursera-platform	7	9224	January 5, 2022
How to decide the initial value of weight and bias? Supervised ML: Regression and Classification	3	168	June 14, 2024
Randomly initialize parameter b instead of W Neural Networks and Deep Learning coursera-platform	6	660	August 23, 2022
What is the effect random initialization of W on multiple nodes and in neural network when all of them are doing the same thing Neural Networks and Deep Learning week-3 , coursera-platform	6	174	May 24, 2024
Asking about derivative of w[j] in gradient descent for logistic regression Supervised ML: Regression and Classification week-3	2	652	September 18, 2022

Values for wj for multiple logistic regression

Related topics