Gradient Descent question

Amit_Misra1 · August 13, 2023, 11:39pm

2 Questions:

1)How do you use the initial values for gradient descent?
2)How come the question for gradient descent always lowers “w” with the minus derivative portion?

Thanks in advance.

TMosh · August 13, 2023, 11:51pm

The initial values are for the weights. Gradient descent starts from there, and proceeds “downhill” to the minimum cost.

Gradient descent is intended to be used with convex cost functions. Those have the characteristic that the gradients are positive if the weights are too low, and are negative if the weights are too high.

Amit_Misra1 · August 14, 2023, 12:03am

Thanks TMosh. How do you select those initial values for w and b?

TMosh · August 14, 2023, 12:15am

Generally just set them to all-zeros. It’s as good a set of initial values an any, since we have no clues what the final values might be until training is complete.

Topic		Replies	Views
Initial values for gradient descent algorithm Supervised ML: Regression and Classification week-1 , week-2	1	269	January 12, 2024
C1_W2_Linear_Regression (value of w in gradient descent) Supervised ML: Regression and Classification week-2	2	677	July 22, 2022
C1_W1_Gradient-Descent Supervised ML: Regression and Classification week-1	3	568	July 28, 2022
What if we get local maxima when we choose w, b in gradient descent algorithm Supervised ML: Regression and Classification week-1	4	528	January 20, 2025
How to decide the initial value of weight and bias? Supervised ML: Regression and Classification how-to	3	143	June 14, 2024

Gradient Descent question

Related topics