In the gradient descent video, Andrew says at 1:45 that the height of the surface represents the value of “J, b” at a certain point. Shouldn’t he have said “w, b” instead?
Hi @paul2048 and welcome to discourse. The cost function
J is a function of the weights
w and bias
b. You want to find a combination of
b that minimizes
J. This is what is explained in the video you referenced.