In the gradient descent video, Andrew says at 1:45 that the height of the surface represents the value of “J, b” at a certain point. Shouldn’t he have said “w, b” instead?

Hi @paul2048 and welcome to discourse. The cost function `J`

is a function of the weights `w`

and bias `b`

. You want to find a combination of `w`

and `b`

that minimizes `J`

. This is what is explained in the video you referenced.

