Visualising the cost function

Adeel_Khan1 · November 19, 2023, 9:52am

The contour plot J has concentric ovals. The minima is at the centre of ovals and the smallest oval is closest to our goal. Should the centre be a dot which is also a part of the J contour plot. Or does linear regression have multiple solutions of w and b (which means different lines of f(x)) which correspond to the smallest oval?

lukmanaj · November 19, 2023, 10:10am

Hi @Adeel_Khan1,
The center should be a dot, just like you initially thought. The smallest oval is closest to the global minimum. In most basic linear regression models, the cost function is usually convex, and thus, there’s only one global minimum (overall minimum cost). The smallest oval is the one closest to the global minimum we seek, and the center is the actual global minimum. So only one line (a set of w and b) fits the global minimum.

Adeel_Khan1 · November 19, 2023, 10:15am

Thanks for the reply which helped me understand that there is only one global minimum. Could you please also tell, why a dot is not marked in the J contour plot if that’s where the minima lies? Is it a practice, or is there a reason behind it?

lukmanaj · November 19, 2023, 10:18am

I think the focus of the plot is to show the shape and how we can optimize the function without necessarily worrying about the exact final solution (minimum).

Adeel_Khan1 · November 19, 2023, 10:19am

Got it! Thanks @lukmanaj

Adeel_Khan1 · November 19, 2023, 7:09pm

I have another related question please. Would finding the centre of this contour plot J solve the minima and give us corresponding values of w and b? Also, would it be a viable method for finding minima?

lukmanaj · November 19, 2023, 7:52pm

In the context of machine learning models, especially when dealing with weights (w) and biases (b), directly solving equations for these values is often not feasible. This approach can be computationally intensive, leading to inefficiencies and scalability issues. Instead, we commonly employ gradient descent, a more efficient algorithm for such tasks. In this process, weights and biases are initially set either randomly or to zero (as seen in simple linear regression models) and are subsequently refined. This refinement is achieved by iteratively updating these values in response to the cost function, which measures the model’s accuracy. The iterative process of gradient descent seeks to minimize this cost function. Optimal weights and biases are identified at the point where further adjustments cease to significantly reduce the cost, indicating that the lowest possible error, or the minimum of the cost function, has been reached.

Saleh_Hussain_Saleh · November 24, 2023, 1:52pm

Thank you, I hope to be useful in many ways.
I have just finished my first week at machine learning… And waiting for the second week.

Topic		Replies	Views
Question on the Cost Function - Week 1 Optional Lab: Cost Function Supervised ML: Regression and Classification week-module-1	1	530	July 15, 2022
Linear regression cost function 3d graph Supervised ML: Regression and Classification week-module-1	6	237	August 2, 2024
About Contour plot Supervised ML: Regression and Classification week-module-1	12	568	January 3, 2023
Explain Visualizing cost function Supervised ML: Regression and Classification week-module-1	16	730	July 19, 2023
Where would the zero cost point be on the cost function counter plot? Supervised ML: Regression and Classification week-module-1	1	17	October 13, 2024

Visualising the cost function

Related topics