Question: why does the cost function squares the error and not just take the absolute value?

deeas · July 5, 2025, 4:15pm

week-module-1
i am watching “Supervised Machine Learning: Regression and Classification > Module 1 > Cost function formula”
and have a question why does the cost function take squares of the error?
If the goal is to make the error always positive then why not just take the absolute value?

TMosh · July 5, 2025, 4:40pm

That is one method. Sometimes it is used.

Its drawback is that we also need to compute the partial derivative of the cost value (i.e. the gradients). The absolute value’s derivative at 0 is undefined.

deeas · July 5, 2025, 5:02pm

thank you for your answer! so it can actually be used. i am not super familiar with derivatives so can not easily understand “The absolute value’s derivative at 0 is undefined”. but anyway thanks! i am now watching the next video “Cost function intuition”

dru · July 5, 2025, 7:19pm

If the error is small (like 0.1), squaring it makes it even smaller (0.01), but if it’s large (like 10), it becomes much larger (100). This helps the model focus more on fixing big errors.

It is important to focus on big errors because if your target value is 1 and your model predicts 1.98 then its almost perfect but if your target value is 10 and your model predicts 7 …
Then during gradient descent, when we take the derivative, large errors give larger gradients, so the model updates more for them. This way, larger errors are given more weight than smaller ones.

Topic		Replies	Views
Cost function : Mean squared error Supervised ML: Regression and Classification week-module-1	4	1007	December 12, 2022
Mathematical proof for the cost function Supervised ML: Regression and Classification week-module-1	3	709	June 21, 2022
Issues with Large Values in the Cost Function Supervised ML: Regression and Classification week-module-1	4	513	October 7, 2022
Loss function to check error Calculus for Machine Learning and Data Science week-module-3	8	496	February 10, 2023
Query about Squared Error Cost Function Supervised ML: Regression and Classification week-module-1 , coursera-platform	1	31	November 24, 2025

Question: why does the cost function squares the error and not just take the absolute value?

Related topics