Cost function for logistic regression : plotting the loss function discretely

ai_is_cool · February 12, 2025, 3:30pm

As the input training examples x^{(i)} are listed in rows of discrete values, one for each row, shouldn’t the plot of the evaluation of the loss function for logisitic regression also be a plot of discrete values of the evaluation of L for each x^{(i)}, instead of a plot of a continuously-valued function against continously-valued input?

Kic · February 12, 2025, 10:01pm

Hi @ai_is_cool

Can you walk us through the steps of your reasoning that support your conclusion.

ai_is_cool · February 13, 2025, 9:52am

Hi @Kic,

I should have said earlier in my previous post that the Loss function is plotted against w_j and b, which do take on a limited set of values for each weight parameter w_j and bias b update every iteration.

However, it is useful to see the “shape” of the loss function for ANY value of input.

Thanks.

rmwkwok · February 13, 2025, 10:31am

You might do something like below to analytically find the shape. For the example below, 3 pairs of (w, J) can solve all coefficients.

Or we can analytically find the coefficients in terms of x^{(i)} and y^{(i)} and compute them, or we might interpolate. Certainly, linear interpolation has error because that only gives us a piecewise-linear.

Kic · February 13, 2025, 2:44pm

Hi @ai_is_cool ,

If you are referring the cancer tumor size, age etc as the input training examples seen in the following screenshot, then please be aware this data set is for demonstration purposes, happened to be whole number. In the real world, the input training dataset can be represented as float datatype or integer for discrete whole number.

Let’s have a look at the loss function from this screenshot:

The loss for each data point (example) is not discrete. As the output of f(x) is a probability value, which is a float. Plotting a loss curve is by plotting the loss of each data point.
During training, the model goes through a number of iterations on the whole set of training data. So we use the cost curve instead of loss curve to give us some idea how the model is doing. If the cost is continuously reducing, then we know the model is on track to find the w and b such that the cost is at the minimum.

Topic		Replies	Views
Week3 lab 4 Supervised ML: Regression and Classification week-module-3	2	21	April 4, 2025
Numerical example Neural Networks and Deep Learning coursera-platform	4	555	June 25, 2021
Loss vs iteration function for sklearns LogisticRegression() function Supervised ML: Regression and Classification week-module-3	3	483	March 9, 2023
Cost Function of Linear Regression Supervised ML: Regression and Classification week-module-1	5	449	October 27, 2023
Interpreting logistic regression - prediction and error on sigmoid plot Supervised ML: Regression and Classification week-module-3	3	491	October 23, 2022

Cost function for logistic regression : plotting the loss function discretely

Related topics