Derivative of "Simplified Cost Function"

Arne_Schirmacher · March 19, 2023, 8:59am

In the workbook C1_W3_Lab06_Gradient_Descent_Soln, the instructor explains that w_j and b are updated like this:
w_j = w_j - \alpha \frac{\partial J(\mathbf{w},b)}{\partial w_j}

b = b - \alpha \frac{\partial J(\mathbf{w},b)}{\partial b}

The corresponding code looks pretty much like the calculating gradients for linear regression but with an extra call to the sigmoid function. Note that the original linear regression code assumes that J(w,b) uses the least squares method.

    for i in range(m):
        f_wb_i = sigmoid(np.dot(X[i],w) + b)          #(n,)(n,)=scalar
        err_i  = f_wb_i  - y[i]                       #scalar
        for j in range(n):
            dj_dw[j] = dj_dw[j] + err_i * X[i,j]      #scalar
        dj_db = dj_db + err_i
    dj_dw = dj_dw/m                                   #(n,)
    dj_db = dj_db/m                                   #scalar

However the compute_cost_logistic function uses a different cost function which is shown in video " Simplified Cost Function for Logistic Regression". The video does not explain how \frac{\partial{J(w,b)}}{\partial{w}} and \frac{\partial{J(w,b)}}{\partial{b}} look in this case.

Here is the derivative to \frac{\partial{J(w,b)}}{\partial{w}}, it is not equivalent to the code listed above:

Please explain why the example code does not use the actual derivative. What am I missing?

rmwkwok · March 19, 2023, 10:24am

Hi @Arne_Schirmacher

Please refer to the following post for the derivation steps that show the gradients for a linear regression and a logistic regression do look the same.

In particular, if you look at the second table, you will see clearly that the denominator term generated in the Logistic Regression due to the loss function gets canceled out, thanks to the Sigmoid function. After the cancellation, the terms between linear regression and logistic regression no longer look any different.

Cheers,
Raymond

Topic		Replies	Views
Logistic Regression Derivative of J(w,b) Supervised ML: Regression and Classification week-3	12	1063	May 16, 2023
Derivative of the Cost Function in Logistic Regression Supervised ML: Regression and Classification week-3	2	512	August 24, 2023
Week3: Derivations of J(w,b) for sigmoid function equal to quadratic linear function? Supervised ML: Regression and Classification week-2	2	512	May 30, 2023
Calculation of partial derivative of the cost function for logistic regression Supervised ML: Regression and Classification week-3	60	172	February 25, 2025
What's the usage of J(w,b) for logistic regression? Supervised ML: Regression and Classification week-3	18	693	June 9, 2024

Derivative of "Simplified Cost Function"

Related topics