Where is 1/m in `compute_gradient` function in "Optional Lab - Gradient Descent"

MrBlobby1999 · November 30, 2025, 7:22pm

My question is about the lab in Machine Learning Specialization, and the “Optional Lab - Gradient Descent”

In the lab, the function compute_gradient is supposed to compute the gradient. The formulas are given as

\begin{align} \frac{\partial J(w,b)}{\partial w} &= \frac{1}{m} \sum\limits_{i = 0}^{m-1} (f_{w,b}(x^{(i)}) - y^{(i)})x^{(i)} \tag{4}\\ \frac{\partial J(w,b)}{\partial b} &= \frac{1}{m} \sum\limits_{i = 0}^{m-1} (f_{w,b}(x^{(i)}) - y^{(i)}) \tag{5}\\ \end{align}

The code was defined as follows,

def compute_gradient(x, y, w, b): 
    """
    Computes the gradient for linear regression 
    Args:
      x (ndarray (m,)): Data, m examples 
      y (ndarray (m,)): target values
      w,b (scalar)    : model parameters  
    Returns
      dj_dw (scalar): The gradient of the cost w.r.t. the parameters w
      dj_db (scalar): The gradient of the cost w.r.t. the parameter b     
     """
    
    # Number of training examples
    m = x.shape[0]    
    dj_dw = 0
    dj_db = 0
    
    for i in range(m):  
        f_wb = w * x[i] + b # linear regression
        dj_dw_i = (f_wb - y[i]) * x[i] 
        dj_db_i = f_wb - y[i] 
        dj_db += dj_db_i
        dj_dw += dj_dw_i 
    dj_dw = dj_dw / m # what's happening here?
    dj_db = dj_db / m # what's happening here?
        
    return dj_dw, dj_db

Where is the 1/m from formulas (4) and (5) given? The code implementation looks different than the formulas, I’m confused about this step. Thanks for any help

TMosh · November 30, 2025, 8:22pm

The source of the 1/m term in the gradients is from the 1/(2m) scaling factor that is part of the squared-error cost function.

rmwkwok · December 1, 2025, 12:18am

Hello @MrBlobby1999,

Here it is, and the code implemented as the formula says.

Cheers,
Raymond

MrBlobby1999 · December 1, 2025, 1:06pm

I get it now. Thanks @rmwkwok @TMosh

Topic		Replies	Views
Vectorizing Logistic Regression's Gradient Output - why no 1/m? Neural Networks and Deep Learning coursera-platform	2	421	July 18, 2023
Query about gradient descent Supervised ML: Regression and Classification week-module-1	2	561	July 1, 2022
C1_W2_Lab02_Multiple_Variable_Soln, problem with understanding compute_gradient(X, y, w, b) Supervised ML: Regression and Classification week-module-2	5	579	September 2, 2022
Cost_function formula_Difference between 2*m & m? Supervised ML: Regression and Classification week-module-1	5	530	February 2, 2023
MLS C1 W2 - Gradient computation: implementation vs equation Supervised ML: Regression and Classification week-module-2	7	547	July 27, 2022

Where is 1/m in `compute_gradient` function in "Optional Lab - Gradient Descent"

Related topics