C1_W2_Linear_Regression: vectorised approach

vgrz · February 8, 2025, 3:31pm

Hi,

I’ve submitted the C1_W2_Linear_Regression lab and would like to share with you some thoughts about the approach I’ve taken.

As we learnt in weeks 1 and 2, the linear regression equation can be calculated with a for loop as:

m = x.shape[0]

for i in range(m):
    f_wb[i] = w * x[i] + b

Or, more efficiently, with the NumPy vectorised function np.dot as:

f_wb = np.dot(x, w) + b

The cost function or mean squared error (MSE) is calculated as:

total_cost = 0

for i in range(m):
    f_wb = np.dot(x, w) + b
    cost += (f_wb - y[i])**2
total_cost = cost / (2 * m)

By using another vectorised function, np.sum, we no longer need to calculate the summation in a for loop. Therefore, it’s more efficient, easier to write and read.

Lastly, the gradient descent is calculated as:

dj_dw = 0
dj_db = 0
m, n = x.shape

for i in range(m):
    err = (np.dot(x[i], w) + b) - y[i]
    for j in range(n):
        dj_dw[j] += err * x[i, j]
    dj_db += err
dj_dw = dj_dw / m
dj_db = dj_db / m

Let’s see how we can get rid of each for loop:

The for loop of the gradient of b:
```
for i in range(m):
    dj_db += err
```
is replaced by the vectorised sum, as mentioned previously.
The nested for loop of gradient of w:
```
for i in range(m):
    for j in range(n):
        dj_dw[j] += err * X[i, j]
```
is replaced by the dot product of the transpose of \mathbf{x} and the error \epsilon.

By following these steps, we calculate the gradients in just 4 variables using vectorised functions.

I’ve done more than the exercise asked me for but I’ve learnt a lot of new things along the way.

Deepti_Prasad · February 8, 2025, 3:44pm

vgrz:

Lastly, the gradient descent is calculated as:

dj_dw = 0
dj_db = 0
m, n = x.shape

for i in range(m):
    err = (np.dot(x[i], w) + b) - y[i]
    for j in range(n):
        dj_dw[j] += err * x[i, j]
    dj_db += err
dj_dw = dj_dw / m
dj_db = dj_db / m

you are sharing an older version of the same assignment when you tried avoiding the for loop, both surely yields the same results.

rmwkwok · February 8, 2025, 3:50pm

Great work, @vgrz! The vectorized approach has better clarity and thus, to me at least, is easier to debug. Though future assignments of this MLS will still use many for loops, if you will ever do the more advanced Deep Learning Specialization, you are already more prepared for it!

Onwards!

Raymond

vgrz · February 8, 2025, 4:00pm

I’d like to point out that’s the code from the C1_W2_Lab02_Multiple_Variable_Soln notebook.

Deepti_Prasad · February 8, 2025, 4:01pm

it’s good a practice to always work around the codes we work upon, gives more understanding to concepts and clarity…

vgrz · February 8, 2025, 4:10pm

Thank you, Raymond.

I decided to take this approach and document it here as an additional learning experience.

I requested financial aid on Coursera to do this specialisation and it was granted to me. Depending on how things go, I’ll do the same for the Deep Learning one.

rmwkwok · February 9, 2025, 1:10am

That’s good! I am looking forward to your next thread.

Cheers,
Raymond

Topic		Replies	Views
C1_W1_lab05: Linear regression code questions Supervised ML: Regression and Classification week-module-1	4	631	September 21, 2022
C1_W3_Lab09_Regularization_Soln incorrect dj_db Supervised ML: Regression and Classification week-module-3	6	514	September 3, 2022
Linear Regression Model implementation Supervised ML: Regression and Classification week-module-2	18	128	June 24, 2025
Week 3 final lab notebook exercise 3 Supervised ML: Regression and Classification week-module-3	4	28	April 7, 2025
Help with code for Multivariable Gradient Descent Supervised ML: Regression and Classification week-module-2	3	343	January 4, 2024

C1_W2_Linear_Regression: vectorised approach

Related topics