C2_W4_assignment Gradient descent question

PZ2004 · September 21, 2023, 8:32pm

Dear AI experts:
I passed the assignment. But I have a question about the gradient descent part of the assignment code.

In the assignment the codes (pasted below without answers) seems to be calculating one gradient descent and updating the W1 and W2 for each batch of data x,y. As new batches of data comes in, the process is repeated as driven by the for loop: “for x, y in get_batches(data, word2Ind, V, C, batch_size):”.

I wonder if through out the entire training data, the beginning of the data is very different from the end of the data, e.g. in contents, formats, syntax, etc, for example beginning are all novels and endings are all poems, would w1 and w2 migrate with the data change without true convergence? My previous understanding of training/model converging, if I recall correctly, is to do a round of gradient descent for the entire dataset, then do it again and again to reach a true global minimum. Am I not understanding the process correctly? Can you elaborate?

for x, y in get_batches(data, word2Ind, V, C, batch_size):
### START CODE HERE (Replace instances of ‘None’ with your own code) ###
# get z and h
z, h =

    # get yhat
    yhat = 
    
    # get cost
    cost = 
    if ( (iters+1) % 10 == 0):
        print(f"iters: {iters + 1} cost: {cost:.6f}")
        
    # get gradients
    grad_W1, grad_W2, grad_b1, grad_b2 = 
    
    # update weights and biases
    W1 = 
    W2 = 
    b1 = 
    b2 = 

    ### END CODE HERE ###
    iters +=1 
    if iters == num_iters: 
        break
    if iters % 100 == 0:
        alpha *= 0.66

TMosh · September 21, 2023, 10:38pm

Which course are you attending? You posted in the “General Discussions” forum.

You can move your thread to the correct forum by using the “pencil” icon in the thread title.

elirod · September 21, 2023, 11:07pm

Hi @PZ2004

Welcome to the community.

Don’t forget to post your queries on the right category. This is the only way mentors be aware of your issue and support you.

Don’t forget to Check the guidelines as well.

Best regards

Topic		Replies	Views
C2_W4_Exercise 5 - gradient_descent NLP with Probabilistic Models week-4	8	21	August 30, 2024
Week 3: Exercise 3 Supervised ML: Regression and Classification week-3	3	567	July 13, 2022
C1_W1_lab05: Linear regression code questions Supervised ML: Regression and Classification week-1	4	631	September 21, 2022
Doubts in ML program for gradient descent AI Discussions ai-question	5	121	April 15, 2024
C2_W4 assignemment Exercice 5:gradient descent / gradient_descent() function problem NLP with Probabilistic Models week-4	1	15	March 30, 2025

C2_W4_assignment Gradient descent question

Related topics