Week 4 | Building your Deep Neural Network Query

Elemento · February 15, 2022, 1:20pm

Hey guys,
I have a small query in this assignment, just out of curiosity. In the L_model_backward function, we have used the linear_activation_backward function to calculate the gradients, and then we are storing them in a dictionary, grads, which we are further passing to the update_parameters function to update the parameters.

Instead of this approach, why are we not updating the parameters directly in the L_model_backward function? This will make sure that we don’t have to store the gradients in any dictionary (which could result in the reduction of a huge memory overhead), and will also save us from iterating over all the layers again (which could save us some computation time).

Is this being done in the assignment in order to make the code more modular and easy to understand, or is there something that I am missing out on, due to which we can’t update the parameters at the same time as calculating the gradients?

Regards,
Elemento

kenb · February 15, 2022, 2:31pm

Hi, @Elemento. I am not a coding-efficiency expert, but what you say makes sense. And, you answered the question yourself. The assignments are written with the educational task paramount. We want learners to see the techniques broken down into the logical components so that they can be examined (i.e. understood) separately and as part of the whole, i.e. the “modularity” to which you refer. Of course, the code should be (and is, hopefully) as clean as is reasonable given that constraint. And that means, it will not always be “Pythonic.” For example, you haven’t seen an list or dict comprehensions have you?

ai_curious · February 15, 2022, 2:52pm

Good insights from both contributors on this thread. My experience with this material is that the initial exercises go to some trouble to expose details of each step and computation. Next, we learn that you can leverage Python matrix-native operations to do away with explicit loops. Then, when using frameworks like TensorFlow, Keras, PyTorch you’ll find that the machinery of the matrix math is provided for you in highly optimized code that also natively supports distributed computing. The explicit for-loops, caching, separation of forward and backward propagation etc. all go away or get completely encapsulated. The code you would use for a production deep learning project looks substantially different than what you see early in these classes.

Elemento · February 15, 2022, 4:19pm

Thanks a lot, @kenb and @ai_curious for your insightful replies

Topic		Replies	Views
Course 1 ,week 4 programming assignment 1 Neural Networks and Deep Learning coursera-platform	6	604	November 3, 2021
Assignment E9 L_model Backward Neural Networks and Deep Learning week-4 , coursera-platform	1	22	August 22, 2024
Course1-week4:Building your Deep Neural Network Step by Step-ex.9 Neural Networks and Deep Learning coursera-platform	5	863	May 9, 2025
W4_A1_Ex-9_L_Model_Backward_Function Neural Networks and Deep Learning coursera-platform	12	754	August 14, 2023
Week 4, Assigment 1 L_model_backward Neural Networks and Deep Learning coursera-platform	17	632	August 1, 2021

Week 4 | Building your Deep Neural Network Query

Related topics