Week 4 - Forward Propagation in a Deep Network - Why do we need the for loop to calculate Zs and As

Marios_Constantinou · April 1, 2022, 2:20pm

This might be a silly question but why do we need a for loop for each layer? Why don’t we write down all the equations for each layer like in the slide?

paulinpaloalto · April 1, 2022, 2:53pm

How many layers are there in your network? That’s the point. If you knew that you always had a fixed number, then you could just write them out explicitly. Of course if the number is > 3, that’s going to be more code than just writing a loop. But here are a couple of points to keep in mind:

Networks typically have way more than 3 or 4 layers. What we are seeing here in course 1 are essentially simple “toy” examples. You’ll see more realistic models when you get further along in Course 2 and Course 4.
The number of layers is not fixed: different problems require different network architectures. Our goal is to write general code that works in all cases: we can just tell it the number of layers and the number of neurons in each layer as arguments to the function and we don’t have to rewrite the core forward and backward propagation code each time.

kenb · April 1, 2022, 3:24pm

Hi, @Marios_Constantinou. Not silly at all – fundamental, really! If I understand your question correctly, you ask why not combine all of the individual layers to form one mega-function by repeated substitution? Grab a pencil and your paper pad ahead and go ahead. At the end of the exercise you’ll have an expression for that function, let’s call it \Lambda(X) (for large!). The you can just minimize the cost function over all of the parameters in this function using some numerical optimizer (based on gradient descent).

Now make up some numbers for the dimensions of X, the number of layers L, the number of nodes in each layers, etc, and compute the number of parameters this function, i.e. the number of elements in all of the combined W^{[l]}'s and b^{[l]}'s. (E.g. 4 layers, 7 nodes per hidden layer, with X a 12,288 dimensional vector. Huge number, right? (For a realistically-sized network at least.) Now find a computer that can do the required optimization before 2050. Expensive! In the lingo of mathematics and computer science, you have encountered the “curse of dimensionality.” There be dragons there!

And now (for the second time today) I get to say that forward/backward propagation is the fundamental technology that makes deep learning possible.

kenb · April 1, 2022, 3:27pm

And if I misunderstood your question, I just noticed that @paulinpaloalto weighed-in in the meantime!

Marios_Constantinou · April 1, 2022, 4:32pm

Oh yeah, I took the video example too literary. What you said makes sense, thanks!

Marios_Constantinou · April 1, 2022, 4:34pm

Yeah @paulinpaloalto covered my question perfectly but your explanation was also insightful because I never thought of it that way, thank you for the response!

Topic		Replies	Views
Week 4 Exercise 9 - Backpropagation, L_model Neural Networks and Deep Learning coursera-platform	4	742	August 11, 2022
Course 1, Week 4, Assignment 1, Exercise 5 - finding L Neural Networks and Deep Learning coursera-platform	9	682	May 9, 2021
Assignment Building NN C1 Week 4 Neural Networks and Deep Learning coursera-platform	11	621	August 16, 2022
Week 4 - 2nd Assignment - layer_dims? Neural Networks and Deep Learning coursera-platform	23	890	June 2, 2024
General question on No. of layers in week2-A1 Improving Deep Neural Networks: Hyperparameter tun coursera-platform	1	491	December 7, 2021

Week 4 - Forward Propagation in a Deep Network - Why do we need the for loop to calculate Zs and As

Related topics