Training NN with Dense layers

Svetlana_Verthein · February 2, 2023, 9:17pm

Hello
When we train a NN with Dense layers, how exactly does it work?
I.e. in each iteration:
a) do we minimize the cost function J for each layer separately, one layer after another? I.e. minimize J for Layer 1, then minimize J for Layer 2 etc
or
b) do we minimize the cost function J across all units and layers in each pass? I.e. clalculate cost for all units and layers with one set of weights, then in the second iteration, calculate cost with updated sets of weights for each layer and see if J is improving, etc?

I think the answer is b , but for some reason it’s not clear to me.
Thank you!

rmwkwok · February 3, 2023, 1:44am

Hello @Svetlana_Verthein,

If I understand you correctly, my choice will be B too.

Once the forward pass is completed, we can compute the gradients of all weights in all layers and units. Of course, in practice, we don’t actually compute all the gradients simulatenously. The more effective way is to first compute the gradients of weights in the L-th layer, then compute the gradients of weights in the (L-1)-th layer, then (L-2)-th layer, and so on.

Then, we can apply all the gradients to update all the weights in all layers and units, and this complete the backward pass.

In short, in one round of gradient descent, it has a forward pass and a backward pass. By the end of the backward pass, all the weight will have been updated ONCE.

Any follow-ups?

Cheers,
Raymond

Svetlana_Verthein · February 3, 2023, 7:50pm

I see. I haven’t gotten to backward passes in the lectures yet, but I think I understand what you are saying.
Thank you!

rmwkwok · February 4, 2023, 1:58am

Okay!

Cheers,
Raymond

Topic		Replies	Views
Weights in each layer Advanced Learning Algorithms	4	295	January 9, 2024
Gradient descent in neural network Advanced Learning Algorithms week-1	3	342	December 27, 2023
Gradient decent in a multi layered neural network Advanced Learning Algorithms week-1	3	229	February 21, 2024
[Week 1] How are the weights updated in backpropagation thorough time? Sequence Models	12	905	July 15, 2023
How does a Deep Neural Network work? Neural Networks and Deep Learning	7	1222	May 8, 2022

Training NN with Dense layers

Related topics