Gradient descent exponential weighted average

Anbu · August 6, 2021, 11:55am

Hi Sir,

In terms of mini batch gradient descent, here is my understanding about involvement of exponential weighted average when beta=0.9. Is it correct sir ?

On first epoch, while during 15th iteration of mini-batch, we are basically average over last 10 iteration from 5th - 15th iteration of mini-batches gradients then update the weights ? Am i correct sir ?

Elemento · May 11, 2022, 9:24am

Hi @Anbu,
Apologies for the delayed response. The fact that in exponentially weighted moving averages, it takes into account the last (1 / 1 - beta) observations into account is an approximation. So, based on that approximation, your understanding is correct. Also, I am assuming that when you refer to average from 5th - 15th, you are referring to exponentially weighted moving average, and not a simple average.

Also, I would like to add, as you might have seen in the lecture videos, Exponentially Weighted Moving Averages in Mini-batch gradient descent, is what we learnt in Gradient Descent with Momentum. I hope this helps.

Regards,
Elemento

Topic		Replies	Views
Doubt regarding Exercise 4, Week 2 Lab Improving Deep Neural Networks: Hyperparameter tun week-2	3	147	May 3, 2024
Implementing exponentially weighted averages Improving Deep Neural Networks: Hyperparameter tun	3	520	April 5, 2023
Gradient Descent with Momentum-Last part Improving Deep Neural Networks: Hyperparameter tun week-2	2	30	November 27, 2024
Exponentially weighted Average Improving Deep Neural Networks: Hyperparameter tun	7	732	May 17, 2021
Momentum Updates Confusion Improving Deep Neural Networks: Hyperparameter tun	1	490	February 8, 2022

Gradient descent exponential weighted average

Related topics