RMSProp formula clarification

yuyang187 · September 25, 2024, 11:02pm

For RMSProp, the lecture notes says
W = W - learning_rate *dW / (sqrt( SdW + epsilon))
The lab says:
W = W - learning_rate *dW / (sqrt( SdW ) + epsilon)

Are both acceptable to use in practice?

TMosh · September 25, 2024, 11:12pm

The goal of adding epsilon is to avoid a division-by-zero math error.
The lecture notes are incorrect.

yuyang187 · September 26, 2024, 4:39am

Thanks for the clarification.

Topic		Replies	Views
C2 W2: Improving Deep Neural Networks Week 2 Programming Assignment Improving Deep Neural Networks: Hyperparameter tun coursera-platform	11	543	April 6, 2024
C2W3 Differential addition of epsilon in Batch Norm and RMSProp Improving Deep Neural Networks: Hyperparameter tun coursera-platform	1	543	August 25, 2021
Course 2 week 2 question on the equation for Adam Improving Deep Neural Networks: Hyperparameter tun coursera-platform	4	664	September 7, 2021
Intuition for RMS Prop Neural Networks and Deep Learning coursera-platform	3	558	February 19, 2023
C2W2: RMSprop has the epsilon term within the square root, while Adam optimization has it outside, why this difference? Improving Deep Neural Networks: Hyperparameter tun coursera-platform	1	634	April 23, 2023

RMSProp formula clarification

Related topics