Adam Optimization Question

Christian_Simonis · December 28, 2022, 8:53pm

Hi there,

Adam uses the calculation of an exponentially filtered moving average, combining RMSProp and Momentum. At initialisation, usually aggregate gradient is set to 1, meaning they tend to be biased to zero since the averaging parameters are 1. This bias is the reason for the correction, see also this source.

So the purpose of bias correction in exponential filtering is to improve the smoothening of early values, see also this plot (green line: with bias correction): https://napsterinblue.github.io/notes/stats/techniques/ewma/ewma_39_0.png

Source

In classic momentum or RMSProp we do not have this bias correction, but I agree with you: depending on the initial assumptions of the exponentially weighted moving average, it would be possible in general and it might make sense, see also:

    
    # Momentum
    v_dW = beta1 * v_dW + (1 - beta1) dW
    v_db = beta1 * v_db + (1 - beta1) db
    
    v_dW_corrected = v_dw / (1 - beta1 ** t)
    v_db_corrected = v_db / (1 - beta1 ** t)
    
    # RMSprop    
    s_dW = beta * v_dW + (1 - beta2) (dW ** 2)
    s_db = beta * v_db + (1 - beta2) (db ** 2)
    
    s_dW_corrected = s_dw / (1 - beta2 ** t)
    s_db_corrected = s_db / (1 - beta2 ** t)
   
    # Combine
    W = W - alpha * (v_dW_corrected / (sqrt(s_dW_corrected) + epsilon))
    b = b - alpha * (v_db_corrected / (sqrt(s_db_corrected) + epsilon))

Source: Momentum, RMSprop, and Adam Optimization for Gradient Descent

Topic		Replies	Views
Difference between Rmsprop and ADAM Improving Deep Neural Networks: Hyperparameter tun	1	922	April 17, 2023
Learning rate decay vs RMSprop Improving Deep Neural Networks: Hyperparameter tun	3	719	February 4, 2023
Adam Optimiztion Improving Deep Neural Networks: Hyperparameter tun	4	614	May 6, 2021
Adam algorithm explanation Improving Deep Neural Networks: Hyperparameter tun	1	565	June 24, 2021
Adam vs RMSPROP, Momentum Improving Deep Neural Networks: Hyperparameter tun	3	555	January 8, 2023

Adam Optimization Question

Related topics