Adam Optimization Question

Christian_Simonis · December 28, 2022, 8:56pm

The Adam bias correction can be interpreted as a modification to the learning rate, see also this paper where an alternative is proposed and discussed:

The originally stated goal of the bias-correction factor was at least partially to reduce the initial learning rate in early steps, before the moving averages had been well initialized (Kingma and Ba, 2017; Mann, 2019).

This thread is relevant for your question. Feel free to take a look: Why not always use Adam optimizer - #4 by Christian_Simonis

Best
Christian

Topic		Replies	Views
Difference between Rmsprop and ADAM Improving Deep Neural Networks: Hyperparameter tun coursera-platform	1	1268	April 17, 2023
Adam Optimiztion Improving Deep Neural Networks: Hyperparameter tun coursera-platform	4	615	May 6, 2021
Learning rate decay vs RMSprop Improving Deep Neural Networks: Hyperparameter tun coursera-platform	3	750	February 4, 2023
Adam algorithm explanation Improving Deep Neural Networks: Hyperparameter tun coursera-platform	1	568	June 24, 2021
Adam vs RMSPROP, Momentum Improving Deep Neural Networks: Hyperparameter tun coursera-platform	3	575	January 8, 2023

Adam Optimization Question

Related topics