Adam Optimization

dparekh123 · August 9, 2022, 5:49am

Isn’t Adam Optimization Algo just a modified version of Gradient Descent with dynamic learning rate adjustment? Or am I missing something here?

TMosh · August 9, 2022, 7:27am

Adam uses momentum and an adaptive learning rate.

rmwkwok · August 9, 2022, 8:59am

Hello Dhawal,

Adam is gradient descent-based. As for whether it is adjusting the “learning rate”, it really depends on how you define learning rate in this context. In our gradient descent, the change of weight is a learning rate \alpha times the gradient, but in Adam, we also have an \alpha value which is always fixed, but what’s multiplied to this \alpha is a variable not just of gradient but also the momentums. You may check out the paper for the complete algorithm and you can see what’s happening there.

Raymond

Topic		Replies	Views
Adam optimzation Advanced Learning Algorithms week-2	1	221	March 4, 2024
Adaptive Learning Rates AI Discussions	2	98	October 31, 2023
Adam algorithm explanation Improving Deep Neural Networks: Hyperparameter tun coursera-platform	1	567	June 24, 2021
Adam Optimiztion Improving Deep Neural Networks: Hyperparameter tun coursera-platform	4	614	May 6, 2021
DLS Course 2 Week 2: adam is the worst algo Improving Deep Neural Networks: Hyperparameter tun coursera-platform	3	583	July 22, 2023

Adam Optimization

Related topics