Hello @Tommy_Lee,
Cheers,
Raymond
PS: You see that we are effectively having two learning rates composing of two hyperparameters - alpha and beta? They are just two different ways of composing these two learning rates ![]()
Hello @Tommy_Lee,
Cheers,
Raymond
PS: You see that we are effectively having two learning rates composing of two hyperparameters - alpha and beta? They are just two different ways of composing these two learning rates ![]()