From Course 2, Week 2, Why can’t decay rate be a function of mini-batch size and beta?
The decay rates shared in the lectures are widely used ones. If you find a heurestic that works well, go for it. Remember that learning rate should go down as training progresses.
1 Like