Stochastic Gradient Descent

ans · June 5, 2021, 9:32am

Hi, i had this doubt that if sgd is not that efficient ,why do we see (i have seen) it in every machine learning curriculum…is there some specialized use???

Jay · June 5, 2021, 6:54pm

Hi there! I think its important to discuss stochastic gradient descent so that students understand the optimization process and the pros/cons of each variant, i.e. batch, mini-batch and stochastic. In my own experiments I’ve had to resort of stochastic gradient descent because I would run out of memory real fast if I used more than one sample.

Topic		Replies	Views
Stochastic gradient decent? Improving Deep Neural Networks: Hyperparameter tun coursera-platform	2	554	May 13, 2021
Week 2 - When to use mini-batch gradient descent Improving Deep Neural Networks: Hyperparameter tun coursera-platform	1	547	June 15, 2021
Conflict in concept of a video and assignment Improving Deep Neural Networks: Hyperparameter tun coursera-platform	4	595	November 23, 2022
Stochastic Gradient Descent Definition Improving Deep Neural Networks: Hyperparameter tun coursera-platform	2	638	September 29, 2022
Stochastic Gradient Descent Vs ADAM Improving Deep Neural Networks: Hyperparameter tun coursera-platform	2	549	April 23, 2022

Stochastic Gradient Descent

Related topics