Gradient Descent [Logistic Regression]

Hardik_Jain1 · August 17, 2023, 12:48pm

I want to understand what happens when we are training m number of training examples, does the forward propagation followed by backward propagation happen only once, or does this cycle keep happening until we get the desired w and b for that particular training example and then move forward to second training example to do the same?

Also, does one epoch mean training m number of training examples once?

saifkhanengr · August 17, 2023, 1:13pm

Forward propagation followed by backward propagation happens the number of iteration times for all examples (not one by one).

Yes…

Hardik_Jain1 · August 17, 2023, 4:11pm

Can you explain in detail?

paulinpaloalto · August 17, 2023, 4:51pm

This was all explained in the lectures, but here’s my summary:

Here in Course 1, we do “full batch” gradient descent. That means we do a number of iterations of the following process:

Compute forward propagation on all training samples with the current weights. This is done in a vectorized way for efficiency.
Do backward propagation on all samples to compute the gradients, which are averaged over all the samples.
Apply the computed gradients to update the weights.
Go to 1) again and repeat for the full number of iterations.

Steps 1) to 3) are called one “epoch” of training. Later in Course 2 we will learn a more sophisticated technique called “minibatch gradient descent” where we break up the full m training samples into “minibatches” and iterate through those in each “epoch”.

If this still doesn’t make sense to you, my suggestion would be to watch the lectures again with what I said above in mind. Prof Ng said everything I said above in the lectures, other than the “minibatch” issue. He’ll discuss that in Course 2.

Topic		Replies	Views
[Mini-batch gradient descent] Did Andrew mean "epoch" instead of "iteration"? Improving Deep Neural Networks: Hyperparameter tun coursera-platform	4	620	July 7, 2021
Quiz weak 1 about different between one epoch and one iteration Improving Deep Neural Networks: Hyperparameter tun coursera-platform	2	550	January 2, 2022
Understanding batch gradient descent over the entire training set Neural Networks and Deep Learning week-module-4 , coursera-platform	9	154	August 5, 2024
Mini-batch understanding Improving Deep Neural Networks: Hyperparameter tun coursera-platform	8	679	March 7, 2023
Epoch clarification Advanced Learning Algorithms week-module-3	2	400	August 5, 2023

Gradient Descent [Logistic Regression]

Related topics