Week 2 programming assignment

jasonchen · October 12, 2021, 11:55pm

Hello,
In Course 2 Week 2 assignment, Optimization Methods, the code for computing cost is “cost += compute_cost(a, Y)” for Batch Gradient Descent and Stochastic Gradient Descent. I don’t understand why cost = cost + compute_cost(a, Y) ? Is the code not correct?
Thanks for your answer.

jonaslalin · October 13, 2021, 10:24am

There is no need to accumulate the cost as they do in the pseudo code.

For batch gradient descent, we use all our examples each time, so one iteration is one epoch. We could divide the cost by m to get an average cost per training example.

For stochastic gradient descent, we use one training example, so to traverse all our examples, we need m iterations. It might make sense to accumulate costs per example to calculate an average cost per example for one epoch, i.e., division by m in the outer loop.

jasonchen · October 13, 2021, 12:04pm

Thank you for your answer!

Topic		Replies	Views
Gradient descent cost aggregation Improving Deep Neural Networks: Hyperparameter tun	2	524	December 14, 2021
C2 W2 / Epoch cost / Exercise 6 Improving Deep Neural Networks: Hyperparameter tun	1	503	March 5, 2022
Why take cost average in Gradient Descent? Improving Deep Neural Networks: Hyperparameter tun	2	532	April 28, 2022
Week 3 assignment: epoch_cost Improving Deep Neural Networks: Hyperparameter tun	2	579	May 21, 2021
DLS Week 2, Exercise 2, Ex. 7 Avg Cost Calc and Backward Prop Improving Deep Neural Networks: Hyperparameter tun	1	511	April 25, 2023

Week 2 programming assignment

Related topics