DLS Course2: Week 3 Exercise 6 (compute_total_loss method)

paulinpaloalto · February 6, 2023, 8:53pm

What they say is correct if you think carefully about what is being said. We only divide by the total number of samples at the end of one full pass of training (all the minibatches). But the function we are writing here is computing the cost for one minibatch, so we only take the sum. The higher level logic will compute the running sum across all the minibatches and then compute the average when it is finished with the pass. You can’t compute the average at the minibatch level, because the math doesn’t work if all the minibatches are not the same size. That will happen if the minibatch size does not evenly divide the total batch size. So you can’t get the overall average by taking the average of the averages.

If you were paying close attention, this is exactly how it worked when we first implemented minibatch gradient descent in the previous assignment (C2 W2 A1 Optimization). It’s the same here, but now we’re doing it in TF instead of straight numpy.

Topic		Replies	Views
Week 3 - Assignment - compute_total_loss - try to set from_logits=False Improving Deep Neural Networks: Hyperparameter tun	5	15736	July 23, 2023
DLS course2 week 3 exercise 6(compute total loss) Improving Deep Neural Networks: Hyperparameter tun	1	529	February 8, 2023
Week 3 - Exercise 6 - Compute Total Loss Improving Deep Neural Networks: Hyperparameter tun	8	1331	April 3, 2024
C2W3 - TF Programming Assignment Improving Deep Neural Networks: Hyperparameter tun	3	488	October 3, 2023
I am confused, can you help me? Improving Deep Neural Networks: Hyperparameter tun week-3	5	73	August 21, 2024

DLS Course2: Week 3 Exercise 6 (compute_total_loss method)

Related topics