Course2_week2_assignment

Lilit.Ghalachyan · May 18, 2021, 7:41am

In model function in the end we count average of cost, but I don’t understand why? Previously we used only cost, why in this case do we use average of it?

paulinpaloalto · June 28, 2021, 7:04pm

The cost is always the average of the loss values across all the samples in the batch (training set). The reason that the way it is computed looks a little different here is that we are doing “minibatch” gradient descent, which means we are splitting up the training set into minibatches and then need to add up the costs for all the minibatches and then divide by the total number of samples to get the usual meaning of the J value (loss averaged across all the samples).

You can check the compute_cost implementation in opt_utils_v1a.py to see that they are not doing the average there, just the sum over the minibatch.

Topic		Replies	Views
Course 2 Week 3: compute cost solution is wrong? Improving Deep Neural Networks: Hyperparameter tun	2	518	November 3, 2022
Why take cost average in Gradient Descent? Improving Deep Neural Networks: Hyperparameter tun	2	532	April 28, 2022
Gradient descent cost aggregation Improving Deep Neural Networks: Hyperparameter tun	2	524	December 14, 2021
C2 W2 / Epoch cost / Exercise 6 Improving Deep Neural Networks: Hyperparameter tun	1	503	March 5, 2022
Confused about Mini-Batch Gradient Descent Improving Deep Neural Networks: Hyperparameter tun	3	556	May 9, 2022

Course2_week2_assignment

Related topics