Regularization Programming Assignment / Cost function

Nicolas_Hirel · August 26, 2022, 11:47am

The cost function in the Regularization programming assignment (in program : reg_utils.py) uses the following formula:
logprobs = np.multiply(-np.log(a3),Y) + np.multiply(-np.log(1 - a3), 1 - Y)
cost = 1./m * np.nansum(logprobs)

The cost function we’ve used in previous assignements calculated cost this way:
cost = (1./m) * (-np.dot(Y,np.log(AL).T) - np.dot(1-Y, np.log(1-AL).T))

Why is there a difference?

In fact both formula produce different cost results.

For example in the Football assignment part 4. ( non regularized model), the cost after 20,000 iterations, using the first formula gives a cost=0.13851642423234922.
whe using the second formula, the cost is = 29.226965514041307

The strange thing is that both formula eventually produce the same Test set and Train set accuracy. So the actual outcome seems to be the same, except for the cost value.

But I’m really wondering why the cost formula is different.

anybody could help on this?

PS: I’m aware that the cost function is not the central topic of assignment 2 (“football”) and that it is calculated not in the notebook but in another file, but i’m still very curious… many thanks.

Elemento · August 26, 2022, 1:38pm

Hey @Nicolas_Hirel,
I don’t think the formulae are any different. In fact, I tried out both the formulae, and they gave the exact same results. I included the below function definition, just above the definition of the model function in the notebook. You can return either cost1 or cost2, and you will find that in both the cases, you will get the exact same result, even the loss values.

def compute_cost(a3, Y):
    """
    Implement the cost function
    
    Arguments:
    a3 -- post-activation, output of forward propagation
    Y -- "true" labels vector, same shape as a3
    
    Returns:
    cost - value of the cost function
    """
    m = Y.shape[1]
    
    ### Used in `reg_utils.py`
    logprobs = np.multiply(-np.log(a3),Y) + np.multiply(-np.log(1 - a3), 1 - Y)
    cost1 = 1./m * np.nansum(logprobs)
    
    ### Used in previous assignments
    cost2 = np.ravel((1./m) * (-np.dot(Y,np.log(a3).T) - np.dot(1-Y, np.log(1-a3).T)))
    
    return cost2

Let me know if this helps.

Cheers,
Elemento

Nicolas_Hirel · August 26, 2022, 2:22pm

Elemento

You’re absolutely right and I was wrong. Although the formulas are different, they indeed generate the same result (cost) as mentionned in your post. The result for the difference was in fact a problem in my code (formula) which i was able to fix thanks to yours.

Again, thank you so much for your precious help! (second time in 1 week!!).

Elemento · August 26, 2022, 2:26pm

Hey @Nicolas_Hirel,
I am glad I could help

Cheers,
Elemento

Topic		Replies	Views
Course 1 Week 3 program mini assignment Neural Networks and Deep Learning coursera-platform	8	588	June 12, 2021
Week 3 - Exercise 5 - compute_cost Neural Networks and Deep Learning coursera-platform	12	1281	September 17, 2023
C1_W3_Logistic_Regression_Potential problem Supervised ML: Regression and Classification week-module-3	3	674	June 24, 2022
Stuck on C1_W3_Logistic_Regression assigment: expected value a little different from mine Supervised ML: Regression and Classification week-module-3	6	641	August 28, 2022
Regularization Programming assignment Improving Deep Neural Networks: Hyperparameter tun coursera-platform	1	508	November 4, 2022

Regularization Programming Assignment / Cost function

Related topics