W4_A 2_Ex-1_two_layer_model

Pilar · October 25, 2022, 7:56pm

I’m strugling trying to understand the cost function used in this exercise. We can use the the compute_cost function given in the assignment and this of course works very well. However I tried to use my own calculated function based on the previous assignment weeks. I’m not sure if I can put here the codes for both versions of the cost function, but I checked the one dnn_app_utils_v3.py code, and it looks very different to the one we could intuitively derive from the equation:

Could someone please explain why the compute_cost function given in the assignment ( i.e in dnn_app_utils_v3.py) doesn’t have a summation and the log of A2 is transposed? Thank you!

AbdElRhaman_Fakhry · October 25, 2022, 9:09pm

Hi

if I understand your question correctly …when he use A2.T and doesn’t have a summation. he use Matrix multiplication as when use matrix multiply (Vectorization expression) it multiply every element as element wise and it calculate the sum of the result

I hope I answer you question,
please feel free to ask any question,
Thanks,
Abdelrahman

paulinpaloalto · October 25, 2022, 9:13pm

There are (at least) two ways to compute each of the two terms of the cost:

You can use elementwise multiply, followed by sum.
You can do both the multiply and the sum in a single operation by using a dot product (np.dot). But to get the dimensions to work for a dot product, you need to first transpose the second argument.

If you don’t understand how matrix multiply works (dot product style), you should go take a basic linear algebra class first. Understanding the basic operations of linear algebra is a prerequisite for this course.

Here’s another thread that talks more about what dot product means.

Pilar · October 27, 2022, 8:22pm

Thank you both for your answers. I understand both cases but I posted the question because there must be something I’m missing since I decided to compute the cost with the summation and elementwise multiplication instead but it never passes the fourth test. Let me know if I should delete the code, but I have tried this:

{moderator edit - solution code removed}

What is it that I’m doing wrong here?

Thank you

paulinpaloalto · October 27, 2022, 8:47pm

That code looks correct to me. Are you sure you did “Shift-Enter” on the actual code cell after you made your most recent change? Just calling the function again without doing that runs the old code.

The other possibility is that the AL or m values are not correct for some reason.

If it is still failing, please show us the output of the tests that you are seeing.

Pilar · October 27, 2022, 8:58pm

I’m sure I have run the correct cells in this case, but I keep getting the same error (see below). When I run the code with the “default” function (the one in dnn_app_utils_v3.py), then it passes all the tests:

paulinpaloalto · October 27, 2022, 11:51pm

Ah, ok, the problem is that in this exercise, you should not be computing the cost directly yourself: you should simply be calling the function compute_cost. It’s also a mistake to “hand copy” over your version of that function from the Step by Step exercise: they provide you with a version of that function, so you should just call it. It turns out that in Step by Step, they would let you return the cost from compute_cost either as a scalar float or as a 1 x 1 array, but here they force it to be a 1 x 1 array for some reason.

Pilar · October 28, 2022, 5:57pm

Thank you. I now know then that the cost I was calculating was correct

Topic		Replies	Views
Course 1 Week 2 Logistic Regression Cost function Neural Networks and Deep Learning	4	717	September 28, 2021
W2_A2_Gradient Descent Neural Networks and Deep Learning	9	596	November 6, 2022
Week 2 Programming Assignment Exercise 5 - cost function Neural Networks and Deep Learning	3	657	May 24, 2023
I'm new, need help in exercise 5 of week 2 Neural Networks and Deep Learning	2	608	August 26, 2021
Week 2, Exercise 5, computing cost using np.dot Neural Networks and Deep Learning	29	2474	May 22, 2024

W4_A 2_Ex-1_two_layer_model

Related topics