Course 1 Week 2 Logistic Regression Cost function

Barb · September 28, 2021, 2:19am

Hey everyone !

Something I can’t warp my head around:

In the comments when coding the cost function, it is said: compute cost using np.dot

cost = -(1/m)*np.sum(np.dot(Y,np.log(A))+np.dot((1-Y),np.log(1-A)))

However, when I tried to do it, the dot product failed because both A and Y are a (1,3) shape.
Then, I did an element wise multiplication and I could pass all the tests.

cost = -(1/m)*np.sum(Y*np.log(A)+(1-Y)*np.log(1-A))

Can someone give me a hint ?

Thanks !

paulinpaloalto · September 28, 2021, 2:57am

You need to understand how dot product multiplication works. It requires a transpose of the second argument in order for the dimensions to work for a “dot product”. Here’s a thread that shows examples.

Barb · September 28, 2021, 3:08am

Yes, I understand how the dimensions have to match for the dot product to be mathematically doable.

What I don’t understand is the vectorization of the J formula:

Therefore, I guess I should be using the dot product in my sigmoid but then why is it authorized to transpose the second matrix in the dot product. Other than, it must be done or it doesn’t work.

paulinpaloalto · September 28, 2021, 3:25am

The point is that what you show is a mathematical formula. So what does that formula mean in terms of what actually happens? It is the sum of the products of the corresponding elements of two vectors, right? Well, what is the dot product of two vectors?

Of course it is always the case that there can be more than one correct way to translate a mathematical formula into linear algebra operations and then to python code. Your implementation with elementwise multiply, followed by a sum, is perfectly correct. It’s just that it’s less efficient because it is two separate vector operations. The dot product can do both operations (multiply and sum) in one vector operation.

Barb · September 28, 2021, 6:44am

@paulinpaloalto thanks for the clarification.

I also now realized that the element wise multiplication doesn’t automatically include the sum.

Thanks again !

Topic		Replies	Views
W4_A 2_Ex-1_two_layer_model Neural Networks and Deep Learning coursera-platform	7	618	October 28, 2022
General Question on Computing Cost using np.dot directly (without multiply) Neural Networks and Deep Learning coursera-platform	3	519	June 24, 2022
W4_A1_Ex-6_cost function with python programming Neural Networks and Deep Learning coursera-platform	3	609	January 6, 2023
The choice between using * (element-wise multiplication) and np.dot (dot product) Deep Learning Resources	1	290	January 2, 2024
Logistic Regression with a Neural Network mindset Confused Neural Networks and Deep Learning week-2 , coursera-platform	6	18	January 9, 2025

Course 1 Week 2 Logistic Regression Cost function

Related topics