How broadcasting makes a different in W2 E5

Lingxiao_Gao · February 6, 2022, 7:52am

Hi all,

When I tried:

cost = - "sum "(“matmul”( Y, “log”(A) ) + “matmul” (( 1 - Y ), “log” ( 1-A ))) / m

There is a value error says:

ValueError: shapes (1,3) and (1,3) not aligned: 3 (dim 1) != 1 (dim 0)

I tried transposed all Y to Y.T, and make it (3,1)x(1,3) but it gives me wrong answer.

I don’t understand how “keepdims” broadcasting correct this issue.
cost = - "sum "(“matmul”( Y, “log”(A) ) + “matmul” (( 1 - Y ), “log” ( 1-A ))) / m, axis = 1, keepdims = True

balaji.ambresh · February 6, 2022, 2:28pm

Don’t do dot product. It’s element wise multiplication. Use * or tensorflow.math.multiply

paulinpaloalto · February 6, 2022, 6:15pm

This is Course 1, so we don’t know about TensorFlow yet, only numpy. You can use dot product, but you have to get the transpose in the right place. As you saw, transposing the first argument is wrong. Try transposing the second one, then you have 1 x m dotted with m x 1, which gives a 1x 1 output, right?

Then you don’t need the sum. That’s why dot product is better here: it does both multiply and sum in one operation, although np.multiply followed by np.sum gives the same answer.

Note that np.matmul is equivalent to np.dot: both are “dot product” style matrix multiply. Prof Ng always uses np.dot for some reason, but they are interchangeable for our purposes here. Whereas np.multiply and * both do “elementwise” multiply.

Lingxiao_Gao · February 7, 2022, 6:48am

I understand that dot product is a sum after multiplication.
It is just being very confusing to tell the difference between matrix multiplication and normal dot with np.dot() function.

So this “Ylog(A)” part is meant to produce a scalar right? I thought we are doing some matmul operation here.

paulinpaloalto · February 7, 2022, 3:35pm

Yes, please read my previous reply again more carefully. I think I explained everything there. You can also read the documentation for numpy matmul and numpy dot.

Topic		Replies	Views
Cost function in Week 2 Exercise 5 Neural Networks and Deep Learning coursera-platform	6	678	July 9, 2023
I'm getting an error in shapes when computing the cost Neural Networks and Deep Learning coursera-platform	5	427	August 15, 2024
Error in cost function programming Neural Networks and Deep Learning coursera-platform	7	706	October 10, 2021
C1 W2 A2 propagate Neural Networks and Deep Learning coursera-platform	3	596	January 9, 2023
W2_A2_Ex-5_ValueError Neural Networks and Deep Learning coursera-platform	1	523	February 5, 2023

How broadcasting makes a different in W2 E5

Related topics