I am able to get db2 and dW2 correct. dW1 and db1 are incorrect, perhaps due to incorrect value of dZ1. I used (1 - np.power(A1, 2)) as guided. However, I am unable to get correct value. Please guide.
Are you sure you implement the full formula of dZ1? You just mentioned the second part but there is also W2 and dZ2, right?
Yes, I implemented. I used np.dot to multiply W2 and dZ2.
Send me your code in a private message. Click my name and message.
Check the formula of dZ1 again. W2 or dW2?
Thanks for the help. I really got confused. The issue is solved.
@saifkhanengr , Excuse me, I have a similar problem.
I computed dZ1 this way: dZ1 = np.dot(np.dot(W2.T, dZ2), (1 - np.dot(A1.T, A1)))
Cause using (1 - np.power(A1, 2)) gives an error of shapes: â(4,3) (4,3) not alignedâ
Despite I wrote the suggested formula it gives me âAssertionError: Wrong values for dW1ââŚHow can I resolve?
This is incorrect. The * operator implies the element-wise multiplication. So, using dot
is incorrect. Also, np.power
should not give any error once you use * operator.
It works. If I can ask you, how many years are needed to learn all these kind of âpython secretsâ? (Iâm new to this programming language.)
Itâs more a matter of understanding how matrix algebra works, not python specifically.
When the equation says to compute the sum of the product of two vectors, you can use a dot product, or you can use sum() and an element-wise multiplication. Theyâll give exactly the same results.
But when the implementation involves matrices instead of vectors, then a matrix dot product will produce results that involve all of the products of the off-axis elements of both matrices. This may or may not be what the equation is trying to calculate, depending on the exact situation.
The essential difference is that if the end result is a scalar (such as the cost value), then a dot product may not be the correct choice.
Note that you already need to know python before you start here. This course is not structured as an âintro to pythonâ.
As a footnote to Tomâs points about this being a matter of understanding the linear algebra, itâs also helpful to be aware of the notational conventions that Prof Ng uses here when he is writing formulas involving multiplication of vectors and matrices. Hereâs a thread which mostly restates what Tom said, but also gives some explanation of the notation that is worth a look.