Curious question for dropout

John11 · November 28, 2022, 1:16am

might be a dumb question but i am just curious

SO, we know that when implimenting dropout we use direct mutiplication such as A1*D1 (for example)

but then we have np.dot() , so why are we not using the no.dot in this situation?

i am using python for the first time for this course, is there any specifications regardig this?

rmwkwok · November 28, 2022, 1:26am

A1 and D1 are supposed to have the same shape, so A1 * D1 is the element-wise multiplication where as np.dot(A1, D1) is a matrix multiplication when they are 2-D. The two multiplications are two different operations and only one of them is what we need. The answer is we want element-wise multiplication because our objective is to zero out those dropped units.

D1 is like a mask and it has the same shape as A1 because D1 tells you which element in A1 you want to zero out, so elements in D1 and elements in A1 are one-one corresponded. That’s why you want element-wise multiplication.

Cheers,
Raymond

paulinpaloalto · November 28, 2022, 1:30am

np.dot and elementwise multiplication are completely different operations. You have to use the one that is appropriate for what you are doing. In the case of dropout, we are randomly zeroing different elements in each column of the activation matrix, so computing a Boolean “mask” the same shape as the activation and then doing an elementwise multiply does exactly what we need. np.dot involves taking the dot product of one row of the first operand with one column of the second operand. How wouid that be useful for “zapping” neurons as we need to do in the dropout case?

Of course, dropout is one step in forward propagation and forward propagation does involve dot products. Everything always goes back to the math: what are you trying to do mathematically and then what linear algebra operations express what it is that the math formula says.

John11 · November 28, 2022, 1:33am

I see, thanks for the responce

paulinpaloalto · November 28, 2022, 1:35am

Here’s another thread that discusses when to use np.dot and *.

Topic		Replies	Views
W3_A1_dotproduct_conceptualquestion Neural Networks and Deep Learning	2	496	June 29, 2023
What is "dZ1 = np.multiply(dA1, np.int64(A1 > 0))" Improving Deep Neural Networks: Hyperparameter tun	4	574	August 29, 2022
DL Specialisation_C1_W4 Neural Networks and Deep Learning	9	407	December 27, 2023
Course1, Week2, programming Exercise 5 - Propagate Neural Networks and Deep Learning	5	641	May 16, 2023
Week 2 Exercise 5 - conceptual question Neural Networks and Deep Learning	3	586	November 4, 2022

Curious question for dropout

Related topics