Element-wise multiplication or dot product in backpropagation

paulinpaloalto · December 27, 2023, 4:06pm

The Chain Rule deals with the composition of functions, so how the derivatives are handled depends on what the functions are. In some cases they involve dot products (linear activation) and in some cases they are “elementwise” operations, e.g. the activation functions. So for example \frac {\partial A1}{\partial Z1} is just the derivative of the layer 1 activation function, which was applied elementwise.

This is beyond the scope of this course: Prof Ng does not really cover the underlying calculus. Here’s a thread with lots of links to supplementary material about the mathematics of back propagation.

Topic		Replies	Views
How to choose between matrix multiplication and element wise multiplication during BackPropagation in Chain Rule? Neural Networks and Deep Learning	6	963	December 25, 2023
Queries regarding backpropagation in RNNS Sequence Models week-1	1	20	January 1, 2025
DL Specialisation_C1_W4 Neural Networks and Deep Learning	9	407	December 27, 2023
Dot product vs element-wise multiplication of arrays Supervised ML: Regression and Classification week-3	9	1879	September 29, 2022
The choice between using * (element-wise multiplication) and np.dot (dot product) Deep Learning Resources	1	285	January 2, 2024

Element-wise multiplication or dot product in backpropagation

Related topics