Dl/DA Gradient First Input Same or Not for All Activation

Anbu · June 2, 2021, 8:39am

Hi Sir,

For dJ / dA[L] which is derivative of the cost function with respect to A[L] final layer, we are having below formula. This formula we got by plugging sigmoid function in the cost function. Is the formula will be same for all the activation function if we plug in to the cost function ? or the below equation could be different for different activation function after derive ? If its different where can I get the derived equation for All activation covered in the lecture video? Please kindly help on this.

Anbu · June 6, 2021, 1:53pm

Dear Mentor ? Can someone please help to answer this ? Is the above formula remains the same for computing dAL whatever the activation functions used in the output layer ?

or Is the above formula works only for sigmoid function present in the output layer?

Anbu · June 20, 2021, 8:43am

Dear Mentor Can you please help on this question?

@afofonjka
@suki
@roannalun
@sjfischer
@petrifast
@yanivh
@aimr

Topic		Replies	Views
Backpropagation formulas Neural Networks and Deep Learning	7	1045	April 21, 2021
Week4- assignment 2- Difference in gradient calculation for the last layer activation in neural networks Neural Networks and Deep Learning	2	677	May 17, 2023
week-4-Backpropagation Neural Networks and Deep Learning week-4	8	26	November 16, 2024
Back propagation derivatives Neural Networks and Deep Learning week-4	7	25	May 30, 2025
Week 4 backward propagation da[l-1] derivation Neural Networks and Deep Learning	2	834	July 24, 2021

Dl/DA Gradient First Input Same or Not for All Activation

Related topics