in šā¢š[š] =šā¢š“[š] āš[š]ā²ā”(š[š]) gā derivative of g (activation function)
why gā in šā¢š[š] =šā¢š“[š] āš[š]ā²ā”(š[š]) and not dg?
šā¢š is dL/dZ is it because its not dL/dg?
As I mentioned on your previous thread about this, this is beyond the scope of this course. I gave links on the other thread that cover the derivations.
Suppose g^{[l]} is ReLU or sigmoid. What do you mean by dg = \displaystyle \frac {\partial L}{\partial g}?