Derivation of the gradients of W^[2], and b^[2] in the 1 hidden neuron network

Check this YouTube guide of Eddy Shyu and this chain rule.

1 Like