Batch Norm Backprop

MalayAgr · June 22, 2021, 9:22am

I’ve been trying to derive the gradients for batch normalisation since I want to implement it on my own. I’ve successfully found the gradients with respect to beta and gamma, but can’t figure out the gradients for weights, biases (not removing them, just yet) and the dA for the previous layer (the layer preceding this, which which will use this for its own backprop step). The main problem I’m facing is figuring out the derivative of the mean and the standard deviation with respect to Z (linear combination with weight and bias).

I was wondering if anyone can help me?

nramon · June 22, 2021, 11:56am

Hi, @MalayAgr.

Here’s the derivation of the backprop equations for batch normalization.

There’s a link at the bottom to a blog post that includes a Python implementation.

Happy hacking

MalayAgr · June 22, 2021, 3:45pm

Hi.

Thanks! I was able to figure it out using this very blog post. Interestingly, only the computation for dZ changes in the entire process. dW and db are still calculated in the same way. Pretty amazing how these things work out in the end.

Neurojedi · May 4, 2022, 9:49pm

Hi,

I’m trying to do something similar and add batch normalization to L-layer neural network that we created in the first course. I would appreciate it if you could possibly share how you changed the dZ .

Topic		Replies	Views
Batch Normalization propagation Improving Deep Neural Networks: Hyperparameter tun coursera-platform	1	523	April 22, 2022
Batch Norm Gradients Improving Deep Neural Networks: Hyperparameter tun coursera-platform	1	543	June 30, 2021
Batch normalization gradient computation question Improving Deep Neural Networks: Hyperparameter tun coursera-platform	3	547	August 17, 2022
Week 3 - How will batch normalization affect backpropagation? Improving Deep Neural Networks: Hyperparameter tun coursera-platform	4	1071	April 29, 2022
Batch Normalization Gradients Improving Deep Neural Networks: Hyperparameter tun coursera-platform	1	564	February 28, 2022

Batch Norm Backprop

Related topics