How to initialize the gamma and beta parameters for batch norm

jaylodha · May 13, 2021, 7:58am

While talking about batch norm, Professor Andrew introduces two new parameters namely - gamma and beta, that basically allow us to control the mean and variance of the intermediary inputs (Z[l] values for any layer “l”). Now, while discussing the implementation of the same using Gradient Descent, Professor Andrew highlights the fact that these aforementioned parameters can be updated in the similar way as the weights of the neural network. Now, my question is how do we initialize these values?

nramon · May 13, 2021, 10:25am

Hi, @jaylodha.

TensorFlow initializes gamma to 1 and beta to 0 by default, but you can specify a different initializer for any of them. What works best may be problem specific.

jaylodha · May 15, 2021, 6:01pm

Hey, thanks a lot for the reply.

Topic		Replies	Views
Initializing Batch Norm Parameters (gamma & beta) Improving Deep Neural Networks: Hyperparameter tun coursera-platform	3	561	April 22, 2022
Doubt regarding the dimensions of the parameters gamma and beta of Batch Norm Improving Deep Neural Networks: Hyperparameter tun coursera-platform	1	535	May 27, 2022
Learning beta and gamma in Batch Norm Improving Deep Neural Networks: Hyperparameter tun coursera-platform	1	628	September 4, 2022
C2 W3 Normalizing Activations in a Network Improving Deep Neural Networks: Hyperparameter tun coursera-platform	1	420	August 28, 2023
Batch Normalization Questions Improving Deep Neural Networks: Hyperparameter tun coursera-platform	2	415	September 15, 2023

How to initialize the gamma and beta parameters for batch norm

Related topics