C1W2 weights_init() why are we initialize BatchNorm2d?

Andy_Davidson · February 17, 2024, 9:11pm

Great Class. Really well done

I am confused.

In C1W1 we used nn.BatchNorm1d . We did not need to initialize it or any of the nn.Linear() layers.

My guess is nn.Linear() must randomly init itself. Initialization is not mentioned in the documentation Linear — PyTorch master documentation )

Why do we use initialization in DCGAN?

# You initialize the weights to the normal distribution
# with mean 0 and standard deviation 0.02
def weights_init(m):
    if isinstance(m, nn.Conv2d) or isinstance(m, nn.ConvTranspose2d):
        torch.nn.init.normal_(m.weight, 0.0, 0.02)
    if isinstance(m, nn.BatchNorm2d):
        torch.nn.init.normal_(m.weight, 0.0, 0.02)
        torch.nn.init.constant_(m.bias, 0)
gen = gen.apply(weights_init)
disc = disc.apply(weights_init)

Kind regards

Andy

paulinpaloalto · February 18, 2024, 12:03am

I believe this is only because they want to use a different initialization than the default. Notice that when they do it explicitly, they are using the Normal Distribution. I think the default in PyTorch is to use the Uniform Distribution. There are lots of other possibilities for initialization algorithms and sometimes you need to tweak it.

Here’s a thread that was the top hit for “pytorch default weight initialization”.

paulinpaloalto · February 18, 2024, 12:09am

And the initialization actually was mentioned on the PyTorch Master Documentation page for Linear if you paged down a bit:

Topic		Replies	Views
C1W4A Build_a_Conditional_GAN Build Basic Generative Adversarial Networks week-4	2	626	July 7, 2022
Default weight initialization process in pytorch custom Module Improving Deep Neural Networks: Hyperparameter tun	3	1394	September 4, 2023
WGAN Weight Initialization Generative Adversarial Networks Resources general	1	61	July 10, 2024
Week 2 - Clarity on DCGAN Build Basic Generative Adversarial Networks	4	342	January 6, 2024
C2W1 Weight Initialization Improving Deep Neural Networks: Hyperparameter tun	3	550	September 2, 2022

C1W2 weights_init() why are we initialize BatchNorm2d?

Related topics