C3W2B_Assignment: size mismatch for final.weight?

Qi_Yu · May 18, 2022, 2:32am

I tried to load pretrained model but have the following error:

RuntimeError: Error(s) in loading state_dict for Discriminator:
size mismatch for final.weight: copying a param with shape torch.Size([1, 128, 1, 1]) from checkpoint, the shape in current model is torch.Size([8, 128, 1, 1]).
size mismatch for final.bias: copying a param with shape torch.Size([1]) from checkpoint, the shape in current model is torch.Size([8]).

Is the notebook checkpoint outdated?

Wendy · May 18, 2022, 8:02pm

Hi @Qi_Yu,
I just tried to confirm and it is working fine for me. The size your error says it is loading from the checkpoint also looks right (matches what I’m seeing). BUT the size for your current model, torch.Size([8, 128, 1, 1]), does NOT match.

Please check your implementation for your line of code in Discriminator that sets self.final. You can run the unit test in the cell immediately following the Discriminator cell to test your discriminator implementation.

Qi_Yu · May 19, 2022, 2:33am

Thank you! Solved. But unit test for Discriminator was not robust enough to catch my error there.

Wendy · May 19, 2022, 6:36pm

@Qi_Yu, congrats on finding the issue!

If you don’t mind, could you DM me the line you entered that caused the error but passed the unit test? I can pass it along to the development team to see if they want to try to enhance the unit tests to cover this case. Of course, it’s not always practical to cover every case with unit tests, but it’d be worth taking a look.

Shatru · July 1, 2024, 11:51pm

Ran across this same issue. The issue can be caught by the unit test if we were to initialize the Discriminator w/ the hidden_channels at default value.

Nydia · July 3, 2024, 12:58pm

Is your problem solved? The suggestion is good, I will report this.

Topic		Replies	Views
Week 4: Conditional GAN & Controllable Generation - Issue last part Build Basic Generative Adversarial Networks week-4	11	439	February 19, 2024
Runtime Error: expected input channel is not matching Build Basic Generative Adversarial Networks week-4	4	504	August 28, 2023
[C1_W2_Assignment] Error trying to initialize generator, discriminator, optimizers Build Basic Generative Adversarial Networks week-2 , week-3	1	521	December 5, 2022
C3W2_Assignment_Zombie_detector - Shape Mismatch and Loss error Advanced Computer Vision with TensorFlow	7	410	December 27, 2023
Week 1 Assignment: RuntimeError Build Basic Generative Adversarial Networks week-1	7	855	February 15, 2022

C3W2B_Assignment: size mismatch for final.weight?

Related topics