Pix2Pix Assignment issues

Barb · September 1, 2021, 2:00am

Hi everyone,

I’ve encountered 2 problems while working on this assignment.

The first one is the Dead Kernel issue, as encountered here: Dead Kernel warning in Pix2Pix Assignment - #3 by 28utkarsh and fixed it the same way by setting pretrain to false.

The second one is: RuntimeError: expected device cpu but got device cuda:0

I get this line when I try to train the Pix2Pix. If I change the device to cpu, the error disappear but the training seems to freeze from the beginning.

What should I do ?

Barb · September 1, 2021, 2:33am

Follow up: still submitted the code and it worked.

shreyasvedpathak · September 1, 2021, 3:24am

Hello @Barb,

It’s great that you were able to tackle your first problem. Concerning your second issue, when you switch to CPU, the training takes a long time because:

GANs are computationally expensive
CPUs are slow, right?

Perhaps it is not freezing as you claim, but rather training at a very slow rate.

Since the assignment you’re referring to doesn’t need you to submit any files, your assignment was accepted as long as the code was valid (logically, syntactically, etc.).

Hope it helps

Barb · September 1, 2021, 3:30am

I had a sort of red broken chain ring icon next to the cell, that’s what I assumed.

Thanks for the update

28utkarsh · September 1, 2021, 4:00am

Hi @Barb

Firstly, thanks for asking this doubt because it has got me an idea to solve this issue.

Just add the parameter map_location = device while you are loading the pre-trained checkpoint. Also, make sure that you have set the device to cuda in the training preparation step. Check Line#16 for setting the map_location parameter.

Secondly, I agree with the point of @shreyasvedpathak regarding the slow rate of training of a model using CPU.

Let us know if you face any other problem.

Barb · September 1, 2021, 4:21am

Hi @28utkarsh

Just put back cuda for the device and the second error stills appears when using cuda.

Here is a screenshot of the error

28utkarsh · September 1, 2021, 4:48am

Hi @Barb

While calculating the adversarial loss, the ones tensor is getting created in CPU using statement torch.ones(pred.shape). You need to send it to the required device while training the model.

So, replace that adversarial loss calculation statement with the following statement:

adv_loss = adv_criterion(pred, torch.ones(pred.shape).to(device))

Barb · September 1, 2021, 5:26am

It’s working now. I had to use the .to(device) also on the prediction.

28utkarsh · September 1, 2021, 5:35am

Congratulations @Barb, on completing your assignment successfully.

Topic		Replies	Views
Dead Kernel warning in Pix2Pix Assignment Apply Generative Adversarial Networks week-module-2	3	660	September 1, 2021
RuntimeError in C3W2B during training in last cell Apply Generative Adversarial Networks week-module-2	2	588	March 17, 2023
Pix2Pix assignment: Model training and test cells succesful but assignment still not accepted Apply Generative Adversarial Networks week-module-2	4	616	August 27, 2021
Error in C1W1_3 Build Basic Generative Adversarial Networks week-module-1	1	624	October 25, 2022
Pre-trained Model Exploration Build Basic Generative Adversarial Networks week-module-1	5	575	October 24, 2022

Pix2Pix Assignment issues

Related topics