You cannot currently connect to a GPU due to usage limits in Colab

Gee-Veepio · September 15, 2021, 5:25am

I can’t complete the final assignment of the course because I have run out of GPU time.

I haven’t done anything excessive; just run the labs and completed assignments for this course.

Any idea how long I have to wait before I get more GPU time?

Gee-Veepio · September 15, 2021, 10:38am

I had to run this for 4.5 hours with no acceleration but lost everything because I got back too late and it timed out.

I went to bed and woke up and now, after 17 hours, I still have no GPU acceleration.

I’ve started another 4.5-hour run. Hopefully, I’ll catch this one before it times out.

I guess the moral of the story is don’t burn through the course too quickly because Google might revoke your GPU privileges. One of the warning signs seems to be that Google Colab starts asking you whether you are a robot.

EDIT: GPU access was restored during my second run at this. So I restarted it with GPU and completed the assignment.

To answer my original question: it took about 18 hours for my GPU privileges to come back.

user11 · October 23, 2021, 3:23pm

how long does it take to train 80ish epochs on gpu?

Gee-Veepio · October 23, 2021, 8:00pm

I just ran it and was getting about 40 seconds per epoch.

ai_curious · November 20, 2021, 6:20pm

Love to know how many epochs it took to pass the grader. I, too, have been put in GPU jail and am hoping to get to completion without it. Right now it is taking about 4 minutes per epoch

Gee-Veepio · November 20, 2021, 8:03pm

I passed with 60 epochs.

ai_curious · November 20, 2021, 8:31pm

Appreciate the prompt reply. My non-GPU run is at 35 heading to 60. Hope the network is correct and I don’t have to rerun!

Gee-Veepio · November 20, 2021, 8:52pm

I feel your pain. Good luck!

balaji.ambresh · November 22, 2021, 12:38pm

@ai_curious

If you simulate a validation set with higher metrics, odds are good that you can stop training just about at the right time (could be below 60). Here’s the outline:

At the end of each epoch, use the generator to create a fresh set of images using new noise of the same shape you trained the gan on.
Use the discriminator to predict the confidence level of real image
Suppose you have a threshold of 65% and want atleast 50% of the datapoints to be identified as real, stop training when this happens.
For submission, pick the images that fooled the discriminator in descending order of discriminator prediction in step 3.
You might want to save your weights to continue training.

It would be good to keep in mind that higher cutoffs could require more epochs. Hope this helps.

ai_curious · November 22, 2021, 1:48pm

Now that I have completed the Specialization I may go back and tinker with some of the code to do things more scientifically. Your point about collecting discriminator confidence is a good one - certainly a better approach than eyeballing it, which I realized after the first submission doesn’t work very well at all. Luckily the grader provided that information for me in the feedback, and i was able to meet the threshold and finish the course and Specialization within the 1 week trial period. In a commercial situation I would no doubt be more careful about resource utilization and implement an early stopping rule, but for these programming assignments was typically minimizing a different parameter. Cheers

Topic		Replies	Views
How long will the assignment training take? Generative Deep Learning with TensorFlow week-module-4	4	399	December 14, 2023
Tip for satisfying the GANS with Hands grader Generative Deep Learning with TensorFlow week-module-4	2	721	December 14, 2022
C4W3 assignment - training model take a very long time and then disconnected due to gpu limit Generative Deep Learning with TensorFlow week-module-3	2	485	December 25, 2023
C4 assign wk 3 -epochs Generative Deep Learning with TensorFlow	8	535	January 3, 2024
Finish 50 epochs need 13.89 hr Advanced Computer Vision with TensorFlow week-module-1	21	818	December 24, 2023

You cannot currently connect to a GPU due to usage limits in Colab

Related topics