As of yesterday, I’ve consistently been getting a ‘Kernel Restarting/The kernel appears to have died. It will restart automatically.’ message when attempting to do the training (section 4) of the U-Net assignment. My tests in code cells above are all passing.
I’ve read other postings here and this looks like a common infrastructure issue (versus an issue with my TF code) although I’m not seeing recent messages for this assignment.
Is there anything I can do? Restarting the kernel is not helping.
I started from a fresh copy of the notebook, implemented the functions, passed all the unit tests, and I got “the kernel has died” in Section “4 - Train the model”.
I don’t think I have any debug output. I removed print statements from the block in 2.1 but I’m still getting the kernel issue. My training never gets past the first epoch.
I tried getting the latest version (Help | Get Latest Version) but this didn’t help. I’d like to try and reset my lab to the original state, but I don’t see a checkpoint in File | Revert to Checkpoint
This is also happening with me during training the model. I think, as @vorpalsnark mentioned, it is a memory issue. Perhaps lots of learners doing this assignment and not clearing the memory. Or a server error.
The working theory is that Coursera’s GPU array is starved for resources. They get GPU service from AWS, and so many people are eating GPU cycles playing with chat-bots that GPU time is in short supply.
I’ve heard some students have had success with (edited) running their notebook during off-peak hours.
But you can submit your assignment without running any code. Grader don’t need to see the output. Try submitting your assignment, if you completed all the exercises.
That’s exactly what i did, i had some weird error though which i had to fix first because it didn’t want to compile my code. I was afraid that it might be because of the kernel dying. Thank you for your reply:)
I got rid of it by rewriting my assignment. I believe the error was on my side. It couldn’t interpret one of my cells.
Error:
Cell #8. Can’t compile the student’s code. Error: AssertionError(‘Error in test’)
FYI, this is still occurring. I got the “kernel has died” issue, but I was able to submit and pass the assignment. I guess I will have to wait til off-peak hours to see the results!
p.s. I’m not getting the second error that some others reported.