Hi Guys,
Overall I enjoy courses from DeepLearning.AI, but somehow my first hands-on exercise hasn’t been such a great experience.
Instruction overall are easy to follow, but difficult to find help.
- First I had an issues with selecting right instance type (size) - ( [Troubleshooting] Getting “ResourceLimitExceeded” when selecting
ml.m5.2xlarge
- Course Q&A / Generative AI with Large Language Models - DeepLearning.AI . To be fair, this problem is mentioned at the Coursera page, but it’d be good to mentioned if it’s even worth trying to do the lab if you have this problem or just give a link where to report the problem and wait. - It seemed that default seized instance (2 CPU and 4GB, forgot the name and instance is blocked now). However, faces another issue during the lab - which was related to dependent libraries when trying to load the FLANT5 model :
RuntimeError: Failed to import transformers.models.t5.modeling_t5 because of the following error (look up to see its traceback): No module named ‘torch._C’
This issues has been reported several times, but doesn’t seems to have a clear resolution:
2.1 Assignment 1; torch.c runtime error - Generative AI with Large Language Models / GenAI with LLMs Resources - DeepLearning.AI - No resolution, suggestion to report a problem by support and learner idled.
2.2 Module Not Found Error: No module named ‘torch._C’ - Course Q&A / Generative AI with Large Language Models - DeepLearning.AI - long conversation, seems learner managed to Auto-magically resolve the problem through quite persistent struggle, however got the account blocked in the end - not clear resolution. (unless you consider “have you tried to turn it on/off” a resolution).
2.3 Getting “No module named ‘torch._C’” error for Lab 1 - Course Q&A / Generative AI with Large Language Models - DeepLearning.AI - suggestion to try another image, learner didn’t come back.
Somewhat suggestion 3 was somewhat I could try, however seems like since that time there were changes in images, and instead of DataScience image now we have DataScience2.0 and DataScience3.0. I have tried both, DataScience2.0 is Python 3.8 and errored out even sooner with when trying to lead the data set. At this stage my AWS learning account expired and I got Account deactivated.
So I had to go and submit the GenAI with LLMs Lab Issue Report (google.com) issue form (I actually did it twice because there is no notification, my application has been received or not and not clear how to follow up).
I’ve ended up spending ~2h hours trying to configure the environment for the lab, instead of actually learning.
Outcome: So I suppose I’ll wait till my AWS lab/learning account will be re-activated and hope that this time I won’t face the issues above.
I might not get the issue when I’ll get my account, but if I may ask DeepLearning.AI for some support. it’d be great to:
- Have some sort of notification or status for Google form report (to know that your problem is being taken care of).
- if possible please have a look a the unresolve torch._C problem, I’m sure it’ll help other people who might be as frustrated as I was.
- It might be helpful to mention full library requirements for the exercise/Lab somewhere if I wish to do it in my own environment.
Thank you in advance.