I get the following error when running the script. I really need this working in order to continue, please respond quickly.
CalledProcessError Traceback (most recent call last)
in
3
4 # import the prepare_data.py module
----> 5 import prepare_data
6
7 # reload the module if it has been previously loaded
Did you complete these steps before running the current cell?
Open the file src/prepare_data.py. Go through the comments to understand its content.
Find and review the convert_to_bert_input_ids() function, which contains the RoBERTa tokenizer configuration.
Complete method encode_plus of the RoBERTa tokenizer. Pass the max_seq_length as a value for the argument max_length. It defines a pad to a maximum length specified.
Save the file src/prepare_data.py (with the menu command File → Save Python File).
Thanks for reaching out. Yes I did. The stupid thing is that it got stuck on importing the script to run it on an example. However once I run it as a script on the sklearn container it all worked. So somewhere locally in the course environment something is/was not working properly.
I am continuously facing the same error in Exercise 2
#######################################################################################################
Please check that the function ‘convert_to_bert_input_ids’ in the file src/prepare_data.py is complete.
#######################################################################################################
Once Sagemaker has loaded, click “Data Science” in the top right hand side. Then there should be a dropdown menu that says something like change kernel, change it to “Data Science 2.0”, sometimes you need retry it, but once it is loaded it solves the dependency issues