C2-W1 - prepare_data.py gives errors

Sacha_van_Weeren · July 28, 2022, 7:07pm

I get the following error when running the script. I really need this working in order to continue, please respond quickly.

CalledProcessError Traceback (most recent call last)
in
3
4 # import the prepare_data.py module
----> 5 import prepare_data
6
7 # reload the module if it has been previously loaded

~/src/prepare_data.py in
26 subprocess.check_call([sys.executable, “-m”, “conda”, “install”, “-c”, “pytorch”, “pytorch==1.6.0”, “-y”])
27
—> 28 subprocess.check_call([sys.executable, “-m”, “conda”, “install”, “-c”, “conda-forge”, “transformers==3.5.1”, “-y”])
29 from transformers import RobertaTokenizer
30

/opt/conda/lib/python3.7/subprocess.py in check_call(*popenargs, **kwargs)
361 if cmd is None:
362 cmd = popenargs[0]
→ 363 raise CalledProcessError(retcode, cmd)
364 return 0
365

CalledProcessError: Command ‘[’/opt/conda/bin/python’, ‘-m’, ‘conda’, ‘install’, ‘-c’, ‘conda-forge’, ‘transformers==3.5.1’, ‘-y’]’ died with <Signals.SIGKILL: 9>.

Mubsi · July 29, 2022, 11:46am

Hi @PDS_Mentors,

Can one of you help here ?

Thanks,
Mubsi

bisht · August 1, 2022, 5:51am

Hello Sacha ,

Did you complete these steps before running the current cell?

Open the file src/prepare_data.py. Go through the comments to understand its content.
Find and review the convert_to_bert_input_ids() function, which contains the RoBERTa tokenizer configuration.
Complete method encode_plus of the RoBERTa tokenizer. Pass the max_seq_length as a value for the argument max_length. It defines a pad to a maximum length specified.
Save the file src/prepare_data.py (with the menu command File → Save Python File).

Sacha_van_Weeren · August 2, 2022, 7:28am

Thanks for reaching out. Yes I did. The stupid thing is that it got stuck on importing the script to run it on an example. However once I run it as a script on the sklearn container it all worked. So somewhere locally in the course environment something is/was not working properly.

Praveen_Juyal · August 29, 2022, 2:56pm

I am continuously facing the same error in Exercise 2

#######################################################################################################
Please check that the function ‘convert_to_bert_input_ids’ in the file src/prepare_data.py is complete.
#######################################################################################################

Sacha_van_Weeren · August 30, 2022, 6:02am

did you set the max_length in the script?

Adam_Ezzat · January 4, 2023, 3:44pm

Dealing with the same problem here, curious if there was a way to fix this issue?

Bach_Nguyen1 · March 18, 2023, 1:56am

It has not been fixed.

Ajmal · March 22, 2023, 9:09am

Did someone manage to resolve this since i am stuck with same getting errors, i have done needed changes on python code in src as well

graham_broughton · March 23, 2023, 1:04am

Once Sagemaker has loaded, click “Data Science” in the top right hand side. Then there should be a dropdown menu that says something like change kernel, change it to “Data Science 2.0”, sometimes you need retry it, but once it is loaded it solves the dependency issues

Topic		Replies	Views
C21 Build, Train, and Deploy ML Pipelines using BERT Exercise Errors Build, Train, and Deploy ML Pipelines using BERT	1	564	December 6, 2022
C2_W1_Assignment 1 Exercise 2 Build, Train, and Deploy ML Pipelines using BERT week-1	0	310	February 2, 2024
C2-W1 Lab: CalledProcessError on Exercise 2 Build, Train, and Deploy ML Pipelines using BERT	2	554	December 8, 2022
C2W1: Please check that the function 'convert_to_bert_input_ids' in the file src/prepare_data.py is complete. Error Build, Train, and Deploy ML Pipelines using BERT	2	609	March 20, 2022
C2W1 assingment - prepare_data issue (exercise 2) Build, Train, and Deploy ML Pipelines using BERT	11	628	March 26, 2023

C2-W1 - prepare_data.py gives errors

Related topics