Hello.
In trying to run the cell:
results = [generate_with_single_input(query, top_p = 0.8, max_tokens = 500 + random.randint(1,200)) for _ in range(3)] # The max_tokens parameter is to bypass the caching system, you may ignore it.
for i,result in enumerate(results):
print(f"Call number {i+1}:\nResponse: {result[‘content’]}")
I get an error message:
InvalidRequestError: Error code: 400 - {"message": "Unable to access non-serverless model meta-llama/Llama-3.2-3B-Instruct-Turbo. Please visit https://api.together.ai/models/meta-llama/Llama-3.2-3B-Instruct-Turbo to create and start a new dedicated endpoint for the model.", "type_": "invalid_request_error", "code": "model_not_available"}
This likely means that the Python library does not have permissions to access the LLM?
Some pointers will be appreciated.
Thank you,
Anjan
@Anjan1970
Labs were recently updated. so before you start working on any labs, click on the 3 dots left top corner, then click to Restore original version, so you have the updated notebook to work upon where you wouldn’t encounter this error.
Regards
Dr. Deepti
@Anjan1970
Recent staff update
together.ai has deprecated a lot of models in the past days resulting in concurrent issues with labs of RAG courses.
Lab assignment 3 to 5 and all ungraded labs are updated now.
You need to Restore original version, to get the updated notebook.
I also would recommend to first delete the old file, and restore original version, so any cache issue is not encountered when running codes again.
Courses, labs are created with multiple channels, so labs gets affected when any changes.made at one end and can take multiple attempts to resolve issue.
Let us know if you still continue to have issue
Regards
Dr. Deepti