Another bug - APIError: Error code: 422 "Tokens must be <= 8193"

DavAlPi · May 22, 2025, 9:45am

I’m trying to take this course but I ran into another bug, I think.

First, I got stuck on this error:

. . .
InvalidRequestError: Error code: 400

See this other post: Is this course broken?.

This first error is overcome replacing the definition of the client.chat.completions.create() function with the following:

output = client.chat.completions.create(
    model="meta-llama/Llama-3-70b-chat-hf",
    messages=[
        {"role": "system", "content": system_prompt},
        {"role": "user", "content": world_prompt}
    ],
    tools=[],
    tool_choice="auto"

Note that two new parameters have been added, as tools=[],tool_choice="auto" .

But now, in lesson L2_Interactive AI Applications I get this other error:

. . .
APIError: Error code: 422 - {"message": "Input validation error: `inputs` tokens + `max_new_tokens` must be <= 8193. Given: 6376 `inputs` tokens and 2048 `max_new_tokens`", "type_": "invalid_request_error", "param": null, "code": null}

It seems to me that the limit on the amount of tokens that can be passed to the AI has changed.

Does anyone have an idea how to fix it?
Thanks

RakshaC · May 25, 2025, 10:58pm

Hello,
You are getting the token count error because the 70-B Llama 3 chat model on Together has an 8192-token context window (prompt + reply) but our prompt alone is nearly 6376 tokens. Because the Together Python client injects a default max_tokens=2048 (or max_new_tokens) when we don’t se the maximum token explicitly, the server sees:
6376 input + 2048 requested = 8424 > 8192
We can resolve the 422 error by explicitly setting the max_tokens. I set it to 256 and it is more than enough for our use.
(Note: The token counts are mentioned in the error)
Hope this helps!

Topic		Replies	Views
Is this course broken? Building an AI-Powered Game dl-ai-learning-platform	5	83	May 22, 2025
Error when running Jupyter Code Prompt Engineering with Llama 2	7	333	June 29, 2025
Invalid_api_key Introducing Multimodal Llama 3.2	15	135	November 17, 2024
Prompt Engineering with Llama 2&3: supplied notebook code error Prompt Engineering with Llama 2 short-course	4	153	February 22, 2025
Dont have maximum tokens length 4097 in Lesson Gestting started with Llama2 Prompt Engineering with Llama 2 short-course	1	477	July 7, 2024

Another bug - APIError: Error code: 422 "Tokens must be <= 8193"

Related topics