Response cut-off for llama 8B intruct

mkt11 · August 25, 2024, 4:55am

Hello fellows,
I am working on a Digital School System where I used the pre-trained Llama-3.1-8B-Instruct model but I am experiencing a response cut-off when teachers’ or student interface sends prompts to generate tasks, it gives incomplete responses. I would appreciate it if someone could help or recommend thing

Alireza_Saei · August 25, 2024, 5:12am

Hi @mkt11

If you are using LangChain framework, make sure to increase the value of max_new_tokens passed in model_kwargs, so you can get complete answers from the llm.

Also, you can take a look at this link, too.

Hope it helps! Feel free to ask if you need further aasistance.

mkt11 · August 25, 2024, 2:07pm

Thank you for the feedback.
I am using s3 to store user chat history and the lambda function to retrieve the chats and send along with user new prompts(API gateway) to the sagemaker endpoint(I don’t think it is a good solution but you can advise accordingly).

mkt11 · August 26, 2024, 4:19pm

@Alireza_Saei

Alireza_Saei · August 26, 2024, 4:59pm

Thanks for the reminder!

you can use other methods to remember user chats (e.g. send only last k conversation, or save a summary of the conversation instead of whole conversation).

Btw, you can watch LangChain short courses on DLAI platform to gain overall knowledge how to solve problems with LLMs.

mkt11 · August 26, 2024, 5:33pm

Thank you
I will check the course.

Alireza_Saei · August 27, 2024, 3:55am

You’re welcome! Feel free to ask if you need help.

Topic		Replies	Views
It will be very helpful if you use open source LLM like falcon-40b-instruct along side OpenAI LangChain for LLM Application Development	0	97	June 7, 2023
How to use Huggingface LLM Hub instead of OpenAI chat LangChain for LLM Application Development	1	162	August 21, 2023
L1 Model Prompt Parser LangChain for LLM Application Development	5	250	June 10, 2023
Model performs worse than shown in the course LangChain: Chat with Your Data	0	78	September 26, 2023
How to create a multi-user chatbot with langchain LangChain for LLM Application Development langchain	0	280	June 5, 2023

Response cut-off for llama 8B intruct

Related topics