Base model's output is blank

Prachi · September 18, 2023, 2:54am

Hi there,
I am in the training step particularly try the base model and attached the response here.
I am not sure why I am getting a blank answer, a model is not outputting anything not even an illogical answer.

Any leads would be helpful.

Thanks!

Mubsi · September 18, 2023, 5:53am

Hi @Prachi,

Can you share the name of the lesson ?

Thanks,
Mubsi

Prachi · September 27, 2023, 6:39pm

Hi Musbi, please pardon my late reply.

The Lesson was: Training; “Try the base model”
Well, it seems the output is off which is obvious because I am trying the base model.

However, I fine-tuned for 100_epochs/10,010_max_steps (took ~6hrs) and seemingly it is overfitting. but do you know why it shows 4 outputs as ###Answers and not just plane one statement?

The attention mask and the pad token id were not set. As a consequence, you may observe unexpected behavior. Please pass your input's `attention_mask` to obtain reliable results.
Setting `pad_token_id` to `eos_token_id`:0 for open-end generation.
### Question:
Who is the maintainer of the LLM model?

###Answer:
The maintainer of the LLM model is Aeala.

###Answer:The maintainer of the LLM model is Aeala.

###Answer:The maintainer of the LLM model is Aeala.

###Answer:The maintainer of the LLM model is Aeala.

###Answer:The maintainer of the LLM model

Thanks!

Topic		Replies	Views
Model is not producing any queries (A good thing, but not for the learning sake) Building Systems with the ChatGPT API	1	63	June 10, 2024
No model output in C4W2_Assignment but still got 100% NLP with Attention Models week-module-2	6	23	August 30, 2024
Question on how Base LLMs are trained Generative AI with Large Language Models week-module-2	4	429	August 3, 2023
Instruct Tuning LLMs AI Discussions ai-discussions	0	18	August 20, 2024
Couple Questions From Week 1 Generative AI with Large Language Models week-module-1	1	383	October 2, 2023

Base model's output is blank

Related topics