Fine-tuning an LLM on non-Q&A and unlabeled dataset

mohsenimani · September 30, 2023, 7:05pm

In the course, it was mentioned that we can fine-tune an LLM on a non-Q&A dataset. To do this, we need to convert the data to a Q&A dataset using different techniques, such as using another model.

How can we fine-tune an LLM on a dataset of documents (which are not in Q&A format and are unlabeled)?

Topic		Replies	Views
Data Preparation for a Text Transfer Task with supervised non-parallel data Finetuning Large Language Models	0	72	September 15, 2023
How to fine-tuning from a stack of PDFs which are not in Q&A format? Finetuning Large Language Models	6	3527	May 10, 2024
How to create dataset on a specific topic to fine tune llm? Finetuning Large Language Models	0	190	November 27, 2023
What if there's no data in question-answer format? Finetuning Large Language Models	4	115	October 19, 2023
What dataset to use for the fine tuning of a pretrained llm? NLP with Attention Models how-to-forum	11	264	June 21, 2024

Fine-tuning an LLM on non-Q&A and unlabeled dataset

Related topics