Adding prompt instruction concatenation as a part of the Kubeflow pipeline

carlesonielfa · January 18, 2024, 10:22am

In the data preparation step, we added an instruction to all questions in order to form the prompt for the model. My question is: Could this step be incorporated into the kubeflow pipeline used in the orchestration/automation step?

One advantage would be that the performance of different instructions could be evaluated, also that the file size for the training data would be reduced.

Can anyone think of any disadvantages? Let’s discuss!

Mubsi · January 19, 2024, 3:19am

Hi @carlesonielfa,

That’s actually the best practice. But as you can understand, for the purposes of teaching, it was done differently in the course.

Thank you for sharing that!
Mubsi

Topic		Replies	Views
Should we use chain of thoughts prompts while instruction tuning the model Generative AI with Large Language Models week-module-3	4	740	July 15, 2023
Queries on Fine Tuning Finetuning Large Language Models	0	215	March 3, 2024
Data preparation for Instruction fine-tuning: where the labels come from? Generative AI with Large Language Models week-module-2	1	263	January 19, 2024
Model WorkFlow Prompt Engineering ChatGPT Prompt Engineering for Developers	1	135	June 25, 2023
Prompt based task training Generative AI with Large Language Models week-module-2	2	526	July 11, 2023

Adding prompt instruction concatenation as a part of the Kubeflow pipeline

Related topics