Help - Transfer Learning or Continual Learning using transaction data

Suraj_Kesavan · June 20, 2024, 8:26am

We are trying to build a domain specific llm like how llm was specialized for protein folding as in the ProLLaMA (https://arxiv.org/pdf/2112.08654) use case.

We want to do a continual learning on a base pretrained llm on sequences that would occur in customers transactions and then later stage do a finetuning using instruction prompts.

My questions are regarding the first step where we need to do a specialized pretrained model using continual learning.
How should we look at data preparation for this? How can i set up training data? because the instruction prompts that we would use for finetuning seem to be more language based where we give an instruction and the input and output for the model to complete that instruction.

I’m confused between giving sequences as
customer1 - merchant a, merchant b, merchant c …
customer2 - merchant d, merchant c, merchant x

Or should be more in line of how llm are trained with each next token like
merchant a,
merchant a, merchant b
merchant a, merchant b, merchant c
merchant d,
merchant d, merchant c
…

Or is this not a correct approach at all?

Please let me kno if i need to give any other details on this.

gent.spah · June 20, 2024, 10:16am

As far as I have understood when working with LLM’s the basic principle is input-ouput-input-output… and so on, so there need to be 2 way conversation with it, preferably with as clear and as concise inputs at each stage (small be it as well) and then you can create an automated chain of interactions and finally receive the output!

Topic		Replies	Views
Can you mix and match different types of data? Finetuning Large Language Models	2	113	September 21, 2023
Week 2: Intuition check for Step 2.1 in "Perform Full Fine-Tuning" Generative AI with Large Language Models week-2	3	426	July 24, 2023
What dataset to use for the fine tuning of a pretrained llm? NLP with Attention Models how-to , how-to-forum , general	11	223	June 21, 2024
Week 2: Instruction fine-tuning Generative AI with Large Language Models llm , prompting	1	59	November 18, 2024
Transfer Learning or Train NLP with Attention Models week-3	3	530	April 27, 2023

Help - Transfer Learning or Continual Learning using transaction data

Related topics