PEFT/LoRA used for domain addaptation

Damian_Jasniewicz · July 10, 2023, 12:16pm

I am interested if we can used PEFT/LORA methods to adapt given model to new domain. The main motivation is to avoid full-pre training cost for domain adaptation (like BloombergGPT).

Can somebody share thoughts about it. I will be grateful

Challenges which I see:

a) We will need to use embedding mapping and tokenization used in original model but there may be domain specific words which are not represented in it.

Is there a way to “froze” the embeddings and tokens of words on which previous model was trained and add new ones which are domain specific? The new words will be mapped into existing space similarly like in soft promoting as I imagine it.

Anirban_K · July 10, 2023, 1:24pm

Excellent question, and the first thing to stumble upon when working in a specific industry, like Bio-medical, manufacturing. I have a similar query above. The HuggingFace lib has made fine-tuning of the tokenizer and the model (one might only fine-tune the model, as that works best in many cases), quite simple. But I am not sure if the latest openai model is available in HuggingFace repo , I see only openai-gpt, but I dnt think its the latest. So in such scenario how do we fine-tune the model. Secondly, as rightly queries, how do we update the tokenizer with peft / Lora. Will be good if someone can explain. We have done this with SLMs (Small language models!) like BART, few yrs back, but not attempted this with LLMs. Thanks.

Topic		Replies	Views
Fine-tuning using PEFT/LORA for GPT-3 - how? Generative AI with Large Language Models week-2	3	835	July 13, 2023
Help in model training strategies (PEFT/LORA + RAG) AI Discussions ai-discussions , project	0	28	November 2, 2024
Week 2 Lab: Training configuration of the PEFT model Generative AI with Large Language Models ai-discussions	3	49	November 21, 2024
Question tokenizer PEFT training Generative AI with Large Language Models week-2	3	177	May 1, 2024
PEFT-LoRA: model performance Generative AI with Large Language Models week-2	0	460	October 3, 2023

PEFT/LoRA used for domain addaptation

Related topics