Fine-tuning using PEFT/LORA for GPT-3 - how?

Anirban_K · July 10, 2023, 10:32am

Here we are shown how to do PEFT and LORA with google/Flan-T5 model using the HuggingFace Transformers lib, which makes it easy to fine-tune a model using TrainingArguments and Trainer.train paradigm. But how do we fine-tune GPT3 with PEFT / LORA since I dnt see this being supported by HuggingFace lib. I know we can instruction fine-tune GPT3, but that is not the same as PEFT, which is updating model parameters. I see this model in HuggingFace models repo -openai-gpt, but this not quite the same as gpt3, I believe. Please explain.

Juan_Olano · July 10, 2023, 3:17pm

Hi!

OpenAI’s GPT3 is available for fine-tuning only via their API fine-tuning. OpenAI’s models are not available to be downloaded at this point. You may want to see if your task can be properly implemented by using the OpenAI’s fine-tuning exposed API.

Aiko · July 12, 2023, 10:37am

If you would like to do something like this on a model that rivals GPT-3 however, you do have that option. There’s quite a few LLAMA type models releasing these days that you could try to train instead, like Alpaca and OpenLLAMA perhaps?

Juan_Olano · July 13, 2023, 12:48am

Other good options are: Bloom, Falcon, Mosaic. You may want to check them out. They are very highly rated in leaderboards.

Topic		Replies	Views
PEFT/LoRA used for domain addaptation Generative AI with Large Language Models week-2	1	489	July 10, 2023
Help in model training strategies (PEFT/LORA + RAG) AI Discussions ai-discussions , project	0	28	November 2, 2024
Week 2 Lab: Training configuration of the PEFT model Generative AI with Large Language Models ai-discussions	3	48	November 21, 2024
Can I replace the GPT with a none OpenAI something open source Building and Evaluating Advanced RAG Applications	4	326	January 8, 2024
PEFT fine-tuning on Flan-t5-base model does not change inference results Generative AI with Large Language Models week-2	2	338	March 14, 2024

Fine-tuning using PEFT/LORA for GPT-3 - how?

Related topics