Does Prompt tuning work with Autoregressive models as well?

Faria_Huq · October 21, 2023, 3:21am

Hello, I am reading the original Prompt Tuning paper which uses FlanT5 for the experiments, which is a encoder-decoder model. I was wondering if it works well across all kinds of Transformer models. Specifically, does it work for autoregressive models such as: GPT4 and Llama as well?

Additionally, I am also curious to know how much data is sufficient for prompt tuning?

gent.spah · October 21, 2023, 8:50am

Prompt tuning should work for all transformer models because it gives some context to the model but the efficacy it depends on the model (how big it is and on its training set).

How much data is sufficient? It depends on the scale and efficiency of the model itself and also on the quality of the prompt, if the prompt is hitting know areas that the model has previously learned, it should be more efficient than a model that hasnt been trained on that subject.

Topic		Replies	Views
Prompt tuning using transformers library Generative AI with Large Language Models week-module-2	1	559	July 19, 2023
Prompt tuning Generative AI with Large Language Models week-module-2	3	316	February 26, 2024
Prompt Tuning for Large Language Models AI Discussions	2	126	June 5, 2023
Thinking aloud.. LoRA & Prompt Tuning Generative AI with Large Language Models week-module-2	8	1230	November 29, 2023
How do we know the prompt template used to train a model? Finetuning Large Language Models	0	150	September 13, 2023

Does Prompt tuning work with Autoregressive models as well?

Related topics