Question tokenizer PEFT training

Mordokkai · May 1, 2024, 9:08am

Hi,
I’m wondering why in the lab we do save and reload the weights of the tokenizer after training. Did PEFT change the weights of the tokenizer ? I thought it would only affect the queries/values part.

Deepti_Prasad · May 1, 2024, 9:56am

Hello @Mordokkai

Because Parameter-Efficient Fine-Tuning (PEFT) methods enable efficient adaptation of large pretrained models to various downstream applications by only fine-tuning a small number of (extra) model parameters instead of all the model’s parameters.

We reload the weights of saved model and freeze the part of the model we do not want to use, and further fine tuning only the parameters we want to get a better llm model. This significantly decreases the computational and storage costs.

Regards
DP

Mordokkai · May 1, 2024, 10:19am

Hi @Deepti_Prasad ,

Thanks for your answer but my question was concerning the tokenizer part especially. In the lab we use Lora to finetune, but I thought Lora only concerned the attention part of the llm (queries, values), I didn’t know it would also concern the tokenizer

Deepti_Prasad · May 1, 2024, 10:21am

It is all related, what if the peft doesn’t want to use all the tokenizers and only selected tokenizer and fine tune the llm.

Topic		Replies	Views
PEFT fine-tuning on Flan-t5-base model does not change inference results Generative AI with Large Language Models week-2	2	339	March 14, 2024
PEFT training Generative AI with Large Language Models week-2	1	494	January 16, 2024
Understanding Peft Library Generative AI with Large Language Models week-3	1	528	July 16, 2023
PEFT/LoRA used for domain addaptation Generative AI with Large Language Models week-2	1	489	July 10, 2023
Retraining with Lora and adapter, frozen weights Generative AI with Large Language Models ai-discussions	1	50	September 23, 2024

Question tokenizer PEFT training

Related topics