Fine-Tune FLAN-T5 with Reinforcement Learning (PPO) and PEFT to Generate Less-Toxic Summaries week #3

Ydvshri1412 · January 23, 2025, 10:35pm

When I am fine tuning the peft model to generate less toxic summarie I am getting error in the code. I tried to change the code little bit but still getting error as ‘AutoModelForSeq2SeqLMWithValueHead’ object has no attribute ‘base_model_prefix’. BelowI am adding the SS of the error

gent.spah · January 24, 2025, 7:22am

Hello, have you added comments and additional coding lines to the Lab? Because I don’t see these over there! From what I see, the parameters inside PPOconfig and PPOTrainer are not set up properly, plus there is no value_model argument in the PPOTrainer:

Topic		Replies	Views
PEFT model inference Generative AI with Large Language Models week-module-2	6	930	March 3, 2025
PEFT fine-tuning on Flan-t5-base model does not change inference results Generative AI with Large Language Models week-module-2	2	371	March 14, 2024
Learn Found. GenAI - w3 - Error on "import necessary component" Generative AI with Large Language Models project	6	27	April 18, 2025
Reinforcement learning made my lab model MORE toxic Generative AI with Large Language Models lab-help	2	54	February 17, 2025
2.3 Evaluate Toxicity - Fine-Tune FLAN-T5 to Generate More-Positive Summaries Generative AI with Large Language Models week-module-3	1	479	July 1, 2023

Fine-Tune FLAN-T5 with Reinforcement Learning (PPO) and PEFT to Generate Less-Toxic Summaries week #3

Related topics