How to train the lab to match the loaded model

baj · January 15, 2024, 5:37am

I’m on Week 2 lab, Fine-Tune a Generative AI Model for Dialogue Summarization.

The lab actually only train for 1 weights update with max_steps=1. I’m trying to get the training result to match that of the loaded model. I’ve updated the max_steps to 10, but the ROUGE metric doesn’t come close to that of the loaded model.

Does anyone have any luck in terms of training the model to get to the performance of the loaded model?

if so, would you mind sharing the parameter you have ?

gent.spah · January 15, 2024, 6:49am

The loaded model maybe trained on much larger dataset!

baj · January 15, 2024, 10:58pm

thank you!

Topic		Replies	Views
W2 lab - testing fine tuned model from section 2.2 Generative AI with Large Language Models week-module-2	2	468	September 30, 2023
What are the parameters using as arguments in full fine-tuning the google flan model in lab 2? Generative AI with Large Language Models week-module-2	1	300	December 17, 2023
Re-creating the model from S3 Generative AI with Large Language Models week-module-2	2	437	December 18, 2023
Lab 2 Week 2 Generative AI with Large Language Models week-module-2	1	156	April 22, 2024
Generative AI with Large Language Models fine tuning checkpoint Generative AI with Large Language Models week-module-2	3	189	April 19, 2024

How to train the lab to match the loaded model

Related topics