DeepLearning.AI
Why use KL divergence in PPO?
Course Q&A
Generative AI with Large Language Models
week-module-3
,
faq
gent.spah
July 16, 2024, 8:34am
2
Hello, please check this post also if you may:
show post in topic
Related topics
Topic
Replies
Views
Activity
KL divergence or trust region?
Generative AI with Large Language Models
week-module-3
7
58
July 15, 2024
Trust region in the PPO equation and KL divergence
GenAI with LLMs Resources
2
437
October 19, 2023
I have a question about the content of the lecture
Generative AI with Large Language Models
week-module-3
0
407
August 14, 2023
Lab 3 Qualitative Evaluation of PPO model; wonky results
Generative AI with Large Language Models
week-module-3
1
443
July 24, 2023
🆘 Help: PPO Training with HuggingFace Model for Text Summarization using TRL
AI Discussions
ai-discussions
1
31
July 4, 2025