Short Course Q&A Reinforcement Learning from Human Feedback
Topic | Replies | Views | Activity | |
---|---|---|---|---|
How to get the Base LLM to generate multiple completions
|
1 | 140 | August 27, 2024 | |
Slides of "Reinforcement Learning From Human Feedback"
|
2 | 23 | July 16, 2024 | |
Why do we need to have the dataset from same distribution?
|
1 | 65 | June 2, 2024 | |
RLHF: Video Not Loading
|
5 | 134 | May 3, 2024 | |
Colab setup
|
0 | 147 | January 30, 2024 | |
Alternative to using VertexAI?
|
1 | 171 | January 25, 2024 | |
Where are the actual datasets used in the course?
|
3 | 164 | January 11, 2024 | |
Path to service account key file
|
3 | 150 | January 6, 2024 | |
Regarding Video Resolution
|
1 | 180 | December 21, 2023 | |
What is the criteria for completing the course?
|
7 | 241 | December 21, 2023 |