Short Course Q&A Reinforcement Learning from Human Feedback
Topic | Replies | Views | Activity | |
---|---|---|---|---|
L3_tune_llm
|
0 | 8 | September 25, 2024 | |
How to get the Base LLM to generate multiple completions
|
1 | 166 | August 27, 2024 | |
Slides of "Reinforcement Learning From Human Feedback"
|
2 | 30 | July 16, 2024 | |
Why do we need to have the dataset from same distribution?
|
1 | 66 | June 2, 2024 | |
RLHF: Video Not Loading
|
5 | 150 | May 3, 2024 | |
Colab setup
|
0 | 149 | January 30, 2024 | |
Alternative to using VertexAI?
|
1 | 174 | January 25, 2024 | |
Where are the actual datasets used in the course?
|
3 | 172 | January 11, 2024 | |
Path to service account key file
|
3 | 153 | January 6, 2024 | |
Regarding Video Resolution
|
1 | 183 | December 21, 2023 | |
What is the criteria for completing the course?
|
7 | 244 | December 21, 2023 |