Short Course Q&A Reinforcement Learning from Human Feedback
| Topic | Replies | Views | Activity | |
|---|---|---|---|---|
|
Regarding Video Resolution
|
|
1 | 214 | December 21, 2023 |
|
L3_tune_llm
|
|
0 | 27 | September 25, 2024 |
|
How to get the Base LLM to generate multiple completions
|
|
1 | 282 | August 27, 2024 |
|
Slides of "Reinforcement Learning From Human Feedback"
|
|
2 | 53 | July 16, 2024 |
|
Why do we need to have the dataset from same distribution?
|
|
1 | 72 | June 2, 2024 |
|
RLHF: Video Not Loading
|
|
5 | 364 | May 3, 2024 |
|
Colab setup
|
|
0 | 159 | January 30, 2024 |
|
Alternative to using VertexAI?
|
|
1 | 229 | January 25, 2024 |
|
Where are the actual datasets used in the course?
|
|
3 | 211 | January 11, 2024 |
|
Path to service account key file
|
|
3 | 168 | January 6, 2024 |
|
What is the criteria for completing the course?
|
|
7 | 260 | December 21, 2023 |