Short Course Q&A Reinforcement Learning from Human Feedback
Topic | Replies | Views | Activity | |
---|---|---|---|---|
Regarding Video Resolution
|
![]() ![]() |
1 | 203 | December 21, 2023 |
L3_tune_llm
|
![]() |
0 | 11 | September 25, 2024 |
How to get the Base LLM to generate multiple completions
|
![]() ![]() |
1 | 228 | August 27, 2024 |
Slides of "Reinforcement Learning From Human Feedback"
|
![]() ![]() |
2 | 38 | July 16, 2024 |
Why do we need to have the dataset from same distribution?
|
![]() ![]() |
1 | 68 | June 2, 2024 |
RLHF: Video Not Loading
|
![]() ![]() ![]() ![]() ![]() |
5 | 231 | May 3, 2024 |
Colab setup
|
![]() |
0 | 152 | January 30, 2024 |
Alternative to using VertexAI?
|
![]() ![]() |
1 | 181 | January 25, 2024 |
Where are the actual datasets used in the course?
|
![]() ![]() |
3 | 189 | January 11, 2024 |
Path to service account key file
|
![]() ![]() |
3 | 157 | January 6, 2024 |
What is the criteria for completing the course?
|
![]() ![]() ![]() ![]() |
7 | 246 | December 21, 2023 |