Short Course Q&A Reinforcement Learning from Human Feedback
Topic | Replies | Views | Activity | |
---|---|---|---|---|
L3_tune_llm
|
![]() |
0 | 8 | September 25, 2024 |
How to get the Base LLM to generate multiple completions
|
![]() ![]() |
1 | 192 | August 27, 2024 |
Slides of "Reinforcement Learning From Human Feedback"
|
![]() ![]() |
2 | 33 | July 16, 2024 |
Why do we need to have the dataset from same distribution?
|
![]() ![]() |
1 | 66 | June 2, 2024 |
RLHF: Video Not Loading
|
![]() ![]() ![]() ![]() ![]() |
5 | 169 | May 3, 2024 |
Colab setup
|
![]() |
0 | 149 | January 30, 2024 |
Alternative to using VertexAI?
|
![]() ![]() |
1 | 175 | January 25, 2024 |
Where are the actual datasets used in the course?
|
![]() ![]() |
3 | 174 | January 11, 2024 |
Path to service account key file
|
![]() ![]() |
3 | 154 | January 6, 2024 |
Regarding Video Resolution
|
![]() ![]() |
1 | 185 | December 21, 2023 |
What is the criteria for completing the course?
|
![]() ![]() ![]() ![]() |
7 | 244 | December 21, 2023 |