Short Course Q&A Pretraining LLMs
Topic | Replies | Views | Activity | |
---|---|---|---|---|
Is it possible to access the entire `models` folder and all Jupyter Notebook files?
|
![]() ![]() ![]() ![]() |
7 | 72 | December 30, 2024 |
Packing the data with max sequence length
|
![]() ![]() |
3 | 23 | December 19, 2024 |
Data packing
|
![]() |
0 | 21 | September 26, 2024 |
Why does the model start repeating the same sentences after some N number of token outputs?
|
![]() ![]() |
3 | 271 | September 25, 2024 |
Number of tokens mentioned in the first video to train a 248M model
|
![]() |
0 | 16 | September 12, 2024 |
L2_language_model.bin not found
|
![]() ![]() ![]() |
3 | 50 | August 29, 2024 |
Korean subtitles for Pretraining LLMs courses
|
![]() |
0 | 27 | July 31, 2024 |
About packaging data: NEXT SENTECE
|
![]() |
0 | 41 | July 19, 2024 |