Short Course Q&A Pretraining LLMs
| Topic | Replies | Views | Activity | |
|---|---|---|---|---|
|
DatasetNotFoundError:
|
|
3 | 61 | November 1, 2025 |
|
Is it possible to access the entire `models` folder and all Jupyter Notebook files?
|
|
7 | 139 | December 30, 2024 |
|
Packing the data with max sequence length
|
|
3 | 57 | December 19, 2024 |
|
Data packing
|
|
0 | 26 | September 26, 2024 |
|
Why does the model start repeating the same sentences after some N number of token outputs?
|
|
3 | 1004 | September 25, 2024 |
|
Number of tokens mentioned in the first video to train a 248M model
|
|
0 | 22 | September 12, 2024 |
|
L2_language_model.bin not found
|
|
3 | 62 | August 29, 2024 |
|
Korean subtitles for Pretraining LLMs courses
|
|
0 | 40 | July 31, 2024 |
|
About packaging data: NEXT SENTECE
|
|
0 | 42 | July 19, 2024 |