yuchenj commited on
Commit
5a3a1fa
1 Parent(s): 888278d

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +1 -1
README.md CHANGED
@@ -5,4 +5,4 @@ datasets:
5
  ---
6
  This is a GPT-2 (350M) model trained in llm.c for 100B tokens with WSD (Warmup-Stable-Decay) learning rate schedule on FineWeb-EDU.
7
 
8
- A lot more detailed info and observations are here: https://x.com/Yuchenj_UW/status/1816181774374109250
 
5
  ---
6
  This is a GPT-2 (350M) model trained in llm.c for 100B tokens with WSD (Warmup-Stable-Decay) learning rate schedule on FineWeb-EDU.
7
 
8
+ A lot more detailed info and observations are here: https://x.com/Yuchenj_UW/status/1816508452518482319