AIGym commited on
Commit
5b34f5a
1 Parent(s): 6eafe5b

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +5 -1
README.md CHANGED
@@ -5,7 +5,11 @@ license: apache-2.0
5
 
6
  # Model Card for Model ID
7
 
8
- <!-- Provide a quick summary of what the model is/does. -->
 
 
 
 
9
 
10
 
11
 
 
5
 
6
  # Model Card for Model ID
7
 
8
+ This tiny model is an 81 million perimeter GPT2 based model it was trained from scratch on the 3060 TI. it uses the GPT2 tokenizer from the GPT2 repo here on hugging face.
9
+
10
+ We are training our own tokenizer from scratch and will release a version 2 of this trained on even more data sets once that is complete.
11
+
12
+ This model is in float 32 but will be converted shortly two float16 in bfloat16.
13
 
14
 
15