Josephgflowers
/

tinyllama-730M-test

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

Josephgflowers commited on Feb 15

Commit

29b2c22

•

1 Parent(s): 71195ef

Update README.md

Files changed (1) hide show

README.md +11 -0

README.md CHANGED Viewed

@@ -1,5 +1,16 @@
 ---
 license: mit
 ---
 I cut my TinyLlama 1.1B cinder v 2 down from 22 layers to 14. At 14 there was no coherent text but there were emerging ideas of a response. 1000 steps on step-by-step dataset.
 6000 on Reason-with-cinder. The loss was still over 1 and the learning rate was still over 4. This model needs significat training. I am putting it up as a base model that

 ---
 license: mit
+widget:
+  - text: >
+      <|system|>
+      You are a helpful assistant</s>
+      <|user|>
+      Tell me about yourself, what is your name?</s>
+      <|assistant|>
 ---
 I cut my TinyLlama 1.1B cinder v 2 down from 22 layers to 14. At 14 there was no coherent text but there were emerging ideas of a response. 1000 steps on step-by-step dataset.
 6000 on Reason-with-cinder. The loss was still over 1 and the learning rate was still over 4. This model needs significat training. I am putting it up as a base model that