Text Generation
Transformers
PyTorch
Safetensors
English
stripedhyena
custom_code
Zymrael commited on
Commit
9dd09a5
1 Parent(s): d0579e2

chore: add info on dtypes

Browse files
Files changed (1) hide show
  1. README.md +2 -0
README.md CHANGED
@@ -31,3 +31,5 @@ StripedHyena is a hybrid architecture composed of multi-head, grouped-query atte
31
  ### Disclaimer
32
 
33
  To use StripedHyena outside of the playground, you will need to install custom kernels. Please follow the instructions from the [standalone repository](https://github.com/togethercomputer/stripedhyena).
 
 
 
31
  ### Disclaimer
32
 
33
  To use StripedHyena outside of the playground, you will need to install custom kernels. Please follow the instructions from the [standalone repository](https://github.com/togethercomputer/stripedhyena).
34
+
35
+ StripedHyena is a mixed precision model. Make sure to keep your `poles` and `residues` in `float32` precision, especially for longer prompts or training.