Sharded version of This model. Use the tokenizer from there
from transformers import LlamaTokenizer, AutoModelForCausalLM
tokenizer = LlamaTokenizer.from_pretrained("NousResearch/Nous-Hermes-13b")
model = AutoModelForCausalLM.from_pretrained("simsim314/Hermes-13b-hf-shards")
- Downloads last month
- 635
Inference API (serverless) is not available, repository is disabled.