Safetensors
llama

the config class and config.json uses DeepseekConfig, not v2

#5
No description provided.

It has been fixed on main branch, by changing to "LlamaForCausalLM"

Cannot merge
This branch has merge conflicts in the following files:
  • modeling_deepseek.py

Sign up or log in to comment