Hugging Face
Models
Datasets
Spaces
Posts
Docs
Solutions
Pricing
Log In
Sign Up
deepseek-ai
/
deepseek-moe-16b-base
like
83
Follow
DeepSeek
793
Text Generation
Transformers
Safetensors
deepseek
custom_code
arxiv:
2401.06066
License:
deepseek
Model card
Files
Files and versions
Community
7
Train
Use this model
7c0fdaa
deepseek-moe-16b-base
3 contributors
History:
4 commits
zwd973-deepseek
initial commit
7c0fdaa
10 months ago
.gitattributes
1.52 kB
initial commit
10 months ago
README.md
1.93 kB
update readme
10 months ago
config.json
1.07 kB
initial commit
10 months ago
configuration_deepseek.py
10.2 kB
initial commit
10 months ago
generation_config.json
121 Bytes
initial commit
10 months ago
model-00001-of-00007.safetensors
5 GB
LFS
initial commit
10 months ago
model-00002-of-00007.safetensors
5 GB
LFS
initial commit
10 months ago
model-00003-of-00007.safetensors
5 GB
LFS
initial commit
10 months ago
model-00004-of-00007.safetensors
5 GB
LFS
initial commit
10 months ago
model-00005-of-00007.safetensors
5 GB
LFS
initial commit
10 months ago
model-00006-of-00007.safetensors
5 GB
LFS
initial commit
10 months ago
model-00007-of-00007.safetensors
2.77 GB
LFS
initial commit
10 months ago
model.safetensors.index.json
490 kB
initial commit
10 months ago
modeling_deepseek.py
72.7 kB
initial commit
10 months ago
tokenizer.json
4.61 MB
initial commit
10 months ago
tokenizer_config.json
793 Bytes
initial commit
10 months ago