adamo1139
/

Experimental-DeepSeek-V2-Coder-Lite-JUMP-alpha1-GGUF

Inference Endpoints

Model card Files Files and versions Community

Experimental-DeepSeek-V2-Coder-Lite-JUMP-alpha1-GGUF / README.md

adamo1139's picture

Update README.md

5266a64 verified 2 months ago

|

history blame contribute delete

532 Bytes

	---
	license: other
	license_name: deepseek-license
	license_link: LICENSE
	---
	DeepSeek-Coder-V2-Lite-Base finetuned for 0.25 epochs on adamo1139/ise-uiuc_Magicoder-Evol-Instruct-110K-ShareGPT via llama-factory at 3000ctx with qlora, rank 32 and alpha 32.

	Prompt format is ChatML but ChatML-specific tokens are not in the tokenizer, so it's sometimes spilling random tokens. Definitely something to fix in the next version.

	It's an early WIP, unless you are dying to try DeepSeek-Coder-V2-Lite finetunes I suggest you don't use it :)