Hugging Face
Models
Datasets
Spaces
Posts
Docs
Solutions
Pricing
Log In
Sign Up
Edit Models filters
Tasks
Libraries
Datasets
Languages
Licenses
Other
1
Inference status
Reset Inference status
Warm
Cold
Frozen
Misc
Reset Misc
dpo
Inference Endpoints
AutoTrain Compatible
text-generation-inference
custom_code
4-bit precision
Eval Results
Merge
8-bit precision
Mixture of Experts
Carbon Emissions
Misc with no match
text-embeddings-inference
Apply filters
Models
3,620
Full-text search
Edit filters
Sort: Trending
Active filters:
dpo
Clear all
v000000/Qwen2.5-14B-Gutenberg-Instruct-Slerpeno
Text Generation
•
Updated
Sep 30
•
2.52k
•
4
v000000/Qwen2.5-14B-Gutenberg-1e-Delta
Text Generation
•
Updated
Sep 30
•
2.51k
•
4
QuantFactory/Qwen2.5-Lumen-14B-GGUF
Text Generation
•
Updated
Sep 21
•
367
•
3
mradermacher/Qwen2.5-Lumen-14B-GGUF
Updated
Sep 22
•
749
•
4
tanliboy/lambda-qwen2.5-32b-dpo-test
Text Generation
•
Updated
Sep 22
•
2.49k
•
4
mradermacher/Qwen2.5-Lumen-14B-i1-GGUF
Updated
Sep 22
•
1.72k
•
6
trl-lib/Qwen2-0.5B-DPO
Text Generation
•
Updated
Sep 27
•
184
•
3
mradermacher/SauerkrautLM-7b-LaserChat-GGUF
Updated
Oct 1
•
226
•
1
mradermacher/SauerkrautLM-7b-LaserChat-i1-GGUF
Updated
Oct 1
•
374
•
1
HumanLLMs/Human-Like-Qwen2.5-7B-Instruct
Updated
Oct 7
•
43
•
2
HumanLLMs/Human-Like-Mistral-Nemo-Instruct-2407
Updated
Oct 9
•
53
•
3
bartowski/Humanish-LLama3-8B-Instruct-GGUF
Text Generation
•
Updated
Oct 7
•
9.79k
•
2
bartowski/Humanish-Mistral-Nemo-Instruct-2407-GGUF
Text Generation
•
Updated
Oct 7
•
754
•
1
bartowski/Humanish-Qwen2.5-7B-Instruct-GGUF
Text Generation
•
Updated
Oct 7
•
1.03k
•
1
CultriX/Qwen2.5-14B-Wernicke-DPO
Text Generation
•
Updated
16 days ago
•
120
•
2
mradermacher/Qwen2.5-14B-Wernicke-DPO-GGUF
Updated
16 days ago
•
4.8k
•
1
mradermacher/Qwen2.5-14B-Wernicke-DPO-i1-GGUF
Updated
16 days ago
•
1.23k
•
3
mradermacher/mistral-7b-dpo-constitutional-ai-GGUF
Updated
10 days ago
•
341
•
1
lyogavin/Anima33B-DPO-Belle-1k
Text Generation
•
Updated
Jul 2, 2023
•
1
lyogavin/Anima33B-DPO-Belle-1k-merged
Text Generation
•
Updated
Jul 2, 2023
•
8
•
12
daekeun-ml/Llama-2-ko-DPO-13B
Text Generation
•
Updated
Oct 31, 2023
•
4.26k
•
19
lewtun/zephyr-7b-dpo-full
Text Generation
•
Updated
Jan 5
•
6
alignment-handbook/zephyr-7b-dpo-qlora
Updated
Jan 9
•
57
•
9
argilla/notus-7b-v1-lora
Text Generation
•
Updated
Dec 4, 2023
•
11
•
7
argilla/notus-7b-v1-lora-adapter
Text Generation
•
Updated
Dec 4, 2023
•
3
ContextualAI/archangel_sft_pythia1-4b
Text Generation
•
Updated
Jan 11
•
14
ContextualAI/archangel_sft_pythia2-8b
Text Generation
•
Updated
Jan 11
•
24
•
1
ContextualAI/archangel_sft_pythia6-9b
Text Generation
•
Updated
Jan 11
•
24
ContextualAI/archangel_sft_pythia12-0b
Text Generation
•
Updated
Jan 11
•
18
ContextualAI/archangel_sft_llama7b
Text Generation
•
Updated
Jan 11
•
632
•
1
Previous
1
2
3
4
...
100
Next