Edit Models filters

Inference status

Misc

Inference Endpoints

Misc with no match

AutoTrain Compatible

text-generation-inference

4-bit precision

text-embeddings-inference

8-bit precision

Carbon Emissions

Mixture of Experts

Models

62

Full-text search

Active filters: Quantized

VPTQ-community/Qwen2.5-72B-Instruct-v16-k65536-65536-woft

Updated Oct 13 • 26 • 4

VPTQ-community/Qwen2.5-72B-Instruct-v16-k65536-32768-woft

Updated Oct 13 • 46 • 3

VPTQ-community/Meta-Llama-3.1-70B-Instruct-v8-k65536-0-woft

Updated Oct 13 • 149 • 2

VPTQ-community/Meta-Llama-3.1-405B-Instruct-v16-k65536-1024-woft

Updated Oct 13 • 12 • 1

VPTQ-community/Meta-Llama-3.1-405B-Instruct-v8-k4096-0-woft

Updated Oct 13 • 4 • 1

VPTQ-community/Meta-Llama-3.1-405B-Instruct-v16-k65536-64-woft

Updated Oct 13 • 45 • 3

VPTQ-community/Meta-Llama-3.1-405B-Instruct-v16-k32768-32768-woft

Updated Oct 13 • 11 • 1

VPTQ-community/Meta-Llama-3.1-405B-Instruct-v16-k65536-128-woft

Updated Oct 13 • 3 • 1

VPTQ-community/Qwen2.5-72B-Instruct-v8-k65536-4-woft

Updated Oct 13 • 3 • 2

VPTQ-community/Qwen2.5-72B-Instruct-v8-k65536-0-woft

Updated Oct 13 • 52 • 2

VPTQ-community/Qwen2.5-72B-Instruct-v8-k512-512-woft

Updated Oct 13 • 8 • 1

VPTQ-community/Qwen2.5-72B-Instruct-v8-k1024-512-woft

Updated Oct 13 • 19 • 2

VPTQ-community/Meta-Llama-3.1-405B-Instruct-v16-k65536-256-woft

Updated Oct 13 • 7 • 1

VPTQ-community/Qwen2.5-72B-Instruct-v8-k65536-256-woft

Updated Oct 13 • 31 • 4

VPTQ-community/Meta-Llama-3.1-8B-Instruct-v12-k65536-4096-woft

Updated Oct 13 • 276 • 2

VPTQ-community/Qwen2.5-72B-Instruct-v8-k65536-65536-woft

Updated Oct 13 • 43 • 1

VPTQ-community/Qwen2.5-32B-Instruct-v16-k65536-65536-woft

Updated Oct 13 • 20 • 1

VPTQ-community/Meta-Llama-3.1-405B-Instruct-v16-k65536-65536-woft

Updated Oct 13 • 11 • 3

VPTQ-community/Mistral-Large-Instruct-2407-v16-k65536-16384-woft

Updated 27 days ago • 13 • 2

VPTQ-community/Mistral-Large-Instruct-2407-v16-k65536-65536-woft

Updated 27 days ago • 76 • 1

VPTQ-community/Mistral-Large-Instruct-2407-v8-k65536-65536-woft

Updated 27 days ago • 29 • 1

VPTQ-community/Meta-Llama-3.1-405B-Instruct-v8-k65536-256-woft

Updated 24 days ago • 18 • 1

VPTQ-community/Llama-3.1-Nemotron-70B-Instruct-HF-v8-k65536-65536-woft

Updated 24 days ago • 111 • 3

VPTQ-community/Llama-3.1-Nemotron-70B-Instruct-HF-v16-k65536-256-woft

Updated 24 days ago • 74 • 1

ABX-AI/WizardLM-2-7B-GGUF-IQ-Imatrix

Updated Apr 15 • 993 • 21

erdiari/turkish-quantized

Updated Jun 5 • 23 • 1

VPTQ-community/Meta-Llama-3.1-70B-Instruct-v16-k65536-32768-woft

Updated Oct 13 • 7

VPTQ-community/Meta-Llama-3.1-8B-Instruct-v8-k65536-65536-woft

Updated Oct 13 • 69

VPTQ-community/Meta-Llama-3.1-8B-Instruct-v8-k65536-4096-woft

Updated Oct 13 • 18

VPTQ-community/Meta-Llama-3.1-8B-Instruct-v8-k65536-256-woft

Updated Oct 13 • 52