Edit Models filters

Inference status

Misc

Inference Endpoints

4-bit precision

AutoTrain Compatible

text-generation-inference

Misc with no match

text-embeddings-inference

8-bit precision

Carbon Emissions

Mixture of Experts

Models

62

Full-text search

Active filters: Quantization

VPTQ-community/Meta-Llama-3.1-8B-Instruct-v8-k65536-256-woft

Updated Oct 13 • 52

VPTQ-community/Meta-Llama-3.1-70B-Instruct-v16-k65536-65536-woft

Updated Oct 13 • 64

VPTQ-community/Meta-Llama-3.1-70B-Instruct-v8-k65536-256-woft

Updated Oct 13 • 41 • 1

VPTQ-community/Qwen2.5-7B-Instruct-v8-k65536-256-woft

Updated Oct 13 • 3

VPTQ-community/Meta-Llama-3.1-70B-Instruct-v8-k32768-0-woft

Updated Oct 13 • 124 • 1

VPTQ-community/Meta-Llama-3.1-70B-Instruct-v8-k65536-65536-woft

Updated Oct 13 • 76 • 2

VPTQ-community/Meta-Llama-3.1-70B-Instruct-v8-k16384-0-woft

Updated Oct 13 • 19 • 2

VPTQ-community/Qwen2.5-72B-Instruct-v8-k65536-4-woft-duplicated

Updated Oct 13 • 9 • 1

VPTQ-community/Qwen2.5-14B-Instruct-v8-k256-256-woft

Updated Oct 13 • 18

VPTQ-community/Qwen2.5-14B-Instruct-v16-k65536-65536-woft

Updated Oct 13 • 26

VPTQ-community/Qwen2.5-14B-Instruct-v8-k65536-256-woft

Updated Oct 13 • 6

VPTQ-community/Qwen2.5-14B-Instruct-v8-k65536-0-woft

Updated Oct 13 • 28

VPTQ-community/Qwen2.5-14B-Instruct-v8-k65536-65536-woft

Updated Oct 13 • 104

VPTQ-community/Qwen2.5-7B-Instruct-v16-k65536-65536-woft

Updated Oct 13 • 40 • 1

VPTQ-community/Qwen2.5-7B-Instruct-v8-k65536-65536-woft

Updated Oct 13 • 8

VPTQ-community/Qwen2.5-7B-Instruct-v8-k256-256-woft

Updated Oct 13 • 24

VPTQ-community/Qwen2.5-7B-Instruct-v8-k65536-0-woft

Updated Oct 13 • 124

VPTQ-community/Qwen2.5-32B-Instruct-v8-k65536-0-woft

Updated Oct 13 • 108

VPTQ-community/Qwen2.5-32B-Instruct-v8-k65536-65536-woft

Updated Oct 13 • 22 • 1

VPTQ-community/Qwen2.5-32B-Instruct-v8-k256-256-woft

Updated Oct 13 • 10

VPTQ-community/Qwen2.5-32B-Instruct-v8-k65536-256-woft

Updated Oct 13 • 29 • 2

VPTQ-community/Mistral-Large-Instruct-2407-v8-k65536-0-woft

Updated 28 days ago • 9

VPTQ-community/Mistral-Large-Instruct-2407-v16-k65536-256-woft

Updated 28 days ago

VPTQ-community/Mistral-Large-Instruct-2407-v8-k65536-256-woft

Updated 28 days ago • 11

VPTQ-community/Mistral-Large-Instruct-2407-v16-k65536-1024-woft

Updated 28 days ago • 7

VPTQ-community/Mistral-Large-Instruct-2407-v16-k65536-4096-woft

Updated 28 days ago • 16

VPTQ-community/Meta-Llama-3.1-405B-Instruct-v8-k65536-65536-woft

Updated 25 days ago • 39

VPTQ-community/Llama-3.1-Nemotron-70B-Instruct-HF-v8-k65536-256-woft

Updated 24 days ago • 46

VPTQ-community/Llama-3.1-Nemotron-70B-Instruct-HF-v16-k65536-65536-woft

Updated 24 days ago • 34

VPTQ-community/Llama-3.1-Nemotron-70B-Instruct-HF-v16-k65536-1024-woft

Updated 24 days ago • 18