Edit Models filters

Inference status

Misc

Inference Endpoints

AutoTrain Compatible

text-generation-inference

Mixture of Experts

Misc with no match

4-bit precision

text-embeddings-inference

8-bit precision

Carbon Emissions

Models

295

Full-text search

Active filters: fp8

nm-testing/granite-3b-code-base-FP8

Text Generation • Updated Jun 12 • 12

fr00000/dolp-fp8

Text Generation • Updated Jun 13 • 3

neuralmagic/Qwen2-0.5B-Instruct-FP8

Text Generation • Updated Jul 18 • 1.09k • 2

nm-testing/opt-125m-fp8-static-kv

Text Generation • Updated Jun 14 • 6

neuralmagic/Qwen2-1.5B-Instruct-FP8

Text Generation • Updated Jul 18 • 33

neuralmagic/Qwen2-7B-Instruct-FP8

Text Generation • Updated Jul 18 • 1.33k • 1

anyisalin/L3-70B-Euryale-v2.1-FP8

Text Generation • Updated Jun 18 • 393

nm-testing/Qwen2-0.5B-Instruct-FP8-KV

Text Generation • Updated Jun 18 • 4

kuotient/llama3-instrucTrans-enko-8b-FP8

Text Generation • Updated Jun 20 • 7 • 2

nm-testing/SparseLlama-3-8B-pruned_50.2of4-FP8

Text Generation • Updated Jun 25 • 18

FlorianJc/Hermes-2-Pro-Mistral-7B-vllm-fp8

Text Generation • Updated Jul 17 • 8

FlorianJc/openchat-3.6-8b-20240522-vllm-fp8

Text Generation • Updated Jul 17 • 18

FlorianJc/Llama3-ChatQA-1.5-8B-vllm-fp8

Text Generation • Updated Jul 17 • 12

TechxGenus/Codestral-22B-v0.1-FP8

Text Generation • Updated Jun 21 • 460

Model-SafeTensors/Meta-Llama-3-70B-FP8-Dynamic

Text Generation • Updated Jun 23 • 1.6k

Model-SafeTensors/Qwen-Qwen2-72B-FP8-Dynamic

Text Generation • Updated Sep 4 • 2.01k

Rallio67/magnum-72B-FP8

Text Generation • Updated Jun 26 • 11

nm-testing/Qwen2-1.5B-Instruct-FP8-KV

Text Generation • Updated Jun 26 • 6

neuralmagic/Meta-Llama-3-70B-Instruct-FP8-KV

Text Generation • Updated Jun 26 • 803 • 2

neuralmagic/Mistral-7B-Instruct-v0.3-FP8

Text Generation • Updated Jul 18 • 1.88k • 2

neuralmagic/Llama-2-7b-chat-hf-FP8

Text Generation • Updated Jul 18 • 657

neuralmagic/Phi-3-mini-128k-instruct-FP8

Text Generation • Updated Oct 9 • 444

neuralmagic/Phi-3-medium-128k-instruct-FP8

Text Generation • Updated Oct 9 • 875 • 5

Rallio67/llama-3-70B-test-FP8

Text Generation • Updated Jun 27 • 6

Rallio67/llama-3-70B-actions-FP8

Text Generation • Updated Jun 27 • 6

nerdylive/Meta-Llama-3-8B-Instruct-FP8

Text Generation • Updated Jul 1 • 3

nm-testing/Qwen2-57B-A14B-Instruct-FP8-KV

Text Generation • Updated Jul 3 • 6

nm-testing/dbrx-instruct-FP8

Text Generation • Updated Jul 3 • 8

tranhoangnguyen03/Gemma-2-9B-It-SPPO-Iter3_Q8

Text Generation • Updated Jul 7 • 12

Model-SafeTensors/mistralai-Mixtral-8x7B-v0.1

Text Generation • Updated Jul 7 • 14