speakleash
/

Bielik-11B-v2.2-Instruct

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

Remek commited on Aug 28

Commit

fc41044

•

1 Parent(s): 835a197

Update README.md

Files changed (1) hide show

README.md +1 -1

README.md CHANGED Viewed

@@ -70,7 +70,7 @@ We know that some people want to explore smaller models or don't have the resour
 - Quanto - [4bit](https://huggingface.co/speakleash/Bielik-11B-v2.2-Instruct-Quanto-4bit), [8bit](https://huggingface.co/speakleash/Bielik-11B-v2.2-Instruct-Quanto-8bit)
 - [FP8](https://huggingface.co/speakleash/Bielik-11B-v2.2-Instruct-FP8) (vLLM, SGLang - Ada Lovelace, Hopper optimized)
 - [INT8 W8A8](https://huggingface.co/speakleash/Bielik-11B-v2.2-Instruct-W8A8) (vLLM INT8 quantization Weights=8bits and Activations=8bits)
-- [GGUF - experimental - IQ imatrix IQ2_XXS, IQ3_XXS, IQ4_XS and calibrated Q4_K_M, Q5_K_M, Q6_K, Q8_0](https://huggingface.co/speakleash/Bielik-11B-v2.2-Instruct-GGUF-IQ-Imatrix)
 Please note that quantized models may offer lower quality of generated answers compared to full sized variatns.

 - Quanto - [4bit](https://huggingface.co/speakleash/Bielik-11B-v2.2-Instruct-Quanto-4bit), [8bit](https://huggingface.co/speakleash/Bielik-11B-v2.2-Instruct-Quanto-8bit)
 - [FP8](https://huggingface.co/speakleash/Bielik-11B-v2.2-Instruct-FP8) (vLLM, SGLang - Ada Lovelace, Hopper optimized)
 - [INT8 W8A8](https://huggingface.co/speakleash/Bielik-11B-v2.2-Instruct-W8A8) (vLLM INT8 quantization Weights=8bits and Activations=8bits)
+- [GGUF - experimental - IQ imatrix IQ1_M, IQ2_XXS, IQ3_XXS, IQ4_XS and calibrated Q4_K_M, Q5_K_M, Q6_K, Q8_0](https://huggingface.co/speakleash/Bielik-11B-v2.2-Instruct-GGUF-IQ-Imatrix)
 Please note that quantized models may offer lower quality of generated answers compared to full sized variatns.