Transformers
GGUF
English
Inference Endpoints
imatrix
mradermacher commited on
Commit
cf3456a
1 Parent(s): c79e6e1

auto-patch README.md

Browse files
Files changed (1) hide show
  1. README.md +3 -3
README.md CHANGED
@@ -47,9 +47,9 @@ more details, including on how to concatenate multi-part files.
47
  | [GGUF](https://huggingface.co/mradermacher/armaGPT-i1-GGUF/resolve/main/armaGPT.i1-Q3_K_M.gguf) | i1-Q3_K_M | 4.5 | IQ3_S probably better |
48
  | [GGUF](https://huggingface.co/mradermacher/armaGPT-i1-GGUF/resolve/main/armaGPT.i1-Q3_K_L.gguf) | i1-Q3_K_L | 4.8 | IQ3_M probably better |
49
  | [GGUF](https://huggingface.co/mradermacher/armaGPT-i1-GGUF/resolve/main/armaGPT.i1-IQ4_XS.gguf) | i1-IQ4_XS | 4.9 | |
50
- | [GGUF](https://huggingface.co/mradermacher/armaGPT-i1-GGUF/resolve/main/armaGPT.i1-Q4_0_4_4.gguf) | i1-Q4_0_4_4 | 5.1 | |
51
- | [GGUF](https://huggingface.co/mradermacher/armaGPT-i1-GGUF/resolve/main/armaGPT.i1-Q4_0_4_8.gguf) | i1-Q4_0_4_8 | 5.1 | |
52
- | [GGUF](https://huggingface.co/mradermacher/armaGPT-i1-GGUF/resolve/main/armaGPT.i1-Q4_0_8_8.gguf) | i1-Q4_0_8_8 | 5.1 | |
53
  | [GGUF](https://huggingface.co/mradermacher/armaGPT-i1-GGUF/resolve/main/armaGPT.i1-Q4_0.gguf) | i1-Q4_0 | 5.1 | fast, low quality |
54
  | [GGUF](https://huggingface.co/mradermacher/armaGPT-i1-GGUF/resolve/main/armaGPT.i1-Q4_K_S.gguf) | i1-Q4_K_S | 5.1 | optimal size/speed/quality |
55
  | [GGUF](https://huggingface.co/mradermacher/armaGPT-i1-GGUF/resolve/main/armaGPT.i1-Q4_K_M.gguf) | i1-Q4_K_M | 5.4 | fast, recommended |
 
47
  | [GGUF](https://huggingface.co/mradermacher/armaGPT-i1-GGUF/resolve/main/armaGPT.i1-Q3_K_M.gguf) | i1-Q3_K_M | 4.5 | IQ3_S probably better |
48
  | [GGUF](https://huggingface.co/mradermacher/armaGPT-i1-GGUF/resolve/main/armaGPT.i1-Q3_K_L.gguf) | i1-Q3_K_L | 4.8 | IQ3_M probably better |
49
  | [GGUF](https://huggingface.co/mradermacher/armaGPT-i1-GGUF/resolve/main/armaGPT.i1-IQ4_XS.gguf) | i1-IQ4_XS | 4.9 | |
50
+ | [GGUF](https://huggingface.co/mradermacher/armaGPT-i1-GGUF/resolve/main/armaGPT.i1-Q4_0_4_4.gguf) | i1-Q4_0_4_4 | 5.1 | fast on arm, low quality |
51
+ | [GGUF](https://huggingface.co/mradermacher/armaGPT-i1-GGUF/resolve/main/armaGPT.i1-Q4_0_4_8.gguf) | i1-Q4_0_4_8 | 5.1 | fast on arm+i8mm, low quality |
52
+ | [GGUF](https://huggingface.co/mradermacher/armaGPT-i1-GGUF/resolve/main/armaGPT.i1-Q4_0_8_8.gguf) | i1-Q4_0_8_8 | 5.1 | fast on arm+sve, low quality |
53
  | [GGUF](https://huggingface.co/mradermacher/armaGPT-i1-GGUF/resolve/main/armaGPT.i1-Q4_0.gguf) | i1-Q4_0 | 5.1 | fast, low quality |
54
  | [GGUF](https://huggingface.co/mradermacher/armaGPT-i1-GGUF/resolve/main/armaGPT.i1-Q4_K_S.gguf) | i1-Q4_K_S | 5.1 | optimal size/speed/quality |
55
  | [GGUF](https://huggingface.co/mradermacher/armaGPT-i1-GGUF/resolve/main/armaGPT.i1-Q4_K_M.gguf) | i1-Q4_K_M | 5.4 | fast, recommended |