Update README.md
Browse files
README.md
CHANGED
@@ -5,7 +5,7 @@ license: cc-by-nc-4.0
|
|
5 |
# Noromaid-v0.1-mixtral-8x7b-v3 4bpw
|
6 |
Exllama quant of [NeverSleep/Noromaid-v0.1-mixtral-8x7b-v3](https://huggingface.co/NeverSleep/Noromaid-v0.1-mixtral-8x7b-v3)
|
7 |
|
8 |
-
You probably want to use the [3.5bpw](https://huggingface.co/Kooten/Noromaid-v0.1-mixtral-8x7b-v3-3.5bpw-exl2) version, it lands very close to 24gb of vram with
|
9 |
|
10 |
Other BPW's [2.7bpw](https://huggingface.co/Kooten/Noromaid-v0.1-mixtral-8x7b-v3-2.7bpw-exl2), [3.0bpw](https://huggingface.co/Kooten/Noromaid-v0.1-mixtral-8x7b-v3-3bpw-exl2), [3.5bpw](https://huggingface.co/Kooten/Noromaid-v0.1-mixtral-8x7b-v3-3.5bpw-exl2), [4.0bpw](https://huggingface.co/Kooten/Noromaid-v0.1-mixtral-8x7b-v3-4bpw-exl2)
|
11 |
|
|
|
5 |
# Noromaid-v0.1-mixtral-8x7b-v3 4bpw
|
6 |
Exllama quant of [NeverSleep/Noromaid-v0.1-mixtral-8x7b-v3](https://huggingface.co/NeverSleep/Noromaid-v0.1-mixtral-8x7b-v3)
|
7 |
|
8 |
+
You probably want to use the [3.5bpw](https://huggingface.co/Kooten/Noromaid-v0.1-mixtral-8x7b-v3-3.5bpw-exl2) version, it lands very close to 24gb of vram with 8bit cache enabled. GGUF is probably preferable if you have less than 24gb.
|
9 |
|
10 |
Other BPW's [2.7bpw](https://huggingface.co/Kooten/Noromaid-v0.1-mixtral-8x7b-v3-2.7bpw-exl2), [3.0bpw](https://huggingface.co/Kooten/Noromaid-v0.1-mixtral-8x7b-v3-3bpw-exl2), [3.5bpw](https://huggingface.co/Kooten/Noromaid-v0.1-mixtral-8x7b-v3-3.5bpw-exl2), [4.0bpw](https://huggingface.co/Kooten/Noromaid-v0.1-mixtral-8x7b-v3-4bpw-exl2)
|
11 |
|