Mikko Juola
commited on
Commit
•
47653af
1
Parent(s):
6ceb0e6
Upload Q4_K_M and Q6_K, also add a README.md
Browse files- .gitattributes +2 -0
- README.md +21 -0
- aurelian-alpha0.1_Q4_K_M.gguf +3 -0
- aurelian-alpha0.1_Q6_K.gguf-split-a +3 -0
- aurelian-alpha0.1_Q6_K.gguf-split-b +3 -0
.gitattributes
CHANGED
@@ -33,3 +33,5 @@ saved_model/**/* filter=lfs diff=lfs merge=lfs -text
|
|
33 |
*.zip filter=lfs diff=lfs merge=lfs -text
|
34 |
*.zst filter=lfs diff=lfs merge=lfs -text
|
35 |
*tfevents* filter=lfs diff=lfs merge=lfs -text
|
|
|
|
|
|
33 |
*.zip filter=lfs diff=lfs merge=lfs -text
|
34 |
*.zst filter=lfs diff=lfs merge=lfs -text
|
35 |
*tfevents* filter=lfs diff=lfs merge=lfs -text
|
36 |
+
*.gguf filter=lfs diff=lfs merge=lfs -text
|
37 |
+
*.gguf-split* filter=lfs diff=lfs merge=lfs -text
|
README.md
ADDED
@@ -0,0 +1,21 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
1 |
+
This repository contains `.gguf` files for:
|
2 |
+
|
3 |
+
https://huggingface.co/grimulkan/aurelian-alpha0.1-70b-rope8-32K-fp16
|
4 |
+
|
5 |
+
Made with `llama.cpp` commit e18f7345a300920e234f732077bda660cc6cda9c
|
6 |
+
|
7 |
+
IMPORTANT: Linear Rope Scaling = 8 (IMPORTANT: use a factor of 8 even if you are not using the full 32K context length). The setting typically defaults to 1, so you need to change it.
|
8 |
+
|
9 |
+
# md5sums
|
10 |
+
|
11 |
+
* `aurelian-alpha0.1_Q4_K_M.gguf` 27ba8b8dc99776cc48d667d1766f8771
|
12 |
+
* `aurelian-alpha0.1_Q6_K.gguf` ab36ed3f2cfd2f833cb814304a5cbe50
|
13 |
+
|
14 |
+
The `aurelian-alpha0.1_Q6_K.gguf` is just barely over 50G, HuggingFace's file
|
15 |
+
limit, so it is in two parts.
|
16 |
+
|
17 |
+
On a UNIX-like system, you can use `cat` to piece it together:
|
18 |
+
|
19 |
+
```shell
|
20 |
+
cat aurelian-alpha0.1_Q6_K.gguf-split-a aurelian-alpha0.1_Q6_K.gguf-split-b > aurelian-alpha0.1_Q6_K.gguf
|
21 |
+
```
|
aurelian-alpha0.1_Q4_K_M.gguf
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:294417779406e249915395646d4ce64fb5af3019db21b6870791107ccdc56d43
|
3 |
+
size 41422910528
|
aurelian-alpha0.1_Q6_K.gguf-split-a
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:e58c7d93f12e760a9113022a812855f73aa2659034ebb0f2960d618e02e705c5
|
3 |
+
size 30064771072
|
aurelian-alpha0.1_Q6_K.gguf-split-b
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:d83c9d01cacae7b4be0831729fe8936667357093be9cb3f017572edfbaef9b5e
|
3 |
+
size 26522596416
|