Upload folder using huggingface_hub (#2)
Browse files- d97ea4a1d42c3368b208cf06cec623e9f5fb03938a279c38612d1acde7244a27 (0e434f8898d259cf5836d06671bc7e941ec53fcb)
- 07314d8a2b5323b9d045348b779f0f5c93de26113e9320c6a2c92a80725521ac (6b3d00fa8783496d807e420762b9cabacc027423)
- 4325ec2cd0579937e24513e09138db858b55a686e157f87f163f9ec0797ebefe (e8c42a358c4af9c87659d59c1c472d338b210933)
- c27c599ce30688e70283a911239259359932f5e4c2bfce54c4267fb92906764b (2e83ccf003a390777a5b71816f0a35181447979e)
- 038dbb9704e27e05422b0e62878c4cba5cc4bcf528c1c3ef6e2170de97836f79 (b439b66edbed67a7991bb4eb8b45e71a236c169c)
- 09c69d43845de7bc4d3f1bbb8910d69b03b2444e206c2a086ca7b8b28e406038 (7f4a88afd9edd165749ee48011c4b98970ae6ca0)
- d59c868477712071137f1d10f3aeac4411d8bdc386437159bffb1e377c8b2402 (ce9cc479e71841c6654e1c5a5c1152cc4533a97d)
- f2de7ffd39aec8dddf94b1b9c86f45f93b738fff04db543ca05c64ce4ef1af44 (fac5d05b65e33b59f82bcaa3619bf063fb8596e4)
- 72058eebd438f4bc08ee3197c32530097a37e5db2fe8eb29dd3e0b283ed81117 (8261002ffbb547e1e83aeb9b5f0bdf138a246d82)
- 8690d7a32304278be77265f5e545764276b5ea5e42338cef35ae46bb1808d0ac (08861ce0f034800c7c62ab38ebb546addfe89f4f)
- 43c4c03e63f03f320b5c319adbe9d22673fb4731b88e3eb34c3fbf71d604b5b6 (7647d2f06b5fc87eeb9cb1ade0013ec4ad16442a)
- e4d076c506c9ff0d713a316209d4031be46387e5687bbd6ef0489e9d05dd29b4 (5dd09138046c66513b8eea6c555b22b55128c72c)
- d133f4fd86e7d8452b2b226b1e753a8f9b42d3c4d2753b44fd03e1701660a7c3 (ea3c2edffcf84c02f869cfc439b4ce359df39d04)
- f16f70c3113e86a025e87e792028f8ec80c6a909ecbcf0d090c0d535cff39b59 (81078b7e7505fc7f699bf548d7d7844081d89fe0)
- a7adc8824d32bab8aaac7cd4240b665218a36001e62491637265db20bc94f685 (919c87e2e42cca1463486a70173d298d6938ea09)
- 907797f952e6f222bd6736c71da95add21423af790573aa20f5d3b9b04348db5 (4914a5d870bd89419d6c1e5cc529ce9f23cb627c)
- .gitattributes +16 -0
- Mistral-Small-Instruct-2409-GGUF_imatrix.dat +3 -0
- Mistral-Small-Instruct-2409.IQ1_M.gguf +3 -0
- Mistral-Small-Instruct-2409.IQ1_S.gguf +3 -0
- Mistral-Small-Instruct-2409.IQ2_XS.gguf +3 -0
- Mistral-Small-Instruct-2409.IQ3_XS.gguf +3 -0
- Mistral-Small-Instruct-2409.IQ4_XS.gguf +3 -0
- Mistral-Small-Instruct-2409.Q2_K.gguf +3 -0
- Mistral-Small-Instruct-2409.Q3_K_L.gguf +3 -0
- Mistral-Small-Instruct-2409.Q3_K_M.gguf +3 -0
- Mistral-Small-Instruct-2409.Q3_K_S.gguf +3 -0
- Mistral-Small-Instruct-2409.Q4_K_M.gguf +3 -0
- Mistral-Small-Instruct-2409.Q4_K_S.gguf +3 -0
- Mistral-Small-Instruct-2409.Q5_K_M.gguf +3 -0
- Mistral-Small-Instruct-2409.Q5_K_S.gguf +3 -0
- Mistral-Small-Instruct-2409.Q6_K.gguf +3 -0
- Mistral-Small-Instruct-2409.Q8_0.gguf +3 -0
- README.md +46 -0
@@ -46,3 +46,19 @@ Llama-3.1-Nemotron-70B-Instruct-HF.Q4_K_M.gguf filter=lfs diff=lfs merge=lfs -te
|
|
46 |
Llama-3.1-Nemotron-70B-Instruct-HF.Q4_K_S.gguf filter=lfs diff=lfs merge=lfs -text
|
47 |
Llama-3.1-Nemotron-70B-Instruct-HF.Q5_K_M.gguf filter=lfs diff=lfs merge=lfs -text
|
48 |
Llama-3.1-Nemotron-70B-Instruct-HF.Q5_K_S.gguf filter=lfs diff=lfs merge=lfs -text
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
46 |
Llama-3.1-Nemotron-70B-Instruct-HF.Q4_K_S.gguf filter=lfs diff=lfs merge=lfs -text
|
47 |
Llama-3.1-Nemotron-70B-Instruct-HF.Q5_K_M.gguf filter=lfs diff=lfs merge=lfs -text
|
48 |
Llama-3.1-Nemotron-70B-Instruct-HF.Q5_K_S.gguf filter=lfs diff=lfs merge=lfs -text
|
49 |
+
Mistral-Small-Instruct-2409.IQ1_M.gguf filter=lfs diff=lfs merge=lfs -text
|
50 |
+
Mistral-Small-Instruct-2409.IQ1_S.gguf filter=lfs diff=lfs merge=lfs -text
|
51 |
+
Mistral-Small-Instruct-2409.IQ2_XS.gguf filter=lfs diff=lfs merge=lfs -text
|
52 |
+
Mistral-Small-Instruct-2409.IQ3_XS.gguf filter=lfs diff=lfs merge=lfs -text
|
53 |
+
Mistral-Small-Instruct-2409.IQ4_XS.gguf filter=lfs diff=lfs merge=lfs -text
|
54 |
+
Mistral-Small-Instruct-2409.Q2_K.gguf filter=lfs diff=lfs merge=lfs -text
|
55 |
+
Mistral-Small-Instruct-2409.Q3_K_L.gguf filter=lfs diff=lfs merge=lfs -text
|
56 |
+
Mistral-Small-Instruct-2409.Q3_K_M.gguf filter=lfs diff=lfs merge=lfs -text
|
57 |
+
Mistral-Small-Instruct-2409.Q3_K_S.gguf filter=lfs diff=lfs merge=lfs -text
|
58 |
+
Mistral-Small-Instruct-2409.Q4_K_M.gguf filter=lfs diff=lfs merge=lfs -text
|
59 |
+
Mistral-Small-Instruct-2409.Q4_K_S.gguf filter=lfs diff=lfs merge=lfs -text
|
60 |
+
Mistral-Small-Instruct-2409.Q5_K_M.gguf filter=lfs diff=lfs merge=lfs -text
|
61 |
+
Mistral-Small-Instruct-2409.Q5_K_S.gguf filter=lfs diff=lfs merge=lfs -text
|
62 |
+
Mistral-Small-Instruct-2409.Q6_K.gguf filter=lfs diff=lfs merge=lfs -text
|
63 |
+
Mistral-Small-Instruct-2409.Q8_0.gguf filter=lfs diff=lfs merge=lfs -text
|
64 |
+
Mistral-Small-Instruct-2409-GGUF_imatrix.dat filter=lfs diff=lfs merge=lfs -text
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:f47eb7bdf4e9698356ed29f4ec02f8c023cbcc84bbb05e955ce60ac74f0acf86
|
3 |
+
size 11940554
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:50dda2e14f4849ef0e4ddfbe610cae2b7cde724f004bd0c3e9abe82e15dafd66
|
3 |
+
size 5267138688
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:41fbc1a93b2303d06cae1df8f4bf77ec07bc62c34992bbdad4ea4f1de835eec3
|
3 |
+
size 4829489280
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:8b74d332ab2d788d5fda974e0cefce22152e38b848e10e5818982d63a040b747
|
3 |
+
size 6646147200
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:e5ba302e41d2d4513aa18ec51072e4c5f1e5eb17ee5ecdb2d59736e51adc4529
|
3 |
+
size 9176098944
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:4d5aa87cd99bce27472a819eb91e67fcfbb48723564bbb2f65a14566cd60da32
|
3 |
+
size 11935295616
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:12e92e39ff67b753916fdbb001eb6f110c04016e1dc534ce31490a2cd7635884
|
3 |
+
size 8272095360
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:fd268dd00803f8a99e2b8ebfc1cff8387fc9244b212b3896b59523e21e84af25
|
3 |
+
size 11730430080
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:5283b47015764c884cad9543450999bcc9a0329bc221af1951cebf9bfdb19bfb
|
3 |
+
size 10756827264
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:b9e1c0a5c5e74d4e28a23e7d87358a07fcedd373c1ed93a8d41eaaf372a7da54
|
3 |
+
size 9641273472
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:ebb16dd5fbc43503bda45718f460480e35c800b37bb9e1a4863c7b8acd0d6f7a
|
3 |
+
size 13341239424
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:e43f178daccefd625441dcf90dc4a18cd020dce0478f7171b6adf5d5194190ee
|
3 |
+
size 12660385920
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:ac958261fb195ffaffcf73f6dabec440e39a5d16a1274eb5cbe92af0cbc49c79
|
3 |
+
size 15722555520
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:8fe6ffa97b222a379d74093d93606827e4c9087a190adc3b0227e075b50eee98
|
3 |
+
size 15324817536
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:2a5f32c232f744817df2e066e44d99d0437b2ded200182ecfd38cca4fd76dee8
|
3 |
+
size 18252703872
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:b6f80a53d949e3a0f3c5f5dd2a6f70ebe0a014b604d6dfc6526a67a85e1bc817
|
3 |
+
size 23640549504
|
@@ -0,0 +1,46 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
1 |
+
---
|
2 |
+
tags:
|
3 |
+
- quantized
|
4 |
+
- 2-bit
|
5 |
+
- 3-bit
|
6 |
+
- 4-bit
|
7 |
+
- 5-bit
|
8 |
+
- 6-bit
|
9 |
+
- 8-bit
|
10 |
+
- GGUF
|
11 |
+
- text-generation
|
12 |
+
- text-generation
|
13 |
+
model_name: Mistral-Small-Instruct-2409-GGUF
|
14 |
+
base_model: mistralai/Mistral-Small-Instruct-2409
|
15 |
+
inference: false
|
16 |
+
model_creator: mistralai
|
17 |
+
pipeline_tag: text-generation
|
18 |
+
quantized_by: MaziyarPanahi
|
19 |
+
---
|
20 |
+
# [MaziyarPanahi/Mistral-Small-Instruct-2409-GGUF](https://huggingface.co/MaziyarPanahi/Mistral-Small-Instruct-2409-GGUF)
|
21 |
+
- Model creator: [mistralai](https://huggingface.co/mistralai)
|
22 |
+
- Original model: [mistralai/Mistral-Small-Instruct-2409](https://huggingface.co/mistralai/Mistral-Small-Instruct-2409)
|
23 |
+
|
24 |
+
## Description
|
25 |
+
[MaziyarPanahi/Mistral-Small-Instruct-2409-GGUF](https://huggingface.co/MaziyarPanahi/Mistral-Small-Instruct-2409-GGUF) contains GGUF format model files for [mistralai/Mistral-Small-Instruct-2409](https://huggingface.co/mistralai/Mistral-Small-Instruct-2409).
|
26 |
+
|
27 |
+
### About GGUF
|
28 |
+
|
29 |
+
GGUF is a new format introduced by the llama.cpp team on August 21st 2023. It is a replacement for GGML, which is no longer supported by llama.cpp.
|
30 |
+
|
31 |
+
Here is an incomplete list of clients and libraries that are known to support GGUF:
|
32 |
+
|
33 |
+
* [llama.cpp](https://github.com/ggerganov/llama.cpp). The source project for GGUF. Offers a CLI and a server option.
|
34 |
+
* [llama-cpp-python](https://github.com/abetlen/llama-cpp-python), a Python library with GPU accel, LangChain support, and OpenAI-compatible API server.
|
35 |
+
* [LM Studio](https://lmstudio.ai/), an easy-to-use and powerful local GUI for Windows and macOS (Silicon), with GPU acceleration. Linux available, in beta as of 27/11/2023.
|
36 |
+
* [text-generation-webui](https://github.com/oobabooga/text-generation-webui), the most widely used web UI, with many features and powerful extensions. Supports GPU acceleration.
|
37 |
+
* [KoboldCpp](https://github.com/LostRuins/koboldcpp), a fully featured web UI, with GPU accel across all platforms and GPU architectures. Especially good for story telling.
|
38 |
+
* [GPT4All](https://gpt4all.io/index.html), a free and open source local running GUI, supporting Windows, Linux and macOS with full GPU accel.
|
39 |
+
* [LoLLMS Web UI](https://github.com/ParisNeo/lollms-webui), a great web UI with many interesting and unique features, including a full model library for easy model selection.
|
40 |
+
* [Faraday.dev](https://faraday.dev/), an attractive and easy to use character-based chat GUI for Windows and macOS (both Silicon and Intel), with GPU acceleration.
|
41 |
+
* [candle](https://github.com/huggingface/candle), a Rust ML framework with a focus on performance, including GPU support, and ease of use.
|
42 |
+
* [ctransformers](https://github.com/marella/ctransformers), a Python library with GPU accel, LangChain support, and OpenAI-compatible AI server. Note, as of time of writing (November 27th 2023), ctransformers has not been updated in a long time and does not support many recent models.
|
43 |
+
|
44 |
+
## Special thanks
|
45 |
+
|
46 |
+
🙏 Special thanks to [Georgi Gerganov](https://github.com/ggerganov) and the whole team working on [llama.cpp](https://github.com/ggerganov/llama.cpp/) for making all of this possible.
|