Will you release q6_k version too?

by Hoioi - opened Jul 5, 2023

Discussion

Hoioi

Jul 5, 2023

Could you please release q6_k version too?

Cornmonster

Xorbits org Jul 5, 2023

•

edited Jul 5, 2023

The current ggml version is based on this project: https://github.com/li-plus/chatglm.cpp. Supported quantizations are "f32", "f16", "q8_0", "q4_0", "q4_1", "q5_0", "q5_1". q6_k is not available yet.

We are going to upload the rest of quantization versions soon. Stay tuned :)

Hoioi

Jul 5, 2023

Thank you so much. I'm waiting for they to add support for other quantizations.

Hoioi changed discussion status to closed Jul 5, 2023

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment