Feature request of quantization of starchat-alpha

by delas - opened May 13, 2023

May 13, 2023

Hi!

Firstly, thank you for your work implementing starcoder for ggml. Working fine with the 4 bit quantized version.

I would like to make a feature request related with the implementation of the 4 bit quantize version of starchat-alpha for enabling the usage with ggml.

Best Regards,

Jordi

mirek190

May 14, 2023

Hello

How to run that starcoder ggml? I tried with llama.cpp and koboldcpp ... no working

Lynxpda

May 14, 2023

This comment has been hidden

mirek190

May 14, 2023

Thank you .
Is working so bad for you like for me?

I'm asking for c++ Fibonacci code and getting always for python.... and other ask also is giving me python code....

delas

May 14, 2023

I suppose starcoder is not optimized for chat interaction but for autocompletion, that's why the finetuned variation starchat-alpha probably is more useful for interaction purposes.

mirek190

May 14, 2023

Oh I see . Thanks .

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment