Re-quantize and re-upload model

by mtasic85 - opened Sep 14

Discussion

mtasic85

Sep 14

@Lyte llama.cpp fixed issue with BOS/EOS tokens, and all models need to be re-quantized and re-uploaded.

Please check https://github.com/ggerganov/llama.cpp/issues/9315

Lyte

Owner Sep 14

okay thanks for letting me know, I'll get to it asap!

Lyte

Owner Sep 16

•

edited Sep 16

@mtasic85 done using latest llama.cpp same as the other repo. feel free to try quantizing them yourself I've included the notebook i use to do this, it's the same one that i used for both RWKV models.

Lyte changed discussion status to closed Sep 16

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment