INST problem?

by Slayery - opened Feb 13

Feb 13

This model also spams INST like all others that have traces of neuraltrix ? (I mean gguf, it doesn't seem to be present on non-quantized models).

paulml

Owner Feb 13

I will try to quantize it today and see how it performs

paulml

Owner Feb 13

I quantized the model and it definitely has this problem. I think we should focus on finding the root model of the issue.

Here is the quantized version: paulml/OGNO-7B-GGUF

Feb 26

I can also reproduce the INST loop error with Q5_K_M quantization type generated with llama.cpp version b2257.

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment