Re-quantize and re-upload model
#1
by
mtasic85
- opened
@bartowski llama.cpp fixed issue with BOS/EOS tokens, and all models need to be re-quantized and re-uploaded.
Please check https://github.com/ggerganov/llama.cpp/issues/9315
Additionally, please quantize RWKV 6 1b6, 3b and 14b models :)
Will do thanks for the info!
bartowski
changed discussion status to
closed