Please open mouth kiss the homies.

by snombler - opened Jun 18

Jun 18

This could be us but you playin'.

(Making exl2s before ggufs is a crime.)

Anthracite org Jun 18

•

llama.cpp's tokenization handling in the past two months is perhaps equally criminal

Jun 18

Not wrong! But until someone else wants to support split loading, it's all we've really got, sadly. Also, thanks for all your contributions.

Jun 18

tbh exl2 simply produces better outputs.

Jun 18

I am graciously willing to accept 3090s to run exl2s for anyone who has them to spare. I'll need enough to run at least 64k context.

Jun 18

Only see 2 bit exl but 4KM gguf. We got different definitions of "before"

Jun 18

It's just proof that bullying works.

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment