Token Speeds for Q5_K_M?
#3 opened 5 months ago
by
Dreifort
Thanks a lot for GGUF variant.
#2 opened 10 months ago
by
Flanua
How much context can this model produce normally?
2
#1 opened 12 months ago
by
Anonimus12345678902