Can it be quantified into a 4-bit version and support ollama, because ollama is used by many people now, and it is more convenient to deploy and supports many interfaces. Please consider it, thank you!
· Sign up or log in to comment