Details about GPTQ, GGUF, HF, GGML, AWQ, fp16, Uncensored of each

by Rohith1016 - opened Oct 19, 2023

Oct 19, 2023

•

edited Oct 19, 2023

Can you tell me the different of this all type of model :
GPTQ, GGUF, HF, GGML, AWQ, fp16
Uncensored GPTQ, Uncensored GGUF, Uncensored HF, Uncensored fp16, Uncensored AWQ, Uncensored GGML
And which one is better And fast Uncensored one or without Uncensored one and which model is fast and good on Google colab T4 GPU

YaTharThShaRma999

Oct 19, 2023

Uncensored just means that it will try to answer questions and not refuse nsfw questions. It’s not a format

Gguf is a format for llama cpp best for cpu or Mac
Gptq with exllama will be fastest format and
Awq should be Hughes quality
Fp16 is the original model unquantized
Ggml is an old format similar to gguf
Hf is a format designed for huggingface transformers which is usually fp16

Rohith1016

Oct 20, 2023

Thank you for your Response and explanation @johnwick123forevr

Rohith1016 changed discussion status to closed Oct 20, 2023

Rohith1016 changed discussion status to open Oct 20, 2023

Rohith1016 changed discussion status to closed Oct 20, 2023

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment