Transformers
GGUF
Not-For-All-Audiences
Inference Endpoints
conversational
Edit model card

QuantFactory/L3.1-8B-sunfall-stheno-v0.6.1-GGUF

This is quantized version of crestf411/L3.1-8B-sunfall-stheno-v0.6.1 created using llama.cpp

Original Model Card

Sunfall (2024-07-31) v0.6.1 on top of https://huggingface.co/Sao10K/Llama-3.1-8B-Stheno-v3.4

See https://huggingface.co/crestf411/L3.1-8B-sunfall-v0.6.1-dpo for details on usage.

Downloads last month
173
GGUF
Model size
8.03B params
Architecture
llama

2-bit

3-bit

4-bit

5-bit

6-bit

8-bit

Inference API
Unable to determine this model’s pipeline type. Check the docs .

Datasets used to train QuantFactory/L3.1-8B-sunfall-stheno-v0.6.1-GGUF