elinas
/

Llama-3-15B-Instruct-zeroed-ft

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

elinas commited on May 16

Commit

8a9cf05

•

1 Parent(s): 8518b6c

Update README.md

Files changed (1) hide show

README.md +0 -2

README.md CHANGED Viewed

@@ -17,8 +17,6 @@ The model is based on a "zeroed" passthrough merge of [Llama-3-15B-Instruct-zero
 This was primarily an experiment to see how a passthrough merge will respond to further finetuning, though this was done on a small dataset.
-The goal was to make a "mid" sized model like Meta has released in the past and the merge method was inspired by [mlabonne's Llama-3-120B](https://huggingface.co/mlabonne/Meta-Llama-3-120B-Instruct).
 The model was finetuned on **8192 context length** and is likely reliable using RoPE up to 32k.
 Further finetuning this model or finetuning the [base model](https://huggingface.co/elinas/Llama-3-15B-Instruct-zeroed) on more samples is encouraged.

 This was primarily an experiment to see how a passthrough merge will respond to further finetuning, though this was done on a small dataset.
 The model was finetuned on **8192 context length** and is likely reliable using RoPE up to 32k.
 Further finetuning this model or finetuning the [base model](https://huggingface.co/elinas/Llama-3-15B-Instruct-zeroed) on more samples is encouraged.