Update README.md
Browse files
README.md
CHANGED
@@ -11,7 +11,7 @@ library_name: transformers
|
|
11 |
|
12 |
This is the 12th in a series of models designed to replicate the prose quality of the Claude 3 models, specifically Sonnet and Opus.
|
13 |
|
14 |
-
This model is the result of multiple KTO runs on top of one SFT run, all of which are published on anthracite-forge.
|
15 |
|
16 |
## Methodology
|
17 |
|
@@ -19,7 +19,7 @@ R1 (SFT) was fine-tuned on top of `IntervitensInc/gemma-2-27b-chatml` which is c
|
|
19 |
|
20 |
We have experimented with various SFT and KTO re-runs, ratios and merge methods and this was our winner, including what was liked most from each model.
|
21 |
|
22 |
-
If you prefer your own mix of the KTO runs or would like to use the SFT on its own, refer to the models section and anthracite-forge, some exl-quants are pre-included.
|
23 |
|
24 |
## Models
|
25 |
|
|
|
11 |
|
12 |
This is the 12th in a series of models designed to replicate the prose quality of the Claude 3 models, specifically Sonnet and Opus.
|
13 |
|
14 |
+
This model is the result of multiple KTO runs on top of one SFT run, all of which are published on [anthracite-forge](https://huggingface.co/anthracite-forge).
|
15 |
|
16 |
## Methodology
|
17 |
|
|
|
19 |
|
20 |
We have experimented with various SFT and KTO re-runs, ratios and merge methods and this was our winner, including what was liked most from each model.
|
21 |
|
22 |
+
If you prefer your own mix of the KTO runs or would like to use the SFT on its own, refer to the models section and [anthracite-forge](https://huggingface.co/anthracite-forge), some exl-quants are pre-included.
|
23 |
|
24 |
## Models
|
25 |
|