vicgalle
/

Humanish-Roleplay-Llama-3.1-8B

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

vicgalle commited on Aug 3

Commit

66fe41c

•

1 Parent(s): d27aa73

Update README.md

Files changed (1) hide show

README.md +2 -0

README.md CHANGED Viewed

@@ -12,6 +12,8 @@ datasets:
 # Humanish-RP-Llama-3.1-8B
 A DPO-tuned Llama-3.1 to behave more "humanish", i.e., avoiding all the AI assistant slop. It also works for role-play (RP). To achieve this, the model was fine-tuned over a series of datasets:
 * General conversations from Claude Opus
 * `Undi95/Weyaxi-humanish-dpo-project-noemoji`, to make the model react as a human, rejecting assistant-like or too neutral responses.

 # Humanish-RP-Llama-3.1-8B
+![image/webp](https://cdn-uploads.huggingface.co/production/uploads/5fad8602b8423e1d80b8a965/VPwtjS3BtjEEEq7ck4kAQ.webp)
 A DPO-tuned Llama-3.1 to behave more "humanish", i.e., avoiding all the AI assistant slop. It also works for role-play (RP). To achieve this, the model was fine-tuned over a series of datasets:
 * General conversations from Claude Opus
 * `Undi95/Weyaxi-humanish-dpo-project-noemoji`, to make the model react as a human, rejecting assistant-like or too neutral responses.