HiroseKoichi
commited on
Commit
•
470c09b
1
Parent(s):
6aac349
Update README.md
Browse files
README.md
CHANGED
@@ -17,6 +17,10 @@ Changes in V3:
|
|
17 |
- Removed `opus-v1.2-llama-3-8b-instruct-run3.5-epoch2.5` and added `Einstein-v6.1-Llama3-8B`
|
18 |
- Swapped `Llama-3-Soliloquy-8B-v2` for `L3-8B-Stheno-v3.2`
|
19 |
|
|
|
|
|
|
|
|
|
20 |
# Details
|
21 |
- **License**: [llama3](https://llama.meta.com/llama3/license/)
|
22 |
- **Instruct Format**: [llama-3](https://llama.meta.com/docs/model-cards-and-prompt-formats/meta-llama-3/)
|
|
|
17 |
- Removed `opus-v1.2-llama-3-8b-instruct-run3.5-epoch2.5` and added `Einstein-v6.1-Llama3-8B`
|
18 |
- Swapped `Llama-3-Soliloquy-8B-v2` for `L3-8B-Stheno-v3.2`
|
19 |
|
20 |
+
I was clearly wrong when I said V2 would be difficult to improve on, because V3 is significantly better in just about every aspect. Stheno-v3.2 fixed all of the issues present in Stheno-v3.1, making it my favorite roleplay model and the best base model for llama-3 MoE merges.
|
21 |
+
|
22 |
+
The one thing I do want to improve on is finding a better conversational model than Meta-Llama-3-8B-Instruct; it's good for that use case, but I'm sure there's a better one out there. I tried using llama-3-cat-8b-instruct-v1, but it absolutely tanked the model's situational awareness and kept making blatantly contradictory statements.
|
23 |
+
|
24 |
# Details
|
25 |
- **License**: [llama3](https://llama.meta.com/llama3/license/)
|
26 |
- **Instruct Format**: [llama-3](https://llama.meta.com/docs/model-cards-and-prompt-formats/meta-llama-3/)
|