HiroseKoichi commited on
Commit
3aaba32
1 Parent(s): 2e57eec

Create README.md

Browse files
Files changed (1) hide show
  1. README.md +94 -0
README.md ADDED
@@ -0,0 +1,94 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ license: llama3
3
+ library_name: transformers
4
+ tags:
5
+ - nsfw
6
+ - not-for-all-audiences
7
+ - llama-3
8
+ - text-generation-inference
9
+ - moe
10
+ - mergekit
11
+ - merge
12
+ ---
13
+
14
+ # Llama-Salad-4x8B-V3
15
+ Changes in V3:
16
+ - Uses `L3-8B-Stheno-v3.2` as the base model instead of `Meta-Llama-3-8B-Instruct`
17
+ - Removed `opus-v1.2-llama-3-8b-instruct-run3.5-epoch2.5` and added `Einstein-v6.1-Llama3-8B`
18
+ - Swapped `Llama-3-Soliloquy-8B-v2` for `L3-8B-Stheno-v3.2`
19
+
20
+ # Details
21
+ - **License**: [llama3](https://llama.meta.com/llama3/license/)
22
+ - **Instruct Format**: [llama-3](https://llama.meta.com/docs/model-cards-and-prompt-formats/meta-llama-3/)
23
+ - **Context Size**: 8K
24
+
25
+ ## Models Used
26
+ - [Meta-Llama-3-8B-Instruct](https://huggingface.co/NousResearch/Meta-Llama-3-8B-Instruct)
27
+ - [llama-3-cat-8b-instruct-v1](https://huggingface.co/TheSkullery/llama-3-cat-8b-instruct-v1)
28
+ - [Llama-3-8B-Synthia-v3.5](https://huggingface.co/migtissera/Llama-3-8B-Synthia-v3.5)
29
+ - [Einstein-v6.1-Llama3-8B](https://huggingface.co/Weyaxi/Einstein-v6.1-Llama3-8B)
30
+
31
+ ## Merge Config
32
+ ```yaml
33
+ base_model: Sao10K/L3-8B-Stheno-v3.2
34
+ gate_mode: hidden
35
+ dtype: bfloat16
36
+ experts_per_token: 2
37
+ experts:
38
+ - source_model: NousResearch/Meta-Llama-3-8B-Instruct
39
+ positive_prompts:
40
+ - "chat"
41
+ - "conversation"
42
+ - source_model: Weyaxi/Einstein-v6.1-Llama3-8B
43
+ positive_prompts:
44
+ - "science"
45
+ - "physics"
46
+ - "chemistry"
47
+ - "biology"
48
+ - "math"
49
+ - "step-by-step"
50
+ - "logical reasoning"
51
+ - "multilingual"
52
+ - "translation"
53
+ - "language translation"
54
+ - "foreign language"
55
+ negative_prompts:
56
+ - "programming language"
57
+ - source_model: migtissera/Llama-3-8B-Synthia-v3.5
58
+ positive_prompts:
59
+ - "summarize"
60
+ - "paraphrase"
61
+ - "list"
62
+ - "explain"
63
+ - "define"
64
+ - "analyze"
65
+ - "rephrase"
66
+ - "elaborate"
67
+ - "programming language"
68
+ - "JavaScript"
69
+ - "Python programming language"
70
+ - "Rust programming language"
71
+ - "C++ programming language"
72
+ - "GO programming language"
73
+ - "Ruby programming language"
74
+ - "Haskell programming language"
75
+ - "SQL query language"
76
+ - "CSS markup styling language"
77
+ - "code"
78
+ - source_model: Sao10K/L3-8B-Stheno-v3.2
79
+ positive_prompts:
80
+ - "characters"
81
+ - "scene"
82
+ - "roleplay"
83
+ - "erotic roleplay"
84
+ - "sexual fetish"
85
+ - "NSFW"
86
+ - "creative writing"
87
+ - "storytelling"
88
+ - "narration"
89
+ - "narrative setting"
90
+ - "narrative plot"
91
+ - "narrative exposition"
92
+ - "narrative theme"
93
+ - "narrative climax"
94
+ ```