kamilmuratyilmaz commited on
Commit
336fe06
1 Parent(s): 80d5cb0

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +89 -13
README.md CHANGED
@@ -16,28 +16,104 @@ model-index:
16
  results:
17
  - task:
18
  type: text-generation
 
 
 
 
 
 
 
 
19
  metrics:
20
- - name: ARC (25-Shot)
21
- type: arc_challenge (25-Shot)
22
  value: 60.07
23
- - name: Hellaswag (10-Shot)
24
- type: Hellaswag (10-Shot)
 
 
 
 
 
 
 
 
 
 
 
 
 
25
  value: 83.43
26
- - name: TruthfulQA
27
- type: TruthfulQA
 
 
 
 
 
 
 
 
 
 
 
 
 
 
28
  value: 52.06
29
- - name: GSM8K (5-Shot)
30
- type: GSM8K (5-Shot)
 
 
 
 
 
 
 
 
 
 
 
 
 
31
  value: 38.97
32
- - name: Winogrande (5-Shot)
33
- type: Winogrande (5-Shot)
 
 
 
 
 
 
 
 
 
 
 
 
 
 
34
  value: 78.61
35
- - name: MMLU (5-Shot)
36
- type: MMLU (5-Shot)
 
 
 
 
 
 
 
 
 
 
 
 
 
 
37
  value: 60.07
 
38
  source:
 
39
  name: Open LLM Leaderboard
40
- url: https://huggingface.co/spaces/open-llm-leaderboard/open_llm_leaderboard
41
  ---
42
  # Model Card for Cymist2-v0.2-SFT
43
  ### Model Description
 
16
  results:
17
  - task:
18
  type: text-generation
19
+ name: Text Generation
20
+ dataset:
21
+ name: AI2 Reasoning Challenge (25-Shot)
22
+ type: ai2_arc
23
+ config: ARC-Challenge
24
+ split: test
25
+ args:
26
+ num_few_shot: 25
27
  metrics:
28
+ - type: acc_norm
 
29
  value: 60.07
30
+ name: normalized accuracy
31
+ source:
32
+ url: https://huggingface.co/spaces/HuggingFaceH4/open_llm_leaderboard?query=cypienai/cymist-2-v02-SFT
33
+ name: Open LLM Leaderboard
34
+ - task:
35
+ type: text-generation
36
+ name: Text Generation
37
+ dataset:
38
+ name: HellaSwag (10-Shot)
39
+ type: hellaswag
40
+ split: validation
41
+ args:
42
+ num_few_shot: 10
43
+ metrics:
44
+ - type: acc_norm
45
  value: 83.43
46
+ name: normalized accuracy
47
+ source:
48
+ url: https://huggingface.co/spaces/HuggingFaceH4/open_llm_leaderboard?query=cypienai/cymist-2-v02-SFT
49
+ name: Open LLM Leaderboard
50
+ - task:
51
+ type: text-generation
52
+ name: Text Generation
53
+ dataset:
54
+ name: TruthfulQA (0-shot)
55
+ type: truthful_qa
56
+ config: multiple_choice
57
+ split: validation
58
+ args:
59
+ num_few_shot: 0
60
+ metrics:
61
+ - type: mc2
62
  value: 52.06
63
+ source:
64
+ url: https://huggingface.co/spaces/HuggingFaceH4/open_llm_leaderboard?query=cypienai/cymist-2-v02-SFT
65
+ name: Open LLM Leaderboard
66
+ - task:
67
+ type: text-generation
68
+ name: Text Generation
69
+ dataset:
70
+ name: GSM8k (5-shot)
71
+ type: gsm8k
72
+ config: main
73
+ split: test
74
+ args:
75
+ num_few_shot: 5
76
+ metrics:
77
+ - type: acc
78
  value: 38.97
79
+ name: accuracy
80
+ source:
81
+ url: https://huggingface.co/spaces/HuggingFaceH4/open_llm_leaderboard?query=cypienai/cymist-2-v02-SFT
82
+ name: Open LLM Leaderboard
83
+ - task:
84
+ type: text-generation
85
+ name: Text Generation
86
+ dataset:
87
+ name: Winogrande (5-shot)
88
+ type: winogrande
89
+ config: winogrande_xl
90
+ split: validation
91
+ args:
92
+ num_few_shot: 5
93
+ metrics:
94
+ - type: acc
95
  value: 78.61
96
+ name: accuracy
97
+ source:
98
+ url: https://huggingface.co/spaces/HuggingFaceH4/open_llm_leaderboard?query=cypienai/cymist-2-v02-SFT
99
+ name: Open LLM Leaderboard
100
+ - task:
101
+ type: text-generation
102
+ name: Text Generation
103
+ dataset:
104
+ name: MMLU (5-Shot)
105
+ type: cais/mmlu
106
+ config: all
107
+ split: test
108
+ args:
109
+ num_few_shot: 5
110
+ metrics:
111
+ - type: acc
112
  value: 60.07
113
+ name: accuracy
114
  source:
115
+ url: https://huggingface.co/spaces/HuggingFaceH4/open_llm_leaderboard?query=cypienai/cymist-2-v02-SFT
116
  name: Open LLM Leaderboard
 
117
  ---
118
  # Model Card for Cymist2-v0.2-SFT
119
  ### Model Description