- Downloads last month
- 63
Inference API (serverless) is not available, repository is disabled.
Evaluation results
- normalized accuracy on AI2 Reasoning Challenge (25-Shot)test set Open LLM Leaderboard34.560
- normalized accuracy on HellaSwag (10-Shot)validation set Open LLM Leaderboard58.240
- accuracy on MMLU (5-Shot)test set Open LLM Leaderboard25.790
- mc2 on TruthfulQA (0-shot)validation set Open LLM Leaderboard39.930
- accuracy on Winogrande (5-shot)validation set Open LLM Leaderboard63.930
- accuracy on GSM8k (5-shot)test set Open LLM Leaderboard4.850