nicholasKluge
/

Aira-OPT-125M

@@ -15,8 +15,6 @@ tags:
 - assistant
 pipeline_tag: text-generation
 widget:
-- text: "What is your name?<|endofinstruction|>"
-  example_title: Greetings
 - text: "Can you explain what is Machine Learning?<|endofinstruction|>"
   example_title: Machine Learning
 - text: "Do you know anything about virtue ethics?<|endofinstruction|>"
@@ -107,14 +105,14 @@ The model will output something like:
 ## Evaluation
-| Model (OPT)                                                         | Average   | [ARC](https://arxiv.org/abs/1803.05457) | [TruthfulQA](https://arxiv.org/abs/2109.07958) | [ToxiGen](https://arxiv.org/abs/2203.09509) |   |   |
-|---------------------------------------------------------------------|-----------|-----------------------------------------|------------------------------------------------|---------------------------------------------|---|---|
-| [Aira-OPT-125M](https://huggingface.co/nicholasKluge/Aira-OPT-125M) | **43.34** | **24.65**                               | **49.11**                                      | **56.27**                                   |   |   |
-| OPT-125M                                                            | 40.29     | 22.78                                   | 42.88                                          | 55.21                                       |   |   |
-| [Aira-OPT-350M](https://huggingface.co/nicholasKluge/Aira-OPT-350M) | **41.56** | **25.00**                               | **42.13**                                      | **57.55**                                   |   |   |
-| OPT-350M                                                            | 40.62     | 23.97                                   | 41.00                                          | 56.91                                       |   |   |
-| [Aira-OPT-1B3](https://huggingface.co/nicholasKluge/Aira-OPT-1B3)   | **43.90** | 28.41                                   | **46.59**                                      | **56.70**                                   |   |   |
-| OPT-1.3b                                                            | 40.91     | **29.69**                               | 38.68                                          | 54.36                                       |   |   |
 * Evaluations were performed using the [Language Model Evaluation Harness](https://github.com/EleutherAI/lm-evaluation-harness) (by [EleutherAI](https://www.eleuther.ai/)).

 - assistant
 pipeline_tag: text-generation
 widget:
 - text: "Can you explain what is Machine Learning?<|endofinstruction|>"
   example_title: Machine Learning
 - text: "Do you know anything about virtue ethics?<|endofinstruction|>"
 ## Evaluation
+| Model (OPT)                                                         | Average   | [ARC](https://arxiv.org/abs/1803.05457) | [TruthfulQA](https://arxiv.org/abs/2109.07958) | [ToxiGen](https://arxiv.org/abs/2203.09509) |
+|---------------------------------------------------------------------|-----------|-----------------------------------------|------------------------------------------------|---------------------------------------------|
+| [Aira-OPT-125M](https://huggingface.co/nicholasKluge/Aira-OPT-125M) | **43.34** | **24.65**                               | **49.11**                                      | **56.27**                                   |
+| OPT-125M                                                            | 40.29     | 22.78                                   | 42.88                                          | 55.21                                       |
+| [Aira-OPT-350M](https://huggingface.co/nicholasKluge/Aira-OPT-350M) | **41.56** | **25.00**                               | **42.13**                                      | **57.55**                                   |
+| OPT-350M                                                            | 40.62     | 23.97                                   | 41.00                                          | 56.91                                       |
+| [Aira-OPT-1B3](https://huggingface.co/nicholasKluge/Aira-OPT-1B3)   | **43.90** | 28.41                                   | **46.59**                                      | **56.70**                                   |
+| OPT-1.3b                                                            | 40.91     | **29.69**                               | 38.68                                          | 54.36                                       |
 * Evaluations were performed using the [Language Model Evaluation Harness](https://github.com/EleutherAI/lm-evaluation-harness) (by [EleutherAI](https://www.eleuther.ai/)).