Intel
/

Meta-Llama-3.1-70B-Instruct-int4-inc

Model card Files Files and versions Community

weiweiz1 commited on Aug 26

Commit

a01d1cd

•

1 Parent(s): 1e2ce83

Update README.md

Files changed (1) hide show

README.md +17 -14

README.md CHANGED Viewed

@@ -115,20 +115,23 @@ cd auto-round/examples/language-modeling
 python3 eval_042/evluation.py --model_name "Intel/Meta-Llama-3.1-70B-Instruct-int4-inc" --eval_bs 16  --tasks lambada_openai,hellaswag,piqa,winogrande,truthfulqa_mc1,openbookqa,boolq,arc_easy,arc_challenge,mmlu,gsm8k --trust_remote_code
 ```
-| Metric         | BF16   | INT4-AutoRound |
-|:---------------|:------ |:---------------|
-| avg            | 0.7182 | 0.7165         |
-| mmlu           | 0.8221 | 0.8145         |
-| lambada_openai | 0.7566 | 0.7565         |
-| hellaswag      | 0.6522 | 0.6492         |
-| winogrande     | 0.7901 | 0.8090         |
-| piqa           | 0.8308 | 0.8270         |
-| truthfulqa_mc1 | 0.4064 | 0.4051         |
-| openbookqa     | 0.3720 | 0.3760         |
-| boolq          | 0.8777 | 0.8768         |
-| arc_easy       | 0.8674 | 0.8565         |
-| arc_challenge  | 0.6246 | 0.6160         |
-| gsm8k(5shot) strict match | 0.8999 | 0.8954      |
 ## Ethical Considerations and Limitations

 python3 eval_042/evluation.py --model_name "Intel/Meta-Llama-3.1-70B-Instruct-int4-inc" --eval_bs 16  --tasks lambada_openai,hellaswag,piqa,winogrande,truthfulqa_mc1,openbookqa,boolq,arc_easy,arc_challenge,mmlu,gsm8k --trust_remote_code
 ```
+| Metric         | BF16   | INT4(iters 200) | INT4(iters1000) |
+|:---------------|:------ |:---------------|:-----------------|
+| avg            | 0.7182 | 0.7119          | 0.7165          |
+| mmlu           | 0.8221 | 0.8136          | 0.8145          |
+| lambada_openai | 0.7566 | 0.7448          | 0.7565          |
+| hellaswag      | 0.6522 | 0.6474          | 0.6492          |
+| winogrande     | 0.7901 | 0.7845          | 0.8090          |
+| piqa           | 0.8308 | 0.8286          | 0.8270          |
+| truthfulqa_mc1 | 0.4064 | 0.4002          | 0.4051          |
+| openbookqa     | 0.3720 | 0.3720          | 0.3760          |
+| boolq          | 0.8777 | 0.8780          | 0.8768          |
+| arc_easy       | 0.8674 | 0.8590          | 0.8565          |
+| arc_challenge  | 0.6246 | 0.6109          | 0.6160          |
+| gsm8k(5shot) strict match | 0.8999 | 0.8923 | 0.8954        |
 ## Ethical Considerations and Limitations