lmzheng commited on
Commit
80f3688
1 Parent(s): e785283

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +2 -0
README.md CHANGED
@@ -41,6 +41,8 @@ See more details in the "Training Details of Vicuna Models" section in the appen
41
 
42
  ## Evaluation
43
 
 
 
44
  Vicuna is evaluated with standard benchmarks, human preference, and LLM-as-a-judge. See more details in this [paper](https://arxiv.org/pdf/2306.05685.pdf) and [leaderboard](https://huggingface.co/spaces/lmsys/chatbot-arena-leaderboard).
45
 
46
  ## Difference between different versions of Vicuna
 
41
 
42
  ## Evaluation
43
 
44
+ ![Evaluation Results](https://github.com/lm-sys/lm-sys.github.io/blob/main/public/images/webdata/vicuna_v1.5_eval.png?raw=true)
45
+
46
  Vicuna is evaluated with standard benchmarks, human preference, and LLM-as-a-judge. See more details in this [paper](https://arxiv.org/pdf/2306.05685.pdf) and [leaderboard](https://huggingface.co/spaces/lmsys/chatbot-arena-leaderboard).
47
 
48
  ## Difference between different versions of Vicuna