Spaces:
Running
Running
metadata
title: README
emoji: π
colorFrom: blue
colorTo: gray
sdk: static
pinned: false
π€ Demo | π€ Paper | π arXiv | GitHub
We are a team from AI2, UCSB, and UWaterloo, UCSC, UWM and we are working on benchmarking vision language models.
Team Member: Yujie Lu, Dongfu Jiang, Yingzi Ma, Jing Gu, Michael Saxon
Advisor: Bill Yuchen Lin, Wenhu Chen, Yejin Choi, William Yang Wang
Compare VLMs at WildVision-Arena and WildVision-Bench.
More chat and vote data will be updated reguarly. Eval script is released here WildVision-Bench
Contact: Bill Yuchen Lin ([email protected]) and Yujie Lu ([email protected])
Citation: If you found this huggingface space useful, please consider cite us:
@misc{lu2024wildvision,
title={WildVision: Evaluating Vision-Language Models in the Wild with Human Preferences},
author={Yujie Lu and Dongfu Jiang and Wenhu Chen and William Yang Wang and Yejin Choi and Bill Yuchen Lin},
year={2024},
eprint={2406.11069},
archivePrefix={arXiv},
primaryClass={id='cs.CV' full_name='Computer Vision and Pattern Recognition' is_active=True alt_name=None in_archive='cs' is_general=False description='Covers image processing, computer vision, pattern recognition, and scene understanding. Roughly includes material in ACM Subject Classes I.2.10, I.4, and I.5.'}
}