README / README.md
FunCube's picture
Update README.md
abe6e5a verified
|
raw
history blame
1.74 kB
metadata
title: README
emoji: 🌍
colorFrom: blue
colorTo: gray
sdk: static
pinned: false

πŸ€— Demo | πŸ€— Paper | πŸ“– arXiv | GitHub

We are a team from AI2, UCSB, and UWaterloo, UCSC, UWM and we are working on benchmarking vision language models.

Team Member: Yujie Lu, Dongfu Jiang, Yingzi Ma, Jing Gu, Michael Saxon

Advisor: Bill Yuchen Lin, Wenhu Chen, Yejin Choi, William Yang Wang

Compare VLMs at WildVision-Arena and WildVision-Bench.

More chat and vote data will be updated reguarly. Eval script is released here WildVision-Bench

Contact: Bill Yuchen Lin ([email protected]) and Yujie Lu ([email protected])

Citation: If you found this huggingface space useful, please consider cite us:

@misc{lu2024wildvision,
      title={WildVision: Evaluating Vision-Language Models in the Wild with Human Preferences}, 
      author={Yujie Lu and Dongfu Jiang and Wenhu Chen and William Yang Wang and Yejin Choi and Bill Yuchen Lin},
      year={2024},
      eprint={2406.11069},
      archivePrefix={arXiv},
      primaryClass={id='cs.CV' full_name='Computer Vision and Pattern Recognition' is_active=True alt_name=None in_archive='cs' is_general=False description='Covers image processing, computer vision, pattern recognition, and scene understanding. Roughly includes material in ACM Subject Classes I.2.10, I.4, and I.5.'}
}