CogVLM2: Visual Language Models for Image and Video Understanding Paper • 2408.16500 • Published 22 days ago • 55
Learning to Move Like Professional Counter-Strike Players Paper • 2408.13934 • Published 26 days ago • 21
Building and better understanding vision-language models: insights and future directions Paper • 2408.12637 • Published 29 days ago • 109
Towards a Unified View of Preference Learning for Large Language Models: A Survey Paper • 2409.02795 • Published 16 days ago • 70
Windows Agent Arena: Evaluating Multi-Modal OS Agents at Scale Paper • 2409.08264 • Published 8 days ago • 39