Windows Agent Arena: Evaluating Multi-Modal OS Agents at Scale Paper • 2409.08264 • Published Sep 12 • 43
PixWizard: Versatile Image-to-Image Visual Assistant with Open-Language Instructions Paper • 2409.15278 • Published Sep 23 • 22
Agent S: An Open Agentic Framework that Uses Computers Like a Human Paper • 2410.08164 • Published about 1 month ago • 24