learning - a xieyuquan Collection

xieyuquan 's Collections

rlhf

arch

dpo

learning

updated 1 day ago

Law of Vision Representation in MLLMs

Paper • 2408.16357 • Published 22 days ago • 92
CogVLM2: Visual Language Models for Image and Video Understanding

Paper • 2408.16500 • Published 22 days ago • 55
Learning to Move Like Professional Counter-Strike Players

Paper • 2408.13934 • Published 26 days ago • 21
Building and better understanding vision-language models: insights and future directions

Paper • 2408.12637 • Published 29 days ago • 109
Towards a Unified View of Preference Learning for Large Language Models: A Survey

Paper • 2409.02795 • Published 16 days ago • 70
Windows Agent Arena: Evaluating Multi-Modal OS Agents at Scale

Paper • 2409.08264 • Published 8 days ago • 39
Qwen2.5-Coder Technical Report

Paper • 2409.12186 • Published 1 day ago • 75