MLLM - a kaufmane Collection

kaufmane 's Collections

Speech

Motion

MLLM

LLMs

MLLM

updated Mar 18

MoAI: Mixture of All Intelligence for Large Language and Vision Models

Paper • 2403.07508 • Published Mar 12 • 75
VideoAgent: Long-form Video Understanding with Large Language Model as Agent

Paper • 2403.10517 • Published Mar 15 • 31