Pangea: A Fully Open Multilingual Multimodal LLM for 39 Languages Paper • 2410.16153 • Published 20 days ago • 42
Multimodal Models Collection Multimodal models with leading performance. • 14 items • Updated 19 days ago • 15
Beyond Chinchilla-Optimal: Accounting for Inference in Language Model Scaling Laws Paper • 2401.00448 • Published Dec 31, 2023 • 28
MMMU: A Massive Multi-discipline Multimodal Understanding and Reasoning Benchmark for Expert AGI Paper • 2311.16502 • Published Nov 27, 2023 • 35
Benchmarking Sequential Visual Input Reasoning and Prediction in Multimodal Large Language Models Paper • 2310.13473 • Published Oct 20, 2023 • 1
DETR Doesn't Need Multi-Scale or Locality Design Paper • 2308.01904 • Published Aug 3, 2023 • 7