Collections
Discover the best community collections!
Collections including paper arxiv:2401.04695
-
TinyGPT-V: Efficient Multimodal Large Language Model via Small Backbones
Paper • 2312.16862 • Published • 30 -
Unified-IO 2: Scaling Autoregressive Multimodal Models with Vision, Language, Audio, and Action
Paper • 2312.17172 • Published • 26 -
Towards Truly Zero-shot Compositional Visual Reasoning with LLMs as Programmers
Paper • 2401.01974 • Published • 5 -
From Audio to Photoreal Embodiment: Synthesizing Humans in Conversations
Paper • 2401.01885 • Published • 27
-
GAIA: a benchmark for General AI Assistants
Paper • 2311.12983 • Published • 183 -
Rank-without-GPT: Building GPT-Independent Listwise Rerankers on Open-Source Large Language Models
Paper • 2312.02969 • Published • 12 -
Axiomatic Preference Modeling for Longform Question Answering
Paper • 2312.02206 • Published • 7 -
Alignment for Honesty
Paper • 2312.07000 • Published • 11
-
GPQA: A Graduate-Level Google-Proof Q&A Benchmark
Paper • 2311.12022 • Published • 25 -
GAIA: a benchmark for General AI Assistants
Paper • 2311.12983 • Published • 183 -
gorilla-llm/APIBench
Updated • 142 • 63 -
Purple Llama CyberSecEval: A Secure Coding Benchmark for Language Models
Paper • 2312.04724 • Published • 20