Molmo Collection Artifacts for open multimodal language models. • 5 items • Updated Sep 26 • 269
JudgeBench: A Benchmark for Evaluating LLM-based Judges Paper • 2410.12784 • Published 25 days ago • 42
Language Models can Self-Lengthen to Generate Long Texts Paper • 2410.23933 • Published 10 days ago • 15
Zero-shot Model-based Reinforcement Learning using Large Language Models Paper • 2410.11711 • Published 26 days ago • 8
CompassJudger-1: All-in-one Judge Model Helps Model Evaluation and Evolution Paper • 2410.16256 • Published 20 days ago • 58
SAM2Long: Enhancing SAM 2 for Long Video Segmentation with a Training-Free Memory Tree Paper • 2410.16268 • Published 20 days ago • 65
Improve Vision Language Model Chain-of-thought Reasoning Paper • 2410.16198 • Published 20 days ago • 17
Scaling Diffusion Language Models via Adaptation from Autoregressive Models Paper • 2410.17891 • Published 18 days ago • 15
Why Does the Effective Context Length of LLMs Fall Short? Paper • 2410.18745 • Published 17 days ago • 16
Unleashing Reasoning Capability of LLMs via Scalable Question Synthesis from Scratch Paper • 2410.18693 • Published 17 days ago • 40
FasterCache: Training-Free Video Diffusion Model Acceleration with High Quality Paper • 2410.19355 • Published 16 days ago • 20
LLaMA-Berry: Pairwise Optimization for O1-like Olympiad-Level Mathematical Reasoning Paper • 2410.02884 • Published Oct 3 • 48
MobileLLM Collection Optimizing Sub-billion Parameter Language Models for On-Device Use Cases (ICML 2024) https://arxiv.org/abs/2402.14905 • 8 items • Updated 3 days ago • 89
SmolLM2 Collection State-of-the-art compact LLMs for on-device applications: 1.7B, 360M, 135M • 8 items • Updated 6 days ago • 160
Eliminating Oversaturation and Artifacts of High Guidance Scales in Diffusion Models Paper • 2410.02416 • Published Oct 3 • 25
OmniBooth: Learning Latent Control for Image Synthesis with Multi-modal Instruction Paper • 2410.04932 • Published Oct 7 • 9
LLMs Know More Than They Show: On the Intrinsic Representation of LLM Hallucinations Paper • 2410.02707 • Published Oct 3 • 48