Mixture-of-Transformers: A Sparse and Scalable Architecture for Multi-Modal Foundation Models Paper • 2411.04996 • Published 3 days ago • 34
OpenCoder: The Open Cookbook for Top-Tier Code Large Language Models Paper • 2411.04905 • Published 3 days ago • 82
From Medprompt to o1: Exploration of Run-Time Strategies for Medical Challenge Problems and Beyond Paper • 2411.03590 • Published 4 days ago • 9
Language Models can Self-Lengthen to Generate Long Texts Paper • 2410.23933 • Published 10 days ago • 15
Unpacking SDXL Turbo: Interpreting Text-to-Image Models with Sparse Autoencoders Paper • 2410.22366 • Published 13 days ago • 71
LongVU: Spatiotemporal Adaptive Compression for Long Video-Language Understanding Paper • 2410.17434 • Published 19 days ago • 24
EMMA: End-to-End Multimodal Model for Autonomous Driving Paper • 2410.23262 • Published 11 days ago • 2
LongReward: Improving Long-context Large Language Models with AI Feedback Paper • 2410.21252 • Published 13 days ago • 16
Hybrid Preferences: Learning to Route Instances for Human vs. AI Feedback Paper • 2410.19133 • Published 17 days ago • 11
A Little Help Goes a Long Way: Efficient LLM Training by Leveraging Small LMs Paper • 2410.18779 • Published 17 days ago • 1
Improve Vision Language Model Chain-of-thought Reasoning Paper • 2410.16198 • Published 20 days ago • 17
FrugalNeRF: Fast Convergence for Few-shot Novel View Synthesis without Learned Priors Paper • 2410.16271 • Published 20 days ago • 80
SAM2Long: Enhancing SAM 2 for Long Video Segmentation with a Training-Free Memory Tree Paper • 2410.16268 • Published 20 days ago • 65
VibeCheck: Discover and Quantify Qualitative Differences in Large Language Models Paper • 2410.12851 • Published about 1 month ago • 1