OpenCoder: The Open Cookbook for Top-Tier Code Large Language Models Paper β’ 2411.04905 β’ Published 3 days ago β’ 82
RLEF: Grounding Code LLMs in Execution Feedback with Reinforcement Learning Paper β’ 2410.02089 β’ Published Oct 2 β’ 10
PhysDreamer: Physics-Based Interaction with 3D Objects via Video Generation Paper β’ 2404.13026 β’ Published Apr 19 β’ 23
AutoTrain: No-code training for state-of-the-art models Paper β’ 2410.15735 β’ Published 20 days ago β’ 55
view article Article Transformers.js v3: WebGPU support, new models & tasks, and more⦠19 days ago ⒠58
LayerSkip: Enabling Early Exit Inference and Self-Speculative Decoding Paper β’ 2404.16710 β’ Published Apr 25 β’ 73
Mutual Reasoning Makes Smaller LLMs Stronger Problem-Solvers Paper β’ 2408.06195 β’ Published Aug 12 β’ 61
Gemma-APS Release Collection Gemma models for text-to-propositions segmentation. The models are distilled from fine-tuned Gemini Pro model applied to multi-domain synthetic data. β’ 3 items β’ Updated 26 days ago β’ 19
StructRAG: Boosting Knowledge Intensive Reasoning of LLMs via Inference-time Hybrid Information Structurization Paper β’ 2410.08815 β’ Published 30 days ago β’ 41
EMOVA: Empowering Language Models to See, Hear and Speak with Vivid Emotions Paper β’ 2409.18042 β’ Published Sep 26 β’ 36
MaskedMimic: Unified Physics-Based Character Control Through Masked Motion Inpainting Paper β’ 2409.14393 β’ Published Sep 22 β’ 7
Qwen2.5-Coder Collection Code-specific model series based on Qwen2.5 β’ 22 items β’ Updated 1 day ago β’ 92
Imagine yourself: Tuning-Free Personalized Image Generation Paper β’ 2409.13346 β’ Published Sep 20 β’ 67