OS-ATLAS: A Foundation Action Model for Generalist GUI Agents Paper ⢠2410.23218 ⢠Published 11 days ago ⢠43
MobileLLM Collection Optimizing Sub-billion Parameter Language Models for On-Device Use Cases (ICML 2024) https://arxiv.org/abs/2402.14905 ⢠8 items ⢠Updated 3 days ago ⢠89
MarDini: Masked Autoregressive Diffusion for Video Generation at Scale Paper ⢠2410.20280 ⢠Published 14 days ago ⢠21
CogVLM2 Collection This collection hosts the repos of the THUDM's CogVLM2 releases ⢠8 items ⢠Updated Aug 18 ⢠18
LoLCATS Collection Linearizing LLMs with high quality and efficiency. We linearize the full Llama 3.1 model family -- 8b, 70b, 405b -- for the first time! ⢠4 items ⢠Updated 27 days ago ⢠14
based Collection These language model checkpoints are trained at the 360M and 1.3Bn parameter scales for up to 50Bn tokens on the Pile corpus, for research purposes. ⢠15 items ⢠Updated 23 days ago ⢠9
Animate-X: Universal Character Image Animation with Enhanced Motion Representation Paper ⢠2410.10306 ⢠Published 27 days ago ⢠50
Cavia: Camera-controllable Multi-view Video Diffusion with View-Integrated Attention Paper ⢠2410.10774 ⢠Published 27 days ago ⢠23
Loradex Highlights Collection This collection features awesome opensource LoRAs trained by members of the Glif Community during Loradex Early Access! ⢠14 items ⢠Updated 23 days ago ⢠18
Semantic Image Inversion and Editing using Rectified Stochastic Differential Equations Paper ⢠2410.10792 ⢠Published 27 days ago ⢠26
African History Collection A collection of data on the history of mankind ⢠5 items ⢠Updated about 20 hours ago ⢠1
VideoGuide: Improving Video Diffusion Models without Training Through a Teacher's Guide Paper ⢠2410.04364 ⢠Published Oct 6 ⢠26
Vript Collection A large-scale video-text dataset of high-resolution videos annotated with dense and detailed captions. ⢠9 items ⢠Updated 25 days ago ⢠3
STELLA 3 Collection All the finished tools made for Project STELLA 3.0 ⢠12 items ⢠Updated Mar 10 ⢠1