DepthCrafter: Generating Consistent Long Depth Sequences for Open-world Videos Paper • 2409.02095 • Published 17 days ago • 32
General OCR Theory: Towards OCR-2.0 via a Unified End-to-end Model Paper • 2409.01704 • Published 17 days ago • 72
CDM: A Reliable Metric for Fair and Accurate Formula Recognition Evaluation Paper • 2409.03643 • Published 15 days ago • 18
Evaluating Multiview Object Consistency in Humans and Image Models Paper • 2409.05862 • Published 11 days ago • 8
LEIA: Latent View-invariant Embeddings for Implicit 3D Articulation Paper • 2409.06703 • Published 10 days ago • 2
Hi3D: Pursuing High-Resolution Image-to-3D Generation with Video Diffusion Models Paper • 2409.07452 • Published 9 days ago • 18
Instant Facial Gaussians Translator for Relightable and Interactable Facial Rendering Paper • 2409.07441 • Published 9 days ago • 9
InstantDrag: Improving Interactivity in Drag-based Image Editing Paper • 2409.08857 • Published 7 days ago • 24