Eni Grand's picture

Eni Grand

Enigrand

·

AI & ML interests

None yet

Organizations

Enigrand's activity

upvoted a paper about 2 hours ago

SVDQunat: Absorbing Outliers by Low-Rank Components for 4-Bit Diffusion Models

Paper • 2411.05007 • Published 3 days ago • 14

upvoted a paper about 12 hours ago

OpenCoder: The Open Cookbook for Top-Tier Code Large Language Models

Paper • 2411.04905 • Published 3 days ago • 82

upvoted a paper 4 days ago

How Far is Video Generation from World Model: A Physical Law Perspective

Paper • 2411.02385 • Published 6 days ago • 27

upvoted a paper 5 days ago

Randomized Autoregressive Visual Generation

Paper • 2411.00776 • Published 9 days ago • 17

upvoted a paper 8 days ago

NeuZip: Memory-Efficient Training and Inference with Dynamic Compression of Neural Networks

Paper • 2410.20650 • Published 13 days ago • 14

upvoted a collection 19 days ago

Stable Diffusion 3.5

6 items • Updated 12 days ago • 83

upvoted a paper 22 days ago

Fluid: Scaling Autoregressive Text-to-image Generative Models with Continuous Tokens

Paper • 2410.13863 • Published 24 days ago • 35

upvoted a paper 23 days ago

Janus: Decoupling Visual Encoding for Unified Multimodal Understanding and Generation

Paper • 2410.13848 • Published 24 days ago • 27

upvoted a paper 26 days ago

Tree of Problems: Improving structured problem solving with compositionality

Paper • 2410.06634 • Published Oct 9 • 8

upvoted 2 papers about 1 month ago

Pixtral 12B

Paper • 2410.07073 • Published Oct 9 • 59

Differential Transformer

Paper • 2410.05258 • Published Oct 7 • 165

upvoted a collection about 1 month ago

AuraFlow

AuraFlow v0.x series, to date the largest (6.8B) and highest fidelity (0.7+ on GenEval) open sourced text to image model. • 3 items • Updated Sep 6 • 5

upvoted 3 papers about 1 month ago

Not All LLM Reasoners Are Created Equal

Paper • 2410.01748 • Published Oct 2 • 27

From Code to Correctness: Closing the Last Mile of Code Generation with Hierarchical Debugging

Paper • 2410.01215 • Published Oct 2 • 30

TPI-LLM: Serving 70B-scale LLMs Efficiently on Low-resource Edge Devices

Paper • 2410.00531 • Published Oct 1 • 28

upvoted an article about 1 month ago

Article

A failed experiment: Infini-Attention, and why we should keep trying?

Aug 14

• 48

upvoted 2 collections about 1 month ago

Qwen2-VL

Vision-language model series based on Qwen2 • 15 items • Updated Sep 18 • 149

RDNet

DenseNets Reloaded: Paradigm Shift Beyond ResNets and ViTs [ECCV 2024] • 9 items • Updated 25 days ago • 3

upvoted 2 papers about 2 months ago

Programming Every Example: Lifting Pre-training Data Quality like Experts at Scale

Paper • 2409.17115 • Published Sep 25 • 59

Molmo and PixMo: Open Weights and Open Data for State-of-the-Art Multimodal Models

Paper • 2409.17146 • Published Sep 25 • 99