Xuefei Ning's picture

2 8 2

Xuefei Ning

Foxfi

·

http://nics-effalg.com/

walkerning

AI & ML interests

Efficient Deep Learning

Organizations

Foxfi's activity

upvoted a paper 2 days ago

A Survey on Efficient Inference for Large Language Models

Paper • 2404.14294 • Published Apr 22 • 2

upvoted a collection 2 days ago

Papers from the NICS-EFFALG Team

8 items • Updated 2 days ago • 2

upvoted a paper 4 months ago

MixDQ: Memory-Efficient Few-Step Text-to-Image Diffusion Models with Metric-Decoupled Mixed Precision Quantization

Paper • 2405.17873 • Published May 28 • 2

upvoted 4 papers 5 months ago

Can LLMs Learn by Teaching? A Preliminary Study

Paper • 2406.14629 • Published Jun 20 • 17

MoA: Mixture of Sparse Attention for Automatic Large Language Model Compression

Paper • 2406.14909 • Published Jun 21 • 13

ViDiT-Q: Efficient and Accurate Quantization of Diffusion Transformers for Image and Video Generation

Paper • 2406.02540 • Published Jun 4 • 2

DiTFastAttn: Attention Compression for Diffusion Transformer Models

Paper • 2406.08552 • Published Jun 12 • 22

upvoted a paper over 1 year ago

Skeleton-of-Thought: Large Language Models Can Do Parallel Decoding

Paper • 2307.15337 • Published Jul 28, 2023 • 36