xxx's picture

2

xxx

wasd003

AI & ML interests

None yet

Organizations

None yet

wasd003's activity

upvoted 2 papers 5 months ago

PowerInfer-2: Fast Large Language Model Inference on a Smartphone

Paper • 2406.06282 • Published Jun 10 • 36

Turbo Sparse: Achieving LLM SOTA Performance with Minimal Activated Parameters

Paper • 2406.05955 • Published Jun 10 • 22