Dolphin: Long Context as a New Modality for Energy-Efficient On-Device Language Models Paper • 2408.15518 • Published 23 days ago • 41
MobileQuant: Mobile-friendly Quantization for On-device Language Models Paper • 2408.13933 • Published 26 days ago • 13