Building a great multi-lingual teacher with sparsely-gated mixture of experts for speech recognition Paper • 2112.05820 • Published Dec 10, 2021 • 2
SpeechMoE2: Mixture-of-Experts Model with Improved Routing Paper • 2111.11831 • Published Nov 23, 2021 • 2