HuBERT
A collection of checkpoints from the HuBERT release, a speech encoder that learns powerful representations from unlabelled audio data.
Paper • 2106.07447 • Published • 2Note The HuBERT paper, accepted at IEEE/ACM Transactions on Audio, Speech and Language Processing Volume 29.
facebook/hubert-base-ls960
Feature Extraction • Updated • 930k • 47Note The "base" HuBERT model fine-tuned on 960 hours of LibriSpeech ASR data.
facebook/hubert-large-ll60k
Feature Extraction • Updated • 40.4k • 24Note The "large" HuBERT model pre-trained on LibriVox 60k hours.
facebook/hubert-large-ls960-ft
Automatic Speech Recognition • Updated • 975k • 59Note A fine-tuned version of hubert-large-ll60k, fine-tuned on 960 hours of LibriSpeech ASR data.
facebook/hubert-xlarge-ll60k
Feature Extraction • Updated • 1.65k • 5Note The "extra-large" HuBERT model pre-trained on LibriVox 60k hours.
facebook/hubert-xlarge-ls960-ft
Automatic Speech Recognition • Updated • 6.87k • 11Note A fine-tuned version of hubert-xlarge-ll60k, fine-tuned on 960 hours of LibriSpeech ASR data. This is the most performant HuBERT checkpoint in the release, achieving a WER of 1.8/2.9% on the LibriSpeech test clean/other subsets respectively.