HuBERT - a facebook Collection

facebook 's Collections

Sparsh

Seamless Communication

MAGNeT

XLSR

XLS-R

Robust Wav2Vec 2.0

HuBERT

Fairseq S^2 TTS

Dinov2

MusicGen Stereo

Sapiens

OPT

HuBERT

updated Jan 16

A collection of checkpoints from the HuBERT release, a speech encoder that learns powerful representations from unlabelled audio data.

HuBERT: Self-Supervised Speech Representation Learning by Masked Prediction of Hidden Units

Paper • 2106.07447 • Published Jun 14, 2021 • 2

Note The HuBERT paper, accepted at IEEE/ACM Transactions on Audio, Speech and Language Processing Volume 29.
facebook/hubert-base-ls960

Feature Extraction • Updated Nov 5, 2021 • 930k • 47

Note The "base" HuBERT model fine-tuned on 960 hours of LibriSpeech ASR data.
facebook/hubert-large-ll60k

Feature Extraction • Updated Nov 5, 2021 • 40.4k • 24

Note The "large" HuBERT model pre-trained on LibriVox 60k hours.
facebook/hubert-large-ls960-ft

Automatic Speech Recognition • Updated May 24, 2022 • 975k • 59

Note A fine-tuned version of hubert-large-ll60k, fine-tuned on 960 hours of LibriSpeech ASR data.
facebook/hubert-xlarge-ll60k

Feature Extraction • Updated Oct 20, 2021 • 1.65k • 5

Note The "extra-large" HuBERT model pre-trained on LibriVox 60k hours.
facebook/hubert-xlarge-ls960-ft

Automatic Speech Recognition • Updated Jun 27, 2023 • 6.87k • 11

Note A fine-tuned version of hubert-xlarge-ll60k, fine-tuned on 960 hours of LibriSpeech ASR data. This is the most performant HuBERT checkpoint in the release, achieving a WER of 1.8/2.9% on the LibriSpeech test clean/other subsets respectively.