reazonspeech-espnet-v1

reazonspeech-espnet-v1 is an ESPnet model trained for Japanese automatic speech recognition (ASR).

This model was trained on 15,000 hours of ReazonSpeech corpus.
Make sure that your audio file is sampled at 16khz when using this model.

For more details, please visit the official project page.

Downloads last month: 5

Inference Examples

This model does not have enough activity to be deployed to Inference API (serverless) yet. Increase its social visibility and check back later, or deploy to Inference Endpoints (dedicated) instead.

Dallyana
/

daya24mile_asr

reazonspeech-espnet-v1

Dataset used to train Dallyana/daya24mile_asr