khanhld
/

wav2vec2-base-vietnamese-160h

Automatic Speech Recognition

Inference Endpoints

Model card Files Files and versions

khanhld commited on May 9, 2022

Commit

17cbb76

•

1 Parent(s): 552b1c8

update readme

Files changed (1) hide show

README.md +11 -9

README.md CHANGED Viewed

@@ -52,18 +52,20 @@ model-index:
 # Vietnamese Speech Recognition using Wav2vec 2.0
 ### Table of contents
 1. [Model Description](#description)
-2. [Benchmark Result](#benchmark)
-3. [Example Usage](#example)
-4. [Evaluation](#evaluation)
-5. [Contact](#contact)
 <a name = "description" ></a>
 ### Model Description
-Fine-tuned the Wav2vec2-based model on about 160 hours of Vietnamese speech dataset from different resources including [VIOS](https://huggingface.co/datasets/vivos), [COMMON VOICE](https://huggingface.co/datasets/mozilla-foundation/common_voice_8_0), [FOSD](https://data.mendeley.com/datasets/k9sxg2twv4/4) and [VLSP 100h](https://drive.google.com/file/d/1vUSxdORDxk-ePUt-bUVDahpoXiqKchMx/view). We have not yet incorporated the Language Model into our ASR system but still gained a promising result.
-<br>
-We also provide code for Pre-training and Fine-tuning the Wav2vec2 model (not available for now but will release soon). If you wish to train on your dataset, check it out here:
-- [Pretrain](https://github.com/khanld/ASR-Wav2vec-Pretrain)
-- [Finetune](https://github.com/khanld/ASR-Wa2vec-Finetune)
 </br>
 <a name = "benchmark" ></a>

 # Vietnamese Speech Recognition using Wav2vec 2.0
 ### Table of contents
 1. [Model Description](#description)
+2. [Implementation](#implementation)
+3. [Benchmark Result](#benchmark)
+4. [Example Usage](#example)
+5. [Evaluation](#evaluation)
+6. [Contact](#contact)
 <a name = "description" ></a>
 ### Model Description
+Fine-tuned the Wav2vec2-based model on about 160 hours of Vietnamese speech dataset from different resources, including [VIOS](https://huggingface.co/datasets/vivos), [COMMON VOICE](https://huggingface.co/datasets/mozilla-foundation/common_voice_8_0), [FOSD](https://data.mendeley.com/datasets/k9sxg2twv4/4) and [VLSP 100h](https://drive.google.com/file/d/1vUSxdORDxk-ePUt-bUVDahpoXiqKchMx/view). We have not yet incorporated the Language Model into our ASR system but still gained a promising result.
+<a name = "implementation" ></a>
+### Implementation
+We also provide code for Pre-training and Fine-tuning the Wav2vec2 model. If you wish to train on your dataset, check it out here:
+- [Pre-train code](https://github.com/khanld/ASR-Wav2vec-Pretrain) (not available for now but will release soon)
+- [Fine-tune code](https://github.com/khanld/ASR-Wa2vec-Finetune)
 </br>
 <a name = "benchmark" ></a>