update model card README.md
Browse files
README.md
CHANGED
@@ -1,48 +1,32 @@
|
|
1 |
---
|
2 |
-
language:
|
3 |
-
- fr
|
4 |
-
license: apache-2.0
|
5 |
tags:
|
6 |
-
- automatic-speech-recognition
|
7 |
-
- mozilla-foundation/common_voice_8_0
|
8 |
- generated_from_trainer
|
9 |
-
- robust-speech-event
|
10 |
model-index:
|
11 |
-
- name:
|
12 |
-
results:
|
13 |
-
- task:
|
14 |
-
name: Automatic Speech Recognition
|
15 |
-
type: automatic-speech-recognition
|
16 |
-
dataset:
|
17 |
-
name: Common Voice 8
|
18 |
-
type: mozilla-foundation/common_voice_8_0
|
19 |
-
args: fr
|
20 |
-
metrics:
|
21 |
-
- name: Test WER
|
22 |
-
type: wer
|
23 |
-
value: 18.33
|
24 |
-
- name: Test CER
|
25 |
-
type: cer
|
26 |
-
value: 5.60
|
27 |
-
- task:
|
28 |
-
name: Automatic Speech Recognition
|
29 |
-
type: automatic-speech-recognition
|
30 |
-
dataset:
|
31 |
-
name: Robust Speech Event - Dev Data
|
32 |
-
type: speech-recognition-community-v2/dev_data
|
33 |
-
args: fr
|
34 |
-
metrics:
|
35 |
-
- name: Test WER
|
36 |
-
type: wer
|
37 |
-
value: 60.25
|
38 |
-
- name: Test CER
|
39 |
-
type: cer
|
40 |
-
value: 15.68
|
41 |
---
|
42 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
43 |
## Model description
|
44 |
|
45 |
-
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
46 |
|
47 |
## Training procedure
|
48 |
|
@@ -58,7 +42,7 @@ The following hyperparameters were used during training:
|
|
58 |
- optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
|
59 |
- lr_scheduler_type: linear
|
60 |
- lr_scheduler_warmup_steps: 2000
|
61 |
-
- num_epochs:
|
62 |
- mixed_precision_training: Native AMP
|
63 |
|
64 |
### Training results
|
@@ -78,11 +62,14 @@ The following hyperparameters were used during training:
|
|
78 |
| 0.8156 | 3.15 | 11000 | inf | 0.1936 |
|
79 |
| 0.8015 | 3.44 | 12000 | inf | 0.1880 |
|
80 |
| 0.774 | 3.73 | 13000 | inf | 0.1834 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
81 |
|
82 |
-
It achieves the best result on the validation set on STEP 13000:
|
83 |
-
- Wer: 0.1834
|
84 |
-
|
85 |
-
Some problem occurs when calculating the validation loss.
|
86 |
|
87 |
### Framework versions
|
88 |
|
@@ -90,16 +77,3 @@ Some problem occurs when calculating the validation loss.
|
|
90 |
- Pytorch 1.10.2+cu102
|
91 |
- Datasets 1.18.3.dev0
|
92 |
- Tokenizers 0.11.0
|
93 |
-
|
94 |
-
### Evaluation Commands
|
95 |
-
1. To evaluate on `mozilla-foundation/common_voice_8` with split `test`
|
96 |
-
|
97 |
-
```bash
|
98 |
-
python eval.py --model_id Plim/xls-r-1b-cv_8-fr --dataset mozilla-foundation/common_voice_8_0 --config fr --split test
|
99 |
-
```
|
100 |
-
|
101 |
-
2. To evaluate on `speech-recognition-community-v2/dev_data`
|
102 |
-
|
103 |
-
```bash
|
104 |
-
python eval.py --model_id Plim/xls-r-1b-cv_8-fr --dataset speech-recognition-community-v2/dev_data --config fr --split validation --chunk_length_s 5.0 --stride_length_s 1.0
|
105 |
-
```
|
|
|
1 |
---
|
|
|
|
|
|
|
2 |
tags:
|
|
|
|
|
3 |
- generated_from_trainer
|
|
|
4 |
model-index:
|
5 |
+
- name: ''
|
6 |
+
results: []
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
7 |
---
|
8 |
|
9 |
+
<!-- This model card has been generated automatically according to the information the Trainer had access to. You
|
10 |
+
should probably proofread and complete it, then remove this comment. -->
|
11 |
+
|
12 |
+
#
|
13 |
+
|
14 |
+
This model is a fine-tuned version of [./checkpoint-13000](https://huggingface.co/./checkpoint-13000) on the None dataset.
|
15 |
+
It achieves the following results on the evaluation set:
|
16 |
+
- Loss: inf
|
17 |
+
- Wer: 0.1854
|
18 |
+
|
19 |
## Model description
|
20 |
|
21 |
+
More information needed
|
22 |
+
|
23 |
+
## Intended uses & limitations
|
24 |
+
|
25 |
+
More information needed
|
26 |
+
|
27 |
+
## Training and evaluation data
|
28 |
+
|
29 |
+
More information needed
|
30 |
|
31 |
## Training procedure
|
32 |
|
|
|
42 |
- optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
|
43 |
- lr_scheduler_type: linear
|
44 |
- lr_scheduler_warmup_steps: 2000
|
45 |
+
- num_epochs: 6.0
|
46 |
- mixed_precision_training: Native AMP
|
47 |
|
48 |
### Training results
|
|
|
62 |
| 0.8156 | 3.15 | 11000 | inf | 0.1936 |
|
63 |
| 0.8015 | 3.44 | 12000 | inf | 0.1880 |
|
64 |
| 0.774 | 3.73 | 13000 | inf | 0.1834 |
|
65 |
+
| 0.8372 | 4.01 | 14000 | inf | 0.1934 |
|
66 |
+
| 0.8075 | 4.3 | 15000 | inf | 0.1923 |
|
67 |
+
| 0.8069 | 4.59 | 16000 | inf | 0.1877 |
|
68 |
+
| 0.8064 | 4.87 | 17000 | inf | 0.1955 |
|
69 |
+
| 0.801 | 5.16 | 18000 | inf | 0.1891 |
|
70 |
+
| 0.8022 | 5.45 | 19000 | inf | 0.1895 |
|
71 |
+
| 0.792 | 5.73 | 20000 | inf | 0.1854 |
|
72 |
|
|
|
|
|
|
|
|
|
73 |
|
74 |
### Framework versions
|
75 |
|
|
|
77 |
- Pytorch 1.10.2+cu102
|
78 |
- Datasets 1.18.3.dev0
|
79 |
- Tokenizers 0.11.0
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
wandb/run-20220206_201634-uhiy9e2t/files/output.log
CHANGED
@@ -11355,3 +11355,17 @@ Configuration saved in ./preprocessor_config.json
|
|
11355 |
Saving model checkpoint to ./
|
11356 |
Configuration saved in ./config.json
|
11357 |
Model weights saved in ./pytorch_model.bin
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
11355 |
Saving model checkpoint to ./
|
11356 |
Configuration saved in ./config.json
|
11357 |
Model weights saved in ./pytorch_model.bin
|
11358 |
+
Configuration saved in ./preprocessor_config.json
|
11359 |
+
Several commits (2) will be pushed upstream.
|
11360 |
+
The progress bars may be unreliable.
|
11361 |
+
02/08/2022 03:24:48 - WARNING - huggingface_hub.repository - Several commits (2) will be pushed upstream.
|
11362 |
+
02/08/2022 03:24:48 - WARNING - huggingface_hub.repository - The progress bars may be unreliable.
|
11363 |
+
|
11364 |
+
|
11365 |
+
|
11366 |
+
Upload file wandb/run-20220206_201634-uhiy9e2t/run-uhiy9e2t.wandb: 98%|βββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββ | 114M/117M [00:06<00:00, 21.0MB/s]
|
11367 |
+
02/08/2022 03:24:58 - WARNING - huggingface_hub.repository - To https://huggingface.co/Plim/xls-r-1b-cv_8-fr
|
11368 |
+
Upload file wandb/run-20220206_201634-uhiy9e2t/run-uhiy9e2t.wandb: 98%|βββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββ | 114M/117M [00:06<00:00, 21.0MB/s]To https://huggingface.co/Plim/xls-r-1b-cv_8-fr
|
11369 |
+
89ae304..5c2523c main -> main
|
11370 |
+
Upload file wandb/run-20220206_201634-uhiy9e2t/run-uhiy9e2t.wandb: 100%|ββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββ| 117M/117M [00:08<00:00, 15.1MB/s]
|
11371 |
+
Dropping the following result as it does not have all the necessary fields:
|
wandb/run-20220206_201634-uhiy9e2t/logs/debug-internal.log
CHANGED
@@ -32484,3 +32484,15 @@ requests.exceptions.HTTPError: 500 Server Error: Internal Server Error for url:
|
|
32484 |
2022-02-08 03:24:36,740 DEBUG HandlerThread:10249 [handler.py:handle_request():131] handle_request: stop_status
|
32485 |
2022-02-08 03:24:36,741 DEBUG SenderThread:10249 [sender.py:send_request():249] send_request: stop_status
|
32486 |
2022-02-08 03:24:39,519 DEBUG SenderThread:10249 [sender.py:send():235] send: stats
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
32484 |
2022-02-08 03:24:36,740 DEBUG HandlerThread:10249 [handler.py:handle_request():131] handle_request: stop_status
|
32485 |
2022-02-08 03:24:36,741 DEBUG SenderThread:10249 [sender.py:send_request():249] send_request: stop_status
|
32486 |
2022-02-08 03:24:39,519 DEBUG SenderThread:10249 [sender.py:send():235] send: stats
|
32487 |
+
2022-02-08 03:24:51,012 INFO Thread-8 :10249 [dir_watcher.py:_on_file_modified():230] file/dir modified: /workspace/xls-r-1b-cv_8-fr/wandb/run-20220206_201634-uhiy9e2t/files/output.log
|
32488 |
+
2022-02-08 03:24:51,911 DEBUG HandlerThread:10249 [handler.py:handle_request():131] handle_request: stop_status
|
32489 |
+
2022-02-08 03:24:51,912 DEBUG SenderThread:10249 [sender.py:send_request():249] send_request: stop_status
|
32490 |
+
2022-02-08 03:24:53,013 INFO Thread-8 :10249 [dir_watcher.py:_on_file_modified():230] file/dir modified: /workspace/xls-r-1b-cv_8-fr/wandb/run-20220206_201634-uhiy9e2t/files/output.log
|
32491 |
+
2022-02-08 03:24:55,014 INFO Thread-8 :10249 [dir_watcher.py:_on_file_modified():230] file/dir modified: /workspace/xls-r-1b-cv_8-fr/wandb/run-20220206_201634-uhiy9e2t/files/output.log
|
32492 |
+
2022-02-08 03:24:57,016 INFO Thread-8 :10249 [dir_watcher.py:_on_file_modified():230] file/dir modified: /workspace/xls-r-1b-cv_8-fr/wandb/run-20220206_201634-uhiy9e2t/files/output.log
|
32493 |
+
2022-02-08 03:24:59,017 INFO Thread-8 :10249 [dir_watcher.py:_on_file_modified():230] file/dir modified: /workspace/xls-r-1b-cv_8-fr/wandb/run-20220206_201634-uhiy9e2t/files/output.log
|
32494 |
+
2022-02-08 03:25:01,018 INFO Thread-8 :10249 [dir_watcher.py:_on_file_modified():230] file/dir modified: /workspace/xls-r-1b-cv_8-fr/wandb/run-20220206_201634-uhiy9e2t/files/output.log
|
32495 |
+
2022-02-08 03:25:05,021 INFO Thread-8 :10249 [dir_watcher.py:_on_file_modified():230] file/dir modified: /workspace/xls-r-1b-cv_8-fr/wandb/run-20220206_201634-uhiy9e2t/files/output.log
|
32496 |
+
2022-02-08 03:25:07,248 DEBUG HandlerThread:10249 [handler.py:handle_request():131] handle_request: stop_status
|
32497 |
+
2022-02-08 03:25:07,248 DEBUG SenderThread:10249 [sender.py:send_request():249] send_request: stop_status
|
32498 |
+
2022-02-08 03:25:09,751 DEBUG SenderThread:10249 [sender.py:send():235] send: stats
|
wandb/run-20220206_201634-uhiy9e2t/run-uhiy9e2t.wandb
CHANGED
@@ -1,3 +1,3 @@
|
|
1 |
version https://git-lfs.github.com/spec/v1
|
2 |
-
oid sha256:
|
3 |
-
size
|
|
|
1 |
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:160673feed9ea3ddc328247490fe7d2950cd358fb40b4b827edc8123c80ab0af
|
3 |
+
size 123076744
|