MooBai commited on
Commit
c632e6b
1 Parent(s): c80cc38

Training in progress epoch 0

Browse files
README.md CHANGED
@@ -15,8 +15,8 @@ probably proofread and complete it, then remove this comment. -->
15
 
16
  This model is a fine-tuned version of [KoichiYasuoka/roberta-classical-chinese-base-char](https://huggingface.co/KoichiYasuoka/roberta-classical-chinese-base-char) on an unknown dataset.
17
  It achieves the following results on the evaluation set:
18
- - Train Loss: 1.7947
19
- - Validation Loss: 0.9816
20
  - Epoch: 0
21
 
22
  ## Model description
@@ -43,7 +43,7 @@ The following hyperparameters were used during training:
43
 
44
  | Train Loss | Validation Loss | Epoch |
45
  |:----------:|:---------------:|:-----:|
46
- | 1.7947 | 0.9816 | 0 |
47
 
48
 
49
  ### Framework versions
 
15
 
16
  This model is a fine-tuned version of [KoichiYasuoka/roberta-classical-chinese-base-char](https://huggingface.co/KoichiYasuoka/roberta-classical-chinese-base-char) on an unknown dataset.
17
  It achieves the following results on the evaluation set:
18
+ - Train Loss: 0.7456
19
+ - Validation Loss: 0.0584
20
  - Epoch: 0
21
 
22
  ## Model description
 
43
 
44
  | Train Loss | Validation Loss | Epoch |
45
  |:----------:|:---------------:|:-----:|
46
+ | 0.7456 | 0.0584 | 0 |
47
 
48
 
49
  ### Framework versions
config.json CHANGED
@@ -1,7 +1,7 @@
1
  {
2
  "_name_or_path": "KoichiYasuoka/roberta-classical-chinese-base-char",
3
  "architectures": [
4
- "RobertaForMaskedLM"
5
  ],
6
  "attention_probs_dropout_prob": 0.1,
7
  "bos_token_id": 0,
 
1
  {
2
  "_name_or_path": "KoichiYasuoka/roberta-classical-chinese-base-char",
3
  "architectures": [
4
+ "RobertaForCausalLM"
5
  ],
6
  "attention_probs_dropout_prob": 0.1,
7
  "bos_token_id": 0,
generation_config.json ADDED
@@ -0,0 +1,7 @@
 
 
 
 
 
 
 
 
1
+ {
2
+ "_from_model_config": true,
3
+ "bos_token_id": 0,
4
+ "eos_token_id": 2,
5
+ "pad_token_id": 1,
6
+ "transformers_version": "4.38.2"
7
+ }
logs/train/events.out.tfevents.1713373181.d16d92757244.2533.0.v2 ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:c7e8fdc5880eed67e5882b2c646221c82795ffad94381fdc69546847935aaff5
3
+ size 2393306
logs/train/events.out.tfevents.1713373844.d16d92757244.9304.0.v2 ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:80b237f373d99486ca8aca60921c3f90612cf6986adfe4d0aedbf9273b13fb90
3
+ size 2393372
logs/validation/events.out.tfevents.1713374414.d16d92757244.9304.1.v2 ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:50d403c7e92e383e00c6fb3e0ad536c46e714d7c9b552c7ac9b68d076e6db70b
3
+ size 232
tf_model.h5 CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:c3626d98357e1edb5213028e9af1e46124a5ae261f847e57bdd7e8696e34255f
3
  size 507845000
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:ceb893f4e2499917dbe7e48d72d42df383c19241010b81624f189d042aba4856
3
  size 507845000