vedantjumle
/

bert-2

@@ -15,10 +15,10 @@ probably proofread and complete it, then remove this comment. -->
 This model is a fine-tuned version of [bert-large-uncased](https://huggingface.co/bert-large-uncased) on an unknown dataset.
 It achieves the following results on the evaluation set:
-- Train Loss: 0.0717
-- Validation Loss: 0.4388
-- Train Accuracy: 0.9067
-- Epoch: 27
 ## Model description
@@ -44,34 +44,7 @@ The following hyperparameters were used during training:
 | Train Loss | Validation Loss | Train Accuracy | Epoch |
 |:----------:|:---------------:|:--------------:|:-----:|
-| 4.9809     | 4.7828          | 0.0333         | 0     |
-| 4.5685     | 4.3200          | 0.2233         | 1     |
-| 3.9808     | 3.6775          | 0.5833         | 2     |
-| 3.3170     | 3.0567          | 0.72           | 3     |
-| 2.7260     | 2.5223          | 0.7967         | 4     |
-| 2.1996     | 2.0628          | 0.8167         | 5     |
-| 1.7561     | 1.7042          | 0.84           | 6     |
-| 1.4166     | 1.4208          | 0.86           | 7     |
-| 1.1157     | 1.1895          | 0.8733         | 8     |
-| 0.8749     | 0.9915          | 0.8933         | 9     |
-| 0.6874     | 0.8773          | 0.8833         | 10    |
-| 0.5538     | 0.7759          | 0.88           | 11    |
-| 0.4475     | 0.7086          | 0.8833         | 12    |
-| 0.3682     | 0.6501          | 0.8867         | 13    |
-| 0.3097     | 0.6114          | 0.8933         | 14    |
-| 0.2609     | 0.5694          | 0.9            | 15    |
-| 0.2240     | 0.5430          | 0.9            | 16    |
-| 0.1915     | 0.5198          | 0.8967         | 17    |
-| 0.1681     | 0.5091          | 0.9067         | 18    |
-| 0.1482     | 0.4936          | 0.8967         | 19    |
-| 0.1339     | 0.4791          | 0.9067         | 20    |
-| 0.1202     | 0.4821          | 0.8933         | 21    |
-| 0.1090     | 0.4700          | 0.89           | 22    |
-| 0.0987     | 0.4585          | 0.8933         | 23    |
-| 0.0900     | 0.4553          | 0.9            | 24    |
-| 0.0843     | 0.4459          | 0.9033         | 25    |
-| 0.0768     | 0.4447          | 0.9033         | 26    |
-| 0.0717     | 0.4388          | 0.9067         | 27    |
 ### Framework versions

 This model is a fine-tuned version of [bert-large-uncased](https://huggingface.co/bert-large-uncased) on an unknown dataset.
 It achieves the following results on the evaluation set:
+- Train Loss: 4.9762
+- Validation Loss: 4.7210
+- Train Accuracy: 0.0767
+- Epoch: 0
 ## Model description
 | Train Loss | Validation Loss | Train Accuracy | Epoch |
 |:----------:|:---------------:|:--------------:|:-----:|
+| 4.9762     | 4.7210          | 0.0767         | 0     |
 ### Framework versions

config.json CHANGED Viewed

@@ -4,10 +4,10 @@
     "BertForSequenceClassification"
   ],
   "attention_probs_dropout_prob": 0.1,
-  "classifier_dropout": null,
   "gradient_checkpointing": false,
   "hidden_act": "gelu",
-  "hidden_dropout_prob": 0.1,
   "hidden_size": 1024,
   "id2label": {
     "0": "LABEL_0",

     "BertForSequenceClassification"
   ],
   "attention_probs_dropout_prob": 0.1,
+  "classifier_dropout": 0.2,
   "gradient_checkpointing": false,
   "hidden_act": "gelu",
+  "hidden_dropout_prob": 0.0,
   "hidden_size": 1024,
   "id2label": {
     "0": "LABEL_0",

tf_model.h5 CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:26c227f18c8ab30e289031233b49efb2547edec2900ecf91e2a06344953952e2
 size 1341734528

 version https://git-lfs.github.com/spec/v1
+oid sha256:109bc563c3d61ee5d58f29995d7c8506991d78efc579ce4159f0a1ae0cf444bd
 size 1341734528