End of training

Browse files

Files changed (5) hide show

README.md +26 -15
config.json +1 -1
model.safetensors +1 -1
tokenizer_config.json +1 -1
training_args.bin +2 -2

README.md CHANGED Viewed

@@ -1,4 +1,5 @@
 ---
 license: apache-2.0
 base_model: distilbert/distilbert-base-uncased
 tags:
@@ -17,7 +18,7 @@ should probably proofread and complete it, then remove this comment. -->
 This model is a fine-tuned version of [distilbert/distilbert-base-uncased](https://huggingface.co/distilbert/distilbert-base-uncased) on an unknown dataset.
 It achieves the following results on the evaluation set:
-- Loss: 1.3497
 - Accuracy: 0.6355
 ## Model description
@@ -41,29 +42,39 @@ The following hyperparameters were used during training:
 - train_batch_size: 32
 - eval_batch_size: 32
 - seed: 42
-- optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
-- num_epochs: 10
 ### Training results
 | Training Loss | Epoch | Step | Validation Loss | Accuracy |
 |:-------------:|:-----:|:----:|:---------------:|:--------:|
-| 3.319         | 1.0   | 65   | 2.7462          | 0.4230   |
-| 2.6272        | 2.0   | 130  | 2.0795          | 0.5010   |
-| 2.137         | 3.0   | 195  | 1.6683          | 0.5770   |
-| 1.469         | 4.0   | 260  | 1.4721          | 0.6101   |
-| 1.2405        | 5.0   | 325  | 1.3497          | 0.6355   |
-| 1.1023        | 6.0   | 390  | 1.2936          | 0.6335   |
-| 0.9206        | 7.0   | 455  | 1.2855          | 0.6316   |
-| 0.8374        | 8.0   | 520  | 1.2579          | 0.6355   |
-| 0.794         | 9.0   | 585  | 1.2525          | 0.6335   |
-| 0.7388        | 10.0  | 650  | 1.2478          | 0.6316   |
 ### Framework versions
-- Transformers 4.44.0
 - Pytorch 2.4.0
 - Datasets 2.20.0
-- Tokenizers 0.19.1

 ---
+library_name: transformers
 license: apache-2.0
 base_model: distilbert/distilbert-base-uncased
 tags:
 This model is a fine-tuned version of [distilbert/distilbert-base-uncased](https://huggingface.co/distilbert/distilbert-base-uncased) on an unknown dataset.
 It achieves the following results on the evaluation set:
+- Loss: 1.2600
 - Accuracy: 0.6355
 ## Model description
 - train_batch_size: 32
 - eval_batch_size: 32
 - seed: 42
+- optimizer: Use adamw_torch with betas=(0.9,0.999) and epsilon=1e-08 and optimizer_args=No additional optimizer arguments
 - lr_scheduler_type: linear
+- num_epochs: 20
 ### Training results
 | Training Loss | Epoch | Step | Validation Loss | Accuracy |
 |:-------------:|:-----:|:----:|:---------------:|:--------:|
+| 3.3759        | 1.0   | 65   | 2.8220          | 0.4152   |
+| 2.7146        | 2.0   | 130  | 2.0778          | 0.5127   |
+| 2.1537        | 3.0   | 195  | 1.6326          | 0.6121   |
+| 1.4176        | 4.0   | 260  | 1.4415          | 0.6062   |
+| 1.1537        | 5.0   | 325  | 1.2944          | 0.6316   |
+| 1.0093        | 6.0   | 390  | 1.2600          | 0.6355   |
+| 0.7806        | 7.0   | 455  | 1.2770          | 0.6199   |
+| 0.6639        | 8.0   | 520  | 1.2654          | 0.6296   |
+| 0.5922        | 9.0   | 585  | 1.2733          | 0.6296   |
+| 0.4659        | 10.0  | 650  | 1.3403          | 0.6179   |
+| 0.3928        | 11.0  | 715  | 1.3584          | 0.6179   |
+| 0.3347        | 12.0  | 780  | 1.3825          | 0.6179   |
+| 0.3175        | 13.0  | 845  | 1.4199          | 0.6101   |
+| 0.2582        | 14.0  | 910  | 1.4277          | 0.6179   |
+| 0.2097        | 15.0  | 975  | 1.4421          | 0.6179   |
+| 0.2308        | 16.0  | 1040 | 1.4636          | 0.6101   |
+| 0.1753        | 17.0  | 1105 | 1.4857          | 0.6199   |
+| 0.1632        | 18.0  | 1170 | 1.4894          | 0.6277   |
+| 0.1564        | 19.0  | 1235 | 1.5043          | 0.6160   |
+| 0.1494        | 20.0  | 1300 | 1.5040          | 0.6179   |
 ### Framework versions
+- Transformers 4.46.3
 - Pytorch 2.4.0
 - Datasets 2.20.0
+- Tokenizers 0.20.3

config.json CHANGED Viewed

@@ -96,6 +96,6 @@
   "sinusoidal_pos_embds": false,
   "tie_weights_": true,
   "torch_dtype": "float32",
-  "transformers_version": "4.44.0",
   "vocab_size": 30522
 }

   "sinusoidal_pos_embds": false,
   "tie_weights_": true,
   "torch_dtype": "float32",
+  "transformers_version": "4.46.3",
   "vocab_size": 30522
 }

model.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:72c8c1cf03c6864b2956e5235f34b02bcd3ea51f808ba94f5e30e78c84d468bb
 size 267937160

 version https://git-lfs.github.com/spec/v1
+oid sha256:f077b2f4176aaa251c8e69019c07d05e68074219b085310a8bda208411beb20c
 size 267937160

tokenizer_config.json CHANGED Viewed

@@ -41,7 +41,7 @@
       "special": true
     }
   },
-  "clean_up_tokenization_spaces": true,
   "cls_token": "[CLS]",
   "do_lower_case": true,
   "mask_token": "[MASK]",

       "special": true
     }
   },
+  "clean_up_tokenization_spaces": false,
   "cls_token": "[CLS]",
   "do_lower_case": true,
   "mask_token": "[MASK]",

training_args.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:9e9a68dcb35447df75f0612c9054154ae4966d21ea11f108d3320c9e29bf8278
-size 5112

 version https://git-lfs.github.com/spec/v1
+oid sha256:2c138e5f4cf55dcf5dfd35b0ebfd4dca743f0d212a315a64a5b90924b8f7f72d
+size 5240