End of training

Files changed (4) hide show

README.md CHANGED Viewed

@@ -9,7 +9,20 @@ metrics:
 - accuracy
 model-index:
 - name: opt-125m-finetuned-mnli
-  results: []
 ---
 <!-- This model card has been generated automatically according to the information the Trainer had access to. You
@@ -19,8 +32,8 @@ should probably proofread and complete it, then remove this comment. -->
 This model is a fine-tuned version of [facebook/opt-125m](https://huggingface.co/facebook/opt-125m) on the glue dataset.
 It achieves the following results on the evaluation set:
-- Loss: 1.2083
-- Accuracy: 0.3321
 ## Model description
@@ -39,7 +52,7 @@ More information needed
 ### Training hyperparameters
 The following hyperparameters were used during training:
-- learning_rate: 2e-05
 - train_batch_size: 16
 - eval_batch_size: 16
 - seed: 42
@@ -51,11 +64,11 @@ The following hyperparameters were used during training:
 | Training Loss | Epoch | Step | Validation Loss | Accuracy |
 |:-------------:|:-----:|:----:|:---------------:|:--------:|
-| 1.2018        | 1.0   | 1    | 1.2117          | 0.3312   |
-| 1.237         | 2.0   | 2    | 1.2083          | 0.3321   |
-| 1.2323        | 3.0   | 3    | 1.2057          | 0.3317   |
-| 1.1903        | 4.0   | 4    | 1.2041          | 0.3317   |
-| 1.1866        | 5.0   | 5    | 1.2032          | 0.3317   |
 ### Framework versions

 - accuracy
 model-index:
 - name: opt-125m-finetuned-mnli
+  results:
+  - task:
+      name: Text Classification
+      type: text-classification
+    dataset:
+      name: glue
+      type: glue
+      config: mnli
+      split: validation_matched
+      args: mnli
+    metrics:
+    - name: Accuracy
+      type: accuracy
+      value: 0.3543555781966378
 ---
 <!-- This model card has been generated automatically according to the information the Trainer had access to. You
 This model is a fine-tuned version of [facebook/opt-125m](https://huggingface.co/facebook/opt-125m) on the glue dataset.
 It achieves the following results on the evaluation set:
+- Loss: 1.8177
+- Accuracy: 0.3544
 ## Model description
 ### Training hyperparameters
 The following hyperparameters were used during training:
+- learning_rate: 0.0002
 - train_batch_size: 16
 - eval_batch_size: 16
 - seed: 42
 | Training Loss | Epoch | Step | Validation Loss | Accuracy |
 |:-------------:|:-----:|:----:|:---------------:|:--------:|
+| 1.0072        | 1.0   | 1    | 2.3236          | 0.3325   |
+| 0.4544        | 2.0   | 2    | 1.8177          | 0.3544   |
+| 0.0899        | 3.0   | 3    | 1.7630          | 0.3319   |
+| 0.0474        | 4.0   | 4    | 1.7078          | 0.3446   |
+| 0.0048        | 5.0   | 5    | 1.7089          | 0.3435   |
 ### Framework versions

model.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:1683fc1582ef6c1d96784e2017664770d64de9ece34dc0d5829d5666c80f4bcb
 size 500988904

 version https://git-lfs.github.com/spec/v1
+oid sha256:5c5f9b1a765527cd6683f47500f3cd23bf883846bd19e7bbc73d071c6f734224
 size 500988904

runs/Dec03_01-32-17_e58bb28c2f23/events.out.tfevents.1701567150.e58bb28c2f23.15759.2 CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:d6272e1dab4ef88a1c1ce0eb00c9cd3707208bf6412c3122c5d2ed4e5b30ecf2
-size 6952

 version https://git-lfs.github.com/spec/v1
+oid sha256:a79b4dd104bb057a165aabe424bfad1191cde6f60d0de54cbf954715d8a09c17
+size 7300

runs/Dec03_01-32-17_e58bb28c2f23/events.out.tfevents.1701567585.e58bb28c2f23.15759.3 ADDED Viewed

+version https://git-lfs.github.com/spec/v1
+oid sha256:720b4d52055816f8dbcfa264f8e62d182dd0cb5e4902709556dc00aa4a2c804c
+size 405