velaa commited on
Commit
8438d63
·
1 Parent(s): 4ebb068

End of training

Browse files
README.md CHANGED
@@ -9,7 +9,20 @@ metrics:
9
  - accuracy
10
  model-index:
11
  - name: opt-125m-finetuned-mnli
12
- results: []
 
 
 
 
 
 
 
 
 
 
 
 
 
13
  ---
14
 
15
  <!-- This model card has been generated automatically according to the information the Trainer had access to. You
@@ -19,8 +32,8 @@ should probably proofread and complete it, then remove this comment. -->
19
 
20
  This model is a fine-tuned version of [facebook/opt-125m](https://huggingface.co/facebook/opt-125m) on the glue dataset.
21
  It achieves the following results on the evaluation set:
22
- - Loss: 1.2083
23
- - Accuracy: 0.3321
24
 
25
  ## Model description
26
 
@@ -39,7 +52,7 @@ More information needed
39
  ### Training hyperparameters
40
 
41
  The following hyperparameters were used during training:
42
- - learning_rate: 2e-05
43
  - train_batch_size: 16
44
  - eval_batch_size: 16
45
  - seed: 42
@@ -51,11 +64,11 @@ The following hyperparameters were used during training:
51
 
52
  | Training Loss | Epoch | Step | Validation Loss | Accuracy |
53
  |:-------------:|:-----:|:----:|:---------------:|:--------:|
54
- | 1.2018 | 1.0 | 1 | 1.2117 | 0.3312 |
55
- | 1.237 | 2.0 | 2 | 1.2083 | 0.3321 |
56
- | 1.2323 | 3.0 | 3 | 1.2057 | 0.3317 |
57
- | 1.1903 | 4.0 | 4 | 1.2041 | 0.3317 |
58
- | 1.1866 | 5.0 | 5 | 1.2032 | 0.3317 |
59
 
60
 
61
  ### Framework versions
 
9
  - accuracy
10
  model-index:
11
  - name: opt-125m-finetuned-mnli
12
+ results:
13
+ - task:
14
+ name: Text Classification
15
+ type: text-classification
16
+ dataset:
17
+ name: glue
18
+ type: glue
19
+ config: mnli
20
+ split: validation_matched
21
+ args: mnli
22
+ metrics:
23
+ - name: Accuracy
24
+ type: accuracy
25
+ value: 0.3543555781966378
26
  ---
27
 
28
  <!-- This model card has been generated automatically according to the information the Trainer had access to. You
 
32
 
33
  This model is a fine-tuned version of [facebook/opt-125m](https://huggingface.co/facebook/opt-125m) on the glue dataset.
34
  It achieves the following results on the evaluation set:
35
+ - Loss: 1.8177
36
+ - Accuracy: 0.3544
37
 
38
  ## Model description
39
 
 
52
  ### Training hyperparameters
53
 
54
  The following hyperparameters were used during training:
55
+ - learning_rate: 0.0002
56
  - train_batch_size: 16
57
  - eval_batch_size: 16
58
  - seed: 42
 
64
 
65
  | Training Loss | Epoch | Step | Validation Loss | Accuracy |
66
  |:-------------:|:-----:|:----:|:---------------:|:--------:|
67
+ | 1.0072 | 1.0 | 1 | 2.3236 | 0.3325 |
68
+ | 0.4544 | 2.0 | 2 | 1.8177 | 0.3544 |
69
+ | 0.0899 | 3.0 | 3 | 1.7630 | 0.3319 |
70
+ | 0.0474 | 4.0 | 4 | 1.7078 | 0.3446 |
71
+ | 0.0048 | 5.0 | 5 | 1.7089 | 0.3435 |
72
 
73
 
74
  ### Framework versions
model.safetensors CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:1683fc1582ef6c1d96784e2017664770d64de9ece34dc0d5829d5666c80f4bcb
3
  size 500988904
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:5c5f9b1a765527cd6683f47500f3cd23bf883846bd19e7bbc73d071c6f734224
3
  size 500988904
runs/Dec03_01-32-17_e58bb28c2f23/events.out.tfevents.1701567150.e58bb28c2f23.15759.2 CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:d6272e1dab4ef88a1c1ce0eb00c9cd3707208bf6412c3122c5d2ed4e5b30ecf2
3
- size 6952
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:a79b4dd104bb057a165aabe424bfad1191cde6f60d0de54cbf954715d8a09c17
3
+ size 7300
runs/Dec03_01-32-17_e58bb28c2f23/events.out.tfevents.1701567585.e58bb28c2f23.15759.3 ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:720b4d52055816f8dbcfa264f8e62d182dd0cb5e4902709556dc00aa4a2c804c
3
+ size 405