KasaiDanto commited on
Commit
a0a1920
·
verified ·
1 Parent(s): 7c66f16

End of training

Browse files
README.md CHANGED
@@ -16,7 +16,7 @@ should probably proofread and complete it, then remove this comment. -->
16
 
17
  This model is a fine-tuned version of [google/gemma-2-2b-it](https://huggingface.co/google/gemma-2-2b-it) on an unknown dataset.
18
  It achieves the following results on the evaluation set:
19
- - Loss: 1.8785
20
 
21
  ## Model description
22
 
@@ -49,8 +49,8 @@ The following hyperparameters were used during training:
49
 
50
  | Training Loss | Epoch | Step | Validation Loss |
51
  |:-------------:|:------:|:----:|:---------------:|
52
- | 2.0279 | 0.9999 | 3912 | 1.9928 |
53
- | 1.8365 | 1.9999 | 7824 | 1.8785 |
54
 
55
 
56
  ### Framework versions
 
16
 
17
  This model is a fine-tuned version of [google/gemma-2-2b-it](https://huggingface.co/google/gemma-2-2b-it) on an unknown dataset.
18
  It achieves the following results on the evaluation set:
19
+ - Loss: 1.8732
20
 
21
  ## Model description
22
 
 
49
 
50
  | Training Loss | Epoch | Step | Validation Loss |
51
  |:-------------:|:------:|:----:|:---------------:|
52
+ | 2.0276 | 0.9999 | 3912 | 1.9911 |
53
+ | 1.8334 | 1.9999 | 7824 | 1.8732 |
54
 
55
 
56
  ### Framework versions
adapter_config.json CHANGED
@@ -23,13 +23,13 @@
23
  "rank_pattern": {},
24
  "revision": null,
25
  "target_modules": [
26
- "o_proj",
27
- "k_proj",
28
  "up_proj",
29
- "q_proj",
30
  "v_proj",
31
- "down_proj",
32
- "gate_proj"
 
 
33
  ],
34
  "task_type": "CAUSAL_LM",
35
  "use_dora": false,
 
23
  "rank_pattern": {},
24
  "revision": null,
25
  "target_modules": [
26
+ "down_proj",
 
27
  "up_proj",
 
28
  "v_proj",
29
+ "k_proj",
30
+ "q_proj",
31
+ "gate_proj",
32
+ "o_proj"
33
  ],
34
  "task_type": "CAUSAL_LM",
35
  "use_dora": false,
adapter_model.safetensors CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:81074065d77991800b5b0a08d6783de402a32d8780a3a451ade3cd8d6f55402b
3
  size 41581360
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:93ae1ff098a64082417ab6b5d277e8501c08a57543f4070ef335e4265ccfe289
3
  size 41581360
runs/Dec08_15-43-41_dbca40d3e117/events.out.tfevents.1733672623.dbca40d3e117.23.0 ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:2de9b4ed4713f1244941d7c0365a56d5c68338a37c58de7204966dd273ed2c27
3
+ size 9734
training_args.bin CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:405eba1f683dc1d69866e1c523b97c83fcbd8e4c693884b09e9278a7b5dcb6b5
3
  size 5240
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:b3ba67acdc936c36a0a3bac6d559b199d330a30b4a08c7f6c947c4e0ac5c2e9f
3
  size 5240