sedrickkeh commited on
Commit
66ef69b
·
verified ·
1 Parent(s): 89e4cbc

Model save

Browse files
Files changed (1) hide show
  1. README.md +5 -6
README.md CHANGED
@@ -4,7 +4,6 @@ license: apache-2.0
4
  base_model: Qwen/Qwen2.5-7B
5
  tags:
6
  - llama-factory
7
- - full
8
  - generated_from_trainer
9
  model-index:
10
  - name: oh-dcft-v3.1-claude-3-5-haiku-20241022-qwen
@@ -16,9 +15,9 @@ should probably proofread and complete it, then remove this comment. -->
16
 
17
  # oh-dcft-v3.1-claude-3-5-haiku-20241022-qwen
18
 
19
- This model is a fine-tuned version of [Qwen/Qwen2.5-7B](https://huggingface.co/Qwen/Qwen2.5-7B) on the mlfoundations-dev/oh-dcft-v3.1-claude-3-5-haiku-20241022 dataset.
20
  It achieves the following results on the evaluation set:
21
- - Loss: 0.4279
22
 
23
  ## Model description
24
 
@@ -54,9 +53,9 @@ The following hyperparameters were used during training:
54
 
55
  | Training Loss | Epoch | Step | Validation Loss |
56
  |:-------------:|:-----:|:----:|:---------------:|
57
- | 0.4292 | 1.0 | 1746 | 0.4309 |
58
- | 0.3757 | 2.0 | 3492 | 0.4209 |
59
- | 0.3268 | 3.0 | 5238 | 0.4279 |
60
 
61
 
62
  ### Framework versions
 
4
  base_model: Qwen/Qwen2.5-7B
5
  tags:
6
  - llama-factory
 
7
  - generated_from_trainer
8
  model-index:
9
  - name: oh-dcft-v3.1-claude-3-5-haiku-20241022-qwen
 
15
 
16
  # oh-dcft-v3.1-claude-3-5-haiku-20241022-qwen
17
 
18
+ This model is a fine-tuned version of [Qwen/Qwen2.5-7B](https://huggingface.co/Qwen/Qwen2.5-7B) on an unknown dataset.
19
  It achieves the following results on the evaluation set:
20
+ - Loss: 0.4277
21
 
22
  ## Model description
23
 
 
53
 
54
  | Training Loss | Epoch | Step | Validation Loss |
55
  |:-------------:|:-----:|:----:|:---------------:|
56
+ | 0.429 | 1.0 | 1746 | 0.4307 |
57
+ | 0.3755 | 2.0 | 3492 | 0.4207 |
58
+ | 0.3266 | 3.0 | 5238 | 0.4277 |
59
 
60
 
61
  ### Framework versions