rngusry commited on
Commit
dd2c0ed
·
verified ·
1 Parent(s): f9ea24f

End of training

Browse files
Files changed (2) hide show
  1. README.md +6 -2
  2. config.json +1 -1
README.md CHANGED
@@ -3,11 +3,15 @@ library_name: transformers
3
  license: llama3.2
4
  base_model: meta-llama/Llama-3.2-1B-Instruct
5
  tags:
 
 
 
 
6
  - trl
7
  - sft
8
  - generated_from_trainer
9
  datasets:
10
- - generator
11
  model-index:
12
  - name: llama3.2-1b-instruct-hh-sft
13
  results: []
@@ -18,7 +22,7 @@ should probably proofread and complete it, then remove this comment. -->
18
 
19
  # llama3.2-1b-instruct-hh-sft
20
 
21
- This model is a fine-tuned version of [meta-llama/Llama-3.2-1B-Instruct](https://huggingface.co/meta-llama/Llama-3.2-1B-Instruct) on the generator dataset.
22
 
23
  ## Model description
24
 
 
3
  license: llama3.2
4
  base_model: meta-llama/Llama-3.2-1B-Instruct
5
  tags:
6
+ - alignment-handbook
7
+ - trl
8
+ - sft
9
+ - generated_from_trainer
10
  - trl
11
  - sft
12
  - generated_from_trainer
13
  datasets:
14
+ - VMware/open-instruct-v1-oasst-dolly-hhrlhf
15
  model-index:
16
  - name: llama3.2-1b-instruct-hh-sft
17
  results: []
 
22
 
23
  # llama3.2-1b-instruct-hh-sft
24
 
25
+ This model is a fine-tuned version of [meta-llama/Llama-3.2-1B-Instruct](https://huggingface.co/meta-llama/Llama-3.2-1B-Instruct) on the VMware/open-instruct-v1-oasst-dolly-hhrlhf dataset.
26
 
27
  ## Model description
28
 
config.json CHANGED
@@ -35,6 +35,6 @@
35
  "tie_word_embeddings": true,
36
  "torch_dtype": "bfloat16",
37
  "transformers_version": "4.48.1",
38
- "use_cache": false,
39
  "vocab_size": 128256
40
  }
 
35
  "tie_word_embeddings": true,
36
  "torch_dtype": "bfloat16",
37
  "transformers_version": "4.48.1",
38
+ "use_cache": true,
39
  "vocab_size": 128256
40
  }