Update README.md
Browse files
README.md
CHANGED
@@ -129,11 +129,11 @@ We also performed further SFT over V1 and further DPO over V1 and we'll release
|
|
129 |
|
130 |
### Changes
|
131 |
|
132 |
-
1. SFT over V1 with `Replete-AI/code_bagel_hermes-2.5` at 1.0e-4 till 5.0e-5
|
133 |
-
2. DPO with: 1.0e-4 to min_lr 5.0e-5
|
134 |
* `mlabonne/orpo-dpo-mix-40k`
|
135 |
* `jondurbin/py-dpo-v0.1`
|
136 |
-
|
137 |
# Evaluations
|
138 |
## [Open LLM Leaderboard Evaluation Results](https://huggingface.co/spaces/HuggingFaceH4/open_llm_leaderboard)
|
139 |
Detailed results can be found [here](https://huggingface.co/datasets/open-llm-leaderboard/details_fblgit__UNA-ThePitbull-21.4B-v2)
|
|
|
129 |
|
130 |
### Changes
|
131 |
|
132 |
+
1. SFT over V1 with `Replete-AI/code_bagel_hermes-2.5` at 1.0e-4 till 5.0e-5 for 1 epoch
|
133 |
+
2. DPO with: 1.0e-4 to min_lr 5.0e-5 for 1 epoch
|
134 |
* `mlabonne/orpo-dpo-mix-40k`
|
135 |
* `jondurbin/py-dpo-v0.1`
|
136 |
+
|
137 |
# Evaluations
|
138 |
## [Open LLM Leaderboard Evaluation Results](https://huggingface.co/spaces/HuggingFaceH4/open_llm_leaderboard)
|
139 |
Detailed results can be found [here](https://huggingface.co/datasets/open-llm-leaderboard/details_fblgit__UNA-ThePitbull-21.4B-v2)
|