fblgit
/

UNA-ThePitbull-21.4B-v2

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

fblgit commited on May 31, 2024

Commit

f12aac9

·

verified ·

1 Parent(s): 4c9a3c5

Update README.md

Files changed (1) hide show

README.md +3 -3

README.md CHANGED Viewed

@@ -129,11 +129,11 @@ We also performed further SFT over V1 and further DPO over V1 and we'll release
 ### Changes
-1. SFT over V1 with `Replete-AI/code_bagel_hermes-2.5` at 1.0e-4 till 5.0e-5
-2. DPO with: 1.0e-4 to min_lr 5.0e-5
 * `mlabonne/orpo-dpo-mix-40k`
 * `jondurbin/py-dpo-v0.1`
 # Evaluations
 ## [Open LLM Leaderboard Evaluation Results](https://huggingface.co/spaces/HuggingFaceH4/open_llm_leaderboard)
 Detailed results can be found [here](https://huggingface.co/datasets/open-llm-leaderboard/details_fblgit__UNA-ThePitbull-21.4B-v2)

 ### Changes
+1. SFT over V1 with `Replete-AI/code_bagel_hermes-2.5` at 1.0e-4 till 5.0e-5 for 1 epoch
+2. DPO with: 1.0e-4 to min_lr 5.0e-5 for 1 epoch
 * `mlabonne/orpo-dpo-mix-40k`
 * `jondurbin/py-dpo-v0.1`
 # Evaluations
 ## [Open LLM Leaderboard Evaluation Results](https://huggingface.co/spaces/HuggingFaceH4/open_llm_leaderboard)
 Detailed results can be found [here](https://huggingface.co/datasets/open-llm-leaderboard/details_fblgit__UNA-ThePitbull-21.4B-v2)