AIR-hl/Qwen2.5-1.5B-DPO at main

Qwen2.5-1.5B-DPO / .ipynb_checkpoints

1 contributor

History: 2 commits

AIR-hl's picture

updata beta=0.01 version

01ef97e verified about 1 month ago

trainer_state-checkpoint.json

22.7 kB

updata beta=0.01 version about 1 month ago