Safetensors
English
llama
alignment-handbook
trl
dpo
Generated from Trainer
Llama-3.1-8B-Magpie-Align-v0.2 / model.safetensors.index.json

Commit History

Training in progress, step 100
f92c0a2
verified

flydust commited on