Mistral-7B-v0.1-sft-hhrlhf-dpo / last-checkpoint /model-00002-of-00003.safetensors

Commit History

Training in progress, epoch 1, checkpoint
1fba992
verified

AmberYifan commited on