qwen-r1-aha-moment / all_results.json
Chris126's picture
End of training
f63e5f3 verified
raw
history blame contribute delete
203 Bytes
{
"total_flos": 0.0,
"train_loss": 1.3050906145387637e-09,
"train_runtime": 4363.1554,
"train_samples": 45000,
"train_samples_per_second": 0.103,
"train_steps_per_second": 0.103
}