llama-3.2-3B-GRPO-GSM325 / pytorch_model-00001-of-00002.bin

Commit History

Trained with Unsloth
eff2c65
verified

Rauhan commited on