DeepSeek-R1-Distill-Qwen-7B-GRPO_Math_lowlr / model.safetensors.index.json

Commit History

Model save
5cc553e
verified

Dongwei commited on