Rauhan
/

llama-3.2-3B-GRPO-GSM325

Text Generation

reinforcement-learning

mathematical-reasoning

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

Resources

View closed (1)

Adding `safetensors` variant of this model

#1 opened 5 days ago by