Running 496 496 Scaling test-time compute 📈 Enhance math problem solving by scaling test-time compute
Running on CPU Upgrade 67 67 La Leaderboard 🌸 Evaluate open LLMs in the languages of LATAM and Spain.