This model is Llemma-7b model used in the paper "An Empirical Analysis of Compute-Optimal Inference for Problem-Solving with Language Models". It's based on Llemma-7b and was further finetuned MetaMath with special format for reward. Each step starts with "Step" and ends with "\u043a\u0438".

Downloads last month: 345

Safetensors

Model size

6.74B params

Tensor type

BF16

Inference Providers NEW

This model is not currently available via any of the supported third-party Inference Providers, and HF Inference API was unable to determine this model's library.

Collection including tkitsers/Llemma-metamath-7b

Inference Scaling Laws Llemma Models

Collection

Inference Scaling Laws: An Empirical Analysis of Compute-Optimal Inference for Problem-Solving with Language Models • 3 items • Updated Oct 22, 2024