SmolLM2: When Smol Goes Big -- Data-Centric Training of a Small Language Model
Paper
•
2502.02737
•
Published
•
60
Building better datasets together
<|begin▁of▁sentence|>User:
, let the model generate the rest.