Demystifying Long Chain-of-Thought Reasoning in LLMs Paper โข 2502.03373 โข Published about 23 hours ago โข 18
SmolLM2: When Smol Goes Big -- Data-Centric Training of a Small Language Model Paper โข 2502.02737 โข Published 2 days ago โข 62