SmolLM2: When Smol Goes Big -- Data-Centric Training of a Small Language Model Paper • 2502.02737 • Published 7 days ago • 153
math-extraction-multilingual/HuggingFaceTB_SmolLM2-1.7B-Instruct Viewer • Updated 12 days ago • 6.32k • 21
math-extraction-multilingual/HuggingFaceTB_SmolLM2-1.7B-Instruct Viewer • Updated 12 days ago • 6.32k • 21
math-extraction-multilingual/meta-llama_Llama-3.2-3B-Instruct Viewer • Updated 18 days ago • 5.15k • 61
math-extraction-multilingual/meta-llama_Llama-3.2-3B-Instruct Viewer • Updated 18 days ago • 5.15k • 61
math-extraction-multilingual/meta-llama_Llama-3.1-8B-Instruct Viewer • Updated 18 days ago • 5.15k • 59
math-extraction-multilingual/meta-llama_Llama-3.1-8B-Instruct Viewer • Updated 18 days ago • 5.15k • 59
math-extraction-multilingual/meta-llama_Llama-3.2-1B-Instruct Viewer • Updated 18 days ago • 5.15k • 60
math-extraction-multilingual/meta-llama_Llama-3.2-1B-Instruct Viewer • Updated 18 days ago • 5.15k • 60
Towards Best Practices for Open Datasets for LLM Training Paper • 2501.08365 • Published 28 days ago • 54