vansin
vansin
AI & ML interests
None yet
Recent Activity
liked
a Space
5 days ago
SmartFlowAI/HuggingFaceWeeklyPaper
upvoted
an
article
9 days ago
State of open video generation models in Diffusers
new activity
10 days ago
deepseek-ai/DeepSeek-R1:How to deploy DeepSeek-R1 witn LMDeploy ?
Organizations
vansin's activity
Post
1251
Amazing !!!! test Post
![](https://cdn-avatars.huggingface.co/v1/production/uploads/1678589663024-640d3eaa3623f6a56dde856d.jpeg)
reacted to
loubnabnl's
post with 🔥
2 months ago
Post
2241
Making SmolLM2 reproducible: open-sourcing our training & evaluation toolkit 🛠️ https://github.com/huggingface/smollm/
- Pre-training code with nanotron
- Evaluation suite with lighteval
- Synthetic data generation using distilabel (powers our new SFT dataset HuggingFaceTB/smoltalk)
- Post-training scripts with TRL & the alignment handbook
- On-device tools with llama.cpp for summarization, rewriting & agents
Apache 2.0 licensed. V2 pre-training data mix coming soon!
Which other tools should we add next?
- Pre-training code with nanotron
- Evaluation suite with lighteval
- Synthetic data generation using distilabel (powers our new SFT dataset HuggingFaceTB/smoltalk)
- Post-training scripts with TRL & the alignment handbook
- On-device tools with llama.cpp for summarization, rewriting & agents
Apache 2.0 licensed. V2 pre-training data mix coming soon!
Which other tools should we add next?