Loubna Ben Allal's picture

Loubna Ben Allal

loubnabnl

·

https://loubnabnl.github.io/

AI & ML interests

SmolLMs, ML for code, data

Recent Activity

updated a dataset about 1 hour ago

HuggingFaceTB/smol-smoltalk

updated a model about 1 hour ago

HuggingFaceTB/SmolLM2-135M

updated a model about 1 hour ago

HuggingFaceTB/SmolLM2-360M

View all activity

Organizations

loubnabnl's activity

upvoted a paper about 2 hours ago

SmolLM2: When Smol Goes Big -- Data-Centric Training of a Small Language Model

Paper • 2502.02737 • Published 1 day ago • 32

upvoted an article 9 days ago

Article

Open-R1: a fully open reproduction of DeepSeek-R1

9 days ago

• 646

upvoted an article 14 days ago

Article

SmolVLM Grows Smaller – Introducing the 250M & 500M Models!

14 days ago

• 119

upvoted a paper 22 days ago

StarCoder: may the source be with you!

Paper • 2305.06161 • Published May 9, 2023 • 29

upvoted an article 22 days ago

Article

Diving into MiniMax01 405B MoE

By

•

22 days ago

• 17

upvoted an article 29 days ago

Article

Fine-tune a SmolLM on domain-specific synthetic data from a LLM

By

•

Jan 3

• 32

upvoted a collection 2 months ago

SmolVLM

State-of-the-art compact VLMs for on-device applications: Base, Synthetic, and Instruct • 5 items • Updated Dec 22, 2024 • 32

upvoted an article 6 months ago

Article

The 5 Most Under-Rated Tools on Hugging Face

Aug 22, 2024

• 85

upvoted a collection 6 months ago

💻 Local SmolLMs

SmolLM models in MLC, ONNX and GGUF format for local applications + in-browser demos • 14 items • Updated Dec 22, 2024 • 48

upvoted an article 7 months ago

Article

SmolLM - blazingly fast and remarkably powerful

Jul 16, 2024

• 305

upvoted 2 papers 8 months ago

The FineWeb Datasets: Decanting the Web for the Finest Text Data at Scale

Paper • 2406.17557 • Published Jun 25, 2024 • 91

Scaling Laws and Compute-Optimal Training Beyond Fixed Training Durations

Paper • 2405.18392 • Published May 28, 2024 • 12

upvoted 2 collections 11 months ago

Leaderboards and benchmarks ✨

Cool leaderboard spaces collection for models across modalities! Text, vision, audio, ... • 90 items • Updated 1 day ago • 93

ZeroGPU Spaces

ZeroGPU Spaces made by the community • 17 items • Updated Jun 6, 2024 • 233

upvoted a paper 11 months ago

StarCoder 2 and The Stack v2: The Next Generation

Paper • 2402.19173 • Published Feb 29, 2024 • 137

upvoted a collection 11 months ago

💫 StarCoder2

StarCoder2 models and datasets! • 8 items • Updated Mar 1, 2024 • 83

upvoted a paper over 1 year ago

Zephyr: Direct Distillation of LM Alignment

Paper • 2310.16944 • Published Oct 25, 2023 • 122