Leandro von Werra's picture

Leandro von Werra

lvwerra

·

https://github.com/lvwerra

AI & ML interests

NLP and RL

Recent Activity

authored a paper about 2 hours ago

SmolLM2: When Smol Goes Big -- Data-Centric Training of a Small Language Model

upvoted a paper about 2 hours ago

SmolLM2: When Smol Goes Big -- Data-Centric Training of a Small Language Model

upvoted an article 1 day ago

Open-source DeepResearch – Freeing our search agents

View all activity

Organizations

lvwerra's activity

liked a Space 2 days ago

DABstep Leaderboard

DABstep Reasoning Benchmark Leaderboard

liked a model 17 days ago

deepseek-ai/DeepSeek-R1

Text Generation • Updated 5 days ago • 1.54M • • 7.2k

liked 2 Spaces about 2 months ago

Jupyter Agent

Create and run Jupyter notebooks interactively

Scaling test-time compute

Enhance math problem solving by scaling test-time compute

liked 2 datasets 2 months ago

microsoft/RedStone

Updated Dec 5, 2024 • 94 • 30

ylecun/mnist

Viewer • Updated Aug 8, 2024 • 70k • 32.4k • 154

liked a Space 3 months ago

Scaling FineWeb to 1000+ languages: Step 1: finding signal in 100s of evaluation tasks

Evaluate multilingual models using FineTasks

liked a model 3 months ago

HuggingFaceTB/SmolLM2-1.7B-Instruct

Text Generation • Updated about 2 hours ago • 95.8k • • 497

liked 2 Spaces 4 months ago

CinePileLeaderboard

Video-LLM evaluations on CinePile's evaluation split.

TxT360: Trillion Extracted Text

Explore a large, deduplicated dataset for LLM training

liked a dataset 5 months ago

HuggingFaceFV/finevideo

Viewer • Updated Dec 16, 2024 • 39.5k • 3.02k • 292

liked 2 models 6 months ago

meta-llama/Llama-3.1-8B-Instruct

Text Generation • Updated Sep 25, 2024 • 5.78M • • 3.58k

google/gemma-2-2b

Text Generation • Updated Aug 7, 2024 • 238k • 490

liked 2 Spaces 8 months ago

BigCodeBench Leaderboard

Explore and analyze code evaluation data

FineWeb: decanting the web for the finest text data at scale

Generate high-quality web text data for LLM training

liked a dataset 9 months ago

tomg-group-umd/cinepile

Viewer • Updated Oct 23, 2024 • 608k • 177 • 77

liked a model 9 months ago

bigcode/starcoder2-15b-instruct-v0.1

Text Generation • Updated Nov 3, 2024 • 1.23k • 101

liked a model 10 months ago

bigcode/starcoder2-15b

Text Generation • Updated Jun 5, 2024 • 29.8k • • 581

liked a dataset 10 months ago

HuggingFaceFW/fineweb

Viewer • Updated 6 days ago • 25B • 485k • 1.86k

liked a model 10 months ago

mistral-community/Mixtral-8x22B-v0.1

Text Generation • Updated Jul 1, 2024 • 4.45k • 674