Leandro von Werra's picture

Leandro von Werra

lvwerra

·

https://github.com/lvwerra

AI & ML interests

NLP and RL

Recent Activity

authored a paper about 2 hours ago

SmolLM2: When Smol Goes Big -- Data-Centric Training of a Small Language Model

upvoted a paper about 3 hours ago

SmolLM2: When Smol Goes Big -- Data-Centric Training of a Small Language Model

upvoted an article 1 day ago

Open-source DeepResearch – Freeing our search agents

View all activity

Organizations

lvwerra's activity

upvoted a paper about 3 hours ago

SmolLM2: When Smol Goes Big -- Data-Centric Training of a Small Language Model

Paper • 2502.02737 • Published 1 day ago • 37

upvoted an article 1 day ago

Article

Open-source DeepResearch – Freeing our search agents

3 days ago

• 623

upvoted an article 2 days ago

Article

DABStep: Data Agent Benchmark for Multi-step Reasoning

3 days ago

• 26

upvoted an article 4 days ago

Article

Open-R1: Update #1

By

and 7 others •

5 days ago

• 235

upvoted an article 9 days ago

Article

Welcome to Inference Providers on the Hub 🔥

10 days ago

• 257

upvoted a paper 9 days ago

DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning

Paper • 2501.12948 • Published 15 days ago • 298

upvoted an article 10 days ago

Article

Open-R1: a fully open reproduction of DeepSeek-R1

10 days ago

• 646

upvoted a paper 20 days ago

Towards Best Practices for Open Datasets for LLM Training

Paper • 2501.08365 • Published 23 days ago • 53

upvoted a collection 2 months ago

🤖 Agents

21 items • Updated Dec 31, 2024 • 114

upvoted a paper 3 months ago

SelfCodeAlign: Self-Alignment for Code Generation

Paper • 2410.24198 • Published Oct 31, 2024 • 23

upvoted an article 4 months ago

Article

FineVideo: behind the scenes

Sep 23, 2024

• 29

upvoted 2 papers 5 months ago

Qwen2.5-Coder Technical Report

Paper • 2409.12186 • Published Sep 18, 2024 • 140

Building and better understanding vision-language models: insights and future directions

Paper • 2408.12637 • Published Aug 22, 2024 • 125

upvoted 3 articles 6 months ago

Article

Tool Use, Unified

Aug 12, 2024

• 72

Article

A failed experiment: Infini-Attention, and why we should keep trying?

Aug 14, 2024

• 57

Article

XetHub is joining Hugging Face!

Aug 8, 2024

• 81

upvoted 4 articles 7 months ago

Article

Llama 3.1 - 405B, 70B & 8B with multilinguality and long context

Jul 23, 2024

• 226

Article

Docmatix - a huge dataset for Document Visual Question Answering

Jul 18, 2024

• 72

Article

SmolLM - blazingly fast and remarkably powerful

Jul 16, 2024

• 305

Article

Our Transformers Code Agent beats the GAIA benchmark!

Jul 1, 2024

• 58