Thomas Wolf's picture

Thomas Wolf PRO

thomwolf

·

https://thomwolf.io

AI & ML interests

NLP and open-source :-)

Recent Activity

authored a paper about 1 hour ago

SmolLM2: When Smol Goes Big -- Data-Centric Training of a Small Language Model

upvoted a paper about 5 hours ago

Demystifying Long Chain-of-Thought Reasoning in LLMs

upvoted a paper about 5 hours ago

SmolLM2: When Smol Goes Big -- Data-Centric Training of a Small Language Model

View all activity

Organizations

thomwolf's activity

upvoted 2 papers about 5 hours ago

Demystifying Long Chain-of-Thought Reasoning in LLMs

Paper • 2502.03373 • Published about 22 hours ago • 18

SmolLM2: When Smol Goes Big -- Data-Centric Training of a Small Language Model

Paper • 2502.02737 • Published 1 day ago • 60

upvoted an article about 6 hours ago

Article

π0 and π0-FAST: Vision-Language-Action Models for General Robot Control

3 days ago

• 67

upvoted an article 2 days ago

Article

Open-source DeepResearch – Freeing our search agents

3 days ago

• 629

upvoted an article 4 days ago

Article

Open-R1: Update #1

By

and 7 others •

5 days ago

• 235

upvoted a collection 6 days ago

Tulu 3 Models

All models released with Tulu 3 -- state of the art open post-training recipes. • 10 items • Updated 8 days ago • 86

upvoted 2 articles 9 days ago

Article

Welcome to Inference Providers on the Hub 🔥

10 days ago

• 258

Article

Open-R1: a fully open reproduction of DeepSeek-R1

10 days ago

• 646

upvoted 2 articles 22 days ago

Article

MiniMax-01 is Now Open-Source: Scaling Lightning Attention for the AI Agent Era

By

•

22 days ago

• 40

Article

Diving into MiniMax01 405B MoE

By

•

22 days ago

• 17

upvoted a paper about 1 month ago

DigiRL: Training In-The-Wild Device-Control Agents with Autonomous Reinforcement Learning

Paper • 2406.11896 • Published Jun 14, 2024 • 20

upvoted an article about 1 month ago

Article

🐺🐦‍⬛ LLM Comparison/Test: DeepSeek-V3, QVQ-72B-Preview, Falcon3 10B, Llama 3.3 70B, Nemotron 70B in my updated MMLU-Pro CS benchmark

By

•

Jan 2

• 39

upvoted a collection about 1 month ago

Phi-3

Phi-3 family of small language and multi-modal models. Language models are available in short- and long-context lengths. • 26 items • Updated 29 days ago • 551

upvoted 2 papers about 1 month ago

Phi-4 Technical Report

Paper • 2412.08905 • Published Dec 12, 2024 • 106

Aguvis: Unified Pure Vision Agents for Autonomous GUI Interaction

Paper • 2412.04454 • Published Dec 5, 2024 • 60

upvoted a collection about 1 month ago

Falcon3

Falcon3 family of Open Foundation Models is a set of pretrained and instruct LLMs ranging from 1B to 10B parameters. • 40 items • Updated 29 days ago • 80

upvoted an article about 1 month ago

Article

FineWeb2-C: Help Build Better Language Models in Your Language

By

and 5 others •

Dec 23, 2024

• 18

upvoted a collection about 2 months ago

TabuLa-8B

Training, eval suite, and model from the paper "Large Scale Transfer Learning for Tabular Data via Language Modeling" https://arxiv.org/abs/2406.12031 • 4 items • Updated Jun 19, 2024 • 11

upvoted 2 papers about 2 months ago

LiFT: Leveraging Human Feedback for Text-to-Video Model Alignment

Paper • 2412.04814 • Published Dec 6, 2024 • 45

Solving Quantitative Reasoning Problems with Language Models

Paper • 2206.14858 • Published Jun 29, 2022 • 1