11 43 60

Agustín Piqueres Lajarín

plaguss

plaguss

AI & ML interests

None yet

Recent Activity

authored a paper about 3 hours ago

SmolLM2: When Smol Goes Big -- Data-Centric Training of a Small Language Model

upvoted a paper about 7 hours ago

SmolLM2: When Smol Goes Big -- Data-Centric Training of a Small Language Model

updated a dataset 2 days ago

plaguss/lcb_code_generation_lite

View all activity

Organizations

plaguss's activity

upvoted a paper about 7 hours ago

SmolLM2: When Smol Goes Big -- Data-Centric Training of a Small Language Model

Paper • 2502.02737 • Published 2 days ago • 65

upvoted an article 3 days ago

Article

FuseO1-Preview: System-II Reasoning Fusion of LLMs

and 4 others •

17 days ago

• 12

upvoted an article 4 days ago

Article

Open-R1: Update #1

and 7 others •

5 days ago

• 237

upvoted an article 9 days ago

Article

Open-R1: a fully open reproduction of DeepSeek-R1

10 days ago

• 648

upvoted a paper 19 days ago

The Lessons of Developing Process Reward Models in Mathematical Reasoning

Paper • 2501.07301 • Published 24 days ago • 89

upvoted an article 23 days ago

Article

Python Is All You Need? Introducing Dria-Agent-α

and 1 other •

27 days ago

• 22

upvoted a collection 29 days ago

Scaling Test-Time Compute with Open Models

Collection

Models and datasets used in our blog post: https://huggingface.co/spaces/HuggingFaceH4/blogpost-scaling-test-time-compute • 10 items • Updated Jan 6 • 23

upvoted an article about 1 month ago

Article

Process Reinforcement through Implicit Rewards

and 1 other •

Jan 3

• 22

upvoted 2 papers about 2 months ago

ProcessBench: Identifying Process Errors in Mathematical Reasoning

Paper • 2412.06559 • Published Dec 9, 2024 • 79

Free Process Rewards without Process Labels

Paper • 2412.01981 • Published Dec 2, 2024 • 31

upvoted a paper 2 months ago

Solving math word problems with process- and outcome-based feedback

Paper • 2211.14275 • Published Nov 25, 2022 • 8

upvoted a collection 2 months ago

SmolVLM

Collection

State-of-the-art compact VLMs for on-device applications: Base, Synthetic, and Instruct • 5 items • Updated Dec 22, 2024 • 32

upvoted 2 articles 3 months ago

Article

Introducing Observers: AI Observability with Hugging Face datasets through a lightweight SDK

and 1 other •

Nov 21, 2024

• 35

Article

Halo: Open Source Health Tracking with Wearables

•

Nov 19, 2024

• 105

upvoted a paper 3 months ago

Aligning Large Language Models via Self-Steering Optimization

Paper • 2410.17131 • Published Oct 22, 2024 • 22

upvoted an article 3 months ago

Article

Releasing the largest multilingual open pretraining dataset

and 2 others •

Nov 13, 2024

• 98

upvoted 3 articles 4 months ago

Article

Releasing Outlines-core 0.1.0: structured generation in Rust and Python

Oct 22, 2024

• 44

Article

How to build a custom text classifier without days of human labeling

and 4 others •

Oct 17, 2024

• 55

Article

How to optimize your data labelling project with custom interfaces

and 9 others •

Oct 16, 2024

• 18

upvoted a paper 4 months ago

DSBench: How Far Are Data Science Agents to Becoming Data Science Experts?

Paper • 2409.07703 • Published Sep 12, 2024 • 67