Daniel van Strien's picture

Daniel van Strien PRO

davanstrien

·

https://danielvanstrien.xyz/

AI & ML interests

Machine Learning Librarian

Recent Activity

liked a Space 43 minutes ago

UNESCO/nllb

upvoted a paper about 2 hours ago

CondAmbigQA: A Benchmark and Dataset for Conditional Ambiguous Question Answering

liked a dataset about 2 hours ago

Apocalypse-AGI-DAO/CondAmbigQA

View all activity

Organizations

davanstrien's activity

upvoted 2 papers about 2 hours ago

CondAmbigQA: A Benchmark and Dataset for Conditional Ambiguous Question Answering

Paper • 2502.01523 • Published 3 days ago • 1

ScholaWrite: A Dataset of End-to-End Scholarly Writing Process

Paper • 2502.02904 • Published 1 day ago • 1

upvoted a collection about 3 hours ago

🧠 Reasoning datasets

Datasets with reasoning traces for math and code released by the community • 6 items • Updated 6 days ago • 34

upvoted 2 papers about 22 hours ago

Template-Driven LLM-Paraphrased Framework for Tabular Math Word Problem Generation

Paper • 2412.15594 • Published Dec 20, 2024 • 1

s1: Simple test-time scaling

Paper • 2501.19393 • Published 6 days ago • 87

upvoted 3 papers 2 days ago

MM-IQ: Benchmarking Human-Like Abstraction and Reasoning in Multimodal Models

Paper • 2502.00698 • Published 4 days ago • 20

FENICE: Factuality Evaluation of summarization based on Natural language Inference and Claim Extraction

Paper • 2403.02270 • Published Mar 4, 2024 • 3

ZebraLogic: On the Scaling Limits of LLMs for Logical Reasoning

Paper • 2502.01100 • Published 3 days ago • 12

upvoted an article 4 days ago

Article

Open-R1: Update #1

By

and 7 others •

4 days ago

• 235

upvoted 2 articles 6 days ago

Article

Replicating DeepSeek R1 for Information Extraction

By

•

6 days ago

• 28

Article

Mini-R1: Reproduce Deepseek R1 „aha moment“ a RL tutorial

By

•

6 days ago

• 28

upvoted a paper 6 days ago

WILDCHAT-50M: A Deep Dive Into the Role of Synthetic Data in Post-Training

Paper • 2501.18511 • Published 7 days ago • 17

upvoted a collection 6 days ago

WildChat-50m

All model responses associated with the WildChat-50m paper. • 55 items • Updated 8 days ago • 6

upvoted an article 8 days ago

Article

Open-R1: a fully open reproduction of DeepSeek-R1

10 days ago

• 646

upvoted an article 9 days ago

Article

Welcome to Inference Providers on the Hub 🔥

10 days ago

• 256

upvoted a collection 10 days ago

Qwen2.5-VL

Vision-language model series based on Qwen2.5 • 3 items • Updated 10 days ago • 320

upvoted a collection 11 days ago

Qwen2.5-1M

The long-context version of Qwen2.5, supporting 1M-token context lengths • 2 items • Updated 11 days ago • 97

upvoted an article 14 days ago

Article

Explore, Curate and Vector Search Any Hugging Face Dataset with Nomic Atlas

By

and 4 others •

14 days ago

• 29

upvoted a paper 14 days ago

DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning

Paper • 2501.12948 • Published 15 days ago • 298

upvoted an article 16 days ago

Article

Exploring Synthetic Data Generation with DataDreamer

By

•

16 days ago

• 6