CondAmbigQA: A Benchmark and Dataset for Conditional Ambiguous Question Answering Paper • 2502.01523 • Published 3 days ago • 1
ScholaWrite: A Dataset of End-to-End Scholarly Writing Process Paper • 2502.02904 • Published 1 day ago • 1
🧠 Reasoning datasets Collection Datasets with reasoning traces for math and code released by the community • 6 items • Updated 6 days ago • 34
Template-Driven LLM-Paraphrased Framework for Tabular Math Word Problem Generation Paper • 2412.15594 • Published Dec 20, 2024 • 1
MM-IQ: Benchmarking Human-Like Abstraction and Reasoning in Multimodal Models Paper • 2502.00698 • Published 4 days ago • 20
FENICE: Factuality Evaluation of summarization based on Natural language Inference and Claim Extraction Paper • 2403.02270 • Published Mar 4, 2024 • 3
ZebraLogic: On the Scaling Limits of LLMs for Logical Reasoning Paper • 2502.01100 • Published 3 days ago • 12
view article Article Mini-R1: Reproduce Deepseek R1 „aha moment“ a RL tutorial By open-r1 • 6 days ago • 28
WILDCHAT-50M: A Deep Dive Into the Role of Synthetic Data in Post-Training Paper • 2501.18511 • Published 7 days ago • 17
WildChat-50m Collection All model responses associated with the WildChat-50m paper. • 55 items • Updated 8 days ago • 6
Qwen2.5-VL Collection Vision-language model series based on Qwen2.5 • 3 items • Updated 10 days ago • 320
Qwen2.5-1M Collection The long-context version of Qwen2.5, supporting 1M-token context lengths • 2 items • Updated 11 days ago • 97
view article Article Explore, Curate and Vector Search Any Hugging Face Dataset with Nomic Atlas By MaxNomic and 4 others • 14 days ago • 29
DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning Paper • 2501.12948 • Published 15 days ago • 298
view article Article Exploring Synthetic Data Generation with DataDreamer By asoria • 16 days ago • 6