view article Article Mastering Long Contexts in LLMs with KVPress By nvidia and 1 other • 14 days ago • 59
Critique Fine-Tuning: Learning to Critique is More Effective than Learning to Imitate Paper • 2501.17703 • Published 8 days ago • 50
OmniHuman-1: Rethinking the Scaling-Up of One-Stage Conditioned Human Animation Models Paper • 2502.01061 • Published 3 days ago • 149
The Differences Between Direct Alignment Algorithms are a Blur Paper • 2502.01237 • Published 3 days ago • 104
view article Article **How biased is Whisper ? Evaluating Whisper Models for Robustness to Diverse English Accents** By Steveeeeeeen • 8 days ago • 13
view article Article 🅰️ℹ️ 1️⃣0️⃣1️⃣ The Keys to Prompt Optimization By Kseniase and 1 other • 8 days ago • 4
Tulu 3 Models Collection All models released with Tulu 3 -- state of the art open post-training recipes. • 10 items • Updated 8 days ago • 86
Qwen2.5-VL Collection Vision-language model series based on Qwen2.5 • 3 items • Updated 10 days ago • 320
DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning Paper • 2501.12948 • Published 15 days ago • 298
OmniThink: Expanding Knowledge Boundaries in Machine Writing through Thinking Paper • 2501.09751 • Published 21 days ago • 47