Token Assorted: Mixing Latent and Text Tokens for Improved Language Model Reasoning Paper • 2502.03275 • Published about 22 hours ago • 2 • 1
SmolLM2: When Smol Goes Big -- Data-Centric Training of a Small Language Model Paper • 2502.02737 • Published 1 day ago • 37 • 1
ACECODER: Acing Coder RL via Automated Test-Case Synthesis Paper • 2502.01718 • Published 3 days ago • 21 • 2
OmniHuman-1: Rethinking the Scaling-Up of One-Stage Conditioned Human Animation Models Paper • 2502.01061 • Published 3 days ago • 149 • 16
ZebraLogic: On the Scaling Limits of LLMs for Logical Reasoning Paper • 2502.01100 • Published 3 days ago • 12 • 2
The Jumping Reasoning Curve? Tracking the Evolution of Reasoning Performance in GPT-[n] and o-[n] Models on Multimodal Puzzles Paper • 2502.01081 • Published 3 days ago • 9 • 2
Improving Transformer World Models for Data-Efficient RL Paper • 2502.01591 • Published 3 days ago • 8 • 2
MatAnyone: Stable Video Matting with Consistent Memory Propagation Paper • 2501.14677 • Published 13 days ago • 26 • 2
Constitutional Classifiers: Defending against Universal Jailbreaks across Thousands of Hours of Red Teaming Paper • 2501.18837 • Published 7 days ago • 7 • 5
Trading Inference-Time Compute for Adversarial Robustness Paper • 2501.18841 • Published 6 days ago • 3 • 2
Thoughts Are All Over the Place: On the Underthinking of o1-Like LLMs Paper • 2501.18585 • Published 7 days ago • 49 • 10
Large Language Models Think Too Fast To Explore Effectively Paper • 2501.18009 • Published 8 days ago • 22 • 3
Atla Selene Mini: A General Purpose Evaluation Model Paper • 2501.17195 • Published 10 days ago • 30 • 4
Early External Safety Testing of OpenAI's o3-mini: Insights from the Pre-Deployment Evaluation Paper • 2501.17749 • Published 8 days ago • 12 • 2
TAID: Temporally Adaptive Interpolated Distillation for Efficient Knowledge Transfer in Language Models Paper • 2501.16937 • Published 9 days ago • 4 • 2
SFT Memorizes, RL Generalizes: A Comparative Study of Foundation Model Post-training Paper • 2501.17161 • Published 9 days ago • 100 • 6
Optimizing Large Language Model Training Using FP4 Quantization Paper • 2501.17116 • Published 9 days ago • 32 • 2