Submitted by chujiezheng 79 ProcessBench: Identifying Process Errors in Mathematical Reasoning · 9 authors 6
Submitted by Shibo-UCSD 77 Training Large Language Models to Reason in a Continuous Latent Space · 7 authors 7
Submitted by avanturist 71 Unraveling the Complexity of Memory in RL Agents: an Approach for Classification and Evaluation · 5 authors 2
Submitted by nicolas-dufour 20 Around the World in 80 Timesteps: A Generative Approach to Global Visual Geolocation · 4 authors 2
Submitted by tttoaster 15 Divot: Diffusion Powers Video Tokenizer for Comprehension and Generation · 4 authors 2
Submitted by LooperXX 15 Exploring Multi-Grained Concept Annotations for Multimodal Large Language Models · 6 authors 2
Submitted by xinlongwang 12 You See it, You Got it: Learning 3D Creation on Pose-Free Videos at Scale · 7 authors 3
Submitted by Hidir 8 MotionShop: Zero-Shot Motion Transfer in Video Diffusion Models with Mixture of Score Guidance · 4 authors 2
Submitted by mikonvergence 7 Global and Dense Embeddings of Earth: Major TOM Floating in the Latent Space · 3 authors 2
Submitted by AntoineGuedon 6 MAtCha Gaussians: Atlas of Charts for High-Quality Geometry and Photorealism From Sparse Views · 4 authors 2
Submitted by huangsiteng 6 CARP: Visuomotor Policy Learning via Coarse-to-Fine Autoregressive Prediction · 8 authors 2
Submitted by mkhalifa 4 If You Can't Use Them, Recycle Them: Optimizing Merging at Scale Mitigates Performance Tradeoffs · 9 authors 2