Submitted by SiyuanH 50 EnerVerse: Envisioning Embodied Future Space for Robotics Manipulation · 10 authors 3
Submitted by akhaliq 42 VITA-1.5: Towards GPT-4o Level Real-Time Vision and Speech Interaction · 15 authors 2
Submitted by KAB1314 18 SDPO: Segment-Level Direct Preference Optimization for Social Agents · 10 authors 2
Submitted by xujz0703 18 VisionReward: Fine-Grained Multi-Dimensional Human Preference Learning for Image and Video Generation · 21 authors 2
Submitted by Franck-Dernoncourt 13 LUSIFER: Language Universal Space Integration for Enhanced Multilingual Embeddings with Large Language Models · 6 authors 2
Submitted by obiwan96 6 BoxingGym: Benchmarking Progress in Automated Experimental Design and Model Discovery · 7 authors 2