Submitted by Ziqi 35 Evaluation Agent: Efficient and Promptable Evaluation Framework for Visual Generative Models · 5 authors 2
Submitted by dongguanting 33 RetroLLM: Empowering Large Language Models to Retrieve Fine-grained Evidence within Generation · 7 authors 4
Submitted by CCCCCC 18 SPaR: Self-Play with Tree-Search Refinement to Improve Instruction-Following in Large Language Models · 10 authors 2
Submitted by Xxlbigbrother 13 GaussianProperty: Integrating Physical Properties to 3D Gaussians with LMMs · 11 authors 2
Submitted by deepcs233 12 VividFace: A Diffusion-Based Hybrid Framework for High-Fidelity Video Face Swapping · 10 authors 2
Submitted by lizb6626 12 IDArb: Intrinsic Decomposition for Arbitrary Number of Input Views and Illuminations · 6 authors 2
Submitted by XiaokunSun 11 StrandHead: Text to Strand-Disentangled 3D Head Avatars Using Hair Geometric Priors · 5 authors 2
Submitted by shihan96 10 SepLLM: Accelerate Large Language Models by Compressing One Segment into One Separator · 10 authors 5
Submitted by emrys-hong 9 Emma-X: An Embodied Multimodal Action Model with Grounded Chain of Thought and Look-ahead Spatial Reasoning · 7 authors 2
Submitted by ozbro 7 SplineGS: Robust Motion-Adaptive Spline for Real-Time Dynamic 3D Gaussians from Monocular Video · 6 authors 3
Submitted by BrandonLiu 7 DynamicScaler: Seamless and Scalable Video Generation for Panoramic Scenes · 4 authors 2
Submitted by JingzeShi 7 Wonderful Matrices: Combining for a More Efficient and Effective Foundation Model Architecture · 2 authors 2
Submitted by jimmyyhwu 5 TidyBot++: An Open-Source Holonomic Mobile Manipulator for Robot Learning · 9 authors 2
Submitted by thuhsy 5 MOVIS: Enhancing Multi-Object Novel View Synthesis for Indoor Scenes · 8 authors 2
Submitted by BoZhang 4 GeoX: Geometric Problem Solving Through Unified Formalized Vision-Language Pre-training · 15 authors 2
Submitted by csferrazza 4 MaxInfoRL: Boosting exploration in reinforcement learning through information gain maximization · 5 authors 2
Submitted by prateekv 4 Whisper-GPT: A Hybrid Representation Audio Large Language Model · 1 authors 2
Submitted by dustalov 2 Reliable, Reproducible, and Really Fast Leaderboards with Evalica · 1 authors 2
Submitted by Andron00e 2 Just a Simple Transformation is Enough for Data Protection in Vertical Federated Learning · 4 authors 2
Submitted by nmhkahn 1 Nearly Zero-Cost Protection Against Mimicry by Personalized Diffusion Models · 5 authors 2
Submitted by jianlanluo 1 RLDG: Robotic Generalist Policy Distillation via Reinforcement Learning · 4 authors 2