Submitted by akhaliq 118 The AI Scientist: Towards Fully Automated Open-Ended Scientific Discovery · 6 authors 10
Submitted by akhaliq 53 ControlNeXt: Powerful and Efficient Control for Image and Video Generation · 6 authors 8
Submitted by akhaliq 37 CogVideoX: Text-to-Video Diffusion Models with An Expert Transformer · 19 authors 6
Submitted by akhaliq 18 FruitNeRF: A Unified Neural Radiance Field based Fruit Counting Framework · 4 authors 2
Submitted by akhaliq 16 VisualAgentBench: Towards Large Multimodal Models as Visual Foundation Agents · 30 authors 3
Submitted by akhaliq 14 HeadGAP: Few-shot 3D Head Avatar via Generalizable Gaussian Priors · 12 authors 2
Submitted by akhaliq 14 UniPortrait: A Unified Framework for Identity-Preserving Single- and Multi-Human Image Personalization · 3 authors 5
Submitted by mamaj92 9 Your Context Is Not an Array: Unveiling Random Access Limitations in Transformers · 3 authors 2
Submitted by akhaliq 9 Body Transformer: Leveraging Robot Embodiment for Policy Learning · 5 authors 2