Submitted by akhaliq 37 VMC: Video Motion Customization using Temporal Attention Adaption for Text-to-Video Diffusion Models · 3 authors 3
Submitted by akhaliq 31 The Unlocking Spell on Base LLMs: Rethinking Alignment via In-Context Learning · 8 authors 4
Submitted by akhaliq 21 VideoSwap: Customized Video Subject Swapping with Interactive Semantic Point Correspondence · 10 authors 5
Submitted by akhaliq 17 LivePhoto: Real Image Animation with Text-guided Motion Control · 7 authors 3
Submitted by akhaliq 13 Rank-without-GPT: Building GPT-Independent Listwise Rerankers on Open-Source Large Language Models · 5 authors
Submitted by akhaliq 13 GPS-Gaussian: Generalizable Pixel-wise 3D Gaussian Splatting for Real-time Human Novel View Synthesis · 7 authors 1
Submitted by akhaliq 12 LLaVA-Grounding: Grounded Visual Chat with Large Multimodal Models · 11 authors
Submitted by akhaliq 11 Fine-grained Controllable Video Generation via Object Appearance and Context · 7 authors
Submitted by akhaliq 9 StableDreamer: Taming Noisy Score Distillation Sampling for Text-to-3D · 10 authors 3
Submitted by akhaliq 9 Generative Rendering: Controllable 4D-Guided Video Generation with 2D Diffusion Models · 6 authors 2
Submitted by akhaliq 8 GPT4Point: A Unified Framework for Point-Language Understanding and Generation · 8 authors
Submitted by akhaliq 7 VideoRF: Rendering Dynamic Radiance Fields as 2D Feature Video Streams · 8 authors 3
Submitted by akhaliq 4 Customize your NeRF: Adaptive Source Driven 3D Scene Editing via Local-Global Iterative Training · 9 authors 1
Submitted by akhaliq 4 Using Large Language Models to Accelerate Communication for Users with Severe Motor Impairments · 16 authors 1
Submitted by akhaliq 4 TextGenSHAP: Scalable Post-hoc Explanations in Text Generation with Long Documents · 6 authors 1