Animate Your Motion: Turning Still Images into Dynamic Videos Paper • 2403.10179 • Published Mar 15, 2024 • 3
Making Flow-Matching-Based Zero-Shot Text-to-Speech Laugh as You Like Paper • 2402.07383 • Published Feb 12, 2024 • 15
Temporal Preference Optimization Collection Temporal Preference Optimization for Long-form Video Understanding • 3 items • Updated 23 days ago • 4
Textoon: Generating Vivid 2D Cartoon Characters from Text Descriptions Paper • 2501.10020 • Published 25 days ago • 22
view article Article MiniMax-01 is Now Open-Source: Scaling Lightning Attention for the AI Agent Era By MiniMax-AI • 27 days ago • 40
EfficientViT-SAM: Accelerated Segment Anything Model Without Performance Loss Paper • 2402.05008 • Published Feb 7, 2024 • 22
MangaNinja: Line Art Colorization with Precise Reference Following Paper • 2501.08332 • Published 28 days ago • 56
Apollo: An Exploration of Video Understanding in Large Multimodal Models Paper • 2412.10360 • Published Dec 13, 2024 • 139