VideoJAM: Joint Appearance-Motion Representations for Enhanced Motion Generation in Video Models Paper β’ 2502.02492 β’ Published 2 days ago β’ 37
Running on Zero 1.47k 1.47k Chat With Janus-Pro-7B π A unified multimodal understanding and generation model.
Negative Token Merging: Image-based Adversarial Feature Guidance Paper β’ 2412.01339 β’ Published Dec 2, 2024 β’ 22
PaliGemma 2: A Family of Versatile VLMs for Transfer Paper β’ 2412.03555 β’ Published Dec 4, 2024 β’ 126