Text-to-Video
Virtual try-on for clothes on a person
Engage in multi-modal conversations with images and videos