T2V-Turbo: Breaking the Quality Bottleneck of Video Consistency Model with Mixed Reward Feedback

4-step Text-to-video Generation

With the style of low-poly game art, A majestic, white horse gallops gracefully across a moonlit beach. medium shot of Christine, a beautiful 25-year-old brunette resembling Selena Gomez, anxiously looking up as she walks down a New York street, cinematic style a cartoon pig playing his guitar, Andrew Warhol style
a dog wearing vr goggles on a boat Pikachu snowboarding a girl floating underwater

Model description πŸš€

This repository contains unet_lora.pt that can turn VideoCrafter2 into our T2V-Turbo (VC2). Our T2V-Turbo (VC2) can achieve both fast and high-quality T2V generation. On VBench, the 4-step generation from our T2V-Turbo (VC2) even outperform proprietary systems, including Gen-2 and Pika. Please refer to our GitHub repo for detailed instructions.

Misuse, Malicious Use and Excessive Use πŸ“–

Our model is meant for research purposes.

  • It is prohibited to generate content that is demeaning or harmful to people or their environment, culture, religion, etc.
  • Prohibited for pornographic, violent and bloody content generation.
  • Prohibited for error and false information generation.
Downloads last month

-

Downloads are not tracked for this model. How to track
Inference Providers NEW
This model is not currently available via any of the supported Inference Providers.
The model cannot be deployed to the HF Inference API: The model has no library tag.

Spaces using jiachenli-ucsb/T2V-Turbo-VC2 2

Collection including jiachenli-ucsb/T2V-Turbo-VC2