This is a text2video model for diffusers, fine-tuned with a modelscope to have an anime-style appearance.
It was trained at 448x384 resolution.
The usage is the same as with the original modelscope model.

The main difference from version 0.1 is only the resolution.

Downloads last month: 62

Inference Providers NEW

Text-to-Video

This model is not currently available via any of the supported third-party Inference Providers, and the HF Inference API does not support diffusers models with pipeline type text-to-video