This is a text2video model for diffusers, fine-tuned with a modelscope to have an anime-style appearance.
It was trained at 448x384 resolution.
The usage is the same as with the original modelscope model.

The main difference from version 0.1 is only the resolution.

Downloads last month
62
Inference Providers NEW
This model is not currently available via any of the supported third-party Inference Providers, and the HF Inference API does not support diffusers models with pipeline type text-to-video