FitDiT is a high-fidelity virtual try-on model.
Video Super-Resolution with Text-to-Video Model
Generate custom images using LoRA models
Generate audio from text with tuning options
Generate depth maps from images
Generate a short video from an image