Analyze and reconstruct videos using mask-based models
Transcribe voice to text
Load and display a Hugging Face Space