State-of-the-art open-vocabulary image segmentation ⚡️
Generate text from audio recordings
Transcribe or translate audio and YouTube videos