This model is a fine-tuned version of SpeechT5 for the Ukrainian language, using the Common Voice dataset.
Usage:
!pip install git+https://github.com/huggingface/transformers
from transformers import SpeechT5Processor, SpeechT5ForTextToSpeech, SpeechT5ForSpeechToSpeech, SpeechT5HifiGan
processor = SpeechT5Processor.from_pretrained("microsoft/speecht5_tts")
model = SpeechT5ForTextToSpeech.from_pretrained("tenebrisu/speecht5_tts_common_voice_uk")
vocoder = SpeechT5HifiGan.from_pretrained("microsoft/speecht5_hifigan")
speaker_embeddings = 2 * torch.rand((1, 512)) - 1
text = """ pryvit yak spravy """
inputs = processor(text=text, return_tensors="pt")
speech = model.generate_speech(inputs["input_ids"], speaker_embeddings, output_cross_attentions=True)
waveform = vocoder.forward(speech[0])
Transliteration Table:
To support the transliteration of Ukrainian characters, the following table is used:
Ukrainian | Transliteration |
---|---|
Є | je |
І | i |
Ї | ji |
А | a |
Б | b |
В | v |
Г | h |
Д | d |
Е | e |
Ж | zh |
З | z |
И | y |
Й | j |
К | k |
Л | l |
М | m |
Н | n |
О | o |
П | p |
Р | r |
С | s |
Т | t |
У | u |
Ф | f |
Х | x |
Ц | c |
Ч | ch |
Ш | sh |
Щ | shch |
Ь | q |
Ю | ju |
Я | ja |
а | a |
б | b |
в | v |
г | h |
д | d |
е | e |
ж | zh |
з | z |
и | y |
й | j |
к | k |
л | l |
м | m |
н | n |
о | o |
п | p |
р | r |
с | s |
т | t |
у | u |
ф | f |
х | x |
ц | c |
ч | ch |
ш | sh |
щ | shch |
ь | q |
ю | ju |
я | ja |
є | je |
і | i |
ї | ji |
Ґ | g |
ґ | g |
- Downloads last month
- 99
Inference Providers
NEW
This model is not currently available via any of the supported third-party Inference Providers, and
the model is not deployed on the HF Inference API.
Model tree for tenebrisu/speecht5_tts_common_voice_uk
Base model
microsoft/speecht5_tts