Intro
The hoyoTTS Model is an artificial intelligence speech synthesis model designed to simulate the voice of characters in Genshin and Star Rail. The model is based on advanced deep learning technology, combining the language understanding capability of BERT and the speech synthesis technology of VITS. It realizes highly natural and smooth speech output. Based on the text input, it can generate voices that match the voice characteristics of the characters, providing an immersive gaming experience. v1 has been optimized in terms of voice naturalness, emotional expression, and the accuracy of voice synthesis, making the character voices closer to the original character's voice characteristics, and bringing players a more realistic and pleasurable auditory enjoyment.
Demo
https://huggingface.co/spaces/Genius-Society/hoyoTTS
Usage
from modelscope import snapshot_download
model_dir = snapshot_download('Genius-Society/hoyoTTS')
Maintenance
git clone [email protected]:Genius-Society/hoyoTTS
cd hoyoTTS
Dataset
https://huggingface.co/datasets/Genius-Society/hoyoTTS
Mirror
https://www.modelscope.cn/models/Genius-Society/hoyoTTS
Reference
[1] https://www.modelscope.cn/studios/erythrocyte/Bert-VITS2_Genshin_TTS