Intro

The hoyoTTS Model is an artificial intelligence speech synthesis model designed to simulate the voice of characters in Genshin and Star Rail. The model is based on advanced deep learning technology, combining the language understanding capability of BERT and the speech synthesis technology of VITS. It realizes highly natural and smooth speech output. Based on the text input, it can generate voices that match the voice characteristics of the characters, providing an immersive gaming experience. v1 has been optimized in terms of voice naturalness, emotional expression, and the accuracy of voice synthesis, making the character voices closer to the original character's voice characteristics, and bringing players a more realistic and pleasurable auditory enjoyment.

Demo

https://huggingface.co/spaces/Genius-Society/hoyoTTS

Usage

from modelscope import snapshot_download
model_dir = snapshot_download('Genius-Society/hoyoTTS')

Maintenance

git clone [email protected]:Genius-Society/hoyoTTS
cd hoyoTTS

Dataset

https://huggingface.co/datasets/Genius-Society/hoyoTTS

Mirror

https://www.modelscope.cn/models/Genius-Society/hoyoTTS

Reference

[1] https://www.modelscope.cn/studios/erythrocyte/Bert-VITS2_Genshin_TTS

Genius-Society
/

hoyoTTS