arxiv:2501.06282
zhihaodu
zhihaodu
AI & ML interests
Audio Generation, Audio Understanding, Speech Enhancement
Recent Activity
new activity
about 13 hours ago
TTS-AGI/TTS-Arena:How does TTS Arena work across different models?
new activity
17 days ago
FunAudioLLM/CosyVoice2-0.5B:Portuguese Language of Portugal
upvoted
a
paper
20 days ago
MinMo: A Multimodal Large Language Model for Seamless Voice Interaction