BigVGAN Collection BigVGAN is a universal neural vocoder that generates audio waveform using mel spectrogram as input. β’ 11 items β’ Updated 21 days ago β’ 11
FunAudioLLM: Voice Understanding and Generation Foundation Models for Natural Interaction Between Humans and LLMs Paper β’ 2407.04051 β’ Published Jul 4, 2024 β’ 36
Restarting on CPU Upgrade 107 107 Open Chinese LLM Leaderboard π Display and filter LLM benchmark results
Standard-format-preference-dataset Collection We collect the open-source datasets and process them into the standard format. β’ 14 items β’ Updated May 8, 2024 β’ 24
Salesforce/xgen-mm-phi3-mini-instruct-r-v1 Image-Text-to-Text β’ Updated 4 days ago β’ 1.26k β’ 186