Submitted by akhaliq 52 Seed-Music: A Unified Framework for High Quality and Controlled Music Generation · 38 authors 3
Submitted by iofu728 41 RetrievalAttention: Accelerating Long-Context LLM Inference via Vector Retrieval · 14 authors 2
Submitted by emanuelevivoli 24 One missing piece in Vision and Language: A Survey on Comics Understanding · 6 authors 2
Submitted by ZCODE0 15 Ferret: Federated Full-Parameter Tuning at Scale for Large Language Models · 5 authors 2
Submitted by Sreyan88 12 ReCLAP: Improving Zero Shot Audio Classification by Describing Sounds · 6 authors 2
Submitted by amanchadha 8 Guiding Vision-Language Model Selection for Visual Question-Answering Across Tasks, Domains, and Knowledge Types · 3 authors 2
Submitted by Swtheking 6 Policy Filtration in RLHF to Fine-Tune LLM for Code Generation · 2 authors 2
Submitted by dek924 4 Towards Predicting Temporal Changes in a Patient's Chest X-ray Images based on Electronic Health Records · 4 authors 2
Submitted by IAMJB 3 LLM-Powered Grapheme-to-Phoneme Conversion: Benchmark and Case Study · 3 authors 1
Submitted by beeformer 2 beeFormer: Bridging the Gap Between Semantic and Interaction Similarity in Recommender Systems · 3 authors 2