Submitted by alexchen4ai 48 Octo-planner: On-device Language Model for Planner-Action Agents · 4 authors 5
Submitted by zwcolin 29 CharXiv: Charting Gaps in Realistic Chart Understanding in Multimodal LLMs · 13 authors 2
Submitted by BestWishYsh 20 ChronoMagic-Bench: A Benchmark for Metamorphic Evaluation of Text-to-Time-lapse Video Generation · 10 authors 3
Submitted by kamanphoebe 16 A Closer Look into Mixture-of-Experts in Large Language Models · 5 authors 2
Submitted by yuchenlin 13 WildGuard: Open One-Stop Moderation Tools for Safety Risks, Jailbreaks, and Refusals of LLMs · 8 authors 1
Submitted by jiho283 13 EHRCon: Dataset for Checking Consistency between Unstructured Notes and Structured Tables in Electronic Health Records · 9 authors 7
Submitted by haoningwu 12 MatchTime: Towards Automatic Soccer Game Commentary Generation · 5 authors 4
Submitted by Zhiqiang007 11 Math-LLaVA: Bootstrapping Mathematical Reasoning for Multimodal Large Language Models · 8 authors 1
Submitted by roeiherz 9 Multimodal Task Vectors Enable Many-Shot Multimodal In-Context Learning · 6 authors 1
Submitted by liweijiang 9 WildTeaming at Scale: From In-the-Wild Jailbreaks to (Adversarially) Safer Language Models · 11 authors 1
Submitted by lastweek 4 MemServe: Context Caching for Disaggregated LLM Serving with Elastic Memory Pool · 11 authors 1