Submitted by akhaliq 62 DeepSeek-Coder-V2: Breaking the Barrier of Closed-Source Models in Code Intelligence · 40 authors 3
Submitted by yulunliu 50 Depth Anywhere: Enhancing 360 Monocular Depth Estimation via Perspective Distillation and Unlabeled Data Augmentation · 2 authors 2
Submitted by akhaliq 32 ChatGLM: A Family of Large Language Models from GLM-130B to GLM-4 All Tools · 56 authors 2
Submitted by akhaliq 30 VoCo-LLaMA: Towards Vision Compression with Large Language Models · 6 authors 10
Submitted by bdqnghi 27 AgileCoder: Dynamic Collaborative Agents for Software Development based on Agile Methodology · 4 authors 2
Submitted by samyadeepbasu 21 From RAGs to rich parameters: Probing how language models utilize external knowledge over parametric information for factual queries · 9 authors 2
Submitted by zhihz0535 19 Learn Beyond The Answer: Training Language Models with Reflection for Mathematical Reasoning · 7 authors 1
Submitted by hughesthe1st 17 RepLiQA: A Question-Answering Dataset for Benchmarking LLMs on Unseen Reference Content · 9 authors 1
Submitted by rimahazra 16 Safety Arithmetic: A Framework for Test-time Safety Alignment of Language Models by Steering Parameters and Activations · 4 authors 4
Submitted by rimahazra 15 SafeInfer: Context Adaptive Decoding Time Safety Alignment for Large Language Models · 6 authors 3
Submitted by tennant 15 Benchmarking Multi-Image Understanding in Vision and Language Models: Perception, Knowledge, Reasoning, and Multi-Hop Reasoning · 4 authors 5
Submitted by akhaliq 14 OlympicArena: Benchmarking Multi-discipline Cognitive Reasoning for Superintelligent AI · 28 authors 2
Submitted by paulpanwang 12 HumanSplat: Generalizable Single-Image Human Gaussian Splatting with Structure Priors · 9 authors 1
Submitted by davanstrien 9 Large Scale Transfer Learning for Tabular Data via Language Modeling · 3 authors 1
Submitted by rezashkv 8 Not All Prompts Are Made Equal: Prompt-based Pruning of Text-to-Image Diffusion Models · 4 authors 1
Submitted by shanchen 8 Language Models are Surprisingly Fragile to Drug Names in Biomedical Benchmarks · 10 authors 1
Submitted by mega 7 Estimating Knowledge in Large Language Models Without Generating a Single Token · 2 authors 1
Submitted by dongwonjo 7 Mixture of Scales: Memory-Efficient Token-Adaptive Binarization for Large Language Models · 4 authors 1
Submitted by jiachenli-ucsb 7 BPO: Supercharging Online Preference Learning by Adhering to the Proximity of Behavior LLM · 4 authors 1
Submitted by Timmli 6 From Crowdsourced Data to High-Quality Benchmarks: Arena-Hard and BenchBuilder Pipeline · 8 authors 1
Submitted by jinggu 5 VIA: A Spatiotemporal Video Adaptation Framework for Global and Local Video Editing · 7 authors 1
Submitted by Keven16 4 Super(ficial)-alignment: Strong Models May Deceive Weak Models in Weak-to-Strong Generalization · 5 authors 2
Submitted by chenfengx 4 Immiscible Diffusion: Accelerating Diffusion Training with Noise Assignment · 6 authors 1
Submitted by amanchadha 4 Hierarchical Prompting Taxonomy: A Universal Evaluation Framework for Large Language Models · 5 authors 1
Submitted by akhaliq 4 JEN-1 DreamStyler: Customized Musical Concept Learning via Pivotal Parameters Tuning · 4 authors 2