SliderSpace: Decomposing the Visual Capabilities of Diffusion Models Paper • 2502.01639 • Published 3 days ago • 22
SafeRAG: Benchmarking Security in Retrieval-Augmented Generation of Large Language Model Paper • 2501.18636 • Published 9 days ago • 25
Temporal Preference Optimization for Long-Form Video Understanding Paper • 2501.13919 • Published 14 days ago • 21
RealCritic: Towards Effectiveness-Driven Evaluation of Language Model Critiques Paper • 2501.14492 • Published 13 days ago • 29
Parameters vs FLOPs: Scaling Laws for Optimal Sparsity for Mixture-of-Experts Language Models Paper • 2501.12370 • Published 16 days ago • 10
iFormer: Integrating ConvNet and Transformer for Mobile Application Paper • 2501.15369 • Published 12 days ago • 10
ARWKV: Pretrain is not what we need, an RNN-Attention-Based Language Model Born from Transformer Paper • 2501.15570 • Published 11 days ago • 23
Emilia: A Large-Scale, Extensive, Multilingual, and Diverse Dataset for Speech Generation Paper • 2501.15907 • Published 10 days ago • 15
Towards General-Purpose Model-Free Reinforcement Learning Paper • 2501.16142 • Published 10 days ago • 24