Submitted by akhaliq 23 MADLAD-400: A Multilingual And Document-Level Large Audited Dataset · 11 authors 3
Submitted by akhaliq 15 When Less is More: Investigating Data Pruning for Pretraining LLMs at Scale · 6 authors
Submitted by akhaliq 9 Natural Language Supervision for General-Purpose Audio Representations · 3 authors
Submitted by akhaliq 9 Optimize Weight Rounding via Signed Gradient Descent for the Quantization of LLMs · 6 authors 2
Submitted by akhaliq 5 FIAT: Fusing learning paradigms with Instruction-Accelerated Tuning · 3 authors