-
Self-Rewarding Language Models
Paper ā¢ 2401.10020 ā¢ Published ā¢ 146 -
ReFT: Reasoning with Reinforced Fine-Tuning
Paper ā¢ 2401.08967 ā¢ Published ā¢ 30 -
Tuning Language Models by Proxy
Paper ā¢ 2401.08565 ā¢ Published ā¢ 22 -
TrustLLM: Trustworthiness in Large Language Models
Paper ā¢ 2401.05561 ā¢ Published ā¢ 69
Collections
Discover the best community collections!
Collections including paper arxiv:2402.03300
-
DeepSeek LLM: Scaling Open-Source Language Models with Longtermism
Paper ā¢ 2401.02954 ā¢ Published ā¢ 43 -
Qwen Technical Report
Paper ā¢ 2309.16609 ā¢ Published ā¢ 35 -
GPT-4 Technical Report
Paper ā¢ 2303.08774 ā¢ Published ā¢ 5 -
Gemini: A Family of Highly Capable Multimodal Models
Paper ā¢ 2312.11805 ā¢ Published ā¢ 44
-
YAYI 2: Multilingual Open-Source Large Language Models
Paper ā¢ 2312.14862 ā¢ Published ā¢ 14 -
SOLAR 10.7B: Scaling Large Language Models with Simple yet Effective Depth Up-Scaling
Paper ā¢ 2312.15166 ā¢ Published ā¢ 57 -
TrustLLM: Trustworthiness in Large Language Models
Paper ā¢ 2401.05561 ā¢ Published ā¢ 69 -
DeepSeekMoE: Towards Ultimate Expert Specialization in Mixture-of-Experts Language Models
Paper ā¢ 2401.06066 ā¢ Published ā¢ 47
-
openchat/openchat-3.5-1210
Text Generation ā¢ Updated ā¢ 2.23k ā¢ 274 -
MoE-Mamba: Efficient Selective State Space Models with Mixture of Experts
Paper ā¢ 2401.04081 ā¢ Published ā¢ 70 -
DeepSeekMath: Pushing the Limits of Mathematical Reasoning in Open Language Models
Paper ā¢ 2402.03300 ā¢ Published ā¢ 88 -
Babelscape/rebel-large
Text2Text Generation ā¢ Updated ā¢ 22.3k ā¢ 214
-
A Picture is Worth More Than 77 Text Tokens: Evaluating CLIP-Style Models on Dense Captions
Paper ā¢ 2312.08578 ā¢ Published ā¢ 17 -
ZeroQuant(4+2): Redefining LLMs Quantization with a New FP6-Centric Strategy for Diverse Generative Tasks
Paper ā¢ 2312.08583 ā¢ Published ā¢ 9 -
Vision-Language Models as a Source of Rewards
Paper ā¢ 2312.09187 ā¢ Published ā¢ 12 -
StemGen: A music generation model that listens
Paper ā¢ 2312.08723 ā¢ Published ā¢ 48
-
Cappy: Outperforming and Boosting Large Multi-Task LMs with a Small Scorer
Paper ā¢ 2311.06720 ā¢ Published ā¢ 8 -
System 2 Attention (is something you might need too)
Paper ā¢ 2311.11829 ā¢ Published ā¢ 40 -
TinyGSM: achieving >80% on GSM8k with small language models
Paper ā¢ 2312.09241 ā¢ Published ā¢ 38 -
ReFT: Reasoning with Reinforced Fine-Tuning
Paper ā¢ 2401.08967 ā¢ Published ā¢ 30
-
From Quantity to Quality: Boosting LLM Performance with Self-Guided Data Selection for Instruction Tuning
Paper ā¢ 2308.12032 ā¢ Published ā¢ 1 -
Know thy corpus! Robust methods for digital curation of Web corpora
Paper ā¢ 2003.06389 ā¢ Published ā¢ 1 -
Self-Alignment with Instruction Backtranslation
Paper ā¢ 2308.06259 ā¢ Published ā¢ 42 -
The Vault: A Comprehensive Multilingual Dataset for Advancing Code Understanding and Generation
Paper ā¢ 2305.06156 ā¢ Published ā¢ 2
-
Attention Is All You Need
Paper ā¢ 1706.03762 ā¢ Published ā¢ 49 -
FlashAttention-2: Faster Attention with Better Parallelism and Work Partitioning
Paper ā¢ 2307.08691 ā¢ Published ā¢ 8 -
Mixtral of Experts
Paper ā¢ 2401.04088 ā¢ Published ā¢ 157 -
Mistral 7B
Paper ā¢ 2310.06825 ā¢ Published ā¢ 46
-
KwaiYiiMath: Technical Report
Paper ā¢ 2310.07488 ā¢ Published ā¢ 2 -
Forward-Backward Reasoning in Large Language Models for Mathematical Verification
Paper ā¢ 2308.07758 ā¢ Published ā¢ 4 -
Natural Language Embedded Programs for Hybrid Language Symbolic Reasoning
Paper ā¢ 2309.10814 ā¢ Published ā¢ 3 -
MathCoder: Seamless Code Integration in LLMs for Enhanced Mathematical Reasoning
Paper ā¢ 2310.03731 ā¢ Published ā¢ 29