LLM Reasoning Papers Collection Papers to improve reasoning capabilities of LLMs • 20 items • Updated 22 days ago • 113
Reasoning Datasets Collection Distilled synthetic Reasoning datasets • 7 items • Updated 4 days ago • 45
Streaming DiLoCo with overlapping communication: Towards a Distributed Free Lunch Paper • 2501.18512 • Published 7 days ago • 25
IndicBERT v2 Collection IndicBERT v2 is a multilingual BERT model pretrained on IndicCorp v2, an Indic monolingual corpus of 20.9 billion tokens, covering 24 consitutionally • 4 items • Updated Oct 15, 2024 • 3
IndicLLMSuite Collection Largest Collections of Pretraining and Instruction Finetuning datasets for 22 Indic languages. • 4 items • Updated Nov 5, 2024 • 15
🧠 Reasoning datasets Collection Datasets with reasoning traces for math and code released by the community • 6 items • Updated 6 days ago • 34