Assisting in Writing Wikipedia-like Articles From Scratch with Large Language Models Paper • 2402.14207 • Published Feb 22, 2024 • 6
view article Article 🚀 Deploying OLMo-7B with Text Generation Inference (TGI) on Hugging Face Spaces By ariG23498 • 4 days ago • 5
LLM Reasoning Papers Collection Papers to improve reasoning capabilities of LLMs • 20 items • Updated 22 days ago • 113
Reasoning Datasets Collection Distilled synthetic Reasoning datasets • 7 items • Updated 4 days ago • 45
SANA 1.5: Efficient Scaling of Training-Time and Inference-Time Compute in Linear Diffusion Transformer Paper • 2501.18427 • Published 7 days ago • 16
🧠 Reasoning datasets Collection Datasets with reasoning traces for math and code released by the community • 6 items • Updated 6 days ago • 34
Large Language Models Think Too Fast To Explore Effectively Paper • 2501.18009 • Published 8 days ago • 22
view article Article Mini-R1: Reproduce Deepseek R1 „aha moment“ a RL tutorial By open-r1 • 6 days ago • 29
view article Article PEFT: Parameter-Efficient Fine-Tuning Methods for LLMs By samuellimabraz • 13 days ago • 12
view article Article KV Caching Explained: Optimizing Transformer Inference Efficiency By not-lain • 8 days ago • 23
view article Article The SOTA Text-to-speech and Zero Shot Voice cloning model that no one knows about... By srinivasbilla • 17 days ago • 56