Demystifying Long Chain-of-Thought Reasoning in LLMs Paper β’ 2502.03373 β’ Published about 22 hours ago β’ 18
SmolLM2: When Smol Goes Big -- Data-Centric Training of a Small Language Model Paper β’ 2502.02737 β’ Published 1 day ago β’ 60
view article Article Ο0 and Ο0-FAST: Vision-Language-Action Models for General Robot Control 3 days ago β’ 67
Tulu 3 Models Collection All models released with Tulu 3 -- state of the art open post-training recipes. β’ 10 items β’ Updated 8 days ago β’ 86
view article Article MiniMax-01 is Now Open-Source: Scaling Lightning Attention for the AI Agent Era By MiniMax-AI β’ 22 days ago β’ 40
DigiRL: Training In-The-Wild Device-Control Agents with Autonomous Reinforcement Learning Paper β’ 2406.11896 β’ Published Jun 14, 2024 β’ 20
view article Article πΊπ¦ββ¬ LLM Comparison/Test: DeepSeek-V3, QVQ-72B-Preview, Falcon3 10B, Llama 3.3 70B, Nemotron 70B in my updated MMLU-Pro CS benchmark By wolfram β’ Jan 2 β’ 39
Phi-3 Collection Phi-3 family of small language and multi-modal models. Language models are available in short- and long-context lengths. β’ 26 items β’ Updated 29 days ago β’ 551
Aguvis: Unified Pure Vision Agents for Autonomous GUI Interaction Paper β’ 2412.04454 β’ Published Dec 5, 2024 β’ 60
Falcon3 Collection Falcon3 family of Open Foundation Models is a set of pretrained and instruct LLMs ranging from 1B to 10B parameters. β’ 40 items β’ Updated 29 days ago β’ 80
view article Article FineWeb2-C: Help Build Better Language Models in Your Language By davanstrien and 5 others β’ Dec 23, 2024 β’ 18
TabuLa-8B Collection Training, eval suite, and model from the paper "Large Scale Transfer Learning for Tabular Data via Language Modeling" https://arxiv.org/abs/2406.12031 β’ 4 items β’ Updated Jun 19, 2024 β’ 11
LiFT: Leveraging Human Feedback for Text-to-Video Model Alignment Paper β’ 2412.04814 β’ Published Dec 6, 2024 β’ 45
Solving Quantitative Reasoning Problems with Language Models Paper β’ 2206.14858 β’ Published Jun 29, 2022 β’ 1