Reasoning Datasets Collection Distilled synthetic Reasoning datasets β’ 7 items β’ Updated 4 days ago β’ 44
Qwen2.5-VL Collection Vision-language model series based on Qwen2.5 β’ 3 items β’ Updated 10 days ago β’ 320
view article Article Introducing the Synthetic Data Generator - Build Datasets with Natural Language Dec 16, 2024 β’ 95
Search-o1: Agentic Search-Enhanced Large Reasoning Models Paper β’ 2501.05366 β’ Published 28 days ago β’ 90
view article Article Train 400x faster Static Embedding Models with Sentence Transformers 23 days ago β’ 135
InternVL2.5-MPO Collection Enhancing the Reasoning Ability of MLLMs via Mixed Preference Optimization β’ 16 items β’ Updated 8 days ago β’ 26
view article Article Introduction to Quantization cooked in π€ with ππ§βπ³ By merve β’ Aug 25, 2023 β’ 27
NuNER: Entity Recognition Encoder Pre-training via LLM-Annotated Data Paper β’ 2402.15343 β’ Published Feb 23, 2024 β’ 13
ModernBERT Collection Bringing BERT into modernity via both architecture changes and scaling β’ 3 items β’ Updated Dec 19, 2024 β’ 132
Qwen2.5-Math Collection Math-specific model series based on Qwen2.5 β’ 11 items β’ Updated 23 days ago β’ 68
OLMo 2 Collection Artifacts for the second set of OLMo models. β’ 22 items β’ Updated about 1 month ago β’ 81
Common Models Collection The first generation of models pretrained on Common Corpus. β’ 5 items β’ Updated Dec 5, 2024 β’ 29