PaliGemma 2 Release Collection Vision-Language Models available in multiple 3B, 10B and 28B variants. β’ 23 items β’ Updated Dec 13, 2024 β’ 134
BhasaAnuvaad Collection A Speech Translation Dataset for 13 Indian Languages β’ 11 items β’ Updated 21 days ago β’ 14
SmolLM2 Collection State-of-the-art compact LLMs for on-device applications: 1.7B, 360M, 135M β’ 16 items β’ Updated about 5 hours ago β’ 214
view article Article Fine-tuning LLMs to 1.58bit: extreme quantization made easy Sep 18, 2024 β’ 216
Depth Pro: Sharp Monocular Metric Depth in Less Than a Second Paper β’ 2410.02073 β’ Published Oct 2, 2024 β’ 40
Molmo Collection Artifacts for open multimodal language models. β’ 5 items β’ Updated about 1 month ago β’ 294
π―DART-Math Collection Difficulty-Aware Rejection Tuning for Mathematical Problem-Solving [NeurIPS 2024] @ https://github.com/hkust-nlp/dart-math β’ 20 items β’ Updated Sep 26, 2024 β’ 6
Llama 3.2 Collection This collection hosts the transformers and original repos of the Llama 3.2 and Llama Guard 3 β’ 15 items β’ Updated Dec 6, 2024 β’ 566
view article Article ColPali: Efficient Document Retrieval with Vision Language Models π By manu β’ Jul 5, 2024 β’ 196
Parler-TTS: fully open-source high-quality TTS Collection If you want to find out more about how these models were trained and even fine-tune them yourself, check-out the Parler-TTS repository on GitHub. β’ 8 items β’ Updated Dec 2, 2024 β’ 50
ToRA: A Tool-Integrated Reasoning Agent for Mathematical Problem Solving Paper β’ 2309.17452 β’ Published Sep 29, 2023 β’ 3
AIMO Progress Prize Collection Models and datasets used in the winning solution to the AIMO 1st Progress Prize β’ 7 items β’ Updated Jul 19, 2024 β’ 11