view article Article Fine-tune ModernBERT for RAG with Synthetic Data By sdiazlor and 2 others • 17 days ago • 33
Explaining Large Language Models Decisions Using Shapley Values Paper • 2404.01332 • Published Mar 29, 2024 • 1
Preference Leakage: A Contamination Problem in LLM-as-a-judge Paper • 2502.01534 • Published 3 days ago • 34
Model2Vec base models Collection These are the Minishlab Model2Vec base models. Load them and use them with model2vec (https://github.com/MinishLab/model2vec) or sentence-transformers • 9 items • Updated 8 days ago • 9
POTION Collection These are the flagship POTION models. Load them and use them with model2vec (https://github.com/MinishLab/model2vec) or sentence-transformers • 5 items • Updated 3 days ago • 10
DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning Paper • 2501.12948 • Published 15 days ago • 301
WILDCHAT-50M: A Deep Dive Into the Role of Synthetic Data in Post-Training Paper • 2501.18511 • Published 7 days ago • 17
Reasoning Datasets Collection Distilled synthetic Reasoning datasets • 7 items • Updated 4 days ago • 44
CritiqueFineTuning Collection The dataset and models for CritiqueFineTuning • 4 items • Updated 5 days ago • 1
Tulu 3 Models Collection All models released with Tulu 3 -- state of the art open post-training recipes. • 10 items • Updated 8 days ago • 86
Critique Fine-Tuning: Learning to Critique is More Effective than Learning to Imitate Paper • 2501.17703 • Published 8 days ago • 50
view article Article Use Models from the Hugging Face Hub in LM Studio By yagilb • Nov 28, 2024 • 136
Fast3R: Towards 3D Reconstruction of 1000+ Images in One Forward Pass Paper • 2501.13928 • Published 14 days ago • 16