view article Article KV Caching Explained: Optimizing Transformer Inference Efficiency By not-lain β’ 7 days ago β’ 23
view article Article SmolVLM Grows Smaller β Introducing the 250M & 500M Models! 15 days ago β’ 119
view article Article Train 400x faster Static Embedding Models with Sentence Transformers 23 days ago β’ 134
view article Article Fine-tune ModernBERT for RAG with Synthetic Data By sdiazlor and 2 others β’ 17 days ago β’ 33
Towards Best Practices for Open Datasets for LLM Training Paper β’ 2501.08365 β’ Published 23 days ago β’ 53
view article Article Crowd-sourced Open Preference Dataset for Text-to-Image Generation By RapidataAI and 4 others β’ 30 days ago β’ 18
Evaluation Agent: Efficient and Promptable Evaluation Framework for Visual Generative Models Paper β’ 2412.09645 β’ Published Dec 10, 2024 β’ 35
RetroLLM: Empowering Large Language Models to Retrieve Fine-grained Evidence within Generation Paper β’ 2412.11919 β’ Published Dec 16, 2024 β’ 33
Global MMLU: Understanding and Addressing Cultural and Linguistic Biases in Multilingual Evaluation Paper β’ 2412.03304 β’ Published Dec 4, 2024 β’ 17
view article Article πΊπ¦ββ¬ LLM Comparison/Test: 25 SOTA LLMs (including QwQ) through 59 MMLU-Pro CS benchmark runs By wolfram β’ Dec 4, 2024 β’ 76
view article Article Use Models from the Hugging Face Hub in LM Studio By yagilb β’ Nov 28, 2024 β’ 136
view article Article To what extent are we responsible for our content and how to create safer Spaces? By davidberenstein1957 β’ Aug 30, 2024 β’ 4
view article Article Introducing Observers: AI Observability with Hugging Face datasets through a lightweight SDK By davidberenstein1957 and 1 other β’ Nov 21, 2024 β’ 35
view article Article How to optimize your data labelling project with custom interfaces By burtenshaw and 9 others β’ Oct 16, 2024 β’ 18
view article Article How to build a custom text classifier without days of human labeling By sdiazlor and 4 others β’ Oct 17, 2024 β’ 55
view article Article Model2Vec: Distill a Small Fast Model from any Sentence Transformer By Pringled and 1 other β’ Oct 14, 2024 β’ 68