Hibiki fr-en Collection Hibiki is a model for streaming speech translation , which can run on device! See https://github.com/kyutai-labs/hibiki. • 5 items • Updated 5 days ago • 44
SFT Memorizes, RL Generalizes: A Comparative Study of Foundation Model Post-training Paper • 2501.17161 • Published 14 days ago • 101
Reasoning Datasets Collection Distilled synthetic Reasoning datasets • 7 items • Updated 9 days ago • 50
Eagle 2 Collection Eagle 2 is a family of frontier vision-language models with vision-centric design. The model supports 4K HD input, long-context video, and grounding. • 9 items • Updated 19 days ago • 31
view article Article Train 400x faster Static Embedding Models with Sentence Transformers 28 days ago • 142
Deepthink and Reasoning Collection Best for Deepthink and Reasoning • 14 items • Updated 18 days ago • 16
ModernBERT Collection Bringing BERT into modernity via both architecture changes and scaling • 3 items • Updated Dec 19, 2024 • 134
Bamba Collection Collection of Bamba - hybrid Mamba2 model architecture based models trained on open data • 8 items • Updated Dec 18, 2024 • 17
Falcon3 Collection Falcon3 family of Open Foundation Models is a set of pretrained and instruct LLMs ranging from 1B to 10B parameters. • 40 items • Updated Jan 8 • 80