MiniMax-01: Scaling Foundation Models with Lightning Attention Paper โข 2501.08313 โข Published 23 days ago โข 272
DeepSeek R1 (All Versions) Collection DeepSeek R1 - the most powerful reasoning open-source model - available in GGUF, original & 4-bit formats. Includes Llama & Qwen distilled models. โข 29 items โข Updated 1 day ago โข 150