MiniMax-01: Scaling Foundation Models with Lightning Attention Paper • 2501.08313 • Published 23 days ago • 272
Llama 3.1 Collection This collection hosts the transformers and original repos of the Llama 3.1, Llama Guard 3 and Prompt Guard models • 11 items • Updated Dec 6, 2024 • 645
HGRN2 Collection HGRN2: Gated Linear RNNs with State Expansion • 2 items • Updated Jun 25, 2024 • 2
HGRN2 Collection HGRN2: Gated Linear RNNs with State Expansion • 2 items • Updated Jun 25, 2024 • 2
TransNormerLLM Collection TransNormerLLM: A Faster and Better Large Language Model with Improved TransNormer • 11 items • Updated Jun 25, 2024 • 3
Scaling Laws for Linear Complexity Language Models Paper • 2406.16690 • Published Jun 24, 2024 • 23
HGRN2 Collection HGRN2: Gated Linear RNNs with State Expansion • 2 items • Updated Jun 25, 2024 • 2
HGRN2 Collection HGRN2: Gated Linear RNNs with State Expansion • 2 items • Updated Jun 25, 2024 • 2