Zyphra

company

https://www.zyphra.com/

ZyphraAI

Activity Feed

AI & ML interests

None defined yet.

Recent Activity

pglo updated a model about 22 hours ago

Zyphra/Zamba2-1.2B-instruct

pglo updated a model 1 day ago

Zyphra/Zamba2-2.7B

pglo updated a model 1 day ago

Zyphra/Zamba2-1.2B

View all activity

Zyphra's activity

pglo

updated a model about 22 hours ago

Zyphra/Zamba2-1.2B-instruct

Updated about 22 hours ago • 110 • 24

pglo

updated 5 models 1 day ago

BerenMillidge

authored 5 papers 6 months ago

BlackMamba: Mixture of Experts for State-Space Models

Paper • 2402.01771 • Published Feb 1, 2024 • 24

A Theoretical Framework for Inference Learning

Paper • 2206.00164 • Published Jun 1, 2022

A Stable, Fast, and Fully Automatic Learning Algorithm for Predictive Coding Networks

Paper • 2212.00720 • Published Nov 16, 2022

Zyda: A 1.3T Dataset for Open Language Modeling

Paper • 2406.01981 • Published Jun 4, 2024 • 3

Tree Attention: Topology-aware Decoding for Long-Context Attention on GPU clusters

Paper • 2408.04093 • Published Aug 7, 2024 • 4

jpilault

authored a paper 6 months ago

Tree Attention: Topology-aware Decoding for Long-Context Attention on GPU clusters

Paper • 2408.04093 • Published Aug 7, 2024 • 4

v2shyam

authored a paper 6 months ago

Tree Attention: Topology-aware Decoding for Long-Context Attention on GPU clusters

Paper • 2408.04093 • Published Aug 7, 2024 • 4

BerenMillidge

authored a paper 8 months ago

Zamba: A Compact 7B SSM Hybrid Model

Paper • 2405.16712 • Published May 26, 2024 • 23

jpilault

authored a paper 8 months ago

Zamba: A Compact 7B SSM Hybrid Model

Paper • 2405.16712 • Published May 26, 2024 • 23

yury-zyphra

authored a paper 8 months ago

Zamba: A Compact 7B SSM Hybrid Model

Paper • 2405.16712 • Published May 26, 2024 • 23

qanthony-z

authored a paper 8 months ago

Zamba: A Compact 7B SSM Hybrid Model

Paper • 2405.16712 • Published May 26, 2024 • 23

pglo

authored a paper 8 months ago

Zamba: A Compact 7B SSM Hybrid Model

Paper • 2405.16712 • Published May 26, 2024 • 23

pglo

authored a paper 11 months ago

The Unreasonable Ineffectiveness of the Deeper Layers

Paper • 2403.17887 • Published Mar 26, 2024 • 79

qanthony-z

authored a paper 12 months ago

Exploiting Inter-Layer Expert Affinity for Accelerating Mixture-of-Experts Model Inference

Paper • 2401.08383 • Published Jan 16, 2024 • 1

AI & ML interests

Recent Activity

Team members 17

Zyphra's activity