Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
1
Dan Busbridge
dbusbridge
Follow
huggingbitch's profile picture
apple-intelligence's profile picture
2 followers
·
1 following
danbusbridge
dbusbridge
AI & ML interests
Deep learning, optimization, self-supervised learning, representation learning, large language modeling, equivariance, geometric deep learning, attention mechanisms, transformers
Recent Activity
authored
a paper
9 days ago
Parameters vs FLOPs: Scaling Laws for Optimal Sparsity for Mixture-of-Experts Language Models
authored
a paper
5 months ago
Theory, Analysis, and Best Practices for Sigmoid Self-Attention
commented
on
a paper
over 1 year ago
How to Scale Your EMA
View all activity
Organizations
dbusbridge
's activity
All
Models
Datasets
Spaces
Papers
Collections
Community
Posts
Upvotes
Likes
Articles
commented
2 papers
over 1 year ago
How to Scale Your EMA
Paper
•
2307.13813
•
Published
Jul 25, 2023
•
9
•
4
How to Scale Your EMA
Paper
•
2307.13813
•
Published
Jul 25, 2023
•
9
•
4