Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
1
3
Yeyun Gong
yegong
Follow
21world's profile picture
1 follower
ยท
1 following
AI & ML interests
None yet
Recent Activity
upvoted
a
paper
11 days ago
Optimizing Large Language Model Training Using FP4 Quantization
authored
a paper
18 days ago
Sigma: Differential Rescaling of Query, Key and Value for Efficient Language Models
upvoted
a
paper
18 days ago
Sigma: Differential Rescaling of Query, Key and Value for Efficient Language Models
View all activity
Organizations
None yet
Papers
6
arxiv:
2501.13629
arxiv:
2410.15748
arxiv:
2405.07526
arxiv:
2404.07965
Expand 6 papers
models
None public yet
datasets
None public yet