Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
140.7
TFLOPS
3
4
15
Fused Ion
fusedion
Follow
21world's profile picture
1 follower
ยท
21 following
AI & ML interests
None yet
Recent Activity
reacted
to
schuler
's
post
with ๐
about 7 hours ago
๐ข New Research Alert: Making Language Models Smaller & Smarter! Thrilled to share the latest technical report demonstrating how to reduce language model parameters by 77% while maintaining performance. The secret? Grouped pointwise convolutions. Yes. We brought a method from computer vision to the transformers arena. ๐ Key Findings: โข 77% parameter reduction. โข Maintained model capabilities. โข Improved generalization. Paper: https://www.researchgate.net/publication/388835829_SAVING_77_OF_THE_PARAMETERS_IN_LARGE_LANGUAGE_MODELS_TECHNICAL_REPORT Code: https://github.com/joaopauloschuler/less-parameters-llm
reacted
to
schuler
's
post
with ๐ฅ
1 day ago
๐ข New Research Alert: Making Language Models Smaller & Smarter! Thrilled to share the latest technical report demonstrating how to reduce language model parameters by 77% while maintaining performance. The secret? Grouped pointwise convolutions. Yes. We brought a method from computer vision to the transformers arena. ๐ Key Findings: โข 77% parameter reduction. โข Maintained model capabilities. โข Improved generalization. Paper: https://www.researchgate.net/publication/388835829_SAVING_77_OF_THE_PARAMETERS_IN_LARGE_LANGUAGE_MODELS_TECHNICAL_REPORT Code: https://github.com/joaopauloschuler/less-parameters-llm
new
activity
8 days ago
DAMO-NLP-SG/VideoLLaMA3-7B:
Max video length
View all activity
Organizations
None yet
spaces
1
Runtime error
Psmathur-orca Mini 13b
๐
models
None public yet
datasets
None public yet