dumball's picture

dumball

archit11

AI & ML interests

small language models, looking for work please reachout [email protected]

Recent Activity

Organizations

Literally Me FRFR Research Society's profile picture Blog-explorers's profile picture ZeroGPU Explorers's profile picture IndiaBuild's profile picture Hugging Face Discord Community's profile picture

archit11's activity

upvoted an article 1 day ago
view article
Article

The case for specialized pre-training: ultra-fast foundation models for dedicated tasks

By Pclanglais β€’
β€’ 29
New activity in ubermenchh/SmolLM2-DPO 5 days ago

details pls

1
#1 opened 5 days ago by
archit11
upvoted an article 6 days ago
view article
Article

How to deploy and fine-tune DeepSeek models on AWS

β€’ 34
upvoted an article 8 days ago
view article
Article

Can we create pedagogically valuable multi-turn synthetic datasets from Cosmopedia?

By davanstrien β€’
β€’ 8
upvoted an article 20 days ago
view article
Article

Train 400x faster Static Embedding Models with Sentence Transformers

β€’ 136