Aritra Roy Gosthipaty's picture

Aritra Roy Gosthipaty PRO

ariG23498

AI & ML interests

Deep Representation Learning

Recent Activity

Organizations

Hugging Face's profile picture Google's profile picture Notebooks-explorers's profile picture PyTorch Image Models's profile picture Keras's profile picture Hugging Test Lab's profile picture Hugging Face Fellows's profile picture Probing ViTs's profile picture TrystAI's profile picture PyImageSearch's profile picture Keras Dreambooth Event's profile picture Hugging Face OSS Metrics's profile picture Blog-explorers's profile picture ZeroGPU Explorers's profile picture kotol's profile picture gg-hf's profile picture MLX Community's profile picture IBM Granite's profile picture Open Generative Fill's profile picture Social Post Explorers's profile picture Hugging Face Discord Community's profile picture nltpt's profile picture nltpt-q's profile picture qrias's profile picture Hugging Face Science's profile picture open/ acc's profile picture wut?'s profile picture LLM from Scratch's profile picture

ariG23498's activity

upvoted 2 articles 7 days ago
view article
Article

Mixture of Experts Explained

302
view article
Article

KV Caching Explained: Optimizing Transformer Inference Efficiency

By not-lain
23
upvoted an article 9 days ago
view article
Article

Welcome to Inference Providers on the Hub 🔥

258
upvoted an article 9 days ago
view article
Article

Open-R1: a fully open reproduction of DeepSeek-R1

646
upvoted an article 14 days ago
view article
Article

SmolVLM Grows Smaller – Introducing the 250M & 500M Models!

119
upvoted an article 14 days ago
view article
Article

Mastering Long Contexts in LLMs with KVPress

By nvidia and 1 other
59
upvoted an article 15 days ago
view article
Article

Unlocking Longer Generation with Key-Value Cache Quantization

41
upvoted an article 16 days ago
view article
Article

Yay! Organizations can now publish blog Articles

By huggingface and 3 others
30
upvoted 2 articles 21 days ago
view article
Article

Timm ❤️ Transformers: Use any timm model with transformers

39
view article
Article

Introducing multi-backends (TRT-LLM, vLLM) support for Text Generation Inference

63
upvoted an article 22 days ago
view article
Article

MiniMax-01 is Now Open-Source: Scaling Lightning Attention for the AI Agent Era

40
upvoted an article about 1 month ago
view article
Article

Announcing NVIDIA Cosmos World Foundation Models

By mingyuliutw and 1 other
23