9 88 587

Anthonny OLIME

Citaman

Citaman

AI & ML interests

None yet

Recent Activity

upvoted an article 4 days ago

Open-R1: Update #1

updated a collection 7 days ago

Keep in Mind's Model

updated a collection 9 days ago

omni models

View all activity

Organizations

Citaman's activity

upvoted an article 4 days ago

Article

Open-R1: Update #1

and 7 others •

5 days ago

• 237

updated a collection 7 days ago

Keep in Mind's Model

Collection

50 items • Updated 7 days ago

updated a collection 9 days ago

omni models

Collection

2 items • Updated 9 days ago

upvoted a paper 9 days ago

Demons in the Detail: On Implementing Load Balancing Loss for Training Specialized Mixture-of-Expert Models

Paper • 2501.11873 • Published 17 days ago • 63

upvoted an article 9 days ago

Article

Welcome to Inference Providers on the Hub 🔥

10 days ago

• 261

liked 5 models 9 days ago

liked 3 models 10 days ago

Qwen/Qwen2.5-VL-72B-Instruct

Image-Text-to-Text • Updated 10 days ago • 34.5k • 222

Qwen/Qwen2.5-VL-7B-Instruct

Image-Text-to-Text • Updated about 14 hours ago • 307k • 328

Qwen/Qwen2.5-VL-3B-Instruct

Image-Text-to-Text • Updated 2 days ago • 97.6k • 161

replied to AdinaY's post 10 days ago

fewer hour short for https://huggingface.co/deepseek-ai/Janus-Pro-1B

liked 4 models 10 days ago

unsloth/DeepSeek-R1-GGUF

Text Generation • Updated 7 days ago • 405k • 585

deepseek-ai/Janus-Pro-7B

Any-to-Any • Updated 5 days ago • 223k • 2.66k

deepseek-ai/Janus-Pro-1B

Any-to-Any • Updated 5 days ago • 64.9k • 336

THUDM/glm-4-9b-chat-1m-hf

Text Generation • Updated 11 days ago • 75 • 7

liked a dataset 10 days ago

THUDM/T1

Viewer • Updated 17 days ago • 10k • 32 • 3

upvoted a paper 10 days ago

Advancing Language Model Reasoning through Reinforcement Learning and Inference Scaling

Paper • 2501.11651 • Published 17 days ago • 1