13 222 900

Reza Sayar PRO

Reza2kn

AI & ML interests

None yet

Recent Activity

upvoted a paper less than a minute ago

SmolLM2: When Smol Goes Big -- Data-Centric Training of a Small Language Model

liked a model about 15 hours ago

lerobot/pi0

liked a Space about 15 hours ago

Remsky/Kokoro-TTS-Zero

View all activity

Organizations

Reza2kn's activity

upvoted a paper less than a minute ago

SmolLM2: When Smol Goes Big -- Data-Centric Training of a Small Language Model

Paper • 2502.02737 • Published 2 days ago • 70

upvoted an article about 15 hours ago

Article

SmolVLM - small yet mighty Vision Language Model

Nov 26, 2024

• 181

upvoted a collection about 15 hours ago

SmolVLM 256M & 500M

Collection

Collection for models & demos for even smoller SmolVLM release • 12 items • Updated 14 days ago • 65

upvoted a paper about 15 hours ago

Process Reinforcement through Implicit Rewards

Paper • 2502.01456 • Published 3 days ago • 53

upvoted 3 papers about 16 hours ago

AIN: The Arabic INclusive Large Multimodal Model

Paper • 2502.00094 • Published 6 days ago • 15

Preference Leakage: A Contamination Problem in LLM-as-a-judge

Paper • 2502.01534 • Published 3 days ago • 34

OmniHuman-1: Rethinking the Scaling-Up of One-Stage Conditioned Human Animation Models

Paper • 2502.01061 • Published 4 days ago • 152

upvoted 2 papers 1 day ago

DeepRAG: Thinking to Retrieval Step by Step for Large Language Models

Paper • 2502.01142 • Published 3 days ago • 15

PhD Knowledge Not Required: A Reasoning Challenge for Large Language Models

Paper • 2502.01584 • Published 3 days ago • 7

upvoted an article 1 day ago

Article

DABStep: Data Agent Benchmark for Multi-step Reasoning

3 days ago

• 26

upvoted a paper 2 days ago

A Study on the Performance of U-Net Modifications in Retroperitoneal Tumor Segmentation

Paper • 2502.00314 • Published 6 days ago • 3

upvoted 2 articles 2 days ago

Article

Smol but Mighty: Can Small Models Reason well? 🤔

•

2 days ago

• 6

Article

Open-source DeepResearch – Freeing our search agents

3 days ago

• 648

upvoted an article 4 days ago

Article

Open-R1: Update #1

and 7 others •

5 days ago

• 237

upvoted a paper 5 days ago

o3-mini vs DeepSeek-R1: Which One is Safer?

Paper • 2501.18438 • Published 7 days ago • 21

upvoted 2 articles 5 days ago

Article

LLM Dataset Formats 101: A No‐BS Guide for Hugging Face Devs

•

6 days ago

• 4

Article

The AI tools for Art Newsletter - Issue 1

7 days ago

• 44

upvoted a collection 6 days ago

WildChat-50m

Collection

All model responses associated with the WildChat-50m paper. • 55 items • Updated 8 days ago • 6

upvoted an article 7 days ago

Article

KV Caching Explained: Optimizing Transformer Inference Efficiency

•

8 days ago

• 23

upvoted a paper 7 days ago

Atla Selene Mini: A General Purpose Evaluation Model

Paper • 2501.17195 • Published 10 days ago • 30