40 419 563

Sugato Ray PRO

sugatoray

https://linkedin.com/in/sugatoray

AI & ML interests

None yet

Recent Activity

upvoted an article about 17 hours ago

Fine-tune ModernBERT for RAG with Synthetic Data

upvoted a paper 1 day ago

Explaining Large Language Models Decisions Using Shapley Values

updated a collection 1 day ago

Papers

View all activity

Organizations

sugatoray's activity

upvoted an article about 17 hours ago

Article

Fine-tune ModernBERT for RAG with Synthetic Data

and 2 others •

17 days ago

• 33

upvoted 2 papers 1 day ago

Explaining Large Language Models Decisions Using Shapley Values

Paper • 2404.01332 • Published Mar 29, 2024 • 1

Preference Leakage: A Contamination Problem in LLM-as-a-judge

Paper • 2502.01534 • Published 3 days ago • 34

upvoted an article 1 day ago

Article

Open-source DeepResearch – Freeing our search agents

3 days ago

• 637

upvoted a paper 2 days ago

s1: Simple test-time scaling

Paper • 2501.19393 • Published 6 days ago • 88

upvoted 4 collections 2 days ago

upvoted 2 papers 3 days ago

DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning

Paper • 2501.12948 • Published 15 days ago • 301

WILDCHAT-50M: A Deep Dive Into the Role of Synthetic Data in Post-Training

Paper • 2501.18511 • Published 7 days ago • 17

upvoted a collection 4 days ago

Reasoning Datasets

Collection

Distilled synthetic Reasoning datasets • 7 items • Updated 4 days ago • 44

upvoted a collection 5 days ago

CritiqueFineTuning

Collection

The dataset and models for CritiqueFineTuning • 4 items • Updated 5 days ago • 1

upvoted a collection 6 days ago

Tulu 3 Models

Collection

All models released with Tulu 3 -- state of the art open post-training recipes. • 10 items • Updated 8 days ago • 86

upvoted a paper 7 days ago

Critique Fine-Tuning: Learning to Critique is More Effective than Learning to Imitate

Paper • 2501.17703 • Published 8 days ago • 50

upvoted 2 articles 8 days ago

Article

Use Models from the Hugging Face Hub in LM Studio

•

Nov 28, 2024

• 136

Article

Open-R1: a fully open reproduction of DeepSeek-R1

10 days ago

• 647

upvoted an article 9 days ago

Article

Welcome to Inference Providers on the Hub 🔥

10 days ago

• 259

upvoted 2 papers 10 days ago

Fast3R: Towards 3D Reconstruction of 1000+ Images in One Forward Pass

Paper • 2501.13928 • Published 14 days ago • 16

Autonomy-of-Experts Models

Paper • 2501.13074 • Published 15 days ago • 40