Victor Mustar's picture

Victor Mustar PRO

victor

·

victormustar

AI & ML interests

Building the UX of this website

Recent Activity

liked a Space about 2 hours ago

nvidia/radio

upvoted an article about 2 hours ago

Mastering Long Contexts in LLMs with KVPress

replied to their post about 3 hours ago

Hey everyone, we've given https://hf.co/spaces page a fresh update! Smart Search: Now just type what you want to do—like "make a viral meme" or "generate music"—and our search gets it. New Categories: Check out the cool new filter bar with icons to help you pick a category fast. Redesigned Space Cards: Reworked a bit to really show off the app descriptions, so you know what each Space does at a glance. Random Prompt: Need ideas? Hit the dice button for a burst of inspiration. We’d love to hear what you think—drop us some feedback plz!

View all activity

Organizations

victor's activity

upvoted an article about 2 hours ago

Article

Mastering Long Contexts in LLMs with KVPress

By

and 1 other •

14 days ago

• 59

upvoted 4 papers 1 day ago

Critique Fine-Tuning: Learning to Critique is More Effective than Learning to Imitate

Paper • 2501.17703 • Published 8 days ago • 50

s1: Simple test-time scaling

Paper • 2501.19393 • Published 6 days ago • 88

OmniHuman-1: Rethinking the Scaling-Up of One-Stage Conditioned Human Animation Models

Paper • 2502.01061 • Published 3 days ago • 149

The Differences Between Direct Alignment Algorithms are a Blur

Paper • 2502.01237 • Published 3 days ago • 104

upvoted an article 4 days ago

Article

Open-R1: Update #1

By

and 7 others •

5 days ago

• 235

upvoted a paper 6 days ago

GuardReasoner: Towards Reasoning-based LLM Safeguards

Paper • 2501.18492 • Published 7 days ago • 78

upvoted 3 articles 6 days ago

Article

How biased is Whisper ? Evaluating Whisper Models for Robustness to Diverse English Accents

By

•

8 days ago

• 13

Article

🅰️ℹ️ 1️⃣0️⃣1️⃣ The Keys to Prompt Optimization

By

and 1 other •

8 days ago

• 4

Article

Anthropic CEO: is DeepSeek-R1 a revolution in AI?

By

•

7 days ago

• 5

upvoted a collection 6 days ago

R1 Multilingual

5 items • Updated 6 days ago • 7

upvoted a collection 7 days ago

Tulu 3 Models

All models released with Tulu 3 -- state of the art open post-training recipes. • 10 items • Updated 8 days ago • 86

upvoted a paper 9 days ago

Qwen2.5-1M Technical Report

Paper • 2501.15383 • Published 11 days ago • 54

upvoted 2 articles 9 days ago

Article

Welcome to Inference Providers on the Hub 🔥

10 days ago

• 257

Article

Open-R1: a fully open reproduction of DeepSeek-R1

10 days ago

• 646

upvoted a collection 10 days ago

Qwen2.5-VL

Vision-language model series based on Qwen2.5 • 3 items • Updated 10 days ago • 320

upvoted a paper 10 days ago

DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning

Paper • 2501.12948 • Published 15 days ago • 298

upvoted a collection 14 days ago

DeepSeek-R1

8 items • Updated 16 days ago • 416

upvoted an article 19 days ago

Article

Gradio spaces are the perfect agent tools\!

By

•

20 days ago

• 13

upvoted a paper 20 days ago

OmniThink: Expanding Knowledge Boundaries in Machine Writing through Thinking

Paper • 2501.09751 • Published 21 days ago • 47