poterliu's picture

6 25

poterliu

poterliu

·

AI & ML interests

None yet

Recent Activity

upvoted a collection 7 days ago

Deepseek Papers

liked a Space 8 days ago

Qwen/Qwen2.5-Max-Demo

liked a Space 10 days ago

akhaliq/anychat

View all activity

Organizations

None yet

poterliu's activity

upvoted a collection 7 days ago

Deepseek Papers

Deepseek papers collection • 15 items • Updated 2 days ago • 46

liked a Space 8 days ago

Qwen2.5 Max Demo

Send messages for chatbot responses

liked a Space 10 days ago

Anychat

upvoted a collection 10 days ago

Qwen2.5-VL

Vision-language model series based on Qwen2.5 • 3 items • Updated 11 days ago • 322

liked 2 models 10 days ago

tencent/Hunyuan3D-2

Image-to-3D • Updated 4 days ago • 41k • 799

deepseek-ai/Janus-Pro-7B

Any-to-Any • Updated 5 days ago • 223k • 2.67k

upvoted a collection 11 days ago

DeepSeek-R1

8 items • Updated 17 days ago • 422

reacted to lewtun's post with 🤗 11 days ago

Post

9886

We are reproducing the full DeepSeek R1 data and training pipeline so everybody can use their recipe. Instead of doing it in secret we can do it together in the open!

🧪 Step 1: replicate the R1-Distill models by distilling a high-quality reasoning corpus from DeepSeek-R1.

🧠 Step 2: replicate the pure RL pipeline that DeepSeek used to create R1-Zero. This will involve curating new, large-scale datasets for math, reasoning, and code.

🔥 Step 3: show we can go from base model -> SFT -> RL via multi-stage training.

Follow along: https://github.com/huggingface/open-r1

5 replies

·

liked a model 11 days ago

deepseek-ai/DeepSeek-R1-Distill-Qwen-7B

Text Generation • Updated 6 days ago • 331k • 352

reacted to onekq's post with 👍 11 days ago

Post

2271

So 🐋DeepSeek🐋 hits the mainstream media. But it has been a star in our little cult for at least 6 months. Its meteoric success is not overnight, but two years in the making.

To learn their history, just look at their 🤗 repo https://huggingface.co/deepseek-ai

* End of 2023, they launched the first model (pretrained by themselves) following Llama 2 architecture
* June 2024, v2 (MoE architecture) surpassed Gemini 1.5, but behind Mistral
* September, v2.5 surpassed GPT 4o mini
* December, v3 surpassed GPT 4o
* Now R1 surpassed o1

Most importantly, if you think DeepSeek success is singular and unrivaled, that's WRONG. The following models are also near or equal the o1 bar.

* Minimax-01
* Kimi k1.5
* Doubao 1.5 pro

1 reply

·

upvoted a paper 11 days ago

DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning

Paper • 2501.12948 • Published 15 days ago • 301

liked a Space 11 days ago

Hunyuan3D-2.0

Text-to-3D and Image-to-3D Generation

liked a dataset 11 days ago

cais/hle

Viewer • Updated 14 days ago • 3k • 3.13k • 180

liked 3 models 17 days ago

deepseek-ai/DeepSeek-R1-Distill-Qwen-32B

Text Generation • Updated 6 days ago • 415k • • 901

deepseek-ai/DeepSeek-R1-Zero

Text Generation • Updated 6 days ago • 25.7k • 725

deepseek-ai/DeepSeek-R1

Text Generation • Updated 6 days ago • 1.54M • • 7.27k

liked 2 models 22 days ago

MiniMaxAI/MiniMax-VL-01

Image-Text-to-Text • Updated 12 days ago • 2.14k • 229

MiniMaxAI/MiniMax-Text-01

Text Generation • Updated 21 days ago • 6.48k • 501

upvoted a paper 22 days ago

MiniMax-01: Scaling Foundation Models with Lightning Attention

Paper • 2501.08313 • Published 23 days ago • 272

liked a model about 1 month ago

Qwen/Qwen2.5-72B

Text Generation • Updated Sep 25, 2024 • 28.3k • 57