3 93 140

zhangwenbin

ExceedZhang

AI & ML interests

None yet

Recent Activity

upvoted a paper 3 days ago

s1: Simple test-time scaling

updated a model 4 days ago

ExceedZhang/DeepSeek-R1-Distill-Qwen-14B-W4A16-G128

upvoted a paper 7 days ago

TradExpert: Revolutionizing Trading with Mixture of Expert LLMs

View all activity

Organizations

None yet

ExceedZhang's activity

upvoted a paper 3 days ago

s1: Simple test-time scaling

Paper • 2501.19393 • Published 6 days ago • 88

updated a model 4 days ago

ExceedZhang/DeepSeek-R1-Distill-Qwen-14B-W4A16-G128

Updated 4 days ago • 13

upvoted a paper 7 days ago

TradExpert: Revolutionizing Trading with Mixture of Expert LLMs

Paper • 2411.00782 • Published Oct 16, 2024 • 1

upvoted 2 papers 8 days ago

Humanity's Last Exam

Paper • 2501.14249 • Published 14 days ago • 54

Qwen2.5-1M Technical Report

Paper • 2501.15383 • Published 12 days ago • 54

upvoted 2 articles 10 days ago

Article

We now support VLMs in smolagents!

14 days ago

• 71

Article

Open-R1: a fully open reproduction of DeepSeek-R1

10 days ago

• 649

liked 2 models 10 days ago

Qwen/Qwen2.5-VL-7B-Instruct

Image-Text-to-Text • Updated about 15 hours ago • 307k • 328

Qwen/Qwen2.5-VL-72B-Instruct

Image-Text-to-Text • Updated 10 days ago • 34.5k • 223

upvoted a paper 14 days ago

DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning

Paper • 2501.12948 • Published 15 days ago • 301

published a model 16 days ago

ExceedZhang/DeepSeek-R1-Distill-Qwen-14B-W4A16-G128

Updated 4 days ago • 13

liked a model 17 days ago

deepseek-ai/DeepSeek-R1-Distill-Qwen-32B

Text Generation • Updated 6 days ago • 415k • • 901

upvoted a paper 22 days ago

MiniMax-01: Scaling Foundation Models with Lightning Attention

Paper • 2501.08313 • Published 23 days ago • 272

liked a model 22 days ago

microsoft/phi-4

Text Generation • Updated 2 days ago • 474k • 1.68k

upvoted a paper 27 days ago

rStar-Math: Small LLMs Can Master Math Reasoning with Self-Evolved Deep Thinking

Paper • 2501.04519 • Published 29 days ago • 253

liked a model 28 days ago

ICTNLP/llava-mini-llama-3.1-8b

Image-Text-to-Text • Updated 25 days ago • 7.19k • 42

upvoted 2 papers 29 days ago

VITA-1.5: Towards GPT-4o Level Real-Time Vision and Speech Interaction

Paper • 2501.01957 • Published Jan 3 • 42

REINFORCE++: A Simple and Efficient Approach for Aligning Large Language Models

Paper • 2501.03262 • Published Jan 4 • 90

liked 2 models about 1 month ago

nvidia/Cosmos-1.0-Diffusion-7B-Text2World

Updated 28 days ago • 225k • 199

VITA-MLLM/VITA-1.5

Video-Text-to-Text • Updated 22 days ago • 1.17k • 33