4 5 15

Ganqu Cui

ganqu

cgq15

AI & ML interests

None yet

Recent Activity

authored a paper 4 days ago

UltraIF: Advancing Instruction Following from the Wild

upvoted a paper 5 days ago

UltraIF: Advancing Instruction Following from the Wild

authored a paper 7 days ago

Process Reinforcement through Implicit Rewards

View all activity

Organizations

ganqu's activity

authored a paper 4 days ago

UltraIF: Advancing Instruction Following from the Wild

Paper • 2502.04153 • Published 5 days ago • 20

upvoted a paper 5 days ago

UltraIF: Advancing Instruction Following from the Wild

Paper • 2502.04153 • Published 5 days ago • 20

authored a paper 7 days ago

Process Reinforcement through Implicit Rewards

Paper • 2502.01456 • Published 8 days ago • 53

upvoted a paper 8 days ago

Process Reinforcement through Implicit Rewards

Paper • 2502.01456 • Published 8 days ago • 53

liked a model 27 days ago

internlm/internlm3-8b-instruct

Text Generation • Updated about 9 hours ago • 36.2k • 194

published an article about 1 month ago

Article

Process Reinforcement through Implicit Rewards

and 1 other •

Jan 3

• 23

updated a Space about 1 month ago

README

🏃

liked a dataset about 1 month ago

PRIME-RL/Eurus-2-RL-Data

Viewer • Updated 8 days ago • 483k • 747 • 25

liked 2 models about 1 month ago

PRIME-RL/Eurus-2-7B-PRIME

Text Generation • Updated 8 days ago • 1.96k • 59

PRIME-RL/EurusPRM-Stage2

Updated 8 days ago • 787 • 6

updated a model about 1 month ago

PRIME-RL/Eurus-2-7B-PRIME

Text Generation • Updated 8 days ago • 1.96k • 59

authored 3 papers 2 months ago

RLAIF-V: Aligning MLLMs through Open-Source AI Feedback for Super GPT-4V Trustworthiness

Paper • 2405.17220 • Published May 27, 2024 • 1

UltraMedical: Building Specialized Generalists in Biomedicine

Paper • 2406.03949 • Published Jun 6, 2024

Free Process Rewards without Process Labels

Paper • 2412.01981 • Published Dec 2, 2024 • 32

upvoted a paper 2 months ago

Free Process Rewards without Process Labels

Paper • 2412.01981 • Published Dec 2, 2024 • 32

updated a dataset 4 months ago

ganqu/openbackdoor

Preview • Updated Oct 23, 2024 • 126

authored 4 papers 10 months ago

Mastering Text, Code and Math Simultaneously via Fusing Highly Specialized Language Models

Paper • 2403.08281 • Published Mar 13, 2024