zhu's picture

3 14 1

zhu

xuekai

·

AI & ML interests

None yet

Recent Activity

upvoted a paper 6 days ago

MedXpertQA: Benchmarking Expert-Level Medical Reasoning and Understanding

upvoted an article 21 days ago

Putting RL back in RLHF

upvoted an article 30 days ago

Process Reinforcement through Implicit Rewards

View all activity

Organizations

Papers 2

arxiv:2412.14689

arxiv:2305.13888

models

None public yet

datasets 1

xuekai/pad_train

Viewer • Updated Mar 21, 2024 • 184k • 12