Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
3
14
1
zhu
xuekai
Follow
0 followers
·
1 following
AI & ML interests
None yet
Recent Activity
upvoted
a
paper
6 days ago
MedXpertQA: Benchmarking Expert-Level Medical Reasoning and Understanding
upvoted
an
article
21 days ago
Putting RL back in RLHF
upvoted
an
article
30 days ago
Process Reinforcement through Implicit Rewards
View all activity
Organizations
Papers
2
arxiv:
2412.14689
arxiv:
2305.13888
models
None public yet
datasets
1
xuekai/pad_train
Viewer
•
Updated
Mar 21, 2024
•
184k
•
12