arxiv:2502.01456
Yuchen Fan
yuchenFan
AI & ML interests
None yet
Recent Activity
authored
a paper
2 days ago
Process Reinforcement through Implicit Rewards
upvoted
a
paper
2 days ago
Process Reinforcement through Implicit Rewards
upvoted
a
paper
6 days ago
MedXpertQA: Benchmarking Expert-Level Medical Reasoning and
Understanding
Organizations
Papers
1
models
6
yuchenFan/1212_ce_qwen_math_part_qwen_llama_full_llama_70b
Feature Extraction
•
Updated
•
4
yuchenFan/no_type_ce_qwen_math_part_qwen_llama_full_llama_70b_latest_ce
Updated
yuchenFan/1214_qwen_dedup_top8_ce_old_new_math_syn_olymiads_beta005_lr5e-7
Text Generation
•
Updated
•
5
yuchenFan/1212_qwen_dedup_top8_old_math_new_math_syn_olympiads
Text Generation
•
Updated
•
4
yuchenFan/no-type-dpo-1127-qwen-original
Text Generation
•
Updated
•
10
yuchenFan/llama-3.1-8b-unfilter-22k-7k-11-26
Text Generation
•
Updated
•
5