arxiv:2410.13824
Yuxiao Qu PRO
CohenQu
AI & ML interests
None yet
Recent Activity
updated
a model
about 9 hours ago
CohenQu/DeepSeek-R1-Distill-Qwen-7B-GRPO
published
a model
about 18 hours ago
CohenQu/DeepSeek-R1-Distill-Qwen-7B-GRPO
Organizations
Papers
1
models
26
CohenQu/DeepSeek-R1-Distill-Qwen-7B-GRPO
Updated
CohenQu/implicit_rank_200000
Text Generation
•
Updated
•
7
CohenQu/implicit_rank_100000
Text Generation
•
Updated
•
11
CohenQu/implicit_rank_50000
Text Generation
•
Updated
•
7
CohenQu/implicit_rank_25000
Text Generation
•
Updated
•
5
CohenQu/implicit_color_200000
Text Generation
•
Updated
•
7
CohenQu/implicit_color_100000
Text Generation
•
Updated
•
5
CohenQu/implicit_color_50000
Text Generation
•
Updated
•
5
CohenQu/implicit_color_25000
Text Generation
•
Updated
•
5
CohenQu/implicit_200000
Text Generation
•
Updated
•
2