RL4Reasoning

community

Activity Feed

AI & ML interests

None defined yet.

Recent Activity

PeterV09 updated a model 23 days ago

RL4Reasoning/dart-math-prop2diff-v1-1e-5

PeterV09 published a model 23 days ago

RL4Reasoning/dart-math-prop2diff-v1-1e-5

PeterV09 updated a model 27 days ago

RL4Reasoning/dart-math-prop2diff-v1

View all activity

RL4Reasoning's activity

PeterV09

updated a model 23 days ago

RL4Reasoning/dart-math-prop2diff-v1-1e-5

Updated 23 days ago • 5

PeterV09

published a model 23 days ago

RL4Reasoning/dart-math-prop2diff-v1-1e-5

Updated 23 days ago • 5

PeterV09

updated a model 27 days ago

RL4Reasoning/dart-math-prop2diff-v1

Updated 27 days ago • 17 • 1

PeterV09

published a model 27 days ago

RL4Reasoning/dart-math-prop2diff-v1

Updated 27 days ago • 17 • 1

yuzhen17

authored a paper about 1 month ago

B-STaR: Monitoring and Balancing Exploration and Exploitation in Self-Taught Reasoners

Paper • 2412.17256 • Published Dec 23, 2024 • 46

PeterV09

authored 2 papers 7 months ago

Is Your Model Really A Good Math Reasoner? Evaluating Mathematical Reasoning with Checklist

Paper • 2407.08733 • Published Jul 11, 2024 • 21

What Makes Good Data for Alignment? A Comprehensive Study of Automatic Data Selection in Instruction Tuning

Paper • 2312.15685 • Published Dec 25, 2023 • 16

yuzhen17

authored a paper 10 months ago

Compression Represents Intelligence Linearly

Paper • 2404.09937 • Published Apr 15, 2024 • 27

yuzhen17

authored a paper about 1 year ago

C-Eval: A Multi-Level Multi-Discipline Chinese Evaluation Suite for Foundation Models

Paper • 2305.08322 • Published May 15, 2023

AI & ML interests

Recent Activity

Team members 2

RL4Reasoning's activity