RLHFlow

university
Activity Feed

AI & ML interests

Workflow of Reinforcement Learning from Human Feedback (RLHF). Blog: https://rlhflow.github.io/

Recent Activity

Min-Li  updated a collection 1 day ago
Decision-Tree Reward Models
Min-Li  updated a dataset 1 day ago
RLHFlow/LLM-Preferences-HelpSteer2
Min-Li  published a dataset 1 day ago
RLHFlow/LLM-Preferences-HelpSteer2
View all activity

RLHFlow's activity