Skills, datasets, etc for DataEnvGym: Data Generation Agents in Teacher Environments with Student Feedback
Zaid Khan
codezakh
·
AI & ML interests
None yet
Recent Activity
updated
a dataset
about 16 hours ago
mlfoundations-dev/dpo_from_multiple_samples_shortest_numina_aime
published
a dataset
about 16 hours ago
mlfoundations-dev/dpo_from_multiple_samples_shortest_numina_aime
updated
a dataset
about 23 hours ago
mlfoundations-dev/dpo_from_stratos_judged_annotated_rejected_responses