mlfoundations-dev/dpo_from_multiple_samples_shortest_numina_aime Text Generation • Updated about 12 hours ago • 2
mlfoundations-dev/dpo_from_multiple_samples_shortest_numina_aime Text Generation • Updated about 12 hours ago • 2
mlfoundations-dev/dpo_from_stratos_judged_annotated_rejected_responses Text Generation • Updated about 18 hours ago • 6
mlfoundations-dev/dpo_from_stratos_judged_annotated_rejected_responses Text Generation • Updated about 18 hours ago • 6
mlfoundations-dev/multiple_samples_majority_consensus_pick_one_numina_aime_math_verify Text Generation • Updated 1 day ago • 6
mlfoundations-dev/multiple_samples_majority_consensus_pick_one_numina_aime_math_verify Text Generation • Updated 1 day ago • 6
mlfoundations-dev/multiple_samples_majority_consensus_numina_aime_math_verify Text Generation • Updated 1 day ago • 2
mlfoundations-dev/multiple_samples_majority_consensus_numina_aime_math_verify Text Generation • Updated 1 day ago • 2