Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
nbeerbower
's Collections
abliteration loras
DPO
bruphin
flammen
llama 3 experiments
Nemo
DPO
updated
19 days ago
Various useful datasets with preference optimization
Upvote
3
jondurbin/gutenberg-dpo-v0.1
Viewer
•
Updated
Jan 12, 2024
•
918
•
1.37k
•
132
nbeerbower/gutenberg2-dpo
Viewer
•
Updated
Nov 16, 2024
•
293
•
117
•
19
jondurbin/truthy-dpo-v0.1
Viewer
•
Updated
Jan 11, 2024
•
1.02k
•
453
•
132
kyujinpy/orca_math_dpo
Viewer
•
Updated
Apr 12, 2024
•
15.3k
•
83
•
18
antiven0m/physical-reasoning-dpo
Viewer
•
Updated
Mar 23, 2024
•
899
•
110
•
10
flammenai/MahouMix-v1
Viewer
•
Updated
May 30, 2024
•
267
•
58
•
4
flammenai/Date-DPO-NoAsterisks
Viewer
•
Updated
Sep 18, 2024
•
330
•
60
•
4
nbeerbower/Arkhaios-DPO
Viewer
•
Updated
Nov 12, 2024
•
222
•
159
•
8
nbeerbower/Purpura-DPO
Viewer
•
Updated
Nov 12, 2024
•
230
•
121
•
7
nbeerbower/Schule-DPO
Viewer
•
Updated
Nov 16, 2024
•
34
•
114
•
1
HumanLLMs/Human-Like-DPO-Dataset
Viewer
•
Updated
30 days ago
•
10.9k
•
3.26k
•
192
nbeerbower/gutenberg-moderne-dpo
Viewer
•
Updated
Nov 17, 2024
•
346
•
123
•
2
nbeerbower/reddit-dpo
Viewer
•
Updated
10 days ago
•
76.9k
•
190
•
1
Atsunori/HelpSteer2-DPO
Viewer
•
Updated
Jul 11, 2024
•
7.59k
•
114
•
6
abacusai/MetaMath_DPO_FewShot
Viewer
•
Updated
Feb 26, 2024
•
395k
•
178
•
26
nbeerbower/GreatFirewall-DPO
Viewer
•
Updated
20 days ago
•
492
•
159
•
4
Upvote
3
Share collection
View history
Collection guide
Browse collections