Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
Edit Models filters
Tasks
Libraries
Datasets
Languages
Licenses
Other
1
Misc
Reset Misc
DPO
Inference Endpoints
AutoTrain Compatible
text-generation-inference
4-bit precision
Mixture of Experts
Merge
Eval Results
8-bit precision
Misc with no match
custom_code
text-embeddings-inference
Carbon Emissions
Apply filters
Models
490
Full-text search
Edit filters
Sort: Trending
Active filters:
DPO
Clear all
Nagi-ovo/Llama-3-8B-DPO
Text Generation
•
Updated
Jan 6
•
23
Novaciano/TinyLlama-1b_DPO_Roleplay_NSFW-GGUF
Updated
Jan 2
•
111
tensorblock/Hermes-2-Theta-Llama-3-8B-32k-GGUF
Updated
Jan 2
•
66
mradermacher/Llama3-OpenBioLLM-8B-GGUF
Updated
Jan 5
•
74
mradermacher/Llama3-OpenBioLLM-8B-i1-GGUF
Updated
Jan 5
•
414
MilyaShams/SmolLM2-DPO-FT-smoltalk
Text Generation
•
Updated
about 1 month ago
•
6
MilyaShams/SmolLM2-DPO-FT-Instruct
Text Generation
•
Updated
about 1 month ago
•
8
Avibhi/Gemma2-2B-HindiTranslation-DPO
Updated
27 days ago
JHuel/Mistral-Nemo-Instruct-2407_DPO_qlora
Reinforcement Learning
•
Updated
15 days ago
govindrhf/aaditya-Llama3-OpenBioLLM-70B
Updated
2 days ago
Previous
1
...
15
16
17
Next