Edit Models filters

Misc

Inference Endpoints

AutoTrain Compatible

text-generation-inference

4-bit precision

Mixture of Experts

8-bit precision

Misc with no match

text-embeddings-inference

Carbon Emissions

Models

490

Full-text search

Active filters: DPO

Nagi-ovo/Llama-3-8B-DPO

Text Generation • Updated Jan 6 • 23

Novaciano/TinyLlama-1b_DPO_Roleplay_NSFW-GGUF

Updated Jan 2 • 111

tensorblock/Hermes-2-Theta-Llama-3-8B-32k-GGUF

Updated Jan 2 • 66

mradermacher/Llama3-OpenBioLLM-8B-GGUF

Updated Jan 5 • 74

mradermacher/Llama3-OpenBioLLM-8B-i1-GGUF

Updated Jan 5 • 414

MilyaShams/SmolLM2-DPO-FT-smoltalk

Text Generation • Updated about 1 month ago • 6

MilyaShams/SmolLM2-DPO-FT-Instruct

Text Generation • Updated about 1 month ago • 8

Avibhi/Gemma2-2B-HindiTranslation-DPO

Updated 27 days ago

JHuel/Mistral-Nemo-Instruct-2407_DPO_qlora

Reinforcement Learning • Updated 15 days ago

govindrhf/aaditya-Llama3-OpenBioLLM-70B

Updated 2 days ago