Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
michaelnguyen11
/
TwinLlama-3.2-3B-DPO
like
0
Text Generation
Transformers
Safetensors
llama
unsloth
trl
dpo
conversational
text-generation-inference
Inference Endpoints
arxiv:
1910.09700
Model card
Files
Files and versions
Community
Train
Deploy
Use this model
main
TwinLlama-3.2-3B-DPO
Commit History
Trained with Unsloth
aebca92
verified
michaelnguyen11
commited on
Dec 24, 2024
Trained with Unsloth
ffe65db
verified
michaelnguyen11
commited on
Dec 23, 2024
Upload tokenizer
38f6ec2
verified
michaelnguyen11
commited on
Dec 23, 2024
initial commit
9c2d787
verified
michaelnguyen11
commited on
Dec 23, 2024