Danielbrdz
/

Barcenas-Tiny-1.1b-DPO

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

Danielbrdz commited on Jan 20, 2024

Commit

871b3e7

·

verified ·

1 Parent(s): 414e743

Update README.md

Files changed (1) hide show

README.md +14 -0

README.md CHANGED Viewed

@@ -1,3 +1,17 @@
 ---
 license: apache-2.0
 ---

 ---
 license: apache-2.0
+datasets:
+- Intel/orca_dpo_pairs
+language:
+- en
+- es
 ---
+Barcenas Tiny 1.1b DPO
+It is a model based on the famous TinyLlama/TinyLlama-1.1B-Chat-v1.0 and trained with DPO using the Intel/orca_dpo_pairs dataset.
+With its reinforcement based training we hope to improve the Tiny model in a huge way and have a better model with better responses with a small size and accessible to most people.
+Many thanks to Maxime Labonne (mlabonne) for his tutorial on how to train a LLM model using DPO, without his tutorial this model would not have been possible.
+Made with ❤️ in Guadalupe, Nuevo Leon, Mexico 🇲🇽