Locutusque
/

UltraQwen-7B

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

Locutusque commited on Jan 21, 2024

Commit

dc8cc0f

·

verified ·

1 Parent(s): 3f5d110

Create README.md

Files changed (1) hide show

README.md +23 -0

README.md ADDED Viewed

	@@ -0,0 +1,23 @@

+---
+datasets:
+- HuggingFaceH4/ultrachat_200k
+language:
+- en
+license: other
+base_model: Qwen/Qwen-7B
+---
+# Model description
+The model was trained on about 100,000 examples of the HuggingFaceH4/ultrachat_200k dataset, with plans to release more checkpoints later on.
+This model has not been aligned with DPO. In the future, different repositories will be released that contain versions of this model aligned with DPO, using various datasets.
+# Evaluation
+Upon personal testing, the model demonstrates excellent performance in mathematics, history, and coding tasks. This model will also be submitted to the Open LLM Leaderboard.
+# Recommended inference parameters
+temperature=0.2, top_p=0.14, top_k=12, repetition_penalty=1.1
+# License
+Please make sure to read the Qwen licensing agreement before using this model.