afrideva commited on
Commit
bfbd6a7
·
1 Parent(s): b6d47dc

Upload README.md with huggingface_hub

Browse files
Files changed (1) hide show
  1. README.md +42 -0
README.md ADDED
@@ -0,0 +1,42 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ base_model: Yhyu13/phi-2-sft-dpo-gpt4_en-ep1
3
+ inference: false
4
+ license: other
5
+ license_link: https://huggingface.co/microsoft/phi-2/resolve/main/LICENSE
6
+ license_name: microsoft-research-license
7
+ model_creator: Yhyu13
8
+ model_name: phi-2-sft-dpo-gpt4_en-ep1
9
+ pipeline_tag: text-generation
10
+ quantized_by: afrideva
11
+ tags:
12
+ - gguf
13
+ - ggml
14
+ - quantized
15
+ - q2_k
16
+ - q3_k_m
17
+ - q4_k_m
18
+ - q5_k_m
19
+ - q6_k
20
+ - q8_0
21
+ ---
22
+ # Yhyu13/phi-2-sft-dpo-gpt4_en-ep1-GGUF
23
+
24
+ Quantized GGUF model files for [phi-2-sft-dpo-gpt4_en-ep1](https://huggingface.co/Yhyu13/phi-2-sft-dpo-gpt4_en-ep1) from [Yhyu13](https://huggingface.co/Yhyu13)
25
+
26
+
27
+ | Name | Quant method | Size |
28
+ | ---- | ---- | ---- |
29
+ | [phi-2-sft-dpo-gpt4_en-ep1.fp16.gguf](https://huggingface.co/afrideva/phi-2-sft-dpo-gpt4_en-ep1-GGUF/resolve/main/phi-2-sft-dpo-gpt4_en-ep1.fp16.gguf) | fp16 | 5.56 GB |
30
+ | [phi-2-sft-dpo-gpt4_en-ep1.q2_k.gguf](https://huggingface.co/afrideva/phi-2-sft-dpo-gpt4_en-ep1-GGUF/resolve/main/phi-2-sft-dpo-gpt4_en-ep1.q2_k.gguf) | q2_k | 1.17 GB |
31
+ | [phi-2-sft-dpo-gpt4_en-ep1.q3_k_m.gguf](https://huggingface.co/afrideva/phi-2-sft-dpo-gpt4_en-ep1-GGUF/resolve/main/phi-2-sft-dpo-gpt4_en-ep1.q3_k_m.gguf) | q3_k_m | 1.48 GB |
32
+ | [phi-2-sft-dpo-gpt4_en-ep1.q4_k_m.gguf](https://huggingface.co/afrideva/phi-2-sft-dpo-gpt4_en-ep1-GGUF/resolve/main/phi-2-sft-dpo-gpt4_en-ep1.q4_k_m.gguf) | q4_k_m | 1.79 GB |
33
+ | [phi-2-sft-dpo-gpt4_en-ep1.q5_k_m.gguf](https://huggingface.co/afrideva/phi-2-sft-dpo-gpt4_en-ep1-GGUF/resolve/main/phi-2-sft-dpo-gpt4_en-ep1.q5_k_m.gguf) | q5_k_m | 2.07 GB |
34
+ | [phi-2-sft-dpo-gpt4_en-ep1.q6_k.gguf](https://huggingface.co/afrideva/phi-2-sft-dpo-gpt4_en-ep1-GGUF/resolve/main/phi-2-sft-dpo-gpt4_en-ep1.q6_k.gguf) | q6_k | 2.29 GB |
35
+ | [phi-2-sft-dpo-gpt4_en-ep1.q8_0.gguf](https://huggingface.co/afrideva/phi-2-sft-dpo-gpt4_en-ep1-GGUF/resolve/main/phi-2-sft-dpo-gpt4_en-ep1.q8_0.gguf) | q8_0 | 2.96 GB |
36
+
37
+
38
+
39
+ ## Original Model Card:
40
+ This is the merged model for LoRA https://huggingface.co/Yhyu13/phi-2-sft-dpo-gpt4_en-ep1-lora
41
+
42
+ This model is a dpo improvement to this base model https://huggingface.co/Yhyu13/phi-2-sft-alpaca_gpt4_en-ep1 who achieve better than text-davinci-003 on AlpcaEval judged by ChatGPT.