lmsys
/

vicuna-7b-v1.1

Text Generation

Transformers

PyTorch

llama

text-generation-inference

Model card Files Files and versions Community

lmzheng commited on Jul 13, 2023

Commit

f238a00

1 Parent(s): dae87ac

Update README.md

Browse files

Files changed (1) hide show

README.md +12 -65

README.md CHANGED Viewed

@@ -1,64 +1,13 @@
 ---
-license: other
 ---
-<!-- header start -->
-<div style="width: 100%;">
-    <img src="https://i.imgur.com/EBdldam.jpg" alt="TheBlokeAI" style="width: 100%; min-width: 400px; display: block; margin: auto;">
-</div>
-<div style="display: flex; justify-content: space-between; width: 100%;">
-    <div style="display: flex; flex-direction: column; align-items: flex-start;">
-        <p><a href="https://discord.gg/Jq4vkcDakD">Chat & support: my new Discord server</a></p>
-    </div>
-    <div style="display: flex; flex-direction: column; align-items: flex-end;">
-        <p><a href="https://www.patreon.com/TheBlokeAI">Want to contribute? TheBloke's Patreon page</a></p>
-    </div>
-</div>
-<!-- header end -->
-# Vicuna 7B 1.1 HF
-This is an HF version of the [Vicuna 7B 1.1 model](https://huggingface.co/lmsys/vicuna-7b-delta-v1.1).
-It was created by merging the deltas provided in the above repo with the original Llama 7B model, [using the code provided on their Github page](https://github.com/lm-sys/FastChat#vicuna-weights).
-## My Vicuna 1.1 model repositories
-I have the following Vicuna 1.1 repositories available:
-**13B models:**
-* [Unquantized 13B 1.1 model for GPU - HF format](https://huggingface.co/TheBloke/vicuna-13B-1.1-HF)
-* [GPTQ quantized 4bit 13B 1.1 for GPU - `safetensors` and `pt` formats](https://huggingface.co/TheBloke/vicuna-13B-1.1-GPTQ-4bit-128g)
-* [2, 3, 4, 5, 6 and 8-bit GGML models for CPU inference](https://huggingface.co/TheBloke/vicuna-13B-1.1-GGML)
-**7B models:**
-* [Unquantized 7B 1.1 model for GPU - HF format](https://huggingface.co/TheBloke/vicuna-7B-1.1-HF)
-* [GPTQ quantized 4bit 7B 1.1 for GPU - `safetensors` and `pt` formats](https://huggingface.co/TheBloke/vicuna-7B-1.1-GPTQ-4bit-128g)
-* [2, 3, 4, 5, 6 and 8-bit GGML models for CPU inference](https://huggingface.co/TheBloke/vicuna-7B-1.1-GGML)
-<!-- footer start -->
-## Discord
-For further support, and discussions on these models and AI in general, join us at:
-[TheBloke AI's Discord server](https://discord.gg/Jq4vkcDakD)
-## Thanks, and how to contribute.
-Thanks to the [chirper.ai](https://chirper.ai) team!
-I've had a lot of people ask if they can contribute. I enjoy providing models and helping people, and would love to be able to spend even more time doing it, as well as expanding into new projects like fine tuning/training.
-If you're able and willing to contribute it will be most gratefully received and will help me to keep providing more models, and to start work on new AI projects.
-Donaters will get priority support on any and all AI/LLM/model questions and requests, access to a private Discord room, plus other benefits.
-* Patreon: https://patreon.com/TheBlokeAI
-* Ko-Fi: https://ko-fi.com/TheBlokeAI
-**Patreon special mentions**: Aemon Algiz, Dmitriy Samsonov, Nathan LeClaire, Trenton Dambrowitz, Mano Prime, David Flickinger, vamX, Nikolai Manek, senxiiz, Khalefa Al-Ahmad, Illia Dulskyi, Jonathan Leane, Talal Aujan, V. Lukas, Joseph William Delisle, Pyrater, Oscar Rangel, Lone Striker, Luke Pendergrass, Eugene Pentland, Sebastain Graf, Johann-Peter Hartman.
-Thank you to all my generous patrons and donaters!
-<!-- footer end -->
 # Vicuna Model Card
@@ -75,14 +24,12 @@ Vicuna was trained between March 2023 and April 2023.
 The Vicuna team with members from UC Berkeley, CMU, Stanford, and UC San Diego.
 **Paper or resources for more information:**
-https://vicuna.lmsys.org/
-**License:**
-Apache License 2.0
 **Where to send questions or comments about the model:**
 https://github.com/lm-sys/FastChat/issues
 ## Intended use
 **Primary intended uses:**
 The primary use of Vicuna is research on large language models and chatbots.
@@ -94,8 +41,8 @@ The primary intended users of the model are researchers and hobbyists in natural
 70K conversations collected from ShareGPT.com.
 ## Evaluation dataset
-A preliminary evaluation of the model quality is conducted by creating a set of 80 diverse questions and utilizing GPT-4 to judge the model outputs. See https://vicuna.lmsys.org/ for more details.
-## Major updates of weights v1.1
-- Refactor the tokenization and separator. In Vicuna v1.1, the separator has been changed from `"###"` to the EOS token `"</s>"`. This change makes it easier to determine the generation stop criteria and enables better compatibility with other libraries.
-- Fix the supervised fine-tuning loss computation for better model quality.

 ---
+inference: false
 ---
+**NOTE: New version available**
+Please check out a newer version of the weights [here](https://huggingface.co/lmsys/vicuna-7b-v1.3).
+If you still want to use this old version, please see the compatibility and difference between different versions [here](https://github.com/lm-sys/FastChat/blob/main/docs/vicuna_weights_version.md).
+<br>
+<br>
 # Vicuna Model Card
 The Vicuna team with members from UC Berkeley, CMU, Stanford, and UC San Diego.
 **Paper or resources for more information:**
+https://lmsys.org/blog/2023-03-30-vicuna/
 **Where to send questions or comments about the model:**
 https://github.com/lm-sys/FastChat/issues
 ## Intended use
 **Primary intended uses:**
 The primary use of Vicuna is research on large language models and chatbots.
 70K conversations collected from ShareGPT.com.
 ## Evaluation dataset
+A preliminary evaluation of the model quality is conducted by creating a set of 80 diverse questions and utilizing GPT-4 to judge the model outputs.
+See https://lmsys.org/blog/2023-03-30-vicuna/ for more details.
+## Acknowledgement
+Special thanks to [@TheBloke](https://huggingface.co/TheBloke) for hosting this merged version of weights earlier.