pankajmathur
/

model_51

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

pankajmathur commited on Nov 9, 2023

Commit

6012fb6

·

1 Parent(s): 53f2ad3

Update README.md

Files changed (1) hide show

README.md +32 -14

README.md CHANGED Viewed

@@ -20,19 +20,20 @@ We evaluated model_51 on a wide range of tasks using [Language Model Evaluation
 Here are the results on metrics used by [HuggingFaceH4 Open LLM Leaderboard](https://huggingface.co/spaces/HuggingFaceH4/open_llm_leaderboard)
-|||||
-|:------:|:--------:|:-------:|:--------:|
-|**Task**|**Metric**|**Value**|**Stderr**|
-|*arc_challenge*|acc_norm|0.6843|0.0141|
-|*hellaswag*|acc_norm|0.8671|0.0038|
-|*mmlu*|acc_norm|0.6931|0.0351|
-|*truthfulqa_mc*|mc2|0.5718|0.0157|
-|**Total Average**|-|**0.7041**||
-## Example Usage
-Here is the prompt format
 ```
 ### System:
@@ -45,17 +46,34 @@ Tell me about Orcas.
 ```
 Below shows a code example on how to use this model
 ```python
 import torch
 from transformers import AutoModelForCausalLM, AutoTokenizer, pipeline
-tokenizer = AutoTokenizer.from_pretrained("psmathur/model_51")
 model = AutoModelForCausalLM.from_pretrained(
-  "psmathur/model_51",
   torch_dtype=torch.float16,
-  load_in_8bit=True,
   low_cpu_mem_usage=True,
   device_map="auto"
 )

 Here are the results on metrics used by [HuggingFaceH4 Open LLM Leaderboard](https://huggingface.co/spaces/HuggingFaceH4/open_llm_leaderboard)
+|||
+|:------:|:--------:|
+|**Task**|**Value**|
+|*ARC*|0.6843|
+|*HellaSwag*|0.8671|
+|*MMLU*|0.6931|
+|*TruthfulQA*|0.5718|
+|*Winogrande*|0.8177|
+|*GSM8K*|0.3237|
+|*DROP*|0.5843|
+|**Total Average**|**0.6488**|
+### Prompt Foramt
 ```
 ### System:
 ```
+#### OobaBooga Instructions:
+This model required upto 45GB GPU VRAM in 4bit so it can be loaded directly on Single RTX 6000/L40/A40/A100/H100 GPU or Double RTX 4090/L4/A10/RTX 3090/RTX A5000
+So, if you have access to Machine with 45GB GPU VRAM and have installed [OobaBooga Web UI](https://github.com/oobabooga/text-generation-webui) on it.
+You can just download this model by using HF repo link directly on OobaBooga Web UI "Model" Tab/Page & Just use **load-in-4bit** option in it.
+![model_load_screenshot](https://huggingface.co/pankajmathur/model_101/resolve/main/oobabooga_model_load_screenshot.png)
+After that go to Default Tab/Page on OobaBooga Web UI and **copy paste above prompt format into Input** and Enjoy!
+![default_input_screenshot](https://huggingface.co/pankajmathur/model_101/resolve/main/default_input_screenshot.png)
+<br>
+#### Code Instructions:
 Below shows a code example on how to use this model
 ```python
 import torch
 from transformers import AutoModelForCausalLM, AutoTokenizer, pipeline
+tokenizer = AutoTokenizer.from_pretrained("pankajmathur/model_51")
 model = AutoModelForCausalLM.from_pretrained(
+  "pankajmathur/model_51",
   torch_dtype=torch.float16,
+  load_in_4bit=True,
   low_cpu_mem_usage=True,
   device_map="auto"
 )