wnma3mz commited on
Commit
5748991
·
verified ·
1 Parent(s): 8b751cb

Create README.md

Browse files
Files changed (1) hide show
  1. README.md +28 -0
README.md ADDED
@@ -0,0 +1,28 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ license: mit
3
+ license_name: deepseek
4
+ license_link: LICENSE
5
+ pipeline_tag: any-to-any
6
+ library_name: transformers
7
+ tags:
8
+ - muiltimodal
9
+ ---
10
+
11
+ This model is derived from https://huggingface.co/deepseek-ai/Janus-Pro-1B and the main modifications are as follows
12
+
13
+ - bin files are updated to safetensors
14
+ - Add chat_template
15
+
16
+ `4bit` refers to quantifying the LLM part to 4 bits.
17
+
18
+ `LM` means that it contains only the language model part.
19
+
20
+ ## Quick Start
21
+
22
+ In Macos (Apple silicon), use [mlx](https://github.com/ml-explore/mlx) framework https://github.com/wnma3mz/tLLM
23
+
24
+ ```bash
25
+ tllm.server --model_path $MODEL_PATH --hostname localhost --is_local --client_size 1
26
+ ```
27
+
28
+ `$MODEL_PATH` like `wnma3mz/Janus-Pro-1B-4bit`