Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
Edit Models filters
Tasks
Libraries
Datasets
Languages
Licenses
Other
1
Misc
Reset Misc
multimodal
Inference Endpoints
text-generation-inference
AutoTrain Compatible
custom_code
4-bit precision
Merge
Eval Results
8-bit precision
Mixture of Experts
Misc with no match
text-embeddings-inference
Carbon Emissions
Apply filters
Models
518
Full-text search
Edit filters
Sort: Trending
Active filters:
multimodal
Clear all
erax-ai/EraX-VL-7B-V2.0-Preview
Visual Question Answering
•
Updated
16 days ago
•
593
•
18
OpenGVLab/VideoChat-Flash-Qwen2_5-2B_res448
Video-Text-to-Text
•
Updated
17 days ago
•
1.81k
•
11
OpenGVLab/VideoChat-Flash-Qwen2-7B_res224
Video-Text-to-Text
•
Updated
17 days ago
•
901
•
3
OpenGVLab/VideoChat-Flash-Qwen2-7B_res448
Video-Text-to-Text
•
Updated
17 days ago
•
978
•
7
osunlp/UGround-V1-72B
Image-Text-to-Text
•
Updated
15 days ago
•
328
•
3
Minthy/ToriiGate-v0.4-2B
Image-Text-to-Text
•
Updated
19 days ago
•
164
•
6
bytedance-research/UI-TARS-2B-SFT
Image-Text-to-Text
•
Updated
12 days ago
•
7.69k
•
13
bytedance-research/UI-TARS-72B-SFT
Image-Text-to-Text
•
Updated
12 days ago
•
290
•
10
mradermacher/UI-TARS-2B-SFT-i1-GGUF
Updated
16 days ago
•
616
•
1
mradermacher/UI-TARS-7B-DPO-GGUF
Updated
16 days ago
•
3.84k
•
7
mradermacher/UI-TARS-7B-DPO-i1-GGUF
Updated
16 days ago
•
1.75k
•
3
OpenGVLab/InternVL_2_5_HiCo_R16
Video-Text-to-Text
•
Updated
14 days ago
•
109
•
2
bartowski/UI-TARS-72B-DPO-GGUF
Image-Text-to-Text
•
Updated
14 days ago
•
1.56k
•
2
bartowski/UI-TARS-7B-DPO-GGUF
Image-Text-to-Text
•
Updated
14 days ago
•
2.85k
•
3
lmstudio-community/UI-TARS-2B-SFT-GGUF
Image-Text-to-Text
•
Updated
14 days ago
•
597
•
2
bartowski/UI-TARS-2B-SFT-GGUF
Image-Text-to-Text
•
Updated
14 days ago
•
1.23k
•
1
bartowski/UI-TARS-7B-SFT-GGUF
Image-Text-to-Text
•
Updated
14 days ago
•
1.53k
•
1
lmstudio-community/UI-TARS-72B-DPO-GGUF
Image-Text-to-Text
•
Updated
14 days ago
•
1.53k
•
1
3ib0n/Qwen2-VL-2B-rkllm
Image-Text-to-Text
•
Updated
14 days ago
•
1
vincentamato/ARIA
Updated
13 days ago
•
1
Sci-fi-vy/Llama-3.2-11B-Vision-Instruct-finetuned
Image-Text-to-Text
•
Updated
11 days ago
•
6
•
1
mlx-community/Qwen2.5-VL-3B-Instruct-4bit
Image-Text-to-Text
•
Updated
8 days ago
•
270
•
1
mlx-community/Qwen2.5-VL-3B-Instruct-8bit
Image-Text-to-Text
•
Updated
8 days ago
•
254
•
4
mlx-community/Qwen2.5-VL-7B-Instruct-3bit
Image-Text-to-Text
•
Updated
8 days ago
•
66
•
1
mlx-community/Qwen2.5-VL-7B-Instruct-bf16
Image-Text-to-Text
•
Updated
8 days ago
•
294
•
2
mlx-community/Qwen2.5-VL-72B-Instruct-4bit
Image-Text-to-Text
•
Updated
8 days ago
•
302
•
2
jarvisvasu/Qwen2.5-VL-3B-Instruct-4bit
Image-Text-to-Text
•
Updated
8 days ago
•
198
•
2
mlx-community/Qwen2.5-VL-72B-Instruct-3bit
Image-Text-to-Text
•
Updated
8 days ago
•
102
•
2
mlx-community/Qwen2.5-VL-72B-Instruct-6bit
Image-Text-to-Text
•
Updated
8 days ago
•
62
•
1
unsloth/Qwen2.5-VL-7B-Instruct
Image-Text-to-Text
•
Updated
6 days ago
•
607
•
1
Previous
1
...
3
4
5
6
7
...
18
Next