![](https://cdn-avatars.huggingface.co/v1/production/uploads/651e96991b97c9f33d26bde6/e4VK7uW5sTeCYupD0s_ob.png)
HuggingFaceTB/SmolVLM-Instruct
Image-Text-to-Text
•
Updated
•
92.8k
•
368
State-of-the-art compact VLMs for on-device applications: Base, Synthetic, and Instruct
Generate text responses using images and text prompts