HuggingFaceTB
/

SmolVLM-Instruct

Image-Text-to-Text

Inference Endpoints

Model card Files Files and versions Community

Reduce memory for inference

#28

by ilikeprivacy - opened 2 days ago

base: refs/heads/main

←

from: refs/pr/28

Discussion Files changed

2 days ago

Remove gradient calculation

Reduce memory for inference7aeef19d

Xenova

Hugging Face TB Research org 2 days ago

Hi there! .generate already uses torch.no_grad, so this is not necessary. See https://github.com/huggingface/transformers/blob/bc9a6d8302334ae08d505437ab3f361af777956c/src/transformers/generation/utils.py#L1879 for more information.

Xenova changed pull request status to closed 2 days ago

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment