metadata
license: apache-2.0
datasets:
- Tuwhy/MIS_Train
base_model:
- Qwen/Qwen2-VL-7B-Instruct
pipeline_tag: image-text-to-text
tags:
- safety
- fine-tuning
- multi-image
- mllm
Rethinking Bottlenecks in Safety Fine-Tuning of Vision Language Models
Our paper, code, data, models can be found at MIS.
Description
Qwen2-VL-7B-Instruct model fine-tuned on MIS training set.
MIRgae
Here is example pipeline of MIS training set and MIRage safety CoT label construction.
You can fine-tune Qwen2-VL series using LlamaFactory.