Checkpoint of Multimodal Variational Auto-encoder based Audio-Visual Segmentation [ICCV2023].