LI WENTONG's picture

2 4 2

LI WENTONG

sunshine-lwt

·

https://cslwt.github.io/

AI & ML interests

Computer Vision, Multimodal AI

Recent Activity

authored a paper about 1 month ago

VideoRefer Suite: Advancing Spatial-Temporal Object Understanding with Video LLM

upvoted a paper about 1 month ago

VideoRefer Suite: Advancing Spatial-Temporal Object Understanding with Video LLM

liked a model 7 months ago

meta-llama/Llama-3.1-8B-Instruct

View all activity

Organizations

sunshine-lwt's activity

upvoted a paper about 1 month ago

VideoRefer Suite: Advancing Spatial-Temporal Object Understanding with Video LLM

Paper • 2501.00599 • Published Dec 31, 2024 • 41

upvoted a collection 7 months ago

TokenPacker

Official model collection for the paper "TokenPacker: Efficient Visual Projector for Multimodal LLM" • 7 items • Updated Jul 25, 2024 • 4

upvoted a paper 7 months ago

TokenPacker: Efficient Visual Projector for Multimodal LLM

Paper • 2407.02392 • Published Jul 2, 2024 • 21

upvoted a paper about 1 year ago

Osprey: Pixel Understanding with Visual Instruction Tuning

Paper • 2312.10032 • Published Dec 15, 2023 • 3