LI WENTONG's picture

2 4 2

LI WENTONG

sunshine-lwt

·

https://cslwt.github.io/

AI & ML interests

Computer Vision, Multimodal AI

Recent Activity

authored a paper about 1 month ago

VideoRefer Suite: Advancing Spatial-Temporal Object Understanding with Video LLM

upvoted a paper about 1 month ago

VideoRefer Suite: Advancing Spatial-Temporal Object Understanding with Video LLM

liked a model 7 months ago

meta-llama/Llama-3.1-8B-Instruct

View all activity

Organizations

sunshine-lwt's activity

authored a paper about 1 month ago

VideoRefer Suite: Advancing Spatial-Temporal Object Understanding with Video LLM

Paper • 2501.00599 • Published Dec 31, 2024 • 41

upvoted a paper about 1 month ago

VideoRefer Suite: Advancing Spatial-Temporal Object Understanding with Video LLM

Paper • 2501.00599 • Published Dec 31, 2024 • 41

liked a model 7 months ago

meta-llama/Llama-3.1-8B-Instruct

Text Generation • Updated Sep 25, 2024 • 5.78M • • 3.58k

updated 7 models 7 months ago

sunshine-lwt/TokenPacker-HD-13b-16patch-36token

Text Generation • Updated Jul 26, 2024 • 2

sunshine-lwt/TokenPacker-HD-13b-16patch-64token

Text Generation • Updated Jul 25, 2024 • 3

sunshine-lwt/TokenPacker-HD-13b-16patch-144token

Text Generation • Updated Jul 25, 2024 • 6

sunshine-lwt/TokenPacker-HD-13b-9patch-144token

Text Generation • Updated Jul 25, 2024 • 2

sunshine-lwt/TokenPacker-HD-7b-9patch-144token

Text Generation • Updated Jul 25, 2024 • 6 • 1

sunshine-lwt/TokenPacker-13b-144token

Text Generation • Updated Jul 25, 2024 • 4

sunshine-lwt/TokenPacker-7b-144token

Text Generation • Updated Jul 25, 2024 • 7

updated a collection 7 months ago

TokenPacker

Official model collection for the paper "TokenPacker: Efficient Visual Projector for Multimodal LLM" • 7 items • Updated Jul 25, 2024 • 4

upvoted a collection 7 months ago

TokenPacker

Official model collection for the paper "TokenPacker: Efficient Visual Projector for Multimodal LLM" • 7 items • Updated Jul 25, 2024 • 4

updated a collection 7 months ago

TokenPacker

Official model collection for the paper "TokenPacker: Efficient Visual Projector for Multimodal LLM" • 7 items • Updated Jul 25, 2024 • 4