Vlm - a ZhitongGao Collection

ZhitongGao 's Collections

Vlm

Vlm

updated about 19 hours ago

MM-IQ: Benchmarking Human-Like Abstraction and Reasoning in Multimodal Models

Paper • 2502.00698 • Published 4 days ago • 21
DeepRAG: Thinking to Retrieval Step by Step for Large Language Models

Paper • 2502.01142 • Published 3 days ago • 15
ZebraLogic: On the Scaling Limits of LLMs for Logical Reasoning

Paper • 2502.01100 • Published 3 days ago • 12
The Jumping Reasoning Curve? Tracking the Evolution of Reasoning Performance in GPT-[n] and o-[n] Models on Multimodal Puzzles

Paper • 2502.01081 • Published 3 days ago • 9
Improved Training Technique for Latent Consistency Models

Paper • 2502.01441 • Published 3 days ago • 7
PhD Knowledge Not Required: A Reasoning Challenge for Large Language Models

Paper • 2502.01584 • Published 3 days ago • 7
COCONut-PanCap: Joint Panoptic Segmentation and Grounded Captions for Fine-Grained Understanding and Generation

Paper • 2502.02589 • Published 2 days ago • 7