-
MM-IQ: Benchmarking Human-Like Abstraction and Reasoning in Multimodal Models
Paper • 2502.00698 • Published • 21 -
DeepRAG: Thinking to Retrieval Step by Step for Large Language Models
Paper • 2502.01142 • Published • 15 -
ZebraLogic: On the Scaling Limits of LLMs for Logical Reasoning
Paper • 2502.01100 • Published • 12 -
The Jumping Reasoning Curve? Tracking the Evolution of Reasoning Performance in GPT-[n] and o-[n] Models on Multimodal Puzzles
Paper • 2502.01081 • Published • 9
Zhitong Gao
ZhitongGao
AI & ML interests
None yet
Recent Activity
updated
a collection
about 21 hours ago
Vlm
updated
a collection
2 days ago
Vlm
updated
a collection
2 days ago
Vlm
Organizations
None yet
Collections
1
datasets
None public yet