Demystifying Long Chain-of-Thought Reasoning in LLMs Paper β’ 2502.03373 β’ Published about 23 hours ago β’ 18
MetaOcc: Surround-View 4D Radar and Camera Fusion Framework for 3D Occupancy Prediction with Dual Training Strategies Paper β’ 2501.15384 β’ Published 12 days ago
ZebraLogic: On the Scaling Limits of LLMs for Logical Reasoning Paper β’ 2502.01100 β’ Published 3 days ago β’ 12
Critique Fine-Tuning: Learning to Critique is More Effective than Learning to Imitate Paper β’ 2501.17703 β’ Published 8 days ago β’ 50
Video-MMMU: Evaluating Knowledge Acquisition from Multi-Discipline Professional Videos Paper β’ 2501.13826 β’ Published 14 days ago β’ 22
UI-TARS: Pioneering Automated GUI Interaction with Native Agents Paper β’ 2501.12326 β’ Published 16 days ago β’ 48
ToolBeHonest: A Multi-level Hallucination Diagnostic Benchmark for Tool-Augmented Large Language Models Paper β’ 2406.20015 β’ Published Jun 28, 2024 β’ 1
HoLLMwood: Unleashing the Creativity of Large Language Models in Screenwriting via Role Playing Paper β’ 2406.11683 β’ Published Jun 17, 2024
HoLLMwood: Unleashing the Creativity of Large Language Models in Screenwriting via Role Playing Paper β’ 2406.11683 β’ Published Jun 17, 2024
Data-Efficient Massive Tool Retrieval: A Reinforcement Learning Approach for Query-Tool Alignment with Language Models Paper β’ 2410.03212 β’ Published Oct 4, 2024
Data-Efficient Massive Tool Retrieval: A Reinforcement Learning Approach for Query-Tool Alignment with Language Models Paper β’ 2410.03212 β’ Published Oct 4, 2024
Chain-of-Reasoning: Towards Unified Mathematical Reasoning in Large Language Models via a Multi-Paradigm Perspective Paper β’ 2501.11110 β’ Published 18 days ago β’ 2
Chain-of-Reasoning: Towards Unified Mathematical Reasoning in Large Language Models via a Multi-Paradigm Perspective Paper β’ 2501.11110 β’ Published 18 days ago β’ 2
Next Token Prediction Towards Multimodal Intelligence: A Comprehensive Survey Paper β’ 2412.18619 β’ Published Dec 16, 2024 β’ 54
xCoT: Cross-lingual Instruction Tuning for Cross-lingual Chain-of-Thought Reasoning Paper β’ 2401.07037 β’ Published Jan 13, 2024 β’ 2
Emulated Disalignment: Safety Alignment for Large Language Models May Backfire! Paper β’ 2402.12343 β’ Published Feb 19, 2024
m3P: Towards Multimodal Multilingual Translation with Multimodal Prompt Paper β’ 2403.17556 β’ Published Mar 26, 2024 β’ 1
The Fine Line: Navigating Large Language Model Pretraining with Down-streaming Capability Analysis Paper β’ 2404.01204 β’ Published Apr 1, 2024
Chinese Tiny LLM: Pretraining a Chinese-Centric Large Language Model Paper β’ 2404.04167 β’ Published Apr 5, 2024 β’ 13