Fine-Grained Guidance for Retrievers: Leveraging LLMs' Feedback in Retrieval-Augmented Generation Paper • 2411.03957 • Published Nov 6, 2024
One Model, Multiple Modalities: A Sparsely Activated Approach for Text, Sound, Image, Video and Code Paper • 2205.06126 • Published May 12, 2022 • 1
Leveraging Print Debugging to Improve Code Generation in Large Language Models Paper • 2401.05319 • Published Jan 10, 2024 • 1
Lawformer: A Pre-trained Language Model for Chinese Legal Long Documents Paper • 2105.03887 • Published May 9, 2021
InfiGUIAgent: A Multimodal Generalist GUI Agent with Native Reasoning and Reflection Paper • 2501.04575 • Published Jan 8 • 23
InfiAgent-DABench: Evaluating Agents on Data Analysis Tasks Paper • 2401.05507 • Published Jan 10, 2024 • 1
On the Multi-turn Instruction Following for Conversational Web Agents Paper • 2402.15057 • Published Feb 23, 2024
Ask-before-Plan: Proactive Language Agents for Real-World Planning Paper • 2406.12639 • Published Jun 18, 2024
MEGA-Bench: Scaling Multimodal Evaluation to over 500 Real-World Tasks Paper • 2410.10563 • Published Oct 14, 2024 • 38
WebCanvas: Benchmarking Web Agents in Online Environments Paper • 2406.12373 • Published Jun 18, 2024
MMLU-Pro: A More Robust and Challenging Multi-Task Language Understanding Benchmark Paper • 2406.01574 • Published Jun 3, 2024 • 45
MantisScore: Building Automatic Metrics to Simulate Fine-grained Human Feedback for Video Generation Paper • 2406.15252 • Published Jun 21, 2024 • 16
LongRAG: Enhancing Retrieval-Augmented Generation with Long-context LLMs Paper • 2406.15319 • Published Jun 21, 2024 • 64
Investigating Answerability of LLMs for Long-Form Question Answering Paper • 2309.08210 • Published Sep 15, 2023 • 13
DialogStudio: Towards Richest and Most Diverse Unified Dataset Collection for Conversational AI Paper • 2307.10172 • Published Jul 19, 2023 • 12