Aymeric Roucher's picture

Aymeric Roucher

m-ric

AI & ML interests

Leading Agents at Hugging Face šŸ¤—

Recent Activity

Organizations

Hugging Face's profile picture Atmos Bank's profile picture Hugging Test Lab's profile picture Tools's profile picture HuggingFaceM4's profile picture lecocqassociate's profile picture huggingPartyParis's profile picture Supreme's profile picture FactSet's profile picture Propulse Lab's profile picture Leaderboard Organization's profile picture FactSet's profile picture CGIAR's profile picture Aperture Laboratories's profile picture AI Energy Score's profile picture C&A's profile picture Social Post Explorers's profile picture Dev Mode Explorers's profile picture Agent Collab's profile picture SLLHF's profile picture Data Agents's profile picture Hugging Face Party @ PyTorch Conference's profile picture Nerdy Face's profile picture Hugging Face Science's profile picture Agents Leaderboard's profile picture Smolagents Benchmark's profile picture Hugging Face Agents Course's profile picture

Posts 88

view post
Post
4892
Introducing š—¼š—½š—²š—» š——š—²š—²š—½-š—„š—²š˜€š—²š—®š—暝—°š—µ by Hugging Face! šŸ’„

OpenAI's latest agentic app Deep Research seems really good... But it's closed, as usual.

ā±ļø So with a team of cracked colleagues, we set ourselves a 24hours deadline to replicate and open-source Deep Research! ā±ļø

āž”ļø We built open-Deep-Research, an entirely open agent that can: navigate the web autonomously, scroll and search through pages, download and manipulate files, run calculation on data...

We aimed for the best performance: are the agent's answers really rigorous?

On GAIA benchmark, Deep Research had 67% accuracy on the validation set.
āž”ļø open Deep Research is at 55% (powered by o1), it is:
- the best pass@1 solution submitted
- the best open solution šŸ’ŖšŸ’Ŗ

And it's only getting started ! Please jump in, drop PRs, and let's bring it to the top !

Read the blog post šŸ‘‰ https://huggingface.co/blog/open-deep-research

Articles 9

Article
608

Open-source DeepResearch ā€“ Freeing our search agents