Post
4892
Introducing š¼š½š²š» šš²š²š½-š„š²šš²š®šæš°šµ by Hugging Face! š„
OpenAI's latest agentic app Deep Research seems really good... But it's closed, as usual.
ā±ļø So with a team of cracked colleagues, we set ourselves a 24hours deadline to replicate and open-source Deep Research! ā±ļø
ā”ļø We built open-Deep-Research, an entirely open agent that can: navigate the web autonomously, scroll and search through pages, download and manipulate files, run calculation on data...
We aimed for the best performance: are the agent's answers really rigorous?
On GAIA benchmark, Deep Research had 67% accuracy on the validation set.
ā”ļø open Deep Research is at 55% (powered by o1), it is:
- the best pass@1 solution submitted
- the best open solution šŖšŖ
And it's only getting started ! Please jump in, drop PRs, and let's bring it to the top !
Read the blog post š https://huggingface.co/blog/open-deep-research
OpenAI's latest agentic app Deep Research seems really good... But it's closed, as usual.
ā±ļø So with a team of cracked colleagues, we set ourselves a 24hours deadline to replicate and open-source Deep Research! ā±ļø
ā”ļø We built open-Deep-Research, an entirely open agent that can: navigate the web autonomously, scroll and search through pages, download and manipulate files, run calculation on data...
We aimed for the best performance: are the agent's answers really rigorous?
On GAIA benchmark, Deep Research had 67% accuracy on the validation set.
ā”ļø open Deep Research is at 55% (powered by o1), it is:
- the best pass@1 solution submitted
- the best open solution šŖšŖ
And it's only getting started ! Please jump in, drop PRs, and let's bring it to the top !
Read the blog post š https://huggingface.co/blog/open-deep-research