Clémentine Fourrier

clefourrier

AI & ML interests

None yet

Recent Activity

updated a dataset about 9 hours ago
demo-leaderboard-backend/requests
updated a collection about 23 hours ago
Leaderboards and benchmarks ✨
View all activity

Organizations

Hugging Face's profile picture Long Range Graph Benchmark's profile picture Evaluation datasets's profile picture BigScience: LMs for Historical Texts's profile picture HuggingFaceBR4's profile picture Cohere For AI's profile picture Open Graph Benchmark's profile picture HuggingFaceGECLM's profile picture Huggingface Projects's profile picture Pretrained Graph Transformers's profile picture Graph Datasets's profile picture BigCode's profile picture Hugging Face H4's profile picture InternLM's profile picture Vectara's profile picture GAIA's profile picture Hugging Face Smol Cluster's profile picture plfe's profile picture Open LLM Leaderboard's profile picture Qwen's profile picture Secure Learning Lab's profile picture Open Life Science AI's profile picture LLM360's profile picture TTS Eval (OLD)'s profile picture Leaderboard Organization's profile picture Bias Leaderboard Development's profile picture hallucinations-leaderboard's profile picture Demo Leaderboard's profile picture Demo leaderboard with an integrated backend's profile picture gg-hf's profile picture Clinical & Biomedical ML Leaderboards's profile picture AIM-Harvard's profile picture Women on Hugging Face's profile picture LMLLO2's profile picture Lighthouz AI's profile picture Open Arabic LLM Leaderboard's profile picture mx-test's profile picture LeaderboardsOnTheHub's profile picture HuggingFaceFW's profile picture IBM Granite's profile picture HF-contamination-detection's profile picture TTS AGI's profile picture Leader Board Test Org's profile picture Social Post Explorers's profile picture hsramall's profile picture Open RL Leaderboard's profile picture The Fin AI's profile picture Open Hebrew LLM's Leaderboard's profile picture La Leaderboard's profile picture gg-tt's profile picture HuggingFaceEval's profile picture HP Inc.'s profile picture Novel Challenge's profile picture Open LLM Leaderboard Archive's profile picture LLHF's profile picture SLLHF's profile picture lbhf's profile picture Inception's profile picture nltpt's profile picture Lighteval testing org's profile picture CléMax's profile picture Hugging Face Science's profile picture test_org's profile picture Coordination Nationale pour l'IA's profile picture LeMaterial's profile picture open-llm-leaderboard-react's profile picture Prompt Leaderboard's profile picture wut?'s profile picture Your Bench's profile picture leaderboard explorer's profile picture Open R1's profile picture

Posts 16

view post
Post
5775
In a basic chatbots, errors are annoyances. In medical LLMs, errors can have life-threatening consequences 🩸

It's therefore vital to benchmark/follow advances in medical LLMs before even thinking about deployment.

This is why a small research team introduced a medical LLM leaderboard, to get reproducible and comparable results between LLMs, and allow everyone to follow advances in the field.

openlifescienceai/open_medical_llm_leaderboard

Congrats to @aaditya and @pminervini !
Learn more in the blog: https://huggingface.co/blog/leaderboard-medicalllm

Articles 32

Article
603

Open-source DeepResearch – Freeing our search agents

datasets

None public yet