LeaderboardExplorer
Filter and display leaderboards based on selected criteria
Filter and display leaderboards based on selected criteria
Track, rank and evaluate open LLMs and chatbots
Select and filter benchmarks for text embedding tasks
Request evaluation results for a speech model
Explore hardware performance for language models
Submit code models for evaluation on benchmarks
Generate animated avatars from images
View and submit LLM evaluations
View and submit machine learning model evaluations
Analyze images to detect and label objects
Evaluate LLM cybersecurity risks
Display model benchmark results
Compare model answers to questions
View LLM Performance Leaderboard
Explore benchmark results for QA and long doc models
VLMEvalKit Evaluation Results Collection
Explore and analyze RewardBench leaderboard data
Explore and analyze code evaluation data
Display and filter multimodal model leaderboard results
Teach, test, evaluate language models with MTEB Arena
Visualize LLM progress with interactive filters
Compare AI models by voting on responses
Blind vote on HF TTS models!