Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
Spaces:
Duplicated from
nan/leaderboard
AIR-Bench
/
leaderboard
like
66
Running
on
CPU Upgrade
App
Files
Files
Community
28
Fetching metadata from the HF Docker repository...
ca2a141
leaderboard
/
src
4 contributors
History:
54 commits
hanhainebula
Modify the commands of evaluating
ca2a141
verified
10 months ago
display
fix: fix the bug in duplicated columns
10 months ago
about.py
Safe
4.42 kB
Modify the commands of evaluating
10 months ago
benchmarks.py
Safe
4.41 kB
Add msmarco for qa task
10 months ago
envs.py
Safe
754 Bytes
chore: clean up
10 months ago
read_evals.py
Safe
8.39 kB
Fix check when loading results file
10 months ago
utils.py
Safe
9.54 kB
fix: fix the bug in the annoymous checkbox
10 months ago