Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
walterShen
's Collections
Code LMs Evaluation
Code LMs Benchmark
Prompt Engineering
World Model
Synthetic Data
Code LMs
Agent
HAI4Code
Code LMs Benchmark
updated
Mar 6, 2024
Upvote
1
Running
1.11k
1.11k
Big Code Models Leaderboard
π
Submit code models for evaluation on benchmarks
Running
430
430
Can Ai Code Results
π
Generate animated avatars from images
openai/openai_humaneval
Viewer
β’
Updated
Jan 4, 2024
β’
164
β’
77.4k
β’
269
google-research-datasets/mbpp
Viewer
β’
Updated
Jan 4, 2024
β’
1.4k
β’
67.2k
β’
158
nuprl/MultiPL-E
Viewer
β’
Updated
17 days ago
β’
12.7k
β’
264k
β’
46
evalplus/mbppplus
Viewer
β’
Updated
Apr 17, 2024
β’
378
β’
38.1k
β’
8
BAAI/TACO
Updated
Jun 19, 2024
β’
2.15k
β’
91
princeton-nlp/SWE-bench
Viewer
β’
Updated
Oct 24, 2024
β’
21.5k
β’
19.8k
β’
94
codeparrot/apps
Viewer
β’
Updated
Oct 20, 2022
β’
20k
β’
5.66k
β’
153
cruxeval-org/cruxeval
Viewer
β’
Updated
Jan 23, 2024
β’
800
β’
1.36k
β’
14
tianyang/repobench_python_v1.1
Viewer
β’
Updated
Feb 27, 2024
β’
23.6k
β’
344
β’
7
SciPhi/textbooks-are-all-you-need-lite
Viewer
β’
Updated
Sep 30, 2023
β’
682k
β’
351
β’
179
nampdn-ai/tiny-codes
Viewer
β’
Updated
Sep 30, 2023
β’
1.63M
β’
364
β’
237
allenai/math_qa
Updated
Jan 18, 2024
β’
33k
β’
95
deepmind/code_contests
Viewer
β’
Updated
Jun 11, 2023
β’
4.04k
β’
8.41k
β’
140
FudanSELab/ClassEval
Viewer
β’
Updated
Aug 26, 2024
β’
100
β’
510
β’
8
ML4SE2023-G1-WizardCoder/ML4SE23_G1_MBCPP-SCoT
Viewer
β’
Updated
Oct 25, 2023
β’
870
β’
51
Muennighoff/quixbugs
Viewer
β’
Updated
Mar 26, 2023
β’
40
β’
93
bigcode/humanevalpack
Updated
May 1, 2024
β’
15.6k
β’
77
NTU-NLP-sg/xCodeEval
Updated
Jun 6, 2024
β’
77.3k
β’
40
JetBrains-Research/commit-chronicle
Viewer
β’
Updated
Oct 5, 2023
β’
10.9M
β’
3.15k
β’
7
tianyang/repobench_java_v1.1
Viewer
β’
Updated
Feb 27, 2024
β’
26.1k
β’
280
zijwang/CrossCodeEval
Updated
Oct 19, 2023
β’
31
Upvote
1
Share collection
View history
Collection guide
Browse collections