Merge branch 'main' of https://huggingface.co/spaces/CONDA-Workshop/Data-Contamination-Report into pr/9 a44e89a OSainz commited on Apr 25, 2024
Code contamination in HumanEval and MBPP (#12) ffb0d75 verified OSainz AmeyaPrabhu commited on Apr 25, 2024
Add model-based results for MedNLI, RadNLI for GPT-3.5 and GPT-4 (#8) d57b460 verified Iker j-chim commited on Apr 23, 2024
Add data from "An Open-Source Data Contamination Report for Large Language Models" (#5) 619ed3b verified Iker vishaal27 commited on Apr 23, 2024
Contamination results updated based on ``https://arxiv.org/abs/2311.06233`` 36cae97 verified shahriargolchin commited on Apr 22, 2024
Import data from LM Contamination Index (#7) e1c863c verified Iker borgr OSainz commited on Apr 19, 2024
Add data from "Documenting Large Webtext Corpora: A Case Study on the Colossal Clean Crawled Corpus" (#6) 935e79b verified Iker vishaal27 commited on Apr 18, 2024