Spaces:

open-llm-leaderboard
/

open_llm_leaderboard

Running on CPU Upgrade

App Files Files Community

1088

Resubmitting a model to use `chat_template` doesn't re-evaluate, but does change `chat_template` column

#1066

by xzuyn - opened 29 days ago

Discussion

xzuyn

29 days ago

I tried to re-evaluate rombodawg/Rombos-LLM-V2.5-Qwen-72b so that it can be properly compared to Qwen/Qwen2.5-72B-Instruct but it didn't re-evaluate, and now it incorrectly states that it evaluated using it. Can this be fixed, and can the model properly be resubmitted for evaluation using the chat_template?

CombinHorizon

27 days ago

this looks like a potential problem i mentioned or guessed might happen here (i'm sorry, i might have not been clear about it that time, maybe that problem is a thing?),
or is this due to recent new UI and internal code changes to the leaderboard?

clefourrier

Open LLM Leaderboard org 3 days ago

Yep looks like a bug we'll need to fix - it's normal that you can't re-evaluate by resubmitting the same model ,(it's to avoid spamming the leaderboard), as indicated in the FAQ, but it's an issue that the request file changes.
If maybe one of you has some time to take a look that would be great

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment