BAAI
/

Incorrect Weights in Model Repositories

#2
by francescortu - opened

Hi there! 😊

I've noticed that the models seem to have the wrong weights, specifically the weights for Emu3-Chat. It appears that the weights in the following repositories are all the same:

BAAI/Emu3-Gen-hf
BAAI/Emu3-Chat-hf
BAAI/Emu3-Chat

I suspect there was an error during the migration from the old repository to the new *-hf versions.

Thanks in advance for looking into this!

@francescortu hey! Thanks for reporting this. The hf weights should be mirrors of original weight and thus it is expected that Chat and Chat-hf share same weight with slightly different state dict keys

For the Gen-hf model, do you mean that the values of weights are same as for Chat-hf and is it causing bad generation quality?

Yes, exactly. The BAAI/Emu3-Gen-hf model has the same weights as the BAAI/Emu3-Chat-hf model—not just identical state dict keys (which is expected), but the actual weight tensors are the same. I suspect there was an error during the conversion, where the input used was from Emu3-Chat instead of Emu3-Gen

Very weird, oke, I can verify that quite later. Might have been an error indeed since the Gen-hf was updated recently

Sign up or log in to comment