Incorrect Weights in Model Repositories

by francescortu - opened 1 day ago

1 day ago

Hi there! 😊

I've noticed that the models seem to have the wrong weights, specifically the weights for Emu3-Chat. It appears that the weights in the following repositories are all the same:

BAAI/Emu3-Gen-hf
BAAI/Emu3-Chat-hf
BAAI/Emu3-Chat

I suspect there was an error during the migration from the old repository to the new *-hf versions.

Thanks in advance for looking into this!

RaushanTurganbay

1 day ago

@francescortu hey! Thanks for reporting this. The hf weights should be mirrors of original weight and thus it is expected that Chat and Chat-hf share same weight with slightly different state dict keys

For the Gen-hf model, do you mean that the values of weights are same as for Chat-hf and is it causing bad generation quality?

francescortu

1 day ago

Yes, exactly. The BAAI/Emu3-Gen-hf model has the same weights as the BAAI/Emu3-Chat-hf model—not just identical state dict keys (which is expected), but the actual weight tensors are the same. I suspect there was an error during the conversion, where the input used was from Emu3-Chat instead of Emu3-Gen

RaushanTurganbay

about 4 hours ago

Very weird, oke, I can verify that quite later. Might have been an error indeed since the Gen-hf was updated recently

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment