lm_head.weight is missing

by DeepMount00 - opened Jun 28

Discussion

DeepMount00

Jun 28

the final lm_head.weight is missing, why?

sadra-barikbin

Jul 7

They've seemingly uploaded the underlying Qwen2Model instead of the Qwen2ForCausalLM.

DeepMount00

Jul 7

yeah

HasturOfficial

Jul 11

This model sets tie_word_embeddings=True, which shares the weight between "model.embed_tokens" and "lm_head". Therefore, use the transpose of "model.embed_tokens.weight" as "lm_head.weight" directly.

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment