MobileLLM safetensors seem to be missing model.embed_tokens.weight #34759

avishaiElmakies · 2024-11-16T18:42:35Z

System Info

transformers version: 4.46.2
Platform: Linux-6.6.20-aufs-1-x86_64-with-glibc2.36
Python version: 3.11.2
Huggingface_hub version: 0.26.2
Safetensors version: 0.4.5
Accelerate version: 1.1.0
Accelerate config: not found
PyTorch version (GPU?): 2.5.1+cu124 (True)
Tensorflow version (GPU?): not installed (NA)
Flax version (CPU?/GPU?/TPU?): not installed (NA)
Jax version: not installed
JaxLib version: not installed
Using distributed or parallel set-up in script?: no
Using GPU in script?: no
GPU type: NVIDIA RTX A5000

Who can help?

@ArthurZucker

Information

The official example scripts
My own modified scripts

Tasks

An officially supported task in the examples folder (such as GLUE/SQuAD, ...)
My own task or dataset (give details below)

Reproduction

mobilellm = AutoModelForCausalLM.from_pretrained("facebook/MobileLLM-125M",trust_remote_code=True)

will output Some weights of MobileLLMForCausalLM were not initialized from the model checkpoint at facebook/MobileLLM-125M and are newly initialized: ['model.embed_tokens.weight']

and the weights will be random. when using use_safetensor=False. everything seems to work as expected.

Expected behavior

using safetensors should work the same as when not using them.

The text was updated successfully, but these errors were encountered:

mayankagarwals · 2024-11-17T07:01:05Z

Hi 👋
Am able to reproduce this, checking this!

mayankagarwals · 2024-11-17T13:15:31Z

can you please provide the code snippet where you are not seeing any error (without using safe tensors) @avishaiElmakies

avishaiElmakies · 2024-11-17T13:20:23Z

There should be a single "error" about lm_head.weight, since the model uses weight tieing for the embeeding and output layer. Both safetensors and normal loading does this.

the problem is that when using safetensors the embedding layer seems to be missing which causes problems with both the embedding layer and the output layer.

maybe I should have been more clear about that in the bug report (sorry about that).

avishaiElmakies added the bug label Nov 16, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

MobileLLM safetensors seem to be missing model.embed_tokens.weight #34759

MobileLLM safetensors seem to be missing model.embed_tokens.weight #34759

avishaiElmakies commented Nov 16, 2024 •

edited

Loading

mayankagarwals commented Nov 17, 2024

mayankagarwals commented Nov 17, 2024 •

edited

Loading

avishaiElmakies commented Nov 17, 2024 •

edited

Loading

MobileLLM safetensors seem to be missing model.embed_tokens.weight #34759

MobileLLM safetensors seem to be missing model.embed_tokens.weight #34759

Comments

avishaiElmakies commented Nov 16, 2024 • edited Loading

System Info

Who can help?

Information

Tasks

Reproduction

Expected behavior

mayankagarwals commented Nov 17, 2024

mayankagarwals commented Nov 17, 2024 • edited Loading

avishaiElmakies commented Nov 17, 2024 • edited Loading

avishaiElmakies commented Nov 16, 2024 •

edited

Loading

mayankagarwals commented Nov 17, 2024 •

edited

Loading

avishaiElmakies commented Nov 17, 2024 •

edited

Loading