Error while loading Mixtral based SFT MoE model VideoLLaMA2-8x7B: SafetensorError: Error while deserializing header: InvalidHeaderDeserialization #77

ApoorvFrontera · 2024-08-20T07:55:20Z

Hi Team,

When I am loading the Mixtral-based SFT MoE model 'DAMO-NLP-SG/VideoLLaMA2-8x7B' using the same inference code provided in the README.md, the following error is raised:

Traceback (most recent call last):
  File "/home/admin/apoorv/development/VideoLLaMA2/playground.py", line 33, in <module>
    inference()
  File "/home/admin/apoorv/development/VideoLLaMA2/playground.py", line 27, in inference
    model, processor, tokenizer = model_init(model_path)
  File "/home/admin/apoorv/development/VideoLLaMA2/videollama2/__init__.py", line 17, in model_init
    tokenizer, model, processor, context_len = load_pretrained_model(model_path, None, model_name, **kwargs)
  File "/home/admin/apoorv/development/VideoLLaMA2/videollama2/model/__init__.py", line 180, in load_pretrained_model
    model = Videollama2MixtralForCausalLM.from_pretrained(model_path, low_cpu_mem_usage=True, config=config, **kwargs)
  File "/home/admin/.conda/envs/vl2/lib/python3.10/site-packages/transformers/modeling_utils.py", line 3838, in from_pretrained
    ) = cls._load_pretrained_model(
  File "/home/admin/.conda/envs/vl2/lib/python3.10/site-packages/transformers/modeling_utils.py", line 4278, in _load_pretrained_model
    state_dict = load_state_dict(shard_file, is_quantized=is_quantized)
  File "/home/admin/.conda/envs/vl2/lib/python3.10/site-packages/transformers/modeling_utils.py", line 516, in load_state_dict
    with safe_open(checkpoint_file, framework="pt") as f:
safetensors_rust.SafetensorError: Error while deserializing header: InvalidHeaderDeserialization

I tried to find the reason for this and came across the following issues where the main reason was that the file/weights saved were not done correctly and had empty dictionaries which the safetensors can't handle.
huggingface/transformers#27397 (comment)

To solve this, there are some changes to be implemented before saving the model checkpoint by you guys:

The text was updated successfully, but these errors were encountered:

ApoorvFrontera · 2024-10-07T09:49:23Z

Hi Team
Any response or help for this will be very useful.

Thanks in advance.

HugoDelhaye · 2024-10-11T17:28:18Z

I am having the same issus

williamium3000 · 2024-10-21T20:02:34Z

same issues here.

leoisufa · 2024-11-02T01:47:02Z

same issues

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Error while loading Mixtral based SFT MoE model VideoLLaMA2-8x7B: SafetensorError: Error while deserializing header: InvalidHeaderDeserialization #77

Error while loading Mixtral based SFT MoE model VideoLLaMA2-8x7B: SafetensorError: Error while deserializing header: InvalidHeaderDeserialization #77

ApoorvFrontera commented Aug 20, 2024

ApoorvFrontera commented Oct 7, 2024

HugoDelhaye commented Oct 11, 2024

williamium3000 commented Oct 21, 2024

leoisufa commented Nov 2, 2024

Error while loading Mixtral based SFT MoE model VideoLLaMA2-8x7B: SafetensorError: Error while deserializing header: InvalidHeaderDeserialization #77

Error while loading Mixtral based SFT MoE model VideoLLaMA2-8x7B: SafetensorError: Error while deserializing header: InvalidHeaderDeserialization #77

Comments

ApoorvFrontera commented Aug 20, 2024

ApoorvFrontera commented Oct 7, 2024

HugoDelhaye commented Oct 11, 2024

williamium3000 commented Oct 21, 2024

leoisufa commented Nov 2, 2024