1.78 - cannot load mixtral 8x7b anymore #1219

IcePanther · 2024-11-17T15:23:23Z

Hi,

After upgrading to 1.78 today, I can't load mixtral-based 8x7b models anymore.

Other models such as 30b/70b llama-type models work.

I get the same error whether I use vulkan or CLBlast, and with different models that also have different quantizations. (one q8_0, the other q6_m)

The error reads :

llama_model_load: error loading model: missing tensor 'blk.0.ffn_down_exps.weight'
llama_load_model_from_file: failed to load model
Traceback (most recent call last):
  File "koboldcpp.py", line 4720, in <module>
    main(parser.parse_args(),start_server=True)
  File "koboldcpp.py", line 4344, in main
    loadok = load_model(modelname)
  File "koboldcpp.py", line 900, in load_model
    ret = handle.load_model(inputs)
OSError: exception: access violation reading 0x00000000000018A4
[17628] Failed to execute script 'koboldcpp' due to unhandled exception!

Previous versions of KoboldCPP worked with those same models without a problem.
After reverting, can confirm 1.77 works.
Both are "cu12" versions (I still use CUDA for smaller models).

System has 64 GB RAM, 16GB VRAM (3080Ti laptop), Windows 11

Thanks in advance,

The text was updated successfully, but these errors were encountered:

Conduitry · 2024-11-18T13:05:03Z

I'm also seeing this same error.

LostRuins · 2024-11-18T14:01:48Z

Yes, unfortunately this is because of the backend refactor in ggerganov#10026

See ggerganov#10244

You can requantize the mixtral model or use https://huggingface.co/mradermacher/Mixtral-8x7B-Instruct-v0.1-GGUF/

I will see if I can port back the support for the old quants, but I cannot guarantee it.

IcePanther · 2024-11-18T14:35:40Z

Thanks for the info, I was unaware of this.

It seems that updated models are indeed available on HF. If these work, they will be the simplest solution. I'll report back once I have downloaded some and confirmed they work.

Conduitry · 2024-11-18T15:18:33Z

The new quantizations are working for me with 1.78. Thank you!

LostRuins · 2024-11-18T17:02:48Z

I have crafted an ugly hack because I hate losing backwards compatibility

d5feaa8

Should work again in the next version.

IcePanther · 2024-11-18T21:30:33Z

Can confirm the new quantized models work for me too with 1.78.

I kept the old ones for now, to test if the backwards compatibility "ugly hack" works in the next version.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

1.78 - cannot load mixtral 8x7b anymore #1219

1.78 - cannot load mixtral 8x7b anymore #1219

IcePanther commented Nov 17, 2024 •

edited

Loading

Conduitry commented Nov 18, 2024

LostRuins commented Nov 18, 2024

IcePanther commented Nov 18, 2024

Conduitry commented Nov 18, 2024

LostRuins commented Nov 18, 2024

IcePanther commented Nov 18, 2024

1.78 - cannot load mixtral 8x7b anymore #1219

1.78 - cannot load mixtral 8x7b anymore #1219

Comments

IcePanther commented Nov 17, 2024 • edited Loading

Conduitry commented Nov 18, 2024

LostRuins commented Nov 18, 2024

IcePanther commented Nov 18, 2024

Conduitry commented Nov 18, 2024

LostRuins commented Nov 18, 2024

IcePanther commented Nov 18, 2024

IcePanther commented Nov 17, 2024 •

edited

Loading