How do I LoRA train an EXL2 model? #253

araleza · 2024-01-01T02:25:54Z

araleza
Jan 1, 2024

Hi there,

I've been making LoRAs for GPTQ models in ooba-webui for a while, using the Transformers model loader to load the GPTQ instead of using the ExLlamaV2 model loader, as the Training tab isn't compatible with models loaded by ExLlamaV2. I need to use a 4-bit GPTQ model, as the full sized model doesn't fit on my 24GB GPU.

Now, EXL2 is supposed to quantize better than GPTQ, and I have an EXL2 quantized version of the model, and I've read elsewhere here that EXL2 supports LoRAs. But how do I actually train (not just apply) a LoRA on my GPU using the EXL2 model? I can't load the EXL2 in ooba with the Transformers model loader, as that produces an error message. And I don't think axolotl supports training a model already in EXL2 format.

Should I just train a LoRA on the GPTQ version of the model, and then apply the result to the EXL2 model, even though they were quantized differently? Or have I misunderstood this process entirely?

Thanks for any advice.

araleza · 2024-01-01T16:01:50Z

araleza
Jan 1, 2024
Author

Okay, so I tried this out in practice. I trained a LoRA on a GPTQ 4-bit quantized version of a model (which I loaded with the Transformers model loader), and then I made an 8-bit EXL2 from the original model, and applied the LoRA to the model in ooba-webui (which I loaded with the ExLlamaV2 loader).

It did have some sort of effect, as the Deterministic output of the chat changed, but the personality of the result was very different to what the 4-bit GPTQ model has when the same LoRA is applied.

Seeing as this technique didn't seem to work, I'd still like to know how to train a LoRA directly on an EXL2 model.

1 reply

Tedy50 Feb 16, 2024

I suppose training lora on exl2 will be impossible for now

But I wonder if it woud be possible to convert 4bit GPTQ into EXL2 somehow since lora training is done on 4 bit models.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

How do I LoRA train an EXL2 model? #253

Uh oh!

{{title}}

Uh oh!

Replies: 1 comment 1 reply

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{title}}

Uh oh!

Select a reply

Uh oh!

Uh oh!

How do I LoRA train an EXL2 model? #253

Uh oh!

araleza Jan 1, 2024

Replies: 1 comment · 1 reply

Uh oh!

araleza Jan 1, 2024 Author

Uh oh!

Tedy50 Feb 16, 2024

araleza
Jan 1, 2024

Replies: 1 comment 1 reply

araleza
Jan 1, 2024
Author