You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
found the bug "PLS remove model = model.merge_and_unload() and reuse the original 4bit base model instead of the fp16 base model". Details is in artidoro/qlora#254
the model in point 1 and point 2 shown below is diff, i've compared their respective generated text.. it's really different.
1.just aft 4bit training->gen = pipeline('text-generation', model=model, tokenizer=tokenizer, max_length=max_length)
2.model = PeftModel.from_pretrained(base_model, new_model)
model = model.merge_and_unload()
gen = pipeline('text-generation', model=model, tokenizer=tokenizer, max_length=max_length)
The text was updated successfully, but these errors were encountered: