You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
I encountered a problem where I pruned the MLP layer of qwen2.5-0.5b,
"0": 4864,
"1": 4864,
"2": 4864,
"3": 4864,
"4": 2816,
"5": 4864,
"6": 4864,
"7": 3840,
"8": 4864,
"9": 2816,
"10": 4864,
"11": 4864,
"12": 4864,
"13": 768,
"14": 4864,
"15": 4864,
"16": 4864,
"17": 768,
"18": 4864,
"19": 4864,
"20": 4864,
"21": 4864,
"22": 4864,
"23": 4864
However, in the config. json file, the mediate_size is global and fixed at 4864, so when I tested the pruned model using lm eval, I encountered an error: ValueError: Trying to set a tensor of shape torch.Size([896, 768]) in "weight" (which has shape torch.Size([896, 4864])), this looks incorrect》 Is there any solution? Or is it another issue?
The text was updated successfully, but these errors were encountered:
I encountered a problem where I pruned the MLP layer of qwen2.5-0.5b,
"0": 4864,
"1": 4864,
"2": 4864,
"3": 4864,
"4": 2816,
"5": 4864,
"6": 4864,
"7": 3840,
"8": 4864,
"9": 2816,
"10": 4864,
"11": 4864,
"12": 4864,
"13": 768,
"14": 4864,
"15": 4864,
"16": 4864,
"17": 768,
"18": 4864,
"19": 4864,
"20": 4864,
"21": 4864,
"22": 4864,
"23": 4864
However, in the
config. json
file, the mediate_size is global and fixed at 4864, so when I tested the pruned model using lm eval, I encountered an error: ValueError: Trying to set a tensor of shape torch.Size([896, 768]) in "weight" (which has shape torch.Size([896, 4864])), this looks incorrect》 Is there any solution? Or is it another issue?The text was updated successfully, but these errors were encountered: