Token pruning #2

xiewende · 2023-02-03T03:43:30Z

Very good work!

But after a brief reading of the VisionTransformerDiffPruning model code under vit_l2_3keep_senet.py, I was puzzled by the token pruning. Token pruning implies a reduction in the number of tokens (Figure 2 in the paper). I didn't find any reduction in the number of tokens in VisionTransformerDiffPruning, but rather the mask of informative token and placeholder. then the representive token is obtained based on the mask and then concatenation with x (x = torch.cat((x,represent_token), dim=1)). Here I am confused, the number of tokens is not reduced under feature x. Does this affect the efficiency?

Maybe I misunderstood, and I hope you can give a detailed explanation.

I look forward to your reply.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Token pruning #2

Token pruning #2

xiewende commented Feb 3, 2023

Token pruning #2

Token pruning #2

Comments

xiewende commented Feb 3, 2023