Add support for (vision) transformers, add options to set last-layer relevance #15

Maximilian-Stefan-Ernst · 2024-03-16T12:52:50Z

Vision Transformers

Add explanations for (vision) transformer (ViT) models by adding the package extension VisionTransformerExt that depends on Metalhead.jl.

Adds the rules

SelfAttentionRule for MultiHeadSelfAttention layers
PositionalEmbeddingRule for ViPosEmbedding layers

Also adds support for some special layers of vision transformers by adding a method for ZeroRule:

_flatten_spatial is a reshaping layer near the input
ClassTokens adds a class token to the model
SelectClassToken only retains the class token for the model prediction (this layer was added by me because Metalhead uses an anonymous function for this purpose, that we have to swap for a "real" layer before explaining the model)

So far, no support for Flux.jl's built-in MultiHeadAttention layer was added, because this layer does not work nicely with Chains (Metalhead's MultiHeadSelfAttention layer is not limited to vision transformer, but can also be used to build "regular" transformer models as long as they use only self-attention, e.g. encoder-only models, something like BERT should be doable).

In addition, the function prepare_vit can be used to prepare Metalhead's ViT (convert it to a Chain, add SelectClassToken layer).

Last layer relevance

Adds the keyword arguments normalize_output=true, R=nothing to LRP. If R is supplied, the relevances in the last layer are set to R. If normalize_output is false, the target neuron activation is not set to one, but remains the "raw" activation from the forward pass.

Canonization

This PR already contains the changes of PR #14, because otherwise canonization of ViT models does not work properly - so PR #14 should be merged first.

ToDo

add tests
add docs

For documentation, I guess it would be nice to have an extra "Extensions" section in the docs, and have a small tutorial under "Extensions => Vision Transformer".

…case

add canonization for SkipConnection layers; fix model splitting edge …

Hotfix/canonize

codecov · 2024-03-16T12:57:59Z

Codecov Report

Attention: Patch coverage is 0% with 61 lines in your changes are missing coverage. Please review.

Project coverage is 0.00%. Comparing base (7b2af98) to head (79b2d40).

Files	Patch %	Lines
ext/RelevancePropagationMetalheadExt/rules.jl	0.00%	51 Missing ⚠️
ext/RelevancePropagationMetalheadExt/utils.jl	0.00%	9 Missing ⚠️
src/rules.jl	0.00%	1 Missing ⚠️

Additional details and impacted files

@@            Coverage Diff             @@
##             main     #15       +/-   ##
==========================================
- Coverage   96.66%   0.00%   -96.67%     
==========================================
  Files          14      15        +1     
  Lines         660     698       +38     
==========================================
- Hits          638       0      -638     
- Misses         22     698      +676

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

Maximilian-Stefan-Ernst · 2024-03-18T10:51:44Z

Maybe move prepare_vit to canonize

Project.toml

ext/VisionTransformerExt/VisionTransformerExt.jl

ext/VisionTransformerExt/utils.jl

src/canonize.jl

src/extensions.jl

adrhill · 2024-03-18T16:17:10Z

src/extensions.jl

+struct SelectClassToken end
+Flux.@functor SelectClassToken


XAIBase exports generic feature selectors.
Maybe these could be used here and extended for transformers?

We can do that, but I don't get how these feature selectors are supposed to be used in a model / why there are no rules for them?

Okay as we discussed, it does not really make sense to use the feature selectors. I think the remaining question is where you want to define new layers in the codebase - maybe an extra file src/layers.jl?

Okay as we discussed, it does not really make sense to use the feature selectors.

Sorry, it's been a while... Can you remind me what the exact issue was? 😅
I can vaguely remember it was something that should go in XAIBase.jl.

Similar to this: https://github.com/Julia-XAI/XAIBase.jl/blob/main/src/feature_selection.jl

Yes, so vision transformers have a special token that is selected near the output, and all other tokens are discarded. They implement this in Metalhead through an anonymous function, so we can't use it for computing LRP. What I did was implementing this simple Flux layer (and an associated rule), that is swapped in for the anonymous function. The problem with the feature selector is that we need an actual layer, so I think we decided that this is probably not the right place ^^

src/extensions.jl

src/lrp.jl

test/test_canonize.jl

Merge upstream changes in main

This reverts commit 9f6d5e2.

This reverts commit c213416.

Maximilian-Stefan-Ernst and others added 16 commits March 7, 2024 14:42

add option to normalize / set output relevances

c213416

start adding package extension for vision transformers

1ed33fd

add NNlib to weakdeps

a80ba77

fix merge conflict

4822aa7

fix dependencies for extension

b07335a

add extension

4f79949

add options to specify last layer relevance

9f6d5e2

fix imports

bfc9e70

add canonization for SkipConnection layers; fix model splitting edge …

5604403

…case

Merge pull request #1 from Maximilian-Stefan-Ernst/hotfix/canonize

27b22d2

add canonization for SkipConnection layers; fix model splitting edge …

fix formatting

655b51d

fix flatten_model edge case

f8762e1

add tests for flatten_model and canonize

4e14408

Merge pull request #2 from Maximilian-Stefan-Ernst/hotfix/canonize

8e606b5

Hotfix/canonize

fix in prepare_vit

72dcc76

formatter

e7c7bac

adrhill reviewed Mar 18, 2024

View reviewed changes

Maximilian-Stefan-Ernst and others added 10 commits March 19, 2024 15:56

Merge branch 'attention' into main

06b4280

Merge pull request #3 from Maximilian-Stefan-Ernst/main

13ba2b7

Merge upstream changes in main

Revert "add options to specify last layer relevance"

e4721fd

This reverts commit 9f6d5e2.

Revert "add option to normalize / set output relevances"

d464696

This reverts commit c213416.

remove NNlib dependency

f680bd6

rename extension

31ac29b

rename files to new extension name & make prepare_vit part of canonize

6cac3e7

move extension rules to src/rules.jl

e2c7e30

tell users they have to load Metalhead to use the rules

17da330

remove prepare_vit from exports; format

79b2d40

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add support for (vision) transformers, add options to set last-layer relevance #15

Add support for (vision) transformers, add options to set last-layer relevance #15

Maximilian-Stefan-Ernst commented Mar 16, 2024

codecov bot commented Mar 16, 2024 •

edited

Loading

Maximilian-Stefan-Ernst commented Mar 18, 2024

adrhill Mar 18, 2024

Maximilian-Stefan-Ernst Mar 23, 2024

Maximilian-Stefan-Ernst Apr 23, 2024

adrhill Apr 23, 2024

Maximilian-Stefan-Ernst Apr 23, 2024

Add support for (vision) transformers, add options to set last-layer relevance #15

Are you sure you want to change the base?

Add support for (vision) transformers, add options to set last-layer relevance #15

Conversation

Maximilian-Stefan-Ernst commented Mar 16, 2024

Vision Transformers

Last layer relevance

Canonization

ToDo

codecov bot commented Mar 16, 2024 • edited Loading

Codecov Report

Maximilian-Stefan-Ernst commented Mar 18, 2024

adrhill Mar 18, 2024

Choose a reason for hiding this comment

Maximilian-Stefan-Ernst Mar 23, 2024

Choose a reason for hiding this comment

Maximilian-Stefan-Ernst Apr 23, 2024

Choose a reason for hiding this comment

adrhill Apr 23, 2024

Choose a reason for hiding this comment

Maximilian-Stefan-Ernst Apr 23, 2024

Choose a reason for hiding this comment

codecov bot commented Mar 16, 2024 •

edited

Loading