Replies: 1 comment 1 reply
-
By any chance, did you have SDPA enabled? The implementation currently returns the attention weights as the None type when SDPA is enabled. Add the |
Beta Was this translation helpful? Give feedback.
1 reply
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
I've tested code as in Multilingual_ASR.ipynb,
only got the following error while reading kv values in cross attention cache
How to solve the problem?
Beta Was this translation helpful? Give feedback.
All reactions