Replies: 1 comment 2 replies
-
You might also want to try using the matrix before/after softmax. |
Beta Was this translation helpful? Give feedback.
2 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
Hi!
I was wondering if it's possible to directly measure similarity between any 2 frames (as a real number).
The idea is simple - since in the process of memory matching XMem takes top-k most similar frames to the query frame, it should be pretty straightforward to measure this "similarity" explicitly.
However, I'm stuck on implementation. The affinity matrix of size
[N*H*W x H*W]
implies that every single pixel in the output image/feature map is a linear combination of every single pixel in all theN
memory frames.So I have 2 questions:
A
would be of sizeHW x HW
, so what do the individual entries mean? IfA[i, j]
is a large positive number, does this mean that the pixeli
in image 1 is very similar to the pixelj
in image 2?Beta Was this translation helpful? Give feedback.
All reactions