Retrieval Metrics Indexes Parameter needs Continuous Indexes? #2757
Unanswered
tsteffek
asked this question in
Classification
Replies: 0 comments
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
Sadly, there doesn't seem to be a retrieval metrics category, so I've added it here.
I seem to have had a crucial misunderstanding of the retrieval metrics indexes parameter and want confirmation that I got it right now and raise awareness of this issue.
So I've always understood the indexes parameter as a way for torchmetrics to differentiate the queries on BATCH-LEVEL. As in, the following code snippets should be equal:
When in reality the first one is equal to:
It believes the first query of the first batch belongs to the same query as the first query of the first batch instead of acknowledging that it's a different batch and therefore probably a different query.
So the code I've used in my collator:
That only iterates on the batch level seems to have been wrong the whole time.
Might be that this is something obvious for people that work more retrieval, but it wasn't obvious to me from the docs and I found absolutely zero discussions about this online. ChatGPT-4o also didn't get that one right, whatever that's worth (Chat).
Could somebody confirm this find? And tell me whether I was stupid and this should have been obvious or whether some clarification should be added to the docs?
Beta Was this translation helpful? Give feedback.
All reactions