Skip to content

Doubt Regarding Retrieving Documents in Stemmed Version #937

Answered by lintool
souravsaha asked this question in Q&A
Discussion options

You must be logged in to vote

The Lucene "Analyzer" is responsible for handling stemming, stopword removal, etc.

This is where Anserini chooses its analyzer: https://github.com/castorini/anserini/blob/master/src/main/java/io/anserini/index/IndexCollection.java#L738

By default, we stem and remove stopwords.

Replies: 1 comment

Comment options

You must be logged in to vote
0 replies
Answer selected by lintool
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Category
Q&A
Labels
None yet
2 participants