Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Zooma3 - Annotation Predictions #23

Open
olgavrou opened this issue Nov 8, 2017 · 0 comments
Open

Zooma3 - Annotation Predictions #23

olgavrou opened this issue Nov 8, 2017 · 0 comments
Assignees

Comments

@olgavrou
Copy link
Contributor

olgavrou commented Nov 8, 2017

Annotation prediction is constructed using the decorator pattern.
Prediction should be able to:
Search by property value
Search by property type and property value
For 1 and 2 additional option to boost certain sources
For 1 and 2 additional option to filter exclusively certain sources
For 1 and 2 additional option to boost certain topics
For 1 and 2 additional option to filter exclusively certain topics
For 3-6 should be able to rank topics or sources
If nothing found in Zooma should fall back to OLS strict search and give max GOOD confidence
If nothing returns should do a loose OLS search and return results with MEDIUM or LOW confidence

Prediction and score should be done as below (search on Solr annotation summaries):
Search
search and retrieve documents based on text search, boost most recent documents
Primary Scoring
normalise Solr score result and boost by:
source number
votes (number of documents in annotation summary)
Filter
boost sources or (if wanted exclusive sources) just filter out the unwanted ones
Secondary Scoring
Boost scores based on text similarity (Needleman-Wunsch Scorer)
Confidence can then be calculated based on cutoff scores and cutoff percentage.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants