Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Reddit search overly senstive to query parameters #166

Open
yewsiang opened this issue Jun 20, 2022 · 2 comments
Open

Reddit search overly senstive to query parameters #166

yewsiang opened this issue Jun 20, 2022 · 2 comments
Assignees

Comments

@yewsiang
Copy link
Collaborator

When clicking on the link suggested within this Reddit post, Crowdwise does not suggest the original Reddit link while on the PubMed page. This seems to be due to "?dopt=Abstract" being removed.

To investigate further.

@frizensami
Copy link
Collaborator

Reproducible on my end.

Separately, we'd expect the reddit post to be at least the first most relevant result. The reddit post is recommended under the title search but is not the first result under the default sort (highest comment count).

We may want to consider the default sort to be by "most relevant"

image image

@frizensami
Copy link
Collaborator

frizensami commented Jun 21, 2022

This is a tough problem to solve. Right now, we remove tracking parameters (also the fragment part of URLs) to normalize the URL (e.g., no gclid, etc), because those cases are clear.

However, each website will have its own unique usage of URL parameters (like ?dOpt=abstract being unnecessary).

Some ways out:

  • Include exact title matches (or almost close to) as exact match as well
  • and/or: have a "special cases" code section for all of these situations. This might get out of hand though, and each check will slow us down ever so slightly.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants