-
Notifications
You must be signed in to change notification settings - Fork 15
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Datasets ? #2
Comments
Hi. The indexes have a few tens of GB, so we can send them on demand to people that are interested. Please contact us (e-mails in the paper) to see how can we do this transfer. |
Hi, Thanks for your answer. I still have the following questions in mind : |
AQUAINT, MSNBC and ACE04 datasets can be obtained from http://webdocs.cs.ualberta.ca/~denilson/data/deos14_ualberta_experiments.tgz . The rest is handled during evaluation by the method in eval/datasets/AQUAINT_MSNBC_ACE04.scala . The AIDA datasets are not public, one needs to get the license for them. Using this license, a text file with entity annotations is generated and this can be used with PBOH as shown here : eval/datasets/AIDA.scala . Just the annotations, without the full documents, can be obtained from here : https://www.mpi-inf.mpg.de/departments/databases-and-information-systems/research/yago-naga/aida/downloads/ |
Good afternoon,
And thanks again for making available the source code of your project.
Do you have an idea about when the corresponding datasets will be available ?
Thanks in advance,
Sammy Khalife
The text was updated successfully, but these errors were encountered: