Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Feature/peace corpora #1313

Merged
merged 42 commits into from
Nov 16, 2023
Merged

Feature/peace corpora #1313

merged 42 commits into from
Nov 16, 2023

Conversation

BeritJanssen
Copy link
Contributor

@BeritJanssen BeritJanssen commented Nov 15, 2023

Close #1288: Integrates the Peace Portal corpora. Also closes #1106 .

I decided to move the NLTK_DATA variable to the Django settings so it can be overridden more easily for tests (and no shifting around of existing data is necessary). Should this solution overlook some problems leading to the decision to place them in the es_settings module: the corresponding commit can be reverted.

backend/addcorpus/corpus.py Outdated Show resolved Hide resolved
backend/addcorpus/corpus.py Outdated Show resolved Hide resolved
backend/addcorpus/corpus.py Show resolved Hide resolved
backend/addcorpus/es_mappings.py Outdated Show resolved Hide resolved
backend/addcorpus/es_settings.py Outdated Show resolved Hide resolved
backend/corpora/peaceportal/FIJI/fiji.py Outdated Show resolved Hide resolved
backend/corpora/peaceportal/FIJI/fiji.py Outdated Show resolved Hide resolved
backend/corpora/peaceportal/iis.py Outdated Show resolved Hide resolved
backend/corpora/peaceportal/FIJI/fiji.py Show resolved Hide resolved
backend/corpora/peaceportal/peaceportal.py Show resolved Hide resolved
@lukavdplas
Copy link
Contributor

A general comment: I'd recommend adding a test like this one for the new corpus definitions:

https://github.com/UUDigitalHumanitieslab/I-analyzer/blob/82099d01c70989499fa48673b10f8097037cdf1c/backend/corpora/rechtspraak/tests/test_rechtspraak.py#L25-L29

@BeritJanssen BeritJanssen merged commit de7ab44 into develop Nov 16, 2023
1 check passed
@BeritJanssen BeritJanssen deleted the feature/peace-corpora branch November 16, 2023 14:02
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

Integrate PeacePortal corpora in main repository Support analysers for multilingual corpora
2 participants