feature: add document nodes create at and modified at attributes #135
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
TODO:
PR description
Adding document nodes
createdAt
andmodifiedAt
attributes (attribute names chosen to match the follow the money schema). These attributes are read from ES document metadata.While reading the ES metadata, we look for the DC terms created/modified metadata in priority since it's the one used in the
datashare-api
to define theDocuemnt.getCreationDate()
(tika_metadata_dcterms_modified
). In case these metadata are not found we fallback to other tika metadata.The fallback order to creation date is :
tika_metadata_dcterms_created_iso8601
tika_metadata_creation_date_iso8601
tika_metadata_date_iso8601
for modification date it is:
tika_metadata_dcterms_modified_iso8601
tika_metadata_last_modified_iso8601
tika_metadata_modified_iso8601
tika_metadata_last_save_date_iso8601
tika_metadata_pdf_docinfo_modified_iso8601
tika_metadata_date_iso8601
Changes
neo4j-app
Added
createdAt
andmodifiedAt
attributes to theDocument
nodesDocument
nodes to be looked up and sorted bycreatedAt
andmodifiedAt
dates