You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Traditionally, the ID and version fields for documents in Sinequa scraper generation within COSMOS worked based of the URL field. Going forward, we want to track versions of the documents in a more practical manner, given that re-indexation now needs to be set in place. We now want to maintain the versions of documents based on the scraped full text itself and this mapping needs to be changed during the config generation phase within COSMOS.
Implementation Considerations
Deliverable
Transition the ID and version mapping during config generation within COSMOS to use full text instead of URLs.
CarsonDavis
changed the title
Change config generation in COSMOS to have full text based version and ID fields
Update COSMOS Config Gen for Version and ID
Oct 2, 2024
Description
Traditionally, the ID and version fields for documents in Sinequa scraper generation within COSMOS worked based of the URL field. Going forward, we want to track versions of the documents in a more practical manner, given that re-indexation now needs to be set in place. We now want to maintain the versions of documents based on the scraped full text itself and this mapping needs to be changed during the config generation phase within COSMOS.
Implementation Considerations
Deliverable
Transition the ID and version mapping during config generation within COSMOS to use full text instead of URLs.
Dependencies
depends on https://github.com/NASA-IMPACT/sde-backend/issues/744
The text was updated successfully, but these errors were encountered: