Refactor rake tasks into harvester and indexer classes #132

thatbudakguy · 2023-01-11T00:26:56Z

This mostly moves logic out of rake tasks and into dedicated classes, which makes it easier to test. It also separates the harvesting portion from the indexing portion; this might make it easier to have the geoblacklight harvester inherit some of this behavior later. The denylist is one of the things the harvester class includes, which closes #119. I think the logic mentioned in #112 was already in place, so this closes #112 too.

It also adds logic to filter indexing based on your desired schema version (set SCHEMA_VERSION to Aardvark to index only aardvark records; the default is 1.0), which fixes #130.

I added a dependency on ruby-git to do the git pull/clone instead of just calling out to system(), which made things easier to test and might also allow for things to run on windows.

* Moves logic out of rake tasks and into tested classes * Adds logic to filter indexing based on desired schema version * Uses ruby-git to manipulate repositories

Refactor rake tasks into harvester and indexer classes

774d5ea

* Moves logic out of rake tasks and into tested classes * Adds logic to filter indexing based on desired schema version * Uses ruby-git to manipulate repositories

thatbudakguy marked this pull request as ready for review January 11, 2023 00:34

thatbudakguy closed this Jan 11, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Refactor rake tasks into harvester and indexer classes #132

Refactor rake tasks into harvester and indexer classes #132

thatbudakguy commented Jan 11, 2023 •

edited

Loading

Refactor rake tasks into harvester and indexer classes #132

Refactor rake tasks into harvester and indexer classes #132

Conversation

thatbudakguy commented Jan 11, 2023 • edited Loading

thatbudakguy commented Jan 11, 2023 •

edited

Loading