Skip to content
Ethan Gruber edited this page Jun 5, 2017 · 7 revisions

The Ingest web service is located at /harvester/admin/ingest.

The service iterates through one or more OAI-PMH sets (including paginations via resumptionToken) to generate RDF/XML which conforms to the DPLA data model. If the ARK is provided, all triples associated with the ARK (with the dcterms:isPartOf predicate) will be purged. If the ARK is not provided, all triples associated with the OAI-PMH set will be purged (with the dcterms:relation predicate). After the SPARQL/Update is successfully posted, the RDF/XML will be posted to the triplestore to complete the ingestion process.

It accepts three request parameters:

  1. sets : required URL-encoded OAI-PMH set URL(s), separated by '|'
  2. repository : required must match the @mainagencycode in the finding aid
  3. target : required the target designates the target platform(s) for eventual aggregation. Accepted values are 'primo', 'dpla', or 'aw'. Any other value (or null) will designate CHOs for aggregation in both Primo and DPLA. The target is captured as a literal in doap:audience in the ore:Aggregation.
  4. rights : optional Accepts a key for a standardized rights statement that is converted into a Rightsstatements.org or Creative Commons URI upon publication.
  5. rightsText : optional A free-text rights statement may be encoded and passed with this parameter
  6. type : optional A DCMI Type (just the string) that is converted to a full URI on publication. E.g., InteractiveResource.
  7. format : optional A mime-type, e.g., image/jpeg
  8. genre : optional Getty AAT URI for a genre, mapped to all CHOs as edm:hasType
  9. language : optional three-letter language code
  10. ark : optional Must conform to the format, 'ark:/80444/xv43658', for example. If the ARK is not provided, then all CHOs which contain any ARK reference will be aggregated.

More Information

This workflow is controlled by xpl/controllers/ingest.xpl. See also Data Model

Clone this wiki locally