Skip to content

Latest commit

 

History

History
36 lines (31 loc) · 1.83 KB

5-reconciliation.md

File metadata and controls

36 lines (31 loc) · 1.83 KB
layout title nav
default
5-Reconciliation
true

Enhancing with Data from Other Sources

  • Vocabulary reconciliation is a process where automated systems use terms from unstandardized metadata to search controlled vocabularies and return URIs.
  • OpenRefine has built in tools to reconcile data with Wikidata
  • Other data services can be added

OpenRefine's Wikidata Service

  • Reconciling the university names

    • Reconcile > Start reconciling...
    • choose Wikidata Reconciliation for OpenRefine (en)
    • choose universities and Auto-match candidates with high confidence
    • matches some automatically, but often requires some manual review
  • Extract the Wikidata id

    • Edit column > Add column based on this column...
    • name column: wikidata_id
    • cell.recon.match.id

Adding more data based on extracted dataset

  • OpenRefine 2.8 added querying and extracting tools
    • Select matched from the judgement facet
    • Edit column > Add columns from reconciled values
    • Add located at street address, student count, country, official website, inception, mascot, Carnegie Classification of Institutions of Higher Education (etc.)
  • Geographic Coordinates for places
    • place of birth > Edit column > Add columns from reconciled values
    • Add coordinate location

Other data services