Skip to content

Documentation For Users

Tom Morris edited this page Dec 22, 2012 · 22 revisions

Documentation hub for users

Where to start?

Love it so much?

Feature Areas

Essential: These are the features you'd use 80% of the time when you use Refine. They help you clean up your data, extend it, and export it out for other tools to consume.

Reconciliation: You can use the "reconciliation" support in Refine to link text names in your data to database identifiers (also known as database keys, IDs). By connecting your data with other databases, you get more value out of your data.

Uploading to Freebase: If you reconcile against Freebase, you can then align your data with Freebase's schemas using Schema Alignment and load your data into Freebase. For more information on Freebase data loading, see the Refinery project.

Extensions: Add even more features to Refine by installing extensions.

Recipes

For complicated tasks that are commonly done, just follow recipes that other folks have discovered and written up. A recipe is just a sequence of steps, and tips and tricks.

There are Recipes (like removing duplicate row values, etc.)

and there are full recipes:

  • Geocoding - translating street addresses to lat/lng coordinates
  • [FetchingURLsFromWebServices Fetching URLs From Web Services] - grabbing from the Web more data related to the data you already have
  • [StrippingHTML Web Scraping & Extracting items from HTML pages]
  • Recipe Apache Logs

Reference

Expressions

  • Variables
  • OpenRefine Expression Language (GREL)
  • [GRELControls Controls]
  • [GRELFunctions Functions]
  • [GRELBooleanFunctions Boolean functions]
  • [GRELStringFunctions String functions]
  • [GRELArrayFunctions Array functions]
  • [GRELMathFunctions Math functions]
  • [GRELDateFunctions Date functions]
  • [GRELOtherFunctions Other functions]
  • Jython

External Sites

While there's a lot of information in this wiki, there's plenty more online too. Here are some notable sites,

And of course [https://www.google.com/search?q=%22google+refine%22 searching online].

Clone this wiki locally