-
-
Notifications
You must be signed in to change notification settings - Fork 46
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Explain when (not) to use OpenRefine #103
Comments
I think we should look at the Library Carpentry lesson on OpenRefine for clearer use cases in the introduction episode: splitting data elements into different columns, normalising date formats and maybe matching/enhancing. This would go instead of the Motivations section, which is currently written for potential instructors (I feel). Let's replace the Features and Getting help sections with How is OR different from spreadsheet applications? and When would you write a script instead of using OR?. Spreadsheets
Scripts
|
From #37:
|
Perhaps it's also useful to distinguish OR from using SQL with a relational database. SQL also allows selection of rows and creating derivative columns. The |
Remember to remove the mention of this issue in the Instructor note in the Introduction section when this issue is being resolved. See #183. |
I have taught the OpenRefine lesson a few times; most recently today. Even though I always try to explain when you could choose OpenRefine for a problem, and how to compare OpenRefine to spreadsheets and writing a script, students keep asking for more explanation and comparisons.
In our workshop the OpenRefine lesson is between Data organisation in spreadsheets and Introduction to R and that is also how I tried to frame OpenRefine: it shows your data like a spreadsheet application, but it has powers like a programming environment.
Seeing how I keep struggling to explain it well, even with years of experience with OR, we should probably improve the lesson materials.
It was suggested by helpers that referring back to my situating OR between spreadsheets and programming in the introduction later in the lesson might help, but the introduction episode should provide more context first.
The text was updated successfully, but these errors were encountered: