Library Juice Academy

Online Professional Development for Librarians

Using OpenRefine for Library Metadata

Instructor: Margaret Heller

Dates: Not currently scheduled

Credits: 1.5 CEUs

Price: $175

OpenRefine is a free open-source tool that makes editing messy metadata easier through clustering, faceting, advanced find and replace scripting, and linked data reconciliation in a spreadsheet-like environment. In addition to cleaning up metadata, OpenRefine’s linked data and URL building tools can extend metadata through databases and API calls.

This course will introduce OpenRefine from a beginning level with installation, introduce how to effectively use standard features, and go on to introduce more advanced features such as reconciliation against Library of Congress Subject Headings linked data and creating an API call.

Learning outcomes:

  • Students will be able to install and run OpenRefine on their computers.
  • Students will be able to edit a spreadsheet file with simple errors.
  • Students will be understand the role of clustering, faceting, and searching in editing data.
  • Students will be able to construct a simple API call to search against their data.
  • Students will be able to install a simple reconciliation extension service.

Note: Students may want to use the following book as background material for the course. It is widely available in libraries: Ruben Verborgh and Max De Wilde, Using OpenRefine (Birmingham: Packt Publishing, 2013) http://www.worldcat.org/oclc/889271264

Margaret Heller

Margaret Heller is Digital Services Librarian at Loyola University Chicago in Chicago, IL, where she manages the website and institutional repository. She also serves on the board of the Read/Write Library Chicago. She has presented and written on a wide variety of technical issues and scholarly communication in libraries. She is a regular contributor to the ACRL Tech Connect blog, on which she wrote a heavily used post on the basics of OpenRefine.
Margaret's Website.

Course Structure

This is an online class that is taught asynchronously, meaning that participants do the work on their own time as their schedules allow. The class does not meet together at any particular times, although the instructor may set up optional sychronous chat sessions. Instruction includes readings and assignments in one-week segments. Class participation is in an online forum environment.

Payment Info

Please contact us to arrange a special session of this class for a group of seven or more, with a negotiable discount, or to be notified when it is next scheduled.