About this Event
When datasets and spreadsheets are created by multiple people or without clear guidelines, the end-result can often lack standardization. New York City might be listed as “NYC” in one instance, “New York, NY” in another, and “New York City” somewhere else. This can make analyzing the data a cumbersome process.
During this workshop, learn how to quickly and efficiently find such standardization issues and other errors—and how to correct them in bulk. Using OpenRefine, an open source desktop application for data cleanup and transformation, you will also learn how to enhance existing datasets by adding new metadata fields.
You will discover how to import and export data with OpenRefine, how to develop effective workflows for cleaning multiple datasets, and how to use the application to explore datasets as a precursor to analytical work.
Librarian Paige Morgan, head of digital scholarship and publishing, will lead the workshop.
The workshop is part of the Software and Data Carpentries curriculum.
This workshop is open to the public. Registration is required as seating is limited.
Participants must bring their own laptops to this workshop. We encourage you to install the OpenRefine software in advance or arrive 30-45 minutes before the scheduled workshop time to install the workshop with the assistance of the instructor.