In this chapter, we will go deeper into OpenRefine and review most of its basic functionalities intended for data fixing and analysis. We will cover the following topics, spread over six recipes:
Recipe 1 – sorting data
Recipe 2 – faceting data
Recipe 3 – detecting duplicates
Recipe 4 – applying a text filter
Recipe 5 – using simple cell transformations
Recipe 6 – removing matching rows
Even more so than in Chapter 1, Diving Into OpenRefine, the recipes are designed to allow readers to jump from one recipe to another in any way you like, depending on your needs and interests. Flowing reading of the chapter is also possible of course, but not mandatory at all.
Be warned that recipes are unequal in length; some are quite short and to the point, but others could not be constricted to one or two pages. Recipe 2 – faceting data, for instance, which covers the broad topic of faceting, runs over many pages and is divided into subrecipes.