In this chapter, we reviewed how important data preparation is for quickly blending and organizing data. The data preparation tools used in this chapter are used in every day analytics to develop efficient workflows. The Data Cleansing
tool is a one-stop shop for cleaning data and handling those nulls that are always bound to exist in input files. Once all the data has been prepared and cleansed, it's time to discover what data needs to be limited by utilizing the Filter
tool. This can be used to limit data to what the customer needs or to optimize the workflow downstream. The next part included how to join multiple inputs and combine data using the Join
, Join Multiple
, and Union
tools. The Join
and Union
tools are a powerful combination for creating inner and outer joins for analysis. The tools utilized in this chapter will provide you with the fundamentals to develop optimal workflows and allow for accurate and quicker business decisions.
In the upcoming chapter, you will learn...