-
Book Overview & Buying
-
Table Of Contents
Pig Design Patterns
By :
The data normalization design pattern discusses ways to perform normalization or standardization of data values.
Normalization of data implies fitting, adjusting, or scaling the data values measured on different scales to a notionally common range. As a simplistic example, joining datasets that consist of different units for distance measurement, such as kilometers and miles, can provide varying results when they are not normalized. Hence, they are normalized to bring them back to one common unit, such as kilometers or miles, so that the effect of different units of measurement is not felt by the analytics.
In Big Data, it is common to encounter varying values in the same data attribute while integrating multiple data sources.
Normalization performs data pre-processing and transformation by changing the original data and scaling it to bring it within a specified range (for example, in the range 0 to 1), and assigning equal weight for all attributes...
Change the font size
Change margin width
Change background colour