At the outset, we will briefly explain the concept of a histogram. Most of you might already know this from your lessons on basic statistics. However, we will reiterate this for the sake of completeness. Histogram is a form of data representation technique that relies on an aggregation of data points. The data is aggregated into a set of predefined bins that are represented along the x axis. The number of data points that fall within each of the bins makes up the corresponding counts on the y axis. For example, let's assume that our data looks something like the following:
D={2,7,1,5,6,9,14,11,8,10,13}
If we define three bins, namely Bin_1 (1 - 5)
, Bin_2 (6 - 10)
, and Bin_3 (11 - 15)
, then the histogram corresponding to our data would look something like this:
Bins |
Frequency |
|
3 |
|
5 |
|
3 |
What this histogram data tells us is that we have three values between 1 and 5, five between 6 and 10, and three again between...