In the previous chapters, we used various data mining algorithms to find the data that we were looking for. With Microsoft Excel becoming more and more powerful in terms of performing data analysis and having connectivity capabilities to many data sources, an addition of the data mining capabilities is only a logical enhancement to its capabilities.
The state of data today is aptly described by volume, variety, and velocity. Volume describes the quantity of data generated, variety describes the type of data that is generated from different sources, and velocity is the rate at which the data gets generated. The most common examples are social media platforms such as Facebook, Twitter, and so on, where we have posts and tweets from millions of people on a daily basis. These comprise of photos, audio files, video files, texts, and so on. These posts and tweets take some several terabytes of space in the data centers of Facebook and Twitter. Traditional...