Excel is the most popular data analysis tool used by business analysts and now HDInsight makes it easy to integrate Excel with Hadoop using Hive. In this section, we will see how to use Excel against the data that is in our Data Lake using Hive.
The prerequisites required are listed as follows:
Office 2013 Professional Plus, Office 365 Pro Plus, Excel 2013 Standalone, or Office 2010 Professional plus
Operating systems that are supported are Windows 7, Windows 8, Windows Server 2008 R2, or Windows Server 2012
The following are the steps to get your data into Excel and analyze it.
The first step is to download the Hive ODBC driver and set it up. Download the Hive ODBC driver from Microsoft Download Center based on your office version (2013 or 2010); the link for 2013 is http://www.microsoft.com/en-us/download/details.aspx?id=40886.
Once you download the driver MSI file to your local...