Book Image

HDInsight Essentials - Second Edition

By : Rajesh Nadipalli
Book Image

HDInsight Essentials - Second Edition

By: Rajesh Nadipalli

Overview of this book

Table of Contents (16 chapters)
HDInsight Essentials Second Edition
Credits
About the Author
About the Reviewers
www.PacktPub.com
Preface
Index

Analysis using Excel and Microsoft Hive ODBC driver


Excel is the most popular data analysis tool used by business analysts and now HDInsight makes it easy to integrate Excel with Hadoop using Hive. In this section, we will see how to use Excel against the data that is in our Data Lake using Hive.

Prerequisites

The prerequisites required are listed as follows:

  • Office 2013 Professional Plus, Office 365 Pro Plus, Excel 2013 Standalone, or Office 2010 Professional plus

  • Operating systems that are supported are Windows 7, Windows 8, Windows Server 2008 R2, or Windows Server 2012

The following are the steps to get your data into Excel and analyze it.

Step 1 – installing the Microsoft Hive ODBC driver

The first step is to download the Hive ODBC driver and set it up. Download the Hive ODBC driver from Microsoft Download Center based on your office version (2013 or 2010); the link for 2013 is http://www.microsoft.com/en-us/download/details.aspx?id=40886.

Once you download the driver MSI file to your local...