Historically, Hive was considered a good abstraction over MapReduce and for data extraction in batch mode. Hive was not considered as a good alternative for low latency queries; however, this is changing as you read this book. With Hive Version 13, you can run Hive over Apache Tez, which is faster and more efficient than the traditional MapReduce. This allows business users to explore and interact with data in HDInsight using BI tools such as Excel.
HDInsight Essentials - Second Edition
By :
HDInsight Essentials - Second Edition
By:
Overview of this book
Table of Contents (16 chapters)
HDInsight Essentials Second Edition
Credits
About the Author
About the Reviewers
www.PacktPub.com
Preface
Free Chapter
Hadoop and HDInsight in a Heartbeat
Enterprise Data Lake using HDInsight
HDInsight Service on Azure
Administering Your HDInsight Cluster
Ingest and Organize Data Lake
Transform Data in the Data Lake
Analyze and Report from Data Lake
HDInsight 3.1 New Features
Strategy for a Successful Data Lake Implementation
Index
Customer Reviews