Book Overview & Buying
Table Of Contents

Apache Hive Essentials - Second Edition

By : Dayong Du

4 (2)

Buy this Book

Apache Hive Essentials

4 (2)

By: Dayong Du

Buy this Book

Overview of this book

In this book, we prepare you for your journey into big data by frstly introducing you to backgrounds in the big data domain, alongwith the process of setting up and getting familiar with your Hive working environment. Next, the book guides you through discovering and transforming the values of big data with the help of examples. It also hones your skills in using the Hive language in an effcient manner. Toward the end, the book focuses on advanced topics, such as performance, security, and extensions in Hive, which will guide you on exciting adventures on this worthwhile big data journey. By the end of the book, you will be familiar with Hive and able to work effeciently to find solutions to big data problems

Preface

Who this book is for

What this book covers

To get the most out of this book

Get in touch

Free Chapter

Overview of Big Data and Hive

A short history

Introducing big data

The relational and NoSQL databases versus Hadoop

Batch, real-time, and stream processing

Overview of the Hadoop ecosystem

Hive overview

Summary

Setting Up the Hive Environment

Installing Hive from Apache

Installing Hive from vendors

Using Hive in the cloud

Using the Hive command

Using the Hive IDE

Summary

Data Definition and Description

Understanding data types

Data type conversions

Data Definition Language

Database

Tables

Partitions

Buckets

Views

Summary

Data Correlation and Scope

Project data with SELECT

Filtering data with conditions

Linking data with JOIN

Combining data with UNION

Summary

Data Manipulation

Data exchanging with LOAD

Data exchange with INSERT

Data exchange with [EX|IM]PORT

Data sorting

Functions

Transactions and locks

Summary

Data Aggregation and Sampling

Basic aggregation

Enhanced aggregation

Aggregation condition

Window functions

Sampling

Summary

Performance Considerations

Performance utilities

Design optimization

Data optimization

Job optimization

Summary

Extensibility Considerations

User-defined functions

HPL/SQL

Streaming

SerDe

Summary

Security Considerations

Authentication

Authorization

Mask and encryption

Summary

Working with Other Tools

The JDBC/ODBC connector

NoSQL

The Hue/Ambari Hive view

HCatalog

Oozie

Spark

Hivemall

Summary

Other Books You May Enjoy

Leave a review - let other readers know what you think

Database

The database in Hive describes a collection of tables that are used for a similar purpose or belong to the same groups. If the database is not specified, the default database is used and uses /user/hive/warehouse in HDFS as its root directory. This path is configurable by the hive.metastore.warehouse.dir property in hive-site.xml. Whenever a new database is created, Hive creates a new directory for each database under /user/hive/warehouse. For example, the myhivebook database is located at /user/hive/datawarehouse/myhivebook.db. In addition, DATABASE has a name alias, SCHEMA, meaning they are the same thing in HQL. The following is the major DDL for databases operations:

Create the database/schema if it doesn't exist:

      > CREATE DATABASE myhivebook;
      > CREATE SCHEMA IF NOT EXISTS myhivebook;

Create the database with the location, comments, and metadata...

Tech Concepts

Programming languages

Tech Tools

Unlimited access to the largest independent learning library in tech of over 8,000 expert-authored tech books and videos.

Innovative learning tools, including AI book assistants, code context explainers, and text-to-speech.

50+ new titles added per month and exclusive early access to books as they are being written.

Your notes and bookmarks

Apache Hive Essentials - Second Edition

By : Dayong Du

Apache Hive Essentials

By: Dayong Du

Overview of this book

Confirmation

Buy this book with your credits?

Submit Your Feedback

Create a Free Account To Continue Reading

Sign in to activate your 7-day free access