Book Image

Apache Hive Essentials

By : Dayong Du
Book Image

Apache Hive Essentials

By: Dayong Du

Overview of this book

Table of Contents (17 chapters)
Apache Hive Essentials
Credits
About the Author
About the Reviewers
www.PacktPub.com
Preface
Index

Chapter 6. Data Aggregation and Sampling

This chapter is about how to aggregate and sample data in Hive. It firstly covers the usage of several aggregation functions, analytic functions working with GROUP BY and PARTITION BY, and windowing clauses. Then, it introduces different ways of sampling data in Hive.

In this chapter, we will cover the following topics:

  • Basic aggregation

  • Advanced aggregation

  • Aggregation condition

  • Analytic functions

  • Sampling