This chapter is about how to aggregate and sample data in Hive. It firstly covers the usage of several aggregation functions, analytic functions working with GROUP BY
and PARTITION BY
, and windowing clauses. Then, it introduces different ways of sampling data in Hive.
In this chapter, we will cover the following topics:
Basic aggregation
Advanced aggregation
Aggregation condition
Analytic functions
Sampling