Grouping and aggregating with multiple columns and functions
It is possible to group and aggregate with multiple columns. The syntax is slightly different than it is for grouping and aggregating with a single column. As usual with any kind of grouping operation, it helps to identify the three components: the grouping columns, aggregating columns, and aggregating functions.
In this recipe, we showcase the flexibility of the .groupby
method by answering the following queries:
- Finding the number of canceled flights for every airline per weekday
- Finding the number and percentage of canceled and diverted flights for every airline per weekday
- For each origin and destination, finding the total number of flights, the number and percentage of canceled flights, and the average and variance of the airtime
How to do it…
- Read in the flights dataset, and answer the first query by defining the grouping columns (
AIRLINE
,WEEKDAY
), the aggregating...