Book Image

Learning pandas - Second Edition

By : Michael Heydt
Book Image

Learning pandas - Second Edition

By: Michael Heydt

Overview of this book

You will learn how to use pandas to perform data analysis in Python. You will start with an overview of data analysis and iteratively progress from modeling data, to accessing data from remote sources, performing numeric and statistical analysis, through indexing and performing aggregate analysis, and finally to visualizing statistical data and applying pandas to finance. With the knowledge you gain from this book, you will quickly learn pandas and how it can empower you in the exciting world of data manipulation, analysis and science.
Table of Contents (16 chapters)

Performing statistical processes on pandas objects

Descriptive statistics gives us the ability to understand numerous measures of data that describe a specific characteristic of the underlying data. Built into pandas are several classes of these descriptive statistical operations that can be applied to a Series or DataFrame.

Let's examine several facets of statistical analysis / techniques provided by pandas:

  • Summary descriptive statistics
  • Measuring central tendency: mean, median, and mode
  • Variance and standard deviation

Retrieving summary descriptive statistics

pandas objects provide the .describe() method, which returns a set of summary statistics of the object's data. When applied to a DataFrame, .describe(...