Book Image

Learning pandas - Second Edition

By : Michael Heydt
Book Image

Learning pandas - Second Edition

By: Michael Heydt

Overview of this book

You will learn how to use pandas to perform data analysis in Python. You will start with an overview of data analysis and iteratively progress from modeling data, to accessing data from remote sources, performing numeric and statistical analysis, through indexing and performing aggregate analysis, and finally to visualizing statistical data and applying pandas to finance. With the knowledge you gain from this book, you will quickly learn pandas and how it can empower you in the exciting world of data manipulation, analysis and science.
Table of Contents (16 chapters)

Plotting basics with pandas

The pandas library itself performs does not perform data visualization. To perform this, pandas tightly integrates with other robust visualization libraries that are part of the Python ecosystem. The most common of these integrations is with matplotlib. This chapter will, therefore, focus its examples on matplotlib, but we will also point you to other possible libraries to try on your own. Two of these are worth mentioning.

Seaborn is another Python visualization library which is also based on matplotlib. It provides a high-level interface for rendering attractive statistical graphics. It has native support for NumPy and pandas data structures. The goal of Seaborn is to create matplotlib graphs that look a lot less scientific in nature. To learn about Seaborn, please visit the site at http://seaborn.pydata.org/index.html.

While both Seaborn and matplotlib...