Book Image

Learning pandas - Second Edition

By : Michael Heydt
Book Image

Learning pandas - Second Edition

By: Michael Heydt

Overview of this book

You will learn how to use pandas to perform data analysis in Python. You will start with an overview of data analysis and iteratively progress from modeling data, to accessing data from remote sources, performing numeric and statistical analysis, through indexing and performing aggregate analysis, and finally to visualizing statistical data and applying pandas to finance. With the knowledge you gain from this book, you will quickly learn pandas and how it can empower you in the exciting world of data manipulation, analysis and science.
Table of Contents (16 chapters)

Reading and writing HDF5 format files

HDF5 is a data model, library, and file format to store and manage data. It is commonly used in scientific computing environments. It supports an unlimited variety of data types, and is designed for flexible and efficient I/O, and for high volume and complex data.

HDF5 is portable and extensible, allowing applications to evolve in their use of HDF5. The HDF5 Technology Suite includes tools and applications to manage, manipulate, view, and analyze data in the HDF5 format. HDF5 is:

  • A versatile data model that can represent very complex data objects and a wide variety of metadata
  • A completely portable file format with no limit on the number or size of data objects in the collection
  • A software library that runs on a range of computational platforms, from laptops to massively parallel systems, and implements a high-level API with C, C++, Fortran...