Exploring the data
We will use the following Python modules in the next example:
- pandas: Python package for data analysis and data manipulation.
- NumPy: This is a library that adds support for large, multi-dimensional arrays and matrices, along with a large collection of high-level mathematical functions to operate on these arrays.
Statsmodels
: Python package that provides a complement to SciPy for statistical computations, including descriptive statistics and estimation and inference for statistical models. It provides classes and functions for the estimation of many different statistical models.- Seaborn and Matplotlib: Python packages for effective data visualization.
We’ll get started using the following steps: