Book Image

Become a Python Data Analyst

By : Alvaro Fuentes
Book Image

Become a Python Data Analyst

By: Alvaro Fuentes

Overview of this book

Python is one of the most common and popular languages preferred by leading data analysts and statisticians for working with massive datasets and complex data visualizations. Become a Python Data Analyst introduces Python’s most essential tools and libraries necessary to work with the data analysis process, right from preparing data to performing simple statistical analyses and creating meaningful data visualizations. In this book, we will cover Python libraries such as NumPy, pandas, matplotlib, seaborn, SciPy, and scikit-learn, and apply them in practical data analysis and statistics examples. As you make your way through the chapters, you will learn to efficiently use the Jupyter Notebook to operate and manipulate data using NumPy and the pandas library. In the concluding chapters, you will gain experience in building simple predictive models and carrying out statistical computation and analysis using rich Python tools and proven data analysis techniques. By the end of this book, you will have hands-on experience performing data analysis with Python.
Table of Contents (8 chapters)

Introduction to SciPy

SciPy is a tool for doing scientific computing in Python. It is a Python-based ecosystem that is an open source software for math, science, and engineering. It contains various toolboxes dedicated to common issues in scientific computing. So, if you work in any scientific or engineering field, you will likely find the tools you need for doing scientific computing within the subpackages of SciPy. The following are the subpackages of SciPy:

  • scipy.io: This package provides a tool for dealing with file input/output
  • scipy.special: This package provides a tool for dealing with special functions
  • scipy.linalg: This package provides a tool for dealing with linear algebra operations
  • scipy.fftpack: This package provides a tool for dealing with fast Fourier transforms
  • scipy.stats: This package provides a tool for dealing with statistics and random numbers
  • scipy.interpolate...