Book Image

The Statistics and Calculus with Python Workshop

By : Peter Farrell, Alvaro Fuentes, Ajinkya Sudhir Kolhe, Quan Nguyen, Alexander Joseph Sarver, Marios Tsatsos
5 (1)
Book Image

The Statistics and Calculus with Python Workshop

5 (1)
By: Peter Farrell, Alvaro Fuentes, Ajinkya Sudhir Kolhe, Quan Nguyen, Alexander Joseph Sarver, Marios Tsatsos

Overview of this book

Are you looking to start developing artificial intelligence applications? Do you need a refresher on key mathematical concepts? Full of engaging practical exercises, The Statistics and Calculus with Python Workshop will show you how to apply your understanding of advanced mathematics in the context of Python. The book begins by giving you a high-level overview of the libraries you'll use while performing statistics with Python. As you progress, you'll perform various mathematical tasks using the Python programming language, such as solving algebraic functions with Python starting with basic functions, and then working through transformations and solving equations. Later chapters in the book will cover statistics and calculus concepts and how to use them to solve problems and gain useful insights. Finally, you'll study differential equations with an emphasis on numerical methods and learn about algorithms that directly calculate values of functions. By the end of this book, you’ll have learned how to apply essential statistics and calculus concepts to develop robust Python applications that solve business challenges.
Table of Contents (14 chapters)
Preface

2. Python's Main Tools for Statistics

Activity 2.01: Analyzing the Communities and Crime Dataset

Solution:

  1. Once the dataset has been downloaded, the libraries can be imported, and pandas can be used to read in the dataset in a new Jupyter notebook, as follows:
    import pandas as pd
    import numpy as np
    import matplotlib.pyplot as plt
    df = pd.read_csv('CommViolPredUnnormalizedData.txt')
    df.head()

    We are also printing out the first five rows of the dataset, which should be as follows:

    Figure 2.21: The first five rows of the dataset

  2. To print out the column names, we can simply iterate through df.columns in a for loop, like so:
    for column in df.columns:
        print(column)
  3. The total number of columns in the dataset can be computed using the len() function in Python:
    print(len(df.columns))
  4. To replace the special character '?' with np.nan objects, we can use the replace() method:
    df = df.replace('?', np.nan)
  5. To print out...