5 (1)

5 (1)

#### Overview of this book

Are you looking to start developing artificial intelligence applications? Do you need a refresher on key mathematical concepts? Full of engaging practical exercises, The Statistics and Calculus with Python Workshop will show you how to apply your understanding of advanced mathematics in the context of Python. The book begins by giving you a high-level overview of the libraries you'll use while performing statistics with Python. As you progress, you'll perform various mathematical tasks using the Python programming language, such as solving algebraic functions with Python starting with basic functions, and then working through transformations and solving equations. Later chapters in the book will cover statistics and calculus concepts and how to use them to solve problems and gain useful insights. Finally, you'll study differential equations with an emphasis on numerical methods and learn about algorithms that directly calculate values of functions. By the end of this book, you’ll have learned how to apply essential statistics and calculus concepts to develop robust Python applications that solve business challenges.
Preface
1. Fundamentals of Python
Free Chapter
2. Python's Main Tools for Statistics
3. Python's Statistical Toolbox
4. Functions and Algebra with Python
5. More Mathematics with Python
6. Matrices and Markov Chains with Python
7. Doing Basic Statistics with Python
8. Foundational Probability Concepts and Their Applications
9. Intermediate Statistics with Python
10. Foundational Calculus with Python
11. More Calculus with Python
12. Intermediate Calculus with Python

# 2. Python's Main Tools for Statistics

## Activity 2.01: Analyzing the Communities and Crime Dataset

Solution:

1. Once the dataset has been downloaded, the libraries can be imported, and pandas can be used to read in the dataset in a new Jupyter notebook, as follows:
```import pandas as pd
import numpy as np
import matplotlib.pyplot as plt
2. To print out the column names, we can simply iterate through `df.columns` in a `for` loop, like so:
```for column in df.columns:
3. The total number of columns in the dataset can be computed using the `len()` function in Python:
`print(len(df.columns))`
4. To replace the special character `'?'` with `np.nan` objects, we can use the `replace()` method:
`df = df.replace('?', np.nan)`