Bayesian Analysis with Python - Third Edition

By : Osvaldo Martin

Bayesian Analysis with Python - Third Edition

By: Osvaldo Martin

Overview of this book

The third edition of Bayesian Analysis with Python serves as an introduction to the main concepts of applied Bayesian modeling using PyMC, a state-of-the-art probabilistic programming library, and other libraries that support and facilitate modeling like ArviZ, for exploratory analysis of Bayesian models; Bambi, for flexible and easy hierarchical linear modeling; PreliZ, for prior elicitation; PyMC-BART, for flexible non-parametric regression; and Kulprit, for variable selection. In this updated edition, a brief and conceptual introduction to probability theory enhances your learning journey by introducing new topics like Bayesian additive regression trees (BART), featuring updated examples. Refined explanations, informed by feedback and experience from previous editions, underscore the book's emphasis on Bayesian statistics. You will explore various models, including hierarchical models, generalized linear models for regression and classification, mixture models, Gaussian processes, and BART, using synthetic and real datasets. By the end of this book, you will possess a functional understanding of probabilistic modeling, enabling you to design and implement Bayesian models for your data science challenges. You'll be well-prepared to delve into more advanced material or specialized statistical modeling if the need arises.

Preface

Free Chapter

Chapter 1 Thinking Probabilistically

1.1 Statistics, models, and this book’s approach

1.2 Working with data

1.3 Bayesian modeling

1.4 A probability primer for Bayesian practitioners

1.5 Interpreting probabilities

1.6 Probabilities, uncertainty, and logic

1.7 Single-parameter inference

1.8 How to choose priors

1.9 Communicating a Bayesian analysis

1.10 Summary

1.11 Exercises

Join our community Discord space

Chapter 2 Programming Probabilistically

2.1 Probabilistic programming

2.2 Summarizing the posterior

2.3 Posterior-based decisions

2.4 Gaussians all the way down

2.5 Posterior predictive checks

2.6 Robust inferences

2.7 InferenceData

2.8 Groups comparison

2.9 Summary

2.10 Exercises

Join our community Discord space

Chapter 3 Hierarchical Models

3.1 Sharing information, sharing priors

3.2 Hierarchical shifts

3.3 Water quality

3.4 Shrinkage

3.5 Hierarchies all the way up

3.6 Summary

3.7 Exercises

Join our community Discord space

Chapter 4 Modeling with Lines

4.1 Simple linear regression

4.2 Linear bikes

4.3 Generalizing the linear model

4.4 Counting bikes

4.5 Robust regression

4.6 Logistic regression

4.7 Variable variance

4.8 Hierarchical linear regression

4.9 Multiple linear regression

4.10 Summary

4.11 Exercises

Join our community Discord space

Chapter 5 Comparing Models

5.1 Posterior predictive checks

5.2 The balance between simplicity and accuracy

5.3 Measures of predictive accuracy

5.4 Calculating predictive accuracy with ArviZ

5.5 Model averaging

5.6 Bayes factors

5.7 Bayes factors and inference

5.8 Regularizing priors

5.9 Summary

5.10 Exercises

Join our community Discord space

Chapter 6 Modeling with Bambi

6.1 One syntax to rule them all

6.2 The bikes model, Bambi’s version

6.3 Polynomial regression

6.4 Splines

6.5 Distributional models

6.6 Categorical predictors

6.7 Interactions

6.8 Interpreting models with Bambi

6.9 Variable selection

6.10 Summary

6.11 Exercises

Join our community Discord space

Chapter 7 Mixture Models

7.1 Understanding mixture models

7.2 Finite mixture models

7.3 The non-identifiability of mixture models

7.4 How to choose K

7.5 Zero-Inflated and hurdle models

7.6 Mixture models and clustering

7.7 Non-finite mixture model

7.8 Continuous mixtures

7.9 Summary

7.10 Exercises

Join our community Discord space

Chapter 8 Gaussian Processes

8.1 Linear models and non-linear data

8.2 Modeling functions

8.3 Multivariate Gaussians and functions

8.4 Gaussian processes

8.5 Gaussian process regression

8.6 Gaussian process regression with PyMC

8.7 Gaussian process classification

8.8 Cox processes

8.9 Regression with spatial autocorrelation

8.10 Hilbert space GPs

8.11 Summary

8.12 Exercises

Join our community Discord space

Chapter 9 Bayesian Additive Regression Trees

9.1 Decision trees

9.2 BART models

9.3 Distributional BART models

9.4 Constant and linear response

9.5 Choosing the number of trees

9.6 Summary

9.7 Exercises

Join our community Discord space

Chapter 10 Inference Engines

10.1 Inference engines

10.2 The grid method

10.3 Quadratic method

10.4 Markovian methods

10.5 Sequential Monte Carlo

10.6 Diagnosing the samples

10.7 Convergence

10.8 Effective Sample Size (ESS)

10.9 Monte Carlo standard error

10.10 Divergences

10.11 Keep calm and keep trying

10.12 Summary

10.13 Exercises

Join our community Discord space

Chapter 11 Where to Go Next

Join our community Discord space

Bibliography

Other Books You May Enjoy

Index

Customer Reviews

5 star

4 star

3 star

2 star

1 star

1.11 Exercises

We do not know whether the brain works in a Bayesian way, in an approximately Bayesian fashion, or maybe some evolutionary (more or less) optimized heuristics. Nevertheless, we know that we learn by exposing ourselves to data, examples, and exercises… Well you may say that humans never learn, given our record as a species on subjects such as wars or economic systems that prioritize profit and not people’s well-being... Anyway, I recommend you do the proposed exercises at the end of each chapter:

Suppose you have a jar with 4 jelly beans: 2 are strawberry-flavored, 1 is blueberry-flavored, and 1 is cinnamon-flavored. You draw one jelly bean at random from the jar.
1. What is the sample space for this experiment?
2. We define event A as the jelly bean drawn is strawberry-flavored and event B as The jelly bean drawn is not cinnamon-flavored. What are the probabilities of events A and B?
3. Are events A and B mutually exclusive? Why or why not?
Previously, we defined a Python function P to compute the probability of an event using the naive definition of probability. Generalize that function to compute the probability of events when they are not all equally likely. Use this new function to compute the probability of events A and B from the previous exercise. Hint: you can pass a third argument with the probability of each event.
Use PreliZ to explore different parameters for the BetaBinomial and Gaussian distributions. Use the methods plot_pdf, plot_cdf, and plot_interactive.
We discussed the probability mass/density functions and the cumulative density function. But there are other ways to represent functions like the percentile point function ppf. Using the plot_ppf method of PreliZ, plot the percentile point function for the BetaBinomial and Gaussian distributions. Can you explain how the ppf is related to the cdf and pmf/pdf?
From the following expressions, which one corresponds to: the probability of being sunny given that it is 9^th of July of 1816?
1. p(sunny)
2. p(sunny|July)
3. p(sunny|9 of July of 1816)
4. p(9^th of July of 1816|sunny)
We showed that the probability of choosing a human at random and picking the Pope is not the same as the probability of the Pope being human. In the animated series Futurama, the (Space) Pope is a reptile. How does this change your previous calculations?
Following the example in Figure 1.9, use PreliZ to compute the moments for the SkewNormal distribution for a different combination of parameters. Generate random samples of different sizes, like 10, 100, and 1,000, and see if you can recover the values of the first two moments (mean and variance) from the samples. What do you observe?
Repeat the previous exercise for the Student’s T distribution. Try values of ν like 2, 3, 500. What do you observe?
In the following definition of a probabilistic model, identify the prior and the likelihood:
$Y ∼ Normal (μ,σ) μ ∼ Normal (0,2) σ ∼ HalfNormal (0.75 )$
In the previous model, how many parameters will the posterior have? Compare it with the model for the coin-flipping problem.
Write Bayes’ theorem for the model in exercise 9.
Let’s suppose that we have two coins; when we toss the first coin, half of the time it lands on tails and half of the time on heads. The other coin is a loaded coin that always lands on heads. If we take one of the coins at random and get a head, what is the probability that this coin is the unfair one?
Try re-plotting Figure 1.12 using other priors (beta_params) and other data (trials and data).
Read about the Cromwell rule on Wikipedia: https://en.wikipedia.org/wiki/Cromwell%27s_rule.
Read about probabilities and the Dutch book on Wikipedia: https://en.wikipedia.org/wiki/Dutch_book.

Bayesian Analysis with Python - Third Edition

By : Osvaldo Martin

Bayesian Analysis with Python - Third Edition

By: Osvaldo Martin

Overview of this book

Related Content you might be interested in

Current Title:

Bayesian Analysis with Python - Third Edition

Mastering Linux Administration

Linux for System Administrators

Mastering Linux Administration

1.11 Exercises