Python for Finance Cookbook - Second Edition

By : Eryk Lewinson

5 (1)

Buy this Book

Python for Finance Cookbook - Second Edition

5 (1)

By: Eryk Lewinson

Buy this Book

Overview of this book

Python is one of the most popular programming languages in the financial industry, with a huge collection of accompanying libraries. In this new edition of the Python for Finance Cookbook, you will explore classical quantitative finance approaches to data modeling, such as GARCH, CAPM, factor models, as well as modern machine learning and deep learning solutions. You will use popular Python libraries that, in a few lines of code, provide the means to quickly process, analyze, and draw conclusions from financial data. In this new edition, more emphasis was put on exploratory data analysis to help you visualize and better understand financial data. While doing so, you will also learn how to use Streamlit to create elegant, interactive web applications to present the results of technical analyses. Using the recipes in this book, you will become proficient in financial data analysis, be it for personal or professional projects. You will also understand which potential issues to expect with such analyses and, more importantly, how to overcome them.

Preface

Who this book is for

What this book covers

To get the most out of this book

Get in touch

Free Chapter

Acquiring Financial Data

Getting data from Yahoo Finance

Getting data from Nasdaq Data Link

Getting data from Intrinio

Getting data from Alpha Vantage

Getting data from CoinGecko

Summary

Data Preprocessing

Converting prices to returns

Adjusting the returns for inflation

Changing the frequency of time series data

Different ways of imputing missing data

Converting currencies

Different ways of aggregating trade data

Summary

Visualizing Financial Time Series

Basic visualization of time series data

Visualizing seasonal patterns

Creating interactive visualizations

Creating a candlestick chart

Summary

Exploring Financial Time Series Data

Outlier detection using rolling statistics

Outlier detection with the Hampel filter

Detecting changepoints in time series

Detecting trends in time series

Detecting patterns in a time series using the Hurst exponent

Investigating stylized facts of asset returns

Summary

Technical Analysis and Building Interactive Dashboards

Calculating the most popular technical indicators

Downloading the technical indicators

Recognizing candlestick patterns

Building an interactive web app for technical analysis using Streamlit

Deploying the technical analysis app

Summary

Time Series Analysis and Forecasting

Time series decomposition

Testing for stationarity in time series

Correcting for stationarity in time series

Modeling time series with exponential smoothing methods

Modeling time series with ARIMA class models

Finding the best-fitting ARIMA model with auto-ARIMA

Summary

Machine Learning-Based Approaches to Time Series Forecasting

Validation methods for time series

Feature engineering for time series

Time series forecasting as reduced regression

Forecasting with Meta’s Prophet

AutoML for time series forecasting with PyCaret

Summary

Multi-Factor Models

Estimating the CAPM

Estimating the Fama-French three-factor model

Estimating the rolling three-factor model on a portfolio of assets

Estimating the four- and five-factor models

Estimating cross-sectional factor models using the Fama-MacBeth regression

Summary

Modeling Volatility with GARCH Class Models

Modeling stock returns’ volatility with ARCH models

Modeling stock returns’ volatility with GARCH models

Forecasting volatility using GARCH models

Multivariate volatility forecasting with the CCC-GARCH model

Forecasting the conditional covariance matrix using DCC-GARCH

Summary

Monte Carlo Simulations in Finance

Simulating stock price dynamics using a geometric Brownian motion

Pricing European options using simulations

Pricing American options with Least Squares Monte Carlo

Pricing American options using QuantLib

Pricing barrier options

Estimating Value-at-Risk using Monte Carlo

Summary

Asset Allocation

Evaluating an equally-weighted portfolio’s performance

Finding the efficient frontier using Monte Carlo simulations

Finding the efficient frontier using optimization with SciPy

Finding the efficient frontier using convex optimization with CVXPY

Finding the optimal portfolio with Hierarchical Risk Parity

Summary

Backtesting Trading Strategies

Vectorized backtesting with pandas

Event-driven backtesting with backtrader

Backtesting a long/short strategy based on the RSI

Backtesting a buy/sell strategy based on Bollinger bands

Backtesting a moving average crossover strategy using crypto data

Backtesting a mean-variance portfolio optimization

Summary

Applied Machine Learning: Identifying Credit Default

Loading data and managing data types

Exploratory data analysis

Splitting data into training and test sets

Identifying and dealing with missing values

Encoding categorical variables

Fitting a decision tree classifier

Organizing the project with pipelines

Tuning hyperparameters using grid searches and cross-validation

Summary

Advanced Concepts for Machine Learning Projects

Exploring ensemble classifiers

Exploring alternative approaches to encoding categorical features

Investigating different approaches to handling imbalanced data

Leveraging the wisdom of the crowds with stacked ensembles

Bayesian hyperparameter optimization

Investigating feature importance

Exploring feature selection techniques

Exploring explainable AI techniques

Summary

Deep Learning in Finance

Exploring fastai’s Tabular Learner

Exploring Google’s TabNet

Time series forecasting with Amazon’s DeepAR

Time series forecasting with NeuralProphet

Summary

Other Books You May Enjoy

Index

Customer Reviews

5 (1)

5 star

100%

4 star

3 star

2 star

1 star

What this book covers

Chapter 1, Acquiring Financial Data, covers a few of the most popular sources of high-quality financial data, including Yahoo Finance, Nasdaq Data Link, Intrinio, and Alpha Vantage. It focuses on leveraging dedicated Python libraries and processing data for further analysis.

Chapter 2, Data Preprocessing, describes various techniques used to preprocess data. It describes the crucial steps between obtaining the data and using it for building machine learning models or investigating trading strategies. As such, it covers topics such as converting prices to returns, adjusting them for inflation, imputing missing values, or aggregating trade data into various kinds of bars.

Chapter 3, Visualizing Financial Time Series, focuses on visualizing financial (and not only) time series data. By plotting the data, we can visually identify some patterns, such as trends, seasonality, and changepoints, which we can further confirm using statistical tests. The insights gathered at this point can lead to making better decisions while choosing the modeling approach.

Chapter 4, Exploring Financial Time Series Data, shows how to use various algorithms and statistical tests to automatically identify potential issues with time series data, such as the existence of outliers. Additionally, it covers analyzing data for the existence of trends or other patterns such as mean-reversion. Lastly, it explores the stylized facts of asset returns. Together, those concepts are crucial while working with financial data, as we want to make sure that the models/strategies we are building can accurately capture the dynamics of asset returns.

Chapter 5, Technical Analysis and Building Interactive Dashboards, explains the basics of technical analysis in Python by showing how to calculate some of the most popular indicators and automatically recognize patterns in candlestick data. It also demonstrates how to create a Streamlit-based web app, which enables us to visualize and inspect the predefined TA indicators in an interactive fashion.

Chapter 6, Time Series Analysis and Forecasting, introduces the basics of time series modeling. It starts by looking into the building blocks of time series and how to separate them using various decomposition methods. Then, it covers the concept of stationarity, how to test for it, and how to achieve it in case the original series is not stationary. Lastly, it shows how to use two of the most widely used statistical approaches to time series modeling—the exponential smoothing methods and ARIMA class models.

Chapter 7, Machine Learning-Based Approaches to Time Series Forecasting, starts by explaining different ways of validating time series models. Then, it provides an overview of feature engineering approaches. It also introduces a tool for automatic feature extraction which generates hundreds or thousands of features with a few lines of code. Furthermore, it explains the concept of reduced regression and how to use Meta’s popular Prophet algorithm. The chapter concludes with an introduction to one of the popular AutoML frameworks for time series forecasting.

Chapter 8, Multi-Factor Models, covers estimating various factor models, starting with the simplest one-factor model (CAPM) and then extending it to the more advanced three-, four-, and five-factor models.

Chapter 9, Modeling Volatility with GARCH Class Models, focuses on volatility and the concept of conditional heteroskedasticity. It shows how to use univariate and multivariate GARCH models, which are one of the most popular ways of modeling and forecasting volatility.

Chapter 10, Monte Carlo Simulations in Finance, explains how to use Monte Carlo methods for various tasks, such as simulating stock prices, pricing derivatives with no closed-form solution (American/Exotic options), or estimating the uncertainty of a portfolio (for example, by calculating Value-at-Risk and Expected Shortfall).

Chapter 11, Asset Allocation, starts by explaining the most basic asset allocation strategy, and on its basis, showing how to evaluate the performance of portfolios. Then it shows three different approaches to obtaining the efficient frontier. Lastly, it explores Hierarchical Risk Parity, which is a novel approach to asset allocation based on the combination of graph theory and machine learning.

Chapter 12, Backtesting Trading Strategies, presents how to run backtests of various trading strategies using two approaches (vectorized and event-driven) with the help of popular Python libraries. To do so, it uses a few examples of strategies built on the basis of popular technical indicators or mean-variance portfolio optimization.

Chapter 13, Applied Machine Learning: Identifying Credit Default, shows how to approach a real-life machine learning task of predicting loan defaults. It covers the entire scope of a machine learning project, from gathering and cleaning data to building and tuning a classifier. An important takeaway from this chapter is understanding the general approach to machine learning projects, which can then be applied to many different tasks, be it churn prediction or estimating the price of new real estate in a neighborhood.

Chapter 14, Advanced Concepts for Machine Learning Projects, continues from the workflow introduced in the preceding chapter and demonstrates possible extensions to the MVP stage of ML projects. It starts with presenting more advanced classifiers. Then, it covers alternative approaches to encoding categorical features and describes a few methods of dealing with imbalanced data.

Furthermore, it shows how to create stacked ensembles of ML models and leverage Bayesian hyperparameter tuning to improve upon exhaustive grid search. It also explores various approaches to calculating feature importance and using it to select the most informative predictors. Lastly, it touches upon the rapidly developing field of explainable AI.

Chapter 15, Deep Learning in Finance, describes how to apply some of the recent neural network architectures to two possible use cases in the financial domain—predicting credit card default (a classification task) and forecasting time series.

Python for Finance Cookbook - Second Edition

By : Eryk Lewinson

Python for Finance Cookbook - Second Edition

By: Eryk Lewinson

Overview of this book

Related Content you might be interested in

Current Title:

Python for Finance Cookbook - Second Edition

Machine Learning for Algorithmic Trading

Time Series Analysis with Python Cookbook

Deep Learning for Time Series Cookbook

What this book covers