Book Image

Healthcare Analytics Made Simple

By : Vikas (Vik) Kumar, Shameer Khader

Book Image

Healthcare Analytics Made Simple

By: Vikas (Vik) Kumar, Shameer Khader

Overview of this book

In recent years, machine learning technologies and analytics have been widely utilized across the healthcare sector. Healthcare Analytics Made Simple bridges the gap between practising doctors and data scientists. It equips the data scientists’ work with healthcare data and allows them to gain better insight from this data in order to improve healthcare outcomes. This book is a complete overview of machine learning for healthcare analytics, briefly describing the current healthcare landscape, machine learning algorithms, and Python and SQL programming languages. The step-by-step instructions teach you how to obtain real healthcare data and perform descriptive, predictive, and prescriptive analytics using popular Python packages such as pandas and scikit-learn. The latest research results in disease detection and healthcare image analysis are reviewed. By the end of this book, you will understand how to use Python for healthcare data analysis, how to import, collect, clean, and refine data from electronic health record (EHR) surveys, and how to make predictive models with this data through real-world algorithms and code examples.

Preface

Who this book is for

What this book covers

To get the most out of this book

Free Chapter

Introduction to Healthcare Analytics

Introduction to Healthcare Analytics

What is healthcare analytics?

Foundations of healthcare analytics

History of healthcare analytics

Examples of healthcare analytics

Exploring the software

Healthcare Foundations

Healthcare Foundations

Healthcare delivery in the US

Patient data – the journey from patient to computer

Standardized clinical codesets

Breaking down healthcare analytics

References and further reading

Machine Learning Foundations

Machine Learning Foundations

Model frameworks for medical decision making

Machine learning pipeline

References and further reading

Computing Foundations – Databases

Computing Foundations – Databases

Introduction to databases

Data engineering with SQL – an example case

Case details – predicting mortality for a cardiology practice

Starting an SQLite session

Data engineering, one table at a time with SQL

References and further reading

Computing Foundations – Introduction to Python

Computing Foundations – Introduction to Python

Variables and types

Data structures and containers

Programming in Python – an illustrative example

Introduction to pandas

Introduction to scikit-learn

Additional analytics libraries

Measuring Healthcare Quality

Measuring Healthcare Quality

Introduction to healthcare measures

US Medicare value-based programs

The Hospital Value-Based Purchasing (HVBP) program

The Hospital Readmission Reduction (HRR) program

The Hospital-Acquired Conditions (HAC) program

The End-Stage Renal Disease (ESRD) quality incentive program

The Skilled Nursing Facility Value-Based Program (SNFVBP)

The Home Health Value-Based Program (HHVBP)

The Merit-Based Incentive Payment System (MIPS)

Other value-based programs

Comparing dialysis facilities using Python

Comparing hospitals

Making Predictive Models in Healthcare

Making Predictive Models in Healthcare

Introduction to predictive analytics in healthcare

Our modeling task – predicting discharge statuses for ED patients

Obtaining the dataset

Starting a Jupyter session

Importing the dataset

Making the response variable

Splitting the data into train and test sets

Preprocessing the predictor variables

Final preprocessing steps

Building the models

Using the models to make predictions

Improving our models

References and further reading

Healthcare Predictive Models – A Review

Healthcare Predictive Models – A Review

Predictive healthcare analytics – state of the art

Overall cardiovascular risk

Congestive heart failure

Readmission prediction

Other conditions and events

References and further reading

The Future – Healthcare and Emerging Technologies

The Future – Healthcare and Emerging Technologies

Healthcare analytics and the internet

Healthcare and deep learning

Obstacles, ethical issues, and limitations

Conclusion of this book

References and further reading

Other Books You May Enjoy

Other Books You May Enjoy

Leave a review - let other readers know what you think

Customer Reviews

5 star

0

4 star

0

3 star

0

2 star

0

1 star

0

Splitting the data into train and test sets

Now that we have our response variable, the next step is to split the dataset into train and test sets. In data science, the training set is the data that is used to determine the model coefficients. In the training phase, the model takes into account the predictor variable values together with the response value to "discover" the rules and the weights that will guide the prediction of new data. The testing set is then used to measure our model performance, as we discussed in Chapter 3, Machine Learning Foundations. Typical splits use 70-80% for the training data and 20-30% for the testing data (unless the dataset is very large, in which case a smaller percentage can be allotted toward the testing set).

Some practitioners also have a validation set that is used to train model parameters, such as the tree size in the random...