Deep Learning for Genomics

By : Upendra Kumar Devisetty

Deep Learning for Genomics

By: Upendra Kumar Devisetty

Overview of this book

Deep learning has shown remarkable promise in the field of genomics; however, there is a lack of a skilled deep learning workforce in this discipline. This book will help researchers and data scientists to stand out from the rest of the crowd and solve real-world problems in genomics by developing the necessary skill set. Starting with an introduction to the essential concepts, this book highlights the power of deep learning in handling big data in genomics. First, you’ll learn about conventional genomics analysis, then transition to state-of-the-art machine learning-based genomics applications, and finally dive into deep learning approaches for genomics. The book covers all of the important deep learning algorithms commonly used by the research community and goes into the details of what they are, how they work, and their practical applications in genomics. The book dedicates an entire section to operationalizing deep learning models, which will provide the necessary hands-on tutorials for researchers and any deep learning practitioners to build, tune, interpret, deploy, evaluate, and monitor deep learning models from genomics big data sets. By the end of this book, you’ll have learned about the challenges, best practices, and pitfalls of deep learning for genomics.

Preface

Who is this book for?

What this book covers

To get the most out of this book

Download the example code files

Conventions used

Get in touch

Reviews

Share Your Thoughts

Download a free PDF copy of this book

Part 1 – Machine Learning in Genomics

Free Chapter

Chapter 1: Introducing Machine Learning for Genomics

What is machine learning?

Why machine learning for genomics?

Machine learning for genomics in life sciences and biotechnology

Summary

Chapter 2: Genomics Data Analysis

Technical requirements

What is a genome?

Genome sequencing

Analysis of genomic data

Introduction to Biopython for genomic data analysis

Chapter 3: Machine Learning Methods for Genomic Applications

Technical requirements

Genomics big data

Supervised and unsupervised ML

ML for genomics

An ML use case for genomics – Disease prediction

ML challenges in genomics

Summary

Part 2 – Deep Learning for Genomic Applications

Chapter 4: Deep Learning for Genomics

Understanding what deep learning is and how it works

Anatomy of deep neural networks

DNNs for genomics

Introducing deep learning algorithms and Python libraries

Summary

Chapter 5: Introducing Convolutional Neural Networks for Genomics

Introduction to CNNs

CNNs for genomics

Applications of CNNs in genomics

Summary

Chapter 6: Recurrent Neural Networks in Genomics

What are RNNs?

Introducing RNNs

Different RNN architectures

Applications and use cases of RNNs in genomics

Summary

Chapter 7: Unsupervised Deep Learning with Autoencoders

What is unsupervised DL?

Types of unsupervised DL

What are autoencoders?

Autoencoders for genomics

Summary

Chapter 8: GANs for Improving Models in Genomics

What are GANs?

Challenges working with genomics datasets

How can GANs help improve models?

Practical applications of GANs in genomics

Summary

Part 3 – Operationalizing models

Chapter 9: Building and Tuning Deep Learning Models

Technical requirements

Use case – Predicting the binding site location of the JunD TF

Summary

Chapter 10: Model Interpretability in Genomics

What is model interpretability?

Unlocking business value from model interpretability

Model interpretability methods in genomics

Use case – Model interpretability for genomics

Summary

Chapter 11: Model Deployment and Monitoring

Technical requirements

Introducing model deployment

Monitoring models using advanced tools

Summary

Chapter 12: Challenges, Pitfalls, and Best Practices for Deep Learning in Genomics

Deep learning challenges regarding genomics

Common pitfalls for applying deep learning to genomics

Best practices for applying deep learning to genomics

Summary

Index

Why subscribe?

Other Books You May Enjoy

Packt is searching for authors like you

Share Your Thoughts

Download a free PDF copy of this book

Customer Reviews

5 star

4 star

3 star

2 star

1 star

Use case – Model interpretability for genomics

In this hands-on exercise section, we will build a similar convolutional NN (CNN) model that we built in Chapter 9, Building and Tuning Deep Learning Models, but unlike in Chapter 9, here we will use a simulated dataset of DNA sequences of length 50 bases (whereas in Chapter 9, we have DNA sequence of length 101 bases). In addition, the binding sites in this example are not just for Transcription Factors (TFs) but any protein. The labels are designated as 0 and 1, corresponding to positive and negative binding sites (0 = no binding site and 1 = binding site).

The goal of this is to train a CNN model to predict the DNA binding site of the protein and visualize it in the predictions. Since these are artificial sequences, we have injected the AAAGAGGAAGTT motif into the positive sequence, but don’t worry—the CNN doesn’t know that.

Data collection

For this hands-on tutorial, we will use the simulated data...

Deep Learning for Genomics

By : Upendra Kumar Devisetty

Deep Learning for Genomics

By: Upendra Kumar Devisetty

Overview of this book

Related Content you might be interested in

Current Title:

Deep Learning for Genomics

Applied Machine Learning for Healthcare and Life Sciences Using AWS

R Bioinformatics Cookbook

The Deep Learning Architect’s Handbook

Use case – Model interpretability for genomics

Data collection