Machine Learning in Biotechnology and Life Sciences

By : Saleh Alkhalifa

Machine Learning in Biotechnology and Life Sciences

By: Saleh Alkhalifa

Overview of this book

The booming fields of biotechnology and life sciences have seen drastic changes over the last few years. With competition growing in every corner, companies around the globe are looking to data-driven methods such as machine learning to optimize processes and reduce costs. This book helps lab scientists, engineers, and managers to develop a data scientist's mindset by taking a hands-on approach to learning about the applications of machine learning to increase productivity and efficiency in no time. You’ll start with a crash course in Python, SQL, and data science to develop and tune sophisticated models from scratch to automate processes and make predictions in the biotechnology and life sciences domain. As you advance, the book covers a number of advanced techniques in machine learning, deep learning, and natural language processing using real-world data. By the end of this machine learning book, you'll be able to build and deploy your own machine learning models to automate processes and make predictions using AWS and GCP.

Preface

Who this book is for

What this book covers

To get the most out of this book

Download the example code files

Download the color images

Conventions used

Get in touch

Share your thoughts

Section 1: Getting Started with Data

Free Chapter

Chapter 1: Introducing Machine Learning for Biotechnology

Understanding the biotechnology field

Combining biotechnology and machine learning

Exploring machine learning software

Summary

Chapter 2: Introducing Python and the Command Line

Technical requirements

Introducing the command line

Discovering the Python language

Tutorial – getting started in Python

Tutorial – working with Rdkit and BioPython

Summary

Chapter 3: Getting Started with SQL and Relational Databases

Technical requirements

Exploring relational databases

Tutorial – getting started with MySQL

Summary

Chapter 4: Visualizing Data with Python

Technical requirements

Exploring the six steps of data visualization

Commonly used visualization libraries

Tutorial – visualizing data in Python

Summary

Section 2: Developing and Training Models

Chapter 5: Understanding Machine Learning

Technical requirements

Understanding ML

Overfitting and underfitting

Developing an ML model

Summary

Chapter 6: Unsupervised Machine Learning

Introduction to UL

Understanding clustering algorithms

Understanding DR

Summary

Chapter 7: Supervised Machine Learning

Understanding supervised learning

Measuring success in supervised machine learning

Understanding classification in supervised machine learning

Understanding regression in supervised machine learning

Summary

Chapter 8: Understanding Deep Learning

Understanding the field of deep learning

Selecting an activation function

Measuring progress with loss

Tutorial – protein sequence classification via LSTMs using Keras and MLflow

Tutorial – anomaly detection in manufacturing using AWS Lookout for Vision

Summary

Chapter 9: Natural Language Processing

Introduction to NLP

Getting started with NLP using NLTK and SciPy

Working with structured data

Tutorial – clustering and topic modeling

Working with unstructured data

Tutorial – developing a scientific data search engine using transformers

Summary

Chapter 10: Exploring Time Series Analysis

Understanding time series data

Exploring the components of a time series dataset

Tutorial – forecasting demand using Prophet and LSTM

Summary

Section 3: Deploying Models to Users

Chapter 11: Deploying Models with Flask Applications

Understanding API frameworks

Working with Flask and Visual Studio Code

Using Flask as an API and web application

Tutorial – Deploying a pretrained model using Flask

Summary

Chapter 12: Deploying Applications to the Cloud

Exploring current cloud computing platforms

Understanding containers and images

Tutorial – deploying a container to AWS (Lightsail)

Tutorial – deploying an application to GCP (App Engine)

Tutorial – deploying an application's code to GitHub

Summary

Why subscribe?

Other Books You May Enjoy

Packt is searching for authors like you

Share your thoughts

Customer Reviews

5 star

4 star

3 star

2 star

1 star

Chapter 1: Introducing Machine Learning for Biotechnology

How do I get started? This is a question that I have received far too frequently over my last few years as a data scientist and consultant operating in the technology/biotechnology sectors, and the answer to this question never really seemed to change from person to person. My recommendation was generally along the lines of learning Python and data science through online courses and following a few tutorials to get a sense of how things worked. What I found was that the vast majority of scientists and engineers that I have encountered, who are interested in learning data science, tend to get overwhelmed by the large volume of resources and documentation available on the internet. From Getting Started in Python courses to Comprehensive Machine Learning guides, the vast majority of those who ask the question How do I get started? often find themselves confused and demotivated just a few days into their journey. This is especially true for scientists or researchers in the lab who do not usually interact with code, algorithms, or predictive models. Using the Terminal command line for the first time can be unusual, uncomfortable, and – to a certain extent – terrifying to a new user.

This book exists to address this problem. This is a one-stop shop to give scientists, engineers, and everyone in-between a fast and efficient guide to getting started in the beautiful field of data science. If you are not a coder and do not intend to be, you have the option to read this book from cover to cover without ever using Python or any of the hands-on resources. You will still manage to walk away with a strong foundation and understanding of machine learning and its useful capabilities, and what it can bring to the table within your team. If you are a coder, you have the option to follow along on your personal computer and complete all the tutorials we will cover. All of the code within this book is inclusive, connected, and designed to be fully replicable on your device. In addition, all of the code in this book and its associated tutorials is available online for your convenience. The tutorials we will complete can be thought of as blueprints to a certain extent, in the sense that they can be recycled and applied to your data. So, depending on what your expectations of the phrase getting started are, you will be able to use this book effectively and efficiently, regardless of your intent to code. So, how do we plan on getting started?

Throughout this book, we will introduce concepts and tutorials that cater to problems and use cases that are commonly experienced in the technology and biotechnology sectors. Unlike many of the courses and tutorials available online, this book is well-connected, condensed, and chronological, thus offering you a fast and efficient way to get up to speed on data science. In under 400 pages, we will introduce the main concepts and ideas relating to Python, SQL, machine learning, deep learning, natural language processing, and time-series analysis. We will cover some popular approaches, best practices, and important information every data scientist should know. In addition to all of this, we will not only put on our data scientist hats to train and develop several powerful predictive models, but we will also put on our data engineer hats and deploy our models to the cloud using Amazon Web Services (AWS) and Google Cloud Platform (GCP). Whether you are planning to bring data science to your current team, train and deploy the models yourself, or start interviewing for data scientist positions, this book will equip you with the right tools and resources to start your new journey, starting with this first chapter. In the following sections, we will cover a few interesting topics to get us started:

Understanding the biotechnology field
Combining biotechnology and machine learning
Exploring machine learning software

With that in mind, let's look at some of the fun areas within the field of biotechnology that are ripe for exploration when it comes to machine learning.

Machine Learning in Biotechnology and Life Sciences

By : Saleh Alkhalifa

Machine Learning in Biotechnology and Life Sciences

By: Saleh Alkhalifa

Overview of this book

Related Content you might be interested in

Current Title:

Machine Learning in Biotechnology and Life Sciences

Hands-On Automated Machine Learning

Deep Learning for Genomics

Exploratory Data Analysis with Python Cookbook

Chapter 1: Introducing Machine Learning for Biotechnology