Machine Learning in Biotechnology and Life Sciences

By : Saleh Alkhalifa

Machine Learning in Biotechnology and Life Sciences

By: Saleh Alkhalifa

Overview of this book

The booming fields of biotechnology and life sciences have seen drastic changes over the last few years. With competition growing in every corner, companies around the globe are looking to data-driven methods such as machine learning to optimize processes and reduce costs. This book helps lab scientists, engineers, and managers to develop a data scientist's mindset by taking a hands-on approach to learning about the applications of machine learning to increase productivity and efficiency in no time. You’ll start with a crash course in Python, SQL, and data science to develop and tune sophisticated models from scratch to automate processes and make predictions in the biotechnology and life sciences domain. As you advance, the book covers a number of advanced techniques in machine learning, deep learning, and natural language processing using real-world data. By the end of this machine learning book, you'll be able to build and deploy your own machine learning models to automate processes and make predictions using AWS and GCP.

Preface

Who this book is for

What this book covers

To get the most out of this book

Download the example code files

Download the color images

Conventions used

Get in touch

Share your thoughts

Section 1: Getting Started with Data

Free Chapter

Chapter 1: Introducing Machine Learning for Biotechnology

Understanding the biotechnology field

Combining biotechnology and machine learning

Exploring machine learning software

Summary

Chapter 2: Introducing Python and the Command Line

Technical requirements

Introducing the command line

Discovering the Python language

Tutorial – getting started in Python

Tutorial – working with Rdkit and BioPython

Summary

Chapter 3: Getting Started with SQL and Relational Databases

Technical requirements

Exploring relational databases

Tutorial – getting started with MySQL

Summary

Chapter 4: Visualizing Data with Python

Technical requirements

Exploring the six steps of data visualization

Commonly used visualization libraries

Tutorial – visualizing data in Python

Summary

Section 2: Developing and Training Models

Chapter 5: Understanding Machine Learning

Technical requirements

Understanding ML

Overfitting and underfitting

Developing an ML model

Summary

Chapter 6: Unsupervised Machine Learning

Introduction to UL

Understanding clustering algorithms

Understanding DR

Summary

Chapter 7: Supervised Machine Learning

Understanding supervised learning

Measuring success in supervised machine learning

Understanding classification in supervised machine learning

Understanding regression in supervised machine learning

Summary

Chapter 8: Understanding Deep Learning

Understanding the field of deep learning

Selecting an activation function

Measuring progress with loss

Tutorial – protein sequence classification via LSTMs using Keras and MLflow

Tutorial – anomaly detection in manufacturing using AWS Lookout for Vision

Summary

Chapter 9: Natural Language Processing

Introduction to NLP

Getting started with NLP using NLTK and SciPy

Working with structured data

Tutorial – clustering and topic modeling

Working with unstructured data

Tutorial – developing a scientific data search engine using transformers

Summary

Chapter 10: Exploring Time Series Analysis

Understanding time series data

Exploring the components of a time series dataset

Tutorial – forecasting demand using Prophet and LSTM

Summary

Section 3: Deploying Models to Users

Chapter 11: Deploying Models with Flask Applications

Understanding API frameworks

Working with Flask and Visual Studio Code

Using Flask as an API and web application

Tutorial – Deploying a pretrained model using Flask

Summary

Chapter 12: Deploying Applications to the Cloud

Exploring current cloud computing platforms

Understanding containers and images

Tutorial – deploying a container to AWS (Lightsail)

Tutorial – deploying an application to GCP (App Engine)

Tutorial – deploying an application's code to GitHub

Summary

Why subscribe?

Other Books You May Enjoy

Packt is searching for authors like you

Share your thoughts

Customer Reviews

5 star

4 star

3 star

2 star

1 star

Working with unstructured data

In the previous section, we explored some of the most common tasks and processes that are conducted when handing text-based data. More often than not, you will find that the data you work with is generally not of a structured nature, or perhaps not of a digital nature. Take, for example, a company that has decided to move all printed documents to a digital state. Or perhaps a company that maintains a large repository of documents, none of which are structured or organized. For tasks such as these, we can rely on several AWS products to come to our rescue. We will explore two of the most useful NLP tools in the next few sections.

OCR using AWS Textract

In my opinion, one of the most useful tools available within AWS is an Optical Character Recognition (OCR) tool known as AWS Textract. The main idea behind this tool is to enable users to extract text, tables, and other useful items from images or static PDF documents using pre-built machine learning...

Machine Learning in Biotechnology and Life Sciences

By : Saleh Alkhalifa

Machine Learning in Biotechnology and Life Sciences

By: Saleh Alkhalifa

Overview of this book

Related Content you might be interested in

Current Title:

Machine Learning in Biotechnology and Life Sciences

Hands-On Automated Machine Learning

Deep Learning for Genomics

Exploratory Data Analysis with Python Cookbook

Working with unstructured data

OCR using AWS Textract