Book Image

Machine Learning in Biotechnology and Life Sciences

By : Saleh Alkhalifa
Book Image

Machine Learning in Biotechnology and Life Sciences

By: Saleh Alkhalifa

Overview of this book

The booming fields of biotechnology and life sciences have seen drastic changes over the last few years. With competition growing in every corner, companies around the globe are looking to data-driven methods such as machine learning to optimize processes and reduce costs. This book helps lab scientists, engineers, and managers to develop a data scientist's mindset by taking a hands-on approach to learning about the applications of machine learning to increase productivity and efficiency in no time. You’ll start with a crash course in Python, SQL, and data science to develop and tune sophisticated models from scratch to automate processes and make predictions in the biotechnology and life sciences domain. As you advance, the book covers a number of advanced techniques in machine learning, deep learning, and natural language processing using real-world data. By the end of this machine learning book, you'll be able to build and deploy your own machine learning models to automate processes and make predictions using AWS and GCP.
Table of Contents (17 chapters)
1
Section 1: Getting Started with Data
6
Section 2: Developing and Training Models
13
Section 3: Deploying Models to Users

Introduction to UL

We will define UL as a subset of ML in which models are trained without the existence of categories or labels. Unlike its supervised counterpart, UL relies on the development of models to capture patterns in the form of features to extract insights from the data. Let's now take a closer look at the two main categories of UL.

There exist many different methods and techniques that fall within the scope of UL. We can group these methods into two main categories: those with discrete data (clustering) and those with continuous data (DR). We can see a graphical representation of this here:

Figure 6.1 – The two types of UL

In each of these techniques, data is either grouped or transformed in order to determine labels or extract insights and representations without knowing the labels or categories of the data ahead of time. Take, for example, the breast cancer dataset we worked with in Chapter 5, Understanding Machine Learning, in...