Book Image

Machine Learning in Biotechnology and Life Sciences

By : Saleh Alkhalifa
Book Image

Machine Learning in Biotechnology and Life Sciences

By: Saleh Alkhalifa

Overview of this book

The booming fields of biotechnology and life sciences have seen drastic changes over the last few years. With competition growing in every corner, companies around the globe are looking to data-driven methods such as machine learning to optimize processes and reduce costs. This book helps lab scientists, engineers, and managers to develop a data scientist's mindset by taking a hands-on approach to learning about the applications of machine learning to increase productivity and efficiency in no time. You’ll start with a crash course in Python, SQL, and data science to develop and tune sophisticated models from scratch to automate processes and make predictions in the biotechnology and life sciences domain. As you advance, the book covers a number of advanced techniques in machine learning, deep learning, and natural language processing using real-world data. By the end of this machine learning book, you'll be able to build and deploy your own machine learning models to automate processes and make predictions using AWS and GCP.
Table of Contents (17 chapters)
1
Section 1: Getting Started with Data
6
Section 2: Developing and Training Models
13
Section 3: Deploying Models to Users

Overfitting and underfitting

Within the context of SML, we will prepare our models by fitting them with historical data. The process of fitting a model generally outputs a measure of how well the model generalizes to data that is similar to the data on which the model was trained. Using this output, usually in the form of precision, accuracy, and recall, we can determine whether the method we implemented or the parameters we changed had a positive impact on our model. If we revisit the definition of ML models that from earlier in this chapter, we specifically refer to them as models that learn or generalize from historical data. Models that are able to learn from historical data are referred to as well-fitted models, in the sense that they are able to perform accurately on new and unseen data.

There are instances in which models are underfitted. Underfitted models generally perform poorly on datasets, which means they have not learned to generalize well. These cases are generally...