Book Image

Comet for Data Science

By : Angelica Lo Duca
Book Image

Comet for Data Science

By: Angelica Lo Duca

Overview of this book

This book provides concepts and practical use cases which can be used to quickly build, monitor, and optimize data science projects. Using Comet, you will learn how to manage almost every step of the data science process from data collection through to creating, deploying, and monitoring a machine learning model. The book starts by explaining the features of Comet, along with exploratory data analysis and model evaluation in Comet. You’ll see how Comet gives you the freedom to choose from a selection of programming languages, depending on which is best suited to your needs. Next, you will focus on workspaces, projects, experiments, and models. You will also learn how to build a narrative from your data, using the features provided by Comet. Later, you will review the basic concepts behind DevOps and how to extend the GitLab DevOps platform with Comet, further enhancing your ability to deploy your data science projects. Finally, you will cover various use cases of Comet in machine learning, NLP, deep learning, and time series analysis, gaining hands-on experience with some of the most interesting and valuable data science techniques available. By the end of this book, you will be able to confidently build data science pipelines according to bespoke specifications and manage them through Comet.
Table of Contents (16 chapters)
1
Section 1 – Getting Started with Comet
5
Section 2 – A Deep Dive into Comet
10
Section 3 – Examples and Use Cases

Preface

A recent survey of machine learning professionals (https://www.comet.com/site/about-us/news-and-events/press-releases/comet-releases-new-survey-highlighting-ais-latest-challenges-too-much-friction-too-little-ml/) concluded that about 40%–60% of interviewed professionals abandoned their data science projects because they were not able to manage the full life cycle process of their data science projects. I’m a data science researcher, and before encountering Comet, I belonged to that 40%–60% of professionals who abandon their data science projects. In fact, during my working experience, I have abandoned many projects without concluding them because of the nature of research, where you test an idea and, if it does not work, you drop it.

Almost a year ago, I discovered Comet, a platform for model tracking and monitoring, and some wonderful people from its team, who opened my mind to the many features provided by Comet. I began to study it, with the hope of keeping my projects organized and moving them from early stages to production. I realized that I was able to conclude all the projects I implemented in Comet because of the simplicity of the platform.

Comet for Data Science is the result of my studies and tests, as well as the countless biweekly meetings with the Comet team. The book aims at helping you to learn how to manage a data science project workflow, from its early stages up to project deployment and reporting. In a single sentence, Comet for Data Science is written to help you to conclude your data science projects successfully.

By picking this book, you will look at the general concepts of data science from a Comet perspective, with the hope that you will increase your productivity. The book will take you through the journey of building a data science project and integrating it into Comet, including exploratory data analysis, model building and evaluation, report building, and, finally, moving the model to production. Throughout the book, you will implement many practical examples that you can use to better understand the described concepts, as well as starting points for your projects.

I hope that this book will add something to your knowledge, and – why not? – help you to become a better data scientist!

Happy reading!