Book Image

Principles of Strategic Data Science

By : Peter Prevos
Book Image

Principles of Strategic Data Science

By: Peter Prevos

Overview of this book

Mathematics and computer science form an integral part of data science, and understanding them is crucial for efficiently managing data. This book is designed to take you through the entire data science pipeline and help you join the dots between mathematics, programming, and business analysis. You’ll start by learning what data science is and how organizations can use it to revolutionize the way they use their data. The book then covers the criteria for the soundness of data products and demonstrates how to effectively visualize information. As you progress, you’ll discover the strategic aspects of data science by exploring the five-phase framework that enables you to enhance the value you extract from data. Toward the concluding chapters, you’ll understand the role of a data science manager in helping an organization take the data-driven approach. By the end of this book, you’ll have a good understanding of data science and how it can enable you to extract value from your data.
Table of Contents (6 chapters)

Best-Practice Data Science

This chapter discusses a normative framework for creating value with data. Inspired by the Roman architect Vitruvius, the products of data science need to be useful, sound and aesthetic.

Data science is useful when the data is converted into information. This information increases the knowledge of the professionals who use it. This knowledge improves the reality from which the data was extracted. The relationship between reality and data is critical.

Data science is sound when it delivers valid and reliable results and can be reviewed by other experts.

Validity is the extent to which the data describes the aspect of reality it is presumed to represent. In physical measurements, this aspect is governed by physics and chemistry. In measures of the social world, validity is complicated, because we can only record the external states of a human being, and not their state of mind.

The reliability of data and its analysis relates to the accuracy of the information. In physical...