Sign In Start Free Trial

Book Overview & Buying
Table Of Contents

Managing Data Science

By : Kirill Dubovikov

5 (2)

Managing Data Science

5 (2)

By: Kirill Dubovikov

Overview of this book

Data science and machine learning can transform any organization and unlock new opportunities. However, employing the right management strategies is crucial to guide the solution from prototype to production. Traditional approaches often fail as they don't entirely meet the conditions and requirements necessary for current data science projects. In this book, you'll explore the right approach to data science project management, along with useful tips and best practices to guide you along the way. After understanding the practical applications of data science and artificial intelligence, you'll see how to incorporate them into your solutions. Next, you will go through the data science project life cycle, explore the common pitfalls encountered at each step, and learn how to avoid them. Any data science project requires a skilled team, and this book will offer the right advice for hiring and growing a data science team for your organization. Later, you'll be shown how to efficiently manage and improve your data science projects through the use of DevOps and ModelOps. By the end of this book, you will be well versed with various data science solutions and have gained practical insights into tackling the different challenges that you'll encounter on a daily basis.

Free Chapter

Section 1: What is Data Science?

Section 1: What is Data Science?

What You Can Do with Data Science

What You Can Do with Data Science

Defining AI

Introduction to machine learning

Introduction to deep learning

Deep learning use case

Introduction to causal inference

Summary

Testing Your Models

Testing Your Models

Offline model testing

Online model testing

Summary

Understanding AI

Understanding AI

Understanding mathematical optimization

Thinking with statistics

How do machines learn?

Exploring machine learning

Exploring deep learning

Summary

Section 2: Building and Sustaining a Team

Section 2: Building and Sustaining a Team

An Ideal Data Science Team

An Ideal Data Science Team

Defining data science team roles

Exploring data science team roles and their responsibilities

Summary

Conducting Data Science Interviews

Conducting Data Science Interviews

Common flaws of technical interviews

Introducing values and ethics into the interview

Designing good interviews

Summary

Building Your Data Science Team

Building Your Data Science Team

Achieving team Zen

Leadership and people management

Facilitating a growth mindset

Case study—creating a data science department

Summary

Section 3: Managing Various Data Science Projects

Section 3: Managing Various Data Science Projects

Managing Innovation

Managing Innovation

Understanding innovations

Why do big organizations fail so often?

Exploring innovation management

Balancing sales, marketing, team leadership, and technology

Managing innovations in a big company

Managing innovations in a start-up company

Finding project ideas

Summary

Managing Data Science Projects

Managing Data Science Projects

Understanding data science project failure

Exploring the data science project life cycle

Choosing a project management methodology

Choosing a methodology that suits your project

Estimating data science projects

Discovering the goals of the estimation process

Summary

Common Pitfalls of Data Science Projects

Common Pitfalls of Data Science Projects

Avoiding the common risks of data science projects

Approaching research projects

Dealing with prototypes and MVP projects

Mitigating risks in production-oriented data science systems

Summary

Creating Products and Improving Reusability

Creating Products and Improving Reusability

Thinking of projects as products

Determining the stage of your project

Improving reusability

Seeking and building products

Summary

Section 4: Creating a Development Infrastructure

Section 4: Creating a Development Infrastructure

Implementing ModelOps

Implementing ModelOps

Understanding ModelOps

Looking into DevOps

Managing code versions and quality

Storing data along with the code

Managing environments

Tracking experiments

The importance of automated testing

Packaging code

Continuous model training

Case study – building ModelOps for a predictive maintenance system

A power pack for your projects

Summary

Building Your Technology Stack

Building Your Technology Stack

Defining the elements of a technology stack

Choosing between core- and project-specific technologies

Comparing tools and products

Summary

Conclusion

Conclusion

Advancing your knowledge

Summary

Other Books You May Enjoy

Other Books You May Enjoy

Leave a review - let other readers know what you think

Storing data along with the code

As you have seen previously, we can structure code in data science projects into a set of pipelines that produce various artifacts: reports, models, and data. Different versions of code produce changing outputs, and data scientists often need to reproduce results or use artifacts from past versions of pipelines.

This distinguishes data science projects from software projects and creates a need for managing data versions along with the code: Data Version Control (DVC). In general, different software versions can be reconstructed by using the source code alone, but for data science projects this is not sufficient. Let's see what problems arise when you try to track datasets using Git.

Tracking and versioning data

...

CONTINUE READING

83

Tech Concepts

36

Programming languages

73

Tech Tools

Unlimited access to the largest independent learning library in tech of over 8,000 expert-authored tech books and videos.

Innovative learning tools, including AI book assistants, code context explainers, and text-to-speech.

50+ new titles added per month and exclusive early access to books as they are being written.

Managing Data Science

Search

Your notes and bookmarks