Sign In Start Free Trial
Account

Add to playlist

Create a Playlist

Modal Close icon
You need to login to use this feature.
  • Book Overview & Buying 15 Math Concepts Every Data Scientist Should Know
  • Table Of Contents Toc
  • Feedback & Rating feedback
15 Math Concepts Every Data Scientist Should Know

15 Math Concepts Every Data Scientist Should Know

By : David Hoyle
4.3 (6)
close
close
15 Math Concepts Every Data Scientist Should Know

15 Math Concepts Every Data Scientist Should Know

4.3 (6)
By: David Hoyle

Overview of this book

Data science combines the power of data with the rigor of scientific methodology, with mathematics providing the tools and frameworks for analysis, algorithm development, and deriving insights. As machine learning algorithms become increasingly complex, a solid grounding in math is crucial for data scientists. David Hoyle, with over 30 years of experience in statistical and mathematical modeling, brings unparalleled industrial expertise to this book, drawing from his work in building predictive models for the world's largest retailers. Encompassing 15 crucial concepts, this book covers a spectrum of mathematical techniques to help you understand a vast range of data science algorithms and applications. Starting with essential foundational concepts, such as random variables and probability distributions, you’ll learn why data varies, and explore matrices and linear algebra to transform that data. Building upon this foundation, the book spans general intermediate concepts, such as model complexity and network analysis, as well as advanced concepts such as kernel-based learning and information theory. Each concept is illustrated with Python code snippets demonstrating their practical application to solve problems. By the end of the book, you’ll have the confidence to apply key mathematical concepts to your data science challenges.
Table of Contents (21 chapters)
close
close
Lock Free Chapter
1
Part 1: Essential Concepts
7
Part 2: Intermediate Concepts
13
Part 3: Selected Advanced Concepts

Sampling from distributions

So far, we’ve learned a lot about random variables, probability distributions, and how to calculate some of the key characteristics of a distribution such as its mean and variance, and we’ve learned about some commonly occurring distributions. But so far, it doesn’t feel like we’ve learned much about data. We’ll now change that.

How datasets relate to random variables and probability distributions

We said at the beginning of this chapter that all data is random. This means when data is captured or generated, we are drawing or sampling values from some underlying probability distribution. This is illustrated schematically in Figure 2.10:

Figure 2.10: Diagram illustrating how real data is generated as samples from a population

Figure 2.10: Diagram illustrating how real data is generated as samples from a population

A sample is finite. It represents a snapshot or subset of the entirety of possible outcomes; for example, a subset of all users who might visit a website. But from...

Visually different images
CONTINUE READING
83
Tech Concepts
36
Programming languages
73
Tech Tools
Icon Unlimited access to the largest independent learning library in tech of over 8,000 expert-authored tech books and videos.
Icon Innovative learning tools, including AI book assistants, code context explainers, and text-to-speech.
Icon 50+ new titles added per month and exclusive early access to books as they are being written.
15 Math Concepts Every Data Scientist Should Know
notes
bookmark Notes and Bookmarks search Search in title playlist Add to playlist download Download options font-size Font size

Change the font size

margin-width Margin width

Change margin width

day-mode Day/Sepia/Night Modes

Change background colour

Close icon Search
Country selected

Close icon Your notes and bookmarks

Confirmation

Modal Close icon
claim successful

Buy this book with your credits?

Modal Close icon
Are you sure you want to buy this book with one of your credits?
Close
YES, BUY

Submit Your Feedback

Modal Close icon
Modal Close icon
Modal Close icon