The following is a very simple dataset with four features or variables: name, age, gender, and monthly income in dollars ($). A dataset may also be known as a set of observations. The term's features and variables are used interchangeably in most places and also in this book. Let's take a look at the following table:
<-----------------Features ---------->
Name |
Age |
Gender |
Monthly Income($) |
---|---|---|---|
Person A |
20 |
Male |
2000 |
Person B |
45 |
Female |
5000 |
Person C |
36 |
Male |
3000 |
Person D |
55 |
Male |
6500 |
Person E |
27 |
Female |
2800 |
Person F |
31 |
Male |
5900 |
Person G |
33 |
Male |
4800 |
Person X |
59 |
Female |
2400 |
Person Y |
42 |
Male |
7200 |
Person Z |
29 |
Female |
3100 |
Rows or records in the dataset are also known as examples in the machine learning context. There are 10 examples in this dataset. Here, the Name column represents the unit of observation—the unit described by the data that one analyzes. So, you may say that Person A has the features: 20, Male...