Chapter 6: Feature Engineering and Labeling
In the previous chapter, we learned how to clean our data and do basic statistical analysis. In this chapter, we will delve into two more types of actions we must perform before we can start our ML training. These two steps are the most important of all besides efficiently cleaning your dataset, and to be good at them, you will require a high amount of experience. This chapter will give you a basis to build upon.
In the first section, we will learn about feature engineering. We will understand the process, how to select predictive features from our dataset, and what methods exist to transform features from our dataset to make them usable for our ML algorithm.
In the second section, we will look at data labeling. Most ML algorithms fall into the category of supervised learning, which means they require labeled training data. We will look at some typical scenarios that require labels and learn how Azure Machine Learning can help with...