3.1 THE BANK MARKETING DATA SET
We will illustrate how to perform the first two phases of the Data Science Methodology using the bank_marketing_training and bank_marketing_test data sets. Readers may download these data sets from the book series web site: www.dataminingconsultant.com. These data sets are adapted from the bank‐additional‐full.txt data set1 from the UCI Machine Learning Repository.2 We use only four predictors (age, educations, previous_outcome, and days_since_previous), plus the target, response. The data relate to a phone‐based direct marketing campaign conducted by a bank in Portugal. The bank was interested in whether or not the contacts would subscribe to a term deposit account with the bank. The bank_marketing_training data set contains 26,874 records, while bank_marketing_test contains 10,255 records.