The KDD98
data set uses a YYMM date format, which is not one of the supported date formats in Modeler. In this recipe we will use Derive nodes to parse the existing date information and reassemble it into a supported format. In this recipe we will extract the month portion of information contained in a variable that combines the month and year in a string. The starting stream has already addressed the year information. We will modify the stream so that it also addresses the month information.
We will start with the Parsing Nonstandard Dates.str
stream, which uses the cup98lrn reduced vars2.txt
data set.
Open the
Parsing Nonstandard Dates.str
stream.Run a preview of the Derive node. Scroll to the far right of the table to see the new variable, and then edit the Derive node. The variable is the
Year_str
variable. Note that the original variable,DOB
, has the two-digit year on the left, and the two digits for the month on the right of a four...