Suppose you have collected several football statistics in plain files. Your files look like this:
Group|Date|Home Team |Results|Away Team|Notes Group 1|02/June|Italy|2-1|France| Group 1|02/June|Argentina|2-1|Hungary Group 1|06/June|Italy|3-1|Hungary Group 1|06/June|Argentina|2-1|France Group 1|10/June|France|3-1|Hungary Group 1|10/June|Italy|1-0|Argentina ------------------------------------------- World Cup 78 Group 1
You don't have one, but many files, all with the same structure. You now want to unify all the information in one single file. Let's begin by reading the files.
Create the folder named
pdi_files
. Inside it, create theinput
andoutput
subfolders.By using any text editor, type the file shown and save it under the name
group1.txt
in the folder namedinput
, which you just created. You can also download the file from Packt's official website.Start Spoon.
From the main menu select File | New Transformation.
Expand the...