Book Image

Pentaho Data Integration Quick Start Guide

By : María Carina Roldán
Book Image

Pentaho Data Integration Quick Start Guide

By: María Carina Roldán

Overview of this book

Pentaho Data Integration(PDI) is an intuitive and graphical environment packed with drag and drop design and powerful Extract-Transform-Load (ETL) capabilities. Given its power and flexibility, initial attempts to use the Pentaho Data Integration tool can be difficult or confusing. This book is the ideal solution. This book reduces your learning curve with PDI. It provides the guidance needed to make you productive, covering the main features of Pentaho Data Integration. It demonstrates the interactive features of the graphical designer, and takes you through the main ETL capabilities that the tool offers. By the end of the book, you will be able to use PDI for extracting, transforming, and loading the types of data you encounter on a daily basis.
Table of Contents (15 chapters)

Generating different kinds of files


After loading and transforming your data in many ways, you will want to send the results to a destination. In particular, you may need to save the data in a plain file. Doing so is a simple task. To illustrate how to do it, we will reuse a transformation that was developed in the Filtering Rows section of  Chapter 4, Transforming Data, and will send the data to a text file:

  1. From the code developed in the last chapter, open the transformation that filtered places with more than three bedrooms or that accommodated more than four people. Save the transformation with a different name.
  2. From the Output folder, drag and drop a Text file output step.
  3. Create a hop from the Filter rows step (or the Java Filter step, depending on the step that you used) to the new step. When asked for the kind of hop, select Main output of step.
  4. Double-click on the Text file output step.
    1. Under Filename, type the full path to the destination file. You can include variable names (for example...