Book Image

Pentaho Data Integration Quick Start Guide

By : María Carina Roldán
Book Image

Pentaho Data Integration Quick Start Guide

By: María Carina Roldán

Overview of this book

Pentaho Data Integration(PDI) is an intuitive and graphical environment packed with drag and drop design and powerful Extract-Transform-Load (ETL) capabilities. Given its power and flexibility, initial attempts to use the Pentaho Data Integration tool can be difficult or confusing. This book is the ideal solution. This book reduces your learning curve with PDI. It provides the guidance needed to make you productive, covering the main features of Pentaho Data Integration. It demonstrates the interactive features of the graphical designer, and takes you through the main ETL capabilities that the tool offers. By the end of the book, you will be able to use PDI for extracting, transforming, and loading the types of data you encounter on a daily basis.
Table of Contents (15 chapters)

Designing and running jobs

In this section, you will learn the basics involved in jobs. First, you will create and run a very simple job, in order to get familiar with the process. Then, you will learn how to sequence tasks; finally, you will be guided into the world of job entry.

Creating and running a simple job 

To create and run jobs, you use the same tool that is used for creating, previewing, and running transformations: Spoon. Let's look at how to use Spoon to work with jobs:

  1. Create a new job. You can do so in several ways: by accessing the option File New Job in the main menu, by clicking on the New file option and then Job in the main toolbar, or by pressingCtrl + Alt + N.

With the preceding action, you will see a new job template, which looks like a transformation, except for the icon at the top of the work area. You will also see a different Design tree, which contains job entries instead of steps.

  1. From the General folder, drag a START entry to the canvas.


The START entry is...