Index
A
- Airbnb
- reference / Reading plain files
- Amazon S3, HDFS / Reading files from remote locations
C
- Community Edition (CE) / Installing PDI
D
- data
- obtaining, from plain files / Getting data from plain files
- obtaining, from relational databases / Getting data from relational databases, Getting data from a database
- obtaining, from other sources / Getting data from other sources
- transforming / Transforming data in different ways
- extracting, from existing fields / Extracting data from existing fields
- aggregating / Sorting and aggregating data, Aggregating data
- sorting / Sorting data
- searching / Looking up for data
- searching, in secondary stream / Looking for data in a secondary stream
- searching, in database / Looking up data in a database
- updating, in database tables / Inserting and updating data in database tables, Updating data
- inserting, in database tables / Inserting and updating data in database tables, Inserting data
- database
- data, searching / Looking up data in a database
- database tables
- data, inserting / Inserting and updating data in database tables
- data, updating / Inserting and updating data in database tables
- errors, handling / Handling errors
- datamart
- loading / Loading a datamart
- time dimension, loading / Loading a time dimension
- dimensions, loading / Loading other kinds of dimensions
- fact table, loading / Loading a fact table
- dimensions
- loading / Loading other kinds of dimensions
- loading, with combination lookup/update step / Loading a dimension with a combination lookup/update step
- loading, with dimension lookup/update step / Loading a dimension with a dimension lookup/update step
E
- execution of jobs
- and transformations, combining / Combining the execution of jobs and transformations
- extract, transform, and load (ETL) / Introducing PDI
F
- files
- types, generating / Generating different kinds of files
G
- Google Cloud Storage / Reading files from remote locations
- Google Drive
- plain files, reading from / Reading files from Google Drive
- Google Drive API
- reference / Reading files from Google Drive
- graphical designer tool
- configuring / Configuring the graphical designer tool
H
- Hadoop / Taking a tour of the job entries
J
- Java Messaging Service (JMS) / System information and Kettle variables
- job entries
- files, working with / Taking a tour of the job entries
- interacting, with databases / Taking a tour of the job entries
- execution flow, modification conditions / Taking a tour of the job entries
- big data, dealing / Taking a tour of the job entries
- emails, sending / Sending emails
- jobs
- designing / Designing and running jobs
- executing / Designing and running jobs, Creating and running a simple job
- creating / Creating and running a simple job
- results of execution / Understanding the results of execution
- tasks, sequencing / Sequencing tasks
- nesting / Nesting transformations and jobs
- executing, with Kitchen utility / Running jobs with the Kitchen utility
- JSON structures
- parsing / XML and JSON
K
- Kettle / Introducing PDI
- Kettle home directory / Understanding the Kettle home directory
- Kettle variables
- using / Defining and using Kettle variables
- defining / Defining and using Kettle variables
- named parameters, using / Using named parameters
- user-defined variables, creating / Creating user-defined Kettle variables
- Kitchen utility
- used, for job execution / Running jobs with the Kitchen utility
M
- metadata
- manipulating / Manipulating the metadata
N
- new fields
- creating, ways / More ways to create new fields
P
- pan utility
- used, for transformation execution / Running transformations with the Pan utility
- PDI jobs
- purpose / Understanding the purpose of PDI jobs
- examples / Understanding the purpose of PDI jobs
- Pentaho Data Integration (PDI)
- about / Introducing PDI
- installing / Installing PDI
- download link / Installing PDI
- plain files
- data, obtaining / Getting data from plain files, Reading plain files
- reading / Reading plain files
- reading, with visibility / Reading files with great versatility
- reading, from remote locations / Reading files from remote locations
- PostgreSQL
R
- relational databases
- data, obtaining / Getting data from relational databases, Getting data from a database
- connecting to / Connecting to a database and using the database explorer
- database explorer, using / Connecting to a database and using the database explorer
- rows
- filtering / Filtering rows
- filtering, upon conditions / Filtering rows upon conditions
S
- simple transformation
- creating / Creating a simple transformation
- single dataset
- different sources, combining into / Combining different sources into a single dataset
- two different datasets, combining / Combining two different datasets into a single dataset
- Spoon
- about / Configuring the graphical designer tool
- customizing / Configuring the graphical designer tool
- interface, exploring / Exploring the Spoon interface
- used, for working with jobs / Creating and running a simple job
- stream
- splitting, upon conditions / Splitting the stream upon conditions
- subjob / Nesting transformations and jobs
- system information
- obtaining / System information and Kettle variables
T
- transformation
- about / Designing, previewing, and running transformations
- designing / Designing and previewing a transformation
- previewing / Designing and previewing a transformation
- logging options / Understanding the logging options
- Step Metrics tab / Understanding the Step Metrics tab
- errors, dealing with / Dealing with errors while designing
- saving / Saving and running a transformation
- executing / Saving and running a transformation
- executing, with pan utility / Running transformations with the Pan utility
- executing, from job / Executing transformations from a job
V
- Virtual File System (VFS) / Reading files from remote locations
X
- XML files
- reading / XML and JSON
- XML Input Stream (StAX) / XML and JSON