Index
A
- access level property / How it works...
- action sequence / Executing a PDI transformation as part of a Pentaho process
- Add sequence step / How it works...
- usage avoiding, to enumerate rows / Avoiding using an Add sequence step to enumerate the rows
- Advanced tab / How it works...
- AgileBI
- data, visualizing with / Visualizing data with AgileBI, How to do it..., How it works...
- alternative notation
- for separator / Alternative notation for a separator
- Analytic Query step / How to do it..., How it works...
- Append streams step / How to do it..., How it works...
- ARFF Output plugin / Getting ready
- ARFF Output step / How it works...
- Argument tab / How it works...
- Attached Files tab / How it works...
- authors.txt file / Getting ready
- authors table, books data structure / Authors
- AWS Console
- security page, URL / Getting ready
- AWS S3 Instance
- data, reading from / Reading data from an AWS S3 Instance, How to do it..., How it works...
B
- Baseball Database
- URL / Getting ready
- Big Data wiki
- URL / Loading data into Hadoop
- Blocking step / Introduction
- Block this step until steps finish / Introduction
- books
- references, URL / Online
- books data structure
- about / Books data structure
- books table store / Books
- authors table store / Authors
- books table, books data structure / Books
- BSON (binary JSON) / Loading data into MongoDB
C
- @ character / How to do it...
- cache duration property / How it works...
- cache property / How it works...
- Carte / How it works...
- categories table, outdoor data structure / categories
- CDA
- URL / There's more...
- CDA API / How it works...
- CDA Editor / Generating files from the PUC with PDI and the CDA plugin
- CDA plugin
- used, for creating files from PUC / Generating files from the PUC with PDI and the CDA plugin, How to do it..., How it works...
- CDA Previewer / Generating files from the PUC with PDI and the CDA plugin
- CDF / Limiting the number of output rows, Populating a CDF dashboard with data coming from a PDI transformation
- CDF dashboard
- populating, with PDI transformation data / Populating a CDF dashboard with data coming from a PDI transformation, How to do it..., How it works..., There's more...
- cells
- value, retrieving in Excel file / Getting the value of specific cells in an Excel file, How to do it..., How it works..., There's more...
- searching for, in Excel file / Looking for a given cell
- Centers for Disease Control and Prevention
- URL / Getting ready
- checksums
- adding, to verify datasets / Adding checksums to verify datasets, How to do it...
- cities table, museums data structure / cities
- Column Configurations (opt) property / How it works...
- Column Configurations II (opt) property / How it works...
- Combination lookup/update step / Using the Combination lookup/update for looking up
- commands
- running, on another server / Running commands on another server, How to do it..., How it works...
- Community Acronym Generator (CAG) / Generating all possible pairs formed from two datasets
- Community Chart Framework (CCF) / How it works...
- Community Dashboard Editor (CDE) / Getting ready
- Community Dashboard Framework
- condition based
- stream, splitting into multiple streams / Splitting a stream into two or more streams based on a condition, How to do it..., How it works...
- Copy Files job entry / Copying or moving one or more files, How it works...
- Copy previous results to args? option / Accessing the copied rows from jobs, transformations, and other entries
- Copy previous results to parameters? option / Accessing the copied rows from jobs, transformations, and other entries
- Copy rows to result step / How it works..., How it works...
- Credit card validator / There's more...
- CSV / Reading files with fixed width fields
- CSV file / How it works...
- custom functionality
- programming / Programming custom functionality, How to do it..., How it works...
D
- Damerau-Levenshtein / There's more...
- data
- retrieving, from database / Getting data from a database, How it works..., There's more...
- retrieving from database, parameters used / Getting data from a database by providing parameters, How to do it..., How it works...
- retrieving from database, runtime built query used / Getting data from a database by running a query built at runtime, How to do it..., There's more...
- from table, deleting / Deleting data from a table, Getting ready, How to do it..., How it works...
- reading, from AWS S3 instance / Reading data from an AWS S3 Instance, How to do it..., How it works...
- loading, in Salesforce.com / Loading data into Salesforce.com, How to do it..., How it works...
- retrieving, from Salesforce.com / Getting data from Salesforce.com, How it works...
- loading, in Hadoop / Loading data into Hadoop, Getting ready, How to do it..., How it works...
- retrieving, from Hadoop / Getting data from Hadoop, How to do it..., How it works...
- loading, in HBase / Loading data into HBase, Getting ready, How to do it..., There's more...
- retrieving, from HBase / Getting data from HBase, How to do it..., How it works...
- loading, in MongoDB / Loading data into MongoDB, How to do it..., How it works...
- retrieving, from MongoDB / Getting data from MongoDB, How to do it...
- getting, from path / Getting data from a different path
- getting, selectively / Getting data selectively
- validating, at runtime / Validating data at runtime, How to do it..., There's more...
- processing into shared transformations, filter criteria used / Processing data into shared transformations via filter criteria and subtransformations, How to do it..., How it works...
- processing into shared transformations, sub-transformations used / Processing data into shared transformations via filter criteria and subtransformations, How to do it..., How it works...
- visualizing, with AgileBI / Visualizing data with AgileBI, How to do it..., There's more...
- visualizing, Instaview used / Using Instaview to analyze and visualize data
- analyzing, Instaview used / Using Instaview to analyze and visualize data
- visualizing, InstaView used / Getting ready, How to do it..., How it works...
- analyzing, InstaView used / Getting ready, How it works...
- reading, from SAS data file / Reading data from a SAS datafile, How to do it..., How it works...
- studying, via stream statistics / Studying data via stream statistics, How to do it..., How it works...
- dataAccessId parameter / How it works...
- data analytics
- about / Introduction
- database
- about / Introduction
- sample database / Sample databases
- Pentaho BI platform databases / Pentaho BI platform databases
- connecting to / Connecting to a database, How to do it..., How it works...
- similar database connections, avoiding / Avoiding creating the same database connection over and over again
- connection properties, advanced / Specifying advanced connection properties
- connecting to, not supported by Kettle / Connecting to a database not supported by Kettle
- connection, checking at runtime / Checking the database connection at runtime
- data, retrieving from / Getting data from a database, How to do it..., There's more...
- data retrieving from, parameters used / Getting data from a database by providing parameters, How to do it..., How it works...
- data retrieving from, runtime built query used / Getting data from a database by running a query built at runtime, How it works..., There's more...
- table from PDI (design time), creating / Creating or altering a database table from PDI (design time), How to do it..., How it works..., There's more...
- table from PDI (design time), altering / Creating or altering a database table from PDI (design time), How to do it..., How it works..., There's more...
- table from PDI (runtime), creating / Creating or altering a database table from PDI (runtime), How it works...
- table from PDI (runtime), altering / Creating or altering a database table from PDI (runtime), How to do it..., There's more...
- connection, changing at runtime / Changing the database connection at runtime, Getting ready, How to do it..., How it works..., There's more...
- values looking for, complex conditions used / Looking for values in a database with complex conditions, How to do it..., How it works..., There's more...
- values looking for, dynamic queries used / Looking for values in a database with dynamic queries, How to do it..., How it works...
- database design tasks
- repetitive database design tasks, performing from PDI / Performing repetitive database design tasks from PDI, How it works...
- Database join step / How it works..., There's more...
- Database lookup step / How it works..., Taking some action when the lookup fails, Taking some action when there are too many results, There's more...
- database metadata
- SQL queries, building via / Building SQL queries via database metadata, Getting ready, How to do It..., How it works...
- database table
- values, looking for / Looking for values in a database table, How to do it..., How it works...
- DataCleaner
- installation location / How to do it...
- documentation, URL / There's more...
- Data Grid step / How it works..., There's more...
- Data grid step
- using, to generate specific data / Using a Data grid step to generate specific data
- data profiling
- with DataCleaner / Data profiling with DataCleaner, Getting ready, How to do it..., How it works...
- datasets
- verifying, checksums added / Adding checksums to verify datasets, How to do it..., How it works...
- pairs creating, from multiple datasets / Generating all possible pairs formed from two datasets, How to do it..., There's more...
- data stream
- altering, with Select values / Altering a data stream with Select values, How to do it..., How it works...
- data structure
- books data structure / Books data structure
- museums data structure / museums data structure
- outdoor data structure / outdoor data structure
- steel wheels data structure / Steel Wheels data structure
- Lahman Baseball Database / Lahman Baseball Database
- data subsets
- working with / Working with subsets of your data
- data types
- about / About data types and formats
- date fields
- Delete file job entry / How it works...
- DELETE operation / How it works...
- Delete step / How it works...
- desc_product fieldname / Changing headers
- Detect empty stream step / There's more...
- developerforce
- URL / Getting ready
- document store / Loading data into MongoDB
- doQuery feature / How it works...
- Double Metaphone / There's more...
- DTD
- about / Validating an XML file against DTD definitions, How to do it...
- limitations / There's more...
- URL / There's more...
- DTD definitions
- XML files, validating against / Validating an XML file against DTD definitions, There's more...
- DTD Validator job entry
- XML files, validating against / How it works...
- Dummy steps
- about / How to do it...
- usage, avoiding / Avoiding the use of Dummy steps
- dynamic sheets
- Excel file, writing with / Writing an Excel file with a dynamic number of sheets, How to do it..., How it works...
- Dynamic SQL row step / How it works..., There's more...
E
- e-mails
- sending, with attached files / Sending e-mails with attached files, How to do it...
- logs, sending through / Sending logs through an e-mail
- sending, in transformation / Sending e-mails in a transformation
- Email job entry / Sending e-mails with attached files
- encoding
- about / About file format and encoding
- URL / There's more...
- Encr / How it works...
- entries
- copied rows, accessing / Accessing the copied rows from jobs, transformations, and other entries
- ETL Metadata Injection step / There's more...
- Excel file
- reading / Reading an Excel file, How to do it..., How it works...
- cells value, retrieiving / Getting the value of specific cells in an Excel file, How to do it..., How it works...
- labels. horizontally arranged / There's more...
- values. horizontally arranged / There's more...
- cell, searching for / Looking for a given cell
- writing, with multiple sheets / Writing an Excel file with several sheets, How to do it..., How it works...
- writing, with dynamic sheets / Writing an Excel file with a dynamic number of sheets, How to do it..., How it works...
- Excel output step / How it works...
- Excel Writer plugin step / Sending e-mails in a transformation
- Execute a transformation window / Sample transformations
- Extract, Transform, and Load (ETL) tool / Processing data into shared transformations via filter criteria and subtransformations
F
- Feedly
- URL / Generating an RSS Feed
- field
- fixed width fields, files reading with / Reading files with fixed width fields
- file name, using as / Using the name of a file (or part of it) as a field, How to do it..., How it works...
- fields
- XML data / XML data in a field
- specifying, path notation used / Specifying fields by using the Path notation, How to do it..., How it works...
- creating, with XML structure / Generating fields with XML structures
- Fields tab / How it works...
- files
- about / Introduction
- simple file, reading / Reading a simple file, How to do it..., How it works...
- format / About file format and encoding
- fields names, altering / Altering the names, order, or metadata of the fields coming from the file
- fields metadata, altering / Altering the names, order, or metadata of the fields coming from the file
- fields order, altering / Altering the names, order, or metadata of the fields coming from the file
- reading, with fixed width fields / Reading files with fixed width fields
- multiple files, reading at once / Reading several files at the same time, How to do it..., How it works...
- semi-structured files, reading / Reading semi-structured files, Getting ready, How to do it..., How it works...
- detail files / Master/detail files
- master files / Master/detail files
- log files / Logfiles
- with one field by row, reading / Reading files having one field per row, How to do it..., There's more...
- with fields occupying multiple rows, reading / Reading files with some fields occupying two or more rows, How to do it..., How it works...
- simple file, writing / Writing a simple file, How it works...
- semi-structured file, writing / Writing a semi-structured file, How to do it..., There's more...
- name, providing / Providing the name of a file (for reading or writing) dynamically, How to do it..., How it works...
- multiple files with similar structure, generating / Generating several files simultaneously with the same structure, but different names
- name, using as field / Using the name of a file (or part of it) as a field, How to do it..., How it works...
- Excel file, reading / Reading an Excel file, How to do it...
- single file, moving / Copying or moving one or more files, How to do it..., Moving files, Creating folders
- multiple files, moving / Copying or moving one or more files, How to do it..., Detecting the existence of the files before copying them, Creating folders
- existence detecting, before copying / Detecting the existence of the files before copying them
- multiple files deleting / Deleting one or more files, How to do it..., There's more...
- single file, deleting / Deleting one or more files, How to do it..., There's more...
- deleted files, identifying / Figuring out which files have been deleted
- retrieving, from remote server / Getting files from a remote server, How to do it..., There's more...
- specifying, to tansfer / Specifying files to transfer
- transferred, information retrieving / Getting information about the files being transferred
- putting, on remote server / Putting files on a remote server, How to do it...
- custom list, moving / Copying or moving a custom list of files, How to do it...
- custom list, copying / Copying or moving a custom list of files, How to do it...
- custom list, deleting / Deleting a custom list of files, How to do it..., How it works...
- comparing / Comparing files and folders, Getting ready, How to do it..., How it works...
- encrypting / Encrypting and decrypting files, How to do it..., How it works...
- decrypting / Encrypting and decrypting files, How to do it..., There's more...
- File Transfer Protocol (FTP) / Getting files from a remote server
- Filter rows step / Splitting a stream into two or more streams based on a condition
- findInfoRowSet() method / Looking up information with additional steps
- fixed width fields
- files, reading with / Reading files with fixed width fields
- folders
- creating / Creating folders
- comparing / Comparing files and folders, How to do it..., How it works..., Comparing folders
- format
- about / About data types and formats
- providing, to output fields / Giving the output fields a format
- Freenode
- IRC, URL / Online
- FTPS
- about / Access via FTPS
- URL / Access via FTPS
- FTP server
- connection, considerations / Some considerations about connecting to an FTP server
- FULL OUTER join / There's more...
- fuzzy match algorithm / How it works...
- Levenshtein / There's more...
- Damerau-Levenshtein / There's more...
- Needleman-Wunsch / There's more...
- Jaro / There's more...
- Jaro-Winkler / There's more...
- Pair letters Similarity / There's more...
- SoundEx / There's more...
- Refined SoundEx / There's more...
- Metaphone / There's more...
- Double Metaphone / There's more...
- Fuzzy match step / Looking for values by proximity, How it works...
G
- GENRE parameter / How it works...
- Get a file with FTP job entry / How it works..., Some considerations about connecting to an FTP server
- Get data from XML step / XML data in a field, How it works...
- Get fields button / Reading files with fixed width fields, XML data in a field
- Get Fields button / How to do it...
- Get files from result step / How it works...
- getRowFrom() method / Looking up information with additional steps
- Get rows from result step / Accessing the copied rows from jobs, transformations, and other entries
- Get System Info step
- about / Get System Info
- URL / Get System Info
- Get XML Data step / How it works...
- Gnu Privacy Guard (GnuPG)
- about / Encrypting and decrypting files
- URL / Getting ready
- Group by step / How it works...
H
- Hadoop
- data, loading in / Loading data into Hadoop, Getting ready, How to do it...
- versions, URL / Getting ready
- data, retrieving from / Getting data from Hadoop, How to do it...
- Hadoop File System (HDFS) / How it works...
- HBase
- data, loading in / Loading data into HBase, Getting ready, How to do it..., There's more...
- data, retrieving from / Getting data from HBase, How to do it...
- headers
- modifying / Changing headers
- hexadecimal notation / Alternative notation for a separator
- HMAC / There's more...
- Hortonworks sandbox environment
- URL / Getting ready
- HTML page
- creating, XML transformation used / Generating an HTML page using XML and XSL transformations, How to do it..., How it works..., There's more...
- creating, XSL transformation used / Generating an HTML page using XML and XSL transformations
- Hypersonic (HSQLDB)
I
- ${Internal.Transformation.Filename.Directory} variable / How to do it...
- Ian Varley
- URL / Getting ready
- Include unspecified field / There's more...
- Info steps / Looking up information with additional steps
- INNER join / There's more...
- Input and Outputs field / How it works...
- Instaview
- using, to analyze data / Using Instaview to analyze and visualize data, Getting ready, How it works...
- Internal.Job.Filename.Directory / How to do it...
- internet
- values, looking over / Looking for values over intranet or the Internet, How to do it...
- Internet Information Services (IIS) / Getting ready
- intranet
- values, looking over / Looking for values over intranet or the Internet, How to do it...
- invoice headers (INV) / Master/detail files
J
- /job / Job XML nodes
- /job/channel-log-table / Job XML nodes
- /job/connection / Job XML nodes
- /job/entries/entry / Job XML nodes
- /job/hops/hop / Job XML nodes
- /job/job-log-table / Job XML nodes
- /job/jobentry-log-table / Job XML nodes
- /job/notepads / Job XML nodes
- Janino library / Getting ready
- Jaro / There's more...
- Jaro-Winkler / There's more...
- Java language
- syntax, URL / There's more...
- Java Script Object Notation (JSON) / Loading data into MongoDB
- JavaScript step
- using, to control entries execution in jobs / Using the JavaScript step to control the execution of the entries in your job
- Job file location / How to do it...
- jobs
- multiple jobs, running in parallel / Processing multiple jobs or transformations in parallel, How to do it..., How it works...
- running, with Kitchen / How to do it..., How it works...
- running, by setting static arguments / Executing a job or a transformation by setting static arguments and parameters, How to do it..., How it works...
- running, by setting static parameters / Executing a job or a transformation by setting static arguments and parameters, How to do it..., How it works...
- executing, by setting arguments / Executing a job or a transformation from a job by setting arguments and parameters dynamically, How to do it..., How it works...
- executing, on conditions known at runtime / Executing a job or a transformation whose name is determined at runtime, How to do it..., How it works..., There's more...
- executing, for row in dataset / Executing part of a job once for every row in a dataset, How to do it..., There's more...
- copied rows, accessing / Accessing the copied rows from jobs, transformations, and other entries
- executing, for every file in files list / Executing a transformation or part of a job once for every file in a list of files
- part executing, multiple times on true condition / Executing part of a job several times until a condition is true, How to do it..., How it works...
- loops, implementing / Implementing loops in a job
- log files, isolating for / Isolating logfiles for different jobs or transformations
- Job XML nodes / Job XML nodes
- JSON files
- working with / Working with JSON files, How to do it..., There's more...
- reading dynamically / Reading JSON files dynamically
- writing / Writing JSON files
- Json Input step / How it works...
- junk dimension tables / There's more...
K
- Kettle / How it works...
- data type / Data type's equivalence
- kettle.properties file / How to do it...
- KettleComponent inputs / Supplying values for named parameters, variables and arguments
- Kettle Transformation File property / How it works...
- Kitchen
- jobs, running with / Launching jobs and transformations
- URL / How to do it...
- KJube / Sending e-mails in a transformation
L
- Lahman's Baseball Archive website
- URL / Getting ready, Getting ready
- Lahman Baseball Database
- about / Lahman Baseball Database
- LEFT OUTER join / There's more...
- Levenshtein / There's more...
- Linux commands
- URL / Getting ready
- Location field / How it works...
- log file
- custom log file, creating / Generating a custom logfile, How to do it..., How it works..., There's more...
- filtering / Filtering the logfile
- clean log file, creating / Creating a clean logfile
- isolating, for different jobs / Isolating logfiles for different jobs or transformations
- isolating, for different transformations / Isolating logfiles for different jobs or transformations
- log files / Logfiles
- logs
- sending, through e-mail / Sending logs through an e-mail
M
- Mail job entry / Avoiding zipping files
- Mail validator / There's more...
- Mail validator job entry / How to do it...
- Mapping (sub-transformation) step / Processing data into shared transformations via filter criteria and subtransformations
- Mapping input specification step / How it works...
- Mapping output specification step / How to do it...
- Marketplace
- plugins, managing with / Managing plugins with the Marketplace, How to do it..., There's more...
- Merge Join step / Joining two or more streams based on given conditions
- Merge Rows (diff) step / How it works...
- Meta-data tab / How it works...
- metadata injection
- using, to reuse transformations / Using Metadata Injection to re-use transformations, How to do it..., How it works..., There's more...
- Metaphone / There's more...
- Modified Java Script Value (MJSV) step / Scripting alternatives to the UDJC step
- Mondrian distribution
- URL / Sample databases
- Mondrian website
- URL / There's more...
- MongoDB
- data, loading in / Loading data into MongoDB, How to do it..., How it works...
- URL, for downloading / Getting ready
- data, retrieving from / Getting data from MongoDB, How to do it...
- documentation, URL / How it works...
- MongoDB Input step / How it works...
- multiple sheets
- Excel file, writing with / Writing an Excel file with several sheets, How to do it..., How it works...
- museums data structure
- about / museums data structure
- museums table / museums
- cities table / cities
- museums table, museums data structure / museums
- MySQL sample databases
- URL / Sample databases
N
- name
- providing, to files / How to do it...
- named parameter / Sample transformation – random list
- name property / How it works...
- Needleman-Wunsch / There's more...
- nested Filter rows steps
- usage, avoiding / Avoiding the use of nested Filter rows steps
- NewsBlur
- URL / Generating an RSS Feed
- numeric fields
O
- ${OUTPUT_FOLDER variable / Sample transformations
- OLAP (On-line Analytical Processing databases) / Creating a Pentaho report with data coming from PDI
- OpenOffice calc files / How it works...
- OpenPGP
- Open Pretty Good Privacy (OpenPGP) / Encrypting and decrypting files
- Oracle Java API documentation
- outdoor data structure
- about / outdoor data structure
- products table / products
- categories table / categories
- Outer join? option / How it works...
- output fields
- format, providing / Giving the output fields a format
- Output file location / How to do it...
- Output Options (opt) property / How it works...
- outputType parameter / How it works...
P
- Pair letters Similarity / There's more...
- Pan
- URL / How to do it...
- param + <name of param.> parameter / How it works...
- parameters property / How it works...
- Parameters tab / How it works...
- parent-child table
- parent_job.getVariable() function / How it works...
- parent_job.setVariable function / How it works...
- path
- data, getting from / Getting data from a different path
- specifying, time saving on / Saving time when specifying Path
- path notation
- used, for specifying fields / Specifying fields by using the Path notation, How to do it..., How it works...
- URL / There's more...
- on sharing, multiple nodes getting / Getting more than one node when the nodes share their Path notation
- PDI / Introduction
- about / Introduction
- Pentaho report, creating with data / Creating a Pentaho report with data coming from PDI, How to do it..., How it works..., There's more...
- Pentaho report, creating / Creating a Pentaho report directly from PDI, How to do it..., There's more...
- used, for creating files from PUC / Generating files from the PUC with PDI and the CDA plugin, How to do it..., How it works...
- PDI (design time)
- database table, altering / Creating or altering a database table from PDI (design time), How to do it..., How it works..., There's more...
- database table, creating / Creating or altering a database table from PDI (design time), How to do it..., There's more...
- PDI (runtime)
- database table, altering / Creating or altering a database table from PDI (runtime), How to do it..., There's more...
- database table, creating / Creating or altering a database table from PDI (runtime), How to do it..., There's more...
- PDI job
- executing, from Pentaho User Console / Executing a PDI job from the Pentaho User Console, How to do it..., How it works..., There's more...
- PDI jobs
- running, Pentaho BI Server configured / Configuring the Pentaho BI Server for running PDI jobs and transformations, How to do it..., There's more...
- PDI transformation data
- CDF dashboard, populating with / Populating a CDF dashboard with data coming from a PDI transformation, How to do it..., How it works..., There's more...
- PDI transformations
- running, Pentaho BI Server configured / Configuring the Pentaho BI Server for running PDI jobs and transformations, How to do it..., How it works...
- executing, as Pentaho process part / Executing a PDI transformation as part of a Pentaho process, How to do it..., How it works...
- location, specifying / Specifying the location of the transformation
- elements / Supplying values for named parameters, variables and arguments
- PEDI / Introduction
- Pentaho
- wiki page, URL / Introduction
- on forums, URL / There's more...
- Pentaho BI platform databases
- hibernate / Pentaho BI platform databases
- quartz / Pentaho BI platform databases
- Sampledata / Pentaho BI platform databases
- Pentaho BI Server
- configuring, to run PDI jobs / Configuring the Pentaho BI Server for running PDI jobs and transformations, How it works..., There's more...
- configuring, to run PDI transformations / Configuring the Pentaho BI Server for running PDI jobs and transformations, How to do it..., There's more...
- Pentaho BI Suite Community Edition (CE)
- URL / Introduction
- Pentaho Business Intelligence Suite
- about / Introduction
- Pentaho Community Tools
- URL / Online
- Pentaho Community Wiki
- URL / There's more..., Online
- Pentaho Design Studio
- URL / Getting ready
- Pentaho Feature and Bug-fix Tracker
- URL / Online
- Pentaho Forums
- URL / Online
- Pentaho report
- creating with data, from PDI / Creating a Pentaho report with data coming from PDI, How to do it..., How it works..., There's more...
- creating, from PDI / Creating a Pentaho report directly from PDI, How to do it..., There's more...
- Pentaho Report Designer
- URL / Getting ready
- Pentaho Reporting / There's more...
- Pentaho Reporting Engine / Creating a Pentaho report with data coming from PDI
- Pentaho Server log / Logfiles
- Pentaho User Console
- PDI jobs, executing / Executing a PDI job from the Pentaho User Console, Getting ready, How to do it..., How it works...
- Pentaho wiki / Introduction
- Pentaho wiki page
- URL / There's more...
- plugins
- managing, with MarketPlace / Managing plugins with the Marketplace, How to do it..., There's more...
- Prashant Rajus blog
- Preview document structure / How it works...
- previous_result.getNrLinesOutput() function / How it works...
- process flow
- creating / Creating a process flow, How to do it..., How it works..., There's more...
- products table, outdoor data structure / products
- proximity
- used, for looking values / Looking for values by proximity, How to do it..., How it works..., There's more...
- PUC
- about / There's more...
- files, creating with PDI / Generating files from the PUC with PDI and the CDA plugin, How to do it..., How it works...
- files, creating with CDA plugin / Generating files from the PUC with PDI and the CDA plugin, How to do it..., How it works...
Q
- query property / How it works...
R
- records
- inserting, alternative / Alternative solution if you just want to insert records
- reference stream / How it works...
- Refined SoundEx / There's more...
- Regular Expressions (RegExp) syntax / How to do it...
- remote server
- files, retrieving from / Getting files from a remote server, How to do it..., How it works...
- files, putting / Putting files on a remote server, How it works...
- Remove tab / How it works...
- Reservoir Sampling / How it works...
- result object / Using the JavaScript step to control the execution of the entries in your job
- RIGHT OUTER join / There's more...
- ROLAP (relational OLAP) / How it works...
- row denormalizer / There's more...
- Row flattener step / How it works...
- rows
- in table, inserting / Inserting or updating rows in a table, How to do it..., How it works...
- in table, updating / Inserting or updating rows in a table, How to do it..., How it works...
- updating, alternative / Alternative solution if you just want to update rows, Alternative way for inserting and updating
- inserting, alternative / Alternative way for inserting and updating
- inserting, on simple primary key generation / Inserting new rows where a simple primary key has to be generated, Getting ready, How to do it..., How it works...
- inserting on simple primary key generation, stored values based / Inserting new rows where the primary key has to be generated based on stored values, How to do it..., How it works..., There's more...
- of two streams, merging with same structure / Merging rows of two streams with the same or different structures, How to do it..., How it works..., There's more...
- of two streams, merging with different structure / Merging rows of two streams with the same or different structures, How to do it..., How it works..., There's more...
- new rows between existent rows, interspersing / Interspersing new rows between existent rows, How to do it..., How it works...
- processing differently, row number based / Processing rows differently based on the row number, How to do it..., There's more...
- specific rows, identifying / Identifying specific rows
- last row in stream, identifying / Identifying the last row in the stream
- enumerating, Add sequence step usage avoiding / Avoiding using an Add sequence step to enumerate the rows
- R package
- by Matt Shotwell, URL / How it works...
- RSS feed
- reading / Reading an RSS Feed, How it works...
- URL / Getting ready
- creating / Generating an RSS Feed, How to do it..., How it works
- RSS Output step / There's more...
- Ruby Scripting / Scripting alternatives to the UDJC step
- runtime
- data, validating at / Validating data at runtime, How to do it..., There's more...
- R_DATABASE / Database connections tables
- R_DATABASE_ATTRIBUTE / Database connections tables
- R_DATABASE_CONTYPE / Database connections tables
- R_DATABASE_TYPE / Database connections tables
- R_JOB / Job tables
- R_JOBENTRY / Job tables
- R_JOBENTRY_ATTRIBUTE / Job tables
- R_JOBENTRY_DATABASE / Database connections tables
- R_JOB_HOP / Job tables
- R_JOB_NOTE / Job tables
- R_STEP / Transformation tables
- R_STEP_ATTRIBUTE / Transformation tables
- R_STEP_DATABASE / Database connections tables
- R_STEP_TYPE / Transformation tables
- R_TRANSFORMATION / Transformation tables
- R_TRANS_HOP / Transformation tables
- R_TRANS_NOTE / Transformation tables
- R__JOBENTRY_TYPE / Job tables
S
- S3 CSV Input step / How it works...
- Salesforce.com
- data, loading in / Loading data into Salesforce.com, How to do it..., How it works...
- data, retrieving from / Getting data from Salesforce.com, How it works...
- docs, URL / How it works...
- Salesforce Input step / How it works...
- sample data
- creating, for testing purposes / Generating sample data for testing purposes, How to do it..., There's more...
- sas7bdat / Reading data from a SAS datafile
- SAS data file
- data, reading from / Reading data from a SAS datafile, How to do it..., How it works...
- SAS file
- reading, need for / Why read a SAS file?
- SAS Input
- about / Reading data from a SAS datafile
- SAS Input step / How it works...
- Sassy
- URL / How it works...
- Select & Alter tab / Altering the names, order, or metadata of the fields coming from the file, How it works...
- Select bucket button / How to do it...
- SELECT statement
- multiple execution / Executing the SELECT statement several times, each for a different set of parameters
- Select values
- used, for altering data stream / Altering a data stream with Select values, How to do it..., How it works...
- Select values step / How it works...
- semi-structured files
- separator
- alternative notation / Alternative notation for a separator
- serialize/de-serialize mechanism / Serializing/De-serializing data
- Set files in result step / Executing a transformation or part of a job once for every file in a list of files
- SFTP
- about / Access via SFTP
- URL / Access via SFTP
- shims
- about / Getting ready
- setting up in Kettle, URL / Getting ready
- for sandbox, URL / Getting ready
- Simple Object Access Protocol (SOAP) / Looking for values by using a web service
- Slawomir Chodnicki's blog
- URL / Avoiding unzipping files
- Sort rows step / How it works...
- SoundEx / There's more...
- sources
- values, looking for / Looking for values in a variety of sources, Getting ready, How to do it..., How it works...
- Split Fields step / How it works...
- Spoon
- built-in optimization tools, using / Using Spoon's built-in optimization tools, How to do it..., How it works...
- SQL queries
- building, via database metadata / Building SQL queries via database metadata, Getting ready, How to do It...
- Steel Wheels structure
- about / Steel Wheels data structure
- stream
- splitting into multiple streams, condition based / Splitting a stream into two or more streams based on a condition, How to do it..., How it works...
- Dummy steps usage, avoiding / Avoiding the use of Dummy steps
- Kettle variable value, comparing against / Comparing against the value of a Kettle variable
- nested Filter rows steps usage, avoiding / Avoiding the use of nested Filter rows steps
- complex conditions difficulties, avoiding / Overcoming the difficulties of complex conditions
- metadata, checking / Making sure that the metadata of the streams is the same
- rows, merging / Telling Kettle how to merge the rows of your streams
- multiple streams, differences creating / Comparing two streams and generating differences, How to do it..., How it works..., There's more...
- multiple streams differences, using / Using the differences to keep a table up-to-date
- middle, variables getting in / Getting variables in the middle of the stream
- output rows number, limiting / Limiting the number of output rows
- multiple stream joining, conditions based / Joining two or more streams based on given conditions, Getting ready, How to do it..., How it works..., There's more...
- steps, executing on empty stream / Executing steps even when your stream is empty, How to do it..., How it works...
- last row, identifying / Identifying the last row in the stream
- Stream Lookup
- Stream Lookup step / How it works..., Speeding up your transformation
- stream statistics
- used, for studying data / Studying data via stream statistics, Getting ready, How to do it..., How it works...
- sub-transformation
- transformation part, moving to / Moving part of a transformation to a subtransformation, How to do it..., How it works..., There's more...
- synchronize after merge / Synchronizing after merge
- Synchronize after merge step / How it works...
T
- /transformation / There's more...
- /transformation/connection / There's more...
- /transformation/info / There's more...
- /transformation/info/log / There's more...
- /transformation/notepads / There's more...
- /transformation/order/hop / There's more...
- /transformation/step / There's more...
- /transformation/step/field / There's more...
- /transformation/step/file / There's more...
- table
- rows, updating / Inserting or updating rows in a table, Getting ready, How to do it..., How it works...
- rows, inserting / Inserting or updating rows in a table, Getting ready, How to do it..., How it works...
- data, deleting from / Deleting data from a table, How to do it..., How it works...
- inserting / Inserting, deleting, or updating a table depending on a field, How to do it..., How it works..., Insert, update, and delete all-in-one
- deleting / Inserting, deleting, or updating a table depending on a field, How to do it..., How it works..., Insert, update, and delete all-in-one
- updating / Inserting, deleting, or updating a table depending on a field, How to do it..., How it works..., Insert, update, and delete all-in-one
- Table Input step / Getting data from a database by providing parameters
- Table Output step / Alternative way for inserting and updating
- Target steps tab / How it works...
- Text file input step / Reading a simple file, How to do it...
- Text file output step / How to do it...
- transformation
- sample / A sample transformation
- e-mails, sending in / Sending e-mails in a transformation
- log files, isolating for / Isolating logfiles for different jobs or transformations
- transformation file
- weather_np.ktr / A sample transformation
- transformation part
- moving, to sub-transformation / Moving part of a transformation to a subtransformation, How to do it..., How it works..., There's more...
- transformations
- multiple transformations, running in parallel / Processing multiple jobs or transformations in parallel, How to do it..., How it works...
- about / Introduction
- sample / Introduction
- running, by setting static arguments / Executing a job or a transformation by setting static arguments and parameters, How to do it..., How it works...
- running, by setting static parameters / Executing a job or a transformation by setting static arguments and parameters, How to do it..., How it works...
- executing, by setting arguments / Executing a job or a transformation from a job by setting arguments and parameters dynamically, How to do it..., How it works...
- executing, on conditions known at runtime / Executing a job or a transformation whose name is determined at runtime, How to do it..., How it works..., There's more...
- copied rows, accessing / Accessing the copied rows from jobs, transformations, and other entries
- executing, for row in dataset / Executing a transformation once for every row in a dataset
- executing, for every file in files list / Executing a transformation or part of a job once for every file in a list of files
- data, transferring between / Other means for transferring or sharing data between transformations
- data, sharing between / Other means for transferring or sharing data between transformations
- reusing, metadata injection used / Using Metadata Injection to re-use transformations, How to do it..., How it works...
- transformations (file-based)
- information, retrieving / Getting information about transformations and jobs (file-based), How to do it..., How it works..., There's more...
- transformations (file-based)
- Job XML nodes / Job XML nodes
- steps and entries information / Steps and entries information
- transformations (repository-based)
- information, getting on / Getting information about transformations and jobs (repository-based), How to do it..., There's more...
- Transformation tables / Transformation tables
- Job tables / Job tables
- Database connections tables / Database connections tables
- transformations, sample
- about / Sample transformations
- hello / Sample transformation – hello
- random list / Sample transformation – random list
- sequence / Sample transformation – sequence
- file list / Sample transformation – file list
U
- UDJC code / Generalizing your UDJC code
- UDJC step
- scripting alternatives / Scripting alternatives to the UDJC step
- UDJE
- about / How to do it...
- UDJE step / How to do it...
- Univariate Statistics step / How to do it..., How it works...
- Universally Unique Identifier (UUID) / There's more...
- Use Kettle Repository option / Specifying the location of the transformation
- User Defined Java Expression (UDJE) / How to do it...
V
- Valid in the parent job / How it works...
- Value Mapper step
- values
- looking for, in database table / Looking for values in a database table, How to do it..., How it works...
- looking for, in sources / Looking for values in a variety of sources, How to do it...
- looking for, proximity used / Looking for values by proximity, How to do it..., How it works..., There's more...
- looking for, web service used / Looking for values by using a web service, Getting ready, How to do it...
- looking for, over intranet / Looking for values over intranet or the Internet, How to do it..., How it works...
- looking for, over internet / Looking for values over intranet or the Internet, How to do it..., How it works...
- values, looking for
- in database, complex conditions used / Looking for values in a database with complex conditions, How to do it..., How it works..., There's more...
- in database, dynamic queries used / Looking for values in a database with dynamic queries, How to do it..., How it works...
- variables property / How it works...
W
- W3C (World Wide Web Consortium)
- URL / Introduction
- W3C recommendation
- URL / There's more...
- w3schools
- URL / Introduction
- Web details
- URL / How it works...
- web scraping / Looking for values over intranet or the Internet
- web service
- used, for looking values / Looking for values by using a web service, How to do it..., How it works..., There's more...
- Web Service Definition Language (WSDL) / How to do it...
- Weka
- about / Building a random data sample for Weka
- random data sample, building for / Building a random data sample for Weka, How to do it..., There's more...
- wiki, URL / How it works...
- on forums, URL / There's more...
- URL / There's more...
- Weka Scoring
- URL / There's more...
- well-formed document
X
- XML
- about / Introduction
- data, in field / XML data in a field
- file name, in field / XML file name in a field
- XML document
- XML files
- reading / Reading simple XML files, How to do it..., How it works...
- large XML files recommendations, URL / There's more...
- well-formed XML files, validating / Validating well-formed XML files, Getting ready, How to do it..., How it works...
- validating, against DTD definitions / Validating an XML file against DTD definitions, Getting ready, There's more...
- validating, against XSD schema / Validating an XML file against an XSD schema, How to do it..., There's more...
- XML Join step / How it works...
- XML output step / How it works...
- XML root structure
- creating / How to do it...
- XML structure
- used, for creating fields / Generating fields with XML structures
- complex XML structures, creating / Generating complex XML structures, How to do it..., How it works...
- creating / How to do it...
- XSD schema
- XML files, validating against / Validating an XML file against an XSD schema, Getting ready, How to do it..., How it works..., There's more...
- XSD Validation step / There's more...
- XSD Validator / There's more...
- XSD Validator step / There's more..., How it works...
- XSLT (Extensible Stylesheet Language Transformations)
Z
- ZIP files
- working with / Working with ZIP files, How to do it..., How it works...
- zipping files, avoiding / Avoiding zipping files
- unzipping files, avoiding / Avoiding unzipping files