Index
A
- action sequence
- administrative tasks
- getting rid of / Getting rid of administrative tasks
- sales datamart loading, automating / Time for action – automating the loading of the sales datamart, What just happened?
- work backup, creating automatically / Have a go hero – Creating a back up of your work automatically
- Agile BI
- about / Agile BI
- analytic query step
B
- basic calculation
- examination review, calculator step used / Time for action – reviewing examinations by using the Calculator step, What just happened?
- calculator step, using / Time for action – reviewing examinations by using the Calculator step
- data, sorting / What just happened?
- Dummy step / What just happened?
- Select values step, using / What just happened?
- fields, modifying / Adding or modifying fields by using different PDI steps
- field modification, PDI used / Adding or modifying fields by using different PDI steps
- basic modification
- Group by step / Group by step
- business keys to surrogate keys, sales fact table
- translating / Translating the business keys into surrogate keys, Obtaining the surrogate key for a Type I SCD, Obtaining the surrogate key for a Type II SCD, Obtaining the surrogate key for the Junk dimension, Obtaining the surrogate key for the Time dimension
- Type I SCD surrogate key, obtaining / Obtaining the surrogate key for a Type I SCD
- TypeII SCD surrogate key, obtaining / Obtaining the surrogate key for a Type II SCD
- junk dimension surrogate key, obtaining / Obtaining the surrogate key for the Junk dimension
- time dimension surrogate key, obtaining / Obtaining the surrogate key for the Time dimension
C
- calculator step used, basic calculation
- about / Time for action – reviewing examinations by using the Calculator step
- average, taking / Time for action – reviewing examinations by using the Calculator step
- preview / Time for action – reviewing examinations by using the Calculator step
- editing / Time for action – reviewing examinations by using the Calculator step
- Sort rows Step / Time for action – reviewing examinations by using the Calculator step
- Select Values step / Time for action – reviewing examinations by using the Calculator step
- final preview / Time for action – reviewing examinations by using the Calculator step
- examination, reviewing / What just happened?
- Carte
- cast function
- CDF
- about / PDI as a datasource
- change history, maintaining
- steps / Time for action – keeping a history of product changes with the Dimension lookup/update step
- Dimension lookup/update step, using / Time for action – keeping a history of product changes with the Dimension lookup/update step, What just happened?
- transformation, testing / Time for action – testing the transformation that keeps a historyof product changes
- Clone() functions
- cloud-computing
- cluster
- clusters
- code enrichment
- about / Time for accode enrichmentabouttion – calculating flexible scores by using variables
- transformations, modifying / Time for accode enrichmentabouttion – calculating flexible scores by using variables
- named parameters, using / Using named parameters
- Start Script / Using the special Start, Main, and End scripts
- Main script / Using the special Start, Main, and End scripts
- End Script / Using the special Start, Main, and End scripts
- transformation predefined constants, using / Using transformation predefined constants, code enrichmenttransformation predefined constants, usingHave a go hero – keeping the top 10 performances, finding the seven errors
- score calculation, Java code used / getProcessCount() functionaboutHave a go hero – calculating scores with Java code
- coding
- columns
- command line argument
- passing, to transformation / Time for action – calling the hello world transformation with fixed arguments and parameters
- named parameters, differentiating between / Deciding between the use of a command-line argument and a named parameter
- use, analyzing / Have a go hero – analysing the use of arguments and named parameters
- Community Edition
- complex lookups, data
- performing / Tcomplex lookups, dataperformingime for action – using a Database join step to create a list ofsuggested products to buy
- suggested products list, creating / Tcomplex lookups, dataperformingime for action – using a Database join step to create a list ofsuggested products to buy, What just happened?
- database to stream data, joining / Joining data from the database to the stream data by using a Database join step
- customers list, rebuilding / Introducomplex lookups, datacustomers list, rebuildingcing dimensional modeling
- constellation
- custom time dimension dataset
- generating, Kettle variables used / Generating a custom time dimension dataset by using Kettle variables
- creating / Time for action – creating the time dimension dataset, What just happened?
- Get Variables step / Getting variables
- variables, getting / Time for action – getting variables for setting the default starting date
- Get Variables step, using / Using the Get Variables step
D
- dashboard
- screenshot / PDI as a datasource
- data
- reading, from files / Reading data from files
- normalizing / Normalizing data, Time for action – enhancing the matches file by normalizing the dataset, What just happened?
- normalizing, Row Normalizer step used / Modifying the dataset with a Row Normalizer step
- data, database
- looking up / Looking up data in a database
- simple lookups, doing / Doing simple lookups, Time for action – using a Database lookup step to create a list of products to buy
- complex lookups, doing / Doing complex lookups
- data, reading
- from files / Reading data from files
- football match results, reading / Time for action – reading results of football matches from files, What judata, readingfootball match results, readingst happened?
- input files / Input sdata, readinginput filesteps
- input files, properties / Input sdata, readinginput filesteps, Readingdata, readinginput files, properties several files at once
- multiple files, reading at once / Readingdata, readinginput files, properties several files at once, Time fordata, readingmultiple files, reading at once action – reading all your files at a time using a single Text file input step and regular expressions
- multiple files reading, single Text file input step used / Time fordata, readingmultiple files, reading at once action – reading all your files at a time using a single Text file input step and regular expressions, Regular data, readingmultiple files reading, single Text file input step usedexpressions
- regular expressions / Regular data, readingmultiple files reading, single Text file input step usedexpressions
- reading files, troubleshooting / Troubleshooting reading files
- grids / Grids
- data, sending to database
- table list, loading / Time for action – loading a table with a list of manufacturers
- inserting, table output step used / Inserting new data into a database table with the Table output step
- data, updating / Inserting or updating data by using other PDI steps, Time for action – inserting new products or updating existent ones, Time for action – testing the update of existing products, What just happened?
- data, inserting / Inserting or updating data by using other PDI steps, Time for action – inserting new products or updating existent ones, Time for action – testing the update of existing products, What just happened?
- Insert/Update step, using / Inserting or updating data with the Insert/Update step, Have a go hero – populating a films database
- data, XML files
- obtaining / Getting data from XML files
- XPath, using / XPath
- node, selecting / XPath
- path expression, examples / XPath
- Get Data From XML input step, configuring / Configuring the Get data from XML step
- database connections
- database querying
- data, working with / Time for action – getting data about shipped orders, What just happened?
- data obtaining, table input step used / Getting data from the database with the Table input step
- SELECT statement, using / Using the SELECT statement for generating a new dataset
- data cleansing
- about / Cleansing data
- example / Cleansing data, Have a go hero – cleansing films data
- PDI step, using / Cleansing data
- data elimination, from database
- data manipulation
- basic calculation / Basic calculations
- filtering / Filtering
- datamart
- about / Introducomplex lookups, datacustomers list, rebuildingcing dimensional modeling, Exploring the sales datamart
- datawahehouse, difference / Exploring the sales datamart
- sales datamart / Exploring the sales datamart
- dataset
- rows, converting to columns / Converting rows to columns
- data, normalizing / Normalizing data
- modifying, Row Normalizer step used / Modifying the dataset with a Row Normalizer step
- custom time dimension dataset, generating / Generating a custom time dimension dataset by using Kettle variables
- data to files, transfering
- about / Sending data to files
- match results, sending / Time for action – sending the results of matches to a plain file, data to files, transferingmatch results, sendingWhat just happened?
- output files / Output files
- rowset / Rowset
- field / Rowset
- row / Rowset
- streams / Streams
- field, deleting / The Select values step
- field, selecting / The Select values step
- field metadata, changing / The Select values step
- data type, system information
- date field / Date fields
- numeric fields / Numeric fields
- transformation, executing / Time for action – running the examination transformation from a terminal window, What just happened?
- date formats, using / data type, system informationdate formats, usingGo for a hero – formatting 99.55
- number 99.55, formatting / data type, system informationdate formats, usingGo for a hero – formatting 99.55
- data validation
- need for / Validating data
- example / Validating data
- films, checking / Time for action – checking films file with the Data Validator
- simple validation rules, defining / Defining simple validation rules using the Data Validator
- datawahehouse
- datamart, difference / Exploring the sales datamart
- datawarehouse
- DDL
- example / A brief word about SQL
- about / A brief word about SQL
- degenerate dimension
- about / Exploring the sales datamart
- DER
- Design Studio
- dimensional modeling
- datawarehouse / Introducomplex lookups, datacustomers list, rebuildingcing dimensional modeling
- about / Introducomplex lookups, datacustomers list, rebuildingcing dimensional modeling, Loadingdimensional modelingabout dimensions with data
- fact table / Introducomplex lookups, datacustomers list, rebuildingcing dimensional modeling
- dimension tables / Introducomplex lookups, datacustomers list, rebuildingcing dimensional modeling
- star schema / Introducomplex lookups, datacustomers list, rebuildingcing dimensional modeling
- datamart / Introducomplex lookups, datacustomers list, rebuildingcing dimensional modeling
- dimensions
- dimensions, sales datamart
- Dimension tables
- dimension tables, with data
- loading / Time fodimension tables, with dataloadingr action – loading a region dimension with a Combination lookup/update step
- loading, combination lookup/update step used / Time fodimension tables, with dataloadingr action – loading a region dimension with a Combination lookup/update step, What just happened?, Time for action – testing the transformation that loads the region dimension, Descdimension tables, with dataloading, combination lookup/update step usedribing data with dimensions
- dimension data, describing / Descdimension tables, with dataloading, combination lookup/update step usedribing data with dimensions
- change history, maintaining / Keeping a history of changes
- DML
- about / A brief word about SQL
- example / A brief word about SQL
- dynamic clusters
E
- E4X
- Enterprise Console
- Enterprise features
- about / Enterprise features
- errors, capturing
- about / Capturing errors
- film age, calculating / Time for action – capturing errors while calculating the ageof a film
- error handling functionality, using / Using PDI error handling functionality
- transformation, aborting / Aborting a transformation, Time for action – aborting when there are too many errors, Aborting a transformation using the Abort step, errors, capturingtransformation, abortingerrors, capturingAbort step, usingFixing captured errors
- Abort step, using / Aborting a transformation using the Abort step, errors, capturingtransformation, abortingerrors, capturingAbort step, usingFixing captured errors
- captured errors, fixing / errors, capturingcaptured errors, fixingTime for action – treating errors that may appear, What just happened?
- rows, treating / Treating rows coming to the error stream, Have a go hero – capturing errors while seeing who wins
- PDI error handling functionality, activities / Pop quiz – PDI error handling
- exit code
- checking, under Windows / Checking the exit code
- checking, under Unix-based systems / Checking the exit code
F
- facts
- fact table
- field
- about / Rowset
- field modification, basic modification
- split fields / Adding or modifying fields by using different PDI steps
- add constants field / Adding or modifying fields by using different PDI steps
- replace in string field / Adding or modifying fields by using different PDI steps
- number range field / Adding or modifying fields by using different PDI steps
- Value Mapper / Adding or modifying fields by using different PDI steps
- User Defined Java Expression / Adding or modifying fields by using different PDI steps
- calculator step / The Calculator step
- Formula step / The Formula step, Time for action – reviewing examinations by using the Formula step, What just happened?
- student, listing / Have a go hero – listing students and their examinations results
- examples / Time for action – calculating World Cup statistics by grouping data, What just happened?
- file-based system
- migrating, to repository-based system / Migrating from a file-based system to a repository-based system and vice-versa
- file result
- filtering
- frequent words, counting / Time for action – counting frequent words by filtering, What just happened?
- rows, filter rows used / Filtering rows using the Filter rows step
- rows / Have a go hero – playing with filters
- word count, discarding commonly used / Have a go hero – counting words and discarding those that are commonly used
- spoken language, identifying / Time for action – finding out which language people speak , What just happened?
- Stream lookup step / The Stream lookup step
- filter rows step
- using, for filtering row / Filtering rows using the Filter rows step
- first transformation, Spoon
- hello world transformation, creating / Time for action – creating a hello world transformation, What just happened?
- Kettle engine, directing / Directing the Kettle engine with transformations
- interface, exploring / Exploring the Spoon interface
- structure, viewing / Viewing the transformation structure
- running / Running and previewing the transformation, Time for action – running and previewing the hello_world transformation , What just happened?
- previewing / Running and previewing the transformation, Time for action – running and previewing the hello_world transformation , What just happened?
- previewing, results in Execution Results window / Previewing the results in the Execution Results window
- flow-control oriented
- foreign keys (FK)
- formula step
G
- getProcessCount() function
- Get Variables step
- about / Using the Get Variables step
- grain
- grid shortcuts
- Group by step, basic modification
- about / Group by step
- fields, reviewing / Group by step
- preview / Group by step
- tasks / Have a go hero – calculating statistics for the examinations
H
- ${HELLOFOLDER}parameter
- about / What just happened?
- hash table algorithm
- about / The Stream lookup step
- hop
- HSQLDB
- Hybrid SCD
I
- id_junk_sales key
- id_manufacturer key
- id_region key
- installing
- PDI / Time for action – installing PDI, What just happened?
- MySQL / Installing MySQL
- MySQL, on Windows / Time for action – installing MySQL on Windows
- MySQL, on Ubuntu / Time for action – installing MySQL on Ubuntu, What just happened?
J
- JavaScript
- JavaScript code, inserting
- about / Inserting JavaScript code using the Modified Java Script Value step
- Transform Functions / Inserting JavaScript code using the Modified Java Script Value step
- Input fields branch / Inserting JavaScript code using the Modified Java Script Value step
- fields, adding / Adding fields, Have a go hero – adding and modifying fields to the contest data
- fields, modifying / Modifying fields, Have a go hero – adding and modifying fields to the contest data
- compatibility switch, turning on / Turning on the compatibility switch
- code, testing / Testing your code
- average calculations, testing / Time for action – testing the calculation of averages, What just happened?
- script, testing / Testing the script using the Test script button
- new average calculations, testing / Have a go hero – testing the new calculation of the average
- JavaScript step
- simple tasks, doing / Time for action – calculating scores with JavaScript
- scores, calculating / Time for action – calculating scores with JavaScript, What just happened?
- using, in PDI / Using the JavaScript language in PDI
- JavaScript code, inserting / Inserting JavaScript code using the Modified Java Script Value step
- jigsaw puzzle database
- populating / Preparing the environment, Time for action – populating the Jigsaw database
- exploring / Exploring the Jigsaw database model
- manufactures table / Exploring the Jigsaw database model
- products table / Exploring the Jigsaw database model
- buy_methods table / Exploring the Jigsaw database model
- payment_methods table / Exploring the Jigsaw database model
- countries table / Exploring the Jigsaw database model
- cities table / Exploring the Jigsaw database model
- customers table / Exploring the Jigsaw database model
- invoices table / Exploring the Jigsaw database model
- job
- hello world file, customizing / Time for action – customizing the hello world file with arguments and parameters
- flexible version, creating / Time for action – customizing the hello world file with arguments and parameters
- named parameters, using / Using named parameters in jobs, Have a go hero – backing up your work
- running, from repository / Running transformations and jobs from a repository
- designing, shortcuts / Designing transformations and jobs
- job, creating as process flow
- transformation, splitting / Time for action – splitting the generation of top scores by copying and getting rows
- data transfer, copy/get rows mechanism used / Transferring data between transformations by using the copy /get rows mechanism , job, creating as process flowdata transfer, copy/get rows mechanism usedHave a go hero – modifying the flow
- data flow, modifying / job, creating as process flowdata transfer, copy/get rows mechanism usedHave a go hero – modifying the flow
- job, running from repository
- steps / Running transformations and jobs from a repository
- command line options, specifying / Specifying command line options
- job, running from terminal window
- job entries
- abort job / Job entries
- create a folder / Job entries
- delete file / Job entries
- evaluate rows number in a table / Job entries
- File Exists / Job entries
- Job / Job entries
- mail / Job entries
- special entries / Job entries
- success / Job entries
- transformation / Job entries
- job entry
- job entry, executing
- sales report, sending / Time for action – sending a sales report and warning the administrator if something is wrong
- execution flow, modifying / Changing the flow of execution on the basis of conditions, Have a go hero – refining the sales report
- file result list, using / Creating and using a file results list
- file result list, creating / Creating and using a file results list
- job iteration
- about / Iterating jobs and transformations
- custom files, executing / Time for action – generating custom files by executing a transformation for every input row, What just happened?
- every input row, executing / Executing for each row, Have a go hero – processing several files at once, Have a go hero – building lists of products to buy, Have a go hero – e-mail students to let them know how they did
- jobs, nesting
- files, generating / transformation design, enhancingjobs, nestingTime for action – generating the files with top scores by nesting jobs, jobs, nestingfiles, generatingWhat just happened?
- job, running inside another job / Running a job inside another job with a job entry
- variable scope types / Understanding the scope of variables
- join
- junk dimension
- about / Exploring the sales datamart
K
- .ktr file jobs
- .ktr file transformations
- Kettle
- about / Pentaho Data Integration
- kettle.properties file
- Kettle 4.0, features
- Agile BI / Agile BI
- visual improvements / Visual improvements for designing transformations and jobs
- Enterprise features / Enterprise features
- Kettle Developer Support
- Kettle repository
- Kettle variables, XML files
- about / Kettle variables
- using / How and when you can use variables
- exploring / Have a go hero – exploring XML files
- work documentation / Have a go hero – documenting your work
- Kitchen
- running, options / Providing options when running Pan and Kitchen
- log details / Log details
- named parameters / Named parameters
- arguments / Arguments
- variables / Variables
- documentation / Variables
- KPIs
L
- LoadScriptFromTab() function
M
- mapping
- master
- mini-dimension
- mouse-over assistance
- working / Time for action – creating a hop with the mouse-over assistance
- toolbar, using / Using the mouse-over assistance toolbar
- MySQL
- installing / Installing MySQL
- installing, on Windows / Time for action – installing MySQL on Windows
- installing, on Ubuntu / Time for action – installing MySQL on Ubuntu, What just happened?
- MySQL installing
N
- named parameters
- using / Using named parameters
- passing, to transformation / Time for action – calling the hello world transformation with fixed arguments and parameters
- command line argument, differentiating between / Deciding between the use of a command-line argument and a named parameter
- use, analysing / Have a go hero – analysing the use of arguments and named parameters
O
- /option@value syntax
- about / Specifying command line options
- OLTP
- output files
- output steps / Output steps
P
- Pan
- running, options / Providing options when running Pan and Kitchen
- log details / Log details
- named parameters / Named parameters
- arguments / Arguments
- variables / Variables
- documentation / Variables
- partition schemas
- PDI
- and Pentaho BI Suite / Pentaho Data Integration and Pentaho BI Suite
- Kettle / Pentaho Data Integration
- PDI 2.3 / Pentaho Data Integration
- PDI 2.4 / Pentaho Data Integration
- PDI 2.5 / Pentaho Data Integration
- PDI 3.0 / Pentaho Data Integration
- PDI 3.1 / Pentaho Data Integration
- PDI 3.2 / Pentaho Data Integration
- PDI 4.0 / Pentaho Data Integration
- using, in real world scenarios / Using PDI in real world scenarios
- installing / Time for action – installing PDI, What just happened?
- graphic designer, launching / Launching the PDI graphical designer: Spoon, Time for action – starting and customizing Spoon, What just happened?
- Spoon / Time for action – starting and customizing Spoon
- job / Introducing PDI jobs
- best practices / PDI best practices
- features / Extending Kettle with plugins, Overcoming real world risks with some remote execution, Scaling out to overcome bigger risks
- Kettle plug-ins / Extending Kettle with plugins, Have a go hero – listing the top 10 students by using the Head plugin step
- real world risk, overcoming / Overcoming real world risks with some remote execution
- scaling up / Scaling out to overcome bigger risks
- scaling out / Scaling out to overcome bigger risks
- cluster / Scaling out to overcome bigger risks
- master / Scaling out to overcome bigger risks
- dynamic clusters / Scaling out to overcome bigger risks
- cloud-computing / Scaling out to overcome bigger risks
- integrating, with Pentaho BI suite / PDIPDI integrating, with Pentaho BI suitePDI integration, with Pentaho BI suiteaboutPentaho BI suite integration, with PDIabout as a process action
- PDI, using in real world scenarios
- data warehouse, loading / Loading datawarehouses or datamarts
- data mart, loading / Loading datawarehouses or datamarts
- data, integrating / Integrating data
- data, cleansing / Data cleansing
- information, migration / Migrating information
- data, exporting / Exporting data
- integrating, Pentaho BI used / Integrating PDI using Pentaho BI
- PDI best practices
- about / PDI best practices
- PDI elements
- database connections / Creating database connections, partitions, servers, and clusters
- partial schemas / Creating database connections, partitions, servers, and clusters
- slave servers / Creating database connections, partitions, servers, and clusters
- clusters / Creating database connections, partitions, servers, and clusters
- PDI Enterprise Edition
- PDI features
- browsing / Time for action – browsing new PDI features by copyinga dataset
- browsing, dataset copied / Time for action – browsing new PDI features by copyinga dataset, What just happened?
- PDI integration, with Pentaho BI suite
- about / PDIPDI integrating, with Pentaho BI suitePDI integration, with Pentaho BI suiteaboutPentaho BI suite integration, with PDIabout as a process action
- as process action / PDIPDI integrating, with Pentaho BI suitePDI integration, with Pentaho BI suiteaboutPentaho BI suite integration, with PDIabout as a process action
- as datasource / PDI as a datasource
- Pentaho suite / More about the Pentaho suite
- PDI job
- hello world job, creating / Time for action – creating a simple hello world job
- processes, executing / Executing processes with PDI jobs
- transformation job entry, using / Using the transformation job entry
- PDI options, stream merge
- choosing / PDI options for merging streams
- union, creating / PDI options for merging streams
- Bouchard’s rows / Time for action – giving priority to Bouchard by using Append Stream, What just happened?
- tasks, sorting / Have a go hero – sorting and merging all tasks, Have a go hero – trying to find missing countries
- tasks, merging / Have a go hero – sorting and merging all tasks, Have a go hero – trying to find missing countries
- PDI steps
- about / Summarizing the PDI steps that operate on sets of rows
- normalize benefits, verifying / Have a go hero – verifying the benefits of normalization
- films file, normalizing / Have a go hero – normalizing the Films file
- scores, calculating / Have a go hero – calculating scores by judge
- Pentaho BI Suite
- and PDI / Pentaho Data Integration and Pentaho BI Suite
- analysis engine / Pentaho Data Integration and Pentaho BI Suite
- reporting engine / Pentaho Data Integration and Pentaho BI Suite
- data mining / Pentaho Data Integration and Pentaho BI Suite
- dashboards / Pentaho Data Integration and Pentaho BI Suite
- data integration / Pentaho Data Integration and Pentaho BI Suite
- Pentaho BI Platform / Pentaho Data Integration and Pentaho BI Suite
- Pentaho BI suite integration, with PDI
- about / PDIPDI integrating, with Pentaho BI suitePDI integration, with Pentaho BI suiteaboutPentaho BI suite integration, with PDIabout as a process action, More about the Pentaho suite
- as process action / PDIPDI integrating, with Pentaho BI suitePDI integration, with Pentaho BI suiteaboutPentaho BI suite integration, with PDIabout as a process action
- as datasource / PDI as a datasource
- primary key (PK)
- process execution, PDI job
- about / Executing processes with PDI jobs
- job entry / Executing processes with PDI jobs
- hop / Executing processes with PDI jobs
- job running, Spoon used / Using Spoon to design and run jobs
- job design, comparing with job transformation / Using Spoon to design and run jobs
- putRow() functions
- puzzles fact table
R
- RDBMS
- records
- relational database
- repository
- creating / Creating a repository, Time for action – creating a PDI repository
- Kettle repository / Creating repositories to store your transformationand jobs
- details, storing / Creating repositories to store your transformationand jobs
- storage system, working with / Working with the repository storage system
- logging into / Time for action – logging into a repository, What just happened?
- logging into, credentials used / Logging into a repository by using credentials
- user accounts, defining / Defining repository user accounts
- restoring / Backing up and restoring a repository
- backing up / Backing up and restoring a repository
- shortcuts / Repositories
- repository-based system
- migrating, to file-based system / Migrating from a file-based system to a repository-based system and vice-versa
- repository explorer
- using, for content modification / Examining and modifying the contents of a repository with the Repository explorer
- using, for content examination / Examining and modifying the contents of a repository with the Repository explorer
- element, creating / Examining and modifying the contents of a repository with the Repository explorer
- element, opening / Examining and modifying the contents of a repository with the Repository explorer
- element, deleting / Examining and modifying the contents of a repository with the Repository explorer
- repository shortcuts
- Ctrl+R / Repositories
- Ctr+D / Repositories
- Ctrl+E / Repositories
- Ctrl+U / Repositories
- Rhino engine
- root-job
- row
- about / Rowset
- Row denormalizer
- rows, converting to columns
- about / Converting rows to columns
- films file, enhancing / Time for action – enhancing a films file by converting rows to columns
- Row denormalizer step, using / Converting row data to column data by using the Row denormalizer step
- data, aggregating / Aggregating data with a Row denormalizer step, Using Row denormalizer for aggregating data
- total scores, calculating / Time for action – calculating total scores by performances by country, What just happened?, Have a go hero – calculating scores by skill by continent
- rows, Stream split
- copying / Copying rows
- distributing / Distributing rows
- tasks, assigning / Time for action – assigning tasks by distributing, What just happened?
- rowset
- about / Rowset
S
- sales datamart
- dimensions / Exploring the sales datamart
- exploring / Exploring the sales datamart
- junk dimension / Exploring the sales datamart
- degenerate dimension / Exploring the sales datamart
- granularity level, determining / Deciding the level of granularity
- dimensions, loading / Loading the dimensions
- model / Extending the sales datamart model
- sales datamart model
- about / Extending the sales datamart model
- added dimensions / Extending the sales datamart model
- added dimensions, loading / Have a go hero – loading the dimensions for the puzzles star model
- sales fact table
- loading / Time for action – loading the sales fact table by looking up dimensions
- information obtaining, SQL queries used / Getting the information from the source with SQL queries
- business keys to surrogate keys, translating / Translating the business keys into surrogate keys
- scaling out
- scaling up
- SCD
- SELECT statement
- using / Using the SELECT statement for generating a new dataset
- parameters, using / Making flexible queries by using parameters, Time for action – getting orders in a range of dates by using parameters , What just happened?
- parameters, adding / Adding parameters to your queries
- Kettle variables, using / Making flexible queries by using Kettle variables, Time for action – getting orders in a range of dates by using variables
- Kettle variables, using in queries / Using Kettle variables in your queries
- Kettle variables, advantages / Using Kettle variables in your queries
- simple lookups, data
- performing / Time for action – using a Database lookup step to create a list of products to buy
- buyers product list, creating / Time for action – using a Database lookup step to create a list of products to buy
- database values, looking up / Looking up values in a database with the Database lookup step
- slave server
- slave servers
- sniff-testing feature
- split field to rows step
- Spoon
- starting / Time for action – starting and customizing Spoon
- launching / Time for action – starting and customizing Spoon, What just happened?
- about / Spoon
- options window preference, setting / Setting preferences in the Options window
- transformation, storing / Storing transformations and jobs in a repository
- repository method / Storing transformations and jobs in a repository
- jobs, storing / Storing transformations and jobs in a repository
- files method / Storing transformations and jobs in a repository
- method, choosing / Storing transformations and jobs in a repository
- first transformation, creating / Creating your first transformation
- Spoon shortcuts
- Ctrl+Alt+N / General shortcuts
- Ctrl+N / General shortcuts
- Ctrl+O / General shortcuts
- Ctrl+S / General shortcuts
- Ctrl+F4 / General shortcuts
- F9 / General shortcuts
- F10 / General shortcuts
- Shift+F10 / General shortcuts
- F11 / General shortcuts
- Ctrl+J / General shortcuts
- Ctrl+T / General shortcuts
- Ctrl+F / General shortcuts
- Ctrl+Alt+J / General shortcuts
- Ctrl+L / General shortcuts
- Ctrl+Alt+U / General shortcuts
- of job design / Designing transformations and jobs
- of transformation design / Designing transformations and jobs
- SQL
- about / A brief word about SQL
- DDL / A brief word about SQL
- DML / A brief word about SQL
- star schema
- Steel Wheels database
- about / Introducing the Steel Wheels sample database
- tables / Introducing the Steel Wheels sample database
- connecting to / Connecting to the Steel Wheels database
- connection, creating / Time for action – creating a connection with the Steel Wheels database
- connecting, with RDBMS / Connecting with Relational Database Management Systems
- sample database, exploring / Time for action – exploring the sample database
- SQL / A brief word about SQL
- configured database exploring, database explorer used / Exploring any configured database with the PDI Database explorer
- storage system, repository
- logging into / Time for action – logging into a repository, What just happened?
- logging into, credentials used / Logging into a repository by using credentials
- user accounts, using / Defining repository user accounts
- transfomation in folders, creating / Creating transformations and jobs in repository folders
- jobs in folders, creating / Creating transformations and jobs in repository folders
- tasks / Creating transformations and jobs in repository folders
- restoring / Backing up and restoring a repository
- backing up / Backing up and restoring a repository
- stream, merging
- about / Merging streams
- progress, gathering / Time for action – gathering progress and merging all together, What just happened?
- PDI options / PDI options for merging streams
- Stream lookup step, filtering
- using / The Stream lookup step
- word counting, precisely / Have a go hero – counting words more precisely
- streams
- splitting / Splitting streams
- splitting, based on condition / Time for actionstreamssplitting, based on condition – assigning tasks by filtering priorities with the Filter rows step
- merging / Merging streams
- Stream split
- PDI features,browsing / Splitting streams
- rows, copying / Copying rows
- rows, distributing / Distributing rows, Time for action – assigning tasks by distributing, What just happened?
- Stream split, based on condition
- tasks assignment, Filter rows step used / Time for actionstreamssplitting, based on condition – assigning tasks by filtering priorities with the Filter rows step
- task, assigning / What just happened?
- PDI, steps / PDI steps for splitting the stream based on conditions
- tasks assignment, Switch/Case step used / Time for action – assigning tasks by filtering priorities with the Switch/ Case step
- sum() function
- system information
- examination news file, updating / Time for action – updating a file with news about examinations, What just happened?
- Get System Info step / Getting information by using Get System Info step
- data type / Data types
T
- table
- time dimension
- transformation
- named parameters, using / Using named parameters and command-line arguments in transformations, What just happened?, Have a go hero – loading the time dimension from a job
- command line arguments, using / Using named parameters and command-line arguments in transformations, What just happened?, Have a go hero – loading the time dimension from a job
- running, from repository / Running transformations and jobs from a repository
- steps / Transformation steps
- designing, shortcuts / Designing transformations and jobs
- transformation, desigining
- mouse-over assistance / Experiencing the mouse-over assistance, Time for action – creating a hop with the mouse-over assistance
- mouse-over assistance toolbar, using / Using the mouse-over assistance toolbar
- sniff-testing feature / Experiencing the sniff-testing feature
- job drill-down feature / Experiencing the job drill-down feature
- enhanced notes editor / Experiencing even more visual changes
- color-coded logs / Experiencing even more visual changes
- revamped repository explorer / Experiencing even more visual changes
- transformation, enhancing
- transformation, running from repository
- steps / Running transformations and jobs from a repository
- command line options, specifying / Specifying command line options
- transformation design, enhancing
- example / Time for action – generating files with top scores, What just happened?
- subtransformation scores, calculating / Time for action – calculating the top scores with a subtransformation
- subtransformation / Creating and using subtransformations
- subtransformation, using / Creating and using subtransformations
- subtransformation, working / Creating and using subtransformations
- subtransformation, redefining / Have a go hero – refining the subtransformation
- job, creating as process flow / Creating a job as a process flow
- jobs, nesting / transformation design, enhancingjobs, nestingTime for action – generating the files with top scores by nesting jobs
- transformation steps
- abort / Transformation steps
- add constants / Transformation steps
- add sequence / Transformation steps
- append streams / Transformation steps
- calculator / Transformation steps
- combination / Transformation steps
- copy rows to result / Transformation steps
- data Validator / Transformation steps
- database join / Transformation steps
- database lookup / Transformation steps
- delay row / Transformation steps
- delete / Transformation steps
- dimension lookup/update / Transformation steps
- dummy / Transformation steps
- excel input / Transformation steps
- excel output / Transformation steps
- filter rows / Transformation steps
- fixed file input / Transformation steps
- formula / Transformation steps
- generate rows / Transformation steps
- get data from XML / Transformation steps
- get rows from result / Transformation steps
- Get System Info / Transformation steps
- Get Variables / Transformation steps
- Group by / Transformation steps
- If field value is null / Transformation steps
- Insert / Update / Transformation steps
- mapping (sub-transformation) / Transformation steps
- mapping input specification / Transformation steps
- mapping output specification / Transformation steps
- Modified Java Script Value / Transformation steps
- Number range / Transformation steps
- Regex Evaluation / Transformation steps
- Row denormaliser / Transformation steps
- Row Normaliser / Transformation steps
- select values / Transformation steps
- Set Variables / Transformation steps
- Sort rows / Transformation steps
- Split field to rows / Transformation steps
- Split Fields / Transformation steps
- stream lookup / Transformation steps
- Switch / Case / Transformation steps
- table input / Transformation steps
- table output / Transformation steps
- text file input / Transformation steps
- text file output / Transformation steps
- update / Transformation steps
- Value Mapper / Transformation steps
- trap detector
- about / PDI options for merging streams
- Type II SCDs
- using, to maintain entire history / Keeping an entire history of data with a Type II slowly changing dimension, Type II SCDsusing, to maintain entire historyType II SCDsaboutLoading Type II SCDs with the Dimension lookup/update step, Have a go hero – loading a Type II SCD dimension
- about / Keeping an entire history of data with a Type II slowly changing dimension, Type II SCDsusing, to maintain entire historyType II SCDsaboutLoading Type II SCDs with the Dimension lookup/update step
- loading, Dimension lookup/update step used / Type II SCDsusing, to maintain entire historyType II SCDsaboutLoading Type II SCDs with the Dimension lookup/update step, Have a go hero – keeping a history just for the theme of a product
- Type I SCD
- loading, with combination lookup/update step / LoadinSCDaboutg Type I SCD with a Combination lookup/update step, Have a go hero – adding regions to the Region Dimension
- regions, adding / Have a go hero – adding regions to the Region Dimension
- manufactures dimension, loading / Have a go hero – loading the manufacturers dimension, Have a go hero – loading a mini-dimension
U
- Ubuntu
- MySQL, installing / Time for action – installing MySQL on Ubuntu, What just happened?
- unexpected errors, avoiding
- data, validating / Tiunexpected errors, avoidingdata, validatingme for action – validating genres with a Regex Evaluation step, Validating data, Time for action – checking films file with the Data Validator
- genres field, validating / Tiunexpected errors, avoidingdata, validatingme for action – validating genres with a Regex Evaluation step
- data, cleansing / Cleansing data
- unstructured files
- reading / Time for action – changing a list of house descriptions with JavaScript
- previous rows, viewing / Looking at previous rows
- modifying / Have a go hero – enhancing the houses file
- contest files, modifying / Aunstructured filescontest files, modifyingvoiding coding by using purpose-built steps
- user accounts, repository
- defining / Defining repository user accounts
- read-only / Defining repository user accounts
- user / Defining repository user accounts
- administrator / Defining repository user accounts
- predefined user, admin / Defining repository user accounts
- predefined user, guest / Defining repository user accounts
W
- Windows
- MySQL, installing / Time for action – installing MySQL on Windows
X
- XML
- about / What is XML
- PDI transformations files / PDI transformation files
- XML files
- about / XML filesaboutTime for action – getting data from an XML file with information about countries
- basic country information, building / XML filesaboutTime for action – getting data from an XML file with information about countries
- data, obtaining / Getting data from XML files
- Kettle variables / Kettle variables