Oracle Data Integrator 11g Cookbook

Oracle Data Integrator 11g Cookbook

Overview of this book

Oracle Data Integrator (ODI) is Oracle's strategic data integration platform for high-speed data transformation and movement between different systems. From high-volume batches, to SOA-enabled data services, to trickle operations, ODI is a cutting-edge platform that offers heterogeneous connectivity, enterprise-level deployment, and strong administrative, diagnostic, and management capabilities."Oracle Data Integrator 11g Cookbook" will take you on a journey past your first steps with ODI to a new level of proficiency, lifting the cover on many of the internals of the product to help you better leverage the most advanced features.The first part of this book will focus on the administrative tasks required for a successful deployment, moving on to showing you how to best leverage Knowledge Modules with explanations of their internals and focus on specific examples. Next we will look into some advanced coding techniques for interfaces, packages, models, and a focus on XML. Finally the book will lift the cover on web services as well as the ODI SDK, along with additional advanced techniques that may be unknown to many users.Throughout "Oracle Data Integrator 11g Cookbook", the authors convey real-world advice and best practices learned from their extensive hands-on experience.

Oracle Data Integrator 11g Cookbook

Credits

Foreword

About the Authors

About the Reviewers

www.PacktPub.com

Preface

Free Chapter

Installation, Deployment, and Configuration

Introduction

Deploying and configuring a standalone agent manually

Deploying a JEE ODI Agent

Configuring a standalone agent with OPMN

Deploying JDBC drivers with a JEE ODI Agent

Defining the Oracle Data Integrator Security

Introduction

Setting up LDAP security for Oracle Data Integrator

Setting external authentication with Oracle Data Integrator

Creating users using generic profiles

Creating users using non-generic profiles

Creating new custom profiles in the Security navigator

Advanced Topology

Introduction

Creating a new technology

Modifying actions to get more from your diagrams

Modifying and expanding datatypes

Changing the case sensitivity for code generation

Best practice – using the Staging Area User to access your target schema

Using Variables

Introduction

Passing start-up parameters to a scenario using variables

Using table names that run in all contexts using getObjectName

Using variables to filter data based on a timestamp

Using variables in KM options (and reusing the variables in an interface, package, and so on)

Using variables in topology

Using variables to control loops inside packages

Knowledge Module Internals

Introduction

Using the substitution passes

Using Java variables in KMs

Using Java for condition code generation

Invoking Java from the KMs

Using substitution methods in Java

Combining substitution methods in a KM

Inside Knowledge Modules – SCD and CDC

Introduction

Implementing Slowly Changing Dimensions (SCD) using ODI

Modifying a Slowly Changing Dimension KM to allow undefined behaviors

Using Changed Data Capture (CDC) - simple

Using Changed Data Capture (CDC) - consistent set

Using one single interface to load changes that occur in any dimensions

Advanced Coding Techniques

Introduction

Using diagrams to develop and maintain models

Generating DDL from data models

Generating interfaces from data models or diagrams

Creating a temporary interface (subquery)

Loading data from an SQL query

Performing a pivot

Loading data using partition exchange

Package Loops and File Processing

Introduction

Defining packages and loops for near real-time execution using a hybrid loop

Using a file from a parameter variable

Detecting files with a variable name

Processing all files in a directory

Processing a large number of files in parallel

XML and Web Services

Introduction

Defining a connection to XML within ODI

Processing complex files with ODI

Processing XML data within an RDBMS not in memory

Invoking web services from ODI

Invoking asynchronous ODI web services with callbacks

Configuring container-based authentication with ODI web services

Advanced Coding Techniques Using the ODI SDK

Introduction

Creating the Master and Work repositories using the SDK

Creating a project using the SDK

Automating the import of artifacts using the SDK

Creating models and datastores using the SDK

Creating an interface using the SDK

Invoking and monitoring a scenario using the SDK

Processing a large number of files in parallel

If you have worked through the two previous recipes, you should already have in place a framework for detecting the presence of a variable number of files in a designated location. You will also have developed a method for processing each of those files as they appear in a prepared list (that is, a table). But consider a situation where there are hundreds or even thousands of files to be processed. Managing all of those files serially would likely prove to be a bottleneck.

In this recipe, we will enhance the file-processing framework by introducing a way to execute the most critical components in parallel.

Getting ready

We will start from where we left off with the previous recipe and to do that, we will build on what we already have: a table that contains a list of files to be processed.

How to do it...

Create a new package called ProcessFiles.
Create a new variable called FILE_PARAM and set it as Alphanumeric. Drag-and-drop the variable in the...

Oracle Data Integrator 11g Cookbook

Oracle Data Integrator 11g Cookbook

Overview of this book

Related Content you might be interested in

Current Title:

Oracle Data Integrator 11g Cookbook

Processing a large number of files in parallel

Getting ready

How to do it...