Book Image

Learning Informatica PowerCenter 10.x - Second Edition

By : Rahul Malewar
Book Image

Learning Informatica PowerCenter 10.x - Second Edition

By: Rahul Malewar

Overview of this book

Informatica PowerCenter is an industry-leading ETL tool, known for its accelerated data extraction, transformation, and data management strategies. This book will be your quick guide to exploring Informatica PowerCenter’s powerful features such as working on sources, targets, transformations, performance optimization, scheduling, deploying for processing, and managing your data at speed. First, you’ll learn how to install and configure tools. You will learn to implement various data warehouse and ETL concepts, and use PowerCenter 10.x components to build mappings, tasks, workflows, and so on. You will come across features such as transformations, SCD, XML processing, partitioning, constraint-based loading, Incremental aggregation, and many more. Moreover, you’ll also learn to deliver powerful visualizations for data profiling using the advanced monitoring dashboard functionality offered by the new version. Using data transformation technique, performance tuning, and the many new advanced features, this book will help you understand and process data for training or production purposes. The step-by-step approach and adoption of real-time scenarios will guide you through effectively accessing all core functionalities offered by Informatica PowerCenter version 10.x.
Table of Contents (20 chapters)
Title Page
Credits
About the Author
Acknowledgement
About the Reviewer
www.PacktPub.com
Customer Feedback
Preface

Working with Sources


Any file or table from where we can extract the data in PowerCenter is referred to as the Source. You can import or create the source definition.

When you import the source definition in Designer, we import only the metadata, that is column names, data type, size, indexes, constraints, dependencies, and so on. Actual data never comes with the Source structure in Designer. The data flows through the mapping in a row-wise manner when we execute the workflow in Workflow Manager.

PowerCenter allows you to work on various types of sources as listed here:

  • Relational Database: PowerCenter supports all the Relations Databases, such as Oracle, Sybase, DB2, Microsoft SQL Server, SAP HANA, and Teradata.
  • File: This includes flat files (fixed width and delimited files), COBOL Copybook files, XML files, and Excel files.
  • High-end applications: Hyperion, PeopleSoft, TIBCO, Web Sphere MQ, and so on can also be used.
  • Mainframe: Additional features of Mainframe such as IBM DB2 OS/390, IBM DB2...