Book Image

Microsoft SQL Server 2012 Integration Services: An Expert Cookbook

Book Image

Microsoft SQL Server 2012 Integration Services: An Expert Cookbook

Overview of this book

SQL Server Integration Services (SSIS) is a leading tool in the data warehouse industry - used for performing extraction, transformation, and load operations. This book is aligned with the most common methodology associated with SSIS known as Extract Transform and Load (ETL); ETL is responsible for the extraction of data from several sources, their cleansing, customization, and loading into a central repository normally called Data Warehouse or Data Mart.Microsoft SQL Server 2012 Integration Services: An Expert Cookbook covers all the aspects of SSIS 2012 with lots of real-world scenarios to help readers understand usages of SSIS in every environment. Written by two SQL Server MVPs who have in-depth knowledge of SSIS having worked with it for many years.This book starts by creating simple data transfer packages with wizards and illustrates how to create more complex data transfer packages, troubleshoot packages, make robust SSIS packages, and how to boost the performance of data consolidation with SSIS. It then covers data flow transformations and advanced transformations for data cleansing, fuzzy and term extraction in detail. The book then dives deep into making a dynamic package with the help of expressions and variables, and performance tuning and consideration.
Table of Contents (23 chapters)
Microsoft SQL Server 2012 Integration Services: An Expert Cookbook
Credits
Foreword
About the Authors
About the Reviewers
www.PacktPub.com
Preface
Index

Merge Join Transform: performing different types of joins in data flow


If you are familiar with T-SQL you may wonder why this transform exists when we can query source data joined in the T-SQL? Remember that in an ETL we may have source data from different types of sources such as flat files or XML. This transformation provides all types of joins such as Inner, Outer, and Left join.

There are differences between these three types of joins, which are explored in this recipe. In this recipe we have two data sources, from an Excel file and a Flat File, and we want to combine them in rows (join them) with the Merge Join Transform. We need to fetch all records that exist in both files or exist in one of the files but not in the other one.

How to do it...

  1. Create a new Integration Services project and name it R08_Merge Join.

  2. Add a Data Flow Task in the Control Flow, then go to the Data Flow tab.

  3. Add an Excel Source, create a new Excel connection manager, and set the Excel file to C:\SSIS\Ch04_Data Flow...