Book Image

Getting Started with Talend Open Studio for Data Integration

By : Jonathan Bowen
Book Image

Getting Started with Talend Open Studio for Data Integration

By: Jonathan Bowen

Overview of this book

Talend Open Studio for Data Integration (TOS) is an open source graphical development environment for creating custom integrations between systems. It comes with over 600 pre-built connectors that make it quick and easy to connect databases, transform files, load data, move, copy and rename files and connect individual components in order to define complex integration processes. "Getting Started with Talend Open Studio for Data Integration" illustrates common uses and scenarios in a simple, practical manner and, building on knowledge as the book progresses, works towards more complex integration solutions. TOS is a code generator and so does a lot of the "heavy lifting"ù for you. As such, it is a suitable tool for experienced developers and non-developers alike. You'll start by learning how to construct some common integrations tasks ñ transforming files and extracting data from a database, for example. These building blocks form a "toolkit"ù of techniques that you will learn how to apply in many different situations. By the end of the book, once complex integrations will appear easy and you will be your organization's integration expert! Best of all, TOS makes integrating systems fun!
Table of Contents (22 chapters)
Getting Started with Talend Open Studio for Data Integration
Credits
Foreword
Foreword
About the Author
Acknowledgement
About the Reviewers
www.PacktPub.com
Preface
Index

Jobs as subjobs


Talend Open Studio allows developers to use jobs within jobs. This is a great technique to use if you have tasks that can be used over and over again in many integration jobs. Over time, developers can build up a library of standard tasks, and componentizing in this way makes your development quicker and easier and promotes reuse of code, which is always good practice.

For example, suppose you have developed an FTP job that lists the files in a directory, connects to a remote FTP server, and transfers the files before deleting the source files. This FTP job can be designed generically so that it can be used for similar purposes in other jobs. Let's see what this might look like.

Our first job is the generic FTP job as shown in the following screenshot:

Our second job takes a delimited file, processes the data using a tMap component, and then writes the data to an XML file.

We can now enhance this job by bringing in the generic FTP job. In the Repository window, click on the...