Book Image

Getting Started with Talend Open Studio for Data Integration

By : Jonathan Bowen
Book Image

Getting Started with Talend Open Studio for Data Integration

By: Jonathan Bowen

Overview of this book

Talend Open Studio for Data Integration (TOS) is an open source graphical development environment for creating custom integrations between systems. It comes with over 600 pre-built connectors that make it quick and easy to connect databases, transform files, load data, move, copy and rename files and connect individual components in order to define complex integration processes. "Getting Started with Talend Open Studio for Data Integration" illustrates common uses and scenarios in a simple, practical manner and, building on knowledge as the book progresses, works towards more complex integration solutions. TOS is a code generator and so does a lot of the "heavy lifting"ù for you. As such, it is a suitable tool for experienced developers and non-developers alike. You'll start by learning how to construct some common integrations tasks ñ transforming files and extracting data from a database, for example. These building blocks form a "toolkit"ù of techniques that you will learn how to apply in many different situations. By the end of the book, once complex integrations will appear easy and you will be your organization's integration expert! Best of all, TOS makes integrating systems fun!
Table of Contents (22 chapters)
Getting Started with Talend Open Studio for Data Integration
About the Author
About the Reviewers

Database to database transfer

In a lot of the examples we have seen so far in the book, we have used files of different sorts to be the conduit for the data. Specifically, data is extracted to a file or imported from a file. However, it is perfectly possible for data to be passed directly from one system to another, which often means from one database to another. We will look at an example of this database-to-database transfer in this section. The job we will build will transfer data from our example database to a second database named DB2DB (there is a script to create the database and an associated table in the ResourceFiles folder of this chapter). The new database contains a table named order_data containing the following fields:

  • order_date

  • order_id

  • line_id

  • order_status

  • product_id

  • product_name

  • brand

  • unit_price

  • quantity

  • extended_price

All of these fields are contained within our original database, with the exception of extended_price, which we will have to construct. (Extended price is commonly...