Book Image

Getting Started with Talend Open Studio for Data Integration

By : Jonathan Bowen
Book Image

Getting Started with Talend Open Studio for Data Integration

By: Jonathan Bowen

Overview of this book

Talend Open Studio for Data Integration (TOS) is an open source graphical development environment for creating custom integrations between systems. It comes with over 600 pre-built connectors that make it quick and easy to connect databases, transform files, load data, move, copy and rename files and connect individual components in order to define complex integration processes. "Getting Started with Talend Open Studio for Data Integration" illustrates common uses and scenarios in a simple, practical manner and, building on knowledge as the book progresses, works towards more complex integration solutions. TOS is a code generator and so does a lot of the "heavy lifting"ù for you. As such, it is a suitable tool for experienced developers and non-developers alike. You'll start by learning how to construct some common integrations tasks ñ transforming files and extracting data from a database, for example. These building blocks form a "toolkit"ù of techniques that you will learn how to apply in many different situations. By the end of the book, once complex integrations will appear easy and you will be your organization's integration expert! Best of all, TOS makes integrating systems fun!
Table of Contents (22 chapters)
Getting Started with Talend Open Studio for Data Integration
Credits
Foreword
Foreword
About the Author
Acknowledgement
About the Reviewers
www.PacktPub.com
Preface
Index

Other useful software


In order to follow the sample jobs throughout the book, you may wish to install some additional software.

Text editor

A decent text editor will be very useful to view CSV and XML files. There are hundreds of text editors—both free and paid-for—and here are some recommendations if you don't already have a favorite:

MySQL

Chapter 4, Working with Databases, focuses on using the Studio to extract from and insert data into a relational database system. The Studio supports many different database systems, but for the examples in this book, we have chosen to use MySQL.

MySQL is the most popular open source relational database and is used by many large-scale applications and websites. It is free to use and there are a number of tools you can use to administer databases. To follow the examples as they are, use MySQL. However, if you have another preferred database you wish to use, it should not be too difficult to modify the job examples to incorporate other database components instead of the illustrated MySQL components.

MySQL Community Server can be downloaded from the following URL:

http://dev.mysql.com/downloads/mysql/

Installation instructions for various operating systems can be found at the following URL:

http://dev.mysql.com/doc/refman/5.1/en/installing.html

Once you have installed the MySQL server, download and install the client tools, which you can use to administer the database, view data, and so on. The MySQL Workbench can be downloaded from http://www.mysql.com/downloads/workbench/ .

MySQL Workbench documentation, including installation instructions, can be found at http://dev.mysql.com/doc/workbench/en/ .

Readers who wish to use other database systems can find a full list of supported databases at http://www.talendforge.org/components/ .

The list includes Oracle, DB2, MS SQL, Postgres, SQLite, and Sybase, among others. TOS also supports the JBDC API to connect to, and a relational database that supports this protocol.