Book Image

Getting Started with Talend Open Studio for Data Integration

By : Jonathan Bowen
Book Image

Getting Started with Talend Open Studio for Data Integration

By: Jonathan Bowen

Overview of this book

Talend Open Studio for Data Integration (TOS) is an open source graphical development environment for creating custom integrations between systems. It comes with over 600 pre-built connectors that make it quick and easy to connect databases, transform files, load data, move, copy and rename files and connect individual components in order to define complex integration processes. "Getting Started with Talend Open Studio for Data Integration" illustrates common uses and scenarios in a simple, practical manner and, building on knowledge as the book progresses, works towards more complex integration solutions. TOS is a code generator and so does a lot of the "heavy lifting"ù for you. As such, it is a suitable tool for experienced developers and non-developers alike. You'll start by learning how to construct some common integrations tasks ñ transforming files and extracting data from a database, for example. These building blocks form a "toolkit"ù of techniques that you will learn how to apply in many different situations. By the end of the book, once complex integrations will appear easy and you will be your organization's integration expert! Best of all, TOS makes integrating systems fun!
Table of Contents (22 chapters)
Getting Started with Talend Open Studio for Data Integration
Credits
Foreword
Foreword
About the Author
Acknowledgement
About the Reviewers
www.PacktPub.com
Preface
Index

Extracting data from Excel files


Spreadsheets are a ubiquitous business tool in the modern world, and there is a vast amount of critical data that resides in these common desktop files. As the tools are so commonplace and easy to use, spreadsheets are often the tool of choice for storing and manipulating all kinds of data. In this section, we'll look at a couple of ways to pull data from a spreadsheet. One of the most-used features of spreadsheets is the sheets functionality, which is the ability to add another page within the spreadsheet file. Sheets within a file may be closely related (for example, each sheet represents sales data for a given month) or may be less closely related (for example, a customers spreadsheet may contain customer data, such as the first name, last name, and e-mail in sheet 1, and address data in sheet 2). Instead of taking spreadsheet data and converting it into the CSV format before transforming it, the Studio has Excel components that allow us to address multiple...