Book Image

Getting Started with Talend Open Studio for Data Integration

By : Jonathan Bowen
Book Image

Getting Started with Talend Open Studio for Data Integration

By: Jonathan Bowen

Overview of this book

Talend Open Studio for Data Integration (TOS) is an open source graphical development environment for creating custom integrations between systems. It comes with over 600 pre-built connectors that make it quick and easy to connect databases, transform files, load data, move, copy and rename files and connect individual components in order to define complex integration processes. "Getting Started with Talend Open Studio for Data Integration" illustrates common uses and scenarios in a simple, practical manner and, building on knowledge as the book progresses, works towards more complex integration solutions. TOS is a code generator and so does a lot of the "heavy lifting"ù for you. As such, it is a suitable tool for experienced developers and non-developers alike. You'll start by learning how to construct some common integrations tasks ñ transforming files and extracting data from a database, for example. These building blocks form a "toolkit"ù of techniques that you will learn how to apply in many different situations. By the end of the book, once complex integrations will appear easy and you will be your organization's integration expert! Best of all, TOS makes integrating systems fun!
Table of Contents (22 chapters)
Getting Started with Talend Open Studio for Data Integration
Credits
Foreword
Foreword
About the Author
Acknowledgement
About the Reviewers
www.PacktPub.com
Preface
Index

Order status updates


Once the orders have been processed by the suppliers, they will send back an order status update to the website system, confirming fulfillment. Here are the key features of this process:

  1. The suppliers FTP files to a directory on the website server. All suppliers follow the same format and the filenames' convention is [supplier code]_shipped_[timestamp].csv, for example fab_shipped_201206060900.csv.

  2. The suppliers can supply these files many times per day on a schedule that is not predetermined.

  3. Once processed, the output file needs to be moved to the website's order status import directory.

  4. The file required by the website is in XML format and is to be named orderstatus_update.xml.

  5. The website has a constraint that an item cannot be shipped more times than it was ordered. If data is presented to the website that attempts to ship an order line that has already been shipped, the import process to the website will fail, requiring manual intervention. For standard merchandise...