Book Image

Talend Open Studio Cookbook

By : Rick Barton
Book Image

Talend Open Studio Cookbook

By: Rick Barton

Overview of this book

Data integration is a key component of an organization's technical strategy, yet historically the tools have been very expensive. Talend Open Studio is the world's leading open source data integration product and has played a huge part in making open source data integration a popular choice for businesses worldwide.This book is a welcome addition to the small but growing library of Talend Open Studio resources. From working with schemas to creating and validating test data, to scheduling your Talend code, you will get acquainted with the various Talend database handling techniques. Each recipe is designed to provide the key learning point in a short, simple and effective manner.This comprehensive guide provides practical exercises that cover all areas of the Talend development lifecycle including development, testing, debugging and deployment. The book delivers design patterns, hints, tips, and advice in a series of short and focused exercises that can be approached as a reference for more seasoned developers or as a series of useful learning tutorials for the beginner.The book covers the basics in terms of schema usage and mappings, along with dedicated sections that will allow you to get more from tMap, files, databases and XML. Geared towards the whole lifecycle, the Talend Open Studio Cookbook shows readers great ways to handle everyday tasks, and provides an insight into all areas of a development cycle including coding, testing, and debugging of code to provide start-to-finish coverage of the product.
Table of Contents (21 chapters)
Talend Open Studio Cookbook
Credits
About the Author
About the Reviewers
www.PacktPub.com
Preface
Common Type Conversions
Index

Using tXMLMap to read XML


This recipe shows how we can convert an XML record stored in a file into a format that is readable by tXMLMap, and how we can then read and process the data in the XML record.

Getting ready

Open the job jo_cook_ch09_0010_readXMLFile.

How to do it...

The first stage of this process is to convert the XML file into Java Document format for use by the downstream component.

  1. Drag a tFileInputXML component onto the canvas.

  2. Edit the schema and add a column named payload. Make it a type of Document, as shown in the screenshot:

  3. Open the tFileInputXML component and change the File name/Stream field to context.cookbookData+"/chapter9/chapter09_jo_0010_customerData.xml".

  4. Change the Loop Xpath query field to "/".

  5. Add an Xpath query of ".", and tick the box Get Nodes.

  6. Your tFileInputXML should look like the one shown in the next screenshot:

    Reading using tXMLMap

  7. Add a tXMLMap component to the canvas and link to the tFileInputXML component.

  8. Open the tXMLMap component and right-click on payload...