Now that we have covered channels and sinks, we will now cover some of the more common ways to get data into your Flume agents. As discussed in Chapter 1, Overview and Architecture, the source is the input point for the Flume agent. There are many sources available with the Flume distribution as well as many open source options available. Like most open source software, if you can't find what you need, you can always write your own by extending the org.apache.flume.source.AbstractSource
class. Since the primary focus of this book is ingesting files of logs into Hadoop, we'll cover a few of the more appropriate sources to accomplish this.
Apache Flume: Distributed Log Collection for Hadoop
By :
Apache Flume: Distributed Log Collection for Hadoop
By:
Overview of this book
Table of Contents (16 chapters)
Apache Flume: Distributed Log Collection for Hadoop Second Edition
Credits
About the Author
About the Reviewers
www.PacktPub.com
Preface
Free Chapter
Overview and Architecture
A Quick Start Guide to Flume
Sinks and Sink Processors
Sources and Channel Selectors
Interceptors, ETL, and Routing
Putting It All Together
Monitoring Flume
There Is No Spoon – the Realities of Real-time Distributed Data Collection
Index
Customer Reviews