Chapter 5. Sources and Channel Selectors

Now that we have covered channels and sinks, we will now cover some of the more common ways to get data into your Flume agents. As discussed in Chapter 1, Overview and Architecture, the source is the input point for the Flume agent. There are many sources available with the Flume distribution as well as many open source options available. Like most open source software, if you can't find what you need, you can always write your own by extending the org.apache.flume.source.AbstractSource class. Since the primary focus of this book is ingesting files of logs into Hadoop, we'll cover a few of the more appropriate sources to accomplish this.

Tech Concepts

Programming languages

Tech Tools

Unlimited access to the largest independent learning library in tech of over 8,000 expert-authored tech books and videos.

Innovative learning tools, including AI book assistants, code context explainers, and text-to-speech.

50+ new titles added per month and exclusive early access to books as they are being written.

Apache Flume: Distributed Log Collection for Hadoop - Second Edition - Second Edition

By : Steven Hoffman

Apache Flume: Distributed Log Collection for Hadoop - Second Edition

By: Steven Hoffman

Overview of this book

Chapter 5. Sources and Channel Selectors

Apache Flume: Distributed Log Collection for Hadoop - Second Edition - Second Edition

By : Steven Hoffman

Apache Flume: Distributed Log Collection for Hadoop - Second Edition

By: Steven Hoffman

Overview of this book

Chapter 5. Sources and Channel Selectors

Confirmation

Buy this book with your credits?

Submit Your Feedback

Create a Free Account To Continue Reading

Sign in to activate your 7-day free access