-
Book Overview & Buying
-
Table Of Contents
Apache Flume: Distributed Log Collection for Hadoop - Second Edition - Second Edition
By :
Apache Flume: Distributed Log Collection for Hadoop - Second Edition
By:
Overview of this book
If you are a Hadoop programmer who wants to learn about Flume to be able to move datasets into Hadoop in a timely and replicable manner, then this book is ideal for you. No prior knowledge about Apache Flume is necessary, but a basic knowledge of Hadoop and the Hadoop File System (HDFS) is assumed.
Table of Contents (11 chapters)
Preface
1. Overview and Architecture
2. A Quick Start Guide to Flume
4. Sinks and Sink Processors
5. Sources and Channel Selectors
6. Interceptors, ETL, and Routing
7. Putting It All Together
8. Monitoring Flume
9. There Is No Spoon – the Realities of Real-time Distributed Data Collection
Index