Apache Kafka

Apache Kafka

By : Nishant Garg

Buy this Book

Apache Kafka

By: Nishant Garg

Buy this Book

Overview of this book

Message publishing is a mechanism of connecting heterogeneous applications together with messages that are routed between them, for example by using a message broker like Apache Kafka. Such solutions deal with real-time volumes of information and route it to multiple consumers without letting information producers know who the final consumers are. Apache Kafka is a practical, hands-on guide providing you with a series of step-by-step practical implementations, which will help you take advantage of the real power behind Kafka, and give you a strong grounding for using it in your publisher-subscriber based architectures. Apache Kafka takes you through a number of clear, practical implementations that will help you to take advantage of the power of Apache Kafka, quickly and painlessly. You will learn everything you need to know for setting up Kafka clusters. This book explains how Kafka basic blocks like producers, brokers, and consumers actually work and fit together. You will then explore additional settings and configuration changes to achieve ever more complex goals. Finally you will learn how Kafka works with other tools like Hadoop, Storm, and so on. You will learn everything you need to know to work with Apache Kafka in the right format, as well as how to leverage its power of handling hundreds of megabytes of messages per second from multiple clients.

Apache Kafka

Credits

About the Author

About the Reviewers

www.PacktPub.com

Preface

Free Chapter

Introducing Kafka

Need for Kafka

Few Kafka usages

Summary

Installing Kafka

Summary

Setting up the Kafka Cluster

Single node – single broker cluster

Single node – multiple broker cluster

Multiple node – multiple broker cluster

Kafka broker property list

Summary

Kafka Design

Kafka design fundamentals

Message compression in Kafka

Cluster mirroring in Kafka

Replication in Kafka

Summary

Writing Producers

The Java producer API

Simple Java producer

Creating a simple Java producer with message partitioning

The Kafka producer property list

Summary

Writing Consumers

Java consumer API

Simple high-level Java consumer

Multithreaded consumer for multipartition topics

Kafka consumer property list

Summary

Kafka Integrations

Kafka integration with Storm

Kafka integration with Hadoop

Summary

Kafka Tools

Kafka administration tools

Integration with other tools

Kafka performance testing

Summary

Index

Customer Reviews

5 star

4 star

3 star

2 star

1 star

Integration with other tools

This section discusses the contributions by many contributors, providing integration with Apache Kafka for various needs such as logging, packaging, cloud integration, and Hadoop integration.

Camus (https://github.com/linkedin/camus) is another art of work done by LinkedIn, which provides a pipeline from Kafka to HDFS. Under this project, a single MapReduce job performs the following steps for loading data to HDFS in a distributed manner:

As a first step, it discovers the latest topics and partition offsets from ZooKeeper.
Each task in the MapReduce job fetches events from the Kafka broker and commits the pulled data along with the audit count to the output folders.
After the completion of the job, final offsets are written to HDFS, which can be further consumed by subsequent MapReduce jobs.
Information about the consumed messages is also updated in the Kafka cluster.

Some other useful contributions are:

Automated deployment and configuration of Kafka and ZooKeeper...

Apache Kafka

By : Nishant Garg

Apache Kafka

By: Nishant Garg

Overview of this book

Related Content you might be interested in

Current Title:

Apache Kafka

Integration with other tools