Book Image

Apache Kafka - Real-time Stream Processing (Master Class) [Video]

By : Prashant Kumar Pandey
3.5 (2)
Book Image

Apache Kafka - Real-time Stream Processing (Master Class) [Video]

3.5 (2)
By: Prashant Kumar Pandey

Overview of this book

If you want to understand the concept of stream processing, this course is for you. Using Kafka, the course will help you get to grips with real-time stream processing and enable you to apply that knowledge to learn Kafka programming techniques. This course uses the Kafka Streams library available in Apache Kafka 2.x. All the source code and examples on Apache Kafka 2.3 open-source distribution have been tested. You'll understand and explore Confluent Platform functionalities such as Schema Registry and Avro Serdes using the Confluent Community Version. This course makes extensive use of IntelliJ IDEA and Apache Maven as the preferred development IDE. You'll leverage Log4J2 and JUnit5 for industry-standard log implementation in your application and implementing unit test cases, respectively. The code bundle is available at https://github.com/PacktPublishing/Apache-Kafka---Real-time-Stream-Processing-Master-Class
Table of Contents (14 chapters)
Chapter 2
Introduction to Real-time Streams
Content Locked
Section 1
Emergence of Bigdata - A Quick Recap
This is the first lecture of this course that talks about some history, from where we started, and where are we going with the data processing. Here, you will learn about the Big Data problem and how it started. This lesson will introduce the three whitepapers that Google published which started the big data movement and led to the development of Hadoop. Then, we will go ahead and discuss some shortcomings of the approach that Hadoop takes for processing large volumes of data.