Scalable Data Streaming with Amazon Kinesis

By : Tarik Makota, Brian Maguire, Danny Gagne, Rajeev Chakrabarti

Scalable Data Streaming with Amazon Kinesis

By: Tarik Makota, Brian Maguire, Danny Gagne, Rajeev Chakrabarti

Overview of this book

Amazon Kinesis is a collection of secure, serverless, durable, and highly available purpose-built data streaming services. This data streaming service provides APIs and client SDKs that enable you to produce and consume data at scale. Scalable Data Streaming with Amazon Kinesis begins with a quick overview of the core concepts of data streams, along with the essentials of the AWS Kinesis landscape. You'll then explore the requirements of the use case shown through the book to help you get started and cover the key pain points encountered in the data stream life cycle. As you advance, you'll get to grips with the architectural components of Kinesis, understand how they are configured to build data pipelines, and delve into the applications that connect to them for consumption and processing. You'll also build a Kinesis data pipeline from scratch and learn how to implement and apply practical solutions. Moving on, you'll learn how to configure Kinesis on a cloud platform. Finally, you’ll learn how other AWS services can be integrated into Kinesis. These services include Redshift, Dynamo Database, AWS S3, Elastic Search, and third-party applications such as Splunk. By the end of this AWS book, you’ll be able to build and deploy your own Kinesis data pipelines with Kinesis Data Streams (KDS), Kinesis Data Firehose (KFH), Kinesis Video Streams (KVS), and Kinesis Data Analytics (KDA).

Preface

Who this book is for

What this book covers

To get the most out of this book

Download the example code files

Download the color images

Conventions used

Get in touch

Reviews

Section 1: Introduction to Data Streaming and Amazon Kinesis

Free Chapter

Chapter 1: What Are Data Streams?

Introducing data streams

The value of real-time data in analytics

Decoupling systems

Challenges associated with distributed systems

Overview of messaging concepts

Examples of data streaming

Summary

Further reading

Chapter 2: Messaging and Data Streaming in AWS

Amazon Kinesis Data Streams (KDS)

Amazon Kinesis Data Firehose (KDF)

Amazon Kinesis Data Analytics (KDA)

Amazon Kinesis Video Streams (KVS)

Amazon Simple Queue Service (SQS)

Amazon Simple Notification Service (SNS)

Amazon MQ for Apache ActiveMQ

IoT Core

Amazon Managed Streaming for Apache Kafka (MSK)

Amazon EventBridge

Service comparison summary

Summary

Chapter 3: The SmartCity Bike-Sharing Service

The mission for sustainable transportation

SmartCity new mobile features

The AWS Well-Architected Framework

Summary

Further reading

Section 2: Deep Dive into Kinesis

Chapter 4: Kinesis Data Streams

Technical requirements

Discovering Amazon Kinesis Data Streams

Creating a stream producer application

Creating a stream consumer application

Data pipelines with Amazon Kinesis Data Streams

Summary

Further reading

Chapter 5: Kinesis Firehose

Technical requirements

Discovering Amazon Kinesis Firehose

Understanding encryption in KDF

Using data transformation in KDF with a Lambda function

Understanding delivery stream destinations

Understanding data format conversion in KDF

Understanding monitoring in KDF

Use-case example – Bikeshare station data pipeline with KDF

Summary

Further reading

Chapter 6: Kinesis Data Analytics

Technical requirements

Discovering Amazon KDA

Working on SmartCity bike share analytics use cases

Creating operational insights using SQL Engine

Creating operational insights using Apache Flink

Building bike ride analytic applications

Monitoring KDA applications

Summary

Further reading

Chapter 7: Amazon Kinesis Video Streams

Technical requirements

Understanding video fundamentals

Discovering Amazon Kinesis video streams WebRTC

Discovering Amazon KVS

Building video-enabled applications with KVS

Summary

Further reading

Section 3: Integrations

Chapter 8: Kinesis Integrations

Technical requirements

Amazon services that can produce data to send to Kinesis

Amazon services that transform Kinesis data

Third-party integrations with Kinesis

Summary

Further reading

Why subscribe?

Other Books You May Enjoy

Packt is searching for authors like you

Leave a review - let other readers know what you think

Customer Reviews

5 star

4 star

3 star

2 star

1 star

Data pipelines with Amazon Kinesis Data Streams

As we have learned how to create streams, producers, and consumers, we will design a simple data pipeline for SmartCity. A data pipeline is a series of processing steps applied to data flowing from the source to the target destination. The processing steps could include automation for copying, transforming, routing, and loading source data to destinations such as business systems, data lakes, and data warehouses. A data pipeline should support the requirements for data throughput, reliability, and latency. A well-architected design will prevent many of the common problems that can occur when collecting and loading data, such as data corruption, bottlenecks, conflicts between sources, and the creation of duplicate entries.

Data pipeline design (simple)

With this first design, this demonstrates receiving data from a single source of data. The data source producer is using Amazon Kinesis Agent deployed in the SmartCity data center...

Scalable Data Streaming with Amazon Kinesis

By : Tarik Makota, Brian Maguire, Danny Gagne, Rajeev Chakrabarti

Scalable Data Streaming with Amazon Kinesis

By: Tarik Makota, Brian Maguire, Danny Gagne, Rajeev Chakrabarti

Overview of this book

Related Content you might be interested in

Current Title:

Scalable Data Streaming with Amazon Kinesis

Serverless Architectures with AWS

Data Engineering with AWS

Geospatial Data Analytics on AWS

Data pipelines with Amazon Kinesis Data Streams

Data pipeline design (simple)