Book Image

Big Data Architect's Handbook

By : Syed Muhammad Fahad Akhtar
Book Image

Big Data Architect's Handbook

By: Syed Muhammad Fahad Akhtar

Overview of this book

The big data architects are the “masters” of data, and hold high value in today’s market. Handling big data, be it of good or bad quality, is not an easy task. The prime job for any big data architect is to build an end-to-end big data solution that integrates data from different sources and analyzes it to find useful, hidden insights. Big Data Architect’s Handbook takes you through developing a complete, end-to-end big data pipeline, which will lay the foundation for you and provide the necessary knowledge required to be an architect in big data. Right from understanding the design considerations to implementing a solid, efficient, and scalable data pipeline, this book walks you through all the essential aspects of big data. It also gives you an overview of how you can leverage the power of various big data tools such as Apache Hadoop and ElasticSearch in order to bring them together and build an efficient big data solution. By the end of this book, you will be able to build your own design system which integrates, maintains, visualizes, and monitors your data. In addition, you will have a smooth design flow in each process, putting insights in action.
Table of Contents (21 chapters)
Preface
Free Chapter
1
Why Big Data?
2
Big Data Environment Setup
3
Hadoop Ecosystem
4
NoSQL Database
5
Off-the-Shelf Commercial Tools
6
Containerization
7
Network Infrastructure
8
Cloud Infrastructure
9
Security and Monitoring
10
Frontend Architecture
11
Backend Architecture
12
Machine Learning
13
Artificial Intelligence
14
Elasticsearch
15
Structured Data
16
Unstructured Data
17
Data Visualization
18
Financial Trading System
19
Retail Recommendation System
20
Other Books You May Enjoy
NoSQL Database

Nowadays, there is so much hype about NoSQL databases, especially in the big data world. People seem to be discussing different aspects of NoSQL and how they can get the most out of it. Different types of questions come to their minds, such as what is it? How it is different from RDBMS? How do I select an appropriate framework and tool while architecting my project?

In this chapter, we will go through NoSQL and answer all of these questions to build a strong foundation. We will then cover the following NoSQL databases from practical aspects, which includes their installation, basic configuration, and most of the operations that we normally perform in a database. We will be mainly discussing the following topics:

  • What is NoSQL?
  • Benefits of NoSQL
  • Comparison of NoSQL and RDBMS
  • CAP theorem and ACID properties
  • Different data models in NoSQL
  • Apache Cassandra
  • MongoDB
  • ...