Sign In Start Free Trial

Book Overview & Buying
Table Of Contents

Big Data Architect???s Handbook

By : Akhtar

3 (2)

Big Data Architect???s Handbook

3 (2)

By: Akhtar

Overview of this book

The big data architects are the “masters” of data, and hold high value in today’s market. Handling big data, be it of good or bad quality, is not an easy task. The prime job for any big data architect is to build an end-to-end big data solution that integrates data from different sources and analyzes it to find useful, hidden insights. Big Data Architect’s Handbook takes you through developing a complete, end-to-end big data pipeline, which will lay the foundation for you and provide the necessary knowledge required to be an architect in big data. Right from understanding the design considerations to implementing a solid, efficient, and scalable data pipeline, this book walks you through all the essential aspects of big data. It also gives you an overview of how you can leverage the power of various big data tools such as Apache Hadoop and ElasticSearch in order to bring them together and build an efficient big data solution. By the end of this book, you will be able to build your own design system which integrates, maintains, visualizes, and monitors your data. In addition, you will have a smooth design flow in each process, putting insights in action.

Preface

Preface

Who this book is for

What this book covers

To get the most out of this book

Get in touch

Free Chapter

Why Big Data?

Why Big Data?

What is big data?

Characteristics of big data

Solution-based approach for data

Big data glossary

Summary

Big Data Environment Setup

Big Data Environment Setup

Oracle VM VirtualBox installation

Ubuntu installation

Hadoop prerequisite installation

Apache Hadoop installation

Summary

Hadoop Ecosystem

Hadoop Ecosystem

Apache Hadoop

Hadoop Distributed File System

Hadoop MapReduce

YARN

Apache Projects related to big data

Summary

NoSQL Database

NoSQL Database

What is NoSQL?

Apache Cassandra

The MongoDB database

Neo4j database

Summary

Off-the-Shelf Commercial Tools

Off-the-Shelf Commercial Tools

Microsoft Azure

Building a practical application

Summary

Containerization

Containerization

Virtualization

What is containerization?

Docker

Kubernetes

Summary

Network Infrastructure

Network Infrastructure

Network

Network connectivity

Network visualization

Summary

Cloud Infrastructure

Cloud Infrastructure

Companies moving to cloud

Design considerations

Summary

Security and Monitoring

Security and Monitoring

Simple Network Management Protocol

Netflow

Nagios

Security Onion

Wireshark

Summary

Frontend Architecture

Frontend Architecture

React JS

Redux

Summary

Backend Architecture

Backend Architecture

API

RESTful API

Redis

Summary

Machine Learning

Machine Learning

Machine learning

Types of algorithms

Supervised learning

Unsupervised learning

Decision tree classifiers

Summary

Artificial Intelligence

Artificial Intelligence

Artificial intelligence

Convolutional neural networks

Deep learning using TensorFlow

Object detection using YOLO

Summary

Elasticsearch

Elasticsearch

Installing Elasticsearch

Kibana

Security

Understanding queries – CRUD commands

Summary

Structured Data

Structured Data

Data analysis

HBase

Sqoop

Summary

Unstructured Data

Unstructured Data

Moving data into Hadoop

Converting images into text for analysis

Summary

Data Visualization

Data Visualization

Matplotlib

D3.js

Summary

Financial Trading System

Financial Trading System

What is algorithmic trading?

Algorithmic trading strategies

Building an Expert Advisor

Summary

Retail Recommendation System

Retail Recommendation System

Types of recommendation system

Commercial tools

Book recommendation system

Summary

Other Books You May Enjoy

Other Books You May Enjoy

Leave a review - let other readers know what you think

Preface

Big data architects are the masters of data and hold high value in today’s market. Handling big data, be it of good or bad quality, is not an easy task. The prime task before any big data architect is to build an end-to-end big data solution that integrates data from different sources and analyzes it to find useful, hidden insights.
Big Data Architect's Handbook takes you through developing a complete, end-to-end big data pipeline that will lay the foundation for you and provide the necessary knowledge required to be an architect in big data. Right from understanding the design considerations to implementing a solid, efficient, and scalable data pipeline, this book walks you through all the essential aspects of big data. It also gives you an overview of how you can leverage the power of various big data tools such as Apache Hadoop and Elasticsearch in order to bring them together and build an efficient big data solution.
By the end of this book, you will be able to build your own design system that integrates, maintains, visualizes, and monitors your data. In addition, you will have a smooth design flow in each process, putting insights in action.

CONTINUE READING

83

Tech Concepts

36

Programming languages

73

Tech Tools

Unlimited access to the largest independent learning library in tech of over 8,000 expert-authored tech books and videos.

Innovative learning tools, including AI book assistants, code context explainers, and text-to-speech.

50+ new titles added per month and exclusive early access to books as they are being written.

Big Data Architect???s Handbook

Search

Your notes and bookmarks