Hands-on DevOps

Book Image

Hands-on DevOps

By : Sricharan Vadapalli

Book Image

Hands-on DevOps

By: Sricharan Vadapalli

Overview of this book

<p>DevOps strategies have really become an important factor for big data environments.</p> <p>This book initially provides an introduction to big data, DevOps, and Cloud computing along with the need for DevOps strategies in big data environments. We move on to explore the adoption of DevOps frameworks and business scenarios. We then build a big data cluster, deploy it on the cloud, and explore DevOps activities such as CI/CD and containerization. Next, we cover big data concepts such as ETL for data sources, Hadoop clusters, and their applications. Towards the end of the book, we explore ERP applications useful for migrating to DevOps frameworks and examine a few case studies for migrating big data and prediction models.</p> <p>By the end of this book, you will have mastered implementing DevOps tools and strategies for your big data clusters.</p>

Title Page

Credits

About the Author

About the Author

About the Reviewers

About the Reviewers

www.PacktPub.com

www.PacktPub.com

Customer Feedback

Customer Feedback

Preface

Free Chapter

Introduction to DevOps

Introduction to DevOps

DevOps application - business scenarios

Business drivers for DevOps adoption to big data

Planning the DevOps strategy

Benefits of DevOps

Introduction to Big Data and Data Sciences

Introduction to Big Data and Data Sciences

In-memory technology

NoSQL databases

Data visualization

DevOps Framework

DevOps Framework

DevOps best practices

DevOps frameworks

Big Data Hadoop Ecosystems

Big Data Hadoop Ecosystems

Big data Hadoop ecosystems

Big data clusters

Hadoop big data cluster nodes

Commercial Hadoop distributions

Capacity planning for systems

Cloud Computing

Cloud Computing

Cloud computing technologies

Multi-tier cloud architecture model

Cloud architectures

Cloud offerings

Backup and recovery

Building Big Data Applications

Building Big Data Applications

Traditional enterprise architecture

Principles to build big data enterprise applications

Big data systems life cycle

Building enterprise applications with Spark

DevOps - Continuous Integration and Delivery

DevOps - Continuous Integration and Delivery

Best practices for CI/CD

Git (SCM) integration with Jenkins

Maven (Build) tool Integration with Jenkins

Building jobs with Jenkins

Source code review - Gerrit

Installation of Gerrit

Repository management

Testing with Jenkins

Continuous delivery- Build Pipeline

Jenkins features

DevOps Continuous Deployment

DevOps Continuous Deployment

Nagios monitoring tool for infrastructure

Integrated dashboards for network analysis, monitoring, and bandwidth

Containers, IoT, and Microservices

Containers, IoT, and Microservices

Container orchestration

Internet of Things (IoT)

DevOps for Digital Transformation

DevOps for Digital Transformation

Digital transformation

Big data and DevOps

Cloud migration - DevOps

Migration to microservices - DevOps

Apps modernization

Architecture migration approach

Best practices for architectural and implementation considerations

DevOps for data science

DevOps for authentication and security

DevOps for IoT systems

DevOps Adoption by ERP Systems

DevOps Adoption by ERP Systems

DevOps Periodic Table

DevOps Periodic Table

Business Intelligence Trends

Business Intelligence Trends

Testing Types and Levels

Testing Types and Levels

Java Platform SE 8

Java Platform SE 8

Customer Reviews

5 star

0

4 star

0

3 star

0

2 star

0

1 star

0

Hadoop big data cluster nodes

We will discuss the different types of nodes along with their role and usage in Hadoop Ecosystem:

NameNode: The NameNode is an important part of an HDFS file system. It keeps the directory tree of all files in the file system, and tracks across where the cluster data files are stored. The data for these files is not stored at all. Client applications communicate with NameNode whenever there is a need to locate a file, or when they want to modify a file. The modifications are stored by NameNode as a log that is appended to a native file system file edits. When a NameNode starts up, it reads the HDFS state from an image file, fsimage, and then applies the edits to the log file.
Secondary NameNode: Secondary NameNode's whole purpose is to have a checkpoint in HDFS. The Secondary NameNode is just a helper node for NameNode; it merges the fsimage and the edits log files periodically and keeps edits log size within a limit.
DataNode: A DataNode stores data in HDFS....