Mastering Mesos

Mastering Mesos

By : Dipa Dubhashi, Akhil Das

Buy this Book

Mastering Mesos

By: Dipa Dubhashi, Akhil Das

Buy this Book

Overview of this book

Apache Mesos is open source cluster management software that provides efficient resource isolations and resource sharing distributed applications or frameworks. This book will take you on a journey to enhance your knowledge from amateur to master level, showing you how to improve the efficiency, management, and development of Mesos clusters. The architecture is quite complex and this book will explore the difficulties and complexities of working with Mesos. We begin by introducing Mesos, explaining its architecture and functionality. Next, we provide a comprehensive overview of Mesos features and advanced topics such as high availability, fault tolerance, scaling, and efficiency. Furthermore, you will learn to set up multi-node Mesos clusters on private and public clouds. We will also introduce several Mesos-based scheduling and management frameworks or applications to enable the easy deployment, discovery, load balancing, and failure handling of long-running services. Next, you will find out how a Mesos cluster can be easily set up and monitored using the standard deployment and configuration management tools. This advanced guide will show you how to deploy important big data processing frameworks such as Hadoop, Spark, and Storm on Mesos and big data storage frameworks such as Cassandra, Elasticsearch, and Kafka.

Mastering Mesos

Credits

About the Authors

About the Reviewer

www.PacktPub.com

Preface

Free Chapter

Introducing Mesos

Introduction to the datacenter OS and architecture of Mesos

The architecture of Mesos

Introduction to frameworks

The attributes and resources of Mesos

Summary

Mesos Internals

Scaling and efficiency

Reservation

Mesos modules

High availability and fault tolerance

Reconciliation

Persistent Volumes

Summary

Getting Started with Mesos

Virtual Machine (VM) instances

Setting up a multi-node Mesos cluster on Amazon Web Services (AWS)

Setting up a multi-node Mesos cluster on Google Compute Engine (GCE)

Setting up a multi-node Mesos cluster on Microsoft Azure

Setting up a multi-node Mesos cluster on your private datacenter

Debugging and troubleshooting

Summary

Service Scheduling and Management Frameworks

Using Marathon to launch and manage long-running applications on Mesos

Multi-node Marathon cluster setup

Chronos as a cluster scheduler

Chronos plus Marathon

Introduction to Apache Aurora

Introduction to Singularity

Service discovery using Marathoner

Service discovery using Consul

Load balancing with HAProxy

Bamboo - Automatically configuring HAProxy for Mesos plus Marathon

Introduction to Netflix Fenzo

Introduction to PaaSTA

A comparative analysis of different Scheduling/Management frameworks

Summary

Mesos Cluster Deployment

Deploying and configuring a Mesos cluster using Ansible

Deploying and configuring Mesos cluster using Puppet

Deploying and configuring a Mesos cluster using SaltStack

Deploying and configuring a Mesos cluster using Chef

Deploying and configuring a Mesos cluster using Terraform

Deploying and configuring a Mesos cluster using Cloudformation

Creating test environments using Playa Mesos

Monitoring the Mesos cluster using Nagios

Monitoring the Mesos cluster using Satellite

Common deployment issues and solutions

Summary

Mesos Frameworks

Introduction to Mesos frameworks

Frameworks – Authentication, authorization, and access control

The Mesos API

Building a custom framework on Mesos

Summary

Mesos Containerizers

Containers

Docker

Mesos containerizer

Networking for Mesos-managed containers

Mesos Image Provisioner

Mesos fetcher

Deploying containerized apps using Docker and Mesos

Summary

Mesos Big Data Frameworks

Summary

Mesos Big Data Frameworks 2

Cassandra on Mesos

The Elasticsearch-Logstash-Kibana (ELK) stack on Mesos

Kafka on Mesos

Summary

Index

Customer Reviews

5 star

4 star

3 star

2 star

1 star

Two-level scheduling

Mesos has a two-level scheduling mechanism to allocate resources to and launch tasks on different frameworks. In the first level, the master process that manages slave processes running on each node in the Mesos cluster determines the free resources available on each node, groups them, and offers them to different frameworks based on organizational policies, such as priority or fair sharing. Organizations have the ability to define their own sharing policies via a custom allocation module as well.

In the second level, each framework's scheduler component that is registered as a client with the master accepts or rejects the resource offer made depending on the framework's requirements. If the offer is accepted, the framework's scheduler sends information regarding the tasks that need to be executed and the number of resources that each task requires to the Mesos master. The master transfers the tasks to the corresponding slaves, which assign the necessary resources to the framework's executor component, which manages the execution of all the required tasks in containers. When the tasks are completed, the containers are dismantled, and the resources are freed up for use by other tasks.

The following diagram and explanation from the Apache Mesos documentation (http://mesos.apache.org/documentation/latest/architecture/) explains this concept in more detail:

Let's have a look at the pointers mentioned in the preceding diagram:

1: Slave 1 reports to the master that it has four CPUs and 4 GB of memory free. The master then invokes the allocation module, which tells it that Framework 1 should be offered all the available resources.
2: The master sends a resource offer describing these resources to Framework 1.
3: The framework's scheduler replies to the master with information about two tasks to run on the slave using two CPUs and 1 GB RAM for the first task and one CPU and 2 GB RAM for the second task.
4: The master sends the tasks to the slave, which allocates appropriate resources to the framework's executor, which in turn launches the two tasks. As one CPU and 1 GB of RAM are still free, the allocation module may now offer them to Framework 2. In addition, this resource offers process repeats when tasks finish and new resources become free.

Mesos also provides frameworks with the ability to reject resource offers. A framework can reject the offers that do not meet its requirements. This allows frameworks to support a wide variety of complex resource constraints while keeping Mesos simple at the same time. A policy called delay scheduling, in which frameworks wait for a finite time to get access to the nodes storing their input data, gives a fair level of data locality albeit with a slight latency tradeoff.

If the framework constraints are complex, it is possible that a framework might need to wait before it receives a suitable resource offer that meets its requirements. To tackle this, Mesos allows frameworks to set filters specifying the criteria that they will use to always reject certain resources. A framework can set a filter stating that it can run only on nodes with at least 32 GB of RAM space free, for example. This allows it to bypass the rejection process, minimizes communication overheads, and thus reduces overall latency.

Mastering Mesos

By : Dipa Dubhashi, Akhil Das

Mastering Mesos

By: Dipa Dubhashi, Akhil Das

Overview of this book

Related Content you might be interested in

Current Title:

Mastering Mesos

Two-level scheduling