Apache Mesos Essentials

Apache Mesos Essentials

By : Dharmesh Kakadia

Buy this Book

Apache Mesos Essentials

By: Dharmesh Kakadia

Buy this Book

Overview of this book

Apache Mesos is a cluster manager that provides efficient resource isolation and sharing across distributed applications, or frameworks. It allows developers to concurrently run the likes of Hadoop, Spark, Storm, and other applications on a dynamically shared pool of nodes. With Mesos, you have the power to manage a wide range of resources in a multi-tenant environment. Starting with the basics, this book will give you an insight into all the features that Mesos has to offer. You will first learn how to set up Mesos in various environments from data centers to the cloud. You will then learn how to implement self-managed Platform as a Service environment with Mesos using various service schedulers, such as Chronos, Aurora, and Marathon. You will then delve into the depths of Mesos fundamentals and learn how to build distributed applications using Mesos primitives. Finally, you will round things off by covering the operational aspects of Mesos including logging, monitoring, high availability, and recovery.

Apache Mesos Essentials

Credits

About the Author

About the Reviewers

www.PacktPub.com

Preface

Free Chapter

Running Mesos

Modern data centers

Cluster computing frameworks

Introducing Mesos

Why Mesos?

Single-node Mesos clusters

Running test frameworks

Mesos Web UI

Multi-node Mesos clusters

Mesos cluster on Amazon EC2

Running Mesos using Vagrant

The Mesos community

Summary

Running Hadoop on Mesos

An introduction to Hadoop

Hadoop on Mesos

Installing Hadoop on Mesos

An example Hadoop job

Advanced configuration for Hadoop on Mesos

Summary

Running Spark on Mesos

Introducing Spark

Spark job scheduling

Spark Standalone mode

Spark on Mesos

Tuning Spark on Mesos

Summary

Complex Data Analysis on Mesos

Complex data and the rise of the Lambda architecture

Storm

Spark Streaming

NoSQL on Mesos

Running Services on Mesos

Introduction to services

Marathon

Chronos

Aurora

Service discovery

Packaging

Summary

Understanding Mesos Internals

The Mesos architecture

Summary

Developing Frameworks on Mesos

The Mesos API

Developing a Mesos framework

Building our framework

Advanced topics

Developer resources

Summary

Administering Mesos

Upgrade

Index

Customer Reviews

5 star

4 star

3 star

2 star

1 star

Spark Standalone mode

Spark standalone mode uses a simple built-in cluster manager. The Spark standalone mode cluster manager can also be run on a single machine, and it is great way to try out Spark (http://spark.apache.org/docs/latest/spark-standalone.html). In Standalone mode, Spark requires Java and Git installed on the system. Let's install the dependencies. On Ubuntu, the following command will install both Java and Git:

ubuntu@master:~$ sudo apt-get install openjdk-7-jdk git

The following are the steps to install Spark in standalone mode:

Download the latest Spark tarball from http://spark.apache.org/downloads.html. Spark packages are prebuilt compatible with a specific version of Hadoop and Mesos. At the time of writing, the latest version is 1.2, which is compatible with 0.18 version of Mesos and many versions of Hadoop, including the latest version, 2.4.

Extract the downloaded tar file and go to the following directory:

ubuntu@master:~$ tar –xzf spark-*.tar.gz
ubuntu@master:~$ cd...

Apache Mesos Essentials

By : Dharmesh Kakadia

Apache Mesos Essentials

By: Dharmesh Kakadia

Overview of this book

Related Content you might be interested in

Current Title:

Apache Mesos Essentials

Spark Standalone mode