Book Image

Apache Mesos Essentials

By : Dharmesh Kakadia
Book Image

Apache Mesos Essentials

By: Dharmesh Kakadia

Overview of this book

<p>Apache Mesos is a cluster manager that provides efficient resource isolation and sharing across distributed applications, or frameworks. It allows developers to concurrently run the likes of Hadoop, Spark, Storm, and other applications on a dynamically shared pool of nodes. With Mesos, you have the power to manage a wide range of resources in a multi-tenant environment.</p> <p>Starting with the basics, this book will give you an insight into all the features that Mesos has to offer. You will first learn how to set up Mesos in various environments from data centers to the cloud. You will then learn how to implement self-managed Platform as a Service environment with Mesos using various service schedulers, such as Chronos, Aurora, and Marathon. You will then delve into the depths of Mesos fundamentals and learn how to build distributed applications using Mesos primitives.</p> <p>Finally, you will round things off by covering the operational aspects of Mesos including logging, monitoring, high availability, and recovery.</p>
Table of Contents (15 chapters)
Apache Mesos Essentials
Credits
About the Author
About the Reviewers
www.PacktPub.com
Preface
Index

Complex data and the rise of the Lambda architecture


The explosive growth in big data is not only in terms of the volume being generated but also in terms of the variety and speed at which the results have to be generated in order to be meaningful. Thus, the velocity of the data and computation has forced developers toward real-time stream processing frameworks, and at the same time, the variety and unstructured nature of data has led to the NoSQL movement.

With the rise of the Internet of Things (IoT), sensors, social media, machine transactions, monitoring data, and so on are being produced at a very large scale and velocity. The insights provided by this data can be very valuable, but the analysis and the data itself do not make sense if results are produced with a delay, or analysis is done on the stale data. In the previous chapters, we looked at how large amounts of data can be processed using Hadoop and Spark. These traditional tools are very well suited for batch or offline analysis...