Book Image

YARN Essentials

By : Fasale, Nirmal Kumar
Book Image

YARN Essentials

By: Fasale, Nirmal Kumar

Overview of this book

If you have a working knowledge of Hadoop 1.x but want to start afresh with YARN, this book is ideal for you. You will be able to install and administer a YARN cluster and also discover the configuration settings to fine-tune your cluster both in terms of performance and scalability. This book will help you develop, deploy, and run multiple applications/frameworks on the same shared YARN cluster.
Table of Contents (12 chapters)
Free Chapter
1
1. Need for YARN
9
9. YARN – Alternative Solutions
11
Index

Journey – present and future

Around two years back, YARN was introduced with the Hadoop 0.23 release on 11 Nov, 2011.

Since then, there was no looking back and there were a number of releases.

Finally, on October 15, 2013 Apache Hadoop 2.2.0 was the GA (General Availability) release of Apache Hadoop 2.x.

In October 2013, Apache Hadoop YARN won the Best Paper award at ACM SoCC (Symposium on Cloud Computing) 2013.

Apache Hadoop 2.x, powered by YARN, is no doubt the best platform for all of the Hadoop ecosystem components such as MapReduce, Apache Hive, Apache Pig, and so on that use HDFS as the underlying data storage.

YARN was also honored by other open source communities for frameworks such as Apache Giraph, Apache Tez, Apache Spark, Apache Flink, and many others.

Vendors such as HP, Microsoft, SAS, Teradata, SAP, Red Hat, and the list goes on, are moving towards YARN to run their existing products and services on Hadoop.

People willing to modify applications can already use YARN directly...