Book Image

Learning AWS - Second Edition

By : Aurobindo Sarkar, Amit Shah
Book Image

Learning AWS - Second Edition

By: Aurobindo Sarkar, Amit Shah

Overview of this book

Amazon Web Services (AWS) is the most popular and widely-used cloud platform. Administering and deploying application on AWS makes the applications resilient and robust. The main focus of the book is to cover the basic concepts of cloud-based development followed by running solutions in AWS Cloud, which will help the solutions run at scale. This book not only guides you through the trade-offs and ideas behind efficient cloud applications, but is a comprehensive guide to getting the most out of AWS. In the first section, you will begin by looking at the key concepts of AWS, setting up your AWS account, and operating it. This guide also covers cloud service models, which will help you build highly scalable and secure applications on the AWS platform. We will then dive deep into concepts of cloud computing with S3 storage, RDS and EC2. Next, this book will walk you through VPC, building real-time serverless environments, and deploying serverless APIs with microservices. Finally, this book will teach you to monitor your applications, automate your infrastructure, and deploy with CloudFormation. By the end of this book, you will be well-versed with the various services that AWS provides and will be able to leverage AWS infrastructure to accelerate the development process.
Table of Contents (12 chapters)

Configuring and using EMR-Spark clusters

In this section, we will present two simple examples of EMR clusters suitable for basic Spark development. In the first example, we will spin up an EMR cluster, start the Spark shell, and do some Spark-Scala work. In the second example, we will spin up an EMR cluster and run a simple Spark program.

Follow the step-by-step instructions specified next for this hands-on exercise:

  1. Log in to the AWS Management Console and open the Amazon EMR console and click on the Create cluster button:
  1. We will use the Create Cluster - Quick Options for selecting the options for our cluster. Specify a name for the cluster (FirstEMRSparkClusterUsingQuickOptions). Choose Launch mode as Cluster:

In the Software configuration section, ensure you have selected the latest available version of EMR and select the Spark option:

Leave the defaults for Hardware...