Learn AWS Serverless Computing

By : Scott Patterson

Learn AWS Serverless Computing

By: Scott Patterson

Overview of this book

Serverless computing is a way to run your code without having to provision or manage servers. Amazon Web Services provides serverless services that you can use to build and deploy cloud-native applications. Starting with the basics of AWS Lambda, this book takes you through combining Lambda with other services from AWS, such as Amazon API Gateway, Amazon DynamoDB, and Amazon Step Functions. You’ll learn how to write, run, and test Lambda functions using examples in Node.js, Java, Python, and C# before you move on to developing and deploying serverless APIs efficiently using the Serverless Framework. In the concluding chapters, you’ll discover tips and best practices for leveraging Serverless Framework to increase your development productivity. By the end of this book, you’ll have become well-versed in building, securing, and running serverless applications using Amazon API Gateway and AWS Lambda without having to manage any servers.

Preface

Who this book is for

What this book covers

To get the most out of this book

Get in touch

Free Chapter

Section 1: Why We're Here

The Evolution of Compute

Understanding enterprise data centers

Exploring the units of compute

Understanding software architectures

Predicting what comes next

Summary

Questions

Further reading

Event-Driven Applications

Understanding modern applications

Evolution of integration patterns

Automation with serverless

Summary

Questions

Further reading

Section 2: Getting Started with AWS Lambda Functions

The Foundations of a Function in AWS

Technical requirements

Learning about AWS Lambda

Setting up security

Invoking Lambda functions

Anatomy of a Lambda function

The programming model

Writing your first Lambda function

Summary

Questions

Further reading

Adding Amazon API Gateway

Technical requirements

Introducing Amazon API Gateway

Securing an API

Building, deploying, and managing APIs

Building a Lambda-backed API

Summary

Questions

Further reading

Leveraging AWS Services

Technical requirements

Using Amazon S3 with Lambda

Using Amazon DynamoDB with Lambda

Using AWS Step Functions as an Orchestrator

Summary

Questions

Further reading

Going Deeper with Lambda

Technical requirements

Bringing your own runtime to Lambda

Enabling code reuse through layers

Operationalizing

Development practices

Summary

Questions

Further reading

Section 3: Development Patterns

Serverless Framework

Technical requirements

Understanding the need for a framework

Exploring the core concepts of the Serverless Framework

Deploying your first service

Testing and debugging

Summary

Questions

Further reading

CI/CD with the Serverless Framework

Technical requirements

Using serverless development pipelines

Understanding deployment patterns

Introducing AWS services

Building a pipeline

Summary

Questions

Further reading

Section 4: Architectures and Use Cases

Data Processing

Technical requirements

Getting across the processing types

Building a batch data processing workload

Data analytics and transformation

Summary

Questions

Further reading

AWS Automation

Technical requirements

Embedded security

Running scheduled tasks

Summary

Questions

Further reading

Creating Chatbots

Technical requirements

Building a chatbot with Amazon Lex

Building a Slackbot using Serverless Framework

Summary

Questions

Further reading

Hosting Single-Page Web Applications

Technical requirements

Understanding serverless SPAs on AWS

Building SPAs with Serverless Framework

Summary

Questions

Further reading

GraphQL APIs

Technical requirements

Introduction to GraphQL

Building GraphQL APIs

Summary

Questions

Further reading

Assessment

Chapter 1: The Evolution of Compute

Chapter 2: Event-Driven Applications

Chapter 3: The Foundations of a Function in AWS

Chapter 4: Adding Amazon API Gateway

Chapter 5: Leveraging AWS Services

Chapter 6: Going Deeper with Lambda

Chapter 7: Serverless Framework

Chapter 8: CI/CD with the Serverless Framework

Chapter 9: Data Processing

Chapter 10: AWS Automation

Chapter 11: Creating Chatbots

Chapter 12: Hosting Single-Page Web Applications

Chapter 13: GraphQL APIs

Other Books You May Enjoy

Leave a review - let other readers know what you think

Customer Reviews

5 star

4 star

3 star

2 star

1 star

Exploring the units of compute

In the previous section, we were reminded of the progression of building data centers to consuming cloud resources. We can also relate this shift to how we provision, deploy, and scale our compute resources. Let's have a look at how our thinking has moved from deploying physical servers to scaling our applications and to scaling at an application function level:

Physical servers: Scale at the physical server layer
Virtual machines: Density efficiencies achieved by virtualizing the hardware
Containers: Better density with faster start times
Functions: Best density by abstracting the runtime

Physical servers – scale at the physical server layer

When designing for scale in a physical server world, we needed to predict when our peak load may exceed our current capacity. This is the point at which one or more of our servers are fully utilized and become a bottleneck for the application or service. If we had enough foresight, we could order a new server kit and get it racked and bring it into service before it's needed. Unfortunately, this isn't how internet-scale works.

The lead time and work behind getting that new gear into service could include the following processes:

Securing a capital expenditure budget from finance or the Project Management Office (PMO)
Procuring hardware from a vendor
Capacity planning to confirm that there is space for the new hardware
Updates to the colocation agreement with the hosting provider
Scheduling engineers to travel to the site
Licensing considerations for new software being installed

The unit of scale here is an entire server, and, as you can see from the preceding list, there is a considerable list of work ahead of you before you make the decision to scale. Compounding this problem is the fact that once you have scaled, there's no scaling back. The new hardware will live in service for years and now your baseline operational costs have increased for the duration.

Virtual machines – density efficiencies achieved by virtualizing the hardware

The drawback of scaling by physical nodes is that we always have to spec out the gear for a peak load. This means that, during low load times, the server isn't doing a lot of work and we have spare capacity.

If we could run more workloads on this single server, we could achieve more efficient use of the hardware. This is what we mean when we talk about density – the number of workloads that we can cram into a server, virtual machine, or operating system. Here we introduce the hypervisor. A hypervisor is a layer of software that abstracts the server's operating system and applications from the underlying hardware.

By running a hypervisor on the host server, we can share hardware resources between more than one virtual operating system running simultaneously on the host. We call these guest machines or, more commonly, virtual machines (VMs). Each VM can operate independently from the other, allowing multiple tenants to use the same physical host.

The following is a diagram showing how the layers can be visualized. A hypervisor sits between the host operating system and the virtual machines and allows a layer of translation so that the guest operating systems can communicate with the underlying hardware:

Multiple virtual machines running on an abstracted hardware layer (IaaS)

Now, our unit of scale is the virtual machine. Virtual machines are configured and deployed in minutes using the hypervisor management software or through the APIs that are provided. The life cycle of a VM is typically weeks or months, though it can sometimes be years.

When we add virtual machines to a physical server, we also have to take into consideration that the underlying CPU and memory resources are finite.

With virtual machines, we still have to run and maintain entire operating systems. The next evolution of compute utilizes the hardware more efficiently.

Containers – better density with faster start times

Of course, a virtual machine can still sit under-utilized or even idle when the demand is low. If you've ever worked in a role managing a fleet of servers, you would have noticed the dramatic under-utilization of compute during low demand periods. Ideally, we don't want the overhead of deploying and configuring a full operating system every time we need to run a workload.

To achieve this, we can make use of containerization technologies such as Docker. Docker allows applications running on an operating system to be isolated from each other by running inside a container. Each container holds only the application and associated dependencies (binaries and libraries):

Containers sharing an operating system, orchestrated by a control plane (PaaS)

For every operating system running on a physical server, we can run many container workloads in that same OS. Containers are more efficient to run, partly because they share the resources of a single operating system. Because each container doesn't have to run its own full operating system environment, you can achieve a better density of containers over virtual machines. This means we can make better use of the hardware resources that are available.

By using containers to isolate workloads, we can increase the density of the workloads running in an operating system by factors into the tens or even hundreds.

It may take minutes to deploy and configure a VM, but it takes seconds to initialize a container. This makes it much more efficient to be able to add capacity during a scale event, which is done by spinning up more containers. It also means you can achieve better granularity of scale, so you're not deploying more capacity than you need at the time. Our unit of scale here is down to the application level. Typically, the life cycle of a container runs for minutes or hours. A request or system may only use part of an application to achieve a result.

The next evolution of compute breaks up the applications into functions so that we can achieve further efficiencies of infrastructure use.

Functions – best density by abstracting the runtime

Finally, the next evolutionary stage in our units of compute is functions. While containerization technologies such as Docker are commonplace in cloud-native applications, we can take another step further by dividing our applications into discrete functions. Function as a Service (FaaS) is another level of abstraction and is where we get to choose the runtime to execute our code on, some execution parameters such as memory allocation, and that's about it. The hardware, operating system, and runtime are all managed for you.

Martin Fowler at https://martinfowler.com/articles/serverless.html says, "Fundamentally, FaaS is about running backend code without managing your own server systems or your own long-lived server applications.":

Functions running on an abstracted runtime (FaaS)

AWS was the first to release their FaaS service, called AWS Lambda in 2014, and a lot of this book is focused around this service as a core building block for applications.

There are many reasons and benefits behind running serverless compute, as follows:

You don't pay for idle resources. This means you only pay when your code is running.
The scale is instantaneous and continuous—you can run multiple instances of the same code in a parallel.
A sequential batch process running on a server has a ceiling of the maximum throughput of that server's resources. With serverless, this process could be executed in parallel, saving the cost of processing time while getting the results sooner. Our unit of scale here is at the function level, so we can scale relative to the size of the workload.

One of the biggest shifts that is most pertinent to me is that serverless means developers can focus more on building the code that directly relates to outcomes for the business. Developers don't have to grind their gears understanding how to harden their operating system, how to maintain configurations across platforms, or how to tune an application server to get the most performance out of their runtimes.

The tasks of system administration and operations have also changed significantly. With the complexities of the platform abstracted away, the traditional tasks to do with making sure the underlying infrastructure is available has shifted. Instead, serverless operations are more focused on proactive tasks such as monitoring service latency or observing transactions to ensure they are being completed using meaningful data.

I believe that the future of modern organizations lies in the minds and capabilities of visionary developers, and serverless compute reduces the complexities of turning their ideas into reality.

Functions can be deployed in milliseconds and typically live for seconds. Long-running processes may not be the best fit for a serverless workload, but we'll get to this later. Next, let's learn how our evolutions in hardware abstraction have influenced software architectures.

Learn AWS Serverless Computing

By : Scott Patterson

Learn AWS Serverless Computing

By: Scott Patterson

Overview of this book

Related Content you might be interested in

Current Title:

Learn AWS Serverless Computing

Hands-On Serverless Applications with Go

Mastering AWS Lambda

Building Serverless Web Applications