Book Image

Geospatial Data Analytics on AWS

By : Scott Bateman, Janahan Gnanachandran, Jeff DeMuth
Book Image

Geospatial Data Analytics on AWS

By: Scott Bateman, Janahan Gnanachandran, Jeff DeMuth

Overview of this book

Managing geospatial data and building location-based applications in the cloud can be a daunting task. This comprehensive guide helps you overcome this challenge by presenting the concept of working with geospatial data in the cloud in an easy-to-understand way, along with teaching you how to design and build data lake architecture in AWS for geospatial data. You’ll begin by exploring the use of AWS databases like Redshift and Aurora PostgreSQL for storing and analyzing geospatial data. Next, you’ll leverage services such as DynamoDB and Athena, which offer powerful built-in geospatial functions for indexing and querying geospatial data. The book is filled with practical examples to illustrate the benefits of managing geospatial data in the cloud. As you advance, you’ll discover how to analyze and visualize data using Python and R, and utilize QuickSight to share derived insights. The concluding chapters explore the integration of commonly used platforms like Open Data on AWS, OpenStreetMap, and ArcGIS with AWS to enable you to optimize efficiency and provide a supportive community for continuous learning. By the end of this book, you’ll have the necessary tools and expertise to build and manage your own geospatial data lake on AWS, along with the knowledge needed to tackle geospatial data management challenges and make the most of AWS services.
Table of Contents (23 chapters)
1
Part 1: Introduction to the Geospatial Data Ecosystem
4
Part 2: Geospatial Data Lakes using Modern Data Architecture
10
Part 3: Analyzing and Visualizing Geospatial Data in AWS
16
Part 4: Accessing Open Source and Commercial Platforms and Services

Deploying a SageMaker Geospatial example

In this section, we will stand up a SageMaker Geospatial ML environment and perform a simple Earth Observation job (EOJ) to monitor the amount of water present in satellite imagery of water bodies. EOJs are useful when the desired outcome is to acquire and transform data from the surface of our planet to make predictions – in this case, regarding water presence in a given area.

There is a one-time environment setup for SageMaker that can be accomplished with the following steps. However, if you already have a SageMaker domain configured, you can leverage the geospatial capabilities from there.

Note

At the time of this book’s writing, geospatial capabilities are only released for SageMaker in the Oregon (us-west-2) Region.

First-time use steps

We will use the following steps:

  1. Navigate to SageMaker from the AWS Console:
Figure 11.4: AWS Console with SageMaker typed into the top navigation panel

Figure 11.4: AWS Console with SageMaker typed into the top...