Book Image

AWS for Solutions Architects - Second Edition

By : Saurabh Shrivastava, Neelanjali Srivastav, Alberto Artasanchez, Imtiaz Sayed
4 (2)
Book Image

AWS for Solutions Architects - Second Edition

4 (2)
By: Saurabh Shrivastava, Neelanjali Srivastav, Alberto Artasanchez, Imtiaz Sayed

Overview of this book

Are you excited to harness the power of AWS and unlock endless possibilities for your business? Look no further than the second edition of AWS for Solutions Architects! Imagine crafting cloud solutions that are secure, scalable, and optimized – not just good, but industry-leading. This updated guide throws open the doors to the AWS Well-Architected Framework, design pillars, and cloud-native design patterns empowering you to craft secure, performant, and cost-effective cloud architectures. Tame the complexities of networking, conquering edge deployments and crafting seamless hybrid cloud connections. Uncover the secrets of big data and streaming with EMR, Glue, Kinesis, and MSK, extracting valuable insights from data at speeds you never thought possible. Future-proof your cloud with game-changing insights! New chapters unveil CloudOps, machine learning, IoT, and blockchain, empowering you to build transformative solutions. Plus, unlock the secrets of storage mastery, container excellence, and data lake patterns. From simple configurations to sophisticated architectures, this guide equips you with the knowledge to solve any cloud challenge and impress even the most demanding clients. This book is your one-stop shop for architecting industry-standard AWS solutions. Stop settling for average – dive in and build like a pro!
Table of Contents (19 chapters)
17
Other Books You May Enjoy
18
Index

Amazon Elastic Map Reduce (EMR)

Back in 2009, AWS introduced EMR, a tool that can handle extremely large amounts of data (terabytes and petabytes) using the latest open-source big data tools like Spark, Hive, Presto, HBase, Flink, and Hudi in the cloud. Amazon EMR is a managed cluster platform that makes it easier to run big data tools, such as Apache Hadoop and Apache Spark, on the AWS cloud for processing and analyzing massive datasets. It is a wrapper around distributed open-source computing frameworks. This wrapper abstracts the effort required to set up infrastructure, security, network communication, disaster recovery, and scalability. Additionally, EMR offers 100% compliance with open-source APIs. So, there is no need to change your application code when you move to EMR from the on-premises Hadoop system.

EMR runs directly against the data stored in your S3 data lake, so you don’t need to move that data or transform your data. You can store data in the data lake...