Book Image

AWS for Solutions Architects - Second Edition

By : Saurabh Shrivastava, Neelanjali Srivastav, Alberto Artasanchez, Imtiaz Sayed
4 (2)
Book Image

AWS for Solutions Architects - Second Edition

4 (2)
By: Saurabh Shrivastava, Neelanjali Srivastav, Alberto Artasanchez, Imtiaz Sayed

Overview of this book

Are you excited to harness the power of AWS and unlock endless possibilities for your business? Look no further than the second edition of AWS for Solutions Architects! Imagine crafting cloud solutions that are secure, scalable, and optimized – not just good, but industry-leading. This updated guide throws open the doors to the AWS Well-Architected Framework, design pillars, and cloud-native design patterns empowering you to craft secure, performant, and cost-effective cloud architectures. Tame the complexities of networking, conquering edge deployments and crafting seamless hybrid cloud connections. Uncover the secrets of big data and streaming with EMR, Glue, Kinesis, and MSK, extracting valuable insights from data at speeds you never thought possible. Future-proof your cloud with game-changing insights! New chapters unveil CloudOps, machine learning, IoT, and blockchain, empowering you to build transformative solutions. Plus, unlock the secrets of storage mastery, container excellence, and data lake patterns. From simple configurations to sophisticated architectures, this guide equips you with the knowledge to solve any cloud challenge and impress even the most demanding clients. This book is your one-stop shop for architecting industry-standard AWS solutions. Stop settling for average – dive in and build like a pro!
Table of Contents (19 chapters)
17
Other Books You May Enjoy
18
Index

Optimizing Amazon Athena

As with any SQL operation, you can take steps to optimize the performance of your queries and inserts. As with traditional databases, optimizing your data access performance usually comes at the expense of data ingestion and vice versa.

Let’s look at some tips that you can use to increase and optimize performance.

Optimization of data partitions

One way to improve performance is to break up files into smaller files called partitions. A common partition scheme breaks up a file by using a divider that occurs with some regularity in data. Some examples follow:

  • Country
  • Region
  • Date
  • Product

Partitions operate as virtual columns and reduce the amount of data that needs to be read for each query. Partitions are normally defined at the time a table or file is created.

Amazon Athena can use Apache Hive partitions. Hive partitions use this name convention:

s3://BucketName/TablePath/<PARTITION_COLUMN_NAME...