Book Image

Data Wrangling on AWS

By : Navnit Shukla, Sankar M, Sampat Palani
5 (1)
Book Image

Data Wrangling on AWS

5 (1)
By: Navnit Shukla, Sankar M, Sampat Palani

Overview of this book

Data wrangling is the process of cleaning, transforming, and organizing raw, messy, or unstructured data into a structured format. It involves processes such as data cleaning, data integration, data transformation, and data enrichment to ensure that the data is accurate, consistent, and suitable for analysis. Data Wrangling on AWS equips you with the knowledge to reap the full potential of AWS data wrangling tools. First, you’ll be introduced to data wrangling on AWS and will be familiarized with data wrangling services available in AWS. You’ll understand how to work with AWS Glue DataBrew, AWS data wrangler, and AWS Sagemaker. Next, you’ll discover other AWS services like Amazon S3, Redshift, Athena, and Quicksight. Additionally, you’ll explore advanced topics such as performing Pandas data operation with AWS data wrangler, optimizing ML data with AWS SageMaker, building the data warehouse with Glue DataBrew, along with security and monitoring aspects. By the end of this book, you’ll be well-equipped to perform data wrangling using AWS services.
Table of Contents (19 chapters)
1
Part 1:Unleashing Data Wrangling with AWS
3
Part 2:Data Wrangling with AWS Tools
7
Part 3:AWS Data Management and Analysis
12
Part 4:Advanced Data Manipulation and ML Data Optimization
15
Part 5:Ensuring Data Lake Security and Monitoring

Who this book is for

Data Wrangling on AWS is designed for a wide range of individuals who are interested in mastering the art of data wrangling and leveraging the power of AWS for efficient and effective data manipulation and preparation. The book caters to the following audience:

  • Data Professionals: Data engineers, data analysts, and data scientists who work with large and complex datasets and want to enhance their data-wrangling skills on the AWS platform
  • AWS Users: Individuals who are already familiar with AWS and want to explore the specific services and tools available for data wrangling
  • Business Analysts: Professionals involved in data-driven decision-making and analysis who need to acquire data-wrangling skills to derive valuable insights from their data
  • IT Professionals: Technology enthusiasts and IT practitioners who want to expand their knowledge of data wrangling on the AWS platform

While prior experience with data wrangling or AWS is beneficial, the book provides a solid foundation for beginners and gradually progresses to more advanced topics. Familiarity with basic programming concepts and SQL would be advantageous but is not mandatory. The book combines theoretical explanations with practical examples and hands-on exercises, making it accessible to individuals with different backgrounds and skill levels.