Items related to Serverless ETL and Analytics with AWS Glue: Your comprehensi...

Serverless ETL and Analytics with AWS Glue: Your comprehensive reference guide to learning about AWS Glue and its features - Softcover

 
9781800564985: Serverless ETL and Analytics with AWS Glue: Your comprehensive reference guide to learning about AWS Glue and its features

Synopsis

Build efficient data lakes that can scale to virtually unlimited size using AWS Glue

Key Features

  • Learn to work with AWS Glue to overcome typical implementation challenges in data lakes
  • Create and manage serverless ETL pipelines that can scale to manage big data
  • Written by AWS Glue community members, this practical guide shows you how to implement AWS Glue in no time

Book Description

Organizations these days have gravitated toward services such as AWS Glue that undertake undifferentiated heavy lifting and provide serverless Spark, enabling you to create and manage data lakes in a serverless fashion. This guide shows you how AWS Glue can be used to solve real-world problems along with helping you learn about data processing, data integration, and building data lakes.

Beginning with AWS Glue basics, this book teaches you how to perform various aspects of data analysis such as ad hoc queries, data visualization, and real-time analysis using this service. It also provides a walk-through of CI/CD for AWS Glue and how to shift left on quality using automated regression tests. You'll find out how data security aspects such as access control, encryption, auditing, and networking are implemented, as well as getting to grips with useful techniques such as picking the right file format, compression, partitioning, and bucketing. As you advance, you'll discover AWS Glue features such as crawlers, Lake Formation, governed tables, lineage, DataBrew, Glue Studio, and custom connectors. The concluding chapters help you to understand various performance tuning, troubleshooting, and monitoring options.

By the end of this AWS book, you'll be able to create, manage, troubleshoot, and deploy ETL pipelines using AWS Glue.

What you will learn

  • Apply various AWS Glue features to manage and create data lakes
  • Use Glue DataBrew and Glue Studio for data preparation
  • Optimize data layout in cloud storage to accelerate analytics workloads
  • Manage metadata including database, table, and schema definitions
  • Secure your data during access control, encryption, auditing, and networking
  • Monitor AWS Glue jobs to detect delays and loss of data
  • Integrate Spark ML and SageMaker with AWS Glue to create machine learning models

Who this book is for

This book is for ETL developers, data engineers, and data analysts who want to understand how AWS Glue can help you solve your business problems. Basic knowledge of AWS data services is assumed.

Table of Contents

  1. Data Management – Introduction and Concepts
  2. Introduction to Important AWS Glue Features
  3. Data Ingestion
  4. Data Preparation
  5. Designing Data Layouts
  6. Data Management
  7. Metadata Management
  8. Data Security
  9. Data Sharing
  10. Data Pipeline Management
  11. Monitoring
  12. Tuning, Debugging, and Troubleshooting
  13. Data Analysis
  14. Machine Learning Integration
  15. Architecting Data Lakes for Real-World Scenarios and Edge Cases

"synopsis" may belong to another edition of this title.

About the Author

Vishal Pathak is a Data Lab Solutions Architect at AWS. Vishal works with customers on their use cases, architects solutions to solve their business problems, and helps them build scalable prototypes. Prior to his journey in AWS, Vishal helped customers implement business intelligence, data warehouse, and data lake projects in the US and Australia.

Subramanya Vajiraya is a Big data Cloud Engineer at AWS Sydney specializing in AWS Glue. He obtained his Bachelor of Engineering degree specializing in Information Science & Engineering from NMAM Institute of Technology, Nitte, KA, India (Visvesvaraya Technological University, Belgaum) in 2015 and obtained his Master of Information Technology degree specialized in Internetworking from the University of New South Wales, Sydney, Australia in 2017. He is passionate about helping customers solve challenging technical issues related to their ETL workload and implementing scalable data integration and analytics pipelines on AWS.

Noritaka Sekiyama is a Senior Big Data Architect on the AWS Glue and AWS Lake Formation team. He has 11 years of experience working in the software industry. Based in Tokyo, Japan, he is responsible for implementing software artifacts, building libraries, troubleshooting complex issues and helping guide customer architectures.

Tomohiro Tanaka is a senior cloud support engineer at AWS. He works to help customers solve their issues and build data lakes across AWS Glue, AWS IoT, and big data technologies such Apache Spark, Hadoop, and Iceberg.

Albert Quiroga works as a senior solutions architect at Amazon, where he is helping to design and architect one of the largest data lakes in the world. Prior to that, he spent four years working at AWS, where he specialized in big data technologies such as EMR and Athena, and where he became an expert on AWS Glue. Albert has worked with several Fortune 500 companies on some of the largest data lakes in the world and has helped to launch and develop features for several AWS services.

Ishan Gaur has more than 13 years of IT experience in soft ware development and data engineering, building distributed systems and highly scalable ETL pipelines using Apache Spark, Scala, and various ETL tools such as Ab Initio and Datastage. He currently works at AWS as a senior big data cloud engineer and is an SME of AWS Glue. He is responsible for helping customers to build out large, scalable distributed systems and implement them in AWS cloud environments using various big data services, including EMR, Glue, and Athena, as well as other technologies, such as Apache Spark, Hadoop, and Hive.

"About this title" may belong to another edition of this title.

Buy Used

Condition: As New
Unread book in perfect condition...
View this item

FREE shipping within United Kingdom

Destination, rates & speeds

Search results for Serverless ETL and Analytics with AWS Glue: Your comprehensi...

Stock Image

Vishal Pathak; Subramanya Vajiraya; Noritaka Sekiyama; Tomohiro Tanaka; Albert Quiroga
Published by Packt Publishing, 2022
ISBN 10: 1800564988 ISBN 13: 9781800564985
New Softcover

Seller: GreatBookPricesUK, Woodford Green, United Kingdom

Seller rating 5 out of 5 stars 5-star rating, Learn more about seller ratings

Condition: New. Seller Inventory # 44649481-n

Contact seller

Buy New

£ 37.29
Convert currency
Shipping: FREE
Within United Kingdom
Destination, rates & speeds

Quantity: Over 20 available

Add to basket

Stock Image

Vishal Pathak; Subramanya Vajiraya; Noritaka Sekiyama; Tomohiro Tanaka; Albert Quiroga
Published by Packt Publishing, 2022
ISBN 10: 1800564988 ISBN 13: 9781800564985
New Softcover

Seller: Ria Christie Collections, Uxbridge, United Kingdom

Seller rating 5 out of 5 stars 5-star rating, Learn more about seller ratings

Condition: New. In. Seller Inventory # ria9781800564985_new

Contact seller

Buy New

£ 37.30
Convert currency
Shipping: FREE
Within United Kingdom
Destination, rates & speeds

Quantity: Over 20 available

Add to basket

Stock Image

Vishal Pathak
Published by Packt Publishing Limited, 2022
ISBN 10: 1800564988 ISBN 13: 9781800564985
New PAP
Print on Demand

Seller: PBShop.store UK, Fairford, GLOS, United Kingdom

Seller rating 5 out of 5 stars 5-star rating, Learn more about seller ratings

PAP. Condition: New. New Book. Delivered from our UK warehouse in 4 to 14 business days. THIS BOOK IS PRINTED ON DEMAND. Established seller since 2000. Seller Inventory # L0-9781800564985

Contact seller

Buy New

£ 37.87
Convert currency
Shipping: FREE
Within United Kingdom
Destination, rates & speeds

Quantity: Over 20 available

Add to basket

Stock Image

Pathak, Vishal; Vajiraya, Subramanya; Sekiyama, Noritaka; Tanaka, Tomohiro; Quiroga, Albert; Gaur, Ishan
Published by Packt Publishing, 2022
ISBN 10: 1800564988 ISBN 13: 9781800564985
Used Softcover

Seller: GreatBookPricesUK, Woodford Green, United Kingdom

Seller rating 5 out of 5 stars 5-star rating, Learn more about seller ratings

Condition: As New. Unread book in perfect condition. Seller Inventory # 44649481

Contact seller

Buy Used

£ 40.75
Convert currency
Shipping: FREE
Within United Kingdom
Destination, rates & speeds

Quantity: Over 20 available

Add to basket

Stock Image

Vishal Pathak; Subramanya Vajiraya; Noritaka Sekiyama; Tomohiro Tanaka; Albert Quiroga
Published by Packt Publishing, 2022
ISBN 10: 1800564988 ISBN 13: 9781800564985
New Softcover

Seller: California Books, Miami, FL, U.S.A.

Seller rating 5 out of 5 stars 5-star rating, Learn more about seller ratings

Condition: New. Seller Inventory # I-9781800564985

Contact seller

Buy New

£ 35.46
Convert currency
Shipping: £ 7.33
From U.S.A. to United Kingdom
Destination, rates & speeds

Quantity: Over 20 available

Add to basket

Stock Image

Vishal Pathak
Published by Packt Publishing Limited, 2022
ISBN 10: 1800564988 ISBN 13: 9781800564985
New PAP
Print on Demand

Seller: PBShop.store US, Wood Dale, IL, U.S.A.

Seller rating 5 out of 5 stars 5-star rating, Learn more about seller ratings

PAP. Condition: New. New Book. Shipped from UK. THIS BOOK IS PRINTED ON DEMAND. Established seller since 2000. Seller Inventory # L0-9781800564985

Contact seller

Buy New

£ 43.25
Convert currency
Shipping: FREE
From U.S.A. to United Kingdom
Destination, rates & speeds

Quantity: Over 20 available

Add to basket

Seller Image

Pathak, Vishal
Published by Packt Publishing 8/30/2022, 2022
ISBN 10: 1800564988 ISBN 13: 9781800564985
New Paperback or Softback

Seller: BargainBookStores, Grand Rapids, MI, U.S.A.

Seller rating 5 out of 5 stars 5-star rating, Learn more about seller ratings

Paperback or Softback. Condition: New. Serverless ETL and Analytics with AWS Glue: Your comprehensive reference guide to learning about AWS Glue and its features 1.63. Book. Seller Inventory # BBS-9781800564985

Contact seller

Buy New

£ 34.91
Convert currency
Shipping: £ 8.42
From U.S.A. to United Kingdom
Destination, rates & speeds

Quantity: 5 available

Add to basket

Stock Image

Vishal Pathak
Published by Packt Publishing Limited, 2022
ISBN 10: 1800564988 ISBN 13: 9781800564985
New Paperback / softback
Print on Demand

Seller: THE SAINT BOOKSTORE, Southport, United Kingdom

Seller rating 5 out of 5 stars 5-star rating, Learn more about seller ratings

Paperback / softback. Condition: New. This item is printed on demand. New copy - Usually dispatched within 5-9 working days 100. Seller Inventory # C9781800564985

Contact seller

Buy New

£ 43.63
Convert currency
Shipping: FREE
Within United Kingdom
Destination, rates & speeds

Quantity: Over 20 available

Add to basket

Stock Image

Vishal Pathak; Subramanya Vajiraya; Noritaka Sekiyama; Tomohiro Tanaka; Albert Quiroga
Published by Packt Publishing, 2022
ISBN 10: 1800564988 ISBN 13: 9781800564985
New Softcover

Seller: GreatBookPrices, Columbia, MD, U.S.A.

Seller rating 5 out of 5 stars 5-star rating, Learn more about seller ratings

Condition: New. Seller Inventory # 44649481-n

Contact seller

Buy New

£ 32.49
Convert currency
Shipping: £ 14.64
From U.S.A. to United Kingdom
Destination, rates & speeds

Quantity: Over 20 available

Add to basket

Stock Image

Vishal Pathak; Subramanya Vajiraya; Noritaka Sekiyama; Tomohiro Tanaka; Albert Quiroga
Published by Packt Publishing, 2022
ISBN 10: 1800564988 ISBN 13: 9781800564985
Used Softcover

Seller: GreatBookPrices, Columbia, MD, U.S.A.

Seller rating 5 out of 5 stars 5-star rating, Learn more about seller ratings

Condition: As New. Unread book in perfect condition. Seller Inventory # 44649481

Contact seller

Buy Used

£ 37.20
Convert currency
Shipping: £ 14.64
From U.S.A. to United Kingdom
Destination, rates & speeds

Quantity: Over 20 available

Add to basket

There are 4 more copies of this book

View all search results for this book