Apache Hudi - The Definitive Guide: Building Robust, Open, and High-Performing Data Lakehouses - Softcover

Xu, Shiyan ; Wason, Prashant ; Saktheeswaran, Bhavani Sudha ; Bilbro, Rebecca

9781098173838: Apache Hudi - The Definitive Guide: Building Robust, Open, and High-Performing Data Lakehouses

Softcover

ISBN 10: 109817383X ISBN 13: 9781098173838

Publisher: O'Reilly Media, 2025

View all copies of this ISBN edition

3 Used

From � 43.34

28 New

From � 38.80

Overcome challenges in building transactional guarantees on rapidly changing data by using Apache Hudi. With this practical guide, data engineers, data architects, and software architects will discover how to seamlessly build an interoperable lakehouse from disparate data sources and deliver faster insights using their query engine of choice.

Authors Shiyan Xu, Prashant Wason, Sudha Saktheeswaran, and Rebecca Bilbro provide practical examples and insights to help you unlock the full potential of data lakehouses for different levels of analytics, from batch to interactive to streaming. You'll also learn how to evaluate storage choices and leverage built-in automated table optimizations to build, maintain, and operate production data applications.

This book helps you:

Understand the need for transactional data lakehouses and the challenges associated with building them
Get up to speed with Apache Hudi and learn how it makes building data lakehouses easy
Explore data ecosystem support provided by Apache Hudi for popular data sources and query engines
Perform different write and read operations on Apache Hudi tables and effectively use them for various use cases, including batch and stream applications
Implement data engineering techniques to operate and manage Apache Hudi tables
Apply different storage techniques and considerations, such as indexing and clustering to maximize your lakehouse performance
Build end-to-end incremental data pipelines using Apache Hudi for faster ingestion and fresher analytics

"synopsis" may belong to another edition of this title.

About the Author

Shiyan Xu is a Founding Engineer at Onehouse and currently working as an Open Source Engineer. He has been an active contributor to Apache Hudi since 2019, and is serving as a PMC member of the project since 2021. Prior to joining Onehouse, Shiyan worked as a tech lead manager at Zendesk, leading the development of a large-scale data lake platform using Apache Hudi. He is passionate about open source development and engaging with community users. Prashant Wason is a Staff Software Engineer at Uber Technologies and a PMC member of the Apache Hudi project. He has been an active contributor to the Hudi project since 2019 with features like Metadata Table and Record Index. Prashant has been working in the Storage and Data Infrastructure space for over 15 years.Sudha Saktheeswaran is a Software Engineer at Onehouse and a PMC member of the Apache Hudi project. She comes with vast experience in real-time and distributed data systems through her work at Moveworks, Uber and Linkedin's data infra teams. Sudha is also a key contributor to the early Presto integrations of Hudi. She is passionate about engaging with and driving the Hudi community. Dr. Rebecca Bilbro is a data scientist, Python programmer, and author in Washington, DC. She specializes in data visualization for machine learning, from feature analysis to model selection and hyperparameter tuning. Rebecca is an active contributor to the open source community and has conducted research on natural language processing, semantic network extraction, entity resolution, and high dimensional information visualization. She earned her doctorate from the University of Illinois, Urbana-Champaign, where her research centered on communication and visualization practices in engineering. Rebecca is co-founder and CTO of Rotational Labs.

"About this title" may belong to another edition of this title.

Publisher: O'Reilly Media
Publication date: 2025
Language: English
ISBN 10: 109817383X
ISBN 13: 9781098173838
Binding: Paperback
Number of pages: 350

Search results for Apache Hudi - The Definitive Guide: Building Robust,...

Seller Image

Apache Hudi : The Definitive Guide: Building Robust, Open, and High-performing Data Lakehouses

Xu, Shiyan; Wason, Prashant; Saktheeswaran, Bhavani Sudha; Bilbro, Rebecca

Published by O'Reilly Media, 2025

ISBN 10: 109817383X ISBN 13: 9781098173838

New Softcover

Seller: GreatBookPrices, Columbia, MD, U.S.A.

Seller rating 5 out of 5 stars

Condition: New. Seller Inventory # 50255451-n

Contact seller

Buy New

� 38.80

� 1.98 shipping
Ships within U.S.A.

Quantity: Over 20 available

Add to basket

Seller Image

Apache Hudi: The Definitive Guide: Building Robust, Open, and High-Performing Data Lakehouses (Paperback or Softback)

Xu, Shiyan

Published by O'Reilly Media 12/2/2025, 2025

ISBN 10: 109817383X ISBN 13: 9781098173838

New Paperback or Softback

Seller: BargainBookStores, Grand Rapids, MI, U.S.A.

Seller rating 5 out of 5 stars

Paperback or Softback. Condition: New. Apache Hudi: The Definitive Guide: Building Robust, Open, and High-Performing Data Lakehouses. Book. Seller Inventory # BBS-9781098173838

Contact seller

Buy New

� 40.85

Free Shipping
Ships within U.S.A.

Quantity: 5 available

Add to basket

Stock Image

Apache Hudi - The Definitive Guide

Shiyan Xu

Published by O'Reilly Media, 2025

ISBN 10: 109817383X ISBN 13: 9781098173838

New PAP

Seller: PBShop.store US, Wood Dale, IL, U.S.A.

Seller rating 5 out of 5 stars

PAP. Condition: New. New Book. Shipped from UK. Established seller since 2000. Seller Inventory # WO-9781098173838

Contact seller

Buy New

� 41.07

Free Shipping
Ships within U.S.A.

Quantity: 15 available

Add to basket

Stock Image

Apache Hudi - The Definitive Guide

Shiyan Xu

Published by O'Reilly Media, 2025

ISBN 10: 109817383X ISBN 13: 9781098173838

New PAP

Seller: PBShop.store UK, Fairford, GLOS, United Kingdom

Seller rating 5 out of 5 stars

PAP. Condition: New. New Book. Shipped from UK. Established seller since 2000. Seller Inventory # WO-9781098173838

Contact seller

Buy New

� 37.29

� 5.02 shipping
Ships from United Kingdom to U.S.A.

Quantity: 15 available

Add to basket

Stock Image

Apache Hudi: The Definitive Guide: Building Robust, Open, and High-Performing Data Lakehouses

Xu, Shiyan; Wason, Prashant; Saktheeswaran, Bhavani Sudha; Bilbro, Rebecca

Published by O'Reilly Media, 2025

ISBN 10: 109817383X ISBN 13: 9781098173838

New Softcover

Seller: California Books, Miami, FL, U.S.A.

Seller rating 4 out of 5 stars

Condition: New. Seller Inventory # I-9781098173838

Contact seller

Buy New

� 44.74

Free Shipping
Ships within U.S.A.

Quantity: Over 20 available

Add to basket

Seller Image

Apache Hudi : The Definitive Guide: Building Robust, Open, and High-performing Data Lakehouses

Xu, Shiyan; Wason, Prashant; Saktheeswaran, Bhavani Sudha; Bilbro, Rebecca

Published by O'Reilly Media, 2025

ISBN 10: 109817383X ISBN 13: 9781098173838

Used Softcover

Seller: GreatBookPrices, Columbia, MD, U.S.A.

Seller rating 5 out of 5 stars

Condition: As New. Unread book in perfect condition. Seller Inventory # 50255451

Contact seller

Buy Used

� 43.34

� 1.98 shipping
Ships within U.S.A.

Quantity: Over 20 available

Add to basket

Seller Image

Apache Hudi - The Definitive Guide

Shiyan Xu

Published by O'Reilly Media, US, 2025

ISBN 10: 109817383X ISBN 13: 9781098173838

New Paperback

Seller: Rarewaves USA, OSWEGO, IL, U.S.A.

Seller rating 5 out of 5 stars

Paperback. Condition: New. Overcome challenges in building transactional guarantees on rapidly changing data by using Apache Hudi. With this practical guide, data engineers, data architects, and software architects will discover how to seamlessly build an interoperable lakehouse from disparate data sources and deliver faster insights using their query engine of choice.Authors Shiyan Xu, Prashant Wason, Sudha Saktheeswaran, and Rebecca Bilbro provide practical examples and insights to help you unlock the full potential of data lakehouses for different levels of analytics, from batch to interactive to streaming. You'll also learn how to evaluate storage choices and leverage built-in automated table optimizations to build, maintain, and operate production data applications.This book helps you:Understand the need for transactional data lakehouses and the challenges associated with building themGet up to speed with Apache Hudi and learn how it makes building data lakehouses easyExplore data ecosystem support provided by Apache Hudi for popular data sources and query enginesPerform different write and read operations on Apache Hudi tables and effectively use them for various use cases, including batch and stream applicationsImplement data engineering techniques to operate and manage Apache Hudi tablesApply different storage techniques and considerations, such as indexing and clustering to maximize your lakehouse performanceBuild end-to-end incremental data pipelines using Apache Hudi for faster ingestion and fresher analytics. Seller Inventory # LU-9781098173838

Contact seller

Buy New

� 45.60

Free Shipping
Ships within U.S.A.

Quantity: Over 20 available

Add to basket

Stock Image

Apache Hudi - The Definitive Guide (Paperback)

Shiyan Xu

Published by O'Reilly Media, Sebastopol, 2025

ISBN 10: 109817383X ISBN 13: 9781098173838

New Paperback

Print on Demand

Seller: Grand Eagle Retail, Bensenville, IL, U.S.A.

Seller rating 5 out of 5 stars

Paperback. Condition: new. Paperback. Overcome challenges in building transactional guarantees on rapidly changing data by using Apache Hudi. With this practical guide, data engineers, data architects, and software architects will discover how to seamlessly build an interoperable lakehouse from disparate data sources and deliver faster insights using their query engine of choice.Authors Shiyan Xu, Prashant Wason, Sudha Saktheeswaran, and Rebecca Bilbro provide practical examples and insights to help you unlock the full potential of data lakehouses for different levels of analytics, from batch to interactive to streaming. You'll also learn how to evaluate storage choices and leverage built-in automated table optimizations to build, maintain, and operate production data applications.This book helps you:Understand the need for transactional data lakehouses and the challenges associated with building themGet up to speed with Apache Hudi and learn how it makes building data lakehouses easyExplore data ecosystem support provided by Apache Hudi for popular data sources and query enginesPerform different write and read operations on Apache Hudi tables and effectively use them for various use cases, including batch and stream applicationsImplement data engineering techniques to operate and manage Apache Hudi tablesApply different storage techniques and considerations, such as indexing and clustering to maximize your lakehouse performanceBuild end-to-end incremental data pipelines using Apache Hudi for faster ingestion and fresher analytics Overcome challenges in building transactional guarantees on rapidly changing data by using Apache Hudi. With this practical guide, data engineers, data architects, and software architects will discover how to seamlessly build an interoperable lakehouse from disparate data sources and deliver faster insights using their query engine of choice. This item is printed on demand. Shipping may be from multiple locations in the US or from the UK, depending on stock availability. Seller Inventory # 9781098173838

Contact seller