Dataproc Cookbook: Running Spark and Hadoop Workloads in Google Cloud - Softcover

Sadineni, Narasimha ; Venkataraman, Anuyogam

9781098157708: Dataproc Cookbook: Running Spark and Hadoop Workloads in Google Cloud

Softcover

ISBN 10: 1098157702 ISBN 13: 9781098157708

Publisher: O'Reilly Media, 2025

View all copies of this ISBN edition

5 Used

From � 28.63

23 New

From � 42.33

Get up to speed with Dataproc, the fully managed and highly scalable service for running open source big data tools and frameworks, including Hadoop, Spark, Flink, and Presto. This cookbook shows data engineers, data scientists, data analysts, and cloud architects how to use Dataproc, integrated with Google Cloud, for data lake modernization, ETL, and secure data science at a fraction of the cost.

Narasimha Sadineni from Google and former Googler Anu Venkataraman show you how to set up and run Hadoop and Spark jobs on Dataproc. You'll learn how to create Dataproc clusters and run data engineering and data science workloads in long-running, ephemeral, and serverless ways. In the process, you'll gain an understanding of Dataproc, orchestration, logging and monitoring, Spark History Server, and migration patterns.

This cookbook includes hands-on examples for configuring, logging, securing clusters, and migrating from on-prem to Dataproc. You'll learn how to:

Create Dataproc clusters on Compute Engine and Kubernetes Engine
Run data science workloads on Dataproc
Execute Spark jobs on Dataproc Serverless
Optimize Dataproc clusters to be cost effective and performant
Monitor Spark jobs in various ways
Orchestrate various workloads and activities
Use different methods for migrating data and workloads from existing Hadoop clusters to Dataproc

"synopsis" may belong to another edition of this title.

About the Authors

Narasimha Sadineni is a data engineer at Google who has 12 years of experience in Data & Analytics. While working as a professional services team member at Google and Cloudera, he helped 50+ organizations in solving BigData problems using tools like Hadoop and Google Cloud technologies. He has several years of teaching experience in Hadoop.

Anu Venkataraman is a Senior Program Manager. She previously served as a Data Lake Engineer at Google, accumulating extensive experience in data technologies. Anu assists customers in migrating large-scale distributed systems to the cloud. She finds joy in speaking at universities and contributing technical blogs and videos to the Data community, aiming to expedite customers' journeys to the cloud. Anu played a key role as one of the leads for the Professional Services Tech Talk playlist on the Google Cloud Tech YouTube channel. She holds a Master's degree in Electrical and Computer Engineering from Ryerson University, specializing in Medical Image Processing and Machine Learning.

"About this title" may belong to another edition of this title.

Publisher: O'Reilly Media
Publication date: 2025
Language: English
ISBN 10: 1098157702
ISBN 13: 9781098157708
Binding: Paperback
Edition number: 1
Number of pages: 436

Search results for Dataproc Cookbook: Running Spark and Hadoop Workloads...

Stock Image

Dataproc Cookbook: Running Spark and Hadoop Workloads in Google Cloud

Sadineni, Narasimha,Venkataraman, Anuyogam

Published by O'Reilly Media, 2025

ISBN 10: 1098157702 ISBN 13: 9781098157708

Used paperback

Seller: Books From California, Simi Valley, CA, U.S.A.

Seller rating 4 out of 5 stars

paperback. Condition: Very Good. Seller Inventory # mon0003938329

Contact seller

Buy Used

� 28.63

� 3.78 shipping
Ships within U.S.A.

Quantity: 1 available

Add to basket

Stock Image

Dataproc Cookbook: Running Spark and Hadoop Workloads in Google Cloud

Sadineni, Narasimha,Venkataraman, Anuyogam

Published by O'Reilly Media, 2025

ISBN 10: 1098157702 ISBN 13: 9781098157708

Used paperback

Seller: Books From California, Simi Valley, CA, U.S.A.

Seller rating 4 out of 5 stars

paperback. Condition: Good. Cover and edges may have some wear. Seller Inventory # mon0003958271

Contact seller

Buy Used

� 28.63

� 3.78 shipping
Ships within U.S.A.

Quantity: 1 available

Add to basket

Stock Image

Dataproc Cookbook: Running Spark and Hadoop Workloads in Google Cloud

Sadineni, Narasimha,Venkataraman, Anuyogam

Published by O'Reilly Media, 2025

ISBN 10: 1098157702 ISBN 13: 9781098157708

Used paperback

Seller: Books From California, Simi Valley, CA, U.S.A.

Seller rating 4 out of 5 stars

paperback. Condition: Fine. Seller Inventory # mon0003938182

Contact seller

Buy Used

� 28.63

� 3.78 shipping
Ships within U.S.A.

Quantity: 1 available

Add to basket

Stock Image

Dataproc Cookbook: Running Spark and Hadoop Workloads in Google Cloud

Sadineni, Narasimha, Venkataraman, Anuyogam

Published by O'Reilly Media, 2025

ISBN 10: 1098157702 ISBN 13: 9781098157708

New Softcover

Seller: Lakeside Books, Benton Harbor, MI, U.S.A.

Seller rating 5 out of 5 stars

Condition: New. Brand New! Not Overstocks or Low Quality Book Club Editions! Direct From the Publisher! We're not a giant, faceless warehouse organization! We're a small town bookstore that loves books and loves it's customers! Buy from Lakeside Books! Seller Inventory # OTF-S-9781098157708

Contact seller

Buy New

� 42.33

� 3.02 shipping
Ships within U.S.A.

Quantity: Over 20 available

Add to basket

Seller Image

Dataproc Cookbook : Running Spark and Hadoop Workloads in Google Cloud

Sadineni, Narasimha; Venkataraman, Anuyogam

Published by O'Reilly Media, 2025

ISBN 10: 1098157702 ISBN 13: 9781098157708

New Softcover

Seller: GreatBookPrices, Columbia, MD, U.S.A.

Seller rating 5 out of 5 stars

Condition: New. Seller Inventory # 48274538-n

Contact seller

Buy New

� 43.37

� 2 shipping
Ships within U.S.A.

Quantity: Over 20 available

Add to basket

Seller Image

Dataproc Cookbook: Running Spark and Hadoop Workloads in Google Cloud (Paperback or Softback)

Sadineni, Narasimha

Published by O'Reilly Media 7/8/2025, 2025

ISBN 10: 1098157702 ISBN 13: 9781098157708

New Paperback or Softback

Seller: BargainBookStores, Grand Rapids, MI, U.S.A.

Seller rating 5 out of 5 stars

Paperback or Softback. Condition: New. Dataproc Cookbook: Running Spark and Hadoop Workloads in Google Cloud. Book. Seller Inventory # BBS-9781098157708

Contact seller