Spark: Big Data Cluster Computing in Production - Softcover

Ganelin, Ilya; Orhian, Ema; Sasaki, Kai

9788126562480: Spark: Big Data Cluster Computing in Production

Softcover

ISBN 10: 812656248X ISBN 13: 9788126562480

Publisher: Wiley india Pvt. Ltd

This specific ISBN edition is currently not available.

0 Used

0 New

BRAND NEW, perfect condition.

"synopsis" may belong to another edition of this title.

From the Back Cover

TIPS, TRICKS, AND SOLUTIONS FOR USING SPARK IN PRODUCTION

Spark′s popularity means the field is expanding in terms of both use and capability. Faster than Hadoop and MapReduce, but compatible with Java�, Scala, Python�, and R, this open source clustering framework is becoming a must–have skill. Spark: Big Data Cluster Computing in Production goes beyond the basics to show you how to bring Spark to real–world production environments. With expert instruction, real–life use cases, and frank discussion, this guide helps you move past the challenges and bring proof–of–concept Spark applications live.

Fine–tune your Spark app to run on production data
Manage resources, organize storage, and master monitoring
Learn about potential problems from real–world use cases, and see where Spark fits best
Estimate cluster size and nail down hardware requirements
Tune up performance with memory management, partitioning, shuffling, and more
Ensure data security with Kerberos
Head off Spark streaming problems in production
Integrate Spark with Yarn, Mesos, Tachyon, and more

About the Author

Ilya Ganelin is a data engineer working at Capital One Data Innovation Lab. Ilya is an active contributor to the core components of Apache Spark and a committer to Apache Apex.

Ema Orhian is a Big Data Engineer interested in scaling algorithms. She is the main committer on jaws–spark–sql–rest, a data warehouse explorer on top of Spark SQL.

Kai Sasaki is a software engineer working in distributed computing and machine learning. He is a Spark contributor who develops mainly MLlib, ML libraries.

Brennon York has been a core contributor to Apache Spark since 2014 including development on GraphX and the core build environment.

"About this title" may belong to another edition of this title.

PublisherWiley india Pvt. Ltd
ISBN 10 812656248X
ISBN 13 9788126562480
BindingPaperback
LanguageEnglish
Number of pages216

(No Available Copies)

Search Books: Create a Want

Can't find the book you're looking for? We'll keep searching for you. If one of our booksellers adds it to AbeBooks, we'll let you know!

Create a Want

Other Popular Editions of the Same Title

9781119254010: Spark: Big Data Cluster Computing in Production

Featured Edition

ISBN 10: 1119254019 ISBN 13: 9781119254010
Publisher: Wiley, 2016
Softcover

Items related to Spark: Big Data Cluster Computing in Production