High Performance Spark: Best Practices for Scaling and Optimizing Apache Spark - Softcover

Karau, Holden ; Warren, Rachel

9781491943205: High Performance Spark: Best Practices for Scaling and Optimizing Apache Spark

Softcover

ISBN 10: 1491943203 ISBN 13: 9781491943205

Publisher: O′Reilly, 2017

View all copies of this ISBN edition

18 Used

From � 4.37

1 New

From � 22.50

Apache Spark is amazing when everything clicks. But if you haven’t seen the performance improvements you expected, or still don’t feel confident enough to use Spark in production, this practical book is for you. Authors Holden Karau and Rachel Warren demonstrate performance optimizations to help your Spark queries run faster and handle larger data sizes, while using fewer resources.

Ideal for software engineers, data engineers, developers, and system administrators working with large-scale data applications, this book describes techniques that can reduce data infrastructure costs and developer hours. Not only will you gain a more comprehensive understanding of Spark, you’ll also learn how to make it sing.

With this book, you’ll explore:

How Spark SQL’s new interfaces improve performance over SQL’s RDD data structure
The choice between data joins in Core Spark and Spark SQL
Techniques for getting the most out of standard RDD transformations
How to work around performance issues in Spark’s key/value pair paradigm
Writing high-performance Spark code without Scala or the JVM
How to test for functionality and performance when applying suggested improvements
Using Spark MLlib and Spark ML machine learning libraries
Spark’s Streaming components and external community packages

"synopsis" may belong to another edition of this title.

About the Author

Holden Karau is a software development engineer at Databricks and is active in open source. She is the author of an earlier Spark book. Prior to Databricks she worked on a variety of search and classification problems at Google, Foursquare, and Amazon. She graduated from the University of Waterloo with a Bachelors of Mathematics in Computer Science. Outside of software she enjoys playing with fire, welding, and hula hooping.Rachel Warren is a data scientist and software engineer at Alpine Data Labs, where she uses Spark to address real world data processing challenges. She has experience working as an analyst both in industry and academia. She graduated with a degree in Computer Science from Wesleyan University in Connecticut.

"About this title" may belong to another edition of this title.

Publisher: O′Reilly
Publication date: 2017
Language: English
ISBN 10: 1491943203
ISBN 13: 9781491943205
Binding: Paperback
Edition number: 1
Number of pages: 358

Search results for High Performance Spark: Best Practices for Scaling...

Stock Image

High Performance Spark: Best Practices for Scaling and Optimizing Apache Spark

Warren, Rachel,Karau, Holden

Published by O'Reilly Media, 2017

ISBN 10: 1491943203 ISBN 13: 9781491943205

Used paperback

Seller: New Legacy Books, Annandale, NJ, U.S.A.

Seller rating 5 out of 5 stars

paperback. Condition: Good. There is a signature or handwriting on the inside front cover. This is an ex library book, stickers and markings accordingly. Fast shipping and order satisfaction guaranteed. A portion of your purchase benefits Non-Profit Organizations, First Aid and Fire Stations! Seller Inventory # mon0000147052

Contact seller

Buy Used

� 4.37

� 1.86 shipping
Ships within U.S.A.

Quantity: 1 available

Add to basket

Stock Image

High Performance Spark : Best Practices for Scaling and Optimizing Apache Spark

Karau, Holden, Warren, Rachel

Published by O'Reilly Media, Incorporated, 2017

ISBN 10: 1491943203 ISBN 13: 9781491943205

Used Softcover

Seller: Better World Books: West, Reno, NV, U.S.A.

Seller rating 5 out of 5 stars

Condition: Very Good. Former library copy. Pages intact with possible writing/highlighting. Binding strong with minor wear. Dust jackets/supplements may not be included. Includes library markings. Stock photo provided. Product includes identifying sticker. Better World Books: Buy Books. Do Good. Seller Inventory # 55404007-20

Contact seller

Buy Used

� 7.18

Free Shipping
Ships within U.S.A.

Quantity: 1 available

Add to basket

Stock Image

High Performance Spark: Best Practices for Scaling and Optimizing Apache Spark

Warren, Rachel,Karau, Holden

Published by O'Reilly Media, 2017

ISBN 10: 1491943203 ISBN 13: 9781491943205

Used Paperback

Seller: HPB-Red, Dallas, TX, U.S.A.

Seller rating 5 out of 5 stars

Paperback. Condition: Good. Connecting readers with great books since 1972! Used textbooks may not include companion materials such as access codes, etc. May have some wear or writing/highlighting. We ship orders daily and Customer Service is our top priority! Seller Inventory # S_473687060

Contact seller

Buy Used

� 4.39

� 2.80 shipping
Ships within U.S.A.

Quantity: 1 available

Add to basket

Stock Image

High Performance Spark: Best Practices for Scaling and Optimizing Apache Spark

Karau, Holden

Published by O'Reilly Media, 2017

ISBN 10: 1491943203 ISBN 13: 9781491943205

Used Softcover

Seller: More Than Words, Waltham, MA, U.S.A.

Seller rating 5 out of 5 stars

Condition: Good. A sound copy with only light wear. Overall a solid copy at a great price! Seller Inventory # BOS-T-08b-01531

Contact seller

Buy Used

� 4.85

� 2.98 shipping
Ships within U.S.A.

Quantity: 1 available

Add to basket

Stock Image

High Performance Spark: Best Practices for Scaling and Optimizing Apache Spark

Karau, Holden; Warren, Rachel

Published by O'Reilly Media, 2017

ISBN 10: 1491943203 ISBN 13: 9781491943205

Used paperback First Edition

Seller: The Maryland Book Bank, Baltimore, MD, U.S.A.

Seller rating 5 out of 5 stars

paperback. Condition: Good. 1st Edition. Corners are slightly bent. Used - Good. Seller Inventory # 9-N-5-0184

Contact seller

Buy Used

� 4.45

� 3.51 shipping
Ships within U.S.A.

Quantity: 1 available

Add to basket

Stock Image

High Performance Spark: Best Practices for Scaling and Optimizing Apache Spark

Karau, Holden; Warren, Rachel

Published by O'Reilly Media, 2017

ISBN 10: 1491943203 ISBN 13: 9781491943205

Used paperback

Seller: A1AMedia, Saint Augustine, FL, U.S.A.

Seller rating 5 out of 5 stars

paperback. Condition: Very Good. Minor wear C2. Seller Inventory # 182333

Contact seller

Buy Used

� 6.92

� 2.98 shipping
Ships within U.S.A.

Quantity: 1 available

Add to basket

Stock Image

High Performance Spark Best Pr

Karau, Holden

Published by O'Reilly Media, 2017

ISBN 10: 1491943203 ISBN 13: 9781491943205

Used Softcover

Seller: World of Books (was SecondSale), Montgomery, IL, U.S.A.

Seller rating 5 out of 5 stars

Condition: Good. Item in good condition. Textbooks may not include supplemental items i.e. CDs, access codes etc. Seller Inventory # 00102393754

Contact seller

Buy Used

� 10

Free Shipping
Ships within U.S.A.

Quantity: 4 available

Add to basket

Stock Image

High Performance Spark Best Pr

Karau, Holden

Published by O'Reilly Media, 2017

ISBN 10: 1491943203 ISBN 13: 9781491943205

Used Softcover

Seller: World of Books (was SecondSale), Montgomery, IL, U.S.A.

Seller rating 5 out of 5 stars

Condition: Very Good. Item in very good condition! Textbooks may not include supplemental items i.e. CDs, access codes etc. Seller Inventory # 00101681102

Contact seller

Buy Used

� 10

Free Shipping
Ships within U.S.A.

Quantity: 1 available

Add to basket

Stock Image

High Performance Spark : Best Practices for Scaling and Optimizing Apache Spark

Karau, Holden, Warren, Rachel

Published by O'Reilly Media, Incorporated, 2017

ISBN 10: 1491943203 ISBN 13: 9781491943205

Used Softcover

Seller: Better World Books Ltd, Dunfermline, United Kingdom

Seller rating 5 out of 5 stars

Contact seller

Buy Used

� 5.16

� 5 shipping
Ships from United Kingdom to U.S.A.

Quantity: 1 available

Add to basket

Seller Image

High Performance Spark: Best Practices for Scaling and Optimizing Apache Spark

Karau, Holden; Warren, Rachel

Published by O'Reilly Media, 2017

ISBN 10: 1491943203 ISBN 13: 9781491943205

Used Softcover

Seller: Goodwill of Silicon Valley, SAN JOSE, CA, U.S.A.

Seller rating 5 out of 5 stars

Condition: good. Supports Goodwill of Silicon Valley job training programs. The cover and pages are in Good condition! Any other included accessories are also in Good condition showing use. Use can include some highlighting and writing, page and cover creases as well as other types visible wear. Seller Inventory # GWSVV.1491943203.G

Contact seller

Buy Used

� 7.49

� 2.98 shipping
Ships within U.S.A.

Quantity: 1 available

Add to basket

There are 9 more copies of this book

View all search results for this book

High Performance Spark: Best Practices for Scaling and Optimizing Apache Spark - Softcover

Karau, Holden ; Warren, Rachel

About the Author

Other Popular Editions of the Same Title

Featured Edition

Search results for High Performance Spark: Best Practices for Scaling...

High Performance Spark: Best Practices for Scaling and Optimizing Apache Spark

Buy Used

High Performance Spark : Best Practices for Scaling and Optimizing Apache Spark

Buy Used

High Performance Spark: Best Practices for Scaling and Optimizing Apache Spark

Buy Used

High Performance Spark: Best Practices for Scaling and Optimizing Apache Spark

Buy Used

High Performance Spark: Best Practices for Scaling and Optimizing Apache Spark

Buy Used

High Performance Spark: Best Practices for Scaling and Optimizing Apache Spark

Buy Used

High Performance Spark Best Pr

Buy Used

High Performance Spark Best Pr

Buy Used

High Performance Spark : Best Practices for Scaling and Optimizing Apache Spark

Buy Used

High Performance Spark: Best Practices for Scaling and Optimizing Apache Spark

Buy Used

There are 9 more copies of this book

High Performance Spark: Best Practices for Scaling and Optimizing Apache Spark - Softcover

Synopsis

About the Author

Other Popular Editions of the Same Title

Featured Edition

Search results for High Performance Spark: Best Practices for Scaling...

Buy Used

Buy Used

Buy Used

Buy Used

Buy Used

Buy Used

Buy Used

Buy Used

Buy Used

Buy Used

There are 9 more copies of this book