Items related to Fault-Tolerance Techniques for High-Performance Computing...

Fault-Tolerance Techniques for High-Performance Computing (Computer Communications and Networks) - Hardcover

 
9783319209425: Fault-Tolerance Techniques for High-Performance Computing (Computer Communications and Networks)

Synopsis

This timely text presents a comprehensive overview of fault tolerance techniques for high-performance computing (HPC). The text opens with a detailed introduction to the concepts of checkpoint protocols and scheduling algorithms, prediction, replication, silent error detection and correction, together with some application-specific techniques such as ABFT. Emphasis is placed on analytical performance models. This is then followed by a review of general-purpose techniques, including several checkpoint and rollback recovery protocols. Relevant execution scenarios are also evaluated and compared through quantitative models. Features: provides a survey of resilience methods and performance models; examines the various sources for errors and faults in large-scale systems; reviews the spectrum of techniques that can be applied to design a fault-tolerant MPI; investigates different approaches to replication; discusses the challenge of energy consumption of fault-tolerance methods in extreme-scale systems.

"synopsis" may belong to another edition of this title.

From the Back Cover

This timely text/reference presents a comprehensive overview of fault tolerance techniques for high-performance computing (HPC).

The text opens with a detailed introduction to the concepts of checkpoint protocols and scheduling algorithms, prediction, replication, silent error detection and correction, together with some application-specific techniques such as algorithm-based fault tolerance. Emphasis is placed on analytical performance models. This is then followed by a review of general-purpose techniques, including several checkpoint and rollback recovery protocols. Relevant execution scenarios are also evaluated and compared through quantitative models.

Topics and features:

  • Includes self-contained contributions from an international selection of preeminent experts
  • Provides a survey of resilience methods and performance models
  • Examines the various sources for errors and faults in large-scale systems, detailing their characteristics, with a focus on modeling, detection and prediction
  • Reviews the spectrum of techniques that can be applied to design a fault-tolerant message passing interface
  • Investigates different approaches to replication, comparing these to the traditional checkpoint-recovery approach
  • Discusses the challenge of energy consumption of fault-tolerance methods in extreme-scale systems, proposing a methodology to estimate such energy consumption

This authoritative volume is essential reading for all researchers and graduate students involved in high-performance computing.

Dr. Thomas Herault is a Research Scientist in the Innovative Computing Laboratory (ICL) at the University of Tennessee Knoxville, TN, USA. Dr. Yves Robert is a Professor in the Laboratory of Parallel Computing at the Ecole Normale Supérieure de Lyon, France, and a Visiting Research Scholar in the ICL.

"About this title" may belong to another edition of this title.

Buy Used

Condition: Fine
Zustand: Sehr gut | Seiten: 332...
View this item

£ 7.71 shipping from Germany to United Kingdom

Destination, rates & speeds

Other Popular Editions of the Same Title

9783319355603: Fault-Tolerance Techniques for High-Performance Computing (Computer Communications and Networks)

Featured Edition

ISBN 10:  3319355600 ISBN 13:  9783319355603
Publisher: Springer, 2016
Softcover

Search results for Fault-Tolerance Techniques for High-Performance Computing...

Stock Image

Unbekannt
ISBN 10: 3319209426 ISBN 13: 9783319209425
Used Hardcover

Seller: Buchpark, Trebbin, Germany

Seller rating 5 out of 5 stars 5-star rating, Learn more about seller ratings

Condition: Sehr gut. Zustand: Sehr gut | Seiten: 332 | Sprache: Englisch | Produktart: Bücher. Seller Inventory # 25708812/12

Contact seller

Buy Used

£ 11.25
Convert currency
Shipping: £ 7.71
From Germany to United Kingdom
Destination, rates & speeds

Quantity: 1 available

Add to basket

Stock Image

Herault, Thomas; Robert, Yves
Published by Cham, Springer., 2015
ISBN 10: 3319209426 ISBN 13: 9783319209425
Used Hardcover

Seller: Universitätsbuchhandlung Herta Hold GmbH, Berlin, Germany

Seller rating 5 out of 5 stars 5-star rating, Learn more about seller ratings

ix, 320p. Hardcover. Versand aus Deutschland / We dispatch from Germany via Air Mail. Einband bestoßen, daher Mängelexemplar gestempelt, sonst sehr guter Zustand. Imperfect copy due to slightly bumped cover, apart from this in very good condition. Stamped. Stamped. Computer Communications and Networks. Sprache: Englisch. Seller Inventory # 4823IB

Contact seller

Buy Used

£ 12.50
Convert currency
Shipping: £ 10.40
From Germany to United Kingdom
Destination, rates & speeds

Quantity: 2 available

Add to basket

Seller Image

Hérault, Thomas (EDT); Robert, Yves (EDT)
Published by Springer, 2015
ISBN 10: 3319209426 ISBN 13: 9783319209425
New Hardcover

Seller: GreatBookPricesUK, Woodford Green, United Kingdom

Seller rating 5 out of 5 stars 5-star rating, Learn more about seller ratings

Condition: New. Seller Inventory # 23922726-n

Contact seller

Buy New

£ 97.09
Convert currency
Shipping: FREE
Within United Kingdom
Destination, rates & speeds

Quantity: Over 20 available

Add to basket

Stock Image

Published by Springer, 2015
ISBN 10: 3319209426 ISBN 13: 9783319209425
New Hardcover

Seller: Majestic Books, Hounslow, United Kingdom

Seller rating 5 out of 5 stars 5-star rating, Learn more about seller ratings

Condition: New. pp. 320. Seller Inventory # 374278503

Contact seller

Buy New

£ 93.75
Convert currency
Shipping: £ 3.35
Within United Kingdom
Destination, rates & speeds

Quantity: 1 available

Add to basket

Stock Image

Published by Springer, 2015
ISBN 10: 3319209426 ISBN 13: 9783319209425
New Hardcover

Seller: Ria Christie Collections, Uxbridge, United Kingdom

Seller rating 5 out of 5 stars 5-star rating, Learn more about seller ratings

Condition: New. In. Seller Inventory # ria9783319209425_new

Contact seller

Buy New

£ 97.62
Convert currency
Shipping: FREE
Within United Kingdom
Destination, rates & speeds

Quantity: Over 20 available

Add to basket

Stock Image

Published by Springer, 2015
ISBN 10: 3319209426 ISBN 13: 9783319209425
New Hardcover

Seller: Books Puddle, New York, NY, U.S.A.

Seller rating 4 out of 5 stars 4-star rating, Learn more about seller ratings

Condition: New. pp. 320. Seller Inventory # 26372815544

Contact seller

Buy New

£ 92.03
Convert currency
Shipping: £ 6.67
From U.S.A. to United Kingdom
Destination, rates & speeds

Quantity: 1 available

Add to basket

Seller Image

Hérault, Thomas|Robert, Yves
ISBN 10: 3319209426 ISBN 13: 9783319209425
New Hardcover
Print on Demand

Seller: moluna, Greven, Germany

Seller rating 5 out of 5 stars 5-star rating, Learn more about seller ratings

Condition: New. Dieser Artikel ist ein Print on Demand Artikel und wird nach Ihrer Bestellung fuer Sie gedruckt. The first complete overview of this increasingly important fieldPresents a unique, rigorous approach based on the design of analytical models to predict performanceProvides a coherent collection of valuable insights from internationally-renown. Seller Inventory # 31406393

Contact seller

Buy New

£ 82.37
Convert currency
Shipping: £ 21.66
From Germany to United Kingdom
Destination, rates & speeds

Quantity: Over 20 available

Add to basket

Seller Image

Yves Robert
ISBN 10: 3319209426 ISBN 13: 9783319209425
New Hardcover
Print on Demand

Seller: BuchWeltWeit Ludwig Meier e.K., Bergisch Gladbach, Germany

Seller rating 5 out of 5 stars 5-star rating, Learn more about seller ratings

Buch. Condition: Neu. This item is printed on demand - it takes 3-4 days longer - Neuware -This timely text presents a comprehensive overview of fault tolerance techniques for high-performance computing (HPC). The text opens with a detailed introduction to the concepts of checkpoint protocols and scheduling algorithms, prediction, replication, silent error detection and correction, together with some application-specific techniques such as ABFT. Emphasis is placed on analytical performance models. This is then followed by a review of general-purpose techniques, including several checkpoint and rollback recovery protocols. Relevant execution scenarios are also evaluated and compared through quantitative models. Features: provides a survey of resilience methods and performance models; examines the various sources for errors and faults in large-scale systems; reviews the spectrum of techniques that can be applied to design a fault-tolerant MPI; investigates different approaches to replication; discusses the challenge of energy consumption of fault-tolerance methods in extreme-scale systems. 332 pp. Englisch. Seller Inventory # 9783319209425

Contact seller

Buy New

£ 95.51
Convert currency
Shipping: £ 9.53
From Germany to United Kingdom
Destination, rates & speeds

Quantity: 2 available

Add to basket

Stock Image

Published by Springer, 2015
ISBN 10: 3319209426 ISBN 13: 9783319209425
New Hardcover

Seller: Biblios, Frankfurt am main, HESSE, Germany

Seller rating 5 out of 5 stars 5-star rating, Learn more about seller ratings

Condition: New. pp. 320. Seller Inventory # 18372815538

Contact seller

Buy New

£ 100.27
Convert currency
Shipping: £ 6.89
From Germany to United Kingdom
Destination, rates & speeds

Quantity: 1 available

Add to basket

Seller Image

Yves Robert
ISBN 10: 3319209426 ISBN 13: 9783319209425
New Hardcover

Seller: AHA-BUCH GmbH, Einbeck, Germany

Seller rating 5 out of 5 stars 5-star rating, Learn more about seller ratings

Buch. Condition: Neu. Druck auf Anfrage Neuware - Printed after ordering - This timely text presents a comprehensive overview of fault tolerance techniques for high-performance computing (HPC). The text opens with a detailed introduction to the concepts of checkpoint protocols and scheduling algorithms, prediction, replication, silent error detection and correction, together with some application-specific techniques such as ABFT. Emphasis is placed on analytical performance models. This is then followed by a review of general-purpose techniques, including several checkpoint and rollback recovery protocols. Relevant execution scenarios are also evaluated and compared through quantitative models. Features: provides a survey of resilience methods and performance models; examines the various sources for errors and faults in large-scale systems; reviews the spectrum of techniques that can be applied to design a fault-tolerant MPI; investigates different approaches to replication; discusses the challenge of energy consumption of fault-tolerance methods in extreme-scale systems. Seller Inventory # 9783319209425

Contact seller

Buy New

£ 95.51
Convert currency
Shipping: £ 12.12
From Germany to United Kingdom
Destination, rates & speeds

Quantity: 1 available

Add to basket

There are 6 more copies of this book

View all search results for this book