Guide to Reliable Distributed Systems

Building High-Assurance Applications and Cloud-Hosted Services

By Kenneth P. Birman

Guide to Reliable Distributed Systems Cover Image

This book describes the key concepts, principles and implementation options for creating high-assurance cloud computing solutions, beginning with a broad technical overview, and moving through the challenges of reliability and fault-tolerance.

Full Description

  • ISBN13: 978-1-4471-2415-3
  • 752 Pages
  • User Level: Students
  • Publication Date: January 14, 2012
  • Available eBook Formats: PDF
  • eBook Price: $89.95
Buy eBook Buy Print Book Add to Wishlist
Full Description
This book describes the key concepts, principles and implementation options for creating high-assurance cloud computing solutions. The guide starts with a broad technical overview and basic introduction to cloud computing, looking at the overall architecture of the cloud, client systems, the modern Internet and cloud computing data centers. It then delves into the core challenges of showing how reliability and fault-tolerance can be abstracted, how the resulting questions can be solved, and how the solutions can be leveraged to create a wide range of practical cloud applications. The author’s style is practical, and the guide should be readily understandable without any special background. Concrete examples are often drawn from real-world settings to illustrate key insights. Appendices show how the most important reliability models can be formalized, describe the API of the Isis2 platform, and offer more than 80 problems at varying levels of difficulty.
Table of Contents

Table of Contents

  1. Part I: Computing in the Cloud.
  2. The Way of the Cloud.
  3. Client Perspective.
  4. Network Perspective.
  5. The Structure of Cloud Data Centers.
  6. Remote Procedure Calls and the Client/Server Mode.
  7. CORBA: The Common Object Request Broker Architecture.
  8. System Support for Fast Client/Server Communication.
  9. Part II: Reliable Distributed Computing.
  10. How and Why Computer Systems Fail.
  11. Overcoming Failures in a Distributed System.
  12. Dynamic Membership.
  13. Group Communication Systems.
  14. Point to Point and Multi
  15. group Considerations.
  16. The Virtual Synchrony Execution Model.
  17. Consistency in Distributed Systems.
  18. Part III: Applications of Reliability Techniques.
  19. Retrofitting Reliability into Complex Systems.
  20. Software Architectures for Group Communication.
  21. Part IV: Related Technologies.
  22. Security Options for Distributed Settings.
  23. Clock Synchronization and Synchronous Systems.
  24. Transactional Systems.
  25. Peer
  26. to
  27. Peer Systems and Probabilistic Protocols.
  28. Appendix A: Virtually Synchronous Methodology For Building Dynamic Reliable Services.
  29. Appendix B: Isis2API.
  30. Appendix C: Problems.
Errata

Please Login to submit errata.

No errata are currently published