Academia.eduAcademia.edu

Outline

From Static Distributed Systems to Dynamic Systems

2005, 24th IEEE Symposium on Reliable Distributed Systems (SRDS'05)

Abstract

A noteworthy advance in distributed computing is due to the recent development of peer-to-peer systems. These systems are essentially dynamic in the sense that no process can get a global knowledge on the system structure. They mainly allow processes to look up for data that can be dynamically added/suppressed in a permanently evolving set of nodes. Although protocols have been developed for such dynamic systems, to our knowledge, up to date no computation model for dynamic systems has been proposed. Nevertheless, there is a strong demand for the definition of such models as soon as one wants to develop provably correct protocols suited to dynamic systems. This paper proposes a model for (a class of) dynamic systems. That dynamic model is defined by (1) a parameter (an integer denoted) and (2) two basic communication abstractions (query-response and persistent reliable broadcast). The new parameter is a threshold value introduced to capture the liveness part of the system (it is the counterpart of the minimal number of processes that do not crash in a static system). To show the relevance of the model, the paper adapts an eventual leader protocol designed for the static model, and proves that the resulting protocol is correct within the proposed dynamic model. In that sense, the paper has also a methodological flavor, as it shows that simple modifications to existing protocols can allow them to work in dynamic systems.

References (24)

  1. Aguilera M.K., A Pleasant Stroll Through the Land of Infinitely Many Creatures. ACM SIGACT News, Distributed Computing Column, 35(2):36-59, 2004.
  2. Aguilera M.K., Delporte-Gallet C., Fauconnier H. and Toueg S., Communication-Efficient Leader Election and Consensus with Limited Link Synchrony. Proc. 23th ACM Symposium on Principles of Distributed Computing (PODC'04), ACM Press, pp. 328-337, St. John's, Newfoundland (Canada), 2004.
  3. Angluin D., Local and Global Properties in Networks of Processes. Proc. 12th ACM Symposium on Theory of Computing (STOC'80), ACM Press, pp. 82-93, Los Angeles (CA), 1980.
  4. Chandra T.D. and Toueg S., Unreliable Failure Detectors for Reliable Distributed Systems. Journal of the ACM, 43(2):225-267, 1996.
  5. Chandra T.D., Hadzilacos V. and Toueg S., The Weakest Failure Detector for Solving Consensus. Journal of the ACM, 43(4):685-722, 1996.
  6. Chandy K.M. and Misra J., How Processes Learn. Distributed Computing, 1(1):40-52, 1986.
  7. Chawathe Y., Ratnasamy S., Breslau L., Nick Lanham, Shenker S., Making Gnutella-like P2P Systems Scalable. Proc. of the 2003 Conf. On Applications, Technologies, Architectures, and Protocols for Computer Communications (SIGCOMM'03), pp. 407-418.
  8. Fischer M.J., Lynch N. and Paterson M.S., Impossibility of Distributed Consensus with One Faulty Process. Journal of the ACM, 32(2):374- 382, 1985.
  9. Flocchini P., Mans B. and Santoro N., Sense of Direction in Distributed Computing. Proc. 12th Int'l Symposium on Distributed Computing (DISC'98), Springer-Verlag LNCS #1499, pp. 1-16, 1998.
  10. Friedman R., Raynal M. and Travers C., Two Abstractions for Implementing Atomic Objects in Dynamic Systems. Tech Report #1692, IRISA, Université de Rennes (France), 2005. (http://www.irisa.fr/bibli/publi/pi/2005/1692/1692.html). Brief Announcement, Proc. 24th ACM Symposium on Principles of Distributed Computing (PODC'05), ACM Press, July 2005.
  11. Gummadi K., Dunn R., Saroiu S., Gribble S., Levy H., and Zahorjan J., Measurement, Modeling, and Analysis of a Peer-to-Peer File-Sharing Workload, Proceedings of the 19th ACM Symposium on Operating Systems Principles (SOSP-19), pp. 381-394, October 2003.
  12. Guerraoui R. and Raynal M., The Information Structure of Indulgent Consensus. IEEE Transactions on Computers, 53(4):453-466, April 2004.
  13. Hadzilacos V. and Toueg S., Reliable Broadcast and Related Problems. In Distributed Systems, ACM Press (S. Mullender Ed.), New-York, pp. 97-145, 1993.
  14. Lamport, L., Time, Clocks and the Ordering of Events in a Distributed System. Communications of the ACM, 21(7):558-565, 1978.
  15. Lamport L., The Part-Time Parliament. ACM Transactions on Computer Systems, 16(2):133-169, 1998. PI n ˚1713 [16] Larrea M., Fernández A. and Arèvalo S., Optimal Implementation of the Weakest Failure Detector for Solving Consensus. Proc. 19th Symposium on Reliable Distributed Systems (SRDS'00), IEEE Computer Society Press, pp. 52-60, Nürnberg (Germany), 2000.
  16. Merritt M. and Taubenfeld G., Computing Using Infinitely Many Processes. Proc. 14th Int'l Symposium on Distributed Computing (DIS- C'00), Springer-Verlag LNCS #1914, pp. 164-178, 2000.
  17. Mostefaoui A., Mourgaya E., and Raynal M., Asynchronous Implementation of Failure Detectors. Proc. Int. IEEE Conference on Dependable Systems and Networks (DSN'03), IEEE Computer Society Press, pp. 351-360, San Francisco (CA), 2003.
  18. Mostefaoui A. and Raynal M., Leader-Based Consensus. Parallel Processing Letters, 11(1):95-107, 2001.
  19. Mostefaoui A., Raynal M. and Travers C., Crash-Resilient Time-Free Eventual Leadership. Proc. 23th Symposium on Reliable Distributed Systems (SRDS'04), IEEE Computer Society Press, pp. 208-217, Florianõpolis (Brazil), 2004.
  20. Raynal M., A Short Introduction to Failure Detectors for Asynchronous Distributed Systems. ACM SIGACT News, Distributed Computing Column, 36(1):53-70, 2005.
  21. Rowstron A. and Druschel P., Pastry: Scalable, Distributed Object Location and Routing for Large Scale Peer-to-Peer Systems. Proc.18th IFIP/ACM International Conference on Distributed Systems Platforms (Middleware 2001), Springer-Verlag LNCS #2218, pp. 329-350, 2001.
  22. Stoica I., Morris R., Liben-Nowell D., Karger D., Kaashoek M.F., Dabek F. and Balakrishnan H., Chord: A Scalable Peer-to-Peer Lookup Protocol for Internet Applications. ACM/IEEE Transactions on Networking, 11(1):17-32, 2003.
  23. Zhao B., Kubiatowicz J. and Joseph A., Tapestry: An Infrastructure for Fault-Tolerant Wide-area Location and Routing. Technical Report UCB/CSD-01-1141, Computer Science Division, U.C. Berkeley, 2001.
  24. Irisa