Academia.eduAcademia.edu

Outline

A Distributed System with a Centralized Organization

Abstract

One of the most important potential benefits of workstation clusters (client/server computing systems) is resource sharing. By interconnecting a number of workstations using a suitable network, a large number of hardware and software resources can be made available to users. In comparison with centralized computer systems (mainframes) and shared memory multiprocessors systems, the availability of such resources can be achieved at a fraction of the costs entailed by mainframe solutions, since workstations allow the use of standard components. Another advantage of workstation clusters is their scalability and flexibility towards an upgrade or extension with some other components. The concept of a mainframe system is sharing centralized resources among all users by switching between users' processes (time sharing). The main advantage of such systems is that they are usually easy to administrate. In contrast to this concept, we will introduce a distributed system that does not only offer time sharing, but also sharing of computing nodes (workstations) according to the users' application requirements (workstation sharing). Nevertheless, the organization of that system remains centralized in order to reduce the expense for management and maintenance. In this paper, we will explain the architecture and mechanism of a transparent system that consists of some Unix workstations running a network file system (NFS).

References (21)

  1. Douglis F., Ousterhout J.: "Transparent process migration: Design alternatives and the Sprite implimentation", Software Practice and Experience, 21(8), pp. 757-785, August 1991.
  2. A. Bricker; M. Litzkow; M. Livny: "Condor Technical Summary", Technical Report 1096, Computer Science Department, University of Wisconsin-Madison, January 1992.
  3. Epema; Livny; van Dantzig; Evers; Pruyne: "A worldwide flock of Condors: load sharing among workstation clusters", Journal on Future Generations of Computer Systems, Volume 12, 1996.
  4. M.J. Litzkow and M.Livny: "Experience with the CONDOR Distributed Batch System", Proc. of the IEEE Workshop on Experimental Distributed Systems, Huntsville, AL, USA, 1990.
  5. S. Zhou, J. Wang, X. Zheng, P. Delisle: "UTOPIA: A Load Sharing Facility for Large, Heterogeneous Distributed Computer Systems", Computer Systems Research Institute, University of Toronto, Technical Report CSRI-257, April 1992.
  6. Platform Computing Corporation, "LSF User's Guide", Toronto, Canada, Third Edition, February 1996.
  7. Tanenbaum A.S., Kaashoek M.F., Renesse R. van, Bal H.E.; "The Amoeba Distributed Operating System -A Status Report", Computer Communications,Vol. 14, July/Aug., 1991, pp. 324-335.
  8. Tanenbaum A.S.: "The Amoeba Distributed Operating System", Department of Mathematics and Computer Science, Vrije Universiteit, The Netherlands, 1994.
  9. Tanenbaum A.S.: "Modern Operating System", Prentice Hall, 1992.
  10. Tanenbaum A.S.: "Distributed Operating System", Prentice Hall, 1995.
  11. Douglis F., Ousterhout, J.K., Kaashoek, M.F., and Tanenbaum, A.S.: "A Comparison of two Distributed Systems: Amoeba and Sprite", Computing Systems Journal Vol. 4, pp. 353-384, Fall 1991.
  12. Douglis F.: "Transparent Process Migration in the Sprite Operating System", Ph.D. Thesis, University of California (Berkeley) , Computer Science Division, September 1990.
  13. J. S. Kaplan, M. L. Nelson: "A Comparison of Queuing, Cluster and Distributed Computing Systems", NASA Langley Research Center, Technical Report, June 1994.
  14. M. A. Baker, G. C. Fox, H. W. Yau: "A Review of Commercial and Research Cluster Management Software", Northeast Parallel Architectures Center, Syracuse University, Version 1.2, June 1996.
  15. L. H. Turcotte.: "A Survey of Software Environments for Exploiting Networked Computing Resources", MSSU-EIRS-ERC-93-2, Engineering Research Center for Computational field Simulation, Mississippi State University, February 1993.
  16. Cheriton D.R.: "The V Distributed System", Communication of the ACM, 31(3)314-333, March 1988.
  17. Barak A., Guday S., and Wheeler R.G.: "The MOSIX Distributed Operating System load balancing for UNIX", Lecture Notes in Computer Science Vol. 672, Springer-Verlag, 1993.
  18. Barak A., Braverman A., Gilderman L., and La'adan O.: "The MOSIX Multicomputer Operating System for Scalable NOW and its Dynamic Resource Sharing Algorithms", Insititute of Computer Science, The Hebrew University of Jerusalem, Report 96-11, July 1996.
  19. T. L. Casavant and J. G. Kuhl.: "A taxonomy of scheduling in general-purpose distributed computing systems", IEEE Transaction on Software Engineering, SE-14:141-154, 1988.
  20. R. Lüling, B. Monien, F. Ramme.: "Load Balancing in Large Networks: A Comparative Study", 3rd IEE Symposium on Parallel and Distributed Processing 1991, pp. 686-689.
  21. Anna Hac.: "Load Balancing in Distributed Systems: A Summary", Performance Evaluation Review Vol. 16, 2-4, February 1989.