Abstract
While traditional parallel computing systems are still struggling to gain a wider acceptance, the largest parallel computer that has ever been available is currently growing with the communication resource Internet. Unfortunately it is also rarely used in the parallel computation field. The reason for the rejection of parallel computers is mainly the difficulty of parallel programming. In this paper we propose the Self Distributing Associative ARChitecture (SDAARC). It has been derived from the Cache Only Memory Architecture (COMA). COMAs provide a distributed shared memory (DSM) with automatic distribution of data. We show how this paradigm of data distribution can be extended to the automatic distribution of instruction sequences (microthreads). We show how microthreads can be extracted from legacy C code to produce code that can automatically be parallelized by SDAARC at run time. We also discuss how SDAARC can be implemented on a tightly coupled multiprocessor system, on heterogenous LAN based computer networks (Intranet) and on WANs of computing resources.
References (23)
- M. Annavaram and W. A. Najjar. Comparison of two stor- age models in data-driven multithreaded architectures. In Eighth IEEE Symposium on Parallel and Distributed Pro- cessing (SPDP) [9], pages 122-129.
- R. D. Blumofe, C. F. Joerg, B. C. Kuszmaul, C. E. Leiserson, K. H. Randall, and Y. Zhou. Cilk: An efficient multithreaded runtime system. Journal of Parallel and Distributed Computing, 37(1):55-69, Aug. 1996. <http://www.cs.utexas.edu/users/ rdb/papers/JPDC96.ps.gz> .
- D. E. Culler, S. C. Goldstein, K. E. Schauser, and T. von Eicken. TAM -A compiler controlled Threaded Abstract Machine. In Journal of Parallel and Distributed Computing, Special Issue on Dataflow, June 1993.
- I. Foster and C. Kesselman, editors. The Grid: Blueprint for a New Computing Infrastructure. Morgan Kaufmann Pub- lishers, Inc., 1999.
- J. Garcia, E. Ayguadé, and J. Labarta. A framework for au- tomatic dynamic data mapping. In Eighth IEEE Symposium on Parallel and Distributed Processing (SPDP) [9], pages 92-99.
- E. Hagersten, A. Landin, and S. Haridi. DDM -A Cache- Only Memory Architecture. IEEE Computer, 25(9):44-54, 1992.
- M. W. Hall, J. M. Anderson, S. P. Amarasinghe, B. R. Mur- phy, S.-W. Liao, E. Bugnion, and M. S. Lam. Maximizing multiprocessor performance with the SUIF compiler. IEEE Computer, Dec. 1996.
- S. Haridi and E. Hagersten. The cache coherence protocol of the Data Diffusion Machine. In Proceedings of the PARLE 89, volume 1, pages 1-18. Springer-Verlag, 1989.
- IEEE. Eighth IEEE Symposium on Parallel and Distributed Processing (SPDP), New Orleans, LA, Oct. 1996. IEEE Computer Society Press.
- Y. W. Lim, P. B. Bhat, and V. K. Prasanna. Efficient algo- rithms for block-cyclic redistribution of arrays. In Eighth IEEE Symposium on Parallel and Distributed Processing (SPDP) [9], pages 92-99.
- M. Livny and R. Raman. High-throughput resource man- agement. In Foster and Kesselman [4], chapter 13, pages 311-337.
- R. Moore, M. Klang, B. Klauer, and K. Waldschmidt. Combining static partitioning with dynamic distribution of threads. In Rammig [20], pages 85-96. <http: //www.ti.informatik.uni-frankfurt.de/ Papers/Adarc/dipes98.pdf> .
- R. Moore, B. Klauer, and K. Waldschmidt. Au- tomatic scheduling for Cache Only Memory Ar- chitectures. In Third International Conference on Massively Parallel Computing Systems (MPCS '98), Colorado Springs, Colorado, Apr. 1998. <http: //www.ti.informatik.uni-frankfurt.de/ Papers/Adarc/colosprings.pdf> .
- R. Moore, B. Klauer, and K. Waldschmidt. A combined virtual shared memory and network which sched- ules. International Journal of Parallel and Distributed Systems and Networks, 1(2):51-56, 1998. <http: //www.ti.informatik.uni-frankfurt.de/ Papers/Adarc/barcelona.pdf> .
- R. Moore, B. Klauer, and K. Waldschmidt. Tailoring a self- distributing architecture to a cluster computer environment. In 8th Euromicro Workshop on Parallel and Distributed Processing (EURO-PDP 2000), Rhodes, Greece, Jan. 2000. IEEE, IEEE Computer Society Press. <http: //www.ti.informatik.uni-frankfurt.de/ Papers/Adarc/rhodos.pdf> .
- H. L. Mueller, P. W. A. Stallard, and D. H. D. Warren. Hid- ing miss latencies with multithreading on the Data Diffusion Machine. In Proceedings of the 1995 International Con- ference on Parallel Processing, ICPP'95, volume 1, pages 178-185, Oconomowoc, WI, Aug. 1995. CRC Press.
- C. Newman. Security, accounting, and assurance. In Foster and Kesselman [4], chapter 16, pages 395-420.
- R. S. Nikhil. A multithreaded implementation of Id using P-RISC graphs. In Proceedings of the Sixth Annual Work- shop on Languages and Compilers for Parallel Computing, pages 390-405, Portland, Oregon, Aug. 1993. Springer Ver- lag LNCS 768.
- D. A. Patterson. Microprocessors in 2020. Scientific Amer- ican Special Issue: The Solid-State Century, 8(1):86-88, 1997.
- F. J. Rammig, editor. Distributed and Parallel Embedded Systems, IFIP WG10.3/WG10.5 International Workshop on Distributed and Parallel Embedded Systems (DIPES '98). Kluwer Academic Publishers, 1999.
- D. Sima, T. Fountain, and P. Kacsuk. Advanced Com- puter Architectures, A Design Space Approach, section 18.8 Cache-only memory acess (COMA) machines. Addison- Wesley, 1997.
- P. Stenström, T. Joe, and A. Gupta. Comparative perfor- mance evaluation of cache-coherent NUMA and COMA ar- chitectures. In Proceedings of the 19th Annual International Symposium on Computer Architecture, pages 80-91, Gold Coast, Australia, May 1992.
- D. H. D. Warren and S. Haridi. The Data Diffusion Machine -a scalable shared virtual memory multiprocessor. In Pro- ceedings of the 1988 International Conference on Fifth Gen- eration Computer Systems, pages 943-952, Tokyo, Japan, Dec. 1988.