Academia.eduAcademia.edu

Outline

Asynchronous Bypass Channel Routers

cegroup.ece.tamu.edu

Abstract

Network-on-Chip (NoC) designs have emerged as a replacement for traditional shared-bus designs for on-chip communications. Typically, these systems require fully balanced clock distribution trees to enable synchronous communication between all nodes on-chip, resulting in higher power consumption. One approach to reduce power consumption is to replace the balanced clock tree with a globally-asynchronous, locally-synchronous (GALS) mesochronous clocking scheme. NoCs implemented with a GALS clocking scheme, however, tend to have high latencies as packets must be synchronized at every hop between source and destination. In this paper, we propose a novel router microarchitecture for GALS NoCs which offers superior performance versus typical synchronizing router designs. Our approach features Asynchronous Bypass Channels (ABCs) at intermediate nodes thus avoiding synchronization delay. We also propose a new network topology that leverages the advantages of the bypass channel offered by our router design. Our experiments show that our design improves the performance of a conventional synchronizing design with similar resources by up to 26% at low loads and increases saturation throughput by up to 11%.

References (10)

  1. T. Meincke, A. Hemani, S. Kumar, P. Ellervee, J. Oberg, T. Ols- son, P. Nilsson, D. Lindqvist, and H. Tenhunen, "Globally asynchronous locally synchronous architecture for large high- performance ASICs," in The IEEE International Symposium on Circuits and Systems (ISCAS), vol. 2, pp. 512-515, 1999.
  2. P. Gratz, C. Kim, R. Mcdonald, S. Keckler, and D. Burger, "Imple- mentation and Evaluation of On-Chip Network Architectures," in International Conference on Computer Design, pp. 477-484, 2006.
  3. I. Panades and A. Greiner, "Bi-Synchronous FIFO for Synchronous Circuit Communication Well Suited for Network-on-Chip in GALS Architectures," in The First International Symposium on Networks-on-Chip (NOCS), pp. 83-94, 2007.
  4. W. Dally and J. Poulton, Digital systems engineering. Cambridge Univ Pr, 1998.
  5. D. Mangano, R. Locatelli, A. Scandurra, C. Pistritto, M. Coppola, L. Fanucci, F. Vitullo, and D. Zandri, "Skew insensitive physical links for network on chip," in The 1st International Conference on Nano-Networks and Workshops (NanoNet), pp. 1-5, 2006.
  6. W. Dally, "Express cubes: Improving the performance of k-ary n- cube interconnection networks," IEEE Transactions on Computers, vol. 40, no. 9, pp. 1016-1023, 1991.
  7. J. Kim, W. Dally, B. Towles, and A. Gupta, "Microarchitecture of a high-radix router," ACM SIGARCH Computer Architecture News, vol. 33, no. 2, pp. 420-431, 2005.
  8. B. Grot, J. Hestness, S. Keckler, and O. Mutlu, "Express Cube Topologies for on-Chip Interconnects," in The 15th International Symposium on High Performance Computer Architecture (HPCA), pp. 163-174, Feb. 2009.
  9. A. Kumar, L. Peh, P. Kundu, and N. Jha, "Express virtual channels: towards the ideal interconnection fabric," in The 34th annual International Symposium on Computer Architecture (ISCA), pp. 150- 161, 2007.
  10. W. Dally and B. Towles, Principles and practices of interconnection networks. Morgan Kaufmann, 2004.