Academia.eduAcademia.edu

Outline

Low-Overhead Core Swapping for Thermal Management

2005, Lecture Notes in Computer Science

Abstract

Technology scaling trends and the limitations of packaging and cooling have intensified the need for thermally efficient architectures and architecture-level temperature management techniques. To combat these trends, we evaluate the thermal efficiency of the microcore architecture, a deeply decoupled processor core with larger structures factored out as helper engines. We further investigate activity migration (core swapping) as a means of controlling the thermal profile of the chip in this study. Specifically, the microcore architecture presents an ideal platform for core swapping thanks to helper engines that maintain the state of each process in a shared fabric surrounding the cores. This results in significantly reduced migration overhead, enabling seamless swapping of cores. Our results show that our thermal mechanisms outperform traditional Dynamic Thermal Management (DTM) techniques by reducing the performance hit caused by slowing/swapping of cores. Our experimental results show that the microcore architecture has 86% fewer thermally critical cycles compared to a conventional monolithic core.

References (27)

  1. In International Technology Roadmap for Semiconductors, 2003.
  2. A. Ajami, K. Banerjee, M. Pedram, and L. van Ginneken. Analysis of non-uniform temperature-dependent interconnect performance in high performance ics. In 41st Design Automation Conference, pages 567-572, June 2001.
  3. R. Balasubramonian, S. Dwarkadas, and D. Albonesi. Reducing the complexity of the register file in dynamic superscalar processors. In Proceedings of the 34th Annual International Symposium on Microarchitecture, December 2001.
  4. D. Brooks and M. Martonosi. Adaptive thermal management for high-performance microprocessors. In Workshop on Complexity Effective Design, June 2000.
  5. D. Brooks, V. Tiwari, and M. Martonosi. Wattch: A framework for architectural- level power analysis and optimization. In 27th Annual International Symposium on Computer Architecture, pages 83-94, June 2000.
  6. D. C. Burger and T. M. Austin. The simplescalar tool set, version 2.0. Technical Report CS-TR-97-1342, U. of Wisconsin, Madison, June 1997.
  7. J.A. Butts and G.S. Sohi. A static power model for architects. In 27th Annual International Symposium on Computer Architecture, pages 191-201, June 2000.
  8. D.Brooks and M.Martonosi. Dynamic thermal management for high-performance microprocessors. In International Symposium on High-Performance Computer Ar- chitecture (HPCA-7), pages 171-182, January 2001.
  9. M. Franklin and G. S. Sohi. Arb: A hardware mechanism for dynamic reordering of memory references. IEEE Transactions on Computers, 46(5), May 1996.
  10. S. Gunther, F. Binns, D. Carmean, and J. Hall. Managing the impact of increasing microprocessor power consumption. In Intel Technology Journal Q1, 2001.
  11. S. Heo, K. Barr, and K. Asanovic. Reducing power density through activity migra- tion. In International Symposium on Low Power Electronics and Design, August 2003.
  12. G. Hinton, D. Sager, M. Upton, D. Boggs, D. Carmean, A. Kyker, and P. Roussel. The microarchitecture of the pentium 4 processor. Intel Technology Journal Q1, 2001.
  13. N. Jouppi. Improving direct-mapped cache performance by the addition of a small fully associative cache and prefetch buffers. In Proceedings of the 17th Annual International Symposium on Computer Architecture, May 1990.
  14. K.Skadron, M.Stan, W. Huang, S.Velusamy, K. Sankaranarayanan, and D. Tarjan. Temperature-aware microarchitecture. In 30th Annual International Symposium on Computer Architecture, pages 2-13, June 2003.
  15. C-H. Lim, W. Daasch, and G.Cai. A thermal-aware superscalar microprocessor. In International Symposium on Quality Electronic Design, pages 517-522, March 2002.
  16. L.T.Yeh and R.Chu. Thermal management of microelectronic equipment. In Amer- ican Society of Mechanical Engineers -ISBN:0791801683, 2001.
  17. G. Reinman, T. Austin, and B. Calder. A scalable front-end architecture for fast instruction delivery. In 26th Annual International Symposium on Computer Ar- chitecture, May 1999.
  18. A. Shayesteh, E. Kursun, S. Sair, T. Sherwood, and G. Reinman. An evaluation of deeply decoupled cores. In University of California Los Angeles Tech Report CS-2004-09, 2004.
  19. T. Sherwood, E. Perelman, G. Hamerly, and B. Calder. Automatically charac- terizing large scale program behavior. In Proceedings of the 10th International Conference on Architectural Support for Programming Languages and Operating Systems, October 2002.
  20. T. Sherwood, S. Sair, and B. Calder. Predictor-directed stream buffers. In 33rd International Symposium on Microarchitecture, December 2000.
  21. P. Shivakumar and Norman P. Jouppi. Cacti 3.0: An integrated cache timing, power, and area model. In Technical Report, 2001.
  22. J. E. Smith. Instruction-level distributed processing. IEEE Computer, 34(4):59-65, April 2001.
  23. R. Viswanath, V. Wakharkar, A. Wathe, and V.Lebonheur. Thermal performance challenges from silicon to systems. In Intel Technology Journal Q3, 2000.
  24. K. Wang and M. Franklin. Highly accurate data value prediction using hybrid predictors. In 30th Annual International Symposium on Microarchitecture, pages 281-290, December 1997.
  25. W.Huang, J.Renau, S-M.Yoo, and J. Torrellas. A framework for dynamic energy effiency and temperature management. In 33rd International Symposium on Mi- croarchitecture, pages 202-213, December 2000.
  26. T. Yeh and Y. Patt. A comprehensive instruction fetch mechanism for a processor supporting speculative execution. In Proceedings of the 25th Annual International Symposium on Microarchitecture, pages 129-139, December 1992.
  27. Y. Zhang, D. Parikh, K. Sankaranarayanan, K. Skadron, and M. Stan. Hotleakage: A temperature-aware model of subthreshold and gate leakage for architects. In University of Virginia Dept of Computer Science Tech Report CS-2003-05, March 2003.