Academia.eduAcademia.edu

Outline

Design Principles for End-to-End Multicore Schedulers

2010, Proceedings of the …

Abstract

As personal computing devices become increasingly parallel multiprocessors, the requirements for operating system schedulers change considerably. Future general-purpose machines will need to handle a dynamic, bursty, and interactive mix of parallel programs sharing a ...

References (30)

  1. T. E. Anderson, B. N. Bershad, E. D. Lazowska, and H. M. Levy. Scheduler activations: Effective kernel sup- port for the user-level management of parallelism. In Proc. of the Thirteenth ACM Symposium on Operating Systems Principles, pages 95-109, Oct. 1991.
  2. Apple. Grand Central Dispatch Technology Brief, 2009.
  3. K. R. Apt and M. G. Wallace. Constraint Logic Program- ming using ECL i PS e . Cambridge University Press, 2007.
  4. A. Baumann, P. Barham, P.-E. Dagand, T. Harris, R. Isaacs, S. Peter, T. Roscoe, A. Schüpbach, and A. Singhania. The multikernel: a new OS architecture for scalable multicore systems. In Proc. of the 22nd ACM Symposium on Operating Systems Principles, Oct. 2009.
  5. S. A. Brandt, S. A. Banachowski, C. Lin, and T. Bis- son. Dynamic integrated scheduling of hard real-time, soft real-time and non-real-time processes. In Proc. of the 24th IEEE Real-Time Systems Symposium, 2003.
  6. D. Culler, R. Karp, D. Patterson, A. Sahay, K. E. Schauser, E. Santos, R. Subramonian, and T. von Eicken. LogP: towards a realistic model of parallel computation. In Proc. of the 4th ACM SIGPLAN symposium on Princi- ples and practice of parallel programming, 1993.
  7. D. Dice and N. Shavit. TLRW: Return of the read-write lock. In Proc. of the 4th ACM SIGPLAN Workshop on Transactional Computing, Feb. 2009.
  8. D. G. Feitelson and L. Rudolph. Gang scheduling perfor- mance benefits for fine-grain synchronization. Journal of Parallel and Distributed Computing, 16:306-318, 1992.
  9. B. He, W. N. Scherer III, and M. L. Scott. Preemption adaptivity in time-published queue-based spin locks. In Proc. of the 12th International Conference on High Per- formance Computing, pages 7-18, 2005.
  10. M. D. Hill and M. R. Marty. Amdahl's law in the multi- core era. IEEE Computer, July 2008.
  11. Intel Corporation. Single-chip cloud computer. http://techresearch.intel.com/articles/Tera-Scale/ 1826.htm, December 2009.
  12. C. G. Jones, R. Liu, L. Meyerovich, K. Asanović, and R. Bodik. Parallelizing the web browser. In Proc. of the 1st USENIX Workshop on Hot Topics in Parallelism, March 2009.
  13. R. Liu, K. Klues, S. Bird, S. Hofmeyr, K. Asanović, and J. Kubiatowicz. Tessellation: Space-time partitioning in a manycore client os. In Proc. of the 1st USENIX Workshop on Hot Topics in Parallelism, March 2009.
  14. J. Mauro. The Solaris Process Model: Managing Thread Execution and Wait Times in the System Clock Handler, 2000. http://developers.sun.com/solaris/ articles/THREADexec.
  15. Message Passing Interface Forum. MPI: A Message- Passing Interface Standard, September 2009.
  16. Microsoft. C++ Concurrency Runtime, 2010. http:// msdn.microsoft.com/en-us/library/dd504870.aspx.
  17. E. B. Nightingale, O. Hodson, R. McIlroy, C. Hawblitzel, and G. Hunt. Helios: heterogeneous multiprocessing with satellite kernels. In Proc. of the 22nd ACM Sym- posium on Operating Systems Principles, 2009.
  18. OpenMP Architecture Review Board. OpenMP Applica- tion Programming Interface, 2008. Version 3.0.
  19. J. Ousterhout. Scheduling techniques for concurrent sys- tems. In IEEE Distributed Computer Systems, 1982.
  20. V. Pankratius, C. Schaefer, A. Jannesari, and W. F. Tichy. Software engineering for multicore systems -an experi- ence report. In Proc. of the 1st International Workshop on Multicore Software Engineering, May 2008.
  21. M. Rajagopalan, B. T. Lewis, and T. A. Anderson. Thread scheduling for multi-core platforms. In Proc. of the 11th USENIX Workshop on Hot Topics in Operating Systems, May 2007.
  22. R. Raman, M. Livny, and M. H. Solomon. Matchmak- ing: Distributed resource management for high through- put computing. In Proc. of the 7th IEEE International Symposium on High Performance Distributed Comput- ing, July 1998.
  23. B. Saha, A.-R. Adl-Tabatabai, A. Ghuloum, M. Ra- jagopalan, R. L. Hudson, L. Peterson, V. Menon, B. Murphy, T. Shpeisman, E. Sprangle, A. Rohillah, D. Carmean, and J. Fang. Enabling scalability and per- formance in a large scale CMP environment. In Proc. of the 2nd ACM SIGOPS/EuroSys European Conference on Computer Systems, Mar. 2007.
  24. A. Schüpbach, S. Peter, A. Baumann, T. Roscoe, P. Barham, T. Harris, and R. Isaacs. Embracing diversity in the Barrelfish manycore operating system. In Proc. of the 1st Workshop on Managed Multi-Core Systems, 2008.
  25. M. L. Scott and W. N. Scherer III. Scalable queue-based spin locks with timeout. In Proc. of the 8th ACM SIG- PLAN Symposium on Principles and Practices of Paral- lel Programming, pages 44-52, 2001.
  26. Z. Wang and M. F. P. O'Boyle. Mapping parallelism to multi-cores: a machine learning based approach. In Proc. of the 14th ACM SIGPLAN Symposium on Princi- ples and Practice of Parallel Programming, 2009.
  27. D. Wentzlaff and A. Agarwal. Factored operating sys- tems (fos): the case for a scalable operating system for multicores. SIGOPS Operating Systems Review, 43(2):76-85, 2009.
  28. R. C. Whaley, A. Petitet, and J. J. Dongarra. Auto- mated empirical optimization of software and the ATLAS project. Parallel Computing, 27(1-2):3-35, 2001.
  29. S. Williams, A. Waterman, and D. Patterson. Roofline: an insightful visual performance model for multicore ar- chitectures. Comm. of the ACM, 52(4):65-76, 2009.
  30. E. Z. Zhang, Y. Jiang, and X. Shen. Does cache sharing on modern CMP matter to the performance of contempo- rary multithreaded programs? In Proc. of the 15th ACM SIGPLAN Symposium on Principles and Practice of Par- allel Programming, Jan. 2010.