Academia.eduAcademia.edu

Outline

Exploratory data analysis for data center energy management

Proceedings of the Thirteenth ACM International Conference on Future Energy Systems

https://doi.org/10.1145/3538637.3539654

Abstract

The continuous improvement in energy efficiency of existing data centers would help reduce their environmental footprints. Greening of Data Centers could be attained using renewable energy sources or more energy efficient compute systems and effective cooling systems. A reliable cooling system is necessary to generate a persistent flow of cold air to cool servers that are subjected to increasing computational load demand. As a matter of fact, servers' dissipated heat effects a strain on the cooling systems and consequently, on electricity consumption. Generated heat in the data center is categorized into different granularity levels namely: server level, rack level, room level, and data center level. Several datasets are collected at ENEA Portici Data Center from CRESCO 6 cluster-a High-Performance Computing Cluster. The cooling and environmental aspects of the data center is also considered for data analysis. This research aims to conduct a rigorous exploratory data analysis on each dataset separately and collectively followed in various stages. This work presents descriptive and inferential analyses for feature selection and extraction process. Furthermore, a supervised Machine learning modelling and correlation estimation is performed on all the datasets to abstract relevant features. that would have an impact on energy efficiency in data centers.

References (20)

  1. Jonathan G Koomey. 2008. Worldwide electricity used in data centers. Environmental Research Letters 3, 3 (2008), 034008. DOI: https://doi.org/10.1088/1748
  2. Joonho K., Sung Woo C., and Kevin S.. 2012. Recent thermal management techniques for microprocessors. ACM Comput. Surv. 44, 3, Article 13 (June 2012), 42 pages. https://doi.org/10.1145/2187671.2187675.
  3. Tobias Van Damme, Claudio De Persis, and Pietro Tesi. 2019. Optimized Thermal-Aware Job Scheduling and Control of Data Centers. IEEE Transactions on Control Systems Technology 27, 2 (2019), 760-771. DOI: https://doi.org/10.1109/tcst.2017.2783366
  4. Georgios Varsamopoulos, Ayan Banerjee, and Sandeep K. S. Gupta. 2009. Energy Efficiency of Thermal-Aware Job Scheduling Algorithms under Various Cooling Models. Communications in Computer and Information Science (2009), 568-580. DOI: https://doi.org/10.1007/978-3-642-03547-0_54
  5. Muhammad Tayyab Chaudhry, Teck Chaw Ling, Atif Manzoor, Syed Asad Hussain, and Jongwon Kim. 2015. Thermal-Aware Scheduling in Green Data Centers. ACM Computing Surveys 47, 3 (2015), 1-48. DOI: https://doi.org/10.1145/2678278
  6. Xibo Jin, Fa Zhang, Athanasios V. Vasilakos, and Zhiyong Liu. 2016. Green Data Centers: A Survey, Perspectives, and Future Directions. arXiv (August 2016). DOI: https://doi.org/ https://doi.org/10.48550/arXiv.1608.00687
  7. Greenpeace International Greenpeace International. 2018. How dirty is your data? (June 2018). Retrieved May 28, 2022 from https://www.greenpeace.org/international/publication/71 96/how-dirty-is-your-data/
  8. M. Chinnici, A. Capozzoli, and G. Serale. 2016. Measuring energy efficiency in data centers. Pervasive Computing (2016), 299-351. DOI: https://doi.org/10.1016/b978-0-12-803663-1.00010-3
  9. Alfonso Capozzoli, Marta Chinnici, Marco Perino, and Gianluca Serale. 2015. Review on Performance Metrics for Energy Efficiency in Data Center: The Role of Thermal Management. Energy Efficient Data Centers (2015), 135-151. DOI: https://doi.org/10.1007/978-3- 319-15786-3_9
  10. Yeo, Sungkap & Hossain, Mohammad Mosaddek & Huang, Jen-Cheng & Lee, Hsien-Hsin. (2014). ATAC: Ambient Temperature-Aware Capping for Power Efficient Datacenters. Proceedings of the 5th ACM Symposium on Cloud Computing, SOCC 2014. 10.1145/2670979.2670996
  11. Lizhe Wang, Samee U. Khan, and Jai Dayal. 2011. Thermal aware workload placement with task- temperature profiles in a data center. The Journal of Supercomputing 61, 3 (2011), 780-803. DOI: https://doi.org/10.1007/s11227-011-0635-z [13] SeyedMorteza MirhoseiniNejad, Hosein
  12. Moazamigoodarzi, Ghada Badawy, and Douglas G. Down. 2020. Joint data center cooling and workload management: A thermal-aware approach. Future Generation Computer Systems 104, (2020), 174-186. DOI: https://doi.org/10.1016/j.future.2019.10.040
  13. Luca Parolini, Bruno Sinopoli, Bruce H. Krogh, and Zhikui Wang. 2012. A Cyber-Physical Systems Approach to Data Center Modeling and Control for Energy Efficiency. Proceedings of the IEEE 100, 1 (2012), 254-268. DOI: https://doi.org/10.1109/jproc.2011.2161244
  14. Equipment Thermal Guidelines for Data Processing Environments. Retrieved May 28, 2022 from https://www.ashrae.org/File%20Library/Technical%20R esources/Bookstore/datacom1_4th/ReferenceCard_7-25- 16.pdf
  15. Ana Azevedo and M.F. Santos. KDD, SEMMA AND CRISP-DM: A PARALLEL OVERVIEW Ana. Retrieved May 28, 2022 from https://recipp.ipp.pt/bitstream/10400.22/136/3/KDD- CRISP-SEMMA.pdf
  16. Zach (2021). When to Use Mean vs. Median (With Examples). [online] Statology. Available at: https://www.statology.org/when-to-use-mean-vs- median/.[18]Patrick Schober, Christa Boer, and Lothar A. Schwarte. 2018. Correlation Coefficients. Anesthesia & Analgesia 126, 5 (2018), 1763-1768. DOI: https://doi.org/10.1213/ane.0000000000002864
  17. CRISP-DM -Data Science Process Alliance. Retrieved May 28, 2022 from https://www.datascience- pm.com/crisp-dm-2/
  18. Introduction to SEMMA -SAS Help Center. Retrieved May 28, 2022 from https://documentation.sas.com/doc/en/emref/14.3/n061b zurmej4j3n1jnj8bbjjm1a2.htm
  19. Anastasia Grishina, Marta Chinnici, Ah-Lian Kor, Davide. De Chiara, Guido Guarnieri, Eric Rondeau, and Jean Philippe Georges. 2022. Thermal awareness to enhance data center energy efficiency. Cleaner Engineering and Technology 6, (2022), 100409. DOI: https://doi.org/10.1016/j.clet.2022.100409
  20. Davide De Chiara, Marta Chinnici, and Ah-Lian Kor. 2020. Data Mining for Big Dataset-Related Thermal Analysis of High Performance Computing (HPC) Data Center. Lecture Notes in Computer Science (2020), 367- 381. DOI: https://doi.org/10.1007/978-3-030-50436- 6_27