Academia.eduAcademia.edu

Outline

Novel Approach for Spatiotemporal Weather Data Analysis

International Journal of Advanced Computer Science and Applications

https://doi.org/10.14569/IJACSA.2022.0130743

Abstract

Massive volumes of multidimensional array-based spatiotemporal data are generated by climate observations and model simulations. The growth in climate data leads to new opportunities for climate studies at multiple spatial and temporal scales. Managing, analyzing and processing of big climate data is considered to be challenging because of huge data volume. In this work multidimensional climate data such as precipitation and temperature are processed and analyzed in the Spark MapReduce Platform, since Spark platform is computationally more efficient than Hadoop-MapReduce Platform of same configuration. In temporal scale monthly and seasonal analysis of climate data has been carried out to understand the regional climate. The prediction of Rainfall on monthly and seasonal time scales is very much important for planning and devising agricultural strategies and disaster management, etc. As the prediction of climate state is very challenging, in this study an attempt is being made for the prediction of the rainfall using the time series analysis in the same framework. As a test case the time series approach such as Auto Regression Integrated Moving Average (ARIMA) has been used for the prediction of rainfall. The proposed approach is evaluated and found to be accurate in the analysis and prediction of climate data and this will surely guide for the researcher for better understanding of the climate and its application to multiple sectors.

References (24)

  1. Z. Li, F. Hu, J. L. Schnase, D. Q. Duffy, T. Lee, M. K. Bowen, and C. Yang. A spatiotemporal indexing approach for efficient processing of big array-based climate data with MapReduce, International Journal of Geographical Information Science, pages 17-35, 2017.
  2. Chaowei Yang, Manzhu Yu, Fei Hu,Yongyao Jiang,Yun Li, Utilizing cloud computing to address big geospatial data challenges, Journal of Computers, Environment and Urban Systems, 2016, http://dx.doi.org/10.1016/j.compenvurbsys.2016.10.010.
  3. James H. Faghmous and Vipin Kumar, A big data guide to understanding climate change: The case for theory-guided data science, Journal of Big Data, Vol. 2, No. 3, Sep 2014, Pages 155-163, https://doi.org/10.1089/big.2014.0026.
  4. Markus Götz, Christian Bodenstein, Matthias Richerzhagen, Gabriele Cavallaro. On Scalable Data Mining Techniques for Earth Science, Procedia Computer Science, December 2015, Volume 51, Pages 2188- 2197.
  5. Jinsong Wu, Song Guo, Jie Li,Deze zeng, Big data meet green challenges: Greening big data. IEEE Systems Journal, Volume: 10, Issue: 3, 19 May 2016, Pages 873 -887.
  6. Ralf Hartmut Guting, M. H. Bohlen, Martin Erwig, Christian S. Jensen, Nikos A. Lorentzos, Markus Schneider, and Michalis Vazirgiannis, A foundation for representing and querying moving objects, ACM Transactions on Database Systems (TODS), Vol. 25, No. 1, March 2000, Pages 1-42.
  7. Sebestyén Viktor, Czvetkó Tímea, Abonyi János, The Applicability of Big Data in Climate Change Research: The Importance of System of Systems Thinking, Frontiers in Environmental Science, Volume 9, March 2021,DOI:10.3389/fenvs.2021.619092.
  8. Yang C., Clarke K., Shekhar S., Tao C.V, Big Spatiotemporal Data Analytics: a research and innovation frontier, International Journal of Geographical Information Science, April 2020, https://doi.org/10.1080/13658816.2019.1698743.
  9. Fei Hu, Chaowei Yang, Daniel Q. Duffy, Michael Bowen, Weiwei Song, Tsengdar Lee, Mengchao Xu and John L. Schnase, ClimateSpark: An in-memory distributed computing framework for big climate data analytics, Journal of Computers and Geosciences, March 2018, Pages 154-166, https://doi.org/10.1016/j.cageo.2018.03.011.
  10. Christopher Bartz, Konstantinos Chasapis, Michael Kuhn, Petra Nerge & Thomas Ludwig, A Best Practice Analysis of HDF 5 and NetCDF-4 Using Lustre, International Conference on High Performance Computing, ISC High Performance 2015: High Performance Computing, volume 9137, Pages 274-281.
  11. Gowtham Atluri,Anuj Karpatne, Vipin kumar, Spatio-Temporal Data Mining: A Survey of Problems and Methods, ACM Computing Surveys, Volume 51, Issue 4, Article No.: 83, July 2019 Pages 1- 41,https://doi.org/10.1145/3161602.
  12. R. Rew, G. Davis, NetCDF: an interface for scientific data access, Volume: 10, Issue: 4, July 1990, pages: 76 -82, DOI: 10.1109/38.56302.
  13. Salman Salloum, Ruslan Dautov, Xiaojun Chen, Patrick Xiaogang Peng, Joshua Zhexue Huang, Big data analytics on Apache Spark, International Journal of Data Science and Analytics, Springer International Publishing Switzerland 2016, Pages 145-164.
  14. Abdul Salam, Internet of Things for Sustainable Human Health, Book chapter in Internet of Things for Sustainable Community Development. Internet of Things, Springer, January 2020, Pages 217-242, https://doi.org/10.1007/978-3-030-35291-2_7.
  15. Pankaj Mudholkar and Megha Mudholkar, Internet of Things (IoT) and Big Data: A Review, International Journal of Management, Technology and Engineering, Volume 8, Issue XII, December 2018, ISSN NO: 2249-7455, Pages 5001-5007.
  16. Mark McKenney, Niharika Nyalakonda, Jarrod McEvers, Mitchell Shipton, Pyspatiotemporalgeom: A Python Library for Spatiotemporal Types and Operations, Proceedings of the 24th ACM SIGSPATIAL International Conference on Advances in Geographic Information Systems, October 2016, Article No.: 93, Pages 1-4.
  17. R. Rew, G. Davis, NetCDF: an interface for scientific data access, IEEE Journal of Computer Graphics and Applications, Volume: 10, Issue: 4, July 1990, Pages 76 -82.
  18. Dimitar Misev, Peter Baumann, Jürgen Seib, Towards Large-Scale Meteorological Data Services: A Case Study, Journal of Datenbank Spektrum-Springer, Volume 21,Issue 1,Pages183-192, 22nd September 2012.
  19. A. Joshi, E. Pebesma, R. Henriques, M. Appel, SCIDB Based Framework For Storage And Analysis Of Remote Sensing Big Data, The International Archives of the Photogrammetry, Remote Sensing and Spatial Information Sciences, Volume XLII-5/W3, Capacity Building and Education Outreach in Advance Geospatial Technologies and Land Management,pp.10-11 December 2019, Dhulikhel, Nepal.
  20. Louai Alarabi, Mohamed F. Mokbel, A Demonstration of STHadoop: A MapReduce Framework for Big Spatiotemporal Data, proceedings of the VLDB Endowment, Vol. 10, No. 12, August 2017.
  21. Ahmed Eldawy, Mohamed F. Mokbel, Demonstration of SpatialHadoop: An Efficient MapReduce Framework for Spatial Data, proceedings of the VLDB Endowment, Volume 6, Issue 12, August 2013, Pages 1230- 1233, https://doi.org/10.14778/2536274.2536283.
  22. Joe B. Buck, Noah Watkins, Jeff LeFevre, Kleoni Ioannidou, SciHadoop: Array-based query processing in Hadoop, Proceedings of 2011 International Conference for High Performance Computing, Networking, Storage and Analysis, November 2011, Article No.: 66, Pages 1-11 https://doi.org/10.1145/2063384.2063473.
  23. Timothy Olatayo, A. I. Taiwo, Statistical Modelling and Prediction of Rainfall Time Series Data, Global Journal of Computer Science and Technology: Interdisciplinary, Volume 14, Issue 1 Version1.0, 2014, Online ISSN: 0975-4172 & Print ISSN: 0975-4350.
  24. Nikita Shivhare, Atul Kumar Rahul, Shyam Bihari Dwivedi and Prabhat Kumar Singh Dikshit, ARIMA based daily weather forecasting tool: A case study for Varanasi, Journal Mausam 70(1), January 2019, Pages 133-140.