Academia.eduAcademia.edu

Outline

Optimal statistical model for forecasting ozone

2007, Journal of Computational Methods in Sciences and Engineering

https://doi.org/10.3233/JCM-2006-6S210

Abstract

The objective of this paper is to apply time series analysis and multiple regression method to ozone data in order to obtain the optimal statistical model for forecasting next day ozone level. The best estimated model is then used to produce one-step ahead point and interval estimates of future values of the ozone series. Ozone data is analyzed using time series analysis, which resulted in an Auto Regressive Moving Average, ARMA (20, 2) with Mean Absolute Percentage Error (MAPE) = 42%. Applying multiple regression method and examination of several possible contributing factors, showed that Wind speed, Mixing height where the complex chemical reactions that produce ozone take place, current and predicted next day temperatures and current ozone concentration are influential on the next day ozone concentration levels. Diagnostics tests and statistics including R-square, residual analysis and Durbin-Watson Statistic (DW) were applied in order to select the best fitted model and finally the best prediction model was found using MAPE and Mean Absolute Deviation (MAD) as predictive criteria. Regression analysis of this data, using tomorrow's and today's maximum temperature, today's wind speed and tomorrow's maximum height at 10 am, as explanatory variables results in R-square of 50.7% with MAD = 12.423, MAPE = 30% and DW = 1.66.

References (10)

  1. H. Akaike, Information Theory and an Extension of the Maximum Likelihood Principal, in: 2nd International Symposium on Information Theory, B.N. Petrov and F. Csaki, eds, Akademiai kiado, Budapest, 1973, pp. 267-281.
  2. P.J. Brockwell and R.A. Davis, Introduction to Time Series and Forecasting, Springer-Verlag, New York, 2001.
  3. J. Durbin and G.S. Watson, Testing for Serial Correlation in Least Squares Regression II, Biometrika 38 (1951), 159-178.
  4. E.P.A., Discussion Paper on Particles, Publication 297, December 1991.
  5. E.P.A., Air Monitoring Data 1991, Publication 402, November 1993.
  6. E.P.A., Air Monitoring Data 1996, Publication 614, May 1998.
  7. R.J. Hyndman, Computing and Graphing Highest Density Regions, Research Report Series, Report No. 6, Department of Statistics, the University of Melbourne, 1993.
  8. D.G. Kleinbaum, L.L. Kupper and K.E. Muller, Applied Regression Analysis and Other Multivariate Methods, Second Edition, PWS-KENT 1998.
  9. National Tuberculosis and respiratory disease Association, Air Pollution primer, New York, 1971.
  10. J.A. Streeton, Air Pollution Health Effects and Air Quality Objectives in Victoria, report to E.P.A. Victoria, Australia, 1990.