Incorporating Open Data Into Introductory Courses in Statistics
2019, Journal of Statistics Education
https://doi.org/10.1080/10691898.2019.1669506Abstract
The 2016 Guidelines for Assessment and Instruction in Statistics Education (GAISE) College Report emphasized six recommendations to teach introductory courses in statistics. Among them: use of real data with context and purpose. Many educators have created databases consisting of multiple data sets for use in class; sometimes making hundreds of data sets available. Yet 'the context and purpose' component of the data may remain elusive if just a generic database is made available. We describe the use of open data in introductory courses. Countries and cities continue to share data through open data portals. Hence, educators can find regional data that engages their students more effectively. We present excerpts from case studies that show the application of statistical methods to data on: crime, housing, rainfall, tourist travel, and others. Data wrangling and discussion of results are recognized as important case study components. Thus the open data based case studies attend most GAISE College Report recommendations. Reproducible R code is made available for each case study. Example uses of open data in more advanced courses in statistics are also described.
References (21)
- Baloğlu, M., Deniz, M. E., and Kesici, Ş. (2011), "A descriptive study of individual and cross-cultural differences in statistics anxiety," Learning and Individual Differences, 21, 387-391.
- Baumer, B. (2015), "A Data Science Course for Undergraduates: Thinking With Data," The American Statistician, 69, 334-342.
- Baumer, B. S., Kaplan, D. T., and Horton, N. J. (2017), Modern Data Science with R, Chapman and Hall/CRC Press: Boca Raton.
- Chew, P. K. and Dillon, D. B. (2014), "Statistics Anxiety Update: Refining the Construct and Recommendations for a New Research Agenda," Perspectives on Psychological Science, 9, 196-208.
- -(2015), "Statistics anxiety and attitudes toward statistics," in D. Chhabra (Ed.), Proceedings of the 4th Annual International Conference on Cognitive and Behavioral Psychology (CBP 2015). Singapore.
- Cruise, R. J., C. R. W.. B. D. L. (1985), "Development and validation of an instrument to measure statistical anxiety," Paper presented at the annual meeting of the Statistical Education Section, Chicago, IL., 92.
- GAISE (2016), "Guidelines for Assessment and Instruction in Statistics Education College Report," Tech. rep., ASA Revision Committee.
- Grimshaw, S. (2015), "A Framework for Infusing Authentic Data Experiences Within Statistics Courses," The American Statistician, 69.
- Horton, N. J. and Hardin, J. S. (2015), "Teaching the Next Generation of Statistics Students to "Think With Data": Special Issue on Statistics and the Undergraduate Curriculum," The American Statistician, 69, 259-265.
- Hullman, J., Resnick, P., and Adar, E. (2015), "Hypothetical Outcome Plots Outperform Error Bars and Violin Plots for Inferences About Reliability of Variable Ordering," PLOS ONE, 10.
- Kalesan, B. and Galea, S. (2017), "Patterns of gun deaths across US counties 1999-2013," Annals of epidemiology, 27, 302-307.
- Keeley, J., Zayac, R., and Correia, C. (2008), "Curvilinear relationships between statistics anxiety and performance among undergraduate students: Evidence for optimal anxiety," Statistics Education Research Journal, 7, 4-15.
- Manyika, J., Chui, M., Groves, P., Farrell, D., Van Kuiken, S., and Doshi, E. A. (2013), "Open data: Unlocking innovation and performance with liquid information," McKinsey Global Institute, 21.
- McKee, T. B., Doesken, N. J., and Kleist, J. (1993), "The relationship of drought frequency and duration to time scales," in Proceedings of the 8th Conference on Applied Climatology, American Meteorological Society Boston, MA, vol. 17, pp. 179-183.
- Neumann, D. L., M., H., and Neumann, M. M. (2013), "Using Real-Life Data when Teaching Statistics: Student Perceptions of this Strategy in an Introductory Statistics Course," Statistics Education Research Journal, 12, 59-70.
- Nolan, P. and Perrett, J. (2016), "Teaching and Learning Data Visualization: Ideas and Assignments," The American Statistician, 70, 260-269.
- O'Neill, J. (2018), "Swansong of Hans Rosling, data visionary." Nature, 556,
- Onwuegbuzie, A. J. and Wilson, V. A. (2003), "Statistics anxiety: Nature, etiology, antecedents, effects, and treatments-A comprehensive review of the literature," Teaching in Higher Education, 8, 195-209.
- R Core Team (2016), R: A Language and Environment for Statistical Computing, R Foundation for Statistical Computing, Vienna, Austria.
- Ridgway, J. (2016), "Implications of the Data Revolution for Statistics Education," International Statistics Review, 84, 528-549.
- Rivera, R. (2016), "A dynamic linear model to forecast hotel registrations in Puerto Rico using Google Trends data," Tourism Management, 57, 12-20.