INTRODUCING HIGH SCHOOL STATISTICS TEACHERS TO PREDICTIVE MODELLING AND APIs USING CODE-DRIVEN TOOLS
DOI:
https://doi.org/10.52041/serj.v21i2.49Keywords:
Statisitcs education research, Data science education, Predictive modeling, Integrating statistical and computational thinking, Task design, High school teachers, APIsAbstract
Tasks for teaching predictive modelling and APIs often require learners to use code-driven tools. Minimal research, however, exists about the design of tasks that support the introduction of high school students and teachers to these new statistical and computational methods. Using a design-based research approach, a web-based task was developed. The task was constructed using our design framework and implemented within a face-to-face professional development workshop involving six high school statistics teachers. The teachers were guided through the process of developing a prediction model using: an informal approach; visual prediction intervals; data about movie ratings from an API; and R code that ran in the browser. Our findings from this exploratory study indicate that the web-based task supported the development of new statistical and computational ideas related to predictive modelling and APIs.
References
Allaire, J., Xie, Y., McPherson, J., Luraschi, J., Ushey, K., Atkins, A., Wickham, H., Cheng, J., Chang, W., & Iannone, R. (2021). rmarkdown: Dynamic documents for R. RStudio. https://rmarkdown.rstudio.com
Anderson, T., & Shattuck, J. (2012). Design-based research: A decade of progress in education research?. Educational Researcher, 41(1), 16–25.
https://doi.org/10.3102%2F0013189X11428813
Bakker, A. (2018). Design research in education: A practical guide for early career researchers. Routledge. https://doi.org/10.4324/9780203701010
Bakker, A., & van Eerde, D. (2015). An introduction to design-based research with an example from statistics education. In A. Bikner-Ahsbahs, C. Knipping, & N. Presmeg (Eds.), Approaches to qualitative research in mathematics education (pp. 429–466). Springer. https://doi.org/10.1007/978-94-017-9181-6_16
Bargagliotti, A., Franklin, C., Arnold, P., Gould, R., Johnson, S., Perez, L., & Spangler, D. (2020). Pre-K–12 Guidelines for Assessment and Instruction in Statistics Education (GAISE) report II. American Statistical Association.
Ben-Zvi, D. (2000). Toward understanding the role of technological tools in statistical learning. Mathematical Thinking and Learning, 2(1-2), 127–155. https://doi.org/10.1207/S15327833MTL0202_6
Biehler, R. (2018). Design principles, realizations and uses of software supporting the learning and the doing of statistics: A reflection on developments since the late 1990s. In M. A. Sorto, A. White, & L. Guyot (Eds.), Looking back, looking forward. Proceedings of the Tenth International Conference on Teaching Statistics (ICOTS10), Kyoto, Japan, July 8–13. International Statistical Institute. https://iase-web.org/icots/10/proceedings/pdfs/ICOTS10_1B1.pdf
Biehler, R., & Schulte, C. (2017). Perspectives for an interdisciplinary data science curriculum at German secondary schools. In R. Biehler, L. Budde, D. Frischemeier, B. Heinemann, S. Podworny, C. Schulte, & T. Wassong (Eds.), Paderborn Symposium on Data Science Education at School Level 2017: The Collected Extended Abstracts (pp. 2–14). Universitätsbibliothek Paderborn.
Burr, W., Chevalier, F., Collins, C., Gibbs, A. L., Ng, R., & Wild, C. J. (2021). Computational skills by stealth in introductory data science teaching. Teaching Statistics, 43, S34–S51. https://doi.org/10.1111/test.12277
Casey, S. A., & Wasserman, N. H. (2015). Teachers’ knowledge about informal line of best fit. Statistics Education Research Journal, 14(1), 8–35. https://doi.org/10.52041/serj.v14i1.267
Cetinkaya-Rundel, M., & Rundel, C. (2018). Infrastructure and tools for teaching computing throughout the statistical curriculum. The American Statistician, 72(1), 58–65. https://doi.org/10.1080/00031305.2017.1397549
De Veaux, R. D., Agarwal, M., Averett, M., Baumer, B. S., Bray, A., Bressoud, T. C., Bryant, L., Cheng, L. Z., Francis, A., Gould, R., Kim, A. Y., Kretchmar, M., Lu, Q., Moskol, A., Nolan, D., Pelayo, R., Raleigh, S., Sethi, R. J., Sondjaja, M., … Ye, P. (2017). Curriculum guidelines for undergraduate programs in data science. Annual Review of Statistics and Its Application, 4, 15–30. https://doi.org/10.1146/annurev-statistics-060116-053930
Edelson, D. C. (2002). Design research: What we learn when we engage in design. The Journal of the Learning sciences, 11(1), 105–121. https://doi.org/10.1207/S15327809JLS1101_4
Engel, J. (2017). Statistical Literacy for Active Citizenship: A Call for Data Science Education. Statistics Education Research Journal, 16 (1), 44–49. https://doi.org/10.52041/serj.v16i1.213
Erickson, T. (2020). The BART Data Portal. An Introduction to Data Science with CODAP. http://codap.xyz/awash/bart-chapter.html
Fergusson, A., & Pfannkuch, M. (2020). Development of an informal test for the fit of a probability distribution model for teaching. Journal of Statistics Education, 28(3), 344–357. https://doi.org/10.1080/10691898.2020.1837039
Fergusson, A., & Pfannkuch, M. (2021). Introducing teachers who use GUI-driven tools for the randomization test to code-driven tools. Mathematical Thinking and Learning. https://doi.org/10.1080/10986065.2021.1922856
Fergusson, A., & Wild, C. J. (2021). On traversing the data landscape: Introducing APIs to data-science students. Teaching Statistics, 43, S71–S83.
https://doi.org/10.1111/test.12266
Finzer, W. (2013). The data science education dilemma. Technology Innovations in Statistics Education, 7(2). https://doi.org/10.5070/T572013891
Gould, R. (2010). Statistics and the modern student. International Statistical Review, 78(2), 297–315. https://doi.org/10.1111/j.1751-5823.2010.00117.x
Gould, R. (2017). Data literacy is statistical literacy. Statistics Education Research Journal, 16(1), 22–25. https://doi.org/10.52041/serj.v16i1.209
Gould, R. (2021). Toward data-scientific thinking. Teaching Statistics, 43, S11–S22. https://doi.org/10.1111/test.12267
Hardin, J. (2018). Dynamic data in the statistics classroom. Technology Innovations in Statistics Education, 11(1). https://doi.org/10.5070/T5111031079
Kaplan, D. (2007). Computing and introductory statistics. Technology Innovations in Statistics Education, 1(1). https://doi.org/10.5070/T511000030
Konold, C., & Miller, C. (2015). TinkerPlots™ Version 2.3 [Computer Software]. Learn Troop. http://www.tinkerplots.com/
Magana, A. J., Vasileska, D., & Ahmed, S. (2011). Work in progress—a transparency and scaffolding framework for computational simulation tools. 2011 Frontiers in Education Conference (FIE), (pp. S4G–1). IEEE.
https://doi.org/10.1109/FIE.2011.6142803
Makar, K., & Rubin, A. (2018). Learning about statistical inference. In D. Ben-Zvi, K. Makar, & J. Garfield (Eds.), International handbook of research in statistics education (pp. 261–294). Springer. https://doi.org/10.1007/978-3-319-66195-7_8
McKenney, S., & Reeves, T. C. (2018). Conducting educational design research. Routledge. https://doi.org/10.4324/9781315105642
National Academies of Sciences, Engineering, and Medicine. (2018). Data science for undergraduates: Opportunities and options. The National Academies of Sciences Engineering Medicine. https://doi.org/10.17226/25104
Nolan, D., & Temple Lang, D. (2010). Computing in the statistics curricula. The American Statistician, 64(2), 97–107. https://doi.org/10.1198/tast.2010.09132
New Zealand Qualifications Authority. (2019). Annotated exemplar Level 3 AS91581. Author. https://www.nzqa.govt.nz/ncea/subjects/mathematics/exemplars/level-3-as91581/
Pfannkuch, M. (2011). The role of context in developing informal statistical inferential reasoning: A classroom study. Mathematical Thinking and Learning, 13(1–2), 27–46. https://doi.org/10.1080/10986065.2011.538302
Pruim, R., Kaplan, D. T., & Horton, N. J. (2017). The mosaic package: Helping students to ‘think with data’ using R. The R Journal, 9(1), 77–102.
R Core Team. (2020). R: A language and environment for statistical computing. https://www.R-project.org/
Reeves, T. C. (2007). Design-based research from a technology perspective. In J. Van den Akker, K. Gravemeijer, S. McKenney & N. Nieveen (Eds.), Educational design research, (pp. 52–56). Routledge.
Ridgway, J. (2016). Implications of the data revolution for statistics education. International Statistical Review, 84(3), 528–549. https://doi.org/10.1111/insr.12110
Sentance, S., Waite, J., & Kallia, M. (2019). Teaching computer programming with PRIMM: a sociocultural perspective. Computer Science Education, 29(2-3), 136–176. https://doi.org/10.1080/08993408.2019.1608781
Shaughnessy, J. M. (1997). Missed opportunities in research on the teaching and learning of data and chance. In F. Biddulph & K. Carr (Eds.), People in mathematics education. Proceedings of the Twentieth Annual Conference of the Mathematics Research Group of Australasia (MERGA-20, July, 1990), Rotorua, New Zealand (Vol. 1, pp. 6–22). MERGA.
Schloerke, B., Allaire, J., & Borges, B. (2018). Learnr: Interactive tutorials for R. CRAN. https://CRAN.R-project.org/package=learnr
Son, J. Y., Blake, A. B., Fries, L., & Stigler, J. W. (2021). Modeling first: Applying learning science to the teaching of introductory statistics. Journal of Statistics and Data Science Education, 29(1), 4–21. https://doi.org/10.1080/10691898.2020.1844106
Sweller, J., van Merriënboer, J. J. G., & Paas, F. G. W. (1998). Cognitive architecture and instructional design. Educational Psychology Review, 10(3), 251–296. https://doi.org/10.1023/A:1022193728205
Van den Akker, J. (1999). Principles and methods of development research. In J. Van den Akker, R. M. Branch, K. Gustafson, N. Nieveen & T. Plomp (Eds.), Design approaches and tools in education and training, (pp. 1–14). Springer. https://doi.org/10.1007/978-94-011-4255-7_1
Van Someren, M. W., Barnard, Y. F., & Sandberg, J. A. C. (1994). The think aloud method: A practical approach to modelling cognitive processes. AcademicPress.
Weiland, T. (2017). The importance of context in task selection. Teaching Statistics, 39(1), 20–25. https://doi.org/10.1111/test.12116
Wickham, H. (2016). ggplot2: Elegant graphics for data analysis. Springer-Verlag.
Wickham, H. (2017). Tidyverse: Easily install and load the “tidyverse”. CRAN. https://CRAN.R-project.org/package=tidyverse
Wiedemann, K., Chao, J., Galluzzo, B., & Simoneau, E. (2020). Mathematical modeling with R: Embedding computational thinking into high school math classes. ACM Inroads, 11(1), 33–42. https://doi.org/10.1145/3380956
Wild, C. J., & Pfannkuch, M. (1999). Statistical thinking in empirical enquiry. International Statistical Review, 67(3), 223–248. https://doi.org/10.1111/j.1751-5823.1999.tb00442.x
Wild, C. J., Pfannkuch, M., Regan, M., & Parsonage, R. (2017). Accessible conceptions of statistical inference: Pulling ourselves up by the bootstraps. International Statistical Review, 85(1), 84–107. https://doi.org/10.1111/insr.12117
Wouters, P., Paas, F., & van Merriënboer, J. J. (2008). How to optimize learning from animated models: A review of guidelines based on cognitive load. Review of Educational Research, 78(3), 645–675.
https://doi.org/10.3102%2F0034654308320320
Zieffler, A., Justice, N., delMas, R., & Huberty, M. D. (2021). The use of algorithmic models to develop secondary teachers’ understanding of the statistical modeling process. Journal of Statistics and Data Science Education, 29(1), 131–147. https://doi.org/10.1080/26939169.2021.1900759