Curating datasets to support middle school student inquiry
DOI:
https://doi.org/10.52041/iase2023.106Abstract
We examine how developers of data science curricula determine what makes a pedagogically effective dataset enabling 10–14 year-old students (“middle school” in the United States) to engage in the data investigation cycle by posing their own questions about relationships among variables. We describe strategies for curating existing datasets to address goals for learning about data, and for optimizing the use of these datasets once they are curated. We investigate how data science educators can transform existing datasets into ones appropriate for students with little data experience, drawing on our experience working with several publicly available datasets, which students explored in CODAP (the Common Online Data Analysis Platform) (Concord Consortium, n.d.).References
Bargagliotti, A., Franklin, C., Arnold, P., Gould, R., Johnson, S., Perez, L., & Spangler, D. A. (2020). Pre-K–12 Guidelines for Assessment and Instruction in Statistics Education II (GAISE II): A Framework for Statistics and Data Science Education. American Statistical Association. https://www.amstat.org/asa/files/pdfs/GAISE/GAISEIIPreK-12_Full.pdf
Concord Consortium. (n.d.). Common Online Data Analysis Platform (CODAP) [Computer software]. https://codap.concord.org/
diSessa, A. A., & Cobb, P. (2004). Ontological innovation and the role of theory in design experiments. The Journal of the Learning Sciences, 13(1), 77–103. https://doi.org/10.1207/s15327809jls1301_4
Erickson, T. (2022). Awash in data. https://codap.xyz/awash/
Higgins, T., Mokros, J., Rubin, A., & Sagrans, J. (2023). Students’ Approaches to Exploring Relationships between Categorical Variables. Teaching Statistics. https://doi.org/10.1111/test.12331
Nilsson, P., Schindler, M., & Bakker, A. (2018). The nature and use of theories in statistics education. In D. Ben-Zvi, K. Makar, & J. Garfield (Eds.), International handbook of research in statistics 66195-7_11
Noy, N. (2020, January 23). Discovering Millions of Datasets on the Web. The Keyword. https://blog.google/products/search/discovering-millions-datasets-web/
Palumbo, A. (2020, June 5). State Health Leaders Dispute CDC’s Claim of Drop in Lyme Disease Cases. NECN. https://www.necn.com/news/national-international/state-health-leaders-dispute-cdcs- claim-of-drop-in-lyme-cases/2013230/
Rosenberg, J. M., Schultheis, E. H., Kjelvik, M. K., Reedy, A., & Sultana, O. (2022). Big Data, Big Changes? The Technologies and Sources of Data Used in Science Classrooms. British Journal of Educational Technology, 53, 1179–1201. https://doi.org/10.1111/bjet.13245
Rubin, A. (2019). Facebook or Instagram? Teens Explore Data about Technology Use. Hands On. https://www.terc.edu/facebook-or-instagram-teens-explore-data-about-technology-use/
Rubin, A. (2021). What to Consider When We Consider Data. Teaching Statistics, 43(S1). https://doi.org/10.1111/test.12275
Schanzer, E., Pfenning, N., Denny, F., Dooman, S., Gibbs Politz, J., Lerner, B. S., Fisler, K., & Krishnamurthi, S. (2022). Integrated Data Science for Secondary Schools: Design and Assessment of a Curriculum. ACM Technical Symposium on Computer Science Education. https://cs.brown.edu/~sk/Publications/Papers/Published/spddplfk-integ-ds-desn-assm- curric/paper.pdf