Teaching classification module in data science course for undergraduate non-STEM students using project based learning methodology
DOI:
https://doi.org/10.52041/iase2023.104Abstract
The current paper presents a detailed plan for teaching the classification module as one of the most important parts of the full data science course for non-STEM undergraduate students. Classification is one of the most common data analytics tasks. It is employed in myriad disciplines including marketing, finance, sociology, psychology, education, medicine, and other non-STEM areas. It is, therefore, appropriate to extend carry off with data mining methods to students who will deal with such problems during their professional careers. The overall data science course combines theory and practice and is taught in a "hands-on" format. The main assignment that is run throughout the course is a practical project, which simulates comprehensive research, starting from research questions, through data mining, to interpretation and decision-making. The contents and highlights may vary between different students' majors. We recommend addressing the teaching of the data science course as an interactive and dynamic process.References
Cardiovascular Study Dataset. (n.d.). Retrieved June 17, 2023 from www.kaggle.com
Carver, R., Everson, M., Gabrosek, J., Horton, N., Lock, R., Mocko, M., Rossman, A., Roswell, G., Velleman, P., Witmer, J., & Wood, B. (2016). Guidelines for Assessment and Instruction in Statistics Education (GAISE) College Report 2016. Guidelines for Assessment and Instruction in Statistics Education (GAISE) College Report 2016.
Chris, B., & Duncan, H. (2013). Evaluating and Assessing for Learning. Routledge.
Company Bankruptcy Prediction. (n.d.). Retrieved June 17, 2023 from www.kaggle.com
Company’s Ideal Customers | Marketing Strategy. (n.d.). Retrieved June 17, 2023 from www.kaggle.com
Diabetes Dataset. (n.d.). Retrieved June 17, 2023 from www.kaggle.com
Donoghue, T., Voytek, B., & Ellis, S. E. (2021). Teaching Creative and Practical Data Science at Scale. Journal of Statistics and Data Science Education, 29(sup1), S27–S39.
Free Machine Learning Services—AWS. (n.d.). Retrieved May 4, 2022, from https://aws.amazon.com/free/machine-learning
Garfield, J., & Ben-Zvi, D. (2008). Developing Students’ Statistical Reasoning: Connecting Research and Teaching Practice. Springer Science & Business Media.
Gary, K. (2015). Project-Based Learning. Computer, 48(9), 98–100.
Gordon, A. D. (1999). Classification, 2nd Edition. CRC Press.
Hawkins, A., Jolliffe, F., & Glickman, L. (2014). Teaching Statistical Concepts. Routledge.
Irimia-Dieguez, A. I., Blanco-Oliver, A., & Vazquez-Cueto, M. J. (2015). A Comparison of Classification/Regression Trees and Logistic Regression in Failure Models. Procedia Economics and Finance, 23, 9–14.
Kaggle: Your Home for Data Science. (n.d.). Retrieved February 25, 2019 from https://www.kaggle.com/
Martignon, L., & Laskey, K. (2019). Statistical literacy for classification under risk: An educational perspective. AStA Wirtschafts- Und Sozialstatistisches Archiv, 13(3), 269–278.
Nolan, D., & Temple Lang, D. (2015). Explorations in Statistics Research: An Approach to Expose Undergraduates to Authentic Data Analysis. The American Statistician, 69(4), 292–299.
Nonaka, I., Toyama, R., & Konno, N. (2000). SECI, Ba and Leadership: A Unified Model of Dynamic Knowledge Creation. Long Range Planning, 33(1), 5–34.
Personality classification Data: 16 Personalities. (n.d.). Retrieved June 17, 2023 from www.kaggle.com
Saltz, J., & Heckman, R. (2016). Big Data science education: A case study of a project-focused introductory course. Themes in Science and Technology Education, 8(2), 85–94.
Student performance prediction. (n.d.). Retrieved June 17, 2023 from www.kaggle.com
UCI Machine Learning Repository: Data Sets. (n.d.). Retrieved December 10, 2016, from https://archive.ics.uci.edu/ml/datasets.html
Voters and Non-Voters. (n.d.). Retrieved June 17, 2023 from https://www.kaggle.com
Wagaman, A. (2016). Meeting Student Needs for Multivariate Data Analysis: A Case Study in Teaching an Undergraduate Multivariate Data Analysis Course. The American Statistician, 70(4), 405–412.