Teaching classification module in data science course for undergraduate non-STEM students using project based learning methodology

Authors

  • Anna Khalemsky Hadassah Academic College
  • Yelena Stukalin Academic College of Tel Aviv–Yaffo

DOI:

https://doi.org/10.52041/iase2023.104

Abstract

The current paper presents a detailed plan for teaching the classification module as one of the most important parts of the full data science course for non-STEM undergraduate students. Classification is one of the most common data analytics tasks. It is employed in myriad disciplines including marketing, finance, sociology, psychology, education, medicine, and other non-STEM areas. It is, therefore, appropriate to extend carry off with data mining methods to students who will deal with such problems during their professional careers. The overall data science course combines theory and practice and is taught in a "hands-on" format. The main assignment that is run throughout the course is a practical project, which simulates comprehensive research, starting from research questions, through data mining, to interpretation and decision-making. The contents and highlights may vary between different students' majors. We recommend addressing the teaching of the data science course as an interactive and dynamic process.

References

Cardiovascular Study Dataset. (n.d.). Retrieved June 17, 2023 from www.kaggle.com

Carver, R., Everson, M., Gabrosek, J., Horton, N., Lock, R., Mocko, M., Rossman, A., Roswell, G., Velleman, P., Witmer, J., & Wood, B. (2016). Guidelines for Assessment and Instruction in Statistics Education (GAISE) College Report 2016. Guidelines for Assessment and Instruction in Statistics Education (GAISE) College Report 2016.

Chris, B., & Duncan, H. (2013). Evaluating and Assessing for Learning. Routledge.

Company Bankruptcy Prediction. (n.d.). Retrieved June 17, 2023 from www.kaggle.com

Company’s Ideal Customers | Marketing Strategy. (n.d.). Retrieved June 17, 2023 from www.kaggle.com

Diabetes Dataset. (n.d.). Retrieved June 17, 2023 from www.kaggle.com

Donoghue, T., Voytek, B., & Ellis, S. E. (2021). Teaching Creative and Practical Data Science at Scale. Journal of Statistics and Data Science Education, 29(sup1), S27–S39.

Free Machine Learning Services—AWS. (n.d.). Retrieved May 4, 2022, from https://aws.amazon.com/free/machine-learning

Garfield, J., & Ben-Zvi, D. (2008). Developing Students’ Statistical Reasoning: Connecting Research and Teaching Practice. Springer Science & Business Media.

Gary, K. (2015). Project-Based Learning. Computer, 48(9), 98–100.

Gordon, A. D. (1999). Classification, 2nd Edition. CRC Press.

Hawkins, A., Jolliffe, F., & Glickman, L. (2014). Teaching Statistical Concepts. Routledge.

Irimia-Dieguez, A. I., Blanco-Oliver, A., & Vazquez-Cueto, M. J. (2015). A Comparison of Classification/Regression Trees and Logistic Regression in Failure Models. Procedia Economics and Finance, 23, 9–14.

Kaggle: Your Home for Data Science. (n.d.). Retrieved February 25, 2019 from https://www.kaggle.com/

Martignon, L., & Laskey, K. (2019). Statistical literacy for classification under risk: An educational perspective. AStA Wirtschafts- Und Sozialstatistisches Archiv, 13(3), 269–278.

Nolan, D., & Temple Lang, D. (2015). Explorations in Statistics Research: An Approach to Expose Undergraduates to Authentic Data Analysis. The American Statistician, 69(4), 292–299.

Nonaka, I., Toyama, R., & Konno, N. (2000). SECI, Ba and Leadership: A Unified Model of Dynamic Knowledge Creation. Long Range Planning, 33(1), 5–34.

Personality classification Data: 16 Personalities. (n.d.). Retrieved June 17, 2023 from www.kaggle.com

Saltz, J., & Heckman, R. (2016). Big Data science education: A case study of a project-focused introductory course. Themes in Science and Technology Education, 8(2), 85–94.

Student performance prediction. (n.d.). Retrieved June 17, 2023 from www.kaggle.com

UCI Machine Learning Repository: Data Sets. (n.d.). Retrieved December 10, 2016, from https://archive.ics.uci.edu/ml/datasets.html

Voters and Non-Voters. (n.d.). Retrieved June 17, 2023 from https://www.kaggle.com

Wagaman, A. (2016). Meeting Student Needs for Multivariate Data Analysis: A Case Study in Teaching an Undergraduate Multivariate Data Analysis Course. The American Statistician, 70(4), 405–412.

Downloads

Published

2024-03-29

Conference Proceedings Volume

Section

Topic 1: Fostering Learning in the Current Data Landscape