Dissertations, Theses, and Capstone Projects
Date of Degree
6-2022
Document Type
Capstone Project
Degree Name
M.S.
Program
Data Analysis & Visualization
Advisor
Howard T. Everson
Subject Categories
Data Science | Educational Assessment, Evaluation, and Research | Educational Methods
Keywords
machine learning, math proficiency, multinominal logistic regression, SPSS Modeler
Abstract
A principal goal of this project was to compare several machine learning (ML) algorithms to explore and validate math proficiency classifications based on standardized test scores. The data used in these analyses came from the 6th-grade students’ mathematics assessment records of the New York State Education Department’s Testing Program (NYSTP). Our approach was to test a number of competing machine learning (ML) algorithms for classifying students’ as proficient based on their test scores and other demographic information. Our samples were drawn from the 2016 test-taking cohort of 6th-grade students (N=156,800). Five classifiers including multinominal logistic regression (MLR), XGBoost, Tree-As, Lagrangian support vector machine (LSVM), and C5.0 Decision Tree algorithm were used to establish the best predictive model. Experimental results demonstrated that multinominal logistic regression had a better performance than other ML algorithms.
Recommended Citation
Avdeev, Alexander, "A Comparison of Machine Learning Techniques for Validating Students’ Proficiency in Mathematics" (2022). CUNY Academic Works.
https://academicworks.cuny.edu/gc_etds/4869
Zip file of the GitHub repository for the capstone project
Included in
Data Science Commons, Educational Assessment, Evaluation, and Research Commons, Educational Methods Commons