Modern predictive models for modeling the college graduation rates

Emma A. Gunu, Carl Lee, Wilson K. Gyasi, Robert M. Roe

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

1 Scopus citations

Abstract

Modern predictive modeling techniques are commonly used for modeling a target of interest based on a list of input variables. In general, these techniques are capable of identifying input variables associated with the target, but not for the purpose of identifying the causation relationship between target and inputs due to the fact that the data are observational data. Advanced technology has made data collection very easy and fast. As a result, when applying predictive modeling methods, the issue of data cleansing becomes critical. This article aims at comparing ten modern predictive modeling techniques for predicting college graduation rate within 6 years. The input variables include variables on 'pre-college' performance, 'first-year' college performance and various social-economic variables, as well as some variables related to university learning environment. The issue of data quality and modeling technique selection are discussed. Some pitfalls and cautions of applying predictive modeling techniques are discussed.

Original languageEnglish
Title of host publicationProceedings - 2017 15th IEEE/ACIS International Conference on Software Engineering Research, Management and Applications, SERA 2017
EditorsLiz Bacon, Jixin Ma, Lachlan MacKinnon
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages39-45
Number of pages7
ISBN (Electronic)9781509057566
DOIs
StatePublished - Jun 30 2017
Externally publishedYes
Event15th IEEE/ACIS International Conference on Software Engineering Research, Management and Applications, SERA 2017 - London, United Kingdom
Duration: Jun 7 2017Jun 9 2017

Publication series

NameProceedings - 2017 15th IEEE/ACIS International Conference on Software Engineering Research, Management and Applications, SERA 2017

Conference

Conference15th IEEE/ACIS International Conference on Software Engineering Research, Management and Applications, SERA 2017
Country/TerritoryUnited Kingdom
CityLondon
Period06/7/1706/9/17

Keywords

  • Decision Tree
  • Ensemble
  • Gradient Boosting
  • LARS
  • LASSO
  • Logistic Regression
  • Neural Network
  • Random Forest
  • Support Vector Machine

Fingerprint

Dive into the research topics of 'Modern predictive models for modeling the college graduation rates'. Together they form a unique fingerprint.

Cite this