Career prediction using XGBoost model and students' academic results
Abstract
This work shows an approach for constructing a system for career prediction by applying
the eXtreme Gradient Boosting (XGBoost) Decision Tree model on the academic results of
International University’s School of Computer Science & Engineering graduates in the past 5
years. Initially, the dataset is cleaned up and normalized to be usable for the prediction algorithm
using Microsoft Excel and Google Colaboratory (with the help of Python 3 programming
language). It is then split into 2 subsets: one for training (80 percent) and the other for testing (20
percent). After that, the algorithm uses the training subset to build the classification model.
Finally, the testing subset is loaded into the model to predict each student’s career path based on
the respective inputs and hyperparameters tuning is employed to boost the model’s accuracy. By
utilizing this solution, the problem related to predicting students’ future career paths based on
their performance throughout their years studying at the university can be adequately addressed
and handled