Classification Model to Predict the University Major Using Decision Tree Algorithm

Main Article Content

Yahyia Benyahmed
Almahdi Alshareef
Salah Amar
Mohammed Zadana

Abstract

Data mining education has emerged as a powerful method for predicting students' academic success and uncovering hidden links in educational data. The goal of this paper is to create an intelligent prediction system to assist in guiding and counseling students who were preparing to start college. The main issue addresses in the study is that many students end up specializing in areas they do not desire, often influenced by the advice of friends or family. The performances of Decision Tree (ID3), Naïve Bayes (NB), and k-Nearest Neighbor (kNN) algorithms, among the data mining algorithms, are calculated and compared to predict the final appropriate university major for students. This involves analyzing the data and evaluating the training performance of classifiers in terms of accuracy, precision, sensitivity (recall), and F-measure. The study utilizes actual, private data to construct the system. After undergoing various processing procedures, the student data from Sebha Universities are organized into different categories and prepared for the algorithms to be trained on, with an accuracy rate of 64.28%, the Decision Tree method proves to be the most effective among the three classification algorithms that are tested.             

Article Details

How to Cite
Benyahmed, Y., Almahdi Alshareef, Salah Amar, & Mohammed Zadana. (2025). Classification Model to Predict the University Major Using Decision Tree Algorithm. Sebha University Conference Proceedings, 4(3), 1–8. https://doi.org/10.51984/sucp.v4i3.4299
Section
Confrence Proceeding

References

Aggarwal CC. Data mining: the textbook. Springer; 2015.

Yağcı M. Educational data mining: prediction of students' academic performance using machine learning algorithms. Smart Learning Environments 2022; 9:11.

Batool S, Rashid J, Nisar MW et al. Educational data mining to predict students' academic performance: A survey study. Education and Information Technologies 2023; 28:905-971.

Kukkar A, Sharma A, Fan J, Zhang M. Data mining applications in university information management system development. 2022.

Jacob D, Henriques R. Educational data mining to predict bachelors students’ success. Emerging Science Journal 2023; 7:159-171.

Shoaib M, Sayed N, Singh J et al. AI student success predictor: Enhancing personalized learning in campus management systems. Computers in Human Behavior 2024; 158:108301.

Angeioplastis A, Aliprantis J, Konstantakis M, Tsimpiris A. Predicting student performance and enhancing learning outcomes: a data-driven approach using educational data mining techniques. Computers 2025; 14:83.

Alyahyan E, Düştegör D. Predicting academic success in higher education: literature review and best practices. International Journal of Educational Technology in Higher Education 2020; 17:1-21.

Ciolacu M, Tehrani AF, Binder L, Svasta PM. Education 4.0-Artificial Intelligence assisted higher education: early recognition system with machine learning to support students' success. In: 2018 IEEE 24th International Symposium for Design and Technology in Electronic Packaging(SIITME). IEEE; 2018. pp. 23-30.

Xie Y. Efficiency of Hybrid Decision Tree Algorithms in Evaluating the Academic Performance of Students. International Journal of Advanced Computer Science & Applications 2024; 15.

Matzavela V, Alepis E. Decision tree learning through a predictive model for student academic performance in intelligent m-learning environments. Computers and Education: Artificial Intelligence 2021; 2:100035.

Frazier A, Silva J, Meilak R et al. Decision Tree-Based Predictive Models for Academic Achievement Using College Students' Support Networks. arXiv preprint arXiv:2108.13947 2021.

Izza Y, Ignatiev A, Marques-Silva J. On explaining decision trees. arXiv preprint arXiv:2010.11034 2020.

Ibrahim IH, Garba EJ, Adejumo A. Predictive Model for Identification and Analysis of Factors Impacting Students Academic Performance Using Machine Learning Algorithms. Kasu Journal of Computer Science 2024; 1.

Bridet L, Leighton MA. The major decision: Labor market implications of the timing of specialization in college. 2015.

Singh LB, Chaturvedi RK, Mehdi SA, Srivastava S. Student's Preference towards Specialization Selection: An Exploratory Perspective. Adhyayan: A Journal of Management Sciences 2020; 10:10-16.

Obias DC, Matitu CAB, Almodovar BJ et al. Exploring the Influence of Out-of-Specialized-Field Teaching on Educators Development and Motivation. International Journal of Multidisciplinary: Applied Business and Education Research 2025; 6:787-804.

Villar A, de Andrade CRV. Supervised machine learning algorithms for predicting student dropout and academic success: a comparative study. Discover Artificial Intelligence 2024; 4:2.

Bansal PK, Ahmed M. Fuzzy logic-based analysis of student behavior patterns in educational environments. In: AIP Conference Proceedings. AIP Publishing LLC; 2025. p. 020047.

Alsayed AO, Rahim MSM, AlBidewi I et al. Selection of the right undergraduate major by students using supervised learning techniques. Applied Sciences 2021; 11:10639.

Roman M, Ullah A, Ullah MA et al. PREDICTING ACADEMIC SUCCESS: A MACHINE LEARNING APPROACH USING DECISION TABLES AND RANDOM FORESTS ALGORITHMS. Spectrum of Engineering Sciences 2025; 3:205-213.

Hernandez R, Atienza R. Career track prediction using deep learning model based on discrete series of quantitative classification. Applied Computer Science 2021; 17.

Iwadi I, Ali D, Jabari M, Sukic E. The Relation of Artificial Intelligence Technology Application with Administrative Performance: A Case Study of Staff in Directorates of Education in the Hebron Governorate in Palestine. TEM Journal 2024; 13.

Kuczera K, Dziembek D. Application of Rough Set Theory to Improve the Efficiency of Higher Education Systems. In: European Conference on Artificial Intelligence. Springer; 2024. pp. 237-249.

Lahoud C, Moussa S, Obeid C et al. A comparative analysis of different recommender systems for university major and career domain guidance. Education and Information Technologies 2023; 28:8733-8759.

Zayed Y, Salman Y, Hasasneh A. A Recommendation System for Selecting the Appropriate Undergraduate Program at Higher Education Institutions Using Graduate Student Data. Applied Sciences 2022; 12:12525.

Al-Shalabi L. A Data Mining Model for Students’ Choice of College Major Based on Rough Set Theory. J. Comput. Sci 2019; 15:1150-1160.

Beseiso M. Enhancing student success prediction: A comparative analysis of machine learning technique. TechTrends 2025; 69:372-384.

Salloum SA, Salloum A, Shaalan K et al. Comparative Analysis of Classical Machine Learning Techniques for Predicting Students’ Exam Performance. In: International Conference on Breaking Barriers with Generative Intelligence. Springer; 2024. pp. 219-227.

Zhang Y, Yun Y, An R et al. Educational data mining techniques for student performance prediction: method review and comparison analysis. Frontiers in psychology 2021; 12:698490.

Richards Z, Kelly AM. STEM enrollment decision trees as graduation predictors for community college students enrolled in remedial mathematics. Community College Review 2025; 53:85-104.

Huynh-Cam T-T, Chen L-S, Le H. Using decision trees and random forest algorithms to predict and determine factors contributing to first-year university students’ learning performance. Algorithms 2021; 14:318.

Latif G, Abdelhamid SE, Fawagreh KS et al. machine learning in higher education: students’ performance assessment considering online activity logs. IEEE access 2023; 11:69586-69600.

Saeedi S, Božanić D, Safa R. Strategic analytics for predicting students’ academic performance using cluster analysis and Bayesian networks. Educ. Sci. Manag 2024; 2:197-214.

Hussen AAME, Saikhu A. Modeling Of Student Graduation Prediction Using the Naive Bayes Classifier Algorithm. In: 2024 3rd International Conference on Creative Communication and Innovative Technology (ICCIT). IEEE; 2024. pp. 1-8.

Zheng X, Li C. Predicting students’ academic performance through machine learning classifiers: a study employing the Naive Bayes classifier (NBC). International Journal of Advanced Computer Science and Applications 2024; 15.

Akanbi OB. Application of naive bayes to students’ performance classification. Asian journal of probability and statistics 2023; 25:35-47.

Meiriza A, Lestari E, Putra P et al. Prediction graduate student use naive bayes classifier. In: Sriwijaya International Conference on Information Technology and Its Applications (SICONIAN 2019). Atlantis Press; 2020. pp. 370-375.

Rawal A, Lal B. Predictive model for admission uncertainty in high education using Naïve Bayes classifier. Journal of Indian Business Research 2023; 15:262-277.

Perez JG, Perez ES. Predicting student program completion using Naïve Bayes classification algorithm. International Journal of Modern Education and Computer Science 2021; 13:57-67.

Rimal Y, Sharma N, Alsadoon A. The accuracy of machine learning models relies on hyperparameter tuning: student result classification using random forest, randomized search, grid search, bayesian, genetic, and optuna algorithms. Multimedia Tools and Applications 2024; 83:74349-74364.

Thamilselvan R, Rajalaxmi R, Gothai E et al. Forecasting Students Academic Achievement Using Machine Learning Techniques. In: 2024 2nd International Conference on Artificial Intelligence and Machine Learning Applications Theme: Healthcare and Internet of Things (AIMLA). IEEE; 2024. pp. 1-6.

Jawthari M, Stoffová V. Predicting students’ academic performance using a modified kNN algorithm. Pollack Periodica 2021; 16:20-26.

Contreas-Bravo LE, Nieves-Pimiento N, González Guerrero K. Prediction of University-Level Academic Performance through Machine Learning Mechanisms and Supervised Methods. Ingeniería 2023; 28.

Alturki S, Cohausz L, Stuckenschmidt H. Predicting Master’s students’ academic performance: an empirical study in Germany. Smart Learning Environments 2022; 9:1-22.

Gupta V, Singhal P, Khattri V. Enhancing Predictive Accuracy in Education: A Detailed Analysis of Student Performance Using Machine Learning Models. Tuijin Jishu/Journal of Propulsion Technology 2024; 45:2024.

Fayyad U, Piatetsky-Shapiro G, Smyth P. From data mining to knowledge discovery in databases. AI magazine 1996; 17:37-37.

Dehghani AA, Movahedi N, Ghorbani K, Eslamian S. Decision tree algorithms. In: Handbook of hydroinformatics. Elsevier; 2023. pp. 171-187.

Sabzevari Y, Eslamian S. Bayesian theory: Methods and applications. In: Handbook of Hydroinformatics. Elsevier; 2023. pp. 57-68.

Li H. K-nearest neighbor. In: Machine Learning Methods. Springer; 2023. pp. 55-66.

Reeta R, Pavithra G, Priyanka V, Raghul J. Predicting autism using naive Bayesian classification approach. In: 2018 International Conference on Communication and Signal Processing (ICCSP). IEEE; 2018. pp. 0109-0113.