Optimization of academic performance prediction using linear regression with selectk-best
DOI:
https://doi.org/10.35335/cit.Vol16.2025.994.pp386-393Keywords:
Feature Selection, Linear Regression, Prediction, SelectK-Best, Student performanceAbstract
This study discusses the prediction of student performance by considering factors that can influence academic performance. In this research, the SelectK-Best feature selection technique and linear regression were used to enhance the accuracy of the prediction. The selection of this topic is based on the importance of understanding the factors that influence student performance and how feature selection can help build more efficient models. The methods applied in this study include data exploration through EDA, the use of SelectK-Best to select the most significant features, and linear regression to build the prediction model. The evaluation metrics show that the model with feature selection achieved MAE of 0.6293, MSE of 0.5945, RMSE of 0.7711, and R² Score of 0.9144, demonstrating the model's excellent performance. In contrast, the model without feature selection did not produce better results than the model with feature selection. This emphasizes the importance of applying feature selection techniques in building more accurate prediction models. This study contributes to predicting student performance through the use of systematic and effective methods, while also opening opportunities for further research in the context of education and more diverse data.
Downloads
References
Dyah Vierdiana, “ANALISIS FAKTOR-FAKTOR YANG MEMPENGARUHI KESEHATAN MENTAL DI KALANGAN MAHASISWA PERGURUAN TINGGI,” Jurnal Review Pendidikan dan Pengajaran, vol. 7, no. 1, pp. 1553–1558, 2024.
Ulfah, “Pengaruh Kesehatan Mental Terhadap Prestasi Akademik Mahasiswa Tingkat Akhir,” 2023.
I. Widiasanti, J. T. Adelia, L. Rosidin, M. F. Viola, and M. Daniarista, “Implementasi Penggunaan Big Data Dalam Menganalisis Faktor Yang Mempengaruhi Kinerja Siswa Dalam Hasil Ujian,” Akademika, vol. 12, no. 01, pp. 239–250, 2023.
A. M. Br Peranginangin and N. Izzati, “Analisis Faktor-Faktor Yang Mempengaruhi Keaktifan Siswa Kelas VIII-1 SMPN 11 Tanjungpinang Dalam Pembelajaran Matematika,” MATH-EDU: Jurnal Ilmu Pendidikan Matematika, vol. 8, no. 1, pp. 24–36, Apr. 2023, doi: 10.32938/jipm.8.1.2023.24-36.
S. Asyura, S. Mulyani Jamil, and F. Ilmu Kesehatan, “Faktor-Faktor yang Mempengaruhi Prestasi Belajar pada Siswa di SMP Negeri 1 Baitussalam Kabupaten Aceh Besar Factors Affecting Student’s Learning Achievement in Junior High School State 1 District Baitussalam Regency Aceh Besar,” 2022.
J. Ablian, M. Gantang, and A. Panes, “The Effects of Alcohol Consumption on Academic Performance: A Literature Review,” vol. Volume 5, pp. 77–84, Feb. 2023.
M. W. Manoppo, F. F. Pitoy, and K. B. Tampi, “Hubungan Tingkat Stres dengan Konsumsi Alkohol pada Remaja,” MAHESA?: Malahayati Health Student Journal, vol. 3, no. 6, pp. 1710–1725, Jun. 2023, doi: 10.33024/mahesa.v3i6.10585.
S. Calnan and M. P. Davoren, “College students’ perspectives on an alcohol prevention programme and student drinking–A focus group study,” Nordic studies on alcohol and drugs, vol. 39, no. 3, pp. 301–321, 2022.
R. Ajeng, “CONSUME ALCOHOL BEHAVIOR ON STUDENTS FACULTY OF SPORT SCIENCE STATE UNIVERSITY OF SURABAYA.”
Julia C. Pulumbara, Sekplin A.S. Sekeon, and Wulan P.J. Kaunang, “HUBUNGAN ANTARA KONSUMSI ALKOHOL DENGAN GANGGUAN FUNGSI KOGNITIF PADA PENDUDUK DI KELURAHAN TUMUMPA DUA KECAMATAN TUMINTING KOTA MANADO,” 2018.
T. A. Harmawan and L. S. Istiyowati, “Big Data Dan Pemahaman Faktor Penunjang Kinerja Akademik Siswa Untuk Meningkatkan Efektivitas Pembelajaran,” JKTP: Jurnal Kajian Teknologi Pendidikan, vol. 7, no. 1, p. 035, Apr. 2024, doi: 10.17977/um038v7i12024p035.
P. P. P. Thereza, G. Lumacad, and R. Catrambone, “Predicting Student Performance Using Feature Selection Algorithms for Deep Learning Models,” in 2021 XVI Latin American Conference on Learning Technologies (LACLO), 2021, pp. 1–7. doi: 10.1109/LACLO54177.2021.00009.
N. Kartik, R. Mahalakshmi, and K. A. Venkatesh, “Predicting Students’ Performance Using Feature Selection-Based Machine Learning Technique,” in Proceedings of Data Analytics and Management, A. Swaroop, Z. Polkowski, S. D. Correia, and B. Virdee, Eds., Singapore: Springer Nature Singapore, 2024, pp. 389–397.
Monica Tiara Gunawan, Jeane Yosefa Tine, and Chatarina Enny Murwaningtyas, “Model decision tree untuk prediksi prestasi akademik matematika siswa kelas VIII SMP Frater Don Bosco Manado,” Jurnal Pendidikan Informatika dan Sains, vol. 13, no. 2, 2024.
E. Ahmed, “Student Performance Prediction Using Machine Learning Algorithms,” Applied Computational Intelligence and Soft Computing, vol. 2024, no. 1, p. 4067721, Jan. 2024, doi: https://doi.org/10.1155/2024/4067721.
R. Habibie Sukarna et al., “ANALISIS PREDIKSI KELULUSAN MAHASISWA TEPAT WAKTU MENGGUNAKAN METODE ALGORITMA MACHINE LEARNING DAN FEATURE SELECTION,” vol. 8, no. 2, 2024.
N. R. Abid-Althaqafi and H. A. Alsalamah, “The Effect of Feature Selection on the Accuracy of X-Platform User Credibility Detection with Supervised Machine Learning,” Electronics (Switzerland), vol. 13, no. 1, Jan. 2024, doi: 10.3390/electronics13010205.
M. Saelan and A. Subekti, “K-BEST SELECTION UNTUK MENINGKATKAN KINERJA ARTIFICIAL NEURAL NETWORK DALAM MEMPREDIKSI RANGE HARGA PONSEL,” INTI Nusa Mandiri, vol. 19, pp. 10–16, Jul. 2024, doi: 10.33480/inti.v19i1.5554.
Andy Hermawan, Nila Rusiardi Jayanti, Zia Tabaruk, Faizal Lutfi Yoga Triadi, Aji Saputra, and M.Rahmat Hidayat Syachrudin, “Membangun Model Prediksi Churn Pelanggan yang Akurat,” Merkurius?: Jurnal Riset Sistem Informasi dan Teknik Informatika, vol. 2, no. 6, pp. 67–81, Oct. 2024, doi: 10.61132/merkurius.v2i6.398.
G. L. Pritalia, “Analisis Komparatif Algoritme Machine Learning pada Klasifikasi Kualitas Air Layak Minum,” 2022.
Y. Zhai, W. Song, X. Liu, L. Liu, and X. Zhao, A Chi-Square Statistics Based Feature Selection Method in Text Classification. 2018. doi: 10.1109/ICSESS.2018.8663882.
A. Qoiriah and Y. Yamasari, “Prediksi Nilai Akhir Mahasiswa dengan Metode Regresi (Studi Kasus Mata Kuliah Pemrograman Dasar).”
M. A. Kurniawan, B. Very Christioko, M. R. Abidin, S. V. Rivaldo, and T. R. U. Utami, “Analisis Prediksi Kinerja Akademik Menggunakan Regresi Linear Berganda.”
D. Jatikusumo and R. R. Hidayat, “OPTIMASI PENENTUAN LOKASI BENCANA ALAM DENGAN REGRESI LINIER SEDERHANA DAN BERGANDA,” Jurnal Informatika dan Teknik Elektro Terapan, vol. 12, no. 3S1, Oct. 2024, doi: 10.23960/jitet.v12i3S1.5257.
D. Ruswanti, D. Susilo, and R. Riani, “Implementasi CRISP-DM pada Data Mining untuk Melakukan Prediksi Pendapatan dengan Algoritma C.45,” Go Infotech: Jurnal Ilmiah STMIK AUB, vol. 30, no. 1, pp. 111–121, Jun. 2024, doi: 10.36309/goi.v30i1.266.
C. Schröer, F. Kruse, and J. M. Gómez, “A systematic literature review on applying CRISP-DM process model,” in Procedia Computer Science, Elsevier B.V., 2021, pp. 526–534. doi: 10.1016/j.procs.2021.01.199.
S. N. Luqman et al., “Komparasi Algoritma Klasifikasi Genre Musik pada Spotify Menggunakan CRISP-DM,” 2021.
P. Cortez and A. Silva, “Student Alcohol Consumption,” Kaggle. Accessed: Jan. 09, 2025. [Online]. Available: https://www.kaggle.com/datasets/uciml/student-alcohol-consumption
M. Komorowski, D. C. Marshall, J. D. Salciccioli, and Y. Crutain, “Exploratory Data Analysis,” in Secondary Analysis of Electronic Health Records, M. I. T. C. Data, Ed., Cham: Springer International Publishing, 2016, pp. 185–203. doi: 10.1007/978-3-319-43742-2_15.
D. Chicco, M. J. Warrens, and G. Jurman, “The coefficient of determination R-squared is more informative than SMAPE, MAE, MAPE, MSE and RMSE in regression analysis evaluation,” PeerJ Comput Sci, vol. 7, pp. 1–24, 2021, doi: 10.7717/PEERJ-CS.623.
Downloads
Published
How to Cite
Issue
Section
License
Copyright (c) 2025 M. Rangga Ramadhan Saelan

This work is licensed under a Creative Commons Attribution-NonCommercial 4.0 International License.

