Penanganan Data Tidak Seimbang pada Pemodelan Rotation Forest Keberhasilan Studi Mahasiswa Program Magister IPB

Authors

  • Junjun Wijaya Institut Pertanian Bogor
  • Agus M Soleh Department of Statistics, IPB
  • Akbar Rizki Department of Statistics, IPB

DOI:

https://doi.org/10.29244/xplore.v2i2.99

Abstract

Graduate school of Bogor Agricultural University (SPs-IPB) stated that not all students of IPB master program successfully complete their studies. This becomes an evaluation for IPB to be more selective in choosing students in the future. This study aims to model the success classification of IPB master students in 2011 to 2015. The classification method used is rotation forest. The percentage of students who graduated is very large compared to those who did not pass, this can cause the evaluation value different. SMOTE (Synthetic Minority Oversampling Technique) is one of method to handle such unbalanced data by generating artificial data. The ROC (Receiver Operating Characteristic) curve is built to see the optimum cut off value. There are two classification models, they are rotation forest models before and after handled by SMOTE. The comparison results show that the rotation forest model after SMOTE with cut off value 0.6 is the best model. This model can increase the sensitivity value more than 50% although the accuracy and specificity value decreased compared to the modeling before SMOTE.

Downloads

Published

2018-08-31

How to Cite

Wijaya, J., Soleh, A. M., & Rizki, A. (2018). Penanganan Data Tidak Seimbang pada Pemodelan Rotation Forest Keberhasilan Studi Mahasiswa Program Magister IPB. Xplore: Journal of Statistics, 2(2), 32–40. https://doi.org/10.29244/xplore.v2i2.99

Most read articles by the same author(s)

1 2 > >>