Classification of Village Development Status in Bekasi Regency Using Ensemble Learning and SMOTE-Based Class Balancing

Authors

  • Ridwan Mochamad Ridwan BPS-Statistics Bekasi Regency, Bekasi, Indonesia https://orcid.org/0000-0001-8482-4071
  • Erwin Tanur Training and Education Center, Statistics Indonesia, Jakarta, Indonesia

DOI:

https://doi.org/10.34123/jurnalasks.v18i1.870

Keywords:

Random Forest, SMOTE, Village Potential Statistics (PODES), Village Development Index (IDM), Classification, Machine Learning, Ensemble Learning, Bekasi Regency

Abstract

Introduction/Main Objectives: This study aims to classify village development status in Bekasi Regency using machine learning based on the 2024 Village Potential Statistics (PODES) and the Village Development Index (IDM). Background Problems: Conventional descriptive assessments ignore complex socio-economic relationships, and class imbalance further reduces model predictive performance. Novelty: This study integrates PODES data, ensemble learning, and SMOTE to improve classification, providing a reliable, data-driven framework for village profiling and planning. Research Methods: Following preprocessing and a 70:30 split, SMOTE was applied to the training data, and four tree-based models (Decision Tree, Bagging, Random Forest, XGBoost) were evaluated using standard classification metrics. Finding/Results: The Random Forest model combined with SMOTE achieved the best classification performance, with an accuracy of 0.7778 and consistently high AUC values across all classes. The most influential predictors were the dominant economic sector, number of farmer groups, availability of basic health services, and presence of micro-business units. These findings demonstrate that combining ensemble learning with SMOTE improves village development classification and provides valuable support for evidence-based rural development planning in Bekasi Regency.

Downloads

Download data is not yet available.

References

B. Rahmasari, "Paradigma Pembangunan Desa Dalam Pengelolaan Keuangan Desa Berdasarkan Undang-Undang Nomor 6 Tahun 2014 Tentang Desa," Volksgeist: Jurnal Ilmu Hukum dan Konstitusi, vol. 3, no. 2, 2020, doi: 10.24090/volksgeist.v3i2.4001.

S. Agung, Pemerintahan asli masyarakat adat: sebuah studi kepemimpinan adat di Lembah Timur Ciamis, Jawa Barat. Deepublish, 2020.

P. Hendrarso, P. Handoko, M. Faiz Ali Ramdhani, N. Andayani, and R. Tania, "Kajian Pengentasan Desa Tertinggal Melalui Pendekatan Indeks Desa Membangun," Transparansi: Jurnal Ilmiah Ilmu Administrasi, vol. 4, no. 1, 2021, doi: 10.31334/transparansi.v4i1.1607.

S. Sriningsih, E. Astuti, B. Ismiwati, and F. Ekonomi, "Implementasi PERMENDESAPDTTRANS NO. 2 Tahun 2016 Terkait Status Desa di Desa Sukarara Lombok Tengah," Jurnal Kompetitif: Media Informasi Ekonomi Pembangunan, Manajemen dan Akuntansi, vol. 6, no. 1, 2020.

A. M. Gai, A. Witjaksono, and R. R. Maulida, "Perencanaan dan Pengembangan Desa," Dream Litera Buana, 2020.

A. Amka, M. Anshar Nur, and J. Jamalluddin, "Kebijakan dan Strategi Pembangunan Daerah untuk Masa Depan," CV BRAVO PRESS Indonesia, n.d.

A. H. Nasrullah, "Implementasi algoritma Decision Tree untuk klasifikasi produk laris," Jurnal Ilmiah Ilmu Komputer Fakultas Ilmu Komputer Universitas Al Asyariah Mandar, vol. 7, no. 2, pp. 45-51, 2021.

H. Kurniawan, "Deteksi Twitter Bot menggunakan Klasifikasi Decision Tree," Jurnal Sustainable: Jurnal Hasil Penelitian dan Industri Terapan, vol. 9, no. 1, pp. 31-37, 2020.

I. Setiawan, R. F. A. Cahyani, and I. Sadida, "Exploring complex decision trees: Unveiling data patterns and optimal predictive power," Journal of Innovation And Future Technology (IFTECH), vol. 5, no. 2, pp. 112-123, 2023.

R. N. Ramadhon, A. Ogi, A. P. Agung, R. Putra, S. S. Febrihartina, and U. Firdaus, "Implementasi Algoritma Decision Tree untuk Klasifikasi Pelanggan Aktif atau Tidak Aktif pada Data Bank," Karimah Tauhid, vol. 3, no. 2, pp. 1860-1874, 2024.

S. Joses, D. Yulvida, and S. Rochimah, "Pendekatan metode ensemble learning untuk prakiraan cuaca menggunakan soft voting classifier," Journal of Applied Computer Science and Technology, vol. 5, no. 1, pp. 72-80, 2024.

T. H. Handoko, Manajemen personalia dan sumberdaya manusia. Bpfe, 2016.

D. S. Lindawaty, "Pembangunan desa pasca Undang-Undang No. 6 Tahun 2014 tentang desa [Village development post Law No. 6 of 2014 on villages]," Jurnal Politica Dinamika Masalah Politik Dalam Negeri Dan Hubungan Internasional, vol. 14, no. 1, pp. 1-21, 2023.

Direktorat Jenderal Pembangunan Desa dan Perdesaan | KDPDTT, "Tentang Indeks Desa Membangun," n.d.

H. Blockeel, L. Devos, B. Frenay, G. Nanfack, and S. Nijssen, "Decision trees: from efficient prediction to responsible AI," Front. Artif. Intell., vol. 6, p. 1124553, 2023.

L. Breiman, "Bagging predictors," Mach. Learn., vol. 24, no. 2, pp. 123-140, 1996.

H. A. Salman, A. Kalakech, and A. Steiti, "Random Forest Algorithm Overview," Babylonian Journal of Machine Learning, vol. 2024, pp. 69-79, Jun. 2024, doi: 10.58496/BJML/2024/007.

M. Niazkar et al., "Applications of XGBoost in water resources engineering: A systematic literature review (Dec 2018-May 2023)," Environmental Modelling & Software, vol. 174, p. 105971, Mar. 2024, doi: 10.1016/j.envsoft.2024.105971.

N. V. Chawla, K. W. Bowyer, L. O. Hall, and W. P. Kegelmeyer, "SMOTE: synthetic minority over-sampling technique," Journal of Artificial Intelligence Research, vol. 16, pp. 321-357, 2002.

T. Wu, H. Fan, H. Zhu, C. You, H. Zhou, and X. Huang, "Intrusion detection system combined enhanced random forest with SMOTE algorithm," EURASIP J. Adv. Signal Process., vol. 2022, no. 1, p. 39, 2022.

R. D. Fitriani, H. Yasin, and T. Tarno, "Penanganan klasifikasi kelas data tidak seimbang dengan random oversampling pada naive bayes (Studi kasus: Status peserta KB IUD di Kabupaten Kendal)," Jurnal Gaussian, vol. 10, no. 1, pp. 11-20, 2021.

S. K. Corbacioglu and G. Aksel, "Receiver operating characteristic curve analysis in diagnostic accuracy studies: A guide to interpreting the area under the curve value," Turk. J. Emerg. Med., vol. 23, no. 4, pp. 195-198, 2023.

L. Breiman, "Random forests," Mach. Learn., vol. 45, no. 1, pp. 5-32, 2001.

M. P. Pulungan, A. Purnomo, and A. Kurniasih, "Penerapan SMOTE untuk Mengatasi Imbalance Class dalam Klasifikasi Kepribadian MBTI Menggunakan Naive Bayes Classifier," Jurnal Teknologi Informasi dan Ilmu Komputer, vol. 11, no. 5, pp. 1033-1042, 2024.

D. Chicco and G. Jurman, "The Matthews correlation coefficient (MCC) should replace the ROC AUC as the standard metric for assessing binary classification," BioData Min., vol. 16, no. 1, p. 4, 2023.

Downloads

Published

2026-06-30

How to Cite

Mochamad Ridwan, R., & Tanur, E. (2026). Classification of Village Development Status in Bekasi Regency Using Ensemble Learning and SMOTE-Based Class Balancing. Jurnal Aplikasi Statistika & Komputasi Statistik, 18(1), 76–97. https://doi.org/10.34123/jurnalasks.v18i1.870