Analysis Of Online Loan Regional Clustering in Indonesia In 2024 Based On Outstanding And Default Rate (TWP90) Using K-Means Clustering
DOI:
https://doi.org/10.24256/kharaj.v8i1.9342Keywords:
Online Loans, K-Means Clustering, Default Risk, Outstanding, TWP90Abstract
The increase in online lending distribution in Indonesia in 2024 was not accompanied by a uniform level of credit risk across regions. This study aims to categorize online lending regions in Indonesia based on outstanding values and 90-day default rates (TWP90) using a quantitative approach based on the K-Means algorithm. Secondary data from all provinces was analyzed using RapidMiner and evaluated using the Davies–Bouldin Index (DBI). The test results showed a DBI of 0.746 at K=2, 0.376 at K=3, and 0.564 at K=4. Although K=2 yielded the lowest DBI, the K=3 model was chosen because it provided a more informative and policy-relevant risk classification. The clustering resulted in three risk clusters: Low Risk, with outstanding values and TWP90 below average; Medium Risk, with values above average; and High Risk, characterized by a very high TWP90 level despite relatively low outstanding values. These findings confirm the effectiveness of K-Means in mapping online lending risks based on regions and support more precise credit monitoring. Keywords: online loans, K-Means clustering, default risk, outstanding, TWP90.
References
Arbelaitz, O., Gurrutxaga, I., Muguerza, J., Pérez, J. M., & Perona, I. (2021). An extensive comparative study of cluster validity indices. Pattern Recognition, 115, 107870. https://doi.org/10.1016/j.patcog.2021.107870
Baesens, B., Van Gestel, T., Viaene, S., Stepanova, M., Suykens, J., & Vanthienen, J. (2003). Benchmarking state-of-the-art classification algorithms for credit scoring. Journal of the Operational Research Society, 54(6), 627–635. https://doi.org/10.1057/palgrave.jors.2601545
Davies, D. L., & Bouldin, D. W. (1979). A cluster separation measure. IEEE Transactions on Pattern Analysis and Machine Intelligence, PAMI-1(2), 224–227. https://doi.org/10.1109/TPAMI.1979.4766909
Hair, J. F., Black, W. C., Babin, B. J., & Anderson, R. E. (2020). Multivariate data analysis (8th ed.). Cengage Learning.
Han, J., Kamber, M., & Pei, J. (2012). Data mining: Concepts and techniques (3rd ed.). Morgan Kaufmann.
Han, J., Kamber, M., & Pei, J. (2022). Data mining: Concepts and techniques (4th ed.). Morgan Kaufmann.
Irawati, N., Prasetyo, E., & Hidayat, R. (2025). Penerapan algoritma K-Means clustering untuk pengelompokan wilayah berbasis indikator ekonomi. Jurnal Ilmu Komputer dan Informatika, 10(1), 45–54.
Jain, A. K. (2010). Data clustering: 50 years beyond K-means. Pattern Recognition Letters, 31(8), 651–666. https://doi.org/10.1016/j.patrec.2009.09.011
Jain, A. K. (2020). Data clustering: 50 years beyond K-means. Pattern Recognition Letters, 31(8), 651–666. https://doi.org/10.1016/j.patrec.2019.09.011
Khandani, A. E., Kim, A. J., & Lo, A. W. (2021). Consumer credit-risk models via machine-learning algorithms. Journal of Banking & Finance, 34(11), 2767–2787. https://doi.org/10.1016/j.jbankfin.2021.03.001
Kou, G., Peng, Y., & Wang, G. (2014). Evaluation of clustering algorithms for financial risk analysis. Knowledge-Based Systems, 56, 1–13. https://doi.org/10.1016/j.knosys.2013.10.005
Lessmann, S., Baesens, B., Seow, H. V., & Thomas, L. C. (2015). Benchmarking state-of-the-art classification algorithms for credit scoring: An update. European Journal of Operational Research, 247(1), 124–136. https://doi.org/10.1016/j.ejor.2015.05.030
Li, Y., Wang, Y., & Zhao, Y. (2020). Credit risk assessment in peer-to-peer lending: A clustering-based approach. Financial Innovation, 6(1), 1–18. https://doi.org/10.1186/s40854-020-00190-3
Liao, S. H., Chu, P. H., & Hsiao, P. Y. (2012). Data mining techniques and applications: A decade review. Expert Systems with Applications, 39(12), 11303–11311. https://doi.org/10.1016/j.eswa.2012.02.063
MacQueen, J. (1967). Some methods for classification and analysis of multivariate observations. In Proceedings of the Fifth Berkeley Symposium on Mathematical Statistics and Probability (Vol. 1, pp. 281–297). University of California Press.
OECD. (2020). Consumer policy and fraud: Evidence-based policy responses. OECD Publishing.
Otoritas Jasa Keuangan. (2021). Statistik fintech lending Indonesia. OJK.
Otoritas Jasa Keuangan. (2024). Statistik fintech lending Indonesia. OJK.
Rahman, M. A., Hasan, M. K., & Sarker, I. H. (2023). Machine learning-based regional financial risk profiling using clustering techniques. Expert Systems with Applications, 213, 118919. https://doi.org/10.1016/j.eswa.2022.118919
Singh, A., & Yadav, A. (2022). Evaluation of clustering techniques using Davies–Bouldin Index in financial risk analysis. International Journal of Data Science and Analytics, 13(2), 145–158. https://doi.org/10.1007/s41060-021-00290-4
Tang, H., & Liu, Y. (2020). Credit risk assessment of peer-to-peer lending using data mining techniques. Journal of Risk and Financial Management, 13(9), 207. https://doi.org/10.3390/jrfm13090207
Zhang, Y., & Chen, W. (2021). Regional credit risk classification based on unsupervised learning methods. Journal of Risk and Financial Management, 14(9), 421. https://doi.org/10.3390/jrfm14090421
Downloads
Published
How to Cite
Issue
Section
Citation Check
License
Copyright (c) 2026 Fina Sherli Wewengkang, Arief Wibowo

This work is licensed under a Creative Commons Attribution-ShareAlike 4.0 International License.
Authors retain copyright and grant the journal right of first publication with the work simultaneously licensed under a Creative Commons Attribution-ShareAlike 4.0 International License. In line with the license, authors are allowed to share and adapt the material. In addition, the material must be given appropriate credit, provided with a link to the license, and indicated if changes were made. If authors remix, transform or build upon the material, authors must distribute their contributions under the same license as the original.







