Analysis Of Online Loan Regional Clustering in Indonesia In 2024 Based On Outstanding And Default Rate (TWP90) Using K-Means Clustering

Fina Sherli Wewengkang; Arief Wibowo

doi:10.24256/kharaj.v8i1.9342

Authors

Fina Sherli Wewengkang Budi Luhur University, Indonesia
Arief Wibowo Budi Luhur University, Indonesia

DOI:

https://doi.org/10.24256/kharaj.v8i1.9342

Keywords:

Online Loans, K-Means Clustering, Default Risk, Outstanding, TWP90

Abstract

The increase in online lending distribution in Indonesia in 2024 was not accompanied by a uniform level of credit risk across regions. This study aims to categorize online lending regions in Indonesia based on outstanding values and 90-day default rates (TWP90) using a quantitative approach based on the K-Means algorithm. Secondary data from all provinces was analyzed using RapidMiner and evaluated using the Davies–Bouldin Index (DBI). The test results showed a DBI of 0.746 at K=2, 0.376 at K=3, and 0.564 at K=4. Although K=2 yielded the lowest DBI, the K=3 model was chosen because it provided a more informative and policy-relevant risk classification. The clustering resulted in three risk clusters: Low Risk, with outstanding values and TWP90 below average; Medium Risk, with values above average; and High Risk, characterized by a very high TWP90 level despite relatively low outstanding values. These findings confirm the effectiveness of K-Means in mapping online lending risks based on regions and support more precise credit monitoring. Keywords: online loans, K-Means clustering, default risk, outstanding, TWP90.

References

Arbelaitz, O., Gurrutxaga, I., Muguerza, J., Pérez, J. M., & Perona, I. (2021). An extensive comparative study of cluster validity indices. Pattern Recognition, 115, 107870. https://doi.org/10.1016/j.patcog.2021.107870

Baesens, B., Van Gestel, T., Viaene, S., Stepanova, M., Suykens, J., & Vanthienen, J. (2003). Benchmarking state-of-the-art classification algorithms for credit scoring. Journal of the Operational Research Society, 54(6), 627–635. https://doi.org/10.1057/palgrave.jors.2601545

Davies, D. L., & Bouldin, D. W. (1979). A cluster separation measure. IEEE Transactions on Pattern Analysis and Machine Intelligence, PAMI-1(2), 224–227. https://doi.org/10.1109/TPAMI.1979.4766909

Hair, J. F., Black, W. C., Babin, B. J., & Anderson, R. E. (2020). Multivariate data analysis (8th ed.). Cengage Learning.

Han, J., Kamber, M., & Pei, J. (2012). Data mining: Concepts and techniques (3rd ed.). Morgan Kaufmann.

Han, J., Kamber, M., & Pei, J. (2022). Data mining: Concepts and techniques (4th ed.). Morgan Kaufmann.

Irawati, N., Prasetyo, E., & Hidayat, R. (2025). Penerapan algoritma K-Means clustering untuk pengelompokan wilayah berbasis indikator ekonomi. Jurnal Ilmu Komputer dan Informatika, 10(1), 45–54.

Jain, A. K. (2010). Data clustering: 50 years beyond K-means. Pattern Recognition Letters, 31(8), 651–666. https://doi.org/10.1016/j.patrec.2009.09.011

Jain, A. K. (2020). Data clustering: 50 years beyond K-means. Pattern Recognition Letters, 31(8), 651–666. https://doi.org/10.1016/j.patrec.2019.09.011

Khandani, A. E., Kim, A. J., & Lo, A. W. (2021). Consumer credit-risk models via machine-learning algorithms. Journal of Banking & Finance, 34(11), 2767–2787. https://doi.org/10.1016/j.jbankfin.2021.03.001

Kou, G., Peng, Y., & Wang, G. (2014). Evaluation of clustering algorithms for financial risk analysis. Knowledge-Based Systems, 56, 1–13. https://doi.org/10.1016/j.knosys.2013.10.005

Lessmann, S., Baesens, B., Seow, H. V., & Thomas, L. C. (2015). Benchmarking state-of-the-art classification algorithms for credit scoring: An update. European Journal of Operational Research, 247(1), 124–136. https://doi.org/10.1016/j.ejor.2015.05.030

Li, Y., Wang, Y., & Zhao, Y. (2020). Credit risk assessment in peer-to-peer lending: A clustering-based approach. Financial Innovation, 6(1), 1–18. https://doi.org/10.1186/s40854-020-00190-3

Liao, S. H., Chu, P. H., & Hsiao, P. Y. (2012). Data mining techniques and applications: A decade review. Expert Systems with Applications, 39(12), 11303–11311. https://doi.org/10.1016/j.eswa.2012.02.063

MacQueen, J. (1967). Some methods for classification and analysis of multivariate observations. In Proceedings of the Fifth Berkeley Symposium on Mathematical Statistics and Probability (Vol. 1, pp. 281–297). University of California Press.

OECD. (2020). Consumer policy and fraud: Evidence-based policy responses. OECD Publishing.

Otoritas Jasa Keuangan. (2021). Statistik fintech lending Indonesia. OJK.

Otoritas Jasa Keuangan. (2024). Statistik fintech lending Indonesia. OJK.

Rahman, M. A., Hasan, M. K., & Sarker, I. H. (2023). Machine learning-based regional financial risk profiling using clustering techniques. Expert Systems with Applications, 213, 118919. https://doi.org/10.1016/j.eswa.2022.118919

Singh, A., & Yadav, A. (2022). Evaluation of clustering techniques using Davies–Bouldin Index in financial risk analysis. International Journal of Data Science and Analytics, 13(2), 145–158. https://doi.org/10.1007/s41060-021-00290-4

Tang, H., & Liu, Y. (2020). Credit risk assessment of peer-to-peer lending using data mining techniques. Journal of Risk and Financial Management, 13(9), 207. https://doi.org/10.3390/jrfm13090207

Zhang, Y., & Chen, W. (2021). Regional credit risk classification based on unsupervised learning methods. Journal of Risk and Financial Management, 14(9), 421. https://doi.org/10.3390/jrfm14090421

Analysis Of Online Loan Regional Clustering in Indonesia In 2024 Based On Outstanding And Default Rate (TWP90) Using K-Means Clustering

Authors

DOI:

Keywords:

Abstract

References

Downloads

Published

How to Cite

Issue

Section

Citation Check

License

Similar Articles

newsidemenualkharaj