CLV-Enhanced RFM Framework for Customer Segmentation in Indonesian SMEs Using K-Means Clustering



Metrics Analysis (Dimensions & PlumX)

Indexing:
Similarity:

© 2026 Dio Rizkyandita, Heri Wijayanto, I Gde Putu Wirarama Wedashwara Wirawan, Mosiur Rahaman

Micro, Small, and Medium Enterprises (MSMEs) contribute more than 61% of Indonesia's Gross Domestic Product, yet most of them still face limitations in leveraging transactional data for customer retention strategies. Prior studies have extensively combined Recency, Frequency, and Monetary (RFM) analysis with the K-Means algorithm for customer segmentation, but the majority treat Customer Lifetime Value (CLV) only as a post-hoc label assigned to clusters after the clustering process is finalized, rather than as a feature that shapes the segment structure from the beginning. This study addresses three research questions: how CLV can be effectively integrated as a clustering input, what segmentation structure emerges from this approach, and what concrete retention strategies can be derived for MSMEs with limited analytical capabilities. The proposed framework incorporates CLV, calculated as the discounted historical net sales using a 10% annual discount rate, as the fourth feature in the K-Means feature space with K-Means++ initialization, alongside RFM variables standardized using Z-scores. The framework is applied to a real dataset from a coffee shop MSME in Indonesia, comprising 19,126 transactions, of which 4,310 are member transactions from 472 unique registered customers, recorded throughout January–December 2023. The optimal number of clusters is determined through the convergence of the Elbow Method and the Silhouette Coefficient, both indicating four clusters as the best solution with a silhouette score of 0.5105. The segmentation divided customers into four tiers: Platinum, Gold, Silver, and Bronze. A key finding was a highly concentrated value distribution, with just 1.48% of customers (n = 7) contributing 21.66% of total revenue and CLV. This pattern is significantly more skewed than the traditional 80/20 Pareto rule. This concentration is interpreted through the lenses of habit formation, small base amplification, and comparative empirical evidence. The four-tier framework translates into differentiated retention strategies: VIP retention, value uplift, repeat-purchase incentives, and win-back campaigns, with monetary thresholds calibrated to each segment's median value.

 

Keywords: K-Means, Clustering, RFM, Customer Lifetime Value, Customer Segmentation, MSMEs.

Alves Gomes, M., & Meisen, T. (2023). A review on customer segmentation methods for personalized customer targeting in e-commerce use cases. Information Systems and E-Business Management, 21(3), 527–570. https://doi.org/10.1007/s10257-023-00640-4

Arthur, D., & Vassilvitskii, S. (2007). k-means++: The advantages of careful seeding. In Proceedings of the Eighteenth Annual ACM-SIAM Symposium on Discrete Algorithms (SODA '07) (pp. 1027–1035). Society for Industrial and Applied Mathematics.

Aslantaş, G., Gençgül, M., Rumelli, M., Özsaraç, M., & Bakırlı, G. (2023). Customer segmentation using K-means clustering algorithm and RFM model. Deu Muhendislik Fakultesi Fen ve Mühendislik, 25(74), 491–503. https://doi.org/10.21205/deufmd.2023257418

Bank Indonesia. (2024). BI 7-day reverse repo rate (BI7DRR). Bank Indonesia. https://www.bi.go.id/en/statistik/indikator/bi-7day-rr.aspx

Djun, S. F., Gunadi, I. G. A., & Sariyasa, S. (2024). Analisis Segmentasi Pelanggan pada Bisnis dengan Menggunakan Metode K-Means Clustering pada Model Data RFM [Customer Segmentation Analysis in Business Using the K-Means Clustering Method on the RFM Data Model]. JTIM : Jurnal Teknologi Informasi Dan Multimedia, 5(4), 354–364. https://doi.org/10.35746/jtim.v5i4.434

Fader, P. S., & Hardie, B. G. S. (2009). Probability Models for Customer-Base Analysis. Journal of Interactive Marketing, 23(1), 61–69. https://doi.org/10.1016/j.intmar.2008.11.003

Gupta, S., Hanssens, D., Hardie, B., Kahn, W., Kumar, V., Lin, N., Ravishanker, N., & Sriram, S. (2006). Modeling Customer Lifetime Value. Journal of Service Research, 9(2), 139–155. https://doi.org/10.1177/1094670506293810

Gupta, S., Lehmann, D. R., & Stuart, J. A. (2004). Valuing Customers. Journal of Marketing Research, 41(1), 7–18. https://doi.org/10.1509/jmkr.41.1.7.25084

Gustriansyah, R., Suhandi, N., & Antony, F. (2019). Clustering optimization in RFM analysis based on k-means. Indonesian Journal of Electrical Engineering and Computer Science, 18(1), 470–477. https://doi.org/10.11591/ijeecs.v18.i1.pp470-477

Haddadi, A. M., & Hamidi, H. (2025). A hybrid model for improving customer lifetime value prediction using stacking ensemble learning algorithm. Computers in Human Behavior Reports, 18, 100616. https://doi.org/10.1016/j.chbr.2025.100616

Han, J., Kamber, M., & Pei, J. (2012). Data mining: Concepts and techniques (3rd ed.). Morgan Kaufmann. https://doi.org/10.1016/C2009-0-61819-5

Hicham, N., & Karim, S. (2022). Analysis of unsupervised machine learning techniques for an efficient customer segmentation using clustering ensemble and spectral clustering. International Journal of Advanced Computer Science and Applications, 13(10), 514–523. https://doi.org/10.14569/IJACSA.2022.0131016

Ikotun, A. M., Ezugwu, A. E., Abualigah, L., Abuhaija, B., & Heming, J. (2023). K-means clustering algorithms: A comprehensive review, variants analysis, and advances in the era of big data. Information Sciences, 622, 178–210. https://doi.org/10.1016/j.ins.2022.11.139

Iqbal, N. M., Iskandar, Y. A., & Zulvia, F. E. (2024). Customer Segmentation Using the K-means Clustering Algorithm and Recency Frequency Monetary Model at Pharmaceutical Product Wholesaler. International Journal of Research in Vocational Studies, 4(2), 53–60. https://doi.org/10.53893/ijrvocas.v4i2.293

Jamunadevi, C., Tamil Selvan, S., Govindarajan, M., Saravanan, C., & Janaki Raman, B. R. (2021). LRFM model for customer purchase behaviour using K-Means algorithm. IOP Conference Series: Materials Science and Engineering, 1055(1), 012111. https://doi.org/10.1088/1757-899x/1055/1/012111

Kanchanapoom, K., & Chongwatpol, J. (2023). Integrated customer lifetime value (CLV) and customer migration model to improve customer segmentation. Journal of Marketing Analytics, 11(2), 172–185. https://doi.org/10.1057/s41270-022-00158-7

Khajvand, M., & Tarokh, M. J. (2011). Estimating customer future value of different customer segments based on adapted RFM model in retail banking context. Procedia Computer Science, 3, 1327–1332. https://doi.org/10.1016/j.procs.2011.01.011

Kumar, V., & Reinartz, W. (2018). Customer relationship management: Concept, strategy, and tools (3rd ed.). Springer. https://doi.org/10.1007/978-3-662-55381-7

Laksono, F. A., Rachmat, B., & Sutarso, Y. (2023). B2B customer segmentation based on customer lifetime value concept and RFM modeling. International Journal of Economics Development Research, 4(3), 1185–1199. https://doi.org/10.37385/ijedr.v4i3.2952

Li, Y., Chu, X., Tian, D., Feng, J., & Mu, W. (2021). Customer segmentation using K-means clustering and the adaptive particle swarm optimization algorithm. Applied Soft Computing, 113, 107924. https://doi.org/10.1016/j.asoc.2021.107924

Ling, S. S., Too, C. W., Wong, W. Y., & Hoo, M. H. (2024). Customer Relationship Management System for Retail Stores Using Unsupervised Clustering Algorithms with RFM Modeling for Customer Segmentation. 2024 IEEE 14th Symposium on Computer Applications & Industrial Electronics (ISCAIE), 1–6. https://doi.org/10.1109/ISCAIE61308.2024.10576353

Widhyastuti, L. P. W., Sukajaya, I. N., & Aryanto, K. Y. E. (2022). Customer profiling berdasarkan model RFM dengan metode K-Means pada institusi pendidikan untuk menunjang strategi bisnis di masa pandemi Covid-19 [Customer profiling based on the RFM model with the K-Means method in educational institutions to support business strategies during the Covid-19 pandemic]. JTIM: Jurnal Teknologi Informasi dan Multimedia, 4(2), 94–108. https://doi.org/10.35746/jtim.v4i2.232

Marisa, F., Ahmad, S. S. S., Yusof, Z. I. M., Fachrudin, & Aziz, T. M. A. (2019). Segmentation model of customer lifetime value in small and medium enterprise (SMEs) using K-Means clustering and LRFM model. International Journal of Integrated Engineering, 11(3), 169–180. https://doi.org/10.30880/ijie.2019.11.03.018

Matz, A., & Hermawan, A. T. (2020). Customer loyalty clustering model using K-Means algorithm with LRIFMQ parameters. Jurnal INFORM, 5(2), 54–61. https://doi.org/10.25139/inform.v0i1.2691

Nugroho, N., & Adhinata, F. (2022). Penggunaan Metode K-Means dan K-Means++ Sebagai Clustering Data Covid-19 di Pulau Jawa. Teknika, 11, 170–179. https://doi.org/10.34148/teknika.v11i3.502

Pedregosa, F., Varoquaux, G., Gramfort, A., Michel, V., Thirion, B., Grisel, O., Blondel, M., Prettenhofer, P., Weiss, R., Dubourg, V., Vanderplas, J., Passos, A., Cournapeau, D., Brucher, M., Perrot, M., & Duchesnay, E. (2011). Scikit-learn: Machine learning in Python. Journal of Machine Learning Research, 12, 2825–2830.

Perdana, S. A., Florentin, S. F., & Santoso, A. (2022). Analisis segmentasi pelanggan menggunakan K-Means clustering studi kasus aplikasi Alfagift [Customer segmentation analysis using K-Means clustering case study of Alfagift application]. Sebatik, 26(2), 420–427. https://doi.org/10.46984/sebatik.v26i2.2134

Reichheld, F. F., & Sasser, W. E. (1990). Zero defections: quality comes to services. Harvard Business Review, 68(5), 105—111. http://europepmc.org/abstract/MED/10107082

Reinartz, W. J., & Kumar, V. (2000). On the profitability of long-life customers in a noncontractual setting: An empirical investigation and implications for marketing. Journal of Marketing, 64(4), 17–35. https://doi.org/10.1509/jmkg.64.4.17.18077

Rhomadhona, H., Kusrini, W., Aprianti, W., & Permadi, J. (2025). Implementation of K-Means Clustering for Social Assistance Recipients with Silhouette Score Evaluation. Brilliance: Research of Artificial Intelligence, 5(1), 136–143. https://doi.org/10.47709/brilliance.v5i1.5900

Schmittlein, D. C., Morrison, D. G., & Colombo, R. (1987). Counting your customers: Who are they and what will they do next? Management Science, 33(1), 1–24. https://doi.org/10.1287/mnsc.33.1.1

Sinaga, K. P., & Yang, M.-S. (2020). Unsupervised K-Means Clustering Algorithm. IEEE Access, 8, 80716–80727. https://doi.org/10.1109/ACCESS.2020.2988796

Tabianan, K., Velu, S., & Ravi, V. (2022). K-Means clustering approach for intelligent customer segmentation using customer purchase behavior data. Sustainability, 14(12), 7243. https://doi.org/10.3390/su14127243

Valentini, T., Roederer, C., & Castéran, H. (2024). From redesign to revenue: Measuring the effects of servicescape remodeling on customer lifetime value. Journal of Retailing and Consumer Services, 77, 103681. https://doi.org/10.1016/j.jretconser.2023.103681

Wood, W., & Rünger, D. (2016). Psychology of habit. Annual Review of Psychology, 67, 289–314. https://doi.org/10.1146/annurev-psych-122414-033417

Wu, J., Shi, L., Lin, W.-P., Tsai, B., Li, Y., Yang, L., & Xu, G. (2020). An Empirical Study on Customer Segmentation by Purchase Behaviors Using a RFM Model and K-Means Algorithm. Mathematical Problems in Engineering, 2020, 1–7. https://doi.org/10.1155/2020/8884227

Data Set Transaksi 2023
Data Set Transaksi 2023
Analysis Code in google Collab
Data result table RFM-CLV-SC-K-Means

Refbacks

  • There are currently no refbacks.


Creative Commons License
This work is licensed under a Creative Commons Attribution-ShareAlike 4.0 International License.

The copyright is reserved to Research in Education, Technology, and Multiculture that is licensed under a Creative Commons Attribution-ShareAlike 4.0 International License.

Address:

CV. IMRECS: Jalan Gagak I, Kecamatan Larangan, Kota Tangerang. Postal Code: 15155

STKIP Al Islam Tunas Bangsa: Jalan Z.A. Pagar Alam No. 41 Gedong Meneng Bandar Lampung