Please use this identifier to cite or link to this item: http://repository.kln.ac.lk/handle/123456789/20373
Title: An Unsupervised Machine Learning Approach for Churn Prediction
Authors: Prasanth, S.
Rathnayaka, R.M.K.T.
Arumawadu, H.
Keywords: Churn
Unsupervised Machine Learning
Principal Component Analysis
K-Means Clustering
Issue Date: 2019
Publisher: 4th International Conference on Advances in Computing and Technology (ICACT ‒ 2019), Faculty of Computing and Technology, University of Kelaniya, Sri Lanka
Citation: Prasanth, S., Rathnayaka, R.M.K.T. and Arumawadu, H. (2019). An Unsupervised Machine Learning Approach for Churn Prediction. 4th International Conference on Advances in Computing and Technology (ICACT ‒ 2019), Faculty of Computing and Technology, University of Kelaniya, Sri Lanka. p16.
Abstract: Customer churn is one of the critical issues faced by the firms nowadays. Telecommunication industry is no exception to this rule. In this industry, keeping the existing subscriber (customer) is more valuable than acquiring a new subscriber (attracting new customers costs approximately 5 times higher than retaining the existing customers). Therefore, predicting the attrition behavior of customers in advance is a significant task. This behavior has triggered most of the researchers to focus on developing the churn prediction model in several industries. Anyhow, in most of the time supervised machine learning techniques have been incorporated in this regard. But in here, an unsupervised machine learning approach has been proposed. A local telecommunication company can be approached for the purpose of conducting this research. Around 10,000 postpaid subscriber details with 20 attributes have been obtained and analyzed during this research. Further, Principal Component Analysis (PCA) and Kmeans clustering algorithm have been utilized with the intention of reducing the dimensionality between features and to find the churners and non-churners respectively. The results obtained from the PCA have revealed that, 16 principal components which represent all the 20 features are considered as most important aspects to cover the entire data. Moreover, totally 6 clusters have been generated and some particular features that tend to show high contributions were identified during the principal component analysis have been analyzed towards each cluster. The proposed approach has finally revealed that out of the 6 clusters three (3) representing 4888 are churners and the other three (3) representing 5112 are non-churners. It could be ensured that, this approach would assist the future researchers to have a promising start for combining the unsupervised technique with the supervised one.
URI: http://repository.kln.ac.lk/handle/123456789/20373
Appears in Collections:ICACT 2019

Files in This Item:
File Description SizeFormat 
16.pdf11.53 kBAdobe PDFView/Open


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.