Paper Title : A LION OPTIMIZATION BASED K-PROTOTYPE CLUSTERING ALGORITHM FOR MIXED DATA
ISSN : 2394-2231
Year of Publication : 2022
10.5281/zenodo.6410014
MLA Style: A LION OPTIMIZATION BASED K-PROTOTYPE CLUSTERING ALGORITHM FOR MIXED DATA " Mr.C.Mani M.C.A.,M.Phil.,M.E., C. Mehala " Volume 9 - Issue 2 International Journal of Computer Techniques (IJCT) ,ISSN:2394-2231 , www.ijctjournal.org
APA Style: A LION OPTIMIZATION BASED K-PROTOTYPE CLUSTERING ALGORITHM FOR MIXED DATA " Mr.C.Mani M.C.A.,M.Phil.,M.E., C. Mehala " Volume 9 - Issue 2 International Journal of Computer Techniques (IJCT) ,ISSN:2394-2231 , www.ijctjournal.org
Abstract
Data Mining is used to gather information from huge set of data. Clustering is a grouping task for a set of objects. Clustering algorithms are divided by several types including hierarchical clustering algorithms,partitioning clustering and density based. The partitioning clustering includes K-Means clustering, K-Modes Clustering and CLARA algorithm. The K-Means clustering is only used for numeric data which has original optima. The K-Modes extends to the K-Means when the sphere is categorical. One of the most important algorithms for clustering heterogeneous type of data is the K- Prototype algorithm. This algorithm is veritably salutary for clustering large data sets. One of the simple optimization methods is Lion Optimization, that could be applied effectively for enhancing clustering results. It's useful for handling mixed data set. This leads a good optimization to calculate the centroid with K- Prototype clustering method. To overcome the problem in this clustering, Lion optimization Algorithm can be used. The proposed algorithm is enforced on standard standard dataset taken from UCI Machine Learning Repository. The Lion Optimization grounded KPrototype clustering algorithm yields a better result when compared with the K- Prototype clustering.
Reference
[1] Arun Prabha .K, N. Karthi Keyani Visalakshi,”Particle Swarm Optimization based K-Prototype Clustering Algorithm” IOSR Journal of Computer Engineering (IOSR-JCE) eISSN: 2278-0661, ISSN: 2278-8727, Vol 17, pp. 56-62, [2015]. [2] Arun K Pujari,”Data Mining Techniques”, Third Edison ISBN: 978-81-7371-884-7[2013]. [3] Fengmei W and H. Lixia, “A Missing Data Imputation Method Based on Neighbor Rules”, Computer Engineering, vol. 38, no. 21, [2012]. [4] Gong Jing,Li Anming, “An Implementation of Clustering Algorithm Based on K-means”, Journal of Hunan University of Technology, vol. 22, pp. 52–54, [2008]. [5] Izhar Ahmad,”K-Mean and K-Prototype Algorithms Performance Analysis”, American Review of Mathematics and Statistics, ISSN 2374-2348, Vol. 2, pp. 95-109, [2014]. [6] Jinchao Ji, Wei Pang, Chunguang Zhou, Xiao Han,Zhe Wang,”A fuzzy k-prototype clustering algorithm for mixed numeric and categorical data”, Elsevier , ISSN 0950-7051,Vol. 30,[2012]. [7] Li Xinwu,”A New Text Clustering Algorithm Based on Improved K-Means”, Journal of Software, Vol. 7, doi:10.4304/jsw.7.1.95-101, [2012]. [8] Ming-Yi Shih, Jar-Wen Jheng and Lien-Fu Lai,” A Two-Step Method for Clustering Mixed Categroical and Numeric Data”, Tamkang Journal of Science and Engineering, Vol. 13, pp. 11-19, [2010]. [9] Maziar Yazdani, Fariborz Jolai,”Lion Optimization Algorithm (LOA): A Nature-Inspired Metaheuristic Algorithm”, Elsevier, Journal of Computational Engineering, [2015]. [10] Madhuri R, M Ramakrishna Murty, JVR Murthy, PVGD Prasad Reddy, and Suresh C Satapathy, "Cluster Analysis on Different Data Sets Using K-Modes and K-Prototype Algorithms," ICT and Critical Infrastructure: Proceedings of the 48th Annual Convention of Computer Society of India-Vol II, pp. 137-144, [2014]. [11] Meng Ying, Luo Ke, et al,”K-medoids clustering algorithm method based on ant colony Algorithm”, J.Computer Engineering and Applications, 48(16):136-139, [2012]. [12] Navneet, Nasib Singh Gill,”A Novel Algorithm for Big Data Classification Based on Lion Optimization”, Journal of Theoretical and Applied Information Technology, ISSN: 1992-8645, EISSN: 1817-3195, Vol.95, [2017]. [13] Preeti Arora, Dr. Deepali , Shipra Varshney ”Analysis of K-Means and K-Medoids Algorithm For Big Data” Elsevier,Procedia Computer Science 78, pp. 507 – 512, [2016]. [14] Pang-Ning Tan, Michael Steinbach, Vipin Kumar, “Introduction to Data Mining”, Pearson, ISBN: 978-93-82291-49-7 [2015]. [15] T.V Suresh Kumar, B.Eswara Reddy, Jagadish S Kallimani,”Data Mining Principles & Applications”, Elsevier, ISBN: 978-93-82291-49- 7 [2013]. [16] Tang Zhe, Luo Keb,”K-means Clustering Algorithm Method Based on Shuffled Frog Leaping Algorithm”, Advanced Materials Research, Vol. 989-994, pp 2245-2249,[2014] [17] UCI Machine Learning Repository, http://archive.ics.uci.edu/ml/datasets.html. [18] Wu Sen, Chen Hong and Feng Xiaodong ,”Clustering Algorithm for Incomplete Data Sets with Mixed Numeric and Categorical Attributes”, International Journal of Database Theory and Application, ISSN: 2005-4270 IJDTA ,Vol.6, pp.95-104,[2013]. [19] Zhexue Huang,”Extensions to the K-Means Algorithm for Clustering Large Data Sets with Categorical Values”, Data Mining and Knowledge Discovery 2, pp. 283–304, [1998]. [20] Zhexue Huang,”Clustering Large Data Sets with Mixed Numeric and Categorical Values”, CSIRO Mathematical and Information Sciences, Conference, pp-21-34, [1997]. [21] C.-H. Cheng and Y.-S. Chen, ‘‘Classifying the segmentation of customer value via RFM model and RS theory,’’ Expert Syst. Appl., vol. 36, no. 3, pp. 4176–4184, Apr. 2009. [22] C.-C.-H. Chan, C.-B. Cheng, and W.-C. Hsien, ‘‘Pricing and promotion strategies of an online shop based on customer segmentation and multiple objective decision making,’’ Expert Syst. Appl., vol. 38, no. 12, pp. 14585–14591, Nov. 2011. [23] R.-S. Wu and P.-H. Chou, ‘‘Customer segmentation of multiple category data in ecommerce using a soft-clustering approach,’’ Electron. Commerce Res. Appl., vol. 10, no. 3, pp. 331–341, May 2011
Keywords
— Kmeans Clustering, Lion Optimization, Data Mining, Machine Learning.