Cluster Method Using A Combination of Cluster K-Prototype Algorithm and Genetic Algorithm for Mixed Data
DOI:
https://doi.org/10.34123/jurnalasks.v7i2.23Keywords:
Data Mining, Cluster Analysis, Mixed Data, K-Prototype Algorithm, Genetic AlgorithmAbstract
Clustering is one of the main methods in data mining that useful to explore the data. One conventional clustering methods namely the K -Means algorithm efficient for large dataset and numeric data types but not for categorical data type. K-prototype algorithm eliminates the limitations of the numerical data but can also be used on categorical data. But the solutions generated by the algorithm is a local optimal solution in which one of the causes is the determination of the initial cluster’s center. Deal with these problems, the genetic algorithm was proposed for solving this global optimasitation problem. The results of the study indicate that the cluster’s center optimization with genetic algorithm success to improve the accuracy of the results of the cluster with K–Prototype algorithm.
Downloads
References
Gil David, Amir Averbuch, "SpectralCAT:Categorical spectral clustering of numerical and nominal data," Pattern Recognition, vol. 45, pp. 416-433, 2012.
J.Han Kamber, Data Mining Concepts and Techniques, 2nd ed. San Fransisco, United States of America: Dianne Cerra, 2006.
J. Suguna, M.Arul Selvi, "Ensemble Fuzzy Clustering for Mixed Numeric and Categorical Data," International Journal of Computer Applications (0975-8887), vol. 42 - No 43, Maret 2012.
M. Ramakrishnan, D. Tennyson Jayaraj, "Modified K-Means Algorithm for effective Clustering of Categorical Data Sets," International Journal of Computer Applications (0975-8887), vol. 89 - No 7, Maret 2014.
Ramesh Valaboju, N. Raghava Rao V.N. Prasad Pinisetty, "Hybrid Algorith for Clustering Mixed Data Sets," IOSR Jornal of Computer Engineeering (IOSRJCE), vol. 6, no. 2, pp. 09-13, Sep-Okt 2012.
Zhexue Huang, "Clustering Large Data Sets with Mixed Numeric and Categorical Values".