Open Access   Article

A Comparative Study on Various Clustering Techniques in Data Mining

M. Kasthuri1 , S. Kanchana2 , R. Hemalatha3

Section:Research Paper, Product Type: Journal Paper
Volume-06 , Issue-11 , Page no. 1-8, Dec-2018

Online published on Dec 31, 2018

Copyright © M. Kasthuri, S. Kanchana, R. Hemalatha . This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

View this paper at   Google Scholar | DPI Digital Library

Citation

IEEE Style Citation: M. Kasthuri, S. Kanchana, R. Hemalatha, “A Comparative Study on Various Clustering Techniques in Data Mining”, International Journal of Computer Sciences and Engineering, Vol.06, Issue.11, pp.1-8, 2018.

MLA Style Citation: M. Kasthuri, S. Kanchana, R. Hemalatha "A Comparative Study on Various Clustering Techniques in Data Mining." International Journal of Computer Sciences and Engineering 06.11 (2018): 1-8.

APA Style Citation: M. Kasthuri, S. Kanchana, R. Hemalatha, (2018). A Comparative Study on Various Clustering Techniques in Data Mining. International Journal of Computer Sciences and Engineering, 06(11), 1-8.

           

Abstract

Clustering is the process of combination of identical objects into same classes. A cluster is a grouping of data objects that are analogous to one another within the same cluster and are disparate to the objects in other clusters. Data clustering can be performed on various areas such as data mining, statistics, machine learning, spatial database, biology and marketing. Machine learning is classified into supervised and unsupervised learning. Clustering is the example of unsupervised learning that has no predefined classes and deals with unknown samples. Cluster analysis can be done with different types of methods includes partitioning methods, hierarchical methods, density based methods, grid based methods and model based methods. Quality of clusters can be determined by the two factors that they are high intra-cluster similarity and low inter-cluster similarity. In this paper, various clustering techniques has been analyzed in data mining in terms of methodology adopted, dataset handled, accuracy, advantages and limitations.

Key-Words / Index Term

Agglomative approach, Clustering, K-means, K-medoid

References

[1] Pushpalatha K.P., and Raju G., “Text Document Retrieval through Clustering using Meaningful Frequent Ordered Word Patterns", In: International Journal of Applied Engineering Research, Volume.13, pp. 4822-4833, ISSN: 0973-4562, 2018.
[2] Hemanth Somasekar and Kavya Naveen, “Text Categorization and graphical representation using Improved Markov Clustering", In: International Journal of Intelligent Engineering and Systems,Volume.11, pp.107-116, March 2018.
[3] Dr.Vo Ngoc Phu and Dr.Vo Thi Ngoc Tran, “English Sentiment Classification Using an YULEQ Similarity Measure and the One dimensional Vectors in a Parallel Network Environment”, In: Journal of Theoretical and Applied Information Technology, Volume.96, pp.3356-3382, ISSN: 1992-8645, June 2018.
[4] Arpit Bansal, Mayur Sharma, Shalini Goel, “Improved K-mean Clustering Algorithm for Prediction Analysis using Classification Technique in Data Mining”, In: International Journal of Computer Applications, Volume.157, pp.35- 40, January 2017.
[5] Vairaprakash Gurusamy S., Kannan, J. and Regan Prabhu, “Mining the Attitude of Social Network Users using K-means”, In: International Journal of Advanced Research in Computer Science and Software Engineering, Volume.7, Issue.5, pp.226-230, ISSN: 2277-128X, May 2017.
[6] Suman, Pinki Rani, “A Survey on STING and CLIQUE Grid Based Clustering Methods”, In: International Journal of Advanced Research in Computer Science, Volume.8, pp.1510-1512, ISSN: 0976-5697, May-June 2017.
[7] Chitra K., and Maheswari D., “A Comparative Study of Various Clustering Algorithms in Data Mining”, In: International Journal of Computer Science and Mobile Computing, Volume.6, Issue.8, pp.109-115, ISSN: 2320–088X, August 2017.
[8] Sajana T., Sheela Rani C.M., and Narayana K.V, “A Survey on Clustering Techniques for Big Data Mining”, In: Indian Journal of Science and Technology, Volume.9, pp.1-12, ISSN: 0974-6846, January 2016.
[9] Shilna S. and Navya EK, “Heart Disease Forecasting System using K-mean Clustering Algorithm with PSO and other Data Mining Methods”, In: International Journal On Engineering Technology and Sciences, Volume.3, Issue.4, pp.51-55, ISSN(P): 2349-3968, ISSN (O): 2349-3976, April 2016.
[10] Kiranjeet Kaur and Lalit Mann Singh, “Heart Disease Prediction System Using ANOVA, PCA And SVM Classification”, In: International Journal of Advance Research, Ideas and Innovations in Technology, Volume.2, Issue.3, pp.1-6, ISSN: 2454-132X, 2016.
[11] Swarndeep Saket J. and Sharnil Pandya, “An Overview of Partitioning Algorithms in Clustering Techniques”, In: International Journal of Advanced Research in Computer Engineering & Technology, Volume.5, Issue.6, pp.1943-1946, ISSN: 2278–1323, June 2016.
[12] Abdullah Z. and Hamdan A. R, “Hierarchical Clustering Algorithms in Data Mining”, In: International Journal of Computer and Information Engineering, Volume.9, pp.2201- 2206, 2015.
[13] Rajalakshmi K, and Dhenakaran, Roobini N, “Comparative Analysis of K-Means Algorithm in Disease Prediction”, In: International Journal of Science, Engineering and Technology Research (IJSETR), Volume.4, Issue.7, pp.2697-2699, ISSN: 2278-7798, July 2015.
[14] Abdullah Z, Hamdan A. R, “Hierarchical Clustering Algorithms in Data Mining”, In: International Journal of Computer and Information Engineering, Volume.9, pp.2201- 2206, 2015.
[15] Jiawei Han and Micheline Kamber, “Data Mining Concepts and Techniques”, Morgan Kaufmann, Third Edition, 2012.
[16] Mohammed M. Abu Tair, Alaa M. El-Halees, “Mining Educational Data to Improve Students’ Performance: A Case Study”, In: International Journal of Information and Communication Technology Research, Volume.2, pp.140-146, February 2012.
[17] M. Kasthuri, Dr. S. Britto Ramesh Kumar, “A Framework for Language Independent Stemmer Using Dynamic Programming”, In: International Journal of Applied Engineering Research (IJAER), Print ISSN 0973-4562, Volume.10, Number.18, pp.39000-39004 Online ISSN1087-1090, 2015.
[18] M. Kasthuri, Dr. S. Britto Ramesh Kumar, “PLIS: Proposed Language Independent Stemmer for Information Retrieval Systems Using Dynamic Programming”, In: 2016 World Congress on Computing and Communication Technologies, ISBN: 978-1-5090-5573-9, pp.132-135, IEEE, 2016.
[19] M. Kasthuri and Dr. S. Britto Ramesh Kumar, “Multilingual Phonetic Based Stem Generation”, In: Proceedings of the Second International Conference on Emerging Research in Computing, Information Communication and Applications (ERCICA-2014), ELSEVIER Science :: Technology India, ISBN:9789351072 607, Volume.1, pp.437-442, 01-02 August, 2014.
[20] Dr.M.Kasthuri and Dr.S.Britto Ramesh Kumar, “PLIS: Proposed Language Independent Stemmer Performance Evaluation”, In: International Journal of Advanced Research in Computer Science & Technology (IJARCST 2017), Volume.5, Issue.4, ISSN: 2347-8446 (Online), ISSN:2347-9817 (Print), Oct-Dec, 2017.