Open Access   Article Go Back

Comparing clustering Algorithms with Diabetic Datasets in WEKA Tool

G.G.Gokilam 1 , K.Shanthi 2

Section:Review Paper, Product Type: Journal Paper
Volume-3 , Issue-2 , Page no. 1-5, Feb-2015

Online published on Feb 28, 2015

Copyright © G.G.Gokilam , K.Shanthi . This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

View this paper at   Google Scholar | DPI Digital Library

How to Cite this Paper

  • IEEE Citation
  • MLA Citation
  • APA Citation
  • BibTex Citation
  • RIS Citation

IEEE Style Citation: G.G.Gokilam , K.Shanthi, “Comparing clustering Algorithms with Diabetic Datasets in WEKA Tool,” International Journal of Computer Sciences and Engineering, Vol.3, Issue.2, pp.1-5, 2015.

MLA Style Citation: G.G.Gokilam , K.Shanthi "Comparing clustering Algorithms with Diabetic Datasets in WEKA Tool." International Journal of Computer Sciences and Engineering 3.2 (2015): 1-5.

APA Style Citation: G.G.Gokilam , K.Shanthi, (2015). Comparing clustering Algorithms with Diabetic Datasets in WEKA Tool. International Journal of Computer Sciences and Engineering, 3(2), 1-5.

BibTex Style Citation:
@article{_2015,
author = {G.G.Gokilam , K.Shanthi},
title = {Comparing clustering Algorithms with Diabetic Datasets in WEKA Tool},
journal = {International Journal of Computer Sciences and Engineering},
issue_date = {2 2015},
volume = {3},
Issue = {2},
month = {2},
year = {2015},
issn = {2347-2693},
pages = {1-5},
url = {https://www.ijcseonline.org/full_paper_view.php?paper_id=392},
publisher = {IJCSE, Indore, INDIA},
}

RIS Style Citation:
TY - JOUR
UR - https://www.ijcseonline.org/full_paper_view.php?paper_id=392
TI - Comparing clustering Algorithms with Diabetic Datasets in WEKA Tool
T2 - International Journal of Computer Sciences and Engineering
AU - G.G.Gokilam , K.Shanthi
PY - 2015
DA - 2015/02/28
PB - IJCSE, Indore, INDIA
SP - 1-5
IS - 2
VL - 3
SN - 2347-2693
ER -

VIEWS PDF XML
3096 2577 downloads 2607 downloads
  
  
           

Abstract

Data mining is the process of discover useful information from large datasets. The data mining techniques are used to analyze and evaluate diabetic dataset in the field of bio-medical. One of the most important techniques of data mining is clustering which is used to analyzing data from different perspectives and summarizing into useful information. Clustering is the task of assigning a set of objects into group called clusters. This paper discusses different clustering algorithms like cobweb, DBSCAN, EM, Farthest first, filtered cluster hierarchical cluster, OPTICS, simple Kmeans. The algorithms are used to compare its performance by Time taken to build the clusters, the cluster differentiated by its true positive and true negative values. Our main aim to show the comparison of the different cluster algorithms are evaluated in weka tool (Data mining Tool) and find out which algorithm will be most suitable for the diabetes dataset.

Key-Words / Index Term

Cluster, Diabetes , Weka ,Data Mining

References

[1] Jiawei Han and Micheline Kamber, “Data Mining Concepts and Techniques”, second edition, Morgan Kaufmann Publishers an imprint of Elsevier.
[2]A.K. JAIN Michigan State University, M.N.MURTY Indian Institute of Science AND P.J. FLYNN The Ohio State University: “Data Clustering”.
[3] P. Vijaya, M N Murthy and D K Subramanian. Leaders-sub leaders, “An efficient hierarchical clustering algorithm for large data sets”,Pattern Recognition Letters (2004) 505-513.
[4] Rama. B,“A Survey on clustering Current status and challenging issues” (IJCSE) International Journal on Computer Science and Engineering Vol. 02, No. 09, 2010, 2976-2980.
[5] M. Pramod Kumar “Simultaneous Pattern and Data Clustering Using Modified K-Means Algorithm” International Journal on Computer Science and Engineering Vol. 02, No. 06, 2010, 2003-2008.
[6] Miroslav Marinov, M.S.,1 Abu Saleh Mohammad Mosa, M.S.,1 Illhoi Yoo, Ph.D.,1,2 and Suzanne Austin Boren, Ph.D., MHA1,2 “ Data-Mining Technologies for Diabetes: A Systematic Review” Journal of Diabetes Science and Technology Volume 5, Issue 6, November 2011 © Diabetes Technology Society.
[7] Celeux, G. and Govaert, G. (1992). “A classification EM algorithm for clustering and two stochastic versions. Computational statistics and data analysis”, 14:315–332
[8] Narendra Sharma , Aman Bajpai , Mr. Ratnesh Litoriya, “Comparison the various clustering algorithms of weka tools” International Journal of Emerging Technology and Advanced Engineering Website: www.ijetae.com (ISSN 2250-2459, Volume 2, Issue 5, May 2012).
[9] Dr. Wenjia Wang, “Tutorial for DM tool Weka 1 CMP: Data Mining and Statistics within the Health Services”.
[10] K. Rajesh, V. Sangeetha ,“ Application of Data Mining Methods and Techniques for Diabetes Diagnosis” International
Journal of Engineering and Innovative Technology (IJEIT) Volume 2, Issue 3, September 2012