Open Access   Article Go Back

Automatic Clustering Based On Outward Statistical Testing Using Advanced Density Metrics

A.A. Jadhav1 , V.S. Gaikwad2

Section:Survey Paper, Product Type: Journal Paper
Volume-5 , Issue-1 , Page no. 32-35, Jan-2017

Online published on Jan 31, 2017

Copyright © A.A. Jadhav, V.S. Gaikwad . This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

View this paper at   Google Scholar | DPI Digital Library

How to Cite this Paper

  • IEEE Citation
  • MLA Citation
  • APA Citation
  • BibTex Citation
  • RIS Citation

IEEE Style Citation: A.A. Jadhav, V.S. Gaikwad, “Automatic Clustering Based On Outward Statistical Testing Using Advanced Density Metrics,” International Journal of Computer Sciences and Engineering, Vol.5, Issue.1, pp.32-35, 2017.

MLA Style Citation: A.A. Jadhav, V.S. Gaikwad "Automatic Clustering Based On Outward Statistical Testing Using Advanced Density Metrics." International Journal of Computer Sciences and Engineering 5.1 (2017): 32-35.

APA Style Citation: A.A. Jadhav, V.S. Gaikwad, (2017). Automatic Clustering Based On Outward Statistical Testing Using Advanced Density Metrics. International Journal of Computer Sciences and Engineering, 5(1), 32-35.

BibTex Style Citation:
@article{Jadhav_2017,
author = {A.A. Jadhav, V.S. Gaikwad},
title = {Automatic Clustering Based On Outward Statistical Testing Using Advanced Density Metrics},
journal = {International Journal of Computer Sciences and Engineering},
issue_date = {1 2017},
volume = {5},
Issue = {1},
month = {1},
year = {2017},
issn = {2347-2693},
pages = {32-35},
url = {https://www.ijcseonline.org/full_paper_view.php?paper_id=1151},
publisher = {IJCSE, Indore, INDIA},
}

RIS Style Citation:
TY - JOUR
UR - https://www.ijcseonline.org/full_paper_view.php?paper_id=1151
TI - Automatic Clustering Based On Outward Statistical Testing Using Advanced Density Metrics
T2 - International Journal of Computer Sciences and Engineering
AU - A.A. Jadhav, V.S. Gaikwad
PY - 2017
DA - 2017/01/31
PB - IJCSE, Indore, INDIA
SP - 32-35
IS - 1
VL - 5
SN - 2347-2693
ER -

VIEWS PDF XML
827 689 downloads 564 downloads
  
  
           

Abstract

Clustering is the process of organizing objects into several groups whose members are similar in some way and is very important technique in data mining as it has applications spread extensively example marketing, biology, pattern recognition etc. Various algorithms have been proposed, published, implemented for clustering like the one published by Rodriguez and Liao but this algorithm is dependent and sensitive to specified parameters and also faces difficulties in identification of ideal problems. Another one published by G. Wang and Q. Song this algorithm do not list all possible numbers of the nearest neighbors and the accuracy is not better in terms of Olivetti face data set then impact of this on performance. This paper overcome the problem faces by above algorithm so, here proposes a new clustering method that will identify cluster centers automatically via statistical testing. Here first define a new metric to evaluate the local density of an object which is named K-density and second metric is define to evaluate the distance of an object to its neighbors with higher density. Then, product of these two metrics is used to evaluate the centrality of each object. After analyzing the distribution of these metrics further transformed the clustering center identification into a problem of extreme-value detection from a long-tailed distribution Finally, apply outward statistical testing method to detect the clustering centers automatically and then completed the clustering process by assigning each of the rest objects to the cluster that contains its nearest neighbor with higher K-density.

Key-Words / Index Term

Clustering, Clustering Center Identification, Long-tailed Distribution, Outward Statistical Testing

References

[1] G. Wang and Q. Song, "Automatic Clustering via Outward Statistical Testing on Density Metrics," in IEEE Transactions on Knowledge and Data Engineering, vol. 28, no. 8, pp. 1971-1985, Aug. 1 2016.
[2] A. Rodriguez and A. Laio, �Clustering by fast search and find of density peaks,� Science, vol. 344, no. 6191, pp. 1492�1496, 2014.
[3] C.-P. Lai, P.-C. Chung, and V. S. Tseng, �A novel two-level clustering method for time Series data analysis,� Expert Systems with Applications, vol. 37, no. 9, pp. 6319�6326, 2010.
[4] I. Rui Xu, Donald C. Wunsch, �Clustering algorithms in biomedical research: a review,� IEEE Reviews in Biomedical Engineering, vol. 3, pp. 120�154, 2010.
[5] W. C. Xiankun Yang, �A novel spatial clustering algorithm based on delaunay triangulation,� J. Software Engineering & Applications, vol. 3, pp. 141�149, 2010.
[6] B. Nadler and M. Galun, �Fundamental limitations of spectral clustering,� in Advances in Neural Information Processing Systems, 2006, pp. 1017�1024.
[7] T. Warren Liao, �Clustering of time series data-a survey,� Pattern Recognition, vol. 38, no. 11, pp. 1857�1874, Nov. 2005.
[8] o. Dongquan Liu, Sourina, �Free-parameters clustering of spatial data with non-uniform density,� in IEEE conference on cybernetics and intelligent systems, 2004, pp. 387 � 392.
[9] T. Kanungo, D. M. Mount, N. S. Netanyahu, C.D. Piatko, R. Silverman, and A. Y.Wu, �An efficient k-means clustering algorithm: analysis and implementation,� IEEE Transactions on Pattern Analysis and Machine Intelligence, vol. 24, no. 7, pp. 881 � 892, 2002.
[10] V. Estivill-Castro and I. Lee, �Argument free clustering for large spatial point-data sets via boundary extraction from Delaunay diagram,� Computers, Environment and Urban Systems, vol. 26, no. 4, pp. 315�334, 2002.
[11] P. S. Bradley, O. L. Mangasarian, and W. N. Street, �Clustering via concave minimization,� Advances in neural information processing systems, pp. 368�374, 1997.
[12] M. Ester, H.-P. Kriegel, J. Sander, and X. Xu, �A density-based algorithm for discovering clusters in large spatial databases with noise.� in Proceedings of International Conference on Knowledge Discovery and Data Mining, vol. 96, no. 34, 1996, pp. 226�231.
[13] L. Hagen and A. B. Kahng, �New spectral methods for ratio cut partitioning and clustering,� IEEE Transactions on Computer-aided design of integrated circuits and systems, vol. 11, no. 9, pp. 1074�1085,1992.
[14] W. E. Donath and A. J. Hoffman, �Lower bounds for the partitioning of graphs,� IBM Journal of Research and Development, vol. 17, no. 5, pp. 420�425, 1973.