Open Access   Article Go Back

An Enhanced Fuzzy Based Linkage Clustering Algorithm (EFCA) in High Dimensional Data

R. Kiruthika1 , V. Vijayakumar2

Section:Research Paper, Product Type: Journal Paper
Volume-8 , Issue-2 , Page no. 12-17, Feb-2020

CrossRef-DOI:   https://doi.org/10.26438/ijcse/v8i2.1217

Online published on Feb 28, 2020

Copyright © R. Kiruthika, V. Vijayakumar . This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

View this paper at   Google Scholar | DPI Digital Library

How to Cite this Paper

  • IEEE Citation
  • MLA Citation
  • APA Citation
  • BibTex Citation
  • RIS Citation

IEEE Style Citation: R. Kiruthika, V. Vijayakumar, “An Enhanced Fuzzy Based Linkage Clustering Algorithm (EFCA) in High Dimensional Data,” International Journal of Computer Sciences and Engineering, Vol.8, Issue.2, pp.12-17, 2020.

MLA Style Citation: R. Kiruthika, V. Vijayakumar "An Enhanced Fuzzy Based Linkage Clustering Algorithm (EFCA) in High Dimensional Data." International Journal of Computer Sciences and Engineering 8.2 (2020): 12-17.

APA Style Citation: R. Kiruthika, V. Vijayakumar, (2020). An Enhanced Fuzzy Based Linkage Clustering Algorithm (EFCA) in High Dimensional Data. International Journal of Computer Sciences and Engineering, 8(2), 12-17.

BibTex Style Citation:
@article{Kiruthika_2020,
author = {R. Kiruthika, V. Vijayakumar},
title = {An Enhanced Fuzzy Based Linkage Clustering Algorithm (EFCA) in High Dimensional Data},
journal = {International Journal of Computer Sciences and Engineering},
issue_date = {2 2020},
volume = {8},
Issue = {2},
month = {2},
year = {2020},
issn = {2347-2693},
pages = {12-17},
url = {https://www.ijcseonline.org/full_paper_view.php?paper_id=5023},
doi = {https://doi.org/10.26438/ijcse/v8i2.1217}
publisher = {IJCSE, Indore, INDIA},
}

RIS Style Citation:
TY - JOUR
DO = {https://doi.org/10.26438/ijcse/v8i2.1217}
UR - https://www.ijcseonline.org/full_paper_view.php?paper_id=5023
TI - An Enhanced Fuzzy Based Linkage Clustering Algorithm (EFCA) in High Dimensional Data
T2 - International Journal of Computer Sciences and Engineering
AU - R. Kiruthika, V. Vijayakumar
PY - 2020
DA - 2020/02/28
PB - IJCSE, Indore, INDIA
SP - 12-17
IS - 2
VL - 8
SN - 2347-2693
ER -

VIEWS PDF XML
286 443 downloads 137 downloads
  
  
           

Abstract

In data mining, clustering algorithm is a powerful meta-learning tool to precisely examine the huge volume of data created by recent applications. In particular, their major objective is to group data into clusters such that data points are grouped in the similar cluster when they are “similar” according to specific metrics. Several clustering algorithms have been developed to deal with very large number of features or with a very high number of dimensions, but they are often not practical when the data is large in both aspects. To address these issues, this paper work, developed an Enhanced Fuzzy based Linkage Clustering Algorithm (EFCA), which combines FCM and cluster assignment strategy to solve the optimization problem during high dimensional data processing. The proposed EFCA approach it can work with large volumes of high dimensional dataset for discovering the outliers. The experimental results shown that the proposed EFCA performance to improve 21.9% especial in terms of Partition Accuracy (PA), Dunn Index (DI) improves 28 %, and Computational time improves 16.4% compared with other existing clusiVAT and FensiVAT algorithms.

Key-Words / Index Term

Data mining, Big data cluster analysis, Fuzzy, Linkage

References

[1]    
U. M. Fayyad, G.
Piatetsky-Shapiro, P. Smyth, and R. Uthurusamy. Advances in Knowledge Discovery
and Data Mining. AAAI/MIT Press, 1996.



[2]    
Hoppner, F.;
Klawnn, F.; Kruse, R.; Runkler, T.;. Fuzzy Cluster Analysis: “Methods for
classification, data analysis and image recognition”, John Wiley & Sons,
Inc., New York NY., 2000.



[3]    
H. Gunadi,
“Comparing nearest neighbor algorithms in highdimensional space,” 2011.



[4]    
T. C. Havens and
J. C. Bezdek, “An efficient formulation of the improved visual assessment of
cluster tendency (iVAT) algorithm,” IEEE Trans. Knowl. Data Eng., vol. 24, no. 5, pp. 813–822, May
2012.



[5]    
D. Kumar, M.
Palaniswami, S. Rajasegarar, C. Leckie, J. C. Bezdek, and T. C. Havens,
“clusiVAT: A mixed visual/numerical clustering algorithm for big data,” in Proc.
IEEE Int. Conf. Big Data, pp. 112–117,
2013
.



[6]    
A. Fahad, N.
Alshatri, Z. Tari, A. Alamri, I. Khalil, A. Y. Zomaya, S. Foufou, and A.
Bouras, “A survey of clustering algorithms for big data: Taxonomy and empirical
analysis,” IEEE Trans. Emerging Topics Comput., vol. 2, no. 3, pp. 267–279, Sep. 2014.



[7]    
M. Popescu, J.
Keller, J. Bezdek, and A. Zare, “Random projections fuzzy c-means (RPFCM) for
big data clustering,” in Proc. IEEE Int. Conf. Fuzzy Syst., pp. 1–6, 2015.



[8]    
D. Kumar, J. C.
Bezdek, M. Palaniswami, S. Rajasegarar, C. Leckie, and T. C. Havens, “A hybrid
approach to clustering in big data,” IEEE Trans. Cybern., vol. 46, no. 10, pp.
2372–2385, Oct. 2016.



[9]    
J. C. Bezdek,
Primer on Cluster Analysis: Four Basic Methods that (Usually) Work, vol. 1.
Sarasota, FL, USA: First Edition Design Publishing, 2017.



[10]  P. Rathore, J. C. Bezdek, S. M. Erfani, S. Rajasegarar,
and M. Palaniswami, “Ensemble fuzzy clustering using cumulative aggregation on
random projections,” IEEE Trans. Fuzzy Syst., vol. 26, no. 3, pp.
1510–1524, Jun. 2018.



[11]  P. Rathore, A. S. Rao, S. Rajasegarar, E. Vanz, J.
Gubbi, and M. Palaniswami, “Real-time urban microclimate analysis using
internet of things,” IEEE Internet Things J., vol. 5, no. 2, pp.
500–511, Apr. 2018.



[12]  Punit Rathore, Dheeraj Kumar, James C. Bezdek,
Sutharshan Rajasegarar, and Marimuthu Palaniswami, "A Rapid Hybrid
Clustering Algorithm for Large Volumes of High Dimensional Data", IEEE
TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, VOL. 31, NO. 4, APRIL 2019.



[13]  J. C. Dunn, “A fuzzy relative of the ISODATA process
and its use in detecting compact well-separated clusters,” J. Cybern., vol. 3, no. 3, pp. 32–57, 1973.



[14]  P. Rathore, Z. Ghafoori, J. C. Bezdek, M. Palaniswami,
and C. Leckie, “Approximating Dunn’s cluster validity indices for partitions of
big data,” IEEE Trans. Cybern.