Open Access   Article Go Back

An Overview of Ontology Based Text Document Clustering Algorithms

A. Awachar1 , R. Bairagi2 , V.L. Hegade3 , M. Khandagale4

Section:Technical Paper, Product Type: Journal Paper
Volume-2 , Issue-2 , Page no. 60-64, Feb-2014

Online published on Feb 28, 2014

Copyright © A. Awachar, R. Bairagi, V.L. Hegade, M. Khandagale . This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

View this paper at   Google Scholar | DPI Digital Library

How to Cite this Paper

  • IEEE Citation
  • MLA Citation
  • APA Citation
  • BibTex Citation
  • RIS Citation

IEEE Style Citation: A. Awachar, R. Bairagi, V.L. Hegade, M. Khandagale, “An Overview of Ontology Based Text Document Clustering Algorithms,” International Journal of Computer Sciences and Engineering, Vol.2, Issue.2, pp.60-64, 2014.

MLA Style Citation: A. Awachar, R. Bairagi, V.L. Hegade, M. Khandagale "An Overview of Ontology Based Text Document Clustering Algorithms." International Journal of Computer Sciences and Engineering 2.2 (2014): 60-64.

APA Style Citation: A. Awachar, R. Bairagi, V.L. Hegade, M. Khandagale, (2014). An Overview of Ontology Based Text Document Clustering Algorithms. International Journal of Computer Sciences and Engineering, 2(2), 60-64.

BibTex Style Citation:
@article{Awachar_2014,
author = {A. Awachar, R. Bairagi, V.L. Hegade, M. Khandagale},
title = {An Overview of Ontology Based Text Document Clustering Algorithms},
journal = {International Journal of Computer Sciences and Engineering},
issue_date = {2 2014},
volume = {2},
Issue = {2},
month = {2},
year = {2014},
issn = {2347-2693},
pages = {60-64},
url = {https://www.ijcseonline.org/full_paper_view.php?paper_id=53},
publisher = {IJCSE, Indore, INDIA},
}

RIS Style Citation:
TY - JOUR
UR - https://www.ijcseonline.org/full_paper_view.php?paper_id=53
TI - An Overview of Ontology Based Text Document Clustering Algorithms
T2 - International Journal of Computer Sciences and Engineering
AU - A. Awachar, R. Bairagi, V.L. Hegade, M. Khandagale
PY - 2014
DA - 2014/02/28
PB - IJCSE, Indore, INDIA
SP - 60-64
IS - 2
VL - 2
SN - 2347-2693
ER -

VIEWS PDF XML
3545 3457 downloads 3697 downloads
  
  
           

Abstract

Text document clustering is an important activity in data mining. It is emerged from text retrieval, and had important application in establishing information retrieval, knowledge management system. Clustering can help to get solutions for many problems associated with real time applications such as, in commercial; in biotechnology; in geography; in the banking sector; in the insurance industry; in the Internet etc. Hence It is important to know different ways available to implement clustering. In Text based clustering approach using title of document it is found out to which cluster this document belongs But It doesn�t give better results because it may possible that same document is renamed with two different names, as content of both documents are similar it is expected that the document should go to the same cluster but depending on the name of the document it may go to two different clusters . A new approach called semantic based text clustering [1] comes into picture in which entire document is parsed and depending on its content it is clustered. Ontology based text clustering [2] is a way to implement semantic based clustering. In this paper we discussed about different Ontology based algorithms like K-means, DBScan, SOM etc.

Key-Words / Index Term

Term-Clustering, k-means, Single-Linkage, DBSCAN, Self-Organizing Maps, F1Measure

References

[1]. �Ontology-based Semantic clustering� by dr. Aida alls and dr. Karina gibert computer science and mathematics sanroma aPh.d. thesis supervised by department of Tarragona.
[2]. �Ontology-based Text Document Clustering� by Andreas Hotho and Alexander Maedche and teffen Staab Institute AIFB, University of Karlsruhe, 76128 Karlsruhe, Germany.
[3]. �Ontology-based Text Clustering� by A. Hotho and S. Staab A. Maedche.
[4]. � Survey of Clustering Algorithms � by Rui Xu, Student Member, IEEE and Donald Wunsch II, Fellow, IEEE.
[5]. �Support Vector Clustering� by Asa Ben-Hur asa ,Raymond and Beverly, Nello Critianini, John Shawe-Taylor and Bob Williamson.
[6]. �Modern Information Retrieval � a book written by yates & neto.