Open Access   Article Go Back

Information Retrieval System Using Vector Space Model for Document Summarization

Vaibhav A. Chavan1 , Santosh R. Durugkar2

Section:Research Paper, Product Type: Journal Paper
Volume-2 , Issue-10 , Page no. 46-50, Oct-2014

Online published on Nov 02, 2014

Copyright © Vaibhav A. Chavan , Santosh R. Durugkar . This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

View this paper at   Google Scholar | DPI Digital Library

How to Cite this Paper

  • IEEE Citation
  • MLA Citation
  • APA Citation
  • BibTex Citation
  • RIS Citation

IEEE Style Citation: Vaibhav A. Chavan , Santosh R. Durugkar , “Information Retrieval System Using Vector Space Model for Document Summarization,” International Journal of Computer Sciences and Engineering, Vol.2, Issue.10, pp.46-50, 2014.

MLA Style Citation: Vaibhav A. Chavan , Santosh R. Durugkar "Information Retrieval System Using Vector Space Model for Document Summarization." International Journal of Computer Sciences and Engineering 2.10 (2014): 46-50.

APA Style Citation: Vaibhav A. Chavan , Santosh R. Durugkar , (2014). Information Retrieval System Using Vector Space Model for Document Summarization. International Journal of Computer Sciences and Engineering, 2(10), 46-50.

BibTex Style Citation:
@article{Chavan_2014,
author = { Vaibhav A. Chavan , Santosh R. Durugkar },
title = {Information Retrieval System Using Vector Space Model for Document Summarization},
journal = {International Journal of Computer Sciences and Engineering},
issue_date = {10 2014},
volume = {2},
Issue = {10},
month = {10},
year = {2014},
issn = {2347-2693},
pages = {46-50},
url = {https://www.ijcseonline.org/full_paper_view.php?paper_id=283},
publisher = {IJCSE, Indore, INDIA},
}

RIS Style Citation:
TY - JOUR
UR - https://www.ijcseonline.org/full_paper_view.php?paper_id=283
TI - Information Retrieval System Using Vector Space Model for Document Summarization
T2 - International Journal of Computer Sciences and Engineering
AU - Vaibhav A. Chavan , Santosh R. Durugkar
PY - 2014
DA - 2014/11/02
PB - IJCSE, Indore, INDIA
SP - 46-50
IS - 10
VL - 2
SN - 2347-2693
ER -

VIEWS PDF XML
3628 3446 downloads 3571 downloads
  
  
           

Abstract

Document summarization is the process of reducing size of text document and that retains the most important content of the original document into the reduced document(Summary).In recent year there are huge work has been done in document summarization. There are various techniques available for document summarization but most of the techniques used similarity of sentences to extract sentence, in the document summarization a context of the document are important, so our current method used term indexing model to gives index to document as well as sentences in that document. In this proposed system we used context based document indexing based on vector space model. This document indexing model works with document frequency (DF) and term frequency (TF).DF and TF model gives document indexing weight which is used for document summarization. We compare our system with traditional term based indexing model and will prove that our system gives better result than this system.

Key-Words / Index Term

Vector space model, Document frequency, Term Frequency, Document context

References

[1] X. Wan and J. Xiao, “Exploiting Neighborhood Knowledge for Single Document Summarization and Keyphrase Extraction,” ACM Trans. Information Systems, vol. 28, pp. 8:1-8:34, http://doi.acm.org/10.1145/1740592.1740596, June 2010.
[2] K.S. Jones, “Automatic Summarising: Factors and Directions,” Advances in Automatic Text Summarization, pp. 1-12, MIT Press, 1998.
[3] L.L. Bando, F. Scholer, and A. Turpin, “Constructing Query- Biased Summaries: A Comparison of Human and System Generated Snippets,” Proc. Third Symp. Information Interaction in Context, pp. 195-204, http://doi.acm.org/10.1145/1840784. 1840813, 2010.
[4] X. Wan, “Towards a Unified Approach to Simultaneous Single- Document and Multi-Document Summarizations,” Proc. 23rd Int’l Conf. Computational Linguistics, pp. 1137-1145, http://portal. acm.org/citation.cfm?id=1873781.1873909, 2010.
[5] X. Wan, “An Exploration of Document Impact on Graph-Based Multi-Document Summarization,” Proc. Conf. Empirical Methods in Natural Language Processing, pp. 755-762, http://portal.acm.org/ citation.cfm?id=1613715.1613811, 2008.
[6] Q.L. Israel, H. Han, and I.-Y. Song, “Focused Multi-Document Summarization: Human Summarization Activity vs. Automated Systems Techniques,” J. Computing Sciences in Colleges, vol. 25, pp. 10-20, http://portal.acm.org/citation.cfm?id=1747137. 1747140, May 2010.
[7] C. Shen and T. Li, “Multi-Document Summarization via the Minimum Dominating Set,” Proc. 23rd Int’l Conf. Computational Linguistics, pp. 984-992, http://portal.acm.org/citation.cfm?id= 1873781.1873892, 2010.
[8] X. Wan and J. Yang, “Multi-Document Summarization Using Cluster-Based Link Analysis,” Proc. 31st Ann. Int’l ACM SIGIR Conf. Research and Development in Information Retrieval, pp. 299-306, http://doi.acm.org/10.1145/1390334.1390386, 2008.
[9] D. Wang, T. Li, S. Zhu, and C. Ding, “Multi-Document Summarization via Sentence-Level Semantic Analysis and Symmetric Matrix Factorization,” Proc. 31st Ann. Int’l ACM SIGIR Conf. Research and Development in Information Retrieval, pp. 307-314, http://doi.acm.org/10.1145/1390334.1390387, 2008.
[10] S. Harabagiu and F. Lacatusu, “Using Topic Themes for Multi- Document Summarization,” ACM Trans. Information Systems, vol. 28, pp. 13:1-13:47, http://doi.acm.org/10.1145/1777432.1777436, July 2010.
[11] H. Daume´ III and D. Marcu, “Bayesian Query-Focused Summarization,” Proc. 21st Int’l Conf. Computational Linguistics and the 44th Ann. meeting of the Assoc. for Computational Linguistics, pp. 305-312, http://dx.doi.org/10.3115/1220175.1220214, 2006.
[12] D.M. Dunlavy, D.P. O’Leary, J.M. Conroy, and J.D. Schlesinger, “QCS: A System for Querying, Clustering and Summarizing Documents,” Information Processing and Management, vol.43, pp.1588-1605, http://portal.acm.org/citation.cfm?id=1284916.
1285163, Nov. 2007.
[13] R. Varadarajan, V. Hristidis, and T. Li, “Beyond Single-Page Web Search Results,” IEEE Trans. Knowledge and Data Eng., vol. 20, no. 3, pp. 411-424, Mar. 2008.
[14] L.-W. Ku, L.-Y. Lee, T.-H. Wu, and H.-H. Chen, “Major Topic Detection and Its Application to Opinion Summarization,” Proc. 28th Ann. Int’l ACM SIGIR Conf. Research and Development in Information Retrieval, pp. 627-628, http://doi.acm.org/10.1145/ 1076034.1076161, 2005.
[15] E. Lloret, A. Balahur, M. Palomar, and A. Montoyo, “Towards Building a Competitive Opinion Summarization System: Challenges and Keys,” Proc. Human Language Technologies: The 2009 Ann. Conference of the North Am. Ch. Assoc. for Computational Linguistics, Companion Vol. : Student Research Workshop and Doctoral Consortium, pp. 72-77, http://portal.acm.org/citation.cfm?id= 1620932.1620945, 2009.
[16] J.G. Conrad, J.L. Leidner, F. Schilder, and R. Kondadadi, “Query- Based Opinion Summarization for Legal Blog Entries,” Proc. 12th Int’l Conf. Artificial Intelligence and Law, pp. 167-176, http://doi.acm.org/10.1145/1568234.1568253, 2009.