Open Access   Article Go Back

Detection of Longest Common Sub Sequence in Normal DNA and Dengue Virus Affected Human DNA using Self Organizing Map

G. Tamilpavai1 , C. Vishnuppriya2

  1. Dept. of Computer Science and Engineering, Government College of Engineering, Tirunelveli, Tamil Nadu, India.
  2. Dept. of Computer Science and Engineering, Government College of Engineering, Tirunelveli, Tamil Nadu, India.

Section:Research Paper, Product Type: Journal Paper
Volume-8 , Issue-1 , Page no. 1-7, Jan-2020

CrossRef-DOI:   https://doi.org/10.26438/ijcse/v8i1.17

Online published on Jan 31, 2020

Copyright © G. Tamilpavai, C. Vishnuppriya . This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

View this paper at   Google Scholar | DPI Digital Library

How to Cite this Paper

  • IEEE Citation
  • MLA Citation
  • APA Citation
  • BibTex Citation
  • RIS Citation

IEEE Style Citation: G. Tamilpavai, C. Vishnuppriya, “Detection of Longest Common Sub Sequence in Normal DNA and Dengue Virus Affected Human DNA using Self Organizing Map,” International Journal of Computer Sciences and Engineering, Vol.8, Issue.1, pp.1-7, 2020.

MLA Style Citation: G. Tamilpavai, C. Vishnuppriya "Detection of Longest Common Sub Sequence in Normal DNA and Dengue Virus Affected Human DNA using Self Organizing Map." International Journal of Computer Sciences and Engineering 8.1 (2020): 1-7.

APA Style Citation: G. Tamilpavai, C. Vishnuppriya, (2020). Detection of Longest Common Sub Sequence in Normal DNA and Dengue Virus Affected Human DNA using Self Organizing Map. International Journal of Computer Sciences and Engineering, 8(1), 1-7.

BibTex Style Citation:
@article{Tamilpavai_2020,
author = {G. Tamilpavai, C. Vishnuppriya},
title = {Detection of Longest Common Sub Sequence in Normal DNA and Dengue Virus Affected Human DNA using Self Organizing Map},
journal = {International Journal of Computer Sciences and Engineering},
issue_date = {1 2020},
volume = {8},
Issue = {1},
month = {1},
year = {2020},
issn = {2347-2693},
pages = {1-7},
url = {https://www.ijcseonline.org/full_paper_view.php?paper_id=4987},
doi = {https://doi.org/10.26438/ijcse/v8i1.17}
publisher = {IJCSE, Indore, INDIA},
}

RIS Style Citation:
TY - JOUR
DO = {https://doi.org/10.26438/ijcse/v8i1.17}
UR - https://www.ijcseonline.org/full_paper_view.php?paper_id=4987
TI - Detection of Longest Common Sub Sequence in Normal DNA and Dengue Virus Affected Human DNA using Self Organizing Map
T2 - International Journal of Computer Sciences and Engineering
AU - G. Tamilpavai, C. Vishnuppriya
PY - 2020
DA - 2020/01/31
PB - IJCSE, Indore, INDIA
SP - 1-7
IS - 1
VL - 8
SN - 2347-2693
ER -

VIEWS PDF XML
336 485 downloads 139 downloads
  
  
           

Abstract

Bioinformatics is an active research area which combines biological matter as well as computer science research. Detection of disease causing human Deoxyribo Nucleic Acid (DNA) sequence analysis is one of the major application areas under bioinformatics. Among the severe diseases, the number of Dengue cases and deaths are raised in Tamil Nadu. Identification of sequence motifs involved in Dengue virus is essential for early prediction and saving human life. It includes wide ranges of steps for disease diagnosing. The scope of this proposed work is to provide the longest common subsequence which present in a normal and Dengue virus affected human DNA sequence. The human DNA sequences are collected from National Center for Biotechnology Information (NCBI) database. Human DNA sequence is separated as k-mer using k-mer separation rule. From that, the separated k-mers are clustered using Self Organizing Map (SOM) algorithm. In which mean, median and standard deviation are used as features for clustering k-mers. Then obtained k-mers clusters are given to the Longest Common Subsequence (LCSS) algorithm to find common subsequence with higher length, which presents in every k-mers clusters. Time consumption for identification of LCSS is compared for both normal and Dengue virus affected DNA.

Key-Words / Index Term

Bioinformatics, K-mers, Longest Common Sub Sequence (LCSS), String pattern matching algorithms

References

[1] Vinayak Majki, Sudip Paul and Rachna Jain, “Bioinformatics for Healthcare Applictions”, IEEE Conference, pp.2014-207, 2019.
[2] Terasa K.Attwood, David J.Parry-Smith and Phukan, Introduction to bioinformatics, Noida(U.P), India: Pearson India Education Services Pvt. Ltd, pp.221, 2008.
[3] Izzat Alsmadi and Maryam Nuser, “String Matching Evaluation Methods for DNA Comparison”, International Journal of Advanced Science and Technology”, Vol.47, pp.13-32, 2012.
[4] Sasikala S, Ratha Jeyalakshmi T, “Extensive Review on Computational Predictions of Genomic Regulatory Sequences”, International Journal of Computer Sciences and Engineering, Vol.07, Issue.08, pp.91-94, 2019.
[5] Amit U Sinha and Raj Bhatnagar, “Efficient and Scalable Motif Discovery using Graph-based Search”, IEEE symposium on Computational Intelligence in Bioinformatics and Computational Biology, pp.197-204, 2007.
[6] Khumukcham Robindro, Ashoke Das, “Effectiveness of Ssaha Algorithm for Searching Motif in Large Databases of DNA Sequences” , International Journal of Scientific Research in Computer Science and Engineering, Vol.5, Issue.4, pp.79-87, 2017.
[7] S.Rajesh, S.Prathima and Dr.L.S.S.Reddy, “Unusual Pattern Detection in DNA Database using KMP Algorithm”, International Journal of Computer Applications (0975-8887), Vol.1, Issue.22, pp.1-5, 2010.
[8] Benjamin Schuster-Bockler and Alex Bateman, “Protein interactions in human genetic diseases”, Genome Biology, Vol.9, Issue 1, Article R9, pp.R9.1-R9.12, 2008.
[9] Chein-Hung Huang, Huai Shun Peng and KA-Lok Ng, “Prediction of Cancer Proteins by Integrating Protein Interaction, Domain frequency and Domain Interaction Data using Machine Learning Algorithms”, BioMed Research International, Vol.2015,pp.1-10, 2015.
[10] Lei Yang, Xudong Zhao and Xianglong Tang, “Predicting Disease-Related Proteins Based on Clique Backbone in Protein-Protein Interaction Network”, International Journal of Biological Sciences, Vol.10, Issue.7, pp.677-688, 2014.
[11] Pankaj Bhanbri, O.P. Gupta, “Phylogenetic Tree Construction for Distance based Methods”, International Journal of Scientific Research in Computer Science and Engineering, Vol.5, Issue.3, pp.142-149, 2017.
[12] Sumedha S.Gunawardena, “Optimum-time, Optimum-space, Algorithms for k-mer Analysis of Whole Genome Sequences”, Journal of Bioinformatics and Comparative Genomics, Vol.1, pp.1-12, 2014.
[13] Teuvo Kohonen and Panu Somervuo, “Self-organizing maps of symbol strings”, Elsevier, Neurocomputing 21, pp.19-30, 1998.
[14] Marghny Mohamed, Abeer A. Al-Mehdhar, Mohamed Bamatraf and Moheb R.Girgis, “Enhanced Self-Organizing Map Neural Network for DNA Sequence Classification”, Intelligent Information Management, Vol.5, pp.25-33, 2013.
[15] Dr.S.A.M.Rizvi and Pankaj Agarwal, “A New Bucket-Based Algorithm for Finding LCS from two given Molecular Sequences”, IEEE, Third International Conference on Information Technology: New Generations, 2006.
[16] Xuyu Xiang, Dafang Zhang and Jiaohua Qin, “A New Algorithm for the Longest Common Subsequence Problem”, IEEE, International Conference on Computational Intelligence and Security Workshops, pp.112-115, 2007.
[17] Coasts S. Iliopoulos and M. Sohel Rahman, “Algorithms for Computing Variants of the Longest Common Subsequence Problem”, Elsevier – Theoretical Computer Science, pp.255-267, 2008.