<fmt:message key='jsp.layout.header-default.alt'/>  

DSpace@UM >
Faculty of Computer Science and Information Technology >
Conference Proceedings >
International Conference on Informatics >
Informatics 2007 >

Please use this identifier to cite or link to this item: http://hdl.handle.net/1812/344

Authors: M Nordin A Rahman
M Yazid M Saman
Aziz Ahmad
A Osman M Tap
Keywords: DNA sequence
Database classification
Optimal alignment
Rough sets theory
Issue Date: 2007
Abstract: Data classification is a vital task in large scale data mining application. DNA sequences are the basis of life and they encode all the necessary information needed to reproduce life. The size of public DNA sequence databases are growing doubling every year. This situation makes automatic classification and reduction of DNA sequences has become important for effective sequence similarity search problem. A challenge in DNA sequence similarity search is that the sequence record structure does not have any attribute that can be used for implementing classification process. In this paper, by means of filtering process an automaton based exact string matching is employed to generate a special attribute used for DNA sequence database classification and reduction. Rough sets theory provides an indiscernibility relation technique which can be used to classify and reduct the database based on some definition of ‘equivalence’. The generated attribute is used for constructing indiscernibility relation among sequences. With computational implementation, the experiments are executed to investigate the effectiveness of rough sets theory on generating DNA sequence database classification and reduction. Moreover, the experiments will demonstrate that the DNA sequence similarity search performance is significantly improved by using this approach.
Description: Proceeding of the 2nd International Conference on Informatics (Informatics 2007), 27th-28th November 2007, Hilton Petaling Jaya Hotel, Petaling Jaya, Selangor, Malaysia. page T1-41 - T1-48
URI: http://dspace.fsktm.um.edu.my/handle/1812/344
ISBN: 978-983-43491-1-0
Appears in Collections:Informatics 2007

Files in This Item:

File Description SizeFormat
AIA.pdf3.42 MBAdobe PDFView/Open

This item is protected by original copyright

Your Tags:


  © Copyright 2008 DSpace Faculty of Computer Science and Information Technology, University of Malaya . All Rights Reserved.
DSpace@UM is powered by MIT - Hawlett-Packard. More information and software credits. Feedback