|
DSpace@UM >
Faculty of Computer Science and Information Technology >
Conference Proceedings >
International Conference on Informatics >
Informatics 2007 >
Please use this identifier to cite or link to this item:
http://hdl.handle.net/1812/344
|
|
| Title: | DNA SEQUENCE DATABASE CLASSIFICATION AND REDUCTION: ROUGH SETS THEORY APPROACH |
| Authors: | M Nordin A Rahman M Yazid M Saman Aziz Ahmad A Osman M Tap |
| Keywords: | DNA sequence Filtering Database classification Optimal alignment Rough sets theory |
| Issue Date: | 2007 |
| Abstract: | Data classification is a vital task in large scale data mining application. DNA sequences are the basis of life and they
encode all the necessary information needed to reproduce life. The size of public DNA sequence databases are growing
doubling every year. This situation makes automatic classification and reduction of DNA sequences has become
important for effective sequence similarity search problem. A challenge in DNA sequence similarity search is that the
sequence record structure does not have any attribute that can be used for implementing classification process. In this
paper, by means of filtering process an automaton based exact string matching is employed to generate a special
attribute used for DNA sequence database classification and reduction. Rough sets theory provides an indiscernibility
relation technique which can be used to classify and reduct the database based on some definition of ‘equivalence’.
The generated attribute is used for constructing indiscernibility relation among sequences. With computational
implementation, the experiments are executed to investigate the effectiveness of rough sets theory on generating DNA
sequence database classification and reduction. Moreover, the experiments will demonstrate that the DNA sequence
similarity search performance is significantly improved by using this approach. |
| Description: | Proceeding of the 2nd International Conference on Informatics (Informatics 2007), 27th-28th November 2007, Hilton Petaling Jaya Hotel, Petaling Jaya, Selangor, Malaysia. page T1-41 - T1-48 |
| URI: | http://dspace.fsktm.um.edu.my/handle/1812/344 |
| ISBN: | 978-983-43491-1-0 |
| Appears in Collections: | Informatics 2007
|
This item is protected by original copyright
|
|