Fast Algorithms for Finding the Common Subsequence of Multiple Sequences

Huang, Kuo-Si; Yang, Chang-Biau; Tseng, Kuo-Tsung

題名:	Fast Algorithms for Finding the Common Subsequence of Multiple Sequences
作者:	Huang, Kuo-Si Yang, Chang-Biau Tseng, Kuo-Tsung
關鍵字:	computational biology common subsequence approximate algorithm
期刊名/會議名稱:	2004 ICS會議
摘要:	The longest common subsequence (LCS) algorithm is a useful method for measuring the identities and for finding similar subsequences in several sequences. Unfortunately, the longest common subsequence problem is NP-hard. In the past years, some algorithms, with several different approaches, have been proposed for finding the LCS of two given sequences. The complexity of these algorithms is about O(n2) in general and worst cases, where n is the length of sequences. When the given sequences are very long, these algorithms will take very long time and thus will become impractical. To overcome the disadvantage of time consuming, some efforts are devoted to the development of heuristic and approximate algorithms for finding the LCS. Such algorithms provide feasible solutions in practical application, such as searching in databases. However, there are few efforts for finding the LCS of more than two sequences. In this paper, we propose two approximate algorithms for finding the LCS of multiple sequences. The time complexity of our algorithms are O(kn) and O(2kn + 3n), where  is the size of symbol set, k and n are the number and length of input sequences, respectively. In the experimental results, our algorithm finds the common subsequences whose lengths are about 0.8\|LCS\| in average for two random sequences with uniform distribution. In the rank-identity experimental result, it shows that our methods are suitable in practical application.
日期:	2006-10-12T07:59:04Z
分類:	2004年 ICS 國際計算機會議

文件中的檔案：

檔案	描述	大小	格式
ce07ics002004000173.pdf		315.83 kB	Adobe PDF	檢視/開啟

顯示文件完整紀錄

在 DSpace 系統中的文件，除了特別指名其著作權條款之外，均受到著作權保護，並且保留所有的權利。

逢甲大學校園典藏知識庫