完整後設資料紀錄
DC 欄位 | 值 | 語言 |
---|---|---|
dc.contributor.author | Chang, Ye-In Jr | |
dc.contributor.author | Huang, Lee-wen Jr | |
dc.contributor.author | Chang, Hsi-Mei Jr | |
dc.date.accessioned | 2011-01-26T01:05:19Z | |
dc.date.accessioned | 2020-05-18T03:10:40Z | - |
dc.date.available | 2011-01-26T01:05:19Z | |
dc.date.available | 2020-05-18T03:10:40Z | - |
dc.date.issued | 2011-01-26T01:05:19Z | |
dc.date.submitted | 2010-12-16 | |
dc.identifier.uri | http://dspace.lib.fcu.edu.tw/handle/2377/29970 | - |
dc.description.abstract | Proteins are the structural components of living cells and tissues, and thus an important building block in all living organisms. Patterns in proteins sequences are some subsequences which appear frequently. Patterns often denote important functional regions in proteins and can be used to characterize a protein family or discover the function of proteins. Moreover, it provides valuable information about the evolution of species. Grouping protein sequences that share similar structure helps in identifying sequences with similar functionality. Many algorithms have been proposed for clustering proteins according to their similarity. Feature-based clustering algorithms use a near–linear complexity -means based clustering algorithm. Although feature-based clustering algorithms are scalable and lead to reasonably good clusters, they consume time on performing the global approach and the local approach separately. Therefore, in this paper, we propose the hybrid algorithm to find and mark features for feature-based clustering algorithms. We observe an interesting result from the relation between the local features and the closed frequent sequential patterns. In our hybrid algorithm (CLoseLG), we first find the closed frequent sequential patterns directly. Next, we find local candidate features efficiently from the closed frequent sequential patterns and then mark the local features. Finally, we find and mark the global features. From our performance study based on the biological data and the synthetic data, we show that our proposed hybrid algorithm is more efficient than the feature-based algorithm. | |
dc.description.sponsorship | National Cheng Kung University,Tainan | |
dc.format.extent | 6p. | |
dc.relation.ispartofseries | 2010 ICS會議 | |
dc.subject | Clustering | |
dc.subject | Features | |
dc.subject | Protein databases | |
dc.subject | Sequential data | |
dc.subject.other | Biomedical Informatics | |
dc.title | A Hybrid Algorithm of Finding Features for Clustering Sequential Data | |
分類: | 2010年 ICS 國際計算機會議(如需查看全文,請連結至IEEE Xplore網站) |
文件中的檔案:
沒有與此文件相關的檔案。
在 DSpace 系統中的文件,除了特別指名其著作權條款之外,均受到著作權保護,並且保留所有的權利。