題名: | Clustering Gene Expression Time Series Data |
作者: | Wu, Cheng-Che Kuo, Chun-Nan Huang, Jen-Peng Kuo, Huang-Cheng |
關鍵字: | microarray gene expression time series clustering analysis |
期刊名/會議名稱: | 2004 ICS會議 |
摘要: | Efficiently and effectively finding the genes with similar behaviors from microarray data is an important task in bioinformatics community. Co-expression genes have the same behavior or are controlled by the same regulatory mechanisms. Clustering analysis is a very popular technique to group the co-expressed genes into the same cluster. One of the key issues for clustering gene expression time series data is to define the similarity between two time series. Distance measurements and correlation coefficients are commonly used similarity definitions. Two time series might be very distant, but they might be similar if a few items are dropped off from one of the two time series. In this paper, we consider this new aspect of time series similarity, denoted “shift effect,” which indicates temporal gap between two time series. For partition based clustering methods, users have to specify the target number of clusters. This is usually done by means of try-and-error to pick up a number from a large range. In order to solve this problem, we apply sequential pattern mining technique by treating time series as sequences. The number of frequent patterns is the number of target clusters. All the time series supporting a sequential pattern are the initial members of a cluster. Then, each time series is iteratively re-assigned to a suitable cluster. |
日期: | 2006-10-12T08:00:38Z |
分類: | 2004年 ICS 國際計算機會議 |
文件中的檔案:
檔案 | 描述 | 大小 | 格式 | |
---|---|---|---|---|
ce07ics002004000206.pdf | 234.86 kB | Adobe PDF | 檢視/開啟 |
在 DSpace 系統中的文件,除了特別指名其著作權條款之外,均受到著作權保護,並且保留所有的權利。