題名: | Hidden Markov Model Based DNA-binding Proteins Prediction by Mining on Sequence and Structure Information |
作者: | Chen, Wei-Jhih Chuang, Po-Cheng Kao, Hung-Yu |
關鍵字: | Machine learning Hidden Markov Model DNA-binding proteins Support vector machines |
期刊名/會議名稱: | 2008 ICS會議 |
摘要: | In the post-genome period, the protein domain structures are published rapidly. For figuring out the cell function, the mechanism of protein-DNA interaction is an important subject in resent bioinformatics research and has not been comprehensively studied. Several machine learning based methods have been attempted to solve this issue. Until recently, few studies have been successful in translating the tertiary structure characteristics of proteins into appropriate features for utilizing the learning mechanism to predict DNA-binding proteins. In this work, a novel machine learning approach based on using HMMs (hidden Markov Models) to express the characteristics of DNA-binding proteins in the both aspects of amino acid sequence and tertiary structure are presented. Moreover, several helpful features of DNA-binding proteins are also utilized in the proposed method, such as residue composition, structure pattern composition and accessible surface area of residues. We develop a SVM (Support Vector Machine) based classifier to predict general DNA-binding proteins, and obtain the accuracy of 88.45% through 5-folds cross-validation. Furthermore, a response element specific classifier is constructed for predicting response element specific DNA-binding proteins, and is obtained the precision of 96.57% with recall rate as 88.83% in average. |
日期: | 2009-02-12T03:21:56Z |
分類: | 2008年 ICS 國際計算機會議 |
文件中的檔案:
檔案 | 描述 | 大小 | 格式 | |
---|---|---|---|---|
ce07ics002008000106.pdf | 225.6 kB | Adobe PDF | 檢視/開啟 |
在 DSpace 系統中的文件,除了特別指名其著作權條款之外,均受到著作權保護,並且保留所有的權利。