CSpace  > 应用数学研究所
Support vector machine prediction of enzyme function with conjoint triad feature and hierarchical context
Wang, Yong-Cui2,3; Wang, Yong4; Yang, Zhi-Xia1; Deng, Nai-Yang2
2011-06-20
发表期刊BMC SYSTEMS BIOLOGY
ISSN1752-0509
卷号5页码:11
摘要Background: Enzymes are known as the largest class of proteins and their functions are usually annotated by the Enzyme Commission (EC), which uses a hierarchy structure, i.e., four numbers separated by periods, to classify the function of enzymes. Automatically categorizing enzyme into the EC hierarchy is crucial to understand its specific molecular mechanism. Results: In this paper, we introduce two key improvements in predicting enzyme function within the machine learning framework. One is to introduce the efficient sequence encoding methods for representing given proteins. The second one is to develop a structure-based prediction method with low computational complexity. In particular, we propose to use the conjoint triad feature (CTF) to represent the given protein sequences by considering not only the composition of amino acids but also the neighbor relationships in the sequence. Then we develop a support vector machine (SVM)-based method, named as SVMHL (SVM for hierarchy labels), to output enzyme function by fully considering the hierarchical structure of EC. The experimental results show that our SVMHL with the CTF outperforms SVMHL with the amino acid composition (AAC) feature both in predictive accuracy and Matthew's correlation coefficient (MCC). In addition, SVMHL with the CTF obtains the accuracy and MCC ranging from 81% to 98% and 0.82 to 0.98 when predicting the first three EC digits on a low-homologous enzyme dataset. We further demonstrate that our method outperforms the methods which do not take account of hierarchical relationship among enzyme categories and alternative methods which incorporate prior knowledge about inter-class relationships. Conclusions: Our structure-based prediction model, SVMHL with the CTF, reduces the computational complexity and outperforms the alternative approaches in enzyme function prediction. Therefore our new method will be a useful tool for enzyme function prediction community.
DOI10.1186/1752-0509-5-S1-S6
语种英语
资助项目National Natural Science Foundation of China[10801131] ; National Natural Science Foundation of China[10801112] ; National Natural Science Foundation of China[10971223] ; National Natural Science Foundation of China[11071252] ; Ph.D Graduate Start Research Foundation of Xinjiang University[BS080101] ; SRF for ROCS, SEM ; Shanghai Key Laboratory of Intelligent Information Processing[IIPL-2010-008]
WOS研究方向Mathematical & Computational Biology
WOS类目Mathematical & Computational Biology
WOS记录号WOS:000297313700005
出版者BIOMED CENTRAL LTD
引用统计
文献类型期刊论文
条目标识符http://ir.amss.ac.cn/handle/2S8OKBNM/240
专题应用数学研究所
通讯作者Yang, Zhi-Xia
作者单位1.Xinjiang Univ, Coll Math & Syst Sci, Urumuchi 830046, Peoples R China
2.China Agr Univ, Coll Sci, Beijing 100083, Peoples R China
3.Chinese Acad Sci, NW Inst Plateau Biol, Key Lab Adaptat & Evolut Plateau Biota, Xining 810001, Peoples R China
4.Chinese Acad Sci, Acad Math & Syst Sci, Beijing 100190, Peoples R China
推荐引用方式
GB/T 7714
Wang, Yong-Cui,Wang, Yong,Yang, Zhi-Xia,et al. Support vector machine prediction of enzyme function with conjoint triad feature and hierarchical context[J]. BMC SYSTEMS BIOLOGY,2011,5:11.
APA Wang, Yong-Cui,Wang, Yong,Yang, Zhi-Xia,&Deng, Nai-Yang.(2011).Support vector machine prediction of enzyme function with conjoint triad feature and hierarchical context.BMC SYSTEMS BIOLOGY,5,11.
MLA Wang, Yong-Cui,et al."Support vector machine prediction of enzyme function with conjoint triad feature and hierarchical context".BMC SYSTEMS BIOLOGY 5(2011):11.
条目包含的文件
条目无相关文件。
个性服务
推荐该条目
保存到收藏夹
查看访问统计
导出为Endnote文件
谷歌学术
谷歌学术中相似的文章
[Wang, Yong-Cui]的文章
[Wang, Yong]的文章
[Yang, Zhi-Xia]的文章
百度学术
百度学术中相似的文章
[Wang, Yong-Cui]的文章
[Wang, Yong]的文章
[Yang, Zhi-Xia]的文章
必应学术
必应学术中相似的文章
[Wang, Yong-Cui]的文章
[Wang, Yong]的文章
[Yang, Zhi-Xia]的文章
相关权益政策
暂无数据
收藏/分享
所有评论 (0)
暂无评论
 

除非特别说明,本系统中所有内容都受版权保护,并保留所有权利。