基于序列关联的蛋白质亚细胞定位识别
Recognition of Subcellular Localization of Proteins Using of Sequences Fusion

作者: 贾芸 :;

关键词: 亚细胞定位F值二次判别分析Subcellular Localization F-Value Quadratic Discriminant Analysis

摘要: 对未知蛋白的功能注释是蛋白质组学的主要目标。一个关键的注释是蛋白质亚细胞定位的预测。应用基于序列关联的二次判别分析方法进行蛋白质亚细胞定位预测,对4个植物定位类型进行5-fold交叉检验。

Abstract: Functional annotation of unknown proteins is a major goal in proteomics. A key annotation is the prediction of a protein’s subcellular localization. We used the method of Increment of Diversity with Quad-ratic Discriminant analysis (IDQD) to predict subcellular localization of proteins which are recognized by the four plant categories and three non-plant and obtained accuracy 87.4(±0.5)% and 91.2(±0.2)%, respectively in 5-fold cross-validation test. Our result is better than comparable existing methods.

文章引用: 贾芸 (2011) 基于序列关联的蛋白质亚细胞定位识别。 计算生物学, 1, 1-3. doi: 10.12677/hjcb.2011.11001

参考文献

[1] K. Nakai. Protein sorting signals and prediction of subcellular localization. Advances in Protein Chemistry, 2000, 54: 277-344.

[2] K. C. Chou, H. B. Shen. Review: Recent progresses in protein subcellular location prediction. Analytical Biochemistry, 2007, 370(1): 1-16.

[3] 贾芸, 赵巨东, 吕军. 基于N端信号的蛋白质亚细胞定位识别[J]. 内蒙古工业大学学报(自然门科学版), 2008, 27(2): 81-87.

[4] O. Emanuelsson, H. Niclsen, S. Brunak, et al. Predicting subcellular localization of proteins using amino acid terminal amino acid sequence. Journal of Molecular Biology, 2000, 300(4): 1005-1016.

[5] 罗辽复. 生命进化的物理观[M]. 上海: 上海科学技术出版社, 2000: 169-189.

[6] R. R. Laxton. The measure of diversity. Journal of theoretical biology, 1978, 70(1): 51-67.

[7] 徐克学. 生物数学[M]. 北京: 科学出版社, 1999: 278-286.

[8] L. R. Zhang, L. F. Luo. Splice site prediction with quadratic discriminant analysis using diversity measure. Nucleic Acids Research, 2003, 31(21): 6214-6220.

[9] 吕军, 罗辽复. 人类PolII启动子的识别[J]. 生物化学与生物物理进展, 2005, 32: 1185-1191.

[10] 边肇祺, 张学工等. 模式识别[M]. 北京: 清华大学出版社, 2004: 9-42.

分享
Top