计算生物学

Vol.4 No.2 (June 2014)

编码和非编码DNA序列的可视化分析
The Visual Analysis of Coding and Non-Coding DNA Sequences

 

作者:

刘玉倩 , 郑智捷 :云南大学软件学院,昆明

 

关键词:

非编码序列图形表示方法概率测量Non-Coding Sequences Graphic Representation Technique Probability Measurements

 

摘要:

DNA序列作为一种复杂的遗传信息,其具体特性不仅体现在编码序列之中,也包含在非编码序列之中。在高等生物体中主要基因成分为非编码序列,在ENCODE计划中,有证据表明,在人类基因中有98%为非编码形式,其中80%具有功能性,所以对编码区和非编码区的研究已经成为一类重要研究热点。本文提供的模型和实验结果,使用图形表示方法对编码区以及非编码区基因的差异进行区分。该模型采用的是对编码区以及非编码区的DNA序列进行分段概率测量,从而对不同的基因特征分布进行比较。

DNA sequences include complex genetic information; their specific characteristics are contained in both the coding and non-coding sequences. Major gene components in higher levels of organisms are composed of non-coding sequences. In ENCODE project, there are evidences that 98% of the human genomes are non-coding forms and 80% of them with functions, so the research on coding region and non-coding region has become an important research hotspot. This paper provides models and experiment results which using visual representation techniques to distinguish differences between coding and non-coding sequences. This model uses probability measurements on the DNA sequences to coding and non-coding regions respectively to distinguish patterns identified from different sequences.


文章引用:

刘玉倩 , 郑智捷 (2014) 编码和非编码DNA序列的可视化分析。 计算生物学, 4, 20-31. doi: 10.12677/HJCB.2014.42003

 

参考文献

分享
Top