DNA序列互补匹配特征分析及可视化系统
The Analysis and Visualization System of the Complementary and Matching Features of DNA Sequences

作者: 刘文嘉 :云南大学软件学院,云南 昆明; 郑智捷 :云南大学软件学院信息安全重点实验室,云南 昆明;

关键词: DNA空间结构长距离匹配子串–互补串匹配可视化DNA Spatial Structure Long Distance Matching Substring Complementary-String Matching Visualization

摘要:
由于DNA序列双螺旋结构的互补对称性质以及空间上的复杂结构,探索长距离DNA片段的匹配特征尤其是在互补关系上有着重要的意义。本文利用子串–互补串匹配技术通过对DNA序列已有结构的检测分析着重对回文序列、发夹结构等可能存在的结构进行预测。通过统计测量的方法对DNA数据进行处理,对结果进行对比分析,将测量数据转换为可视图,对批量复杂DNA序列的提取特征数据进行可视化分析。通过结果图示,可以看到选择的DNA序列中的确存在着长距离匹配结构。文中给出的可视化方法,提出的分析测量模型以及提取的测量特征可视化机制能为后续不同DNA序列数据以及结构的可视化分析的应用研究提供坚实的模型和实践基础。

Abstract: Due to the complementary symmetry properties of the double helix of DNA sequence and complicated space structure, exploring long-range DNA pieces matching characteristics, especially on the complementary relationship has important significance. In this paper, the substring-complementary string matching technique is used to predict the possible structure of hairpin structure and detect palindromic structure in DNA sequences. With using statistical measurement method to processing DNA, and comparing with the results of analysis, visualization of measured-date, large amounts of complex DNA sequences can be analyzed speedy. Through the results, the matching structure in selective DNA sequence does exists in long distance. Visualization methods, the analysis of the characteristics of the measuring model and extract visual mechanism given in the paper can provide a model and practice foundation for different DNA sequence data, and the application of visualization analysis of structure research.

文章引用: 刘文嘉 , 郑智捷 (2015) DNA序列互补匹配特征分析及可视化系统。 计算生物学, 5, 49-57. doi: 10.12677/HJCB.2015.54006

参考文献

[1] Cantor, C.R. and Lim, H.A. (1991) The First International Conference on Electrophoresis, Supercomputing, and the Human Genome: Proceedings of the April 10-13 Conference at Florida State University, Tallahassee, Florida. International Conference on Electrophoresis, Supercomputing, and the Human Genome, World Scientific.

[2] 骆嘉伟, 刘芳, 杨华. 基于信息离散度的DNA序列相似性分析[J]. 计算机应用, 2009, 29(1): 269-272.

[3] 白凤兰. DNA 序列的特征数值及相似性分析[J]. 数学的实践与认识, 2007, 37(18): 95-99.

[4] 张巍琼, 郑智捷. 基于不同产生机制的伪随机序列和DNA序列的随机性测量[J]. 成都信息工程学院学报, 2012(6), 548-555.

[5] 刘玉倩, 郑智捷. 编码和非编码 DNA 序列的可视化分析[J]. 计算生物学, 2014, 4(2): 20-31.

[6] 完竹, 郑智捷. DNA 序列一维分段测量分布可视化[J]. 云南大学学报(自然科学版), 2013(35): 1-6.

[7] 杜磊, 郑智捷. 在非线性函数下的DNA概率测量聚类分布[J]. 软件工程与应用, 2014, 3(3): 41-49.

[8] 敖丽敏, 罗存金. 基于神经网络集成的 DNA 序列分类方法研究[J]. 计算机仿真, 2012, 29(6): 171-175.

分享
Top