# 在非线性函数下的DNA概率测量聚类分布DNA Clustering Distribution Measured with Probability under Nonlinear Function

Abstract: Typical clustering analysis can make similarity data together and show the use of the same or different spatial distribution of fragments presented in the sequence. This paper deals with DAN sequences from different sources using statistical calculations and projection characteristics grouped in three different nonlinear functions of the probability value measurements, getting a visual on the genetic characteristics of the formation of the cluster distribution. Comparison showed that similar stratification had the same trend and complementary characteristics at a higher level, but there are obvious differences between the distributions of different types of gene sequences.

[1] Lieberman-Aiden, E., et al. (2009) Comprehensive mapping of long-range interactions reveals folding principles of the human genome. Science, 326, 289-293.

[2] Zheng, J., Zhang, W.Q., Luo, J., et al. (2013) Variant map system to simulate complex properties of DNA interactions using binary sequences. Advances in Pure Mathematics, 3, 5-24.

[3] Eisen, M., Spellman, P., Brown, P., et al. (1998) Parallel human genome analysis: Cluster analysis and display of genome-wide expression patterns. PNAS, 95, 14863-14868.

[4] Tavazoie, S., Hughes, J.D., Campbell, M.J., et al. (1999) System-attic determination of genetic network architecture. Nature Genetics, 22, 281-85.

[5] Yeung, K.Y., Raley, C., Murua, A., et al. (2001) Model-based clustering and data transformations for gene expression data. Bioinformatics, 17, 977-987.

[6] Beyer, O., Hackel, H., Pieper, V. and Tiedge, J. (1980) 概率计算和数学统计. Harri Deutsch出版社.

[7] Chance, B.L. and Rossman, A.J. (2005) Preface. In: Investigating Statistical Concepts, Applications, and Methods, Duxbury Press, New York.

[8] 吴赣昌 (2008) 概率论与数理统计. 中国人民大学出版社, 北京.

[9] Schneier, B. (1995) Chapter 17—Other Stream Ciphers and Real Random-Sequence Generators. In: Applied Cryptography: Protocols, Algorithms, and Source Code in C, 2nd Edition, Wiley, New York.

[10] 张巍琼, 郑智捷 (2012) 基于不同产生机制的伪随机序列和DNA序列的随机性测量. 成都信息工程学院学报, 6,文章编号: 1671.

[12] ftp://ftp.ncbi.nih.gov/genomes/

[13] Chapman, S.J. (2008) MATLAB Programming for Engineers. 2nd Edition, 清华大学出版社, 北京.

[14] Bu, Q.X. and Zheng, J.Z.J. (2013) 2D Conjugate Maps of DNA Sequences. Journal of Information Security, 4, 193196.

Top