欢迎访问《中国农学通报》,

中国农学通报 ›› 2010, Vol. 26 ›› Issue (17): 51-53.

所属专题: 生物技术

• 生物技术科学 • 上一篇    下一篇

TMGMV完整基因组的统计特征

杨硕 李建学   

  • 收稿日期:2010-02-14 修回日期:2010-04-23 出版日期:2010-09-05 发布日期:2010-09-05

The Statistical Characteristics of Tobacco Mild Green Mosaic Virus Complete Genome

  • Received:2010-02-14 Revised:2010-04-23 Online:2010-09-05 Published:2010-09-05

摘要:

提取TMGMV完整基因组的统计特征,并且对它进行聚类分析.在TMGMV完整基因组的碱基序列上,用每个碱基及其随后二个碱基所构成的三碱基组,排列成一个新的序列S;计算所有64种不同三碱基组在S上出现的概率,得到一个64维向量L;比较各个基因组的L向量,我们得到6个三碱基组,它们的概率有明显地差异。结论:6个三碱基组(AAG;AGA;TGA;GAC;GAG;GTT)的出现概率与TMGMV基因组的遗传变异有着重要关联;4个不同来源的TMGMV完整基因组,按其遗传变异结果,形成2个大类。

关键词: 玉米, 玉米, 氮高效, 光, 氮, 硝酸还原酶, 谷氨酰胺合成酶

Abstract:

To extract the statistical features of complete tobacco streak virus genome and conduct cluster analysis. In the tobacco streak virus complete genome sequence of the base, with two bases of each base and its subsequent posed by the three base groups, arranged in a new sequence S; the calculation of all 64 kinds of different groups in the three bases appear on the probability of S, we obtain a 64-dimensional vector L; Comparison of L-vector of each genome, we get 6 three base groups that emergence of probability are significantly different 6 three-base group (AAG;AGA;TGA;GAC;GAG;GTT)the emergence of probability and the genome associated with genetic variation has importance; 4 different sources Tobacco streak virus complete genome, based on the results of their genetic variation, form 2 major categories.