KMS Of Academy of mathematics and systems sciences, CAS
A measure of discrepancy of multiple sequences | |
Fang, WW; Roberts, FS; Ma, ZR | |
2001-09-01 | |
发表期刊 | INFORMATION SCIENCES |
ISSN | 0020-0255 |
卷号 | 137期号:1-4页码:75-102 |
摘要 | Multiple sequence comparison is a basic problem for molecular biology and other sciences. In this paper, we introduce the concept of complete information set and some measurement principles for measuring discrepancy among multiple sequences. Based on them, we present a new measurement method satisfying the principles for comparing multiple sequences. We illustrate that this method can effectively distinguish different random sequences or DNA sequences of length 8000 by comparisons of 6-8 symbol (base) strings or protein sequences of length 8000 by comparisons of 3-4 symbol (amino acid) strings. It can also measure slight changes of a sequence, e.g., insertion or deletion of a symbol (a base or an amino acid) in a sequence. It is applied in the study of molecular evolution, and the elementary result shows a hierarchic relationship among the cytochrome C protein sequences of different species, much as that in taxonomy. (C) 2001 Elsevier Science Inc. All rights reserved. |
关键词 | multiple sequence comparison entropy DNA information discrepancy |
语种 | 英语 |
WOS研究方向 | Computer Science |
WOS类目 | Computer Science, Information Systems |
WOS记录号 | WOS:000170199000006 |
出版者 | ELSEVIER SCIENCE INC |
引用统计 | |
文献类型 | 期刊论文 |
条目标识符 | http://ir.amss.ac.cn/handle/2S8OKBNM/15974 |
专题 | 中国科学院数学与系统科学研究院 |
通讯作者 | Fang, WW |
作者单位 | 1.Chinese Acad Sci, Inst Appl Math, Acad Math & Syst Sci, Beijing 100080, Peoples R China 2.Rutgers State Univ, Ctr Discrete Math, Piscataway, NJ 08855 USA 3.Rutgers State Univ, Theoret Comp Sci Ctr, DIMACS, Piscataway, NJ 08855 USA 4.Rutgers State Univ, Waksman Inst Microbiol, Piscataway, NJ 08855 USA |
推荐引用方式 GB/T 7714 | Fang, WW,Roberts, FS,Ma, ZR. A measure of discrepancy of multiple sequences[J]. INFORMATION SCIENCES,2001,137(1-4):75-102. |
APA | Fang, WW,Roberts, FS,&Ma, ZR.(2001).A measure of discrepancy of multiple sequences.INFORMATION SCIENCES,137(1-4),75-102. |
MLA | Fang, WW,et al."A measure of discrepancy of multiple sequences".INFORMATION SCIENCES 137.1-4(2001):75-102. |
条目包含的文件 | 条目无相关文件。 |
除非特别说明,本系统中所有内容都受版权保护,并保留所有权利。
修改评论