CSpace
A measure of discrepancy of multiple sequences
Fang, WW; Roberts, FS; Ma, ZR
2001-09-01
Source PublicationINFORMATION SCIENCES
ISSN0020-0255
Volume137Issue:1-4Pages:75-102
AbstractMultiple sequence comparison is a basic problem for molecular biology and other sciences. In this paper, we introduce the concept of complete information set and some measurement principles for measuring discrepancy among multiple sequences. Based on them, we present a new measurement method satisfying the principles for comparing multiple sequences. We illustrate that this method can effectively distinguish different random sequences or DNA sequences of length 8000 by comparisons of 6-8 symbol (base) strings or protein sequences of length 8000 by comparisons of 3-4 symbol (amino acid) strings. It can also measure slight changes of a sequence, e.g., insertion or deletion of a symbol (a base or an amino acid) in a sequence. It is applied in the study of molecular evolution, and the elementary result shows a hierarchic relationship among the cytochrome C protein sequences of different species, much as that in taxonomy. (C) 2001 Elsevier Science Inc. All rights reserved.
Keywordmultiple sequence comparison entropy DNA information discrepancy
Language英语
WOS Research AreaComputer Science
WOS SubjectComputer Science, Information Systems
WOS IDWOS:000170199000006
PublisherELSEVIER SCIENCE INC
Citation statistics
Document Type期刊论文
Identifierhttp://ir.amss.ac.cn/handle/2S8OKBNM/15974
Collection中国科学院数学与系统科学研究院
Corresponding AuthorFang, WW
Affiliation1.Chinese Acad Sci, Inst Appl Math, Acad Math & Syst Sci, Beijing 100080, Peoples R China
2.Rutgers State Univ, Ctr Discrete Math, Piscataway, NJ 08855 USA
3.Rutgers State Univ, Theoret Comp Sci Ctr, DIMACS, Piscataway, NJ 08855 USA
4.Rutgers State Univ, Waksman Inst Microbiol, Piscataway, NJ 08855 USA
Recommended Citation
GB/T 7714
Fang, WW,Roberts, FS,Ma, ZR. A measure of discrepancy of multiple sequences[J]. INFORMATION SCIENCES,2001,137(1-4):75-102.
APA Fang, WW,Roberts, FS,&Ma, ZR.(2001).A measure of discrepancy of multiple sequences.INFORMATION SCIENCES,137(1-4),75-102.
MLA Fang, WW,et al."A measure of discrepancy of multiple sequences".INFORMATION SCIENCES 137.1-4(2001):75-102.
Files in This Item:
There are no files associated with this item.
Related Services
Recommend this item
Bookmark
Usage statistics
Export to Endnote
Google Scholar
Similar articles in Google Scholar
[Fang, WW]'s Articles
[Roberts, FS]'s Articles
[Ma, ZR]'s Articles
Baidu academic
Similar articles in Baidu academic
[Fang, WW]'s Articles
[Roberts, FS]'s Articles
[Ma, ZR]'s Articles
Bing Scholar
Similar articles in Bing Scholar
[Fang, WW]'s Articles
[Roberts, FS]'s Articles
[Ma, ZR]'s Articles
Terms of Use
No data!
Social Bookmark/Share
All comments (0)
No comment.
 

Items in the repository are protected by copyright, with all rights reserved, unless otherwise indicated.