Articles

Chance and statistical significance in protein and DNA sequence analysis

See allHide authors and affiliations

Science  03 Jul 1992:
Vol. 257, Issue 5066, pp. 39-49
DOI: 10.1126/science.1621093

Abstract

Statistical approaches help in the determination of significant configurations in protein and nucleic acid sequence data. Three recent statistical methods are discussed: (i) score-based sequence analysis that provides a means for characterizing anomalies in local sequence text and for evaluating sequence comparisons; (ii) quantile distributions of amino acid usage that reveal general compositional biases in proteins and evolutionary relations; and (iii) r-scan statistics that can be applied to the analysis of spacings of sequence markers.