Computational Biology

Decoding Noncoding Regions

See allHide authors and affiliations

Science  01 Sep 2000:
Vol. 289, Issue 5484, pp. 1433-1435
DOI: 10.1126/science.289.5484.1433e

As the genomes of complex organisms are deciphered, an increasing amount of DNA sequence becomes available, of which only a small fraction (about 3% of human chromosome 22) encodes proteins. Knowledge of the genetic code makes it feasible to recognize the amino acid-coding parts of genes and to perform sophisticated comparisons across species and individuals.

Bussemaker et al. are tackling the problem of recognizing gene regulatory sequences, making use of recent data collected from microarray studies. Their algorithm, MobyDick, reveals common motifs in the form of words from the unbroken string of As, Ts, Cs, and Gs; many of the words in their dictionary correspond to those associated with genes whose expression changes during either sporulation or general repression in yeast. — GJC

Proc. Natl. Acad. Sci. U.S.A., in press.

Navigate This Article