You are currently viewing the abstract.
View Full TextLog in to view the full text
AAAS login provides access to Science for AAAS members, and access to other journals in the Science family to users who have purchased individual subscriptions.
Register for free to read this article
As a service to the community, this article is available for free. Existing users log in.
More options
Download and print this article for your personal scholarly, research, and educational use.
Buy a single issue of Science for just $15 USD.
Transcription Factor Binding Sites
Transcription factors (TFs) are proteins that bind to DNA to control gene transcription. Stergachis et al. (p. 1367; see the Perspective by Weatheritt and Babu) examined TF binding within the human genome in more than 80 cell types. Nearly 15% of coding regions simultaneously specify both amino acid sequence and TF recognition sites. The distribution of the TF binding sites evolutionarily constrains how codons within these regions can change, independent of encoded protein function. Thus, TF binding may represent a widespread and strong evolutionary force in coding regions.
Abstract
Genomes contain both a genetic code specifying amino acids and a regulatory code specifying transcription factor (TF) recognition sequences. We used genomic deoxyribonuclease I footprinting to map nucleotide resolution TF occupancy across the human exome in 81 diverse cell types. We found that ~15% of human codons are dual-use codons (“duons”) that simultaneously specify both amino acids and TF recognition sites. Duons are highly conserved and have shaped protein evolution, and TF-imposed constraint appears to be a major driver of codon usage bias. Conversely, the regulatory code has been selectively depleted of TFs that recognize stop codons. More than 17% of single-nucleotide variants within duons directly alter TF binding. Pervasive dual encoding of amino acid and regulatory information appears to be a fundamental feature of genome evolution.