TOOLS: A Human Gene Master List

Science  18 Mar 2005:
Vol. 307, Issue 5716, pp. 1701
DOI: 10.1126/science.307.5716.1701c

If you search several genome databases for information about a particular human gene, the results won't always match. That's because the various sites apply different criteria to pinpoint genes and often marshal different evidence to infer their functions. To straighten out these discrepancies, genome mavens have crafted a master catalog of nearly 15,000 of our genes that almost certainly code for proteins. The Consensus CoDing Sequence project involved organizations such as the National Center for Biotechnology Information, the University of California, Santa Cruz, and the European Bioinformatics Institute and entailed comparing the latest gene rosters compiled by researchers and by computers. Experts weeded out problem sequences such as pseudogenes, which lack a corresponding protein. Recent estimates suggest that humans might carry up to 10,000 more genes, but many of these didn't make the cut because of insufficient evidence.

www.ncbi.nlm.nih.gov/projects/CCDS

Related Content

Navigate This Article