Population Biology, Evolution, and Infectious Disease: Convergence and Synthesis

See allHide authors and affiliations

Science  05 Feb 1999:
Vol. 283, Issue 5403, pp. 806-809
DOI: 10.1126/science.283.5403.806


Traditionally, the interest of population and evolutionary biologists in infectious diseases has been almost exclusively in their role as agents of natural selection in higher organisms. Recently, this interest has expanded to include the genetic structure and evolution of microparasite populations, the mechanisms of pathogenesis and the immune response, and the population biology, ecology, and evolutionary consequences of medical and public health interventions. This article describes recent work in these areas, emphasizing the ways in which quantitative, population-biological approaches have been contributing to the understanding of infectious disease and the design and evaluation of interventions for their treatment and prevention.

Ever since Darwin, population and evolutionary biologists have been interested in infectious disease (Fig. 1). For most of the history of this enterprise, however, infectious diseases have been seen as primarily agents of natural selection in “higher” organisms, rather than as subjects of study in their own right. In the last two decades, this situation has changed. An increasing number of population and evolutionary biologists have turned to infectious diseases and the microorganisms responsible as the subjects of their research. In many areas, the infectious disease problems being addressed by population and evolutionary biologists have converged with (or complement) those studied by microbiologists, immunologists, epidemiologists, and clinicians. Here, we consider three of these areas: molecular epidemiology, the mechanisms of pathogenesis, and intervention. In place of a literature review of these areas, we offer a critical perspective that is less comprehensive but more personal.

Figure 1

This quotation from Darwin's Variation in Animals and Plants Under Domestication illustrates not only how well developed the contagion school was before Pasteur and Koch's demonstration of bacteria as the etiologic agents of disease, but also how infectious disease has played a secondary role in evolutionary biology. Here Darwin was using germs to bolster his argument that the “gemmules” he postulates in his “provisional hypothesis of pangenesis” as the carriers of the information of inheritance and development could be small enough to fit in the sperm of higher organisms. Darwin's other references to infectious disease (and vices) in The Descent of Man are as agents of natural selection in higher organisms, the role they have played for most of the history of population and evolutionary biology.

Molecular Epidemiology

As measured by the number of practitioners and international meetings (1), the most developed area of this convergence is molecular epidemiology. The practical goals of molecular epidemiology are to identify the microparasites (viruses, bacteria, fungi, and protozoa) responsible for infectious diseases and determine their physical sources, their biological (phylogenetic) relationships, and their routes of transmission and those of the genes (and accessory elements) responsible for their virulence, vaccine-relevant antigens, and drug resistance.

Early in the development of the “germ theory,” it became clear that there was considerable variation in the incidence and severity of infections with microbes classified as members of the same “species.” As a means of understanding the microbiological basis of this variation and classifying the organisms within species more finely, a variety of typing schemes were developed using serological and other phenotypic markers. The widespread application of these methods led to the observation that specific arrays of disease symptoms could be attributed to particular serotypes or phage types within species such as Escherichia coli and Salmonella.This discovery supported the notion that bacterial lineages maintain their genetic integrity over long time intervals and over great distances—that is, their genomes are not rapidly broken down or reshuffled by recurrent mutation and recombination. This view of the genetic structure of bacterial populations became known as the “clone concept” (2).

As the data accumulated, however, it became clear that bacterial populations and species are not merely arrays of genetically separate lineages. The degree of clonality varies among different species: Some, like E. coli and Salmonella, are highly clonal, whereas the populations of other species like Neisseria gonorrhoeae and Bacillus subtilis are effectively panmictic (3). Moreover, it also became apparent that even the most clonal bacteria, like E. coli, are chimeras bearing chromosomal genes (4) and portions of genes of different ancestries (5).

Among viruses, extensive attention has been devoted to the molecular epidemiology of the human immunodeficiency virus (HIV). These studies have revealed identifiable lineages of recent common ancestry, but they have also shown that some clinical isolates are chimeras of two or more ancestral lines (6). Clonal population structures have also been suggested for eukaryotic microparasites, likeTrypanosoma cruzii (7), in which it is unknown whether recombination occurs during the life cycle. Most surprisingly, some studies have indicated a degree of clonal population structure (with important caveats) for eukaryotes for which recombination occurs as part of the reproductive cycle, such as the malarial parasitePlasmodium falciparum (8). Thus, in contrast to the naı̈ve expectation, it appears that some organisms for which recombination is not a normal part of the life cycle are nearly panmictic, whereas some microparasites with well-characterized mechanisms for recombination maintain a nearly clonal population genetic structure.

A perusal of the literature on the genetic epidemiology of microparasites would yield the impression that the clone concept (the “clone controversy,” as some prefer to see it) is central to the enterprise. Is it? From a practical perspective, understanding the genetic structure of microparasite populations is important for two main reasons. First, for epidemiological and forensic investigations, a clonal population structure makes it possible to trace the sources of different isolates of a pathogen. However, microparasites that on larger spatial and temporal scales are effectively panmictic have clonal population structures when isolated from outbreaks (3). Consequently, a detailed knowledge of the overall genetic structure of a pathogen population may be unnecessary for short-term epidemiological tracking. Rather, the challenge is to use typing characters that change at rates that are informative for the particular question at hand (9). Thus, a relatively fast-changing genetic marker, the restriction pattern of insertion sequences, has been useful for tracking the transmission of tuberculosis in San Francisco (10) but would be unhelpful for looking at trends in the global population of Mycobacterium tuberculosis over many decades. Similarly, broad categories such as serogroup and serotype are useful in global surveys of disease patterns for some bacterial infections but might provide inadequate discrimination for investigating local outbreaks.

Second, knowledge of the genetic structure is important to understand (and, ideally, predict) the responses of pathogen populations to selective pressures imposed by host immunity, both natural and vaccine-induced, and is important for the effective management of antimicrobial drugs. These areas are discussed further below.

Within-Host Population Dynamics of Pathogen Proliferation

If the course of a microparasite infection in a vertebrate host were described without jargon, the process would be readily recognized as one of population dynamics and evolution. A microbe proliferates in a habitat (host tissue), in which it is normally limited by physical and chemical barriers or by predators and competitors (phagocytic and other cells of the host immune system, and other microorganisms). These microbial invaders often have phenotypes that enhance their survival in this novel habitat (virulence factors, including the ability to attach to and invade cells and to resist the host's constitutive chemical and cellular defenses). Evolution occurs in both the parasite population and the host's immune system. Parasite-mediated selection favors the proliferation of predators (such as B and T cells) that specifically target the microparasite. Selection in the microparasite population favors mutants (antigenic variants) that are able to evade these targeted predators or avoid predation and competition by invading different microhabitats (cells and tissues) where the (immunological) predation pressure is less intense. Either the microparasite population will be controlled or cleared by the response of the cell community (host), with only modest disturbance (morbidity), or the defenses will fail and the cell community will be destroyed (mortality). Figure 2 is a schematic diagram of these interactions.

Figure 2

Schematic illustration of the population dynamics of a virus infection. The replication of a virus in an infected host can be likened to a natural ecosystem with resources, predators, and prey. By infecting a cell the virus preys on its resource, the susceptible target cells, while itself being subjected to predation by the host's immune response. Classical virological and immunological research is mostly concerned with uncovering the direct interactions between the virus and different cell types of its host. In contrast, population-dynamical research aims to disentangle the complicated and often counterintuitive dynamical behavior resulting from the web of nonlinear interactions.

This perspective on pathogenesis and the immune response as ecological, population-dynamical, and evolutionary processes has been well recognized for some time (11). However, it has had little impact on contemporary research on the mechanisms of pathogenesis. Much of this research is qualitative rather than quantitative, and it can be described as a quest to characterize (genetically, biochemically, and physiologically) the interaction between infectious pathogens and the host's immune defenses. Although this research provides an indispensable basis for understanding pathogenesis and the host's response to infection, it tells only a part of the story. A complete account of the course of an infectious disease must include a quantitative description of the major forces that determine the abundance, diversity, and distribution of a pathogen population within an infected host and the immune defenses involved in its control.

During the past decade, population and evolutionary biologists have begun to develop a quantitative theory of the within-host population dynamics of infections (12). Among the most fruitful of these investigations are those that have involved direct collaborations between population biologists, immunologists, and virologists working on HIV. Simple mathematical models describing the nonlinear interaction between populations of HIV and of CD4+ T cells were instrumental in uncovering the rapid viral replication that underlies an apparently slow disease. Mathematical analysis of clinical data showed that more than 1010 virus particles are produced and cleared every day and that about one-third of the actively virus-producing cells are replaced every day (13). Before these analyses, the low viral load during the asymptomatic phase of the infection was commonly attributed to slow virus replication. The population-dynamical analysis, however, showed that the low virus load reflects a dynamic equilibrium between very high rates of production and clearance.

Analogous studies of the dynamics of hepatitis B and C virus and simian immunodeficiency virus (SIV) are now beginning to provide a comparative perspective (14). In addition to contributing to our understanding of viral pathogenesis, the population-dynamical approach led to predictions about the minimum duration of treatment for the elimination of a viral infection, provided the rationale for the current recommendation to treat HIV early during the asymptomatic phase, and helped explain the differences in the rates at which drug resistance appears in the treatment of different infections (13,14).

The Ecology of the Immune Response

An “ecological” perspective may also be helpful in studying the dynamic nature of the immune response to infectious pathogens in vivo. For example, a population-dynamical analysis of cross-sectional data of T cell counts in humans after radiotherapy provided the first estimates for the proliferation, death, and interconversion rates of naı̈ve and memory T cells in vivo (15). A more direct estimate of these rates in macaques was recently obtained by collecting longitudinal data for the uptake and washout of a marker that labels dividing cells (16). Mathematical modeling and analysis not only enabled the estimation of proliferation and death rates of CD4+ and CD8+ T cells in normal macaques, but also enabled the quantification of how these rates change in the presence of SIV infection.

Quantifying the dynamics of immune responses not only provides valuable information for the study of infectious disease but also sheds light on many central issues in immunology and identifies critical questions for further experimental analysis. The macaque study, for example, suggests that the death rate exceeds the proliferation rate of T cells in the blood (16). If so, what source makes up for the difference? Could immigration of T cells from the thymus play numerically a more important role than commonly believed? What regulates the cell populations in the periphery? Many experimental and theoretical studies have addressed these questions separately, but only a few combine both approaches (17). However, it is clear that a combination of both is necessary to develop a comprehensive quantitative understanding of the immune response to infectious pathogens.


Population-biological models have already played a major role in the design and targeting of vaccination programs. They have been used, for example, to calculate the critical fraction of a population that must be vaccinated to eliminate a particular pathogen; to design clinical trials of vaccines; to target vaccination programs (or other interventions) to achieve maximal reductions in disease; to anticipate possible “perverse” effects of vaccination (for example, increases in congenital rubella syndrome as vaccination increases the average age of rubella infection); and to predict potential outbreaks of a disease in time to take preventive measures (18,19). These successful, practical applications rest on more basic insights provided by the models into the mechanisms underlying observed spatial and temporal patterns in disease incidence (19, 20). In both basic and applied settings, the success of these models stems from their extensive use of epidemiological data to adapt the models to individual diseases and to estimate key parameters of disease transmission.

The successful vaccines of the past were directed at organisms with little antigenic variation, whereas many of the current efforts at vaccine development target microparasites with considerable antigenic diversity, such as HIV, Plasmodium spp., andStreptococcus pneumoniae. This diversity raises a number of new population-biological and evolutionary questions that are beginning to be addressed. How will populations of antigenically diverse microparasites evolve in response to mass vaccination, which may only target a subset of the antigenic variants, and how will this evolution affect the vaccine's benefits in reducing total disease (21,22)? How must calculations of key epidemiological parameters like the basic reproductive rate of a microparasite be changed when considering antigenically diverse organisms, and how can clinical trials be designed to provide maximal information about the evolutionary effects of the vaccine (22, 23)?

Antimicrobial Resistance

Resistance to antimicrobial drugs is now a serious clinical problem in a wide range of infections (24, 25). New drugs are unlikely to appear soon enough and in sufficient numbers to solve many of these resistance problems. Hence, there is a growing need to understand the factors that lead to the evolution of the spread of resistance, and to design strategies to maximize the effectiveness of existing drugs while minimizing the spread of resistance to them.

The prevalence of resistance of a particular organism to a given drug shows a strong positive correlation with the extent of use of the drug (25, 26). However, the pattern becomes more complex in the broader context of the patterns of antimicrobial use and resistance. For some drugs used to treat some infections, resistance has been an obvious problem since treatment was first introduced (27); in other cases, resistance has emerged relatively recently (or not at all), despite several decades of use (28). A variety of pharmacological, genetic, ecological, and social factors are responsible for these varying patterns of resistance. Mathematical models of the population dynamics of sensitive and resistant organisms are beginning to provide explanations for these patterns of resistance, as well as testable predictions for the impact of changes in antimicrobial use on the extent of resistance in individuals and populations (29–36).

These models predict that the frequency of drug resistance and the rate of change of that frequency depend primarily on a few measurable properties of the pathogen and the drug in question, the most important of which are (i) the duration of infectiousness of infected individuals, (ii) the incidence of drug treatment, (iii) the extent to which treatment of a sensitive infection reduces the transmission of that infection, (iv) the degree to which resistance reduces the competitive fitness of the microparasite in the absence of treatment, and (v) the probability that a drug-sensitive infection becomes resistant upon treatment (29–34).

The effects of these factors are apparent in the contrasting predictions of two recent mathematical models of antiviral resistance. A model of acyclovir treatment of genital herpes (32) predicted that a sustained increase in antiviral treatment would probably result in a very slow increase in resistance, taking several decades to increase to 5 to 10%. The reasons for this slow increase include the long infectious period of genital herpes, the substantial fitness burden imposed on resistant viruses in the absence of treatment, and the fact that treatment of an individual with a drug-sensitive infection rarely causes a permanent switch to drug resistance. By contrast, a model of influenza A infections predicted that resistance would rapidly reach substantial frequencies, in the tens of percents, in a community where anti-influenza drugs were used to treat large numbers of infected individuals. The reasons for this rapid rise include the short duration of infectiousness and the fact that treatment of drug-sensitive cases commonly results in “acquired” resistance in the treated patients as well as the transmission of resistant viruses (33).

A key question now is how drug use can be altered to reduce the spread of resistance while maintaining the quality of treatment for individual patients. Several studies, both empirical and theoretical, have addressed the question of how the extent of resistance will change after reductions in the extent of antimicrobial use (29,35). At present, there is still a considerable distance between the theory and empirical work, largely because epidemiological studies rarely measure most of the basic parameters of the models, as listed above. Nonetheless, models have already produced results that should be useful, or at least provocative, to empirical researchers and public health practitioners. For instance, models suggest that cycling of antibiotics, which is often recommended as a tool for reducing resistance, may in fact speed the ascent of resistance in the population (34). The model of antiviral treatment of influenza A described above (33) suggests that the use of anti-influenza drugs for prophylaxis, rather than treatment, would prolong the efficacy of these drugs considerably.

Finally, population-biological models have been used to study the evolutionary response of bacterial and viral populations within a treated patient (36). These models help to define the relative importance of noncompliance with drug regimens, heterogeneity in drug concentrations, pathogen population size and diversity, and other factors that are implicated in the emergence of drug resistance in a treated individual. The models offer quantitative support for the long-standing idea that, to overcome the problems of acquired resistance, as Paul Ehrlich said long ago, it is best to “hit hard and hit early,” and to combine drugs (37). Moreover, if integrated into realistic models of within-host dynamics of infections and their control by host defenses, these models may aid in the design of treatment protocols that maximize the efficacy of treatment while minimizing the side effects to the treated host and the dissemination of drug resistance into the community. Particularly in the current, early stages of these modeling efforts, carefully designed trials are necessary to test the effectiveness of policies suggested by the models.

Conclusions and Outlook

In this article, we have focused on the ways in which population and evolutionary biology have been (and can be) contributing to microbiology and medicine. We have not considered the fascinating opportunities that microbiology offers to study evolution and population biology in real time and experimentally (38). Furthermore, we have omitted two topics that actually belong in the broader context discussed here: emerging infectious diseases and the evolution of virulence. There are two reasons for this omission. First, these areas have already been extensively reviewed (39–41). Second, and more important, it is our conviction that the research questions and approaches we have described are fundamental to making real progress in understanding the factors responsible for the emergence or reemergence of old and new diseases and the evolution and maintenance of virulence (42, 43).

The work described above represents some of the early steps toward a convergence of population biology and infectious disease. The examples discussed here have shown that this convergence has led to new fundamental insights, but they also highlight the need to further intensify the collaborative efforts of infectious disease biologists and clinicians with population biologists to advance our quantitative understanding of infectious diseases and to apply this knowledge to their control and treatment.


View Abstract

Navigate This Article