A Distinctive Class of Integron in the Vibrio cholerae Genome

See allHide authors and affiliations

Science  24 Apr 1998:
Vol. 280, Issue 5363, pp. 605-608
DOI: 10.1126/science.280.5363.605


The ability of bacteria to acquire and disseminate heterologous genes has been a major factor in the development of multiple drug resistance. A gene, intI4, was identified that encodes a previously unknown integrase that is associated with a “gene-VCR” organization (VCRs are Vibrio cholerae repeated sequences), similar to that of the well-characterized antibiotic resistance integrons. The similarity was confirmed by IntI1-mediated recombination of a gene-VCR cassette into a class 1 integron. VCR cassettes are found in a number of Vibrio species including a strain of V. metschnikovii isolated in 1888, suggesting that this mechanism of heterologous gene acquisition predated the antibiotic era.

Integrons are gene expression elements that acquire open reading frames (gene cassettes) and convert them to functional genes. More than 40 different antibiotic resistance cassettes have been characterized in these structures, permitting their bacterial hosts to become resistant to a broad spectrum of antimicrobial compounds (1). The insertion of a gene cassette takes place by site-specific recombination between the circularized cassette and the recipient integron; the essential components are an integrase gene (intI) and a linked attachment site (attI) required for the efficient site-specific integration of the gene cassettes into the integron structure (2). To date, three classes of integron have been defined on the basis of the sequence of their associated integrase (1); the integrases have 43 to 58% amino acid sequence identity and are related to the integrases of temperate bacteriophages. The similarity between the three integrases suggests that their evolutionary divergence extended beyond the half century of the antibiotic era, and probably much longer according to the substitution rates calculated by Ochman and Wilson (3).

All integron-inserted cassettes identified share specific structural characteristics (Fig. 1A); the boundaries of each integrated cassette are defined by two GTTRRRY (core-site) (4) sequences in the same orientation, which are the targets of the recombination process. The integrated cassettes include a gene and an imperfect inverted repeat located at the 3′ end of the gene called a 59-base element, a diverse family of sequences that function as recognition sites for the site-specific integrase. The organization of these elements starts with an inverse core-site, which is complementary to the core-site located upstream from the gene cassette, followed by a sequence of imperfect dyad symmetry, and ending with the downstream core site sequence (Fig. 1). The 59-base elements vary in size from 60 to 141 base pairs (bp); their nucleotide sequence similarities are primarily restricted to the inverse core-site and the core-site. Both the source of gene cassettes and the mechanism of their genesis are unknown, although they are presumed to be bacterial in provenance.

Figure 1

(A) Schematic representation of antibiotic resistance integron In30 (25) and (B) the VCR clusters characterized in V. cholerae. Genes and ORFs are indicated by boxes and labeled when known. M.F.R., mannose-fucose resistant; nt, nucleotide. In (A), the genes belonging to cassettes (white) are distinguished from the nonmobile genes of class 1 integrons (gray). The imperfect dyad symmetries present in the sequences of both the 59-base element and the VCR are symbolized by gray arrows; the range in size of these elements is marked above.

A number of species of the Vibrio genus, a widely distributed bacterial group found primarily in aqueous environments, are pathogenic for humans and animals. Vibrio cholerae has been responsible for major epidemics of human disease, especially since the early 19th century. It is known to have at least two pathogenicity islands (5): the locus encoding the cholera toxin, which is carried by a M13-like phage inserted into the chromosome (6), and a large cluster of genes encoding the accessory colonization factor and genes for the biogenesis of the toxin–co-regulated pilus, which is harbored in a prophage-like structure (7). The V. cholerae genome has also been found to contain repeated sequences (VCRs) in clusters that have a similar organization to the integron–gene cassette structures (8). They were first identified surrounding the mannose-fucose–resistant hemagglutinin gene (mrhA) (9) and the heat-stable toxin gene (sto) (10), two pathogenicity genes in V. cholerae O:1. VCRs are highly repeated (60 to 100 copies) and situated in a single restriction fragment corresponding to about 10% of the V. cholerae genome (11). The VCRs are a family of 123- to 126-bp sequences of imperfect dyad symmetry, and the 13 examples sequenced thus far show an overall identity of 92%. Two such sequences flank the heat-stable toxin gene sto (Fig. 1B), in both O:1 and non-O:1 V. cholerae isolates (10), and nine other VCRs have been found in a 6-kb V. cholerae fragment encoding the hemagglutinin gene, a lipoprotein gene, and eight other unidentified open reading frames (ORFs) (11, 12). Within these clusters, the VCRs are separated from one another by up to two ORFs (Fig. 1B).

We found a 90% sequence identity between the VCR sequences and the 59-base element associated with blaP3, which is an integron-associated antibiotic resistance gene encoding the carbenicillinase CARB-4 isolated fromPseudomonas (13). Further investigation of the structures of VCR clusters revealed that the gene-VCR organization is essentially identical to that of the resistance gene cassette array typically found in integrons (Fig. 1): (i) the VCRs usually abut a single ORF; (ii) the VCRs have imperfect dyad symmetries starting with an inverse core-site and ending with a core-site identical to the integron cassette consensus GTTRRRY (4); (iii) the inverse core-site is always complementary to the upstream VCR core-site; and (iv) all VCRs are in the same orientation to each other.

The ability of an integrase to recognize potential recombination sites can be assayed by measuring the integration of a single cassette or the co-integration of a plasmid carrying the cassette into a target integron (14). Using such an assay we demonstrated that the blaP3 cassette (pSU38::CARB4) (15) and the ORF1 cassette (pSU38::ORF1-cat) (16) found in the VCR locus containing the hemagglutinin gene (Fig. 1B) can be directed to the insertion sites of integrons. To track the ORF1 cassette, we tagged it by inserting a catgene [for chloramphenicol resistance (CmR)]. As shown in Table 1, in both cases, integration of the cassette into the integron was observed. The precise location of the cassette insertion events was established by polymerase chain reaction (PCR). In the 48 transconjugants studied, the cassettes were inserted at the attI site of In3, with recombination occurring at the core-sites of the VCRs (17). For bothblaP3 and ORF1::cat, the frequencies of transfer by recombination (conduction) were about 10−2 and were strictly dependent on IntI1 activity. These frequencies are comparable to those found in studies of integration of several different 59-base elements of known integron cassettes (14,18).

Table 1

Recombination frequencies of the blaP3and ORF1::cat cassettes. The indicated donor strains were mated with Escherichia coli UB5201 (14) and transconjugants separately selected for trimethoprim resistance (Tpr) and ampicillin resistance (Ampr) for ω3 and ω5 or chloramphenicol resistance (Cmr) for ω8 and ω9.

View this table:

To investigate the relationship between VCR clusters and integrons, we used nucleic acid hybridization to isolate a gene for an integron integrase in V. cholerae (19) and identified theintI4 gene, the product of which has 45 to 50% identity with the three known integrases (Fig. 2). An array of four gene-VCR cassettes is found upstream ofintI4 in the characterized V. cholerae fragment. This organization is identical to that of antibiotic resistance integrons; the location of the putative attI site is at the proximal boundary of the first cassette, 225 bp upstream from theintI4 start codon. A cluster of ribosomal protein genes is located downstream from intI4. Specific signals were detected in both V. mimicus and V. metschnikoviiby Southern (DNA) hybridization with an intI4 probe (20).

Figure 2

The V. cholerae gene intI4. (A) Schematic representation of the intI4locus, putative attI site, and VCR and associated ORFs are as shown in Fig. 1. The light gray box symbolizes the intI4gene encoding the integrase, and the dark gray boxes indicate the genes encoding ribosomal proteins and initiation factor 3; arrows show the directions of transcription. (B) Alignment of IntI4 (accession number AF 055586) and the three integron integrases (26).

A number of Vibrio isolates dating from 1888 to 1982 have been screened by means of oligonucleotide primers (17) corresponding to the most conserved regions of the VCR sequences (Table2). Vibrio cholerae O:1 569B exhibited a complex pattern of amplification composed of more than 10 different PCR products, and both V. metschnikovii isolates showed a pattern of at least six distinct products (20). This result demonstrates that VCR cassettes were found in theVibrio lineage before the emergence of antibiotic resistance integrons. At least three VCR cassettes were found in V. mimicus and V. parahaemolyticus, but none was present in the well-studied luminescent, nonpathogenic V. fisheri.Vibrio mimicus is phylogenetically close to V. cholerae, whereas V. metschnikovii, V. parahaemolyticus, and V. fischeri belong to other lines of descent in the Vibrio genus (21). The presence of relatively fewer VCR cassettes in Vibrio species other than V. cholerae may be real or it may indicate substantial nucleotide sequence variation between the VCR sequences present in these strains. Despite the high variability in 59-base element sequences, it has been established that antibiotic resistance cassettes are all substrates for the integron-encoded integrases; thus, we predict the same to be true for the gene-VCR clusters. These results do not distinguish between the possibilities that intI4 and its associated cassettes were acquired by a Vibrio ancestor before the separation of these species and lost in some (for example,V. fisheri), or that this integron invaded only pathogenicVibrio during their evolution.

Table 2

Vibrio strains screened for the presence of VCR cassettes. Abbreviations: V. chol., V. cholerae;V. met., V. metschnikovii; V. par., V. parahaemolyticus; V. mim.,V. mimicus; V. fis., V. fischeri; Path., pathogenicity; cass., cassettes. Symbols: +, more than two different amplification products (24); −, no product. PCR primer sequences are given in (17).

View this table:

Of the six different cassettes sequenced from the V. metschnikovii PCR products, one showed 67% identity to the previously described ORF5 V. cholerae cassette (Fig. 1B), which encodes a 15-kD protein of unknown function, suggesting thatV. cholerae and V. metschnikoviishared the same pool of cassettes and that VCR cassettes are disseminated among Vibrio species. The gene encoding the heat-stable toxin (sto) is harbored by a VCR cassette inV. cholerae (10) and is dispersed in V. cholerae (both O:1 and non-O:1 serotypes), as well as amongV. mimicus strains; the sto genes found inV. cholerae and V. mimicus are highly similar (22). In the case of V. mimicus, it is not known whether the sto gene is a component of a VCR cassette, but its variable distribution among different V. mimicusisolates suggests the presence of such a structure.

Our studies show that VCR islands are integron-like structures and that their formation likely occurred by typical integrase-mediated processes, suggesting that integrons (widely spread in antibiotic-resistant Gram-negative bacteria) also existed for the purpose of gene capture in Vibrio species. The variation observed in codon usage of the gene-VCR cassettes as well as in their GC content (between 33 and 45%, compared with 47% for the V. cholerae genome) is consistent with the idea that the VCR-associated genes were recruited from other microbial sources. The observation that there are 60 to 100 VCR copies in the V. cholerae genome (11) implies that there must be an equivalent number of cassettes, that is, more than 10 times the number found in the largest antibiotic resistance integron. The function of this “super-integron” may extend beyond the clustering of genes for pathogenicity into a generalized system for the entrapment and spread of other biochemical functions, for example, the antibiotic resistance–encoding cassette blaP3 (which is a VCR cassette). It is well established that integrons had a major role in the recent spread of multidrug resistance among Gram-negative bacteria. Our studies support their function in bacterial genome evolution, through the fixed integration of genes at secondary sites, as has already been observed for antibiotic resistance cassettes (1). A striking feature of the VCR integron, compared with the antibiotic resistant integrons, is the conserved sequence of the recombination elements of its endogenous cassettes (the VCR itself). Given the relationship between VCR sequences and the numerous 59-base elements, it is possible that each resistance gene cassette with a different 59-base element represents a single member of a clustered group of cassettes. If this is true, we may expect to find such structures in many bacterial genera.

  • * Present address: Institut Pasteur, 25-28 rue du Dr. Roux, 75724 Paris Cedex 15, France.


View Abstract

Navigate This Article