| HOME | HELP | FEEDBACK | SUBSCRIPTIONS | ARCHIVE | SEARCH | TABLE OF CONTENTS |
| |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Previous Article | Next Article ![]()
Eukaryotic Cell, April 2007, p. 721-733, Vol. 6, No. 4
1535-9778/07/$08.00+0 doi:10.1128/EC.00338-06
Copyright © 2007, American Society for Microbiology. All Rights Reserved.
,
bieta Rozp
dowska,2
Jure Pi
kur,2 and
Kenneth H. Wolfe1*
Department of Genetics, Smurfit Institute, Trinity College Dublin, Dublin 2, Ireland,1 Cell and Organism Biology, Lund University, Sölvegatan 35, 22362 Lund, Sweden2
Received 24 October 2006/ Accepted 19 January 2007
| ABSTRACT |
|---|
|
|
|---|
| INTRODUCTION |
|---|
|
|
|---|
D. bruxellensis is probably the major microbial cause of wine spoilage worldwide and causes substantial economic losses within the wine industry (22). Wines infected by D. bruxellensis are said to have "Brett" character: they may smell mousy or medicinal or of wet wool, burnt plastic, or horse sweat (38). The mousy taint is the result of pyridines synthesized by D. bruxellensis from lysine and ethanol (29), while medicinal or barnyard odors are caused by the volatile phenols 4-ethylguaiacol and 4-ethylphenol, secondary metabolites produced by D. bruxellensis from phenolic acids naturally present in the grape must (14, 30). D. bruxellensis may have been present in wineries for centuries, and a very slight Brett character is traditionally considered desirable in certain styles of wine (22). The frequency and severity of D. bruxellensis contamination have increased in recent decades, however, as winemaking trends have changed to favor the production of wines that contain more residual sugars and that may be unsulfited, unfiltered, or aged on lees (dead yeast cells), all factors that are favorable to D. bruxellensis growth.
D. bruxellensis possesses a number of adaptations that allow it to survive in the physiologically challenging environments of fermenting must and maturing wine. Like S. cerevisiae, it is ethanol tolerant, facultatively anaerobic, and petite positive (it can survive without mitochondrial DNA). Both species are also Crabtree positive and thus ferment preferentially in the presence of high glucose under aerobic conditions (36, 48). This suite of characteristics has evolved independently in each of the two lineages, possibly in response to similar selective pressures. The species differ in a number of other regards, however. For example, D. bruxellensis utilizes glucose less efficiently and grows much more slowly than S. cerevisiae (57) but can assimilate a wider variety of alternative carbon sources (16). Together, these characteristics explain the typical ecological succession observed during the course of D. bruxellensis-contaminated alcoholic fermentations, with S. cerevisiae dominating throughout the primary fermentation phase and then being replaced by D. bruxellensis in the maturation phase, when ethanol concentrations are high and only minute amounts of residual sugars remain (51).
Despite its economic importance and physiological interest, D. bruxellensis has remained largely unstudied at the genomic level. Previous work has focused on sequencing rRNA regions for phylogenetic analysis (67) or to aid in molecular detection of D. bruxellensis contamination (45), and only one nuclear protein-coding gene has been sequenced from this species to date (33). Consequently, the genetic bases of the physiological capabilities of D. bruxellensis remain largely unknown. To investigate these, and to provide a resource for further research on this species, we undertook genome survey sequencing of strain CBS 2499, isolated from wine. We report here a preliminary analysis of the genome organization and gene content of this strain.
| MATERIALS AND METHODS |
|---|
|
|
|---|
Gene identification. To identify protein-coding genes in D. bruxellensis, we first compared the contig sequences to a database of S. cerevisiae protein sequences (5,770 protein-coding genes, excluding dubious open reading frames and transposable element genes, downloaded from the Saccharomyces Genome Database in July 2005). Regions of the contigs greater than 100 nucleotides in length were annotated as orthologs to S. cerevisiae genes to which they were reciprocal best BLAST hits (using BLASTX and TBLASTN bit scores) (1).
To identify D. bruxellensis genes that do not have S. cerevisiae orthologs, we repeated this analysis using the nonredundant (NR) protein database (downloaded from the National Center for Biotechnology Information in July 2005). Regions of the D. bruxellensis contigs were considered to be orthologs of characterized genes in the NR database to which they were reciprocal best hits if (i) the contig region had no reciprocal best S. cerevisiae hit or (ii) the BLASTX bit score of the NR reciprocal best hit to the region was at least 100 bits higher than that of the S. cerevisiae hit.
Finally, we used S. cerevisiae rRNA and tRNA gene sequences as BLASTN queries against the contigs, with an upper E-value threshold of 1e-10, to identify D. bruxellensis homologs of these genes.
Phylogenetic position. We constructed phylogenetic trees using amino acid sequence data from D. bruxellensis, S. cerevisiae, and eight other fungal species for which the whole genome sequence was available (Candida glabrata, Kluyveromyces lactis, Debaryomyces hansenii, Yarrowia lipolytica [19], Candida albicans [34], Ashbya gossypii [18], Aspergillus nidulans [23], and Schizosaccharomyces pombe [66]). We accepted as sets of orthologous genes those genes that were best mutual BLASTP hits, with E values of <1e-40, in all possible pairwise comparisons among all 10 taxa. This yielded 396 sets of orthologous genes.
In those cases where only a partial gene sequence was present in the D. bruxellensis data, we extracted the corresponding segment of the orthologous gene in each of the other species based on BLASTP HSP coordinates; in all other cases, complete gene sequences were used. The amino acid sequences of each set of orthologs were aligned using T-Coffee (46), and regions of uncertain alignment were removed using Gblocks (13). Each of the 396 alignments was separately used to estimate the species phylogeny, using PHYML (26) with the JTT model of amino acid substitution and a gamma distribution of substitution rates with four categories.
We visualized the degree of topological conflict among these trees by constructing a consensus network (31) using SplitsTree4 (32). To test whether one of the three best-supported topologies suggested by this consensus network was a significantly better fit to the total sequence data than the others, we performed the Shimodaira-Hasegawa test (56) as implemented in Tree-Puzzle (54). We concatenated the 396 gene alignments to produce an alignment of 115,036 residues and calculated the log likelihood of each of the three topologies given these sequence data and a JTT model of substitution. To ensure that model misspecification did not affect our results, we repeated the Shimodaira-Hasegawa test using four other substitution models (WAG, VT, BLOSUM 62, and Dayhoff).
Genome architecture. For each pair of genes adjacent in D. bruxellensis, we determined whether their orthologs were also adjacent (or separated by fewer than five genes) in the S. cerevisiae genome. We also used the same method to calculate the degree of gene order conservation between D. bruxellensis and C. albicans. For this purpose, we considered as orthologous any pair of D. bruxellensis and C. albicans genes that shared a mutual S. cerevisiae ortholog. The list of C. albicans/S.cerevisiae orthologs we used here was obtained from http://www.candidagenome.org.
We used two methods to identify introns in D. bruxellensis. Because many introns occur at conserved genomic locations across the hemiascomycetes (8), we first examined those D. bruxellensis genes orthologous to intron-containing S. cerevisiae genes. We aligned each pair of orthologous genes and determined whether the D. bruxellensis gene contained known hemiascomycete intron splice or branch sites (obtained from reference 27) and/or noncoding, presumably intronic, sequence in the region of the gene corresponding to the S. cerevisiae intron. Secondly, to detect D. bruxellensis-specific introns, we also identified genes that appeared to contain intron length segments of extra sequence when compared to their S. cerevisiae orthologs. To do this, we searched for genes that returned two significant BLASTX HSPs to a single S. cerevisiae protein, where the interval between those HSPs was 50 nucleotides (approximately the minimum known hemiascomycete intron length [8]) or greater in D. bruxellensis but 10 nucleotides or smaller in S. cerevisiae. These genes were then screened for known intronic splice and branch sites, as described above.
We used Tandem Repeats Finder (6) to identify microsatellites in the assembly. To identify transposable elements in this genome, we used the D. bruxellensis contigs as BLASTX queries against a database of protein sequences of TYA and TYB genes from Ty elements of multiple hemiascomycete species (43) and protein-coding regions of non-long terminal repeat (LTR) retrotransposons and DNA transposons from C. albicans and Y. lipolytica (12, 25, 44). Regions of the contigs that had hits with E values of <1e-10 to any of these proteins were counted as transposable elements. If a region had significant hits to more than one type of element, it was annotated as belonging to the element type of the best hit. Because many of the elements identified were fragmentary and/or substantially diverged from the query sequences used, we identified the longest D. bruxellensis elements from each of the three classes found and repeated the analysis, using these sequences as a TBLASTX database, to identify more diverged elements in our data. To ensure that our estimate of the number of elements was not inflated by double counting 5' and 3' ends of a single element on different contigs, we counted only hits that shared a common region of homology to the query as unique elements.
Gene content analyses. To further examine the differences in the sets of proteins encoded by D. bruxellensis and other yeasts, we searched for three types of loci: individual genes that have undergone recent lineage-specific duplication in D. bruxellensis, gene families that have expanded in D. bruxellensis relative to other species, and genes that have been horizontally transferred. To identify genes in the first two categories, we created a database of "putative genes" in D. bruxellensis. Each such gene was a region of a contig that was over 100 nucleotides long, had a BLASTX hit to an S. cerevisiae protein with E < 1e-10 (but was not necessarily the reciprocal best BLAST hit to that protein), and did not contain a stop codon.
A pair of genes arising from a lineage-specific gene duplication are likely to be one another's best BLAST hits, to the exclusion of orthologs in other lineages. We used the translated putative D. bruxellensis genes as BLASTP queries against a database containing the same putative D. bruxellensis proteins plus the proteomes of S. cerevisiae, C. glabrata, K. lactis, A. gossypii, C. albicans, D. hansenii, and Y. lipolytica. Any D. bruxellensis gene whose top two hits were itself and another D. bruxellensis protein was considered a probable lineage-specific duplication.
To assess whether genes with particular functions are more likely to be duplicated in D. bruxellensis, we tested whether the S. cerevisiae orthologs of the duplicated genes represent a functionally biased subset of the S. cerevisiae proteome. We determined the distribution of the orthologs of the duplicates among gene ontology categories, for each of the three ontologies, using GOToolBox (42). For each ontology, we then compared this distribution to that of the 5,770 S. cerevisiae proteins used in our analyses, using Fisher's exact test. We corrected for multiple tests (5).
We also searched for gene families that have expanded to a greater size in D. bruxellensis than in two closely related species, C. albicans and D. hansenii. The proteomes of these two species were combined with the translated putative gene set of D. bruxellensis into a single database. We queried this database with the protein sequences of S. cerevisiae genes belonging to previously defined gene families with two or more members (a subset, defined by phyletic pattern, of the "consensus families" from http://cbi.labri.fr/Genolevures/fam/index.html). We then calculated the total number of unique hits with an E value of <1e-10 to D. bruxellensis, C. albicans, and D. hansenii proteins across all genes in each family. Families that had more hits to D. bruxellensis than to either C. albicans or D. hansenii were considered to have undergone gene family expansion in D. bruxellensis. This is a conservative threshold, as the D. bruxellensis genome data are far less complete than those of either C. albicans or D. hansenii.
Nucleotide sequence accession numbers. Read sequences have been deposited in GenBank with accession numbers EI011584 [GenBank] to EI026443 [GenBank] , and contig sequences are available on request from the corresponding author. In addition to the genome survey data, we completely sequenced a number of clones containing adenyl deaminase genes and nitrate/nitrite utilization genes (GenBank accession numbers EF364424 [GenBank] to EF364429 [GenBank] ).
| RESULTS AND DISCUSSION |
|---|
|
|
|---|
|
Genes and genetic code. We identified complete or partial sequences of 2,606 D. bruxellensis protein-coding genes with orthologs in S. cerevisiae, listed in Table S1 of the supplemental material. Additionally, 277 genes without S. cerevisiae orthologs, but with orthologs from other Saccharomycetales species, were identified (most of these genes were annotated as hypothetical; the 50 with functional annotations are listed in Table 2), as was a single example of a gene horizontally transferred from a bacterial species (see below). No other genes without fungal homologs were identified. An almost complete rDNA repeat, including 18S, 5.8S, 25S, and 5S rDNAs, is present in the data, as are at least 24 tRNA genes. If we assume that the 40% of the genome represented in our data are a random sample of the whole genome with respect to gene density, the total protein-coding gene complement of D. bruxellensis is likely to be on the order of 7,430 genes, which is comparable to that of other hemiascomycetes.
|
Phylogenetic position. To determine the phylogenetic position of D. bruxellensis among the hemiascomycetes, we constructed 396 trees, based on protein sequences of orthologous genes from D. bruxellensis and nine other fungal species (see Materials and Methods). We then calculated a consensus network to quantify the degree of incongruence between these trees (31). This involves generating a list of splits (bipartitions of the taxa corresponding to branches in the trees) present in the complete set of trees and weighting them by the frequency with which they occur; all splits that occur at a frequency above a certain threshold can then be displayed in a network. Boxes in the network indicate the presence of incompatible splits. Although the major groupings in our network are well supported, there is conflict between the gene trees regarding the position of D. bruxellensis: three incompatible splits are each present in over 25% of trees (Fig. 1A).
|
Previous estimates of the phylogenetic position of D. bruxellensis reflect the uncertainty we observe in our data: separate analyses based on 18S sequence data have supported both topology 1 (11) and topology 3 (47). Our analysis of almost 400 protein-coding genes significantly rejects topology 3, but we remain unable to determine whether D. bruxellensis diverged from the hemiascomycete lineage shortly before C. albicans and D. hansenii (topology 1) or if these three species briefly formed a shared divergent lineage before the speciation of D. bruxellensis (topology 2). It is likely that very few substitutions were fixed in the short time periods separating these divergence events, making it difficult to accurately reconstruct the true topology. The proteins used in our analysis were well conserved across taxa and thus relatively slowly evolving, and it is possible that more rapidly evolving proteins would provide more substitutions on the branches of interest. Such proteins, however, are more difficult to assign to orthology groups and to reliably align and are more likely to be saturated, possibly leading to decreased phylogenetic accuracy.
Gene sequence statistics. We calculated the amino acid identity between each D. bruxellensis protein and its S. cerevisiae ortholog by aligning their amino acid sequences and removing gapped sites. D. bruxellensis and S. cerevisiae orthologous proteins have a mean amino acid identity of 48.8% (N = 2,615, standard deviation, 14.6). This is significantly lower than the amino acid identity between C. albicans and S. cerevisiae orthologs for the subset of proteins that are present in all three species (mean C. albicans-S. cerevisiae identity, 52.9%; mean D. bruxellensis-S. cerevisiae identity, 49.9%; N = 2,115; P < 1 x 1016, Wilcoxon signed-rank test). If we accept that the divergence of D. bruxellensis, S. cerevisiae, and C. albicans is effectively a trichotomy (or that the best-supported tree, topology 2, reflects the real phylogeny), this suggests that the rate of amino acid sequence evolution in D. bruxellensis is higher than that of C. albicans.
D. bruxellensis genes tend to have higher GC content than their S. cerevisiae orthologs at all classes of sites (Table 1). The difference in GC content across orthologous sites is statistically significant (P < 1 x 1016 for all genic sites and for codon third position sites; Wilcoxon signed-rank test). GC content also varies among D. bruxellensis genes substantially more than among the orthologous S. cerevisiae genes, a trend that is particularly obvious at third position sites (Fig. 2).
|
We compared the order of genes along D. bruxellensis contigs with that of their orthologs on the S. cerevisiae genome to determine the degree of synteny conservation between these two species. Most of our D. bruxellensis contigs contain only a single gene, but 286 contain two or more genes, allowing us to analyze 318 pairs of adjacent genes. Of these gene pairs, 19% are also immediate neighbors in S. cerevisiae and a further 4% are close neighbors separated by one to five genes. Because we expect gene order relationships between D. bruxellensis and S. cerevisiae to have been affected by the whole genome duplication that occurred on the S. cerevisiae lineage (65), we also assessed the degree of gene order conservation between D. bruxellensis and C. albicans, a species that has not undergone whole genome duplication. In this case, of 233 pairs of adjacent D. bruxellensis genes with C. albicans orthologs, 18% have adjacent C. albicans orthologs and an additional 8% are separated by one to five genes. Only 17 pairs of adjacent genes in D. bruxellensis are also adjacent in both C. albicans and S. cerevisiae (see Table S2 in the supplemental material); these genes show no obvious functional bias.
These results suggest that D. bruxellensis has a somewhat lower rate of small-scale rearrangements than C. albicans (Fig. 3). An earlier analysis using the same method determined that only 9% of gene pairs adjacent in C. albicans were also adjacent in S. cerevisiae (55). As we expect approximately half of the links between gene pairs in pre-whole-genome duplication species to be broken due to random gene loss in the S. cerevisiae lineage subsequent to duplication (65), we can estimate that approximately 62% of D. bruxellensis and 82% of C. albicans neighboring gene pairs have been disrupted in S. cerevisiae due to chromosomal rearrangement. The lower value for D. bruxellensis is not unusual: it has been shown that the rate of small chromosomal inversions differs by over an order of magnitude among six completely sequenced yeast genomes (21), and our estimate of the percentage of adjacent gene pairs in D. bruxellensis conserved in S. cerevisiae (19%) is consistent with those for D. hansenii (16%) and C. tropicalis (18%) (39).
|
We identified introns in 40 D. bruxellensis genes whose S. cerevisiae orthologs also contain introns (Table 3). Of these, two (DYN2 and RPL7A and their D. bruxellensis orthologs) each contain two introns. We found an additional eight introns that appear to be specific to D. bruxellensis (Table 3). This may be an underestimate of the true number of novel introns, as our detection method requires that exons both upstream and downstream of the intron be identified as HSPs by BLAST. Hemiascomycete introns (including those we observe in our data) have a strong bias towards the 5' end of genes, and in many cases the first exon encodes five or fewer amino acids; such first exons may not be detected as an HSP.
|
We examined two kinds of repeated sequences in the D. bruxellensis genome: microsatellites and transposable elements. The assembly contains 48 microsatellites with between 8 and 34 dinucleotide or trinucleotide repeat units. On the basis of results from other hemiascomycete species (37), it is likely that at least some of these are polymorphic and may thus be of use in future analyses of population diversity.
The great majority of transposable elements so far described in hemiascomycete yeasts are LTR retrotransposons, although non-LTR retrotransposons and DNA transposons have been found in C. albicans and Y. lipolytica (12, 25, 44). The LTR retrotransposons fall into two broad categories: Ty1/copia (including Ty1, Ty2, T4, and Ty5 elements) and Ty3/gypsy (43). The D. bruxellensis data contain at least 42 transposable elements, all of which are LTR retrotransposons: three Ty1-like, 16 Ty3-like, and 13 Ty5-like. This number includes only those elements that retain at least fragments of the TYA or TYB protein-coding genes and is thus a minimum estimate of the number of elements present; many solo LTR sequences are likely to be present but too degenerate to identify by comparison with LTR sequences from other species.
It has previously been suggested that Ty1 elements first appeared in the genomes of hemiascomycetes at the base of the S. cerevisiae/K. lactis clade (43), after the divergence of D. bruxellensis from the main hemiascomycete lineage. The presence of three putative Ty1 elements in our data may indicate that elements of this class arose at an earlier stage in hemiascomycete evolution and have subsequently been lost from other species in the C. albicans/D. hansenii clade (as they have been from Saccharomyces bayanus, for example). The Ty1-like elements present in D. bruxellensis are substantially diverged from one another and from their closest homologs in other species (the most similar has 26% amino acid identity to a Tkm1 element from Kluyveromyces marxianus), indicating that they are unlikely to be the result of a recent horizontal gene transfer event.
Differences in gene content. The variations in physiological capabilities between D. bruxellensis and other yeasts are caused by differences in gene complement and regulation in these species. To investigate the extent of gene content differences, we identified genes in D. bruxellensis that are absent from S. cerevisiae (Table 2), and we also sought evidence for increases in gene number that could underlie various traits. We identified 89 pairs of lineage-specific duplicated genes (see Table S3 in the supplemental material). Enzymes and transporters were notably overrepresented in this set of genes, among other significantly enriched GO categories (see Tables S4a to c in the supplemental material). Notably, we found duplicated copies of three genes in the allantoin catabolism pathway (DAL1, DCG1, and MLS1), all with 97% DNA sequence identity between copies. We also identified 18 gene families that are expanded in D. bruxellensis relative to both C. albicans and D. hansenii, including several families functioning in sterol, phosphocholine, and phospholipid synthesis (Table 4).
|
|
D. bruxellensis also possesses genes coding for several subunits of the respiratory chain complex I and for a SHAM-sensitive alternative oxidase (AOX1) (Table 2), suggesting that this species is capable of alternative respiration. This pathway bypasses complexes III and IV of the respiratory chain; instead, electrons from the ubiquinone pool are used by the alternative oxidase to reduce oxygen to water. Alternative respiration and aerobic fermentation have previously been hypothesized to represent two alternative responses to glycolytic overflow (63). Under this theory, when high levels of glucose are available to the yeast cell, rapid glycolysis may lead to saturation of the respiratory pathway and subsequent accumulation of pyruvate and NADH. To prevent glycolysis from stalling, this NADH may then be reoxidized to NAD+ either via alternative respiration (in Crabtree-negative yeasts) or by the reduction of pyruvate to ethanol (in Crabree-positive yeasts). The presence of complex I and AOX1 in D. bruxellensis, a Crabtree-positive yeast capable of aerobic fermentation, is therefore somewhat unexpected (62), although alternative respiration has previously been demonstrated in some strains of Dekkera and Brettanomyces yeasts (7). It has been suggested that this alternative respiratory pathway may be implicated in the Custers effect (anaerobic inhibition of fermentation) shown by these yeasts (7) or NADH reoxidation through glycerol production, but little is known of the mechanisms of these processes.
Finally, seven D. bruxellensis genes coding for peroxisome structural and regulatory proteins were also identified as having ascomycete but no S. cerevisiae orthologs. Peroxisomes are found in almost all eukaryotes, and some peroxins (including that encoded by PEX19) are conserved across this taxonomic spectrum (53), suggesting that S. cerevisiae orthologs to at least some of these genes are likely to exist. Many of the ascomycete peroxisome genes in Table 2 have in fact been previously annotated as homologous to S. cerevisiae genes (59), although their products often have low amino acid identity. Phylogenetic analyses suggest that S. cerevisiae peroxin genes are indeed orthologous to the D. bruxellensis genes but are too divergent to be detected using our method (data not shown).
Horizontal gene transfer.
While comparing our contig sequences to the entire NCBI database, we identified a gene fragment in D. bruxellensis that was the reciprocal best BLAST hit to a Burkholderia cenocepacia (
-Proteobacteria) gene annotated as encoding an adenosine deaminase (ADA). We sequenced three additional plasmid clones to obtain the full sequence of this gene, the surrounding intergenic regions, and portions of the neighboring genes. The gene is 1,053 nucleotides long and is located downstream of a URA4 homolog and upstream of a transporter of the DAL5 family. Both of the neighboring genes show clear evidence of fungal origin, which rules out the possibility that the ADA gene-like gene arose from bacterial contamination of our D. bruxellensis library. The complete gene sequence identifies the gene as a member of the adenyl deaminase gene subfamily, which includes both adenosine deaminase and adenine deaminase (ADE) genes (41, 52).
We constructed a phylogenetic tree using sequences of the D. bruxellensis gene, its close BLAST hits, and a number of known ADA and ADE genes (Fig. 5). The D. bruxellensis gene clusters with neither the ADA nor the ADE gene clade but forms a separate clade with sequences from four species of the Burkholderia cepacia complex (40) and one sequence from Kineococcus radiotolerans. The clear separation of this clade from the fungal ADE genes, together with the existence of a canonical ADE gene in D. bruxellensis (DbADE1) (Fig. 5), suggests that horizontal gene transfer from a bacterial species is the most likely origin for the Burkholderia-like gene, which we named DbHAD1 for D. bruxellensis horizontally transferred adenyl deaminase gene.
|
It seems likely that both D. bruxellensis and the four B. cepacia complex species received this gene as a horizontal transfer, either independently or sequentially, from some other species not yet represented in the public sequence databases. The four B. cepacia complex species, K. radiotolerans, and D. anomala (the sister species of D. bruxellensis) have all been isolated from soil (15, 50, 61), a common habitat that provides the opportunity for genetic transfers, and B. cepacia complex species are known to have genomic characteristics that facilitate the receipt and transfer of nonendogenous DNA (15). Comparative analysis of conserved amino acid residues gave no indication of the substrate specificity of DbHAD1, although the residues required for catalytic function are conserved (see Table S5 in the supplemental material). It therefore seems likely that this gene remains functional in D. bruxellensis; it is difficult to speculate on precisely what role it plays, however.
While the genomes of some other hemiascomycete species have been shown to contain a number of horizontally transferred genes (19, 28), DbADE1 was the only example found in our data. We found no evidence of an ortholog of URA1, the horizontally transferred cytosolic dihydroorotate dehydrogenase (DHODase) gene present in S. cerevisiae and closely related species (24). Most basal hemiascomycetes, like other eukaryotes, have a mitochondrial DHODase, encoded by URA9; the activity of this enzyme is coupled to the mitochondrial respiratory chain because a quinone is the terminal electron acceptor. As URA1 uses fumarate as an electron acceptor, the presence of this gene effectively decouples uracil biosynthesis from respiration, and it has been proposed that the transfer of URA1 from an anaerobic bacterium to an ancestor of S. cerevisiae and K. lactis was an essential step in the evolution of facultative anaerobiosis in this group of yeasts (24, 28, 48).
Some species in the URA1-possessing clade, such as K. lactis, retain both URA1 and URA9, while in others, like Saccharomyces cerevisiae, URA9 has been lost and only URA1 has been retained (24, 28). Our sequence data contain a complete URA9 gene (Table 2), indicating that a parallel gene displacement event has not occurred in D. bruxellensis. It is not possible to determine from our data, however, whether D. bruxellensis has an undetected URA1 gene that allows it to live anaerobically or whether it has found an alternative evolutionary solution to this challenge.
Conclusions. The genome survey sequence data analyzed here show that D. bruxellensis strain CBS 2499 has genome characteristics typical of other hemiascomycetes in many respects, including estimated gene number, intron size and number, intergenic length, and gene content. Its rate of sequence evolution is faster than that of its relative C. albicans, though it has a lower rate of genomic rearrangement; neither rate, however, is extreme. Although aneuploidy may contribute to the variation in karyotypes observed between strains of D. bruxellensis (58), the relatively low proportion of duplicated sequence we see in our data, together with the small number of lineage-specific duplicate genes identified, suggests that this strain is haploid; this is consistent with its relatively small estimated genome size when compared with other strains.
These data should additionally provide a resource for future genetic study of this species. For example, although our sequences do not include the gene coding for the cinnamate decarboxylase required for the production of ethylphenols and ethylguaiacols (30), they contain numerous uncharacterized enzymes, some of which may be involved in the generation of the other volatile compounds that contribute to "Brett" character in wine (29, 38). These data may also facilitate analyses of population structure and variation in D. bruxellensis. Such analyses are essential if we are to develop optimal strategies to control contamination and to determine whether strains of D. bruxellensis that are neutral or even beneficial to wine character exist and may be harnessed by the viticultural industry (22).
| ACKNOWLEDGMENTS |
|---|
This work was supported by Science Foundation Ireland, the E. P. Sörensens Foundation, and the Nilsson-Ehle Foundation.
| FOOTNOTES |
|---|
Published ahead of print on 2 February 2007. ![]()
Supplemental material for this article may be found at http://ec.asm.org/. ![]()
| REFERENCES |
|---|
|
|
|---|
kur. 2004. Horizontal gene transfer promoted evolution of the ability to propagate under anaerobic conditions in yeasts. Mol. Genet. Genomics 271:387-393.[CrossRef][Medline]
kur, J., E. Ro
p
dowska, S. Polakova, A. Merico, and C. Compagno. 2006. How did Saccharomyces evolve to become a good brewer? Trends Genet. 22:183-186.[CrossRef][Medline]
/ß barrel enzymes: a new adenine deaminase family. J. Mol. Biol. 334:1117-1131.[CrossRef][Medline]