American Journal of Science, Vol. 305, June/September/October 2005, P.467-502
Biogeochemical signatures through time as inferred from whole microbial genomes
Aubrey L. Zerkle,
Christopher H. House
and
Susan L. Brantley
Department of Geosciences, Pennsylvania State University, 212 Deike Building, University Park, Pennsylvania, 16802
Corresponding author: Christopher House, chouse{at}geosc.psu.edu
 |
ABSTRACT
|
|---|
Throughout geologic time, a strong feedback has existed between the geosphere and the biosphere; therefore biological evolution and innovation can be linked to the evolution of ancient environments on Earth. Here we deduce geochemical signatures and phylogenetic relationships of prokaryotes from whole genome sequences and use this link to infer geochemical aspects of the biosphere through time. In particular, we have investigated two potential biosignatures for modern and ancient biochemistry: the magnitude of microbial carbon isotopic fractionation, and the use of metals in microbial cells.
The distribution of carbon fixation pathways on microbial phylogenies suggests that: (1) both low and moderate carbon isotope fractionation were established quickly in the early evolution of life; and (2) methanotrophic and ethanotrophic metabolisms capable of producing biomass with extreme 13C depletions are not primitive, but rather evolved after the major groups of Prokaryotes had already diverged. The universal importance of the TCA cycle, which results in low carbon isotopic fractionation, indicates it may have evolved especially early making it perhaps the most likely carbon fixation pathway for the last common ancestor (LCA). Low isotopic fractionation by the reductive TCA cycle can be considered consistent with carbon isotope ratios found in 3,800 million year old Isua sediments. Additionally, moderately fractionated biomass from up to 3,500 million years ago can now likely be attributed to carbon fixation by anoxygenic photoautotrophs using the reductive pentose phosphate cycle (Calvin cycle).
In addition to carbon, cells require a number of other elements that could potentially provide biosignatures, including bioactive trace metals. We calculated "model metallomes" for 52 prokaryotes based on the number of atoms of trace metals required to express one molecule of each metallo-enzyme coded for in the corresponding genomes. Our results suggest that the use of metals in prokaryotes as a group generally follows the hierarchy: Fe >> Zn > Mn >> Mo, Co, Cu >> Ni > W, V. However, model metallomes vary with metabolism, oxygen tolerance, optimum growth temperature, and phylogeny. The model metallome of methanogens shows a unique metal signature, suggesting that elevated requirements of nickel and tungsten might be translated to the expressed metallome providing a biosignature for methanogenesis. The model metallomes of diazotrophs and cyanobacteria do not show unique signatures; however changes in enzyme expression under some conditions could still translate into a metal biosignature in the expressed cellular metal content.
In a separate analysis, we made inferences about the timing of the evolution of individual metallo-enzymes based on their function and occurrence in modern organisms. The results suggest that fluctuations in the redox state of the Earths oceans and atmosphere have forced changes in the proportions of metals used in biology. In this model, biological use of copper and molybdenum has developed along with bioavailability; biological use of iron and manganese has developed counter to bioavailability; and biological use of zinc, cobalt, and nickel has not changed significantly through time. Using this technique, we estimated a model metallome for the LCA based on the metallo-enzymes we infer to have been present at that time. This metallome for the LCA differs greatly from one extrapolated from the distribution of model metallomes on microbial phylogenies, supporting the idea that gene loss, metal substitution, and lateral gene transfer have been important in shaping the enzymatic composition of extant organisms.
 |
INTRODUCTION
|
|---|
Using Microbial Genomes to Infer Past Biochemistry
Microbial genomes provide information about metabolic capabilities and gene regulation in extant organisms, as well as molecular clues to the events leading to the evolution of these genes in the geologic past (Macalady and Banfield, 2003). In this context, modern prokaryotic genomes can provide a link between the geosphere and the biosphere through the evolutionary signatures they contain. Here we demonstrate this by using genomes to examine the modern phylogenetic distribution of two potentially important biochemical signatures, and interpreting their evolution and possible distribution in ancient life. We used the gene content of prokaryotic genomes to predict cellular metal requirements in modern organisms, and to determine how metal use and biological carbon isotope fractionation have evolved through geologic time. We first refined the topology of the tree of life using whole genome sequences of composite phylogenetic groups of prokaryotes. We then mapped biochemical characteristics onto the topology of this composite tree, and used the result as a foundation for discussion about microbial biochemistry on early earth. As part of this study, we have reviewed and updated the literature on biological carbon isotope fractionation by prokaryotes from carbon dioxide and methane, establishing a range of fractionation factors produced by specific carbon fixation pathways. We have also examined the cellular metallome inferred from gene content, and evaluated the use of a model metallome as a biosignature for specific microbial metabolisms. Ultimately we suggest that such analyses will be useful in interpreting biosignatures in both modern prokaryotic ecosystems and in the rock record.
Biological Carbon Isotope Fractionation
In recent decades, the biological fractionation of carbon isotopes has become an important means of recognizing and studying ancient life (for a review, see Schidlowski, 2001). Autotrophic microorganisms preferentially incorporate 12CO2 into their biomass to a varying degree based partly on their carbon fixation pathway. Therefore, specific carbon fixation pathways recognizably affect the isotopic signature of organic matter preserved in the geologic record.
Among the Bacteria and the Archaea, the two prokaryotic domains of life, there are four known CO2 carbon fixation pathways: (1) the reductive tricarboxylic acid (TCA) cycle; (2) the 3-hydroxypropionate cycle (3-HP); (3) the reductive pentose phosphate cycle (or Calvin Cycle), using Rubisco (PP); and (4) the reductive acetyl-CoA pathway (AP). There may be additional presently unknown CO2 carbon fixation pathways, operating in organisms such as the recently described Candidatus Chlorothrix halophila, recovered from a hypersaline microbial mat (Klappenbach and Pierson, 2004).
Cell carbon can also be fixed from methane or other hydrocarbon gases. In the Bacteria, methanotrophy is accomplished using the enzyme methane monooxygenase in either the RuMp pathway (RuMP) or the Serine Pathway (S) (Hanson and Hanson, 1996). In the Archaea, it has been suggested that anaerobic methanotrophy (AOM) occurs using a modified methyl co-enzyme M reductase (Hallam and others, 2003; Krüger and others, 2003). The consumption of non-methane hydrocarbons, such as ethane, is known in the Actinobacteria and in the
-Proteobacteria (Bokova, 1954; Davis and others, 1956; de Bont, 1976) but the process, ethanotrophy (E), remains considerably less studied than bacterial methanotrophy despite the potential of ethane as a global microbial substrate (DHondt and others, 2003).
Figure 1A shows the degree of carbon isotopic fractionation from either CO2 or CH4 to cellular biomass observed in organisms using different carbon fixation pathways (data found in tables 1A and 1B; adopted and updated from House and others, 2003a; also reviewed by Knoll and Canfield, 1998; van der Meer and others, 2001a). Fractionation factors in figure 1A are given in
(Hayes, 2002):
 | (1) |
where A is the substrate being used, B is the resulting biomass, and
is the fractionation factor, the ratio of the rate of carbon fixation for 12C to the rate of carbon fixation for 13C, calculated as
 | (2) |

View larger version (12K):
[in this window]
[in a new window]
|
Fig. 1. (A) A generalized summary of published carbon-isotopic fractionation factors from carbon dioxide or methane to biomass by prokaryotes grouped by their known or suspected carbon fixation pathway (modified from House and others, 2003a, with data from tables 1A and 1B). Note that only defines a relative value of fractionation produced by the specific pathway, and that the actual carbon isotopic compositions produced will vary according to the isotopic composition of the substrate used. The carbon fixation pathways from carbon dioxide are: the reductive TCA cycle (TCA), the 3-hydroxypropionate cycle (3-HP), the reductive pentose phosphate cycle (PP), and the acetyl-CoA pathway in species that form a metabolic product from carbon dioxide such as methane or acetate (APMP), or in species that do not (AP). The shape of the bars indicates the estimated relative importance of that pathway in producing a particular fractionation factor. The changing carbon isotopic fractionation that has been observed for methanogens with growth phase (listed in APMP) is shown as a thin solid line extending up to the values expected for a culture at stationary growth. Fractionation factors observed in the metabolic products produced (acetate and methane) are shown as white boxes. Other work has shown that larger fractionations to methane can be obtained during methanogenesis in open systems (Botz and others, 1996; Valentine and others, 2004). The published fractionation factors for AP have been connected by a thin solid line; while these organisms all use the acetyl-CoA pathway, they demonstrate a wide range of isotopic fractionations. The carbon fixation pathways from methane are bacterial methanotrophy using the RuMP pathway (RuMP) or the Serine Pathway (S), and anaerobic methanotrophy (AOM). For the RuMP pathway, the different fractionation factors associated with soluble versus particulate methane monooxygenase is connected by a solid line because both forms of the enzyme is found in some species causing them to show a reduction in fractionation at the initiation of stationary growth. (B) The predicted carbon isotopic composition produced by natural microbial cells using each of the pathways shown, assuming fixation of CO2 with a typical carbon isotopic composition of about 7 per mil, or CH4 with a composition of 55 per mil. The labels "extreme", "moderate", and "low" refer to terminology used in fig. 3 and the corresponding discussion.
|
|
View this table:
[in this window]
[in a new window]
|
TABLE 1A Published results of growth experiments estimating carbon-isotopic fractionation from carbon dioxide to cell biomass, permil. The carbon fixation pathways are those listed in figure 1. Where available, isotopic fractionation to corresponding metabolic products are given in parentheses (MP; methane in most cases, except acetate for A. woodii). Updated from House and others, 2003a.
|
|
Note that
only defines a relative value of fractionation produced by the specific pathway, and that the actual carbon isotopic compositions produced will vary according to the isotopic composition of the initial substrate used.
The results in figure 1A indicate that the reductive pentose phosphate cycle, acetyl-CoA pathway, and some methanotrophic pathways can yield larger isotopic fractionations than the reductive TCA cycle and 3-hydroxypropionate cycle (Preuß and others, 1989; House and others, 2003a; Londry and Des Marais, 2003). House and others (2003a) also showed that biomass produced by acetyl-CoA pathway utilizing microbes can vary over a remarkably wide range, with
values of 2.7 to 8.0 permil for the Archaeoglobales and
values of 4.8 to 26.7 permil for the methanogens. The magnitude of carbon isotopic fractionation observed in methanogens is dependent on the growth status of the culture, with isotopic fractionation increasing as the cultures proceeded toward stationary growth phase. Furthermore, Londry and Des Marais (2003) have shown that different sulfate reducers using the acetyl-CoA pathway can demonstrate widely different fractionations (
values of both 10.0 and 30.5
). The fractionation associated with bacterial methanotrophy by RuMP has been shown to be up to about 30 permil for the particulate methane monooxygenase, but lower for the soluble methane monooxygenase (Summons and others, 1994; Jahnke and others, 1999). Finally, the fractionation associated with anaerobic methane oxidation has been estimated to be around 32 permil based on studies of natural cell carbon from seep environments (Orphan and others, 2001a; Orphan and others, 2002).
Figure 1B shows the predicted carbon isotopic composition produced by natural microbial cells using each of the pathways shown, assuming fixation of CO2 with a typical carbon isotopic composition of about 7 permil, or CH4 with a composition of 55 permil. The global range of carbon isotopic compositions for methane is between 50 to 110 permil (Whiticar, 1999), but is typically heavier (about 55
) in marine seeps subject to microbial methane oxidation (Whiticar, 1999; Orphan and others, 2004). This figure provides only an approximate guide to the expected carbon isotopic signatures that each metabolic pathway should produce in a natural setting, because the actual isotopic composition of either CO2 or CH4 can be highly variable in nature. Nevertheless, there is expected to be demonstrably different values of carbon isotopic composition found in natural microbial biomass. The most 13C-depleted carbon is produced by cells that have grown on natural biogenic methane. Methanotrophic cells using either an aerobic pathway (RuMP) or an anaerobic pathway (AOM) typically have cellular biomass that is around 85 permil (Orphan and others, 2001b, 2002) due to both a 13C-depleted methane source and isotopic fractionation during carbon fixation. Of the CO2 fixation pathways, the acetyl-Co A pathway can produce the most 13C-depleted biomass, but in other cases produces significantly less 13C-depleted biomass (from
13C of about 45
to about 10
). In methanogens and acetogens, this pathway produces biomass that is highly 13C-depleted (
13C about 30
). The same level of 13C-depletion is found for natural microbial cells using the reductive pentose phosphate cycle with type I rubisco (
13C around 30
), and somewhat less 13C-depletion is found for cells using the reductive pentose phosphate cycle with type II rubisco (
13C around 20
; Roeske and OLeary, 1984, 1985; Robinson and Cavanaugh, 1995). Finally, the two remaining carbon fixation pathways, the 3-hydroxypropionate cycle and the reductive TCA cycle, produce the least 13C-depleted biomass (with
13C values expected between 0
to 20
).
The Microbial Metallome
In addition to carbon and other traditional macro-nutrients such as nitrogen and phosphorous, organisms require more than 20 elements for cellular growth. In fact, a biological cell can be compositionally described by: (1) its genome, which contains a blueprint of all molecules required to carry out cellular processes (DNA); (2) its proteome, the individual proteins encoded for by the genome and expressed; or (3) its metallome, the twenty-some elements (including carbon, nitrogen, phosphorous, as well as bioactive trace metals) that make up both DNA and proteins and are involved in cellular reactions (Frausto da Silva and Williams, 2001). Bioactive metals are essential nutrients that function as structural elements and catalytic centers in metalloproteins and metal-activated enzymes involved in virtually all cell functions, including DNA and RNA synthesis, respiration and photosynthesis, electron transport, and cellular detoxification (Holm and others, 1996; Frausto da Silva and Williams, 2001). Iron, for example, is almost universally required for life, forming iron-sulfur proteins involved in mitochondrial electron transport, and important heme proteins used for electron transfer, oxidase formation, storage, and transport. Other metals have more limited biological uses associated with specific microbial physiologies. Molybdenum, for example, is an important enzyme in the nitrogen cycle, used in nitrate reductase and the nitrogenase enzyme necessary for nitrogen fixation (Burgess and Lowe, 1996; Eady, 1996; Hille, 1996; Kisker and others, 1997); nickel is found in enzymes catalyzing carbon reduction reactions of acetogenic and methanogenic organisms (Hausinger, 1987; Ragsdale and Kumar, 1996; Eitinger and Friedrich, 1997; Taha and others, 1997); and tungsten is most commonly used by hyperthermophiles and methanogenic archaea (Johnson and others, 1996; Lvov and others, 2002).
Conflicting ideas exist as to the extent of variance in the metallome of prokaryotic organisms. Previous work indicates that E. coli exert tight genetic control over cellular metal concentrations, even for relatively low-toxicity metals such as zinc (Outten and OHalloran, 2001). These researchers conclude that heterotrophic microbial cells cultured in the laboratory contain a background metal content, or "metal quota", that is conserved regardless of the metal content of the media. Other studies (for example Raven, 1988) suggest that organisms with specialized metabolic functions, such as nitrogen fixation (diazotrophy), may require up to 100 times more of a specific trace metal to sustain growth under nitrogen-limited conditions. Specialized prokaryotic trace metal requirements in excess of background cellular metal content could potentially provide a useful fingerprint of physiological function in poorly characterized microbial communities in the environment.
In addition to providing a possible link between cellular chemistry and physiology, the metallome also offers the most direct chemical link between biological systems and the environment. An element that is widely used by biological systems must be biologically available, or present in an easily extractable form, from the atmosphere or from solutions. Conditions such as temperature and the oxidation state of the atmosphere and oceans have fluctuated through geologic time, considerably affecting the bioavailability of redox sensitive elements. Some researchers suggest that decreased organic matter production and carbon isotope fractionation during the mid-Proterozoic could have been due to metal limitation of nitrogen fixation and primary productivity in a sulfidic ocean (Canfield, 1998; Anbar and Knoll, 2002). The trace metal preferences and sensitivities of cyanobacteria could also reflect evolution in a sulfidic environment (Saito and others, 2003). Modern biological systems have evolved sophisticated methods of acquiring essential metals under metal-limited conditions, and can significantly alter geochemical cycling in natural environments. For example, in terrestrial settings bacteria can enhance weathering rates through the production of chelators to promote solubilization of metals from minerals in soils (Kalinowski and others, 2000; Liermann and others, 2000; Brantley and others, 2001). In response to low concentrations of some essential metals in the surface of modern oceans due to phytoplankton depletion (Bruland, 1980, 1989; Martin and others, 1989; Rue and Bruland, 1995), various microorganisms release strong complexing agents and catalyze redox reactions that modify the bioavailability of these metals and promote their rapid cycling in the upper water column (for a review see Morel and Price, 2003).
We examine the trace metal content of the modern prokaryotic metallome predicted by the presence or absence of genes coding for metallo-enzymes in whole genome sequences. We utilize these data to determine if prokaryotic metallomes could provide physiological biosignatures, and to clarify the evolution of biological chemistry and its inter-dependence with the Earth surface environment. Due to the nearly universal biological role of iron, it is assumed that all organisms require cellular iron concentrations far in excess of other redox-sensitive metals. For this reason, we focused only on the predicted metallome contribution of non-iron trace metals (Zn, Mn, Cu, Mo, Co, Ni, W, and V) from prokaryotic genomes. Iron-containing enzymes were considered only if they contained an additional trace metal. In a second analysis, we estimated the approximate timing of evolution of metallo-enzymes (including those containing iron) from their functions and occurrences in modern organisms. This view of metallome evolution is then compared to modeled changes in the bioavailability of these metals through time from the literature (Saito and others, 2003).
 |
MATERIALS AND METHODS
|
|---|
Estimation of a Consensus Tree of Life
We estimated the topology of the tree of life from the combined gene content of taxonomic groups of prokaryotes. For this, we downloaded data matrices containing the presence or absence of homologous gene families in individual prokaryotes, as calculated in House and others (2003b). We then combined the data matrices for individual organisms in each group to produce a representative data matrix scoring the presence or absence of gene families in a specified percentage of individual organisms in that group, including gene families present in greater than 1, 10, 15, 20, 25, 50, and 99 percent of the organisms in the group (combined matrices available at http://www.geosc.psu.edu/
chouse/ajs1.html). We constructed phylogenetic trees based on the presence and absence of informative gene families in composite genomes, as previously described for individual organisms (Fitz-Gibbon and House, 1999; House and others, 2003b). Parsimony and distance analyses were performed on the combined matrices using PAUP version 4.0b (Swofford, 2002); compatibility and threshold parsimony analyses were applied using the Phylip software package (Felsenstein, 1993). We compared these results to published trees based on rRNA (Fox and others, 1980; Woese, 1987), concatenated protein datasets (Hansmann and Martin, 2000; Brown and others, 2001; Brochier and others, 2002; Matte-Tailiez and others, 2002), and other gene content methods (Gerstein, 1998; Gerstein and Hedgyi, 1998; Snel and others, 1999; Tekaia and others, 1999; Lin and Gerstein, 2000; Wolf and others, 2001; Bansal and Meyer, 2002; Clarke and others, 2002; Korbel and others, 2002; Li and others, 2002; Blank, 2004; Yang and others, 2005), and produced a consensus microbial tree of life based on this comparison. This tree does not represent a strict consensus, but simply our best estimate of phylogenetic relationships based on these disparate studies.
Predicted Carbon Isotope Fractionation
We produced a compilation of experimentally determined carbon isotope fractionation values as a function of carbon fixation pathways from data compiled in House and others, 2003a (data from Calder and Parker, 1973; Wong and others, 1975; Pardue and others, 1976; Quandt and others, 1977; Sirevåg and others, 1977; Fuchs and others, 1979; Mizutani and Wada, 1982; Belyaev and others, 1983; Holo and Sirevåg, 1986; Ruby and others, 1987; Preuß and others, 1989; Mirkin and Ruby, 1991; Popp and others, 1998; Jahnke and others, 2001; House and others, 2003a) and updated with additional published literature (Madigan and others, 1989; Summons and others, 1994; Jahnke and others, 1999; van der Meer and others, 2001a, 2001b; Orphan and others, 2001a, 2002; Londry and Des Marais, 2003; Londry and others, 2004; tables 1A and 1B; fig. 1A). We superimposed the results onto our consensus microbial tree of life based on the phylogenetic distribution of carbon fixation pathways (Andreesen and Gottschalk, 1969; Sirevag, 1974; Fuchs and Stupperich, 1978; Fuchs and others, 1980; Schauder and others, 1987, 1989; Holo, 1989; Altekar and Rajagopalan, 1990; Windhovel and Bowien, 1990; Meijer and others, 1991; Beh and others, 1993; Strauss and Fuchs, 1993; Kandler, 1994; Vorholt and others, 1995; Delwiche and Palmer, 1996; Ishii and others, 1996; Wahlund and Tabita, 1997; Ishii, 1998; Shively and others, 1998; Menendez and others, 1999; Watson and others, 1999; Wirsen and others, 2002; Hügler and others, 2003; Macalady and Banfield, 2003). We then used parsimony to extrapolate carbon isotopic fractionation at various nodes on the tree, including the last common ancestor (LCA).
Computing the Model Metallome
We compiled a list of known metallo-enzymes utilizing the bioactive metals iron, zinc, copper, molybdenum, manganese, cobalt, nickel, tungsten, and vanadium, and the number of atoms of metal required per molecule of enzyme, when known (data tables available at http://www.geosc.psu.edu/
chouse/ajs1.html; Burgess and Lowe, 1996; Dismukes, 1996; Hille, 1996; Holm and others, 1996; Johnson and others, 1996; Lipscomb and Sträter, 1996; Ragsdale and Kumar, 1996; Berman and others, 2000; Frausto da Silva and Williams, 2001; Lvov and others, 2002; and references therein). We created a mysql database of genome content for the complete genomes of 52 prokaryotes from data downloaded from the NCBI site (www.nbi.nih.gov or ftp.ncbi.nih.gov), which contains reannotated microbial genomes for maximum similarity. For each gene in a genome, the NCBI database contains information including, but not limited to, fields for the gene name, the Clusters of Orthologous Groups (COG) code (Tatusov and others, 1997), and the COG product. We then used multiple arrangements of the common names and synonyms of each of the metal-containing enzymes we identified to search within our mysql genome database for the presence of a gene or genes encoding each metallo-enzyme. The presence of genes coding for metal-containing enzymes in each genome was then used to compute a "model metallome" for each organism.
In computing the model metallome, we made a series of assumptions. First, for each gene known to code for a metallo-enzyme, we assumed that one enzyme molecule would be expressed. We then defined the model metallome, TMe, for this one enzyme per gene system by summing all the metal atoms in the expressed enzymes, such that if Mei is the total number of atoms of any individual metal, i, in that model metallome, then
 | (3) |
We calculated the fractional metal contribution of each metal i to the model metallome (Fi), by the following equation:
 | (4) |
Due to the nearly ubiquitous use of iron in biological systems, iron-containing enzymes were considered only if they contained an additional metal cofactor. In that case, only the number of non-Fe atoms appeared in the TMe summation. Thus, strictly, Fi is the fractional metal contribution to a model metallome where Fe atoms have been excluded. Values of Mei, TMe, and Fi for individual metals are shown for each organism in table 2.
View this table:
[in this window]
[in a new window]
|
TABLE 2 The organisms considered in this study, their groupings in discussed analyses (including phylogenetic groups, optimum growth temperature, oxygen tolerance, and any specialized metabolisms considered), the number of atoms of each individual metal in a given model metallome (Mei), the total model metallome (TMe), and the fractional contribution of each metal to the model metallome (Fi, in %). Abbreviations used are defined below.
|
|
In any natural system, the microbial metallome can be altered either by changing the genome through the addition or loss of genes encoding for a metallo-enzyme, or by changing the proportion of genes expressed by the organism. In order to make generalizations about TMe across a wide range of microbial cellular functions, we assumed that all enzymes encoded in the genome are produced in equal proportions within the cell at all times. Thus, the "model metallome" represents the metals in all metallo-enzymes that can be produced by an organism (excluding Fe-only enzymes). At any given time under any given environmental condition, the actual expressed cellular metallome in the organism may differ substantially from the theoretical "model metallome" we calculate, due to changes in gene expression.
We then sought to look at how the model metallome varied across the consensus phylogenetic tree. We tested sets of individual Fi values of organisms within phylogenetic groups for similarity using the chi-square (Pearson goodness of fit) test for general association of variables. The test was used to determine whether or not the organisms have statistically similar model metallomes within the groups, and statistically different model metallomes between groups. This particular statistical analysis is generally weakened by the occurrence of counts of less than five for the variable in question; therefore tests performed using each individual Fi count were checked against tests combining Fi counts less than five, commonly FNi, FV, and FW. The model metallome for phylogenetic groups was statistically significant in all but two cases: the model metallomes for the firmicutes fit into three statistically significant categories that were unrelated to previously recognized phylogenetic groupings, and the model metallome of methanogenic archaea fit into two statistically significant groups, Methanosarcina sp. and chemoautotrophic methanogens (excluding M. thermoautotrophicum).
We extrapolated these model metallomes on a consensus tree representing our analysis of combined gene families considered along with the most widely accepted microbial tree topologies (see discussion below). We calculated the extrapolated metallomes for each node by computing the mean Fi for each metal of all branches diverging from the node. For example (see fig. 7A), the Fi values assigned to the branching point of the firmicutes and actinobacteria is calculated as an average of the Fi values for the three groups of firmicutes and the Fi values for the actinobacteria. In turn, the Fi values of earlier nodes on the tree were calculated based on the values determined for the lineages that diverge from them.

View larger version (22K):
[in this window]
[in a new window]
|
Fig. 7. (A) Model metallomes extrapolated onto a consensus tree of life. (B) Possible scenario for metallome evolution based on this extrapolation.
|
|
 |
COMPOSITE ORGANISM PHYLOGENY AND A CONSENSUS TREE OF LIFE
|
|---|
We estimated the topology of the tree of life from the combined gene content of taxonomic groups of prokaryotes. Trees based on gene families present in
20 percent of individual genomes in each taxonomic group are shown in figure 2A (maximum parsimony) and in figure 2B (compatibility). We chose to present the trees based on twenty percent because using gene families present in
25 percent of genomes in a group resulted in composite genomes with very few gene families unique to each group, and thus produced primarily unresolved trees. Trees produced with gene families present in
15 percent and
20 percent of genomes in a group were nearly identical. Additionally, a cutoff point of 20 percent produced the most consistent trees between the various tree-building methods we employed.

View larger version (12K):
[in this window]
[in a new window]
|
Fig. 2. (A) Composite organism genomic tree generated by maximum parsimony. (B) Composite organism genomic tree generated by compatibility. (C) Consensus tree representing our analysis of combined gene families considered along with the most widely accepted microbial tree topologies.
|
|
In general, different tree-building methods have different strengths and weaknesses, leading to differing tree topologies. Therefore, based on comparisons of our results with trees in the literature constructed from ribosomal RNA, the concatenation of large protein-data sets, and other gene content methods (see references above), we developed a tree best representing the present consensus on microbial relationships (fig. 2C). For the consensus tree, the two groups of gram positive bacteria (the firmicutes and the actinobacteria) were united based on their overall similar cell structure, in spite of the limited molecular support for this grouping (Brown and others, 2001; Wolf and others, 2001; Fu and Fu-Liu, 2002; Blank, 2004). Our resulting tree of life, shown in figure 2C, is broadly similar to that based on rRNA (for example: Fox and others, 1980; Woese, 1987) with the following exceptions: (1) the methanogenic archaea are united together based on whole genome analysis (Slesarev and others, 2002; House and others, 2003b; Yang and others, 2005); and (2) cyanobacteria have been placed as a sister group to the gram positive bacteria (Gerstein, 1998; Hansmann and Martin, 2000; Blank, 2004; this study). Our trees of composite organisms (fig. 2A and 2B) differ from the consensus tree (fig. 2C) in that our genomic trees failed to unite the Proteobacteria, and placed the Thermoplasmales at the base of the tree, presumably due to difficulties in analyzing genomes of differing size (House and Fitz-Gibbon, 2002).
 |
INFERRED CARBON ISOTOPE SIGNATURES OF ANCIENT ORGANISMS
|
|---|
Based on the observation that the reductive TCA and 3-hydroxypropionate cycles result in low carbon isotope fractionation between CO2 and biomass, the reductive pentose-phosphate cycle and acetyl-CoA pathway result in moderate fractionation between CO2 and biomass, and pathways using other lighter substrates result in extremely 13C depleted biomass (as in fig. 1), we superimposed the distribution of modern carbon fixation pathways and the resulting carbon isotope signatures onto a modified version of our consensus tree of life (fig. 3). From extrapolation by parsimony, we then inferred the carbon fixation and approximate isotopic signature for the common ancestors of various ancient prokaryotic groups, including the last common ancestor (LCA). For ethanotrophy (E), a biological process for which fractionation factors have not been measured, we assumed some degree of biological fractionation from natural ethane, which is itself isotopically depleted. For example, measured
13C values for ethane in deep sea sediments are around 60 permil due to biological fractionation during synthesis (Oremland and others, 1988; Waseda and Didyk, 1995; Paull and others, 2000).

View larger version (25K):
[in this window]
[in a new window]
|
Fig. 3. Carbon isotopic signatures from the literature (fig. 1; tables 1A and 1B) superimposed on our consensus tree of life (fig. 2C). For this analysis, we additionally added single organisms with known carbon fixation pathways that did not have representative whole genome sequences or did not fit into previously assigned phylogenetic groupings. Carbon fixation pathways are noted as in fig. 1, with the addition of ethanotrophy (E). Note that we positioned the Haloarchaea as a sister group to the Methanoarchaea and Archaeoglobus, within the Euryarchaeota. This placement is primarily based on arguments presented in House and others (2003b), however it is currently the subject of debate (for example, Korbel and others, 2002; Yang and others, 2005). Additionally, the anaerobic methane oxidizing archaea (ANME groups) have been included with the Methanoarchaea (Orphan and others, 2001a). Ancient isotopic signatures are extrapolated using simple parsimony.
|
|
The results shown in figure 3 suggest the following: (1) both low and moderate carbon isotope fractionation were established quickly in the early evolution of life; and (2) methanotrophic and ethanotrophic metabolisms capable of producing biomass with extreme 13C depletions are not primitive, but rather evolved after the major groups of Prokaryotes had already diverged. Based on the distribution of carbon fixation pathways in figure 3, it appears more or less equally parsimonious to assign low or moderate carbon isotope fractionation to the base of the Bacteria and the base of the Archaea, and thus to the last common ancestor. This conclusion follows because it is difficult to extrapolate the relative antiquities of the reductive TCA cycle, acetyl CoA pathway, and reductive pentose phosphate cycle. For example, the last common ancestor of the Archaea could have exhibited high fractionation through either a variant of the acetyl CoA pathway as found in methanogens, or through the reductive pentose phosphate cycle, as found in the haloarchaea (Rawal and others, 1988).
It has been suggested that the acetyl Co-A pathway is the oldest carbon fixation pathway (Huber and Wachtershauser, 1997; Martin and Russell, 2004). Alternatively, the wide phylogenetic distribution of the reductive pentose phosphate cycle (Rawal and others, 1988; Ivanovsky and others, 1999), and the ubiquitous occurrence of diverged forms of Rubisco in both Bacteria and Archaea support the antiquity of this pathway (Finn and Tabita, 2003; Hügler and others, 2003). The Rubisco enzyme, however, could have had a complex history, originally evolving as an oxygenase (Hanson and Tabita, 2001). The TCA cycle, which seems to have evolved originally as a reductive pathway (Romano and Conway, 1996), is universally important in biosynthesis and therefore must have evolved especially early making it perhaps the most likely carbon fixation pathway for the last common ancestor (Wachtershaüser, 1990; Morowitz and others, 2000). For the purposes of extrapolating carbon isotopic signatures to the LCA, if the reductive TCA cycle is the oldest carbon fixation pathway, then the LCA would demonstrate low fractionation.
We expect that carbon isotopic signatures of biomass identified experimentally in modern organisms using different carbon fixation pathways have been manifested in isotopic compositions of ancient microorganisms, and subsequently preserved in microfossils (House and others, 2000; Ueno and others, 2001; Kaufman and Xiao, 2003) and other organic matter in the rock record (for example, Hayes and others, 1983). Carbon isotopic studies of Precambrian sediments have traced a distinctive isotopic signature of highly fractionated biomass, resulting from biological carbon fixation, to
3,500 million years ago (Schidlowski and others, 1983; Hayes and others, 1992). For example, recent studies of Buck Reef Chert, South Africa, have shown that photosynthetic mats were an important microbial ecosystem in shallow water environments at 3,416 million years ago (Tice and Lowe, 2004). These laminated cherts contain carbonaceous matter with a carbon isotopic composition of 20 to 30 permil, consistent with fractionation by the reductive pentose phosphate cycle (Tice and Lowe, 2004). It is now possible to partially explain the isotopic fractionations found throughout the Archean through anoxygenic photoautotrophy. In the past, such an explanation was hindered (House, ms, 1999) because the known deeply diverging lineages of anoxygenic phototrophs, Chloroflexus and Chlorobium, do not use the reductive pentose phosphate cycle; and therefore do not demonstrate sufficiently large isotopic fractionations during carbon fixation to explain the isotopic record prior to the evolution of Cyanobacteria or photosynthetic Proteobacteria (like Chromatium). However, it is now known that Oscillochloris (Keppen and others, 2000), a Green Non-sulfur Bacteria, uses the reductive pentose phosphate cycle for carbon fixation (Ivanovsky and others, 1999). This lineage, along with the recently described Candidatus Chlorothrix which has been shown not to the use the 3-hydroxypropionate cycle (Klappenbach and Pierson, 2004), may be modern examples of the kinds of anoxygenic photoautotrophs that built microbial mats 3,500 million years ago producing organic material with moderately high 13C-depletions. Furthermore, the prospect that anoxygenic photoautotrophs, such as Oscillochloris, were mat builders in the early Archean is consistent with reported "cyanobacteria-like" microfossils of that Eon (Schopf and Packer, 1987; Schopf, 1993; Schopf and others, 2002), as the morphology of Oscillochloris is remarkably similar to that of filamentous cyanobacteria (Gorlenko and Pivovarova, 1977; Keppen and others, 1994).
Compared to younger Archean sediments, a relatively low fractionation has been observed in turbidites from the 3,800 million years old Isua terrain in central West Greenland (Rosing, 1999), and a large amount of 13C-depletion has been reported in graphite inclusions in apatite from the 3,850 million years old Akilia Island metasediments in southwest Greenland (Mojzsis and others, 1996). For the Isua terrain, the carbon isotopic composition of the organic matter in turbidite sediments (about 19
; Rosing, 1999) and inclusions in various mineral phases (down to about 18
; Ueno and others, 2002) could be consistent with fractionation by a primitive autotroph using the reductive TCA cycle (van der Meer and others, 2000). However, since the carbon isotopic results from the more ancient Akilia Island seem to show greater 13C-depletion, an explanation of this carbon is more complicated (also see Knoll, 2003). The Alkilia Island results would suggest that: 1) the last common ancestor of life dates from earlier than 3,850 million years ago; 2) processes other than biologic carbon fixation have contributed to the apparently large magnitude fractionations reflected in the ancient Akilia samples; and/or 3) the reductive TCA cycle is not the oldest carbon fixation pathway. (Note: during final preparation of this manuscript, Lepland and others, 2005, published concerns about the relevance of Akilia apatites as a recorder of the past record of life on Earth based on their inability to reproduce the earlier results of Mojzsis and others, 1996).
For example, it is possible that life originated and diversified extensively before the 3,800 to 3,900 million year old sedimentary units were deposited. Growing evidence suggests that the Earth was habitable before the oldest preserved sediments. Zircons from arc volcanism, dated from 4,200 to 4,400 million years ago, have been recovered from Jack Hills, Western Australia, suggesting that oceans were present at that time (Mojzsis and others, 2001; Wilde and others, 2001). Additionally, studies of the history of lunar impacts suggest that the Earth was comparatively peaceful from
4,400 to
3,900 million years ago, between the completion of an accretionary event
4,450 million years ago and a late intensive bombardment at
3, 900 million years ago (Ryder, 2003), allowing for a period of prior planetary habitability.
 |
MODEL METALLOMES OF MODERN AND ANCIENT ORGANISMS
|
|---|
Comparing Model Metallomes from Modern Genome Sequences
We hypothesized that the prokaryotic metallome could provide a further biosignature of microbial physiology in the environment. To test this hypothesis, we examined the model metallomes of modern prokaryotes predicted by the presence or absence of genes encoding for metallo-enzymes in whole genome sequences, and compared these values across specific metabolisms, oxygen tolerance, optimum growth temperature, and phylogeny (fig. 4 and fig. 5). When making broad comparisons across groups of organisms, if we suppose equal gene expression and a relatively constant TMe, then a higher Fi value for a specific metal means that the organism generally requires more atoms of that metal, if all enzymes are being expressed equally.

View larger version (13K):
[in this window]
[in a new window]
|
Fig. 4. Average Fi values predicted by whole genomes compared across: (A) specific metabolisms; (B) oxygen tolerance; and (C) optimum growth temperature. Note the scaling (Zn/2, Mn/2, Nix2, Wx2, and Vx2).
|
|

View larger version (14K):
[in this window]
[in a new window]
|
Fig. 5. Average Fi values compared across phylogenies: (A) Bacteria and Archaea; (B) individual groups of archaea; and (C) individual groups of bacteria. Note the scaling as in figure 4 (Zn/2, Mn/2, Nix2, Wx2, and Vx2).
|
|
Results of these comparisons suggest that the use of metals in prokaryotes when analyzed as a group follows the general hierarchy Fe >> Zn > Mn >> Co, Cu, Mo >> Ni > W, V (fig. 4 and fig. 5). As explained above, iron is nearly universally required for life and therefore not even considered in the model metallome as we define it. Zinc (FZn 30 50%) forms metallopeptidases, metallophosphatases, and many other zinc polymerases involved in DNA and RNA binding and synthesis (Lipscomb and Sträter, 1996). Manganese (FMn 20 30%) catalyzes several redox reactions, hydration-dehydration, isomerizations, phosphorylation-dephosphorylation, phosphoryl transfer, and dioxygen production in photosynthesis (Dismukes, 1996; Yachandra and others, 1996). Cobalt (FCo 10 20%) forms the cobalamin cofactor of the essential vitamin B12, which is required for a number of enzymes (Marsh, 1999). Copper (FCu 10 20%) forms the common blue copper proteins used in electron transfer, often associated with oxidative enzymes and energy capture (Guss and Freeman, 1983; Solomen and others, 1996). And molybdenum (FMo 5 20%), in addition to its involvement in nitrogen metabolism (in nitrogenase and nitrate reductase), is used in a wide variety of other enzymes that catalyze an oxygen atom transfer or two-electron reaction, including xanthine oxidase, sulfate reductase, and fumarate dehydrogenase (Burgess and Lowe, 1996; Eady, 1996; Hille, 1996; Kisker and others, 1997; Frausto da Silva and Williams, 2001). Nickel, tungsten, and vanadium have much more limited biological uses, as discussed below.
We compared model metallomes for organisms with specialized metabolic functions to determine if their use of specific metallo-enzymes produces a unique metal signature that could be apparent in the expressed cellular metallome (fig. 4A). We chose to compare organisms that perform methanogenesis, nitrogen fixation, and oxygenic photosynthesis, as these are metabolic functions that are important in elemental cycling in the environment and are each associated with one or more specific metallo-enzymes.
Model metallome comparisons indicate that of the metabolisms we tested, only the model metallome of methanogens contains a unique signature. Methanogenic archaea have significantly elevated FNi and FW (FNi
9% and FW
5% for methanogens, versus FNi
2% and FW
2% for non-methanogens; fig. 4A and fig. 6A). Nickel is extremely important in methanogenesis, forming the cofactor of two primary enzymes involved in methane production, methyl-coenyzme M reductase and acetyl-CoA synthase. It is also used in carbon monoxide dehydrogenase, which can catalyze the reduction of CO2 to carbon CO and subsequent assembly of acetyl-CoA, key to pathway of carbon fixation for acetogenic methanogens (Ragsdale and Kumar, 1996). In non-methanogens nickel is additionally used in urease, some hydrogenases, and rarely in superoxide dismutase (Frausto da Silva and Williams, 2001). Hyperthermophilic methanogens use tungsten in aldehyde ferredoxin oxidoreductase (AOR) enzymes, and in some thermophilic methanogens tungsten is also used preferentially to molybdenum in formylmethanofuran dehydrogenase, catalyzing the first step in the conversion of CO2 to CH4 (Lvov and others, 2002).

View larger version (13K):
[in this window]
[in a new window]
|
Fig. 6. (A) FCu versus FNi values for aerobes, methanogens, and non-methanogenic anaerobes. Average values for each group are shown as open symbols. (B) FMo and FW for organisms plotted against optimum growth temperature (for organisms with a range of optimum growth temperatures, the mean temperature was used). Average values for each group (abbreviated as in table 1) are connected with a line and shown as open symbols.
|
|
Our examinations indicate the model metallomes of nitrogen fixers and cyanobacteria, on the other hand, do not contain unique metal signatures. We calculate only slightly higher FMo values in nitrogen-fixing microorganisms (FMo
12% for diazotrophs, versus FMo of 10% for non N-fixers), despite their requirement for molybdenum in the Fe-Mo cofactor of the nitrogenase enzyme responsible for nitrogen-fixation. In the absence of molybdenum, some organisms can form alternative nitrogenases containing vanadium and iron or iron only (Eady, 1996; Burgess and Lowe, 1996; Howard and Rees, 1996), so that diazotrophs additionally have a small but insubstantial FV (1 2%) not found in most non N-fixers, as the only other well-understood use of vanadium in biology is in haloperoxidases (Frausto da Silva and Williams, 2001). Due to the substantial requirement of manganese in photosystem II, which is responsible for oxygenic photosynthesis (using 4 atoms of manganese per molecule of enzyme; Yachandra and others, 1996), we expected that oxygenic phototrophs might exhibit elevated FMn values over non-phototrophs. However, the two cyanobacteria in our database have a mean 23 percent FMn, an increase of only one percent over non-phototrophs.
High FNi and FW values in methanogenic archaea support the idea that these organisms may contain elevated cellular nickel and tungsten concentrations in excess of other organisms, providing a biosignature for methanogens utilizing the known Ni- and W-containing enzymes in the environment. In fact initial measurements of the cellular metal content of Methanosarcina demonstrate high nickel and tungsten concentrations (House, unpublished data). Furthermore, in our calculations of a model metallome we ignored gene expression, assuming that all enzymes were expressed to the same extent in all organisms at all times. However, in different environments under different conditions, specific proteins will be expressed in higher quantity than others in a given organism. For example, Raven (1990) calculated an increase in both manganese and iron requirements for photosynthesis at low photon flux densities and with different ATP sources and CO2 assimilation mechanisms. Additionally, growth in the absence of fixed nitrogen requires a high concentration of the nitrogenase enzyme, and as a result some diazotrophs accumulate 10 to 20 percent of the total cell protein as nitrogenase under nitrogen-limited conditions (Martinez-Argudo and others, 2004). If the metallo-enzymes responsible for specific metabolic capabilities such as nitrogen fixation and oxygenic photosynthesis are expressed in excess of other enzymes under conditions when these metabolisms are active, then this would result in an amplification of the metal in the expressed cellular metallome beyond that predicted by our model. Thus the expressed metallome could still potentially contain a biosignature for the activation of these metabolisms in nature.
Next we compared the model metallomes of strict aerobes and strict anaerobes (fig. 4B). This comparison indicates that anaerobes have elevated FNi and FW and slightly lower FCu and FMo than aerobes (average for anaerobes: FNi and FW > 3%, FCu
8%, FMo
9%; average for aerobes FNi and FW
2%, FCu
10%, FMo
11%). A closer examination of FNi for individual organisms (fig. 6A) clearly shows that the increased nickel signature for anaerobes is due to the bias in our anaerobic database for methanogenic archaea, which have elevated FNi as explained above. This examination also shows that most anaerobes have FCu less than 9 percent, while aerobes have FCu values up to twice that (fig. 6A). The two exceptions were M. jannaschii and A. fulgidus. This is difficult to explain for A. fulgidus; however, M. jannaschii has a relatively small total number of identified metallo-enzymes, most likely due to poor genome annotation (TMe of 30 for this organism, much less than the average TMe of 51 for all organisms examined), such that even though it only requires 3 atoms of copper in our model (in amine oxidase and nitrate reductase), this leads to a higher overall FCu.
We also compared the model metallomes of organisms with varying optimum growth temperatures (fig. 4C). Increasing trends in FMo and FW are seen with increasing thermophily (fig. 4C and fig. 6B). FMo increases from around 8 percent in the one psychrophile we examined to greater than 12 percent in hyperthermophiles. At high temperatures, molybdenum forms the primary cofactor of enzymes such as formyl methanofuran dehydrogenase (Lvov and others, 2002). FW is
4 percent for hyperthermophiles,
3 percent for thermophiles,
2 percent for mesophiles, and
1 percent for the one psychrophile examined. As mentioned above, tungsten is used in AOR enzymes, best known from the hyperthermophile Pyrococcus furiosus (Mukund and Adams, 1991; Chan and others, 1995; Kletzin and others, 1995). Hyperthermophilic archaea also contain two other AOR-type tungstoenzymes that catalyze the oxidation of aldehydes, formaldehyde ferredoxin oxidoreductase and glyceraldehyde ferredoxin-oxidoreductase (Johnson and others, 1996). In some thermophilic organisms, tungsten can also be substituted for molybdenum in molybdo-enzymes such as formylmethanofuran dehydrogenase (Lvov and others, 2002). Despite the small increase in FW, tungsten is such a minor contributor to the total metallome (
6% for all organisms in this study), that unless the proportion of tungsten-containing proteins expressed in an organism or group of organisms (that is hyperthermophiles) is found to be higher than that of other metallo-enzymes, tungsten content will not be a useful biomarker for thermophiles. Preliminary studies indicate cellular tungsten concentrations in all hyperthermophiles studied thus far are less than 0.1 parts per billion (Cameron and others, 2004).
Similar comparisons can be made between model metallomes across phylogenetic groups (fig. 5). These comparisons suggest relatively constant FZn, FMn, FCu, and FCo, with fluctuating FMo, FNi, and FW. The average archaeal metallome predicts higher FMo (
13%), FNi (
4%), and FW (
4%), when compared to the average bacterial metallome (FMo 10%, FNi
2%, and FW
2%; fig. 5A). Individual groups of Archaea have much more widely variant model metallomes (fig. 5B) than individual groups of Bacteria (fig. 5C). All archaeal and bacterial groups shown in figures 5B and 5C have statistically significant model metallomes, except for the firmicutes, which had to be separated into three groups unrelated to previous phylogenetic groupings. Mean values for FMo, FCo, FNi, and FW for Archaeal groups all vary significantly (fig. 5B). Again, the most clear signals are elevated FNi and FW for Methanosarcina sp. and chemoautotrophic methanogens, and elevated FW for the methanogens and thermophilic Thermoplasmales. Compared to the groups of Archaea, groups of Bacteria are fairly uniform, with only the Firmicutes (a mean of all three statistically significant groups) and cyanobacterial FNi differing significantly.
Evolution of the Metallome from Modern Genome Sequences
Model metallomes of phylogenetic groups can be mapped onto our consensus tree of life (fig. 2C), and used to infer evolutionary changes (fig. 7). Extrapolation to the base of the tree of life by consecutive calculation of each mean Fi from the Fi of converging branches projects a model metallome for the LCA: FZn
40 percent, FMn
20 percent, FMo, FCo, and FCu
10 percent each, FNi and FW
3 percent (fig. 7). The predicted model metallome of the LCA is very similar to the average model metallome we calculated from genomes of modern anaerobes in this study, as in figure 4 (FZn = 44%, FMn = 21%, FMo = 9%, FCo = 11%, FCu = 8%, FNi = 3% and FW = 3%; fig. 8). This similarity is supported by the isotopic evidence for life more than 3,800 million years ago (see discussion above), prior to the oxygenation of the Earths atmosphere (Rye and Holland, 1998).

View larger version (14K):
[in this window]
[in a new window]
|
Fig. 8. Model metallomes for: (A) the LCA, based on metallo-enzymes inferred to be present at this time, from figure 9; (B) the LCA, based on extrapolation of model metallomes from our consensus tree in figure 7; and (C) the average of modern anaerobes, from figure 5.
|
|
A hypothesis for the possible evolution of trace metal use in the microbial metallome can be formed by assuming that Bacteria and Archaea with mean Fi values evolved from the last common ancestor, and then specific groups of Bacteria and Archaea evolved from these organisms (fig. 7). The predicted changes in the metallome with each evolutionary step may have been caused by: 1) changes in trace metal availability due to fluctuating redox conditions in the organisms environment; 2) competition between evolving organisms for available trace metals; 3) opportunistic evolution of enzymatic uses for widely available but previously unused trace metals; 4) gain of genes encoding for metallo-enzymes by lateral gene transfer; and/or 5) loss of genes encoding for metallo-enzymes along with unnecessary metabolic functions. Early in the evolution of life, it is likely that changes in trace metal availability due to variable redox conditions and competition with other organisms were important in driving biological metal use. As complex life evolved and developed specialized lifestyles, lateral gene transfer and loss of extraneous genetic material likely became important in shaping trace metal utilization (for example, Gogarten and others, 2002).
Evolution of the Metallome from Metallo-enzyme Functions
Another way to view the evolution of the use of metals in biology is by considering the timing of evolution of modern metallo-enzymes, based on their individual function and occurrence in life today (fig. 9). Although some enzymes are believed to have initially developed for one biological use and later modified and adapted to fit a different function (for example nitrogenase; Silver and Postgate, 1973; Normand and Bousquet, 1989), we assume a unique origin for each individual metallo-enzyme, and assign it an inferred timing of evolution based on the rules presented in table 3. The metallo-enzymes can then be mapped onto a geochemical timeline based on the inferred timing of major evolutionary events and we can examine how the use of these metals has developed with changes in global ocean redox chemistry (fig. 9). Note: a growing body of evidence indicates that beginning around 1,800 million years ago, after an initial pulse of oxygen into the atmosphere, the Earths oceans experienced a prolonged period of deep water euxinia, before a second pulse of O2 brought atmospheric levels to near current values (Canfield, 1998; Shen and others, 2002, 2003; Arnold and others, 2004; Poulton and others, 2004; shown as a dark hatch in fig. 9B). This could have dramatically altered the bioavailability of many redox-sensitive metals during this time (fig. 9B; Saito and others, 2003). Some researchers have speculated that this crisis in metal bioavailability could have forced a change in the use of specific metals in metallo-enzymes (Anbar and Knoll, 2002; Saito and others, 2003). However, as we have no conclusive evidence for biological innovation at this time, we can only speculate as to what these changes might have been. Therefore, although the bioavailability of metals during the mid-Proterozoic calculated by Saito and others (2003) is shown in figure 9B, our interpretation of the evolution of metal use with bioavailability considers only a single oxygenation event.

View larger version (17K):
[in this window]
[in a new window]
|
Fig. 9. (A) A second way to view the evolution of the use of metals in biology, considering the timing of evolution of known modern metallo-enzymes based on their individual function and occurrence in life today (using rules outlined in table 3). The inferred evolutionary timing of these enzymes is based on the estimated timing of major biological innovations (from Young, 1992; Golubic and others, 1995; Beaumont and Robert, 1999; Brocks and others, 1999; Summons and others, 1999; Shen and others, 2001; Habicht and others, 2002; Battistuzzi and others, 2004) and the estimated timing of oxygenation of the Earths atmosphere (Holland, 1984; Knoll and others, 1986; Holland and others, 1990; Des Marais and others, 1992; Canfield and Teske, 1996; Rye and Holland, 1998). The numbers on the bars refer to the number of metallo-enzymes containing a given metal inferred to be in use during that time (for example, 88 Fe-containing enzymes were considered to exist on Earth today), and the thickness of the bars is scaled to reflect the proportions of each metal used among total metal-containing enzymes at any given time. (B) Approximate concentrations of the considered metals and other species in seawater with progressive oxygenation of the oceans, based on assumptions made by Saito and others (2003), except molybdenum concentrations, which were derived from other sources (Bertine and Turekian, 1973; Collier, 1985; Lewis and Landing, 1992; Stumm and Morgan, 1996; Morford and Emerson, 1999; Anbar and Knoll, 2002). The hatched area refers to the period of time during which the Earths oceans likely experienced a prolonged period of deep water euxinia (based on the most recent dates from Poulton and others, 2004).
|
|
The metals we examined fall into one of three categories: (1) metals with biological uses that developed along with their bioavailability (Cu and Mo); (2) metals with biological uses that developed counter to their bioavailability (Fe and Mn); and (3) metals with biological uses that did not fluctuate significantly through time (Zn, Co, Ni, W, and V).
The biological uses of both copper and molybdenum have increased along with their bioavailability since oxygenation of the Earths atmosphere. We speculate the maximum number of metallo-enzymes utilizing copper doubled from 10 to 20 enzymes with oxygenation. Today copper is commonly used in blue proteins involved in electron transfer, especially in oxidative enzymes and energy capture (Guss and Freeman, 1983). Copper enzymes are also commonly used in oxidase reactions involving small non-metal compounds that only became available to any extent with the rise of dioxygen, including phenols, amines, ferrous ions, and oxidized nitrogen species (Solomen and others, 1996). Similarly, the maximum number of metallo-enzymes utilizing molybdenum more than doubled from 13 to 27 enzymes with oxygenation. Today, molybdenum enzymes catalyze reactions ranging from six-electron reductions (for example nitrogenase) to oxygen atom transfer. The increased use of copper and molybdenum in oxygen transformation is consistent with an increase in the biological availability of these metals with complete oxidation of the oceans (fig. 9). Under oxidizing conditions such as the modern ocean, copper is moderately available as Cu2+, and molybdenum forms the highly mobile molybdate anion, MoO42-, which is unreactive in seawater, exhibiting a conservative profile with an average concentration of 105 nanomolar (Emerson and Huested, 1991).
In contrast to Cu and Mo, biological use of iron and manganese has increased counter to their bioavailability since oxidation of Earths atmosphere (fig. 9). We speculate that the maximum number of metallo-enzymes using iron has nearly tripled from 26 to 74 enzymes with oxygenation, while the maximum number of enzymes using manganese has increased from 13 to 18, despite the predicted decrease in the availability of these metals in Earth systems with progressive oxidation of the atmosphere (fig. 9). Despite the low solubility under oxidizing conditions of minerals containing these elements, both Fe and Mn are ubiquitous in natural environments. Reduced ferrous iron (Fe2+) is readily available under anoxic conditions such as on the early Earth, where it could have easily been used to form early metallo-enzymes (Holland, 1973). However, in a transitional sulfidic ocean iron would have been highly complexed with sulfides, and might have existed in concentrations three orders of magnitude lower than Archean seawater (Lewis and Landing, 1991, 1992; Saito and others, 2003). In modern oxygenated oceans iron is estimated to exist at no more than nanomolar concentrations, due to the formation of insoluble Fe-hydroxides (Gordon and others, 1982; Landing and Bruland, 1987; Martin and Gordon, 1988; Bruland and others, 1991; Donat and Bruland, 1995). Despite the decreasing availability of iron under surface conditions with the progressive oxidation of the Earth, the use of iron in transformations of molecular oxygen and in dealing with oxygen radical toxicity seems to have increased with the rise of oxygen in the Earths atmosphere. Irons use in enzymes dealing with oxygen radicals (for example, superoxide dismutase), and in multiple reductase enzymes (sulfite, nitrite, et cetera) could have evolved in organisms living in anoxic environments where iron was still abundant. In contrast, the wide use of iron in molecular oxygen transformations in most dioxygenases suggests that mechanisms for acquiring iron in oxic environments might have evolved very early in life as an adaptation to rising oxygen in the Earths atmosphere (Raven, 1995). For instance, some microorganisms secrete ligands, called siderophores, designed almost exclusively to bind and take up Fe3+ (for example, Neilands, 1984; Hughes and Poole, 1989). These siderophores are low-molecular weight organic molecules, commonly catechols or hydroxamates, with formation constants for ferric iron as high as 1051 (Hider, 1984; Hughes and Poole, 1989; Winkelmann, 1991; Hersman and others, 1995).
Manganese has similar redox sensitivity, and can therefore substitute into some iron-containing enzymes (such as superoxide dismutase). Other manganese enzymes are used in a wide variety of functions, most notably dioxygen production, as in photosystem II, catalase, superoxide dismutase, and in the organelles of eukaryotes (Frausto da Silva and Williams, 2001). These functions might have evolved later in life after organisms began to produce molecular oxygen and learn to deal with free oxygen radicals. Furthermore, over some pH ranges manganese minerals tend to have higher solubility than iron minerals even under oxidizing conditions.
Biological uses of other metals (Zn, Co, Ni, W, and V) do not seem to have fluctuated significantly through time (fig. 9). This may be because organisms established very specific roles for these metals early in life, or because the metals have been subsequently replaced in metallo-enzymes by more bioavailable cofactors. Zinc, for example, is used in many ubiquitous enzymes such as metallopeptidases, metallophosphatases, and other zinc polymerases involved in DNA and RNA binding and synthesis (Lipscomb and Sträter, 1996). Zinc is readily available in modern freshwater and marine systems, where it can exist at up to 0.45 micromolar concentrations, nearly half of which is the bioavailable Zn2+ ion (Turner and others, 1981; Stumm and Morgan, 1996). However Saito and others (2003) calculate that zinc would have been scarce in early anoxic oceans, and suggest that during this time prokaryotes (specifically cyanobacteria) either used less zinc due to its scarcity or made use of alternate metal centers. They further suggest that organisms may have obtained zinc enzymes later through lateral gene transfer.
Cobalt and nickel-containing enzymes are most commonly used in metabolisms based on methane, carbon monoxide, and hydrogen, such as those likely operating on early Earth. N