-
Research
-
Publications
-
All publications
-
Benner, SA
-
Carrigan, MA
-
Chamberlin, SG
-
Chen, F
-
Hughes, E
-
Hutter, D
-
Hoshika, S
-
Karalkar, N
-
Kim, HJ
-
Kim, MJ
-
Laos, R
-
Leal, NA
-
Shaw, RW
-
Yang, ZY
-
People
-
Benner, Steven
-
Carrigan, Matthew
-
Chamberlin, Steve
-
Chen, Fei
-
Davis, Ross
-
Hoshika, Shuichi
-
Hughes, Ewa
-
Hughes, Romaine
-
Hutter, Daniel
-
Karalkar, Nilesh
-
Kim, Hyo-Joong
-
Kim, Myong
-
Laos, Roberto
-
Leal, Nicole
-
Opalko, Jeff
-
Shaw, Ryan
-
Yang, Zunyi
-
Software
-
News and Events
-
Press Coverage
-
Our Foundation
|
Distinguished Fellow Steven Benner
Education
- BS in Molecular Biophysics and Biochemistry. Yale University
(1976)
- MS in Molecular Biophysics and Biochemistry. Yale University
(1976)
- PhD in Chemistry. Harvard University
(1979)
Research summary
The Benner group has:
- Initiated synthetic biology as a field. The Benner group was the first to synthesize a gene for an enzyme, and used organic synthesis to prepare the first artificial genetic systems. These systems have been used to direct the synthesis of artificial proteins having unnatural amino acids, in FDA-approved clinical assays for HIV, hepatitis B and hepatitis C that improves the medical care of over 400,000 patients annually, and to support the first artificial chemical system capable of Darwinian evolution.
- Invented dynamic combinatorial chemistry, combining ideas from molecular evolution, enzymology, analytical chemistry, and organic chemistry to generate a strategy to discover small molecule therapeutic leads. A German company, Alantos, is today using this technology to develop drug leads.
- Established paleomolecular biology, where researchers resurrect ancestral proteins from extinct organisms for study in the laboratory, The strategy allows scientists to connect chemistry to function in biology, which is defined by an organism's fitness in a complex and changing environment.
- Helped found evolutionary bioinformatics, in 1991, launched one of the first web-based bioinformatics servers with Gaston Gonnet, generated the first naturally organized protein sequence databases, and helped develop the MasterCatalog that generated ca. $4 million in sales. This work also supported the first exhaustive matching of a modern protein sequence database, the first convincing tools to predict structure in proteins from sequence data, strategies to detect distant homologs using structure prediction, and "post-genomic" tools to detect changing protein function.
Awards
- National Science Foundation Graduate Fellow
- Junior Fellowship, Harvard Society of Fellows
- Dreyfus Award for Young Faculty, 1982
- Searle Scholar, 1984-86
- Sloan Foundation Fellow, 1984-86
- Anniversary Prize, Federation of European Biochemical Societies, 1993
- Nolan Summer Award, 1998
- Arun Gunthikonda Memorial Award, 1998
- Townes R. Leigh Commemorative Professor, 1999
- B. R. Baker Award, 2001
- Sigma Xi Senior Faculty Award 2005
Selected Publications

Signatures of a Shadow Biosphere
Davies, PCW
Benner, SA
Cleland, CE
Lineweaver, CH
McKay, CP
Wolfe-Simon, F
Astrobiology 9
(2)
241-249
(2009)
<Abstract>
Astrobiologists are aware that extraterrestrial life might differ from
known life, and considerable thought has been given to possible
signatures associated with weird forms of life on other planets. So
far, however, very little attention has been paid to the possibility
that our own planet might also host communities of weird life. If
life arises readily in Earth-like conditions, as many astrobiologists
contend, then it may well have formed many times on Earth itself,
which raises the question whether one or more shadow biospheres have
existed in the past or still exist today. In this paper, we discuss
possible signatures of weird life and outline some simple strategies
for seeking evidence of a shadow biosphere.

Incorporation of Multiple Sequential Pseudothymidines by DNA Polymerases and Their Impact on DNA Duplex Structure
Havemann, SA
Hoshika, S
Hutter, D
Benner, SA
Nuc. Nuc. Nuc. acids 27
(3)
261-278
(2008)
<Abstract>
In this article, we focus on the synthesis of aryl C-glycosides via
Heck coupling. It is organized based on the type of structures used in
the assembly of the C-glycosides (also called C-nucleosides) with the
following subsections: pyrimidine C-nucleosides, purine C-nucleosides,
and monocyclic, bicyclic, and tetracyclic C-nucleosides. The reagents
and conditions used for conducting the Heck coupling reactions are
discussed. The subsequent conversion of the Heck products to the
corresponding target molecules and the application of the target
molecules are also described.

The evolution of seminal ribonuclease: Pseudogene reactivation or multiple gene inactivation events?
Sassi, SO
Braun, EL
Benner, SA
Mol. Biol. Evol. 24
(4)
1012-1024
(2007)
<Abstract>
Two approaches, one novel, are applied to analyze the divergent
evolution of ruminant seminal ribonucleases (RNases), paralogs of the
well-known pancreatic RNases of mammals. Here, the goal was to identify
periods of divergence of seminal RNase under functional constraints,
periods of divergence as a pseudogene, and periods of divergence driven
by positive selection pressures. The classical approach involves the
analysis of nonsynonymous to synonymous replacements ratios (omega) for
the branches of the seminal RNase evolutionary tree. The novel approach
coupled these analyses with the mapping of substitutions on the folded
structure of the protein. These analyses suggest that seminal RNase
diverged during much of its history after divergence from pancreatic
RNase as a functioning protein, followed by homoplastic inactivations
to create pseudogenes in multiple ruminant lineages. Further, they are
consistent with adaptive evolution only in the most recent episode
leading to the gene in modern oxen. These conclusions contrast sharply
with the view, cited widely in the literature, that seminal RNase
decayed after its formation by gene duplication into an inactive
pseudogene, whose lesions were repaired in a reactivation event.
Further, the 2 approaches, omega estimation and mapping of replacements
on the protein structure, were compared by examining their utility for
establishing the functional status of the seminal RNase genes in 2 deer
species. Hog and roe deer share common lesions, which strongly suggests
that the gene was inactive in their last common ancestor. In this
specific example, the crystallographic approach made the correct
implication more strongly than the omega approach. Studies of this type
should contribute to an integrated framework of tools to assign
functional and nonfunctional episodes to recently created gene
duplicates and to understand more broadly how gene duplication leads to
the emergence of proteins with novel functions.

Nucleoside alpha-thiotriphosphates, polymerases and the exonuclease III analysis of oligonucleotides containing phosphorothioate linkages
Yang, ZY
Sismour, AM
Benner, SA
Nucl. Acids Res. 35
(9)
3118-3127
(2007)
<Abstract>
The use of DNA polymerases to incorporate phosphorothioate linkages
into DNA, and the use of exonuclease III to determine where those
linkages have been incorporated, are re- examined in this work. The
results presented here show that exonuclease III degrades single-
stranded DNA as a substrate and digests through phosphorothioate
linkages having one absolute stereochemistry, assigned ( assuming
inversion in the polymerase reaction) as S, but not the other absolute
stereochemistry. This contrasts with a general view in the literature
that exonuclease III favors double-stranded nucleic acid as a substrate
and stops completely at phosphorothioate linkages. Furthermore, not all
DNA polymerases appear to accept exclusively the ( R) stereoisomer of
nucleoside alpha- thiotriphosphates [ and not the ( S) diastereomer], a
conclusion inferred two decades ago by examination of five Family- A
polymerases and a reverse transcriptase. This suggests that caution is
appropriate when extrapolating the detailed behavior of one polymerase
from the behaviors of other polymerases. Furthermore, these results
provide constraints on how exonuclease III - thiotriphosphate -
polymerase combinations can be used to analyze the behavior of the
components of a synthetic biology.

Enzymatic incorporation of a third nucleobase pair
Yang, ZY
Sismour, AM
Sheng, PP
Puskar, NL
Benner, SA
Nucl. Acids Res. 35
(13)
4238-4249
(2007)
<Abstract>
DNA polymerases are identified that copy a nonstandard nucleotide pair
joined by a hydrogen bonding pattern different from the patterns
joining the dA:T and dG:dC pairs.
6-Amino-5-nitro3-(l'-p-D-2'-deoxyribofuranosyl)-2(1H)-pyridone (dZ)
implements the non-standard 'small' donordonor-acceptor (pyDDA)
hydrogen bonding pattern.
2-Amino-8-(1-beta-D-2'-deoxyribofuranosyl)imidazo[1,2-a]-1,3,5-triazin-4
(8H)-one [dP) implements the 'large' acceptor-acceptor-donor (puAAD)
pattern. These nucleobases were designed to present electron density to
the minor groove, density hypothesized to help determine specificity
for polymerases. Consistent with this hypothesis, both dZTP and dPTP
are accepted by many polymerases from both Families A and B. Further,
the dZ:dP pair participates in PCR reactions catalyzed by Taq, Vent
(exo(-)) and Deep Vent (exo-) polymerases, with 94.4%, 97.5% and 97.5%,
respectively, retention per round. The dZ:dP pair appears to be lost
principally via transition to a dC:dG pair. This is consistent with a
mechanistic hypothesis that deprotonated dZ (presenting a pyDAA
pattern) complements dG (presenting a puADD pattern), while protonated
dC (presenting a pyDDA pattern) complements dP (presenting a puAAD
pattern). This hypothesis, grounded in the Watson-Crick model for
nucleobase pairing, was confirmed by studies of the pH-dependence of
mismatching. The dZ:dP pair and these polymerases, should be useful in
dynamic architectures for sequencing, molecular-, systems- and
synthetic-biology.

The origin of proteins and nucleic acids
Ricardo, A
Benner, SA
Planets and Life: The Emerging Science of Astrobiology, ed. Woodruff T. Sullivan and John A. Baross, Cambridge University Press 154-173
(2007)

Alien biochemistries
Ward, PD
Benner, SA
Planets and Life: The Emerging Science of Astrobiology, ed. Woodruff T. Sullivan and John A. Baross, Cambridge University Press 537-544
(2007)

Integrating protein structures and precomputed genealogies in the Magnum database: Examples with cellular retinoid binding proteins
Bradley, ME
Benner, SA
BMC Bioinformatics 7 89
(2006)
<Abstract>
Background: When accurate models for the divergent evolution of protein
sequences are integrated with complementary biological information,
such as folded protein structures, analyses of the combined data often
lead to new hypotheses about molecular physiology. This represents an
excellent example of how bioinformatics can be used to guide
experimental research. However, progress in this direction has been
slowed by the lack of a publicly available resource suitable for
general use.
Results: The precomputed Magnum database offers a solution to this
problem for ca. 1,800 full-length protein families with at least one
crystal structure. The Magnum deliverables include 1) multiple sequence
alignments, 2) mapping of alignment sites to crystal structure sites,
3) phylogenetic trees, 4) inferred ancestral sequences at internal tree
nodes, and 5) amino acid replacements along tree branches.
Comprehensive evaluations revealed that the automated procedures used
to construct Magnum produced accurate models of how proteins
divergently evolve, or genealogies, and correctly integrated these with
the structural data. To demonstrate Magnum's capabilities, we asked for
amino acid replacements requiring three nucleotide substitutions,
located at internal protein structure sites, and occurring on short
phylogenetic tree branches. In the cellular retinoid binding protein
family a site that potentially modulates ligand binding affinity was
discovered. Recruitment of cellular retinol binding protein to function
as a lens crystallin in the diurnal gecko afforded another opportunity
to showcase the predictive value of a browsable database containing
branch replacement patterns integrated with protein structures.
Conclusion: We integrated two areas of protein science, evolution and
structure, on a large scale and created a precomputed database, known
as Magnum, which is the first freely available resource of its kind.
Magnum provides evolutionary and structural bioinformatics resources
that are useful for identifying experimentally testable hypotheses
about the molecular basis of protein behaviors and functions, as
illustrated with the examples from the cellular retinoid binding
proteins.

Analysis of transitions at two-fold redundant sites in mammalian genomes. Transition redundant approach-to-equilibrium (TREx) distance metrics
Li, T
Chamberlin, SG
Caraco, MD
Liberles, DA
Gaucher, EA
Benner, SA
BMC Evol. Biol. 6 25
(2006)
<Abstract>
Background: The exchange of nucleotides at synonymous sites in a gene
encoding a protein is believed to have little impact on the fitness of
a host organism. This should be especially true for synonymous
transitions, where a pyrimidine nucleotide is replaced by another
pyrimidine, or a purine is replaced by another purine. This suggests
that transition redundant exchange ( TREx) processes at the third
position of conserved two-fold codon systems might offer the best
approximation for a neutral molecular clock, serving to examine, within
coding regions, theories that require neutrality, determine whether
transition rate constants differ within genes in a single lineage, and
correlate dates of events recorded in genomes with dates in the
geological and paleontological records. To date, TREx analysis of the
yeast genome has recognized correlated duplications that established a
new metabolic strategies in fungi, and supported analyses of functional
change in aromatases in pigs. TREx dating has limitations, however.
Multiple transitions at synonymous sites may cause equilibration and
loss of information. Further, to be useful to correlate events in the
genomic record, different genes within a genome must suffer transitions
at similar rates.
Results: A formalism to analyze divergence at two fold redundant codon
systems is presented. This formalism exploits two-state
approach-to-equilibrium kinetics from chemistry. This formalism
captures, in a single equation, the possibility of multiple
substitutions at individual sites, avoiding any need to "correct" for
these. The formalism also connects specific rate constants for
transitions to specific approximations in an underlying evolutionary
model, including assumptions that transition rate constants are
invariant at different sites, in different genes, in different
lineages, and at different times. Therefore, the formalism supports
analyses that evaluate these approximations.
Transitions at synonymous sites within two-fold redundant coding
systems were examined in the mouse, rat, and human genomes. The key
metric (f(2)), the fraction of those sites that holds the same
nucleotide, was measured for putative ortholog pairs. A transition
redundant exchange ( TREx) distance was calculated from f(2) for these
pairs. Pyrimidine-pyrimidine transitions at these sites occur
approximately 14% faster than purine-purine transitions in various
lineages. Transition rate constants were similar in different genes
within the same lineages; within a set of orthologs, the f(2)
distribution is only modest overdispersed. No correlation between
disparity and overdispersion is observed. In rodents, evidence was
found for greater conservation of TREx sites in genes on the X
chromosome, accounting for a small part of the overdispersion, however.
Conclusion: The TREx metric is useful to analyze the history of
transition rate constants within these mammals over the past 100
million years. The TREx metric estimates the extent to which silent
nucleotide substitutions accumulate in different genes, on different
chromosomes, with different compositions, in different lineages, and at
different times.

Application of DETECTER, an Evolutionary Genomic Tool to Analyze Genetic Variation, to the Cystic Fibrosis Gene Family
Gaucher, EA
DeKee, DW
Benner, SA
BMC Genomics 7 44
(2006)
<Abstract>
Background: The medical community requires computational tools that
distinguish genetic differences having phenotypic impact within the
vast number of mutations that do not. Tools that do this will become
increasingly important for those seeking to use human genome sequence
data to predict disease, make prognoses, and customize therapy to
individual patients.
Results: An approach, termed DETECTER, is proposed to identify sites
in a protein sequence where amino acid replacements are likely to have
a significant effect on phenotype, including causing genetic
disease. This approach uses a model-dependent tool to estimate the
normalized replacement rate at individual sites in a protein sequence,
based on a history of those sites extracted from an evolutionary
analysis of the corresponding protein family. This tool identifies
sites that have higher-than-average, average, or lower- than-average
rates of change in the lineage leading to the sequence in the
population of interest. The rates are then combined with sequence data
to determine the likelihoods that particular amino acids were present
at individual sites in the evolutionary history of the gene
family. These likelihoods are used to predict whether any specific
amino acid replacements, if introduced at the site in a modern human
population, would have a significant impact on fitness. The DETECTER
tool is used to analyze the cystic fibrosis transmembrane conductance
regulator (CFTR) gene family.
Conclusions: In this system, DETECTER retrodicts amino acid
replacements associated with the cystic fibrosis disease with greater
accuracy than alternative approaches. While this result validates this
approach for this particular family of proteins only, the approach may
be applicable to the analysis of polymorphisms generally, including
SNPs in a human population.

2-Hydroxymethylboronate as a Reagent To Detect Carbohydrates: Application to the Analysis of the Formose Reaction
Ricardo, A
Frye, F
Carrigan, MA
Tipton, JD
Powell, DH
Benner, SA
J. Org. Chem. 71
(25)
9503-9505
(2006)
<Abstract>
2-Hydroxymethylphenylboronate is described as a reagent that converts
neutral 1,2-diols, as found in simple carbohydrates, into 1:1 anionic
complexes that are easily detected by Fourier transform ion cyclotron
resonance mass spectrometry. The value of this reagent was demonstrated
through its application to analyze complex mixtures of carbohydrates
formed in the formose process, often cited as a way that biologically
significant carbohydrates might have been generated from formaldehyde
under prebiotic conditions. Coupled with isotope studies, the reagent
shows that the simplest autocatalytic cycle for the consumption of
formaldehyde in this process cannot account for the bulk consumption of
formaldehyde.

Dynamic assembly of primers on nucleic acid templates
Leal, NA
Sukeda, M
Benner, SA
Nucl. Acids Res. 34 4702-4710
(2006)
<Abstract>
A strategy is presented that uses dynamic equlibria to assemble in situ
composite DNA polymerase primers, having lengths of 14 or 16 nt, from DNA
fragments that are 6 or 8 nt in length. In this implementation, the
fragments are transiently joined under conditions of dynamic equilibrium by
an imine linker, which has a dissociation constant of 1 µM. If a polymerase
is able to extend the composite, but not the fragments, it is possible to
prime the synthesis of a target DNA molecule under conditions where two
useful specificities are combined: (i) single nucleotide discrimination
that is characteristic of short oligonucleotide duplexes (four to six
nucleobase pairs in length), which effectively excludes single mismatches,
and (ii) an overall specificity of priming that is characteristic of long
(14 to 16mers) oligonucleotides, potentially unique within a genome. We
report here the screening of a series of polymerases that combine an
ability not to accept short primer fragments with an ability to accept the
long composite primer held together by an unnatural imine linkage. Several
polymerases were found that achieve this combination, permitting the
implementation of the dynamic combinatorial chemical strategy.

Artificially expanded genetic information system: a new base pair with an alternative hydrogen bonding pattern
Yang, ZY
Hutter, D
Sheng, PP
Sismour, AM
Benner, SA
Nucl. Acids Res. 34
(21)
6095-6101
(2006)
<Abstract>
To support efforts to develop a 'synthetic biology' based on an
artificially expanded genetic information system (AEGIS), we have
developed a route to two components of a non-standard nucleobase pair,
the pyrimidine analog
6-amino-5-nitro-3-(1'-beta-D-2'-deoxyribofuranosyl)-2(1H)-pyridone (dZ)
and its Watson-Crick complement, the purine analog
2-amino-8-(1'-beta-D-2'-deoxyribofuranosyl)-imidazo[1,2-a]-1,3,5-triazin
-4(8H)-one (dP). These implement the pyDDA:puAAD hydrogen bonding
pattern (where 'py' indicates a pyrimidine analog and 'pu' indicates a
purine analog, while A and D indicate the hydrogen bonding patterns of
acceptor and donor groups presented to the complementary nucleobases,
from the major to the minor groove). Also described is the synthesis of
the triphosphates and protected phosphoramidites of these two
nucleosides. We also describe the use of the protected phosphoramidites
to synthesize DNA oligonucleotides containing these AEGIS components,
verify the absence of epimerization of dZ in those oligonucleotides,
and report some hybridization properties of the dZ:dP nucleobase pair,
which is rather strong, and the ability of each to effectively
discriminate against mismatches in short duplex DNA.

A review: Synthesis of aryl C-glycosides via the heck coupling reaction
Wellington, KW
Benner, SA
Nuc. Nuc. Nuc. acids 25
(12)
1309-1333
(2006)
<Abstract>
In this article, we focus on the synthesis of aryl C-glycosides via
Heck coupling. It is organized based on the type of structures used in
the assembly of the C-glycosides (also called C-nucleosides) with the
following subsections: pyrimidine C-nucleosides, purine C-nucleosides,
and monocyclic, bicyclic, and tetracyclic C-nucleosides. The reagents
and conditions used for conducting the Heck coupling reactions are
discussed. The subsequent conversion of the Heck products to the
corresponding target molecules and the application of the target
molecules are also described.

Desorption/ionization on porous silicon mass spectrometry studies on pentose-borate complexes
Li, Q
Ricardo, A
Benner, SA
Winefordner, JD
Powell, DH
Anal. Chem. 77
(14)
4503-4508
(2005)
<Abstract>
Desorption/ionization on porous silicon mass spectrometry (DIOS-MS) was
used to investigate the binding affinities between aldopentose isomers
and boron. Boron has been recognized for its importance in pentose
synthesis and stabilization in prebiotic conditions. Boron may also
account for the fact that ribose, among other aldopentoses, is the
favored building block in RNA synthesis. This research started with the
detection of aldopentoses in the positive mode through cationization
and the aldopentose-borate complexes in the negative mode. Then two
competition schemes, one using a pentose structure analogue and the
other using C-13-labeled ribose, were designed to compare the relative
binding affinities of four aldopentoses (xylose, lyxose, arabinose, and
ribose) to boron. Both approaches determined the binding preference to
be ribose > lyxose > arabinose > xylose. This work illustrates the
potential of DIOS-MS in the analyses of nonvolatile, small molecules in
delicate chemical equilibria. Without externally introduced matrices,
background signals are not a limiting factor. Furthermore, the possible
dramatic change of pH associated with the matrix introduction, which
may disturb the equilibria of interest, is avoided.

Synthetic biology
Sismour, AM
Benner, SA
Expert Opin. Biol. Ther. 5
(11)
1409-1414
(2005)
<Abstract>
Chemistry is a broadly powerful discipline in contemporary science
because it has the ability to create new forms of the matter that it
studies. By doing so, chemistry can test models that connect molecular
structure to behaviour without having to rely on what nature has
provided. This creation, known as synthesis', began to be applied to
living systems in the 1980s as recombinant DNA technologies allowed
biologists to deliberately change the molecular structure of the
microbes that they studied, and automated chemical synthesis of DNA
became widely available to support these activities. The impact of the
information that has emerged has made biologists aware of a truism that
has long been known in chemistry: synthesis drives discovery and
understanding in ways that analysis cannot. Synthetic biology is now
setting an ambitious goal: to recreate in artificial systems the
emergent properties found in natural biology. By doing so, it is
advancing our understanding of the molecular basis of genetics in ways
that analysis alone cannot. More practically, it has yielded artificial
genetic systems that improve the healthcare of some 400,000 Americans
annually. Synthetic biology is now set to take the next step, to create
artificial Darwinian systems by direct construction. Supported by the
National Science Foundation as part of its Chemical Bonding program,
this work cannot help but generate clarity in our understanding of how
biological systems work.

The use of thymidine analogs to improve the replication of an extra DNA base pair: a synthetic biological system
Sismour, AM
Benner, SA
Nucl. Acids Res. 33 5640-5646
(2005)
<Abstract>
Synthetic biology based on a six-letter genetic alphabet that
includes the two non-standard nucleobases isoguanine (isoG) and
isocytosine (isoC), as well as the standard A, T, G and C, is
known to suffer as a consequence of a minor tautomeric form of
isoguanine that pairs with thymine, and therefore leads to
infidelity during repeated cycles of the PCR. Reported here is a
solution to this problem. The solution replaces thymidine
triphosphate by 2-thiothymidine triphosphate (2-thioTTP). Because
of the bulk and hydrogen bonding properties of the thione unit in
2-thioT, 2-thioT does not mispair effectively with the minor
tautomer of isoG. To test whether this might allow PCR
amplification of a six-letter artificially expanded genetic
information system, we examined the relative rates of
misincorporation of 2-thioTTP and TTP opposite isoG using affinity
electrophoresis. The concentrations of isoCTP and 2-thioTTP were
optimal to best support PCR amplification using thermostable
polymerases of a six-letter alphabet that includes the isoC-isoG
pair. The fidelity-per-round of amplification was found to be
approximately 98% in trial PCRs with this six-letter DNA
alphabet. The analogous PCR employing TTP had a fidelity-per-round
of only approximately 93%. Thus, the A, 2-thioT, G, C, isoC, isoG
alphabet is an artificial genetic system capable of Darwinian
evolution.

Resurrecting ancestral alcohol dehydrogenases from yeast
Thomson, JM
Gaucher, EA
Burgan, MF
De Kee, DW
Li, T
Aris, JP
Benner, SA
Nature Genet. 37
(6)
630-635
(2005)
<Abstract>
Modern yeast living in fleshy fruits rapidly convert sugars into
bult ethanol through pyruvate. Pyruvate loses carbon dioxide to
become acetaldehyde, which is reduced by alcohol dehydrogenase 1
(Adh1) to ethanol, which accumulates. Yeast later consumes the
accumulated ethanol, exploiting Adh2, an Adh1 homolog differing by
24 (of 348) amino acids. Because many microorganisms cannot grow
in ethanol, accumulated ethanol may help yeast defend resources in
the fruit. We report here the reconstruction of the last common
ancestor of Adh1 and Adh2, called AdhA. The kinetic behavior of
AdhA suggests that it was optimized to make (not consume) ethanol.
This is consistent with the hypothesis that before the Adh1-Adh2
duplication, yeast did not accumulate ethanol for later consumption
but rather used AdhA to recycle NADH generated in the glycolytic
pathway. Silent nucleotide dating suggests that the Adh1-Adh2
duplication occurred near the time of duplication of several other
proteins involved in the accumulation of ethanol, possibly in the
Cretaceous age when fleshy fruits arose. These results help to
connect the chemical behavior of these enzymes through systems
analysis to a time of global ecosystem change, a small but useful
step towards a planetary systems biology.
 Synthetic Biology
Sismour, AM
Benner, SA
Nat. Rev. Genet. 6 533-543
(2005)
<Abstract>
Synthetic biologists come in two broad classes. One uses unnatural
molecules to reproduce emergent behaviours from natural biology,
with the goal of creating artificial life. The other seeks
interchangeable parts from natural biology to assemble into
systems that function unnaturally. Either way, a synthetic goal
forces scientists to cross uncharted ground to encounter and solve
problems that are not easily encountered through analysis. This
drives the emergence of new paradigms in ways that analysis cannot
easily do. Synthetic biology has generated diagnostic tools that
improve the care of patients with infectious diseases, as well as
devices that oscillate, creep and play tic-tac-toe.

Understanding nucleic acids using synthetic chemistry
Benner, SA
Acc. Chem. Res. 37
(10)
784-797
(2004)
<Abstract>
This Account describes work done in these laboratories that has used
synthetic, physical organic, and biological chemistry to understand the
roles played by the nucleobases, sugars, and phosphates of DNA in the
molecular recognition processes central to genetics. The number of
nucleobases has been increased from 4 to 12, generating an artificially
expanded genetic information system. This system is used today in the
clinic to monitor the levels of HIV and hepatitis C viruses in
patients, helping to manage patient care. Work with uncharged phosphate
replacements suggests that a repeating charge is a universal feature of
genetic molecules operating in water and will be found in
extraterrestrial life (if it is ever encountered). The use of ribose
may reflect prebiotic processes in the presence of borate-containing
minerals, which stabilize ribose formed from simple organic precursors.
A new field, synthetic biology, is emerging on the basis of these
experiments, where chemistry mimics biological processes as complicated
as Darwinian evolution.

The planetary biology of cytochrome P450 aromatases
Gaucher, EA
Graddy, LG
Li, T
Simmen, RC
Simmen, FA
Schreiber, DR
Liberles, DA
Janis, CM
Benner, SA
BMC Biology 2
(1)
19
(2004)
<Abstract>
BACKGROUND: Joining a model for the molecular evolution of a
protein family to the paleontological and geological records
(geobiology), and then to the chemical structures of substrates,
products, and protein folds, is emerging as a broad strategy for
generating hypotheses concerning function in a post-genomic
world. This strategy expands systems biology to a planetary
context, necessary for a notion of fitness to underlie (as it
must) any discussion of function within a biomolecular
system.
RESULTS: Here, we report an example of such an expansion,
where tools from planetary biology were used to analyze three
genes from the pig Sus scrofa that encode cytochrome P450
aromatases-enzymes that convert androgens into estrogens. The
evolutionary history of the vertebrate aromatase gene family was
reconstructed. Transition redundant exchange silent substitution
metrics were used to interpolate dates for the divergence of
family members, the paleontological record was consulted to
identify changes in physiology that correlated in time with the
change in molecular behavior, and new aromatase sequences from
peccary were obtained. Metrics that detect changing function in
proteins were then applied, including KA/KS values and those
that exploit structural biology. These identified specific amino
acid replacements that were associated with changing substrate
and product specificity during the time of presumed adaptive
change. The combined analysis suggests that aromatase paralogs
arose in pigs as a result of selection for Suoidea with larger
litters than their ancestors, and permitted the Suoidea to
survive the global climatic trauma that began in the
Eocene.
CONCLUSIONS: This combination of bioinformatics analysis,
molecular evolution, paleontology, cladistics, global
climatology, structural biology, and organic chemistry serves as
a paradigm in planetary biology. As the geological,
paleontological, and genomic records improve, this approach
should become widely useful to make systems biology statements
about high-level function for biomolecular systems.

Is there a common chemical model for life in the universe?
Benner, SA
Ricardo, A
Carrigan, MA
Curr. Op. Chem Biol. 8
(6)
672-689
(2004)
<Abstract>
A review of organic chemistry suggests that life, a chemical system
capable of Darwinian evolution, may exist in a wide range of
environments. These include non-aqueous solvent systems at low
temperatures, or even supercritical dihydrogen-helium mixtures. The
only absolute requirements may be a thermodynamic disequilibrium and
temperatures consistent with chemical bonding. A solvent system,
availability of elements such as carbon, hydrogen, oxygen and nitrogen,
certain thermodynamic features of metabolic pathways, and the
opportunity for isolation, may also define habitable environments. If
we constrain life to water, more specific criteria can be proposed,
including soluble metabolites, genetic materials with repeating
charges, and a well defined temperature range.

Inferring the palaeoenvironment of ancient bacteria on the basis of resurrected proteins
Gaucher, EA
Thomson, JM
Burgan, MF
Benner, SA
Nature 425
(6955)
285-288
(2003)
<Abstract>
Features of the physical environment surrounding an ancestral
organism can be inferred by reconstructing sequences(1-9) of
ancient proteins made by those organisms, resurrecting these
proteins in the laboratory, and measuring their
properties. Here, we resurrect candidate sequences for
elongation factors of the Tu family (EF-Tu) found at ancient
nodes in the bacterial evolutionary tree, and measure their
activities as a function of temperature. The ancient EF-Tu
proteins have temperature optima of 55-65degreesC. This value
seems to be robust with respect to uncertainties in the
ancestral reconstruction. This suggests that the ancient
bacteria that hosted these particular genes were thermophiles,
and neither hyperthermophiles nor mesophiles. This conclusion
can be compared and contrasted with inferences drawn from an
analysis of the lengths of branches in trees joining proteins
from contemporary bacteria(10), the distribution of thermophily
in derived bacterial lineages(11), the inferred G+C content of
ancient ribosomal RNA(12), and the geological record combined
with assumptions concerning molecular clocks(13). The study
illustrates the use of experimental palaeobiochemistry and
assumptions about deep phylogenetic relationships between
bacteria to explore the character of ancient life.

Nucleobase pairing in Watson-Crick-like genetic expanded information systems
Geyer, CR
Battersby, TR
Benner, SA
Structure 11
(12)
1485-1498
(2003)
<Abstract>
To guide the design of alternative genetic systems, we measured melting
temperatures of DNA duplexes containing matched and mismatched
nucleobase pairs from natural and unnatural structures. The pairs were
analyzed in terms of structural features, including nucleobase size,
number of hydrogen bonds formed, the presence of uncompensated hydrogen
bonding functional groups, the nature of the bond joining the
nucleobase to the sugar, and nucleobase charge. The results suggest
that stability of nucleobase pairs correlates with the number of
H-bonds, size complementarity, the presence of uncompensated functional
groups, and the presence of charge on a nucleobase. Each of these
properties appear to be more significant than the nature of the
glycosidic bond and sequence context. The results provide guidelines
for constructing stable Watson-Crick like nucleobase pairs with
unnatural nucleobases. The experiments also demonstrate that expanded
genetic systems can be constructed using size complementary nucleobase
pairs that contain three hydrogen bonds.

Evolution - Planetary biology - Paleontological, geological, and molecular histories of life
Benner, SA
Caraco, MD
Thomson, JM
Gaucher, EA
Science 296
(5569)
864-868
(2002)
<Abstract>
The history of life on Earth is chronicled in the geological
strata, the fossil record, and the genomes of contemporary
organisms. When examined together, these records help identify
metabolic and regulatory pathways, annotate protein sequences,
and identify animal models to develop new drugs, among other
features of scientific and biomedical interest. Together,
planetary analysis of genome and proteome databases is providing
an enhanced understanding of how life interacts with the
biosphere and adapts to global change.

How small can a microorganism be?
Benner, SA
Size Limits of Very Small Microorganisms: Proceedings of a Workshop, Steering Group on Astrobiology of the Space Studies Board, National Research Council 126-135
(1999)
(Click here to see all publications by this author)
|
- Chemical genetics
- Synthetic biology
- Paleogenetics
- Planetary biology
- Systems biology
- The connection of natural history to the physical sciences
|
|