-
Research
-
Publications
-
People
-
Benner, Steven
-
Carrigan, Matthew
-
Chamberlin, Steve
-
Davis, Ross
-
Gaucher, Eric
-
Hughes, Romaine
-
Hutter, Daniel
-
Kim, Hyo-Joong
-
Leal, Nicole
-
Shaw, Ryan
-
Yang, Zunyi
-
Software
-
News and Events
-
Our Foundation
|
All FfAME Publications

Incorporation of Multiple Sequential Pseudothymidines by DNA Polymerases and Their Impact on DNA Duplex Structure
Havemann, SA
Hoshika, S
Hutter, D
Benner, SA
Nuc. Nuc. Nuc. acids 27
(3)
261-278
(2008)
<Abstract>
In this article, we focus on the synthesis of aryl C-glycosides via
Heck coupling. It is organized based on the type of structures used in
the assembly of the C-glycosides (also called C-nucleosides) with the
following subsections: pyrimidine C-nucleosides, purine C-nucleosides,
and monocyclic, bicyclic, and tetracyclic C-nucleosides. The reagents
and conditions used for conducting the Heck coupling reactions are
discussed. The subsequent conversion of the Heck products to the
corresponding target molecules and the application of the target
molecules are also described.
 Paleotemperature trend for Precambrian life inferred from resurrected proteins
Gaucher, EA
Ganesh, O
Govindarajan, S
Nature
(2007)
In press
 Leishmania promastigotes activate PI3K/Akt signalling to confer host cell resistance to apoptosis
Ruhland, A
Leal, N
Kima, PE
Cell Microbiol. 9
(1)
84-96
(2007)
<Abstract>
Previous reports have shown that cells infected with promastigotes of
some Leishmania species are resistant to the induction of apoptosis.
This would suggest that either parasites elaborate factors that block
signalling from apoptosis inducers or that parasites engage endogenous
host signalling pathways that block apoptosis. To investigate the
latter scenario, we determined whether Leishmania infection results in
the activation of signalling pathways that have been shown to mediate
resistance to apoptosis in other infection models. First, we showed
that infection with the promastigote form of Leishmania major,
Leishmania pifanoi and Leishmania amazonensis activates signalling
through p38 mitogen-activated protein kinase (MAPK), NF kappa B and
PI3K/Akt. Then we found that inhibition of signalling through the
PI3K/Akt pathway with LY294002 and Akt IV inhibitor reversed resistance
of infected bone marrow-derived macrophages and RAW 264.7 macrophages
to potent inducers of apoptosis. Moreover, reduction of Akt levels with
small interfering RNAs to Akt resulted in the inability of infected
macrophages to resist apoptosis. Further evidence of the role of
PI3K/Akt signalling in the promotion of cell survival by infected cells
was obtained with the finding that Bad, which is a substrate of Akt,
becomes phosphorylated during the course of infection. In contrast to
the observations with PI3K/Akt signalling, inhibition of p38 MAPK
signalling with SB202190 or NF kappa B signalling with wedelolactone
had limited effect on parasite-induced resistance to apoptosis. We
conclude that Leishmania promastigotes engage PI3K/Akt signalling,
which confers to the infected cell, the capacity to resist death from
activators of apoptosis.
 Molecular evolutionary models to guide experiments in protein engineering and directed evolution
Gaucher, EA
(2007)
In review
 PduL is an evolutionarily distinct phosphotransacylase involved in B-12-dependent 1,2-propanediol degradation by Salmonella enterica serovar typhimurium LT2
Liu, Y
Leal, NA
Sampson, EM
Johnson, CLV
Havemann, GD
Bobik, TA
J. Bacteriol. 189
(5)
1589-1596
(2007)
<Abstract>
Salmonella enterica degrades 1,2-propanediol (1,2-PD) in a coenzyme
B-12-dependent manner. Previous enzymatic assays of crude cell extracts
indicated that a phosphotransacylase (PTAC) was needed for this
process, but the enzyme involved was not identified. Here, we show that
the pduL gene encodes an evolutionarily distinct PTAC used for 1,2-PD
degradation. Growth tests showed that pduL mutants were unable to
ferment 1,2-PD and were also impaired for aerobic growth on this
compound. Enzyme assays showed that cell extracts from a pduL mutant
lacked measurable PTAC activity in a background that also carried a pta
mutation (the pta gene was previously shown to encode a PTAC enzyme).
Ectopic expression of pduL corrected the growth defects of a pta
mutant. PduL fused to eight C-terminal histidine residues (PduL-His(8))
was purified, and its kinetic constants were determined: the V-max was
51.7 +/- 7.6 mu mol min(-1) mg(-1), and the K-m values for
propionyl-PO42- and acetyl-PO42- were 0.61 and 0.97 mM, respectively.
Sequence analyses showed that PduL is unrelated in amino acid sequence
to known PTAC enzymes and that PduL homologues are distributed among at
least 49 bacterial species but are absent from the Archaea and Eukarya.
 In vivo expression of human ATP : cob(I)atamin adenosyltransferase (ATR) using recombinant adeno-associated virus (rAAV) serotypes 2 and 8
Erger, KE
Conlon, TJ
Leal, NA
Zori, R
Bobik, TA
Flotte, TR
J. Gene Med. 9
(6)
462-469
(2007)
<Abstract>
Background Methylmalonic aciduria (MMA) is an autosomal recessive
disease with symptoms that include ketoacidosis, lethargy, recurrent
vomiting, dehydration, respiratory distress, muscular hypotonia and
death due to methylmalonic acid levels that are up to 1000-fold greater
than normal. CblB MMA, a subset of the mutations leading to MMA, is
caused by a deficiency in the enzyme cob(I)alamin adenosyltransferase
(ATR). No animal model currently exists for this disease. ATR functions
within the mitochondria matrix in the final conversion of cobalamin
into coenzyme B-12, adenosylcobalamin (AdoCbl). AdoCbl is. a required
coenzyme for the mitochondrial enzyme methylmalonyl-CoA mutase (MCM).
Methods The human ATR cDNA was cloned into a recombinant
adenoassociated virus (rAAV) vector and packaged into AAV 2 or 8
capsids and delivered by portal vein injection to C57/B16 mice at a
dose of 1 x 10(10) and 1 x 10(11), particles. Eight weeks
post-injection RNA, genomic DNA and protein were then extracted and
analyzed.
Results Using primer pairs specific to the cytomegalovirus (CMV)
enhancer/chicken P-actin (CBAT) promoter within the rAAV vectors,
genome copy numbers were found to be 0.03, 2.03 and 0.10 per cell in
liver for the rAAV8 low dose, rAAV8 high dose and rAAV2 high dose,
respectively. Western blotting performed on mitochondrial protein
extracts demonstrated protein levels were comparable to control levels
in the rAAV8 low dose and rAAV2 high dose animals and 3- to 5-fold
higher than control levels were observed in high dose animals.
Immunostaining demonstrated enhanced transduction efficiency of
hepatocytes to over 40% in the rAAV8 high dose animals, compared to 9%
and 5% transduction in rAAV2 high dose and rAAV8 low dose animals,
respectively.
Conclusions These data demonstrate the feasibility of efficient ATR
gene transfer to the liver as a prelude to future gene therapy
experiments. Copyright (C) 2007 John Wiley & Sons, Ltd.

The evolution of seminal ribonuclease: Pseudogene reactivation or multiple gene inactivation events?
Sassi, SO
Braun, EL
Benner, SA
Mol. Biol. Evol. 24
(4)
1012-1024
(2007)
<Abstract>
Two approaches, one novel, are applied to analyze the divergent
evolution of ruminant seminal ribonucleases (RNases), paralogs of the
well-known pancreatic RNases of mammals. Here, the goal was to identify
periods of divergence of seminal RNase under functional constraints,
periods of divergence as a pseudogene, and periods of divergence driven
by positive selection pressures. The classical approach involves the
analysis of nonsynonymous to synonymous replacements ratios (omega) for
the branches of the seminal RNase evolutionary tree. The novel approach
coupled these analyses with the mapping of substitutions on the folded
structure of the protein. These analyses suggest that seminal RNase
diverged during much of its history after divergence from pancreatic
RNase as a functioning protein, followed by homoplastic inactivations
to create pseudogenes in multiple ruminant lineages. Further, they are
consistent with adaptive evolution only in the most recent episode
leading to the gene in modern oxen. These conclusions contrast sharply
with the view, cited widely in the literature, that seminal RNase
decayed after its formation by gene duplication into an inactive
pseudogene, whose lesions were repaired in a reactivation event.
Further, the 2 approaches, omega estimation and mapping of replacements
on the protein structure, were compared by examining their utility for
establishing the functional status of the seminal RNase genes in 2 deer
species. Hog and roe deer share common lesions, which strongly suggests
that the gene was inactive in their last common ancestor. In this
specific example, the crystallographic approach made the correct
implication more strongly than the omega approach. Studies of this type
should contribute to an integrated framework of tools to assign
functional and nonfunctional episodes to recently created gene
duplicates and to understand more broadly how gene duplication leads to
the emergence of proteins with novel functions.

Nucleoside alpha-thiotriphosphates, polymerases and the exonuclease III analysis of oligonucleotides containing phosphorothioate linkages
Yang, ZY
Sismour, AM
Benner, SA
Nucl. Acids Res. 35
(9)
3118-3127
(2007)
<Abstract>
The use of DNA polymerases to incorporate phosphorothioate linkages
into DNA, and the use of exonuclease III to determine where those
linkages have been incorporated, are re- examined in this work. The
results presented here show that exonuclease III degrades single-
stranded DNA as a substrate and digests through phosphorothioate
linkages having one absolute stereochemistry, assigned ( assuming
inversion in the polymerase reaction) as S, but not the other absolute
stereochemistry. This contrasts with a general view in the literature
that exonuclease III favors double-stranded nucleic acid as a substrate
and stops completely at phosphorothioate linkages. Furthermore, not all
DNA polymerases appear to accept exclusively the ( R) stereoisomer of
nucleoside alpha- thiotriphosphates [ and not the ( S) diastereomer], a
conclusion inferred two decades ago by examination of five Family- A
polymerases and a reverse transcriptase. This suggests that caution is
appropriate when extrapolating the detailed behavior of one polymerase
from the behaviors of other polymerases. Furthermore, these results
provide constraints on how exonuclease III - thiotriphosphate -
polymerase combinations can be used to analyze the behavior of the
components of a synthetic biology.

Enzymatic incorporation of a third nucleobase pair
Yang, ZY
Sismour, AM
Sheng, PP
Puskar, NL
Benner, SA
Nucl. Acids Res. 35
(13)
4238-4249
(2007)
<Abstract>
DNA polymerases are identified that copy a nonstandard nucleotide pair
joined by a hydrogen bonding pattern different from the patterns
joining the dA:T and dG:dC pairs.
6-Amino-5-nitro3-(l'-p-D-2'-deoxyribofuranosyl)-2(1H)-pyridone (dZ)
implements the non-standard 'small' donordonor-acceptor (pyDDA)
hydrogen bonding pattern.
2-Amino-8-(1-beta-D-2'-deoxyribofuranosyl)imidazo[1,2-a]-1,3,5-triazin-4
(8H)-one [dP) implements the 'large' acceptor-acceptor-donor (puAAD)
pattern. These nucleobases were designed to present electron density to
the minor groove, density hypothesized to help determine specificity
for polymerases. Consistent with this hypothesis, both dZTP and dPTP
are accepted by many polymerases from both Families A and B. Further,
the dZ:dP pair participates in PCR reactions catalyzed by Taq, Vent
(exo(-)) and Deep Vent (exo-) polymerases, with 94.4%, 97.5% and 97.5%,
respectively, retention per round. The dZ:dP pair appears to be lost
principally via transition to a dC:dG pair. This is consistent with a
mechanistic hypothesis that deprotonated dZ (presenting a pyDAA
pattern) complements dG (presenting a puADD pattern), while protonated
dC (presenting a pyDDA pattern) complements dP (presenting a puAAD
pattern). This hypothesis, grounded in the Watson-Crick model for
nucleobase pairing, was confirmed by studies of the pH-dependence of
mismatching. The dZ:dP pair and these polymerases, should be useful in
dynamic architectures for sequencing, molecular-, systems- and
synthetic-biology.

The origin of proteins and nucleic acids
Ricardo, A
Benner, SA
Planets and Life: The Emerging Science of Astrobiology, ed. Woodruff T. Sullivan and John A. Baross, Cambridge University Press 154-173
(2007)

Alien biochemistries
Ward, PD
Benner, SA
Planets and Life: The Emerging Science of Astrobiology, ed. Woodruff T. Sullivan and John A. Baross, Cambridge University Press 537-544
(2007)

Synthesis and antiviral activity of 7-deazaneplanocin A against orthopoxviruses (vaccinia and cowpox virus)
Arumugham, B
Kim, HJ
Prichard, MN
Kern, ER
Chu, CK
Bioorg. Med. Chem. Lett. 16
(2)
285-287
(2006)
<Abstract>
An efficient method for the synthesis of 7-deazaneplanocin A (2) has
been accomplished by the condensation of cyclopentenol 3 with
6-chloro-7-deazapurine followed by subsequent functional group
manipulations. The synthesized 7-deazaneplanocin A (2) exhibited potent
antiviral activity against cowpox and vaccinia viruses without
cytotoxicity in HFF cells. (c) 2005 Elsevier Ltd. All rights reserved.

Integrating protein structures and precomputed genealogies in the Magnum database: Examples with cellular retinoid binding proteins
Bradley, ME
Benner, SA
BMC Bioinformatics 7 89
(2006)
<Abstract>
Background: When accurate models for the divergent evolution of protein
sequences are integrated with complementary biological information,
such as folded protein structures, analyses of the combined data often
lead to new hypotheses about molecular physiology. This represents an
excellent example of how bioinformatics can be used to guide
experimental research. However, progress in this direction has been
slowed by the lack of a publicly available resource suitable for
general use.
Results: The precomputed Magnum database offers a solution to this
problem for ca. 1,800 full-length protein families with at least one
crystal structure. The Magnum deliverables include 1) multiple sequence
alignments, 2) mapping of alignment sites to crystal structure sites,
3) phylogenetic trees, 4) inferred ancestral sequences at internal tree
nodes, and 5) amino acid replacements along tree branches.
Comprehensive evaluations revealed that the automated procedures used
to construct Magnum produced accurate models of how proteins
divergently evolve, or genealogies, and correctly integrated these with
the structural data. To demonstrate Magnum's capabilities, we asked for
amino acid replacements requiring three nucleotide substitutions,
located at internal protein structure sites, and occurring on short
phylogenetic tree branches. In the cellular retinoid binding protein
family a site that potentially modulates ligand binding affinity was
discovered. Recruitment of cellular retinol binding protein to function
as a lens crystallin in the diurnal gecko afforded another opportunity
to showcase the predictive value of a browsable database containing
branch replacement patterns integrated with protein structures.
Conclusion: We integrated two areas of protein science, evolution and
structure, on a large scale and created a precomputed database, known
as Magnum, which is the first freely available resource of its kind.
Magnum provides evolutionary and structural bioinformatics resources
that are useful for identifying experimentally testable hypotheses
about the molecular basis of protein behaviors and functions, as
illustrated with the examples from the cellular retinoid binding
proteins.

Analysis of transitions at two-fold redundant sites in mammalian genomes. Transition redundant approach-to-equilibrium (TREx) distance metrics
Li, T
Chamberlin, SG
Caraco, MD
Liberles, DA
Gaucher, EA
Benner, SA
BMC Evol. Biol. 6 25
(2006)
<Abstract>
Background: The exchange of nucleotides at synonymous sites in a gene
encoding a protein is believed to have little impact on the fitness of
a host organism. This should be especially true for synonymous
transitions, where a pyrimidine nucleotide is replaced by another
pyrimidine, or a purine is replaced by another purine. This suggests
that transition redundant exchange ( TREx) processes at the third
position of conserved two-fold codon systems might offer the best
approximation for a neutral molecular clock, serving to examine, within
coding regions, theories that require neutrality, determine whether
transition rate constants differ within genes in a single lineage, and
correlate dates of events recorded in genomes with dates in the
geological and paleontological records. To date, TREx analysis of the
yeast genome has recognized correlated duplications that established a
new metabolic strategies in fungi, and supported analyses of functional
change in aromatases in pigs. TREx dating has limitations, however.
Multiple transitions at synonymous sites may cause equilibration and
loss of information. Further, to be useful to correlate events in the
genomic record, different genes within a genome must suffer transitions
at similar rates.
Results: A formalism to analyze divergence at two fold redundant codon
systems is presented. This formalism exploits two-state
approach-to-equilibrium kinetics from chemistry. This formalism
captures, in a single equation, the possibility of multiple
substitutions at individual sites, avoiding any need to "correct" for
these. The formalism also connects specific rate constants for
transitions to specific approximations in an underlying evolutionary
model, including assumptions that transition rate constants are
invariant at different sites, in different genes, in different
lineages, and at different times. Therefore, the formalism supports
analyses that evaluate these approximations.
Transitions at synonymous sites within two-fold redundant coding
systems were examined in the mouse, rat, and human genomes. The key
metric (f(2)), the fraction of those sites that holds the same
nucleotide, was measured for putative ortholog pairs. A transition
redundant exchange ( TREx) distance was calculated from f(2) for these
pairs. Pyrimidine-pyrimidine transitions at these sites occur
approximately 14% faster than purine-purine transitions in various
lineages. Transition rate constants were similar in different genes
within the same lineages; within a set of orthologs, the f(2)
distribution is only modest overdispersed. No correlation between
disparity and overdispersion is observed. In rodents, evidence was
found for greater conservation of TREx sites in genes on the X
chromosome, accounting for a small part of the overdispersion, however.
Conclusion: The TREx metric is useful to analyze the history of
transition rate constants within these mammals over the past 100
million years. The TREx metric estimates the extent to which silent
nucleotide substitutions accumulate in different genes, on different
chromosomes, with different compositions, in different lineages, and at
different times.

Application of DETECTER, an Evolutionary Genomic Tool to Analyze Genetic Variation, to the Cystic Fibrosis Gene Family
Gaucher, EA
DeKee, DW
Benner, SA
BMC Genomics 7 44
(2006)
<Abstract>
Background: The medical community requires computational tools that
distinguish genetic differences having phenotypic impact within the
vast number of mutations that do not. Tools that do this will become
increasingly important for those seeking to use human genome sequence
data to predict disease, make prognoses, and customize therapy to
individual patients.
Results: An approach, termed DETECTER, is proposed to identify sites
in a protein sequence where amino acid replacements are likely to have
a significant effect on phenotype, including causing genetic
disease. This approach uses a model-dependent tool to estimate the
normalized replacement rate at individual sites in a protein sequence,
based on a history of those sites extracted from an evolutionary
analysis of the corresponding protein family. This tool identifies
sites that have higher-than-average, average, or lower- than-average
rates of change in the lineage leading to the sequence in the
population of interest. The rates are then combined with sequence data
to determine the likelihoods that particular amino acids were present
at individual sites in the evolutionary history of the gene
family. These likelihoods are used to predict whether any specific
amino acid replacements, if introduced at the site in a modern human
population, would have a significant impact on fitness. The DETECTER
tool is used to analyze the cystic fibrosis transmembrane conductance
regulator (CFTR) gene family.
Conclusions: In this system, DETECTER retrodicts amino acid
replacements associated with the cystic fibrosis disease with greater
accuracy than alternative approaches. While this result validates this
approach for this particular family of proteins only, the approach may
be applicable to the analysis of polymorphisms generally, including
SNPs in a human population.

The diverse biological functions of phosphatidylinositol transfer proteins in eukaryotes
Phillips, SE
Vincent, P
Rizzieri, KE
Schaaf, G
Bankaitis, VA
Gaucher, EA
Crit. Rev. Biochem. Mol. Biol. 41
(1)
21-49
(2006)
<Abstract>
Phosphatidylinositol/phosphatidylcholine transfer proteins (PITPs)
remain largely functionally uncharacterized, despite the fact that
they are highly conserved and are found in all eukaryotic cells thus
far examined by biochemical or sequence analysis approaches. The
available data indicate a role for PITPs in regulating specific
interfaces between lipid-signaling and cellular function. In this
regard, a role for PITPs in controlling specific membrane trafficking
events is emerging as a common functional theme. However, the
mechanisms by which PITPs regulate lipid-signaling and
membrane-trafficking functions remain unresolved. Specific PITP
dysfunctions are now linked to neurodegenerative and intestinal
malabsorbtion diseases in mammals, to stress response and
developmental regulation in higher plants, and to previously
uncharacterized pathways for regulating membrane trafficking in yeast
and higher eukaryotes, making it clear that PITPs are integral parts
of a highly conserved signal transduction strategy in
eukaryotes. Herein, we review recent progress in deciphering the
biological functions of PITPs, and discuss some of the open questions
that remain.
 Reaction of tetramethylimidazol-2-ylidene with (Tp(tBu,Me))YbE(thf) (E=I, CH2SMe3): simple adduct and a hydrocarbyl tethered carbene ligand
Ferrence, GM
Arduengo, AJ
Jockisch, A
Kim, HJ
McDonald, R
Takats, J
J. Alloys Compd. 418
(1-2)
184-188
(2006)
<Abstract>
Treatment of (Tp(tBu,Me))YbE(thf) (E=I (1), CH2SiMe3 (2)) with
tetramethylimidazol-2-ylidene (ImMe(4)) resulted in very different
outcomes depending on the nature of the anionic ligand E. ImMe(4) acts
as a simple Lewis base toward 1 resulting in substitution of the thf
(tetrahydrofuran) ligand and formation of (Tp(tBu,Me))YbI(ImMe(4)) (3).
However, reaction with 2, in addition to displacement of thf, proceeded
by metalation of one of the N-CH3 substituents of ImMe(4) and gave
(Tp(tBu,Me))Yb(ImMe(4))(CH2N(C(CH3)C(CH3)N(CH3)C) (4), featuring a
hydrocarbyl tethered carbene ligand. The X-ray structures of 3 and 4
are reported. (c) 2006 Published by Elsevier B.V.

2-Hydroxymethylboronate as a Reagent To Detect Carbohydrates: Application to the Analysis of the Formose Reaction
Ricardo, A
Frye, F
Carrigan, MA
Tipton, JD
Powell, DH
Benner, SA
J. Org. Chem. 71
(25)
9503-9505
(2006)
<Abstract>
2-Hydroxymethylphenylboronate is described as a reagent that converts
neutral 1,2-diols, as found in simple carbohydrates, into 1:1 anionic
complexes that are easily detected by Fourier transform ion cyclotron
resonance mass spectrometry. The value of this reagent was demonstrated
through its application to analyze complex mixtures of carbohydrates
formed in the formose process, often cited as a way that biologically
significant carbohydrates might have been generated from formaldehyde
under prebiotic conditions. Coupled with isotope studies, the reagent
shows that the simplest autocatalytic cycle for the consumption of
formaldehyde in this process cannot account for the bulk consumption of
formaldehyde.

Dynamic assembly of primers on nucleic acid templates
Leal, NA
Sukeda, M
Benner, SA
Nucl. Acids Res. 34 4702-4710
(2006)
<Abstract>
A strategy is presented that uses dynamic equlibria to assemble in situ
composite DNA polymerase primers, having lengths of 14 or 16 nt, from DNA
fragments that are 6 or 8 nt in length. In this implementation, the
fragments are transiently joined under conditions of dynamic equilibrium by
an imine linker, which has a dissociation constant of 1 µM. If a polymerase
is able to extend the composite, but not the fragments, it is possible to
prime the synthesis of a target DNA molecule under conditions where two
useful specificities are combined: (i) single nucleotide discrimination
that is characteristic of short oligonucleotide duplexes (four to six
nucleobase pairs in length), which effectively excludes single mismatches,
and (ii) an overall specificity of priming that is characteristic of long
(14 to 16mers) oligonucleotides, potentially unique within a genome. We
report here the screening of a series of polymerases that combine an
ability not to accept short primer fragments with an ability to accept the
long composite primer held together by an unnatural imine linkage. Several
polymerases were found that achieve this combination, permitting the
implementation of the dynamic combinatorial chemical strategy.

Artificially expanded genetic information system: a new base pair with an alternative hydrogen bonding pattern
Yang, ZY
Hutter, D
Sheng, PP
Sismour, AM
Benner, SA
Nucl. Acids Res. 34
(21)
6095-6101
(2006)
<Abstract>
To support efforts to develop a 'synthetic biology' based on an
artificially expanded genetic information system (AEGIS), we have
developed a route to two components of a non-standard nucleobase pair,
the pyrimidine analog
6-amino-5-nitro-3-(1'-beta-D-2'-deoxyribofuranosyl)-2(1H)-pyridone (dZ)
and its Watson-Crick complement, the purine analog
2-amino-8-(1'-beta-D-2'-deoxyribofuranosyl)-imidazo[1,2-a]-1,3,5-triazin
-4(8H)-one (dP). These implement the pyDDA:puAAD hydrogen bonding
pattern (where 'py' indicates a pyrimidine analog and 'pu' indicates a
purine analog, while A and D indicate the hydrogen bonding patterns of
acceptor and donor groups presented to the complementary nucleobases,
from the major to the minor groove). Also described is the synthesis of
the triphosphates and protected phosphoramidites of these two
nucleosides. We also describe the use of the protected phosphoramidites
to synthesize DNA oligonucleotides containing these AEGIS components,
verify the absence of epimerization of dZ in those oligonucleotides,
and report some hybridization properties of the dZ:dP nucleobase pair,
which is rather strong, and the ability of each to effectively
discriminate against mismatches in short duplex DNA.

A review: Synthesis of aryl C-glycosides via the heck coupling reaction
Wellington, KW
Benner, SA
Nuc. Nuc. Nuc. acids 25
(12)
1309-1333
(2006)
<Abstract>
In this article, we focus on the synthesis of aryl C-glycosides via
Heck coupling. It is organized based on the type of structures used in
the assembly of the C-glycosides (also called C-nucleosides) with the
following subsections: pyrimidine C-nucleosides, purine C-nucleosides,
and monocyclic, bicyclic, and tetracyclic C-nucleosides. The reagents
and conditions used for conducting the Heck coupling reactions are
discussed. The subsequent conversion of the Heck products to the
corresponding target molecules and the application of the target
molecules are also described.

Desorption/ionization on porous silicon mass spectrometry studies on pentose-borate complexes
Li, Q
Ricardo, A
Benner, SA
Winefordner, JD
Powell, DH
Anal. Chem. 77
(14)
4503-4508
(2005)
<Abstract>
Desorption/ionization on porous silicon mass spectrometry (DIOS-MS) was
used to investigate the binding affinities between aldopentose isomers
and boron. Boron has been recognized for its importance in pentose
synthesis and stabilization in prebiotic conditions. Boron may also
account for the fact that ribose, among other aldopentoses, is the
favored building block in RNA synthesis. This research started with the
detection of aldopentoses in the positive mode through cationization
and the aldopentose-borate complexes in the negative mode. Then two
competition schemes, one using a pentose structure analogue and the
other using C-13-labeled ribose, were designed to compare the relative
binding affinities of four aldopentoses (xylose, lyxose, arabinose, and
ribose) to boron. Both approaches determined the binding preference to
be ribose > lyxose > arabinose > xylose. This work illustrates the
potential of DIOS-MS in the analyses of nonvolatile, small molecules in
delicate chemical equilibria. Without externally introduced matrices,
background signals are not a limiting factor. Furthermore, the possible
dramatic change of pH associated with the matrix introduction, which
may disturb the equilibria of interest, is avoided.

Phylogenomic approaches to common problems encountered in the analysis of low copy repeats: The sulfotransferase IA gene family example
Bradley, ME
Benner, SA
BMC Evol. Biol. 5 22
(2005)
<Abstract>
Background: Blocks of duplicated genomic DNA sequence longer than 1000
base pairs are known as low copy repeats (LCRs). Identified by their
sequence similarity, LCRs are abundant in the human genome, and are
interesting because they may represent recent adaptive events, or
potential future adaptive opportunities within the human lineage.
Sequence analysis tools are needed, however, to decide whether these
interpretations are likely, whether a particular set of LCRs represents
nearly neutral drift creating junk DNA, or whether the appearance of
LCRs reflects assembly error. Here we investigate an LCR family
containing the sulfotransferase (SULT) IA genes involved in drug
metabolism, cancer, hormone regulation, and neurotransmitter biology as
a first step for defining the problems that those tools must manage.
Results: Sequence analysis here identified a fourth sulfotransferase
gene, which may be transcriptionally active, located on human
chromosome 16. Four regions of genomic sequence containing the four
human SULTIA paralogs defined a new LCR family. The stem hominoid
SULTIA progenitor locus was identified by comparative genomics
involving complete human and rodent genomes, and a draft chimpanzee
genome. SULTIA expansion in hominoid genomes was followed by positive
selection acting on specific protein sites. This episode of adaptive
evolution appears to be responsible for the dopamine sulfonation
function of some SULT enzymes. Each of the conclusions that this
bioinformatic analysis generated using data that has uncertain
reliability (such as that from the chimpanzee genome sequencing
project) has been confirmed experimentally or by a "finished"
chromosome 16 assembly, both of which were published after the
submission of this manuscript.
Conclusion: SULTIA genes expanded from one to four copies in hominoids
during intra-chromosomal LCR duplications, including (apparently) one
after the divergence of chimpanzees and humans. Thus, LCRs may provide
a means for amplifying genes (and other genetic elements) that are
adaptively useful. Being located on and among LCRs, however, could make
the human SULTIA genes susceptible to further duplications or deletions
resulting in 'genomic diseases' for some individuals. Pharmacogenomic
studies of SULTIAsingle nucleotide polymorphisms, therefore, should
also consider examining SULTIA copy number variability when searching
for genotype-phenotype associations. The latest duplication is,
however, only a substantiated hypothesis; an alternative explanation,
disfavored by the majority of evidence, is that the duplication is an
artifact of incorrect genome assembly.

One-pot glycosylation (OPG) for the chemical synthesis of oligosaccharides
Yu, B
Yang, ZY
Cao, HZ
Curr. Org. Chem. 9
(2)
179-194
(2005)
<Abstract>
This review provides a comprehensive survey of the "one pot
glycosylation" (OPG) strategy for the chemical synthesis of
oligosaccharides, covering literatures from the first example reported
by Kahne and Raghavan in 1993 through May 2003. The essence of the OPG
is to distinguish the reactivity difference of a pair of the
glycosylation donors or acceptors so as to carry out two glycosylation
steps sequentially without purification of the first-step coupling
product. Accordingly, the literature reports are grouped based on the
major stereoelectronic factors causing the reactivity differences,
those include the "armed-disarmed effect", "orthogonality of leaving
groups", "distinguishable acceptors", and "the hybrid". "The hybrid"
OPG procedure takes advantage of a combination of the reactivity
disparity of a set of the armed-disarmed donors, orthogonal leaving
groups, as well as acceptors so as to proceed three or more steps of
glycosylation sequentially in one pot. Relevant conception and
exploitation of the reactivity differences of the donors and acceptors
in the synthesis of oligosaccharides, which finally evolve the OPG or
advance parallelly, are briefly described at the beginning.

Synthetic biology
Sismour, AM
Benner, SA
Expert Opin. Biol. Ther. 5
(11)
1409-1414
(2005)
<Abstract>
Chemistry is a broadly powerful discipline in contemporary science
because it has the ability to create new forms of the matter that it
studies. By doing so, chemistry can test models that connect molecular
structure to behaviour without having to rely on what nature has
provided. This creation, known as synthesis', began to be applied to
living systems in the 1980s as recombinant DNA technologies allowed
biologists to deliberately change the molecular structure of the
microbes that they studied, and automated chemical synthesis of DNA
became widely available to support these activities. The impact of the
information that has emerged has made biologists aware of a truism that
has long been known in chemistry: synthesis drives discovery and
understanding in ways that analysis cannot. Synthetic biology is now
setting an ambitious goal: to recreate in artificial systems the
emergent properties found in natural biology. By doing so, it is
advancing our understanding of the molecular basis of genetics in ways
that analysis alone cannot. More practically, it has yielded artificial
genetic systems that improve the healthcare of some 400,000 Americans
annually. Synthetic biology is now set to take the next step, to create
artificial Darwinian systems by direct construction. Supported by the
National Science Foundation as part of its Chemical Bonding program,
this work cannot help but generate clarity in our understanding of how
biological systems work.

Planetary systems biology
Benner, SA
Ricardo, A
Mol. Cell 17
(4)
471-472
(2005)
<Abstract>
Combining paleogenetics, protein engineering, synthetic biology, and
metabolic modeling, a planetary biology perspective is brought to bear
on adaptive evolutionary events in ancient bacteria.

A call for likelihood phylogenetics even when the process of sequence evolution is heterogeneous
Gaucher, EA
Miyamoto, MM
Mol. Phylogenet. Evol. 37
(3)
928-931
(2005)
<Abstract>
All methods of phylogenetic inference make assumptions about the
underlying evolutionary process of their characters and it is
these assumptions that determine their relative successes and
failures in the estimation of the true phylogeny for a group.
This dependency of phylogenetic accuracy and robustness on
evolutionary assumptions has been most extensively studied for
the classic case of Felsenstein (1978) and its four-taxon
phylogeny with two long, unrelated, terminal branches
interspersed with two short ones. Given this model phylogeny,
"long branch attraction" can occur and thereby lead to the
convergence of a phylogenetic method onto an incorrect tree with
the two long and two short terminal branches directly connected
rather than interspersed. The extent to which a particular
phylogenetic method is susceptible to this problem depends on
what assumptions it makes about the evolution of the characters
and data themselves.

The use of thymidine analogs to improve the replication of an extra DNA base pair: a synthetic biological system
Sismour, AM
Benner, SA
Nucl. Acids Res. 33 5640-5646
(2005)
<Abstract>
Synthetic biology based on a six-letter genetic alphabet that
includes the two non-standard nucleobases isoguanine (isoG) and
isocytosine (isoC), as well as the standard A, T, G and C, is
known to suffer as a consequence of a minor tautomeric form of
isoguanine that pairs with thymine, and therefore leads to
infidelity during repeated cycles of the PCR. Reported here is a
solution to this problem. The solution replaces thymidine
triphosphate by 2-thiothymidine triphosphate (2-thioTTP). Because
of the bulk and hydrogen bonding properties of the thione unit in
2-thioT, 2-thioT does not mispair effectively with the minor
tautomer of isoG. To test whether this might allow PCR
amplification of a six-letter artificially expanded genetic
information system, we examined the relative rates of
misincorporation of 2-thioTTP and TTP opposite isoG using affinity
electrophoresis. The concentrations of isoCTP and 2-thioTTP were
optimal to best support PCR amplification using thermostable
polymerases of a six-letter alphabet that includes the isoC-isoG
pair. The fidelity-per-round of amplification was found to be
approximately 98% in trial PCRs with this six-letter DNA
alphabet. The analogous PCR employing TTP had a fidelity-per-round
of only approximately 93%. Thus, the A, 2-thioT, G, C, isoC, isoG
alphabet is an artificial genetic system capable of Darwinian
evolution.

Resurrecting ancestral alcohol dehydrogenases from yeast
Thomson, JM
Gaucher, EA
Burgan, MF
De Kee, DW
Li, T
Aris, JP
Benner, SA
Nature Genet. 37
(6)
630-635
(2005)
<Abstract>
Modern yeast living in fleshy fruits rapidly convert sugars into
bult ethanol through pyruvate. Pyruvate loses carbon dioxide to
become acetaldehyde, which is reduced by alcohol dehydrogenase 1
(Adh1) to ethanol, which accumulates. Yeast later consumes the
accumulated ethanol, exploiting Adh2, an Adh1 homolog differing by
24 (of 348) amino acids. Because many microorganisms cannot grow
in ethanol, accumulated ethanol may help yeast defend resources in
the fruit. We report here the reconstruction of the last common
ancestor of Adh1 and Adh2, called AdhA. The kinetic behavior of
AdhA suggests that it was optimized to make (not consume) ethanol.
This is consistent with the hypothesis that before the Adh1-Adh2
duplication, yeast did not accumulate ethanol for later consumption
but rather used AdhA to recycle NADH generated in the glycolytic
pathway. Silent nucleotide dating suggests that the Adh1-Adh2
duplication occurred near the time of duplication of several other
proteins involved in the accumulation of ethanol, possibly in the
Cretaceous age when fleshy fruits arose. These results help to
connect the chemical behavior of these enzymes through systems
analysis to a time of global ecosystem change, a small but useful
step towards a planetary systems biology.
 Synthetic Biology
Sismour, AM
Benner, SA
Nat. Rev. Genet. 6 533-543
(2005)
<Abstract>
Synthetic biologists come in two broad classes. One uses unnatural
molecules to reproduce emergent behaviours from natural biology,
with the goal of creating artificial life. The other seeks
interchangeable parts from natural biology to assemble into
systems that function unnaturally. Either way, a synthetic goal
forces scientists to cross uncharted ground to encounter and solve
problems that are not easily encountered through analysis. This
drives the emergence of new paradigms in ways that analysis cannot
easily do. Synthetic biology has generated diagnostic tools that
improve the care of patients with infectious diseases, as well as
devices that oscillate, creep and play tic-tac-toe.

Selection influences the proteomic usage of a majority of amino acids.
Brooks, DJ
Singh, M
Fresco, JR
(2005)
Submitted
<Abstract>
<Tables and figures>
Background: The comparison of observed and expected amino acid frequencies has been used historically to establish the influence of both neutral evolution and selection on protein sequences. With the availability of many complete proteomes, it is an opportune time to reexamine this issue by determining whether there is consistent cross-proteomic occurrence of bias in, and thus evidence for the action of selection upon, amino acid frequencies.
Results: We determined which amino acids consistently occur at biased frequencies across twenty-six proteomes encoded by genomes of wide-ranging GC content. More than half of the twenty amino acids were found to occur at biased frequencies: Lys, Asp, Glu, Ala, and Phe are over-represented and Arg, Ser, Cys, His, Thr, and Trp are underrepresented in a non-random number of the proteomes. Three of these eleven amino acids, Phe, Thr, and Trp, had not been previously identified as biased, and some amino acids formerly identified as biased were reassessed in light of the current data. In ten proteomes with only moderate nucleotide bias, Met, Ile, and Pro were additionally found to display a consistent bias in frequency, bringing the total number of biased amino acids to fourteen.
Conclusions: Genomic analysis provides a more sensitive means of detecting amino acid frequencies that deviate from the neutral expectation than do methods that have been applied in the past to more limited data sets. Our finding that the majority of amino acids are present at biased frequencies suggests that selection plays a greater role than previously appreciated in determining amino acid frequencies. Knowledge of which amino acids are most prone to selection may contribute to methods for predicting residues that are undergoing selection within proteins.

Inferred thermophily of the Last Universal Ancestor based on estimated amino acid composition
Brooks, DJ
Gaucher, EA
(2005)
Submitted
<Abstract>
The environmental temperature of the last universal ancestor (LUA) of all extant organisms is the subject of heated debate. Because the amino acid composition of proteins differs between mesophiles and thermophiles, the inferred amino acid composition of proteins in the LUA could be used to classify it as one or the other. We applied expectation maximization (EM) to estimate the amino acid composition of a set of thirty-one proteins in the LUA based on alignments of their modern day descendants, a phylogenetic tree relating those descendants and a model of evolution. Separate estimates of amino acid composition in LUA proteins were derived using modern day sequences of eight mesophilic species, eight thermophilic species or the sixteen species combined. We show that the relative mean Euclidean distance between the amino acid composition in one species and that of a set of mesophiles or thermophiles can be employed as a classifier with 100% accuracy. Applying this classifier to the estimated amino acid composition of the ancestral protein set in the LUA, we find it to be classified as a thermophile even when only the proteins of mesophilic species are used to derive the estimate. Based on the estimated amino acid composition of proteins in the LUA, we infer that it was a thermophile. We discuss our findings in the context of previous data pertaining to the OGT of the LUA, particularly the inferred G + C content of its rRNA. We conclude that the gathering evidence strongly supports a thermophilic LUA.

Understanding nucleic acids using synthetic chemistry
Benner, SA
Acc. Chem. Res. 37
(10)
784-797
(2004)
<Abstract>
This Account describes work done in these laboratories that has used
synthetic, physical organic, and biological chemistry to understand the
roles played by the nucleobases, sugars, and phosphates of DNA in the
molecular recognition processes central to genetics. The number of
nucleobases has been increased from 4 to 12, generating an artificially
expanded genetic information system. This system is used today in the
clinic to monitor the levels of HIV and hepatitis C viruses in
patients, helping to manage patient care. Work with uncharged phosphate
replacements suggests that a repeating charge is a universal feature of
genetic molecules operating in water and will be found in
extraterrestrial life (if it is ever encountered). The use of ribose
may reflect prebiotic processes in the presence of borate-containing
minerals, which stabilize ribose formed from simple organic precursors.
A new field, synthetic biology, is emerging on the basis of these
experiments, where chemistry mimics biological processes as complicated
as Darwinian evolution.

Cytoplasmic glycosylation of protein-hydroxyproline and its relationship to other glycosylation pathways
West, CM
van der Wel, H
Sassi, S
Gaucher, EA
Biochim. Biophys. Acta 1673
(1-2)
29-44
(2004)
<Abstract>
The Skp1 protein, best known as a subunit of E3(SCF)-ubiquitin ligases, is subject to complex glycosylation in the cytoplasm of the cellular slime mold Dictyostelium. Pro143 of this protein is sequentially modified by a prolyl hydroxylase and five soluble glycosyltransferases (GT), to yield the structure Galalpha1,Galalpha1,3Fucalpha1,2Galbeta1,3GlcNAcalpha1-HyPro143. These enzymes are unusual in that they are expressed in the cytoplasmic compartment of the cell, rather than the secretory pathway where complex glycosylation of proteins usually occurs. The first enzyme in the pathway appears to be related to the soluble animal prolyl 4-hydroxylases (P4H), which modify the transcriptional factor subunit HIF-1alpha in the cytoplasm, and more distantly to the P4Hs that modify collagen and other proteins in the rER, based on biochemical and informatics analyses. The soluble alphaGlcNAc-transferase acting on Skp1 has been cloned and is distantly related to the mucin-type polypeptide N-acetyl-alpha-galactosaminyltransferase in the Golgi of animals. Its characterization has led to the discovery of a family of related polypeptide N-acetyl-alpha-glucosaminyltransferases in the Golgi of selected lower eukaryotes. The Skp1 GlcNAc is extended by a bifunctional diglycosyltransferase that sequentially and apparently processively adds beta1,3Gal and alpha1,2Fuc. Though this structure is also formed in the animal secretory pathway, the GTs involved are dissimilar. Conceptual translation of available genomes suggests the existence of this kind of complex cytoplasmic glycosylation in other eukaryotic microorganisms, including diatoms, oomycetes, and possibly Chlamydomonas and Toxoplasma, and an evolutionary precursor of this pathway may also occur in prokaryotes. (C) 2004 Elsevier B.V. All rights reserved.

Quantitative analysis of a RNA-cleaving DNA catalyst obtained via in vitro selection
Carrigan, MA
Ricardo, A
Ang, DN
Benner, SA
Biochemistry 43
(36)
11446-11459
(2004)
<Abstract>
In vitro selections performed in the presence of Mg2+ generated DNA
sequences capable of cleaving an internal ribonucleoside linkage.
Several of these, surprisingly, displayed intermolecular catalysis and
catalysis independent of Mg2+, features that the selection protocol was
not explicitly designed to select. A detailed physical organic analysis
was applied to one of these DNAzymes, termed 614. First, the progress
curve for the reaction was dissected to identify factors that prevented
the molecule from displaying clean first-order transformation kinetics
and 100% conversion. Several factors were identified and quantitated,
including (a) competitive intra- and intermolecular rate processes, (b)
alternative reactive and unreactive conformations, and (c) mutations
within the catalyst. Other factors were excluded, including "approach
to equilibrium" kinetics and product inhibition. The possibility of
complementary strand inhibition was demonstrated but was shown to not
be a factor under the conditions of these experiments. The rates of the
intra- and intermolecular processes were compared, and saturation
models for the intermolecular process were built. The rate-limiting
step for the intermolecular reaction was found to be the association/
folding of the enzyme with the substrate and not the cleavage step. The
DNAzyme 614 is more active in trans than in cis and more active at
temperatures below the selection temperature than at the selection
temperature. Many of these properties have not been reported in similar
systems; these results therefore expand the phenomenology known for
this class of DNA-based catalysts. A brief survey of other catalysts
arising from this selection found other Mg2+-independent DNAzymes and
provided a preliminary view of the ruggedness of the landscape,
relating function to structure in sequence space. Hypotheses are
suggested to account for the fact that a selection in the presence of
Mg2+ did not exploit this Mg2+. This study of a specific catalytically
active DNAzyme is an example of studies that will be necessary
generally to permit in vitro selection to help us understand the
distribution of function in sequence space.

A novel method for estimating ancestral amino acid composition and its application to proteins of the Last Universal Ancestor.
Brooks, DJ
Fresco, JR
Singh, M
Bioinformatics 20 2251-2257
(2004)
<Abstract>
Motivation: Knowledge of how proteomic amino acid composition has changed over time is important for constructing realistic models of protein evolution and increasing our understanding of molecular evolutionary history. The proteomic amino acid composition of the Last Universal Ancestor (LUA) of life is of particular interest, since that might provide insight into the early evolution of proteins and the nature of the LUA itself.
Results: We introduce a method to estimate ancestral amino acid composition that is based on expectation.maximization. On simulated data, the approach was found to be very effective in estimating ancestral amino acid composition, with accuracy improving as the number of residues in the dataset was increased. The method was then used to infer the amino acid composition of a set of proteins in the LUA. In general, as compared with the modern protein set, LUA proteins were found to be richer in amino acids that are believed to have been most abundant in the prebiotic environment and poorer in those believed to have been unavailable or scarce. Additionally, we found the inferred amino acid composition of this protein set in the LUA to be more similar to the observed composition of the same set in extant thermophilic species than in extant mesophilic species, supporting the idea that the LUA lived in a thermophilic environment.
Availability: The program is available at http://compbio.cs.princeton.edu/ancestralaa

The planetary biology of cytochrome P450 aromatases
Gaucher, EA
Graddy, LG
Li, T
Simmen, RC
Simmen, FA
Schreiber, DR
Liberles, DA
Janis, CM
Benner, SA
BMC Biology 2
(1)
19
(2004)
<Abstract>
BACKGROUND: Joining a model for the molecular evolution of a
protein family to the paleontological and geological records
(geobiology), and then to the chemical structures of substrates,
products, and protein folds, is emerging as a broad strategy for
generating hypotheses concerning function in a post-genomic
world. This strategy expands systems biology to a planetary
context, necessary for a notion of fitness to underlie (as it
must) any discussion of function within a biomolecular
system.
RESULTS: Here, we report an example of such an expansion,
where tools from planetary biology were used to analyze three
genes from the pig Sus scrofa that encode cytochrome P450
aromatases-enzymes that convert androgens into estrogens. The
evolutionary history of the vertebrate aromatase gene family was
reconstructed. Transition redundant exchange silent substitution
metrics were used to interpolate dates for the divergence of
family members, the paleontological record was consulted to
identify changes in physiology that correlated in time with the
change in molecular behavior, and new aromatase sequences from
peccary were obtained. Metrics that detect changing function in
proteins were then applied, including KA/KS values and those
that exploit structural biology. These identified specific amino
acid replacements that were associated with changing substrate
and product specificity during the time of presumed adaptive
change. The combined analysis suggests that aromatase paralogs
arose in pigs as a result of selection for Suoidea with larger
litters than their ancestors, and permitted the Suoidea to
survive the global climatic trauma that began in the
Eocene.
CONCLUSIONS: This combination of bioinformatics analysis,
molecular evolution, paleontology, cladistics, global
climatology, structural biology, and organic chemistry serves as
a paradigm in planetary biology. As the geological,
paleontological, and genomic records improve, this approach
should become widely useful to make systems biology statements
about high-level function for biomolecular systems.

Multiplexed genetic analysis using an expanded genetic alphabet
Johnson, SC
Marshall, DJ
Harms, G
Miller, CM
Sherrill, CB
Beaty, EL
Lederer, SA
Roesch, EB
Madsen, G
Hoffman, GL
Laessig, RH
Kopish, GJ
Baker, MW
Benner, SA
Farrell, PM
Prudent, JR
Clin. Chem. 50
(11)
2019-2027
(2004)
<Abstract>
Background: All states require some kind of testing for newborns, but
the policies are far from standardized. In some states, newborn
screening may include genetic tests for a wide range of targets, but
the costs and complexities of the newer genetic tests inhibit expansion
of newborn screening. We describe the development and technical
evaluation of a multiplex platform that may foster increased newborn
genetic screening.
Methods: MultiCode(R) PLx involves three major steps: PCR,
target-specific extension, and liquid chip decoding. Each step is
performed in the same reaction vessel, and the test is completed in
similar to3 h. For site-specific labeling and room-temperature
decoding, we use an additional base pair constructed from isoguanosine
and isocytidine. We used the method to test for mutations within the
cystic fibrosis transmembrane conductance regulator (CFTR) gene. The
developed test was performed manually and by automated liquid handling.
Initially, 225 samples with a range of genotypes were tested
retrospectively with the method. A prospective study used samples from
>400 newborns.
Results: In the retrospective study, 99.1% of samples were correctly
genotyped with no incorrect calls made. In the perspective study, 95%
of the samples were correctly genotyped for all targets, and there were
no incorrect calls.
Conclusions: The unique genetic multiplexing platform was successfully
able to test for 31 targets within the CFTR gene and provides accurate
genotype assignments in a clinical setting. (C) 2004 American
Association for Clinical Chemistry.

Is there a common chemical model for life in the universe?
Benner, SA
Ricardo, A
Carrigan, MA
Curr. Op. Chem Biol. 8
(6)
672-689
(2004)
<Abstract>
A review of organic chemistry suggests that life, a chemical system
capable of Darwinian evolution, may exist in a wide range of
environments. These include non-aqueous solvent systems at low
temperatures, or even supercritical dihydrogen-helium mixtures. The
only absolute requirements may be a thermodynamic disequilibrium and
temperatures consistent with chemical bonding. A solvent system,
availability of elements such as carbon, hydrogen, oxygen and nitrogen,
certain thermodynamic features of metabolic pathways, and the
opportunity for isolation, may also define habitable environments. If
we constrain life to water, more specific criteria can be proposed,
including soluble metabolites, genetic materials with repeating
charges, and a well defined temperature range.
 Contribution of a conserved phenylalanine residue to the activity of Escherichia coli uracil DNA glycosylase
Shaw, RW
Feller, JA
Bloom, LB
DNA Repair 3
(10)
1273-1283
(2004)
<Abstract>
Uracil DNA glycosylase (UDG) excises uracil from DNA to initiate repair
of this lesion. This important DNA repair enzyme is conserved in
viruses, bacteria, and eukaryotes. One residue that is conserved among
all the members of the UDG family is a phenylalanine that stacks with
uracil when it is flipped out of the DNA helix into the enzyme active
site. To determine what contribution this conserved Phe residue makes
to the activity of UDG, Phe-77 in the Escherichia coli enzyme was
mutated to three different amino acid residues, alanine (UDG-F77A),
asparagine (UDG-F77N), and tyrosine (UDG-F77Y). The effects of these
mutations were measured on the steady-state and pre-steady-state
kinetics of uracil excision in addition to enzyme-DNA binding kinetics.
The overall excision activity of each of the mutants was reduced
relative to the wild-type enzyme; however, each mutation gave rise to a
different kinetic phenotype with different effects on substrate binding
and catalysis. The excision activity of UDG-F77N was the most severely
compromised, but this enzyme still bound to uracil-containing DNA at
about the same rate as wild-type UDG. In contrast, the decrease in the
excision activity of UDG-F77A is likely to reflect a greater reduction
in uracil-DNA binding than in the catalytic step. Overall, the effects
of the mutations on catalysis are best correlated with the polarity of
the substituted residue such that an increase in polarity decreases the
efficiency of uracil excision. (C) 2004 Elsevier B.V. All rights
reserved.
 Significance of cytoplasmic prolyl hydroxylation and complex glycosylation in the cellular slime mold Dictyostelium
West, CM
van der Wel, H
Sassi, S
Gaucher, E
Ercan, A
Glycobiology 14
(11)
1063-1063
(2004)

Expanding the genetic alphabet: Pyrazine nucleosides that support a donor-donor-acceptor hydrogen-bonding pattern
von Krosigk, U
Benner, SA
Helv. Chim. Acta 87
(6)
1299-1324
(2004)
<Abstract>
The 6-aminopyrazin-2(1H)-one, when incorporated as a pyrimidine-base
analog into an oligonucleotide chain, presents a H-bond donor- donor-
acceptor pattern to a complementary DNA or RNA strand. When paired with
the corresponding acceptor-acceptor-donor purine in oligonucleotides,
the heterocycle selectively contributes to the stability of the duplex,
presumably by forming a base pair of Watson-Crick geometry joined by a
nonstandard H-bonding pattern, expanding the genetic alphabet. Reported
here is a short, high yielding, beta-D-selective synthesis of a
6-aminopyrazin-2(l H) -one nucleoside via the glycine riboside
derivative 28. The key steps include a Wittig-Horner reaction of an
appropriately protected ribose derivative (Scheme 10, 19 --> 21)
followed by a Michael-like ring closure (Scheme 12, 30 --> la and 32
--> 1b). Thus, a variety of pyrazine nucleosides (Scheme 73) including
the target 6-aminopyrazin-2(1H)-one riboside la, and its 5-methyl
derivative 1b, 6-amino-5-methylpyrazin-2(1H)-one riboside, are obtained.

Human ATP:Cob(I)alamin adenosyltransferase and its interaction with methionine synthase reductase
Leal, NA
Olteanu, H
Banerjee, R
Bobik, TA
J. Biol. Chem. 279
(46)
47536-47542
(2004)
<Abstract>
The final step in the conversion of vitamin B(12) into coenzyme B(12)
(adenosylcobalamin, AdoCbl) is catalyzed by ATP:cob(I)alamin
adenosyltransferase (ATR). Prior studies identified the human ATR and
showed that defects in its encoding gene underlie cblB methylmalonic
aciduria. Here two common polymorphic variants of the ATR that are
found in normal individuals are expressed in Escherichia coli,
purified, and partially characterized. The specific activities of ATR
variants 239K and 239M were 220 and 190 nmol min(-1) mg(-1), and their
K(m) values were 6.3 and 6.9 mum for ATP and 1.2 and 1.6 mum for
cob(I)alamin, respectively. These values are similar to those obtained
for previously studied bacterial ATRs indicating that both human
variants have sufficient activity to mediate AdoCbl synthesis in
vivo. Investigations also showed that purified recombinant human
methionine synthase reductase (MSR) in combination with purified ATR
can convert cob(II)alamin to AdoCbl in vitro. In this system, MSR
reduced cob(II)alamin to cob(I)alamin that was adenosylated to AdoCbl
by ATR. The optimal stoichiometry for this reaction was approximately
4 MSR/ATR and results indicated that MSR and ATR physically interacted
in such a way that the highly reactive reaction intermediate
[cob(I)alamin] was sequestered. The finding that MSR reduced
cob(II)alamin to cob(I)alamin for AdoCbl synthesis (in conjunction
with the prior finding that MSR reduced cob(II)alamin for the
activation of methionine synthase) indicates a dual physiological role
for MSR.

Empirical analysis of protein insertions and deletions determining parameters for the correct placement of gaps in protein sequence alignments
Chang, MSS
Benner, SA
J. Mol. Biol. 341
(2)
617-631
(2004)
<Abstract>
To understand how protein segments are inserted and deleted during
divergent evolution, a set of pairwise alignments contained exactly one
gap, and therefore arising from the first insertion-deletion (indel)
event in the time separating the homologs, was examined. The alignments
showed that "structure breaking" amino acids (PGDNS) were preferred
within and flanking gapped regions, as are two residues with
hydrophilic side-chains (QE) that frequently occur at the surface of
protein folds. Conversely, hydrophobic residues (FMILYVW) occur
infrequently within and flanking the gapped region. These preferences
are modestly different in protein pairs separated by an episode of
adaptive evolution, than in pairs diverging under strong functional
constraints. Surprisingly, regions near an indel have not evolved more
rapidly than the sequence pair overall, showing no evidence that an
indel event must be compensated by local amino acid replacement. The
gap-lengths are best approximated by a Zipfian distribution, with the
probability of a gap of length L decreasing as a function of L-1.8.
These features are largely independent of the length of the gap and the
extent of divergence (measured by both silent and non-silent sequence
changes) separating the two proteins. Surprisingly, amino acid repeats
were discovered in more than a third of the polypeptide segments in and
around the gap. These correspond to repeats in the DNA sequence. This
suggests that a signature of the mechanism by which indels occur in the
DNA sequence remains in the encoded protein sequences. These data
suggest specific tools to score gap placement in an alignment. They
also suggest tools that distinguish true indels from gaps created by
mistaken gene finding, including under-predicted and overpredicted
introns. By providing mechanisms to identify errors, the tools will
enhance the value of genome sequence databases in support of integrated
paleogenomics strategies used to extract functional information in a
post-genomic environment.

The Sup35 domains required for maintenance of weak, strong or undifferentiated yeast [PSI+] prions
Bradley, ME
Liebman, SW
Mol. Microbiol. 51
(6)
1649-1659
(2004)
<Abstract>
The Sup35 protein can exist in a non-infectious form or in various
infectious forms called [PSI+] prion variants (or prion strains). Each
of the different [PSI+] prion variants converts non-infectious Sup35
molecules into that prion variant's infectious form. One definition of
a 'prion domain' is the minimal fragment of a prion protein that is
necessary and sufficient to maintain the prion form. We now demonstrate
that the Sup35 N region (residues 1-123), which is frequently referred
to as the 'prion domain', is insufficient to maintain the weak or
strong [PSI (+)] variants per se, but appears to maintain them in an
'undifferentiated'[PSI+] state that can differentiate into weak or
strong [PSI+] variants when transferred to the full-length Sup35
protein. In contrast, Sup35 residues 1-137 are necessary and sufficient
to faithfully maintain weak or strong [PSI+] variants. This implicates
Sup35 residues 124-137 in the variant-specific maintenance of the weak
or strong [PSI+] forms. Structure predictions indicate that the
residues in the 124-137 region form an alpha-helix and that the 1-123
region may have beta structure. In view of these findings, we discuss a
plausible molecular basis for the [PSI+] prion variants as well as the
inherent difficulties in defining a 'prion domain'.

Probing minor groove recognition contacts by DNA polymerases and reverse transcriptases using 3-deaza-2 '-deoxyadenosine
Hendrickson, CL
Devine, KG
Benner, SA
Nucl. Acids Res. 32
(7)
2241-2250
(2004)
<Abstract>
Standard nucleobases all present electron density as an unshared pair
of electrons to the minor groove of the double helix. Many heterocycles
supporting artificial genetic systems lack this electron pair. To
determine how different DNA polymerases use the pair as a substrate
specificity determinant, three Family A polymerases, three Family B
polymerases and three reverse transcriptases were examined for their
ability to handle 3-deaza-2'-deoxyadenosine (c(3)dA), an analog of
2'-deoxyadenosine lacking the minor groove electron pair. Different
polymerases differed widely in their interaction with c(3)dA. Most
notably, Family A and Family B polymerases differed in their use of
this interaction to exploit their exonuclease activities. Significant
differences were also found within polymerase families. This plasticity
in polymerase behavior is encouraging to those wishing to develop a
synthetic biology based on artificial genetic systems. The differences
also suggest either that Family A and Family B polymerases do not share
a common ancestor, that minor groove contact was not used by that
ancestor functionally or that this contact was not sufficiently
critical to fitness to have been conserved as the polymerase families
diverged. Each interpretation is significant for understanding the
planetary biology of polymerases.

PCR amplification of DNA containing non-standard base pairs by variants of reverse transcriptase from Human Immunodeficiency Virus-1
Sismour, AM
Lutz, S
Park, JH
Lutz, MJ
Boyer, PL
Hughes, SH
Benner, SA
Nucl. Acids Res. 32 728-735
(2004)
<Abstract>
As the next step towards generating a synthetic biology from
artificial genetic information systems, we have examined variants
of HIV reverse transcriptase (RT) for their ability to synthesize
duplex DNA incorporating the non-standard base pair between
2,4-diaminopyrimidine (pyDAD), a pyrimidine presenting a hydrogen
bond 'donor-acceptor-donor' pattern to the complementary base,
and xanthine (puADA), a purine presenting a hydrogen bond
'acceptor-donor-acceptor' pattern. This base pair fits the
Watson-Crick geometry, but is joined by a pattern of hydrogen
bond donor and acceptor groups different from those joining the
GC and AT pairs. A variant of HIV-RT where Tyr 188 is replaced by
Leu, has emerged from experiments where HIV was challenged to
grow in the presence of drugs targeted against the RT, such as
L-697639, TIBO and nevirapine. These drugs bind at a site near,
but not in, the active site. This variant accepts the pyDAD-puADA
base pair significantly better than wild type HIV-RT, and we used
this as a starting point. A second mutation, E478Q, was
introduced into the Y188L variant, in the event that the residual
nuclease activity observed is due to the RT, and not a
contaminant. The doubly mutated RT incorporated the non-standard
pair with sufficient fidelity that the variant could be used to
amplify oligonucleotides containing pyDAD and puADA through
several rounds of a polymerase chain reaction (PCR) without
losing the non-standard base pair. This is the first time where
DNA containing non-standard base pairs with alternative hydrogen
bonding patterns has been amplified by a full PCR. This work also
illustrates a research strategy that combines in clinico
pre-evolution of proteins followed by rational design to obtain
an enzyme that meets a particular technological specification.

2 '-deoxycytidines carrying amino and thiol functionality: Synthesis and incorporation by vent (exo(-)) polymerase
Roychowdhury, A
Illangkoon, H
Hendrickson, CL
Benner, SA
Org. Lett. 6
(4)
489-492
(2004)
<Abstract>
The synthesis of 2'-deoxycytidine nucleosides bearing amino and thiol
groups appended to the 5-position of the nucleobase via a butynyl
linker is described. The corresponding triphosphates were then
synthesized from the nucleoside and incorporated into oligonucleotides
by Vent (exo(-)) DNA polymerase. The ability of Vent (exo(-))
polymerase to amplify oligonucleotides containing these functionalized
cytidine derivatives in a polymerase chain reaction (PCR) was
demonstrated for the amino-functionalized derivative.

PduP is a Coenzyme-A-Acylating Propionaldehyde Dehydrogenase Associated with the Polyhedral Bodies Involved in B12-dependent 1,2-propanediol Degradation by Salmonella enterica serovar Typhimurium LT2
Leal, NA
Havemann, GD
Bobik, TA
Arch. Microbiol. 180
(5)
353-361
(2003)
<Abstract>
Salmonella enterica forms polyhedral bodies involved in
coenzyme-B12-dependent 1,2-propanediol degradation. Prior studies
showed that these bodies consist of a proteinaceous shell partly
composed of the PduA protein, coenzyme-B12-dependent diol dehydratase,
and additional unidentified proteins. In this report, we show that the
PduP protein is a polyhedral-body-associated CoA-acylating aldehyde
dehydrogenase important for 1,2-propanediol degradation by
S. enterica. A PCR-based method was used to construct a precise
nonpolar deletion of the gene pduP. The resulting pduP deletion strain
grew poorly on 1,2-propanediol minimal medium and expressed 105-fold
less propionaldehyde dehydrogenase activity (0.011 micromol min(-1)
mg(-1)) than did wild-type S. enterica grown under similar conditions
(1.15 micromol min(-1) mg(-1)). An Escherichia coli strain was
constructed for high-level production of His8-PduP, which was purified
by nickel-affinity chromatography and shown to have 15.2 micromol
min(-1) mg(-1) propionaldehyde dehydrogenase activity. Analysis of
assay mixtures by reverse-phase HPLC and mass spectrometry established
that propionyl-CoA was the product of the PduP reaction. For
subcellular localization, purified His8-PduP was used as antigen for
the preparation of polyclonal antiserum. The antiserum obtained was
shown to have high specificity for the PduP protein and was used in
immunogold electron microscopy studies, which indicated that PduP was
associated with the polyhedral bodies involved in 1,2-propanediol
degradation. Further evidence for the localization of the PduP enzyme
was obtained by showing that propionaldehyde dehydrogenase activity
co-purified with the polyhedral bodies. The fact that both
Ado-B12-dependent diol dehydratase and propionaldehyde dehydrogenase
are associated with the polyhedral bodies is consistent with the
proposal that these structures function to minimize propionaldehyde
toxicity during the growth of S. enterica on 1,2-propanediol.

The NASA astrobiology roadmap
Marais, DJD
Allamandola, LJ
Benner, SA
Boss, AP
Deamer, D
Falkowski, PG
Farmer, JD
Hedges, SB
Jakosky, BM
Knoll, AH
Liskowsky, DR
Meadows, VS
Meyer, MA
Pilcher, CB
Nealson, KH
Spormann, AM
Trent, JD
Turner, WW
Woolf, NJ
Yorke, HW
Astrobiology 3
(2)
219-235
(2003)
<Abstract>
The NASA Astrobiology Roadmap provides guidance for research and
technology development across the NASA enterprises that encompass the
space, Earth, and biological sciences. The ongoing development of
astrobiology roadmaps embodies the contributions of diverse scientists
and technologists from government, universities, and private
institutions. The Roadmap addresses three basic questions: How does
life begin and evolve, does life exist elsewhere in the universe, and
what is the future of life on Earth and beyond? Seven Science Goals
outline the following key domains of investigation: understanding the
nature and distribution of habitable environments in the universe,
exploring for habitable environments and life in our own solar system,
understanding the emergence of life, determining how early life on
Earth interacted and evolved with its changing environment,
understanding the evolutionary mechanisms and environmental limits of
life, determining the principles that will shape life in the future,
and recognizing signatures of life on other worlds and on early Earth.
For each of these goals, Science Objectives outline more specific
high-priority efforts for the next 3-5 years. These 18 objectives are
being integrated with NASA strategic planning.
 First PCR amplification of DNA containing a nonstandard base pair A.
Lutz, S
Park, JH
Benner, SA
Biochemistry 42
(28)
8598-8598
(2003)

Greater GNN pattern bias in sequence elements encoding conserved residues of ancient proteins may be an indicator of amino acid composition of early proteins.
Brooks, DJ
Fresco, JR
Gene 303 177-185
(2003)
<Abstract>
The possibility that RNY pattern bias in extant sequences is a remnant of more pronounced bias of this type in early ancestors was investigated. To this end, conserved residues (those residues for which the inferred ancestral and known descendant amino acids are identical) and non-conserved residues of ancient proteins dating to the Last Universal Ancestor were identified within six species: two archaea, two eubacteria and two eukaryotes. Bias within sequence elements encoding each subset of residues, conserved and non-conserved, was then determined. In all species, GNN bias is greater within conserved than non-conserved sequence elements, whereas ANN is not. This difference is statistically significant in all six species examined. Since the relative mutability of the GNN-encoded amino acids does not explain the greater bias in conserved sequences, it is concluded that early sequences probably possessed a strong GNN bias. It is suggested that this bias may be a consequence of the GNN codons being the first introduced into the genetic code. Although NNY bias is also greater within conserved sequence elements of the six species, that difference is statistically significant in only half of them. Therefore, the evidence for early NNY bias remains inconclusive. The findings of this study do not support the proposal of Diaz-Lazcoz et al. (J. Mol. Biol. 250 (1995) 123) that the codons of the TCN four-codon block were the first assigned to serine during the evolution of the genetic code.

Destabilizing interactions among [PSI+] and [PIN+] yeast prion variants
Bradley, ME
Liebman, SW
Genetics 165
(4)
1675-1685
(2003)
<Abstract>
The yeast Sup35 and Rnq1 proteins can exist in either the noninfectious
soluble forms, [psi(-)] or [pin(-)], respectively, or the multiple
infectious amyloid-like forms called [PSI+] or [PIN+] prion variants
(or prion strains). It was previously shown that [PSI+] and [PIN+]
prions enhance one another's de novo appearance. Here we show that
specific prion variants of [PSI+] and [PIN+] disrupt each other's
stable inheritance. Acquiring [PSI+] often impedes the inheritance of
particular [PIN+] variants. Conversely, the presence of some [PIN+]
variants impairs the inheritance of weak [PSI+] but not strong [PSI+]
variants. These same [PIN+] variants generate a single-dot fluorescence
pattern when a fusion of Rnq1 and green fluorescent protein is
expressed. Another [PIN+] variant, which forms a distinctly different
multiple-dot fluorescence pattern, does not impair [PSI+] inheritance.
Thus, destabilization of prions by heterologous prions depends upon the
variants involved. These findings may have implications for
understanding interactions among other amyloid-forming proteins,
including those associated with certain human diseases.
 Initiation of mucin-type O-glycosylation in lower eukaryotes (O-alpha-GlcNAc-type) and higher eukaryotes (O-alpha-GalNAc-type) is homologous
West, CM
Wang, F
van der Wel, H
Gaucher, E
Sassi, S
Metcalf, T
Heise, N
Mendonca-Previato, L
Previato, JO
Glycobiology 13
(11)
875-876
(2003)

A direct synthesis of nucleoside analogs homologated at the 3 '- and 5 '-positions
Schmidt, J
Eschgfaller, B
Benner, SA
Helv. Chim. Acta 86
(9)
2937-2958
(2003)
<Abstract>
A new route is presented to prepare analogs of nucleosides homologated
at the 3'- and 5'-positions. This route, applicable to both the D- and
L-enantiomeric forms, is suitable for the preparation of monomeric
bis-homonucleosides needed for the synthesis of oligonucleotide
analogs. It begins with the known monobenzyl ether 3 of
pent-2-yne-1,5-diol, which is reduced to alkenol 4. Sharpless
asymmetric epoxidation of 4, followed by opening of the epoxide 5 with
allylmagnesium bromide, gives a mixture of diols 6 and 7 Protection of
the primary alcohol as a silyl ether followed by treatment with OsO4,
NalO(4), and mild acid in MeOH, followed by reduction, yields (2R,3R)
{{[(tert-butyl)diphenylsilyl]oxy}methyl}tetrahydro-2-(2-hydroxyethyl)-5-
methoxyfuran (=methyl 3-{{[(tert-butyl)diphenylsilyl]oxy}methyl}-2
3,5-trideoxy-alpha/beta-D-erythro-hexafuranoside: 10) (Scheme 1).
Protected nucleobases are added to this skeleton with the aid of
trimethylsilyl triflate (Scheme 2). The o-toluoyl (2-MeC6H4CO) and
p-anisoyl (4-MeOC6H4CO) groups were used to protect the exocyclic amino
group of cytosine. The bis-homonucleoside analogs 11 and 14a are then
converted to monothiol derivatives suitable for coupling (Schemes 3 and
4) to oligonucleotide analogs with bridging S-atoms. This synthesis
replaces a much longer synthesis for analogous nucleoside analogs that
begins with diacetoneglucose (= 1,2:5,6-di-O-isopropylideneglucose),
with the stereogenic centers in the final products derived from the
Sharpless asymmetric epoxidation. The new route is useful for
large-scale synthesis of these building blocks for the synthesis of
oligonucleotide analogs.

Synthesis and properties of oligodeoxynucleotide analogs with bis(methylene) sulfone bridges
Eschgfaller, B
Schmidt, JG
Konig, M
Benner, SA
Helv. Chim. Acta 86
(9)
2959-2997
(2003)
<Abstract>
A convergent, solution-phase synthesis was developed for the
bis(methylene) sulfone-bridged oligodeoxynucleotide analogs (SNA)
5'-d(HOCH2-Tso(2)Tso(2)Tso(2)Cso(2)Tso(2)Tso(2)Tso(2)T-CH2SO3-)-3'
(35b) and
5'-d(HOCH2-Tso(2)Tso(2)Tso(2)Tso(2)Tso(2)Tso(2)Tso(2)T-CH2SO3-)-3'
(34c) (SO2 corresponds to CH2SO2CH2 instead of OP(=O)(O-)(O). In these,
the phosphodiester linkages are replaced by non-ionic bis(methylene)
sulfone linkers. The general strategy involved convergent coupling of
3',5'-bishomo-beta-D-deoxyribonucleotide analogs functionalized at the
6'-end (=CH2-C(5')) as bromides or mesylates and at the CH2-C(3')
position as thiols. with the resulting thioether being oxidized to the
corresponding sulfone. A single charge was introduced at the terminal
CH2-C(3') position of the octamers to increase their solubility in
water. During the synthesis, it became apparent that the key
intermediates generated secondary structures through either folding or
aggregation in a variety of solvents. This generated unusual reactivity
and was unique for very similar structures. For example, although the
dimeric thiol d(BzOCH(2)-Tso(2)C-CH2SH) (14b) was a well-behaved
synthetic intermediate, the tetrameric thiol
d(TrOCH2-Tso(2)Tso(2)Tso(2)(10)C-CH2SH) derived from the corresponding
thioacetate was rapidly converted to a disulfide by very small amounts
of oxidant (28 --> 29, Scheme 6). while the analogous tetrameric thiol
d(BzOCH(2)-Tso(2)TsTso(2)T-CH2SH) (26), differing only by a single
heterocycle, was oxidized much more slowly (Bz = PhCO, Tr = Ph3C, to =
2-MeC6H4CO (at N-4 of dc)). The sequence-dependent reactivity, well
known in many classes of natural products (including polypeptides), is
not prominent in natural oligonucleotides. These results are discussed
in light of the proposal that the repeating negative charge in nucleic
acids is key to their ability to serve as genetic molecules, in
particular, their capability to support Darwinian evolution. The
ability of
5'-d(HOCH2-Tso(2)Tso(2)Tso(2)Cso(2)Tso(2)Tso(2)Tso(2)T-CH2SO3-)-3'
(35b) to bind as a third strand to duplex DNA was also examined. No
triple-helix-forming propensity was detected in this molecule.

Identification of the Human and Bovine ATP:cob(I)alamin Adenosyltransferase cDNAs Based on Complementation of a Bacterial Mutant
Leal, NA
Park, SD
Kima, PE
Bobik, TA
J. Biol. Chem. 278
(11)
9227-9234
(2003)
<Abstract>
In humans, deficiencies in coenzyme B12-dependent methylmalonyl-CoA
mutase (MCM) lead to methylmalonyl aciduria, a rare disease that is
often fatal in newborns. Such deficiencies can result from inborn
errors in the MCM structural gene or from mutations that impair the
assimilation of dietary cobalamins into coenzyme B12 (Ado-B12), the
required cofactor for MCM. ATP:cob(I)alamin adenosyltransferase (ATR)
catalyzes the terminal step in the conversion of cobalamins into
Ado-B12. Substantial evidence indicates that inherited defects in this
enzyme lead to methylmalonyl aciduria, but the corresponding ATR gene
has not been identified. Here we report the identification of the
bovine and human ATR cDNAs as well as the corresponding human gene. A
bovine liver cDNA expression library was screened for clones that
complemented an ATR-deficient bacterial strain for color formation on
aldehyde indicator medium, and four positive clones were isolated. The
DNA sequences of two clones were determined and found to be
identical. Sequence similarity searching was then used to identify a
homologous human cDNA (89% identity) and its corresponding gene that
is located on chromosome XII. The bovine and human cDNAs were
independently cloned and expressed in Escherichia coli. Enzyme assays
showed that expression strains produced 87 and 98 nmol/min/mg ATR
activity, respectively. These specific activities are in line with
va | |