Our Team

FfAME Our Team Cen Chen

Scientist

Cen Chen

  • (386) 418-8085

Research Summary

My research focuses on the novel modified oligonucleotides synthesis, new purification method development, Gene synthesis as well as molecular diagnostics based Self-avoiding Molecular Recognition Systems (SAMRS) and Artificially Expanded Genetic Information Systems (AEGIS).

Current projects include:

  • Novel modified oligonucleotides synthesis. This including Self-avoiding Molecular Recognition Systems (SAMRS) and Artificially Expanded Genetic Information Systems (AEGIS), such as A*, T*, dP, dZ as well as other artificial nucleobases. These modification were applied in molecular diagnostics, sequencing, gene synthesis, as well as AEGISZymes and AEGISBody development.
  • Molecular diagnostics with SAMRS. Nucleic acid amplification reactions suffer from the formation of off-target amplification products. SAMRS were approved to minimize the primer dimer formation, improved the sensitivity and specificity and achieved highly multiplex PCR. We are developing alternative chemistry combined with SAMRS to improve their performance.
Research Focus:
  • Molecular Diagnostics
  • Oligo Synthesis
  • Novel Modified Oligos
Education:
  • B.S. in Chemistry, Sichuan Normal University, China (2011)
  • M.S. in Organic Chemistry, Chengdu Institute of Organic
  • Chemitry, Chinese Academy of Science (CAS), China (2015)
  • Ph.D. in Organic Chemistry, Georgia State University, Georgia (2020)
  • Instructor, Georgia State University, Georgia (2018-2019)
  • Chemist, Affinity Research Chemicals, Delaware (2020)
  • Scientist, Firebird Biomolecular Sciences, LLC, Florida (2020-present)

Publications

Brazzill, M., Ma, R., Munn, K., Prestifilippo, L., Pickford, A.R., Kim, H.J., Chen, C., Hoshika, S., Benner, S.A., Rusling, D.A. Nat. Commun., Nature (2026) PMC12934894, doi: 10.1038/s41467-026-74375-4

The sequence-specific recognition of double-stranded DNA by biocompatible molecules is fundamental to molecular medicine and synthetic biology. Triplex-forming oligonucleotides (TFOs) enable programmable major groove recognition via Hoogsteen base pairing; however, the limited repertoire of natural nucleobases imposes strict constraints on target sequences and parallel motif triplexes require acidic conditions for stability. Here, we have expanded the triplex recognition space using nucleobases from an artificially expanded genetic information system (AEGIS). Through a systematic evaluation of 120 base triad combinations, we identify at least 12 modular triads that can be combined interchangeably to target duplex DNA containing standard, damaged, or synthetic base pairs with nanomolar affinity at neutral pH. We further demonstrate the versatility of this expanded recognition code by detecting oxidative lesions or AEGIS base pairs in enzymatically assembled duplex constructs using both chemically and enzymatically synthesized TFOs. This generalized framework provides a robust platform for precision gene-targeting, molecular sensing, and nucleic acid nanotechnology.

Kawabe, H., Manfio, L., Magana Pena, S., Zhou, N., Bradley, K., Chen, C., McLendon, C., Benner, S.A., Levy, K., Yang, Z., Marchand, J., Fuhrmeister, E. Synth. Bio. 14 (2) 470-484 (2025) PMC11419210, doi.org/10.1021/acssynbio.4c00619

Environmental surveillance and clinical diagnostics heavily rely on the polymerase chain reaction (PCR) for target detection. A growing list of microbial threats warrants new PCR-based detection methods that are highly sensitive, specific, and multiplexable. Here, we introduce a PCR-based icosaplex (20-plex) assay for detecting 18 enteropathogen and two antimicrobial resistance genes. This multiplexed PCR assay leverages the self-avoiding molecular recognition system (SAMRS) to avoid primer dimer formation, the artificially expanded genetic information system (AEGIS) for amplification specificity, and next-generation sequencing for amplicon identification. Using parallelized multitarget TaqMan Array Cards (TAC) to benchmark performance of the 20-plex assay on wastewater, soil, and human stool samples, we found 90% agreement on positive calls and 89% agreement on negative calls. Additionally, we show how long-read and short-read sequencing information from the 20-plex can be used to further classify allelic variants of genes and distinguish subspecies. The strategy presented offers sensitive, affordable, and robust multiplex detection that can be used to support efforts in wastewater-based epidemiology, environmental monitoring, and human/animal diagnostics.

Bang Wang, Hyo-Joong Kim, Kevin M. Bradley, Cen Chen, Chris McLendon, Zunyi Yang, Steven A. Benner J. Am. Chem. Soc. 146 (51) 35129-35138 (2024) doi: 10.1021/jacs.4c11043

By rearranging hydrogen bond donor and acceptor groups within a standard Watson–Crick geometry, DNA can add eight independently replicable nucleotides forming four additional not found in standard Terran DNA. For many applications, the orthogonal pairing of standard and nonstandard pairs offers a key advantage. However, other applications require standard and nonstandard nucleotides to communicate with each other. This is especially true when seeking to recruit high-throughput instruments (e.g., Illumina), designed to sequence standard 4-nucleotide DNA, to sequence DNA that includes added nucleotides. For this purpose, PCR workflows are needed to replace nonstandard nucleotides in (for example) a 6-letter DNA sequence by defined mixtures of standard nucleotides built from 4 nucleotides. High-throughput sequencing can then report the sequences of those mixtures to bioinformatic alignment tools, which infer the original 6-nucleotide sequence by analysis of the mixtures. Unfortunately, the intrinsic orthogonality of standard and nonstandard nucleotides often demand polymerases that violate pairing biophysics to do this replacement, leading to inefficiencies in this “transliteration” process. Thus, laboratory in vitro evolution (LIVE) using “anthropogenic evolvable genetic information systems” (AEGIS), an important “consumer” of new sequencing tools, has been slow to be democratized; robust sequencing is needed to identify the AegisBodies and AegisZymes that AEGIS-LIVE delivers. This work introduces a new way to connect synthetic and standard molecular biology: biversal nucleotides. In an example presented here, a pyrimidine analogue (pyridine-2-one, y) pairs with Watson–Crick geometry to both a nonstandard base (2-amino-8-imidazo-[1,2a]-1,3,5-triazin-[8H]-4-one, P, the Watson–Crick partner of 6-amino-5-nitro-[1H]-pyridin-2-one, Z) and a base that completes the Watson–Crick hydrogen bond pattern (2-amino-2'-deoxyadenosine, amA). PCR amplification of GACTZP DNA with dyTP delivers products where Z:P pairs are cleanly transliterated to A:T pairs. In parallel, PCR of the same GACTZP sample at higher pH delivers products where Z:P pairs are cleanly transliterated to C:G pairs. By allowing robust sequencing of 6-letter GACTZP DNA, this workflow will help democratize AEGIS-LIVE. Further, other implementations of the biversal concept can enable communication across and between standard DNA and synthetic DNA more generally.

Bang Wang, James R. Rocca, Shuichi Hoshika, Cen Chen, Zunyi Yang, Reza Esmaeeli, Jianguo Wang, Xiaoshu Pan, Jianrong Lu, Kevin K. Wang, Y. Charles Cao, Weihong Tan & Steven A. Benner Nat. Chem., Nature (2024) https://doi.org/10.1038/s41557-024-01552-7

Adding synthetic nucleotides to DNA increases the linear information density of DNA molecules. Here we report that it also can increase the diversity of their three-dimensional folds. Specifically, an additional nucleotide (dZ, with a 5-nitro-6-aminopyridone nucleobase), placed at twelve sites in a 23-nucleotides-long DNA strand, creates a fairly stable unimolecular structure (that is, the folded Z-motif, or fZ-motif) that melts at 66.5°C at pH 8.5. Spectroscopic, gel and two-dimensional NMR analyses show that the folded Z-motif is held together by six reverse skinny dZ-:dZ base pairs, analogous to the crystal structure of the free heterocycle. Fluorescence tagging shows that the dZ-:dZ pairs join parallel strands in a four-stranded compact down-up-down-up fold. These have two possible structures: one with intercalated dZ-:dZ base pairs, the second without intercalation. The intercalated structure would resemble the i-motif formed by dC:dC+-reversed pairing at pH ≤ 6.5. This fZ-motif may therefore help DNA form compact structures needed for binding and catalysis.

Bang Wang, Kevin M. Bradley, Myong-Jung Kim, Roberto Laos, Cen Chen, Dietlind L. Gerloff, Luran Manfio, Zunyi Yang & Steven A. Benner Nat. Commun. 15 (4057), Nature (2024) https://doi.org/10.1038/s41467-024-48408-9

With just four building blocks, low sequence information density, few functional groups, poor control over folding, and difficulties in forming compact folds, natural DNA and RNA have been disappointing platforms from which to evolve receptors, ligands, and catalysts. Accordingly, synthetic biology has created "artificially expanded genetic information systems" (AEGIS) to add nucleotides, functionality, and information density. With the expected improvements seen in AegisBodies and AegisZymes, the task for synthetic biologists shifts to developing for expanded DNA the same analytical tools available to natural DNA. Here we report one of these, an enzyme-assisted sequencing of expanded genetic alphabet (ESEGA) method to sequence six-letter AEGIS DNA. We show how ESEGA analyses this DNA at single base resolution, and applies it to optimized conditions for six-nucleotide PCR, assessing the fidelity of various DNA polymerases, and extending this to AEGIS components with functional groups. This supports the renewed exploitation of expanded DNA alphabets in biotechnology.