ISB News

New Tool Uses 3-D Protein-DNA Structures to Predict Locations of Genetic ‘On-Off’ Switches

3 Bullets:

  • Novel systems approach uses high-resolution structures of protein-DNA complexes to predict where transcription factors (genetic switches) bind and regulate the genome.
  • This approach can help researchers better understand and predict binding sites for non-model organisms or ‘exotic’ species.
  • Having such insight and predictive capabilities is critical for reverse- and forward-engineering organisms that could be pivotal for new green biotechnologies.

By Jake Valenzuela and Justin Ashworth

Researchers at the Institute for Systems Biology (ISB) have developed a new method to predict the locations of genetic on-off switches (transcription factors) that is more powerful and accurate than other existing approaches. Having this predictive capability allows researchers to better understand the gene networks that control cellular physiology and adaptation in new or understudied organism species that may impact the development of green biotechnologies.

In the study, lead author, Justin Ashworth, and colleagues in the Baliga Lab at ISB, demonstrated the power of the new approach by accurately predicting how subtle changes in the DNA-recognition sequences within proteins has altered where they bind and regulate the genome of a salt-loving microbe, Halobacterium salinarum.

Title: Structure-based inference of feast/famine regulatory targets
Publication: PLOS ONE
Authors: Justin Ashworth, Christopher L. Plaisier, Fang Yin Lo, David J. Reiss, Nitin S. Baliga
Link: http://www.plosone.org/article/info%3Adoi%2F10.1371%2Fjournal.pone.0107863

Transcription factors (TFs) are fundamental to gene regulation, binding to DNA and typically turning on or off from one to hundreds of genes. For non-model organisms, it is sometimes difficult to make predictions about gene regulation due to their novelty: not everything looks or functions like E. coli or any other long-studied laboratory organism. In many archaeal species, including H. salinarum, new and expanded families of transcription factors appear to regulate metabolism in ways that we have not seen before. In order to understand the ways that these TFs form gene regulatory networks and control cellular physiology, more powerful prediction methods are needed than existing gene lineage methods.

Our detailed understanding of the relationship between sequence, molecular structure, and function in protein and DNA coding can be used to improve the prediction of gene regulatory relationships and networks by explicitly considering the high-resolution three-dimensional structure of proteins and DNA, the energetic implications of evolutionary variations in proteins and DNA sequences can be more precisely estimated. In this paper, ISB researchers obtained greater power to predict the different DNA binding site preferences of eight Lrp-like feast/famine regulatory proteins (FFRPs) that were recently discovered in the H. salinarum genome. These FFRPs are distantly related to a transcription factor (Lrp) in E. coli, but sense different signals as well as regulate very different genes in archaea. Through a combination of structure-based predictions and experimental measurements, ISB researchers were able to infer that the archaeon H. salinarum utilizes an FFRP protein to coordinately regulate its carA and carB genes, whereas the same genes in E. coli are regulated collectively in an operon by a completely different transcription factor. Such distinctions can have enormous impacts on the gene regulatory networks of the organism.

Incorporating molecular and physics-based information into the prediction of gene regulatory networks is useful particularly in new species and non-model organisms, for which simple homology to well-studied organisms (e.g. E. coli) is insufficient to explain their biology.

The increased resolution and discriminatory power that this new method affords will be particularly useful for understanding non-model organisms found in new environments and ecosystems, as well as for reverse- and forward-engineering non-model organisms that are desirable for bioproduction processes. Understanding how genes are regulated and by which transcription factors is a critical aspect of understanding how cells function and adapt. These increased predictive capabilities will facilitate a systems-level understanding of non-model organisms, as well as the re-engineering of cellular control systems for new green biotechnologies.

About Dr. Jake Valenzuela and Dr. Justin Ashworth: Jake is a postdoc in the Baliga Lab and a member of the ISB Editorial Board. Justin is postdoc in the Baliga Lab and lead author of the paper.

Analysis of evolutionary variation in Lrp-like feast/famine regulatory proteins reveals that particular amino acids vary in tandem to encode divergent gene regulatory interactions. Shown here is a rendering of the high-resolution structure of one such protein named FL11 from the species Pyrococcus horikoshii bound to DNA (Protein Data Bank accession code: 2e1c; Yokoyama et al. 2007). Space-filling spheres indicate amino acids with a high degree evolutionary co-variation. Using a physics-based molecular modeling approach, ISB scientists were able to infer the preferred DNA sequences and downstream gene targets for an expanded and diverged family of these proteins in the archaeon Halobacterium salinarum.

Analysis of evolutionary variation in Lrp-like feast/famine regulatory proteins reveals that particular amino acids vary in tandem to encode divergent gene regulatory interactions. Shown here is a rendering of the high-resolution structure of one such protein named FL11 from the species Pyrococcus horikoshii bound to DNA (Protein Data Bank accession code: 2e1c; Yokoyama et al. 2007). Space-filling spheres indicate amino acids with a high degree evolutionary co-variation. Using a physics-based molecular modeling approach, ISB scientists were able to infer the preferred DNA sequences and downstream gene targets for an expanded and diverged family of these proteins in the archaeon Halobacterium salinarum. Figure created by Dr. Justin Ashworth.

Recent Articles

  • Drs. Jennifer Hadlock and Alexandra Ralevski

    ISB Study Highlights AI’s Potential and Pitfalls in Analyzing Health Data

    New peer-reviewed research out of ISB highlights the strengths of large language models in uncovering social determinants of health while underscoring the need for human oversight and improved de-identification methods.

  • Dr. Sid Venkatesh

    Sid Venkatesh Publishes Co-First Authored Paper in Science

    ISB Assistant Professor Dr. Sid Venkatesh is the co-first author of a paper in the journal Science. While at Washington University in St. Louis, Venkatesh and colleagues identified a novel gut microbial enzyme that impacts satiety-related signaling pathways in undernourished children treated with microbiota-directed complementary foods.

  • AmeriCorps Member Faduma Hussein Joins ISB as Public Health Ambassador Coordinator

    Faduma Hussein recently joined the ISB Education team as the Public Health Ambassador Coordinator, becoming only the fourth AmeriCorps member to serve at ISB. In this Q&A, she shares insights into her education, what drew her to ISB, career aspirations, and more.