Team:SASTRA Thanjavur/Design

Design

Our project, House of Toeholds, is aimed at the detection of the microRNA biomarkers of cervical cancer. We used toehold switches, an mRNA based regulator of gene expression, as a biosensory element that is engineered to bind to the unique biomarker target. We had the following design guidelines and steps that we followed during the course of our project.

Cycle 1: Biomarker Panel

Our project’s first essential step was to determine the choice of our biomarkers. microRNAs are short oligonucleotides about 22 nt in length that have been cited in recent literature as potent indicators of various cancers. In addition, mircoRNAs are also circulatory biomarkers, thus identified in the bloodstream and aren’t localised to the tumour origin.

However, we realised early into our project that no single miRNA could act as the key indicator of cervical cancer as cancer reigns the “emperor of all maladies”, quoting the famous oncologist and writer Siddharth Mukherjee.

microRNAs are differentially expressed in the bloodstream, either as upregulated or downregulated factors in the advent of cervical cancer. We identified the prominent microRNAs using differential expression profiling and subsequent survival analysis based on multivariate Cox regression models and hence came up with a biomarker panel for cervical cancer.

Cycle 2: Toehold Switch Design

Translational riboregulators are mRNA elements whose secondary structures effectively inhibit the binding of the ribosome to carry out the process of translation for any downstream gene. Therefore, translational riboregulators have been incorporated in synthetic biology to study the expression of a gene or genes of interest downstream of the riboregulators.

Toehold switches are a class of completely synthetic riboregulators de novo designed by Green et al.[3] in 2014. Their high dynamic range, orthogonality and low crosstalk are a result of highly specific nucleotide composition. Each switch has a computationally optimized sequence that forms a double looped structure in the inactivated/ off state. Upon base pairing with a unique, linear trigger mRNA sequence, the toehold structure of the switch unfolds, allowing translational gene expression for the gene of interest downstream.

The toehold switch, like all engineered elements in science, is carefully designed with intricate parameters. The illustration depicts the key domains or regions by which a toehold switch is identified. This section shall explain the features of every domain in a toehold switch and how we came up with our switch designs based on these requirements.

The leader sequence is special GGG region that enhances efficient transcription by RNA Polymerase in the T7 promoter system. The toehold domain, the unique complement to the trigger sequence. The bottom stem sequence is the region above the toehold domain that is base-paired extensively. The top stem is the base paired region preceding the loop containing the ribosome binding site. The pre Ribosome Binding Site sequence, a short domain part of the primary loop before the RBS sequence that increases the size of the loop. The Ribosome Binding Site (shorthand is RBS) is the consensus sequence to which the ribosome binds to translate an mRNA strand. These sequences are categorised and designed due to their ability to bind with prokaryotic ribosomes and initialise translation. The Registry of Standard Biological Parts contains two major RBS collections, the Anderson RBS Library created and curated by the Anderson lab at the University of California, Berkeley and the RBS collection from the synthetic biology community. The sequestered start codon AUG is the first codon of any gene to be expressed. The final domain is a horizontal linker domain to the gene of interest. This domain partially base pairs with the toehold domain and forms the stem bottom of the toehold switch.

In the above figure, a trigger sequence binds to the toehold switch, causing the toehold domain of the switch to base pair with the trigger and thereby losing its double looped structure. As a result, the switch becomes linear and the gene expression of the particular gene of interest is observed.

The dynamic range is a metric to measure the sensitivity of the toehold switch, based on the minimum and maximum intensity of fluorescent protein expression, typically GFP expression.

The dynamic range of the toehold is quantified by the ON/OFF ratio values obtained by GFP expression. The greater the ON/OFF ratio, the larger the dynamic range of the toehold switch.

We have analysed the sequence and structural features of domains unique to each toehold switch and constructed a multivariate regression model to analyse the dependence of the dynamic range on each of our engineered features.

We engineered our toehold domain to be longer than 10 nucleotides to allow effective binding of the trigger to the switch. We also ensured that the toehold domain did not base-pair within itself or to the horizontal linker towards the 3’ end of the sequence. We felt that this was an important yet overlooked design aspect as the intra-domain base pairing of the toehold region could potentially hinder the binding of the trigger to the toehold switch. Similarly, an association of the toehold domain with the horizontal linker also negates the trigger binding event and eventually may not result in any reporter expression.

The bottom stem of the toehold switch was kept at a minimum of 12 nucleotides to allow for 4 amino acids after the start codon in the open reading frame during translation before the horizontal linker. The thermodynamic stability of the toehold switch in the OFF state is also increased by having a longer, base paired bottom stem region.

The AUG containing secondary loop of the toehold switch consists of three diametrically opposite bases that don’t base pair with the start codon. We were unsure of having the wobble base pair in the start codon region as the binding event of the trigger may not break the base pairing. Our results validate the rationale behind the choice of keeping the AUG intact without base pairing.

The RBS loop or the primary loop was designed around the Anderson RBS with the conserved RBS sequence for prokaryotic ribosomal binding. The Anderson ABS is AGAGGA and we designed the primary loop sequence such that the preRBS sequence part of the loop was long enough to sequester the RBS effectively to prevent ribosomal binding. We kept the primary loop length at 11 nucleotides and added adenines prior to the RBS for enhancing transcription.

The small linker portion below the RBS, the constitutive top stem element, was comprised of a scar site sequence.

The horizontal linker that we used after the bottom stem is AACCUGGCGGCAGCGCAAAAG, the 21nt long linker used in Green et al., 2014. The purpose of the linker is to sequester the proteins formed by the construct so that it doesn’t interfere with the reporter expression. The linker produces flexible amino acids such as glycine or serine and doesn’t affect the translation of amino acids essential to the green fluorescent protein.

Second Generation Toehold Switch Design

As mentioned before, the bottom stem of the toehold switch consists of an ascending sequence at the 5’ end and a descending sequence in the 3’ end that are base-complementary to each other. The full length of our miRNA was used as a trigger to our designed toehold switch.

The orientation of the toehold switch is from 5’ to 3’ and the trigger runs from 3’ to 5’. We noticed that the triggers miR21 and miR20a consist of a stop codon in the ORF towards the 5’ region of their sequence. Specifically, the stop codon was observed as the first three nucleotides in the sequences. Logically, when the binding event is to occur, this stop codon would bind to the 3’ end of the toehold switch, or the last three nucleotides of the ascending bottom stem sequence (at the top, just below the secondary loop). This would imply that in order for a proposed switch design to bind to the trigger, the descending bottom stem sequence would have to be base-complementary to the ascending sequence, thus resulting in the same sequence of the miR trigger. In other words, the 12 nucleotides of the descending bottom stem sequence would possess a stop codon in the ORF just after the start codon. This means that no GFP can be expressed as translation is brought to a halt because of this molecular roadblock.

antimiR Design

We thus came up with a design inspired from Green et al.(2017), to effectively isolate the portion of the miRNA trigger containing the stop codon. We designed a partially complementary antisense RNA strand (or antimiR), to bind to 12 nt starting from the 5’ end of the miR sequences. Thus, we created a hybrid trigger molecule shaped like a half pulled zipper, with a base-paired region and two free ends.

The advantage of this design is that it allows for the customizability of the free end of the non-complementary region of the anti-miR. We used this to maximise that sequence to bind to the toehold domain.

The second generation toehold switch thus binds to two different molecules treated as a single hybrid structure. The customizable nucleotide design of the anti-miR sequence is transferred to the toehold switch, improving the trigger binding event and allowing for better unfolding of the toehold switch.

Cycle 3: Modelling and Experimental Characterization

The reader is referred to the Modelling section of our wiki for a detailed discussion of the reaction-kinetics modelling of our designed toeholds. Further to the computational validation, we sought to experimentally characterize our toehold switch constructs by embedding in a genetic circuit that when triggered expresses the Green Fluorescent Protein. We measured fluorescence to determine the performance as well as leakiness of the switch, thereby evaluating its specificity. Upon confirmation of the specificity, we moved on to perform concentration studies where increasing concentrations of the microRNA biomarkers varying across orders of magnitude were considered to estimate the calibration curve for the proposed biosensor. From the calibration curve, the biosensor parameters such as linear range, limit of detection were estimated to define the performance of the biosensor towards the detection of the chosen microRNA markers. The biosensor we have developed is a solution based proof-of-concept, which we were able to achieve by cell-free characterization of our designed Biobricks.

Our technology could be embedded in a paper based diagnostic device for the detection of early stage biomarkers of cervical cancer. The development of a prototype would lead to a low cost, minimally invasive device that is complementary to existing screening methods.

References

Alexander A. Green, Pamela A. Silver, James J. Collins, Peng Yin. (2014) Toehold Switches: De-Novo-Designed Regulators of Gene Expression. Cell 159, 925–939 Elsevier BV
Keith Pardee, Alexander A. Green, Tom Ferrante, D. Ewen Cameron, Ajay Daley Keyser, Peng Yin, James J. Collins. (2014) Paper-Based Synthetic Gene Networks.Cell 159, 940–954 Elsevier BV
Keith Pardee, Alexander A. Green, Melissa K. Takahashi, Dana Braff, Guillaume Lambert, Jeong Wook Lee, Tom Ferrante, Duo Ma, Nina Donghia, Melina Fan, Nichole M. Daringer, Irene Bosch, Dawn M. Dudley, David H. O’Connor, Lee Gehrke, James J. Collins. (2016) Rapid Low-Cost Detection of Zika Virus Using Programmable Biomolecular Components. Cell 165, 1255–1266 Elsevier B
J Kim, P Yin, AA Green. Ribocomputing: Cellular Logic Computation Using RNA Devices.. Biochemistry 57, 883-885 (2018)