Difference between revisions of "Team:Marburg/Model"

Line 1,193: Line 1,193:
 
For the in vivo efficiency approximation of the sequences we calculated the relative decline in average base counts in 25-base windows before and after the terminator candidates (Creecy et al., 2015). Sequences which had an approximated efficiency below a high threshold of 80% were ignored for further consideration.
 
For the in vivo efficiency approximation of the sequences we calculated the relative decline in average base counts in 25-base windows before and after the terminator candidates (Creecy et al., 2015). Sequences which had an approximated efficiency below a high threshold of 80% were ignored for further consideration.
 
</p>
 
</p>
 +
 
<figure>
 
<figure>
   <img src="https://2019.igem.org/File:T--Marburg--m_terminator_bpcount.jpg">
+
   <img src="https://2019.igem.org/File:T--Marburg--m_terminator_bpcount.jpg" alt="Placeholder image">
 
   <figcaption>
 
   <figcaption>
 
     Exemplary efficiency analysis of a predicted terminator on the sense strand. The x-axis is the nucleotide position on the genome, y axis the counts for the associated base. The predicted terminator is displayed in read, the 25 bases before and after the terminator sequence are colored blue.
 
     Exemplary efficiency analysis of a predicted terminator on the sense strand. The x-axis is the nucleotide position on the genome, y axis the counts for the associated base. The predicted terminator is displayed in read, the 25 bases before and after the terminator sequence are colored blue.
 
   </figcaption>
 
   </figcaption>
 
</figure>
 
</figure>
 +
 
<p>
 
<p>
 
After the careful separation of the unsuitable candidates we were left with the most promising terminators. To further analyze the functions of these terminators a kinetic approach was indispensable.  
 
After the careful separation of the unsuitable candidates we were left with the most promising terminators. To further analyze the functions of these terminators a kinetic approach was indispensable.  
Line 1,204: Line 1,206:
 
The RNA secondary structures were predicted using KineFold. To choose the most likely formation we performed multiple independent runs using different random seeds and chose the most frequent structure.   
 
The RNA secondary structures were predicted using KineFold. To choose the most likely formation we performed multiple independent runs using different random seeds and chose the most frequent structure.   
 
</p>
 
</p>
 +
 
<figure>
 
<figure>
   <img src="https://2019.igem.org/File:T--Marburg--m_terminator_folding.jpg">
+
   <img src="https://2019.igem.org/File:T--Marburg--m_terminator_folding.jpg" alt="Placeholder image">
 
   <figcaption>
 
   <figcaption>
 
     Example of a secondary structure prediction using MFOLD. We can clearly see that a poly(U) region is part of the hairpin. Additionally we can see the high GC content of the base, a typically small stem loop and the poly(A) region on the 5’ end.
 
     Example of a secondary structure prediction using MFOLD. We can clearly see that a poly(U) region is part of the hairpin. Additionally we can see the high GC content of the base, a typically small stem loop and the poly(A) region on the 5’ end.
 
   </figcaption>
 
   </figcaption>
 
</figure>
 
</figure>
 +
 +
 
Based upon these results we were tasked with the correct identification of the U-tract, hairpin and the A-tract regions. The predicted secondary structures were often hairpins that extended beyond the terminator hairpin. The reason for this was the formation of base pairs between the upstream poly(A) sequences and the U-tract. For the precise identification of these regions it was important that the poly(U) region was part of the U-tract and not the hairpin. To correctly distinguish these two several steps had to be taken. Given a stem loop structure, we screened for possible U-tracts in the region between the sixth nucleotide in the 3’-arm of the stem loop and the eighth nucleotide after the stem by evaluating every 8 base pairs.   
 
Based upon these results we were tasked with the correct identification of the U-tract, hairpin and the A-tract regions. The predicted secondary structures were often hairpins that extended beyond the terminator hairpin. The reason for this was the formation of base pairs between the upstream poly(A) sequences and the U-tract. For the precise identification of these regions it was important that the poly(U) region was part of the U-tract and not the hairpin. To correctly distinguish these two several steps had to be taken. Given a stem loop structure, we screened for possible U-tracts in the region between the sixth nucleotide in the 3’-arm of the stem loop and the eighth nucleotide after the stem by evaluating every 8 base pairs.   
 
<br>
 
<br>
 
For this we have calculated the Gibbs free energy of all possible U-tracts with the formula
 
For this we have calculated the Gibbs free energy of all possible U-tracts with the formula
 
<br>
 
<br>
 +
 
<figure>
 
<figure>
   <img src="https://via.placeholder.com/150" alt="Placeholder image">
+
   <img src="https://2019.igem.org/File:T--Marburg--m_terminator_energy_gu.jpg" alt="Placeholder image">
 
</figure>
 
</figure>
 +
 
<p>
 
<p>
 
Where N_U = 8 is the length of the U-tract, Delta G_RNA:DNA is the free-energy contribution of the RNA:DNA hybridization from the two nucleotides pairs at position i and i+1.
 
Where N_U = 8 is the length of the U-tract, Delta G_RNA:DNA is the free-energy contribution of the RNA:DNA hybridization from the two nucleotides pairs at position i and i+1.
Line 1,226: Line 1,233:
 
<br>
 
<br>
  
<b>< TABLE OF 10 TER HERE></b>
+
<table class="table table-bordered table-hover table-condensed">
 +
<thead><tr><th title="Field #1">ID</th>
 +
<th title="Field #2">Strand</th>
 +
<th title="Field #3">Starting Site</th>
 +
<th title="Field #4">End Site</th>
 +
<th title="Field #5">Length</th>
 +
<th title="Field #6">bp counts before</th>
 +
<th title="Field #7">bp counts after</th>
 +
<th title="Field #8">read trough</th>
 +
<th title="Field #9">A-tract</th>
 +
<th title="Field #10">Hairpin</th>
 +
<th title="Field #11">Loop</th>
 +
<th title="Field #12">U-Tract</th>
 +
<th title="Field #13">Structure</th>
 +
</tr></thead>
 +
<tbody><tr>
 +
<td align="right">1036</td>
 +
<td>+</td>
 +
<td align="right">166606</td>
 +
<td align="right">166645</td>
 +
<td align="right">39</td>
 +
<td align="right">2182747.4</td>
 +
<td align="right">2.76</td>
 +
<td align="right">1.26E-06</td>
 +
<td>CAACUAAAGA</td>
 +
<td>GAGUCGCUCAGAGAGCGGCUC</td>
 +
<td>AGA</td>
 +
<td>UUUUUUGUUG</td>
 +
<td>(((((((((...)))))))))</td>
 +
</tr>
 +
<tr>
 +
<td align="right">1000</td>
 +
<td>+</td>
 +
<td align="right">2622887</td>
 +
<td align="right">2622925</td>
 +
<td align="right">38</td>
 +
<td align="right">295441.36</td>
 +
<td align="right">0.76</td>
 +
<td align="right">2.57E-06</td>
 +
<td>UAAACAACCU</td>
 +
<td>CUUCAGUCACAGGACUGAGG</td>
 +
<td>ACAG</td>
 +
<td>GUUUUGUUUU</td>
 +
<td>((((((((....))))))))</td>
 +
</tr>
 +
<tr>
 +
<td align="right">904</td>
 +
<td>-</td>
 +
<td align="right">2379456</td>
 +
<td align="right">2379414</td>
 +
<td align="right">42</td>
 +
<td align="right">585709.4</td>
 +
<td align="right">3.08</td>
 +
<td align="right">5.26E-06</td>
 +
<td>AGCAAAAAGC</td>
 +
<td>CUGUCUAAGCAUUGUCUUGGACAG</td>
 +
<td>CAUUGU</td>
 +
<td>GCUUUUUGCU</td>
 +
<td>(((((((((......)))))))))</td>
 +
</tr>
 +
<tr>
 +
<td align="right">743</td>
 +
<td>-</td>
 +
<td align="right">1899716</td>
 +
<td align="right">1899683</td>
 +
<td align="right">33</td>
 +
<td align="right">23739.36</td>
 +
<td align="right">0.16</td>
 +
<td align="right">6.74E-06</td>
 +
<td>UAAAAAACGC</td>
 +
<td>CCGGGCAACGCUCGG</td>
 +
<td>AAC</td>
 +
<td>GCGUUUUUUA</td>
 +
<td>((((((...))))))</td>
 +
</tr>
 +
<tr>
 +
<td align="right">279</td>
 +
<td>-</td>
 +
<td align="right">709603</td>
 +
<td align="right">709570</td>
 +
<td align="right">33</td>
 +
<td align="right">437192.08</td>
 +
<td align="right">12.24</td>
 +
<td align="right">2.80E-05</td>
 +
<td>CGAACCCCUA</td>
 +
<td>GUCAUCAAUGGUGAU</td>
 +
<td>CAAUG</td>
 +
<td>AGGGGUUCGU</td>
 +
<td>((((((...))))))</td>
 +
</tr>
 +
<tr>
 +
<td align="right">1193</td>
 +
<td>-</td>
 +
<td align="right">1170678</td>
 +
<td align="right">1170642</td>
 +
<td align="right">36</td>
 +
<td align="right">69043.84</td>
 +
<td align="right">7.96</td>
 +
<td align="right">0.000115289</td>
 +
<td>UAUCAGGAUG</td>
 +
<td>UGACUGAGAACUCAAUCA</td>
 +
<td>GAAC</td>
 +
<td>UCCUGAUCGU</td>
 +
<td>(((.(((....))).)))</td>
 +
</tr>
 +
<tr>
 +
<td align="right">349</td>
 +
<td>+</td>
 +
<td align="right">908409</td>
 +
<td align="right">908444</td>
 +
<td align="right">35</td>
 +
<td align="right">73266.04</td>
 +
<td align="right">10.08</td>
 +
<td align="right">0.000137581</td>
 +
<td>CAAACCCAGU</td>
 +
<td>GUCUUCUUGUUGGAGGC</td>
 +
<td>UUGUU</td>
 +
<td>UGGGUUUUUG</td>
 +
<td>((((((.....))))))</td>
 +
</tr>
 +
<tr>
 +
<td align="right">498</td>
 +
<td>+</td>
 +
<td align="right">1270707</td>
 +
<td align="right">1270744</td>
 +
<td align="right">37</td>
 +
<td align="right">134.76</td>
 +
<td align="right">0.04</td>
 +
<td align="right">0.000296824</td>
 +
<td>GGCAUUUGGG</td>
 +
<td>GGGCGGCGGUGGGUCGCCC</td>
 +
<td>GGUGG</td>
 +
<td>UUUUUUUCUG</td>
 +
<td>(((((((.....)))))))</td>
 +
</tr>
 +
<tr>
 +
<td align="right">586</td>
 +
<td>+</td>
 +
<td align="right">1518890</td>
 +
<td align="right">1518927</td>
 +
<td align="right">37</td>
 +
<td align="right">683.16</td>
 +
<td align="right">0.4</td>
 +
<td align="right">0.000585514</td>
 +
<td>CCACAUUAGC</td>
 +
<td>GCUCUCGCCUGUCGAGAGC</td>
 +
<td>CCUGU</td>
 +
<td>UUUUUUAUGC</td>
 +
<td>(((((((.....)))))))</td>
 +
</tr>
 +
<tr>
 +
<td align="right">909</td>
 +
<td>+</td>
 +
<td align="right">2385885</td>
 +
<td align="right">2385928</td>
 +
<td align="right">43</td>
 +
<td align="right">731.4</td>
 +
<td align="right">0.56</td>
 +
<td align="right">0.000765655</td>
 +
<td>GUCUAAAACC</td>
 +
<td>CCGCUGGUUCCCAGAGAGCUAGCGG</td>
 +
<td>CCAGA</td>
 +
<td>UUUUCCUUAU</td>
 +
<td>((((((((((.....))))))))))</td>
 +
</tr>
 +
</tbody></table>
  
 
<br>
 
<br>

Revision as of 01:47, 22 October 2019

Modelling


This year we used our mathematical and programming background to look for artificial Neutral integration Site option (aNSo) and suitable terminators for our project. We took advantage of genome data bank of UTEX2973 and used bioinformatics tools to gain insights and implement it to our project. In addition to that, we designed a model to predict the doubling times of UTEX2973 that was only possible after a thorough investigation and standardization of the current state of the art methods. To achieve this level of standardization we also implemented a light model to properly predict light intensities for our cultures.


Growth Curve Model


artificial Neutral integration Site options


Terminator Model