Line 2,060: | Line 2,060: | ||
<div class="row para1"> | <div class="row para1"> | ||
<div class="col"> | <div class="col"> | ||
− | We use Gillespie algorithm in stochastic modelling. | + | We use Gillespie algorithm in stochastic modelling. The procedure of this algorithm[7] is shown as follows in the form of pseudocode:<br /><br /> |
− | + | Step 1: Initialize the reaction system at \(t=0\) with rate constants \(c1, c2, ......, cv\) as initial numbers of molecules \(x1, x2, ......, xu\) corresponding to \(v\) reactions and \(u\) sustances (both reactants and products) involved in the reaction system.<br /><br /> | |
+ | Step 2: For each \(i=1,2,......,v\), calculate the hazard for the i<sup>th</sup>> type of reaction, denoted as \(h_i(x,c_i)\) based on current substance number x.<br /><br /> | ||
+ | Step 3: Calculate the combined reaction hazard \(h_0(x,c)=\sigma_{i=1}^{v}h_i(x,c_i)\).<br /><br /> | ||
+ | Step 4: Simulate the time to the next reaction, \(t^'\) , which is a random quantity that obeys exponential distribution with parameter \(\lambda\).<br /><br /> | ||
+ | Step 5: Put \(t:=t+t^'\). <br /><br /> | ||
+ | Step 6: Simulate the reaction index, \(j\). The probability that the \(j^{th}\) reaction can occur is \(\frac{h_i(x,c_i)}{h_0(x,c)}, i=1,2,......,v.\).<br /><br /> | ||
+ | Step 7: Update \(x\) according to reaction \(j\), which means putting \(x:=x+S^{(j)}\), where \(S^{(j)}\) denotes the \(j^{th}\) colomn of the stoichiometry matrix \(S\). The \(j^{th}\) column of denotes the change in substance number caused by the \(j^{th}\) reaction.<br /><br /> | ||
+ | Step 8: Record time \(t\) and current substance number \(x\).<br /><br /> | ||
+ | Step 9: If \(t《T_max\), return to step 2. \(T_{max}\) corresponds to the maximum duration of the reaction set by the user.<br /><br /> | ||
+ | Step 10: Plot the result to see the dynamic of the quantity of the substance that you are interested in.<br /><br /> | ||
+ | Although the algorithm is rather simple, basic mathematical skills is required to understand its theoretical basis. You may consult the book written by <a href="#Ref4">Wilkinson and Darren J.</a> for a thorough understanding. The result is shown in <a href="#Fig5">Fig. 5</a>.<br /><br /> | ||
+ | The result demonstrates that recombined \(P_{target}\)s do occur and two rounds of reverse transcription and recombination can take place in one replication cycle of E. coli (1200 s) (<a href="#Fig5">Fig 5a</a>). On the contrary, no recombined will come out within that period if the initial cDNA is 5 molecules and initial Cre is 3228 molecules (<a href="#Fig5">Fig 5b</a>). This again demonstrates the necessity of putting RT and Cre under different induction setups. The fluctuation of the number of recombined \(P_{target}s\) is due to the backward reaction that Cre can rebind with recombined and reverting the action, making it not counted as recombined \(P_{target}\) by the algorithm. | ||
</div> | </div> | ||
</div> | </div> | ||
Line 2,080: | Line 2,091: | ||
<div class="row para1"> | <div class="row para1"> | ||
<div class="col"> | <div class="col"> | ||
− | + | To ensure the evolved protein is encoded by the mutated GOI sequence that is recombined into \(P_{target}\), we decided to use degradation tag to accelerate the degradation process of Cre. This design would make Cre only function when inducer is in the system, thus allowing stringent control of the protein. However, we then face the problem of how to select the optimal degradation tag. Empirically, to minimize the duration of recombination, we tend to choose degradation tags with higher efficiency, but extremely high degradation rate will also reduce the yield of recombined \(P_{target}\), leading to decreased library size. Also, it is impractical for researchers to do experiments to test these degradation tags one by one. For these reasons, we are going to use models to find out the optimal degradation tag that should be added to Cre based on the average yield of recombined \(P_{target}\) at the end of R-Evolution functioning period (8 hours).<br /><br /> | |
− | + | We intend to use the models described in Part I, combined with aTc induction model proposed by <a href="#Ref5">Steel et al</a>. to compute the yield of recombined \(P_{tartget}\) under different degradation rate of Cre (the reason why Tet operon is used has been elaborated in Part I; the schematic diagram of this process is shown in <a href="#Fig6">Fig 6a</a>. Details of the substance names, parameter names and mathematical equations can be found in the appendix). <br /><br /> | |
− | + | Although the setup in Part I successfully provided us with a clear insight into the reactions and dynamic changes of substances that underlie our mutagenesis system, the simplification that the steady-state substance concentrations of previous models can be used as inputs for subsequent models doesn’t match real reaction situation. For example, when Cre is expressed, it can immediately bind with cDNA and initiate recombination. This fact contradicts with our model assumption that recombination only takes place after both Cre and cDNA has reached their steady-state concentration.<br /><br /> | |
− | + | To overcome this issue, we decided to combine all three minor models together and calculate the expected output.<br /><br /> | |
+ | As a result of the impreciseness of the basic assumption of the models in part I, we only gave a qualitative conclusion that the amount of RT and Cre should be different. Here we need to quantify how Cre degradation rate and steady-state concentration affects the yield of recombined \(P_{target}\). That’s why we employed deterministic model here to combine the separate steps together into one and better simulate real intracellular circumstances.<br /><br /> | ||
+ | By combining the models that have been talked above, we revealed the reason why the degradation tag with a moderate degradation rate, which can’t be too high or too low, should be selected (<a href="#Fig6">Fig 6a</a>).: under appropriate inducer concentration (20~22uM), when the degradation rate is relatively low (below 0.1 min^{-1}), the yield of recombined \(P_{target}\) will increase according to the increase of Cre degradation rate, but when that rate is sufficiently high (above 0.1 min^{-1}), the increase of Cre degradation rate will do harm to the yield of recombined \(P_{target}\).<br /><br /> | ||
+ | The average degradation rate acquired from literature is 0.2 min^{-1}<sup><a href="#Ref1">[1]</a></sup>> and the degradation rate of Cre when tagged with the most efficient degradation tag is 0.69 min^{-1}. Within this range of degradation rate, the maximum yield of recombined \(P_{target}\) will decrease according to the increase of Cre degradation efficiency (<a href="#Fig6">Fig 6b</a>). So we decided to use the least efficient degradation tag.<br /><br /> | ||
+ | We also revealed the dynamic change of the recombined \(P_{target}\). It will continuously accumulate within Cre function period (<a href="#Fig6">Fig. 6c</a>). However, the concentration remains to be low within that period, due to Cre degradation (<a href="#Fig6">Fig. 6d</a>).<br /><br /> | ||
+ | Finally, there is another interesting phenomenon that is worth mentioning. From <a href="#Fig6">Fig. 6a</a> and <a href="#Fig6">Fig. 6b</a>, we can find that for each degradation tag rate greater than 0.2 min^{-1}, there exits a range of aTc dosage that can make the yield of recombined relatively big. Also, decreased degradation efficiency enlarges that range. This discovery provides us with another reason for using less efficient degradation tag in that it can increase the robustness of our mutagenesis system by decreasing its sensitivity to the change of inducer dosage. | ||
</div> | </div> | ||
</div> | </div> | ||
Line 2,110: | Line 2,126: | ||
<div class="col"> | <div class="col"> | ||
<ul class="paraUl" style="list-style:none;"> | <ul class="paraUl" style="list-style:none;"> | ||
− | <li>[1]. Stamatakis M, Mantzaris N V. Comparison of Deterministic and Stochastic Models of the lac Operon Genetic Network[J]. Biophysical Journal, 2009, 96(3):887-906.</li> | + | <li>[1]. <a name="Ref1">Stamatakis M, Mantzaris N V. Comparison of Deterministic and Stochastic Models of the lac Operon Genetic Network[J]. Biophysical Journal, 2009, 96(3):887-906.</a></li> |
− | <li>[2]. Kulpa, D. Determination of the site of first strand transfer during Moloney murine leukemia virus reverse transcription and identification of strand transfer-associated reverse transcriptase errors[J]. EMBO (European Molecular Biology Organization) Journal, 1997, 16(4):856-865.</ | + | <li>[2]. <a name="Ref2">Kulpa, D. Determination of the site of first strand transfer during Moloney murine leukemia virus reverse transcription and identification of strand transfer-associated reverse transcriptase errors[J]. EMBO (European Molecular Biology Organization) Journal, 1997, 16(4):856-865.</a></li> |
− | + | <li>[3]. <a name="Ref3">Ringrose L, Lounnas V, Ehrlich L, et al. Comparative kinetic analysis of FLP and cre recombinases: mathematical models for DNA binding and recombination[J]. Journal of Molecular Biology, 1998, 284(2):0-384.</a></li> | |
− | <li>[ | + | <li>[4]. <a name="Ref4">Wilkinson, Darren J. Stochastic Modelling for Systems Biology, Second Edition[M]. Crc Press, 2011.</a></li> |
− | + | <li>[5]. <a name="Ref5">Harris A W K, Kelly C L, Steel H, et al. The autorepressor: A case study of the importance of model selection[C]. Decision & Control. IEEE, 2018.</a></li> | |
− | <li>[ | + | |
− | + | ||
</ul> | </ul> | ||
</div> | </div> |
Revision as of 18:48, 19 October 2019