MODELING
DELAY SYSTEM SIMULATION
Introduction
One of the most novel designs in our project is the delay system of the cordycepin expression. Cordycepin, which is also called 3'-deoxyadenosine will be easily converted to functionless substance 3'-deoxyinosine (3'-dI) by adenosine deaminase (ADA) in cell when its concentration is high. Pentostatin in our project, acts as the protector against ADA. In this system, pentostatin needs to be expressed before the expression of cordycepin to protect it. Based on the background, we designed a delay system using a pMET3 promoter, which is suppressed by the presence of methionine. More details about delay system.
As the natural consumption of methionine, the expression of cns1-cns2 fusion protein will turn on when methionine is depleted. Therefore, we realize delay production of cordycepin. Additionally, we can adjust the delayed time by changing the initial concentration of added methionine.
Prerequisite Knowledge
General Description
Considering the gene expression process, we can divide the process into two major sections, protein synthesis and protein degradation.
Assumption 1: Protein synthesis and degradation are deterministic processes in the system.
Therefore, we can derive following equation:
where in this equation, represents the concentration of the protein, while the represents the velocity of the synthesis and represents the degradation rate of the protein.
While focusing on the synthesis of the protein, it can be divided into transcription, translation and degradation of mRNA.
Assumption 2: Protein Synthesis is considered to be the process of transcription, translation and mRNA degradation.
Thus, we can get these expressions:
If transcription factor (TF) is an activator that increases the production of the protein, , then the rate of production of will be proportional to the fraction of the factors that are bound, . If TF is a repressor, then the rate will be
proportional to . is the degradation constant of mRNA. (Chandran D et al., 2008)
can be represented by the transcriptional constant and the fraction of TF that are bound:
if the TF is the activator, while
if the TF is the repressor.
As a whole, the kinetic equations of a protein is:
Hill-Langmuir Equation
Hill function is widely used in describing the interaction between promoter and its transcription factor. (Goldenfeld N et al., 2007)
A useful function that describes many real gene input functions is called the Hill function. The Hill function can be derived from considering the equilibrium binding of the transcription factor to its site on the promoter.
Likewise, the description of Hill equation in Wikipedia is:
In biochemistry and pharmacology, the Hill equation refers to two closely related equations that reflect the binding of ligands to macromolecules, as a function of the ligand concentration.
According to Hill-Langmuir equation, the fraction of the receptor protein concentration that is bound by the ligand is represented by the concentration of the ligand , the dissociation constant and the Hill coefficient .
In this equation, is the ligand concentration producing half occupation.
Kinetic Simulation of a protein
As illustrated before, the kinetic equations of a protein is:
If the TF is an activator, the is represented as:
While if the TF is a repressor, the is represented as:
Therefore, we can get that, if the TF is an activator:
If the TF us a repressor:
Analysis of delay system
The interaction of inducer, transcription factors and promoters are illustrated below:
One of the most important parts of the delay system simulation is the methionine consumption. The consumption of methionine controls the switching time of the system, so the model used in this session needs to be carefully evaluated.
Methionine Consumption
Since we use the natural consumption as the time delay to postpone the expression of fusion protein cns1A2, we'd like to simulate the consumption of the methionine.
Assumption 3: The consumption of the methionine of a yeast cell per unit time is fixed, and the total consumption is dependent on the population of the yeast cells and the time.
Logistic Population Model
The culture of engineered yeast cells is limited owing to the restricted space and nutrition. One of the methods to describe the population in symmetric way is logistic population model.
According to the population dynamics, population increasing rate is proportional to the population size.
where is the population size, and r is the intrinsic rate of increase.
In logistic population model, the carrying capacity of the system being studied is defined as . Carrying capacity is the population level at which the birth and death rates of a species precisely match, resulting in a stable population over time.
Therefore, the population size can be represented by the initial intrinsic rate of increase , the carrying capacity and the population size itself.
Gompertz Curve
In bacteria growth researches, it is found that a method called Gompertz Curve has a better performance in fitting with the experimental data (Zwietering M et al., 1990).
Gompertz curve is raised originally to describe a person's resistance to death decreases as his age increases. (Winsor et al., 1932)
Used in Biology to describe growth curve, it is written in:
where N is the population size which is dependent on the time, asymptote , the x-axis skewing of the inflection point and the growth rate .
Considering the differenciation of the Gompertz curve:
Reparameterization of Growth Models
Since the parameters in the equation above (a, b, c) do not have biological meaning, it is difficult to estimate the values of the parameters. Therefore, all the growth models were rewritten to substitute the mathematical parameters with , , and . (Zwietering M et al., 1990)
Figure. 2 Biological meaning of parameter reparameterization of the growth model. is the asymptote, is the x-axis skewing and is the max growth rate.
According to the article, the logistic equation can be reparameterized to:
and
Likewise, Gompertz curve can be reparameterized in the following form:
and
Methionine Comsumption Calculation
In the case of our project, the yeast strain we use is able to produce methionine intrinsically but it will not produce methionine until the externally added methionine is depleted. The consumption of methionine is dependent on the population of the yeast cell and time.
Assumption 4: The consumption of methionine will not affect the protein synthesis in the cell, since the yeast strain can intrinsically produce methionine, and the strength of the promoter will not be weakened because of it.
Since the cells consume methionine more quickly when it is abundant and slowly when it is nearly depleted, the change of concentration of methionine can be represented as:
In the equation above, is the initial concentration of methionine, using which we eliminate the influence of the addition of to the equation. This equation shows that the cell consumes methionine more slowly as the gradual depletion, which is reasonable in the real world.
In our simulation for delay time system, we use logistic model to represent the cell growth and the equation above becomes:
In the equation above, is the OD value of cell growth, and we simply evaluate the cell number based on this OD value. Therefore, is set optimally based on the result from British_Columbia 2012. is the asymptote, is the x-axis skewing of the inflection point and is the growth rate.
Simulation of cns1A2 Expression
Putting all the equations together, we can derive the following ordinary differential equation (ODE) system:
In the equations, since the ordinary differential equation of methionine concentration causes minus concentration after a certain period, we did an alteration, multiplying , to keep the concentration around zero when it is depleted. The equations above describes the genetic circuit involved in delay system, leaving alone the expression of cns3-HisG domain.
Simulation of Delay System
In our project, cns3 is constitutively expressed so it will not be affected by the concentration of the transcription factor.
In S. cerevisiae, the repression of MET3 by methionine is unusually stringent, with absence of leaky expression. (Care R S et al., 1999). Thus, in this modelling project, we do not consider the leaky expression in the equations.
The expression of the whole system is represented by the following equations:
Parameter Determination
All the parameters used in delay system model are based on experimental data or derived from research articles, which make the quantitative model reliable to use.
Methionine Consumption
Total consumption rate of methionine is related to the amount of yeast cell in culture, and it is evaluated by logistic model. The parameters (A, b, e) are from the fitting of logistic model to the experimental data of S.cerevisiae YM4271 straingrowth. More details about YM4271 growth measurement.
We estimated the parameters using nonlinear regression of logistic model in R and get the result below.
Figure. 3 Data fitting of experimental data and logistic model and parameter estimates of the model. In the figure, b is the opposite number of growth rate, d is the asymptote and e is the x-axis skewing.
From the p-value of the parameter estimates, we could regard it a reliable simulation. Therefore, we chose the parameters of yeast growth based on this result.
mRNA Synthesis
The transcriptional activator GAL4 binds as a dimer, so the hill constant of it is . Originally, there is an interaction between GAL4 and GAl80, which has more interaction with other proteins like GAL3. However, in our project, after consulting Prof. Liu for genetic engineering strategies in yeast, we change our yeast strain from AH109 to YM4271, which is GAL4 and GAL80 deficient (GAL4, GAL80). Therefore, the GAL4 protein acts as the pure activator in this case. (Traven A et al., 2006)
Hong et al. discovered that Gal4 binds DNA with an apparent dissociation constant () of 24 nM. Since is defined as and , equals 4.9 nM. (Hong M et al., 2008)
In the equations, parameters like mRNA synthesis rate , mRNA translation rate , mRNA decay rate and protein decay rate are of vital importance in the simulation. According to Miller et al, mRNA decay and synthesis are functionally independent during normal growth.
(Miller C et al., 2011)
In dynamic transcriptome analysis of yeast, the average synthesis rate of transcription factor (TF) is around mRNA/cell/cellCycle. A cell cycle is defined as 150 min. Here, we specify the transcription rate of GAL4 controlled by pMET3 to be this average value. (Miller C et al., 2011)
Referring to Mitre et al. research, there is a conversion of unit between and , it assumes the cell volume of yeast is : (Mitre T et al., 2016)
Using this calculation in our project,
Likely, paper from Sun et al. reveals the mean mRNA synthesis rate, mRNA/cell/cellCycle. Therefore, we define the transcription rate of our uncharacterised protein cns3 to this value. (Sun M et al., 2012)
In another paper, the transcription rate (synthesis rate) of GAL1 gene after pGAL1 promoter is 1.042 molecules/(cell × min) (Mitre T et al., 2016), which can be converted to M/h:
The unit of these constants are .
The mechanism and kinetic of repression by methionine to MET3 promoter is is poorly known, resulting in no data for Hill coefficient of methionine and dissociation constant binding to its target.
From the only paper found to describe the -galactosidase activity driven by MET3 promoter, we estimate the , which is the ligand concentration producing half occupation, to be , which is standardized to . (Mao X et al., (2002)
mRNA Degradation
Considering the decay rate of mRNAs, we can derive the average decay rate of the transcription factor which substitutes the specific decay rate for each mRNA, since little data are available for the decay rate of .
The decay rate is derived from the half-life of mRNA.
(Pérez‐Ortín J et al., 2013)
In which is the mRNA half-life and represents the degradation constant.
In dynamic transcriptome analysis, the median half-lives of transcription factor mRNAs calculated in yeast cell is around 11 min (Miller C et al., 2011), from which we can get the median decay rate in unit of .
Since cns3 and cns1A2 protein are poorly characterised, we define the decay rate of the mRNAs of these two protein the median decay rate of yeast mRNAs, which are calculated above, .
However, the decay rate of GAL4 mRNA is described in the literature, which is around which is derived from the half-life of 16 min (Mitre T et al., 2016).
From the data above, we can get the decay rate of the GAL4 mRNA.
Protein Synthesis
Considering the translation rate of the mRNAs, Mitre et al. estimated the parameters based on the equation below:
In, Arava et al. (2003), "fraction of translated mRNA" is defined to be relative translation rate, which is unitless:
Since there is no data available for the relative translation rate of GAL4, we use the median relative translation rates of all associated GAL mRNAs reported, which is given by 0.145. (Arava Y et al., 2003)
As the ORF length increased, the ribosome density dropped from an average of 1.2 ribosomes per 100 nts for ORFs shorter than 400 nts to an average of 0.14 ribosomes per 100 nts for ORFs longer than 3,600 nts.
The length of cns3 protein is around aa, while the length of fusion protein of cns1 and cns2, cns1A2, is around aa. According to the relation illustrated in the literature, the ribosome density is for cns3 protein and for cns1A2.
With the data given that the average ribosome occupancy is , we can calculate the relative translation rate of cns3 is while that of cns1A2 is .
Additionally, from the article of Bonven et al. (1979), the mRNA elongation rate if around 9.3 a.a per second. (Bonven B et al., 1979)
Considering the paper of Ideker et al., ratios of proteins to mRNA lies between 4200 and 4800. (Ideker T et al., 2001)
Integrating the constant about translation above, we can calculate the translation rate of the proteins used in our project.
Protein Degradation
Using an epitope-tagged strain collection, researchers find that the mean half-life of proteins in yeast is about (Belle A et al., 2006). From the equation in mRNA degradation, we can calculate the degradation rate, which is .
Since we can no direct data for the degradation rate of GAL4, cns1A2 and cns3, we use this mean degradation rate.
MATLAB code
MATLAB code to solve the equation system:
function [f] = dXdT(t, x)
%constant
A = 1.900786;
e = 28.689490;
b = 0.270983;
kmet = 1.15*10**(-4);
kTC1 = 2.37*10**(-7);
kA1 = 4*10**(-8);
D1 = 2.60;
kTL1 = 23117;
B1 = 0.97;
kTC2 = 1.48*10**(-6);
kA2 = 4.9*10**(-9);
D2 = 3.78;
kTL2 = 33502;
B2 = 0.97;
kTC3 = 6.83*10**(-7);
D3 = 3.78;
kTL3 = 68774;
B3 = 0.97;
N = 2;
MET=0.0001; %Same with the initial concentration of methionine
%parameter
met=x(1);
mgal=x(2);
gal=x(3);
m1A2=x(4);
p1A2=x(5);
mc3=x(6);
c3=x(7);
%ODE
dmetdt = -kmet*(met/MET)*10**(A/(1+exp(-b*t+b*e)));
dmgaldt = kTC1/(1+(met/kA1)**N)-D1*mgal;
dgaldt = kTL1*mgal-B1*gal;
dm1A2dt = kTC2/(1+(kA2/(gal+10**(-10)))**N)-D2*m1A2;
dp1A2dt = kTL2*m1A2-B2*p1A2;
dmc3dt = kTC3-D3*mc3;
dc3dt = kTL3*mc3-B3*c3;
f = [dmetdt; dmgaldt; dgaldt; dm1A2dt; dp1A2dt; dmc3dt; dc3dt];
end
tspan = 0:0.001:60;
initial = [0.0001,0,0,0,0,0,0];
[t,x] = ode45( @dXdT, tspan, initial);
Result
The result of the ODE solution is plotted below. We compared the expression of cns1A2 and HisG in our delay system and concluded that we successfully delayed the expression of cns1A2. Since the typical concentration of methionine used as repressing signal is 1mM, the delayed time under 1mM methionine is around 20 hours, which means that cns1A2 starts expression 20 hours after the addition of methionine:
Figure. 4 Elucidation of expression level of cns1A2 and HisG protein under 1.0mM initial methionine when A = 1.900786, e = 28.689490, b = 0.270983, kmet = 1.15E-4, kTC1 = 2.37E-7, kA1 = 4E-8, D1 = 2.60, kTL1 = 23117, B1 = 0.97, kTC2 = 1.48E-6, kA2 = 4.9E-9, D2 = 3.78, kTL2 = 33502, B2 = 0.97, kTC3 = 6.83E-7, D3 = 3.78, kTL3 = 68774, B3 = 0.97, N = 2.
The figure above demonstrates our design of the delay system as methionine successfully delays the expression time of cns1A2.
Our goal of designing delay system is flexible manipulation of expression time of our target protein. We would like to investigate the connection of initial methionine concentration and its corresponding delayed time. We tested the methionine concentration between 0.1mM and 0.5mM and plotted the result in the following figure.
Figure. 5 Elucidation of concentration of cns1A2 under various initial methionine when A = 1.900786, e = 28.689490, b = 0.270983, kmet = 1.15E-4, kTC1 = 2.37E-7, kA1 = 4E-8, D1 = 2.60, kTL1 = 23117, B1 = 0.97, kTC2 = 1.48E-6, kA2 = 4.9E-9, D2 = 3.78, kTL2 = 33502, B2 = 0.97, kTC3 = 6.83E-7, D3 = 3.78, kTL3 = 68774, B3 = 0.97, N = 2.
The delayed time under 0.1mM methionine in our model is about 1 or 2 hours, comparing to around 70 hours under 50mM methionine. We noticed that the upper bound of cns1A2 protein under various methionine concentrations is constant, suggesting that methionine only affects the delayed time.
This result testifies the flexibility of the adjustment on the delayed time by proper determination of methionine concentration. However, 70 hours is not the limit of the system. The solubility of methionine is about 379mM. Therefore, we further investigated the delayed time when increasing methionine concentration.
Figure. 6 Elucidation of concentration of cns1A2 under initial methionine concentration of 100mM, 200mM, 300mM and 379mM when A = 1.900786, e = 28.689490, b = 0.270983, kmet = 1.15E-4, kTC1 = 2.37E-7, kA1 = 4E-8, D1 = 2.60, kTL1 = 23117, B1 = 0.97, kTC2 = 1.48E-6, kA2 = 4.9E-9, D2 = 3.78, kTL2 = 33502, B2 = 0.97, kTC3 = 6.83E-7, D3 = 3.78, kTL3 = 68774, B3 = 0.97, N = 2.
Initial concentration from 100mM to 379mM causes an extension of delayed time from around 100 hours to 360 hours. So that we can theoretically delay the expression of cns1A2 to maximal 360 hours, which is 15 days. Definitely yeast cells are not likely to live more than 15 days in normal cases, so in theory, our design of this delay system can satisfy any requirement of the delayed time.
Conclusion
In the modeling part of the delay system, we used the Hill equation to describe the binding between transcription factor and promoter. We took transcription, translation, RNA degradation, and protein degradation into consideration. We ignore the leaking expression of GAL4 protein because pMET3 is reported as a tight promoter. We used the logistic model as the population simulation of yeast cells and calculate the methionine consumption based on this model. All the parameters are chosen according to corresponding research articles. The model shows a range of delayed time from 1 hour to maximal 360 hours in theory.
The result of modeling in the delay system will give not only a profile of our project but also a guideline for the experiment. According to the details of our demand, we can alter the concentration of methionine and then adjust the delayed time of cordycepin production. In conclusion, we successfully realized flexible control of delayed time using our delay time system.
Appendix
Appendix shows all the symbols and parameters used in delay system model, together with their values and references
Symbol | Description |
---|---|
[ ] | The concentration symbol |
population size of the cells in the culture | |
methionine | |
initial concentration of methionine | |
mRNA transcribed by gal4 gene | |
GAL4 protein, which is translated by | |
mRNA transcribed by cns1-cns2 fusion protein | |
fusion protein of cns1-cns2 | |
keto form of the product of cns2, at location of enzyme cns1 | |
cns3 protein, which is translated by |
Constant | Description | Value | Reference |
---|---|---|---|
Carrying capacity (in OD value) of the system | 1.900786 | nonlinear regression | |
rate of populational increase | 0.270983 | nonlinear regression | |
x-axis offset of the inflection point | 28.689490 | nonlinear regression | |
Hill constant | 2 | Traven et al. (2006) | |
Consumption rate of methionine | British_Columbia 2012 | ||
Transcription strength of binded promoter of gal4 gene | Miller C etal. (2014) | ||
ligand concentration producing half occupation of pMET3 promoter | Mao et al. (2002) | ||
Degradation rate of | Mitre et al. (2016) | ||
Translation rate of | Arava et al. (2003) | ||
Degradation rate of GAL4 protein | Belle et al. (2006) | ||
Transcription strength of binded promoter of cns1A2 gene | Mitre et al. (2016) | ||
ligand (GAL4) concentration producing half occupation of pGAL1 promoter | Hong et al. (2008) | ||
Degradation rate of | Miller C et al. (2004) | ||
Translation rate of | Arava et al. (2003) | ||
Degradation rate of cns1A2 protein | Belle et al. (2006) | ||
Transcription strength of binded promoter of cns3 gene | Miller C etal. (2014) | ||
Degradation rate of | Miller C et al. (2004) | ||
Translation rate of | Arava et al. (2003) | ||
Degradation rate of cns3 protein | Belle et al. (2006) |
Reference
Chandran, D., Copeland, W. B., Sleight, S. C., & Sauro, H. M. (2008). Mathematical modeling and synthetic biology. Drug Discovery Today: Disease Models, 5(4), 299-309.
Goldenfeld, N. (2007). An introduction to systems biology: design principles of biological circuits; systems biology: properties of reconstructed networks. Physics Today, 60(6), 63-64.
Zwietering, M. H. . (1990). Modeling of the bacterial growth curve. Appl. Environ. Microbiol., 56(6), 1875-1881.
Winsor, & C., P. . (1932). The gompertz curve as a growth curve. Proceedings of the National Academy of Sciences, 18(1), 1-8.
Care, R. S. , Trevethick, J. , Binley, K. M. , & Sudbery, P. E. . (1999). The met3 promoter: a new tool for candida albicans molecular genetics. Molecular Microbiology, 34.
Traven, A. , Jelicic, B. , & Sopta, M. . (2006). Yeast gal4: a transcriptional paradigm revisited. Embo Journal, 7(5), 496-499.
Hong, M. , Fitzgerald, M. X. , Harper, S. , Luo, C. , Speicher, D. W. , & Marmorstein, R. . (2008). Structural basis for dimerization in dna recognition by gal4. Structure, 16(7), 1019-1026.
Miller, C., Schwalb, B., Maier, K., Schulz, D., Dümcke, S., Zacher, B., ... & Martin, D. E. (2011). Dynamic transcriptome analysis measures rates of mRNA synthesis and decay in yeast. Molecular systems biology, 7(1).
Mitre, T. M., Mackey, M. C., & Khadra, A. (2016). Mathematical model of galactose regulation and metabolic consumption in yeast. Journal of theoretical biology, 407, 238-258.
Sun, M., Schwalb, B., Schulz, D., Pirkl, N., Etzold, S., Larivière, L., ... & Cramer, P. (2012). Comparative dynamic transcriptome analysis (cDTA) reveals mutual feedback between mRNA synthesis and degradation. Genome research, 22(7), 1350-1359.
Mao, X., Hu, Y., Liang, C., & Lu, C. (2002). MET3 promoter: a tightly regulated promoter and its application in construction of conditional lethal strain. Current microbiology, 45(1), 37-40.
Pérez‐Ortín, J. E., Medina, D. A., Chávez, S., & Moreno, J. (2013). What do you mean by transcription rate? The conceptual difference between nascent transcription rate and mRNA synthesis rate is essential for the proper understanding of transcriptomic analyses. BioEssays, 35(12), 1056-1062.
Arava, Y., Wang, Y., Storey, J. D., Liu, C. L., Brown, P. O., & Herschlag, D. (2003). Genome-wide analysis of mRNA translation profiles in Saccharomyces cerevisiae. Proceedings of the National Academy of Sciences, 100(7), 3889-3894.
Bonven, B., & Gulløv, K. (1979). Peptide chain elongation rate and ribosomal activity in Saccharomyces cerevisiae as a function of the growth rate. Molecular and General Genetics MGG, 170(2), 225-230.
Ideker, T., Thorsson, V., Ranish, J. A., Christmas, R., Buhler, J., Eng, J. K., ... & Hood, L. (2001). Integrated genomic and proteomic analyses of a systematically perturbed metabolic network. Science, 292(5518), 929-934.
Belle, A., Tanay, A., Bitincka, L., Shamir, R., & O’Shea, E. K. (2006). Quantification of protein half-lives in the budding yeast proteome. Proceedings of the National Academy of Sciences, 103(35), 13004-13009.