Team:SCU-China/Enzyme

MODELING

SCU-China Header


ENZYME KINETIC SIMULATION

a

Introduction

Our project this year conentrates mainly on the metabolic circuit of cordycepin (COR) and pentostatin (PTN). It involves three enzymes, cns1, cns2 and cns3-HisG domain. Since all our work is related to these three enzymes, it is necessary to depict the overall picture about the enzymatic system.

Generally, our aim of constructing the enzyme kinetic simulation is devided into two parts. Firstly, we would like to theoretically give the model that testifies the necessity of conducting fusion expression of cns1 and cns2 in experiments, which can give powerful support and confidence to the later experiments of fusion expression. Additionally, we would like to discover how diffusion rate influence the production rate. Secondly, since our goal is to co-ferment cordycepin and pentostatin furthestly, we have to build a model which consists of all the elements involved in the metabolic circuit of cordycepin and pentostatin. More importantly, we need to find the correlation between the concentration of cordycepin and pentostatin, as well as give the optimal substitute ratio and optimal expression strength of the enzymes.


1 Fusion-Expression Simulation

Our task is to deal with the circuit below. In the cirsuit, cns1 converts 3'-AMP to the enol state of 2'-C-3'-dA, which then natually converts to the keto state.

This modeling section aims to compare the efficiency of the enzymatic reactions using a fusion-expression strategy to that using a separate-expression strategy, which in result will give theoretical support for the corresponding experiments.

It is reported in literature that there exists interaction between cns1 and cns2 in vivo by yeast two-hybrid analysis [Xia Y et al. (2017)]. The phenomenon of interaction between two or more enzymes is not peculiar. It is common to see some multienzyme complex in nature, including pyruvate dehydrogenase, rubisco and even respiratory electron-transport chain, which is recently proven to be a megacomplex. The common consensus about the multienzyme complex is that it allows to segregate certain biochemical pathways into one place in the cell, which increases the catalytic efficiency and reduces side effects. In nature, the expression of enzymes in the multienzyme complex is generally independent, after which they would meet and combine into a multienzyme complex.

In the theory of structural biology, the fusion-expression strategy connects the two enzymes during expression so that the substitute of the first enzyme is sequentially converted to the product of the second enzyme, which is more efficient than separate-expression strategy. However, it has not been mathematically proven in enzyme kinetics. Additionally, the evaluation of the linker connecting the enzymes is obscure before doing specific enzyme-activity experiments.

Here, we would like to build an enzyme kinetic model to simulate the separate-expression strategy and the fusion-expression strategy. Determination of the linker is conducted in computational structural biology model.

It is reported that keto-enol tautomerism can be simulated as a first-order reaction [Huang L et al. (2007)] and can be represented as:

dCdt=keqC\frac{dC}{dt} = -k_{eq}C

in which CC is the concentration of the keto form.

  • General Assumption: Our enzyme kinetic simulation is assumesdin vivo, so all the enzymatic reaction is reversable and 2'-C-3'-dA is considered to be all in keto form (keq=[Enolform][Ketoform]1k_{eq} = \frac{[Enol form]}{[Keto form]} \ll 1).

1.1 Distance between molecules in solution

The difference between these two strategies is the relative concentration of the substitute of the second enzyme, since the concentration is lower in seperate-expression strategy because the product of the first enzyme (which is at the same time the substitute of the second enzyme) need to diffuse to the second enzyme. These can be illustrated by enzyme kinetic equations.

Considering cns1 and cns2 enzyme diffusing in the solvent, the initial concentration of the enzymes is uniform in the whole system. When the substitute of the first enzyme is converted into the product, it need to diffuse to the second enzyme.

Therefore, the diffusing distance is the key issue here.

Firstly, we shoule consider the distance

  • Assumption 1: The solution is uniform, which means that the concentration of the solvent at one position is the same with that at any other position in the solution.

  • Assumption 2: All the enyzmes can be assumed as spheres.

The average distance between molecules in a solution and the average radius of a molecule is calculated by Kozanecki et al. [Kozanecki M et al. (2016)]

It is frequently useful to know the average volume of solution occupied by each molecule, or more directly, the average distance separating molecules in solution. This is a simple calculation based only on the molar concentration. In a 1 M solution, there are 6×10236×10^{23} molecules/L, = 0.6 molecules/nm3\text{nm}^3, or inverting, the volume per molecule is V=1.66 nm3nm^3/molecule at 1 M. For a concentration C, the volume per molecule is V=1.66/C. So, the distance between molecule in average is:

d=V13=1.18/C13d = V^{\frac{1}{3}} = 1.18/C^{\frac{1}{3}}

What we really want is a physically intuitive parameter for the
size of the protein. If we assume the protein has the simplest
shape, a sphere, we can calculate its radius. We will refer to this
as RminR_\text{min}, because it is the minimal radius of a sphere that could contain the given mass of protein:

Rmin=(3V/4π)13=0.066M13(for M in Dalton,Rminin nanometer)R_\text{min} = (3V/4\pi)^{\frac{1}{3}} = 0.066M^{\frac{1}{3}} (\text{for M in Dalton}, R_{min} \text{in nanometer})

Since the concentration of the enzyme in the cell is in the μmol\mu \text{mol} magnitude, the distance between protein molecules is in the hundred nanometer magnitude. In our project, the mass of the cns1, cns2, cns3 is around 80, 40, 100 kD respectively, so the radius of the enzyme is around 3 nm, which is in the unit nanometer magnitude. Compared to the distance between enzymes, the radius is negligible.

  • Assumption 3: In the simulation of the fusion-expression, the enzyme is treated as a point, rather than a sphere.

To calculate the distance between cns1 and cns2 in the mixed solution of the system, we make another assumption to simplify the model.

  • Assumption 4: The two enzymes are evenly distributed in the whole solution.

Thus, if we assume the concentration of cns1 to be Ccns1C_\text{cns1}, and concentration of cns2 to be Ccns2C_\text{cns2}, there are totally (Ccns1+Ccns2)×6.02×1023(C_\text{cns1}+C_\text{cns2})\times 6.02\times 10^{23} molecules in 1 L system. As the calculation above, the distance between two enzymes can be represented by:

dcns1,cns2=1.18/(Ccns1+Ccns2)13d_{cns1,cns2} = 1.18/(C_{cns1}+C_{cns2})^{\frac{1}{3}}

Compared to the seperate-expression strategy, the fusion-expression strategy enables relatively no distance between cns1 and cns2, as they are connected using the linker.


1.2 Diffuson Simulation

During the diffusion process, the concentration of the substitute of the second enzyme is related to the product of the first enzyme by the diffusion model. Since all the molecule are evenly distributed, the three-dimentional system can be simplified to one-dimentional system because we only need to consider the relation between one cns2 enzyme and the nearest cns1 enzyme.

Here, we use Fick's law to describe the diffusion process.

  • Assumption 5: A concentration gradient is the only force driving substrate motion in the system, ignoring the influence of charge interactions or the shape of the molecule.

[Eun C et al. (2014)]

According to Fick's first law of diffusion, [Rice S et al. (1985)]
J(z,t)=Dc(z,t)zJ(z,t) = -D \frac{\partial c(z,t)}{\partial z}

Where JJ is the diffusion flux, of which the dimension is amount of substance per unit area per unit time. JJ measures the amount of substance that will flow through a unit area during a unit time interval.

DD is the diffusion coefficient or diffusivity. Its dimension is area per unit time.

cc is the concentration, of which the dimension is amount of substance per unit volume.

zz is position, the dimension of which is length.

In the equation, the minus sign indicates that the flow occurs from the concentrated to the dilute region of the solution. [Macheras P et al. (2006)]

If we treat the situation in a discretized way, the Fick's first law of diffusion can be interpreted in:

J(z,t)=DΔc(z,t)Δz=Dc(z+1,t)c(z,t)(z+1)zJ(z,t) = -D \frac{\Delta c(z,t)}{\Delta z} =-D\frac{c(z+1,t)-c(z,t)}{(z+1)-z}

Therefore, the flux of a position is related to the concetration gradient of this position.

When we consider the change of the concentration as the fuction of the time, we can get:

c(z,t)t=xJ=x(Dxc(z,t))=D2c(z,t)x2\frac{\partial c(z,t)}{\partial t} = - \frac{\partial}{\partial x} J = \frac{\partial}{\partial x}(D\frac{\partial}{\partial x} c(z,t)) = D\frac{\partial ^2 c(z,t)}{\partial x^2}

This formula is called Fick's second law of diffusion. Given the initial condition that c(0,0)=cl(t)c(0,0) = c_l(t) and c(δ,0)=cr(t)c(\delta,0) = c_r(t), the differential ordinary equation can be solved as the form below. [M.H. Jacobs. (1935)]

c(z,t)=cl(t)+[cr(t)cl(t)]zδ+2πi=1cr(t)cosiπcl(t)isin(iπzδ)exp(Dδ2i2π2t)c(z,t) = c_l(t)+[c_r(t)-c_l(t)]\frac{z}{\delta}+\frac{2}{\pi}\sum_{i=1}^{\infty}{\frac{c_r(t)cosi\pi -c_l(t)}{i}}sin(i\pi \frac{z}{\delta})exp(-\frac{D}{\delta^2}i^2\pi^2 t)

In this equation, cr(t)c_r(t) is the concentration of the substitute of the second enzyme around the second enzyme, while cl(t)c_l(t) is the concetration of the substitute of the second enzyme in the context of the first enzyme. These two concentration is independent to the diffusion process, but dependent on the production and consumption of substance.

If we fix zz to δ\delta, we can get:
c(z=δ,t)=cr(t)+2πi=1cr(t)cosiπcl(t)isin(iπ)exp(Dδ2i2π2t)c(z=\delta,t) = c_r(t)+\frac{2}{\pi}\sum_{i=1}^{\infty}{\frac{c_r(t)cosi\pi -c_l(t)}{i}}sin(i\pi )exp(-\frac{D}{\delta^2}i^2\pi^2 t)

Since sin(iπ)=0sin(i\pi) = 0,
c(z=δ,t)=cr(t)c(z=\delta,t) = c_r(t)

In this case, this equation is meaningless at the position of the second enzyme, but if we consider the position near the second enzyme, which makes the sin(iπ)0sin(i\pi) \neq 0 but very small. Based on this manupulation, the equation above is meaningful. Since the equation is too complicated to fit in our case, we'd like to simplify it to this form.

c(z=δ,t)=cr(t)kdiffcr(t)cl(t)exp(δ)c(z=\delta,t) = c_r(t) - k_\text{diff}\frac{c_r(t)-c_l(t)}{exp(\delta)}

Since δ\delta is the distance between the source and the point of interest, if we do not fix δ\delta, but set it to a parameter xx, we can get:

c(x,t)=cr(t)kdiffcr(t)cl(t)exp(x)c(x,t) = c_r(t) - k_\text{diff}\frac{c_r(t)-c_l(t)}{exp(x)}

Because cr(t)c_r(t) and cl(t)c_l(t) is independent to the diffusion process, so they can be represented to crc_r and clc_l, which is consistent to the equations in enzyme kinetics. c(x,t)c(x,t) is the updated concentration of the substance near the second enzyme which is affected by diffusion. So the equation looks like:

dcrdt=kdiffclcrexp(x)\frac{dc_r}{dt} = k_\text{diff}\frac{c_l-c_r}{exp(x)}

This equation describes that the derivation of the concentration in the low concentration area is proportional to the difference between the concentrations and nonlinearly inversely proportional to the distance xx. The coefficient kdiffk_\text{diff} represents the diffusion rate of stubstances in the solution. Since we have no access to the value of this parameter, we set it to an optional value and investigate the influence on the model by this value.


1.3 Enzyme Kinetics Simulation of Fusion Expression

1.3.1 Fusion Expression Simulation with limited resource

Fusion-expression strategy enables the enzyme cns1 and cns2 to be expression linkly and closely in distance. The product of the first enzyme instantaneously functions as the substitute of the second enzyme and is converted to the end product, cordycepin.

Based on all the assumption and prerequisite derivation. we can create the follow diagram based on typical enzyme kinetics:

Based on mass action law, we can now list the equations:
d[S1]dt=k1[Ecns2][S1]+k1[Ecns2S1]\frac{d[S_1]}{dt} = -k_1[E_\text{cns2}][S_1] + k_{-1}[E_\text{cns2}S_1]

d[Ecns2]dt=k1[Ecns2][S1]+k1[Ecns2S1]+k2[Ecns2S1]k2[P1][Ecns2]\frac{d[E_\text{cns2}]}{dt} = -k_1[E_\text{cns2}][S_1] + k_{-1}[E_\text{cns2}S_1] + k_2[E_\text{cns2}S_1] - k_{-2}[P_{1}][E_\text{cns2}]

d[Ecns2S1]dt=k1[Ecns2][S1]k1[Ecns2S1]k2[Ecns2S1]+k2[P1][Ecns2]\frac{d[E_\text{cns2}S_1]}{dt} = k_1[E_\text{cns2}][S_1] - k_{-1}[E_\text{cns2}S_1] - k_2[E_\text{cns2}S_1] + k_{-2}[P_{1}][E_\text{cns2}]

d[P1]dt=k2[Ecns2S1]k2[P1][Ecns2]k3[Ecns1][P1]+k3[Ecns1S2]\frac{d[P_1]}{dt} = k_2[E_\text{cns2}S1]-k_{-2}[P_1][E_\text{cns2}]-k_3[E_\text{cns1}][P_1]+k_{-3}[E_\text{cns1}S_2]

d[Ecns1]dt=k3[Ecns1][P1]+k3[Ecns1S2]+k4[Ecns1S2]k4[P2][Ecns1]\frac{d[E_\text{cns1}]}{dt} = -k_{3}[E_\text{cns1}][P_{1}] + k_{-3}[E_{cns1}S_2] + k_4[E_{cns1}S_2] - k_{-4}[P_2][E_{cns1}]

d[Ecns1S2]dt=k3[Ecns1][P1]k3[Ecns1S2]k4[Ecns1S2]+k4[P2][Ecns1]\frac{d[E_\text{cns1}S_2]}{dt} = k_3[E_\text{cns1}][P_{1}] - k_{-3}[E_\text{cns1}S_2] - k_4[E_\text{cns1}S_2] + k_{-4}[P_2][E_\text{cns1}]

d[P2]dt=k4[Ecns1S2]k4[P2][Ecns1]\frac{d[P_2]}{dt} = k_4[E_\text{cns1}S_2] - k_{-4}[P_2][E_\text{cns1}]

Based on the derivation of Michaelis-Menten equation, from the equation of d[Ecns2]dt\frac{d[E_\text{cns2}]}{dt} and d[Ecns2S1]dt\frac{d[E_\text{cns2}S_1]}{dt}, we can find that d[Ecns2]dt+d[Ecns2S1]dt=0\frac{d[E_\text{cns2}]}{dt}+\frac{d[E_\text{cns2}S_1]}{dt} = 0, which means that the value of [Ecns2]+[Ecns2S1][E_\text{cns2}]+[E_\text{cns2}S_1] is a constant. Here, we defind this constant as Eini2E_\text{ini2}, which represents the constant of the initial concentration of cns2 enzyme. Likewise, considering to cns1 enzyme, value of [Ecns1]+[Ecns1S2][E_\text{cns1}]+[E_\text{cns1}S_2] is a constant, which is defined as Eini1E_\text{ini1}.

Therefore, the equations convert to:

d[S1]dt=k1[Ecns2][S1]+k1(Eini2[Ecns2])\frac{d[S_1]}{dt} = -k_1[E_\text{cns2}][S_1] + k_{-1}(E_\text{ini2}-[E_\text{cns2}])

d[Ecns2]dt=k1[Ecns2][S1]+(k1+k2)(Eini2[Ecns2])k2[P1][Ecns2]\frac{d[E_\text{cns2}]}{dt} = -k_1[E_\text{cns2}][S_1] + (k_{-1}+k_2)(E_\text{ini2}-[E_\text{cns2}]) - k_{-2}[P_{1}][E_\text{cns2}]

d[P1]dt=k2(Eini2[Ecns2])k2[P1][Ecns2]k3[Ecns1][P1]+k3(Eini1[Ecns1])\frac{d[P_1]}{dt} = k_2(E_\text{ini2}-[E_\text{cns2}])-k_{-2}[P_1][E_\text{cns2}]-k_3[E_\text{cns1}][P_1]+k_{-3}(E_\text{ini1}-[E_\text{cns1}])

d[Ecns1]dt=k3[Ecns1][P1]+(k3+k4)(Eini1[Ecns1])k4[P2][Ecns1]\frac{d[E_\text{cns1}]}{dt} = -k_{3}[E_\text{cns1}][P_{1}] + (k_{-3}+k_4)(E_\text{ini1}-[E_\text{cns1}]) - k_{-4}[P_2][E_\text{cns1}]

d[P2]dt=k4(Eini1[Ecns1])k4[P2][Ecns1]\frac{d[P_2]}{dt} = k_4(E_\text{ini1}-[E_\text{cns1}]) - k_{-4}[P_2][E_\text{cns1}]

MATLAB Code:

function [f] = dXdT(t, x)

k1F = 1;
k1R = 0.1;
k2F = 2;
k2R = 0.1;
k3F = 1;
k3R = 0.1;
k4F = 1;
k4R = 0.1;
E01 = 1;
E02 = 1;

s1=x(1);
e1=x(2);
p1=x(3);
e2=x(4);
p2=x(5);

%ds1dt = 0;
ds1dt = -k1F*e1*s1 + k1R*(E01-e1);
de1dt = -k1F*e1*s1 + (k1R+k2F)*(E01-e1) - k2R*p1*e1;
dp1dt = k2F*(E01-e1) - k2R*p1*e1 - k3F*e2*p1 + k3R*(E02-e2);
de2dt = -k3F*e2*p1 + (k3R+k4F)*(E02-e2) - k4R*p2*e2;
dp2dt = k4F*(E02-e2) - k4R*p2*e2;
f = [ds1dt; de1dt; dp1dt; de2dt; dp2dt];

end

tspan = 0:0.001:10;
initial = [1,1,0,1,0];

[t,x] = ode45( @dXdT, tspan, initial);

Solving the ordinary differential equation system will give us the concentration deviation of the substitue, intermediate product and end product, which is 3'-AMP, 2'-C'-3'-dA respectly. In this simulation, the resource is limitied, which means that the substitutes will deplete without any supplement.

In the result, the final concentration of the end product, cordycepin, under this set of paramter, is around 0.89. The value of the cordycepin concentration is meaningless, since all the parameters are set optionally. However, it provides a model which can be used in the comparison of the separate-expression strategy with the same set of parameter.

1.3.2 Fusion Expression Simulation with abundant resource

The substitute of cns2 and 3'-AMP, while the substitue of cns3-HisG domain is adenosine, which both are the basic substance and liable to produce in a cell. If we provide abundant resource, the concentration of the substitutes is considered to be constant.

Therefore, we need to alter the equations of the concentration of substutute. We set the concentration of S1S_1 (3'AMP) to be constant, and let other equations unchanged.

d[S1]dt=0\frac{d[S_1]}{dt} = 0

The result is below:

From the figure above, we can see that the concentration of the intermediate product 2'-C-3'-dA rises slowly while the end product cordycepin rises much more quickly.


1.4 Enzyme Kinetics Simulation of Separate Expression with Diffusion

Considering the separate-expression strategy, we add the diffusion process into the diagram and we can create the following diagram:

1.4.1 Separate Expression Simulation with limited resource

Likewise, we can now list the ordinary differential equations:

d[S1]dt=k1[Ecns2][S1]+k1[Ecns2S1]\frac{d[S_1]}{dt} = -k_1[E_\text{cns2}][S_1] + k_{-1}[E_\text{cns2}S_1]

d[Ecns2]dt=k1[Ecns2][S1]+k1[Ecns2S1]+k2[Ecns2S1]k2[P1l][Ecns2]\frac{d[E_\text{cns2}]}{dt} = -k_1[E_\text{cns2}][S_1] + k_{-1}[E_\text{cns2}S_1] + k_2[E_\text{cns2}S_1] - k_{-2}[P_{1l}][E_\text{cns2}]

d[Ecns2S1]dt=k1[Ecns2][S1]+k1[Ecns2S1]k2[Ecns2S1]+k2[P1l][Ecns2]\frac{d[E_\text{cns2}S_1]}{dt} = k_1[E_\text{cns2}][S_1] + - k_{-1}[E_\text{cns2}S_1] - k_2[E_\text{cns2}S_1] + k_{-2}[P_{1l}][E_\text{cns2}]

d[P1l]dt=k2[Ecns2S1]k2[P1l][Ecns2]Kdiff[P1l][P1r]exp(dcns1,cns2)\frac{d[P_{1l}]}{dt} = k_2[E_\text{cns2}S_1] - k_{-2}[P_{1l}][E_\text{cns2}] - K_\text{diff}\frac{[P_{1l}]-[P_{1r}]}{exp(d_\text{cns1,cns2})}

d[P1r]dt=Kdiff[P1l][P1r]exp(dcns1,cns2)k3[Ecns1][P1r]+k3[Ecns1S2]\frac{d[P_{1r}]}{dt} = K_\text{diff}\frac{[P_{1l}]-[P_{1r}]}{exp(d_\text{cns1,cns2})} - k_3[E_\text{cns1}][P_{1r}] + k_{-3}[E_\text{cns1}S_2]

d[Ecns1]dt=k3[Ecns1][P1r]+k3[Ecns1S2]+k4[Ecns1S2]k4[P2][Ecns1]\frac{d[E_\text{cns1}]}{dt} = -k_{3}[E_\text{cns1}][P_{1r}] + k_{-3}[E_\text{cns1}S_2] + k_4[E_\text{cns1}S_2] - k_{-4}[P_2][E_\text{cns1}]

d[Ecns1S2]dt=k3[Ecns1][P1r]k3[Ecns1S2]k4[Ecns1S2]+k4[P2][Ecns1]\frac{d[E_\text{cns1}S_2]}{dt} = k_3[E_\text{cns1}][P_{1r}] - k_{-3}[E_\text{cns1}S_2] - k_4[E_\text{cns1}S_2] + k_{-4}[P_2][E_\text{cns1}]

d[P2]dt=k4[Ecns1S2]k4[p2][Ecns1]\frac{d[P_2]}{dt} = k_4[E_\text{cns1}S_2] - k_{-4}[p_2][E_\text{cns1}]

From the equation of d[Ecns2]dt\frac{d[E_\text{cns2}]}{dt} and d[Ecns2S1]dt\frac{d[E_\text{cns2}S_1]}{dt}, we can find that d[Ecns2]dt+d[Ecns2S1]dt=0\frac{d[E_\text{cns2}]}{dt}+\frac{d[E_\text{cns2}S_1]}{dt} = 0, which means that the value of [Ecns2]+[Ecns2S1][E_\text{cns2}]+[E_\text{cns2}S_1] is a constant. Here, we defind this constant as Eini2E_{ini2}, which represents the constant of the initial concentration of cns2 enzyme. Likewise, considering to cns1 enzyme, value of [Ecns1]+[Ecns1S2][E_\text{cns1}]+[E_\text{cns1}S_2] is a constant, which is defined as Eini1E_\text{ini1}.

Therefore, the equations convert to:

d[S1]dt=k1[Ecns2][S1]+k1(Eini2[Ecns2])\frac{d[S_1]}{dt} = -k_1[E_\text{cns2}][S_1] + k_{-1}(E_\text{ini2}-[E_\text{cns2}])

d[Ecns2]dt=k1[Ecns2][S1]+(k1+k2)(Eini2[Ecns2])k2[P1l][Ecns2]\frac{d[E_\text{cns2}]}{dt} = -k_1[E_\text{cns2}][S_1] + (k_{-1}+k_2)(E_\text{ini2}-[E_\text{cns2}]) - k_{-2}[P_{1l}][E_\text{cns2}]

d[P1l]dt=k2(Eini2[Ecns2])k2[P1l][Ecns2]kdiff[P1l][P1r]exp(dcns1,cns2)\frac{d[P_{1l}]}{dt} = k_2(E_\text{ini2}-[E_\text{cns2}]) - k_{-2}[P_{1l}][E_\text{cns2}] - k_\text{diff}\frac{[P_{1l}]-[P_{1r}]}{exp(d_\text{cns1,cns2})}

d[P1r]dt=kdiff[P1l][P1r]exp(dcns1,cns2)k3[Ecns1][P1r]+k3(Eini1[Ecns1])\frac{d[P_{1r}]}{dt} = k_\text{diff}\frac{[P_{1l}]-[P_{1r}]}{exp(d_\text{cns1,cns2})} - k_3[E_\text{cns1}][P_{1r}] + k_{-3}(E_\text{ini1}-[E_\text{cns1}])

d[Ecns1]dt=k3[Ecns1][P1r]+(k3+k4)(Eini1[Ecns1])k4[P2][Ecns1]\frac{d[E_\text{cns1}]}{dt} = -k_{3}[E_\text{cns1}][P_{1r}] + (k_{-3}+k_4)(E_\text{ini1}-[E_\text{cns1}]) - k_{-4}[P_2][E_\text{cns1}]

d[P2]dt=k4(Eini1[Ecns1])k4[P2][Ecns1]\frac{d[P_2]}{dt} = k_4(E_{ini1}-[E_{cns1}]) - k_{-4}[P_2][E_{cns1}]

MATLAB Code:

function [f] = dXdT(t, x)

k1F = 1
k1R = 0.1
k2F = 2
k2R = 0.1
k3F = 1
k3R = 0.1
k4F = 1
k4R = 0.1
E1 = 1
E2 = 1

s1=x(1);
e1=x(2);
pl=x(3);
pr=x(4);
e2=x(5);
p2=x(6);

kdiff = 10;
d = 1.118/((e1+e2)**(1/3));

ds1dt = -k1F*e1*s1 + k1R*(E1-e1);
de1dt = -k1F*e1*s1 + (k1R+k2F)*(E1-e1) - k2R*pl*e1;
dpldt = k2F*(E1-e1) - k2R*pl*e1 - (kdiff)*(pl-pr)/exp(d);
dprdt = (kdiff)*(pl-pr)/exp(d) - k3F*e2*pr + k3R*(E2-e2);
de2dt = -k3F*e2*pr + (k3R+k4F)*(E2-e2) - k4R*p2*e2;
dp2dt = k4F*(E2-e2) - k4R*p2*e2;
f = [ds1dt; de1dt; dpldt; dprdt; de2dt; dp2dt];

end

tspan = 0:0.001:40;
initial = [1,1,0,0,1,0];

[t,x] = ode45( @dXdT, tspan, initial);

The concentration of the substitute, intermediate product and end product is plotted below:

Furthermore, we would like to investigate the influence of the diffusion efficiency kdiffk_\text{diff} on the concentration of the intermediate product and the end product.

In the figure, p1lp_{1l} represents the concentration at the location of the first enzyme cn2 (before diffusion), while plrp_{lr} represents the concentration at the location of the second enzyme cns1 (after diffusion).

The concentration deviation of the end product cordycepin in relation to the increase of kdiffk_\text{diff} suggests that the production efficiency raises when the diffusion rate increases, while the final conentration of the end product is the same value.

1.4.2 Separate Expression Simulation with abundant resource

Just as the manupulation of the fusion expression model, we set the concentration of the substitute S1S_1 to a constant. The result is below:


1.4 Comparison between fusion expression and separate expression


2 Systematic Enzyme Kinetics

In fusion expression simulation, we discussed the catalytic process of cns1 and cns2, leaving alone the function of cns3 and its product pentostatin, which is an inhibitor to the adenosine deaminase (ADA). When we add the role of cns3-HisG domain and ADA, the systematic enzyme kinetics is represented by the following diagram.


2.1 Systematic simulation with limited resource

As the cases in the above sections, we can derive the kinetic equations from the diagram.

d[S1]dt=k1[Ecns2][S1]+k1[Ecns2S1]\frac{d[S_1]}{dt} = -k_1[E_\text{cns2}][S_1] + k_{-1}[E_\text{cns2}S_1]

d[Ecns2]dt=k1[Ecns2][S1]+k1[Ecns2S1]+k2[Ecns2S1]k2[P1][Ecns2]\frac{d[E_\text{cns2}]}{dt} = -k_1[E_\text{cns2}][S_1] + k_{-1}[E_\text{cns2}S_1] + k_2[E_\text{cns2}S_1] - k_{-2}[P_{1}][E_\text{cns2}]

d[Ecns2S1]dt=k1[Ecns2][S1]k1[Ecns2S1]k2[Ecns2S1]+k2[P1][Ecns2]\frac{d[E_\text{cns2}S_1]}{dt} = k_1[E_\text{cns2}][S_1] - k_{-1}[E_\text{cns2}S_1] - k_2[E_\text{cns2}S_1] + k_{-2}[P_{1}][E_\text{cns2}]

d[P1]dt=k2[Ecns2S1]k2[P1][Ecns2]k3[Ecns1][P1]+k3[Ecns1S2]\frac{d[P_1]}{dt} = k_2[E_\text{cns2}S1]-k_{-2}[P_1][E_\text{cns2}]-k_3[E_\text{cns1}][P_1]+k_{-3}[E_\text{cns1}S_2]

d[Ecns1]dt=k3[Ecns1][P1]+k3[Ecns1S2]+k4[Ecns1S2]k4[P2][Ecns1]\frac{d[E_\text{cns1}]}{dt} = -k_{3}[E_\text{cns1}][P_{1}] + k_{-3}[E_\text{cns1}S_2] + k_4[E_\text{cns1}S_2] - k_{-4}[P_2][E_\text{cns1}]

d[Ecns1S2]dt=k3[Ecns1][P1]k3[Ecns1S2]k4[Ecns1S2]+k4[P2][Ecns1]\frac{d[E_\text{cns1}S_2]}{dt} = k_3[E_\text{cns1}][P_{1}] - k_{-3}[E_\text{cns1}S_2] - k_4[E_\text{cns1}S_2] + k_{-4}[P_2][E_\text{cns1}]

d[P2]dt=k4[Ecns1S2]k4[P2][Ecns1]k5[EADA][P2]+k5[EADAS3]\frac{d[P_2]}{dt} = k_4[E_\text{cns1}S_2] - k_{-4}[P_2][E_\text{cns1}] - k_5[E_\text{ADA}][P_2] + k_{-5}[E_\text{ADA}S_3]

d[EADA]dt=k5[EADA][P2]+k5[EADAS3]+k6[EADAS3]k6[P3][EADA]k9[I][EADA]+k9[EADAI]\frac{d[E_\text{ADA}]}{dt} = -k_5[E_\text{ADA}][P_2] + k_{-5}[E_\text{ADA}S_3] + k_6[E_\text{ADA}S_3] - k_{-6}[P_3][E_\text{ADA}] - k_9[I][E_\text{ADA}] + k_{-9}[E_\text{ADA}I]

d[EADAS3]dt=K5[EADA][P2]k5[EADAS3]k6[EADAS3]+k6[P3][EADA]\frac{d[E_\text{ADA}S_3]}{dt} = K_5[E_\text{ADA}][P_2] - k_{-5}[E_\text{ADA}S_3] - k_6[E_\text{ADA}S_3] + k_{-6}[P_3][E_\text{ADA}]

d[P3]dt=k6[EADAS3]k6[P3][EADA]\frac{d[P_3]}{dt} = k_6[E_\text{ADA}S_3] - k_{-6}[P_3][E_\text{ADA}]

d[S4]dt=k7[EHisG][S4]+k7[EHisGS4]\frac{d[S_4]}{dt} = -k_7[E_\text{HisG}][S_4] + k_{-7}[E_\text{HisG}S_4]

d[EHisG]dt=k7[EHisG][S4]+k7[EHisGS4]+k8[EHisGS4]k8[EHisG][S4]\frac{d[E_\text{HisG}]}{dt} = -k_7[E_\text{HisG}][S_4] + k_{-7}[E_\text{HisG}S_4] + k_8[E_\text{HisG}S_4] - k_{-8}[E_\text{HisG}][S_4]

d[EHisGS4]dt=k7[EHisG][S4]k7[EHisGS4]k8[EHisGS4]+k8[EHisG][I]\frac{d[E_\text{HisG}S_4]}{dt} = k_7[E_\text{HisG}][S_4] - k_{-7}[E_\text{HisG}S_4] - k_8[E_\text{HisG}S_4] + k_{-8}[E_\text{HisG}][I]

d[I]dt=k8[EHisGS4]k8[EHisG][S4]k9[I][EADA]+k9[EADAI]\frac{d[I]}{dt} = k_8[E_\text{HisG}S_4] - k_{-8}[E_\text{HisG}][S_4] - k_9[I][E_\text{ADA}] + k_{-9}[E_\text{ADA}I]

d[EADAI]dt=k9[I][EADA]k9[EADAI]\frac{d[E_\text{ADA}I]}{dt} = k_9[I][E_\text{ADA}] - k_{-9}[E_\text{ADA}I]

Likewise, we can simplify this 14-equation system to 10-equation system because the initial concentrations of the enzymes are fixed. In the enzymes, EADAE_\text{ADA} is a little different, since the case that [EADA]+[EADAS3]+[EI]=Eini3[E_\text{ADA}] + [E_\text{ADA}S_3] + [EI] = E_\text{ini3}, so [EADAS3][E_\text{ADA}S_3] can be represented by Eini3[EADA][EI]E_\text{ini3} - [E_\text{ADA}] - [EI].

d[S1]dt=k1[Ecns2][S1]+k1(Eini2[Ecns2])\frac{d[S_1]}{dt} = -k_1[E_\text{cns2}][S_1] + k_{-1}(E_\text{ini2}-[E_\text{cns2}])

d[Ecns2]dt=k1[Ecns2][S1]+(k1+k2)(Eini2[Ecns2])k2[P1][Ecns2]\frac{d[E_\text{cns2}]}{dt} = -k_1[E_\text{cns2}][S_1] + (k_{-1}+k_2)(E_\text{ini2}-[E_\text{cns2}]) - k_{-2}[P_{1}][E_\text{cns2}]

d[P1]dt=k2(Eini2[Ecns2])k2[P1][Ecns2]k3[Ecns1][P1]+k3(Eini1[Ecns1])\frac{d[P_1]}{dt} = k_2(E_\text{ini2}-[E_\text{cns2}])-k_{-2}[P_1][E_\text{cns2}]-k_3[E_\text{cns1}][P_1]+k_{-3}(E_\text{ini1}-[E_\text{cns1}])

d[Ecns1]dt=k3[Ecns1][P1]+(k3+k4)(Eini1[Ecns1])k4[P2][Ecns1]\frac{d[E_\text{cns1}]}{dt} = -k_{3}[E_\text{cns1}][P_{1}] + (k_{-3}+k_4)(E_\text{ini1}-[E_\text{cns1}]) - k_{-4}[P_2][E_\text{cns1}]

d[P2]dt=k4(Eini1[Ecns1])k4[P2][Ecns1]k5[EADA][P2]+k5(Eini3[EADA][EI])\frac{d[P_2]}{dt} = k_4(E_\text{ini1}-[E_\text{cns1}]) - k_{-4}[P_2][E_\text{cns1}] - k_5[E_\text{ADA}][P_2] + k_{-5}(E_\text{ini3} - [E_\text{ADA}] - [EI])

d[EADA]dt=k5[EADA][P2]+(k5+k6)(Eini3[EADA][EI])k6[P3][EADA]k9[I][EADA]+k9[EADAI]\frac{d[E_\text{ADA}]}{dt} = -k_5[E_\text{ADA}][P_2] + (k_{-5}+k_6)(E_\text{ini3} - [E_\text{ADA}] - [EI]) - k_{-6}[P_3][E_\text{ADA}] - k_9[I][E_\text{ADA}] + k_{-9}[E_\text{ADA}I]

d[P3]dt=k6(Eini3[EADA][EI])k6[P3][EADA]\frac{d[P_3]}{dt} = k_6(E_\text{ini3} - [E_\text{ADA}] - [EI]) - k_{-6}[P_3][E_\text{ADA}]

d[S4]dt=k7[EHisG][S4]+k7(EiniH[EHisG])\frac{d[S_4]}{dt} = -k_7[E_\text{HisG}][S_4] + k_{-7}(E_\text{iniH}-[E_\text{HisG}])

d[EHisG]dt=k7[EHisG][S4]+(k7+k8)(EiniH[EHisG])k8[EHisG][S4]\frac{d[E_\text{HisG}]}{dt} = -k_7[E_\text{HisG}][S_4] + (k_{-7}+k_8)(E_\text{iniH}-[E_\text{HisG}]) - k_{-8}[E_\text{HisG}][S_4]

d[I]dt=k8(EiniH