Introduction
1. What is our model
We use mathematical modeling to determine parameters and narrow our range of options. Specifically, three models were adopted for our project: The binding of aptamer to target sites,Determination of inhibitor and the appropriate time for growth of hydrogels.In the first model,we establish a differential equation based on the molecular mechanism in cell micro-environment and solving them.
In the second model, we interpreted the issue as an optimization problem and used thermodynamics theory to quantify objective function and constraint function.
In the third model, we use post feedback neural network to predict the radius of colloidal particles in given time. By training the network with experimental data, we can get an approximate mapping from time to radius of colloidal particles.
2. Why modeling?
The main purpose of our modeling is to determine or narrow some options for experiments. For example, the determination of inhibitor, the binding of aptamer to target sites, and the appropriate time for growth of hydrogels need to be modeled. With the preliminary theoretical reference, the members in the experimental group can do the experiment effectively.3. What help does the experiment provide for the model?
The experiments provide us with necessary data, such as MFI in the binding of aptamer to target sites and the data of micelle radius in the appropriate time for growth of hydrogels. The former helps us determine the parameters and verify the model, and the latter helps us train the network.4. How does the model guide the experiment?
The process of solving the model parameters guides the experiment. For example, we need to determine the reaction rate constant in the binding of aptamer to target sites. Therefore, we did the experiment of target binding to aptamer by fluorescence labeling on aptamer. The residual concentration of the aptamers in the reaction equilibrium is obtained by fluorescence intensity, and the rate constants can be solved by it.The binding of aptamer to target sites
In our experiments, an important step is to combine the aptamers with the cell targets, and subsequently produce hydrogels. However, Cells will not be fully reacted if they react for too short time and be damaged if they react for too long time, which both affect the formation of hydrogels. Therefore, it is very important to find a suitable combination time.
The binding reaction between aptamer and target spots is the first-order kinetics:
There is a chemical positive reaction rate equation: $$\begin{equation} v_1=kc_a(t)c_t(t) \end{equation} $$
In the experiment, we added the concentration of target point, \(\begin{equation} c_t(0)=8.3*10^{-10}mol/L - 1.6*10^{-8}mol/L \end{equation}\), which had little effect on M, so we finally confirm \(\begin{equation} c_t(0)=10^{-8}mol/L \end{equation} \), \(\begin{equation} c_a(0)=2*10^{-7}mol/L \end{equation}\). According to the experimental data, when the reaction is in equilibrium, \(\begin{equation} c_a=115*10^{-9}mol/L \end{equation}\).
When the reaction is in equilibrium,we can consider the time as infinity.
Conclusion
We set up an ordinary differential equation with parameters through the reaction rate equation, and solved the parameters through the experimental data, and determined a suitable reaction time of 1 hour for the experiment. This model can be applied to other first-order kinetics, the parameters can be determined by similar methods, then the theoretical value of the reaction can be obtained.Determination of inhibitor
The reaction of aptamer binding to target is also the reaction of primer exposure. The exposure of Primer is directly related to the formation of hydrogel, so we need a suitable cover length to make the Primer stable enough before aptamer binds to the target, and to make the Primer smoothly exposed when aptamer binds to the target. In this way, we can compute a feasible region where we can optimize.
3.1 Objective function
Aptamer is a single strand of DNA that is easy to have the phenomenon of self-rotation. In order to suppress the self-rotation of DNA, we need to maximize the length of the masking sequence in the feasible domain.2, The establishment of feasible domain
The assumption of nearest neighbor method [1] :
The initial concentration of two single nucleic acid chains in the hybridization reaction is the same;
∙∙The nucleic acid hybridization reaction satisfies the distate model, i.e., the dechain and the formation of the double strand are always dynamically balanced;
∙∙The hybrid nucleic acid double strand is a single double helix structure.
3.2 The first of the constraint function
Based on the assumption, we can compute the model in this way. This reaction can be divided into two different reaction. ①The first reaction is between aptamer and target protein: $$\begin{equation} AT \leftrightarrow A+T \end{equation} $$ The equilibrium constant of the reaction is:$$\begin{equation} K_d1=\frac{c_ac_t}{c_at} \end{equation} $$ ②Another reaction is between aptamer and inhibitor $$\begin{equation} AI \leftrightarrow A+I \end{equation} $$ and the equilibrium constant of the reaction is: $$\begin{equation} K_d2=\frac{c_ac_I}{c_at} \end{equation} $$ The whole reaction is: $$\begin{equation} AC+P \leftrightarrow AP+C \end{equation} $$ To describe the direction and stability of the reaction, we introduce the thermodynamic concept of free energy. $$\begin{equation} \ ΔG_c=-RTln(K) \end{equation} $$
3.3 The second constraint function
According to the thermodynamic model, in order to expose the inhibitor, it’s necessary that the Gibbs energy of aptamer is greater than that of inhibitor. The compute of the energy is another constraint condition.$$\begin{equation} \sum_{i=1}^{k-1} ΔG_i\geq\ ΔG_a \end{equation} $$
3.4 Solution
The results show that the inhibitor length of aptamer is 5-10. Considering the multi-objective optimization, and trying to prevent DNA self-folding, we found that the suitable number of inhibitor are 8 and 9 by experiment, so we finally chose 9 bases as the inhibitor of aptamer part, and the inhibitor of linker part is not limited, so we took 6 bases. Then we determined the masking sequence as TCAGGC-CACTACAGABecause the second constraint function needs to use the oligo analyzer,we just show a built-in code to calculate the first constraint. Just enter the sequence in uppercase.
3.5 Conclusion
The assumption of nearest neighbor method provides the theoretical basis for the quantification of the constraint function. It is the establishment of constraint functions that allow us to transform it into an optimization problem. Because of this model’s small computation, it’s easy to confirm the best length by exhaustive method.The basic idea is from another IGEM team in Xiamen university. Based on the existing results, we add a restriction function to prevent DNA single strand from self folding and find out the feasible region of inhibitor. Finally, we determined the inhibitor by experiments as follows: TCAGGC-CACTACAGA.
The appropriate time for growth of hydrogels
Hydrogel is a large number of DNA single chains generated by DNA rolling ring amplification. Because of its good structural characteristics, we use it as prison clothes. Because the whole environment is not suitable for cell survival, too long time will lead to cell death, and too short time will lead to failure to form, it is important to determine the forming time of hydrogel. First, we consider establishing differential equations to solve this problem, but the rate of DNA rolling ring amplification is difficult to express (There are many effects that are difficult to quantify, such as DNA degradation), and in order to describe this process theoretically, we still describe the derivation of differential equations. Finally, we decided to adopt the BP neural network to approximate the mapping from time to the radius of the hydrogel to solve it.
4.1 Establishment and solution of network
We normalize the data and use two hidden layers in network, the number of neurons in each layer is 3, the active function from input layer to hidden layer is logsig, and the active function from hidden layer to output layer is tansig. The specific structure is as follows: After training with experimental data (Table 3), the mapping of the network is:4.2 Derivation of differential equation
Notion:A and B are linear structures of complementary ring chains
\(n_1\): The number of whole circle sequence in DNA chain A
\(n_2\): The number of whole circle sequence in DNA chain B
\(c_1\): The number of DNA chain A
\(c_2\): The number of DNA chain B
\(V(n_1,n_2,t)\): The rate of DNA rolling ring amplification
Considering the simultaneous reaction of multiple chains, the growth rate of the number of rings of A(B) is directly proportional to the number of base points, the ratio is V, where V is the reaction rate of a single chain, which is related to N1, N2 and time. The basic point is the starting position of the chain, and the number of base points of A and B is n_2-c_2+1, n_1-c_1.Similarly, the growth rate of A (B) chain number is directly proportional to B (A) chain number, and the ratio is V[2][3].
So we can get:
4.3 Conclusion
We use experimental data and BP neural network to fit the mapping from time to hydrogel particle diameter. The suitable time for rolling ring amplification is 8 hours-8 hours and 20 minutes. After that, we set up the differential equations. Theoretically, knowing the change of the rate V about N1, N2 and T, we can numerically solve the change of the number of bases with time, but we did not find this relationship. We hope that our later research can make a breakthrough in this part.Reference
[1] JOHN SANTALUCIA, JR*, A unified view of polymer, dumbbell, and oligonucleotide DNA nearest-neighbor thermodynamics, Proc Natl Acad Sci Anal Chem, 1998, 95, 1460-1465.
[2] Gevertz, J. L., Dunn, S. M., & Roth, C. M. (2005). Mathematical model of real-time PCR kinetics. Biotechnology and Bioengineering, 92(3), 346–355. doi:10.1002/bit.20617
[3] Beals T P , Smith J H , Nietupski R M , et al. A mechanism for ramified rolling circle amplification[J]. BMC Molecular Biology, 2010, 11(1):94.