Team:SYSU-Medicine/Future

<!doctype html> SYSU-Medicine










Modeling Part 4




Future*

In the future, our method may be of clinical application. As we know, an excellent medical method is hoped to solve the problem that can not be solved nowadays in the future. However, clinical issue is very long. In addition, during the issue, some patients may exit the experiment or try other methods when he/she notice the low validity of this regime. Thus we want to think a little farther.

In this section, we will give the predicted survival situation of patients who receive our method. And we also try to provide the method of Dynamic Treatment Regime. All models in this section are based on Machine Learning. Because in the research of human body, the situation become so complex that we could not just use the main paramter to predict what will happen. And in many situation, fixed models can not fit the data well. So the method of Machine Learning is very significant.

Survival Time Prediction And Effect Prediction

Regression

In medical research, survival time and effect size are significant in judging a method successful or unsuccessful. In the experiment on mouse, we have the survival time and effect size of mice. In the clinical research, we have the survival time and effect size of patients. How does these two survival time relate with each other? There is no doubt that mouse have similarity with human, because we often use mouse for experiment. (it’s not only easy to experiment but cheap) Thus, how can we measure this similarity? And can we apply this similarity to our prediction?

Actually, a lot of mice theraputic experiments fail in the clinical trials on human. Thus, a research of cross-species difference by Rachelly Normand and his team provide us a statistical model FIT to measure this similarity.

We collect the mouse-to-man data on the website provided by Rachelly Normand and his team. After cleaning the data, we obtain 14757 samples. After trying a lot of time, we find that the BP neural network in regression is better than other traditional methods. So we use the BP neural network regression.

In our data analysis, we find that the number of neuron can not greatly influence the fitting. We try the neural network fitting from 10 neurons to 30 neurons, and get a conclusion that the MSE is approximately 0.1 and the R2 is approximately 0.8, which shows that our model fit the data well.

We finally decide to use 10 neurons, because it can decrease the cost of internal storage without a lost of accuracy. We also choose the Levenberg-Marquardt Algorithm for our training, and choose 70% data for training, 15% data for validation, and 15% data for testing. Finally, we get the fitting figure and regression figure below.

Fitting
Regression

In the experiment of M1, we got a conclusion that 29 lines showed a more than 30% decrease in viability 48h after exposure. In those lines, our prediction on human body is about 60%. That is to say, our method got a positive result based on the data.

Whether the method would be a success?

Survival time is hard to collect because it always takes a long time. Can we just predict the clinical research a success or failure based on the survival time of mouse? Because of limited time, we haven’t recorded enough data for survival time. However, we tried to provide a probable success or failure label for different kinds of cancer.

And we consider that some survival data may have deletion in the actual research, We simulate doctors’ judging rules. Doctors’ judgements are often subjective, which means, in our simulation, the threshold is often random around a fixed value. Thus, we choose the mean of the distribution to be that fixed value. And we use the random values created in the distribution N(μ, σ2) as the thresholds in every judgement. When the actual value is bigger than the threshold, we consider it successful. Otherwise, we judge it as unsuccessful. Then we use our Classification Model to make judgements according to every data from mouse. Next we could judge whether our method is successful or not when we get a new survival time difference in the control group and experiment group. The ROC is shown below. The total set AUC is approximately 0.7.

ROC in Classification

DTR*

To optimize the sequence of specific treatments for specific patients is a significant question of precision medicine. DTR (Dynamic Treatment Regime) is such a method that can explore and examine the best strategies. In this part, we will give a brief introduction of DTR model based on our method.

To be brief, we choose 2 stages of treatment and 2 treatments at each stage (for example, conservative treatment(A =  − 1) and multiple treatment of M1 and actinomycin D (A=1)). We use Oi, i = 1, 2 to describe the observation, which can be -1 or 1. Oi = 1 can mean the growth of the tumor, while -1 means the shrink of the tumor.

The model is described as follows:
$$\begin{array}{lcl} 1.\;O_i \in \{-1,1\},A_i\in\{-1,1\},for\;i=1,2;\\ 2.\;P(A_1=1)=P(A_1=-1)=0.5,P(A_2=1)=P(A_2=-1)=0.5;\\ 3.\;O_1\sim Bernoulli(0.5);\;O_2|O_1,A_1\sim Bernoilli(expit(\delta O_1+\delta A_1));\\ 4.\;Stage\;1\;outcome\;Y_1 \equiv 0;\; \\ 5.\;Stage\;2\;outcome\;Y_2=\gamma_1+\gamma_2 O_1+\gamma_3 A_1+\gamma_4 O_1 A_1+\gamma_5 A_2+\gamma_6 O_2 A_2+\gamma_7 A_1 A_2 +\varepsilon . \end{array}$$
In the model above, $expit(x)=\frac{e^x}{1+e^x}$. And ε ∼ N(0, 1). The parameters γi, i = 1 ∼ 7 and δi, i = 1, 2 can be estimated by the method of nonfitting or neural network. Thus, we can estimate the best combination of regimes as follows.(for simpleness we choose 9 parameters for our simulation, and choose the best regime based on the observation)

Simulation with γ = (0, 0,  − 0.5, 0, 0.25, 0.5, 0.5)
(O1, O2) (A1, A2) Outcome Y2 Remark
( − 1,  − 1) ( − 1,  − 1) 1.25 Best
( − 1,  − 1) ( − 1, 1) -0.25
( − 1,  − 1) (1,  − 1) -0.75
( − 1,  − 1) (1, 1) -0.25
( − 1, 1) ( − 1,  − 1) 0.25
( − 1, 1) ( − 1, 1) 0.75 Best
( − 1, 1) (1,  − 1) -1.75
( − 1, 1) (1, 1) 0.75
(1,  − 1) ( − 1,  − 1) 1.25 Best
(1,  − 1) ( − 1, 1) -0.25
(1,  − 1) (1,  − 1) -0.75
(1,  − 1) (1, 1) -0.25
(1, 1) ( − 1,  − 1) 0.25
(1, 1) ( − 1, 1) 0.75 Best
(1, 1) (1,  − 1) -1.75
(1, 1) (1, 1) 0.75

And we can choose the best regime based on the current observation. In this case, we use the Q-function:
$$\left \{ \begin{array}{lcl} Q_2^{opt}(H_2,A_2;\beta_2,\psi_2)=H_{20}^T\beta_2+H_{21}^T\psi_2 A_2\\ Q_1^{opt}(H_1,A_1;\beta_1,\psi_1)=H_{10}^T\beta_1+H_{1}^T\psi_1 A_1 \end{array} \right .$$

In Q-function above, parameters are given by
$$\left \{ \begin{array}{lcl} H_{20}=(1,O_1,A_1,O_1A_1),\\ H_{21}=(1,O_2,A_1),\\ H_{10}=(1,O_1)^T,\\ H_{11}=(1,O_1)^T,\\ \psi_1=\gamma_3+q_1|f_1|-q_2|f_2|+q_3|f_3|-q_4|f_4|,\\ \psi_2=\gamma_4+q_1'|f_1|-q_2'|f_2|-q_3'|f_3|+q_4'|f_4|,\\ \end{array} \right.$$
where $q_1'=q_3'=\frac{1}{4}(expit(\delta_1+\delta_2)-expit(-\delta_1+\delta_2))$,and $q_2'=q_4'=\frac{1}{4}(expit(\delta_1-\delta_2)-expit(-\delta_1-\delta_2))$.

Thus we could compare the effect of different regimes based on every history. The contrast is below:
$$\left \{ \begin{array}{lcl} \Delta_1=Q_1^{opt}(H_1,1;\hat{\beta_1},\hat{\psi_1})-Q_1^{opt}(H_1,-1;\hat{\beta_1},\hat{\psi_1})\\ \Delta_2=Q_2^{opt}(H_2,1;\hat{\beta_2},\hat{\psi_2})-Q_2^{opt}(H_2,-1;\hat{\beta_2},\hat{\psi_2})\\ \end{array} \right.$$

And we simulate the stage 2 mean outcome slope[]. We can see that while patients prefer different treatments in stage 1, there will be probably opposite best regime for patients.

Decision Slope

Conclusion

In this section, we will review our modeling detailedly. Our models have some advantages. But in the same time, the disadvantages of them are obvious. We will have a discussion about them in this section. Finally we will describe what can we do in the future.

Advantage

In the part of M1 Cultivation, Poisson Process is proper and it is with a low quantity of culculation, and the best times for collecting targeted virus are actually convincing because it is confirmed by the experimental results.

In the part of Injection System, Improved Michaelis-Menten Equation considers the variation of concentration of nitroreductase and the best time interval for injection is convincing, confirmed by experiment as well.Integrated Injection combined the parial process and injection process together, successfully describe the relationship of microorganism and medicine; Gaussian Mixture Model realize the variation of β with the position.

In the part of Targeting Ability, Effectiveness and Safety, based on the growth mechanism of M1 and the immunological effect in animals, we established an ordinary differential equation system about the number of M1, tumor cells and the time of dosing. Using ode15s and ode45s functions in MATLAB, we obtained the change curve of the number of M1, infected and uninfected tumor cells with the time of dosing, which verified the effectiveness of our system. In additional, We combined the process and results of other classical experiments to further verify the targeting ability, effectiveness and stability of our system, and used mathematical methods such as function fitting and hypothesis testing to obtain convincing results.

In the final part of our model, with the help of the popular BP neural network, we predicted the effect of our system in human body, making the whole project more comprehensive and in-depth.

Disadvantage

In ODE model of part 1, the descent of concentration of uninfected tumor cells is too quick;

The initial conditions of our ODE model are not correct enough. Our data is not enough, and the curve only show the trend of concentration.

The time-varying distribution of β had not been added to the ODE model.

Due to the difficulty of the experiment, our results can still be improved. Therefore, we have resorted to many experiments done by others to obtain our desired conclusion.

Expectation

It is possible to mutate M1 through directed evolution to increase its lethality and range of action while still being safe.

We can improve the targeting ability, effectiveness and safety of M1 through our dosing system model and testing model.

By optimizing our neural network model, we can reliably predict the effect of m1-based oncolytic virus therapy in the human body, thus accelerating the development of oncolytic virus therapy in medicine.

Renference

[1]Normand Rachelly,Du Wenfei,Briller Mayan,Gaujoux Renaud,Starosvetsky Elina,Ziv-Kenet Amit,Shalev-Malul Gali,Tibshirani Robert J,Shen-Orr Shai S. Found In Translation: a machine learning model for mouse-to-human inference.[J]. Nature methods,2018.

[2]Bibhas Chakraborty, Erica E.M. Moodie (2013). Statistical Methods for Dynamic Treatment Regimes. New York: Springer 154-164.


Address: No.74 Zhongshan Rd.2, Guangzhou, P.R. China

ZipCode: 510080

E-mail: sysumedicine2019@163.com