Introduction
We developed to enable RNA export out of mammalian cells for minimal invasive longterm monitoring.
Incorporating the feedback given by Prof. Hohendorf, we developed an in-silico model for calculating the amount of obtainable RNA based on the experimental setup to account for the various factors involved. The image above shows some of these factors.
Therefore, we explored the influences of parameters such as cell number, cellular RNA levels, RNA to binding protein affinity, or binding protein export to create this model. We then developed the following rearrangeable formula, which allows for specific optimizations to the experiment at hand.
When applied, one can estimate the number of cells required to produce a detectable signal. The accuracy of this estimation was then experimentally tested by comparing the formula's results to our own data. The cell number detection limit predicted by our model correlated with the experimental data from a cell titration experiment.
The model can now be used to determine whether ALiVE would be suited for one's needs.
Incorporating the feedback given by Prof. Hohendorf, we developed an in-silico model for calculating the amount of obtainable RNA based on the experimental setup to account for the various factors involved. The image above shows some of these factors.
Therefore, we explored the influences of parameters such as cell number, cellular RNA levels, RNA to binding protein affinity, or binding protein export to create this model. We then developed the following rearrangeable formula, which allows for specific optimizations to the experiment at hand.
When applied, one can estimate the number of cells required to produce a detectable signal. The accuracy of this estimation was then experimentally tested by comparing the formula's results to our own data. The cell number detection limit predicted by our model correlated with the experimental data from a cell titration experiment.
The model can now be used to determine whether ALiVE would be suited for one's needs.
Formula Overview
RNAO = Ncells * RNAC * Dbound(NBP, RNAC, KD) * EBP * T(t, tM) * Dvol * η
RNAO: Total amount of extracted RNA
Ncells: Number of cells
RNAC: Average amount of RNA of interest per cell
NBP: RNA binding proteins per cell
Dbound(NBP, RNAC, KD): Fraction of RNA bound to RNA binding protein
EBP: Fraction of RNA binding protein exported
T(t, tM): Time(t)- and time-since-last-medium-change(tM)-dependant medium satuation function
Dvol: Fraction of Volume used
η: Fraction of signal retrieved by measurement after preparation
Ncells: Number of cells
RNAC: Average amount of RNA of interest per cell
NBP: RNA binding proteins per cell
Dbound(NBP, RNAC, KD): Fraction of RNA bound to RNA binding protein
EBP: Fraction of RNA binding protein exported
T(t, tM): Time(t)- and time-since-last-medium-change(tM)-dependant medium satuation function
Dvol: Fraction of Volume used
η: Fraction of signal retrieved by measurement after preparation
Assumptions
We assumed the following:- All the factors are directly proportional to the RNA output
- RNAC and NBP are constant for a specific cell line
- EBP is constant for a specific adapter-marker (Gag for VLPs, CD63 for Exosomes) couple
- Dvol and η are constant for the experimental setup
- Dbound needs to be derived from the KD formula
- T levels off
Parameter Determination
Prior knowlegde
Dvol
The Fraction of Volume used was Dvol=1 in our case, since we took all of the sample at the beginning of each assay and later accounted for steps where we used less than the full volume.Ncells
We used the number of cells at confluency for the respective well format, if the cells were seeded as suggested.Most of the model data was obtained from 6-well plate wells, resulting in Ncells=1,2*106 for these data points.
η
The fraction of signal retrieved by measurement after preparation is the product of all quantified assay efficiencies, especially the vesicle purification (determined by HiBiT-Assay) and an estimated error. We assumed η=0.4 overall.HiBiT-Assay
To assess and quantify the amount of exosomal and VLP marker in the supernatant above the cells and in the cells themselves we used Promega's Nano-Glo® HiBiT Extracellular Detection System. It provides the LgBiT and the substrate that the HiBiT tag on CD63 or Gag needs to luminesce. Two measurements are made on the same sample: One with unlysed supernatant and another one with detergent- and heat-treated supernatant. The signal coming from untreated supernatant provides information about how much tagged protein was not packaged into vesicles, thus determining the leakage of the system. The detergent- and heat-treatment ruptures the vesicles, allowing the LgBiT access to all HiBiT-molecules in the sample, therefore providing information about the total content of CD63-HiBiT and Gag-HiBiT. Cells can also be lysed in order to solubilize the cytosolic, exosomal and VLP markers not secreted yet. With a HiBiT standard curve, the measured RLU signal is then converted into pure HiBiT equivalents that we assumed to be the best possible approximation for the quantity of the target molecule, since there is exactly one HiBiT domain on each protein-of-interest.
NBP
The RNA binding proteins per cell were determined by dividing the cellular HiBiT-signal by the number of cells. NBP=2,4*109 was determined as mean for the exosomes, NBP=3,7*108 for the VLPs.EBP
The fraction of RNA binding protein exported was computed by dividing the difference between the lysed supernatant and the unlysed supernatant signal (resulting in the HiBiT signal originating from intact vesicles) by the sum of the lysed supernatant and cell signal (total HiBiT signal). It was found that EBP=0,02 for exosomes and EBP=0,45 for VLPs.RT-qPCR
The RNA in the vesicles was isolated either by a PEG-based precipitation and phenol-chloroform extraction for exosomes or a viral RNA extraction-specific protocol for VLPs. Cellular RNA was extracted via the TRIzolTM reagent. The samples were then reverse transcribed into cDNA and a qPCR was carried out to precisely quantify the FLuc transcript content of exosomes and VLPs. For more elaborate applications, the transcript-of-interest would be quantified with this method. The relative CT-value detected by the qPCR-maschine can be converted into absolute copy numbers by using a standard curve of known copy numbers. To get increase the precission of the value, a noRT control copy number is subtracted from the reverse-transcribed copy number.
RNAC
The average amount of RNA of interest per cell is calculated by dividing the mRNA of interest copy number of extracted cells by the number of cells. Our best RT-qPCR result for FLuc-mRNA without export (to not falsify the result by said export) showed RNAC=3,3*104RNAO
The total amount of extracted RNA also needs to be measured as a reference. The value is the number of transcripts of the extracted vesicles. For some examples, look at the section belowComplex parameters
Dbound(NBP, RNAC, KD)
To get the fraction of RNA bound to RNA binding proteinDbound(NBP, RNAC, KD) = (0,5*(-√(POTENZ(RNAC)-(2*RNAC*(NBP-KD))+POTENZ(NBP+KD)2 ))+(RNAC+NBP+KD)))/RNAC,
Afree = Atotal/AB and Bfree = Btotal/AB were put into KD = (Afree*Bfree)/AB, with Atotal representing RNAC and Btotal representing NBP. The resulting formula was solved for AB and divided by Atotal to obtain Dbound. The only parameter, that was not already determined is the KD. From literature KD=1*10-11 was obtained for the RNA binding protein L7Ae(Schroeder et al. 2010).T(t, tM)
The time(t)- and time-since-last-medium-change(tM)-dependant medium satuation function was proposed and designed to account for satuation states of the RNA in the cells and the vesicles in the medium. These satuation states are theorized to result from constant growth with each generation decaying over time, modeled as a factor to multiply RNAC. This factor for each of the two layers fl = Σi=0tl (1-a)i = (1-(1-a)tl+1)/a is dependent on the parameter a representing the specific decay rate for the RNA or vesicle respectively. After rescaling the factors and multiplying them with each other, we arrive at the following final formula:T(t, tM) = (1-(1-aRNA)t+1)*(1-(1-avesicle)t-tM+1)*TMax
As the parameters in this formula were hard to measure, they were fitted based on our data. aRNA wasValidation and Range of uncertainty
To validate our model, we rearranged the formula to calculate the cell number.
We decided to desire a yield of 100000 RNA molecules for a clean and reliable readout, although this is above the detection limit of qPCR.
t=72 and tM=48 were assumed as this were the prefered settings of the cell titration assay we performed to check the numbers obtained from the above calculations. Besides the resulting cell numbers and RT-qPCR copy numbers (omRNA), you can see the model applied to the different conditions with the resulting predicted transcript number (omRNA*). This additional information was not generated for fun but to estimate the error of the model.
The sample name is made to display the corresponding conditions with
For VLPs, it is a bit less obvious, both of the values are below the sample with the minimal pipettable cell number, "TitVLP_SN8", indicating the predictions approximate correctness.
The mean deviation is only around 23% making the model fairly accurate over all.
Ncells = RNAO / (RNAC * Dbound(NBP, RNAC, KD) * EBP * T(t, tM) * Dvol * η)
N_cells | omRNA | cmRNA/cell | D_bound | BP_E | T(t, t_M) | Dvol | eta | |
Exo Minimum Cells | 851,174482306478 | 100.000,00 | 33111,1126625632 | 1 | 0,02 | 0,443524528380608 | 1 | 0,4 |
VLP Minimum Cells | 30,2754578514225 | 100.000,00 | 33111,1126625632 | 1 | 0,45 | 0,554195506674102 | 1 | 0,4 |
We decided to desire a yield of 100000 RNA molecules for a clean and reliable readout, although this is above the detection limit of qPCR.
t=72 and tM=48 were assumed as this were the prefered settings of the cell titration assay we performed to check the numbers obtained from the above calculations. Besides the resulting cell numbers and RT-qPCR copy numbers (omRNA), you can see the model applied to the different conditions with the resulting predicted transcript number (omRNA*). This additional information was not generated for fun but to estimate the error of the model.
Ncells | omRNA | omRNA* | N_cells | cmRNA/cell | D_bound | BP_E | T(t, t_M) | Dvol | eta | Deviation | ABS Deviation | |||
TitExo_SN1 | 25.000,00 | 966.712,46 | 2.937.118,13 | 25.000,00 | 33.111,11 | 1,00 | 0,02 | 0,44 | 1,00 | 0,40 | 0,67 | 0,67 | ||
TitExo_SN2 | 12.500,00 | 428.546,79 | 1.468.559,06 | 12.500,00 | 33.111,11 | 1,00 | 0,02 | 0,44 | 1,00 | 0,40 | 0,71 | 0,71 | ||
TitExo_SN3 | 6.250,00 | 235.834,36 | 734.279,53 | 6.250,00 | 33.111,11 | 1,00 | 0,02 | 0,44 | 1,00 | 0,40 | 0,68 | 0,68 | ||
TitExo_SN4 | 3.125,00 | 124.509,18 | 367.139,77 | 3.125,00 | 33.111,11 | 1,00 | 0,02 | 0,44 | 1,00 | 0,40 | 0,66 | 0,66 | ||
TitExo_SN5 | 1.562,50 | 110.154,58 | 183.569,88 | 1.562,50 | 33.111,11 | 1,00 | 0,02 | 0,44 | 1,00 | 0,40 | 0,40 | 0,40 | ||
TitExo_SN6 | 781,25 | 74.214,07 | 91.784,94 | 781,25 | 33.111,11 | 1,00 | 0,02 | 0,44 | 1,00 | 0,40 | 0,19 | 0,19 | ||
TitExo_SN7 | 390,63 | 60.506,59 | 45.892,47 | 390,63 | 33.111,11 | 1,00 | 0,02 | 0,44 | 1,00 | 0,40 | -0,32 | 0,32 | ||
TitExo_SN8 | 195,31 | 77.629,42 | 22.946,24 | 195,31 | 33.111,11 | 1,00 | 0,02 | 0,44 | 1,00 | 0,40 | -2,38 | 2,38 | ||
TitVLP_SN1 | 25.000,00 | 27.136.224,28 | 82.575.134,36 | 25.000,00 | 33.111,11 | 1,00 | 0,45 | 0,55 | 1,00 | 0,40 | 0,67 | 0,67 | ||
TitVLP_SN2 | 12.500,00 | 23.970.752,32 | 41.287.567,18 | 12.500,00 | 33.111,11 | 1,00 | 0,45 | 0,55 | 1,00 | 0,40 | 0,42 | 0,42 | ||
TitVLP_SN3 | 6.250,00 | 12.586.309,84 | 20.643.783,59 | 6.250,00 | 33.111,11 | 1,00 | 0,45 | 0,55 | 1,00 | 0,40 | 0,39 | 0,39 | ||
TitVLP_SN4 | 3.125,00 | 7.882.675,82 | 10.321.891,80 | 3.125,00 | 33.111,11 | 1,00 | 0,45 | 0,55 | 1,00 | 0,40 | 0,24 | 0,24 | ||
TitVLP_SN5 | 1.562,50 | 2.582.841,55 | 5.160.945,90 | 1.562,50 | 33.111,11 | 1,00 | 0,45 | 0,55 | 1,00 | 0,40 | 0,50 | 0,50 | ||
TitVLP_SN6 | 781,25 | 2.119.567,10 | 2.580.472,95 | 781,25 | 33.111,11 | 1,00 | 0,45 | 0,55 | 1,00 | 0,40 | 0,18 | 0,18 | ||
TitVLP_SN7 | 390,63 | 1.080.091,83 | 1.290.236,47 | 390,63 | 33.111,11 | 1,00 | 0,45 | 0,55 | 1,00 | 0,40 | 0,16 | 0,16 | ||
TitVLP_SN8 | 195,31 | 306.911,71 | 645.118,24 | 195,31 | 33.111,11 | 1,00 | 0,45 | 0,55 | 1,00 | 0,40 | 0,52 | 0,52 |
The sample name is made to display the corresponding conditions with
- "Tit" refering to the titration assay
- "Exo" and "VLP" to the respective vesicle type
- "SN" to the supernatant (containing the vesicles but no cells)
- the number indicating the dilution factor with "1" referring to conditions seeded with the suggested cell number, "2" 1/2 of the suggested cell number, "3" 1/4 up to "8" being 1/128 of the suggested cell number
Detection Limit
When looking at the exosomal samples, one can clearely see the predicted cell number as well as the required 100000 copy numbers located between "TitExo_SN5" and "TitExo_SN6", proving the prediction correct.For VLPs, it is a bit less obvious, both of the values are below the sample with the minimal pipettable cell number, "TitVLP_SN8", indicating the predictions approximate correctness.
Deviation
The last two columns show the deviation of our predicted results from the measured values. The maximum error of our model (taking the maximum of the last column) is around 2.4. This number is quite high but still in the right order of magnitude. It is also the only deviation above 0.71, so it can be attributed to errors resulting from the low cell number.The mean deviation is only around 23% making the model fairly accurate over all.
Conclusion
Incorporating the feedback given by Prof. Hohendorf, we decided to create an in-silico model to predict the outcome of ALiVE experiments. This allowed us to predict the minimum number of cells needed for a detectable signal that we were in fact able to validate. We were event able to show the models high accurecy.