Difference between revisions of "Team:Marburg/Model"

Line 679: Line 679:
 
<p>
 
<p>
 
Due to the small amount of data we were able to collect we decided to use a polynomial regression model instead of a more data demanding approach like k nearest neighbors, support vector machines or neural networks.  
 
Due to the small amount of data we were able to collect we decided to use a polynomial regression model instead of a more data demanding approach like k nearest neighbors, support vector machines or neural networks.  
This regressional model was built using<a href="https://scikit-learn.org/stable/">scikit learn</a> [<i>Pedregose et.al.</i> 2011].  
+
This regressional model was built using <a href="https://scikit-learn.org/stable/">scikit learn</a> [<i>Pedregose et.al.</i> 2011].  
 
Even with this approach, the amount of data we have at our disposal is not enough to deliver a model that we would describe as accurate within and especially not outside of our training data.  
 
Even with this approach, the amount of data we have at our disposal is not enough to deliver a model that we would describe as accurate within and especially not outside of our training data.  
 
Nevertheless, we think a model like this is the best way forward if we want to properly predict the doubling time and with more data a very accurate model can be built.  
 
Nevertheless, we think a model like this is the best way forward if we want to properly predict the doubling time and with more data a very accurate model can be built.  
Line 840: Line 840:
 
However, for many of the parameters we cannot do that in one measurement, since the rpm, CO<sub>2</sub> concentration and temperature has to be identical.  
 
However, for many of the parameters we cannot do that in one measurement, since the rpm, CO<sub>2</sub> concentration and temperature has to be identical.  
 
For the light intensity there could have been more sampling which would have improved the performance of the model.
 
For the light intensity there could have been more sampling which would have improved the performance of the model.
 +
In addition to that, we used doubling times that we calculated by hand and by manually choosing datapoints for the calulations.
 +
This can also introduce an error.
 +
By automating that process and maybe not only predicting doubling times but the optical densities at different timepoints this manual error could be circumvented.
 +
However, the automated calculation of doubling times can be troublesome for some suboptimal growth curves, since the automatic definition of the exponential phase can be troublesome.
 +
If this problem would be solved, this would take all the manual work out of the process and further improve the model.
  
  

Revision as of 22:47, 21 October 2019

Modelling


This year we used our mathematical and programming background to look for artificial Neutral integration Site option (aNSo) and suitable terminators for our project. We took advantage of genome data bank of UTEX2973 and used bioinformatics tools to gain insights and implement it to our project. In addition to that, we designed a model to predict the doubling times of UTEX2973 that was only possible after a thorough investigation and standardization of the current state of the art methods. To achieve this level of standardization we also implemented a light model to properly predict light intensities for our cultures.


Growth Curve Model


artificial Neutral integration Site options


Terminator Model