Team:Amazonas-Brazil/Model

Team:Amazonas Brazil - 2019.igem.org

Model

Disclaimer: If you are already familiar with modelling and its tools, skip to the Our model section.

Introduction

What is modelling?

Math is grounded in axioms based on logic, which is the most basic way one may achieve conclusions. These axioms construct a system that always needs to respect them. So, math is as deep as one can get, and have results from logic assumptions. Bearing this in mind, it's only natural that this axiomatic logic can be used as a basis for constructing more and more complex things. But what is modelling?

Modelling describes a system through assumptions that can be interpreted as logic respecting the proposed axioms. This means that when a certain set of axioms is chosen, all the tools developed for it can also be used for the constructed model.

This line of reasoning is crucial for modern science because a good model can encapsulate every important aspect of a system through its assumptions, granting a way to get data without experiments! The first science to use it successfully was physics, developing ways to predict behaviors otherwise unknown, going beyond lab data from only math itself.

How to build a model?

As explained earlier, a model consists of simplifications of reality, in a way possible to describe it mathematically. So, first of all, we need to make assumptions to simplify and turn the system into something treatable with a certain mathematical framework enabling all of its tools.

This seems pretty abstract, but is actually very intuitive and can be better understood with an example. Let's say you want to build a model to describe winds in a certain place. Wind currents have a velocity, also a predominant direction and orientation... Very interesting way to think about it, because it looks just like vectors! Where the speed can be represented as the module of the vector. Not only that, but every point in space has a vector attached to it, meaning we have a vector field! (See Figure 1)

Figure 1: Vector field representing the speeds of wind above 40m in Southern California[1]. Available via license: Creative Commons Attribution 4.0 International

After choosing the framework and making necessary assumptions to treat this problem, all of the vector field treatment tools are enabled for us. For example, there is an algebraic topology theorem, the hairy ball theorem, which states that in a vector field tangent to the surface of a sphere, there needs to exist at least one point where the vector is zero. Extending the vector field treatment to the whole earth, this suggests that at all times, wind (our vector field) must have at least one zero point, in this case, a zero-point represents a cyclone/anticyclone, like the ones observed in Figure 2. Which seems strange but at the same time is incredible! From pure math and assumptions, we come to a true observable conclusion: at all times there must exist at least one cyclone/anticyclone on the earth! Even though we can't know exactly where it exists, we know it does.

Figure 2: Gif showing how each of the behaviors of a cyclone/anticyclone and saddle that can occur in a zero wind place. Source: http://chalkdustmagazine.com/blog/hairy-balls-cyclones-computer-graphics/

This simple example shows us how a model works and predicts behaviors, arriving at conclusions otherwise unknown, without experiments. Not only that, but models can help the experiments to be more precise and also expand the data beyond measurements, guaranteeing a certain behavior, the same way that experiments validate and enhance models, turning into a feedback loop, where experiments feed the model and vice-versa. Furthermore, models also turn works more scientific, showing that the experimental data can be expected, turning it more credible and reproducible, as we now know what aspects are essential for it to work.

Models in Synthetic Biology

In Synthetic Biology, experimentation can be pretty expensive, in an ideal case, we want to do the least experiments possible. This aspect is what makes models essential for this new scientific branch, cutting costs, predicting behaviors, helping characterization and expanding data overall. This can be noted from the works of the repressilator[2] and the Collins toggle switch[3], that heavily used mathematical modelling in their works, showing the expected performance.

Mathematical tools

This subsection aims to give a brief introduction to the math used in our model, which includes: logic gates and dynamical systems, also talking a bit about the construction of circuits with the help of Hill function, allowing the unfamiliar to keep up with the work developed here. Synthetic Biology draws lots of tools from electronics and engineering such as logic gates and dynamical systems which are used to describe circuits with varying complexity.

$\boldsymbol{\cdot} $ Logic gates

In electronics, circuits can have an unending amount of complexity added to it, in a way that it can get quite convoluted and almost impossible to take information from it. Logic gates come to simplify the behaviors implementing a Boolean algebra, where 1 represents the presence of a certain component, and 0 the absence. The most common are the: NOT, AND, OR, NAND, NOR, XOR and XNOR gates, represented in Figure 3.

Figure 3: All logic gates representations possible, where these symbols are implemented to build a certain circuit. Source: https://physicsabout.com/logic-gates/.

For example, let's take a look at the NOT gate. The NOT gate means that when there is no input, we have an output, and where there is input, there is no output. For example, every time it snows you don't go to school, snow is the input and going to school is the output, and the relation between going to school and snowing can be seen as a NOT gate. Even though this is a very simple example, this tool is very important for the construction of complex circuits, as one might look at the funny symbol and know what to expect from it, even if we don't see the full machinations behind it.

Snow	Going to school
0	1
1	0

Table 1: Demonstration of how the NOT gate works utilizing Boolean algebra for the example given above.

$\boldsymbol{\cdot} $ Dynamical Systems

The first thing to note before starting to talk about dynamical systems is that I'm assuming the reader already has an understanding of basic calculus and its more "physical" interpretations. From calculus, it's important the notion that a rate of change can be described as a derivative, as can be seen from the Newton quotient. This idea is the foundation for this whole branch of mathematics known as Dynamical Systems.

To understand this, you can think of how speed is the derivative of distance with relation to time meanwhile, acceleration is the derivative of speed $\vec{a}=\frac{d\vec{v}}{dt}$. In this way, you see the speed as the rate of change of the position of a certain object, which is very intuitive. Such intuition is so important that every science and engineer should understand it, as it is the starting point for most mathematical formalizations.

The cases described above are very simple, but gives us an idea of how these equations behave, other rates can also be considered, adding complexity. Let's say that a raindrop is falling, but as it gains speed, it accumulates more water and its radius becomes bigger by a factor k and g is the gravity. So we can describe the system as:

$$ \frac {dv}{dt} = g$$ $$ \frac {dR}{dt} = kv$$

Even though it is not physically correct, it gives us a feeling of how these ideas work. This particular dynamical system can be solved analytically, but the majority actually can't. So why study something we can't solve? One of the most important tools for dynamical systems is the phase portrait or phase diagram. Such a tool allows us to have more qualitative behaviors, without an analytical solution, predicting if the solutions will oscillate, be attracted by a particular point or repelled, and many more interesting behaviors [4]. The way it works is that we implement the rates as vector in a vector field, where the axis are the analyzed quantities, as seen in Figure 4.

Figure 4: Phase portrait of a pendulum, where its axis are the value theta of the angle and the angular velocity, where the balls represent a specific initial condition and latter how it evolves with time. Source: https://gereshes.com/2019/03/04/an-introduction-to-phase-portraits/

Even though there are other ways to mathematically describe circuits created from Synthetic Biology, such as the thermodynamical or the stochastic approach, the Dynamical System way is for sure the most widely used, being essential for the Synthetic biologist to understand the basics.

Model construction

Now that we have an idea of how Dynamical Systems work, we can start building the mathematical description of simple circuits and the assumptions needed.

The first and more important assumption is that there are enough copies of the analyzed component, in a way that a continuous approximation is good. When dealing with Synthetic Biology, in general, there are enough copies to make this consideration, as the colonies tend to be big enough. Another important aspect that in general is used is that a rate instantly interferes with the others, such that there are no delays from one to another. More complex models may consider this, with discrete-time lags, but it will not be considered further in this work. Also, this type of models doesn't consider how the reactions occur, only how their rate change and affect other rates.

With these assumptions, a simple model can be built for a chemical reaction such that:

$$ E + S \rightarrow ES $$

This can be mathematically described as, where the brackets represent the concentration and k is the reaction constant:

$$ \frac{d[E]}{dt} = -k[E][S] $$ $$ \frac{d[S]}{dt} = -k[E][S] $$ $$ \frac{d[ES]}{dt} = k[E][S] $$

As more and more reactions occur, more terms appear in each equation, the positive ones are the production rates, while the negative ones are the consumption rates. These equations can be treated as previously with dynamical systems, allowing us to describe how each concentration of E, S and ES change with time.

Another important tool for Syn Biology is the Hill equation, observed in Figure 5, where it usually appears every time there is activation or repression of a certain concentration. For example, suppose a concentration of A regulated by L, the Hill equation is written as for activation:

$$ \frac{dA}{dt} = \frac{v_{max}[L]^ {n} }{ K + [L]^n} $$

And for repression:

$$ \frac{dA}{dt} = \frac{v_{max} }{ 1 + \frac{[L]^ {n}}{K} } $$

Figure 5: Representation of the Hill function for activation, where we can see its sigmoidal shape[5].

Where $v_{max}$ is the maximal production rate, K is the Michaelis Menten constant, which tells us about the half-maximum production and n is the Hill coefficient, which is related to the steepness of the curve, or how fast it goes to its minimal or maximal value. Another interesting thing to notice is that for the first equation, as L grows, the rate goes to $v_{max}$ in the limit. And for the second equation, the rate is $v_{max}$ when [L] = 0 and goes to zero as [L] grows [6]. This is exactly the behavior wanted from a repression and activation function.

Team:Amazonas-Brazil/Model

Our model

Design influence

AND gate validation

Tumor growth model

Lactate	Hypoxia	Output
0	0	0
1	0	0
0	1	0
1	1	1