ETHZ/Modeling Basics

From 2007.igem.org

< ETHZ
Revision as of 07:50, 12 October 2007 by Uhrm (Talk | contribs)

Contents

Introduction

The functioning of our model depends mostly on protein concentrations. These proteins are produced within the E. coli cells based on genes that we introduced into the cell. To understand the system, it is crucial to model the gene expression and regulation accurately.

This Wiki page is intended to present the basic mechanisms and assumptions that went into the mathematical description of our iGEM model.

Constitutive Protein Production

The most simple parts in the system are genes that are continuously transcribed to produce a protein. At the same time, proteins have a certain half-life time, which means that they are degraded. This leads to the following simple model of protein production/degradation shown in Fig. 1.

Fig. 1: System of a constitutively produced protein P. Production rate is cmax and degradation rate is dP.

To find the concentration of protein P (as a function of time), the system of Fig. 1 can be written as an ODE

Basic eq01.png

This equation states that the change of protein concentration is a function of protein production (cmax) and protein degradation (dP[P]).

It is worth looking at the protein production a bit closer. The production of protein P depends on the expression of a gene that codes for this protein. In the case of constant protein production, the gene can be modeled by a constitutive promoter and a coding region for the protein (Fig. 2).

Fig. 2: Model for the gene coding for protein P. The promoter of the gene is continuously expressed.

Regulated Protein Production

To build a system with logic functionality, it is necessary to produce a protein depending on the concentration of another protein (i.e. a transcription factor). To model such a system, the promoter of the constitutive production system in Fig. 2 must be extended to take into account the presence of the regulatory protein R. There are two possible cases: the regulatory protein either inhibits or activates the expression of the gene (see Figs. 3 and 4, respectively).

Fig. 3: Regulated protein production where the regulatory protein has inhibitory effect. That is, the higher the concentration of R, the smaller the expression of protein P.
Fig. 4: Regulated protein production where the regulatory protein activates protein production. That is, the higher the concentration of R, the larger the expression of protein P.

Inhibition

To derive the equations describing the regulated transcription, we first need a model how the transcription factor interacts with DNA. The most simple model is when the protein binds reversibly to the DNA:

Basic eq02.png

Note that the equation above involves n transcription factors. For certain transcription factors the number of proteins involved is indeed greater than 1. These are interesting cases that enable applications such as toggle switches.

When the transcription factor binds to DNA, it blocks the enzymes transcribing the gene. Thus, the higher the concentration of R, the smaller the transcription of the gene. By controlling the concentration of the regulatory protein R, the expression of protein P can effectively be regulated.

To understand this process in more detail, we make the simplification that the binding of R to DNA is in equilibrium. That is, one can write

Basic eq03.png

In case of inhibition, the expression of the gene is proportional to the probability that the DNA is 'free' (meaning that there is no transcription factor bound to it). After some algebraic manipulation of the above equation an expression for the 'free DNA' as a function of transcription factor concentration can be derived:

Basic eq04.png

Now, we have basically everything together to write down a differential equation for the concentration of protein P whose expression is regulated by protein R:

Basic eq05.png

The transcription does not always take place at the maximum rate cmax as was the case for the constitutively produced proteins, but is modulated by the concentration of protein R.

Activation

To understand activation, we start from the same model of the transcription factor binding reversibly to DNA

Basic eq02.png

But in case of activation, we are interested in the DNA - transcription factor complex, because the transcription rate is proportional to the probability that a protein is bound to DNA.

Assuming equilibrium conditions as in the inhibition case, we can derive an expression for the DNA-protein complex concentration as a function of protein concentration:

Basic eq06.png

Analogously, we can now write the whole differential equation for protein concentration if transcription is activated by protein R

Basic eq07.png

Basic Production

The equations so far assume perfect inhibition/activation. This means, in case of inhibition, that if the inhibitor concentration is high enough, the transcription of protein P is practically zero. Or, in the case of activation, that in the absence of activator protein there is no transcription. In reality, one observes always some basic production, despite high inhibitor concentrations or absence of activator protein. The basic transcription is usually around 10-20% of the maximum transcription rate. The case of inhibition is illustrated in Fig. 5.
Fig. 5: Inhibition of transcription factor is only effective between basic transcription and maximum transcription. That is, protein production cannot be shut off completely by inhibition.

Thus we have to introduce some 'basic transcription' that always takes place:


Note that the basic transcription rate is introduced as a leakiness factor a, which is a percentage of the maximum transcription rate cmax. Regulation by protein R now is only effective in the range between a·cmax and cmax.