ETHZ/Model

From 2007.igem.org

< ETHZ(Difference between revisions)
(Equations)
(Mathematical Model)
 
(180 intermediate revisions not shown)
Line 1: Line 1:
-
== Test Protocol ==
+
[[Image:ETHZ_banner.png|830px]]
 +
<!--
 +
<center>[[ETHZ | Main Page]] &nbsp;&nbsp;&nbsp;&nbsp; [[ETHZ/Model | System Modeling]] &nbsp;&nbsp;&nbsp;&nbsp; [[ETHZ/Simulation | Simulations]] &nbsp;&nbsp;&nbsp;&nbsp; [[ETHZ/Biology | System Implementation]] &nbsp;&nbsp;&nbsp;&nbsp; [[ETHZ/Biology/Lab| Lab Notes]] &nbsp;&nbsp;&nbsp;&nbsp; [[ETHZ/Meet_the_team | Meet the Team]] &nbsp;&nbsp;&nbsp;&nbsp; [[ETHZ/Internal | Team Notes]] &nbsp;&nbsp;&nbsp;&nbsp; [[ETHZ/Pictures | Pictures!]]</center><br>
 +
-->
 +
__NOTOC__
 +
<html>
 +
<script type="text/javascript" src="http://christos.bergeles.net/eth_dropdowntabs.js">
-
For our project we decided on designing a systems that is able to learn or adapt to its environment. Please note that this is only a minimal system that should be able to act as a proof of concept. A protocol how the system should react according to an input is shown in Figure 1.
+
/***********************************************
 +
* Drop Down Tabs Menu- (c) Dynamic Drive DHTML code library (www.dynamicdrive.com)
 +
* This notice MUST stay intact for legal use
 +
* Visit Dynamic Drive at http://www.dynamicdrive.com/ for full source code
 +
***********************************************/
-
[[Image:ETHZFlowdiagram.png|center|thumb|Fig. 1: Flow diagram. This figure shows the protocol with which the final system should be tested as well as the test results in form of the reported colors. There are tree phases the systems has to perform: (1) a training or learning phase in which the system learns an input and stores it in its memory, (2) a memory phase in which the system has to keep the content of its memory and finally (3) a recognition phase where the output of the system depends on the content of its memory as well as on the current input.|480px]]
+
</script>
-
The idea behind this protocol is that
+
<!-- CSS for Drop Down Tabs Menu #1 -->
-
* The system will be able to learn one of two input signals - aTc or IPTG - furing a learning phase if no input signal AHL is present. Depending on the input it will report by either green or yellow florescence.  
+
<link rel="stylesheet" type="text/css" href="http://christos.bergeles.net/eth_ddcolortabs.css" />
-
* Once the system learned, the inputs - aTc or IPTG - can be released and the system goes into a memory state in the presence of AHL. In this state no output color is reported. Since the inputs aTc and IPTG are not present during this state we force our system to really memorize to perform the next phase properly.
+
<div id="colortab" class="ddcolortabs">
-
* During a recognition phase the inputs aTc or IPTG are (re-) presented again. The system reports by changing its color depending on the input and its current memory state. That is why the system can have different florescence properties even in the presence of the same input. The recognition phase takes place in the presence of AHL to keep the memory enabled and avoid another learning phase.
+
<ul>
 +
<li><a href="https://2007.igem.org/wiki/index.php?title=ETHZ" title="Home" rel="dropmenu_home"><span>Home</span></a></li>
 +
<li><a href="https://2007.igem.org/wiki/index.php?title=ETHZ/Model" title="Modeling" rel="dropmenu_modeling"><span>System Modeling</span></a></li>
 +
<li><a href="https://2007.igem.org/wiki/index.php?title=ETHZ/Simulation" title="Simulations" rel="dropmenu_simulation"><span>Simulations</span></a></li>
 +
<li><a href="https://2007.igem.org/wiki/index.php?title=ETHZ/Biology" title="System Implementation" rel="dropmenu_biology"><span>System Implementation</span></a></li>
 +
<li><a href="https://2007.igem.org/wiki/index.php?title=ETHZ/Meet_the_team" title="Meet the team" rel="dropmenu_meettheteam"><span>Meet the team</span></a></li>
 +
<li><a href="https://2007.igem.org/wiki/index.php?title=ETHZ/Pictures" title="Pictures!" rel="dropmenu_pictures"><span>Pictures!</span></a></li>
 +
</ul>
 +
</div>
 +
<div class="ddcolortabsline">&nbsp;</div>
-
== Model Overview ==
 
-
To define our system we start with the classical back box approach as shown in Figure 2.
+
<!--1st drop down menu -->
 +
<div id="dropmenu_home" class="dropmenudiv_a">
 +
<a href="https://2007.igem.org/wiki/index.php?title=ETHZ#Introduction">Introduction</a>
 +
<a href="https://2007.igem.org/wiki/index.php?title=ETHZ#Team_Members">Team Members</a>
 +
<a href="https://2007.igem.org/wiki/index.php?title=ETHZ#Acknowledgments">Acknowledgments</a>
 +
<a href="https://2007.igem.org/wiki/index.php?title=ETHZ#Site_Map">Site map</a>
 +
</div>
-
[[Image:ETHZBlackbox.png|center|thumb|Fig. 2: Black box |480px]]
 
-
To fill that back box we have to think a bit more about the properties of our system. From our protocol we know that we need:
+
<!--2nd drop down menu -->
-
* 2 inputs that should be learned/detected/adapted to,
+
<div id="dropmenu_modeling" class="dropmenudiv_a" style="width: 150px;">
-
* 1 input to switch on the memory.
+
<a href="https://2007.igem.org/wiki/index.php?title=ETHZ/Model#Introduction">Introduction</a>
-
* We need to store at least 3 states. That is why we decided to use 2 state variables - cI and p22cII.
+
<a href="https://2007.igem.org/wiki/index.php?title=ETHZ/Model#Model_Overview">Model Overview</a>
-
* We need 4 florescense signals for the outputs. Actually one could also decide to take 6 output signals into account to further distinguish the learning phase from the recognition phase but we restricted ourself to 4 outputs to reduce the number of genes that we will need to implement the signals.
+
<a href="https://2007.igem.org/wiki/index.php?title=ETHZ/Model#Detailed_Model">Detailed Model</a>
 +
<a href="https://2007.igem.org/wiki/index.php?title=ETHZ/Model#Final_Model">Final Model</a>
 +
<a href="https://2007.igem.org/wiki/index.php?title=ETHZ/Modeling_Basics">Modeling Basics Page</a>
 +
<a href="https://2007.igem.org/wiki/index.php?title=ETHZ/Model#Mathematical_Model">Mathematical Model</a>
 +
<a href="https://2007.igem.org/wiki/index.php?title=ETHZ/FSM">FSM View Page</a>
 +
<a href="https://2007.igem.org/wiki/index.php?title=ETHZ/FlipFlop">Flip-Flop View Page</a>
 +
<a href="https://2007.igem.org/wiki/index.php?title=ETHZ/Parameters">Parameters Page</a>
 +
</div>
-
An overview about the final system is shown in Figure 3.
+
<!--3rd drop down menu -->
 +
<div id="dropmenu_simulation" class="dropmenudiv_a" style="width: 150px;">
 +
<a href="https://2007.igem.org/wiki/index.php?title=ETHZ/Simulation#Introduction">Introduction</a>
 +
<a href="https://2007.igem.org/wiki/index.php?title=ETHZ/Simulation#Simulation_of_Test_Cases">Test Cases</a>
 +
<a href="https://2007.igem.org/wiki/index.php?title=ETHZ/Simulation#Sensitivity_Analysis">Sensitivity Analysis</a>
 +
</div>
-
[[Image:ETHZFullsystemmodel.png|center|thumb|Fig.3: System overview. AHL, IPTG and aTc are passing the cell membrane where they build complexes with the sensor proteins LuxR, LacI and TetR. These sensor proteins and/or complexes are used to control memory formation and the production of the reporter proteins. The memory content is represented by the proteins cI and p22cII that repress the production of each other. YFP, RFP, CFP and GFP stand for yellow, red, cyan and green florescence protein, respectively.|480px]]
+
<!--4th drop down menu -->
 +
<div id="dropmenu_biology" class="dropmenudiv_a" style="width: 150px;">
 +
<a href="https://2007.igem.org/wiki/index.php?title=ETHZ/Biology#Introduction">Introduction</a>
 +
<a href="https://2007.igem.org/wiki/index.php?title=ETHZ/Biology#The_Complete_System">The Complete System</a>
 +
<a href="https://2007.igem.org/wiki/index.php?title=ETHZ/Biology#System_Phases">System Phases</a>
 +
<a href="https://2007.igem.org/wiki/index.php?title=ETHZ/Biology#Current_Cloning_Status">Current Cloning Status</a>
 +
<a href="https://2007.igem.org/wiki/index.php?title=ETHZ/Biology/parts">System Parts Page</a>
 +
<a href="https://2007.igem.org/wiki/index.php?title=ETHZ/Biology/Lab">Lab Notes Page</a>
 +
</div>
-
* To be more robust against pertubations we couple the state variables cI and p22cII in the way that is well known from memory circuits that engineers build where one  state variable is depressing the other one.
+
<!--5th drop down menu -->
-
* We know that the system should finally be implemented in form of DNA and proteins in a bacteria. Since - due to their size - proteins can only hardly pass the cell membrane (if they are not actively transported through the cell membrane) we decided on the much smaller inducer molecules AHL, IPTG and aTc to act as the inputs. However, since those inducers cannot directly act on the transciption of DNA nor on the production of proteins we need to produce the sensor proteins LuxR, LacI and TetR that build complexes with AHL, IPTG and aTc, respectively.
+
<div id="dropmenu_meettheteam" class="dropmenudiv_a" style="width: 150px;">
-
* The sensor proteins and complexes are used to control the memory formation and the production of the florescence reporter proteins YFP, RFP, CFP and GFP.
+
<a href="https://2007.igem.org/wiki/index.php?title=ETHZ/Meet_the_team#The_ETH_Zurich_07_Team">The ETH Zurich 07 Team</a>
 +
<a href="https://2007.igem.org/wiki/index.php?title=ETHZ/Meet_the_team#Team_Description">Team Description</a>
 +
<a href="https://2007.igem.org/wiki/index.php?title=ETHZ/Internal">Brainstorming Page</a>
 +
</div>
-
== Detailed Model ==
+
<script type="text/javascript">
-
In this section we are transfering our model into a more detailed desciptions of the involved molecules and proteins.
+
//SYNTAX: tabdropdown.init("menu_id", [integer OR "auto"])
 +
tabdropdown.init("colortab", 3)
 +
</script>
-
==== Sensors ====
+
</html>
 +
__NOTOC__
-
As shown in Figure 4, the proteins that act as sensors for the inducer signals are constituitively produced.
+
=Introduction=
-
[[Image:Model01b.png|center|thumb|Fig. 4: The proteins that act as sensors are constituitively produced.|140px]]
+
As previously discussed in the main page, we are interested in designing a system that is able to adapt to its environment. Our ideas are based on discussions about neural networks, and how we can create a biological system that exhibits the behavior of learning without having to resort to evolutionary processes.  
-
==== Memory ====
+
[[Image:ETHzFlowdiagram2.png|thumb|<b>Fig. 1</b>: Flow diagram. This figure shows the protocol with which the final system should be tested as well as the test results in form of the reported colors. The protocol is divided into three phases: (1) a training or learning phase in which the system learns an input and stores it in its memory, (2) a memory phase in which the system keeps the content of its memory, and finally (3) a recognition phase where the output of the system depends on the content of its memory as well as the current input.|450px]]
-
The mechanisms for the memory control are shown in Figure 5.
+
Learning can be considered as a switching of behavior, based on some external stimuli. Thus, it comes naturally to work on existing ideas of toggle switches and [[ETHZ/FSM | finite state machines]].
 +
 
 +
The proposed system is able to distinguish between two chemicals. It represents a minimal test system that is intended as a proof of concept. By introducing the ability to distinguish more than two chemical and thereby introducing new [[ETHZ/FSM | system states]], the power of the system or its "intelligence" can be scaled. A protocol depicting how the system should react according to an input is shown in Fig. 1.
 +
 
 +
The idea behind this protocol is that:
 +
* The system will be able to learn one of two input signals - ATC or IPTG - during a learning phase, while a "learning signal" (AHL) is not yet present. Depending on the input it will report by producing either cyan or yellow florescence.
 +
* Once the system has learned, the inputs - ATC or IPTG - can be removed and the system goes into a memory state in the presence of AHL. In this state, no output color is reported. Memorizing is guaranteed by removing the input chemicals.
 +
* During the recognition phase, the inputs ATC or IPTG are (re-)inserted. The system reports by changing its color depending on the input and its current memory state. This is why the system has different florescence properties even in the presence of the same input. The recognition phase takes place in the presence of AHL, to keep the memory enabled and avoid another learning phase. Since we would like to separate four different end states, we use four different fluorescent proteins to encode them.
 +
 
 +
==Model Overview==
 +
 
 +
The model for the proposed system is developed using a top-down approach. We start with a black box as shown in Fig. 2.
 +
 
 +
[[Image:ETHZBlackbox.png|thumb|<b>Fig. 2</b>: Black box |280px]]
 +
 
 +
The system is sketched in Fig. 3. It can be summarized as follows:
 +
* There are two inputs to be learned/detected/adapted to.
 +
* There is one separate input to switch on the memory.
 +
* The system has to alternate between at least three states. Hence, we decided to use two state variables - CI and P22CII (when interpreted as binary variables, in principle allowing for up to four different states).
 +
* There are four different output signals (synthesis of four fluorescent proteins). One could also decide to take six output signals into account to further distinguish the learning phase from the recognition phase. However, we restricted ourselves to four outputs to reduce the number of genes that are needed to implement the signals.
 +
 
 +
[[Image:ETHZFullsystemmodel.png|left|thumb|<b>Fig. 3</b>: System overview. AHL, IPTG and ATC pass the cell membrane where they build complexes with the sensor proteins LuxR, LacI and TetR. These sensor proteins and/or complexes are used to control the internal system state: the memory represented by the proteins CI and P22CII (mutually repressing their synthesis) and the sensed input (IPTG, ATC). CFP, RFP, GFP and YFP stand for yellow, red, cyan and green florescent protein, respectively.|420px]]
 +
 
 +
However, we had to keep in mind that the proposed system should be implemented in DNA, and that it would be sensitive to noise. As a result, we took several actions to achieve better experimental results and easier DNA construction:
 +
* To be more robust against perturbations, we coupled the state variables CI and P22CII like it is known from toggle switches [1]. Based on this approach, one state variable is depressing the other one, and the system's internal toggle has the possibility of reaching two stable states.
 +
* Since - due to their size - proteins can only hardly pass the cell membrane (if they are not actively transported through the cell membrane), we decided to use the much smaller inducer molecules AHL, IPTG and ATC as  inputs. However, since these inducers cannot directly act on the transcription of the DNA nor on the production of proteins, we need to produce the sensor proteins LuxR, LacI and TetR that build complexes with AHL, IPTG and ATC, respectively.
 +
* The sensor proteins and complexes are used to control the memory formation and the production of the florescent reporter proteins CFP, RFP, GFP and YFP.<br><br>
 +
 
 +
==Detailed Model==
 +
 
 +
In order to test our ideas, we came up with a detailed model of all the interactions in the system.
 +
After defining the desired behavior of our system (as shown in the introduction) and a [[ETHZ/FSM | formalized description of the system]] we identified necessary biological components and their interactions. As we can observe in Fig. 3, our system is composed from three basic subparts:
 +
* sensors,
 +
* memory, and
 +
* reporters.
 +
 
 +
===Sensors===
 +
 
 +
The first part contains the sensors. Our sensors are the proteins LacI, luxR and TetR, which are constitutively produced. The sensing subsystem is shown in Fig. 4.
 +
 
 +
[[Image:Model01b.png|center|thumb|<b>Fig. 4</b>: The proteins that act as sensors are constitutively produced.|140px]]
 +
 
 +
===Memory===
 +
 
 +
The second subsystem implements the memory. The memory control is based on the following underlying mechanisms:
 +
* The sensor proteins form complexes together with the inducers. These complexes are used to activate the transcription of the genes for the proteins CI and P22CII.
 +
* P22CII and CI repress the DNA transcription of each other, so that the closed loop system behaves as a toggle; a dynamic system with only two possible steady states (see Fig. 6).
 +
 
 +
[[Image:ETHZModelLearning.png|center|thumb|<b>Fig. 5</b>: Learning system: Depending on the inputs IPTG or ATC the proteins CI and P22CII are produced.|300px]]
 +
 
 +
* Fig. 5 shows the protein production system that is used during the learning phase. During the learning phase, there is still no CI or P22CII produced. They are produced, only if either IPTG or ATC is added, respectively. Since no AHL is present, the inner toggle switch (see Figure 6) is turned off.
   
   
-
[[Image:Model02b.png|center|thumb|Fig. 5: The memory content is represented by the concentrations of the proteins cI and p22cII.|520px]]
+
[[Image:ETHZModelMemory.png|center|thumb|<b>Fig. 6</b>: Memory system. If AHL is present the production of either CI or P22CII is continued.|420px]]
 +
 
 +
* During the memory phase, AHL is added and the IPTG and ATC are removed. This is why only the inner toggle switch (see Fig. 6) is turned on while the protein production systems shown in Fig. 5 are deactivated. Depending on what was produced during the learning phase, the production of either CI or P22CII is continued. This is why the system can act as memory, effectively storing the information it is exposed to.
 +
 
 +
Based on all the above, we present the final assembly of the memory subsystem in Fig. 7.
 +
 
 +
[[Image:Model02b.png|center|thumb|<b>Fig. 7</b>: Final interaction of the learning and memory system. The memory content is represented by the concentrations of the proteins CI and P22CII.|560px]]
 +
 
 +
===Reporters===
 +
 
 +
Fig. 8 gives an overview of the reporter subsystem. Florescent reporter proteins are expressed depending on the inducer concentrations, and the concentrations of CI and P22CII. For example, the presence of either TetR or CI will repress the production of YFP. However, if the inducer ATC is present, ATC will bind to TetR which can no longer block the production of YFP. We are using four fluorescent proteins, to encode the steady states of our system at the final recognition stage. In this way, we are able to distinguish between all the different transition paths in the system.
-
WE NEED BETTER FIGURE!!! SHOW PATHWAYS FOR LEARNING AND MEMORY (INNER TOGGLE SWITCH)
+
[[Image:Model03b.png|center|thumb|<b>Fig. 8</b>: The production of the florescent reporter proteins depends on the memory content (CI or P22CII) and the current input (ATC or IPTG).|600px]]
-
* The sensor proteins form complexes together with the inducers. These complexes are used to either activate (in case of the complex consisting of luxR and AHL) or repress (in case of the complexes consisting of LacI and IPTG as well as TetR and aTc) the DNA transciption of the proteins cI and p22cII.
+
==Final Design==
-
* Futhermore, p22cII and cI repress the DNA transciption of each other.
+
-
The final mechanism that is formed is working like this:
+
So far, we have presented all parts needed to model and simulate the behavior of the proposed system. By following the details presented in the previous section, we have all the necessary information to fully understand the interior of the black boxes that were presented in Fig. 2 and Fig. 3. Our overall system model is presented in Fig. 9.
-
* During the learning or training phase there is no cI or p22cII produced so far. If either IPTG or aTc is added, cI or p22cII are produced, respecively. Since no AHL is present the inner toggle switch is turned off.
+
-
* During the memory phase AHL is added and the IPTG and aTc are removed. That is why the inner toggle switch is turned off and depending on what is already present either the production of cI or p22cII is continued.
+
-
==== Reporters ====
+
[[Image:ETHZFullsystem.png|center|thumb|<b>Fig. 9</b>: Final design of the educatETH <i>E.coli</i> system.|900px]]
-
Figure 6 gives an overview about the reporter system. Reporter proteins are expressed depending on the inducer concentrations and the concentrations of cI and p22cII.
+
In contrast to this biological implementation, an alternative implementation using an engineering approach can be found on our [[ETHZ/FlipFlop | 'Engineer's View' page]].
-
[[Image:Model03b.png|center|thumb|Fig. 6: The production of the florescence reporter proteins depends on the memory content (cI or p22cII) and the current input (aTc or IPTG).|560px]]
+
==Mathematical Model==
-
== Equations ==
+
Based on the modeling done so far, we can derive the equations that govern the behavior of our system. The model is given by sets of coupled [[ETHZ/Modeling_Basics | ordinary differential equations]] which are presented below. We use a simple notation for the different elements of the equations. Namely:
 +
* All concentrations are given in brackets (for example [CI]).
 +
* All decay constants are described by a variable d followed by the name of the protein they refer to.
 +
* The production of the proteins is described by a basic constant production level named 'a' that models the leak of the production system, and a factor of l and c<sub>max</sub> that describe the maximum production of a protein, given in [M/min].
 +
* Depending on whether the DNA for a protein is implemented on a low or a high copy plasmid, we distinguish between l<sub>lo</sub> and l<sub>hi</sub>, respectively.
 +
* Dissociation constants are given by 'K' followed by the name of the protein they refer to.
 +
* The Hill cooperativities are described by the constants 'n' followed by the name of the protein they refer to.
-
To perform simulations we descibe our system with the help of simple differential equations. We use a simple notation. All concentrations are given in brackets like [cI]. All decay constants are described by a variable d followed by the name of the protein they refer to. We descibe the production of the proteins by a basic constant production level that models the leak of the production system and a factor of l and c<sub>max</sub> that descibe the maximum production of a protein given in [M]. Depending on if the DNA for a protein is implemented on a low or a high copy plasmid we distinguish between l<sub>lo</sub> and l<sub>hi</sub>, respectively.
+
For a more detailed introduction into how we transferred our model into equations, see the section [[ETHZ/Modeling_Basics|Modeling Basics]].<br>
-
For a more basic introduction into the definition of that equations please see ADD LINK TO BASIC EXPLANATIONS
+
===Allosteric regulation===
-
==== Constitutively produced proteins ====
+
These equations describe the formation of complexes between the inducers and sensor proteins. We do not use differential equations, but describe directly the concentrations of the complexes. This is a valid assumption, provided that we always wait a sufficient time, and the system reaches a steady state.
 +
We describe the total amount of proteins with the index 't', while we use the index '*' for proteins that build a complex with their respective inducer. For example:
 +
* [TetR]<sub>t</sub> describes the total concentration of TetR that is available. It is the sum of the free TetR proteins and the TetR proteins that form a complex with ATC.
 +
* [TetR<sub>*</sub>] describes the proteins that are available as a complex with ATC, and
 +
* [TetR] gives the concentration of free TetR proteins.
-
The equations for the constitutively produced proteins are very simple since we do not have to consider any dependence on other proteins. They are designed so that the protein concentration tends to l<sub>hi</sub>*c<sub>max</sub>/d for steady state.
+
[[Image:Eq04.png|238px]]
-
REPLACE FIGURE
+
===Constitutively produced proteins===
-
[[Image:Eq01.png|171px]]
+
-
==== Allosteric regulation ====
+
The differential equations for the constitutively produced proteins are very simple, since there is no dependence on other proteins. They are designed so that the protein concentration reaches the value l<sub>hi</sub>*c<sub>max</sub>/d at steady state.
-
These equations descibe the formation of complexes between the inducers and sensor proteins. We do not use differential equations but by directly descibing the concentrations of the complexes which is a valid assumption if we alway wait until steady state.
+
[[Image:Constitutive_braced.png|330px]]
-
REPLACE FIGURE
+
===Learning and memory subsystem===
-
[[Image:Eq04.png|208px]]
+
-
==== Learning system ====
+
The learning and memory subsystem is the core of the system that we are trying to model and implement. It is characterized by the feedback between its state variables/proteins CI and P22CII. Its behavior is further complicated by the variation of the production of the aforementioned proteins because of the inputs. The following equations describe the concentrations of the memory proteins as a system of coupled differential equations. The equations consist of two major production parts and a decay part.
 +
* The first production part models the production of either CI or P22CII during the learning phase, and corresponds to the model in Fig. 5.
 +
* The second production part describes the inner toggle switch that was shown in Fig. 6.
-
The learning system is the most complicated part of our system due to the feedback between the state variables and proteins cI and p22cII as well as the variation of the production of these proteins depending on the inputs.
+
[[Image:Toggle_braced.png|770px]]
-
DESCIBE IN MORE DETAIL ONCE THERE ARE THE RIGHT PICTURES
+
===Reporting subsystem===
-
REPLACE FIGURE
+
The equations for the reporting subsystem finally describe the production of the florescence proteins depending on the inputs and memory proteins as modeled in Figure 8. Note that both the free constitutively produced proteins and the memory proteins repress the production of the florescence proteins. So e.g. YFP is only produced when there is both no CI and all TetR is bound in a complex together with ATC.
-
[[Image:Eq02.png|475px]]
+
-
==== Reporter system ====
+
[[Image:Reporter_braced.png|778px]]
-
The equations for the reporter system finally descibe the production of the florescence proteins depending on the inputs and memory proteins. Note that both inputs and memory proteins act repressively on the production of the florescence proteins. So e.g. YFP is only produced when there is both no
+
The systems of equations presented above describe and predict the behavior of our system. We have simulated the behavior of our system at steady states, and the results can be seen in the section [[ETHZ/Simulation|Simulations]]. In order to increase the accuracy of our results, we conducted an extensive literature survey, in order to isolate and find the parameters of our system. Since this is a burden for every team undertaking a complicated project in synthetic biology, we are presenting our full table of parameters in the [[ETHZ/Parameters|Parameters]] page.
-
REPLACE FIGURE
+
== References ==
-
[[Image:Eq03.png|586px]]
+
<p>
 +
[http://www.nature.com/nature/journal/v403/n6767/abs/403339a0.html &#91;1&#93; Gardner TS, Cantor CR and Collins JJ] <i>"Construction of a genetic toggle switch in Escherichia coli"</i>, Nature 403:339–342, 2000<br />

Latest revision as of 19:29, 26 October 2007

ETHZ banner.png

 


Introduction

As previously discussed in the main page, we are interested in designing a system that is able to adapt to its environment. Our ideas are based on discussions about neural networks, and how we can create a biological system that exhibits the behavior of learning without having to resort to evolutionary processes.

Fig. 1: Flow diagram. This figure shows the protocol with which the final system should be tested as well as the test results in form of the reported colors. The protocol is divided into three phases: (1) a training or learning phase in which the system learns an input and stores it in its memory, (2) a memory phase in which the system keeps the content of its memory, and finally (3) a recognition phase where the output of the system depends on the content of its memory as well as the current input.

Learning can be considered as a switching of behavior, based on some external stimuli. Thus, it comes naturally to work on existing ideas of toggle switches and finite state machines.

The proposed system is able to distinguish between two chemicals. It represents a minimal test system that is intended as a proof of concept. By introducing the ability to distinguish more than two chemical and thereby introducing new system states, the power of the system or its "intelligence" can be scaled. A protocol depicting how the system should react according to an input is shown in Fig. 1.

The idea behind this protocol is that:

  • The system will be able to learn one of two input signals - ATC or IPTG - during a learning phase, while a "learning signal" (AHL) is not yet present. Depending on the input it will report by producing either cyan or yellow florescence.
  • Once the system has learned, the inputs - ATC or IPTG - can be removed and the system goes into a memory state in the presence of AHL. In this state, no output color is reported. Memorizing is guaranteed by removing the input chemicals.
  • During the recognition phase, the inputs ATC or IPTG are (re-)inserted. The system reports by changing its color depending on the input and its current memory state. This is why the system has different florescence properties even in the presence of the same input. The recognition phase takes place in the presence of AHL, to keep the memory enabled and avoid another learning phase. Since we would like to separate four different end states, we use four different fluorescent proteins to encode them.

Model Overview

The model for the proposed system is developed using a top-down approach. We start with a black box as shown in Fig. 2.

Fig. 2: Black box

The system is sketched in Fig. 3. It can be summarized as follows:

  • There are two inputs to be learned/detected/adapted to.
  • There is one separate input to switch on the memory.
  • The system has to alternate between at least three states. Hence, we decided to use two state variables - CI and P22CII (when interpreted as binary variables, in principle allowing for up to four different states).
  • There are four different output signals (synthesis of four fluorescent proteins). One could also decide to take six output signals into account to further distinguish the learning phase from the recognition phase. However, we restricted ourselves to four outputs to reduce the number of genes that are needed to implement the signals.
Fig. 3: System overview. AHL, IPTG and ATC pass the cell membrane where they build complexes with the sensor proteins LuxR, LacI and TetR. These sensor proteins and/or complexes are used to control the internal system state: the memory represented by the proteins CI and P22CII (mutually repressing their synthesis) and the sensed input (IPTG, ATC). CFP, RFP, GFP and YFP stand for yellow, red, cyan and green florescent protein, respectively.

However, we had to keep in mind that the proposed system should be implemented in DNA, and that it would be sensitive to noise. As a result, we took several actions to achieve better experimental results and easier DNA construction:

  • To be more robust against perturbations, we coupled the state variables CI and P22CII like it is known from toggle switches [1]. Based on this approach, one state variable is depressing the other one, and the system's internal toggle has the possibility of reaching two stable states.
  • Since - due to their size - proteins can only hardly pass the cell membrane (if they are not actively transported through the cell membrane), we decided to use the much smaller inducer molecules AHL, IPTG and ATC as inputs. However, since these inducers cannot directly act on the transcription of the DNA nor on the production of proteins, we need to produce the sensor proteins LuxR, LacI and TetR that build complexes with AHL, IPTG and ATC, respectively.
  • The sensor proteins and complexes are used to control the memory formation and the production of the florescent reporter proteins CFP, RFP, GFP and YFP.

Detailed Model

In order to test our ideas, we came up with a detailed model of all the interactions in the system. After defining the desired behavior of our system (as shown in the introduction) and a formalized description of the system we identified necessary biological components and their interactions. As we can observe in Fig. 3, our system is composed from three basic subparts:

  • sensors,
  • memory, and
  • reporters.

Sensors

The first part contains the sensors. Our sensors are the proteins LacI, luxR and TetR, which are constitutively produced. The sensing subsystem is shown in Fig. 4.

Fig. 4: The proteins that act as sensors are constitutively produced.

Memory

The second subsystem implements the memory. The memory control is based on the following underlying mechanisms:

  • The sensor proteins form complexes together with the inducers. These complexes are used to activate the transcription of the genes for the proteins CI and P22CII.
  • P22CII and CI repress the DNA transcription of each other, so that the closed loop system behaves as a toggle; a dynamic system with only two possible steady states (see Fig. 6).
Fig. 5: Learning system: Depending on the inputs IPTG or ATC the proteins CI and P22CII are produced.
  • Fig. 5 shows the protein production system that is used during the learning phase. During the learning phase, there is still no CI or P22CII produced. They are produced, only if either IPTG or ATC is added, respectively. Since no AHL is present, the inner toggle switch (see Figure 6) is turned off.
Fig. 6: Memory system. If AHL is present the production of either CI or P22CII is continued.
  • During the memory phase, AHL is added and the IPTG and ATC are removed. This is why only the inner toggle switch (see Fig. 6) is turned on while the protein production systems shown in Fig. 5 are deactivated. Depending on what was produced during the learning phase, the production of either CI or P22CII is continued. This is why the system can act as memory, effectively storing the information it is exposed to.

Based on all the above, we present the final assembly of the memory subsystem in Fig. 7.

Fig. 7: Final interaction of the learning and memory system. The memory content is represented by the concentrations of the proteins CI and P22CII.

Reporters

Fig. 8 gives an overview of the reporter subsystem. Florescent reporter proteins are expressed depending on the inducer concentrations, and the concentrations of CI and P22CII. For example, the presence of either TetR or CI will repress the production of YFP. However, if the inducer ATC is present, ATC will bind to TetR which can no longer block the production of YFP. We are using four fluorescent proteins, to encode the steady states of our system at the final recognition stage. In this way, we are able to distinguish between all the different transition paths in the system.

Fig. 8: The production of the florescent reporter proteins depends on the memory content (CI or P22CII) and the current input (ATC or IPTG).

Final Design

So far, we have presented all parts needed to model and simulate the behavior of the proposed system. By following the details presented in the previous section, we have all the necessary information to fully understand the interior of the black boxes that were presented in Fig. 2 and Fig. 3. Our overall system model is presented in Fig. 9.

Fig. 9: Final design of the educatETH E.coli system.

In contrast to this biological implementation, an alternative implementation using an engineering approach can be found on our 'Engineer's View' page.

Mathematical Model

Based on the modeling done so far, we can derive the equations that govern the behavior of our system. The model is given by sets of coupled ordinary differential equations which are presented below. We use a simple notation for the different elements of the equations. Namely:

  • All concentrations are given in brackets (for example [CI]).
  • All decay constants are described by a variable d followed by the name of the protein they refer to.
  • The production of the proteins is described by a basic constant production level named 'a' that models the leak of the production system, and a factor of l and cmax that describe the maximum production of a protein, given in [M/min].
  • Depending on whether the DNA for a protein is implemented on a low or a high copy plasmid, we distinguish between llo and lhi, respectively.
  • Dissociation constants are given by 'K' followed by the name of the protein they refer to.
  • The Hill cooperativities are described by the constants 'n' followed by the name of the protein they refer to.

For a more detailed introduction into how we transferred our model into equations, see the section Modeling Basics.

Allosteric regulation

These equations describe the formation of complexes between the inducers and sensor proteins. We do not use differential equations, but describe directly the concentrations of the complexes. This is a valid assumption, provided that we always wait a sufficient time, and the system reaches a steady state. We describe the total amount of proteins with the index 't', while we use the index '*' for proteins that build a complex with their respective inducer. For example:

  • [TetR]t describes the total concentration of TetR that is available. It is the sum of the free TetR proteins and the TetR proteins that form a complex with ATC.
  • [TetR*] describes the proteins that are available as a complex with ATC, and
  • [TetR] gives the concentration of free TetR proteins.

Eq04.png

Constitutively produced proteins

The differential equations for the constitutively produced proteins are very simple, since there is no dependence on other proteins. They are designed so that the protein concentration reaches the value lhi*cmax/d at steady state.

Constitutive braced.png

Learning and memory subsystem

The learning and memory subsystem is the core of the system that we are trying to model and implement. It is characterized by the feedback between its state variables/proteins CI and P22CII. Its behavior is further complicated by the variation of the production of the aforementioned proteins because of the inputs. The following equations describe the concentrations of the memory proteins as a system of coupled differential equations. The equations consist of two major production parts and a decay part.

  • The first production part models the production of either CI or P22CII during the learning phase, and corresponds to the model in Fig. 5.
  • The second production part describes the inner toggle switch that was shown in Fig. 6.

Toggle braced.png

Reporting subsystem

The equations for the reporting subsystem finally describe the production of the florescence proteins depending on the inputs and memory proteins as modeled in Figure 8. Note that both the free constitutively produced proteins and the memory proteins repress the production of the florescence proteins. So e.g. YFP is only produced when there is both no CI and all TetR is bound in a complex together with ATC.

Reporter braced.png

The systems of equations presented above describe and predict the behavior of our system. We have simulated the behavior of our system at steady states, and the results can be seen in the section Simulations. In order to increase the accuracy of our results, we conducted an extensive literature survey, in order to isolate and find the parameters of our system. Since this is a burden for every team undertaking a complicated project in synthetic biology, we are presenting our full table of parameters in the Parameters page.

References

[http://www.nature.com/nature/journal/v403/n6767/abs/403339a0.html [1] Gardner TS, Cantor CR and Collins JJ] "Construction of a genetic toggle switch in Escherichia coli", Nature 403:339–342, 2000