Difference between revisions of "Team:Calgary/Model/InSilicoEmulsionSystemVerification"

(Added Future Directions)
 
(18 intermediate revisions by 4 users not shown)
Line 1: Line 1:
 
{{:Template:Calgary/Layout}}
 
{{:Template:Calgary/Layout}}
 
<html>
 
<html>
<head>
+
  <head>
  
</head>
+
  </head>
<body>
+
  <body>
  
<div class="container-fluid">
+
    <div class="container-fluid">
  
<div class = "fixed" id="fixed-content">
+
      <div class = "fixed" id="fixed-content">
+
  
<div class="mobile-banner-back" id="banner">
 
<div class="page-banner">
 
<h2 class="page-subtitle">Section &nbsp;&nbsp;/&nbsp;&nbsp; <span class="emphasis">Page</span></h2>
 
<h2 class="toggle-button">+ Press for Menu</h2>
 
</div>
 
</div>
 
  
<div class="progress-container">
+
        <div class="section-menu section-menu-up" id="section-menu">
<progress value="0" max="100" id="bar"></progress>
+
          <div class="sections" id="sections">
</div>
+
          </div>
 +
          <div class="back-to-top">
 +
          </div>
 +
        </div>
 +
      </div>
  
<div class="section-menu section-menu-up" id="section-menu">
 
<div class="sections" id="sections">
 
</div>
 
<div class="back-to-top">
 
<a class="goto-top" href="#">Back to Top</a>
 
</div>
 
</div>
 
</div>
 
  
<div class="desktop-banner-back">
+
      <div class="desktop-banner-back">
<div class="text-area">
+
        <div class="text-area">
<div class="page-banner">
+
          <div class="page-banner">
<h2 class="page-subtitle"><a href='https://2019.igem.org/Team:Calgary/Model'>MODELLING</a></h2>
+
            <h2 class="page-subtitle">Modelling</h2>
<h1 class="page-title">In Silico Emulsion System Verification</h1>
+
            <h1 class="page-title">In Silico Emulsion System Verification</h1>
</div>
+
          </div>
</div>
+
        </div>
<div class="overlap-area" id="overlap"></div>
+
        <div class="overlap-area" id="overlap"></div>
</div>
+
      </div>
  
<div class="interface-group">
+
      <div class="interface-group" id="interface">
<div class="desktop-section-menu" id="desktop-section-menu">
+
        <div class="menu-container" id="menu-container">
<div class="sections" id="desktop-sections">
+
          <div class="desktop-section-menu" id="desktop-section-menu">
</div>
+
            <div class="sections" id="desktop-sections">
<div class="back-to-top" id="go-top">
+
            </div>
<a class="goto-top" href="#">Back to Top</a>
+
            <div class="back-to-top" id="go-top">
</div>
+
            </div>
</div>
+
          </div>
<div class="content-area" id="textual-content">
+
        </div>
  
<div class="header-area">
 
<h1>Inspiration</h1>
 
<h2></h2>
 
</div>
 
  
<p>Using the water soluble chlorophyll binding protein(6GIX) for purification in an oil based product forced the team to understand the system and how we can mitigate the stress placed on the protein. To understand the stress placed on 6GIX the team aimed to observe the nanoscale dynamics of the protein within different systems. These observations are incredibly difficult and expensive to obtain within the laboratory, thus forcing us to look towards in silico measurements and observations instead. Through the use of molecular dynamic simulations and a supercomputer iGEM Calgary was able to look deeper into our proteins nanoscale struggles.  
+
        <div class="content-area" id="textual-content">
</p>
+
          <div class="header-area">
 +
            <h1>Inspiration</h1>
 +
          </div>
 +
          <p>In order for us to know whether it would be possible to use a water soluble chlorophyll-binding protein (6GIX) in an emulsion, it was necessary for us to assess the stress placed on the hydrophilic protein by being in an emulsion, which contains hydrophobic elements. To understand the stress placed on 6GIX, we investigated the nanoscale dynamics of the protein within different systems. These observations are incredibly difficult and expensive to obtain within the laboratory, so we turned towards in silico measurements and observations instead. Through the use of molecular dynamic simulations and a supercomputer, we were able to assess the nanoscale interactions of 6GIX with different components of the emulsion.  
 +
          </p>
  
<h1>Questions</h1>
+
          <h1>Questions</h1>
<br>
+
  
To guide our Molecular dynamic modelling we generated the four questions that would have the largest impact on our project.
+
          To guide our molecular dynamic modelling, we generated four questions that would have the largest impact on our project design.
<br>
+
          <br>
<br>
+
          <br>
<h2>Question 1: What are the dynamics of the protein in an aqueous environment?</h2>
+
          <h2>Question 1: </h2> <p>What are the dynamics of the protein in an aqueous environment?</p>
<br>
+
          <br>
<h2>Question 2: Would the individual monomers of the 6GIX protein aggregate to form the tetramer structure?</h2>
+
          <h2>Question 2:</h2>
<br>
+
          <p>Will the individual monomers of the 6GIX protein aggregate to form a tetramer structure?</p>
<h2>Question 3: How will the protein hold up in a non-aqueous environment?</h2>
+
          <br>
<br>
+
          <h2>Question 3:</h2>
<h2>Question 4: How comfortable is 6GIX in the emulsion on the nanoscale?</h2>
+
          <p>How will the protein hold up in a non-aqueous environment?</p>
 +
          <br>
 +
          <h2>Question 4:</h2>
 +
          <p>How comfortable is 6GIX in an emulsion on the nanoscale?</p>
 
<br>
 
<br>
 +
<h2>Question 5:</h2>
 +
          <p>How does the inclusion of secretion tags affect our protein folding and stability?</p>
  
<div class="header-area">
 
<h1>Methodology</h1>
 
<h2>How were these models generated?</h2>
 
</div>
 
 
<p>All simulations were conducted in GROMACS 18.1 on the cpu2019 partition of the ARC computing cluster at the University of Calgary.
 
Along with utilizing the same hardware over all simulations  a similar model generating method was used.
 
<br>
 
<b>Step 1. Converting Structure files into GROMACS files.</b>
 
The first step is to take the pdb files generated from structure prediction modelling or provided by the protein data bank. Then after converting the previous file to a .gro file we place it in a theoretical cube with the dimensions set to fit around the protein with at least a nanometer of padding in all dimensions.
 
<br>
 
<br>
 
<b>Step 2. Solvate the Box.</b>
 
Now that the protein is in a theoretical box we can systematically place solvent molecules to fill it. In the case of two solvents they are completed one at a time.
 
<br>
 
<br>
 
<b>Step 3. Run Energy Minimization.</b>
 
Now that the box has been solvated we will now run energy minimization to ensure a stable system for the future steps. This is done through a relaxation stage known as energy minimization to ensure no catastrophic issues with the system. This step also makes sure that solvents are realistically aligned for the next steps. This step usually takes 2 hours
 
<br>
 
<br>
 
<b>Step 4. Running Isothermal-Isochoric Equilibration.</b>
 
This step looks at ensuring that the solvent and our protein are stable together at the temperature we are looking to simulate. This step usually takes 4 hours
 
<br>
 
<br>
 
<b>Step 5. Running Isothermal-Isobaric Equilibration.</b>
 
This step is much like the previous step but it accounts for the pressure and density instead of temperature. This step usually takes 4 hours
 
<br>
 
<br>
 
<b>Step 6. Atom by Atom Molecular Dynamics.</b>
 
Now that the system has been equilibrated we will run molecular dynamics. This simulates the atom by atom movement of the solvated box for a given amount of time. This step will generate a file containing the trajectories for every atom of the simulation. Due to the extreme computational requirements of this step, molecular dynamics are computed for nanoseconds at a time. This step takes 23 hours per nanosecond.
 
<br>
 
<br>
 
<b>Step 7.Visualize the Trajectory.</b>
 
For visualizing the dynamics of the protein we have to take the large numeric files and make them understandable to our wetlab. To make these visualizations our team used VMD and Pymol. For other teams using this strategy, Pymol is easier to install and the more user friendly option. Though it being harder to use and install VMD also is very powerful for visualization and has many other applications.
 
<br>
 
<br>
 
      </p>
 
  
<div class="header-area">
+
          <div class="header-area">
<h1>Results and Wetlab Integration</h1>
+
            <h1>Methodology</h1>
<h2>Where and how our models were applied</h2>
+
            <h2>How were these models generated?</h2>
</div>
+
          </div>
<p>
+
 
<br>
+
          <p>All simulations were conducted in GROMACS 18.1 on the cpu2019 partition of the ARC computing cluster at the University of Calgary.
The models generated were able to answer the key questions our wetlab had about the nanoscale properties of the protein.
+
            A similar model generating method was used utilizing the same hardware.
<br><br>
+
          </p>
<b>Question 1. </b>
+
          <b>Step 1. Converting Structure files into GROMACS files.</b>
The tetramer structure was maintained and the protein was stable throughout the nanosecond simulation.  
+
          The pdb files generated from structure prediction modelling or provided by the protein data bank were converted to a .gro file. Afterwards, we placed it in a theoretical cube with the dimensions set to fit around the protein with at least a nanometer of padding in all dimensions.
<br>
+
          <br>
<img style="width: 100%" src="
+
          <br>
https://static.igem.org/mediawiki/2019/8/86/T--Calgary--Question1.gif"></img>
+
          <b>Step 2. Solvate the Box.</b>
 +
          Once the protein was in a theoretical box we systematically placed solvent molecules to fill the box. If two solvents are necessary, they can be completed one at a time.
 +
          <br>
 +
          <br>
 +
          <b>Step 3. Run Energy Minimization.</b>
 +
          Once the box had been solvated we ran energy minimization to ensure a stable system for the future steps. This is done through a relaxation stage known as energy minimization to ensure no catastrophic issues with the system. This step also makes sure that solvents are realistically aligned for the next steps. This step usually takes 2 hours.
 +
          <br>
 +
          <br>
 +
          <b>Step 4. Running Isothermal-Isochoric Equilibration.</b>
 +
          This step looks at ensuring that the solvent and our protein are stable together at the temperature we are looking to simulate. This step usually takes 4 hours.
 +
          <br>
 +
          <br>
 +
          <b>Step 5. Running Isothermal-Isobaric Equilibration.</b>
 +
          This step is similar to the previous step, but it accounts for the pressure and density instead of temperature. This step usually takes 4 hours.
 +
          <br>
 +
          <br>
 +
          <b>Step 6. Atom by Atom Molecular Dynamics.</b>
 +
          Once the system was equilibrated, we ran molecular dynamics. This simulates the atom by atom movement of the solvated box for a given amount of time. This step will generate a file containing the trajectories for every atom of the simulation. Due to the extreme computational requirements of this step, molecular dynamics are computed for nanoseconds at a time. This step takes 23 hours per nanosecond.
 +
          <br>
 +
          <br>
 +
          <b>Step 7.Visualize the Trajectory.</b>
 +
          In order to visualize the dynamics of the protein we had to take the large numeric files and make them understandable to our wetlab. To make these visualizations, our team used VMD and Pymol. We recommend using Pymol, as it is easier to install and more user-friendly. Although it is harder to install and use, VMD also is very powerful for visualization and has many other applications.
 +
          <br>
 +
          </p>
 +
        <br><p><b>Assumptions</b><br><br>
 +
        Assumption 1. We assume that the water molecule model(spc216) is representative of what would happen in the real world.<br><br>
 +
        Assumption 2. In modelling 6GIX we consistently make the assumption about its structure. Particularly we assume that the structure provided on the protein data bank sufficiently constitutes the actual nano scale structure of the protein. <br><br>
 +
        Assumption 3. Other assumptions made in these models are associated with the use of Ewald Electrostatics and Verlet cutoff schemes. These determine how we model the flow of force in the system. Due to these being commonly used in industry we were comfortable with including these assumptions within the model.<br><br>
 +
        Assumption 4. The largest assumption for our models was the time frame in which we simulated for. At times within the nanosecond range we truly see a very small snapshot of the total dynamics for the protein. Due to the nanoscale size of the protein however we are comfortable with the use of these time frames.
 +
        </p>
 +
 
 +
        <div class="header-area">
 +
          <h1>Results and Wet-lab Integration</h1>
 +
          <h2>Where and how our models were applied</h2>
 +
        </div>
 +
        <p>
 +
          <br>
 +
          The models generated were able to answer the key questions our wetlab had about the nanoscale properties of the protein.
 +
          <br>
 +
        <p><b>Question 1. </b>What are the dynamics of the protein in an aqueous environment?</p>
 +
        The tetramer structure was maintained and the protein was stable throughout the nanosecond simulation.  
 +
        <br></p>
 +
      <img style="width: 100%" src="
 +
                                    https://static.igem.org/mediawiki/2019/8/86/T--Calgary--Question1.gif"></img>
 +
    <p style="text-align: center ;">Figure 1: 6GIX in water</p>
  
<br>
+
    This enabled our team to understand the stability of our protein in water and the stability of the tetramer structure. This model was also the easiest to accomplish, which allowed our dry lab to become more familiar with the software
This enabled our team to understand the stability of our protein in water and the stability of the tetramer structure. Along with the gained understanding this model is also the easiest to accomplish, allowing for a teams drylab to gain familiarity with the software.  
+
    <br><br>
 +
    <p><b>Question 2.</b>Will the individual monomers of the 6GIX protein aggregate to form a tetramer structure?</p>
 +
    This model showed that the protein monomer's long-range forces allowed for the formation a tetramer when individual monomers came close enough in contact. We also found that the protein was less likely to form a tetramer without chlorophyll binding.
 +
    <br></p>
 +
  <img style="width: 100%" src="https://static.igem.org/mediawiki/2019/3/31/T--Calgary--Question2.gif
 +
                                "></img>
 +
<p style="text-align: center ;">Figure 2: 6GIX monomers in a 20nm cube of water. Monomers in this range will aggregate over time to form a tetramer.</p>
 +
 
 +
This model reassured our team that it would function to bind chlorophyll in an emulsion, because it would not immediately aggregate upon protein folding, thereby inhibiting chlorophyll binding. This model was extremely complex and required over 80 hours of computing time.  
 
<br><br>
 
<br><br>
<b>Question 2.</b>
+
<p><b>Question 3.</b>How will the protein hold up in a non-aqueous environment?</p>
This model showed that the protein monomers long range forces did form a tetramer when close but was less likely to form long range without chlorophyll.
+
From this model, we discovered that 6GIX in oleic acid, a non-polar solvent, loses its tetramer structure, and each individual monomer denatures. </p>
<br>
+
<img style="width: 100%" src="https://static.igem.org/mediawiki/2019/7/72/T--Calgary--Question3.gif
<img style="width: 100%" src="https://static.igem.org/mediawiki/2019/3/31/T--Calgary--Question2.gif
+
                              "></img><p style="text-align: center ;">Figure 3: 6GIX denatured in oleic acid, a nonpolar solvent</p>
"></img>
+
<br>
+
  
This model ensured the teams use of the protein was founded as it would not immediately aggregate, thereby inhibiting chlorophyll binding. This model was extremely complex and required over 80 hours of computing time. For this reason we recommend that this model be completed after gaining experience with the software for others attempting it.
+
This confirmed our expectations that a nonpolar solvent, such as canola oil, can be harmful to 6GIX's structure. This model emphasizes the importance of keeping 6GIX out of the oil phase of the emulsion for proper function.  
 
<br><br>
 
<br><br>
<b>Question 3.</b>
+
<p><b>Question 4. </b>How comfortable is 6GIX in an emulsion on the nanoscale?</p>
From this model we discovered that when placed in a nonpolar solvent the 6GIX protein is ripped apart from its tetramer structure and the monomers are observed to denature.
+
The bisolvent system simulation showed that the 6GIX protein was comfortable in the emulsion at the nanoscale. The protein remains folded and fully functional within the aqueous layer.  
<br><img style="width: 100%" src="https://static.igem.org/mediawiki/2019/7/72/T--Calgary--Question3.gif
+
</p>
"></img>
+
<img style="width: 100%" src="https://static.igem.org/mediawiki/2019/e/e8/T--Calgary--Question4.gif
<br>
+
                              "></img><p style="text-align: center ;">Figure 4: 6GIX in an emulsion. The protein remains in the aqueous center of the model.</p>
This demonstrated for our wetlab how problematic nonpolar solvent can be to 6GIX. This model really put how important it was for the team to ensure we protect the proteins from any unnecessary contact with the canola oil.  
+
 
<br><br>
 
<br><br>
<b>Question 4. </b>
+
<p><b>Question 5. </b>How does the inclusion of excretion tags affect our protein folding and stability?</p>
The bisolvent system simulation showed that the 6GIX protein was comfortable in the emulsion at the nanoscale. This means that for our system the 6GIX protein should be comfortable within the emulsion bubbles.
+
To determine if the inclusion of a secretion tag would be acceptable for our protein we generated a dynamics simulation for each of our secretion tags. From the generated models we looked at the root mean square deviation(RMSD) for the entire simulation. This gave a way for us to understand how the tagged protein stacked up against the 6GIX wild type. The proteins with the secretion tag was observed to have lower deviation when undergoing dynamic simulation as seen below. Due to this indication of stability we were comfortable with the use of a secretion tag for our 6GIX protein.
<br>
+
</p>
<img style="width: 100%" src="https://static.igem.org/mediawiki/2019/e/e8/T--Calgary--Question4.gif
+
<img style="width: 50%" src="https://static.igem.org/mediawiki/2019/8/86/T--Calgary--RMSDTAT.png"></img><p style="text-align: center ;">Figure 5: Root mean square deviation (RMSD) for 6GIX versus 6GIX with the Pho secretion tag.</p>
"></img>
+
<br>
+
  
From the emulsion bubble model our teams plan to use the emulsion system for chlorophyll degradation was verified.
+
All together, these models helped us understand the response 6GIX has to its given environment, and helped verify the molecular-scale viability of 6GIX in an emulsion.  
  
All together these models assisted the team in understanding the response a protein may have to its given environment. These knowledge then integrated into every facet of the project design. With a nanoscale understanding of our system iGEM Calgary was able to design and carryout yOIL in such a way to get the maximum out of our protein.
 
  
 
</p>
 
</p>
  
</div>
+
<p> The impact of this modelling can be felt within multiple aspects of our project. The parts that directly benefitted from this modelling are our 6GIX protein<a href="http://parts.igem.org/Part:BBa_K3114006" target="_blank">(BBa_K3114006)</a>, our ModGIX protein<a href="http://parts.igem.org/Part:BBa_K3114007" target="_blank">(BBa_K3114007)</a>, and our 6GIX protein with several secretion tags<a href="http://parts.igem.org/Part:BBa_K3114016" target="_blank">(BBa_K3114016)</a> <a href="http://parts.igem.org/Part:BBa_K3114017" target="_blank">(BBa_K3114017)</a> <a href="http://parts.igem.org/Part:BBa_K3114018" target="_blank">(BBa_K3114018)</a> <a href="http://parts.igem.org/Part:BBa_K3114019" target="_blank">(BBa_K3114019)</a> <a href="http://parts.igem.org/Part:BBa_K3114020" target="_blank">(BBa_K31140020)</a> <a href="http://parts.igem.org/Part:BBa_K3114021" target="_blank">(BBa_K3114021)</a> <a href="http://parts.igem.org/Part:BBa_K3114022" target="_blank">(BBa_K3114022)</a> <a href="http://parts.igem.org/Part:BBa_K3114023" target="_blank">(BBa_K3114023)</a>. Taken in full these models informed the generation of several proteins and the system in which they were implemented.
</div>
+
 
</div>
+
</p>
 
+
 
<div class="footer">
+
<div class="header-area">
        <div class="wiki-section">
+
  <h1>Future Directions</h1>
            <h2 class="section-title">Team</h2>
+
 
            <ul class="section-links">
+
</div>
                <a href="">Members</a>
+
<p>
                <a href="">Attributions</a>
+
When looking to model the dynamics of our system in the future, the two largest criteria for growth are in the time simulated for, and the size of the system we are simulating. To accomplish growth in these areas we will need to gain access to larger computational resources and gain more familiarity with the software.
                <a href="">Sponsors</a>
+
<br><br>
                <a href="">Members</a>
+
In addition to the increased time and size of our simulations, there is another simulation that we believe would greatly inform the project. This simulation would be to determine the ligand binding potential for 6GIX with chlorophyll. This simulation is achievable with our current resources and will be possible as we grow more confidence with GROMACS.
            </ul>
+
</p>
        </div>
+
 
        <div class="wiki-section">
+
 
            <h2 class="section-title">Journal</h2>
+
 
            <ul class="section-links">
+
 
                <a href="">Entries</a>
+
<div class="header-area">
                <a href="">Protocols</a>
+
  <h1>References</h1>
            </ul>
+
 
        </div>
+
</div>
        <div class="wiki-section">
+
 
            <h2 class="section-title">Project</h2>
+
<p class="hangingindent">Palm, D. M., Agostini, A., Averesch, V., Girr, P., Werwie, M., Takahashi, S., . . . Paulsen, H. (2018). Chlorophyll a/b binding-specificity in water-soluble chlorophyll protein. Nature Plants,4(11), 920-929. doi:10.1038/s41477-018-0273-z</p>
            <ul class="section-links">
+
<p class="hangingindent">M.J. Abraham, D. van der Spoel, E. Lindahl, B. Hess, and the GROMACS development team, GROMACS User Manual version 2018, www.gromacs.org(2018)</p>
                <a href="">Description</a>
+
<p class="hangingindent">The PyMOL Molecular Graphics System, Version 2.0 Schrödinger, LLC, 2019, www.pymol.org </p>
                <a href="">Modelling</a>
+
<p class="hangingindent">Humphrey, W., Dalke, A. and Schulten, K., "VMD - Visual Molecular Dynamics", J. Molec. Graphics, 1996, vol. 14, pp. 33-38.</p>
                <a href="">Software</a>
+
 
                <a href="">Experiments</a>
+
</div>
            </ul>
+
</div>
        </div>
+
</div>
        <div class="wiki-section">
+
            <h2 class="section-title">Results</h2>
+
            <ul class="section-links">
+
                <a href="">Demonstration</a>
+
                <a href="">Improvements</a>
+
            </ul>
+
        </div>
+
        <div class="wiki-section">
+
            <h2 class="section-title">Human Practices</h2>
+
            <ul class="section-links">
+
                <a href="">Education</a>
+
                <a href="">Integrated Practices</a>
+
            </ul>
+
        </div>
+
        <div class="wiki-section">
+
            <h2 class="section-title">Connect</h2>
+
            <ul class="section-links">
+
                <a href="">Facebook</a>
+
                <a href="">Instagram</a>
+
                <a href="">LinkedIn</a>
+
                <a href="">Twitter</a>
+
            </ul>
+
        </div>
+
    </div>
+
  
 
</body>
 
</body>
 
</html>
 
</html>
 +
{{Calgary/Footer}}

Latest revision as of 23:34, 21 October 2019

Modelling

In Silico Emulsion System Verification

Inspiration

In order for us to know whether it would be possible to use a water soluble chlorophyll-binding protein (6GIX) in an emulsion, it was necessary for us to assess the stress placed on the hydrophilic protein by being in an emulsion, which contains hydrophobic elements. To understand the stress placed on 6GIX, we investigated the nanoscale dynamics of the protein within different systems. These observations are incredibly difficult and expensive to obtain within the laboratory, so we turned towards in silico measurements and observations instead. Through the use of molecular dynamic simulations and a supercomputer, we were able to assess the nanoscale interactions of 6GIX with different components of the emulsion.

Questions

To guide our molecular dynamic modelling, we generated four questions that would have the largest impact on our project design.

Question 1:

What are the dynamics of the protein in an aqueous environment?


Question 2:

Will the individual monomers of the 6GIX protein aggregate to form a tetramer structure?


Question 3:

How will the protein hold up in a non-aqueous environment?


Question 4:

How comfortable is 6GIX in an emulsion on the nanoscale?


Question 5:

How does the inclusion of secretion tags affect our protein folding and stability?

Methodology

How were these models generated?

All simulations were conducted in GROMACS 18.1 on the cpu2019 partition of the ARC computing cluster at the University of Calgary. A similar model generating method was used utilizing the same hardware.

Step 1. Converting Structure files into GROMACS files. The pdb files generated from structure prediction modelling or provided by the protein data bank were converted to a .gro file. Afterwards, we placed it in a theoretical cube with the dimensions set to fit around the protein with at least a nanometer of padding in all dimensions.

Step 2. Solvate the Box. Once the protein was in a theoretical box we systematically placed solvent molecules to fill the box. If two solvents are necessary, they can be completed one at a time.

Step 3. Run Energy Minimization. Once the box had been solvated we ran energy minimization to ensure a stable system for the future steps. This is done through a relaxation stage known as energy minimization to ensure no catastrophic issues with the system. This step also makes sure that solvents are realistically aligned for the next steps. This step usually takes 2 hours.

Step 4. Running Isothermal-Isochoric Equilibration. This step looks at ensuring that the solvent and our protein are stable together at the temperature we are looking to simulate. This step usually takes 4 hours.

Step 5. Running Isothermal-Isobaric Equilibration. This step is similar to the previous step, but it accounts for the pressure and density instead of temperature. This step usually takes 4 hours.

Step 6. Atom by Atom Molecular Dynamics. Once the system was equilibrated, we ran molecular dynamics. This simulates the atom by atom movement of the solvated box for a given amount of time. This step will generate a file containing the trajectories for every atom of the simulation. Due to the extreme computational requirements of this step, molecular dynamics are computed for nanoseconds at a time. This step takes 23 hours per nanosecond.

Step 7.Visualize the Trajectory. In order to visualize the dynamics of the protein we had to take the large numeric files and make them understandable to our wetlab. To make these visualizations, our team used VMD and Pymol. We recommend using Pymol, as it is easier to install and more user-friendly. Although it is harder to install and use, VMD also is very powerful for visualization and has many other applications.


Assumptions

Assumption 1. We assume that the water molecule model(spc216) is representative of what would happen in the real world.

Assumption 2. In modelling 6GIX we consistently make the assumption about its structure. Particularly we assume that the structure provided on the protein data bank sufficiently constitutes the actual nano scale structure of the protein.

Assumption 3. Other assumptions made in these models are associated with the use of Ewald Electrostatics and Verlet cutoff schemes. These determine how we model the flow of force in the system. Due to these being commonly used in industry we were comfortable with including these assumptions within the model.

Assumption 4. The largest assumption for our models was the time frame in which we simulated for. At times within the nanosecond range we truly see a very small snapshot of the total dynamics for the protein. Due to the nanoscale size of the protein however we are comfortable with the use of these time frames.

Results and Wet-lab Integration

Where and how our models were applied


The models generated were able to answer the key questions our wetlab had about the nanoscale properties of the protein.

Question 1. What are the dynamics of the protein in an aqueous environment?

The tetramer structure was maintained and the protein was stable throughout the nanosecond simulation.

Figure 1: 6GIX in water

This enabled our team to understand the stability of our protein in water and the stability of the tetramer structure. This model was also the easiest to accomplish, which allowed our dry lab to become more familiar with the software.

Question 2.Will the individual monomers of the 6GIX protein aggregate to form a tetramer structure?

This model showed that the protein monomer's long-range forces allowed for the formation a tetramer when individual monomers came close enough in contact. We also found that the protein was less likely to form a tetramer without chlorophyll binding.

Figure 2: 6GIX monomers in a 20nm cube of water. Monomers in this range will aggregate over time to form a tetramer.

This model reassured our team that it would function to bind chlorophyll in an emulsion, because it would not immediately aggregate upon protein folding, thereby inhibiting chlorophyll binding. This model was extremely complex and required over 80 hours of computing time.

Question 3.How will the protein hold up in a non-aqueous environment?

From this model, we discovered that 6GIX in oleic acid, a non-polar solvent, loses its tetramer structure, and each individual monomer denatures.

Figure 3: 6GIX denatured in oleic acid, a nonpolar solvent

This confirmed our expectations that a nonpolar solvent, such as canola oil, can be harmful to 6GIX's structure. This model emphasizes the importance of keeping 6GIX out of the oil phase of the emulsion for proper function.

Question 4. How comfortable is 6GIX in an emulsion on the nanoscale?

The bisolvent system simulation showed that the 6GIX protein was comfortable in the emulsion at the nanoscale. The protein remains folded and fully functional within the aqueous layer.

Figure 4: 6GIX in an emulsion. The protein remains in the aqueous center of the model.



Question 5. How does the inclusion of excretion tags affect our protein folding and stability?

To determine if the inclusion of a secretion tag would be acceptable for our protein we generated a dynamics simulation for each of our secretion tags. From the generated models we looked at the root mean square deviation(RMSD) for the entire simulation. This gave a way for us to understand how the tagged protein stacked up against the 6GIX wild type. The proteins with the secretion tag was observed to have lower deviation when undergoing dynamic simulation as seen below. Due to this indication of stability we were comfortable with the use of a secretion tag for our 6GIX protein.

Figure 5: Root mean square deviation (RMSD) for 6GIX versus 6GIX with the Pho secretion tag.

All together, these models helped us understand the response 6GIX has to its given environment, and helped verify the molecular-scale viability of 6GIX in an emulsion.

The impact of this modelling can be felt within multiple aspects of our project. The parts that directly benefitted from this modelling are our 6GIX protein(BBa_K3114006), our ModGIX protein(BBa_K3114007), and our 6GIX protein with several secretion tags(BBa_K3114016) (BBa_K3114017) (BBa_K3114018) (BBa_K3114019) (BBa_K31140020) (BBa_K3114021) (BBa_K3114022) (BBa_K3114023). Taken in full these models informed the generation of several proteins and the system in which they were implemented.

Future Directions

When looking to model the dynamics of our system in the future, the two largest criteria for growth are in the time simulated for, and the size of the system we are simulating. To accomplish growth in these areas we will need to gain access to larger computational resources and gain more familiarity with the software.

In addition to the increased time and size of our simulations, there is another simulation that we believe would greatly inform the project. This simulation would be to determine the ligand binding potential for 6GIX with chlorophyll. This simulation is achievable with our current resources and will be possible as we grow more confidence with GROMACS.

References

Palm, D. M., Agostini, A., Averesch, V., Girr, P., Werwie, M., Takahashi, S., . . . Paulsen, H. (2018). Chlorophyll a/b binding-specificity in water-soluble chlorophyll protein. Nature Plants,4(11), 920-929. doi:10.1038/s41477-018-0273-z

M.J. Abraham, D. van der Spoel, E. Lindahl, B. Hess, and the GROMACS development team, GROMACS User Manual version 2018, www.gromacs.org(2018)

The PyMOL Molecular Graphics System, Version 2.0 Schrödinger, LLC, 2019, www.pymol.org

Humphrey, W., Dalke, A. and Schulten, K., "VMD - Visual Molecular Dynamics", J. Molec. Graphics, 1996, vol. 14, pp. 33-38.