Gromacs/C2/Creating-Input-Files/English

From Script | Spoken-Tutorial
Revision as of 19:30, 21 October 2021 by Ranipv076 (Talk | contribs)

(diff) ← Older revision | Latest revision (diff) | Newer revision → (diff)
Jump to: navigation, search
Visual Cue Narration
Slide Number 1

Title Slide

Welcome to the spoken tutorial on Creating Input Files.
Slide Number 2

Learning Objectives

In this tutorial, we will learn about,
  • Flow chart to setup a MD run for a protein in a solvent box
  • Delete water molecules present in the starting pdb file
  • pdb2gmx command and syntax
  • Generate .gro and .top files
Slide Number 3

Learning Objectives

* About force field and selecting a force field
  • Generate the topology file
  • editconf command


Slide Number 4

System and Software Requirement

To record this tutorial, I am using
  • Ubuntu Linux v20.04 OS
  • Firefox web browser v92
  • gedit 3.36
  • Gromacs v2021.2
Slide Number 5

Pre-requisites

https://spoken-tutorial.org

To follow this tutorial,
  • Learner must be familiar with basic computer skills.
  • For pre-requisite tutorials, please visit this site.
Slide Number 6

Code Files

* Files used in this tutorial are provided in the code files link.
  • Please download and extract the files.
  • Make a copy and then use them while practising.


Open a web browser and go to https://manual.gromacs.org/archive/5.0/online/flow.html . Open a web browser and go to the Gromacs link as seen here.

The flow chart here, shows the procedure for MD and data analysis.

Cursor on the flow chart. I will demonstrate parts of this procedure till the addition of a bounding box.
Press Ctrl, Alt and T keys together. Open a terminal by pressing Ctrl, Alt and T keys together.

Windows users can open the Ubuntu 20.04 app.

Type mkdir firstmd and press Enter. Type mkdir space firstmd to create a new directory.
Type cd firstmd and press Enter. Change path to the newly created firstmd directory with the cd command.
Open the File Manager. Open the File Manager and go to the firstmd directory.
Let’s consolidate all our input files here.

From now on, I will not mention this separately.

Slide Number 7

FIle path for Windows Users

* Windows users must find the path of this directory in Windows.
  • Detailed instructions are provided with the tutorial.
  • Use this path when locating, opening or moving files, which are saved to here.
Copy 1AKI.pdb to the firstmd directory. Copy the provided 1AKI.pdb input file to this directory.
Open the 1AKI.pdb file in a text editor. Open the 1AKI.pdb file in a text editor.
Scroll to the end of the file. Scroll to the bottom of the file.

In many files downloaded from the PDB website, water molecules will be present.

Show water molecules HOH from 1AKI.pdb file. They are denoted as HOH and present at the end, before the model ends.
Highlight the HOH from 1AKI.pdb. If so, be sure to delete them and save the file, before proceeding further.
select all the HOH containing rows molecules and delete them. Let’s select all the lines starting with HOH molecules and delete them.

Lysozyme is well studied, stable and a model protein.

Hence we will also use it for learning the steps of protein MD for beginners.

Show absence of H atom in the file. Notice that hydrogens atoms are absent in this structure file.

We will not add them.

The Gromacs commands will add the hydrogens for the user later on.

Save the file and quit the text editor. Now, save the file and quit the text editor.
Structures determined by NMR methods may not contain the water molecules.
Annotation: Windows users use appropriate path in your system. Let’s learn a few terminal commands to generate input files for MD.
Type gmx pdb2gmx -f 1AKI.pdb -o first.gro . All the commands start with the phrase gmx which calls Gromacs.

Enter the commands as seen and retain the syntax.

Highlight pdb2gmx. The second word tells Gromacs, what command is to be performed.

Here, it is pdb2gmx, which will convert the pdb file to gro file input file format.

Highlight -f 1AKI.pdb . The hyphen f flag denotes the input file and it is 1AKI.pdb.
Press Enter. The hyphen o flag refers to the output file and I will name it first.gro.

Press the Enter key and we will enter an interactive mode.

Cursor on the force field and publication details selection. Selection of force field and solvent addition is interactive in this command.

First, several forcefieds are listed, for the user to pick from.

Cursor on the details page. The details off publication for each of the force field is also listed.

To learn further, learner is referred to the publication listed.

Type 4 as seen here and press the Enter key. Learner must know their system to select the number for the desired force field.

Lysozyme, is a protein without unnatural modifications.

Hence I want to select AMBER94.

To select AMBER, type 4 as seen here and press the Enter key.

Cursor and highlight the response on the terminal. Gromacs returns the force field that will be used as amber99.ff.

By convention, .ff denotes force field files and the path of file is also seen.

Highlight solvent environment options.


Type 2 and press Enter.

Next, the user is prompted to add a water model to the system.

Type 2 for tip4p and press the Enter key.

Sometimes, users desire for different solvent system to be added.

If so, create a topology file and add it to the library.

Show file manager with the newly created files. Open the file manager and notice the newly created files.
Cursor on the file manager showing the newly generated files. Three files, with .gro, .top and .itp extension are created with this command.

.top is the topology file and .itp is include topology file.

Open first.gro in text editor. Let’s open the generated first.gro file that gets created in a text editor.

Notice that, this command has also added hydrogens to the file.

Cursor on 4-6 columns. The 4th, 5th and 6th columns are the atomic coordinate positions of the atom.
Close the firstmd.gro file.

Press Ctrl+L.

Let’s close the firstmd.gro file.

I will clear the terminal screen for clarity in the video.

Show Gromacs manual website on getting started.

https://manual.gromacs.org/archive/5.0.4/online/getting_started.html#setup

Learner may go through the Gromacs website on getting started.

The link is shown here.

Type gmx editconf -f first.gro -o box-added.gro -c -d 1 -bt cubic. Next, enter the editconf command, which adds a solvent box to the system.

This is to define the volume where the simulation will be performed.

Editconf does not create a new conformational state for the protein.

Cursor on -f and -o flag. This command also generates .gro file like the editconf command.

Again -f denotes input file and -o denotes output file.

Type -c . Cubic and hexagonal shapes are popularly used for the bounding box.

The hyphen c flag places the center of the molecule in the center box.

Type -d 1 . The -d flag denotes distance from the

center of the box to the boundary.

Here 1 denotes 1nm, which is half of the length of the box.

Increasing the box size will considerably lengthen the MD run time.

If using lysozyme, the max length for lysozyme can be of the order of 90 A.

So the box length is a little bigger than the molecule under study.

Type -bt cubic . Next, enter the shape of the box, which I will enter to be of cubic shape.

If another shape is desired, use the -bt flag with the shape name.

Press Enter to run the command. Let’s run the command.

This generates another input .gro file, with box size and sets file names.

Cursor on the details. The box volume that is created is displayed in cubic nanometers.
The extent of shift of the coordinates is also mentioned in nanometer.
Show screenshot of box-added.gro. Let’s open the box-added.gro in a text editor.

Notice that, the coordinates have shifted to accommodate aligning in the box.

Close the text editor. Learner may pause the video, and explore the options further.

Close the text editor window.

Slide Number

Summary 8

Now let’s summarize. In this tutorial, we learned about,
  • Steps in a MD run for a protein in a solvent box
  • The pdb2gmx command and syntax
  • About adding a force field
  • Generated .gro and .top files
Slide Number

Summary 9

  • About editconf command
  • Define the size of a bounding box
  • Set the position of the molecule in the box
Slide Number

Assignment 10

For assignment activity, please do the following.
  • Use a PDB ID of another protein of your choice.
  • Practice the commands learnt in this tutorial.
  • Read publications of your choice which use Gromacs.
Slide Number

Assignment 11

http://www.mdtutorials.com/gmx/

* Go to the website seen below.
  • Choose a tutorial of your choice and read it
Slide Number 12

Spoken Tutorial Project

This video summarises the Spoken Tutorial Project.

Please download and watch it.

Slide Number 13

Spoken Tutorial workshops

We conduct workshops using spoken tutorials and give certificates.

Please write to us.

Slide Number 14

Forum for questions

Post your timed queries in this forum.
Slide Number 15

Acknowledgment

Spoken Tutorial Project is funded by MoE, Government of India.
This is Rani from IIT, Bombay. Thank you for joining.

Contributors and Content Editors

Ranipv076, Snehalathak