| GRID manual | ||
|---|---|---|
| <<< Previous | Next >>> | |
Chapter 55. Tutorial 01
This is a detailed tutorial for Users when they first start to use the Molecular Discovery Programmes. It contains worked examples of Grid computations. We suggest that all Users should also read the original paper about Programme GRID, which was published in the Journal of Medicinal Chemistry (1985). Volume 28. pp 849-857. Of course all the energy functions and parameters have changed many times since that paper was published, but it still gives the best overall view of the Grid method. We suggest all Users to read also the paper published in Nature, Volume 363, pp 418-423 (1993).
55.1. How to start
Programme GRIN is used first, in order to prepare and check the input data for the main Programme GRID. It is essential to use GRIN before GRID. There are two alternative ways to get started:
1. You can run GRIN and GRID individually, or
2. You can run Programmes GRIN and GRID under the control of a supervising Programme called GREAT.
We suggest that you start with Programme GREAT. The organisation of operations will then be:

The primary input for Programmes GREAT and GRIN will be the xyz coordinate positions of the atoms in the "Target" (macro)molecule. This is the molecule or collection of molecules which you wish to study. Your input file containing the coordinates of the Target can be converted by Programme GREAT into the internationally recognised format specified for the Protein Data Bank (PDB) at Brookhaven. The converted input file is called a Protein Data Bank file (a PDB file in pdb format). The coordinates must be in PDB format before they can be used as input for Programme GRIN.
To begin with we have supplied a Protein Data Bank file called PDB.pdb on the Tutorial01 directory. You therefore do not have to worry about format conversions before doing your first trial run. Use your page editor to inspect the PDB.pdb file now. It is the 1LZ1.pdb protein structure determined by Artymiuk and Blake. Please note that the sequence of lines in 1LZ1.pdb is different, and the water coordinates are not there. More information on the structure is contained in the file pdb_header.txt.
You are going to begin with Programme GREAT which works like this:
It converts your input coordinates into pdb format (if they are not already in that format).
It checks the pdb file.
It instructs Programme GRIN to give two output files grinkout.kout and grinlout.lout
Programme GREAT gives the necessary instructions to Programme GRIN by writing a directive file grin.in
If anything unexpected happens during the GRIN run, a message will be sent to the lineprinter output file grinlout.lout
The other output file grinkout.kout is ready to be used as input for the main Programme GRID. The command file grin.in is a record of what you have done.
55.2. WORKED EXAMPLES
Go from the your working directory in the Tutorial01 directory:
cd Tutorial01
Your computer should be showing its usual prompt on the left-hand margin of the screen. You begin by typing:
Great (or /your_grid_directory/great)
and hitting the RETURN key. You will see this sort of menu:
WELCOME TO THE MOLECULAR DISCOVERY PROGRAMMES
*********************************************
MENU A
****** Do you want to use:-
(1) Programme GRIN (GRIN)
(2) Programme GRID (GRID)
(3) Reserved for future use (****)
(4) or do you want to prepare a pdb FILE (FILE)
(5) or do you want HELP or (HELP)
(6) do you want to use the operating system (SPAWN)
(7) or return to the operating system (ABANDON)
(8) or read the Regular MD Licence Terms (TERMS)
(9) or do you want to switch off this BELL (BELL)
ANSWER NOW: |
Programme GREAT works by asking you questions, and waiting for your answers. Most of the questions are collected into Menus, and you are looking at the first which is MENU A.
You must begin each new job with Programme GRIN. This is item 1 in Menu A. So hit 1 on the keyboard and then hit RETURN.
1 (press Return)
The screen will clear and you will be offered three main choices like this:
How do you want to start? Do you prefer to begin with:
(1) The original default filenames and parameter
values as described in the User Manual, or
(2) The filenames and parameter values which have been
saved in a previous command file, or
(3) The filenames and parameter values which are saved in
the latest copy of the command file named "grin.in" |
Programme GRIN is always controlled by a set of instructions in a File of Directives which is called grin.in. Programme GREAT is now going to prepare (or update) grin.in for you. In fact, you have been supplied with a copy of grin.in on the distribution, and it is already in your Tutorial01 directory as a template. It contains all the instructions for running Programme GRIN to do this first worked example. It is the latest copy of the command file named grin.in.
Therefore give the answer 3 and hit the RETURN key.
3 (press Return)
You will promptly see the next menu:
MAIN MENU B Preparing a command file: grin.in
*********** Hit a row number (and RETURN) to choose from the menu
Do you want to alter a filename, or do something else:
(1) PDB.pdb The input file with atom COORDINATES
(2) grinlout.lout The line-printer output file (GRINLOUT)
(3) grinkout.kout The coordinate output file (GRINKOUT)
(4) grin The name of the executable PROGRAM Grin
(5) The current directory will be used unless names start with /
(6) grub.dat The name of data-file GRUB
(7) grin.in The NAME of the new command file
(8) Do you want to alter something ELSE
(9) INSPECT or PREPARE grin.com or SUBMIT it ... or STOP
ANSWER NOW: |
Please study Main Menu B very carefully. If you find it confusing type HELP and hit RETURN. The lines numbered 1, 2, 3, 4, 6 and 7 should be quite straightforward. PDB.pdb and grin.in are files which are already in your Tutorial01 directory. We have already discussed them. grinlout.lout and grinkout.kout are the names of the output files which will be generated by Programme GRIN.
Insert the path of GRIN in Item 4, i.e. /your_grid_directory/grin.
Item 5 in MAIN MENU B means that all the files in the other Items will be taken from your current directory. That is what you want, because that is where the files like PDB.pdb and grin.in are now. That is also where you want the output in files grinlout.lout and grinkout.kout to appear.
Item 6 refers to the location of the grub.dat file, which contains the GRID parametrisation. Insert the appropriate path if necessary.
You are now completely ready to do your first GRIN run. This first GRIN run will use file PDB.pdb for the xyz coordinates of the Target, together with the appropriate Energy Parameters. At this stage you simply type the word SUBMIT and hit RETURN.
submit (press Return)
When you 'SUBMIT' a job, the directive file grin.in will tell Programme GRIN exactly what to do. A new version of the directive file must therefore be prepared for the new job, but the original version will not be thrown away unless you want that to happen.
You will be prompted about this. We suggest that you answer "YES" in order to keep a copy of the current directive file, because it is the original as supplied on the Tutorial01 directory. You will be asked to give it a new name, and you could rename it: grin.in.original.
As soon as you have saved the original file of directives, a new version will be prepared and written to disk. The new version will now be called grin.in and will have replaced the original file of that name. However that does not matter, because you have saved a copy of the original as grin.in.original.
You may also be asked if you want to keep a copy of the original version of the input file PDB.pdb.
Then you may be asked again if you are CERTAIN that your pdb file is absolutely correct, and in this case you can answer CERTAIN.
certain
The job will be submitted as a batch job in background mode as soon as you have answered all the questions. Then you will be asked to hit RETURN once or twice, and Menu A will reappear.
55.2.1. ASSESSING THE RESULTS FROM PROGRAMME GRIN
Now use your page editor (e.g. vi, jot, nedit, gedit) in order to study the output files grinlout.lout and grinkout.kout which have been produced by your first run with Programme GRIN. There may be a Message in grinlout.lout telling you the total electrostatic charge of the Target. In this case it has a positive charge, because the pdb file does not contain any counter-ions. (Programme GRIN tells you the charge of the Target, in case you want to add counter-ions before doing your GRID runs). Then grinlout.lout ends with the words:
*** THANK YOU FOR USING PROGRAM GRIN *** |
This shows that everything has worked correctly, because the message would have been less polite if you had made a mistake, or if there had been significant errors in PDB.pdb (In fact another pdb file with errors has been supplied on the tape. It is call PDB.dud and you can try using it later in order to see how Programme GRIN deals with input errors. Do not use it now while you are doing the example computations).
Notice in particular that:
The sequence of some lines in PDB.pdb has been shuffled, so that the lines in grinkout.kout are in the sequence defined by the conventions of the Protein Data Bank. For instance the first ATOM record in PDB.pdb was for a C-alpha atom CA, but protein chains really start with a nitrogen. This anomaly has therefore been corrected in the GRINKOUT file.
That some numbers in GRUB have been modified before transfer to grinkout.kout For example, the charge on the N-terminal nitrogen (first row of grinkout.kout) is much more positive than the charge specified by GRUB. This is because the GRUB charge is (by default) appropriate for a mid-chain amide-type nitrogen, but the N-terminal nitrogen of a protein has a positive charge since it is a cationic amine-type atom.
You have now completed you first GRIN run, and are ready to use Programme GRID. Well done!!
55.2.2. WORKED EXAMPLE WITH PROGRAMME GRID
It was necessary to use Programme GRIN before GRID, and by this time you should have completed the previous worked example. You should have generated a grinkout.kout file of about 1553 lines and more than 120 columns. That file will be the primary input for your first GRID run.
Begin by typing Great as before.
Great (or /your_grid_directory/great)
Then in menu A you should type 2 in order to use Programme GRID.
2 (press Return)
Give the answer 3 in the intermediate menu, in order to call up the filenames and parameter values which are saved in the latest copy of the directive file grid.in
3 (press Return)
Amongst other things it is necessary for the command file grid.in to specify the type of Probe which is to be used for your Grid run. The file is set-up for a water Probe (OH2), and you should now give the reply 'YES' in order to confirm that you do want to use water for your first Grid run.
Yes (press Return)
As soon as you have typed YES the Main Menu J will be displayed.
Make sure you understand items 1 to 6 in Menu J. Then start your first GRID run by typing SUBMIT.
submit (press Return)
The next thing is to give a title to your computation. You could give the answer: "First trial with GRID cage round a binding site for water" and hit RETURN.
First trial with GRID cage round a binding site for water (press Return).
Then give the final confirmation that you want the job to proceed by typing "YES".
yes (press Return)
You will be reminded that the original copy of grid.in may be overwritten and lost, and you should answer YES or NO. (If you just hit the RETURN key the original version of grid.in will be saved to a file called temp.saved).
As soon as you have answered the questions, grid.in will be submitted to the computer to run Programme GRID as a batch job in background mode. You will be asked to hit RETURN once or twice, and MENU A will be displayed again. You should then type the number 7 in Menu A in order to return to the operating system, and in a few seconds your GRID run will be completed.
55.2.3. ASSESSING THE RESULTS FROM PROGRAMME GRID
The supervising Programme GREAT will have created a new command file grid.in and you can use your page editor to inspect this command file while the GRID computation is running. These lines should appear near the end of the new grid.in file:
OH2
TOPX 28.000
TOPY 9.500
TOPZ 21.000
BOTX 24.000
BOTY 5.500
BOTZ 17.000
IEND
First trial with GRID cage round a binding site for water
0 1 |
They are interpreted like this:
You have been studying a complete protein, but your GRID map will be restricted to a relatively small part of the whole protein structure. This part is defined by the six directives TOPX .....BOTZ. These define a Grid CAGE whose top right back corner is at the position x=28.0 y=9.5 z=21.0 and whose bottom left front corner is at x=24.0 y=5.5 z=17.0. The positions are measured in Angstrom on the same coordinate axes which were used to define the atom positions in the pdb file. For your first job the Grid map will be computed for this restricted region alone.
Notice that there is quite a long list of directives in grid.in, starting with INPT and continuing through TOPX and ending with IEND. Of these, IEND is particularly important because it tells the computer that the list of directives has completely finished. The directive list must always finish with IEND as shown above. All the other directives (with the exception of the Probe) are optional, and you can read about them later in the MD User Manual.
"First trial with GRID cage round a binding site for water" is the title you typed in.
The next line contains 0 and 1. These are values for variables NZ1 and NZ2 which are described in the MD User Manual. You can always use 0 and 1 unless you are studying a very large Target on a small or slow computer.
OH2 is the symbol specifying the choice of Probe, which is water in this first Grid run. This symbol OH2 defines the properties of a water Probe, which can alternatively be defined by a set of eight or nine Energy Parameters. The Parameters are described in more detail below. The list of Probes is reported in the MD User Manual.
So you have been studying the interaction of water with an interesting part of a protein.
You have generated a Grid Map in file gridkont.kont and we will look at the map later.
The map is supplemented by some detailed tables of results in file gridlont.lont and we will look at them with the page editor as soon as the GRID job has finished.
Go to the end of file gridlont.lont and notice the most negative interaction energy.
tail gridlont.lont (press Return)
It is about -17 Kcal/mole and shows that water would be very strongly attracted to this place on the protein. Such a big value is not typical, because this region of the Target was specially selected for a demonstration run with the Programmes.
55.2.4. OUTPUT FROM GRID TO GRAPHICS – Programme GVIEW
From GRID version 19 Molecular Discovery Limited provides a graphic interface to display the output from Grid. The programme is called Gview. More information on Gview programme is reported in the MD User Manual - Gview.
Begin to use Gview by typing gview with the name of the grid kont file:
Gview gridkont.kont (or /your_grid_directory/gview gridkont.kont)
The Molecular Interaction Field produced by water probe will be reported on the screen together with the protein structure.The yellow region refers to interaction energies between water probe and the protein. This contour level has been automatically selected by Gview programme.
To change the energy level, select:
Edit->Field style
Move the cursor of the interaction energy levels up to -5.8, select cyan colour and then press Exit. Move the image pressing the left button of the mouse and moving the mouse at the same time. To import the macromolecule structure together with the water MIF, use the commands:
File->Open and select the PDB.pdb from the dialogue. Then press Open.
The water molecules will be displayed on the 3Dplot with red crosses. Please refer to the Gview manual page for more information about Gview settings.

Graphic visualisation and interpretation shows that a water molecule is trapped in a funnel-shape potential. The most probable position for water molecule is close to the narrow part at the bottom, but all the positions into the blue region can be considered potentially water-populated. Thus, the yellow region could also represent a dynamic image of a water-protein interaction in this particular location of the macromolecule.
The most negative energy occurs at grid point coordinates which are close to 25.750, 7.500 and 19.750 Angstrom. Use your page editor to search through the file gridlont.lont and you will find a detailed table of results.
This file lists, plane by plane, the atoms of the Target which are nearest to the most favourable position for the Probe. Search the plane 12. The asociated Table shows that the nearest atom is water molecule number 1067* which is actually 0.401 Angstrom from the best probe position. The asterisk in 1067* shows that the water molecules was NOT in fact considered as part of the Target when the Grid run was done. The X-ray crystallographer observed a water molecule at x=25.454 y=7.770 z=19.767 but Programme Grid took no account of that particular X-ray observation. Quite independently, the Grid computation found that the grid point at x=25.750 y=7.500 z=19.750 would be the best position when the grid points were spaced at 0.33 Angstrom.
With a closer Grid spacing of 0.1 Angstrom the best point is at x=25.700 y=7.600 z=19.800. Of course more computer time is required to compute the close-packed points, but the distance between the observed and the predicted positions is now only 0.30 Angstrom. This close agreement shows that GRID has made a satisfactory prediction. GRID predicts that there should be a water molecule very near the place where the crystallographers actually found one.
In this directory we have supplied a Grid kont file called gridkont_almd.kont. This has an extended format containing extra information on the energy contribution made by each individual protein atom. A new directive ALMD was used in order to produce this special output, and if you want to know more about ALMD, click here.
The extended results obtained by using ALMD can be inspected graphically. Typing Gview with the name of the grid kont file:
Gview gridkont_almd.kont
To change the energy levels, select:
Edit->Field style
Move the cursor of the energy levels up to -12 Kcal. To display the contribution to the Molecular Interaction Field produced by individual protein atoms, click on the "Style" tab and select "atom contribution" (atom type color). You can move the image by pressing the left button of the mouse and moving the mouse at the same time.
The negative blue region of water-protein attractive interaction will be coloured in red. This is due to the oxygens O 920 of Gln and O 884 of Arg residues (red coloured) that contribute most to the energetic of the interaction with water molecule in the reported region. The nitrogens 941 of the Arg residue nearby is contributing less than the previous oxygen atoms. To prove that both the oxygens of residues Gln and Arg are contributing to the total energy of interaction in this region, open again the Rendering window and select "atom contribution"(contrast color).
The region of water-protein attractive interaction will be coloured in different colours now, showing that the two oxygens are giving a co-operative interaction with water molecule. However, N 941 is also contributing to the total energy of interaction in this region (see Figure below).

Change the MIF contour using symbol. Activate the toggle mode pressing ESC or selecting
View->Toggle Mode
The distances between atom-pairs, MIF field points and/or MIF points and protein atom can be now reported on the screen. Press SHIFT button and click on any atom of the protein, then press SHIFT and click again in another atom of the protein. Their distances will be reported on the screen. To show the MIF point - ATOM distance press on any of the MIF symbol and on the selected protein atom.
You have now completed you first Tutorial. Well done!! We look forwards to hearing from you if we can help in any way.
Please, continue with the Tutorial 02.
| <<< Previous | Home | Next >>> |
| Greater tutorial 07 | Up | Tutorial 02 |