Chapter 28. Control directives

These directives control the way in which the Programme runs. Their use is shown in Section K.6. Input is through channel IDIR (Default value IDIR = 5; see above). The Control Directives consist of one or more free format lines in the command file, and they must follow the Channel assignments. Each four-letter directive must have appropriate numerical value. The free format input expects integer values (no decimal point) for Directives starting with the letters I, J, K, L, M, and N. Do not use Tabs (the "TAB" character) in order to format your input.

Default values are supplied for all the following directives except POSI. New Users will often find that these default values are acceptable. Quite soon, however, they may like to try the effects of TOPX......BOTZ which are described in detail below.

The directives on the following pages should not be used until they are needed. Remember that the list of directives must always finish with IEND on a line by itself, before the job title is input (See Section K.6). The symbol defining the Probe should normally be entered (with the Control Directives) before IEND, but there are other ways of defining the Probe as described below.

Directives BOTX, BOTY, BOTZ, TOPX, TOPY, and TOPZ may be specified in order to restrict the computation to a selected region of the Target molecule. This region is defined as a rectangular box. TOPX TOPY TOPZ are the xyz coordinates in Angstrom of the top right back corner of the box, and BOTX BOTY BOTZ define the bottom left front corner. TOPX....BOTZ are real variables.

The TOP value along any coordinate axis must NOT be smaller than the BOT value. i.e. TOP-BOT must not be negative.

If you do not specify any values for these six variables, the size of the grid will be adjusted so that it is a little larger than the whole Target. The clearance of the GRID around the Target will normally be determined by the value of directive CLER. CLER has a default value of 5.0 so the GRID will extend 5.0 Angstrom beyond the Target in each dimension.

If the six values BOTX, BOTY, BOTZ, TOPX, TOPY, and TOPZ are all specified then the GRID is constructed so that it just encompasses the rectangular box. In this case it is appropriate but not essential to use values of TOPX....BOTZ, so that the GRID can fit the box exactly. The closeness of the grid points is determined as usual by directive NPLA (see below), but the value of directive CLER is reduced to 0.0. However, new Users need not worry about these details.

If you are studying several Targets one after the other as a Set, then the size of the grid will be adjusted so that it surrounds the whole Set of Targets. The clearance depends on the value of Directive CLER (see below) if Directive KWIK = 0. However the clearance is automatically determined taking account of atom and Probe charges and the dielectric environment if KWIK = 3 or KWIK = 4, and in this case you should not set values for TOPX ... BOTZ. See Diagram 6 or see Chapter 36 for more information.

Superposition of targets: The grid will surround all the Targets, when you are studying a Set. All the Target molecules should therefore be in approximately the same x,y,z coordinate position. If the Targets are dispersed, the Grid will be correspondingly large, and this is inconvenient.

28.1. Observations at one grid point only

If TOPX=BOTX and TOPY=BOTY and TOPZ=BOTZ then the energy of inter- action is computed for the Probe at one single position. Results are output by lineprinter to GRIDLONT as usual, but there is no output to GRIDKONT. In this special case it is not necessary to specify both TOP and BOT values, but BOTX BOTY BOTZ alone will suffice.

Do not use TOPX...BOTZ in order to define a single Grid point, if you are studying several Targets one after the other as a Set. Use Directive POSI (see below) in this case, and make sure that you never use POSI at the same time as TOPX ... BOTZ. POSI will normally over-ride directives TOPX ... BOTZ if a clash occurs, but POSI must not be used with Multi-Atom Probes.

Directive POSI (see below) may be used repeatedly if the energy of interaction is wanted at more than one specified position. In fact POSI is the only Directive which may be used more than once on successive lines of a Command File for Grid. In some cases the Command File may have thousands of POSI lines, for example to computer Grid energies at points on the surface of a molecule.

28.2. Timings with programme GRID

GRID is a fast programme, and runs reasonably quickly on modern computers and workstations. Timing is not usually a problem, unless an older machine is being used or large Targets are being studied with close-spaced grids.

Of course shorter quicker computations can be performed when TOPX etc are used, because fewer grid points are calculated. Note well, however, that the whole Target is still considered during the computations for each individual grid point, i.e.nbsp; interactions between the Probe and any the Target atoms which are outside the rectangular box are still taken into account.

When first studying a new macromolecular Target, the GRID Points should normally be spaced at 1 Angstrom apart. With this spacing all significant features of the GRID map will normally be displayed, although the contour surfaces may appear rather rough.

When a feature of special interest has been identified, TOPX .... BOTZ should be used in order to define a smaller region in which to make a more detailed map. The grid points should be more closely spaced on this new map, so that smaller features can be observed. 0.5 or 0.25 or even 0.1 Angstrom spacing would be appropriate, and this spacing may be set with Directive NPLA (see below).

Since the cpu time for a GRID run depends on the number of grid points, a given Target will take 64 times longer with 0.25 Angstrom spacing than with 1 Angstrom spacing!! Similarly, with a given spacing, a Target of 10x10x10 Angstrom dimensions will run 64 times faster than a Target of 40x40x40. In practice the User should judge the spacing and the size of the Target, and set suitable directives to achieve an appropriate cpu time with the available processor.

28.3. Control directives for GRID

28.3.1. Directive ALMD

From version 19 the GRID output can be controlled by a new directive ALMD (which is short for Almond). The default value is ALMD=0.0 and this leave the output as it was in the previous release. The output is also unchanged if ALMD is omitted altogether from the command file. If ALMD is set to the alternative value 1.0 there should be no effect in a regular grid run; ie: there should be no effect in a run with one Probe, and with input from a single .KOUT file as the Target. In this case the .KONT output should be a regular binary matrix for one Probe on one Target.

However extra information is written to the .KONT output file, when the input is defined by a file.list and when ALMD has been set to 1. In this case there may be several different targets as defined by the file.list, and extra information is printed. .KONT output from a file.list normally begins with the coordinates of the grid points, and these are printed as usual to the start of the .KONT file like this:

 1     -9.000 -6.000
-5.000  2     -9.000
-5.000 -5.000  3
-9.000 -4.000 -5.000
.......................................
and so on until one comes to the next section of output which would have been like this from the previous release of grid:

phenolate       -1.5     HEADER                           CMPD 1
   0.000
  -0.001
  -0.007
..........

and so on. These numbers are the energies at successive grid points, so the third grid point at (-9.000 -4.000 -5.000) has an energy of -0.007 Kcal/mole in this example. The same output would be obtained when ALMD was set to its default value 0.

However, when ALMD is set to 1 the energies are printed like this:

phenolate       -1.5     HEADER                           CMPD 1
   0.000       0
  -0.001       4
  -0.007       7
..........
and so on. This means that atom 4 in the .KOUT input file interacts more strongly with the Probe than any other Target atom, when the Probe is at position 2. Atom 4 makes the bigest contribution to the total energy of -0.001 at this position -9.000 -5.000 -5.000. Similarly atom 7 of the Target makes the strongest interaction when the Probe is at the third grid point (-9.000 -4.000 -5.000). (The number 0 is written against the first energy, because the interaction is so weak (0.000) that no "best" atom can be selected). Note that only the heavy atoms of the Target are considered when the best atom is chosen. Therefore, if the best atom were a hydrogen, it would be ignored.

Gview program can be used to display this extra information in the 3Dplot of Molecular Interaction Fields. This directive can be very useful in MIFs interpretation.

When a Target molecule contains negative charge concentrated on a certain atom, this is always the most strongly interacting atom when the Probe is positively charged (like N3+).At every grid point the best Target atom can be the charged atom when N3+ interacts with this target structure. The output will shows no selectivity because the electrostatic interaction always dominates, and gives the same result at every grid point.

A possible way of dealing with this problem is to set ALMD directive to 0.1 (which is acceptable because ALMD is a real variable ranging between 0.0 and 1.0). When ALMD is set to 0.1, then the electrostatic contribution is diminished by a factor of 10 when the most strongly interacting atom is being selected. The output energy itself is not changed, but the selection of "best atom" is made with a reduction of the electrostatic component.

Summing up: when 1.0 <= ALMD < 0.0 the extra information is printed in GRID .KONT file showing the Target atom which interacts most strongly at each grid point. Gview programme can be used to display this extra information in the 3Dplot of Molecular Interaction Fields.

28.3.2. Directives BOTX, BOTY, BOTZ

Directives BOTX, BOTY, BOTZ, TOPX, TOPY, and TOPZ may be specified in order to restrict the computation to a selected region of the Target molecule. This region is defined as a rectangular box. TOPX TOPY TOPZ are the xyz coordinates in Angstrom of the top right back corner of the box, and BOTX BOTY BOTZ define the bottom left front corner. TOPX....BOTZ are real variables.

The TOP value along any coordinate axis must NOT be smaller than the BOT value. i.e. TOP-BOT must not be negative.

If you do not specify any values for these six variables, the size of the grid will be adjusted so that it is a little larger than the whole Target. The clearance of the GRID around the Target will normally be determined by the value of directive CLER. CLER has a default value of 5.0 so the GRID will extend 5.0 Angstrom beyond the Target in each dimension.

If the six values BOTX, BOTY, BOTZ, TOPX, TOPY, and TOPZ are all specified then the GRID is constructed so that it just encompasses the rectangular box. In this case it is appropriate but not essential to use values of TOPX....BOTZ, so that the GRID can fit the box exactly. The closeness of the grid points is determined as usual by directive NPLA (see below), but the value of directive CLER is reduced to 0.0.

28.3.3. Directive CLER

CLER is the clearance in Angstrom by which the grid extends beyond the Target. This ensures that the whole Target molecule is well contained within the array of grid points. However, the assigned value of CLER is sometimes changed by the Programme at run time, and a message is printed to GRIDLONT when this happens. For example CLER is reduced to 0.0 when TOPX.....BOTZ are used, so that the GRID can fit the defined box exactly.

If you are studying several Targets one after the other as a Set, then the size of the grid will be adjusted so that it is big enough to surround all the Targets with a clearance equal to CLER if directive KWIK = 0. However the value of CLER will be ignored if KWIK = 3 or KWIK = 4 and the clearance will be automatically determined taking account of atom and Probe charges and the dielectric environment. See Chapter 36 for more information.

The initial default value is CLER=5.0 Angstrom and the acceptable range of values for CLER is normally:

Default: CLER = 5.0

2.0 < CLER < 100.0

28.3.4. Directive DEEP

DEEP is a directive that influences the amount of information which is printed to the output file GRIDLONT. GRIDLONT contains a series of Tables which describe the environment around the energy minima of the grid map. However, some of these minima may be shallow, only occurring at insignificant energy levels. For instance a deep minimum at -10 Kcal/mole might well be of functional importance, while another shallow minimum at only -0.5 Kcal/mole would be less significant although it was still a true minimum in its region of the energy map.

DEEP is an energy level measured in Kcal/mole. If a minimum occurs at a more negative energy than DEEP, then the expected Table of information will be printed to GRIDLONT. On the other hand if the bottom of the minimum is above DEEP (i.e. more positive than DEEP) then no Table will be printed. If DEEP was equal to -5.0 Kcal/mole in the above example, then a Table would be printed for the minimum at -10.0 Kcal/mole but not for the minimum at -0.5. Thus DEEP may be used to control the size of the gridLONT.DAT output file.

The default value of DEEP is set equal to EMAX (see below). This has the effect of incapacitating DEEP unless it is reset by the User to an appropriate negative value such as -1.0 Kcal/mole.

If the User wants several Tables to be printed for each z-plane, then he or she will set Directive NUMB to a value greater than 1. In this case the default for DEEP will be reset to -1.0 Kcal/mole in order to prevent the printout of Tables about shallow minima near the edges of the grid. (The User can still set DEEP to another value if this altered default is unacceptable).

DEEP is automatically reset to -0.001 when the hydrophobic Probe is used, in order to prevent the generation of redundant information about grid points where there is no hydrophobicity.

DEEP is switched off when LEAU=3 because the Programme is detecting grid points at which the Grid Energy is positive.

Default: DEEP = EMAX or DEEP = -1.0 when NUMB>1 or DEEP = -0.001 when the hydrophobic or amphipathic Probes are used.

EMAX > DEEP > -20.0

28.3.5. Directive DPRO

DPRO is the effective dielectric constant of the macro-molecular matrix, which is treated as a uniform region of low dielectric constant from which solvent molecules are partly excluded. For example, DPRO might define the effective dielectric constant in the centre of a globular protein.

Default: DPRO = 4.0

1.0 < DPRO < 120.0

28.3.6. Directive DWAT

DWAT is the effective dielectric constant of the environment surrounding the Target molecule. In biological systems this is usually water, and the default value is therefore set to 80.0

Default: DWAT = 80.0

1.0 < DWAT < 160.0

28.3.7. Directive EACH

EACH is a directive which influences the selection of Target atoms for printing to the tables in the output file GRIDLONT. One table applies to a single grid point, and the atoms in that table are normally those nearest to the chosen point, starting with the nearest Target atom and moving outwards. This is shown by the "actual distance" column of the table.

EACH is an energy measured in Kcal/mole. By default it has a positive value and is therefore switched off. However, it may be reset to a small negative energy, and if the total attraction between the Probe (at its grid point) and a Target atom would be more positive than EACH, then the line corresponding to that atom will not be printed to the table. Thus EACH may be used in order to reduce the overall size of the tables in GRIDLONT, in such a way that the most significant atom-Probe attractions ar reported but weaker interactions are not.

An example will make this clear. If the closest atoms are respectively at 2.9 3.0 3.1 3.2 and 3.3 Angstrom distance from the grid point, and directive LENG=5 and directive EACH=5.0, then the GRIDLONT table will contain five lines each reporting the strength of one Probe-Atom interaction. If the interaction energies of these atoms (as measured in the ETOT column of the GRIDLONT file) were respectively +1.0 -0.5 -0.01 -0.3 and -0.2 Kcal/mole, and if directive EACH=-0.1, then the line corresponding to the third atom (which is at 3.1 Angstrom with an attraction energy of only -0.01 Kcal/mole) would be omitted from the GRIDLONT table.

Note that the first atom in this example had a positive interaction energy, indicating a repulsion. Although its energy of +1.0 was more positive than the value of EACH (-0.1), the line corresponding to this atom would still be printed to the table, because EACH has no effect when there is a close contact causing repulsion between a target atom and the Probe. Note also that EACH has no effect when one is doing a Grid run with several selected Probes at several selected grid points, because in this case the individual Probes may all have different interaction energies at the same grid point.

The default value of EACH is set equal to EMAX (see below). This has the effect of incapacitating EACH unless it is reset by the User to an appropriate negative value such as -0.5 Kcal/mole.

EACH is switched off when LEAU=3 because the Programme is detecting grid points at which the Grid Energy is positive.

Default: EACH = EMAX

EMAX > EACH > -2.0

28.3.8. Directive EMAX

EMAX is a positive energy value in Kcal/Mole; i.e a repulsion energy. It is used as an upper cut-off energy, for example when the Probe happens to be positioned on a grid point which is very close to a Target atom.

If the calculated energy at the grid point is greater than EMAX, then it will be cut back to the EMAX value before it is printed to GRIDKONT. This feature permits the User to control the overall range of energy values in GRIDKONT, which may be useful if he or she wishes to rescale the data.

All energies upto EMAX will be fully and correctly computed. The default value of EMAX is +5 Kcal/mole, so that the overall range of values in a typical Grid run will be more or less symmetrical about zero. The recommended range of values is 2.0 to 500.0 Kcal/mole, but positive values outside this range will not be rejected.

EMAX should normally be switched off when LEAU=3, because the Programme is then detecting grid points at which the Grid Energy is positive. EMAX is therefore reset to +400 Kcal/mole if directive LEAU=3.

Default: EMAX = 5.0

2.0 < EMAX < 500.0

28.3.9. Directive FARH

FARH is the distance in Angstrom beyond which hydrogen bond interactions are neglected.

Default: FARH = 5.0

3.0 < FARH < 10.0

28.3.10. Directive FARR

FARR is the distance in Angstrom beyond which Lennard-Jones interactions are neglected. Note that there is no distance cut-off for electrostatic interactions; they are all computed.

Default: FARR = 8.0

3.0 < FARR < 100.0

28.3.11. Directive IEND

IEND is used to on a line by itself after any other directives, to show that the list of directives has finished. No numerical value is given to IEND.

28.3.12. Directive KWIK

KWIK can be used to alter the speed or size of the computation. It is also used in conjunction with the exclamation mark '!' when the Target is symmetrical (see Section 24.3.1). Its default value is zero, and KWIK then has no effect. The grid will surround the Target with a clearance of CLER. Directive VALU is disabled (see below).

28.3.12.1. KWIK with a single target

In a normal Grid run with a single Target KWIK can be set to 1, 2, 5, 6, or 7 with the following results (Also see Diagram 5 below):

  • It should be noted that a complete grid is normally a rectangular box. Some of the grid points lie in the corners of the box, where they may be far from the Target molecule. However, a full energy calculation is normally made for each of these grid points, although the information may not actually be required by the User for such distant positions. Setting the value KWIK=1 forces the Programme to omit any grid point which is more than 4.0 Angstrom away from the Target. This makes the Programme run more quickly. When KWIK=1 the energy values for the omitted grid points are set to zero, and this can sometimes produce a sharp energy jump at the 4.0 Angstrom cut-off distance. Anomalies may also occur if you are studying a system in which the Probe is always repelled from the Target at every point on the grid. Care must therefore be taken when interpreting results with KWIK=1. Note also that bigger values of directive CLER will automatically be reduced to 4.0 whenever KWIK=1. The value KWIK =1 may not be used if you are studying several Target molecules one after the other as a Set.

  • Setting KWIK=2 has the same effect as KWIK=1. In addition, however, the detailed geometry of the Probe group as defined by JTYPE is neglected. JTYPE is reset equal to zero, and this further speeds up the computation (see Diagram 5 below). The value KWIK = 2 may not be used if you are studying several Target molecules one after the other as a Set.

  • Setting KWIK=5 has the same effect of KWIK=1, but the process of definition of distant-points has been improved. Therefore, when KWIK=5 the Programme Grid will omit any grid point which is more than 5 Angstrom away from the Target (in the case of Uncharged Probes) or 7 Angstrom (in the case of Charged Probes). The energy values for the omitted grid points are set to zero. Directive KWIK is automatically reset equal to zero when LEAU=3.

  • If you set KWIK=6 grid points which are inside the Target are easily found and Programme Grid will omit any grid point which is more than 5.0 Angstrom away from the Target (in the case of Uncharged Probes) or 7 Angstrom (in the case of Charged Probes). The effect of setting KWIK=5 is maintained when setting KWIK=6; thereby, two combined factors speed up the calculations. The energy values for the further omitted grid points are set to the GRID energy cut-off (+5 kcal/mol). KWIK=6 is optimal for macromolecules. Directive KWIK=6 is automatically reset equal to zero when LEAU=3.

  • Setting KWIK=7 all the effects of KWIK=5&6 are maintained. In addition, the regular grid cube around the (macro)molecule will be modified into a regular ellipsoid inscribed inside the cube. The eights corners of the cube are held out from the computation. Again, the energy values for the further omitted grid points are set to zero. Directive KWIK is automatically reset equal to zero when LEAU=3.

Note that any value of KWIK>0 may not be used unless Directive MOVE=0. This value of MOVE does not permit atoms of the Target to move in response to the Probe. If they were allowed to move, they might go to places near grid points at which KWIK was going to abort the computation. Important results at such points might then be lost if KWIK>0, and this is why these KWIK values are only allowed with MOVE=0. The value KWIK=5&6 can be used if you are studying several Target molecules one after the other as a Set, since the cage dimensions are not modified. KWIK=7 is optimized for macromolecules, when the active site shape is almost regular and quite-spherical.

28.3.12.2. KWIK with a Set of Targets

The normal input to Programme GRID is a GRINKOUT file previously prepared by Programme GRIN, but an alternative procedure is available. Instead of reading the actual GRINKOUT file into GRID through channel INPT, one can read a list of pointers to several GRINKOUT files which are then processed as a Set. See also Chapter 36. KWIK should not have the values 1 or 2 when this is done, but may be 3, 4, 5, 6, or 7 instead.

  • If you set KWIK=3 then the size of the grid will be automatically adjusted so that it surrounds all the Targets in the Set. It is essential for all the Targets to be in the same position and arranged in the same orientation; i.e. they must be more or less superimposed. Then (if KWIK has the value 3) the size of the grid will be increased appropriately, and will take account of the fact that some of the compounds may be charged and therefore have a far-reaching electrostatic field. The final size of the grid will be determined according to the superposition arrangement; the charges of each compound; the dielectric environment; and the charges of the Probes. Directive CLER is without effect. See Diagram 6 below. The value KWIK=3 may not be used if you are studying one single Target in the usual way. It may only be used when you are processing several molecules one after the other as a Set.

  • If you set KWIK=4 then the size of the grid will be adjusted as described for KWIK=3 but the output to the GRIDKONT file will either be shortened by the elimination of 'wasted' grid points, or the 'wasted' points will be filled by a predefined "missing value". By a 'wasted' grid point is meant one that is distant from all the Target molecules, in a region of space where the energy is mostly electrostatic, and where that energy is small and only varies gradually from point to point. Exactly what happens is determined by Directive VALU like this:

    • If VALU=0.0 you will get a shorter output, because the 'wasted' points will be eliminated from the output to GRIDKONT. Of course, you will no longer get results for all the points of a complete rectangular grid, and so it may not be easy to display the results graphically.

    • If VALU has a non-zero value you will get the full set of grid points, but all the 'wasted' points will have the numerical value of VALU. For instance if you set VALU=-99.99 then every wasted point will have the value -99.99 and you could write a statistics program that would treat this as a "missing value".

    With KWIK equal to 4 it is again appropriate for the Targets to be superimposed (See Diagram 6 below). The value KWIK=4 may not be used if you are studying one single Target in the usual way. It may only be used when you are processing several molecules one after the other as a Set.

  • Setting KWIK=5 has the same effect of KWIK=1, but the process of definition of distant-points has been improved. Therefore, when KWIK=5 the Programme Grid will omit any grid point which is more than 5 Angstrom away from the Target (in the case of Uncharged Probes) or 7 Angstrom (in the case of Charged Probes). The energy values for the omitted grid points are set to zero. Differently from KWIK=1, Directive KWIK=5 works also when BOTX, ... , TOPZ are used to define the grid cage; on the contrary, KWIK is automatically reset equal to zero when LEAU=3.

  • If you set KWIK=6 grid points which are inside the Target are easily found and Programme Grid will omit any grid point which is more than 5.0 Angstrom away from the Target (in the case of Uncharged Probes) or 7 Angstrom (in the case of Charged Probes). The effect of setting KWIK=5 is maintained when setting KWIK=6; thereby, two combined factors speed up the calculations. The energy values for the omitted grid points are set to the GRID energy cut-off (+5 kcal/mol). KWIK=6 is optimal for macromolecules. Differently from KWIK=1, Directive KWIK=6 works also when BOTX, ... , TOPZ are used to define the grid cage; on the contrary, KWIK is automatically reset equal to zero when LEAU=3.

  • Setting KWIK=7 all the effects of KWIK=5&6 are maintained. In addition, the regular grid cube around the (macro)molecule will be modified into a regular ellipsoid inscribed inside the cube. The eights corners of the cube are held out from the computation. Again, the energy values for the omitted grid points are set to zero. Differently from KWIK=1, Directive KWIK=7 works also when BOTX, ... , TOPZ are used to define the grid cage; on the contrary, KWIK is automatically reset equal to zero when LEAU=3.

The default value KWIK=0 is normally recommended, unless the available time is particularly short, or the processor is rather slow, or the Target is very big, or a Set of Targets is being studied.

Note that KWIK must not have the value 1, 2, 5, 6, nor 7 when a multi-atom Probe is being used.

Default: KWIK=0.

KWIK=0, 1, 2, 5, 6, or 7 for a single Target.

KWIK=0, 3, 4, 5, 6, or 7 for a Set of Targets.

28.3.12.3. Using KWIK with symmetrical targets

Some very big Targets are symmetrical. For example there is a perfect axis of two-fold rotational symmetry down the centre of the haemoglobin tetramer, and there is a receptor cleft precisely on this axis. The receptor has two identical halves, and a Grid map of the receptor region would also be symmetrical. In such a case as this it might only be necessary to compute the map for half of the Target, since the other half of the Grid map would be exactly the same.

One method of restricting the size of the map to fit the required region would be to use the directives TOPX..... BOTX. An alternative approach would be to set directive KWIK=1 (or 2) as described above. A third possibility would be to mark the atoms of the Target structure so that only one half of it would be considered. This last approach is always combined with the use of KWIK.

In practice, one should not study a Set of large Targets, and directive KWIK should be set equal to 1 (or 2) when a single large symmetrical Target is being studied. Exclamation marks should then be used to flag half the atoms of the Target. KWIK will ignore the flagged atoms when it is determining the the 4 Angstrom cut-off. For example, if all the atoms in one half of the haemoglobin tetramer were flagged with an !, then the cut-off would surround the unflagged half. It would extend 4 Angstroms into the flagged moiety but no further, so that the final Grid map would cover rather more than the flagged half of the Target.

The exclamation mark can be used to flag any unwanted part of the Target. For instance the insulin hexamer which may be described as an (alpha/beta) trimer, and one could flag out two-thirds of the structure. The computation would then only be made over one third of the grid points, but the whole hexamer structure would be taken into account while that one third was computed; i.e. the result would be different to that which would have been obtained if a Grid map had been computed for an isolated alpha/beta pair alone, because the isolated pair would have been treated as if the rest of the hexamer was missing altogether.

The exclamation mark should be placed against appropriate atoms in the ALT column of the PDB input file, before running the first Programme GRIN. Alternatively, it may be placed in the corresponding column of the GRINKOUT file. The exclamation mark is a reserved character in this position, with the specific function of controlling the Target area when KWIK has been set.

Note that KWIK must not have the value 1 or 2 when a multi-atom Probe is being used, and the exclamation mark may not be used to flag symmetrical atoms with multi-atom Probes. The interactions between Directives KWIK and LIST are summarised in Diagrams 5 and 6 (see Directive LIST).

28.3.13. Directive LEAU

LEAU determines the way in which water is treated during the computation (LEAU is the French word for water). In this Version of the programmes it can be used in two completely different ways.

28.3.13.1. Bridging waters

In some circumstances a water molecule may act as a Hydrogen Bonding Bridge between Target and Probe. For example, Matthews et al found such a water between a threonine hydroxyl group of dihydrofolate reductase, and an NH2 substituent of its ligand methotrexate. The presence of this water was an essential feature in the binding pocket.

Programme GRID will take explicit account of such a Water molecule when LEAU = 1 The User does not need to provide any water coordinates, although waters may be included in the input PDB and GRINKOUT files if their positions are known. The main GRID computation for Target and Probe is started as usual. Then, if LEAU = 1, GRID will find and take account of any sites for Bridging Waters whilst it does the main computation for the interaction between Target and Probe. A conventional lineprinter output GRIDLONT is produced, showing favoured sites for the Probe as usual. If a Bridging Water contributes to the Probe:Target interaction, then an extra line for that Bridging Water will be printed to the GRIDLONT file. The energy value which is written to the GRID map generated in the GRIDKONT file, will also include an appropriate contribution from the bridging water.

The value LEAU = 1 may only be used with certain Probe Types. The Probe must be able to make at least two hydrogen bonds, one to the Bridging Water molecule and one direct to the Target. Probe Types with more hydrogen bonds can of course be used. The geometry of the Probe must be defined by JTYPE, and so JTYPE = 0 is not an acceptable value.

The value LEAU = 2 may only be used with a few Probe Types. The Probe must be able to make at least three hydrogen bonds, two to the Bridging Water molecules and one direct to the Target. Water, which can make four hydrogen bonds may of course be used. The geometry of the Probe must be defined by JTYPE, and so JTYPE = 0 is not an acceptable value.

If the main Probe is itself water, then two possibilities are considered when LEAU = 2:

  • The water forms one hydrogen bond direct to the Target, and one or two to bridging waters.

  • The water forms two hydrogen bonds to the Target, and one or two to bridging waters.

Thus, when the main Probe is water, a network of water molecules across the surface of the Target can be considered, each forming sideways hydrogen bonds to its water neighbours.

When Directive MOVE = 1 or 2 or 3 or -1 certain atoms of the Target are allowed to move under the influence of the Probe. In these cases the values LEAU=1 and LEAU=2 must not be used.

28.3.13.2. Competiton by the probe

It sometimes happens that the chosen Probe may interact favourably with the Target at a certain position, but that a water molecule would interact better at the same place. The net interaction energy of that particular Probe at that grid point would then be unfavourable, and Grid will detect regions where this happens if directive LEAU is set as LEAU=3. More detailed information will be found in Chapter 34.

It is important to know if specifically unfavourable places for certain Probes exist on a Target, because they can influence the affinity and selectivity of drug-receptor interactions, and the orientation of a ligand at its receptor site. A well-known example occurs with the enzyme dihydrofolate reductase, which binds the purine ring of its substrate in one orientation, and the corresponding ring of its inhibitor methotrexate (MTX) after an 180 degree ring rotation. The significant structural difference between these two ligands is the replacement of an oxygen atom in the substrate by a nitrogen in MTX, and the inhibitor was originally designed on the assumption that it would bind exactly like the substrate itself.

The bound orientation of MTX was observed many year's ago by X-ray crystallography. This Version of Grid shows (when LEAU has been reset to 3) that an oxygen of the substrate would be precisely located at a place in the enzyme cleft which is maximally unfavourable to oxygen, if the substrate were to bind in the same way as MTX. Indeed it appears as if two backbone carbonyl oxygens of the enzyme are an important structural feature which ensures that the substrate does not bind like MTX, because dihydrofolate reductase can only give a stereochemically correct product if the substrate ring is rotated 180 degrees away from the MTX orientation. See Chapter 60 for more information.

Results such as these (with LEAU=3) suggest that ligand selectivity may be critically influenced by pre-bound water molecules, because water is not easily displaced by polar but inappropriate ligand atoms. One misplaced polar atom such as carbonyl may give such a big positive energy that it forces the ligand into a completely different orientation, and causes several otherwise apparently acceptable hydrogen bonds to be misaligned.

Default: LEAU = 0

Alternative Values: LEAU = 1 or 2 or 3

28.3.14. Directive LENG

LENG controls the length of the output Tables printed in GRIDLONT by the lineprinter. Each Table describes the environment around a low energy position; i.e. a grid point at which the Probe group would interact favourably with the Target molecule. Every line in the Table describes the interaction between the Probe and one of the Target atoms.

When LENG has a positive value, the Target atoms in the table are listed according to their distance from the grid point. The table begins with the nearest atom to the Probe position itself, and LENG is the number of atoms listed. All nearby atoms are considered including HETATMS and hydrogens.

If LENG=0 and NUMB=0 the printout of Tables will be abolished, and the GRIDLONT output file will be very much shorter.

When LENG has a negative value, the Target atoms in the table are listed according to their interaction energies with the Probe. The table begins with the Target atoms which interact most strongly, and these are usually atoms which are attracted to the Probe.

28.3.14.1. Details about negative values for LENG

Sometimes there are Target atoms which have strong positive interaction energies (indicating repulsion of the Probe), and they will be brought to the top of the table when LENG is negative and the repulsion energy is substantial. This is done because it could be misleading to look at all the attractions and ignore significant repulsions. For instance, the charge on a carbonyl carbon is typically +0.3, and the charge on the carbonyl oxygen -0.3, so that one might be strongly attracted and the other strongly repelled by an anionic Probe. In this situation it would then be misleading to consider the attraction alone.

In other situations there may be lots of Target atoms which show a weak attraction, and lots more which show weak repulsion. These weak energies may not be so interesting, and such atoms will all be consigned towards the end of the table when LENG is negative.

Default: LENG = 30

-251 < LENG < 251

28.3.15. Directive LGND

LGND can be used to provide an extra input channel for the eight (or nine) Energy Variables which describe the properties of the Probe ligand. Unlike the other directives in this list, LGND may be used with the second method of input described above.

In order to use LGND effectively a number of Probe Files must be prepared, each with the Energy Variables for one type of ligand. The regular user can then call these variables through channel LGND without having to look them up and type them in and check them on each occasion. Sample files are provided for commonly used Probes (see FIXME ). Please note, however, that the remaining input information (i.e. the directives, IEND, the job-title and the values of NZ1 and NZ2) must still be input through channel IDIR as usual.

Default LGND=IDIR

28.3.16. Directive LEVL

LEVL controls the amount of additional information printed in GRIDLONT by the lineprinter. Bigger values give more information.

The value of LEVL should be 1 or more, when one is using a File List to study a set of several compounds with several Probes. It will be raised to 1, and message D1815 will be printed to GRIDLONT.DAT if a File List is being used and LEVL has been set to a smaller value.

Default: LEVL = 1

-2 < LEVL < 5

28.3.17. Directive LIST

LIST controls output to file GRIDKONT. The default value is LIST = 1

28.3.17.1. Using directive LIST with a single target

When a single Target is being studied in a normal Grid run, an extra record will be prefixed to the start of the computer-readable GRIDKONT file if LIST=1 which is the default value. This record will be absent if LIST=0

The extra record contains additional information which can be used, for example, when a graphics program is being initialised in order to display the Grid map.

28.3.17.2. Using directive LIST with hypermolecules

Directive LIST is provided because it may not be possible to generate the whole GRIDKONT file in a single computer run, when a very large macromolecule is being studied on a slow computer. The individual planes of grid points (i.e. the z-planes; see below) can then be calculated in batches, and finally merged to give the complete GRIDKONT file.

You can achieve this by setting directive LIST=1 when you calculate the first batch of z-planes, and having LIST=0 for the remaining batches. Then, when you merge the output files the extra information will only be provided once at the start of the merged GRIDKONT file.

28.3.17.3. Using directive LIST with a set of targets

The situation is somewhat different when a Set of Targets is being studied, and LIST has its default value of 1. In this case LIST forces the GRIDKONT output to begin with a list of the x,y,z coordinates of the grid points. These are followed by the Grid energies for each Target molecule, one after the other. The energies are printed for all the Targets with the first (or only) Probe, and the whole printout is repeated if there are subsequent Probes. Thus the layout of the GRIDKONT file will be arranged like this:

 The set of coordinates for Probe A 
   Energies for Target 1 with Probe A 
   Energies for Target 2 with Probe A 
   . 
   . 
   Energies for last Target with Probe A 
 The set of coordinates for Probe B 
   Energies for Target 1 with Probe B 
   Energies for Target 2 with Probe B 
   . 
   . 
   Energies for last Target with Probe B 
 The set of coordinates for the last Probe
   Energies for Target 1 with last Probe
   Energies for Target 2 with last Probe 
   . 
   . 
   Energies for last Target with last Probe

You may set LIST = -1 only when studying a Set of Targets. This value of LIST then shortens the GRIDKONT printout. The coordinates and energies are printed as usual for the first Probe, but the coordinates are not repeated before the set of energies generated by the second or later Probes :

 One set of coordinates for all Probes
   Energies for Target 1 with Probe A
   Energies for Target 2 with Probe A
   . 
   .  
   Energies for last Target with Probe A 
   Energies for Target 1 with Probe B 
   Energies for Target 2 with Probe B 
   . 
   . 
   Energies for last Target with Probe B 
   Energies for Target 1 with last Probe 
   Energies for Target 2 with last Probe 
   . 
   . 
   Energies for last Target with last Probe

Note that LIST should not be set equal to zero if several Targets are being studied as a Set. If you do set LIST = 0 for a Set, then the value of LIST will be changed from 0 to 1.

Setting LIST = 2 is only permitted when a Set of Targets are being studied one after the other. The output is arranged as described above for a Set with LIST = 1, but is printed in computer readable binary code. This gives a smaller output file which can be written more quickly, and can be read more quickly by the computer on which it was written. On the other hand it cannot be read by eye, and may not be compatible with other computers.

Setting LIST = -2 is only permitted when a Set of Targets are being studied one after the other. The output is arranged as described above for a Set with LIST = -1 but is printed in computer readable binary code. This gives a smaller output file which can be written more quickly, and can be read more quickly by the computer on which it was written. On the other hand it cannot be read by eye, and may not be compatible with other computers.

28.3.17.4. Subdividing the gridkont file

When a Set is being studied and LIST = 1, then the GRIDKONT file may be relatively large. However it will be easy to sub-divide the file into a number of sections, one for each Probe and all having the same layout of coordinates followed by Grid energies. On the other hand when LIST = -1 the GRIDKONT file will be shorter because the coordinates will not be repeated for each Probe. In this case, however, it may not be so easy to sub-divide the file into similar sections with the same layout in each.

28.3.17.5. Summary for directive LIST

In summary, the available values of LIST are: Default: LIST=1; LIST=0 or 1 for a single Target; LIST=2 or 1 or -1 or -2 for a Set.

>DIAGRAM 5: SUMMARY OF THE EFFECTS OF DIRECTIVES KWIK AND LIST WHEN ONE SINGLE TARGET IS BEING STUDIED BY ITSELF. (NOTE THAT DIFFERENT EFFECTS ARE PRODUCED WHEN A SET OF TARGETS ARE BEING STUDIED ONE AFTER THE OTHER. IN THAT CASE LIST MAY NOT HAVE THE VALUE 0.)

KWIKLIST = 0LIST = 1
0The grid size will depend on Directive CLER.The grid size will depend on Directive CLER.
1CLER will be set to 4.0 Angstrom. The values of the grid energies will be set to zero at points near the corners of the Grid.CLER will be set to 4.0 Angstrom. The values of the grid energies will be set to zero at points near the corners of the Grid.
2* CLER will be set to 4.0 Angstrom. The values of the grid energies will be set to zero at points near the corners of the Grid. Several other shortcuts will make Grid run quicker.* CLER will be set to 4.0 Angstrom. The values of the grid energies will be set to zero at points near the corners of the Grid. Several other shortcuts will make Grid run quicker.
3* KWIK will be reset to zero* KWIK will be reset to zero
4* KWIK will be reset to zero* KWIK will be reset to zero
5The values of the Grid energies will be set to zero at points which are more than 5 to 7 Amstrong away from the Target, independently from the value of CLER.The values of the Grid energies will be set to zero at points which are more than 5 to 7 Amstrong away from the Target, independently from the value of CLER.
6In addition to KWIK=5, the points inside the Target are set to the Grid energy cut-off (+5 kcal/mol).In addition to KWIK=5, the points inside the Target are set to the Grid energy cut-off (+5 kcal/mol).
7In addition to KWIK=5&6, the cube-cage is modified into an ellipsoid-cage, and the points at the eight corners are omitted from the calculation.In addition to KWIK=5&6, the cube-cage is modified into an ellipsoid-cage, and the points at the eight corners are omitted from the calculation.

Combinations marked with a star * are NOT recommended.

>DIAGRAM 6: SUMMARY OF THE EFFECTS OF DIRECTIVES KWIK AND LIST WHEN SEVERAL TARGETS ARE BEING STUDIED AS A SET. (NOTE THAT DIFFERENT EFFECTS ARE PRODUCED WHEN ONE SINGLE TARGET IS BEING STUDIED BY ITSELF.)

* If you set LIST = 0 it will be reset to LIST = 1

KWIKLIST = -1 or -2LIST = 1 or 2
0The grid size will depend on Directive CLER. The x,y,z coordinates will be output once, before the energies generated by the first Probe. VALU is disabled.The grid size will depend on Directive CLER. The x,y,z coordinates will be output before the energies generated by each and every Probe. VALU is disabled.
1* KWIK will be reset to zero* KWIK will be reset to zero
2* KWIK will be reset to zero* KWIK will be reset to zero
3The size of the grid will be adjusted to surround all the Targets with a clearance depending on their charges, on the charges of the Probes and on the dielectric. The coordinates will be output once, before the energies generated by the first Probe. VALU is disabled.The size of the grid will be adjusted to surround all the Targets with a clearance depending on their charges, on the charges of the Probes and on the dielectric. The coordinates will be output before the energies generated by each and every Probe. VALU is disabled.
4The size of the grid will be adjusted as for KWIK=3 and "wasted" points will then be eliminated if VALU is zero, or they will be replaced by missing values if VALU has been set. The coordinates will be output once, before the energies generated by the first Probe.The size of the grid will be adjusted as for KWIK=3 and "wasted" points will then be eliminated if VALU is zero, or they will be replaced by missing values if VALU has been set. The coordinates will be output before the energies generated by each and every Probe.
5The values of the Grid energies will be set to zero at points which are more than 5 to 7 Amstrong away from the Target, independently from the value of CLER.The values of the Grid energies will be set to zero at points which are more than 5 to 7 Amstrong away from the Target, independently from the value of CLER.
6In addition to KWIK=5, the points inside the Target are set to the Grid energy cut-off (+5 kcal/mol).In addition to KWIK=5, the points inside the Target are set to the Grid energy cut-off (+5 kcal/mol).
7In addition to KWIK=5&6, the cube-cage is modified into an ellipsoid-cage, and the points at the eight corners are omitted from the calculation.In addition to KWIK=5&6, the cube-cage is modified into an ellipsoid-cage, and the points at the eight corners are omitted from the calculation.

Combinations marked with a star * are NOT recommended.

28.3.18. Directive MOVE

MOVE controls the behaviour of the Target. There are two quite separate directives called MOVE; one in Programme GRIN and another in GRID. The default value in Programme GRID is MOVE=0 and this allows lone pairs and tautomeric hydrogens to move in response to the Probe. It also allows the torsion angle of groups like sp3 hydroxyl or sp2 amine to alter, so that they can twist and make the most favourable hydrogen bonds to the Probe. This is the traditional Grid model of the Target.

The Target responds more flexibly to the Probe when MOVE=1 in Programme GRID. Flexible side chains such as the -CH2-CH2-CH2-CH2-NH3+ of lysine respond as the Probe is moved from one grid point to the next. For instance, if a carbonyl oxygen Probe was being used, then the cationic NH3+ of a lysine in the Target would tend to move towards it, so that more favourable hydrogen-bond and electrostatic interactions could occur.

Some atoms (such as the cationic NH3+ of lysine in this example) may be repositioned further away from the Core of the Target when they move in response to the Probe. The overall dimensions of the Target molecule will then be increased, and parts of the Target may actually move outside the grid. As a result of this adjustment you may need to use a bigger grid than you initially expected, and the grid size will be extended appropriately if you set directive MOVE=3 in Programme Grid.

When you set MOVE=3 you may actually get a much bigger grid than you wanted, so that the computation requires too much cpu time and the GRIDKONT file is too large. We therefore recommend that you consider using the value MOVE=2 in Programme GRID before using MOVE=3. This will initialise the GRID run and determine the grid size and the number of grid points. However a GRID run with MOVE=2 will be aborted automatically after a short time, and the User can then assess the lineprinter output. Directives CLER; TOPX ... BOTZ; NPLA and MOVE can then be altered to more appropriate values if need be, before the main Grid run is restarted with MOVE=3.

A special value MOVE=-99 may be used when Probe Molecules are being studied.

A more detailed description of the grid Force-Field model when MOVE>0 is given in Section 21.4. Several practical points should also be noted:

  • Directive MOVE was first introduced with Version 15 of the Programmes. It cannot be used with earlier Versions of GRID before Version 15.

  • The preceding Programme GRIN must compute additional information about the conformational flexibility of the Target, and must then encode the extra data into GRINKOUT.DAT. GRIN does this when Directive MOVE = 1 which is the default value for MOVE in GRIN. GRINKOUT.DAT is the input file for GRID, and the extra information is used by GRID when MOVE>0.

  • We have attempted to maintain backwards compatibility, but we strongly recommend that Versions should be used compatibly. Ie: Use Version 14 of GRIN with Version 14 of datafile GRUB and Version 14 of GRID; use Version 15 of GRIN with Version 15 of datafile GRUB and Version 15 of GRID; and so on. Versions should not be mixed.

  • Note in particular that the format for GRINKOUT files was changed with Version 15 of the Programmes in order to cope with the extra data for flexible Targets, and the newer GRINKOUT formats are not backwards compatible. GRINKOUT files produced by Version 15 of GRIN (or any later Version) cannot be used as input to Version 14 of GRID!

  • Directive MOVE is not compatible with the values LEAU=1 or LEAU=2 or LEAU=3.

  • The conformation initially assigned by the User to the flexible side chains of the Target is not critically important when MOVE>0 in GRID, because the Programme will not use those initial torsions when processing a flexible chain. However, care must be taken that flexible atoms are not placed where they are too close to each other, within Van der Waals touching distance! For instance, a phospholipid molecule with long side chains could be almost "tied into knots" by computer graphics, and the algorithms in GRIN and GRID would not be able to unravel it! It is therefore important to choose a "sensible" starting conformation for the Target, and we recommend that any flexible side chains should be extended so they are clear of the Core and clear of each other.

  • Although the initial conformation of the flexible side chains is not normally important when MOVE>0, there are some situations in which it can have a noticable effect. For instance the hydroxyl oxygen atom of a benzyl alcohol HETATM Target (C6H5-CH2-OH) can undergo torsional rotation around the axis which links the methylene CH2 group to the benzene ring. During this rotation the hydrogens of the methylene would also rotate, but the present Version of the Programme does not take this hydrogen rotation into account. The Probe at a certain grid point might then clash with a methylene hydrogen when one starting conformation of the Target was used, but not clash if one began with a different conformation. This effect only occurs with the hydrogens of methylene groups which are the linkers of flexible side chains (ie: groups directly bonded to the Core of the Target, like the methylene of benzyl alcohol). It is only significant at grid points where there is a steric clash between the Probe and the hydrogen. It does not occur with the atoms of the flexible chain itself, and does not occur with protein Targets in which the methylene group would be represented by an extended ATOM record.

  • The GRIDLONT.DAT output file from GRID shows the coordinates of the Target atoms. These are always the coordinates printed in the PDB file used as input for GRIN. They are not the positions to which the Target atoms may have been moved in response to the Probe.

  • GRIDLONT.DAT also has a column showing the distance from each Target atom to the Probe, and these distances are also measured from the original position of the Target atom as shown in the PDB file. For instance you might be surprised to see a particularly strong interaction energy when the "ORIGINAL DISTANCE" from Target atom to Probe was 10 Angstroms. This would happen if Programme GRID had moved the Target atom from its tabulated position, so that its interaction energy was actually calculated when that Terget atom was at a more appropriate distance; ie: when it was closer to the Probe and therefore more strongly attracted.

  • KWIK=1 and KWIK=2 may not be used if you have set directive MOVE>0. These values of KWIK prevent Programme GRID from calculating the energies at points which are reasonably far from any Target atom. The energies at those remote points would, of course, tend to be small so long as the Target atoms did not move. However, when MOVE>0 the Target atoms can move, and a grid point which was originally far from the Target might suddenly have a Target atom close by. The calculated energy at such a point might then be large, and it would be inappropriate to neglect it by setting KWIK=1 or KWIK=2. The GRID run would be aborted in this case, and Error Message D1065 would be issued. The settings KWIK=3 and KWIK=4 work differently (See above under KWIK), but these values of KWIK are also forbidden when MOVE>0. Finally, also KWIK=5, KWIK=6, KWIK=7 are forbidden when MOVE>0.

  • MOVE>0 may not be used if you are studying a Set of Targets, because each Target molecule would respond differently to the Probe and so the outputs would not be comparable.

  • MOVE=1 may be used if you are running GRID in order to prepare an input file for Programme GLUE, but this is not recommended. It is also important to remember that the same Probe may not be used twice by GLUE, and you may have to make a decision. For instance, if you wanted to use the sp3 aliphatic hydroxyl Probe (O1) to generate input for GLUE, you could either have a GRIDKONT file that had been prepared with the recommended value MOVE=0, or a file prepared with MOVE=1. However, these two files may not be used together as simultaneous inputs for the same run of GLUE.

28.3.18.1. The regular effects of direct MOVE

When directive MOVE=1 in GRIN, and MOVE=1 or 2 or 3 in GRID, the effects are like this:

  • Programme GRIN computes extra information about the Target, and prints this to the GRINKOUT file which becomes the input to GRID.

  • Programme GRID assigns each Target atom to one of three main classes: either it is in the Core where it cannot move in response to the Probe; or it can move as part of a flexible chain; or it can move as part of a Bead. See Section 21.4 for more detailed information.

  • Programme GRID uses the traditional Grid algorithms to deal with atoms in the Core of the Target. It treats Target atoms and Beads which can move as described in Section 21.4.5.

28.3.18.2. The special value MOVE = -99

A special value MOVE=-99 may be used when Probe Molecules are being studied (See Section 41.5). Grid normally tries to adjust the position of a Probe Molecule in its binding cleft, in order to relax close contacts. This relaxation is recommended, but will not take place if MOVE=-99 in the Grid run.

28.3.19. Directive NETA

NETA is the Number of Extra Target Atoms. New Users should use the default values which are automatically assigned, as described at the end of this Section on NETA. The following notes are for more experienced Users.

28.3.19.1. Using NETA on a single target

Programme Grid is normally run on one single job with one single Target. This single Target may be just one molecule, or it may contain several proteins, ligands, counter-ions and solvent molecules making a more complex Target. Alternatively, Grid may be used to study several Targets one after another as a Set. This Set Mode is used, for example, if a set of Grid Maps are required for a set of related drug molecules, so that Statistical analyses can be performed on the individual energy values generated by Grid.

When there is only one Target (Macromolecule Mode; or Small Molecule Mode; or Mixed Mode) the following arrangements for NETA will apply. The arrangements are different when a Set of Targets is being studied (see below).

In order to understand the usefulness of assigning a non-default value to NETA, it is necessary to study the input file GRINKOUT. This file is a list of the ATOMS in the Target, and these ATOMS may be followed by a list of HETATMS and finally a list of computed hydrogen positions. When the Target is a protein or other macromolecule, the HETATMS may include bound water, other solvent molecules, ligands, substrates and so on.

Programme GRID would not normally treat the HETATMS as a part of the macromolecular Target, if ATOMS and HETATMS were both present in the GRINKOUT file; i.e. if Programme GRID was being used in Mixed Mode. In this case the default would be to ignore the HETATMS, and only treat the ATOMS of the macromolecule as the Target. The default value is NETA = 0. This default value is used, so that the HETATMS which represent water molecules will not be considered as part of the macromolecular Target in Mixed Mode. These water molecules may be present in large numbers in the original PDB file for a protein. If the User wishes to treat some of the HETATMs as a part of the Target, then Directive NETA must be assigned an appropriate value as described below.

Macromolecule Mode deals with Targets which contain only ATOMS. Small Molecule Mode deals with Targets which contain only HETATMS. The default value of NETA=0 is also appropriate in both these modes. In this case, however, Programme GRID will automatically change NETA to the correct value (if necessary) like this:

28.3.19.2. Using NETA in macro-molecule mode

In Macromolecule Mode, when there is only one Target which contains ATOMS but no HETATMS, the value of NETA will be zero; ie: no change will be made from the default.

28.3.19.3. Using NETA in small molecule mode

In Small Molecule Mode, with one Target which contains HETATMS but no ATOMS, the value of NETA will be increased to the number of HETATMS in the Target.

28.3.19.4. Using NETA in mixed mode

NETA should be set by the User when Mixed Mode is required; ie: when the Target contains both ATOMS and HETATMS. In this case NETA may be used in one of four different ways if a single Target is being studied. Different arrangements are made if several Targets are being studied together as a Set (see Chapter 36).

Users need only study Method 1 initially. This method is the most straightforward and can often be used. At a later stage most Users also try the other methods, and then choose the approach they prefer. The four methods are:

  1. Consider by way of example a macromolecule which has two metal groups firmly embedded in the macromolecular matrix, notwithstanding that the metals are actually hetero groups; i.e. they are HETATMS with Energy Variables from the HET section at the end of datafile GRUB. Assume, for the sake of simplicity, that these two metal groups are the only HETATMS in the Target. By default these two HETATMS would be excluded from the Target because ATOMS are also present (Mixed Mode). However the metal groups are firmly embedded in the protein, and the User might NOT want to treat them as displaceable ligands but rather as a part of the macro- molecule. In this case an instruction would be needed to tell Programme GRID that the two HETATMS should be used in the computation just like normal Target atoms. This would be done by setting NETA=2 It must be emphasised that this first method would not work correctly unless the two metal groups were the only HETATMS in the Target. Moreover they must come at the end of the PDB file, immediately after any ATOMS. (Note, however, that a TER record would be required just before the HETATMS, at the end of the ATOM list, if the ATOMS constituted a protein chain ending with a terminal carboxy group. The TER record would come after the OXT ATOM at the end of the protein chain).

  2. Secondly consider a PDB input file containing many ATOMS and many HETATMS. To use NETA effectively in this case it would first be necessary to edit the PDB file so that the wanted HETATMS (i.e. the HETATMS which were to be treated as part of the Target) came IMMEDIATELY after any Target ATOMS. They must be moved to the top of the HETATM list in the original PDB file (This is most easily done with a page editor) before Programme GRIN is used to generate the GRINKOUT file for input to GRID. This GRINKOUT file will start with Target ATOMS which have been assigned Energy Variables from the Recognised Molecules at the start of datafile GRUB. It will then contain the HETATMS which are to be part of the Target, and these will be followed by any other HETATMS in the PDB file. Finally there will be a list of hydrogens. The HETATMS and hydrogens will be marked with a star * or a cross X. When running Programme GRID, directive NETA is the number of HETATMS which will be included with the ATOMS of the Target molecule. These are the Extra Target Atoms, and NETA is the "Number of Extra Target Atoms". In effect, NETA moves the pointer between ATOMS and HETATMS, so that some HETATMS at the top of the HETATM list are reclassified with the ATOMS of the Target, and are therefore used in the GRID computation. It is therefore necessary to set NETA equal to the number of required HETATMS which were moved with the page editor to the top of the HETATM list in the original PDB file. Note that the first method of using NETA, described above, is a special case of this second method. In the first method it is assumed that all the HETATMS are wanted. Another way of including all the HETATMS is to set NETA to a very large value, in which case Programme GRID will reduce it appropriately to include exactly all the HETATMS.

  3. The third method is similar to the above, but the page editor is used to remove all the unwanted HETATMS from the PDB file. A new GRINKOUT file is then created with Programme GRIN, and this consists of ATOMS followed by HETATMS and hydrogens. All the remaining HETATMS are wanted in the Target, and the User should set directive NETA to a large positive value. Programme GRID will automatically reduce it to the exact number of HETATMS present.

  4. The fourth procedure is to begin with the standard GRINKOUT file, and set NETA to a large value so that all the HETATMS will be included with the ATOMS of the Target. NETA will by default be reduced by Programme GRID to the exact number of HETATMS and a message will be sent to the lineprinter file GRIDLONT at run-time. Before starting the GRID run, however, the cross X is used to exclude any unwanted ATOMS or HETATMS from the input file GRINLOUT. Thus Programme GRID will only use for its Target the remaining ATOMS and HETATMS which are uncrossed. A possible problem with this fourth method, is that the hydrogens bonded to a heavy atom must also be crossed off the GRINKOUT file at the same time as their heavy atom itself. This will usually call for careful book-keeping.

28.3.19.5. Using NETA in set mode

When a set of Targets is being studied all together, one after the other as a Set, the following arrangements for NETA apply. The User should first read the above Sub-Section describing how to use NETA on a single Target, and should also look at the main Section on the use of Grid in order to study a Set of Targets (see Chapter 36). Then look at the following description.

The individual Targets of a Set are normally individual small molecules which are being studied one after the other. Each molecule is composed of HETATMS. The default value of NETA is set equal to Parameter MAXDIM, which is normally 24000. Programme Grid then reduces this default to the appropriate value for each successive Target, and so each Target is processed correctly.

Other Users may wish to study a Set of Targets in which each individual Target contains both ATOMs and HETATMs. For example each Target might be a protein (ATOMs) and the counter-ions (HETATMs) which are required in order to have overall electro-neutrality. Once again the default value of NETA is set equal to MAXDIM, and this is reduced by Programme Grid to the appropriate value for each Target; ie all the counter-ions would be included as part of the Target.

This default arrangement is different from the default in Mixed Mode when a single Target of ATOMs and HETATMs is being studied. This is the difference:

  • By default with one Target in Mixed Mode, none of the HETATMs are included with the ATOMs of the Target. The default value is NETA = 0 and this default must be changed if some HETATMs are needed.

  • By default in Set Mode, all of the HETATMs are included with the ATOMs of their Target. The default is NETA = 24000 and this default cannot be changed by the User. Thus, as each successive Target of the Set is processed, it will be processed by Programme Grid with all its ATOMs and all its HETATMs. Therefore, if you are working in Set Mode, and only want to have ATOMs in the Target, you must prepare the input file for Programme Grid, so that it does not contain any HETATMs.

The reason for this different default is as follows. New Users should have a simple default in Mixed Mode, so that the receptor cleft of the enzyme which they are studying is not completely blocked by solvent molecules. Their default is therefore NETA = 0, and they can increase this to the appropriate value if they wish to include some counter-ions or other HETATMs as part of the Target in Mixed Mode.

In Set Mode, however, each successive Target might need a different value of NETA. For example, the first protein Target might have six counter-ions and the second might have ten. If the default value was NETA = 0 then the User would have to assign an individual value of NETA for each Target of the Set, and this would be cumbersome. Therefore the default is NETA = 24000 which is automatically reduced to the appropriate number for each successive Target of the Set.

One important difference between the default arrangements should be noted. None of the HETATMs is included by default with the ATOMs of the single Target in Mixed Mode. On the other hand all of the HETATMs are included by default with their Target, as the successive Targets are studied one after the other in Set Mode.

WATER MOLECULES

In some computations the User may be able to delete all water molecules from the PDB input file, before it is submitted to GRIN and GRID. Deleting the waters like this will give faster runs, than setting NETA=0 while leaving the waters as HETATMS in the PDB file. Whether the waters can be omitted of course depends on the objective of the particular computation.

TER RECORDS

When the ATOMS of the Target constitute a protein chain ending with an OXT atom and a carboxy terminal group, there should always be a TER Record after the OXT carboxy oxygen. This is particularly important if HETATMS are also present and Directive NETA is used.

Defaults when there is only one Target:

NETA = 0

if the Target only contains ATOMS. All the ATOMS will be included in the Target (unless they are marked with a cross).

NETA = 0

if it contains ATOMS and HETATMS. With this default any HETATMS will be ignored unless the User resets NETA him/herself.

NETA = 0

if the Target only contains HETATMS, but this default value will be altered automatically to the total number of HETATMS unless the User resets NETA him/herself.

Defaults when there are several Targets which are being studied one after the other as a Set. The User cannot reset these defaults for NETA:

NETA = 24000

if a Target only contains ATOMS. All ATOMS will be included in the Target (unless they are marked with a cross).

NETA = 24000

if a Target has ATOMS and HETATMS. All the HETATMs will be included with the ATOMs of their Target (unless they are marked with a cross).

NETA = 24000

if a Target only contains HETATMS. This default value will be altered automatically to the correct number of HETATMS in each successive Target.

Limits: 0 <= NETA <= Total number of hetero atoms.

28.3.20. Directive NPLA

NPLA is the Number of Planes of grid points per Angstrom, and determines the resolution of the computation. Thus the GRID points will be 1.0 Angstrom apart if NPLA=1, but if NPLA=3 the GRID points will be 0.333 Angstrom apart. In this case the calculation will take about 27 times longer to run, unless TOPX.......BOTZ are used in order to restrict the computation to a smaller region of the Target.

Directive NPLA must normally be a positive integer, and integer values are always recommended. However, very large output files can be generated if you are studying several Targets one after the other as a Set, even if you set NPLA = 1. In this special case therefore, NPLA may be assigned a fractional value which causes the points to be more widely spaced, and reduces the size of the output file to a smaller size.

Default: NPLA = 1

0 < NPLA

28.3.21. Directive NUMB

NUMB is the number of tables printed in GRIDLONT by the lineprinter at each stage of the computation. The computation is performed on one z-plane of grid points at a time, and at least one table is printed for each plane. This table describes the environment around the minimum energy position on that particular z-plane.

NUMB controls the number of Tables describing the environment around other favoured positions (i.e. low energy positions but not the absolute minimum) on each z-plane. In many cases the number of Tables actually printed may be less than NUMB. This will happen, for example, if no grid points on the z-plane have energies below EMAX.

Remember that a Table can be calculated for any required position by doing a GRID run with TOPX=BOTX etc,etc or by using Directive POSI.

If NUMB=1 and LENG=0 the Table headings will still be printed, but the data will be suppressed and the GRIDLONT output file will be much shorter. When a set of compounds is being studied one after the other, and the input comes from a FILE.LIST, you can set NUMB=0 The Table headings will then be suppressed and the GRIDLONT file will be shorter still.

The default value of Directive DEEP will be altered if NUMB has been set to a value greater than 1 (see DEEP above).

Default: NUMB = 1 0 < NUMB < 10 with GRINKOUT input -1 < NUMB < 10 with FILE.LIST input

28.3.22. Directive POSI

POSI is used to define the position of specific grid points at which the Probe will be placed and the interaction energy computed. The x, y, z coordinates of each required point are entered on a separate line thus:

POSI 12.3 4.56 7.890

with free format for the numbers. x, y and z may take any reasonable values. Do not use Tabs (the "TAB" character) in order to format your input.

More than one POSI directive may be entered in the file of directives "grid.in". The maximum acceptable number of POSI directives is defined by Parameter MAXPOS. (MAXPOS = 100000 when Programme Grid is supplied.)

Note that Directive POSI will over-ride Directives TOPX ... BOTZ. The latter Directives should not be used together with POSI.

28.3.22.1. HETATM records as POSI directives

Brookhaven Protein Data Bank HETATM records may be used instead of POSI directives. Each HETATM record must exactly comply with the HETATM format as specified by Brookhaven. In particular the 'H' of HETATM must be the first character on the line. A whole list of HETATM records can be included in the command file for GRID, in exactly the same way as a list of POSI records.

This is particularly convenient because many programs give output in PDB format. For instance the output from Programme MINIM is a list of all the favourable minima in a Grid map. If such a map had been generated by a water Probe, the list of minima would be a list of favourable water positions. It could then be used as a POSI list, in order to run GRID and obtain a GRIDLONT lineprinter output table describing the environment of each water position.

28.3.22.2. Output when POSI has been set

28.3.22.2.1. Lineprinter output

There is output to the lineprinter file GRIDLONT when POSI has been set, with one output Table per POSItion. The length of this Table may be adjusted as usual by Directive LENG. If LENG and NUMB are both set to Zero, then no Tables will be printed, and this is recommended if many POSI directives are being used one after the other.

28.3.22.2.2. Gridkont output

No map is written to GRIDKONT, because the defined POSItions are not in general on a regularly mapped GRID. Instead of a map, the binary GRIDKONT output file contains a single record for each POSItion. This record contains the unformatted binary values of x, y, z and E for the POSItion, and it is written like this:

WRITE (KONT) X,Y,Z,E The GRIDKONT file may then be used as input for further computations.

28.3.22.2.3. Programme b2p

When POSI values have been set, the GRIDKONT output is a computer-readable binary file. Programme B2P may be used to translate this output into PDB format which can be read by eye. Each binary record is turned into a HETATM line in which the name of the atom is shown as ATM, and the name of the molecule is MOL. The first record will look like this:

 HETATM    1 ATM  MOL  -999      66.000   3.000   7.000 -1.76

Note that a distinctive number (-999) is used for the molecule number of this record which corresponds to the first POSI position in the command file.

Subsequent records (corresponding to subsequent POSI positions) will have the molecule numbers -998 -997 and so on. The Grid energy for each position is shown in the column which is normally reserved in a PDB file for the occupancy, and so the energy of the chosen Probe at the first POSI position in the above record is -1.76 Kcal/mole. Further information is given below under the heading: PROGRAMME B2P.

PROGRAMME B2P WITH SEVERAL PROBES

Several different Probes may be listed in the command file for Grid when the positions of the individual grid points are defined by POSI records. In this case B2P will only write the energy of the first Probe to each HETATM record, and the energies of the other Probes will not be transcribed.

28.3.22.3. Generating POSI values

The output from many Programmes is a list of atom positions written in Brookhaven HETATM format, and these can be used as a batch of POSI values. In other circumstances the User may want to write a jiffy program which will generate a list of the POSI positions at which Grid energies are required. For instance he or she might want grid points distributed at spherically or cylindrically defined positions, or over the Van der Waals surface of the Target, instead of an orthogonal Grid.

Any distribution of grid points can be achieved, and pasted as a set of POSItions to the command file for Grid. It should be noted, however, that some graphics programs may not be able to display the results unless the Grid map is conventionally arranged.

28.3.23. Directives TOPX, TOPY AND TOPZ

TOPX, TOPY, TOPZ. See above at the start of this Section.

28.3.24. The symbol defining the probe

The SYMBOL defining the Probe is usually included somewhere in the list of Control directives as described below.

28.3.25. Directive VALU

VALU is a missing value which may be used if you are studying several Targets one after the other as a Set, and if the value of directive KWIK has been set to 4 (See above under KWIK). If VALU=0.0 no missing value is used, and KWIK=4 has the effect of causing 'wasted' grid points to be omitted from the GRIDKONT output file. If VALU has a non-zero value, then that value is inserted into the GRIDKONT file at each "wasted" point (See above under KWIK for the explanation of a 'wasted' point).

If VALU is given a small value such as 0.00001 this will be rounded down to zero if the GRIDKONT output file is printed as an eye-readable ASCII file. A missing value of 0.0 will then be entered into the output which may or may not be what you wanted. We recommend -99.99 as a missing value, since this is a physically impossible number for the actual Grid energy in Kcal/mole.

If directive LIST is 2 or -2 the GRIDKONT file will be written in binary, and an exact value such as 0.00001 can then be output (see above under Directive LIST). However 0.0001 could actually be a physically real value, and we therefore recommend -99.99 as before.

VALU may only be used if KWIK=4

Default: VALU=0.0

Note: IEND must always appear on a line by itself after all the other directives. If no directives are used, IEND must still be input immediately before the job title, to show that the run may begin. No numerical value is given to IEND. See Section K.6.

Latest versions

Login

Username

Password

Register | Lost password?