next up previous contents index
Next: Locally Enhanced Sampling Up: Additional Simulation Parameters Previous: Adaptive Biasing Force Calculations   Contents   Index

Subsections


Alchemical Free Energy Perturbation Calculations

This feature has been contributed to NAMD by the following authors:

Surjit B. Dixit and Christophe Chipot

Equipe de dynamique des assemblages membranaires
Institut nancéien de chimie moléculaire,
UMR CNRS/UHP 7565,
Université Henri Poincaré,
BP 239,
54506 Vand\oeuvre-lès-Nancy cedex, France

Introduction and theoretical background

A method to perform alchemical free energy perturbation (FEP)  [28,4,27,25,16,12,17,9] within NAMD has now been implemented. In FEP, the free energy difference between two states, $a$ and $b$, is expressed by:


\begin{displaymath}
\Delta A_{a \rightarrow b} = -k_B T \ \ln
\left< \exp\left[-...
...}) -
{\cal H}_a({\bf r}, {\bf p})}
{k_B T}\right]
\right>_a
\end{displaymath} (7)

wherein $k_B$ is the Boltzmann constant, $T$ is the temperature, and ${\cal H}_a({\bf r}, {\bf p})$ and ${\cal H}_b({\bf r}, {\bf p})$ are the Hamiltonians characteristic of states $a$ and $b$, respectively. $\left< \cdots \right>_a$ denotes an ensemble average over configurations representative of the initial state, $a$. In practice, the transformation between the two thermodynamic states is replaced by a series of transformations between non-physical, intermediate states along a pathway that connects $a$ to $b$. This pathway is characterized by a variable, referred to as ``coupling parameter'', [4,17,15] $\lambda $, that makes the free energy a continuous function of this parameter between $a$ and $b$:


\begin{displaymath}
\Delta A_{a \rightarrow b} = -k_B T \ \sum_{k = 1}^N \ln
\le...
...cal H}({\bf r}, {\bf p}; \lambda_k)}
{k_B T}\right]
\right>_k
\end{displaymath} (8)

Here, $N$ stands for the number of intermediate states, or ``windows'' between the initial and the final states.

In a typical FEP setup, that involves the transformation of one chemical species into another one, the atoms in the molecular topology can be separated into three groups: (i) a group of atoms that do not change during the simulation -- e.g.the environment, (ii) those atoms describing the initial state, $a$, of the system, and (iii) those that correspond to the final state, $b$, at the end of the alchemical transformation. The atoms representative of state $a$ do not interact with those of state $b$ throughout the entire molecular dynamics simulation. Such a setup, in which atoms pertaining to both the initial and the final states of the system are present in the molecular topology file -- i.e. the psf file -- is referred to as ``dual topology'' paradigm. [2,19] The hybrid Hamiltonian of the system, which is a function of the coupling parameter $\lambda $, that smoothly connects state $a$ to state $b$, is evaluated as:


\begin{displaymath}
{\mathcal H}(\lambda) = {\mathcal H}_0
+ \lambda {\mathcal H}_a
+ (1-\lambda) {\mathcal H}_b
\end{displaymath} (9)

where ${\mathcal H}_a$ is the Hamiltonian for the group of atoms representative of the initial state, $a$, and ${\mathcal H}_b$ characterizes the final state, $b$. ${\mathcal H}_0$ is the Hamiltonian for those atoms that do not undergo any transformation during the MD simulation.

For instance, in a transformation involving the mutation of an alanine side chain into that of glycine, using the FEP methodology, the topology of both the methyl group of alanine and the hydrogen borne by the C$_\alpha$ in glycine co-exist throughout the simulation (see Figure 5).

Figure 5: Dual topology description for an alchemical simulation. Case example of the mutation of alanine into glycine. The lighter color denotes the non-interacting, alternate state.
\includegraphics[width=14.5cm]{figures/dual_top}

The energy and forces are defined as a function of $\lambda $, in such a fashion that the interaction of the methyl group of alanine with the rest of the protein is effective at the beginning of the simulation, i.e.$\lambda $ = 0, while the glycine C$_\alpha$ hydrogen does not interact with the rest of the protein, and vice versa at the end of the simulation, i.e.$\lambda $ = 1. For intermediate values of $\lambda $, both the alanine and the glycine side chains participate in the non-bonded interactions with the rest of the protein, scaled on the basis of the current value of $\lambda $. It should be emphasized that these side chains, however, do not interact with each other.

It is, therefore, necessary to exclude explicitly in the setup those atoms that are created from those that will be annihilated in the course of the FEP calculation (see ``A tutorial to set up alchemical free energy perturbation calculations in NAMD'' available from the NAMD website).

It is also worth noting that the free energy calculation does not alter intramolecular potentials, i.e.bond stretch, valence angle deformation, torsions etc, during the simulation. In calculations targetted at the estimation of free energy differences between two states characterized by distinct environments -- e.g.a ligand bound to a protein in the first simulation, and solvated in water, in the second -- as is the case for most free energy calculations that make use of a thermodynamic cycle, perturbation of intramolecular terms, e.g.chemical bonds, can be safely avoided. [5]

Implementation of free energy perturbation in NAMD

The procedure implemented in NAMD is particularly adapted for performing free energy calculations that split the reaction path into a number of non-physical, intermediate $\lambda $-states, or ``windows''. Separate simulations can be started for each window. Alternatively, the TCL scripting ability of NAMD can be employed advantageously to perform the complete simulation in a single run. An example making use of such script is supplied at the end of this section.

The following keywords can be used to control free energy calculations aimed at alchemical transformations.

Note: Free energy calculations that rely upon equation (7) make use of an average temperature, which, in principle, should coincide with the value of the thermostat. Rather than employing the computed average of $T$, $\Delta A_{a \rightarrow b}$ is estimated with the target value of the temperature defined by the user. It is, therefore, necessary to activate some constant-temperature scheme to carry out FEP calculations.

Example of an input file for running FEP alchemical transformations

The following example illustrates the use of TCL scripting for running the alchemical FEP feature of NAMD:

fep		on  
fepfile		ion.fep
fepCol		X
fepOutfile	ion.fepout
fepOutFreq	5
fepEquilSteps	5000

set step 0.0
set dstep 0.1

while {$step <= 0.9} {
 lambda $step
 set step [expr $step+$dstep]
 lambda2 $step
 run  10000
}

Here, the pdb file read by NAMD to extract the information about perturbed atoms is biotin.fep. The pertinent information is present in the X column. The output file of the free energy calculation is biotinr.fepout, in which energies are written every 5 steps. $\delta \lambda $, the width of the windows, is set to 0.1. 5000 MD steps are performed in each window to equilibrate the system. In this particular instance, the current value of $\lambda $ is controlled by the statement set step. The FEP calculation is run until $\lambda $ reaches the value 0.9. In every window, 10000 MD steps are performed.

Description of FEP simulation output

The fepOutFile contains electrostatic and van der Waals energy data calculated at $\lambda $ and $\lambda2$, written every fepOutFreq steps. The column dE is the instantaneous energy difference for the current configuration. dE_avg and dG are the accumulated energy ensemble average and the corresponding free energy at the current time step, respectively. The temperature is specified in the penultimate column. Upon completion of fepEquilSteps steps, the calculation of dE_avg and dG is restarted. The accumulated net free energy change is output at each $\lambda $-value and at the end of the simulation. The cumulative average energy dE_avg value may be summed using, for instance, the trapezoidal rule, or a Gaussian quadrature, to obtain an approximate TI estimate for the free energy change during the run.


next up previous contents index
Next: Locally Enhanced Sampling Up: Additional Simulation Parameters Previous: Adaptive Biasing Force Calculations   Contents   Index
namd@ks.uiuc.edu