skip to main content
10.1145/2975167.2975214acmconferencesArticle/Chapter ViewAbstractPublication PagesbcbConference Proceedingsconference-collections
research-article

Multiscale Approximation with Graphical Processing Units for Multiplicative Speedup in Molecular Dynamics

Published: 02 October 2016 Publication History

Abstract

The timescales and structure sizes accessible via simulations of atomistic molecular dynamics (MD) can be advanced substantially by two independent techniques: (1) many-core parallelization with graphics processing units (GPUs) and (2) multiscale approximation with hierarchical charge partitioning (HCP). Achieving efficient many-core parallelization on the GPU generally requires highly synchronized and regular computation across the GPU. However, multiscale methods can result in highly asynchronous and irregular processing. Thus, one might expect that realizing such multiscale algorithms on the GPU would result in an overall loss of performance and that the total speedup obtained would be less than the product of the individual speedups for the two techniques separately, i.e., less than multiplicative speedup.
To test this expectation in the context of atomistic MD, we designed and implemented our HCP multiscale method on NVIDIA GPU platforms. The HCP code was implemented in NAB, short for nucleic acid builder, and tested using the distance-dependent-dielectric, implicit solvent model. (NAB is the molecular dynamics module in the open-source Amber-Tools v1.4.) We show that for the HCP multiscale approximation and the common MD simulation model considered here, the degradation in performance due to asynchronous and irregular processing is mostly offset by a corresponding reduction in other asynchronous operations and slow global memory accesses. As a result, we realize near multiplicative speedups. For example, for a 475,000-atom virus capsid we were able to achieve a 11,071-fold combined speedup, only slightly less than the 11,706-fold multiplicative limit speedup -- 48.0-fold from the parallelization on the GPU times 243.9-fold from the multiscale approximation. The overall speedup depends on structure size, with smaller structures having lower speedups. An additional benefit of the HCP implementation on the GPU is the reduced memory requirement, which allows the processing of much larger structures that would otherwise be impossible on the limited memory GPU platform.

References

[1]
G. G. Dodson, D. P. Lane, and C. S. Verma. Molecular simulations of protein dynamics: new windows on mechanisms in biology. EMBO reports, 9:144--150, 2008.
[2]
M. Karplus and J. Kuriyan. Molecular dynamics and protein function. Proc Natl Acad Sci USA, 102(19):6679--6685, May 2005.
[3]
J. L. Klepeis, K. Lindorff-Larsen, R. O. Dror, and D. E. Shaw. Long-timescale molecular dynamics simulations of protein structure and function. Curr Opin Struct Biol, 19(2):120--127, April 2009.
[4]
C. C. Gruber and J. Pleiss. Systematic benchmarking of large molecular dynamics simulations employing GROMACS on massive multiprocessing facilities. J. Comput. Chem., 32(4):600--606, 2011.
[5]
R. Schulz, B. Lindner, L. Petridis, and J. C. Smith. Scaling of Multimillion-Atom Biological Molecular Dynamics Simulation on a Petascale Supercomputer. J Chem Theory Comput, 5(10):2798--2808, October 2009.
[6]
M. S. Friedrichs, P. Eastman, V. Vaidyanathan, M. Houston, S. Legrand, A. L. Beberg, D. L. Ensign, C. M. Bruns, and V. S. Pande. Accelerating molecular dynamic simulation on graphics processing units. J Comput Chem, 30(6):864--872, 2009.
[7]
T. Darden, D. York, and L. Pedersen. Particle mesh Ewald: An N.log(N) method for Ewald sums in large systems. J Chem Phys, 98(12):10089--10092, 1993.
[8]
U. Essmann, L. Perera, M. L. Berkowitz, T. Darden, H. Lee, and L. G. Pedersen. A smooth particle mesh Ewald method. J Chem Phys, 103(19):8577--8593, 1995.
[9]
G. D. Hawkins, C. J. Cramer, and D. G. Truhlar. Parametrized models of aqueous free energies of solvation based on pairwise descreening of solute atomic charges from a dielectric medium. J. Phys. Chem., 100:19824--19836, 1996.
[10]
A. Onufriev, D. Bashford, and D. A. Case. Modification of the generalized Born model suitable for macromolecules. J Phys Chem B, 104(15):3712--3720, 2000.
[11]
J. A. Grant, B. T. Pickup, M. J. Sykes, C. A. Kitchen, and A. Nicholls. The gaussian generalized born model: application to small molecules. Phys. Chem. Chem. Phys., 9(35):4913--4922, 2007.
[12]
M. Feig. Kinetics from implicit solvent simulations of biomolecules as a function of viscosity. J. Chem. Theory Comput., 3(5):1734--1748, September 2007.
[13]
I. Buck, T. Foley, D. Horn, J. Sugerman, K. Fatahalian, M. Houston, and P. Hanrahan. Brook for GPUs: stream computing on graphics hardware. In International Conference on Computer Graphics and Interactive Techniques, pages 777--786. ACM New York, NY, USA, 2004.
[14]
Jen-Hsun Huang. Opening Keynote, NVIDIA GTC 2010. http://livesmooth.istreamplanet.com/nvidia100921/.
[15]
The Top500 Supercomputer Sites. http://www.top500.org/list/2011/11/100.
[16]
J.D. Owens, M. Houston, D. Luebke, S. Green, J.E. Stone, and J.C. Phillips. GPU Computing. Proceedings of the IEEE, 96(5):879--899, May 2008.
[17]
D. J. Hardy, J. E. Stone, and K. Schulten. Multilevel summation of electrostatic potentials using graphics processing units. Parallel Comput, 35(3):164--177, 2009.
[18]
J. A. Baker and J. D. Hirst. Molecular Dynamics Simulations Using Graphics Processing Units. Mol. Inf., 30(6-7):498--504, 2011.
[19]
N. Ganesan, B. A. Bauer, T. R. Lucas, S. Patel, and M. Taufer. Structural, dynamic, and electrostatic properties of fully hydrated DMPC bilayers from molecular dynamics simulations accelerated with graphical processing units (GPUs). J. Comput. Chem., 32:2958--2973, 2011.
[20]
R. Anandakrishnan, T.Scogland, A.T. Fenley, J.C. Gordon, W. Feng, and A.V. Onufriev. Accelerating electrostatic surface potential calculation with multiscale approximation of graphical processing units. J Mol Graphics Model, 28:904--910, 2010.
[21]
R. Anandakrishnan and A.V. Onufriev. An n log n approximation based on the natural organization of biomolecules for speeding up the computation of long range interactions. Journal of Computational Chemistry, 31(4):691--706, 2010.
[22]
The Amber Molecular Dynamics Package, 2011. http://www.ambermd.org.
[23]
J. Ramstein and R. Lavery. Energetic coupling between dna bending and base pair opening. Proc. Natl. Acad. Sci. U.S.A., 85(19):7231--7235, 1988.
[24]
J.C. Gordon, A. T. Fenley, and A. Onufriev. An analytical approach to computing biomolecular electrostatic potential. II. Validation and applications. J Chem Phys, 129:075102, 2008.
[25]
W. M. Brown, P. Wang, S. J. Plimpton, and A. N. Tharrington. Implementing Molecular Dynamics on Hybrid High Performance Computers - Short Range Forces. J Comput Phys Comm, 182:898--911, 2012.
[26]
R. Anandakrishnan, M. Daga, and A. V. Onufriev. An n log n Generalized Born Approximation. J. Chemical Theory and Computation, 7(3):544--559, March 2011.
[27]
H. Wang and E. Nogales. Nucleotide-dependent bending flexibility of tubulin regulates microtubule assembly. Nature, 435(7044):911--915, June 2005.
[28]
V. Hornak, R. Abel, A. Okur, B. Strockbine, A. Roitberg, and C. Simmerling. Comparison of multiple Amber force fields and development of improved protein backbone parameters. Proteins, 65(3):712--725, 2006.
[29]
AMBER 11 NVIDIA GPU Acceleration Support, 2011. http://www.ambermd.org/gpus.
[30]
S. Ryoo, C. I. Rodrigues, S. S. Stone, S. S. Baghsorkhi, S. Ueng, J. A. Stratton, and W. W. Hwu. Program optimization space pruning for a multithreaded gpu. In CGO '08: Proceedings of the 6th IEEE/ACM International Symposium on Code Generation and Optimization, pages 195--204, New York, NY, USA, 2008. ACM.
[31]
A. M. Aji, M. Daga, and W. Feng. CampProf: A Visual Performance Analysis Tool for Memory Bound GPU Kernels. Technical report, Virginia Tech, 2010.
[32]
A. M. Aji, M. Daga, and W. Feng. Bounding the Effect of Partition Camping in GPU Kernels. In ACM International Conference on Computing Frontiers, Ischia, Italy, May 2011.
[33]
Amber (PMEMD) NVIDIA GPU Support - Benchmarks, 2011. http://ambermd.org/gpus/benchmarks.htm#Benchmarks.

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM Conferences
BCB '16: Proceedings of the 7th ACM International Conference on Bioinformatics, Computational Biology, and Health Informatics
October 2016
675 pages
ISBN:9781450342254
DOI:10.1145/2975167
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Sponsors

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 02 October 2016

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. Biomolecular electrostatics
  2. graphical processing unit (GPU)
  3. molecular dynamics
  4. multiscale approximation

Qualifiers

  • Research-article
  • Research
  • Refereed limited

Conference

BCB '16
Sponsor:

Acceptance Rates

Overall Acceptance Rate 254 of 885 submissions, 29%

Upcoming Conference

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • 0
    Total Citations
  • 60
    Total Downloads
  • Downloads (Last 12 months)1
  • Downloads (Last 6 weeks)0
Reflects downloads up to 14 Sep 2024

Other Metrics

Citations

View Options

Get Access

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media