 
 
PSA: A Method for Quantifying Macromolecular Pathways
Diverse classes of proteins function through large-scale conformational changes and various sophisticated computational algorithms have been proposed to enhance sampling of these macromolecular transition paths. Because such paths are curves in a high-dimensional space, it has been difficult to quantitatively compare multiple paths, a necessary pre-requisite to, for instance, assess the quality of different algorithms. In the paper Path Similarity Analysis: A Method for Quantifying Macromolecular Pathways [PLoS Comp Biol 11 (2015), e1004568] we introduce a method named Path Similarity Analysis (PSA) that enables us to quantify the similarity between two arbitrary paths and extract the atomic-scale determinants responsible for their differences.1
Implementation, Availability and Documentation
PSA is implemented as the MDAnalysis.analysis.psa module in the MDAnalysis Python library.
A tutorial is available at github.com/Becksteinlab/PSAnalysisTutorial, including example Jupyter notebooks such as the example for comparing multiple path sampling algoritms.
Summary
PSA utilizes the full information available in 3N-dimensional configuration space trajectories by employing the Hausdorff or Fre?chet metrics (adopted from computational geometry) to quantify the degree of similarity between piecewise-linear curves. It thus completely avoids relying on projections into low dimensional spaces, as used in traditional approaches.
To elucidate the principles of PSA, we quantified the effect of path roughness induced by thermal fluctuations using a toy model system with variable dimensionality. Temperature has a much more pronounced effect than dimensionality and the “double barrel” potential system shows a transition from two distinct groups of paths at lower temperatures to paths that are indistinguishable at higher temperatures.
Using, as an example for a realistic macromolecular transition, the closed-to-open transitions of the enzyme adenylate kinase (AdK) in its substrate-free form, we compared a range of protein transition path-generating algorithms. Molecular dynamics-based dynamic impor- tance sampling (DIMS) MD and targeted MD (TMD) and the purely geometric FRODA (Framework Rigidity Optimized Dynamics Algorithm) were tested along with seven other methods publicly available on servers, including several based on the popular elastic network model (ENM). PSA with clustering revealed that paths produced by a given method are more similar to each other than to those from another method and, for instance, that the ENM-based methods produced relatively similar paths. PSA applied to ensembles of DIMS MD and FRODA trajectories of the conformational transition of diphtheria toxin, a particularly challenging example, showed that the geometry-based FRODA occasionally sampled the pathway space of force field-based DIMS MD.
For the AdK transition, the new concept of a Hausdorff-pair map enabled us to extract the molecular structural determinants responsible for differences in pathways, namely a set of conserved salt bridges whose charge-charge interactions are fully modelled in DIMS MD but not in FRODA. PSA has the potential to enhance our understanding of transition path sampling methods, validate them, and to provide a new approach to analyzing conformational transitions.
References
- a Seyler SL, Kumar A, Thorpe MF, Beckstein O (2015) Path Similarity Analysis: A Method for Quantifying Macromolecular Pathways. PLoS Comput Biol 11 (10): e1004568. doi: 10.1371/journal.pcbi.1004568









Discuss: “PSA: A Method for Quantifying Macromolecular Pathways”