.. -*- coding: utf-8 -*- .. _multiseq: =========================================== Structural Bioinformatics with VMD MultiSeq =========================================== Open :program:`VMD`. With the `MultiSeq plugin`_ it provides a convenient interface to do structural bioinformatics. We will manually select one PDB code from each organism. (Normally, one would do a more careful selection, e.g., taking resolution into account). Open :menuselection:`Extensions --> Tk Console` and type .. code-block:: tcl set pdbcodes {4jzk 1ak2 2c9y 1aky 2ak3 1zd8 2ar7 1p3j 3gmt 1ake 4pzl 1zin 4k46 1s3g 3be4 3fb4 3tlx} foreach pdb $pdbcodes {puts "Loading $pdb..."; mol new $pdb} (4np6 excluded because it does not parse easily, and I did not have time to check what was wrong.) In VMD, load :menuselection:`Extensions --> Analysis --> MultiSeq`. (This can take a moment when it downloads updates.) Manually delete all chains B, C, ... (highlight and :kbd:`delete`.) .. Delete 4np6 (STAMP complains). Hide rep. Perform a STAMP structural alignment: In Multiseq choose :menuselection:`Tools --> Stamp Structural Alignment`. .. For changing all representations to Tube: .. 1. go to top molecule .. 2. change active rep to Tube (and color) .. 3. :menuselection:`Extensions --> Visualization --> Clone Representation`: .. From Top to All Structural conservation ======================= :math:`Q` (``Qres``) is a measure of structural similarity. :math:`Q` is a parameter that indicates structural identity. :math:`Q` accounts for the fraction of similar native contacts between the aligned residues in two proteins [Eastwood2001]_. :math:`Q=1` implies that structures are identical. When :math:`Q` has a low score (0.1-0.3), structures are not aligned well, i.e., only a small fraction of the Cα atoms superimpose. :math:`Q` per residue is the contribution from each residue to the overall :math:`Q` value of aligned structures. 1. In Multiseq window, choose :menuselection:`View --> Coloring --> Qres`. 2. Observe the coloring in the sequence alignment and the graphics window (projected on structures) Note that the CORE domain has high ``Qres``. This indicates that it superimposes well in all structures. Sequence conservation ===================== Color by *Sequence Identity*. Note the residues that are 100% conserved (:menuselection:`Search --> Select Residues...`: Where Sequence Idenity >= 100): - R, K - G, P Switch to *1ake* and create a new rep for ``chain A and resname AP5`` (use *CPK* or *VDW* and color by *name*) What is the role of the conserved R (Arg) and K (Lys)? (MultiSeq :menuselection:`View --> Highlight Color --> ResType`). Phylogenetic tree ================= You need an alignment to create a tree. A phylogenetic tree displays evolutionary relationships. :menuselection:`Tools --> Phylogenetic Tree`. - using Percent Identity - label with full organism name Note that this tree is based on the structural alignment and the conformational change that is visible obscures some of the evolutionary relationships. References ========== .. [Eastwood2001] Eastwood, M.P., C. Hardin, Z. Luthey-Schulten, and P.G. Wolynes. “Evaluating the protein structure-prediction schemes using energy landscape theory.” IBM J. Res. Dev. 45: 475-497, 2001. URL: http://www.research.ibm.com/journal/rd/453/eastwood.pdf .. _`MultiSeq plugin`: http://www.ks.uiuc.edu/Research/vmd/plugins/multiseq/