Read Section I Background, parts A-C of the annotated, on-line "Anatomy & Taxonomy of Protein Structure", which include explanation of the web version, Intro, Amino Acid & Backbone, and Levels of Error. Links to the Anatax, software, and kinemage files can all be found on the BCH222 course web site at http://kinemage.biochem.duke.edu/teaching/bch222/.

If you are not already somewhat familiar with polypeptide geometry, please study Fig. 5 in the Anatax. You may also want to explore kinemage 1 in the download arrow c1Basics.kin (132KB) file, which shows the same information in 3D. Open it in KiNG, left-drag to move the viewpoint around, and turn on & off the angle and atom labels and the H atoms with buttons on the right-hand button panel. Turn on the "spheres", to get a feeling for the bulk of the atoms and their bonded interpenetration. With spheres off, enable bond rotations under the Tools menu by choosing Specialty -> Suite rotation, which gives you a rotation widget with a choice of possible angles to rotate. Try PHI and PSI, then CHI1 and CHI2. [Don't use the "end" angles, which are a temporary artifact of the current process of updating KiNG to work with these lessons.] Left-button drag around the circle edge changes the angle quickly, right-button drag changes more slowly, and the + and - boxes very slowly. Quit out of KiNG when you're done.

Worksheet:  Getting Comfortable with Protein Geometry

  1. Finding direction and chain termini
    1. Open kinemage file download arrow 1a6m.kin.zip (464KB) in KiNG, which shows the mainchain from a crystal structure of oxy-myoglobin refined at 1Å resolution, with color-coded balls for the non-C atoms (O red, N blue). In the overview, drag to rotate the molecule slowly back & forth, to see in 3D the arrangement of helices that enclose the heme group (in pink). Turn on sidechains and sc atoms. The central Fe atom (orange) of the heme has an O2 ligand bound on one side; what is the amino-acid type and the residue number of the sidechain ligand on the other side? [You can click on an atom in the sidechain and read its pointID at bottom left of the graphics window, or figure it out by the shape and atom types of the sidechain.]
      ___________________________
    2. With just mainchain and mc atoms on, choose "helix side" from the Views pulldown menu. Given that the backbone carbonyl (CO) groups in a helix point toward the C-terminus and splay outward slightly, determine the N to C direction for the helix. In this view, is the beginning of the helix at the top or the bottom of the screen?  ___________________
      Then turn off mc atoms, and get used to identifying the carbonyl oxygens and their directionality without the red balls.
    3. Zoom out or choose the overview, and use the directionality of the helices to identify the N and C termini of the molecule. "Click" on selected atoms to check that you are correct, but one objective of this lesson is to learn chain direction and atom identity just from the geometry of the mainchain.
      What are the residue types and numbers for the first and last residues displayed?
      ___________________________        ___________________________
    4. Remembering that N precedes and CO follows the C-alpha in each residue, practice determining the local chain direction in loops and ends as well as helices. Move around in the molecule by a right-click to center on an atom, and zoom in or out as needed. [if you get confused about which is the C-alpha, briefly turn on either the sidechains, or the 'mc atoms' button for O and N atom markers.]
  2. Backbone geometry
    1. Choose the "helix end" view, and observe how each peptide forms a straight line seen down the helix axis, while the Calphas are at the corners between peptides. For several examples, turn to look along the peptide plane to see which atoms are planar and how accurately. In a given peptide, which 5 atoms are co-planar?   ______ , ______ , ______ , ______ , ______
    2. Without the sidechains or the mc atoms, practice identifying Calpha, N, and CO just from their relative geometries. Successive Calpha atoms should be very close to 3.8Å apart if the peptide bond is trans (there are no cis peptides in myoglobin). Measure 3 successive Calpha-Calpha distances by picking each Calpha in turn, and noting the distance between each atom pair, as reported at screen bottom center. The 3 distances are:   ______Å,  ______Å,  ______Å
      *Successive Calphas for most peptides are "trans" = "across" the CO-NH amide bond: C-alpha on opposite sides
      but some (almost always preceding a Pro) are "cis" = "same-side":
    3. KiNG and Mage have further tools to help study geometrical features. Pull down the "Tools" menu and turn on the "Measure" function; the information line at the bottom of the screen will now give you successive identity, distance, angle, and dihedral for the last 1, 2, 3, and 4 atoms you picked.
      Try it out to get dihedral angles phi, psi, and omega along the backbone (if you start with a carbonyl carbon and go forward along the main polypeptide chain, the first dihedral you get will be phi). In a helical region you should get phi, psi, and omega values near -60, -40, 180.
  3. Sidechains
    1. Turn on sidechains and sc atoms, choose the Pro face or Pro side view, and rotate between them. Note that the Pro ring is not planar. Two atoms are in both the mainchain and the sidechain of the Pro ring; identify which is Calpha and which is N.
      Decide which of the 5 ring atoms (Cbeta, Cgamma, etc.) is most out of the plane defined by the other 4:  _______________ ,
      [NB: the consensus is that Pro should pucker at the Cgamma atom.]

    2. Practice identifying amino-acid residues by looking just at the sidechain shapes (e.g., Asp vs. Leu, Met vs Lys, even Glu vs Gln). Note especially the branch points, and whether those branches are tetrahedral or planar.
      What are two pairs of residues that cannot be told apart this way?
        _____________   vs   _____________   and   _____________   vs   _____________ .
    3. Check yourself either by clicking on an atom to get its ID, or by turning on 'atoms' to mark the non-carbons.
  4. Levels of error: the good parts at high resolution
    1. The original myoglobin map shown in the introduction to "Anatomy and Taxonomy of Protein Structures" was at 6Å resolution, where only the molecular shape and the position of helices can be seen. This 1a6m structure is at 1Å resolution, where almost every individual atom can be seen. With both mainchain, sidechains, and their atoms on, as well as the "het groups", choose the heme view and turn on "heme" under the "ED map" group. Notice how thoroughly the map shape outlines all the atoms of the heme group. The gray contours are at 1.2 sigma (or about 0.6 electrons per cubic Å), while the purple contours are at 3 sigma (or about 1.6 electrons per cubic Å). Turn off the gray contours ("1.2 sigma" master button) temporarily to see that each atom has a separate density peak ("atomic resolution", where neighboring atoms appear distinct, is about 1.2Å or better). Which atom has the biggest peak?  _____________
    2. Choose the Phe 123 view, and turn on just the "Pro and helix" electron density, with both contour levels. Note the hole in the ring and the clear peaks for individual atoms. Turn sidewise to see the flatness of the ring and the tetrahedral "elbow bend" between backbone and sidechain. At what atom does the structure make the elbow bend?  _____________
    3. Choose the "Pro face" view. Note that the peaks for N atoms are a bit larger than for C, and O peaks a bit larger still; that's because C has 6 electrons, N has 7, and O has 8. This difference is clear in the good parts at atomic resolution, but cannot be distinguished at lower resolutions. Choose the Pro side view, or rotate around, to see the ring pucker. Which atom is out of the plane of the other 4?  _____________     is there any doubt about the location or direction of pucker?  ________
    4. Choose the Arg 139 view, and turn on just the "Arg" electron density. Note that the Arg sidechain is very non-planar but still clear and well-ordered -- it has a favorable rotamer conformation and strong interactions with its surroundings. The 3 N atoms of the planar "guanidinium" end-group have 5 H-bond-donating hydrogen atoms attached, and in this case (not unusually), all 5 H-bonds are satisfied by H-bond acceptor atoms. One of those is the Ogamma of a nearby Ser sidechain; of what residue number?  _________   Two others are isolated density peaks; turn on the "waters" button to see that they are water O atoms. The last two are in the density to the right of the Arg, with no model inside it; those are part of a neighboring myoglobin molecule in the crystal. Drag to rotate around, to see that the peaks are elongated for those atoms - they are somewhat disordered. In solution, H-bonds to them would be replaced by H-bonds to solvent; however, the Arg would still be well-ordered, because the Ser O and the 2 waters would stay fixed, since the waters also H-bond to another sidechain O - can you identify the sidechain of that other O?  _______________________
      Next week you will learn how to reliably identify H-bonds.
  5. Levels of error: the bad parts with disorder
    1. Even at very high resolution, it is usual to have partly-disordered regions at the chain ends and for some of the surface sidechains; at lower resolution loop ends are often disordered, and occasionally entire domains. Choose the "high-B ends" view and turn on just the "ends & alts" density under the ED map group. Lys 140 is at the upper left in this view; its density is well-ordered out to the  ________ atom, very weak at   ________ , and absent at  ________ . The chain N-terminus is at the upper right. Note that the density is weak or absent for the Val 1 Cgamma atoms, and the shape of the density that includes N, Ca, Cb is strange (drag to see it in 3D). In both cases, the problem is almost certainly due to the sidechain or chain-terminus adopting multiple conformations, but it was not possible to see those alternates and fit them.
    2. However, sometimes 2 or more conformations can be identified confidently. Turn on the "altb" master button, to see two versions for the sidechains of Lys   _______  and Glu   _______ . The Glu is moving locally within one rotamer, while the Lys occupies two quite distinct rotamers. At lower resolution the density would appear smeared-out or would disappear altogether, although a single conformation is often fit. The key to identifying these uncertain regions is either to look at the electron density, or to pay attention to the B-factors of the atoms, which are larger the more the density is smeared out. In these kinemages, both the occupancy (when there are alternates) and the B-factor are given in the pointID. Click on each atom in the sidechain of Lys 140; the B-factor for Ca is   _______ , for Cb is  _______ , for Cg is   _______ , for Cd is  _______ , for Ce is   _______ , and for Nz (you'll need to turn of "sc atoms") the B is   _______ . For Lys 133, the occupancy of the "a" alternate is   _______  and for the "b" alternate is   _______ .
      Often the conformation fit by the crystallographer is misleading only because it is not actually unique. However, multiple, smeared, or missing density can often lead to an erroneous fit that does not correspond to any possible local conformation. Later in the course you will learn more about how to identify and even correct such local errors.
      Quit out of KiNG when you're done (otherwise you accumulate too much verbiage in the text window!).
  6. Cis peptides
    1. Launch KiNG and open the file download arrow c6FldFlx.kin (796KB), and go to Kinemage 3 (use either the kinemage list at top of the button panel, or the *{ }* hypertext link in the text-window table of contents). Click the "Animate" button or hit the "a" keyboard key to switch back and forth between a trans peptide in myoglobin and a cis peptide in a related protein (erythrocruorin). Scroll down in the text window to read about this kinemage, and drag the image to view the cis/trans difference from many angles. Measure the distance from Calpha to Calpha across the trans peptide:   _______ Å, and then across the cis peptide:   _______ Å.
    2. Go to kinemage 4, which shows a proline isomerase enzyme, and read the text about it. Look especially at the closeup of the cis Ala-Pro substrate (view 5), and turn on "contacts" to see how the enzyme binds it tightly in order to catalyze the isomerization of the peptide bond between cis and trans forms. Identify the Calphas of the Ala and the Pro, and verify that the bond between them is indeed cis.