Anatomy of protein pockets and cavities: Measurement of binding site geometry and implications for ligand design

Identification and size characterization of surface pockets and occluded cavities are initial steps in protein structure‐based ligand design. A new program, CAST, for automatically locating and measuring protein pockets and cavities, is based on precise computational geometry methods, including alpha shape and discrete flow theory. CAST identifies and measures pockets and pocket mouth openings, as well as cavities. The program specifies the atoms lining pockets, pocket openings, and buried cavities; the volume and area of pockets and cavities; and the area and circumference of mouth openings. CAST analysis of over 100 proteins has been carried out; proteins examined include a set of 51 monomeric enzyme‐ligand structures, several elastase‐inhibitor complexes, the FK506 binding protein, 30 HIV‐1 protease‐inhibitor complexes, and a number of small and large protein inhibitors. Medium‐sized globular proteins typically have 10‐20 pockets/cavities. Most often, binding sites are pockets with 1‐2 mouth openings; much less frequently they are cavities. Ligand binding pockets vary widely in size, most within the range 102‐103 Å3. Statistical analysis reveals that the number of pockets and cavities is correlated with protein size, but there is no correlation between the size of the protein and the size of binding sites. Most frequently, the largest pocket/cavity is the active site, but there are a number of instructive exceptions. Ligand volume and binding site volume are somewhat correlated when binding site volume is < 700 Å3, but the ligand seldom occupies the entire site. Auxiliary pockets near the active site have been suggested as additional binding surface for designed ligands (Mattos C et al., 1994, Nat Struct Bid 1:55‐58). Analysis of elastase‐inhibitor complexes suggests that CAST can identify ancillary pockets suitable for recruitment in ligand design strategies. Analysis of the FK506 binding protein, and of compounds developed in SAR by NMR (Shuker SB et al., 1996, Science 274:1531‐1534), indicates that CAST pocket computation may provide a priori identification of target proteins for linked‐fragment design. CAST analysis of 30 HIV‐1 protease‐inhibitor complexes shows that the flexible active site pocket can vary over a range of 853‐1,566 Å3, and that there are two pockets near or adjoining the active site that may be recruited for ligand design.

[1]  B. Lee,et al.  The interpretation of protein structures: estimation of static accessibility. , 1971, Journal of molecular biology.

[2]  A. Shrake,et al.  Environment and exposure to solvent of protein atoms. Lysozyme and insulin. , 1973, Journal of molecular biology.

[3]  J L Finney,et al.  Volume occupation, environment and accessibility in proteins. The problem of the protein surface. , 1975, Journal of molecular biology.

[4]  F M Richards,et al.  Areas, volumes, packing and protein structure. , 1977, Annual review of biophysics and bioengineering.

[5]  J L Finney,et al.  Volume occupation, environment, and accessibility in proteins. Environment and molecular area of RNase-S. , 1978, Journal of molecular biology.

[6]  J M Blaney,et al.  A geometric approach to macromolecule-ligand interactions. , 1982, Journal of molecular biology.

[7]  J L Finney,et al.  Calculation of protein volumes: an alternative to the Voronoi procedure. , 1982, Journal of molecular biology.

[8]  Carl W David,et al.  Voronoi polyhedra as a tool for studying solvation structure , 1982 .

[9]  M. L. Connolly Analytical molecular surface calculation , 1983 .

[10]  T. Richmond,et al.  Solvent accessible surface area and excluded volume in proteins. Analytical equations for overlapping spheres and implications for the hydrophobic effect. , 1984, Journal of molecular biology.

[11]  F. M. Richards,et al.  Calculation of molecular volumes and areas for structures of known geometry. , 1985, Methods in enzymology.

[12]  B Honig,et al.  Internal cavities and buried waters in globular proteins. , 1986, Biochemistry.

[13]  K. D. Gibson,et al.  Exact calculation of the volume and surface area of fused hard-sphere molecules with unequal atomic radii , 1987 .

[14]  I. Kuntz,et al.  Using shape complementarity as an initial screen in designing ligands for a receptor binding site of known three-dimensional structure. , 1988, Journal of medicinal chemistry.

[15]  G. Vriend,et al.  A very fast program for visualizing protein surfaces, channels and cavities. , 1989, Journal of molecular graphics.

[16]  M. Jaskólski,et al.  Conserved folding in retroviral proteases: crystal structure of a synthetic HIV-1 protease. , 1989, Science.

[17]  M. Navia,et al.  Three-dimensional structure of aspartyl protease from human immunodeficiency virus HIV-1 , 1989, Nature.

[18]  Chris M. W. Ho,et al.  Cavity search: An algorithm for the isolation and display of cavity-like binding regions , 1990, J. Comput. Aided Mol. Des..

[19]  Shoshana J. Wodak,et al.  Detection of cavities in a set of interpenetrating spheres , 1991 .

[20]  D. Levitt,et al.  POCKET: a computer graphics method for identifying and displaying protein cavities and their surrounding amino acids. , 1992, Journal of molecular graphics.

[21]  Herbert Edelsbrunner,et al.  Three-dimensional alpha shapes , 1992, VVS.

[22]  J. Delaney Finding and filling protein cavities using cellular logic operations. , 1992, Journal of Molecular Graphics.

[23]  Incremental topological flipping works for regular triangulations , 1992, SCG '92.

[24]  Piero Procacci,et al.  A General Algorithm for Computing Voronoi Volumes - Application to the Hydrated Crystal of Myoglobin , 1992 .

[25]  B. Wallace,et al.  The pore dimensions of gramicidin A. , 1993, Biophysical journal.

[26]  A. Wlodawer,et al.  Structure-based inhibitors of HIV-1 protease. , 1993, Annual review of biochemistry.

[27]  R F Standaert,et al.  Atomic structures of the human immunophilin FKBP-12 complexes with FK506 and rapamycin. , 1993, Journal of molecular biology.

[28]  Dagmar Ringe,et al.  Analogous inhibitors of elastase do not always bind analogously , 1994, Nature Structural Biology.

[29]  G J Kleywegt,et al.  Detection, delineation, measurement and display of cavities in macromolecular structures. , 1994, Acta crystallographica. Section D, Biological crystallography.

[30]  D Fischer,et al.  Molecular surface representations by sparse critical points , 1994, Proteins.

[31]  J. Thornton,et al.  Buried waters and internal cavities in monomeric proteins , 1994, Protein science : a publication of the Protein Society.

[32]  P Argos,et al.  Intramolecular cavities in globular proteins. , 1994, Protein engineering.

[33]  A Wlodawer,et al.  Energy calculations and analysis of HIV-1 protease-inhibitor crystal structures. , 1994, Protein engineering.

[34]  Frederick P. Brooks,et al.  Computing smooth molecular surfaces , 1994, IEEE Computer Graphics and Applications.

[35]  Michael A. Facello,et al.  Implementation of a randomized algorithm for Delaunay and regular triangulations in three dimensions , 1995, Comput. Aided Geom. Des..

[36]  Herbert Edelsbrunner,et al.  Measuring proteins and voids in proteins , 1995, Proceedings of the Twenty-Eighth Annual Hawaii International Conference on System Sciences.

[37]  H. Edelsbrunner The union of balls and its dual shape , 1995 .

[38]  R A Sayle,et al.  RASMOL: biomolecular graphics for all. , 1995, Trends in biochemical sciences.

[39]  G A Petsko,et al.  Structural analysis of the active site of porcine pancreatic elastase based on the X-ray crystal structures of complexes with trifluoroacetyl-dipeptide-anilide inhibitors. , 1995, Biochemistry.

[40]  R. Laskowski SURFNET: a program for visualizing molecular surfaces, cavities, and intermolecular interactions. , 1995, Journal of molecular graphics.

[41]  M. Levitt,et al.  The volume of atoms on the protein surface: calculated from simulation, using Voronoi polyhedra. , 1995, Journal of molecular biology.

[42]  M. Swindells,et al.  Protein clefts in molecular recognition and function. , 1996, Protein science : a publication of the Protein Society.

[43]  K Schulten,et al.  VMD: visual molecular dynamics. , 1996, Journal of molecular graphics.

[44]  C. Frömmel,et al.  The automatic search for ligand binding sites in proteins of known three-dimensional structure using only geometric criteria. , 1996, Journal of molecular biology.

[45]  D. Ringe,et al.  Locating and characterizing binding sites on proteins , 1996, Nature Biotechnology.

[46]  Herbert Edelsbrunner,et al.  Triangulating the Surface of a Molecule , 1996, Discret. Appl. Math..

[47]  P. Hajduk,et al.  Discovering High-Affinity Ligands for Proteins: SAR by NMR , 1996, Science.

[48]  Iosif I. Vaisman,et al.  Delaunay Tessellation of Proteins: Four Body Nearest-Neighbor Propensities of Amino Acid Residues , 1996, J. Comput. Biol..

[49]  S Subramaniam,et al.  Computation of molecular electrostatics with boundary element methods. , 1997, Biophysical journal.

[50]  Database of HIV proteinase structures. , 1997, Trends in biochemical sciences.

[51]  E A Merritt,et al.  Raster3D: photorealistic molecular graphics. , 1997, Methods in enzymology.

[52]  J Liang,et al.  Chemical complementation identifies a proton acceptor for redox-active tyrosine D in photosystem II. , 1997, Proceedings of the National Academy of Sciences of the United States of America.

[53]  Frederick P. Brooks,et al.  Linearly Scalable Computation of Smooth Molecular Surfaces , 1997 .

[54]  Alexander Wlodawer,et al.  Database of three-dimensional structures of HIV proteinases , 1997, Nature Structural Biology.

[55]  P. Munson,et al.  Statistical significance of hierarchical multi‐body potentials based on Delaunay tessellation and their application in sequence‐structure alignment , 1997, Protein science : a publication of the Protein Society.

[56]  M. McGee,et al.  Hydration structure of antithrombin conformers and water transfer during reactive loop insertion. , 1998, Biophysical journal.

[57]  M. McGee,et al.  Effective electrostatic charge of coagulation factor X in solution and on phospholipid membranes: implications for activation mechanisms and structure-function relationships of the Gla domain. , 1998, The Biochemical journal.

[58]  Herbert Edelsbrunner,et al.  On the Definition and the Construction of Pockets in Macromolecules , 1998, Discret. Appl. Math..

[59]  H Edelsbrunner,et al.  Analytical shape computation of macromolecules: II. Inaccessible cavities in proteins , 1998, Proteins.

[60]  S Subramaniam,et al.  Analytical shape computation of macromolecules: I. molecular area and volume through alpha shape , 1998, Proteins.