7
Todd J.Taylor, Iosif I.Vaisman [email protected], [email protected] Abstract: A method of protein structural domain assignment using an Ising/Potts-like model on a lattice derived from the Delaunay tessellation of a protein structure is described. The method is very simple and agrees well with previously published methods. Protein Structural Domain Assignment with a Delaunay Tessellation Derived Lattice

Todd J.Taylor, Iosif I.Vaisman todd.taylor@nist, ivaisman@gmu

  • Upload
    hewitt

  • View
    44

  • Download
    2

Embed Size (px)

DESCRIPTION

Protein Structural Domain Assignment with a Delaunay Tessellation Derived Lattice. - PowerPoint PPT Presentation

Citation preview

Page 1: Todd J.Taylor, Iosif I.Vaisman todd.taylor@nist, ivaisman@gmu

Todd J.Taylor, Iosif [email protected], [email protected]

Abstract: A method of protein structural domain assignment using an Ising/Potts-like model on a lattice derived from the Delaunay tessellation of a protein structure is described. The method is very simple and agrees well with previously published methods.

Protein Structural Domain Assignment with a Delaunay Tessellation Derived Lattice

Page 2: Todd J.Taylor, Iosif I.Vaisman todd.taylor@nist, ivaisman@gmu

Protein structures have been analyzed with a technique from computational geometry known as Delaunay tessellation (DT). Each amino acid is abstracted to a point and the points are then joined by edges to form a set of non-overlapping, irregular, space-filling tetrahedra each having the property that the sphere on the surface of which all four vertices reside does not contain a vertex from any other tetrahedron.

The union of the surface faces of the tessellated protein forms the convex hull of the Cα point set. Surface irregularities are ‘paved over’ by long edges (20Å+) which form contacts between residue pairs that are too far apart to be ‘true’ neighbors. It is sometimes expedient therefore to impose an edge length cutoff in the DT analysis.

Cα Delaunay tessellation of phosphoglycerate kinase (16pk) with no edge cutoff and with a 10Å cutoff

Page 3: Todd J.Taylor, Iosif I.Vaisman todd.taylor@nist, ivaisman@gmu

Structural domains: Wetlaufer (1973), Definition - continuous segment(s) of the main chain that form a compact, stable structure with a hydrophobic core and potentially could fold and function independently from the rest of the structure

Delaunay-Potts: Sequence of domain labels is S={s1,s2, …, sN} , initialized to residue numbers.

sit+1 = si

t + U[∑ J(sit ,sjt ) ] , i =1, …, N , where j varies over the Delaunay neighbors

of i and U(x) = x/|x|

Pick residue at random and immediately update (asynchronous updating). Iterate until shape of domain label profile meets ending 'stairstep' criteria.

1 if sj > si and dij ≤ r

J(sit ,sj

t ) = -1 if sj < si and dij ≤ r cutoff distance r, typically 8.5-12Å

0 if dij > r

Smooth in a window around residue i, replacing the label at i with the median in the window. Post-processing fine tunes assignment: no domains smaller than 40 residues, no domain boundary cuts a beta sheet.

Protein domain assignment and DePot

Page 4: Todd J.Taylor, Iosif I.Vaisman todd.taylor@nist, ivaisman@gmu

domain 1 domain 2

Schematic of Delaunay-Potts (DePot) procedure

Page 5: Todd J.Taylor, Iosif I.Vaisman todd.taylor@nist, ivaisman@gmu

Example assignments and evolution of domain labels2lao domain1 domain2Expert 1-90,191-238 91-190DALI 1-89,193-240 90-192CATH 1-90,192-238 91-191PDP 1-90,192-238 91-191DomainParser2 1-89,193-240 90-1923DEE 1-89,193-238 90-192DDBASE 5-91,188-237 92-185Islam 1-88,196-238 89-195SCOP 1-238DOMS 1-90,192-238 91-191DePot 1-91,186-238 92-185

1avhA domain1 domain2 domain3 domain4Expert 3-87 88-167 168-246 247-320DALI 3-86,247-320 87-145 146-246CATH 14-86 87-160 161-246 247-318PDP 3-140,247-320 141-246DomainParser2 3-89,247-320 90-145 146-2463DEE 14-86 87-160 161-246 247-320DDBASE 3-87 88-157 158-246 247-320Islam 3-87 88-245 246-320SCOP 3-320DOMS 3-73 74-159 160-223 224-320DePot 3-87 88-160 161-247 248-320

Page 6: Todd J.Taylor, Iosif I.Vaisman todd.taylor@nist, ivaisman@gmu

0.590.520.580.590.610.810.560.630.760.56same #

0.980.960.970.950.940.970.970.940.960.97overlapVIRand

0.620.75DOMS0.550.78SCOP

0.530.80DePot

0.530.81Islam0.610.79DDBASE0.320.913DEE0.530.80Domain Parser0.540.81PDP0.380.88CATH0.530.80DALI

Depot along with several other methods was tested on a set of 100 structures from three previously published domain assignment papers. The overlap score (used before in the literature) was used to measure similarity wrt expert assignments as well as two other scoring schemes, not applied to domain assignment before from the clustering literature.

Performance on combined Jones, Taylor, and Veretnik test set wrt expert assignment

Page 7: Todd J.Taylor, Iosif I.Vaisman todd.taylor@nist, ivaisman@gmu

[1] Singh RK, Tropsha A, Vaisman II (1996) Delaunay tessellation of proteins: four body nearest-neighbor propensities of amino acid residues. J Comput Biol 3(2):213-21.

[2] Taylor TJ, Vaisman II (2006) Protein structural domain assignment with a Delaunay tessellation derived lattice, Proceedings of the 3rd International Symposium on Voronoi Diagrams in Science and Engineering.

[3] Taylor WR (1999) Protein structural domain identification. Protein Eng 12: 203-16.

[4] Veretnik S, Bourne PE, Alexandrov NN, Shindyalov IN (2004) Toward consistent assignment of protein domains in proteins. J Mol Biol 339: 647-678.

[5] Holland TA, Veretnik S, Shindyalov IN, Bourne PE. (2006) Partitioning protein structures into domains: why is it so difficult? J Mol Biol. 361(3):562-590.[6] Jones S, Stewart M, Michie A, Swindells MB, Orengo C, Thornton JM (1998) Domain assignment for protein structures using a consensus approach: characterization and analysis. Protein Sci 7: 233-242.

[7] Okabe A (2000) Spatial tessellations : concepts and applications of Voronoi diagrams. Wiley

Assignment serverhttp://proteins.binf.gmu.edu/iv-software.html

AcknowledgementsW.R. Taylor for the DOMS method and code.

Stella Veretnik for discussions regarding her work with domain assignment.

NSF for funding.

Selected references