Transcript
Page 1: Swarming Binding Funnels: Ranking Native over False ......Swarming Binding Funnels: Ranking Native over False Docking Poses a d R L Crowding Crowding TP docking funnel FP docking Funnel

Swarming Binding Funnels: Ranking

Native over False Docking Poses

a

d

R

L

Crowding

Crowding TP docking funnel

FP docking Funnel

Mutation space more robust?

Mutational space less robust?

Motivation I: Docking Mechanisms

Page 2: Swarming Binding Funnels: Ranking Native over False ......Swarming Binding Funnels: Ranking Native over False Docking Poses a d R L Crowding Crowding TP docking funnel FP docking Funnel

Motivation II: Model Refinement (MD)

T39: Cluster 44 --> Cluster 9

Erik Pfeiffenberger’s poster

Page 3: Swarming Binding Funnels: Ranking Native over False ......Swarming Binding Funnels: Ranking Native over False Docking Poses a d R L Crowding Crowding TP docking funnel FP docking Funnel

FP TP FP

Binding Funnels

Decreasing Energy 1. Find the correct funnel? 2. Confidence in finding TP funnel in the top 10? 3. Get to the bottom of the correct funnel? 4. Do the proteins dock at all?

SwarmDock solutions

SwarmDock solutions

MD (constrained)

Page 4: Swarming Binding Funnels: Ranking Native over False ......Swarming Binding Funnels: Ranking Native over False Docking Poses a d R L Crowding Crowding TP docking funnel FP docking Funnel

Score_set: Decoy Statistics (Lensink & Wodak Proteins(2014))

T29

2083 2 78 87 1916

T30

1343 0 0 2 1341

T32

599 0 3 12 584

T35

499 0 0 3 496

T36

309 0 0 0 309

T37

1500 11 46 42 1401

T38

899 0 0 0 899

T39

1400 0 3 1 1396

T40

2180 193 206 189 1592

T41

1200 2 120 249 829

T46

1699 0 0 24 1675

T47

1051 278 307 26 440

T50

1451 0 36 97 1318

T53

1400 0 17 113 1270

T54

1400 0 1 18 1381

Target Total High Medium Acceptable Incorrect

*

*

Page 5: Swarming Binding Funnels: Ranking Native over False ......Swarming Binding Funnels: Ranking Native over False Docking Poses a d R L Crowding Crowding TP docking funnel FP docking Funnel

T40 (easy)

T39 (hard)

T47 (easy)

T32 (hard)

Docking Poses Scored with DCOMPLEX (Liu et al Proteins (2004))

Page 6: Swarming Binding Funnels: Ranking Native over False ......Swarming Binding Funnels: Ranking Native over False Docking Poses a d R L Crowding Crowding TP docking funnel FP docking Funnel

Methodology Score_Set dataset (Lensink & Wodak; Proteins 2014)

Removing models with clashes

Feature calculation and normalization

Cluster (10 Angstrom)

Aggregation of feature values for each cluster

Construction of pairwise cluster comparision matrix for classification learning

Page 7: Swarming Binding Funnels: Ranking Native over False ......Swarming Binding Funnels: Ranking Native over False Docking Poses a d R L Crowding Crowding TP docking funnel FP docking Funnel

Number of Models per Cluster

Cluster

Nu

mb

er

of

Mo

de

ls

0

100

200

300 T40 250 PSO Particles

Page 8: Swarming Binding Funnels: Ranking Native over False ......Swarming Binding Funnels: Ranking Native over False Docking Poses a d R L Crowding Crowding TP docking funnel FP docking Funnel

10 Angstrom Clusters and PSO Ensembles

T39 Cluster 1: 95

T39 Cluster 1: PSO 250 Particles

T39 Cluster 44: 5

T39 Cluster 1: PSO 250 Particles

Page 9: Swarming Binding Funnels: Ranking Native over False ......Swarming Binding Funnels: Ranking Native over False Docking Poses a d R L Crowding Crowding TP docking funnel FP docking Funnel

SwarmDock

• Swarm / Flock / Colony

– Group of Individuals

• Individual

– Follows Simple Rules

• Collective Behaviour

– Solve Complex Problems

Page 10: Swarming Binding Funnels: Ranking Native over False ......Swarming Binding Funnels: Ranking Native over False Docking Poses a d R L Crowding Crowding TP docking funnel FP docking Funnel

Particle Swarm Optimisation

i

x

y

z

x

y

z

M1

M2

Mm

A Particle Position

Vi

x

y

z

x

y

z

M1

M2

Mm

A Particle Velocity

Initialise Swarm

Calculate scores

Update Velocities

Update Positions

Repeat Until

Convergence

Return Best Structure

Page 11: Swarming Binding Funnels: Ranking Native over False ......Swarming Binding Funnels: Ranking Native over False Docking Poses a d R L Crowding Crowding TP docking funnel FP docking Funnel

PSO Simulation for T32 Cluster 6

Page 12: Swarming Binding Funnels: Ranking Native over False ......Swarming Binding Funnels: Ranking Native over False Docking Poses a d R L Crowding Crowding TP docking funnel FP docking Funnel

Cluster A Cluster B

Comparison

H_BOND ELEC DCOMPLE

X NSC VDW SOLV ...

Compute Molecular Descriptors for each member of each cluster

110 molecular descriptors

Molecular descriptors of all models for a cluster are represented as distributions

Min, Q1, Mean, Median, Q3, Max

Cluster A

Cluster B

1100 Features to compare two clusters

Extremely Randomized Trees

Cluster A or B Better?

Machine Learning Approach

Page 13: Swarming Binding Funnels: Ranking Native over False ......Swarming Binding Funnels: Ranking Native over False Docking Poses a d R L Crowding Crowding TP docking funnel FP docking Funnel

T29

Min LRMSD (nm)

0 2 4 6 8 10

Nu

mb

er

of

tim

es

be

tte

r

Predicted Actual

Cluster Comparison Score

Page 14: Swarming Binding Funnels: Ranking Native over False ......Swarming Binding Funnels: Ranking Native over False Docking Poses a d R L Crowding Crowding TP docking funnel FP docking Funnel

Cluster Comparison Profiles for 11 Targets

Page 15: Swarming Binding Funnels: Ranking Native over False ......Swarming Binding Funnels: Ranking Native over False Docking Poses a d R L Crowding Crowding TP docking funnel FP docking Funnel

1100 Features

Re

lati

ve Im

po

rtan

ce

ZRANK(#96)

DCOMPLEX(#241)

Pierce & Weng: Proteins(2007)

Liu et al Proteins (2004))

Tobi Potential: Residue Level(#1) (Tobi & Bahar Proteins (2006)

Ranked of Features

C1_MIN_N_CP_TB C2_Q1N_CP_D1 C2_MIN_N_CP_TB C1_Q1_N_CP_TB C2_MIN_N_CP_D1 C2_AVG_N_CP_D1 C1_MIN_N_CP_D1 C1_AVG_N_ODA C2_Q1_N_CP_TB C1_MAX_N_CP_PIE

Top 10

Page 16: Swarming Binding Funnels: Ranking Native over False ......Swarming Binding Funnels: Ranking Native over False Docking Poses a d R L Crowding Crowding TP docking funnel FP docking Funnel

Pearson Product-Moment Correlation Coefficient (Many features calculated using the CCharPPI server; Moal et al Bioinormatics (2015))

Molecular Descriptor

Mo

lecu

lar

De

scri

pto

r

Positively correlated

Not correlated

Negatively correlated

dDFIRE

CG

_PP

C

P_P

IE

CP

_M

J2h

Page 17: Swarming Binding Funnels: Ranking Native over False ......Swarming Binding Funnels: Ranking Native over False Docking Poses a d R L Crowding Crowding TP docking funnel FP docking Funnel

Transforming the Feature Space (PCA)

Minimised False negative rate

Accuracy

Precision Recall

F1

Pe

rce

nta

ge

Dimensions

Page 18: Swarming Binding Funnels: Ranking Native over False ......Swarming Binding Funnels: Ranking Native over False Docking Poses a d R L Crowding Crowding TP docking funnel FP docking Funnel

Final Ranking of Clusters with at Least One Acceptable Docking Pose

T29 1 1 7

T30 8 11 19

T32 10 6 9

T35 9 11 12

T39 9 3 29

T40 8 2 13

T41 1 3 4

T46 5 13 2

T47 1 1 3

T53 1 2 3

T54 5 10 2

Target Pred (ML) ZRANK DCOMPLEX

1st Top 10 P

erc

en

tage

of

Targ

ets

Top 5

Page 19: Swarming Binding Funnels: Ranking Native over False ......Swarming Binding Funnels: Ranking Native over False Docking Poses a d R L Crowding Crowding TP docking funnel FP docking Funnel

CAPRI 2016 SwarmDock

Iain Moal

Erik Pfeiffenberger Raphael Chaleil