Computational Techniques in Support of Drug Discovery October 2, 2002 Jeffrey Wolbach, Ph. D

Preview:

Citation preview

Computational Techniques in Support of Drug Discovery

October 2, 2002

Jeffrey Wolbach, Ph. D.

Who Is Tripos?

Core Science & Technology Software

Consulting Services

Chemistry Products& Services

Discovery Research& Process Implementation

Discovery Software& Methods Research

Sequential Drug Discovery

Many cycles of synthesis/testing to identify and optimize lead Role of molecular modeling

o unrealistic to jump from validated target to optimized leado useful to reduce the number of synthesis/testing cycleso enables “first to file”o enlarge number of targets

ChooseDisease

LeadIdentification

LeadValidation

LeadOptimization

ADMECandidateto Clinic

TargetIdentification

TargetValidation

Choose a Disease

Lead Identification

Lead Validation

Lead Optimization

ADME

Candidate to Clinic

Target Identification

Target Validation

Drug Discovery in Parallel

• Knowledge-sharing environment: genomics, HTS, chemistry, ADME,

toxicology

• Collect more data, on more compounds, more quickly

• Apply predictive models of “developability” early

• Enhanced understanding & predictive model building

• Increase share of patented time on market

Ligand-Based Design

Ligand Structures w/ActivitiesLigand Structures w/ActivitiesNo Target StructureNo Target Structure

PharmacophorePharmacophoreAnalysisAnalysis QSARQSAR

Database SearchingDatabase Searching

New Candidate Structures for Synthesis/Testing

Discern Similarities and Differences in Active Structures

• Assume active molecules share a binding modeo Search for common chemical features of active molecules

• Don’t know binding mode, so active molecules are considered flexible

o Search set of pre-determined conformerso Allow molecules to flex during search

Pharmacophore Analysis

• Typical features:o H-Bond Donorso H-bond acceptorso Hydrophobic groups

Pharmacophore Models• Chemical features in 3-D space

• Distance constraints between chemical features

QSAR• Relates bioactivity differences to molecular structure

differences

• Structure represented by numerical descriptorso Traditional (2D) QSARo 3D QSAR - CoMFA

• Statistical techniques relate

descriptors to activity

Activity

Descriptor

+

++

+

++

+

+

+

+

++

Activity = D0 + 0.5 D1 + 0.17 D2 + ...

QSAR - Traditional (2D)• Descriptors are molecular properties

o logP, dipole moment, connectivity indices ...

Structures + Activity Descriptors

logP = 1.9 = 2.8Estate = 7.2

logP = 2.1 = 3.5Estate = 5.5

logP = 1.7 = 2.3Estate = 6.7

Predictive Model (QSAR Equation)pKi=5.3

pKi=3.7

pKi=2.9

pKi=A + B(logP) + C() + D(Estate) + ...

PLSMLR

.

.

QSAR - 3D QSAR - CoMFA

• Comparative Molecular Field Analysis

• Descriptors are field strengths around molecules - electrostatic, steric, H-bond ..

• Fields can have easy physical interpretation

pKi=A + B(D1) + C(D2) + ...

QSAR/CoMFA - Interpretation• High Coefficient (important) lattice points can be

plotted around molecular structures

N

S

OHO

O

N

O N

OO

O

010110010010101

2D Database Searching

• Searches often performed on bit-stringso “Fingerprints” (many types)o Fingerprints display neighborhood behavior

• Also includes substructure searching

• Can search for similarity or dissimilarity

• Query is a collection of features in 3-D spaceo Pharmacophoreo Lead compound / specific atomic groups

3D Database Searching

• Search a database of flexible, 3-D molecules

o Molecules can’t be stored in every possible conformation

o Allow molecules to flex to fit the query

Example of Structure-Based Design

• Not restricted to ligand-based design

• Information about target can be included in the query

o Can define steric hindrances

o Additional interaction sites

o Serves to filter hits from the search

3D Database Searching

Identification of Novel Matrix Metalloproteinase (MMP) Inhibitors

A fibroblast collagenase-1 complexed with a diphenyl-ether sulphone-based hydroxamic acid

MMPs•Zinc-dependent proteases•Involved in the degradation and remodeling of the extracellular matrix

They are important therapeutic targets with indications in:•Cancer•Arthritis•Autoimmunity•Cardiovascular disease

Objectives

Design high affinity MMP inhibitors based on the diketopiperazine scaffold by:

•Creating a virtual combinatorial library of candidate inhibitors

•Using virtual screening tools to identify candidates with the highest predicted affinity

•Perform R-group and binding mode analysis to guide library design

Synthesis ofDKP-MMP inhibitors

DKP-I DKP-II

1.) Esterification of the solid support (HO-) with an amino acid2.) Reductive alkylation of the amino acid and acylation of the resulting secondary amine 3.) Deprotection of the N-alkylated dimer followed by cyclic cleavage from the resin yielding diketopiperazine (DKP)

Finding & Filtering Reagents

UNITY 2D structure search of the ACD

Filtered out:•Metals•MW > 400•RB > 15

Filtered out:•Metals•MW > 250•RB > 8

73 Boc protected amino acids1154 aldehydes

Selecting Reagents & Building the Virtual Library

Selector™ Diverse selection of amino acids (R1) and aldehydes (R2) using:

•2D Finger Prints•Atom Pairs•Hierarchical Clustering

Legion™ Model the reaction and create virtual combinatorial library

55 amino acids (R1)x 95 aldehydes (R2)x 14 amino acids (R3)= 73,150 compounds (~75k, 8.5 MB)

Randomly selected 14 amino acids for R3

The CombiFlexX Protocol•Select a diverse subset of compounds using OptiSim

•Dock and score the compounds in the diverse subset using FlexX

•Select unique core placements using OptiSim

•Hold each core placement fixed in the binding site as each R-group is independently attached, docked, and scored.

•Sum the scores of the "R-cores" and subtract the score of the common core

Computation times scale as the sum of the number of R-groups rather than as the product of the number of R-groups

Virtual Screeningof DKP-MMP Inhibitors

• ~75k compound library

• MMP target structure collagenase-1 (966c.pdb)

• 150 diverse compounds selected and docked

• 39 non-redundant core placements based on RMSD > 1.5 Å.

Virtual Screening Results

•Docked 91% of the library

•36 compounds/minute

•331 compounds predicted to be more active than those published

Consensus Scoring Results

•Extracted the top 1000 library compounds based on Flex-X score

•Ranked the top library compounds and published “highly actives” using CScore

•10 compounds predicted by all scoring functions to be more active than “highly actives”

Frequency of R-group use among 331 active virtual compounds

R1 (55 reagents)

R2 (95 reagents)

R3 (14 reagents)

R-Group Analysis in HiVol

R1

R-Group Analysis in HiVol, con’t.Frequency of R-group use among 331 active virtual compounds

SummaryUsed CombiFlexX and HiVol to:

• Identify highly promising candidates• Perform R-group analysis & Binding mode analysis to guide further computational design of libraries

Further Work

• Diversity/similarity analysis of the published and virtual libraries• Use docking results for library design in Diverse Solutions• SAR development

Binding Mode Analysis

Frequency of core placement use

Frequency of R-group use among 331 active virtual compounds

R1 (55 reagents)

R2 (95 reagents)

R3 (14 reagents)

R-Group Analysis in HiVol

Recommended