44
Fast and Accurate Enumeration of Combinatorial Libraries Keith A Harrington, Julian Hayward & Roger Upton

Fast and Accurate Enumeration of Combinatorial Librariesacscinf.org/docs/meetings/222nm/presentations/222nm49.pdf · 2013-02-21 · Fast and Accurate Enumeration of Combinatorial

  • Upload
    others

  • View
    2

  • Download
    0

Embed Size (px)

Citation preview

Page 1: Fast and Accurate Enumeration of Combinatorial Librariesacscinf.org/docs/meetings/222nm/presentations/222nm49.pdf · 2013-02-21 · Fast and Accurate Enumeration of Combinatorial

Fast and Accurate Enumeration of Combinatorial Libraries

Keith A Harrington, Julian Hayward & Roger Upton

Page 2: Fast and Accurate Enumeration of Combinatorial Librariesacscinf.org/docs/meetings/222nm/presentations/222nm49.pdf · 2013-02-21 · Fast and Accurate Enumeration of Combinatorial

Pre-Accelrys CombiChem History

• Synopsys– Accord Chemistry Software

• Components, Toolkits, Applications

– Enumeration capabilities

– Reaction Databases

• Selective, Thematic

– Solid-Phase Synthesis Database

• MSI– Enumeration within MedChem explorer as critical part of library

design

Page 3: Fast and Accurate Enumeration of Combinatorial Librariesacscinf.org/docs/meetings/222nm/presentations/222nm49.pdf · 2013-02-21 · Fast and Accurate Enumeration of Combinatorial

CombiChem

• The chemists answer to the challenges posed by the HTS revolution

• The purpose being to increase productivity by several orders of magnitude

• Chemists now 100 times more productive

• Millions of new compounds are created per year

• Has software kept pace with requirements?

Page 4: Fast and Accurate Enumeration of Combinatorial Librariesacscinf.org/docs/meetings/222nm/presentations/222nm49.pdf · 2013-02-21 · Fast and Accurate Enumeration of Combinatorial

Scope of Talk

• ‘Combichem’ encompasses:– Molecular Diversity

– Library Design* and Library Analysis

– Library Synthesis and Robotics

– Library Registration*

– Quality Control

– High Throughput Screening

• Focus today on ‘Enumeration’ of discrete structures and libraries

Page 5: Fast and Accurate Enumeration of Combinatorial Librariesacscinf.org/docs/meetings/222nm/presentations/222nm49.pdf · 2013-02-21 · Fast and Accurate Enumeration of Combinatorial

Fast and Accurate?

• Accuracy Vital– Over enumeration e.g. symmetry

– Structures simply wrong e.g. stereochemistry

– Presentation issues e.g. bond overlaps

• Fast– Enumeration of small, medium and large libraries

• What is fast?

– Use the same enumeration engine in either case

• Advantages all around

Page 6: Fast and Accurate Enumeration of Combinatorial Librariesacscinf.org/docs/meetings/222nm/presentations/222nm49.pdf · 2013-02-21 · Fast and Accurate Enumeration of Combinatorial

Software Requirements

• Difficult to imagine CombiChem without software– Would it be possible?

• Software needs to grow to meet the needs

• Chemical representation to the fore– Get the chemical structure correct first time, every time

• Integration standards critical– Oracle, Microsoft, third-party software

• Simple to use applications for the end-user chemist

• Component code base highly desirable for more complex, bespoke applications

Page 7: Fast and Accurate Enumeration of Combinatorial Librariesacscinf.org/docs/meetings/222nm/presentations/222nm49.pdf · 2013-02-21 · Fast and Accurate Enumeration of Combinatorial

Enumeration: Principle

• Compound generation, based on user input and common features

NH

O

NH

O

NH

O

NH

O

X NH

Y

O

X =

Y =

Page 8: Fast and Accurate Enumeration of Combinatorial Librariesacscinf.org/docs/meetings/222nm/presentations/222nm49.pdf · 2013-02-21 · Fast and Accurate Enumeration of Combinatorial

End-user Interaction

• How does the end-user interact with the system?– Applications must be intuitive

• Wizard approach rather than rules?

– Must utilize existing data

• Reading and writing of standard file formats

• To what extent does the software need to be ‘intelligent’– Does it need to ‘understand’ chemistry?

– or simply ‘do as it is told’?

Page 9: Fast and Accurate Enumeration of Combinatorial Librariesacscinf.org/docs/meetings/222nm/presentations/222nm49.pdf · 2013-02-21 · Fast and Accurate Enumeration of Combinatorial

Reaction-based Enumeration

NH2

NH2

NH2

NH

O

NH

O

NH

O

NH

O

NH

O

NH

O

(96 amides)

(8 amines)

(12 acids)

OH

O

OH

O

OH

O

R1 OH

O

+ R2NH2

R1 NH

O

R2

Page 10: Fast and Accurate Enumeration of Combinatorial Librariesacscinf.org/docs/meetings/222nm/presentations/222nm49.pdf · 2013-02-21 · Fast and Accurate Enumeration of Combinatorial

Reaction-based Enumeration

• Advantages– Obviates the need to pre-process (clip) the reactants

• more intuitive to the chemist

– Flexibility to enumerate the individual reactions as well as molecules if desired

– Complete ‘experiment’ may be subsequently stored generically within a database

• Designed into all Accord solutions

Page 11: Fast and Accurate Enumeration of Combinatorial Librariesacscinf.org/docs/meetings/222nm/presentations/222nm49.pdf · 2013-02-21 · Fast and Accurate Enumeration of Combinatorial

More Complex Issues (1)

• “Intramolecular” examples

R1-NH2 =NH2

OH

NH2

OH NH2

O NH2

NH2

OH

O

R1NH2

R2 OH

O

R2 NH

O

R1+

NH2

Page 12: Fast and Accurate Enumeration of Combinatorial Librariesacscinf.org/docs/meetings/222nm/presentations/222nm49.pdf · 2013-02-21 · Fast and Accurate Enumeration of Combinatorial

More Complex Issues (2)

• Multifunctional substrates

R1-NH2 =NH2

NH2

NH2

NH2

NH2

ONH2

NH2

R1NH2

R2 OH

O

R2 NH

O

R1+

NH2

NH2

NH2

Page 13: Fast and Accurate Enumeration of Combinatorial Librariesacscinf.org/docs/meetings/222nm/presentations/222nm49.pdf · 2013-02-21 · Fast and Accurate Enumeration of Combinatorial

More Complex Issues (3)

• Competing functionality

R1-NH2 =NH2

OH

NH2

NH2 NH2

ONH2

NH2

NH

NH2

O

R1NH2

R2 OH

O

R2 NH

O

R1+

NH2

Page 14: Fast and Accurate Enumeration of Combinatorial Librariesacscinf.org/docs/meetings/222nm/presentations/222nm49.pdf · 2013-02-21 · Fast and Accurate Enumeration of Combinatorial

‘Diverse’ Reactions

R1SH

R2 Cl

O

R2 S

O

R1+

R1NH2 + R2

SCl

O

O

R2S

NH

O

OR1

R1NH2 R2

N

O+ R2

NH

NH

O

R1

Page 15: Fast and Accurate Enumeration of Combinatorial Librariesacscinf.org/docs/meetings/222nm/presentations/222nm49.pdf · 2013-02-21 · Fast and Accurate Enumeration of Combinatorial

Chemical Integrity

• Enumeration must be chemically reliable:– No ‘over-enumeration’

• no ‘impossible’ solutions

• No multiple, identical solutions for symmetrical molecules

– Stereochemistry:

• Absolute & Relative configurations dealt with correctly

• Diastereomers

– Double-bond Geometry

• Olefins, Allenes etc.

• The software must adequately deal with these issues

Page 16: Fast and Accurate Enumeration of Combinatorial Librariesacscinf.org/docs/meetings/222nm/presentations/222nm49.pdf · 2013-02-21 · Fast and Accurate Enumeration of Combinatorial

‘Impossible’ ProductsR1

R2R1

R2

+

Br

Cl

F

Br

Cl

F

F

Cl

BrX

Page 17: Fast and Accurate Enumeration of Combinatorial Librariesacscinf.org/docs/meetings/222nm/presentations/222nm49.pdf · 2013-02-21 · Fast and Accurate Enumeration of Combinatorial

Symmetry Perception (1)

R1 OH

O

R1 OH

OH

OH

OH

OH

O

OH

OH

O

OH

O

OH

OX

Page 18: Fast and Accurate Enumeration of Combinatorial Librariesacscinf.org/docs/meetings/222nm/presentations/222nm49.pdf · 2013-02-21 · Fast and Accurate Enumeration of Combinatorial

Symmetry Perception (2)

R1 R2

O

R1 R2

OH

O OH

18-fold ‘symmetry’ 1 solution

Page 19: Fast and Accurate Enumeration of Combinatorial Librariesacscinf.org/docs/meetings/222nm/presentations/222nm49.pdf · 2013-02-21 · Fast and Accurate Enumeration of Combinatorial

Stereochemistry

• Absolute & Relative configurations supported at the atom & bond level:

OH

NH

NH

O

OH

OH

ClS

Rel-βAbs

Rel-α Rel-α

Page 20: Fast and Accurate Enumeration of Combinatorial Librariesacscinf.org/docs/meetings/222nm/presentations/222nm49.pdf · 2013-02-21 · Fast and Accurate Enumeration of Combinatorial

Control of Inversions

Ph

OTs

Ph

OTs

Ph

Cl

Ph

Cl

R1 Ph

OTs

R1 Ph

Cl

Page 21: Fast and Accurate Enumeration of Combinatorial Librariesacscinf.org/docs/meetings/222nm/presentations/222nm49.pdf · 2013-02-21 · Fast and Accurate Enumeration of Combinatorial

Control of Diastereomers

R1 OH

O

R1 NH

O

R2+NH2

R2

OH

O

S

OH

Chiral

NH2

OH

OH

S

O

NH

OH

Chiral

OH

S

O

NH

OH

Chiral

Page 22: Fast and Accurate Enumeration of Combinatorial Librariesacscinf.org/docs/meetings/222nm/presentations/222nm49.pdf · 2013-02-21 · Fast and Accurate Enumeration of Combinatorial

Presentation

• Look & Layout of Library– Suitable for visual inspection/comparison?

– Adherence to corporate registration standards?

• ‘Redraw’ programs– Often fail for bridged systems etc.

– Will fail to maintain a ‘standard’ orientation

• Could this be template-based?

Page 23: Fast and Accurate Enumeration of Combinatorial Librariesacscinf.org/docs/meetings/222nm/presentations/222nm49.pdf · 2013-02-21 · Fast and Accurate Enumeration of Combinatorial

Cl

O

NH2

NH

O

NH

O

NH

O

Swivel & Rotate Features

R1 Cl

O

R1 N

O

R2N

R2+

Cl

O

Cl

O

N

N

Page 24: Fast and Accurate Enumeration of Combinatorial Librariesacscinf.org/docs/meetings/222nm/presentations/222nm49.pdf · 2013-02-21 · Fast and Accurate Enumeration of Combinatorial

Chemical Intelligence

• e.g. Presentation– “But steroids are always this way up!”

O

H

H

H

H

H

O

O

H

H

H

H

H

O

Page 25: Fast and Accurate Enumeration of Combinatorial Librariesacscinf.org/docs/meetings/222nm/presentations/222nm49.pdf · 2013-02-21 · Fast and Accurate Enumeration of Combinatorial

Scalability

• How do we differentiate between ‘project level’ and ‘large’ libraries

– Desktop software?

• Single PC

• Up to10,000 compounds?

– Server software?

• Parallel processing, Multithreading…

• Millions of compounds?

• The ‘chemistry engine’ should be based upon the same components

Page 26: Fast and Accurate Enumeration of Combinatorial Librariesacscinf.org/docs/meetings/222nm/presentations/222nm49.pdf · 2013-02-21 · Fast and Accurate Enumeration of Combinatorial

Scalability and Flexibility

• Multi platform, component based– Windows, Sun/Solaris, SGI/Irix, NT, Linux

Accord SDK

Accord Chemistry Engine

Driv

ers

Cal

cula

tors

Page 27: Fast and Accurate Enumeration of Combinatorial Librariesacscinf.org/docs/meetings/222nm/presentations/222nm49.pdf · 2013-02-21 · Fast and Accurate Enumeration of Combinatorial

Example Application

• Accord for Excel Combichem add-on– Desktop enumeration tool

– Project-level libraries

– Utilizes workbook concept to adopt a workflow approach

– Enumerates compounds or reactions directly within Excel

– CLogP, MW calculation environment

Page 28: Fast and Accurate Enumeration of Combinatorial Librariesacscinf.org/docs/meetings/222nm/presentations/222nm49.pdf · 2013-02-21 · Fast and Accurate Enumeration of Combinatorial
Page 29: Fast and Accurate Enumeration of Combinatorial Librariesacscinf.org/docs/meetings/222nm/presentations/222nm49.pdf · 2013-02-21 · Fast and Accurate Enumeration of Combinatorial
Page 30: Fast and Accurate Enumeration of Combinatorial Librariesacscinf.org/docs/meetings/222nm/presentations/222nm49.pdf · 2013-02-21 · Fast and Accurate Enumeration of Combinatorial
Page 31: Fast and Accurate Enumeration of Combinatorial Librariesacscinf.org/docs/meetings/222nm/presentations/222nm49.pdf · 2013-02-21 · Fast and Accurate Enumeration of Combinatorial
Page 32: Fast and Accurate Enumeration of Combinatorial Librariesacscinf.org/docs/meetings/222nm/presentations/222nm49.pdf · 2013-02-21 · Fast and Accurate Enumeration of Combinatorial
Page 33: Fast and Accurate Enumeration of Combinatorial Librariesacscinf.org/docs/meetings/222nm/presentations/222nm49.pdf · 2013-02-21 · Fast and Accurate Enumeration of Combinatorial
Page 34: Fast and Accurate Enumeration of Combinatorial Librariesacscinf.org/docs/meetings/222nm/presentations/222nm49.pdf · 2013-02-21 · Fast and Accurate Enumeration of Combinatorial
Page 35: Fast and Accurate Enumeration of Combinatorial Librariesacscinf.org/docs/meetings/222nm/presentations/222nm49.pdf · 2013-02-21 · Fast and Accurate Enumeration of Combinatorial
Page 36: Fast and Accurate Enumeration of Combinatorial Librariesacscinf.org/docs/meetings/222nm/presentations/222nm49.pdf · 2013-02-21 · Fast and Accurate Enumeration of Combinatorial
Page 37: Fast and Accurate Enumeration of Combinatorial Librariesacscinf.org/docs/meetings/222nm/presentations/222nm49.pdf · 2013-02-21 · Fast and Accurate Enumeration of Combinatorial
Page 38: Fast and Accurate Enumeration of Combinatorial Librariesacscinf.org/docs/meetings/222nm/presentations/222nm49.pdf · 2013-02-21 · Fast and Accurate Enumeration of Combinatorial
Page 39: Fast and Accurate Enumeration of Combinatorial Librariesacscinf.org/docs/meetings/222nm/presentations/222nm49.pdf · 2013-02-21 · Fast and Accurate Enumeration of Combinatorial
Page 40: Fast and Accurate Enumeration of Combinatorial Librariesacscinf.org/docs/meetings/222nm/presentations/222nm49.pdf · 2013-02-21 · Fast and Accurate Enumeration of Combinatorial
Page 41: Fast and Accurate Enumeration of Combinatorial Librariesacscinf.org/docs/meetings/222nm/presentations/222nm49.pdf · 2013-02-21 · Fast and Accurate Enumeration of Combinatorial

Accelrys Accord Solutions

• Manual pre-clipping step can be avoided

• Simple wizard type interface in Excel

• No over-enumeration problems– no duplicate or ‘impossible’ structures

• No random inversions of stereochemistry or double-bond geometry

• Correctly enumerates all valid diastereomers

• Presentation greatly enhanced by ‘swivel & rotate’ functions

Page 42: Fast and Accurate Enumeration of Combinatorial Librariesacscinf.org/docs/meetings/222nm/presentations/222nm49.pdf · 2013-02-21 · Fast and Accurate Enumeration of Combinatorial

Accelrys Accord Solutions

• Able to integrate with other ‘best of breed’ applications– Inherent in Accord’s component architecture

• Do not place unreasonable constraints on the chemist– User interface, accuracy & speed

• Are flexible enough to adapt to rapidly changing requirements

• Are scalable from client (small/medium libraries) to server (large libraries)

Page 43: Fast and Accurate Enumeration of Combinatorial Librariesacscinf.org/docs/meetings/222nm/presentations/222nm49.pdf · 2013-02-21 · Fast and Accurate Enumeration of Combinatorial

Acknowledgements

• Osman Guner and ACS Committee

• The audience today

• Tony Cook

• Dan Thomas

• Jim Clarke

• For demonstrations and further discussions– Booths 829 & 837

[email protected]

– www.accelrys.com

Page 44: Fast and Accurate Enumeration of Combinatorial Librariesacscinf.org/docs/meetings/222nm/presentations/222nm49.pdf · 2013-02-21 · Fast and Accurate Enumeration of Combinatorial

Science, faster

[email protected]