141
Cerius 2 Hypothesis & Receptor Models Release 4.0 April 1999 Molecular Simulations Inc. 9685 Scranton Road San Diego, CA 92121-3752 619/458-9990 Fax: 619/458-0136

Cerius2 - ESI · 2001-06-08 · Cerius2 Hypothesis & Receptor Models Release 4.0 April 1999 Molecular Simulations Inc. 9685 Scranton Road San Diego, CA 92121-3752 619/458-9990 Fax:

  • Upload
    others

  • View
    1

  • Download
    0

Embed Size (px)

Citation preview

Page 1: Cerius2 - ESI · 2001-06-08 · Cerius2 Hypothesis & Receptor Models Release 4.0 April 1999 Molecular Simulations Inc. 9685 Scranton Road San Diego, CA 92121-3752 619/458-9990 Fax:

Cerius2Hypothesis & Receptor Models

Release 4.0April 1999

Molecular Simulations Inc.9685 Scranton Road

San Diego, CA 92121-3752

619/458-9990 Fax: 619/458-0136

Page 2: Cerius2 - ESI · 2001-06-08 · Cerius2 Hypothesis & Receptor Models Release 4.0 April 1999 Molecular Simulations Inc. 9685 Scranton Road San Diego, CA 92121-3752 619/458-9990 Fax:
Page 3: Cerius2 - ESI · 2001-06-08 · Cerius2 Hypothesis & Receptor Models Release 4.0 April 1999 Molecular Simulations Inc. 9685 Scranton Road San Diego, CA 92121-3752 619/458-9990 Fax:

Copyright*

This document is copyright © 1999, Molecular Simulations Inc., a subsidiary of Pharma-copeia, Inc. All rights reserved. Except as permitted under the United States Copyright Act of 1976, no part of this publication may be reproduced or distributed in any form or by any means or stored in a database retrieval system without the prior written permission of Molec-ular Simulations Inc.The software described in this document is furnished under a license and may be used or copied only in accordance with the terms of such license.

Restricted Rights LegendUse, duplication, or disclosure by the Government is subject to restrictions as in subpara-graph (c)(1)(ii) of the Rights in Technical Data and Computer Software clause at DFAR 252.227–7013 or subparagraphs (c)(1) and (2) of the Commercial Computer Software—Restricted Rights clause at FAR 52.227-19, as applicable, and any successor rules and regula-tions.

Trademark AcknowledgmentsCatalyst, Cerius2, Discover, Insight II, and QUANTA are registered trademarks of Molecular Simulations Inc. Biograf, Biosym, Cerius, CHARMm, Open Force Field, NMRgraf, Polygraf, QMW, Quantum Mechanics Workbench, WebLab, and the Biosym, MSI, and Molecular Sim-ulations marks are trademarks of Molecular Simulations Inc. IRIS, IRIX, and Silicon Graphics are trademarks of Silicon Graphics, Inc. AIX, Risc System/6000, and IBM are registered trademarks of International Business Machines, Inc. UNIX is a registered trademark, licensed exclusively by X/Open Company, Ltd. PostScript is a trade-mark of Adobe Systems, Inc. The X-Window system is a trademark of the Massachusetts Institute of Technology. NSF is a trademark of Sun Microsystems, Inc. FLEXlm is a trademark of Highland Software, Inc.

Permission to Reprint, Acknowledgments, and ReferencesMolecular Simulations usually grants permission to republish or reprint material copy-righted by Molecular Simulations, provided that requests are first received in writing and that the required copyright credit line is used. For information published in documentation, the format is “Reprinted with permission from Document-name, Month Year, Molecular Simu-lations Inc., San Diego.” For example:

Reprinted with permission from Cerius2 Hypothesis & Receptor Models, April 1999, Molecular Simulations Inc., San Diego.

Requests should be submitted to MSI Scientific Support, either through electronic mail to [email protected] or in writing to:

*U.S. version of Copyright Page

Page 4: Cerius2 - ESI · 2001-06-08 · Cerius2 Hypothesis & Receptor Models Release 4.0 April 1999 Molecular Simulations Inc. 9685 Scranton Road San Diego, CA 92121-3752 619/458-9990 Fax:

MSI Scientific Support and Customer Service9685 Scranton RoadSan Diego, CA 92121-3752

To print photographs or files of computational results (figures and/or data) obtained using Molecular Simulations software, acknowledge the source in the format:

Computational results obtained using software programs from Molecular Simu-lations Inc.—dynamics calculations were done with the Discover® program, using the CFF91 forcefield, ab initio calculations were done with the DMol pro-gram, and graphical displays were printed out from the Cerius2 molecular mod-eling system.

To reference a Molecular Simulations publication in another publication, no author should be specified and Molecular Simulations Inc. should be considered the publisher. For example:

Cerius2 Hypothesis and Receptor Models, April 1999. San Diego: Molecular Simula-tions Inc., 1999.

Page 5: Cerius2 - ESI · 2001-06-08 · Cerius2 Hypothesis & Receptor Models Release 4.0 April 1999 Molecular Simulations Inc. 9685 Scranton Road San Diego, CA 92121-3752 619/458-9990 Fax:

Cerius2 Hypothesis and Receptor Models/April 1999 v

Contents

How to Use This Book ixPreparing to work . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . xHow to find information . . . . . . . . . . . . . . . . . . . . . . . . . . . xiUsing other Cerius2 books. . . . . . . . . . . . . . . . . . . . . . . . . . xiTypographical conventions . . . . . . . . . . . . . . . . . . . . . . . . . xii

1. Introduction to Cerius2•Hypothesis and Receptor Models1

Understanding the modules . . . . . . . . . . . . . . . . . . . . . . . . .2Before you begin . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .3

Accessing C2 modules. . . . . . . . . . . . . . . . . . . . . . . . . . .3

PART 1: ALIGNMENT

2. Aligning Models 9

Aligning models using field alignment . . . . . . . . . . . . . . . . .9Aligning models using Moments alignment . . . . . . . . . . . .12

Alignment of non-rigid models . . . . . . . . . . . . . . . . . .13Multiple model alignment strategies . . . . . . . . . . . . . .14

RMS alignment . . . . . . . . . . . . . . . . . . . . . . . . . . . .17 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .19

3. Incorporating Align Data in QSAR 21

Preparing models for generating alignment similarity descriptors21

Page 6: Cerius2 - ESI · 2001-06-08 · Cerius2 Hypothesis & Receptor Models Release 4.0 April 1999 Molecular Simulations Inc. 9685 Scranton Road San Diego, CA 92121-3752 619/458-9990 Fax:

vi Cerius2 Hypothesis and Receptor Models/April 1999

Adding alignment similarity descriptors to the QSAR Study Table22

Using the study table to analyze a consensus alignment . . 24Using alignment similarity descriptors for QSAR analysis 24Recalculating alignment similarity descriptors . . . . . . . . . 25

4. Theory: Alignment 27

Model Alignment Strategies and Methods . . . . . . . . . . . . . 27RMS Superposition . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 28Electrostatic Potential Similarity. . . . . . . . . . . . . . . . . . . . . 30Steric Shape Similarity . . . . . . . . . . . . . . . . . . . . . . . . . . . . 31Field Alignment (Electrostatic & Steric Similarity) . . . . . . . 32Steric Clash (Bump) Checking . . . . . . . . . . . . . . . . . . . . . . 32Implementation: Gaussian Approximation of Overlap Functions

33

PART 2: Receptor

5. Introduction to Receptor 39

Accessing Receptor . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 40Using Receptor . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 40

6. Receptor QuickStart 43

QuickStart. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 43

7. Tutorial: Receptor Surface Analysis (RSA) 47

Before you begin . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 47Load Models and add them to the Study Table . . . . . . . . . 48Generate a Receptor Surface Model (RSM). . . . . . . . . . . . . 49Add Receptor Surface energies to the Study Table. . . . . . . 50Calculate a QSAR . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 52

8. Generating a Receptor Model 53

Generating a receptor surface model . . . . . . . . . . . . . . . . . 54Setting preferences . . . . . . . . . . . . . . . . . . . . . . . . . . . . 56

Page 7: Cerius2 - ESI · 2001-06-08 · Cerius2 Hypothesis & Receptor Models Release 4.0 April 1999 Molecular Simulations Inc. 9685 Scranton Road San Diego, CA 92121-3752 619/458-9990 Fax:

Cerius2 Hypothesis and Receptor Models/April 1999 vii

Saving and restoring a receptor model . . . . . . . . . . . . . . . .58Listing a receptor model . . . . . . . . . . . . . . . . . . . . . . . . . . .58Mapping property values . . . . . . . . . . . . . . . . . . . . . . . . . .59

9. Evaluating Structures Against a Receptor Model 61

Evaluating structures . . . . . . . . . . . . . . . . . . . . . . . . . . . . .62Setting preferences . . . . . . . . . . . . . . . . . . . . . . . . . . . .64

Viewing the results . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .66Incorporating receptor data in QSARs . . . . . . . . . . . . . . . .67

Building a QSAR using receptor data . . . . . . . . . . . . . .68Using a QSAR with receptor data to estimate activity of a structure

69

10. Theory: Receptor Models 71

Receptor surface models . . . . . . . . . . . . . . . . . . . . . . . . . . .71Mapping properties. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .72Molecule-Receptor model interactions . . . . . . . . . . . . . . . .73RSA (Receptor Surface Analysis) . . . . . . . . . . . . . . . . . . . .75

PART 3: Pharmacophore

11. Introduction to Generating a Pharmacophoric Hypothesis79

Using ConFirm and HipHop. . . . . . . . . . . . . . . . . . . . . . . .79

12. Generating Conformations 83

Running ConFirm . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .84

13. Generating and Using Alignment Hypotheses 87

Aligning common molecular features . . . . . . . . . . . . . . . . .88Setting preferences using the Preferences control panel . . .89Incorporating activity data into a hypothesis . . . . . . . . . . .91Using aligned structures to generate receptor models . . . .91

Page 8: Cerius2 - ESI · 2001-06-08 · Cerius2 Hypothesis & Receptor Models Release 4.0 April 1999 Molecular Simulations Inc. 9685 Scranton Road San Diego, CA 92121-3752 619/458-9990 Fax:

viii Cerius2 Hypothesis and Receptor Models/April 1999

PART 4: Database Query

14. Performing a Database Query 95

Available database searches . . . . . . . . . . . . . . . . . . . . . . . . 96Preparing to work with Database Query . . . . . . . . . . . . . . 97Methodology . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 97Constructing a query . . . . . . . . . . . . . . . . . . . . . . . . . . . . 100

Constructing an atom query . . . . . . . . . . . . . . . . . . . 101Constructing a bond query . . . . . . . . . . . . . . . . . . . . 102Constructing a centroid query . . . . . . . . . . . . . . . . . . 104Constructing a distance query . . . . . . . . . . . . . . . . . . 106

Deleting all query features. . . . . . . . . . . . . . . . . . 107Submitting a query . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 107

Submitting a Catalyst query. . . . . . . . . . . . . . . . . . . . 108Submitting a SHAPE query . . . . . . . . . . . . . . . . . . . . 109Submitting an ISIS query . . . . . . . . . . . . . . . . . . . . . . 111

Defining a Database Domain from Prior Query Hit List112

Retrieving and browsing database hits . . . . . . . . . . . . . . 114Manipulating a hit list . . . . . . . . . . . . . . . . . . . . . . . . 115

APPENDICES

A. References 121

Page 9: Cerius2 - ESI · 2001-06-08 · Cerius2 Hypothesis & Receptor Models Release 4.0 April 1999 Molecular Simulations Inc. 9685 Scranton Road San Diego, CA 92121-3752 619/458-9990 Fax:

Cerius2 Hypothesis and Receptor Models/April 1999 ix

How to Use This Book

Cerius2 Hypothesis and Receptor Models is a guide to the Cerius2TM modules C2•Alignment, C2•Receptor, C2•DBAccess, C2•Active Site Viewer, and the Cerius2 interfaces to Catalyst ConFirmTM, HipHopΤΜ, and HypoTM Topics in this book describe module tasks and their associated procedures, combining conceptual, procedural, and reference material in one book.

Because these modules are part of the highly modular Cerius2 prod-uct line, you perform some tasks using functions located in other Cerius2 modules. Detailed descriptions of these tasks are in other Cerius2 documentation and are cross-referenced throughout this book. (See Using other Cerius2 books.)

What is in this book This book is divided into four parts:

♦ Part One, Alignment, describes tools used to superimpose mole-cules to satisfy various alignment conditions. These tools permit alignment of multiple molecules using least square fitting of specified atoms or field fitting, by employing the overlap similar-ity of molecular forcefields.

This section will be of interest to users who want to superimpose molecules that they are studying to satisfy various alignment conditions.

♦ Part Two, Receptor, describes the 3D visual environment that Receptor provides for receptor hypothesis exploration. It includes information on how to generate receptor surface models from the overlay of active compounds and how to use pseudo-receptors to evaluate new, potentially active compounds.

This section will be of interest to users who want to use a 3D visual environment for receptor hypothesis exploration and for new compound evaluation.

♦ Part Three, Pharmacophore, describes the pharmacophoric hypotheses that are generated through the interfaces to Catalyst ConFirm, HipHop, and Hypo. It includes information on how to use the interfaces to generate conformers, align structures by

Page 10: Cerius2 - ESI · 2001-06-08 · Cerius2 Hypothesis & Receptor Models Release 4.0 April 1999 Molecular Simulations Inc. 9685 Scranton Road San Diego, CA 92121-3752 619/458-9990 Fax:

x Cerius2 Hypothesis and Receptor Models/April 1999

common chemical features, generate hypotheses, and use aligned structures to generate receptor surfaces in Receptor.

This section will be of interest to users who want to generate pharmacophoric hypotheses using the common chemical fea-tures of a set of molecules.

♦ Part Four, Database Query, describes how to use Catalyst/Info and MDL ISIS to construct queries to search databases and to retrieve, examine, and save structures from the databases that fit your criteria.

This section will be of interest to users who want to construct queries to search databases and to retrieve, examine, and save the structures from the databases that certain criteria.

Preparing to work

To use the software as described in this book, you should be familiar with:

You should already be familiar with

♦ The Cerius2•Visualizer and graphical user interface

♦ Basic Cerius2 facilities for model manipulation

You need the following resources to use the Drug Design Work-bench (DDW) software described in this book:

Your workstation should have

♦ A licensed copy of Cerius2

♦ Licensed copies of some or all of the following modules: Align-ment, Receptor, DBAccess, Catalyst/Info, ConFirm, HipHop, and Hypo, along with the appropriate Cerius2 interfaces

♦ A home directory in which you can create subdirectories

Page 11: Cerius2 - ESI · 2001-06-08 · Cerius2 Hypothesis & Receptor Models Release 4.0 April 1999 Molecular Simulations Inc. 9685 Scranton Road San Diego, CA 92121-3752 619/458-9990 Fax:

How to find information

Cerius2 Hypothesis and Receptor Models/April 1999 xi

How to find information

Using other Cerius2 books

You can find additional information about Cerius2 in several other books.

♦ Cerius2 QSAR+ — Describes the C2•QSAR+ module and its regression and analysis technologies integrated in a chemically aware molecular spreadsheet

♦ Cerius2 Diversity — Describes the C2•Diversity module, which analyzes chemical diversity to design and evaluate compound libraries and reagent sets for combinatorial chemistry

If you want to know about… Read…

The C2 modules available for activ-ity prediction and their related tasks

Chapter 1, Introduction to Cerius2•Hypothesis and Receptor Models

Performing alignment operations Chapter 2, Aligning ModelsChapter 3, Incorporating Align Data in QSARChapter 4, Theory: Alignment

Performing receptor operations Chapter 5, Introduction to ReceptorChapter 6, Receptor QuickStartChapter 8, Generating a Receptor ModelChapter 9, Evaluating Structures Against a

Receptor ModelChapter 10, Theory: Receptor Models

Performing pharmacophoric hypothesis generation

Chapter 11, Introduction to Generating a Pharmacophoric Hypothesis

Chapter 12, Generating ConformationsChapter 13, Generating and Using Alignment

HypothesesPerforming database searches Chapter 14, Performing a Database QueryBackground for the topics in this

bookChapter A, References

Page 12: Cerius2 - ESI · 2001-06-08 · Cerius2 Hypothesis & Receptor Models Release 4.0 April 1999 Molecular Simulations Inc. 9685 Scranton Road San Diego, CA 92121-3752 619/458-9990 Fax:

xii Cerius2 Hypothesis and Receptor Models/April 1999

♦ Cerius2 Modeling Environment —Describes the integrated set of tools for session management and atomistic modeling that form the core of the Cerius2 modeling environment

♦ Cerius2 Builders — Discusses the specialized builder modules (that is, the Analog Builder, Crystal Builder, Surface Builder, Interface Builder, Polymer Builder, and Amorphous Builder modules) that can be added to supplement the basic model sketching capabilities provided by the Cerius2 model environ-ment

♦ Cerius2 Simulation Tools — Discusses the Open Force Field, Force Field Editor, Charges, Minimizer, Dynamics Simulation and Analysis

♦ Cerius2 Conformational Search and Analysis — Discusses Con-former Search, Conformer Analysis, and Field Calculation

♦ Cerius2 Command Scripts Guide — Shows how to capture and replay a script of Cerius2 commands, and how to enhance your command scripts with the features of the Tool Command Lan-guage (Tcl)

♦ Cerius2 Installation and Administration Guide — Provides step-by-step instructions for installing and administering Cerius2 in your operating environment.

Typographical conventions

Unless otherwise noted in the text, this book uses the typographical conventions described below:

♦ Words in italic represent variables. For example:

Pred_dependent_variable

In this example, the name of an appropriate dependent variable replaces the value dependent_variable. (For example, if the name of the dependent variable were “Activity”, QSAR+ would display a value of Pred_Activity.)

♦ Names of Cerius2 objects and elements that appear in the user interface are presented in bold type. For example:

Page 13: Cerius2 - ESI · 2001-06-08 · Cerius2 Hypothesis & Receptor Models Release 4.0 April 1999 Molecular Simulations Inc. 9685 Scranton Road San Diego, CA 92121-3752 619/458-9990 Fax:

Typographical conventions

Cerius2 Hypothesis and Receptor Models/April 1999 xiii

Access the QSAR card deck, then choose View Equations from the Equation Viewer menu card.

♦ Items that you type are presented in bold type. For example:

To name your file, enter testset.qsar in the text entry box.

♦ Messages that appear on your screen and excerpts of input/out-put files appear in courier font:

When written to the textport, the listed data forms columns:

Surface Coordinatespoint X Y Z mol atomchargeesp hbond hydrophob

Page 14: Cerius2 - ESI · 2001-06-08 · Cerius2 Hypothesis & Receptor Models Release 4.0 April 1999 Molecular Simulations Inc. 9685 Scranton Road San Diego, CA 92121-3752 619/458-9990 Fax:

xiv Cerius2 Hypothesis and Receptor Models/April 1999

Page 15: Cerius2 - ESI · 2001-06-08 · Cerius2 Hypothesis & Receptor Models Release 4.0 April 1999 Molecular Simulations Inc. 9685 Scranton Road San Diego, CA 92121-3752 619/458-9990 Fax:

Cerius2 Hypothesis and Receptor Models/April 1999 1

1 Introduction to Cerius2•Hypothesis and Receptor Models

This book contains descriptions of a set of tools that are useful in some phases of drug design and in the search for novel active com-pounds. With the Cerius2•Align tools described in this book you may choose to perform spatial alignments of entire sets of molecules to look for common elements that may predict their usefulness as new drugs.

Alternatively, you can use the Cerius2 interface to Catalyst Confirm and HipHop to generate pharmacophoric hypotheses. You create these by first producing conformations for a set of molecules, then using these to find and align functional groups common to all the molecules in the set. The tools available allow you to select hypoth-eses that do the best job of predicting activity for the set of study molecules, then use these aligned structures to generate a model of an active site in Cerius2•Receptor.

The Cerius2•Receptor tool can use models generated by ConFirm and HipHop, or initial models focusing on the active site of an enzyme or antibody can be generated by Cerius2•Receptor itself. Whatever the source of these initial models, you can then use them to predict 3D structure of active sites, which can aid the medicinal chemist in designing novel compounds.

The Database tools described in this book allow you to store these models, and to retrieve and examine them at a later time for various structural and activity criteria that you define using the Cerius2•Database Query software. With DBAccess, you can also search ISIS/MDL, CatShape, and Catalyst databases, giving you added flexibility in how you store your hypothesis and receptor model data.

Page 16: Cerius2 - ESI · 2001-06-08 · Cerius2 Hypothesis & Receptor Models Release 4.0 April 1999 Molecular Simulations Inc. 9685 Scranton Road San Diego, CA 92121-3752 619/458-9990 Fax:

2 Cerius2 Hypothesis and Receptor Models/April 1999

1. Introduction to Cerius2•Hypothesis and Receptor Models

Understanding the modules

All Cerius2 modules discussed here are based on the common con-trol and presentation platform of the Cerius2•Visualizer. The Cerius2 •Visualizer is an integrated collection of tools for session management and atomistic modeling that is the core of the Cerius2 modeling environment. These tools perform functions associated with model management, 3D graphical representation, session log-ging, environment customization including tables and graphs, and visualization of molecular models. Please see the Cerius2 Modeling Environment for more detailed information on Cerius2•Visualizer.

These are the Cerius2 software modules documented in this manual:

♦ Alignment (Cerius2•Align), which provides tools to superim-pose molecules to satisfy various alignment conditions. These tools permit alignment of molecules using least square fitting with atom equivalencies specified either by automatic atom matching algorithms or by manual atom matching. In addition to rigid body superpositioning, the module provides tools for flex-ibly aligning one molecule over another using a fit optimizer algorithm.

♦ Field Fitting (Cerius2•Field Fit), a module within Alignment, which employs the overlap similarity of molecular forcefields.

♦ Receptor (Cerius2•Receptor), which provides a 3D visual envi-ronment for receptor hypothesis exploration. The module creates receptor surface models using information generated from the overlay of active compounds. The receptor models can be used to evaluate new compounds and evaluate conformations and con-straints on compounds in the receptor site.

♦ Database Query (Cerius2•DBAccess), which provides access to the search facilities of Catalyst and ISIS to search molecular data-bases, retrieve, examine, and save structures that fit your search criteria.

♦ Catalyst is an MSI product that performs hypothesis generation, database searches, and activity information. The database search functions, Cat/Info and CatShape, are accessed through Cerius2 interfaces. Please see the Catalayst online help for more detailed information about Catalyst.

Page 17: Cerius2 - ESI · 2001-06-08 · Cerius2 Hypothesis & Receptor Models Release 4.0 April 1999 Molecular Simulations Inc. 9685 Scranton Road San Diego, CA 92121-3752 619/458-9990 Fax:

Before you begin

Cerius2 Hypothesis and Receptor Models/April 1999 3

♦ Interfaces for Catalyst ConFirm and Catalyst HipHop, which access Catalyst applications that provide tools to generate phar-macophoric hypotheses. The hypotheses are generated by first generating conformations for a set of study molecules and then using the conformations to find and align chemically important functional groups common to the molecules in the study set. The DRUG DISCOVERY card deck includes interface menu cards to both of these applications.

Before you begin

Before you use Cerius2 for activity prediction, you must have installed on your system a properly licensed copy of the Cerius2

software containing the modules you have purchased. If you have any questions about your system setup or your software license, talk to your system administrator.

You should become familiar with the Cerius2 environment. The Cerius2 Modeling Environment introduces you to the Cerius2 user interface and to the Visualizer tools. The book also provides detailed information about standard elements of the modeling environment. Cerius2 software runs on workstations using the UNIX operating system.

To start Cerius2 If you installed Cerius2 with the default startup command, then start the software using the following command in a UNIX shell window:

> cerius2

Note

Please see you system administrator if this default command does not start the software.

Accessing C2 modules

The tools described in this book are found in the following card decks:

You must be in a directory where you have write access.

Page 18: Cerius2 - ESI · 2001-06-08 · Cerius2 Hypothesis & Receptor Models Release 4.0 April 1999 Molecular Simulations Inc. 9685 Scranton Road San Diego, CA 92121-3752 619/458-9990 Fax:

4 Cerius2 Hypothesis and Receptor Models/April 1999

1. Introduction to Cerius2•Hypothesis and Receptor Models

♦ DRUG DISCOVERY

♦ HYPOTHESIS MODELS

This section describes the card decks for the modules covered in this book. Each module is accessed by making the appropriate selection from a menu card. (Note that depending on which modules are installed, you may notice a difference between the menu cards and card decks described here and those you see displayed on your screen.)

If the full set of Life Sciences modules is installed, you can see the card decks described below on the Main panel by selecting them from the popup menu in the Deck Selector.

DRUG DISCOVERY deck

The DRUG DISCOVERY card deck includes the QSAR, MODEL RECEPTOR, ALIGN MOLECULES, and QUERY DATABASE cards. The menu cards for this deck are illustrated below:

This deck includes tools to generate and apply QSARs, to align mol-ecules, to generate and evaluate receptor models, and to construct molecular queries for searching databases and retrieving structures.

The QSAR menu selections are mainly described in the Cerius2•QSAR+ book. However, the Study Table is used to incorpo-

Page 19: Cerius2 - ESI · 2001-06-08 · Cerius2 Hypothesis & Receptor Models Release 4.0 April 1999 Molecular Simulations Inc. 9685 Scranton Road San Diego, CA 92121-3752 619/458-9990 Fax:

Before you begin

Cerius2 Hypothesis and Receptor Models/April 1999 5

rate alignment data into QSAR. This procedure is described in Incor-porating Align Data in QSAR.

The ALIGN MOLECULES selections are described in three chap-ters in this book: Aligning Models, Incorporating Align Data in QSAR, and Theory: Alignment.

The MODEL RECEPTOR card contains items that are discussed in: Introduction to Receptor, Receptor QuickStart, Generating a Receptor Model, Evaluating Structures Against a Receptor Model, and Theory: Receptor Models.

The QUERY DATABASE selections are described in: Performing a Database Query.

DATABASES deck The DATABASES card deck includes the CATALYST, CAT-SHAPE, and MDL INTERFACE menu cards, as illustrated:

This deck includes tools to construct molecular queries for database searches and structure retrieval using either Catalyst or MDL ISIS and the databases associated with their search facilities. These instruments are described in more detail in Performing a Database Query.

HYPOTHESIS MOD-ELS deck

The HYPOTHESIS MODELS card deck includes the CONFIRM INTERFACE, HIPHOP INTERFACE, and MODEL RECEPTOR menu cards. The menu cards for this deck are illustrated below:

Page 20: Cerius2 - ESI · 2001-06-08 · Cerius2 Hypothesis & Receptor Models Release 4.0 April 1999 Molecular Simulations Inc. 9685 Scranton Road San Diego, CA 92121-3752 619/458-9990 Fax:

6 Cerius2 Hypothesis and Receptor Models/April 1999

1. Introduction to Cerius2•Hypothesis and Receptor Models

:

These cards provide interfaces to the Catalyst tools ConFirm and HipHop through the Cerius2 interface. The MODEL RECEPTORS card is included here to allow you to easily work with Receptor models within the ConFirm and HipHop software environments.

Page 21: Cerius2 - ESI · 2001-06-08 · Cerius2 Hypothesis & Receptor Models Release 4.0 April 1999 Molecular Simulations Inc. 9685 Scranton Road San Diego, CA 92121-3752 619/458-9990 Fax:

Part 1

Alignment

Aligning Models

Incorporating Align Data in QSAR

Theory: Alignment

Page 22: Cerius2 - ESI · 2001-06-08 · Cerius2 Hypothesis & Receptor Models Release 4.0 April 1999 Molecular Simulations Inc. 9685 Scranton Road San Diego, CA 92121-3752 619/458-9990 Fax:
Page 23: Cerius2 - ESI · 2001-06-08 · Cerius2 Hypothesis & Receptor Models Release 4.0 April 1999 Molecular Simulations Inc. 9685 Scranton Road San Diego, CA 92121-3752 619/458-9990 Fax:

Cerius2 Hypothesis and Receptor Models/April 1999 9

2 Aligning Models

You can perform an alignment as soon as you have two or more models loaded or built and imported into the Align Models table. With just two models you can perform an alignment immediately using either the Field Fitting or Moments alignment methods. You may also use RMS alignment by specifying atom matches, either directly or using the automated subgraph mapping tool in the Align Models panel. In addition, you may choose to align the models rigidly or to use a specified set of flexible torsions. With more than two models imported into the Align Models table you may choose between Consensus and Target Model alignment strategies to define how the models should be aligned to each other.

Sections in this chapter

The topics covered in this chapter are:

Aligning models using field alignment

Aligning models using Moments alignment

Performing a target model alignment to a single model

RMS alignment

Aligning models using field alignment

The Field Align method aligns models by calculating the steric and electrostatic forcefields about two models (using a potential probe) then orienting the models to achieve a maximum similarity over-lap of those forcefields. This means that the models are aligned so that similar regions in terms of shape and electrostatic charge dis-tribution are overlapped. Hence, one immediate advantage that Field Align has over RMS Align is that you are not required to specify how atoms from one model are matched to those in another.

Page 24: Cerius2 - ESI · 2001-06-08 · Cerius2 Hypothesis & Receptor Models Release 4.0 April 1999 Molecular Simulations Inc. 9685 Scranton Road San Diego, CA 92121-3752 619/458-9990 Fax:

10 Cerius2 Hypothesis and Receptor Models/April 1999

2. Aligning Models

Importing models for alignment

To perform an alignment you must first import your models. Bring up the ALIGN MOLECULES card by going to the DRUG DIS-COVERY deck of cards and clicking on ALIGN MOLECULES. Click Align Models to bring up the Align Models panel.

At the top of this panel is the ADD button. This button is used to import models from the model manager into the Align Models table. Next to the ADD button is a popup which specifies which models to add, either the Current model, the Selected models or All models. Click the ADD button to append your currently-selected model to the table. Select a second model and click the ADD button again to add the second model to the table.

The table now contains a row for each model you have added. In addition to the model name, each row of the table now contains four columns. The #Tor column reports the number of movable torsions that are currently set for the model. The Use column spec-ifies whether this model is to be included in a subsequent align-ment. This property is also useful for temporarily excluding a model from the set of models to align, without actually deleting this model from the Align Models table. The columns M1 and M2 actually refer to the corresponding rows of the table and are used to report how many RMS atom matches are defined between pairs of models. In this case there are zero matches specified between your two models. (Please see the RMS alignment section below for more information on atom matches.)

Performing a Field align-ment

Now that you have imported your models, you can perform a Field alignment. Go to the ALIGN MOLECULES card and click Align to bring up the Align panel. This panel is used to specify and perform a new alignment on the models specified in the Align Models table.

On the control panel, set the Align Method value to Field and the Align Type value to Rigid. (This specifies a Field alignment treat-ing the align models as rigid.) Click the ALIGN button on the panel to perform the alignment. Your model display window should now show the two models superimposed on each other, although only the second model is actually reoriented.

Align UNDO and REDO Notice that after the alignment has finished the UNDO button is no longer grayed out. Click this button to revert your models back into their original orientations. (Note that you may not be able to see both of them, since one model be translated out of view.) The

Page 25: Cerius2 - ESI · 2001-06-08 · Cerius2 Hypothesis & Receptor Models Release 4.0 April 1999 Molecular Simulations Inc. 9685 Scranton Road San Diego, CA 92121-3752 619/458-9990 Fax:

Aligning models using field alignment

Cerius2 Hypothesis and Receptor Models/April 1999 11

UNDO button now becomes a REDO button. Click this button again to revert the models back to the aligned orientations.

The UNDO/REDO tool is very useful for examining the effect of alignment, for reverting back after a mistake, or for adjusting the alignment parameters. These buttons only become gray again if you edit an alignment model or remove it from the Align Models table. You may move or reorient the models, but the UNDO/REDO buttons will always return the orientations to those imme-diately present before/after you performed the last alignment.

Customizing Field align-ment

You can customize Field alignment by either clicking the Prefs... button at the bottom of the Align panel, or clicking Align Prefer-ences... on the ALIGN MOLECULES card. Both of these actions open the Align Preferences panel. The middle section of this panel deals with parameters used in the Field alignment process.

By default, the Pre-align Moments check box is checked, indicat-ing that the models should first be pre-aligned using the Moments align method before executing Field alignment. This is mainly so that the models are moved into a decent relative orientation and position before performing the field similarity calculations. You will probably want this option checked most of the time.

The Steric Field value specifies the ratio of steric to electrostatic components in the overall field calculation. At 100%, models are aligned purely on the similarity of there shapes/volumes, while at 0% models are aligned purely on the distribution of their electro-static charges.

The Potential Probe popup specifies the neutral probe atom that the individual molecular forcefields are calculated with respect to before their similarity is determined. The probe atom only affects the steric field component of the field fitting, and usually has little effect on the overall alignment of a set of models. (If you think of steric field fitting as aligning models based on their respective shapes, then choosing a smaller probe atom, for example, hydro-gen, will result in the surface of these calculated shapes being bumpier.)

Page 26: Cerius2 - ESI · 2001-06-08 · Cerius2 Hypothesis & Receptor Models Release 4.0 April 1999 Molecular Simulations Inc. 9685 Scranton Road San Diego, CA 92121-3752 619/458-9990 Fax:

12 Cerius2 Hypothesis and Receptor Models/April 1999

2. Aligning Models

Note

Aligning models using Moments alignment

The Moments alignment method is the fastest, but least accurate, method for aligning models. It is typically employed for aligning a large number of models quickly (for example, to place models in the active site of a protein relative to the position and orientation of a ligand already docked in that site).

Performing a Moments alignment

To perform a Moments alignment, you must first import a couple of models into the Align Model table and bring up the Align panel as described in the previous section, Importing models for alignment. To perform a Moments alignment, set the Align Method option to Moments and the Align Type option to Rigid. Move and reorient your models relative to each other if you have just performed an alignment, then click the ALIGN button on the Align panel to per-form a Moments alignment.

Your models should now be approximately aligned in terms of centers of mass and relative orientation.

Customizing Moments Align

If you wish to customize the Moments alignment, do so by open-ing the Align Preferences panel (by clicking either clicking the Prefs... button at the bottom of the Align panel, or clicking Align Preferences... on the ALIGN MOLECULES card). This is the same panel used in the previous section to customize the Field align-ment. The top section of this panel contains options used in the Moments alignment process.

There are two Moments alignment methods you can choose from: Inertia and Electrostatic. Inertia refers to principle moments of inertia. Electrostatic refers to dipole and quadrupole electrostatic moments. These are calculated for each model you align and are used directly to translate and orient models in the alignment. Typ-ically, you would employ moments of inertia since these are well-defined for all models. However, if your models are highly polar

Field alignment employs parameters which are specific to the current forcefield loaded into Cerius2. Changing this forcefield may have a significant effect on the overall alignment of your models.

Page 27: Cerius2 - ESI · 2001-06-08 · Cerius2 Hypothesis & Receptor Models Release 4.0 April 1999 Molecular Simulations Inc. 9685 Scranton Road San Diego, CA 92121-3752 619/458-9990 Fax:

Aligning models using Moments alignment

Cerius2 Hypothesis and Receptor Models/April 1999 13

or have peculiar charge distributions you may find electrostatic alignment gives a better result.

Moments alignment also employs field calculations to choose the relative direction that the moments of two models are aligned. Hence, changing the Steric Field factor could affect the resulting aligned moments orientations.

Alignment of non-rigid models

So far only rigid alignment of models has been discussed, that is, the models have been aligned purely on the basis of translating and re-orienting one model relative to another. However, a better alignment is usually achieved when one model is allowed to be flexible so that it may adopt a conformation that is closer to that of the second model.

Align models may be flexible with respect to torsion angles, (that is, bond dihedral angles). Torsions are not defined automatically since you may wish to have some models remain flexible (with tor-sions specified) while others are rigid (with no torsions specified) during the same (flexible) alignment. If you try to perform a flexi-ble alignment on a set of models that have no torsions defined, Align will warn you that it is proceeding with a Rigid alignment. If you perform a Rigid alignment of models for which torsions are defined, however, these torsions are ignored. Align will always choose one model to treat as rigid when performing a flexible alignment. This will usually be the first model in the Align Models table or the first model it finds for which no torsions are currently defined. (For the target model Align strategy the first N models are always considered to be rigid—please see Multiple model alignment strategies for more information on this topic.)

Selecting torsion angles To quickly specify which torsion angles to vary during flexible alignment, click the Find Torsions button. This defines all the movable torsions using the defaults in the Search Torsions panel. The setting affects all models in the Align Model table which do not already have torsions defined. To manually specify individual torsions for a model, use the Search Torsion control panel.

To open the Search Torsions control panel, go to the ALIGN MOL-ECULES menu card and select Torsions. This control panel is also used by the Conformer Search module (see the chapter on con-

Page 28: Cerius2 - ESI · 2001-06-08 · Cerius2 Hypothesis & Receptor Models Release 4.0 April 1999 Molecular Simulations Inc. 9685 Scranton Road San Diego, CA 92121-3752 619/458-9990 Fax:

14 Cerius2 Hypothesis and Receptor Models/April 1999

2. Aligning Models

former search in Cerius2• Conformational Search and Analysis). Click the FIND button to specify all the movable torsions on the current model. When you look back at the Align Models table you will notice that the value in the #Tor column has been updated to reflect the numbers of torsions currently defined on your align models.

Flexible alignment Once you have two models in the Align Models table and have defined torsions for at least one of them you are ready to perform a Flexible alignment. Choose the Align Method you wish to use (Field or Moments) in the Align panel, select Flexible as the Align Type and click the ALIGN button.

Customizing flexible align-ment

To customize the flexible alignment, open the Align Preferences panel (by either clicking the Prefs... button at the bottom of the Align panel, or clicking Align Preferences... on the ALIGN MOL-ECULES card). The bottom section of this panel contains options you can adjust for the Flexible alignment process.

There is only one option you can adjust for Flexible alignment: Bump Check. By default this option is unchecked, indicating that there is no bump checking during the part of the alignment that optimizes the movable torsions. If you select Bump Check, then another option, Bump Factor, appears. Bump checking ensures that the conformations resulting from torsional optimization are reasonable (that is, that the conformation is not invalid due to large van der Waals overlaps of atoms). The Bump Factor indicates how important these overlaps are—higher values mean you are less likely to produce aligned conformations with atom overlaps. Since at least one model is always treated as a rigid reference for the alignment, if your models start with valid conformations they will usually have valid conformations after the alignment. Hence, since bump checking can significantly reduce the performance of an alignment, the default setting is no bump checking.

Multiple model alignment strategies

When aligning more than two models you have additional options for how you wish to perform the alignment. In version 3.5 of Cerius2•Align, you would specify just one of your models as the target model to which all others would be aligned. However, the default alignment strategy for Align is now Consensus alignment.

Page 29: Cerius2 - ESI · 2001-06-08 · Cerius2 Hypothesis & Receptor Models Release 4.0 April 1999 Molecular Simulations Inc. 9685 Scranton Road San Diego, CA 92121-3752 619/458-9990 Fax:

Aligning models using Moments alignment

Cerius2 Hypothesis and Receptor Models/April 1999 15

The consensus strategy is to align all models to each other, such that the overall alignment value (the similarity or RMS) is opti-mized. This does not necessarily give the best alignment for any given pair of models in a set of consensus-aligned models but can be thought of as an alignment of all models to a virtual model which is the “average” conformation of the aligned models.

The other alignment strategy available to you is Target Model. In this strategy all models are aligned separately to one or more spec-ified target models.

Performing a Consensus alignment of four models.

So far you have been using consensus alignment by default. This was not mentioned previously because with only two models the results are the same for consensus and target model alignments.

To perform a Consensus alignment of four models, import two more models into the Align Models table using the method described in Importing models for alignment, so that you now have four align models. On the Align panel select Consensus as the Align Strategy, Field as the Align Method, and Rigid as the Align Type parameters. (You may use Moments and/or Flexible align if you wish.) Click the ALIGN button to perform the Consensus alignment. Your view should be updated to show all four models aligned and superimposed.

Performing a target model alignment to a sin-gle model

To perform a target model alignment, open up the Align Models panel if it is not still in view. Notice that there is a option called Tar-get Models which currently has the value of 1. This specifies that there is currently one model specified as a target model—in fact it is always the first model in the table. Change the Align Strategy option on the Align panel to Target Model and then click the ALIGN button. Your models should now be aligned to the first model in your table and results for the individual alignments should appear in your text window.

Now you should change the target model in the Align Models table. To do this select a table row, (for example, row 3), and click the Target Model icon. This moves the row to the top of the table and designates the model in this row as the new target model. Click the ALIGN button in the Align panel again to verify that your models now align to this new target model. (You may wish to move your models around before you align them to see the effect more clearly.)

Page 30: Cerius2 - ESI · 2001-06-08 · Cerius2 Hypothesis & Receptor Models Release 4.0 April 1999 Molecular Simulations Inc. 9685 Scranton Road San Diego, CA 92121-3752 619/458-9990 Fax:

16 Cerius2 Hypothesis and Receptor Models/April 1999

2. Aligning Models

Performing a Consensus Alignment on a subset of Align models

To perform a Consensus alignment on a subset of Align models, go to the Align Models table and click on the cell in the Use column for the last model in the table. This should toggle the value from Yes to No. Next you must mark all models as used in the Align Models table to be presented in the Model View window. To do this, click the Update Overlay Display button. (You may also select the last row and click on the Remove Model button. How-ever, this totally removes the model from the Align Models table and you must then re-import it to follow the subsequent steps.)

On the Align panel set the Align Strategy back to Consensus and click the ALIGN button. This time just the three models are con-sensus aligned.

Performing a target model alignment to multi-ple target models

Using the target model Align strategy you can now take your fourth model and align it to the three consensus aligned models in the last step. First you must specify the first three models as target models. There are two ways to do this:

1. Set the Target Models value in the Align Models table to 3.

or

2. Select three table rows and click the Target Model button. (This also moves these models to the top of the table if they are not already there.)

Now include the last model in the align models set by clicking on the Use table cell for that model to set it back to Yes. On the Align panel set the Align Strategy option to Target Model and click the ALIGN button. Your fourth model should now be aligned relative to the other three, which will not move since they are specified as align target models. To see this more clearly you may wish to move the target models apart and click the ALIGN button again. Your fourth model should now be aligned in the center of your target models.

If you had more than four models in your Align Model table then each non-target model would be aligned with the target models set individually. Since target models do not move in the alignment, any torsions defined on those models are ignored. You are also unable to change the value in the Use column for target models from Yes while Target Model is specified as the Align Strategy.

Page 31: Cerius2 - ESI · 2001-06-08 · Cerius2 Hypothesis & Receptor Models Release 4.0 April 1999 Molecular Simulations Inc. 9685 Scranton Road San Diego, CA 92121-3752 619/458-9990 Fax:

Aligning models using Moments alignment

Cerius2 Hypothesis and Receptor Models/April 1999 17

RMS alignment

RMS alignment aligns models by producing the best superposi-tion of two or more models using the collective RMS of their atomic coordinates (please see Theory: Alignment for more infor-mation on this topic). the RMS method is generally faster and more accurate than forcefield-based alignment but requires that for each pair of models, the corresponding atoms be identified. This is referred to as RMS atom matching.

Preparing models for RMS alignment: atom match-ing

To match atoms between pairs of models, use the Align Models panel (open it by clicking Align Models on the ALIGN MOLE-CULES card). There are two ways to perform atom matching between models: automatically and manually. Usually you should use a combination of both methods.

Click the Common Subgraph button in the Align Models table. When the dialog box appears, read what it says and then click OK. After a moment you will see that the table columns under Number of Align Atom Matches, M1, and M2 fill up with integer values. These values represent the number of current atom matches spec-ified between pairs of models (for example, for the first model in row 1 the value in the M3 column is the number of atom matches to the third model in the table, in row 3). To see all the models in the Align Models table with green dashed lines between atom pairs, showing the current atom matches specified, click Update Overlay Display. (You may wish to drag one of the models around to see this more clearly.) You can click on the Display Atom Matches check box to hide or redisplay these align atom matching monitors.

This last operation creates atom matches between each pair of models in the Align Models table. This is often necessary when you are preparing to generate QSAR Alignment Similarity descriptors based on RMS alignments (refer to next chapter). However, you usually do not need to specify atom matches between each pair of models. For example, to specify that four atoms named "A" on four models are to be superimposed during RMS alignment you could simply select three atom match pairs: (Model1:A,Model2:A), (Model2:A,Model3:A), and (Model3:A,Model4:A). This implicitly defines the atom matches (Model1:A,Model3:A), (Model1:A,Model4:A), (Model1:A,Model3:A), (and so on), which are necessary for the alignment calculation. Alternatively, the same set of atom matches

Page 32: Cerius2 - ESI · 2001-06-08 · Cerius2 Hypothesis & Receptor Models Release 4.0 April 1999 Molecular Simulations Inc. 9685 Scranton Road San Diego, CA 92121-3752 619/458-9990 Fax:

18 Cerius2 Hypothesis and Receptor Models/April 1999

2. Aligning Models

can be defined by specifying atom matches between the atoms of one model to each of the other align models, for example: (Model1:A,Model2:A), (Model1:A,Model3:A), and (Model1:A,Model4:A). This is the recommended procedure when setting up atom matches between multiple models from scratch and simply involves selecting one model (row) in the Align Mod-els table and clicking the Common Subgraph button. (When set-ting up for targeted alignment you should choose one of the target models as your selection.)

Note

Performing an RMS align-ment

To perform an RMS alignment, go to the Align panel and set the Align Strategy option to Consensus and the Align Method to RMS. Click the ALIGN button. The models should be realigned by RMS value, and the consensus RMS value should be reported in the text window. Like field and moments alignment methods, you may also use RMS alignment with the target model strategy and flexible or rigid alignment options.

Editing atom matches The Common Subgraph tool produces atom matches based on the common topologies of the atoms in each pair of align models. It then chooses between equivalent best topology matches by the RMS of superposition of the these atoms sets using the current align model conformations. This tool works in combination with selected rows in the table: With one row selected, atom matches are made between this align model and all others in the table. With two or more rows selected, atom matches are made between all

If you explicitly specify an atom match, (for example: Model2:A,Model3:B), using an atom which is already implicitly matched to another atom in the same model, (for example: Model2:A,Model3:A), then your overall alignment may be affected. Here you are effectively specifying that the atom Model2:A is to be superimposed for RMS alignment for both atoms Model3:A and Model3:B—in fact Model2:A will be superimposed on the geometric center of atoms Model3:A and Model3:B. Due to the implicit atom matches, this means all atoms matched to Model3:A are also matched to Model3:B. This situation can occur quite easily when using common subgraph matching to generate atom matches between multiple align models, as a result of the ambiguity in choosing the best common subgraph matches (when equivalent topological mappings of atoms exist due to local symmetry in models).

Page 33: Cerius2 - ESI · 2001-06-08 · Cerius2 Hypothesis & Receptor Models Release 4.0 April 1999 Molecular Simulations Inc. 9685 Scranton Road San Diego, CA 92121-3752 619/458-9990 Fax:

Aligning models using Moments alignment

Cerius2 Hypothesis and Receptor Models/April 1999 19

these align models. Above you used this tool with no rows selected, which has the same effect as selecting all rows.

The Clear Atom Matches tool works with selected rows in exactly the same way as the Common Subgraph tool, except that it clears the atom matches between align models.

When your Align models have similar chemical features, the Com-mon Subgraph atom matching often gives you exactly the atom matching you want. More often, however, you will want to adjust the resulting matches by adding or subtracting individual atom matches or even specifying all the atom matches by hand.

To manually edit the atom matches, you must first have the Align models in the same view (use Overlay Viewing Mode) and then click on the Pick Atom Matches icon to set the global atom picking mode to atom matching. To add a new atom match, simply pick one atom in one Align model, then pick its match in a second Align model. A new atom match monitor appears and the corresponding number of atom matches in the Align Models table are updated. To remove an existing atom match pick a pair of atoms on two Align models for which an atom match monitor already exists. (You may wish to employ the Use column of the Align Models table and the Update Overlay Display button to temporarily hide other Align models from the view to make the atom picking easier.) You can also pick atom matches between displayed models which are not already in the Align Models table. When you do this, these models are added to the table with the atom match you picked. (Note that this is an alternative way to import models into the Align Models table.)

Page 34: Cerius2 - ESI · 2001-06-08 · Cerius2 Hypothesis & Receptor Models Release 4.0 April 1999 Molecular Simulations Inc. 9685 Scranton Road San Diego, CA 92121-3752 619/458-9990 Fax:

20 Cerius2 Hypothesis and Receptor Models/April 1999

2. Aligning Models

Page 35: Cerius2 - ESI · 2001-06-08 · Cerius2 Hypothesis & Receptor Models Release 4.0 April 1999 Molecular Simulations Inc. 9685 Scranton Road San Diego, CA 92121-3752 619/458-9990 Fax:

Cerius2 Hypothesis and Receptor Models/April 1999 21

3 Incorporating Align Data in QSAR

The Align module functionality is designed to physically align sets of models in space. Align only returns a small amount of numeric data for analysis to the text window: either the overall consensus alignment similarity/RMS or the similarity/RMS values for each model aligned to the target model(s).

You can also perform alignment similarity/RMS calculations using the alignment similarity descriptors through the QSAR Study Table. In this case more statistics are available for an align-ment calculation but no model’s position or configuration is actu-ally altered.

This chapter discusses: The topics covered in this chapter are:

Preparing models for generating alignment similarity descriptors

Adding alignment similarity descriptors to the QSAR Study Table

Using the study table to analyze a consensus alignment

Using alignment similarity descriptors for QSAR analysis

Recalculating alignment similarity descriptors

Preparing models for generating alignment similarity descriptors

Alignment Similarity descriptors are calculated for models in the Study Table with respect to a specific model or alignment of mod-els. These target models are the same ones specified in the Align Models table. Thus before you can add Alignment Similarity descriptors to the Study Table, you must first import models into the Align Models table and denote them as target models (please refer to Chapter 2, Aligning Models). If you are using more than

Page 36: Cerius2 - ESI · 2001-06-08 · Cerius2 Hypothesis & Receptor Models Release 4.0 April 1999 Molecular Simulations Inc. 9685 Scranton Road San Diego, CA 92121-3752 619/458-9990 Fax:

22 Cerius2 Hypothesis and Receptor Models/April 1999

3. Incorporating Align Data in QSAR

one target model, you should also perform a consensus alignment on these models.

Adding alignment similarity descriptors to the QSAR Study Table

With one or more target models loaded into the Align Models table you can add alignment descriptors to the QSAR Study Table.

To add these descriptors, go to the Study Table pulldown menu commands and select Descriptors/Select... to open the Descriptors control panel. Use the Descriptors in Family popup to select the Align family. Now click the Preferences button to open the Align-ment Similarity control panel. This panel allows you to choose descriptors to add to the Study Table. You may add up to six descriptors: three types of qualitative calculations based on align-ments using two different alignment strategies. A descriptor (col-umn) of each of the types specified here is added to the Study Table for each target model.

When a descriptor is actually calculated for a Study Table model, this model is first aligned to the target models using the specified method (although the model’s coordinates are not actually changed as they would be if an Align command was used). Elec-trostatic Field Similarity, Steric Field Similarity, and Matched Atom RMS values are then calculated, based on this alignment, for each Align target model.

Page 37: Cerius2 - ESI · 2001-06-08 · Cerius2 Hypothesis & Receptor Models Release 4.0 April 1999 Molecular Simulations Inc. 9685 Scranton Road San Diego, CA 92121-3752 619/458-9990 Fax:

Adding alignment similarity descriptors to the QSAR Study Table

Cerius2 Hypothesis and Receptor Models/April 1999 23

Note

The method of aligning QSAR (Study Table) models to target mod-els when calculating descriptor values is largely determined by current settings in the Align panel. For example, the Align Method, Align Preferences, and Align Type effect whether or not the models are treated as flexible during alignment. However, the choice of Align Strategy is controlled by the type of descriptor cal-culated:

♦ Consensus target alignment specifies descriptors calculated by aligning a model with the consensus alignment of the target models. This may be thought of as aligning a model with a vir-tual model which represents the average (consensus) alignment of all the target models, although the descriptor values are still calculated with respect to each target model individually.

♦ Pairwise targets alignment specifies descriptors calculated by aligning a model with each individual target model. Hence the current alignment of the target models with respect to each other target model is not relevant when calculating pairwise targets alignment descriptors.

Once you have selected which descriptors to add to the Study Table, select the Alignment Similarity row in the Descriptors table and click the ADD button. After a few moments the new descrip-tor columns appear in the Study Table. Since these descriptors are relative to specific target models the names of the these target models are incorporated into the descriptor names.

Matched Atom RMS values can only be calculated if atom matches between a QSAR model and the Align target model(s) are specified in the Align Models table. You normally only need calculate these descriptor values if you are also using RMS as your Align Method.

In this case, all align models must have atom matches defined for each of the alignment target models, since each alignment descriptor value (table cell entry) represents a RMS value for a pair of models. (This is also the case if you wish to create pairwise field similarity descriptors when the Align Method in the Align control panel is set to RMS.)

Page 38: Cerius2 - ESI · 2001-06-08 · Cerius2 Hypothesis & Receptor Models Release 4.0 April 1999 Molecular Simulations Inc. 9685 Scranton Road San Diego, CA 92121-3752 619/458-9990 Fax:

24 Cerius2 Hypothesis and Receptor Models/April 1999

3. Incorporating Align Data in QSAR

Using the study table to analyze a consensus alignment

When you perform a consensus alignment on a set of models using the Align module only an overall alignment similarity or RMS is reported to the text window. If you wish to examine this overall alignment in more detail, (that is, to look at each pair of models involved in the overall alignment), you must calculate alignment similarity descriptors in the Study Table.

To calculate alignment similarity descriptors:

1) Change all consensus aligned models in the Align Models table to target models.

2) Choose and add the alignment similarity descriptors you wish to examine (as described above).

3) Add the aligned models into the Study Table. The descriptor values are calculated for the models as they are added to the table.

Note

Using alignment similarity descriptors for QSAR analysis

Alignment similarity descriptors can be useful for drug discovery when combined with activity data for your target models. Typi-cally you first align a small number of active compounds and use these as your target models to generate alignment similarity descriptors.

Alignment similarity descriptor values calculated for (non-target) models in the Study Table represent a qualitative measurement of the structural and forcefield similarities between these models and

Before adding the models to the Study table you should turn the energy minimization option OFF within the Molecule Preferences control panel of the Study Table. Minimizing the models is not desirable because it affects the current alignment of the models.

Page 39: Cerius2 - ESI · 2001-06-08 · Cerius2 Hypothesis & Receptor Models Release 4.0 April 1999 Molecular Simulations Inc. 9685 Scranton Road San Diego, CA 92121-3752 619/458-9990 Fax:

Recalculating alignment similarity descriptors

Cerius2 Hypothesis and Receptor Models/April 1999 25

the active/lead/target molecules. To generate these values you need only add these models to the Study Table — the descriptor values are then calculated automatically.

Note

Recalculating alignment similarity descriptors

Alignment similarity descriptors are specific to a set of (aligned) target models, as specified in the Align Models table. Like other complex descriptors, the calculated values are saved with models as user data which is also used to indicate whether the displayed values are up-to-date with respect to all variables used to calculate them. (This user data is written for these models when the models are saved to a MSI formatted file and is loaded back into the Study Table when the models are added and these descriptors are present.)

This user data is marked as invalid for alignment similarity descriptors when:

♦ Align models are changed, (for example, atoms edited or moved)

♦ Align models are moved with respect to relative orientations (for example, because of re-alignment)

♦ Alignment options and/or preferences are changed in the Align module. (When you make such a change for an align tar-get model then all alignment similarity descriptors are marked as not current; if you change a non- align target model, only the descriptors for this model are marked as not being current.)

Alignment Similarity descriptors may be recalculated by issuing the Recalc Descriptors command from the Study Table’s Descrip-tors command menu. Recalculation does not happen automati-cally whenever the current values for the descriptors are outdated,

Creating these descriptors for non- align target models actually requires that these models also be present in the Align Models table. However, these models are automatically added to this table if they are not already present when you check the option ON in the Alignment Similarity control panel

Page 40: Cerius2 - ESI · 2001-06-08 · Cerius2 Hypothesis & Receptor Models Release 4.0 April 1999 Molecular Simulations Inc. 9685 Scranton Road San Diego, CA 92121-3752 619/458-9990 Fax:

26 Cerius2 Hypothesis and Receptor Models/April 1999

3. Incorporating Align Data in QSAR

since you may still want to view the old values or you may wish to make several changes to your alignment set up before performing a costly recalculation of all your QSAR descriptors. On the other hand, issuing the Recalc Descriptors command when your descriptors are already updated takes only a moment.

You may remove alignment similarity descriptors from the Study Table at any time by simply selecting the descriptor column and deleting it using the Cut button. You can add these descriptors back, or add new ones, by simply re-adding the alignment similar-ity descriptors via the Descriptors control panel. If you change the target models in the Align Models table and re-add the descriptors to the Study Table, you may end up with alignment similarity descriptors (columns) which are no longer valid. Invalid descrip-tor values simply show up as empty table cells.

Please refer to the Cerius2•QSAR+ documentation for more gen-eral information on manipulating descriptors and the Study Table.

Page 41: Cerius2 - ESI · 2001-06-08 · Cerius2 Hypothesis & Receptor Models Release 4.0 April 1999 Molecular Simulations Inc. 9685 Scranton Road San Diego, CA 92121-3752 619/458-9990 Fax:

Cerius2 Hypothesis and Receptor Models/April 1999 27

4 Theory: Alignment

Model Alignment Strategies and Methods

Align offers two strategies for aligning a set of models:

1. Consensus alignment aligns a set of specified models to a vir-tual model which represents the best average alignment between each pair of models. In this iterative alignment strat-egy all models are initially aligned to the first model to move the models into a reasonable starting alignment. Each model is then fit to the rest of the models as if these represented one large model but with the fitting parameter (that is, RMS (Root Mean Square Superposition) or field similarity) averaged over the number of models. This process is repeated for each model in turn (at least once per model) until the overall fitting parameter converges (for example, until the change in RMS over succes-sive iterations is small enough).

2. Targeted alignment aligns a set of specified models to one or more specified target models. With just a single target model the resulting alignment is a set of pair-wise alignments of the specified models to the target model (which does not move). This alignment strategy is equivalent to that employed in pre-vious versions of Cerius2•Align.

When you specify multiple models as the target for an alignment these models are not re-aligned, but instead represent a consensus target (virtual model) for other models to be aligned against. For example, you may wish to perform a consensus alignment on a set of active analog models to find an active configuration. Using this active configuration as your alignment target, you may align other models to investigate their ability to adopt a similar configuration.

For either of the above two multiple-model alignment strategies you may employ one of three Align Methods:

Page 42: Cerius2 - ESI · 2001-06-08 · Cerius2 Hypothesis & Receptor Models Release 4.0 April 1999 Molecular Simulations Inc. 9685 Scranton Road San Diego, CA 92121-3752 619/458-9990 Fax:

28 Cerius2 Hypothesis and Receptor Models/April 1999

4. Theory: Alignment

1. RMS Atoms alignment performs pair-wise model alignments based on the superposition of a set of matching atoms. You must define these for each pair of models in the Align Models panel. This method is relatively fast and accurate for models where a strong correspondence between specific sets of atoms (for example, functional groups and backbone) can be readily iden-tified.

2. Moments alignment aligns models using either electrostatic moments or principal moments of inertia. A field similarity cal-culation is made to determine the best alternative orientation for these moment vectors. This is the fastest, but least accurate of the alignment methods. It is useful for tasks such as placing models in protein cavities.

3. Field alignment aligns models by maximizing the overlap between the steric and electrostatic fields calculated about the models using a probe potential (for example, a point positive charge). This method may be slower and less accurate than RMS but has the advantage of not requiring definition of any matching atoms. Non-optimal alignment may occur due to the fitting finding local minima when optimizing the field overlap. This effect can usually be reduced by decreasing the number of flexible torsions and/or choosing a more appropriate probe potential for your models.

Non-target models may be aligned in two ways: rigidly, using their current configurations; or flexibly, using a set of specified torsion bonds which are rotated about during the alignment procedure. Generally, using flexible models produces a better alignment but takes a little longer.

RMS Superposition

With RMS Atoms selected as the Align Method, models are aligned using superposition of the matching atoms between each pair of models. The function to be minimized is the sum of squares of the distances between all atoms to be superimposed, as in Eq. 1

Page 43: Cerius2 - ESI · 2001-06-08 · Cerius2 Hypothesis & Receptor Models Release 4.0 April 1999 Molecular Simulations Inc. 9685 Scranton Road San Diego, CA 92121-3752 619/458-9990 Fax:

RMS Superposition

Cerius2 Hypothesis and Receptor Models/April 1999 29

Eq. 1

where:

M = the number of molecules;

N(p,q) = the number of atoms between the pth and the qth molecules to be aligned;

and

= the transformed atomic coordinate of the ith atom in the pth molecule. This is described as:

Eq. 2

where:

= the original atomic coordinate;

= the rotation angles of the molecule;

= the translation vector of the molecule;

and

= the angles of all torsions in the molecule to be optimized.

Eq. 1can be minimized by conventional nonlinear least-squares methods.

The RMS value reported by the alignment calculation is:

Eq. 3

where F is calculated using Eq. 1, and

F r ′ip r′iq–( )2

i 1=

N p q,( )

∑q p 1+=

M

∑p 1=

M 1–

∑=

r ′ip

r ′ f r0 θ t φ, , ,( )=

r0

θ

t

φ

RMSF

Npair-------------=

Page 44: Cerius2 - ESI · 2001-06-08 · Cerius2 Hypothesis & Receptor Models Release 4.0 April 1999 Molecular Simulations Inc. 9685 Scranton Road San Diego, CA 92121-3752 619/458-9990 Fax:

30 Cerius2 Hypothesis and Receptor Models/April 1999

4. Theory: Alignment

Eq. 4

Electrostatic Potential Similarity

A number of different techniques have been proposed and applied to electrostatic potential similarity calculation, which is becoming a well-established modeling technique (Good, 1992).

Different formulas for similarity determination have been pro-posed. Here, we use the Hodgkin index as discussed by Good for electrostatic potential similarity calculation:

Eq. 5

In Eq. 5, Pa and Pb are the electrostatic potentials for molecules a and b, which are dependent on the atomic charges and distance according to Eq. 6:

Eq. 6

where:

n = the number of atoms in the molecule;

r = the coordinate where electrostatic potential is to be evalu-ated;

Ri = the coordinate position of atom i;

and

Npair N p q,( )

q p 1+=

M

∑p 1=

M 1–

∑=

SF a b,( )

2 PaPb vd∫Pa

2vd∫ Pb

2vd∫+

----------------------------------------=

Pr

Qi

r Ri–-------------

i 1=

n

∑=

Page 45: Cerius2 - ESI · 2001-06-08 · Cerius2 Hypothesis & Receptor Models Release 4.0 April 1999 Molecular Simulations Inc. 9685 Scranton Road San Diego, CA 92121-3752 619/458-9990 Fax:

Steric Shape Similarity

Cerius2 Hypothesis and Receptor Models/April 1999 31

Qi = the charge assigned to atom i.

The value of the function, SF, ranges from -1, maximum dissimilar-ity, to 1, indicating identical potentials. A value of 0 corresponds to two molecules with zero electrostatic potential overlap, either because the molecules are far apart or because the value of the pos-itive overlap equals the value of the negative overlap.

For multiple molecules, Eq. 7 is used for the similarity calculation and optimization:

Eq. 7

where M = the number of molecules.

Here, the SF function again ranges from -1 (inverse) to 1 (identi-cal).

Steric Shape Similarity

The molecular steric similarity of two molecules is calculated with Eq. 5, where Pa and Pb are the steric functions for molecules a and b, which are Lennard-Jones potentials as in Eq. 8 ("9-6" potential) or Eq. 9 ("12-6" potential), depending on forcefield choice.

Eq. 8

Eq. 9

where:

n = the number of atoms in the molecule;

SF2

M M 1–( )⋅---------------------------- SF a b,( )

b a 1+=

M

∑a 1=

M 1–

∑=

Prek--

sr--

9 sr--

6––

n

i=1

∑=

Prek--

sr--

12 sr--

6––

n

i=1

∑=

Page 46: Cerius2 - ESI · 2001-06-08 · Cerius2 Hypothesis & Receptor Models Release 4.0 April 1999 Molecular Simulations Inc. 9685 Scranton Road San Diego, CA 92121-3752 619/458-9990 Fax:

32 Cerius2 Hypothesis and Receptor Models/April 1999

4. Theory: Alignment

r = the coordinate where steric potential is to be evaluated;

k = a constant;

e = epsilon for the atom type;

and

s = sigma for the atom type.

The function SF ranges from 0, meaning zero steric overlap (mole-cules are too far apart), to 1 indicating identity.

Field Alignment (Electrostatic & Steric Similarity)

The combined similarity is calculated for field (and moments) alignment using Eq. 10.

Eq. 10

where:

w = user-specified weighting factor, ranging from 0 to 1.

The combined similarity function SF ranges from -1, meaning maximum dissimilarity, to 1, indicating identity.

Steric Clash (Bump) Checking

To avoid van der Waals clashes in flexible fitting, a penalty func-tion is added to the similarity function or the RMS function during the optimization process, as in Eq. 11.

Eq. 11

where:

SF = the similarity function or RMS function.

The following penalty function is used:

SF w SF steric( )⋅ 1 w–( ) SF electrostatic( )⋅+=

F Optimize( ) SF w F penalty( )⋅+=

Page 47: Cerius2 - ESI · 2001-06-08 · Cerius2 Hypothesis & Receptor Models Release 4.0 April 1999 Molecular Simulations Inc. 9685 Scranton Road San Diego, CA 92121-3752 619/458-9990 Fax:

Implementation: Gaussian Approximation of Overlap Functions

Cerius2 Hypothesis and Receptor Models/April 1999 33

Eq. 12

Eq. 13

where:

n = the number of atom pairs between rotatable segments in the molecule;

ro2 = the sum of the van der Waals radii;

r2 = the distance between the two atoms.

To provide a degree of "softness" in the penalty function, scaling factors are applied to the van der Waals radii of atoms. The scaling factors are:

♦ 1-4 interactions: 0.85

♦ H-bond candidates: 0.65

♦ Others: 0.95

These scaling factors may not be altered but an overall scaling of the contribution of van der Waals clash penalty function can be altered using the Align Preferences panel.

Implementation: Gaussian Approximation of Overlap Functions

In the electrostatic calculation, Pr in Eq. 6 is replaced by a Gaussian function approximation of two terms, as in Eq. 14. The integrals in Eq. 5 have a simple form based on exponent values and the dis-tance between atom centers (Good et al., 1992).

Eq. 14

F penalty( ) 1ro

r----

2

–2

n

i=1

∑=

F penalty( ) 0 when r2

ro2≥=

1r--- 0.8150e

0.3052r2

–0.2908e

0.0130r2

–+∼

Page 48: Cerius2 - ESI · 2001-06-08 · Cerius2 Hypothesis & Receptor Models Release 4.0 April 1999 Molecular Simulations Inc. 9685 Scranton Road San Diego, CA 92121-3752 619/458-9990 Fax:

34 Cerius2 Hypothesis and Receptor Models/April 1999

4. Theory: Alignment

The Lennard-Jones potential in Eq. 8 or Eq. 9 is approximated by the following Gaussians over a range of r covering just inside the repulsive part of the potential, as follows:

Eq. 15

LJ9 6– ε,σ,r( ) 261.238ε exp 6.81 r σ⁄( )2–( )∼

LJ12 6– ε,σ,r( ) 1272.79ε exp 7.5298 r σ⁄( )2–( )∼

Page 49: Cerius2 - ESI · 2001-06-08 · Cerius2 Hypothesis & Receptor Models Release 4.0 April 1999 Molecular Simulations Inc. 9685 Scranton Road San Diego, CA 92121-3752 619/458-9990 Fax:

Implementation: Gaussian Approximation of Overlap Functions

Cerius2 Hypothesis and Receptor Models/April 1999 35

Page 50: Cerius2 - ESI · 2001-06-08 · Cerius2 Hypothesis & Receptor Models Release 4.0 April 1999 Molecular Simulations Inc. 9685 Scranton Road San Diego, CA 92121-3752 619/458-9990 Fax:

36 Cerius2 Hypothesis and Receptor Models/April 1999

4. Theory: Alignment

Page 51: Cerius2 - ESI · 2001-06-08 · Cerius2 Hypothesis & Receptor Models Release 4.0 April 1999 Molecular Simulations Inc. 9685 Scranton Road San Diego, CA 92121-3752 619/458-9990 Fax:

Part 2

ReceptorIntroduction to Receptor

Receptor QuickStart

Generating a Receptor Model

Evaluating Structures Against a Receptor Model

Theory: Receptor Models

Page 52: Cerius2 - ESI · 2001-06-08 · Cerius2 Hypothesis & Receptor Models Release 4.0 April 1999 Molecular Simulations Inc. 9685 Scranton Road San Diego, CA 92121-3752 619/458-9990 Fax:
Page 53: Cerius2 - ESI · 2001-06-08 · Cerius2 Hypothesis & Receptor Models Release 4.0 April 1999 Molecular Simulations Inc. 9685 Scranton Road San Diego, CA 92121-3752 619/458-9990 Fax:

Cerius2 Hypothesis and Receptor Models/April 1999 39

5 Introduction to Receptor

The Receptor application creates hypothetical models, called recep-tor surface models, that characterize the active site of an enzyme or antibody based on the construction of surfaces to represent spatial and electrostatic properties of a receptor active site.

Molecular models are minimized within the receptor surface model and interaction energies are calculated, which allows the evaluation of new candidate compounds.

Receptor site models are inferred from activity data of a set of com-pounds considered to act upon the active site of an enzyme or anti-body whose 3D structure is unknown. We assume an underlying complementarity between the shape and properties (for example, hydrophobicity, hydrogen bonding proclivity) of the receptor and the set of compounds that bind to it.

This chapter provides an overview of Receptor. The chapter dis-cusses:

Where to learn about Receptor

Accessing Receptor (next section)

Using Receptor — Describes a typical flow of activity in the module. The discussion directs you to appropriate chapters in Cerius2 books where more detailed information about each activity can be found.

You can get a feel for Receptor’s capabilities and interface by working through the Receptor QuickStart.

See the other Receptor chapters for details of:

Generating a Receptor Model, Chapter 8

Evaluating Structures Against a Receptor Model, Chapter 9

Theory: Receptor Models, Chapter 10

Page 54: Cerius2 - ESI · 2001-06-08 · Cerius2 Hypothesis & Receptor Models Release 4.0 April 1999 Molecular Simulations Inc. 9685 Scranton Road San Diego, CA 92121-3752 619/458-9990 Fax:

40 Cerius2 Hypothesis and Receptor Models/April 1999

5. Introduction to Receptor

Accessing Receptor

This section describes the procedure for starting Receptor and the things you need to work with the software.

Before you begin You must have one or more aligned structures displayed in the model window (see Part One, Alignment or Part Three, Pharma-cophore in this book, or see Molecular Shape Analysis in Cerius2 QSAR+).

To start Receptor 1. Select Drug Discovery from the Deck Selector popup. The Drug Discovery card deck appears.

2. Choose the Model Receptor menu card from the card deck.

Using Receptor

The typical flow of tasks in Receptor is outlined in the following steps:

1. Begin with a single molecule or a set of structures that have been aligned using Alignment, ConFirm, or the alignment capabilities of Molecular Shape Analysis. If you have a lot of data it is better to use a subset of the most active compounds.

Note that you can generate structures for alignment in many ways, including these:

♦ Build one or more structures using the Cerius2 3D Sketcher (see the Cerius2 Modeling Environment and the Cerius2 Build-ers)

♦ Generate a series of analogous structures using the Analog Builder (see the Cerius2 Modeling Environment)

♦ Import previously built structures using the Load Model control panel accessed from the File pulldown menu on the menu bar (see the Cerius2 Modeling Environment)

2. Generate a receptor model after you have identified the solvent region (if any) of the molecule and the structures you want to

Page 55: Cerius2 - ESI · 2001-06-08 · Cerius2 Hypothesis & Receptor Models Release 4.0 April 1999 Molecular Simulations Inc. 9685 Scranton Road San Diego, CA 92121-3752 619/458-9990 Fax:

Using Receptor

Cerius2 Hypothesis and Receptor Models/April 1999 41

use as templates. Alternatively, load a receptor model saved from previous work (Saving and restoring a receptor model).

Before you generate a model, you can adjust generation prefer-ences by accessing the Preferences control panel (Setting prefer-ences).

3. After the receptor model is generated, you may map properties such as charge or hydrophobicity onto the surface of the model (Mapping property values).

4. Orient new candidate structures within the receptor model to evaluate the quality of the fit between the structure and the receptor surface. Evaluating structures inside a receptor model provides quantitative information that you can use to rate the fit.

Before you evaluate a model, you can adjust evaluation prefer-ences by accessing the Preferences control panel (Setting prefer-ences).

Note that you can automatically orient candidates to a model.

5. Map favorable and unfavorable interactions between struc-tures and the surface of the receptor model (Viewing the results).

6. Incorporate the receptor data into a QSAR to further evaluate and to estimate the biological activities of your structures (Incorporating receptor data in QSARs).

7. At this point, you can choose either of two iterative processes that use the steps described above:

Modify (optimize) candidate structures within the receptor model (see the Cerius2 Modeling Environment and the Cerius2 Builders) and re-evaluate them (Evaluating structures).

Generate refined receptor models using new alignments of the template structures you have been using, or use new template structures. Alternatively, you can load saved receptor models from previous work. New receptor models are available for evaluating new or previously evaluated structures (Generating a receptor surface model).

Page 56: Cerius2 - ESI · 2001-06-08 · Cerius2 Hypothesis & Receptor Models Release 4.0 April 1999 Molecular Simulations Inc. 9685 Scranton Road San Diego, CA 92121-3752 619/458-9990 Fax:

42 Cerius2 Hypothesis and Receptor Models/April 1999

5. Introduction to Receptor

Page 57: Cerius2 - ESI · 2001-06-08 · Cerius2 Hypothesis & Receptor Models Release 4.0 April 1999 Molecular Simulations Inc. 9685 Scranton Road San Diego, CA 92121-3752 619/458-9990 Fax:

Cerius2 Hypothesis and Receptor Models/April 1999 43

6 Receptor QuickStart

This chapter allows you to work in Receptor to solve a simple problem.

QuickStart

You begin by reading in a set of compounds, the dopamine beta-hydroxylase inhibitors.

1. Load beta hydroxylase inhibitor model files from Cerius2-Resources directory

Note that the structures are pre-aligned.

2. Add the models to the Study Table

Starting from a new Cerius2 session, load all beta hydroxy-lase inhibitors from the directory Cerius2-Resources/EXAMPLES/DBH: files dbh49.msi through dbh52.msi. Make all the loaded structures visible and set Cerius2 into Overlay mode.

Open the Study Table and select Molecules… from the Pref-erences pulldown. When the Molecule Preferences panel appears, click the Add Hydrogens, Minimize Energy, and Calculate Charges buttons off, then select Add All from the Molecules pulldown menu.

Page 58: Cerius2 - ESI · 2001-06-08 · Cerius2 Hypothesis & Receptor Models Release 4.0 April 1999 Molecular Simulations Inc. 9685 Scranton Road San Diego, CA 92121-3752 619/458-9990 Fax:

44 Cerius2 Hypothesis and Receptor Models/April 1999

6. Receptor QuickStart

All four models are added to the Study Table, without being mini-mized, which might disturb the alignment.

3. Enter biological activity data

4. Create a receptor surface model

This tells Receptor to weight the contribution of each model by its bio-logical activity. Compounds that are highly active contribute more to the receptor surface model than less active ones by default (although this can be reversed).

This creates a receptor surface model. Note the Activity weight infor-mation messages in the textport.

The way the shape of the receptor surface model reflects the shapes of

Enter the following values into the Activity column:

dbh49 5.82dbh50 5.92dbh51 6.17dbh52 7.13

Select DRUG DISCOVERY from the card deck, then click MODEL RECEPTOR and then Generate Receptor Model to start the Receptor module.

Click Preferences… to bring up the Preferences control panel and then click the Use QSAR Activity Data button.

Next click the GENERATE button of the Generate Receptor Model control panel.

Page 59: Cerius2 - ESI · 2001-06-08 · Cerius2 Hypothesis & Receptor Models Release 4.0 April 1999 Molecular Simulations Inc. 9685 Scranton Road San Diego, CA 92121-3752 619/458-9990 Fax:

QuickStart

Cerius2 Hypothesis and Receptor Models/April 1999 45

the contributing models is probably obvious, but there is also an elec-trostatic component to the receptor model. The potential exerted by each molecule (atomic point charge model) is calculated at each point on the receptor model surface. These potentials are summed and may be displayed by clicking the ELECTROSTATIC Map Property but-ton. You can make the colors more vivid by resetting the Min and Max values to -0.04 and 0.04.

5. Construct a new lead compound

Next you will construct a hypothetical hydroxylase inhibitor.

6. Evaluate the new compound in the receptor model

Now you can evaluate the new molecule in your receptor model.

The energies of interaction (Einteract) between each molecule and the receptor surface model are displayed in the table of the Evaluate Receptor Model panel. Your new lead interacts with an energy of approximately -9 kcal, which makes it a little less active than the two

Load the model Cerius2-Resources/EXAMPLES/DBH/dbh33.msi. Make all the models invisible except this new model, dbh33. Using the 3D-Sketcher, change both of the hydrogen atoms that are ortho to the fluorine atom into flu-orine atoms, creating a 3,4,5-trifluoride.

Clean up the structure with the Clean button on the Sketcher control panel.

Click Evaluate Molecules. When the Evaluate Receptor Model control panel appears, click TOTAL ENERGY.

Page 60: Cerius2 - ESI · 2001-06-08 · Cerius2 Hypothesis & Receptor Models Release 4.0 April 1999 Molecular Simulations Inc. 9685 Scranton Road San Diego, CA 92121-3752 619/458-9990 Fax:

46 Cerius2 Hypothesis and Receptor Models/April 1999

6. Receptor QuickStart

most active compounds in the series.

The interaction between the new lead and the surface is mostly favor-able (purple areas) with only one region of unfavorable interaction: the part of the surface colored green complements a phenolic proton in most of the contributing models (including three of the four most active compounds). The fluorine atom is even more electronegative than the phenolic oxygen atom so it interacts favorably with the part of the surface nearest to it, causing an intensely purple area.

We can show that these interactions are primarily electrostatic by col-oring the surface by VDW energy.

Note that there are no green or purple areas near the para substituent, since F is about the same size as O. Sterically, our new lead is a good fit, but electrostatically the absence of a positively charged hydrogen atom causes unfavorable interactions which reduce biological activity.

7. Finish up

The QuickStart lesson is now over.

Make the receptor surface model visible once more by click-ing the Visible buttons of the Cerius2 Model Manager.

Click the VDW ENERGY Map Property button.

To end the Cerius2 session, close all open panels and select File/Exit from the Visualizer menu bar.

If you want to go on to another tutorial, or use Cerius2 to run an experiment, first close all panels and select File/New Session from the Visualizer menu bar.

Page 61: Cerius2 - ESI · 2001-06-08 · Cerius2 Hypothesis & Receptor Models Release 4.0 April 1999 Molecular Simulations Inc. 9685 Scranton Road San Diego, CA 92121-3752 619/458-9990 Fax:

Cerius2 Tutorials—Life Science/April 1997 47

7 Tutorial: Receptor Surface Analysis (RSA)

This chapter shows how to use the new Receptor Surface Analysis functionality in QSAR.

♦ Construct a Receptor Surface Model provides specific instruc-tions on how to select a set of Independant Variables based on Varianced a Receptor Surface Model from a set of aligned bio-logically active models.

♦ Use the new QSAR descriptor Receptor_RSA provides spe-cific instructions on how to add energies evaluated at each ver-tex of the Receptor Surface Model to the Study Table.

♦ Construct a QSAR relationship tells the user how to select a subset of the added surface points on the basis of highest vari- ance and use these points to derive a QSAR equation which identifies the most significant points on the surface.

Before you begin

To complete this tutorial, you need a licensed copy of Cerius2 that includes these modules:

♦ QSAR+

♦ Receptor

Page 62: Cerius2 - ESI · 2001-06-08 · Cerius2 Hypothesis & Receptor Models Release 4.0 April 1999 Molecular Simulations Inc. 9685 Scranton Road San Diego, CA 92121-3752 619/458-9990 Fax:

48 Cerius2 Tutorials—Life Science/April 1997

Load Models and add them to the Study Table

In this lesson you will add molecules to the QSAR Study Table, and enter activity data.

This will optimize the models with the Receptor clean utility.

Load beta hydroxylase inhibitor model files from Cerius-Resources directory.

Starting from a new Cerius2 session, load all 47 beta hydrox-ylase inhibitors from the directory Cerius-Resources/EXAMPLES/DBH/, files dbh02.msi through dbh52.msi.

To clean up the models with the Receptor clean method, select DRUG DISCOVERY from the application pulldown menu, then pick MODEL RECEPTOR. Click on Evaluate Molecules to bring up the Evaluate Receptor Model control panel. Click on the EVALUATE button.

To add the models to the Study Table, bring up the Study Table and select Molecules... from the Preferences pull-down. When the Molecule Preferences panel appears dese-lect the Add Hydrogens, Minimize Energy and Calculate Charges buttons, then select Add All from the Molecules pulldown menu.

Page 63: Cerius2 - ESI · 2001-06-08 · Cerius2 Hypothesis & Receptor Models Release 4.0 April 1999 Molecular Simulations Inc. 9685 Scranton Road San Diego, CA 92121-3752 619/458-9990 Fax:

Cerius2 Tutorials—Life Science/April 1997 49

All 47 models will be added to the Study Table.

Generate a Receptor Surface Model (RSM)

Now we will create a Receptor Surface Model using a subset of the most active models. All the models except the last one loaded (dbh52) should be invisible at the moment.

To import biological activity data, select Import... from the File pulldown menu. When the Import Table panel appears select File Contains Column Labels. Then read in the dbhactivity.dat file from the Cerius2-Resources/EXAM-PLES/DBH directory.

Select the column of activity data by clicking on the column heading -log(IC50). Copy this column of data by clicking on the COPY button in the Table Manager.

To transfer this column to the Study Table, click on the col-umn heading Activity, then select Paste from the Edit pull-down of the Study Table. Insert the -log(IC50) column before the blank Activity column, then select Delete from the Edit menu to delete the blank.

Make models dbh51 through dbh49 visible by clicking on the Visible buttons in the Model Manager.

Page 64: Cerius2 - ESI · 2001-06-08 · Cerius2 Hypothesis & Receptor Models Release 4.0 April 1999 Molecular Simulations Inc. 9685 Scranton Road San Diego, CA 92121-3752 619/458-9990 Fax:

50 Cerius2 Tutorials—Life Science/April 1997

This identifies the -log(IC50) column as Independant Y variables. Receptor will use this activity data to weight the contribution of each molecule to the Receptor Surface Model.

Add Receptor Surface energies to the Study Table.

The energy of interaction between the active models and each point on the surface will now be added to the Study Table. This allows us to identify places on the surface which are related to biological activity, and use these to calculate a QSAR.

Choose Select... from the Descriptors pulldown. When the Descriptors panel appears, select Receptor from the family popup menu, then click on the preferences button Recep-tor....

Two panels will appear which control the addition of energies of inter-action between the receptor surface model and the set of molecules to

Next, click on Generate Receptor Model in the deck of cards menu for MODEL RECEPTOR. When the Generate Receptor Model control panel appears, click on PREFER-ENCES... and select the Use QSAR Activity Data button.

Now click on the -log(IC50) column header in the Study Table, then select Set Y under the Variables pulldown.

We want to build the RSM from only the most active com-pounds so set the apply To Molecule(s) popup menu to All Visible. Finally, click on the GENERATE button of the Gen-erate Receptor Model panel to create the Receptor Surface Model (RSM).

Page 65: Cerius2 - ESI · 2001-06-08 · Cerius2 Hypothesis & Receptor Models Release 4.0 April 1999 Molecular Simulations Inc. 9685 Scranton Road San Diego, CA 92121-3752 619/458-9990 Fax:

Cerius2 Tutorials—Life Science/April 1997 51

the Study Table.

The second panel deals with the surface - molecule interactions at each point on the surface, and the first panel deals with the sum of these interactions.

For example, if selected, the button Nonbonded TOT Energy between Molecule and Receptor will add sum the VDW and elec-trostatic energies of interaction between each point on the surface for each molecule to the Study Table, i.e., one number for each molecule. If the TOT button in the RSA Preferences is slected, then the VDW and electrostatic energies at each point will be added. This may be thousands of columns of data, one for each point on the surface.

Unless we perform some kind of filtering process to this, there will be a great many columns of data added, probably too much for the statis-tical methods we may want to apply to this data later. Therefore the Filter Surface Points popup menu in the RSA Preferences panel lets us reduce input to the Study Table by adding only every nth point, variance, and correlation. The default method for filtering Study Table input is to enter only ten percent of surface points, choosing those with highest variance.

It is hard to decide before running the calculation what filtering method should be used, so a good place to start is to select the ’Add Every Nth surface point’ option.

The Receptor Surface energies will be added to the Study Table and you will see columns added on the right (TOT/1, TOT/11, etc).

Next, click on the button under the ADD button in the Descriptors panel, and then click on row 23 Receptor_RSA to select the RSA descriptor. Finally, click on the ADD but-ton.

We can see which points on the surface have been used by selecting Equation Viewer... and when the panel appears, click on the Show selected points More... button.

Page 66: Cerius2 - ESI · 2001-06-08 · Cerius2 Hypothesis & Receptor Models Release 4.0 April 1999 Molecular Simulations Inc. 9685 Scranton Road San Diego, CA 92121-3752 619/458-9990 Fax:

52 Cerius2 Tutorials—Life Science/April 1997

The chosen Independant X variables show up as crosses. Since we chose the ’Every Nth surface point’ option they crosses should be evenly distributed over the surface.

Calculate a QSAR

The most important point will have the largest Loading Coefficient (the number in square brackets).

This concludes the tutorial.

Change the 3D-QSAR Labels popup menu from MFA to RSA, then click on Label Independant Variables. If you have not already done so, put Cerius2 Visualizer into Over-lay mode so that all the objects are superimposed in the graphics window.

Select the GFA statistical method, and increase the number of cross-overs by selecting Statistical Method from the Pref-erences pulldown of the Study Table. Increase the parame-ter Generations to 20000.

Next we must include non-linear terms in the expression, so click on the Configure GFA... button and when the Config-ure GFA panel appears, select the Spline, Quadratic, Offset Quad and Quad Spline buttons.

Click on RUN to start the calculation. After a few moments the Equation Viewer will appear. Use the 3D-QSAR Labels menu to show the surface points selected by the GFA method as crucial in the QSAR by clicking on the Label Cur-rent equation.

Page 67: Cerius2 - ESI · 2001-06-08 · Cerius2 Hypothesis & Receptor Models Release 4.0 April 1999 Molecular Simulations Inc. 9685 Scranton Road San Diego, CA 92121-3752 619/458-9990 Fax:

Cerius2 Hypothesis and Receptor Models/April 1999 53

8 Generating a Receptor Model

To evaluate new structures for potential biological activity, you must first generate a receptor surface model from the overlay of known active compounds.

This chapter explains Generating a receptor surface model

Saving and restoring a receptor model

Listing a receptor model

Mapping property values

Before you begin You must have one or more aligned structures displayed in the model window (see Part One, Alignment, Part Three, Pharmacoph-ore, or the Cerius2 QSAR+ discussions of Molecular Shape Analy-sis). If you have a large set of active compounds, it is probably best to use a subset of the most active compounds to build your recep-tor surface model.

Now you can display the properties stored at each vertex (electro-static potential, partial charge, hydrophobicity, and hydrogen bonding propensity).

You can generate structures to align and use in creating a receptor surface model through one of several methods:

♦ Build one or more structures using the Cerius2•3D Sketcher or a Cerius2•Builder (see the Cerius2 Modeling Environment and the Cerius2 Builders)

♦ Use the Analog Builder to generate a series of analogous struc-tures (see the Cerius2 Builders)

♦ Import previously built structures using the Load Model con-trol panel accessed from the File pulldown on the menu bar (see the Cerius2 Modeling Environment)

Page 68: Cerius2 - ESI · 2001-06-08 · Cerius2 Hypothesis & Receptor Models Release 4.0 April 1999 Molecular Simulations Inc. 9685 Scranton Road San Diego, CA 92121-3752 619/458-9990 Fax:

54 Cerius2 Hypothesis and Receptor Models/April 1999

8. Generating a Receptor Model

To begin the receptor generation process

Choose Generate Receptor Model from the Model Receptor menu card in the Drug Discovery card deck. The Generate Recep-tor Model control panel appears.

Generating a receptor surface model

When you generate a receptor surface model, the receptor surface that you see in the model window is a semi-transparent surface that surrounds the template structures. The molecules within the receptor are visible and can be edited, transformed, or otherwise manipulated using standard Cerius2 tools. The model and the structures within it also can be manipulated together.

To generate a receptor surface model

1. Select the structures that you want to use as templates for gen-erating a receptor surface model. To make this selection, use the Apply To Molecule(s) popup on the Generate Receptor Model control panel. You can select all the structures listed in the model table (All), the models that are visible in the viewing

Page 69: Cerius2 - ESI · 2001-06-08 · Cerius2 Hypothesis & Receptor Models Release 4.0 April 1999 Molecular Simulations Inc. 9685 Scranton Road San Diego, CA 92121-3752 619/458-9990 Fax:

Generating a receptor surface model

Cerius2 Hypothesis and Receptor Models/April 1999 55

window (All Visible), or only the current structure (Current). All is the default selection.

2. If you want to adjust generation preferences, click Preferences. The Preferences control panel is displayed. For more informa-tion, see Setting preferences.

3. Use the Solvent Atoms popup to specify any structures or parts of structures that are exposed to solvent, You can choose no exposure (None), exposure of selected atoms (Selected), or exposure of a group (Group) of atoms that you have defined. None is the default selection. For information on making atom selections, see the Cerius2 Modeling Environment.

If parts of a structure are accessible to solvent, no surface is gen-erated in that region and the receptor model is open.

4. Make adjustments to the model display. Use the Attributes of Receptor Model popup to specify the model that you want to affect.

5. If appropriate, increase or decrease the tightness of fit of the model with respect to the template structures. Make adjust-ments using the Surface Fit slider bar. Surface fit is defined by the distance in angstroms of the model from the van der Waals surface of the template molecules. The default distance is 0.10 Å. The slider can be adjusted from -0.50 (closer fit) to 1.00 Å (looser fit). Fitness can be adjusted anytime after the receptor model is generated by moving the slider.

When you change the fit of the receptor model, you affect the outcome of new structure evaluation. The geometry of a new structure in the receptor surface model and the interaction score for a new structure in the receptor can be affected. For example, if you generate a receptor model using the boat geometry of cyclohexane, the chair geometry can sit within the receptor if the surface fit is greater than or equal to zero. However, the interaction score will reflect the strain of this fit.

If you lower the surface fit to less than zero, the molecule changes from the chair geometry to the boat geometry in order to fit in the receptor with the tolerance you have specified. The interaction score reflects the change to the more favorable geometry.

Page 70: Cerius2 - ESI · 2001-06-08 · Cerius2 Hypothesis & Receptor Models Release 4.0 April 1999 Molecular Simulations Inc. 9685 Scranton Road San Diego, CA 92121-3752 619/458-9990 Fax:

56 Cerius2 Hypothesis and Receptor Models/April 1999

8. Generating a Receptor Model

6. Adjust the transparency of the receptor surface model by mov-ing the Transparency slider bar. The percentage transparency is indicated in the field to the right of the slider bar. The default value is 50.0 percent.

7. Specify the property you want to map to the receptor surface by selecting one of the Map Property choices (see Mapping property values).

8. Click Generate. The receptor surface model is generated and displayed in the model window as a smooth semi-transparent surface contoured around the template molecules. If any prop-erty is mapped, the surface is colored according to the mapped property rather than the default color.

The receptor model name is listed in the Model table and in the Using Receptor Model popup on the Evaluate Receptor Model control panel.

Molecules are visible within the receptor when Model Receptor is in overlay mode and transparency is greater than zero. If you have no atoms accessible to solvent, the receptor surface com-pletely encloses the molecules. If some atoms of the molecules are solvent-accessible, the receptor surface is open in the area of these atoms.

Setting preferences

This section describes additional preferences that you can set for generating receptor surface models.

To set preferences 1. Click Preferences on the Generate Receptor Model control panel. The Preferences control panel is displayed.

Page 71: Cerius2 - ESI · 2001-06-08 · Cerius2 Hypothesis & Receptor Models Release 4.0 April 1999 Molecular Simulations Inc. 9685 Scranton Road San Diego, CA 92121-3752 619/458-9990 Fax:

Generating a receptor surface model

Cerius2 Hypothesis and Receptor Models/April 1999 57

2. In the Generate Preferences section, check the Use QSAR Activity Data check box if you want Receptor to use activity data from the Study Table for the molecules that you have spec-ified as templates to generate a receptor model. The activity data from QSAR is used to weight the relative importance of each template structure in generating the receptor model.

If the box is not checked, all template molecules are assumed to have equal activity when the receptor model is generated. This is the default.

Note that the Set Independent column (with Activity Data) must be set first, before checking this box.

3. Check the Include Solvation Correction if you want Receptor to add a penalty function when polar atoms are placed in hydrophobic regions of the receptor surface model. This box is not checked by default.

4. Choose the type of receptor surface you want Receptor to gen-erate by making a selection in the Receptor Surface Type sub-panel. The choices are:

Page 72: Cerius2 - ESI · 2001-06-08 · Cerius2 Hypothesis & Receptor Models Release 4.0 April 1999 Molecular Simulations Inc. 9685 Scranton Road San Diego, CA 92121-3752 619/458-9990 Fax:

58 Cerius2 Hypothesis and Receptor Models/April 1999

8. Generating a Receptor Model

van der Waals — This is the default selection. When you make this selection, the receptor surface characterizes the van der Waals shape of the molecules. Atom positions of the aligned molecules are clearly defined.

Soft — When you make this selection, the receptor surface is a more abstract representation of shape. The surface is much smoother and hides individual atom details. Genera-tion of this receptor surface is based on the Wyvill field func-tion for soft objects.

Saving and restoring a receptor model

Receptor models can be saved and reloaded at a later time for fur-ther work with the same or different structures.

To save a receptor model With the model you want to save set in the Model Manager as cur-rent and visible, select Save Receptor Model from the Model Receptor menu card. Use the File Selector tools to select and save the model. When you select a model from the browser box, the name of that model is entered in the filename field. Because recep-tor surface model files can be quite large, it may take some time for the file to be saved.

To restore a receptor model

Select Load Receptor Model from the Model Receptor menu card. A File Selector appears. Use the File Selector tools to select and load the model you want to display. Because receptor surface model files can be quite large, it may take some time for the model to be restored and displayed.

Listing a receptor model

The property values stored at each point on the receptor surface model may be listed to the textport, to a table, or to a file.

To list a receptor model Select which receptor surface model you wish to list by selecting the List receptor model parameter. After choosing the destination (TEXTPORT, TABLE, or FILE), click LIST.

When written to the textport, the listed data forms columns.

Page 73: Cerius2 - ESI · 2001-06-08 · Cerius2 Hypothesis & Receptor Models Release 4.0 April 1999 Molecular Simulations Inc. 9685 Scranton Road San Diego, CA 92121-3752 619/458-9990 Fax:

Mapping property values

Cerius2 Hypothesis and Receptor Models/April 1999 59

Surface Coordinatespoint X Y Z mol atom charge esphbond hydrophob

Column 1 is the number of the surface point, followed by its carte-sian coordinates in columns 2, 3, and 4. The number of the mole-cule (in the bundle of aligned molecular models used to construct the receptor surface) followed by the number of the atom in the molecule follow in columns 5 and 6. The next four columns hold the properties partial charge, electrostatic potential, hydrogen bonding strength, and hydrophobicity.

Mapping property values

This section describes the properties of the receptor surface model that can be mapped, and the procedure for generating a map.

Property values of the receptor are mapped onto the displayed receptor surface. One property can be mapped at a time. When a different property is selected, the map is automatically updated. Property maps are displayed as color regions of the receptor sur-face. These properties reflect the anticipated characteristics of the receptor that is being modeled. The intensity of the color reflects the magnitude of the mapped property at a particular location.

Properties that can be mapped include the following:

♦ Nothing — No property is mapped. The receptor surface is transparent or translucent white.

♦ Electrostatic potential — When this property is mapped, each surface vertex is colored according to the potential value at the vertex position. Red areas have negative electrostatic potential, blue areas have positive potential, and white areas have neutral potential.

♦ Charge — When this property is mapped, the surface color is based on the average of the charges of the template atoms clos-est to the receptor surface. Red areas are positively charged, blue are negatively charged, and white areas are neutral.

Page 74: Cerius2 - ESI · 2001-06-08 · Cerius2 Hypothesis & Receptor Models Release 4.0 April 1999 Molecular Simulations Inc. 9685 Scranton Road San Diego, CA 92121-3752 619/458-9990 Fax:

60 Cerius2 Hypothesis and Receptor Models/April 1999

8. Generating a Receptor Model

♦ Hydrogen Bonding — When this property is mapped, the color indicates the tendency for specific areas of the surface to act as hydrogen bond donors (purple) or acceptors (light blue). Areas of the model with no hydrogen bonding activity are colored white.

♦ Hydrophobicity — When this property is mapped, the surface is colored brown to map the hydrophobic areas of the model. Areas that are not hydrophobic relative to the scale you have set on the panel are white.

To map a property 1. Generate a receptor model (see the previous section).

2. Specify the property you want to map to the surface of the receptor by selecting one of the display choices from the Map Property box on the Generate Receptor Model control panel (that is, Nothing, Electrostatic, Charge, H Bonding, or Hydro-phobic). The model automatically displays the map. There is no need to click the Generate button again.

3. If you want to adjust the value range of the property map, enter values in the entry boxes of the color legend located to the right of the property list in the control panel, then click on the radio button for that property.

The box on the left side of the legend displays the minimum value for the property. The box to the right displays the maxi-mum value. The range you set is relative, and simply adjusts the color display relative to that range. For hydrogen bonding, the acceptor box is on the left and the donor box is on the right.

If you want to map another property, select another display choice. A new mapping is automatically displayed using the current min-imum and maximum values.

Page 75: Cerius2 - ESI · 2001-06-08 · Cerius2 Hypothesis & Receptor Models Release 4.0 April 1999 Molecular Simulations Inc. 9685 Scranton Road San Diego, CA 92121-3752 619/458-9990 Fax:

Cerius2 Hypothesis and Receptor Models/April 1999 61

9 Evaluating Structures Against a Receptor Model

Receptor models are used to evaluate the biological activity of new molecular models.

This chapter explains ♦ Evaluating structures

♦ Viewing the results

♦ Incorporating receptor data in QSARs

Before you begin You should have a receptor model generated for evaluating struc-tures. However, you can evaluate structures without a receptor model by running the energy evaluation algorithm alone .

You also must have structures created, aligned, and loaded into the model table. You can generate structures using several meth-ods, for example:

♦ Build one or more structures using the Cerius2• 3D Sketcher or a Cerius2•Builder (see the Cerius2 Modeling Environment and the Cerius2 Builders)

♦ Use the Analog Builder to generate a series of analogous struc-tures (see the Cerius2 Builders)

♦ Import previously built structures using the Load Model con-trol panel accessed from the File pulldown on the menu bar (see the Cerius2 Modeling Environment book)

The evaluation process attempts to fit structures to the receptor model. It adjusts the geometry of structures to fit the model in the orientation in which they are placed within the model.

When the evaluation process is complete, you can examine several aspects of the results. Visual examination gives information on structural changes that may improve the fit into the receptor site. Property interaction maps give you additional visual information.

Page 76: Cerius2 - ESI · 2001-06-08 · Cerius2 Hypothesis & Receptor Models Release 4.0 April 1999 Molecular Simulations Inc. 9685 Scranton Road San Diego, CA 92121-3752 619/458-9990 Fax:

62 Cerius2 Hypothesis and Receptor Models/April 1999

9. Evaluating Structures Against a Receptor Model

You also can evaluate the quality of the fit for each structure in the receptor site by analyzing the numerical data that is generated.

Since the evaluation process is a fitting algorithm and does not search for the best orientation of the molecule, you must correctly set the gross orientation of a new structure relative to the receptor model. The PreAlign checkbox in Preferences automatically aligns molecules to a surface prior to evaluation, or you can use alignment procedures to orient your structures properly.

To start the evaluation Choose Evaluate Molecules from the Model Receptor menu card in the DRUG DISCOVERY card deck. The Evaluate Receptor Model control panel is displayed.

Evaluating structures

Evaluate aligned structures with respect to a receptor model you have generated and selected. If the Minimize Molecules During Evaluation check box is checked, the evaluation procedure also adjusts the geometry of the structures into best-fit configurations based on the constraints imposed by the receptor model. If the Pre-

Energy data is displayed here

Interaction property map choices

Page 77: Cerius2 - ESI · 2001-06-08 · Cerius2 Hypothesis & Receptor Models Release 4.0 April 1999 Molecular Simulations Inc. 9685 Scranton Road San Diego, CA 92121-3752 619/458-9990 Fax:

Evaluating structures

Cerius2 Hypothesis and Receptor Models/April 1999 63

Align Molecules to Surface box is checked the molecules are first oriented into the surface.

To evaluate a structure 1. Select one or more structures that you want to evaluate. To make this selection, use the Apply To Molecule(s) popup on the Evaluate Receptor Model control panel. You can select all the structures listed in the model table (All), the structures that are visible in the model window (All Visible), or only the cur-rent structure (Current).

2. Select the receptor model that you want to use in the evaluation process. To make this selection, use the Using Receptor Model popup on the Evaluate Receptor Model control panel. If you want to run the energy evaluation algorithm without the influ-ence of a receptor model, select None.

3. Indicate if you want model constraints imposed on each struc-ture. To do this, make sure that Minimize Molecules During Evaluation is checked. This is the default setting. If it is not checked, evaluation is of the static conformation of the struc-ture. Einside is the energy of the static structure.

4. If you want to adjust additional evaluation preferences, click Preferences. The Preferences control panel is displayed. For more information, Setting preferences.

5. Specify your choice of structure/receptor interactions to map on the surface of the receptor model.

6. Interaction values of the receptor with the structure are mapped in color onto the displayed receptor surface. The map indicates favorable (magenta) and unfavorable (green) interac-tions.

One property is mapped at a time. When you select a new prop-erty from the control panel, the map is not automatically updated. To update the model, click Evaluate.

Properties that can be mapped include: electrostatic energy, van der Waals energy, and total energy (electrostatic plus VDW). You can also display no interaction map (Nothing). Addition-ally, you can adjust the range of mapped values by entering minimum and maximum values in the color legend to the right of the interactions list. Use this feature to highlight small differ-ences in energy.

Page 78: Cerius2 - ESI · 2001-06-08 · Cerius2 Hypothesis & Receptor Models Release 4.0 April 1999 Molecular Simulations Inc. 9685 Scranton Road San Diego, CA 92121-3752 619/458-9990 Fax:

64 Cerius2 Hypothesis and Receptor Models/April 1999

9. Evaluating Structures Against a Receptor Model

7. Click Evaluate. If you have checked Minimize Molecules Dur-ing Evaluation, the geometry of the structure is adjusted to fit the constraints of the model. If the checkbox is not checked, no adjustment is made.

The property interaction you have chosen is mapped as colored regions on the receptor surface. Also, on the Evaluate Receptor Model control panel, values for Energy, Estrain, and Einteract are listed along with the molecule name in the table.

Setting preferences

This section describes additional preferences that you can set for generating receptor models.

To set preferences 1. Click Preferences on the Generate Receptor Model control panel. The Preferences control panel is displayed.

The Preferences panel is used to change generation and evalu-ation defaults. (See Hahn and Rogers (1995b) for a description of van der Waals and soft surfaces.)

Page 79: Cerius2 - ESI · 2001-06-08 · Cerius2 Hypothesis & Receptor Models Release 4.0 April 1999 Molecular Simulations Inc. 9685 Scranton Road San Diego, CA 92121-3752 619/458-9990 Fax:

Evaluating structures

Cerius2 Hypothesis and Receptor Models/April 1999 65

2. Check Include Solvation Correction if you want Receptor to add a penalty function when polar atoms are placed in hydro-phobic regions of the receptor surface model. This box is not checked by default.

When the box is checked, if the fraction of receptor surface hydrophobic points to total points in proximity to a polar atom is greater than 0.90, then a correction energy of 0.3 kcal/Mol Å2 is added. This energy is proportional to the exposed surface area of the polar atom. The term is added to the total energy after minimization is complete.

Use of a solvation correction term is based on the assumption that any electrostatic interactions between a molecule and sur-rounding water molecules will be replaced by similar electro-static interactions when the molecule is bound.

3. Choose the type of receptor surface you want Receptor to gen-erate by making a selection in the Receptor Surface Type sub-panel. The choices are:

van der Waals — This is the default selection. When you make this selection, the receptor surface characterizes the van der Waals shape of the molecules. Atom positions of the aligned molecules are clearly defined.

Soft — When you make this selection, the receptor surface is a more abstract representation of shape. The surface is much smoother and hides individual atom details. Generation of this receptor surface is based on the Wyvill field function for soft objects.

4. Adjust the parameters used to evaluate energy by making a selection in the Evaluate Energy Using subpanel. The choices are:

Partial Charge Complementarity — Computes electrostatic energy assuming that the receptor surface has a charge that is equal to and opposite in charge to the partial charges of the atoms used in the template molecules.

Electrostatic Charge Complementarity — Computes electro-static energy by giving surface charges corresponding to the electrostatic potential (ESP) expected on the surface, assuming that the ESP value will be equal and opposite the ESP of the template molecules. This is the default selection.

Page 80: Cerius2 - ESI · 2001-06-08 · Cerius2 Hypothesis & Receptor Models Release 4.0 April 1999 Molecular Simulations Inc. 9685 Scranton Road San Diego, CA 92121-3752 619/458-9990 Fax:

66 Cerius2 Hypothesis and Receptor Models/April 1999

9. Evaluating Structures Against a Receptor Model

You have completed this activity.

Viewing the results

When the evaluation process is complete, you can examine several aspects of the results. The energy evaluation algorithm attempts to alter the geometry of the structure into a conformation consistent with the surface model. Visual examination of the structures gives information on the structural changes that improve the fit into the receptor site. Property interaction maps generated for each prop-erty give you additional visual information about where favorable and unfavorable contacts are located.

You also can evaluate the quality of the fit for each structure in the receptor site by analyzing the numerical data that is generated and displayed in the Evaluate Receptor Model control panel.

Energy evaluation algo-rithm

The energy evaluation algorithm uses a fast, approximate force field to produce reasonable geometries and energies quickly. The force field is general and does not rely on specific force field atom types. Only element type, hybridization, and bond type are used to describe the energy of the system. For more information about the energy evaluation algorithm, see Hahn and Rogers (1995b).

The Evaluate Receptor Model control panel includes a data table which lists all the structures that have been evaluated and the fol-lowing data points for each molecule:

Energy — This term reports the internal strain energy of a mol-ecule as it sits in and is constrained by the receptor site.

Estrain — This term reports the internal strain energy of a mole-cule as it sits in the receptor site without being subject to recep-tor model constraints. This value should always be less than or equal to the Energy term.

Einteract — This term reports the interaction energy of the mol-ecule with the receptor. The more negative the value, the greater the interaction between the molecule and the receptor. Einteract is the sum of van der Waals and electrostatic interac-tions.

Page 81: Cerius2 - ESI · 2001-06-08 · Cerius2 Hypothesis & Receptor Models Release 4.0 April 1999 Molecular Simulations Inc. 9685 Scranton Road San Diego, CA 92121-3752 619/458-9990 Fax:

Incorporating receptor data in QSARs

Cerius2 Hypothesis and Receptor Models/April 1999 67

The difference between Energy and Estrain indicates the amount of strain imposed on the candidate structure by the receptor con-straints. The value of this difference and the value of Einteract together can be used to judge the quality of the fit of each structure in the receptor site. Note that reported values reflect the surface fit that you chose when you generated your receptor model. If the fit that you chose was loose (greater than 0.1 Å), a candidate structure may appear less strained and show less interaction. If the fit was tight (less than 0.1 Å), strain and interaction of a candidate struc-ture both may appear higher. You can obtain reasonable energies by running an evalutation with the Surface Fit parameter set to 0.10 Å. This is the default setting.

Next steps From this point, there are several options open to you to generate additional information on potentially useful structures. For exam-ple, you can:

♦ Change the interaction property that is mapped on the receptor surface to get more information for analyzing the interaction between the structure and the receptor.

♦ Use the 3D Sketcher to modify an evaluated candidate struc-ture outside the receptor model (unconstrained), then run an evaluation procedure on the new structure.

♦ Use other Cerius2 tools to modify a candidate structure (change orientation, add atoms or functional groups) as it sits in the receptor site (constrained) and then reevaluate the structure.

♦ Use the receptor model to build new, reasonable, optimized structures.

♦ Generate another receptor model using new or modified struc-tures. You can then evaluate the group of candidate structures against the new model.

Incorporating receptor data in QSARs

The interaction energy calculated by Receptor between a molecule and a receptor surface model may be used in the QSAR+ applica-tion to develop a QSAR, that is, the goodness of fit between a can-didate structure and a receptor surface model may be correlated with activity data.

Page 82: Cerius2 - ESI · 2001-06-08 · Cerius2 Hypothesis & Receptor Models Release 4.0 April 1999 Molecular Simulations Inc. 9685 Scranton Road San Diego, CA 92121-3752 619/458-9990 Fax:

68 Cerius2 Hypothesis and Receptor Models/April 1999

9. Evaluating Structures Against a Receptor Model

Before you begin Align all the models that you intend to use in receptor surface modeling. You must know the biological activity of all the mole-cules you intend to use. Use the most active molecules as tem-plates for the receptor model. Use the model with highest activity as the target model for alignment.

Start the process by generating a receptor surface model. For infor-mation on this procedure, see Generating a Receptor Model. When you have generated a receptor surface model, save it. For informa-tion on saving, see Saving and restoring a receptor model.

Building a QSAR using receptor data

When you have generated a receptor surface model and have aligned the models you want to study, you can proceed to build a QSAR using data from the receptor-structure iterations. This sec-tion describes the procedure.

To build a QSAR using Receptor data

1. If it is not displayed in the Cerius2 Models window, load and display the receptor model that you have calculated and the most active model that you used for a template. Make sure that the molecule orientation is the same as that used to construct the receptor surface model.

2. Evaluate all models relative to the receptor surface model. For information on this procedure, see Evaluating structures.

3. Create a Study Table and add the structures you are studying in Receptor. For information on this procedure, see Cerius2 QSAR+.

4. Add receptor data to the Study Table by selecting the descriptor Receptor_energies. For information on this procedure, see Cerius2 QSAR+.

5. Calculate a QSAR using the Genetic statistical method (this procedure generates the most reliable QSAR data). For infor-mation on generating a QSAR, see Cerius2 QSAR+.

Assuming the receptor model is predictive, the significant QSAR equations that are generated contain one or more of the receptor energy descriptors in them.

6. Save the QSAR equation. For information on this procedure, see Cerius2 QSAR+.

Page 83: Cerius2 - ESI · 2001-06-08 · Cerius2 Hypothesis & Receptor Models Release 4.0 April 1999 Molecular Simulations Inc. 9685 Scranton Road San Diego, CA 92121-3752 619/458-9990 Fax:

Incorporating receptor data in QSARs

Cerius2 Hypothesis and Receptor Models/April 1999 69

Using a QSAR with receptor data to estimate activity of a structure

You can use a QSAR in which you have incorporated receptor data to estimate the biological activity of a structure relative to the receptor surface model you have built. This section describes the procedure. It assumes that you have generated a receptor surface model, that you have used evaluation data from the model to gen-erate a QSAR equation, and that the QSAR has been saved. Both the receptor surface model and the QSAR equation are used in the procedure.

To estimate biological activity

1. If it is not displayed in the Model window, load and display the receptor surface model that you have calculated and the most active model that you used for a template. Make sure that the model orientation is the same as that used to construct the receptor model.

2. Create or load the model you want to use to predict activity.

3. Align the new model using the most active analog as the target for the alignment. The orientation of the target model should be the same as the orientation used to construct the original recep-tor model.

4. Evaluate the new model in relation to the receptor surface model. Make sure that the new model is the current model, then set the Apply To Molecule(s) field of the Evaluate Recep-tor Model control panel to Current. Make sure the Minimize Molecules During Evaluation checkbox is toggled on, then click the EVALUATE button so that the interaction energy is added to the Energy table of the Evaluate Receptor Model con-trol panel.

5. Add the new model to the Study Table.

Make your new structure the current model in the Model Man-ager and select Add Current from the Molecules pulldown of the Study Table. To determine the predicted activity of the new model, click the ADD button of the Descriptors panel. The energies and predicted activity of the new model are added to the Study Table.

Page 84: Cerius2 - ESI · 2001-06-08 · Cerius2 Hypothesis & Receptor Models Release 4.0 April 1999 Molecular Simulations Inc. 9685 Scranton Road San Diego, CA 92121-3752 619/458-9990 Fax:

70 Cerius2 Hypothesis and Receptor Models/April 1999

9. Evaluating Structures Against a Receptor Model

Page 85: Cerius2 - ESI · 2001-06-08 · Cerius2 Hypothesis & Receptor Models Release 4.0 April 1999 Molecular Simulations Inc. 9685 Scranton Road San Diego, CA 92121-3752 619/458-9990 Fax:

Cerius2 Hypothesis and Receptor Models/April 1999 71

10 Theory: Receptor Models

It is common to have biological activity data for a set of com-pounds that bind to an enzyme or antibody but not to have any firm knowledge of the 3D structure of the active site. One can deduce a hypothetical model of the binding site known as a recep-tor site model, and this model ought to be predictive and sufficiently reliable to guide the medicinal chemist in the design of novel com-pounds.

Receptor surface models

Receptor site models are different from pharmacophore models, which postulate a 3D arrangement of atoms recognizable by the active site in terms of the similarity of functional groups common to the set of binding molecules. In contrast, receptor site models do not contain atoms, but try to directly represent the essential fea-tures of an active site by assuming complementarity between the shape and properties of the receptor site and the set of binding compounds.

The Receptor application uses 3D surfaces that define the shape of the receptor site by enclosing the most active members (after appropriate alignment) of a series of compounds. Note that errors in alignment can lead to incorrect, poorly predictive receptor sur-face models.

The surface is generated from a “Shape Field”. The atomic coordi-nates of the contributing models are used to compute field values on each point of a 3D grid. The two functions used to calculate the field values are a van der Waals function (Eq. 16) and a Wyvill soft object function (Eq. 17).

Eq. 16V r( ) r VDWr–=

Page 86: Cerius2 - ESI · 2001-06-08 · Cerius2 Hypothesis & Receptor Models Release 4.0 April 1999 Molecular Simulations Inc. 9685 Scranton Road San Diego, CA 92121-3752 619/458-9990 Fax:

72 Cerius2 Hypothesis and Receptor Models/April 1999

10. Theory: Receptor Models

Where r = distance from the atomic coordinate to the grid point and VDWr = van der Waals radius of the atom. This allows a sur-face to be constructed at any arbitrary isovalue of density on the grid. For example, at the van der Waals surface, V(r) = 0.

Eq. 17

The Wyvill function (Eq. 17) is also a function of r but it is bounded and decays completely within the distance R, that is, V(R) = 0. R is set to be twice the van der Waals radius of each atom type. The Wyvill shape function gives smoother surfaces, whereas the van der Waals function produces a molecule-shaped surface.

The isovalue chosen is called the surface fit and the Marching Cubes algorithm is used to create a triangulated surface fit with an average surface density of about six points per square angstrom, and average distance between points of about 0.5 angstroms.

Mapping properties

Properties are stored with each surface point of the receptor model. The first property stored is the partial charge, which would be desirable in that position in the receptor so as to maximize elec-trostatic attraction between the model and surface, based on the assumption that the charge at any point on the surface is comple-mentary to the partial atomic charge of any atom in contact with the surface. If the receptor model is constructed from a single mol-ecule, each surface point is given a charge that is equal but oppo-site in sign to the closest atom in the molecule. If the receptor model is constructed from an aligned bundle of models, each sur-face point is given a charge that is equal and opposite in sign to the average partial charge of the closest atoms in each molecule. This also assumes that each model contributes equally.

The second property stored at each vertex is an electrostatic poten-tial. Again the assumption of complementarity is made. If the receptor model is calculated from a single molecule, each surface point is given a potential that is equal but opposite to the potential calculated from summing:

V r( ) 4r2

9R6---------

17r4

9R4-----------

22r2

9r2-----------– 1+ +–=

Page 87: Cerius2 - ESI · 2001-06-08 · Cerius2 Hypothesis & Receptor Models Release 4.0 April 1999 Molecular Simulations Inc. 9685 Scranton Road San Diego, CA 92121-3752 619/458-9990 Fax:

Molecule-Receptor model interactions

Cerius2 Hypothesis and Receptor Models/April 1999 73

Eq. 18

r = distance between point and atomQa = charge on atomQb = charge on point

for all atoms. As before, if a surface is constructed over a bundle of molecules, the complementary potential is calculated as the aver-age of the potentials exerted by each molecule in the bundle.

Third, a hydrogen bonding value is stored at each surface point, found by projecting a cone away from each hydrogen bond donor or acceptor atom. There is no manipulation of proton positions to take into account O-H rotameric conformations. Acceptors are defined at either O or N atoms with a free pair of electrons, and donors are any hydrogens attached to oxygen or nitrogen.

Fourth, each point is classified as being either hydrophobic or hydrophilic, based on the other properties. A hydrophobic point is defined as a point with low partial charge (absolute value less than 0.15), a low electrostatic potential (absolute value less than 0.01), and a low hydrogen bond-donating or accepting propensity (abso-lute value less than 0.1).

In the Cerius2 Model Manager, these four properties can be mapped onto the surface and colored to show regions of potential, partial charge, hydrophobicity, and hydrogen bonding potential.

Molecule-Receptor model interactions

Molecular models can be minimized inside the receptor surface model—not just the models that were used to build the surface, but also new models. The receptor surface and the model are opti-mized by the fast, approximate forcefield “Clean”. You can use Clean on models without a receptor surface model.

The properties discussed above are used to calculate non-bonded interaction energies between the atoms of a molecular model and all the points of the receptor surface (with a 6-Å cutoff), by means of a van der Waals term (1) and (2), an electrostatic term (3), and a desolvation energy correction term.

QaQb

r2--------------

Page 88: Cerius2 - ESI · 2001-06-08 · Cerius2 Hypothesis & Receptor Models Release 4.0 April 1999 Molecular Simulations Inc. 9685 Scranton Road San Diego, CA 92121-3752 619/458-9990 Fax:

74 Cerius2 Hypothesis and Receptor Models/April 1999

10. Theory: Receptor Models

Eq. 19

Eq. 20

where:

RA is the hybridization-corrected van der Waals radius of the atom

VDWr is the van der Waals radius

Ch is the hybridization factor—

for sp2, Ch = 0.95; for sp3, Ch = 0.90

r is the distance between the atom and the surface point

K is the well-depth constant (set to 0.1 for all atom-point inter-actions)

D is an empirically derived point density scaling factor

D scales the vdW energy and forces so that ideal atom/surface interactions yield a typical value of 0.0125 kcal/Å2 of surface con-tact.

The electrostatic term is a monopole-monopole Coulombic func-tion:

Eq. 21

r is the separation between the atom and the surface point, Qa is the charge on the atom, Qp is either the potential or the charge at the point. D is the same density correction factor used in the van der Waals term, and S(r) is an atom based switching function employed in CHARMM.

Eq. 22

for ron < r < roff

Evdw KRAr

------- 12

2DRAr

------- 6

– =

where RA VDWr Ch×=

Eele322.1 Qa Qp××

r----------------------------------------

D S r( )××=

S r( )r2

off r2–( )2 r2off 2r2 3r2

on–+( )r2

off r2on–( )3

-------------------------------------------------------------------------------=

Page 89: Cerius2 - ESI · 2001-06-08 · Cerius2 Hypothesis & Receptor Models Release 4.0 April 1999 Molecular Simulations Inc. 9685 Scranton Road San Diego, CA 92121-3752 619/458-9990 Fax:

RSA (Receptor Surface Analysis)

Cerius2 Hypothesis and Receptor Models/April 1999 75

This gives a continuous potential energy and force. The current settings are as follows:

ron = 7 Å, roff = 8 Å

All pairs of atoms with a separation greater than this cutoff are neglected in the nonbonded calculation.

The desolvation energy correction term is a penalty function that attempts to account the loss of solvation when polar atoms are forced into hydrophobic regions of the receptor surface. If the frac-tion of hydrophobic points to total points in proximity to a polar atom is greater than 90%, then the desolvation correction energy is added. The magnitude of the term is proportional to the exposed surface area of the atom by the constant 0.3 kcal/mol Å2. We make the assumption that any electrostatic interaction between a mole-cule and the surrounding solvent will be replaced by similar elec-trostatic interactions when the molecule is bound. The desolvation term is added to the total energy after the minimization of the model in the receptor model is complete.

Evaluation of molecule-receptor model energy interactions allows us to construct a receptor surface model and then calculate the energy of interaction between the receptor model and a molecular model. Medicinal chemists can evaluate compounds whose bio-logical activity has not yet been measured, or entirely new hypo-thetical structures. Moreover the energies can be incorporated into the Study Table (descriptor Receptor_energies) as indepen-dent Y variables to construct a QSAR relationship.

RSA (Receptor Surface Analysis)

The CoMFA formalism calculates probe interaction energies on a rectangular grid around a bundle of active molecules. Receptor calculates molecule-receptor model interaction energies on a receptor surface, and, like CoMFA, these energies can serve as input for the calculation of a QSAR relationship.

Just as each energy associated with an MFA grid point can be used, so can each point on the surface of a receptor model. Hopefully, the

Page 90: Cerius2 - ESI · 2001-06-08 · Cerius2 Hypothesis & Receptor Models Release 4.0 April 1999 Molecular Simulations Inc. 9685 Scranton Road San Diego, CA 92121-3752 619/458-9990 Fax:

76 Cerius2 Hypothesis and Receptor Models/April 1999

10. Theory: Receptor Models

receptor surface is better able to sample the environment of the molecule than a rectangular grid, leading to better results.

Page 91: Cerius2 - ESI · 2001-06-08 · Cerius2 Hypothesis & Receptor Models Release 4.0 April 1999 Molecular Simulations Inc. 9685 Scranton Road San Diego, CA 92121-3752 619/458-9990 Fax:

Part 3

Pharmacophore

Introduction to Generating a Pharmacophoric Hypothesis

Generating Conformations

Generating and Using Alignment Hypotheses

Page 92: Cerius2 - ESI · 2001-06-08 · Cerius2 Hypothesis & Receptor Models Release 4.0 April 1999 Molecular Simulations Inc. 9685 Scranton Road San Diego, CA 92121-3752 619/458-9990 Fax:
Page 93: Cerius2 - ESI · 2001-06-08 · Cerius2 Hypothesis & Receptor Models Release 4.0 April 1999 Molecular Simulations Inc. 9685 Scranton Road San Diego, CA 92121-3752 619/458-9990 Fax:

Cerius2 Hypothesis and Receptor Models/April 1999 79

11 Introduction to Generating a Pharmacophoric Hypothesis

Catalyst’s ConFirm, Hypo, and HipHop and C2•Receptor are applications that provide tools to generate pharmacophoric hypotheses. The hypotheses are created by generating conforma-tions for a set of study molecules, then using the conformations to find and align chemically important functional groups common to the molecules in the study set. Each hypothesis can also incorpo-rate data on the biological activities of the study molecules.

This chapter provides an overview of the Cerius2 interfaces to the Catalyst applications ConFirm, and HipHop, and Hypo, and the activities that can be performed using them. For detailed informa-tion about using these interfaces, see Chapters 12 through 14. For detailed information about ConFirm and HipHop, see the online documents Catalyst ConFirm User’s Reference and the Catalyst HipHop User’s Reference, supplied on your MSI CD when you buy these products.

Using ConFirm and HipHop

The HYPOTHESIS MODELS card deck in Cerius2 provides menu cards that contain interface functions for running ConFirm, HipHop, and Hypo.

Before you begin To use ConFirm or HipHop to generate hypotheses, you must have one or more structures loaded into the Model Manager. Mol-ecule input files for this interface are in the standard .msi file for-mat. ConFirm typically handles molecules up to pentapeptide size. Output files are generated in Catalyst CPD format.

You can create or import structures for conformation generation using one of several methods:

Page 94: Cerius2 - ESI · 2001-06-08 · Cerius2 Hypothesis & Receptor Models Release 4.0 April 1999 Molecular Simulations Inc. 9685 Scranton Road San Diego, CA 92121-3752 619/458-9990 Fax:

80 Cerius2 Hypothesis and Receptor Models/April 1999

11. Introduction to Generating a Pharmacophoric Hypothesis

♦ Build one or more structures using the Cerius2 3D Sketcher or a Cerius2 builder (see the Cerius2 Modeling Environment book or the Cerius2 Builders book)

♦ Use the Analog Builder to generate a series of analogous struc-tures (see the Cerius2 Builders book)

♦ Import previously built structures using the Load Model con-trol panel accessed from the File pulldown on the menu bar (see the Cerius2 Modeling Environment)

To access ConFirm or HipHop

Choose the HYPOTHESIS MODELS card deck from the deck selector. The deck appears:

The four activities that you can perform using the Cerius2 Con-Firm, HipHop, and Hypo interfaces and C2•Receptor are described briefly in the following paragraphs.

Generate conformations The Cerius2 interface to ConFirm is used to generate conforma-tions for a single molecule or a set of molecules. The number of conformations needed to produce a good representation of a com-pound’s conformational space depends on the molecule. Both con-formation-generating algorithms available in ConFirm (Best and Fast) are adjusted to produce a diverse set of conformations, avoiding repetitious groups of conformations all representing local minima. For detailed information on generating conforma-tions, see Chapter 12, Generating Conformations.

The conformations generated by ConFirm can be used as input into HipHop and Hypo to align common molecular features and generate pharmacophoric hypotheses. Currently the conforma-tions cannot be viewed in Cerius2.

Page 95: Cerius2 - ESI · 2001-06-08 · Cerius2 Hypothesis & Receptor Models Release 4.0 April 1999 Molecular Simulations Inc. 9685 Scranton Road San Diego, CA 92121-3752 619/458-9990 Fax:

Using ConFirm and HipHop

Cerius2 Hypothesis and Receptor Models/April 1999 81

Align common molecular features to generate a hypothesis

HipHop and Hypo use conformations generated in ConFirm to align chemically important functional groups common to the mol-ecules in the study set. A pharmacophoric hypothesis can then be generated from these aligned structures. For detailed information on this alignment activity, see Aligning common molecular features.

Incorporate biological activity data into a hypothesis

The Cerius2 interface to HipHop and Hypo is also used to incorpo-rate biological activity data into the hypothesis-generating pro-cess. Each hypothesis is tested by regression techniques to compare estimated activity with actual activity data. The software uses the data from these tests to select the hypotheses that do the best job predicting activity for the set of study molecules. This capability is provided by Catalyst/Hypo. For detailed information on this alignment activity, see Incorporating activity data into a hypothesis.

Generate receptor mod-els using HipHop- aligned structures

The MODEL RECEPTOR menu card is included in the HYPOTH-ESIS MODELS card deck so that you can use structures that have been aligned in HipHop to generate a receptor surface model. Since structures used in HipHop are aligned by common chemical features, the receptor surface model that is generated for them can be significantly different from a receptor surface model generated from template-aligned structures. For detailed information on per-forming this activity, please see Using aligned structures to generate receptor models.

Page 96: Cerius2 - ESI · 2001-06-08 · Cerius2 Hypothesis & Receptor Models Release 4.0 April 1999 Molecular Simulations Inc. 9685 Scranton Road San Diego, CA 92121-3752 619/458-9990 Fax:

82 Cerius2 Hypothesis and Receptor Models/April 1999

11. Introduction to Generating a Pharmacophoric Hypothesis

Page 97: Cerius2 - ESI · 2001-06-08 · Cerius2 Hypothesis & Receptor Models Release 4.0 April 1999 Molecular Simulations Inc. 9685 Scranton Road San Diego, CA 92121-3752 619/458-9990 Fax:

Cerius2 Hypothesis and Receptor Models/April 1999 83

12 Generating Conformations

The Cerius2 interface to ConFirm provides a way for you to gener-ate conformations for a single molecule or a set of molecules. The number of conformations needed to produce a good representa-tion of a compound’s conformational space depends on the mole-cule. Both conformation-generating algorithms available in ConFirm (Best and Fast) are adjusted to produce a diverse set of conformations, avoiding repetitious groups of conformations all representing local minima. For information on conformations gen-erated using these algorithms, see Smellie, Kahn, and Teig.

You can use the conformations generated by ConFirm as input into HipHop to align common molecular features and generate phar-macophoric hypotheses.

This chapter describes how to use the Cerius2 ConFirm interface to generate conformations for a set of study molecules. These confor-mations can then can be used as input into HipHop to generate pharmacophoric hypotheses.

Output files for HipHop and Catalyst/Hypo are generated in Cat-alyst CPD format.

Before you begin You must have one or more structures loaded into the Model Man-ager. You can generate or import structures for conformation gen-eration through one of several methods:

♦ Build one or more structures using the Cerius2 3D Sketcher or a Cerius2 builder (see the Cerius2 Modeling Environment or the Cerius2 Builders)

♦ Use the Analog Builder to generate a series of analogous struc-tures (see the Cerius2 Builders)

♦ Import previously built structures using the Load Model con-trol panel accessed from the File pulldown on the menu bar (see the Cerius2 Modeling Environment)

Page 98: Cerius2 - ESI · 2001-06-08 · Cerius2 Hypothesis & Receptor Models Release 4.0 April 1999 Molecular Simulations Inc. 9685 Scranton Road San Diego, CA 92121-3752 619/458-9990 Fax:

84 Cerius2 Hypothesis and Receptor Models/April 1999

12. Generating Conformations

Running ConFirm

To access ConFirm Display the CONFIRM menu card in the HYPOTHESIS MOD-ELS card deck and select Conformer Generation to display the Conformer Generation panel.

To generate conforma-tions

You can generate a set of conformations automatically for the cur-rent model in the Model Manager by accepting the default settings on the control panel and clicking Generate. ConFirm will generate 255 or fewer conformations and deposit them in files in your cur-rent directory using the name format compoundname.cpd where compoundname is the name of the input molecule.

To change default settings You can change a number of settings for the conformation genera-tion process by changing one or more entries and popup choices in the Conformer Generation control panel:

♦ Models used for conformer generation — Specify the models you want to use. The popup choices are Current, Selected and All. The default selection is Current.

♦ Conformer search method — Choose the method you want to use. The popup contains the following selections:

Fast — Takes a quick sampling of the conformational space of a molecule. It quickly produces a set of conformations that do not cover conformational space as completely as the Best method. This is the default selection

Best — Generates the most comprehensive set of conforma-tions. The Best method incorporates a technique in its pro-cess for promoting conformational variation. This method can be used to refine and expand a set of conformations found using the Fast method. It is the method you should use for generating conformations to be used in HipHop.

♦ Stereochemical treatment of asymmetric centers — Choose from the following popup options:

Absolute — Generates conformations that are based on the known absolute stereochemistry of each model. This choice can also be used for molecules with relative stereochemistry. ConFirm treats any relative stereocenter in a starting mole-cule as absolute. This is the default selection.

Page 99: Cerius2 - ESI · 2001-06-08 · Cerius2 Hypothesis & Receptor Models Release 4.0 April 1999 Molecular Simulations Inc. 9685 Scranton Road San Diego, CA 92121-3752 619/458-9990 Fax:

Running ConFirm

Cerius2 Hypothesis and Receptor Models/April 1999 85

Relative — Generates conformations that maintain the rela-tive stereochemistry of the starting molecule. If the molecule contains only relative stereocenters, ConFirm generates con-formations of enantiomers (that is, the starting molecule and its mirror image).

Undetermined — Attempts to generate all possible stereoi-somers and conformations for each molecule when no stere-ochemical information is available. Because the number of stereoisomers can exceed the maximum number of allow-able conformations, ConFirm may not generate a complete set of conformations for molecules with multiple stereo-centers. Therefore, use this choice judiciously.

♦ Energy range — Specifies the acceptable energy range within which conformations should be generated. The default value is 15 kcal.

♦ Maximum conformers — Specifies the maximum number of conformations you want to generate. Enter a number by typing in the data entry box or by clicking the up- and down-arrows associated with the box. The default number, 255, is the maxi-mum number of conformations for each model that can be han-dled by HipHop.

♦ Save results in — Enter the name of the file where you want results saved. If no file is specified, conformations are saved to the file compoundname.cpd where compoundname is the name of the molecule for which conformations have been generated. You can also specify the directory where you want the file stored. The default directory is the current directory (that is, ./).

♦ Ignore Existing Conformers — Indicates whether conforma-tions that were previously generated for the starting molecule should be saved or not. When the box is checked, all existing conformations are discarded. When the box is unchecked, new conformations are added to existing ones. The default condi-tion is checked.

Page 100: Cerius2 - ESI · 2001-06-08 · Cerius2 Hypothesis & Receptor Models Release 4.0 April 1999 Molecular Simulations Inc. 9685 Scranton Road San Diego, CA 92121-3752 619/458-9990 Fax:

86 Cerius2 Hypothesis and Receptor Models/April 1999

12. Generating Conformations

Page 101: Cerius2 - ESI · 2001-06-08 · Cerius2 Hypothesis & Receptor Models Release 4.0 April 1999 Molecular Simulations Inc. 9685 Scranton Road San Diego, CA 92121-3752 619/458-9990 Fax:

Cerius2 Hypothesis and Receptor Models/April 1999 87

13 Generating and Using Alignment Hypotheses

HipHop and Hypo use conformations generated in ConFirm to align chemically important functional groups common to the mol-ecules in a study set. Biological activity data can be incorporated into these hypotheses so that the best hypotheses for predicting activity are generated and selected. Additionally, you can use structures that have been aligned in these programs to generate a receptor surface model in Cerius2•Receptor.

This chapter describes This chapter discusses the following tasks:

Aligning common molecular features

Setting preferences using the Preferences control panel

Incorporating activity data into a hypothesis

Using aligned structures to generate receptor models

Before you begin Before you can generate a pharmacophoric hypothesis, you must have conformation files generated by ConFirm or by Catalyst in Catalyst CPD format. Each input file must contain a conforma-tional model that spans the energetically accessible conforma-tional space. The maximum number of molecules allowed in the input set is 32. To avoid consuming large amounts of time and memory, use molecules with fewer than 30 features per conforma-tion.

To access HipHop Display the HIPHOP menu card in the HYPOTHESIS MODELS card deck and select Generate Alignments.

The Generate Hypothesis control panel is displayed.

Page 102: Cerius2 - ESI · 2001-06-08 · Cerius2 Hypothesis & Receptor Models Release 4.0 April 1999 Molecular Simulations Inc. 9685 Scranton Road San Diego, CA 92121-3752 619/458-9990 Fax:

88 Cerius2 Hypothesis and Receptor Models/April 1999

13. Generating and Using Alignment Hypotheses

Aligning common molecular features

If you want to use the default settings for the alignment process, set the Alignments/Hypotheses popup to Alignments and click Generate. HipHop aligns common chemical features, generating 10 alignments for the complete set of molecules listed in the Model Manager. The the alignment information is output to MDL struc-ture-data (SD) files stored in your current directory.

You can change the default settings of the Generate Hypotheses control panel as described in the following paragraphs:

♦ Alignment — You can focus on generating alignment files by selecting Alignments or on generating hypotheses by selecting Hypotheses from the popup. If you select Hypotheses, the Cre-ate…Output Alignment Files field is not displayed.

♦ Models — Generate alignments for all or selected models listed in the Model Manager, as listed in the popup.

♦ Generate Alignments — Enter a number up to 999 in the data entry field to set the number of alignments you want HipHop to generate.

♦ Output Alignment Files — Choose the type of output file you want for generated alignments. The choices listed in the popup are the MDL structure-data (SD) or MOL2 files. HipHop pro-duces files containing the coordinates of concatenated com-pounds according to the file type you specify. These files can be used as input for other modeling modules.

♦ Results directory — Enter the name of the directory where you want to save the alignment files. If you do not put a name in the data entry box, the files are saved to your current directory (/.).

♦ Generate input files — If you want template input files cre-ated, check this box. Template files are the input files used in the alignment process. If you have used ConFirm or Catalyst to generate conformations, the template files are generated auto-matically. Template files can be edited.

Page 103: Cerius2 - ESI · 2001-06-08 · Cerius2 Hypothesis & Receptor Models Release 4.0 April 1999 Molecular Simulations Inc. 9685 Scranton Road San Diego, CA 92121-3752 619/458-9990 Fax:

Setting preferences using the Preferences control panel

Cerius2 Hypothesis and Receptor Models/April 1999 89

Setting preferences using the Preferences control panel

This section describes the variety of parameters for which you can set values in HipHop and Hypo. These parameters regulate the program’s execution and its output. The software tries to satisfy all of the constraints you impose when you set preferences. Each pref-erence contributes to the results. If it is not possible for HipHop or Hypo to meet all of the criteria you specify, the software produces no generated hypotheses and no aligned molecules. If your speci-fications conflict, error messages are issued and processing halts.

To set preferences Display the HIPHOP menu card in the HYPOTHESIS MODELS card deck and select Preferences to display The HipHop Prefer-ences control panel.

Select the options and adjust parameter values on this control panel to specify the constraints you want to use in generating alignments and hypotheses.

The options included on this control panel are briefly described in the following paragraphs.

Total Hypothesis Features Specifies minimum and maximum number of features allowed per generated hypothesis. The default value for the minimum number of features is one. The default value for the maximum number of features is 10.

Pharmacophore Size Specifies the minimum number of points in a pharmacophore. The default value is 3. Note that some chem-ical features contain more than one point, for example, a hydrogen bond.

Candidate Pharmacophore Size Specifies the minimum number of points in a pharmacophore to be considered as a candidate in the construction of a larger hypothesis. The default value is 3. Note that some chemical features (for example, a hydrogen bond) con-tain more than one point.

Unmapped Molecules Specifies the maximum number of mole-cules that miss more than one feature of a hypothesis. Such mole-cules are termed complete misses. The default value is 0.

Page 104: Cerius2 - ESI · 2001-06-08 · Cerius2 Hypothesis & Receptor Models Release 4.0 April 1999 Molecular Simulations Inc. 9685 Scranton Road San Diego, CA 92121-3752 619/458-9990 Fax:

90 Cerius2 Hypothesis and Receptor Models/April 1999

13. Generating and Using Alignment Hypotheses

Partially Mapped Molecules Specifies the maximum number of partially mapped (one feature missing) and unmapped (two or more missing features) molecules in the input set of molecules. The default value is 0.

Molecules Missing Some Feature Specifies the maximum num-ber of molecules that can miss the same feature of a hypothesis. The default value is 0. A small value tends to distribute missed fea-tures evenly throughout the input set of molecules.

Superposition Check Specifies the RMS fit of molecules to gen-erated hypotheses. This parameter is more precise than a distance check. The default condition for this checkbox is checked.

RMS Error Sets the allowed deviation in RMS fit of mapped features in each mapped molecule. The default value is 1.00. Reducing this value from its default tightens the fit.

Location Constraint Tolerance Scaling Specifies the factor applied to each location tolerance, constraining each of the fea-tures in a hypothesis. The default value is 1.00. The unscaled toler-ances are 1.6 Å for all ligand features and 2.2 Å for all projected points.

Inter-feature Distance Specifies the minimum distance between feature locations in input molecules. Only configurations of fea-tures in the molecules with at least this distance between the actual feature locations are considered when identifying common fea-tures for a hypotheses. The default value is 2.97 Å. Small feature-poor molecules may require a value about 1.5 Å.

Maximum Memory (fast algorithm) Specifies the maximum memory to be used by the Fast alignment algorithm. The default value is 60 MB. Total accessible memory is twice the set value.

Ideal Hydrogen Bond Geometries Only Specifies that only pro-jected points at positions that reflect ideal hydrogen bond geome-tries be considered. HipHop considers only staggered hydrogen donor/acceptor interactions ions for a hydroxyl and two lone-pair positions for a carbonyl oxygen. The default condition for this box is checked.

Generate Hypothesis (.chm) Files Generates a sequential hypothesis file for each alignment hypothesis with the name HypoOutput.n.chm where n is the sequential number of the file.

Page 105: Cerius2 - ESI · 2001-06-08 · Cerius2 Hypothesis & Receptor Models Release 4.0 April 1999 Molecular Simulations Inc. 9685 Scranton Road San Diego, CA 92121-3752 619/458-9990 Fax:

Incorporating activity data into a hypothesis

Cerius2 Hypothesis and Receptor Models/April 1999 91

These files can be used by Catalyst/Info. The default condition for this checkbox is checked.

Incorporating activity data into a hypothesis

The Cerius2 interface to HipHop and Hypo is also used to incorpo-rate biological activity data into the hypothesis-generating pro-cess. Each hypothesis is tested by regression techniques to compare estimated activity with actual activity data. The software uses the data from these tests to select the hypotheses that do the best job predicting activity for the set of study molecules. This capability is provided by Catalyst/Hypo.

To generate hypotheses for predicting activity

Select Generate Alignments from the HIPHOP card to open the Generate Hypothesis control panel. Select Hypotheses from the first popup on the control panel, then click Generate.

The software identifies common features among the study mole-cules, aligns them into a set of alignment hypotheses, compares estimated activity (from the hypotheses) with actual activity, then reports the lowest cost/best fit hypotheses with respect to predica-tion of activity for the set of study molecules.

Using aligned structures to generate receptor models

The MODEL RECEPTOR menu card is included in the HYPOTH-ESIS MODELS card deck so that you can use structures that have been aligned in HipHop to generate a receptor surface model. Since structures used in HipHop are aligned by common chemical features, the receptor surface model that is generated for them can contain significant data that differs from data for a receptor surface model generated from template-aligned structures.

To display HipHop aligned structures

You must import the .SD or .mol file that has been generated by HipHop or Hypo into the Model Manager using the following pro-cedure:

Page 106: Cerius2 - ESI · 2001-06-08 · Cerius2 Hypothesis & Receptor Models Release 4.0 April 1999 Molecular Simulations Inc. 9685 Scranton Road San Diego, CA 92121-3752 619/458-9990 Fax:

92 Cerius2 Hypothesis and Receptor Models/April 1999

13. Generating and Using Alignment Hypotheses

1. Select Load Models from the Visualizer File menu. The Load Models control panel is displayed.

2. Select MACCS from the File Format popup.

3. Change the extension listed in the file data entry box from .m[do]l to .sd or .mol to correspond to the type of files you want to import. A list of files with the appropriate extension appears in the file browser.

4. Highlight the files you want to load and click the Load button. The files are loaded into the Model Manager.

Page 107: Cerius2 - ESI · 2001-06-08 · Cerius2 Hypothesis & Receptor Models Release 4.0 April 1999 Molecular Simulations Inc. 9685 Scranton Road San Diego, CA 92121-3752 619/458-9990 Fax:

Part 4

Database Query

Database Query

Page 108: Cerius2 - ESI · 2001-06-08 · Cerius2 Hypothesis & Receptor Models Release 4.0 April 1999 Molecular Simulations Inc. 9685 Scranton Road San Diego, CA 92121-3752 619/458-9990 Fax:
Page 109: Cerius2 - ESI · 2001-06-08 · Cerius2 Hypothesis & Receptor Models Release 4.0 April 1999 Molecular Simulations Inc. 9685 Scranton Road San Diego, CA 92121-3752 619/458-9990 Fax:

Cerius2 Hypothesis and Receptor Models/April 1999 95

14 Performing a Database Query

C2•DBAccess enables you to construct molecular queries to search specified databases and to retrieve, examine, and save structures that fit your criteria. Cerius2•DBAccess is an interface to various third-party databases and database search engines. It allows you to search your existing in-house databases from within Cerius2 and retrieve the resulting hit list directly into Cerius2. Therefore, it allows you to maintain your corporate and/or project databases in your choice of format and with your choice of chemical informa-tion management system (in this release, these choices are MDL's ISIS™, and MSI's Catalyst and catShape systems) and provides you the ability to retrieve information from these databases directly into Cerius2.

This chapter explains ♦ Available database searches

♦ Preparing to work with Database Query

♦ Methodology

♦ Constructing a query

♦ Submitting a query

♦ Retrieving and browsing database hits

DBAccess benefits The most apparent benefits to you:

1. Cerius2•DBAccess adds value to your existing investment in activity prediction software tools since it now has a more seam-less integration with ISIS and Catalyst.

2. You are not required to convert your corporate database into another format since you can access the database in its native format with Cerius2•DBAccess.

3. With Cerius2•DBAccess, it is now easy for you to incorporate corporate structural information into your modeling and anal-ysis environment since DBAccess integrates your corporate

Page 110: Cerius2 - ESI · 2001-06-08 · Cerius2 Hypothesis & Receptor Models Release 4.0 April 1999 Molecular Simulations Inc. 9685 Scranton Road San Diego, CA 92121-3752 619/458-9990 Fax:

96 Cerius2 Hypothesis and Receptor Models/April 1999

14. Performing a Database Query

structural databases with the Cerius2 visualization and analysis tools.

Cerius2•DBAccess does not itself perform any database searches; instead, it uses the searching capabilities of Catalyst and ISIS to retrieve compounds.

All types of structure-based searches are available with Cerius2•DBAccess that includes 2D substructure and 3D pharma-cophoric searches (FAST and BEST algorithms), and 3D shape searches with Catalyst, and 2D substructure and similarity, and 3D pharmacophoric searches (rigid and flexible search algorithms) with ISIS.

The ISIS searches are performed through the ISIS/Host API. Once a search is complete, the hit list is written out as an SD file and automatically brought into the Hit List Browser.

Available database searches

ISIS ISIS has the ability to perform searches on 2D and 3D structural databases based on a structural query. The structural query can be developed to mimic a Catalyst Hypothesis, or it can be based on a pharmacophore model generated with C2•ASearch. Catalyst hypotheses can also be exported as ISIS search queries.

With a query in hand, ISIS performs searches on databases and retrieves only those compounds that satisfy the geometric and structural constraints. If the constraints are 3D, then the search can optionally be performed taking the conformational flexibility of the molecules into consideration.

The hit list can then be exposed to various analysis capabilities within Cerius2. For example, you can import a hit list (SD file) into a Study Table and calculate various descriptor properties; then the hit list can be clustered with respect to these properties.

Catalyst/SHAPE Catalyst/SHAPE (catShape) performs 3D shape searching of a flexible 3D database to retrieve compounds that have similar shapes. It also calculates and reports multi-conformational shape descriptors for use in combinatorial library design, analysis, and comparison.

Page 111: Cerius2 - ESI · 2001-06-08 · Cerius2 Hypothesis & Receptor Models Release 4.0 April 1999 Molecular Simulations Inc. 9685 Scranton Road San Diego, CA 92121-3752 619/458-9990 Fax:

Preparing to work with Database Query

Cerius2 Hypothesis and Receptor Models/April 1999 97

Catalyst Catalyst is a Molecular Simulations product that performs hypoth-esis generation, database searches, and activity prediction. The database search function, Catalyst/Info, is accessed through the Cerius2 interface and allows 2D or 3D queries to be generated for searching any Catalyst database. Databases available from MSI include the MDL’s Available Chemical Database (ACD), Der-went’s World Drug Index (WDI), BioByteMaster File, National Cancer Institute, and Maybridge databases.

Preparing to work with Database Query

Before you begin Before you start Database Query, you must:

1. Have licensed and installed at least one of the following: ISIS 2.0 or Catalyst 2.2 or higher. You also must have the appropriate Cerius2 modules.

Please note that both Catalyst and ISIS/Host must run on an SGI platform either locally or networked to the machine where you are running Cerius2. For ISIS searches, you also need to have a copy of your ISIS Hview files in your home directory.

Creation of the databases in query-accessible form should be completed as part of installation and administration of each database. Additionally, your workstation environment must be set up for access to these databases. If you cannot access the databases you want, see your database system administrator.

2. Be familiar with the Cerius2 user interface and tools. For an introduction to the user interface and the Visualizer tools, see the book, Cerius2 Modeling Environment, and the Cerius2 Tutori-als— Basics.

3. Displays the current model, in the model window, the target structure for constructing your query.

Methodology

To open C2•DBAccess, select the DATABASES card deck from the main Cerius2 menu. It displays three cards: MDL INTER-

Page 112: Cerius2 - ESI · 2001-06-08 · Cerius2 Hypothesis & Receptor Models Release 4.0 April 1999 Molecular Simulations Inc. 9685 Scranton Road San Diego, CA 92121-3752 619/458-9990 Fax:

98 Cerius2 Hypothesis and Receptor Models/April 1999

14. Performing a Database Query

FACE, CATALYST INTERFACE, and CATSHAPE INTERFACE. Each interface has at least the following three options: Submit Query, Browse Hits, and Reset. The CATSHAPE INTERFACE card has two additional options: Show Study Table and Receptor Model. The Receptor Model option provides a Generate Model and Evaluate Molecules pulldown menu. The Catalyst and ISIS interfaces have an Edit Query option that allows you to develop a search query from within Cerius2.

The Submit Query panel allows you to select the database that you want to search by selecting *.bdb files for Catalyst searches, and the *.hvd files for ISIS searches.

Other options in Submit Query include an option to save files in a specific SD file. The Query source can be defined as:

♦ for ISIS: Current Model, MDL Clipboard, and Mol File

♦ for Catalyst: Current Model or Mol File

♦ for CatShape: Current and Selected Model

Each of the three interfaces allows you to limit the number of hits to a specific maximum. As for the search options, the Catalyst interface provides options for Fastest or Best fit; ISIS interface has options for Substructure, Similarity and 3D searches. The similar-ity search requires a similarity value to be entered (between 0 – 100), and the 3D search requires you to select from Rigid or Flexi-ble search options. The CatShape interface has options to Gener-ate the shape descriptors for the database (this option must be run once), Statistics which provides statistics on shape descriptors for the database, Descriptors, which writes out the descriptor values into a user specified file and display them in a table, and finally the Search option performs the Shape-based search on the specified database.

Both the Catalyst and CatShape interface also allows you to select a host machine on your network on which to perform the search. Finally, the CatShape interface allows you to set the Tolerances for your Shape search.

You can develop your 3D search queries using the Edit Query panel for both Catalyst and ISIS searches. This interface allows you to develop ISIS queries that include distances between pairs of atoms and ring centroids. In addition, all of the ISIS options for bond and atom definitions are available.

Page 113: Cerius2 - ESI · 2001-06-08 · Cerius2 Hypothesis & Receptor Models Release 4.0 April 1999 Molecular Simulations Inc. 9685 Scranton Road San Diego, CA 92121-3752 619/458-9990 Fax:

Methodology

Cerius2 Hypothesis and Receptor Models/April 1999 99

Bond types:

Bond topology:

Atom types (include or exclude):

The queries are generated in ISIS molfile format and both Catalyst and ISIS searches can be performed using the same queries.

The Hit Browser is activated automatically upon completion of a Catalyst, CatShape, or ISIS search and is loaded with the structures in the search’s output SD file. The browser allows you to rapidly browse compounds. A table lists the structures, the compound names, and the number of atoms and bonds. You can perform standard table operations with this hit list table such as sorting, selecting, exporting, etc. Finally, you can select the Display In Model Window option to display the hits in the Model Window using your choice of display options. Browsing in the Hit List Browser automatically updates the image in the Model Window.

To start Database Query 1. If the DRUG DISCOVERY card deck is not already displayed, select DRUG DISCOVERY from the Deck Selector popup.

2. Choose the QUERY DATABASE menu card from the deck. The card is displayed.

single double triple

up down up/down

cis/trans aromatic any

single/double double/aromatic single/aromatic

ring chain

none any element any element except H

any element except C & H

any halogen common organic elements

user specified list of elements

Page 114: Cerius2 - ESI · 2001-06-08 · Cerius2 Hypothesis & Receptor Models Release 4.0 April 1999 Molecular Simulations Inc. 9685 Scranton Road San Diego, CA 92121-3752 619/458-9990 Fax:

100 Cerius2 Hypothesis and Receptor Models/April 1999

14. Performing a Database Query

or

Select DATABASES from the Deck Selector popup. The CAT-ALYST, CATSHAPE, and MDL (ISIS) INTERFACE menu cards appear:

Constructing a query

Queries are constructed using the Query Editor. Four types of que-ries can be defined: atom, bond, distance, and centroid. Defining each query type involves specifying parameters specific to the query. The appearance of the Query Editor control panel changes depending on the query type you select.

The simplest type of query is simply a molecular fragment as rep-resented by a Cerius2 model. The current model, which you have loaded into Cerius2 or created via the C2•Builder, is the usual starting point for a query.

Page 115: Cerius2 - ESI · 2001-06-08 · Cerius2 Hypothesis & Receptor Models Release 4.0 April 1999 Molecular Simulations Inc. 9685 Scranton Road San Diego, CA 92121-3752 619/458-9990 Fax:

Constructing a query

Cerius2 Hypothesis and Receptor Models/April 1999 101

When the current model is used as the query, the default behavior is to build the query using the exact atoms, bonds, etc., as they cur-rently exist in the model. If a 3D query is submitted, the relative coordinates of the atoms in your model are used.

More commonly, you will want to edit the query to make it more general or more specific to your search. For example, you may wish to specify that an atom currently represented in the model by a carbon atom could also be a nitrogen or sulfur atom for the pur-pose of the search, or that it could be any atom except carbon. You can specify any combination of specified atom, bond, and centroid types for a Catalyst or ISIS database query. For 3D queries, you can specify desired intra-atomic distances.

When you submit a query via the Submit Query command, the query you have defined on the current model is first written to an MDL-format MOL file which is then used by Catalyst or ISIS for the search. You may also submit a query using a previously cre-ated query (MOL) directly. You can also export Catalyst hypothe-ses as ISIS search queries.

The following four sections describe how to edit each of the four types of query definitions: atom, bond, centroid, and distance.

Constructing an atom query

An atom query can be constructed for any atom in your target structure. The atom you choose in the target is identified by ele-ment and number.

To construct an atom query

1. Click the Atom icon in the Query Editor control panel.

2. Pick an atom in your target model. The subpanel displays the pertinent information for the atom you selected.

To specify the same query for multiple atoms, hold down the <Shift> key while you pick additional atoms. Note that only the query for the first atom you selected is displayed.

3. Specify the criteria for the query either by selecting a pre-defined set of elements (for example, Any Halogen) from the popup, or by using the entry field to type the elements you want to specify. If you type the data, your action overrides the

Page 116: Cerius2 - ESI · 2001-06-08 · Cerius2 Hypothesis & Receptor Models Release 4.0 April 1999 Molecular Simulations Inc. 9685 Scranton Road San Diego, CA 92121-3752 619/458-9990 Fax:

102 Cerius2 Hypothesis and Receptor Models/April 1999

14. Performing a Database Query

predefined information and the popup displays the term User Specified.

4. If the element list is to be used to exclude structures, check the Exclude checkbox.

Note that if hydrogens are included in the structures, they are included in the query.

5. If you want to remove data from the data entry box, click Delete This Feature. The current query information is cleared from the panel. If you want to remove all atom queries, click Delete All Atom Queries.

6. To save the query, click Define. You can now select another atom and repeat the process of constructing another atom query.

Constructing a bond query

A bond query can be constructed for any bond in your target struc-ture. The bond you choose in the target is identified by the atoms in the bond. This means you cannot click a bond to select it.

To construct a bond query 1. Click the Bond icon in the Query Editor control panel.

2. Use the mouse to pick two atoms in your target model to define a bond. A rubber band indicator stretches from the first atom to the second. When you pick the second atom, the bond is high-lighted. A Query Editor panel (as illustrated) associated with this bond appears.

Page 117: Cerius2 - ESI · 2001-06-08 · Cerius2 Hypothesis & Receptor Models Release 4.0 April 1999 Molecular Simulations Inc. 9685 Scranton Road San Diego, CA 92121-3752 619/458-9990 Fax:

Constructing a query

Cerius2 Hypothesis and Receptor Models/April 1999 103

3. The Bond Type popup displays only the currently selected bond query type, for example, Single. Use the popup to select the bond type you wish to use in the search, for example, Dou-ble Aromatic. You may <Shift>-select further atoms connected to this bond to specify the same type over multiple bonds.

An alternative way to perform this selection, which is very use-ful when, for example, you wish to define all the bonds in part of the model to be aromatic for the query, is to first select Bond from the Feature Type popup. With standard picking mode selected (the arrow icon highlighted), use standard selection techniques (for example, rubber-band selection) to select the atoms defining the bonds you wish to modify for the search. Click the Bond icon— note that the Bond Type displayed is the last one used (since the bond type cannot be generally dis-played when multiple bonds are selected). However, if you then select a bond type, for example, Double Aromatic, and

Page 118: Cerius2 - ESI · 2001-06-08 · Cerius2 Hypothesis & Receptor Models Release 4.0 April 1999 Molecular Simulations Inc. 9685 Scranton Road San Diego, CA 92121-3752 619/458-9990 Fax:

104 Cerius2 Hypothesis and Receptor Models/April 1999

14. Performing a Database Query

then click the Define button, all these bonds will become of the specified query type.

Note

You can use Bond Topology to further specify your bond query. This further qualifies which bonds will match your bond query during a search and allows you to specify whether or not the bonds are allowed to be in ring systems.

4. When you are satisfied with your selections, click Define to define the query.

5. You can remove the current query by clicking Delete This Fea-ture. The current query information is cleared from the panel. If you want to remove all bond queries, click Delete All Bond Queries.

Constructing a centroid query

With respect to database queries, a centroid is defined as the geo-metric average of a set of atoms that are used in a substructure search. Hence the centroid itself does not represent a query but rather a property of a query. For example, you may wish to find all models from a 3D database search with a certain distance between a given metal atom and the center (centroid) of a phenyl group. Your query would be composed of a six-membered aromatic car-bon ring, a metal atom, the centroid of the ring, and a distance query defined between the metal atom and the centroid.

To construct a centroid query:

1. Click the Centroid icon in the Query Editor panel.

2. Pick an atom you want included in the centroid definition. The Query Editor panel will change to display information the number of atoms used in the centroid’s current definition. To add more atoms to the definition simply use the mouse to pick the desired atoms in the Cerius2 Models window.

This methodology also works for the other query features; Atom, Distance, and Centroid.

Page 119: Cerius2 - ESI · 2001-06-08 · Cerius2 Hypothesis & Receptor Models Release 4.0 April 1999 Molecular Simulations Inc. 9685 Scranton Road San Diego, CA 92121-3752 619/458-9990 Fax:

Constructing a query

Cerius2 Hypothesis and Receptor Models/April 1999 105

3. When you have specified all the atoms in your centroid defini-tion click Define. The centroid is now defined and will appear as a “+” at the geometric center of the atoms you specified. The “+” centroid atom may later be selected for the purpose of deleting or using this centroid in a Distance query.

4. To delete the currently selected centroid, click Delete This Fea-ture. Clicking Clear Selection deletes all the atoms in the cur-rent centroid definition so that you can edit an existing centroid.

5. Click Delete All Centroid Features to remove all centroid fea-tures from the current query.

Page 120: Cerius2 - ESI · 2001-06-08 · Cerius2 Hypothesis & Receptor Models Release 4.0 April 1999 Molecular Simulations Inc. 9685 Scranton Road San Diego, CA 92121-3752 619/458-9990 Fax:

106 Cerius2 Hypothesis and Receptor Models/April 1999

14. Performing a Database Query

Constructing a distance query

A distance query feature acts as a query filter on the set of models that would normally be returned for your model query. It does this by specifying a distance inclusion range between a pair of atoms in your query. Hence a distance query is only employed during 3D database queries. Multiple distance queries may be defined for your model query (Atoms and Bonds), in which case all the dis-tances must be satisfied when a database model is matched to your query.

To construct a distance query:

1. Click the Distance icon in the Query Editor panel.

2. Pick the first atom of your distance query specification. A “rub-ber” line will now trail your cursor so that you know that you

Page 121: Cerius2 - ESI · 2001-06-08 · Cerius2 Hypothesis & Receptor Models Release 4.0 April 1999 Molecular Simulations Inc. 9685 Scranton Road San Diego, CA 92121-3752 619/458-9990 Fax:

Submitting a query

Cerius2 Hypothesis and Receptor Models/April 1999 107

need to pick another atom. Pick the second atom of your speci-fication. The Query Editor panel will now show distance fea-ture information, including the distance range for your current selection. If this has not already been defined this will show val-ues based on the distance between these atoms (± 0.5 Å).

3. Type in the distance range minimum and maximum values you wish to use with this distance feature. These represent the min-imum and maximum distances allowed between these atoms in any model retrieved from a 3D database query. Click the Define button to define this distance feature query.

4. To delete an individual Distance Feature query use the Delete This Feature button after selecting the distance query you wish to delete. To delete all currently defined distance queries, use the Delete All Distance Features button.

Deleting all query features

To reset/delete all query features defined on the current model, use the Clear All Features button on the Query Editor panel. It is a good idea to do this whenever you wish to start defining a new query. This will delete all Distance and Centroid features and reset all Atom and Bond features to their defaults (that is, the atoms/bonds of the current model as displayed).

Submitting a query

Cerius2 does not act on query information until a query is submit-ted. Queries are submitted using the Submit Query control panel, which displays different subpanels depending on the search facil-ity and database type you want to use. The default selection for database is None.

Accessing the Submit Query control panel

To access the Submit Query control panel, select Submit Query from the QUERY DATABASE menu card.

Page 122: Cerius2 - ESI · 2001-06-08 · Cerius2 Hypothesis & Receptor Models Release 4.0 April 1999 Molecular Simulations Inc. 9685 Scranton Road San Diego, CA 92121-3752 619/458-9990 Fax:

108 Cerius2 Hypothesis and Receptor Models/April 1999

14. Performing a Database Query

Select the database type you want and continue as described in the following database-specific procedures.

Submitting a Catalyst query

To submit a Catalyst query:

1. Select Catalyst from the popup in the Database Type field or select Submit Query from the CATALYST INTERFACE menu card to open the Submit Query control panel.

2. Select the database (.bdb) file that you want to search from the Database to Search browser on the left of the control panel, then click Set. The name of the database appears below the data entry field.

3. Name the file to be used to save results either by selecting it from the Save Results In browser on the right of the control panel or by entering a name in the data entry field; click Set. The name of the file appears below the data entry field.

Page 123: Cerius2 - ESI · 2001-06-08 · Cerius2 Hypothesis & Receptor Models Release 4.0 April 1999 Molecular Simulations Inc. 9685 Scranton Road San Diego, CA 92121-3752 619/458-9990 Fax:

Submitting a query

Cerius2 Hypothesis and Receptor Models/April 1999 109

4. Use the Use... Fit popup to choose the way that you want Cat-alyst to perform flexible fitting. FASTEST is the default selec-tion but you may also choose BEST.

5. If you want to put an upper limit on the number of structures that can be retrieved in the search, enter the number in the Limit Hits To data entry box. The default value is 999. Check the Limit Hits check box. The default condition is unchecked.

6. Select the model you wish to use as the query using the Use Query popup. You may either specify the current model or a previously defined query stored in a .mol or .chm file.

7. Select the host on which the search is to run by using the Search Database on Host popup. The default host is installation-spe-cific.

8. Click Submit Query to start the search. Information during the search is reported to the text window. When the search is com-plete, the number of hits is also reported in the text window.

Submitting a SHAPE query

To submit a SHAPE query:

1. Select SHAPE from the popup in the Database Type field or select Submit Query from the CATSHAPE INTERFACE menu card to open the Submit Query control panel.

2. Select the database (.bdb) file that you wish to search from the Database to Search browser on the left of the control panel, then click Set. The name of the database appears below the data entry field.

3. Choose a name for the output of your database search (.sd) using the Save Results In browser. The name of the output file appears below the data entry field. Note: if a file by this name already exists this will be overwritten by the search.

4. Choose the CatShape search type by clicking the CatShape Run popup. There are four search methods available for Cat-Shape:

a. Generate: Searching the database in this mode will not actu-ally submit a query to search. When SEARCH is selected,

Page 124: Cerius2 - ESI · 2001-06-08 · Cerius2 Hypothesis & Receptor Models Release 4.0 April 1999 Molecular Simulations Inc. 9685 Scranton Road San Diego, CA 92121-3752 619/458-9990 Fax:

110 Cerius2 Hypothesis and Receptor Models/April 1999

14. Performing a Database Query

CatShape will produce a shape index database from the database you selected, which is required for subsequent queries on this database. A file db_name.bdb.id.4bdb is pro-duced in your local directory. If this file does not already exist, other CatShape Run type searches will automatically generate this file at the start of the search.

b. Statistics: Searching the database in this mode will not actu-ally submit a query to search. When SEARCH is selected, CatShape will produce statistics to the text output window from the shape index database.

c. Descriptors: Searching the database in this mode will not actually submit a query to search. When SEARCH is selected, CatShape will produce a table of statistics (descrip-tors) for each molecule in the database from the shape index database. These are also recorded to an output file that you must specify in the Descriptors Output File parameter in the Submit Query panel. This file can also be loaded into the QSAR+ module. The descriptors produced are based on properties, such as moments of inertia, averaged over all conformations for a molecule in the database.

d. Search: This is the value required to actually run a CatShape search. The query in this case will be generalized shape data calculated from your query model, such as molecular vol-ume, principle axes of inertia, etc. Which model to use as the query is set by the Construct Query From popup, where you specify either the current or selected model.

Clicking the Search Tolerances button brings up the Cata-lyst Shape Search Preferences panel where you can adjust the limits to the accuracy of your shape query. For example, Max and Min values for percentage tolerance on principal axes specify the percentage difference that is allowed between a database conformer’s axes and the query mole-cule axes for that molecule to be considered a possible match.

You can also set a limit on the maximum number of hits returned by a search by checking the Limit Hits check but-ton and specifying the maximum number of hits allowed.

Page 125: Cerius2 - ESI · 2001-06-08 · Cerius2 Hypothesis & Receptor Models Release 4.0 April 1999 Molecular Simulations Inc. 9685 Scranton Road San Diego, CA 92121-3752 619/458-9990 Fax:

Submitting a query

Cerius2 Hypothesis and Receptor Models/April 1999 111

5. Choose a Host machine to run your search on. Usually you would only need to change the value from localhost if your Catalyst server is on another machine.

6. When you have all the search parameters set, click the SEARCH button to initiate the database query search. When a query search completes, the results will be loaded into the Browse Hits browser.

Submitting an ISIS query

To submit an ISIS query:

1. Select ISIS from the popup in the Database Type field or Sub-mit Query from the MDL INTERFACE menu card to open the Submit Query control panel.

2. Select the database (.hvd) file that you wish to search from the Database to Search browser on the left of the control panel, then click Set. The name of the database appears below the data entry field. Note: this may take a couple of minutes because the interface looks at the database to find what properties (data-base fields) exist.

3. Select the properties you wish to retrieve from the database for your query matches. To do this click the Retrieve Fields button. This brings up the Retrieve Fields control panel (yours may look different from experiment to experiment, depending on the database you specify).

To select fields, you simply click the desired field’s name in the list. Any field name that is highlighted (inverse text colors) will be retrieved when the query is issued. The Select All and Unse-lect All buttons on this panel allow you to select all or de-select all fields for retrieval.

4. Choose a name for the output of your database search (.sd) using the Save Results In browser. The name of the output file appears below the data entry field.

Note: If a file by this name already exists it will be overwritten by the search.

5. Choose the source of the query for the search using the Use Query popup. The default value, Current Model, means that

Page 126: Cerius2 - ESI · 2001-06-08 · Cerius2 Hypothesis & Receptor Models Release 4.0 April 1999 Molecular Simulations Inc. 9685 Scranton Road San Diego, CA 92121-3752 619/458-9990 Fax:

112 Cerius2 Hypothesis and Receptor Models/April 1999

14. Performing a Database Query

the search query will be formed from the current model with all the query features you added using the Edit Query command. A value of MDL Clipboard specifies that the query be taken from the clipboard of an MDL query editor that is running in the background. This is only recommended for experienced ISIS users. A value of MOL File will bring up a browser from which you can specify the name of an existing MOL file to use as your query source.

6. If you want to put an upper limit on the number of structures that can be retrieved by the search, check the Limit Hits check box and enter the maximum number of hits you wish to be retrieved. If you just want to see how many structures would be retrieved from your search, check the Count only check box.

7. Choose which type of ISIS database query you wish to perform from the Type popup. Substructure and Similarity type searches perform 2D searches. Choose 3D search if your query contains distance features or if you want to retrieve 3D data fields from the database for your query.

Substructure search is an exact search, meaning that the query must be exactly present in the hit. Similarity search is a fuzzy search and the hits are retrieved based on a score of how close they are topologically to the query. The Value parameter for Similarity search represents the threshold value for including a match in the search output. For example, if your query is pipe-ridine, the substructure search will retrieve any compound that contains a piperidine; however, similarity search will also retrieve pyridines, etc. Similarity search retrieves the hits based on a Tanimoto calculation using topological keys (fingerprints).

8. When you have all the search parameters set, click the SEARCH button to initiate the database query search. When a query search completes, the results will be loaded into the Browse Hits browser.

Defining a Database Domain from Prior Query Hit List

ISIS database searches produced by a Cerius2 query automatically produce a hitlist (.hlist) file in addition to the MACCS SD file. The hitlist file contains the corresponding models and selected data-base fields data. A hitlist file contains only the references to hits found in a prior query and is specific to the hview (.hvd) file

Page 127: Cerius2 - ESI · 2001-06-08 · Cerius2 Hypothesis & Receptor Models Release 4.0 April 1999 Molecular Simulations Inc. 9685 Scranton Road San Diego, CA 92121-3752 619/458-9990 Fax:

Submitting a query

Cerius2 Hypothesis and Receptor Models/April 1999 113

employed. A hitlist file always contains all the hits that matched a query regardless of the maximum number of hits that were speci-fied for retrieval (which governs how many models are retrieved and loaded into the hitlist browser).

There are three primary uses for hitlist files:

♦ Retrieving structures and data for the hits of a prior query.

You may wish to retrieve more or less models than originally specified for viewing in Hit List Browser and/or retrieve a dif-ferent set of properties (field data) without having to repeat the same database query.

♦ Performing a subset (domain) search.

You may wish to refine the results of your first query using a second query.

♦ Defining a database domain using combined hitlist files.

You may wish to retrieve models which were, for example, hits common to two or more independent database queries.

To define a Database Domain for a query:

1. Ensure that the hview file selected from the Database to Search browser is one previously employed to produce a hitlist (that is, a query search performed using Cerius2 version 4.0). If not, you must redo your original query to produce a hitlist.

2. Click the Define Domain button to bring up the Define Data-base Subset panel. This panel contains a list of hitlists you have referenced during the current session of Cerius2. To add to this list, click the Add Hit List button to open the Add Prior Hit Lists panel. This panel is a file browser where you can locate a previously generated hitlist.

A hitlist file name will have the form <hview>@<date>-<#hits>.hlist, unless it was generated using a name you speci-fied in the Save Hit list In value on the Submit Query panel.

3. After adding hitlists to the Define Database Subset listbox you should click the prior hitlists you want to use. The Hit list com-bination logic value gives you three ways to define the search domain from more than one hit list:

UNION means that hits in all the selected hitlists are used.

Page 128: Cerius2 - ESI · 2001-06-08 · Cerius2 Hypothesis & Receptor Models Release 4.0 April 1999 Molecular Simulations Inc. 9685 Scranton Road San Diego, CA 92121-3752 619/458-9990 Fax:

114 Cerius2 Hypothesis and Receptor Models/April 1999

14. Performing a Database Query

INTERSECTION means that only hits common to all the selected hitlists define the database domain.

SUBTRACT means that only hits in the first hitlist selected, which are not in any of the other selected hitlists, define the database domain.

4. With the database domain defined using prior hitlists, you can now perform the query as if you were querying the whole data-base. You may also choose to not perform a query and instead retrieve only the models and database field data from this domain. To do this set the Query: selector on the Submit Query panel to Retrieve Domain.

Note

Database queries in Cerius2, by default, output both a hitlist and an SD file containing retrieved model data. Generating the SD file can take a considerable time unless you limit the number of results. To generate only a hitlist from a query, you may set the out-put file name for Save Results In to “–”. If you do not wish to gen-erate a hitlist because, for example, you are only retrieving data from a prior hitlist, uncheck the Save Hit list In option.

Retrieving and browsing database hits

When a search is complete, you can retrieve and examine the hits (that is, structures) that satisfy your criteria by using the Hit Browser control panel.

1. The database query will fail unless all of the prior hitlists selected originated from queries using the same (or equivalent) hview (.hvd) file.2. The “first” selected hitlist when using the SUBTRACT option refers to the top-most selection. You can change the order of the hitlists in the Define Database Subset panel by adding and removing hitlists using the Add Hit List and Remove Selected buttons.3. You specify a database domain search by selecting one or more hitlists in the Define Database Subset panel. To revert back to querying the entire database, simply deselect all hitlists. You may use the Unselect all button to do this.

Page 129: Cerius2 - ESI · 2001-06-08 · Cerius2 Hypothesis & Receptor Models Release 4.0 April 1999 Molecular Simulations Inc. 9685 Scranton Road San Diego, CA 92121-3752 619/458-9990 Fax:

Retrieving and browsing database hits

Cerius2 Hypothesis and Receptor Models/April 1999 115

Accessing the Hit Browser control panel

To access the Hit Browser control panel, select Browse Hits from the QUERY DATABASE menu card or from the MDL (ISIS), CATSHAPE, or CATALYST INTERFACE menu cards. This opens the Hit Browser control panel.

The Hit Browser consists of a table to display the list of hits and a pictograph area to display a stick model of the current hit. The name of the current structure is listed as a label in the pictograph area. In addition, tools are supplied to manipulate the hit list.

If hits are found, they are sorted according to the classification scheme you have specified. Each row of the hit list includes the fol-lowing information:

♦ Hit name — The name of the molecule that has been identified as a hit.

♦ Atoms — The number of atoms in the structure.

♦ Bonds — The number of bonds in the structure.

In addition to the stick model in the pictograph area, the Hit Browser creates a model of the currently displayed hit. The model is listed in the Model Manager and behaves as a standard model.

If your search generates no hits, the Hit Browser displays no picto-graph and no listing. The message No Hits Found is displayed in the middle of the pictograph area.

Manipulating a hit list

If you want to examine structures in the current hit list or a hit list from a previous search, use the Hit Browser control panel to com-plete the following procedures:

To browse the hit list Use the buttons above the pictograph area to move sequentially through the hit list or to go to the first or last structure on the list. When you click a button, the structure that is displayed in the pic-tograph area depends on the button you have clicked as indicated in the illustration:

Page 130: Cerius2 - ESI · 2001-06-08 · Cerius2 Hypothesis & Receptor Models Release 4.0 April 1999 Molecular Simulations Inc. 9685 Scranton Road San Diego, CA 92121-3752 619/458-9990 Fax:

116 Cerius2 Hypothesis and Receptor Models/April 1999

14. Performing a Database Query

The current structure number is displayed in the data box in the middle of the scroll buttons field.

The Total Hits field above the pictograph table tallies the total number of structures retrieved from a search.

To display a specific struc-ture

Highlight a row in the hit list table, enter a structure number in the data entry box above the pictograph, or scroll through the list using the tools above the pictograph until you reach the desired structure number. The structure you have specified is displayed as a pictograph.

To change structures listed in the table

Make a selection from the View popup. You can choose to list All Hits or only leaders (that is, structures with the highest score in each group). Leaders is the default selection.

To display a structure in the Models window

If you want the structure currently selected and displayed in the pictograph area to also be displayed in the Models window, check the Display in Model Window check box.

To display a different hit list 1. Click Select Hit List…. The Select Hit List control panel is dis-played:

First hit

One lower than current hit

Last hit

One higher than current hit

Current structure number

Page 131: Cerius2 - ESI · 2001-06-08 · Cerius2 Hypothesis & Receptor Models Release 4.0 April 1999 Molecular Simulations Inc. 9685 Scranton Road San Diego, CA 92121-3752 619/458-9990 Fax:

Retrieving and browsing database hits

Cerius2 Hypothesis and Receptor Models/April 1999 117

2. Specify the type of list to load from the Hit Results popup. The options are Hitlist (created from a Cerius2 session) and MDL (MACCS SD file created by Catalyst, CatShape, or ISIS).

a. If you select Hitlist, highlight the list you want to use in the browser and click View. The list is loaded into the Hit Browser control panel.

b. If you select MDL, the Select Hit List control panel changes slightly to display a standard Cerius2 file browser. Locate the file you want to use and click Load. The MDL file is loaded in the Hit Browser control panel.

To delete hits Click Delete All Hits to delete the hits from the current hit list.

In addition to the tools described in the preceding paragraphs, you can use the standard buttons in the table tool bar of the Hit Browser control panel to manipulate the data in the hit list table. For more information on these tools, see the discussion of tables in the Cerius2 Modeling Environment.

Page 132: Cerius2 - ESI · 2001-06-08 · Cerius2 Hypothesis & Receptor Models Release 4.0 April 1999 Molecular Simulations Inc. 9685 Scranton Road San Diego, CA 92121-3752 619/458-9990 Fax:

118 Cerius2 Hypothesis and Receptor Models/April 1999

14. Performing a Database Query

Page 133: Cerius2 - ESI · 2001-06-08 · Cerius2 Hypothesis & Receptor Models Release 4.0 April 1999 Molecular Simulations Inc. 9685 Scranton Road San Diego, CA 92121-3752 619/458-9990 Fax:

Retrieving and browsing database hits

Cerius2 Hypothesis and Receptor Models/April 1999 119

Page 134: Cerius2 - ESI · 2001-06-08 · Cerius2 Hypothesis & Receptor Models Release 4.0 April 1999 Molecular Simulations Inc. 9685 Scranton Road San Diego, CA 92121-3752 619/458-9990 Fax:

120 Cerius2 Hypothesis and Receptor Models/April 1999

14. Performing a Database Query

Page 135: Cerius2 - ESI · 2001-06-08 · Cerius2 Hypothesis & Receptor Models Release 4.0 April 1999 Molecular Simulations Inc. 9685 Scranton Road San Diego, CA 92121-3752 619/458-9990 Fax:

Cerius2 Hypothesis and Receptor Models/April 1999 121

A References

Barnum, D., Greene, J., Smellie, A., and Sprague, P., “Identification of Common Functional Configurations Among Molecules,” J. Chem. Inf. Comput. Sci. (in press)

Good, A. C.; Hodgkin, E. E.; and Richards, W. G. “Utilization of Gaussian Function for the Rapid Evaluation of Molecular Simi-larity”, J. Chem. Inf. Comput. Sci., 32, 188-191 (1992).

Good, A. C. “The Calculation of Molecular Similarity: Alternative Formu-las, Data Manipulation, and Graphical Display”, J. Mol. Graphics, 10, 144-151 (1992).

Hahn, M., “Receptor Surface Models. 1. Definition and Construc-tion,” J. Med. Chem., 38 2091 (1995a)

Hahn, M., and Rogers, D., “Receptor Surface Models. 2. Application to Quantitative Structure Activity Relationship Studies,” J. Med. Chem., 38 2091 (1995b)

Smellie, A., Kahn, S.D., and Teig, S.L., “Analysis of Conformational Coverage 1. Validation and Estimation of Coverage,” J. Chem. Info. and Computer Sci. 35 285 (1995).

Smellie, A., Kahn, S.D., and Teig, S.L., “Analysis of Conformational Coverage 2. Applications of Conformational Models,” J. Chem. Info. and Computer Sci. 35 295 (1995).

Smellie, A., Teig, S.L., and Towbin, P., “Poling: Promoting Confor-mational Coverage,” J. Comp. Chem. 16 171-187 (1995)

Smellie, A., Kahn, S.D., and Teig, S.L., “An Analysis of Conforma-tional Coverage. 1. Validation and Estimation of Coverage,” J. Chem. Inf. Comput. Sci. 35 285-294 (1995a)

Smellie, A., Kahn, S.D., and Teig, S.L., “An Analysis of Conforma-tional Coverage. 2. Applications of Conformational Models,” J. Chem. Inf. Comput. Sci. 35 295-304 (1995b)

Page 136: Cerius2 - ESI · 2001-06-08 · Cerius2 Hypothesis & Receptor Models Release 4.0 April 1999 Molecular Simulations Inc. 9685 Scranton Road San Diego, CA 92121-3752 619/458-9990 Fax:

122 Cerius2 Hypothesis and Receptor Models/April 1999

A. References

Page 137: Cerius2 - ESI · 2001-06-08 · Cerius2 Hypothesis & Receptor Models Release 4.0 April 1999 Molecular Simulations Inc. 9685 Scranton Road San Diego, CA 92121-3752 619/458-9990 Fax:

Cerius2 Hypothesis and Receptor Models/April 1999 123

.bdb files 98

.hvd files 983D Sketcher 53, 61, 80, 83

AAlign

changing models 25family 22methods 9, 27models 9, 10, 11, 12, 13, 16, 21moving models 25non-target models 16preferences 11, 12, 14, 33QSAR 21, 22specifying atom matches 9, 10, 17, 18strategies 13, 14, 15, 27subgraph 9, 18target model 14, 15, 22target models 25theory 27

Align Molecules menu card 4Align Redo 10Align Undo 10alignment

bump checking 14, 32calculating similarity descriptors 24consensus 9, 14, 15, 16, 23, 27consesnsus 22, 24descriptors 21flexible 14, 18, 23, 28, 32for QSAR analysis 24forcefield 9, 12, 17, 31invalid data 25methods

Field 9, 12, 22, 28, 32Moments 9, 11, 12, 15, 18, 28, 32RMS 27

non-rigid 13non-target models 28recalculating descriptors 25removing descriptors from Study Table 26

rigid 10, 12, 13, 14, 18, 28similarity descriptors 22strategies 16targeted 9, 13, 23, 27torsions 9, 10, 13, 14, 16, 28, 29viewing 10, 15, 16, 17, 19

Alignment module 2Analog Builder 53, 61, 80, 83ASearch

and DBAccess 96atom matches

editing 18in QSAR 23RMS 22specifying 9, 10, 17

atom matching 17atom query

constructing 101

Bbond query

constructing 102

Ccard decks

Databases 5Drug Discovery 4, 99Hypothesis Models 5

Catalystand DBAccess 95, 97query options 98

Catalyst ConFirm interface 3Catalyst environment 2Catalyst HipHop interface 3Catalyst interface 98Catalyst menu card 5Catalyst/SHAPE. See CatShape.CatShape

and DBAccess 96

Index

Page 138: Cerius2 - ESI · 2001-06-08 · Cerius2 Hypothesis & Receptor Models Release 4.0 April 1999 Molecular Simulations Inc. 9685 Scranton Road San Diego, CA 92121-3752 619/458-9990 Fax:

124 Cerius2 Hypothesis and Receptor Models/April 1999

D

catShapeand DBAccess 95

CatShape interface 98CatShape menu card 5centroid query

constructing 104Cerius2 modeling environment 2Cerius2 Visualizer 2common subgraph 17, 18Confirm Interface menu card 5Consensus alignment of four models 15Consensus Alignment on a subset of Align

models 16control panel

Preferences 55control panel, Preferences 64control panels

Edit Query 98Evaluate Receptor Model 62Generate Receptor Model 54Hit Browser 114Load Model 53, 61, 80, 83Query Editor 99Submit Query 98, 107

customization, see preferences

DDatabase Query 2

requirements 97database query

Catalyst, submitting 108constructing 99deleting features 107editing 101hits, retrieving 114ISIS, submitting 109, 111SHAPE, submitting 109submitting 107

database searching 98Databases card deck 5DBAccess

opening 97descriptors

recalculating 25distance query

constructing 106

Drug Discovery card deck 4, 40, 54, 62

EEinside 63, 66Einteract 64, 66Electrostatic Charge Complementarity 65electrostatic potential

receptor model 72energy evaluation algorithm 66Estrain 64Evaluate button. 63Evaluate Receptor Model control panel 56, 62,

69–??Evaluate Receptor Model control panel data

table 66evaluation process

additional options 67examining data 66fitting structures to receptor model 61imposing model constraints 63mapping interactions 63running 62–66selecting receptor model 63selecting structures 63viewing results 66–??

FField alignment 9, 12, 22field similarity 11, 22File menu 53, 80, 83File Selector 58Fit slider bar 55Fit to Receptor Model 62flexible alignment 14, 18, 28, 32

alignmentflexible 13

bump checking 14forcefield

alignment 9, 12, 17, 24, 31

GGaussian function

in the electrostatic calculation 33

Page 139: Cerius2 - ESI · 2001-06-08 · Cerius2 Hypothesis & Receptor Models Release 4.0 April 1999 Molecular Simulations Inc. 9685 Scranton Road San Diego, CA 92121-3752 619/458-9990 Fax:

H

Cerius2 Hypothesis and Receptor Models/April 1999 125

Generate Receptor Model control panel 54Apply to Molecule(s) 54Map Property 60Solvent Atoms 55

HHiphop Interface menu card 5Hit Browser

DBAccess 99hit list

browsing 115changing lists 116deleting hits 117view a structure 116

hitsretrieving 114

hydrogen bondingreceptor model 73

hydrophilicityreceptor model 73

hydrophobicityreceptor model 73

Hypothesisand DBAccess 96

Hypothesis Models card deck 5

IImporting models for alignment 10interaction property

changing 67electrostatic energy 63favorable 63mapping 61, 63maximum 63minimum 63total energy 63unfavorable 63van der Waals energy 63

interaction score 55ISIS

and DBAccess 95, 96query options 98

LLoad Model control panel 53, 61, 80, 83

MMDL interface 97MDL Interface menu card 5menu cards 4–??Model Receptor

accessing 40preparing to use 40starting 40

Model Receptor menu card 4, 5, 40Evaluate Receptor Model 62Generate Receptor Model 54Load Receptor Model 58Save Receptor Model 58

Model Receptor module 2model table 54, 56MOL file 101molecular orientation 62Moments alignment 9, 11, 12, 15, 18, 32

Nnon-rigid alignment 13non-target model alignment 16, 24, 28

Ppartial charge

receptor model 72Partial Charge Complemetarity 65Potential Probe 11potential probe 9preferences

Align 14align 11alignment 12descriptors 22effect on descriptor calculation 23effect on descriptor data 25energy evaluation 65for descriptor analysis 24

Preferences control panel 55, 64preferences, setting 55, 64

Page 140: Cerius2 - ESI · 2001-06-08 · Cerius2 Hypothesis & Receptor Models Release 4.0 April 1999 Molecular Simulations Inc. 9685 Scranton Road San Diego, CA 92121-3752 619/458-9990 Fax:

126 Cerius2 Hypothesis and Receptor Models/April 1999

Q

propertymapping 59

property valueadjusting 60

QQSAR

aligning models to target models 23analysis 24atom matches 23incorporating Align data in 21recalculating descriptors 26Study Table 22

QSAR activity data, using 57QSAR menu card 4quality of fit 66Query Database menu card 4

Rreceptor model

evaluating structures 41, 61–??generating 40, 54–??, 61mapping properties 56, 59–60molecule interactions 73restoring 58saving 58shape field 71specifying solvent exposure 55specifying tightness of fit 55

receptor model propertiescharge 59electrostatic potential 59hydrogen bonding 60hydrophobicity 60nothing 59

receptor surfacechoosing a type 57, 65soft type 58, 65van der Waals type 58, 65

receptor surface analysis. See RSA.receptor template structures 54rigid alignment 10, 12, 13, 14, 18, 28RMS, see alignment methodsRSA 75

Sselecting torsion angles

torsion angles, selecting 13Shape Field 71solvation, making a correction 57, 65solvation, making correction 57, 65static conformation 63structures

adjusting geometry 61aligning 40, 53building 40, 53, 61displaying 53evaluating 41, 61–64modifying 41

surface fitreceptor model 72

Ttarget model

activity data 24aligning to QSAR models 23and alignment descriptors 21, 22and and similarity descriptors 25atom matches 23effect on descriptor data 25non-align 25pairwise alignment 23

target model alignment strategy 13target model alignment to a single model 15Target Model alignment to multiple Target

modelsAlign

target model 16torsion angles

selecting 13torsions

alignment 9, 10, 13, 14, 16, 28, 29Transparency slider bar 56

Vvan der Waals energy 63VDW energy, see van der Waals energyviewing

alignment 10, 17

Page 141: Cerius2 - ESI · 2001-06-08 · Cerius2 Hypothesis & Receptor Models Release 4.0 April 1999 Molecular Simulations Inc. 9685 Scranton Road San Diego, CA 92121-3752 619/458-9990 Fax:

V

Cerius2 Hypothesis and Receptor Models/April 1999 127

viewing alignment 15, 16, 19