Protein Structure & Modeling

Protein Structure& Modeling

Biology 224Instructor: Tom Peavy

Nov 18 & 23, 2009

Classical structural biology

Determine biochemical activity

Purify protein

Determine structure

Understand mechanism, function

Structural genomics

Determine genomic DNA sequence

Predict protein

Determine structure or analyze in silico

Understand mechanism, function

Protein function and structure

Function is often assigned based on homology. However,homology based on sequence identity may be subtle.

Consider RBP and OBP: these are true homologs (they are both lipocalins, sharing the GXW motif).But they are distant relatives, and do not share significantamino acid identity in a pairwise alignment.

Protein structure evolves more slowlythan primary amino acid sequence. RBP and OBP sharehighly similar three dimensional structures.

Principles of protein structure

Primary amino acid sequence

Secondary structure: helices, sheets

Tertiary structure: from X-ray, NMR

Quaternary structure: multiple subunits

Protein secondary structure

Protein secondary structure is determined by the amino acid side chains.

Myoglobin is an example of a protein having many-helices. These are formed by amino acid stretches4-40 residues in length.

Thioredoxin from E. coli is an example of a proteinwith many b sheets, formed from strands composedof 5-10 residues. They are arranged in parallel orantiparallel orientations.

Myoglobin(John Kendrew, 1958)

Thioredoxin

Secondary structure prediction

Chou and Fasman (1974) developed an algorithmbased on the frequencies of amino acids found in helices, -sheets, and turns.

Proline: occurs at turns, but not in helices.

GOR (Garnier, Osguthorpe, Robson): related algorithm

Modern algorithms: use multiple sequence alignmentsand achieve higher success rate (about 70-75%)

Secondary structure prediction

Web servers:

GOR4JpredNNPREDICTPHDPredatorPredictProteinPSIPREDSAM-T99sec

Tertiary protein structure: protein folding

Three main approaches:

[1] experimental determination (X-ray crystallography, NMR)

[2] Comparative modeling (based on homology)

[3] Ab initio (de novo) prediction

Experimental approaches to protein structure

[1] X-ray crystallography-- Used to determine 80% of structures-- Requires high protein concentration-- Requires crystals-- Able to trace amino acid side chains-- Earliest structure solved was myoglobin

[2] NMR-- Magnetic field applied to proteins in solution-- Largest structures: 350 amino acids (40 kD)-- Does not require crystallization

Access to PDB through NCBI

Molecular Modeling DataBase (MMDB)

Cn3D (“see in 3D” or three dimensions):structure visualization software

Vector Alignment Search Tool (VAST):view multiple structures

Additional web-based sites to visualize structures

Swiss-PDB Viewer

RasMol

Structural Classification of Proteins (SCOP)

SCOP describes protein structures using a hierarchical classification scheme:

ClassesFoldsSuperfamilies (likely evolutionary relationship)FamiliesDomainsIndividual PDB entries

http://scop.mrc.lmb.cam.ac.uk/scop/

There are about >20,000 structures in PDB, andabout 1 million protein sequences in SwissProt/TrEMBL. For most proteins, structural modelsderive from computational biology approaches,rather than experimental methods.

The most reliable method of modeling and evaluatingnew structures is by comparison to previouslyknown structures. This is comparative modeling.

An alternative is ab initio modeling.

Approaches to predicting protein structures

obtain sequence (target)

fold assignment

comparativemodeling

ab initiomodeling

build, assess model

Approaches to predicting protein structures

[1] Perform fold assignment (e.g. BLAST, CATH, SCOP); identify structurally conserved regions

[2] Align the target (unknown protein) with the template. This is performed for >30% amino acid identity over a sufficient length

[3] Build a model

[4] Evaluate the model

Comparative modeling of protein structures

Errors may occur for many reasons

[1] Errors in side-chain packing

[2] Distortions within correctly aligned regions

[3] Errors in regions of target that do not match template

[4] errors in sequence alignment

[5] use of incorrect templates

Errors in comparative modeling

Many web servers offer comparative modeling services.

Examples areSWISS-MODEL (ExPASy)Predict Protein server (Columbia)WHAT IF (CMBI, Netherlands)

Comparative modeling

Ab initio prediction can be performed when a proteinhas no detectable homologs.

Protein folding is modeled based on global free-energyminimum estimates.

The “Rosetta Stone” methods was applied to sequencefamilies lacking known structures. For 80 of 131 proteins, one of the top five ranked models successfullypredicted the structure within 6.0 Å RMSD (Bonneauet al., 2002).

Ab initio protein structure prediction

Protein Structure & Modeling

Documents

Methods for Protein Structure Prediction Homology Modeling ...dspace.mit.edu/bitstream/handle/1721.1/96935/7-91j-spring-2004/... · Methods for Protein Structure Prediction Homology

Comparative Protein Structure ModelingComparative Protein Structure Modeling UNIT 2.9 Using MODELLER Narayanan Eswar, 1Ben Webb, Marc A. Marti-Renom,2 M.S. Madhusudhan, 1David Eramian,

Introduction to Proteomics and Protein Structure Modeling BMI 705

Comparative Modeling for Beta Protein Structure Prediction Lenore J. Cowen Tufts University

Homology modeling and structure prediction of thioredoxin (TRX) protein … · Homology modeling and structure prediction of thioredoxin (TRX) protein in wheat (Triticum aestivum

Protein structure prediction and refinement 2TGFKEVKQP.KICPF &QEMKPI 2TQVGKP &QEMKPI Template-based modeling GalaxyTBM Loop modeling GalaxyLoop Protein structure refinement Galaxy

MODELLER A Program for Protein Structure Modeling Release ... · MODELLER A Program for Protein Structure Modeling Release 9v7, r6923 Andrej ˇSali with help from Ben Webb, M.S. Madhusudhan,

4. Modeling of side chains 1. Protein Structure Prediction: – given: sequence of protein – predict: structure of protein Challenges: – conformation space

Template-based structure modeling of protein-protein interactions … · 2014. 4. 18. · 1 Template-based structure modeling of protein-protein interactions Andras Szilagyi 1 and

BASIC PROTEIN STRUCTURE PREDICTION FOR THE BIOLOGIST: …€¦ · class of protein structure prediction methods has appeared: protein threading. Homology modeling makes structure

De Novo Protein Structure Modeling from Cryoem Data

Distance-based protein structure modeling Di Wuorion.math.iastate.edu/dept/thesisarchive/PHD/WuDiPhDSS06.pdf · vii ABSTRACT Protein structure modeling can be studied based on the

Tertiary Structure Prediction Methods Any given protein sequence Structure selection Compare sequence with proteins have solved structure Homology Modeling

11/11/05 D Dobbs ISU - BCB 444/544X: Protein Structure Prediction1 11/11/05 Protein Structure Prediction & Modeling

Protein structure prediction Computer-aided pharmaceutical design: Modeling receptor flexibility

comparative protein structure modeling of genes and genomes

Molecular Modeling and Simulations Protein Modeling and ... · MOE strongly supports drug design through molecular simulation, protein structure analysis, data processing of small

Protein 3D Structure Determination Using Homology Modeling and Structure Analysis

Protein Structure. Protein Structure I Primary Structure

Protein structure and homology modeling Morten Nielsen, CBS, BioCentrum, DTU