Near-atomic resolution 3-D reconstruction of viruses · 1 exposure, on film, Nikon scanner, 2-D...

Preview:

Citation preview

Near-atomic resolution 3-D reconstruction of viruses

Wen Jiang

EMAN Workshop, 2008

Markey Center for Structural Biology

Viruses

Fields Virology, 2006

Bacteriophages

•! “Parasites” that infect cells

•! Bacterial viruses are also called phages

•! 102-103 Å in diameter

•! Composition: protein shell + DNA/RNA genome. Some have lipid envelope

•! Many viruses have icosahedral shell

Review by Crowther, 2004

Viruses as Early 3-D EM Targets (1960s) T

MV

adenovirus T4

TBSV

Adrian, M. et al. (1984). Cryo-electron microscopy of viruses. Nature 308, 32.

Ice Embedded Adenovirus Virions

Cryo-EM: Frozen Hydrated Sample

Tailed dsDNA Bacteriophages

portal

scaffold core

terminase Procapsid

Infectious Phage

Bacterial Cell

•! ~1031 tailed dsDNA phage particles ! most abundant life form on Earth

and enormous genetic diversity

•! Some of the most complex macromolecular machines known

•! Host specificity is determined by the tail fibers

Tailed dsDNA Phage: Atomic Models

infectious phage

HK97: recombinant shell

(2000)

?

T4: vertex protein

(2005)

Epsilon15 Phage

•! short tail dsDNA phage (Podoviridae)

•! infects Salmonella

anatum

•! ~40kb genome

•! ~700Å in diameter

•! ~20 MDa in mass

align & average

segmentation

Epsilon15: Icosahedral Shell at 9.5 Å

!hexon average

!T=7 asymmetric unit

Jiang et al. Nature 2006

Shell Proteins of Epsilon15 Phage and HK97 Phage Have Common Fold

HK97 Epsilon15

(Wikoff et al. Science 2000) (Jiang et al. Nature 2006)

phi29

Morais et al. Mol. Cell 2005

Jiang et al. NSB 2003

P22

Common Shell Protein Fold for Tailed dsDNA Phages and Herpesvirus

Jiang et al. Nature 2006

Epsilon15

T4

Fokine et al. PNAS 2005

HK97

Wikoff et al. Science 2000

Baker et al. J.Virol. 2005

Herpes

>15 Å <4 Å 6 Å 9 Å

2BTV

Helices Sheets

Strands Domains Size

Shape Backbone Side Chain

2BTV VP3A

Towards Near-Atomic Resolution

4.5Å Icosahedral Structure of Epsilon15

•! Liquid Helium microscope

•! Large dataset (~3000 micrograph)

•! Image processing (EMAN)

•! Large scale computation (million hours) Jiang et al. Nature 2008

C! Model of Epsilon15: Asymmetric Unit

T = 7L

GP7: Conformation Variance

Capsid Model of Epsilon15

Density Map gp7 Model

2006 (Jiang et al. Nature)

9.5Å

2008 (Jiang et al. Nature)

4.5Å

gp7

gp10

gp10: a molecular staple

gp10

gp7

3.7 Å map: gp7 !

Near-atomic Resolution 3-D Reconstruction

Tasks Requirements/Methods

Sample Stable, homogeneous

Imaging Good microscope

(LHe or LN2 ?)

Image processing Accurate alignment (orientation,

center, defocus)

Computing Cluster, Condor, Grid

High Resolution Images

JEOL3200FSC: 300 kV, N2, Energy filter ~1500 micrographs, 50k mag

1 exposure, on film, Nikon scanner

JEOL3000SFF: 300 kV, Helium ~3000 micrographs, 60k mag

1 exposure, on film, Nikon scanner,

2-D Alignment

JEOL3000SFF JEOL3200FSC

Orienta

tion V

ariatio

n (

degre

e)

2-D Alignment O

rienta

tion V

ariatio

n (

degre

e)

2-D Alignment O

rie

nta

tion V

ariatio

n (

degre

e)

Cen

ter

Variatio

n (

pix

el)

Focus Variation

!15 70

ice

!15

!15 !15

!15

!15 700Å

ice

!15 !15 !15 !15

Focus Variation

Focus Variation

Defocus (µm)

Focus Variation

Defo

cus

Variatio

n (

µm

)

Focus Variation

Defo

cus

variatio

ns

(Angst

rom

)

250

500

-500

-250

Grid 1 Grid 2

0

Detecting CTF Fitting Errors

Large Scale Distributed Computing: Condor

Campus B

Campus A

Developed by U Wisconsin - Madison

Greater than 50% available " 25% - 50% available " Less than 25% available "

Large Scale Computing: Condor

Parallelizing 2-D Alignment

0 1 2 n

0 M00 M01 M02 M0n

1 M10 M11 M12

2 M20 M21 M22

Mij

m Mm0 Mmn

Cry

o-E

M p

art

icle

images

(m)

reference projections of current 3-D model (n)

•!All to all comparison

•!Comparisons are independent

•!m=102-105

•!n=102-103

•!Image size: 100-10002

•!3 variables (2 center, 1 rotation) for comparison pair

•!Block parallelization

EMAN on Condor •! Implemented EMAN on Condor

•! Works well with Condor on Linux (32bit and 64bit X86, PPC64) and Windows

•! Simultaneously run on all Purdue Condor pools (BoilerGrid)

•! Purdue main campus

•! Purdue Calumet

•! Indiana state university

•! Notre Dame University

•! Hundreds to thousands CPUs can be used

Remaining Issues:

•! Data IO/image file transfer bottleneck ?

•! Distributed checkpointing ?

Large Scale Computing: Condor 100 " 1000 CPUs

Num

ber

of R

unnin

g J

obs

8AM Time (hours)

Windows 32bit

All

Linux 32 bit

Linux 64 bit

Purdue Condor Pool 2007

Purdue Condor Pool 2008:

Condor Is Growing

7PM

Condor Is Growing

Total Owner Claimed Unclaimed Matched IA64/LINUX 4 0 0 4 0 INTEL/LINUX 818 346 218 225 29 INTEL/WINNT51 5142 2887 1 2254 0 INTEL/WINNT52 1 0 0 1 0 INTEL/WINNT60 2373 250 0 2123 0 PPC/OSX 1 0 0 1 0 PPC64/LINUX 8 0 0 8 0 SUN4u/SOLARIS28 1 0 0 1 0 SUN4u/SOLARIS5.10 15 5 0 10 0 X86_64/LINUX 10739 7738 2857 144 0 Total 19102 11226 3076 4771 29

Linux clusters

Research labs, Teaching Labs, Offices, …

Computing Resources for Image Processing

Projection

2-D Alignment

3-D Reconstruction

Class Averaging

Image Processing Tasks

Desktop

Group of Desktops

Cluster

Condor

Computing Resources

?

Our dream:

Super-computer scale resource for complete image processing from individual’s desktop without worrying about complex computer issues for a biologist user

Tailed dsDNA Phage: Atomic Models

Infectious Phage (2008)

HK97: recombinant shell

(2000)

T4: vertex protein

(2005)

Can we solve the structure of non-icosahedral components: portal, tail, genome ?

Complete structure (20 Å)

Jiang et al. Nature 2006

Icosahedral shell (4.5 Å)

Jiang et al. Nature 2008

Beyond Icosahedral Averaging

Icosahedral Reconstruction ! Non-Icosahedral Reconstruction

particle images

icosahedral reconstruction

icosahedral orientation

C1 orientation 1

C1 orientation 2

C1 orientation 60 best C1 orientation

C1 reconstruction

projection matching

asymmetric reconstruction icosahedral

Asymmetric Reconstruction

1

2

34

5

6

side view

top view

“Virus Anatomy”

shell

tail spikes

tail hub

portal

core

dsDNA

dsDNA terminus

700Å

850Å

dsDNA Packing: Coaxial Spooling solenoid spiral

liquid crystal folded toroid

thebacteriophages.org

Epsilon15 vs. P22

Epsilon15 P22

Acknowledgments

MIT

Jonathan King Peter Weigele

Baylor College of Medicine

Wah Chiu Matthew Baker Jonita Jakana Qinfen Zhang Steve Ludtke Juan Chang

Funding

Purdue, NIH/NIAID

Purdue

Weimin Wu Ping-An Fang Min Su Teepanis Chachiyo

Rosen Center of Advanced Computing

New Structural Biology Building (2009)

Recommended