24
Genomics II: The Proteome Using high-throughput methods to identify proteins and to understand their function

Genomics II: The Proteome Using high-throughput methods to identify proteins and to understand their function

Embed Size (px)

Citation preview

Page 1: Genomics II: The Proteome Using high-throughput methods to identify proteins and to understand their function

Genomics II:The Proteome

Using high-throughput methods to identify proteins and to

understand their function

Page 2: Genomics II: The Proteome Using high-throughput methods to identify proteins and to understand their function

Subcellular localization of the yeast proteome

• Complete genome sequences allow each ORF to be precisely tagged with a reporter molecule

• Tagged ORF proteins indicate subcellular localization– Useful for the following:

• Correlating to regulatory modules• Verifying data on protein–protein interactions• Annotating genome sequence

Page 3: Genomics II: The Proteome Using high-throughput methods to identify proteins and to understand their function

Attaching a GFP tag to an ORF

Fusion protein

Chromosome

PCR product

COOHNH2

Homologousrecombination

GFP HIS3MX6

ORF1 ORF2

protein GFP

Page 4: Genomics II: The Proteome Using high-throughput methods to identify proteins and to understand their function

FlyTrap Screen for

Protein Localization

http://flytrap.med.yale.edu/

Page 5: Genomics II: The Proteome Using high-throughput methods to identify proteins and to understand their function

Patterns of protein localization

Page 7: Genomics II: The Proteome Using high-throughput methods to identify proteins and to understand their function

Identification of unpredicted ORFs

Page 8: Genomics II: The Proteome Using high-throughput methods to identify proteins and to understand their function

Protein-protein interactions“The Interactome”

• Yeast two-hybrid analysis

• Protein chips

• Biochemical purification/Mass spectrometry

• Protein complementation

Page 9: Genomics II: The Proteome Using high-throughput methods to identify proteins and to understand their function

Yeast two-hybrid method• Goal: Determine how proteins interact with each other

• Method– Use yeast transcription factors

– Gene expression requires the following:• A DNA-binding domain• An activation domain• A basic transcription apparatus

– Attach protein1 to DNA-binding domain (bait)

– Attach protein2 to activation domain (prey)

– Reporter gene expressed only if protein1 and protein2 interact with each other

Page 10: Genomics II: The Proteome Using high-throughput methods to identify proteins and to understand their function

A schematic of the yeast two-hybrid method

m

n

Page 11: Genomics II: The Proteome Using high-throughput methods to identify proteins and to understand their function

Results from a yeast two-hybrid experiment

• Goal: To characterize protein–protein interactions among 6,144 yeast ORFs– 5,345 were successfully cloned into yeast as

both bait and prey– Identity of ORFs determined by DNA

sequencing in hybrid yeast– 692 protein–protein interaction pairs– Interactions involved 817 ORFs

Page 12: Genomics II: The Proteome Using high-throughput methods to identify proteins and to understand their function

Yeast two-hybrid results for flies & worms

• Worms:– Created >3000 bait constructs– Tested against two AD libraries– Mapped 4000 interactions

Flies:Flies: Screened 10,000 predicted transcriptsScreened 10,000 predicted transcripts Found 20,000 interactionsFound 20,000 interactions

Statistically assigned 4800 as “high quality” Statistically assigned 4800 as “high quality” interactionsinteractions

Page 13: Genomics II: The Proteome Using high-throughput methods to identify proteins and to understand their function

Caveats associated with the yeast two-hybrid method

• There is evidence that other methods may be more sensitive

• Some inaccuracy reported when compared against known protein–protein interactions– False positives– False negatives

Page 14: Genomics II: The Proteome Using high-throughput methods to identify proteins and to understand their function

Purification of interacting proteins

• Immunoprecipitation– Impractical on large scale (identification of

unknowns)

• Affinity purification– Biochemically practical, but too dirty

• Tandem affinity purification– Sufficient yield & purity for identification of

unknown proteins

Page 15: Genomics II: The Proteome Using high-throughput methods to identify proteins and to understand their function

TAP Purification

Strategy

Page 16: Genomics II: The Proteome Using high-throughput methods to identify proteins and to understand their function

Identification of Interacting Proteins

ProteolyticDigestion(Trypsin)

MassSpectrometricAnalysis

Page 17: Genomics II: The Proteome Using high-throughput methods to identify proteins and to understand their function

Identifying proteins with mass spectrometry

• Preparation of protein sample– Extraction from a gel– Digestion by proteases — e.g., trypsin

• Mass spectrometer measures mass-charge ratio of peptide fragments

• Identified peptides are compared with database– Software used to generate theoretical peptide mass

fingerprint (PMF) for all proteins in database– Match of experimental readout to database PMF

allows researchers to identify the protein

Page 18: Genomics II: The Proteome Using high-throughput methods to identify proteins and to understand their function

Mass spectrometry

• Measures mass-to-charge ratio

• Components of mass spectrometer– Ion source– Mass analyzer– Ion detector– Data acquisition unit

A mass spectrometer

Page 19: Genomics II: The Proteome Using high-throughput methods to identify proteins and to understand their function

Principle of mass spectrometry

Page 20: Genomics II: The Proteome Using high-throughput methods to identify proteins and to understand their function

Ion sources used for proteomics

• Proteomics requires specialized ion sources

• Electrospray Ionization (ESI)– With capillary

electrophoresis and liquid chromatography

• Matrix-assisted laser desorption/ionization (MALDI)– Extracts ions from sample

surface

ESI

MALDI

Page 21: Genomics II: The Proteome Using high-throughput methods to identify proteins and to understand their function

Mass analyzers used for proteomics

• Ion trap– Captures ions on the basis

of mass-to-charge ratio– Often used with ESI

• Time of flight (TOF)– Time for accelerated ion to

reach detector indicates mass-to-charge ratio

– Frequently used with MALDI

• Also other possibilities

Ion Trap

Time of Flight

Detector

Page 22: Genomics II: The Proteome Using high-throughput methods to identify proteins and to understand their function

A mass spectrum

Page 23: Genomics II: The Proteome Using high-throughput methods to identify proteins and to understand their function

Identifying proteins with mass spectrometry

• Preparation of protein sample– Extraction from a gel– Digestion by proteases — e.g., trypsin

• Mass spectrometer measures mass-charge ratio of peptide fragments

• Identified peptides are compared with database– Software used to generate theoretical peptide mass

fingerprint (PMF) for all proteins in database– Match of experimental readout to database PMF

allows researchers to identify the protein

Page 24: Genomics II: The Proteome Using high-throughput methods to identify proteins and to understand their function

Limitations of mass spectrometry

• Not very good at identifying minute quantities of protein

• Trouble dealing with phosphorylated proteins

• Doesn’t provide concentrations of proteins

• Improved software eliminating human analysis is necessary for high-throughput projects