27
IBiS 403. The Human Proteome: Defining Variations and Modifications of Protein Molecules Lecture 1 Tuesday Jan. 6, 2015 1 INTERACTION PARTNERS SEQUENCE VARIANTS COVALENT MODIFICATIONS CANONICAL SEQUENCE

1 Intro to Proteomics WQ 2015 v2

Embed Size (px)

DESCRIPTION

proteins

Citation preview

IBiS 403. The Human Proteome: Defining Variations and Modifications

of Protein Molecules

Lecture 1Tuesday Jan. 6, 2015 1

INTERACTION PARTNERSSEQUENCE VARIANTSCOVALENT MODIFICATIONS

CANONICAL SEQUENCE

Some Logistics

Syllabus overview

• Distribution of assignments

• Reading expectations

• Course reserves

2

NU New CMS: Canvas

(https://canvas.northwestern.edu/)

3

https://canvas.northwestern.edu/

4

Goal of Proteomics Course

To teach:

1. the value of proteomics

2. how such measurements are made

3. what questions can be answered

4. and current developments in the field of proteomics.

5. How proteomics could be used in (your) research

5

Course Sections (Modules)

A. Mass Spectrometry

• Fundamentals and data collection

B. Bottom-Up Proteomics

• Data analysis and verification

C. Quantitative Proteomics

• Relative and absolute means

D. Top-Down Proteomics

E. Protein Variation

6

Assumed Knowledge

Basic Eukaryotic Cell Biology• Central Dogma• Cell cycle• Signaling pathways

Basic Molecular Biology and Biochemistry• Molecular manipulations of DNA: PCR, cloning, transfection

etc.• Northern, Southern and Western blots

Basic Protein Chemistry and processing in vivo (a Must !)

mRNA processing, tRNA

ORFs, exons, introns, polymorphisms

7

What is the proteome?

The complete set of unique protein molecules within a cell.

8

From One Gene, Many Proteoforms

DNA

SNPs; translocations; gene families

mRNA proteins

~20,300 human genes

>100,000 mRNA >1,000,000 distinct proteoforms

9

Splice variants Covalent modifications; proteolytic processing

Genome size of eukaryotes is

highly variable

10

Genome size and gene number(C-value Paradox/Enigma)

The human genome has

minimal context

dependence; the human

proteome is the opposite

The genome is complicated

13

The genome is complicated

14

WHAT WE (MAINLY) CARE ABOUT IN THIS

COURSE

Sequence Variability

The Era of Functional Genomics

Human Genome Sequence

Drugs & Diagnostics

Expression Variability

Phenotypic variation

Mapping of Human Disease Loci

(Chromosomes 9-12)

http://www.ncbi.nlm.nih.gov/cgi-bin/SCIENCE96/chr?1

Nuclear membrane

Chromatin

fiber

}Trans-acting

factors

H1

H1

Nuclear matrix

Nuclear

pore

Interphase

nucleus

Nucleosomes

DNA

Chromatin fiber

(30 nm dia.)

By weight, the human genome is half DNA and half protein

Packaging of DNA into Chromatin

From Lodish et al. 2000.

Mosammaparast & Shi, 2010

Histone Modifications: Position- and State-specific

Eukaryotic gene structure (simplified)

23

Alternative splice forms yield different

protein molecules

24

Origins of Complexity in the Human Proteome

End processing

XPi

Polymorphism/`Mutations

CN

UnknownModifications

EnzymaticModifications

Variable Splicing

Ac Me

Ac

Key Concept: sources of protein variability result in a large, but finite number of protein forms (~106 / cell type), resulting in a measurement challenge.

What questions can proteomics answer?

What proteins are expressed?

How are these proteins modified?• Any covalent modifications?

• Alternative splice forms?

Where are these proteins located?• Which cell types?

• Which organelles?

When are these proteins expressed?• At what points in the cell cycle?

• In the presence of phamaceuticals?

26

Mass Spectrometry is the workhorse for

untargeted proteomics

• High-throughput

• Confident identification of peptides/proteins

• Can achieve complete protein characterization

27