25
Part 2: Molecular Biology IGEM, 20 June 2006

Part 2: Molecular Biology IGEM, 20 June 2006. Proteins Structural and catalytic components of cells are mainly proteins. Linear polymers of 20 different

Embed Size (px)

Citation preview

Page 1: Part 2: Molecular Biology IGEM, 20 June 2006. Proteins Structural and catalytic components of cells are mainly proteins. Linear polymers of 20 different

Part 2: Molecular Biology

IGEM, 20 June 2006

Page 2: Part 2: Molecular Biology IGEM, 20 June 2006. Proteins Structural and catalytic components of cells are mainly proteins. Linear polymers of 20 different

Proteins

• Structural and catalytic components of cells are mainly proteins.

• Linear polymers of 20 different amino acids joined by amide linkages.

• Spontaneously fold into 3D shapes (tertiary structure).

• Conventionally written from N to C terminus using three-letter or one-letter codes.

• May have multiple identical or different subunits.

(H2N)-Met-Pro-Asp-Thr-Ser-Phe-Ser-Asn-Pro-Gly-Leu-(COO-)

MPDTSFSNPGLFTPLQ…

Ribbon diagram of protein structure

Page 3: Part 2: Molecular Biology IGEM, 20 June 2006. Proteins Structural and catalytic components of cells are mainly proteins. Linear polymers of 20 different

Structure of DNA

• Protein sequences are encoded in DNA.• Linear polymer of four types of base: G, A, T, C.• Strands directional: written from 5’ to 3’ end.• Generally two antiparallel strands, hydrogen-bonded.• G on one strand binds C on other; likewise A binds T.• E. coli DNA forms a circle about 4.5 million base pairs

(Mbp) in length, encoding about 4000 proteins.

5’-AAGGAGGCAGAATGCCGGACACGTCCTTCTCCACCCCAGGCCTG-3’3’-TTCCTCCGTCTTACGGCCTGTGCAGGAAGAGGTGGGGTCCGGAC-5’

Page 4: Part 2: Molecular Biology IGEM, 20 June 2006. Proteins Structural and catalytic components of cells are mainly proteins. Linear polymers of 20 different

Encoding of proteins in DNA

• Protein structure is determined by amino acid sequence.• Encoded in DNA in ‘open reading frames’ (ORFs).• Three DNA bases (one codon) specify an amino acid.• There are 64 possible codons.• Some amino acids are specified by a single codon,

others by 2, 3, 4 or 6 different codons (degenerate).• The beginning of an ORF is indicated by the start codon

ATG (rarely, GTG) which specifies N-formyl methionine.• The end of an ORF is indicated by a stop codon: TAA,

TGA or TAG.

Page 5: Part 2: Molecular Biology IGEM, 20 June 2006. Proteins Structural and catalytic components of cells are mainly proteins. Linear polymers of 20 different

Transfer of sequence information from DNA to protein

• TRANSCRIPTION: one strand of DNA (sense or coding strand) is copied as a single stranded messenger RNA (mRNA).

• TRANSLATION: the sequence information encoded in the mRNA is used to generate a protein.

DNAtranscription

mRNAtranslation

protein

Page 6: Part 2: Molecular Biology IGEM, 20 June 2006. Proteins Structural and catalytic components of cells are mainly proteins. Linear polymers of 20 different

Transcription

• DNA sequence is copied to RNA by RNA Polymerase (RPol).

• RPol initially binds to DNA at a promoter sequence, which specifies which strand is to be copied.

• RPol binds to the promoter sequence and begins to make an RNA copy of the specified strand, from 5’ to 3’, using the other strand (template or antisense strand) as a template.

• RNA structure is similar to DNA except that T (thymine) is replaced by U (uracil).

• Transcription continues until a termination sequence is reached, at which point RPol dissociates from the DNA.

Page 7: Part 2: Molecular Biology IGEM, 20 June 2006. Proteins Structural and catalytic components of cells are mainly proteins. Linear polymers of 20 different

Translation

• Sequence information in mRNA is translated into protein sequence by ribosomes.

• Ribosomes bind to mRNA at ribosome binding sites (rbs), which partially match AAGGAGG.

• If a start codon AUG (or GUG) is within 5 to 10 bp of a rbs, the ribosome will begin making a protein.

• Translation continues until a stop codon is reached, at which point the ribosome dissociates from the protein and mRNA.

• The protein spontaneously folds into its final shape.• Multiple proteins can be encoded by one mRNA if each

has its own rbs. A group of ORFs expressed on a single mRNA is an operon.

Page 8: Part 2: Molecular Biology IGEM, 20 June 2006. Proteins Structural and catalytic components of cells are mainly proteins. Linear polymers of 20 different

Control of expression

• At any given time, only a subset of the 4000 ORFs are being expressed.

• This is mainly controlled by switching promoters on and off in response to various stimuli.

• Some promoters are controlled by activator proteins, which must bind near the promoter to recruit RPol.

• Some promoters are controlled by repressor proteins, which can bind to the promoter to prevent RPol binding.

Page 9: Part 2: Molecular Biology IGEM, 20 June 2006. Proteins Structural and catalytic components of cells are mainly proteins. Linear polymers of 20 different

Positive Control• The promoter is switched on in response to the presence

of a small molecule (inducer).• The inducer may bind to an activator protein, causing it

to bind to the promoter, or to a repressor protein, causing it to dissociate from the promoter.

RPolactivator

RPolinducer

RPol

repressor

Inducer-activator complex binds promoter and recruits RPol.

RPolrepressor

inducer

Repressor binds promoter only when inducer is not present.

Page 10: Part 2: Molecular Biology IGEM, 20 June 2006. Proteins Structural and catalytic components of cells are mainly proteins. Linear polymers of 20 different

Negative control• A promoter is switched off in response to the presence of

a small molecule (corepressor).• The corepressor may bind to an activator protein,

causing it to dissociate from the promoter, or to a repressor protein, causing it to bind the promoter.

RPolactivator

RPol corepressor

Corepressor-activator complex does not bind to promoter and recruit RPol.

RPolrepressor

RPol

repressorcorepressor

Repressor-corepressor complex binds promoter and blocks RPol.

Page 11: Part 2: Molecular Biology IGEM, 20 June 2006. Proteins Structural and catalytic components of cells are mainly proteins. Linear polymers of 20 different

Sequence of a gene/operon

• Promoter (eg TTGACA … TATA…).• Operator site (binds activator or repressor protein).• Ribosome binding site (eg AAGGAGG).• Start codon (ATG).• Open reading frame.• Stop codon (TAG, TGA, TAA).• May be other {rbs-start-ORF-stop} combinations.• Transcription termination sequence.

promoter + operator site

start codon stop codon

ORF termination sequence

Page 12: Part 2: Molecular Biology IGEM, 20 June 2006. Proteins Structural and catalytic components of cells are mainly proteins. Linear polymers of 20 different

Genetic Engineering

• Techniques for transfer of genes from one organism to another. Considerations include:

• Host (background)• Vector• Selectable marker• Restrictions sites for DNA insertion• Reporter genes

Page 13: Part 2: Molecular Biology IGEM, 20 June 2006. Proteins Structural and catalytic components of cells are mainly proteins. Linear polymers of 20 different

Host strains

• Many specialized host strains of E. coli are available.• Disabled: cannot colonise intestines.• Mutation in recA: DNA cannot recombine.• Other specialized mutations for particular purposes.• Most common host strains (eg DH5, JM109) are

derived from strain K12.

Page 14: Part 2: Molecular Biology IGEM, 20 June 2006. Proteins Structural and catalytic components of cells are mainly proteins. Linear polymers of 20 different

Vectors

• A way to carry DNA into the cell.• Usually plasmids: loops of DNA which are replicated

independently of the main chromosome.• Require an origin of replication (ori).• Two plasmids with the same ori cannot be stably

maintained in the same cell.• Usually carry one or more antibiotic resistance genes to

select for cells which have the plasmid. Most commonly ampicillin resistance (apR, bla, beta lactamase).

Page 15: Part 2: Molecular Biology IGEM, 20 June 2006. Proteins Structural and catalytic components of cells are mainly proteins. Linear polymers of 20 different

Restriction sites

• DNA can be cut at specific 6 bp sequences by restriction enzymes (type 2 restriction endonucleases).

• May leave ‘sticky’ or ‘blunt’ ends.• Sites are usually palindromic (dyad symmetry).• Eg EcoRI: GAATTC; HindIII: AAGCTT

5’-NNNGAATTCNNN-3’3’-NNNCTTAAGNNN-5’

5’-NNNG-3’ 5’-AATTCNNN-3’3’-NNNCTTAA-5’ 3’-GNNN-5’

Page 16: Part 2: Molecular Biology IGEM, 20 June 2006. Proteins Structural and catalytic components of cells are mainly proteins. Linear polymers of 20 different

Ligation

• Compatible sticky ends tend to stick together by base pairing, especially at low temperatures.

• Ends can be covalently joined using T4 DNA ligase.

5’-NNNGAATTCNNN-3’3’-NNNCTTAAGNNN-5’

5’-NNNGAATT-3’ 5’-CNNN-3’3’-NNNC-5’ 3’-TTAAGNNN-5’

Page 17: Part 2: Molecular Biology IGEM, 20 June 2006. Proteins Structural and catalytic components of cells are mainly proteins. Linear polymers of 20 different

Inserting a new gene into a plasmid

• Digest plasmid and source DNA with the same pair of restriction enzymes.

• Mix the DNA together and add DNA ligase.• A fraction of the resulting product molecules will contain

the plasmid with the new gene inserted.

EcoRI HindIII EcoRI HindIIIEcoRI HindIII

digest mix and ligate

Page 18: Part 2: Molecular Biology IGEM, 20 June 2006. Proteins Structural and catalytic components of cells are mainly proteins. Linear polymers of 20 different

Introducing a plasmid into a host

• E. coli cells will take up DNA after being treated to make them competent.

• Mix competent cells with plasmid or ligation to allow DNA uptake (transformation).

• Spread cells on an agar plate containing the correct antibiotic (eg ampicillin).

• Only cells with the plasmid will survive exposure to the antibiotic and grow to produce colonies.

• Each colony is assumed to grow from a single transformed cell.

Colonies on a plate

Page 19: Part 2: Molecular Biology IGEM, 20 June 2006. Proteins Structural and catalytic components of cells are mainly proteins. Linear polymers of 20 different

Selecting clones with the recombinant plasmid.

• Many modern vectors use ‘blue-white’ selection.• Insertion of a new piece of DNA into the multi-cloning

site disrupts a chromogenic gene (lacZ’).• Addition of IPTG (inducer) and X-gal (chromogenic

substrate) to the plates results in blue colonies if lacZ’ is intact, white if it has been disrupted.

• More generally, prepare plasmid DNA (miniprep) from each clone, digest with restriction enzymes, and analyse by agarose gel electrophoresis.

Page 20: Part 2: Molecular Biology IGEM, 20 June 2006. Proteins Structural and catalytic components of cells are mainly proteins. Linear polymers of 20 different

DNA synthesis

• In contrast to RNA synthesis, DNA synthesis requires a primer: short piece of DNA (oligonucleotide, oligo) which binds to the template strand.

• DNA Polymerase extends the primer, using the template strand as a basis for choosing which base to add. The new DNA has the same sequence as the other strand.

5’-GAATCCTGTAATCGATCGGATACGGCGATTACGA

3’-CTTAGGACATTAGCTAGCCTATGCCGCTAATGCTAATCGATCATCGGCCA-5’5’-GAATCCTGTAATCGATCGGATACGGCGATTACGA AGTAGCCGGT-3’

TTGCT

DNA polprimer new DNA

Page 21: Part 2: Molecular Biology IGEM, 20 June 2006. Proteins Structural and catalytic components of cells are mainly proteins. Linear polymers of 20 different

DNA sequencing• Sanger dideoxynucleotide chain termination method.• A primer is extended in the presence of fluorescently

labelled chain-terminating ddATP, ddCTP, ddGTP and ddTTP substrates.

• Prematurely terminated strands of defined lengths are detected by fluorescence following capillary electrophoresis and this information is used to assemble the sequence.

• About 800 bases can be read from one primer.

3’-CTTAGGACATTAGCTAGCCTATGCCGCTAATGCTAATCGATCATCGGCCA-5’5’-GAATCCTGTAATCGATCGGATACGGCGATTACGA

DNA polprimer new DNA

Green product 34 bases long ending in A.

Page 22: Part 2: Molecular Biology IGEM, 20 June 2006. Proteins Structural and catalytic components of cells are mainly proteins. Linear polymers of 20 different

Polymerase Chain Reaction• Can specifically amplify the region of DNA between two

short stretches of known sequence.• Can modify the sequence at the ends.• Requires two primers binding opposite strands.• Exponential amplification process.

Original DNA

Primers

New DNA (first round)

New DNA (subsequent)

Page 23: Part 2: Molecular Biology IGEM, 20 June 2006. Proteins Structural and catalytic components of cells are mainly proteins. Linear polymers of 20 different

Reporter genes

• Genes whose expression can be easily monitored.• lacZ – produces blue colour with X-gal.• idoA: produces blue colour.• xylE: produces yellow colour with catechol.• luxAB: glows if decanal added.• luxCDABE: glows.• luc: glows brightly if luciferin added.• gfp, rfp, cfp, yfp, ofp: cells fluorescent when viewed

under UV.• None of these systems is easy to turn off.

Page 24: Part 2: Molecular Biology IGEM, 20 June 2006. Proteins Structural and catalytic components of cells are mainly proteins. Linear polymers of 20 different

Safety Considerations

• No genetically modified organisms may be produced without a valid risk assessment approved in advance by SBS Genetic Modification and Biological Safety Committee.

• Work using disabled E. coli hosts and harmless transgenes is category 1 and may be performed in category 1 laboratories.

• Best practice must be followed to ensure that GMO do not leave the laboratory unless properly contained.

• All potentially contaminated material must be sterilised prior to disposal. All working surfaces must be disinfected and proper protective equipment worn.

Page 25: Part 2: Molecular Biology IGEM, 20 June 2006. Proteins Structural and catalytic components of cells are mainly proteins. Linear polymers of 20 different

The End

• This concludes your introduction to the wonderful world of Genetic Modification.