72
Drew Endy Stanford Department of Bioengineering The BioBricks Foundation 9 March 2009 Computing the Future of Biology & Biotechnology

Computing the Future of Biology & Biotechnologyarchive2.cra.org/ccc/files/docs/Endy_CCC_lite.pdf · ~5E9 tons plants (1% DNA, made up number) ~1E9 tons animals (1% DNA, made up number)

  • Upload
    others

  • View
    0

  • Download
    0

Embed Size (px)

Citation preview

Page 1: Computing the Future of Biology & Biotechnologyarchive2.cra.org/ccc/files/docs/Endy_CCC_lite.pdf · ~5E9 tons plants (1% DNA, made up number) ~1E9 tons animals (1% DNA, made up number)

Drew EndyStanford Department of BioengineeringThe BioBricks Foundation

9 March 2009

Computing the Future of Biology & Biotechnology

Page 2: Computing the Future of Biology & Biotechnologyarchive2.cra.org/ccc/files/docs/Endy_CCC_lite.pdf · ~5E9 tons plants (1% DNA, made up number) ~1E9 tons animals (1% DNA, made up number)

http://i.pbase.com/u47/hodero/large/30691408.jungle.jpg

Questions

What is this stuff?

Where did it come from?

How does it work?

What is it doing?

Where is it going?

Page 3: Computing the Future of Biology & Biotechnologyarchive2.cra.org/ccc/files/docs/Endy_CCC_lite.pdf · ~5E9 tons plants (1% DNA, made up number) ~1E9 tons animals (1% DNA, made up number)

Image c/o http://www.rps.psu.edu/probing/graphics/earth2.jpg

How much DNA on Earth?

Page 4: Computing the Future of Biology & Biotechnologyarchive2.cra.org/ccc/files/docs/Endy_CCC_lite.pdf · ~5E9 tons plants (1% DNA, made up number) ~1E9 tons animals (1% DNA, made up number)

~1E10 tons bacteria (5% DNA)~5E9 tons plants (1% DNA, made up number)~1E9 tons animals (1% DNA, made up number)~1E31 viruses (50% DNA)

How much DNA on Earth?

The pangenome concept: a unifying view of genetic information.1: Med Sci Monit. 2005 Jul;11(7):HY24-9. Epub 2005 Jun 29.

Page 5: Computing the Future of Biology & Biotechnologyarchive2.cra.org/ccc/files/docs/Endy_CCC_lite.pdf · ~5E9 tons plants (1% DNA, made up number) ~1E9 tons animals (1% DNA, made up number)

How much DNA on Earth?

4E35 bp bacterial DNA4E34 bp plant DNA1E34 bp animal DNA5E30 bp virus DNA

5E35 bp DNA on the planet

Page 6: Computing the Future of Biology & Biotechnologyarchive2.cra.org/ccc/files/docs/Endy_CCC_lite.pdf · ~5E9 tons plants (1% DNA, made up number) ~1E9 tons animals (1% DNA, made up number)

How quickly can we sequence?

1E10 bp per sequencing center per month

1E12 bp per month per Earth

5E23 months to sequence all DNA on earth

Page 8: Computing the Future of Biology & Biotechnologyarchive2.cra.org/ccc/files/docs/Endy_CCC_lite.pdf · ~5E9 tons plants (1% DNA, made up number) ~1E9 tons animals (1% DNA, made up number)

8Carlson, Pace & Proliferation of Biological Technologies, Biosec. & Bioterror. 1(3):1 (2003)

Page 9: Computing the Future of Biology & Biotechnologyarchive2.cra.org/ccc/files/docs/Endy_CCC_lite.pdf · ~5E9 tons plants (1% DNA, made up number) ~1E9 tons animals (1% DNA, made up number)

Typical NIH R01 c.2090

Specific Aim #1:

Sequence all DNA on earth in first month of project

Reviewer Comments:

Data storage & sharing plans not adequate

Functional annotation tools insufficient

Page 10: Computing the Future of Biology & Biotechnologyarchive2.cra.org/ccc/files/docs/Endy_CCC_lite.pdf · ~5E9 tons plants (1% DNA, made up number) ~1E9 tons animals (1% DNA, made up number)

10

Questions.

Can we finish genetics?

Could we finish early?

Page 11: Computing the Future of Biology & Biotechnologyarchive2.cra.org/ccc/files/docs/Endy_CCC_lite.pdf · ~5E9 tons plants (1% DNA, made up number) ~1E9 tons animals (1% DNA, made up number)

11

Page 12: Computing the Future of Biology & Biotechnologyarchive2.cra.org/ccc/files/docs/Endy_CCC_lite.pdf · ~5E9 tons plants (1% DNA, made up number) ~1E9 tons animals (1% DNA, made up number)

12

1. Genomes have finite lengths.

2. Fixation rates of mutants in populations are finite.

3. Atrophy rates of functional genetic elements are greater than zero.

Genetics is bounded?!

Page 13: Computing the Future of Biology & Biotechnologyarchive2.cra.org/ccc/files/docs/Endy_CCC_lite.pdf · ~5E9 tons plants (1% DNA, made up number) ~1E9 tons animals (1% DNA, made up number)

1. Collect or make libraries of mutants; characterize or screen.

13

Pre-Sequencing Genetics

2. Map mutants to loci.

Underlying Math = Logic

xxxxx xxxxxxxxx

xxx

xxx x

Page 14: Computing the Future of Biology & Biotechnologyarchive2.cra.org/ccc/files/docs/Endy_CCC_lite.pdf · ~5E9 tons plants (1% DNA, made up number) ~1E9 tons animals (1% DNA, made up number)

1. Read DNA.

14

Post-Sequencing Genetics

2. Identify putative functional genetic elements via models.

Underlying Math = Pattern Recognition

xxxxx xxxxxxxxx

xxx

xxx x

Page 15: Computing the Future of Biology & Biotechnologyarchive2.cra.org/ccc/files/docs/Endy_CCC_lite.pdf · ~5E9 tons plants (1% DNA, made up number) ~1E9 tons animals (1% DNA, made up number)

1. Design and construct any mutant or combinations thereof; characterize.

15

Post-Synthesis Genetics?

2. Reveal / confirm functions or demonstrate absence thereof.

Underlying Math =? Perturbation Design

Page 16: Computing the Future of Biology & Biotechnologyarchive2.cra.org/ccc/files/docs/Endy_CCC_lite.pdf · ~5E9 tons plants (1% DNA, made up number) ~1E9 tons animals (1% DNA, made up number)

Bacteriophage

16http://www.bact.wisc.edu/themicrobialworld/Phage.html

Page 17: Computing the Future of Biology & Biotechnologyarchive2.cra.org/ccc/files/docs/Endy_CCC_lite.pdf · ~5E9 tons plants (1% DNA, made up number) ~1E9 tons animals (1% DNA, made up number)

17

Key Features - self assembling - nanoscale - reproducing machine - programmed via DNA

Page 18: Computing the Future of Biology & Biotechnologyarchive2.cra.org/ccc/files/docs/Endy_CCC_lite.pdf · ~5E9 tons plants (1% DNA, made up number) ~1E9 tons animals (1% DNA, made up number)

Ecological importance

18

http://svs.gsfc.nasa.gov/vis/a000000/a002400/a002497/index.html

Page 19: Computing the Future of Biology & Biotechnologyarchive2.cra.org/ccc/files/docs/Endy_CCC_lite.pdf · ~5E9 tons plants (1% DNA, made up number) ~1E9 tons animals (1% DNA, made up number)

Technical importance

19

http://dspace.mit.edu/handle/1721.1/39677

Page 20: Computing the Future of Biology & Biotechnologyarchive2.cra.org/ccc/files/docs/Endy_CCC_lite.pdf · ~5E9 tons plants (1% DNA, made up number) ~1E9 tons animals (1% DNA, made up number)

20

Phage T7: Pre-Sequencing

“19 genes have been identified in T7 by isolation & characterization of amber mutants... ... it is estimated that not more than another 10 or so genes remain to be discovered.”

Page 21: Computing the Future of Biology & Biotechnologyarchive2.cra.org/ccc/files/docs/Endy_CCC_lite.pdf · ~5E9 tons plants (1% DNA, made up number) ~1E9 tons animals (1% DNA, made up number)

21

Phage T7: Post-Sequencing

“...coding sequences for 50 genes...”

“...evidence for expression of only 38...”

~100 genetic elements total

Page 22: Computing the Future of Biology & Biotechnologyarchive2.cra.org/ccc/files/docs/Endy_CCC_lite.pdf · ~5E9 tons plants (1% DNA, made up number) ~1E9 tons animals (1% DNA, made up number)

22

Page 23: Computing the Future of Biology & Biotechnologyarchive2.cra.org/ccc/files/docs/Endy_CCC_lite.pdf · ~5E9 tons plants (1% DNA, made up number) ~1E9 tons animals (1% DNA, made up number)

23

Page 24: Computing the Future of Biology & Biotechnologyarchive2.cra.org/ccc/files/docs/Endy_CCC_lite.pdf · ~5E9 tons plants (1% DNA, made up number) ~1E9 tons animals (1% DNA, made up number)

24

TCTCACAGTGTACGGACCTAAAGTTCCCCCATAGGGGGTACCTAAAGCCCAGCCAATCACCTAAAGTCAACCTTCGGTTGACCTTGAGGGTTCCCTAAGGGTTGGGGATGACCCTTGGGTTTGTCTTTGGGTGTTACCTTGAGTGTCTCTCTGTGTCCCTATCTGTTACAGTCTCCTAAAGTATCCTCCTAAAGTCACCTCCTAACGTCCATCCTAAAGCCAACACCTAAAGCCTACACCTAAAGACCCATCAAGTCAACGCCTATCTTAAAGTTTAAACATAAAGACCAGACCTAAAGACCA

GACCTAAAGACACTACATAAAGACCAGACCTAAAGACGCCTTGTTGTTAGCCATAAAGTGATAACCTTTAATCATTGTCTTTATTAATACAACTCACTATAAGGAGAGACAACTTAAAGAGACTTAAAAGATTAATTTAAAATTTATCAAAAAGAGTATTGACTTAAAGTCTAACCTATAGGATACTTACAGCCATCGAGAGGGACACGGCGAATAGCCATCCCAATCGACACCGGGGTCAACCGGATAAGTAGACAGCCTGATAAGTCGCACGAAAAACAGGTATTGACAACATGAAGTAACATGC

AGTAAGATACAAATCGCTAGGTAACACTAGCAGCGTCAACCGGGCGCACAGTGCCTTCTAGGTGACTTAAGCGCACCACGGCACATAAGGTGAAACAAAACGGTTGACAACATGAAGTAAACACGGTACGATGTACCACATGAAACGACAGTGAGTCACCACACTGAAAGGTGATGCGGTCTAACGAAACCTGACCTAAGACGCTCTTTAACAATCTGGTAAATAGCTCTTGAGTGCATGACTAGCGGATAACTCAAGGGTATCGCAAGGTGCCCTTTATGATATTCACTAATAACTGCACG

AGGTAACACAAGATGGCTATGTCTAACATGACTTACAACAACGTTTTCGACCACGCTTACGAAATGCTGAAAGAAAACATCCGTTATGATGACATCCGTGACACTGATGACCTGCACGATGCTATTCACATGGCTGCCGATAATGCAGTTCCGCACTACTACGCTGACATCTTTAGCGTAATGGCAAGTGAGGGCATTGACCTTGAGTTCGAAGACTCTGGTCTGATGCCTGACACCAAGGACGTAATCCGCATCCTGCAAGCGCGTATCTATGAGCAATTAACGATTGACCTCTGGGAAGAC

GCAGAAGACTTGCTCAATGAATACTTGGAGGAAGTCGAGGAGTACGAGGAGGATGAAGAGTAATGTCTACTACCAACGTGCAATACGGTCTGACCGCTCAAACTGTACTTTTCTATAGCGACATGGTGCGCTGTGGCTTTAACTGGTCACTCGCAATGGCACAGCTCAAAGAACTGTACGAAAACAACAAGGCAATAGCTTTAGAATCTGCTGAGTGATAGACTCAAGGTCGCTCCTAGCGAGTGGCCTTTATGATTATCACTTTACTTATGAGGGAGTAATGTATATGCTTACTATCGGTCTA

CTCACCGCTCTAGGTCTAGCTGTAGGTGCATCCTTTGGGAAGGCTTTAGGTGTAGCTGTAGGTTCCTACTTTACCGCTTGCATCATCATAGGAATCATCAAAGGGGCACTACGCAAATGATGAAGCACTACGTTATGCCAATCCACACGTCCAACGGGGCAACCGTATGTACACCTGATGGGTTCGCAATGAAACAACGAATCGAACGCCTTAAGCGTGAACTCCGCATTAACCGCAAGATTAACAAGATAGGTTCCGGCTATGACAGAACGCACTGATGGCTTAAAGAAAGGTTATATGCC

CAATGGCACACTATACGCTGCAAATCGGCGAATAGTGAGAACTTGGCGAGAGAACAACCTCGAACGCCGCAAGGACAAGAGAGGGCGGCGTGGCATAGACGAAAGGAAAAGGTTAAAGCCAAGAAACTCGCCGCACTTGAACAGGCACTAGCCAACACACTGAACGCTATCTCATAACGAACATAAAGGACACAATGCAATGAACATTACCGACATCATGAACGCTATCGACGCAATCAAAGCACTGCCAATCTGTGAACTTGACAAGCGTCAAGGTATGCTTATCGACTTACTGGTCG

AGATGGTCAACAGCGAGACGTGTGATGGCGAGCTAACCGAACTAAATCAGGCACTTGAGCATCAAGATTGGTGGACTACCTTGAAGTGTCTCACGGCTGACGCAGGGTTCAAGATGCTCGGTAATGGTCACTTCTCGGCTGCTTATAGTCACCCGCTGCTACCTAACAGAGTGATTAAGGTGGGCTTTAAGAAAGAGGATTCAGGCGCAGCCTATACCGCATTCTGCCGCATGTATCAGGGTCGTCCTGGTATCCCTAACGTCTACGATGTACAGCGCCACGCTGGATGCTATACGGTGG

TACTTGACGCACTTAAGGATTGCGAGCGTTTCAACAATGATGCCCATTATAAATACGCTGAGATTGCAAGCGACATCATTGATTGCAATTCGGATGAGCATGATGAGTTAACTGGATGGGATGGTGAGTTTGTTGAAACTTGTAAACTAATCCGCAAGTTCTTTGAGGGCATCGCCTCATTCGACATGCATAGCGGGAACATCATGTTCTCAAATGGAGACGTACCATACATCACCGACCCGGTATCATTCTCGCAGAAGAAAGACGGTGGCGCATTCAGCATCGACCCTGAGGAACTCATCA

AGGAAGTCGAGGAAGTCGCACGACAGAAAGAAATTGACCGCGCTAAGGCCCGTAAAGAACGTCACGAGGGGCGCTTAGAGGCACGCAGATTCAAACGTCGCAACCGCAAGGCACGTAAAGCACACAAAGCTAAGCGCGAAAGAATGCTTGCTGCGTGGCGATGGGCTGAACGTCAAGAACGGCGTAACCATGAGGTAGCTGTAGATGTACTAGGAAGAACCAATAACGCTATGCTCTGGGTCAACATGTTCTCTGGGGACTTTAAGGCGCTTGAGGAACGAATCGCGCTGCACTGG

CGTAATGCTGACCGGATGGCTATCGCTAATGGTCTTACGCTCAACATTGATAAGCAACTTGACGCAATGTTAATGGGCTGATAGTCTTATCTTACAGGTCATCTGCGGGTGGCCTGAATAGGTACGATTTACTAACTGGAAGAGGCACTAAATGAACACGATTAACATCGCTAAGAACGACTTCTCTGACATCGAACTGGCTGCTATCCCGTTCAACACTCTGGCTGACCATTACGGTGAGCGTTTAGCTCGCGAACAGTTGGCCCTTGAGCATGAGTCTTACGAGATGGGTGAAGCACGCTT

CCGCAAGATGTTTGAGCGTCAACTTAAAGCTGGTGAGGTTGCGGATAACGCTGCCGCCAAGCCTCTCATCACTACCCTACTCCCTAAGATGATTGCACGCATCAACGACTGGTTTGAGGAAGTGAAAGCTAAGCGCGGCAAGCGCCCGACAGCCTTCCAGTTCCTGCAAGAAATCAAGCCGGAAGCCGTAGCGTACATCACCATTAAGACCACTCTGGCTTGCCTAACCAGTGCTGACAATACAACCGTTCAGGCTGTAGCAAGCGCAATCGGTCGGGCCATTGAGGACGAGGCTCG

CTTCGGTCGTATCCGTGACCTTGAAGCTAAGCACTTCAAGAAAAACGTTGAGGAACAACTCAACAAGCGCGTAGGGCACGTCTACAAGAAAGCATTTATGCAAGTTGTCGAGGCTGACATGCTCTCTAAGGGTCTACTCGGTGGCGAGGCGTGGTCTTCGTGGCATAAGGAAGACTCTATTCATGTAGGAGTACGCTGCATCGAGATGCTCATTGAGTCAACCGGAATGGTTAGCTTACACCGCCAAAATGCTGGCGTAGTAGGTCAAGACTCTGAGACTATCGAACTCGCACCTGAATA

CGCTGAGGCTATCGCAACCCGTGCAGGTGCGCTGGCTGGCATCTCTCCGATGTTCCAACCTTGCGTAGTTCCTCCTAAGCCGTGGACTGGCATTACTGGTGGTGGCTATTGGGCTAACGGTCGTCGTCCTCTGGCGCTGGTGCGTACTCACAGTAAGAAAGCACTGATGCGCTACGAAGACGTTTACATGCCTGAGGTGTACAAAGCGATTAACATTGCGCAAAACACCGCATGGAAAATCAACAAGAAAGTCCTAGCGGTCGCCAACGTAATCACCAAGTGGAAGCATTGTCCGGTC

GAGGACATCCCTGCGATTGAGCGTGAAGAACTCCCGATGAAACCGGAAGACATCGACATGAATCCTGAGGCTCTCACCGCGTGGAAACGTGCTGCCGCTGCTGTGTACCGCAAGGACAAGGCTCGCAAGTCTCGCCGTATCAGCCTTGAGTTCATGCTTGAGCAAGCCAATAAGTTTGCTAACCATAAGGCCATCTGGTTCCCTTACAACATGGACTGGCGCGGTCGTGTTTACGCTGTGTCAATGTTCAACCCGCAAGGTAACGATATGACCAAAGGACTGCTTACGCTGGCGAAAG

GTAAACCAATCGGTAAGGAAGGTTACTACTGGCTGAAAATCCACGGTGCAAACTGTGCGGGTGTCGATAAGGTTCCGTTCCCTGAGCGCATCAAGTTCATTGAGGAAAACCACGAGAACATCATGGCTTGCGCTAAGTCTCCACTGGAGAACACTTGGTGGGCTGAGCAAGATTCTCCGTTCTGCTTCCTTGCGTTCTGCTTTGAGTACGCTGGGGTACAGCACCACGGCCTGAGCTATAACTGCTCCCTTCCGCTGGCGTTTGACGGGTCTTGCTCTGGCATCCAGCACTTCTCCGC

GATGCTCCGAGATGAGGTAGGTGGTCGCGCGGTTAACTTGCTTCCTAGTGAAACCGTTCAGGACATCTACGGGATTGTTGCTAAGAAAGTCAACGAGATTCTACAAGCAGACGCAATCAATGGGACCGATAACGAAGTAGTTACCGTGACCGATGAGAACACTGGTGAAATCTCTGAGAAAGTCAAGCTGGGCACTAAGGCACTGGCTGGTCAATGGCTGGCTTACGGTGTTACTCGCAGTGTGACTAAGCGTTCAGTCATGACGCTGGCTTACGGGTCCAAAGAGTTCGGCTTCCGTC

AACAAGTGCTGGAAGATACCATTCAGCCAGCTATTGATTCCGGCAAGGGTCTGATGTTCACTCAGCCGAATCAGGCTGCTGGATACATGGCTAAGCTGATTTGGGAATCTGTGAGCGTGACGGTGGTAGCTGCGGTTGAAGCAATGAACTGGCTTAAGTCTGCTGCTAAGCTGCTGGCTGCTGAGGTCAAAGATAAGAAGACTGGAGAGATTCTTCGCAAGCGTTGCGCTGTGCATTGGGTAACTCCTGATGGTTTCCCTGTGTGGCAGGAATACAAGAAGCCTATTCAGACGCGCTTG

AACCTGATGTTCCTCGGTCAGTTCCGCTTACAGCCTACCATTAACACCAACAAAGATAGCGAGATTGATGCACACAAACAGGAGTCTGGTATCGCTCCTAACTTTGTACACAGCCAAGACGGTAGCCACCTTCGTAAGACTGTAGTGTGGGCACACGAGAAGTACGGAATCGAATCTTTTGCACTGATTCACGACTCCTTCGGTACCATTCCGGCTGACGCTGCGAACCTGTTCAAAGCAGTGCGCGAAACTATGGTTGACACATATGAGTCTTGTGATGTACTGGCTGATTTCTACGACCA

GTTCGCTGACCAGTTGCACGAGTCTCAATTGGACAAAATGCCAGCACTTCCGGCTAAAGGTAACTTGAACCTCCGTGACATCTTAGAGTCGGACTTCGCGTTCGCGTAACGCCAAATCAATACGACTCACTATAGAGGGACAAACTCAAGGTCATTCGCAAGAGTGGCCTTTATGATTGACCTTCTTCCGGTTAATACGACTCACTATAGGAGAACCTTAAGGTTTAACTTTAAGACCCTTAAGTGTTAATTAGAGATTTAAATTAAAGAATTACTAAGAGAGGACTTTAAGTATGCGTAACTTC

GAAAAGATGACCAAACGTTCTAACCGTAATGCTCGTGACTTCGAGGCAACCAAAGGTCGCAAGTTGAATAAGACTAAGCGTGACCGCTCTCACAAGCGTAGCTGGGAGGGTCAGTAAGATGGGACGTTTATATAGTGGTAATCTGGCAGCATTCAAGGCAGCAACAAACAAGCTGTTCCAGTTAGACTTAGCGGTCATTTATGATGACTGGTATGATGCCTATACAAGAAAAGATTGCATACGGTTACGTATTGAGGACAGGAGTGGAAACCTGATTGATACTAGCACCTTCTACCACCACG

ACGAGGACGTTCTGTTCAATATGTGTACTGATTGGTTGAACCATATGTATGACCAGTTGAAGGACTGGAAGTAATACGACTCAGTATAGGGACAATGCTTAAGGTCGCTCTCTAGGAGTGGCCTTAGTCATTTAACCAATAGGAGATAAACATTATGATGAACATTAAGACTAACCCGTTTAAAGCCGTGTCTTTCGTAGAGTCTGCCATTAAGAAGGCTCTGGATAACGCTGGGTATCTTATCGCTGAAATCAAGTACGATGGTGTACGCGGGAACATCTGCGTAGACAATACTGCTAACAGTTA

CTGGCTCTCTCGTGTATCTAAAACGATTCCGGCACTGGAGCACTTAAACGGGTTTGATGTTCGCTGGAAGCGTCTACTGAACGATGACCGTTGCTTCTACAAAGATGGCTTTATGCTTGATGGGGAACTCATGGTCAAGGGCGTAGACTTTAACACAGGGTCCGGCCTACTGCGTACCAAATGGACTGACACGAAGAACCAAGAGTTCCATGAAGAGTTATTCGTTGAACCAATCCGTAAGAAAGATAAAGTTCCCTTTAAGCTGCACACTGGACACCTTCACATAAAACTGTACGCTATCC

TCCCGCTGCACATCGTGGAGTCTGGAGAAGACTGTGATGTCATGACGTTGCTCATGCAGGAACACGTTAAGAACATGCTGCCTCTGCTACAGGAATACTTCCCTGAAATCGAATGGCAAGCGGCTGAATCTTACGAGGTCTACGATATGGTAGAACTACAGCAACTGTACGAGCAGAAGCGAGCAGAAGGCCATGAGGGTCTCATTGTGAAAGACCCGATGTGTATCTATAAGCGCGGTAAGAAATCTGGCTGGTGGAAAATGAAACCTGAGAACGAAGCTGACGGTATCATTCAGGGTC

TGGTATGGGGTACAAAAGGTCTGGCTAATGAAGGTAAAGTGATTGGTTTTGAGGTGCTTCTTGAGAGTGGTCGTTTAGTTAACGCCACGAATATCTCTCGCGCCTTAATGGATGAGTTCACTGAGACAGTAAAAGAGGCCACCCTAAGTCAATGGGGATTCTTTAGCCCATACGGTATTGGCGACAACGATGCTTGTACTATTAACCCTTACGATGGCTGGGCGTGTCAAATTAGCTACATGGAGGAAACACCTGATGGCTCTTTGCGGCACCCATCGTTCGTAATGTTCCGTGGCACCGAG

GACAACCCTCAAGAGAAAATGTAATCACACTGGCTCACCTTCGGGTGGGCCTTTCTGCGTTTATAAGGAGACACTTTATGTTTAAGAAGGTTGGTAAATTCCTTGCGGCTTTGGCAGCTATCCTGACGCTTGCGTATATTCTTGCGGTATACCCTCAAGTAGCACTAGTAGTAGTTGGCGCTTGTTACTTAGCGGCAGTGTGTGCTTGCGTGTGGAGTATAGTTAACTGGTAATACGACTCACTAAAGGAGGTACACACCATGATGTACTTAATGCCATTACTCATCGTCATTGTAGGATGCCTT

GCGCTCCACTGTAGCGATGATGATATGCCAGATGGTCACGCTTAATACGACTCACTAAAGGAGACACTATATGTTTCGACTTCATTACAACAAAAGCGTTAAGAATTTCACGGTTCGCCGTGCTGACCGTTCAATCGTATGTGCGAGCGAGCGCCGAGCTAAGATACCTCTTATTGGTAACACAGTTCCTTTGGCACCGAGCGTCCACATCATTATCACCCGTGGTGACTTTGAGAAAGCAATAGACAAGAAACGTCCGGTTCTTAGTGTGGCAGTGACCCGCTTCCCGTTCGTCCGTCTGT

TACTCAAACGAATCAAGGAGGTGTTCTGATGGGACTGTTAGATGGTGAAGCCTGGGAAAAAGAAAACCCGCCAGTACAAGCAACTGGGTGTATAGCTTGCTTAGAGAAAGATGACCGTTATCCACACACCTGTAACAAAGGAGCTAACGATATGACCGAACGTGAACAAGAGATGATCATTAAGTTGATAGACAATAATGAAGGTCGCCCAGATGATTTGAATGGCTGCGGTATTCTCTGCTCCAATGTCCCTTGCCACCTCTGCCCCGCAAATAACGATCAAAAGATAACCTTAGGTGAAAT

CCGAGCGATGGACCCACGTAAACCACATCTGAATAAACCTGAGGTAACTCCTACAGATGACCAGCCTTCCGCTGAGACAATCGAAGGTGTCACTAAGCCTTCCCACTACATGCTGTTTGACGACATTGAGGCTATCGAAGTGATTGCTCGTTCAATGACCGTTGAGCAGTTCAAGGGATACTGCTTCGGTAACATCTTAAAGTACAGACTACGTGCTGGTAAGAAGTCAGAGTTAGCGTACTTAGAGAAAGACCTAGCGAAAGCAGACTTCTATAAAGAACTCTTTGAGAAACATAAGGATAA

ATGTTATGCATAACTTCAAGTCAACCCCACCTGCCGACAGCCTATCTGATGACTTCACATCTTGCTCAGAGTGGTGCCGAAAGATGTGGGAAGAGACATTCGACGATGCGTACATCAAGCTGTATGAACTTTGGAAATCGAGAGGTCAATGACTATGTCAAACGTAAATACAGGTTCACTTAGTGTGGACAATAAGAAGTTTTGGGCTACCGTAGAGTCCTCGGAGCATTCCTTCGAGGTTCCAATCTACGCTGAGACCCTAGACGAAGCTCTGGAGTTAGCCGAATGGCAATACGTTCCGG

CTGGCTTTGAGGTTACTCGTGTGCGTCCTTGTGTAGCACCGAAGTAATACGACTCACTATTAGGGAAGACTCCCTCTGAGAAACCAAACGAAACCTAAAGGAGATTAACATTATGGCTAAGAAGATTTTCACCTCTGCGCTGGGTACCGCTGAACCTTACGCTTACATCGCCAAGCCGGACTACGGCAACGAAGAGCGTGGCTTTGGGAACCCTCGTGGTGTCTATAAAGTTGACCTGACTATTCCCAACAAAGACCCGCGCTGCCAGCGTATGGTCGATGAAATCGTGAAGTGTCACGA

AGAGGCTTATGCTGCTGCCGTTGAGGAATACGAAGCTAATCCACCTGCTGTAGCTCGTGGTAAGAAACCGCTGAAACCGTATGAGGGTGACATGCCGTTCTTCGATAACGGTGACGGTACGACTACCTTTAAGTTCAAATGCTACGCGTCTTTCCAAGACAAGAAGACCAAAGAGACCAAGCACATCAATCTGGTTGTGGTTGACTCAAAAGGTAAGAAGATGGAAGACGTTCCGATTATCGGTGGTGGCTCTAAGCTGAAAGTTAAATATTCTCTGGTTCCATACAAGTGGAACACTGCT

GTAGGTGCGAGCGTTAAGCTGCAACTGGAATCCGTGATGCTGGTCGAACTGGCTACCTTTGGTGGCGGTGAAGACGATTGGGCTGACGAAGTTGAAGAGAACGGCTATGTTGCCTCTGGTTCTGCCAAAGCGAGCAAACCACGCGACGAAGAAAGCTGGGACGAAGACGACGAAGAGTCCGAGGAAGCAGACGAAGACGGAGACTTCTAAGTGGAACTGCGGGAGAAAATCCTTGAGCGAATCAAGGTGACTTCCTCTGGGTGTTGGGAGTGGCAGGGCGCTACGAACAATAAA

GGGTACGGGCAGGTGTGGTGCAGCAATACCGGAAAGGTTGTCTACTGTCATCGCGTAATGTCTAATGCTCCGAAAGGTTCTACCGTCCTGCACTCCTGTGATAATCCATTATGTTGTAACCCTGAACACCTATCCATAGGAACTCCAAAAGAGAACTCCACTGACATGGTAAATAAGGGTCGCTCACACAAGGGGTATAAACTTTCAGACGAAGACGTAATGGCAATCATGGAGTCCAGCGAGTCCAATGTATCCTTAGCTCGCACCTATGGTGTCTCCCAACAGACTATTTGTGATATACGC

AAAGGGAGGCGACATGGCAGGTTACGGCGCTAAAGGAATCCGAAAGGTTGGAGCGTTTCGCTCTGGCCTAGAGGACAAGGTTTCAAAGCAGTTGGAATCAAAAGGTATTAAATTCGAGTATGAAGAGTGGAAAGTGCCTTATGTAATTCCGGCGAGCAATCACACTTACACTCCAGACTTCTTACTTCCAAACGGTATATTCGTTGAGACAAAGGGTCTGTGGGAAAGCGATGATAGAAAGAAGCACTTATTAATTAGGGAGCAGCACCCCGAGCTAGACATCCGTATTGTCTTCTCAAGC

TCACGTACTAAGTTATACAAAGGTTCTCCAACGTCTTATGGAGAGTTCTGCGAAAAGCATGGTATTAAGTTCGCTGATAAACTGATACCTGCTGAGTGGATAAAGGAACCCAAGAAGGAGGTCCCCTTTGATAGATTAAAAAGGAAAGGAGGAAAGAAATAATGGCTCGTGTACAGTTTAAACAACGTGAATCTACTGACGCAATCTTTGTTCACTGCTCGGCTACCAAGCCAAGTCAGAATGTTGGTGTCCGTGAGATTCGCCAGTGGCACAAAGAGCAGGGTTGGCTCGATGTGGGATAC

CACTTTATCATCAAGCGAGACGGTACTGTGGAGGCAGGACGAGATGAGATGGCTGTAGGCTCTCACGCTAAGGGTTACAACCACAACTCTATCGGCGTCTGCCTTGTTGGTGGTATCGACGATAAAGGTAAGTTCGACGCTAACTTTACGCCAGCCCAAATGCAATCCCTTCGCTCACTGCTTGTCACACTGCTGGCTAAGTACGAAGGCGCTGTGCTTCGCGCCCATCATGAGGTGGCGCCGAAGGCTTGCCCTTCGTTCGACCTTAAGCGTTGGTGGGAGAAGAACGAACTGGTCA

CTTCTGACCGTGGATAATTAATTGAACTCACTAAAGGGAGACCACAGCGGTTTCCCTTTGTTCGCATTGGAGGTCAAATAATGCGCAAGTCTTATAAACAATTCTATAAGGCTCCGAGGAGGCATATCCAAGTGTGGGAGGCAGCCAATGGGCCTATACCAAAAGGTTATTATATAGACCACATTGACGGCAATCCACTCAACGACGCCTTAGACAATCTCCGTCTGGCTCTCCCAAAAGAAAACTCATGGAACATGAAGACTCCAAAGAGCAATACCTCAGGACTAAAGGGACTGAGTTGGA

GCAAGGAAAGGGAGATGTGGAGAGGCACTGTAACAGCTGAGGGTAAACAGCATAACTTTCGTAGTAGAGATCTATTGGAAGTCGTTGCGTGGATTTATAGAACTAGGAGGGAATTGCATGGACAATTCGCACGATTCCGATAGTGTATTTCTTTACCACATTCCTTGTGACAACTGTGGGAGTAGTGATGGGAACTCGCTGTTCTCTGACGGACACACGTTCTGCTACGTATGCGAGAAGTGGACTGCTGGTAATGAAGACACTAAAGAGAGGGCTTCAAAACGGAAACCCTCAGGAGGTA

AACCAATGACTTACAACGTGTGGAACTTCGGGGAATCCAATGGACGCTACTCCGCGTTAACTGCGAGAGGAATCTCCAAGGAAACCTGTCAGAAGGCTGGCTACTGGATTGCCAAAGTAGACGGTGTGATGTACCAAGTGGCTGACTATCGGGACCAGAACGGCAACATTGTGAGTCAGAAGGTTCGAGATAAAGATAAGAACTTTAAGACCACTGGTAGTCACAAGAGTGACGCTCTGTTCGGGAAGCACTTGTGGAATGGTGGTAAGAAGATTGTCGTTACAGAAGGTGAAATCGAC

ATGCTTACCGTGATGGAACTTCAAGACTGTAAGTATCCTGTAGTGTCGTTGGGTCACGGTGCCTCTGCCGCTAAGAAGACATGCGCTGCCAACTACGAATACTTTGACCAGTTCGAACAGATTATCTTAATGTTCGATATGGACGAAGCAGGGCGCAAAGCAGTCGAAGAGGCTGCACAGGTTCTACCTGCTGGTAAGGTACGAGTGGCAGTTCTTCCGTGTAAGGATGCAAACGAGTGTCACCTAAATGGTCACGACCGTGAAATCATGGAGCAAGTGTGGAATGCTGGTCCTTGGATT

CCTGATGGTGTGGTATCGGCTCTTTCGTTACGTGAACGAATCCGTGAGCACCTATCGTCCGAGGAATCAGTAGGTTTACTTTTCAGTGGCTGCACTGGTATCAACGATAAGACCTTAGGTGCCCGTGGTGGTGAAGTCATTATGGTCACTTCCGGTTCCGGTATGGGTAAGTCAACGTTCGTCCGTCAACAAGCTCTACAATGGGGCACAGCGATGGGCAAGAAGGTAGGCTTAGCGATGCTTGAGGAGTCCGTTGAGGAGACCGCTGAGGACCTTATAGGTCTACACAACCGTGTCCG

ACTGAGACAATCCGACTCACTAAAGAGAGAGATTATTGAGAACGGTAAGTTCGACCAATGGTTCGATGAACTGTTCGGCAACGATACGTTCCATCTATATGACTCATTCGCCGAGGCTGAGACGGATAGACTGCTCGCTAAGCTGGCCTACATGCGCTCAGGCTTGGGCTGTGACGTAATCATTCTAGACCACATCTCAATCGTCGTATCCGCTTCTGGTGAATCCGATGAGCGTAAGATGATTGACAACCTGATGACCAAGCTCAAAGGGTTCGCTAAGTCAACTGGGGTGGTGCTGGTC

GTAATTTGTCACCTTAAGAACCCAGACAAAGGTAAAGCACATGAGGAAGGTCGCCCCGTTTCTATTACTGACCTACGTGGTTCTGGCGCACTACGCCAACTATCTGATACTATTATTGCCCTTGAGCGTAATCAGCAAGGCGATATGCCTAACCTTGTCCTCGTTCGTATTCTCAAGTGCCGCTTTACTGGTGATACTGGTATCGCTGGCTACATGGAATACAACAAGGAAACCGGATGGCTTGAACCATCAAGTTACTCAGGGGAAGAAGAGTCACACTCAGAGTCAACAGACTGGTCCAAC

GACACTGACTTCTGACAGGATTCTTGATGACTTTCCAGACGACTACGAGAAGTTTCGCTGGAGAGTCCCATTCTAATACGACTCACTAAAGGAGACACACCATGTTCAAACTGATTAAGAAGTTAGGCCAACTGCTGGTTCGTATGTACAACGTGGAAGCCAAGCGACTGAACGATGAGGCTCGTAAAGAGGCCACACAGTCACGCGCTCTGGCGATTCGCTCCAACGAACTGGCTGACAGTGCATCCACTAAAGTTACCGAGGCTGCCCGTGTGGCAAACCAAGCTCAACAGCTTTCC

AAATTCTTTGAGTAATCAAACAGGAGAAACCATTATGTCTAACGTAGCTGAAACTATCCGTCTATCCGATACAGCTGACCAGTGGAACCGTCGAGTCCACATCAACGTTCGCAACGGTAAGGCGACTATGGTTTACCGCTGGAAGGACTCTAAGTCCTCTAAGAATCACACTCAGCGTATGACGTTGACAGATGAGCAAGCACTGCGTCTGGTCAATGCGCTTACCAAAGCTGCCGTGACAGCAATTCATGAAGCTGGTCGCGTCAATGAAGCTATGGCTATCCTCGACAAGATTGATAACT

AAGAGTGGTATCCTCAAGGTCGCCAAAGTGGTGGCCTTCATGAATACTATTCGACTCACTATAGGAGATATTACCATGCGTGACCCTAAAGTTATCCAAGCAGAAATCGCTAAACTGGAAGCTGAACTGGAGGACGTTAAGTACCATGAAGCTAAGACTCGCTCCGCTGTTCACATCTTGAAGAACTTAGGCTGGACTTGGACAAGACAGACTGGCTGGAAGAAACCAGAAGTTACCAAGCTGAGTCATAAGGTGTTCGATAAGGACACTATGACCCACATCAAGGCTGGTGATTGGGTTAA

GGTTGACATGGGAGTTGTTGGTGGATACGGCTACGTCCGCTCAGTTAGTGGCAAATATGCACAAGTGTCATACATCACAGGTGTTACTCCACGCGGTGCAATCGTTGCCGATAAGACCAACATGATTCACACAGGTTTCTTGACAGTTGTTTCATATGAAGAGATTGTTAAGTCACGATAATCAATAGGAGAAATCAATATGATCGTTTCTGACATCGAAGCTAACGCCCTCTTAGAGAGCGTCACTAAGTTCCACTGCGGGGTTATCTACGACTACTCCACCGCTGAGTACGTAAGCTACCGT

CCGAGTGACTTCGGTGCGTATCTGGATGCGCTGGAAGCCGAGGTTGCACGAGGCGGTCTTATTGTGTTCCACAACGGTCACAAGTATGACGTTCCTGCATTGACCAAACTGGCAAAGTTGCAATTGAACCGAGAGTTCCACCTTCCTCGTGAGAACTGTATTGACACCCTTGTGTTGTCACGTTTGATTCATTCCAACCTCAAGGACACCGATATGGGTCTTCTGCGTTCCGGCAAGTTGCCCGGAAAACGCTTTGGGTCTCACGCTTTGGAGGCGTGGGGTTATCGCTTAGGCGAGAT

GAAGGGTGAATACAAAGACGACTTTAAGCGTATGCTTGAAGAGCAGGGTGAAGAATACGTTGACGGAATGGAGTGGTGGAACTTCAACGAAGAGATGATGGACTATAACGTTCAGGACGTTGTGGTAACTAAAGCTCTCCTTGAGAAGCTACTCTCTGACAAACATTACTTCCCTCCTGAGATTGACTTTACGGACGTAGGATACACTACGTTCTGGTCAGAATCCCTTGAGGCCGTTGACATTGAACATCGTGCTGCATGGCTGCTCGCTAAACAAGAGCGCAACGGGTTCCCGTTTGAC

ACAAAAGCAATCGAAGAGTTGTACGTAGAGTTAGCTGCTCGCCGCTCTGAGTTGCTCCGTAAATTGACCGAAACGTTCGGCTCGTGGTATCAGCCTAAAGGTGGCACTGAGATGTTCTGCCATCCGCGAACAGGTAAGCCACTACCTAAATACCCTCGCATTAAGACACCTAAAGTTGGTGGTATCTTTAAGAAGCCTAAGAACAAGGCACAGCGAGAAGGCCGTGAGCCTTGCGAACTTGATACCCGCGAGTACGTTGCTGGTGCTCCTTACACCCCAGTTGAACATGTTGTGTTTAAC

CCTTCGTCTCGTGACCACATTCAGAAGAAACTCCAAGAGGCTGGGTGGGTCCCGACCAAGTACACCGATAAGGGTGCTCCTGTGGTGGACGATGAGGTACTCGAAGGAGTACGTGTAGATGACCCTGAGAAGCAAGCCGCTATCGACCTCATTAAAGAGTACTTGATGATTCAGAAGCGAATCGGACAGTCTGCTGAGGGAGACAAAGCATGGCTTCGTTATGTTGCTGAGGATGGTAAGATTCATGGTTCTGTTAACCCTAATGGAGCAGTTACGGGTCGTGCGACCCATGCGTTCCC

AAACCTTGCGCAAATTCCGGGTGTACGTTCTCCTTATGGAGAGCAGTGTCGCGCTGCTTTTGGCGCTGAGCACCATTTGGATGGGATAACTGGTAAGCCTTGGGTTCAGGCTGGCATCGACGCATCCGGTCTTGAGCTACGCTGCTTGGCTCACTTCATGGCTCGCTTTGATAACGGCGAGTACGCTCACGAGATTCTTAACGGCGACATCCACACTAAGAACCAGATAGCTGCTGAACTACCTACCCGAGATAACGCTAAGACGTTCATCTATGGGTTCCTCTATGGTGCTGGTGATGA

GAAGATTGGACAGATTGTTGGTGCTGGTAAAGAGCGCGGTAAGGAACTCAAGAAGAAATTCCTTGAGAACACCCCCGCGATTGCAGCACTCCGCGAGTCTATCCAACAGACACTTGTCGAGTCCTCTCAATGGGTAGCTGGTGAGCAACAAGTCAAGTGGAAACGCCGCTGGATTAAAGGTCTGGATGGTCGTAAGGTACACGTTCGTAGTCCTCACGCTGCCTTGAATACCCTACTGCAATCTGCTGGTGCTCTCATCTGCAAACTGTGGATTATCAAGACCGAAGAGATGCTCGTAG

AGAAAGGCTTGAAGCATGGCTGGGATGGGGACTTTGCGTACATGGCATGGGTACATGATGAAATCCAAGTAGGCTGCCGTACCGAAGAGATTGCTCAGGTGGTCATTGAGACCGCACAAGAAGCGATGCGCTGGGTTGGAGACCACTGGAACTTCCGGTGTCTTCTGGATACCGAAGGTAAGATGGGTCCTAATTGGGCGATTTGCCACTGATACAGGAGGCTACTCATGAACGAAAGACACTTAACAGGTGCTGCTTCTGAAATGCTAGTAGCCTACAAATTTACCAAAGCTGGGTAC

ACTGTCTATTACCCTATGCTGACTCAGAGTAAAGAGGACTTGGTTGTATGTAAGGATGGTAAATTTAGTAAGGTTCAGGTTAAAACAGCCACAACGGTTCAAACCAACACAGGAGATGCCAAGCAGGTTAGGCTAGGTGGATGCGGTAGGTCCGAATATAAGGATGGAGACTTTGACATTCTTGCGGTTGTGGTTGACGAAGATGTGCTTATTTTCACATGGGACGAAGTAAAAGGTAAGACATCCATGTGTGTCGGCAAGAGAAACAAAGGCATAAAACTATAGGAGAAATTATTATGGCTAT

GACAAAGAAATTTAAAGTGTCCTTCGACGTTACCGCAAAGATGTCGTCTGACGTTCAGGCAATCTTAGAGAAAGATATGCTGCATCTATGTAAGCAGGTCGGCTCAGGTGCGATTGTCCCCAATGGTAAACAGAAGGAAATGATTGTCCAGTTCCTGACACACGGTATGGAAGGATTGATGACATTCGTAGTACGTACATCATTTCGTGAGGCCATTAAGGACATGCACGAAGAGTATGCAGATAAGGACTCTTTCAAACAATCTCCTGCAACAGTACGGGAGGTGTTCTGATGTCTGACTAC

CTGAAAGTGCTGCAAGCAATCAAAAGTTGCCCTAAGACTTTCCAGTCCAACTATGTACGGAACAATGCGAGCCTCGTAGCGGAGGCCGCTTCCCGTGGTCACATCTCGTGCCTGACTACTAGTGGACGTAACGGTGGCGCTTGGGAAATCACTGCTTCCGGTACTCGCTTTCTGAAACGAATGGGAGGATGTGTCTAATGTCTCGTGACCTTGTGACTATTCCACGCGATGTGTGGAACGATATACAGGGCTACATCGACTCTCTGGAACGTGAGAACGATAGCCTTAAGAATCAACTAAT

GGAAGCTGACGAATACGTAGCGGAACTAGAGGAGAAACTTAATGGCACTTCTTGACCTTAAACAATTCTATGAGTTACGTGAAGGCTGCGACGACAAGGGTATCCTTGTGATGGACGGCGACTGGCTGGTCTTCCAAGCTATGAGTGCTGCTGAGTTTGATGCCTCTTGGGAGGAAGAGATTTGGCACCGATGCTGTGACCACGCTAAGGCCCGTCAGATTCTTGAGGATTCCATTAAGTCCTACGAGACCCGTAAGAAGGCTTGGGCAGGTGCTCCAATTGTCCTTGCGTTCACCGAT

AGTGTTAACTGGCGTAAAGAACTGGTTGACCCGAACTATAAGGCTAACCGTAAGGCCGTGAAGAAACCTGTAGGGTACTTTGAGTTCCTTGATGCTCTCTTTGAGCGCGAAGAGTTCTATTGCATCCGTGAGCCTATGCTTGAGGGTGATGACGTTATGGGAGTTATTGCTTCCAATCCGTCTGCCTTCGGTGCTCGTAAGGCTGTAATCATCTCTTGCGATAAGGACTTTAAGACCATCCCTAACTGTGACTTCCTGTGGTGTACCACTGGTAACATCCTGACTCAGACCGAAGAGTCCGC

TGACTGGTGGCACCTCTTCCAGACCATCAAGGGTGACATCACTGATGGTTACTCAGGGATTGCTGGATGGGGTGATACCGCCGAGGACTTCTTGAATAACCCGTTCATAACCGAGCCTAAAACGTCTGTGCTTAAGTCCGGTAAGAACAAAGGCCAAGAGGTTACTAAATGGGTTAAACGCGACCCTGAGCCTCATGAGACGCTTTGGGACTGCATTAAGTCCATTGGCGCGAAGGCTGGTATGACCGAAGAGGATATTATCAAGCAGGGCCAAATGGCTCGAATCCTACGGTTCAACGA

GTACAACTTTATTGACAAGGAGATTTACCTGTGGAGACCGTAGCGTATATTGGTCTGGGTCTTTGTGTTCTCGGAGTGTGCCTCATTTCGTGGGGCCTTTGGGACTTAGCCAGAATAATCAAGTCGTTACACGACACTAAGTGATAAACTCAAGGTCCCTAAATTAATACGACTCACTATAGGGAGATAGGGGCCTTTACGATTATTACTTTAAGATTTAACTCTAAGAGGAATCTTTATTATGTTAACACCTATTAACCAATTACTTAAGAACCCTAACGATATTCCAGATGTACCTCGTGCAACCG

CTGAGTATCTACAGGTTCGATTCAACTATGCGTACCTCGAAGCGTCTGGTCATATAGGACTTATGCGTGCTAATGGTTGTAGTGAGGCCCACATCTTGGGTTTCATTCAGGGCCTACAGTATGCCTCTAACGTCATTGACGAGATTGAGTTACGCAAGGAACAACTAAGAGATGATGGGGAGGATTGACACTATGTGTTTCTCACCGAAAATTAAAACTCCGAAGATGGATACCAATCAGATTCGAGCCGTTGAGCCAGCGCCTCTGACCCAAGAAGTGTCAAGCGTGGAGTTCGGTGGGTC

TTCTGATGAGACGGATACCGAGGGCACCGAAGTGTCTGGACGCAAAGGCCTCAAGGTCGAACGTGATGATTCCGTAGCGAAGTCTAAAGCCAGCGGCAATGGCTCCGCTCGTATGAAATCTTCCATCCGTAAGTCCGCATTTGGAGGTAAGAAGTGATGTCTGAGTTCACATGTGTGGAGGCTAAGAGTCGCTTCCGTGCAATCCGGTGGACTGTGGAACACCTTGGGTTGCCTAAAGGATTCGAAGGACACTTTGTGGGCTACAGCCTCTACGTAGACGAAGTGATGGACATGTCTG

GTTGCCGTGAAGAGTACATTCTGGACTCTACCGGAAAACATGTAGCGTACTTCGCGTGGTGCGTAAGCTGTGACATTCACCACAAAGGAGACATTCTGGATGTAACGTCCGTTGTCATTAATCCTGAGGCAGACTCTAAGGGCTTACAGCGATTCCTAGCGAAACGCTTTAAGTACCTTGCGGAACTCCACGATTGCGATTGGGTGTCTCGTTGTAAGCATGAAGGCGAGACAATGCGTGTATACTTTAAGGAGGTATAAGTTATGGGTAAGAAAGTTAAGAAGGCCGTGAAGAAAGTCAC

CAAGTCCGTTAAGAAAGTCGTTAAGGAAGGGGCTCGTCCGGTTAAACAGGTTGCTGGCGGTCTAGCTGGTCTGGCTGGTGGTACTGGTGAAGCACAGATGGTGGAAGTACCACAAGCTGCCGCACAGATTGTTGACGTACCTGAGAAAGAGGTTTCCACTGAGGACGAAGCACAGACAGAAAGCGGACGCAAGAAAGCTCGTGCTGGCGGTAAGAAATCCTTGAGTGTAGCCCGTAGCTCCGGTGGCGGTATCAACATTTAATCAGGAGGTTATCGTGGAAGACTGCATTGAATGGA

CCGGAGGTGTCAACTCTAAGGGTTATGGTCGTAAGTGGGTTAATGGTAAACTTGTGACTCCACATAGGCACATCTATGAGGAGACATATGGTCCAGTTCCAACAGGAATTGTGGTGATGCATATCTGCGATAACCCTAGGTGCTATAACATAAAGCACCTTACGCTTGGAACTCCAAAGGATAATTCCGAGGACATGGTTACCAAAGGTAGACAGGCTAAAGGAGAGGAACTAAGCAAGAAACTTACAGAGTCAGACGTTCTCGCTATACGCTCTTCAACCTTAAGCCACCGCTCCTTAGGAG

AACTGTATGGAGTCAGTCAATCAACCATAACGCGAATACTACAGCGTAAGACATGGAGACACATTTAATGGCTGAGAAACGAACAGGACTTGCGGAGGATGGCGCAAAGTCTGTCTATGAGCGTTTAAAGAACGACCGTGCTCCCTATGAGACACGCGCTCAGAATTGCGCTCAATATACCATCCCATCATTGTTCCCTAAGGACTCCGATAACGCCTCTACAGATTATCAAACTCCGTGGCAAGCCGTGGGCGCTCGTGGTCTGAACAATCTAGCCTCTAAGCTCATGCTGGCTCTATTCC

CTATGCAGACTTGGATGCGACTTACTATATCTGAATATGAAGCAAAGCAGTTACTGAGCGACCCCGATGGACTCGCTAAGGTCGATGAGGGCCTCTCGATGGTAGAGCGTATCATCATGAACTACATTGAGTCTAACAGTTACCGCGTGACTCTCTTTGAGGCTCTCAAACAGTTAGTCGTAGCTGGTAACGTCCTGCTGTACCTACCGGAACCGGAAGGGTCAAACTATAATCCCATGAAGCTGTACCGATTGTCTTCTTATGTGGTCCAACGAGACGCATTCGGCAACGTTCTGCAAATG

GTGACTCGTGACCAGATAGCTTTTGGTGCTCTCCCTGAGGACATCCGTAAGGCTGTAGAAGGTCAAGGTGGTGAGAAGAAAGCTGATGAGACAATCGACGTGTACACTCACATCTATCTGGATGAGGACTCAGGTGAATACCTCCGATACGAAGAGGTCGAGGGTATGGAAGTCCAAGGCTCCGATGGGACTTATCCTAAAGAGGCTTGCCCATACATCCCGATTCGGATGGTCAGACTAGATGGTGAATCCTACGGTCGTTCGTACATTGAGGAATACTTAGGTGACTTACGGTCCCTT

GAAAATCTCCAAGAGGCTATCGTCAAGATGTCCATGATTAGCTCTAAGGTTATCGGCTTAGTGAATCCTGCTGGTATCACCCAGCCACGCCGACTGACCAAAGCTCAGACTGGTGACTTCGTTACTGGTCGTCCAGAAGACATCTCGTTCCTCCAACTGGAGAAGCAAGCAGACTTTACTGTAGCTAAAGCCGTAAGTGACGCTATCGAGGCTCGCCTTTCGTTTGCCTTTATGTTGAACTCTGCGGTTCAGCGTACAGGTGAACGTGTGACCGCCGAAGAGATTCGGTATGTAGCTTCTG

AACTTGAAGATACTTTAGGTGGTGTCTACTCTATCCTTTCTCAAGAATTACAATTGCCTCTGGTACGAGTGCTCTTGAAGCAACTACAAGCCACGCAACAGATTCCTGAGTTACCTAAGGAAGCCGTAGAGCCAACCATTAGTACAGGTCTGGAAGCAATTGGTCGAGGACAAGACCTTGATAAGCTGGAGCGGTGTGTCACTGCGTGGGCTGCACTGGCACCTATGCGGGACGACCCTGATATTAACCTTGCGATGATTAAGTTACGTATTGCCAACGCTATCGGTATTGACACTTCTGGTA

TTCTACTCACCGAAGAACAGAAGCAACAGAAGATGGCCCAACAGTCTATGCAAATGGGTATGGATAATGGTGCTGCTGCGCTGGCTCAAGGTATGGCTGCACAAGCTACAGCTTCACCTGAGGCTATGGCTGCTGCCGCTGATTCCGTAGGTTTACAGCCGGGAATTTAATACGACTCACTATAGGGAGACCTCATCTTTGAAATGAGCGATGACAAGAGGTTGGAGTCCTCGGTCTTCCTGTAGTTCAACTTTAAGGAGACAATAATAATGGCTGAATCTAATGCAGACGTATATGCATCTT

TTGGCGTGAACTCCGCTGTGATGTCTGGTGGTTCCGTTGAGGAACATGAGCAGAACATGCTGGCTCTTGATGTTGCTGCCCGTGATGGCGATGATGCAATCGAGTTAGCGTCAGACGAAGTGGAAACAGAACGTGACCTGTATGACAACTCTGACCCGTTCGGTCAAGAGGATGACGAAGGCCGCATTCAGGTTCGTATCGGTGATGGCTCTGAGCCGACCGATGTGGACACTGGAGAAGAAGGCGTTGAGGGCACCGAAGGTTCCGAAGAGTTTACCCCACTGGGCGAGACTCCA

GAAGAACTGGTAGCTGCCTCTGAGCAACTTGGTGAGCACGAAGAGGGCTTCCAAGAGATGATTAACATTGCTGCTGAGCGTGGCATGAGTGTCGAGACCATTGAGGCTATCCAGCGTGAGTACGAGGAGAACGAAGAGTTGTCCGCCGAGTCCTACGCTAAGCTGGCTGAAATTGGCTACACGAAGGCTTTCATTGACTCGTATATCCGTGGTCAAGAAGCTCTGGTGGAGCAGTACGTAAACAGTGTCATTGAGTACGCTGGTGGTCGTGAACGTTTTGATGCACTGTATAACCACCT

TGAGACGCACAACCCTGAGGCTGCACAGTCGCTGGATAATGCGTTGACCAATCGTGACTTAGCGACCGTTAAGGCTATCATCAACTTGGCTGGTGAGTCTCGCGCTAAGGCGTTCGGTCGTAAGCCAACTCGTAGTGTGACTAATCGTGCTATTCCGGCTAAACCTCAGGCTACCAAGCGTGAAGGCTTTGCGGACCGTAGCGAGATGATTAAAGCTATGAGTGACCCTCGGTATCGCACAGATGCCAACTATCGTCGTCAAGTCGAACAGAAAGTAATCGATTCGAACTTCTGATAGAC

TTCGAAATTAATACGACTCACTATAGGGAGACCACAACGGTTTCCCTCTAGAAATAATTTTGTTTAACTTTAAGAAGGAGATATACATATGGCTAGCATGACTGGTGGACAGCAAATGGGTACTAACCAAGGTAAAGGTGTAGTTGCTGCTGGAGATAAACTGGCGTTGTTCTTGAAGGTATTTGGCGGTGAAGTCCTGACTGCGTTCGCTCGTACCTCCGTGACCACTTCTCGCCACATGGTACGTTCCATCTCCAGCGGTAAATCCGCTCAGTTCCCTGTTCTGGGTCGCACTCAGGCAGC

GTATCTGGCTCCGGGCGAGAACCTCGACGATAAACGTAAGGACATCAAACACACCGAGAAGGTAATCACCATTGACGGTCTCCTGACGGCTGACGTTCTGATTTATGATATTGAGGACGCGATGAACCACTACGACGTTCGCTCTGAGTATACCTCTCAGTTGGGTGAATCTCTGGCGATGGCTGCGGATGGTGCGGTTCTGGCTGAGATTGCCGGTCTGTGTAACGTGGAAAGCAAATATAATGAGAACATCGAGGGCTTAGGTACTGCTACCGTAATTGAGACCACTCAGAACAAGGC

CGCACTTACCGACCAAGTTGCGCTGGGTAAGGAGATTATTGCGGCTCTGACTAAGGCTCGTGCGGCTCTGACCAAGAACTATGTTCCGGCTGCTGACCGTGTGTTCTACTGTGACCCAGATAGCTACTCTGCGATTCTGGCAGCACTGATGCCGAACGCAGCAAACTACGCTGCTCTGATTGACCCTGAGAAGGGTTCTATCCGCAACGTTATGGGCTTTGAGGTTGTAGAAGTTCCGCACCTCACCGCTGGTGGTGCTGGTACCGCTCGTGAGGGCACTACTGGTCAGAAGCACGTC

TTCCCTGCCAATAAAGGTGAGGGTAATGTCAAGGTTGCTAAGGACAACGTTATCGGCCTGTTCATGCACCGCTCTGCGGTAGGTACTGTTAAGCTGCGTGACTTGGCTCTGGAGCGCGCTCGCCGTGCTAACTTCCAAGCGGACCAGATTATCGCTAAGTACGCAATGGGCCACGGTGGTCTTCGCCCAGAAGCTGCTGGTGCAGTGGTTTTCAAAGTGGAGTAATGCTGGGGGTGGCCTCAACGGTCGCTGCTAGTCCCGAAGAGGCGAGTGTTACTTCAACAGAAGAAACCTTAA

CGCCAGCACAGGAGGCCGCACGCACCCGCGCTGCTAACAAAGCCCGAAAGGAAGCTGAGTTGGCTGCTGCCACCGCTGAGCAATAACTAGCATAACCCCTTGGGGCCTCTAAACGGGTCTTGAGGGGTTTTTTGCTGAAAGGAGGAACTATATGCGCTCATACGATATGAACGTTGAGACTGCCGCTGAGTTATCAGCTGTGAACGACATTCTGGCGTCTATCGGTGAACCTCCGGTATCAACGCTGGAAGGTGACGCTAACGCAGATGCAGCGAACGCTCGGCGTATTCTCAACAA

GATTAACCGACAGATTCAATCTCGTGGATGGACGTTCAACATTGAGGAAGGCATAACGCTACTACCTGATGTTTACTCCAACCTGATTGTATACAGTGACGACTATTTATCCCTAATGTCTACTTCCGGTCAATCCATCTACGTTAACCGAGGTGGCTATGTGTATGACCGAACGAGTCAATCAGACCGCTTTGACTCTGGTATTACTGTGAACATTATTCGTCTCCGCGACTACGATGAGATGCCTGAGTGCTTCCGTTACTGGATTGTCACCAAGGCTTCCCGTCAGTTCAACAACCGATTCT

TTGGGGCACCGGAAGTAGAGGGTGTACTCCAAGAAGAGGAAGATGAGGCTAGACGTCTCTGCATGGAGTATGAGATGGACTACGGTGGGTACAATATGCTGGATGGAGATGCGTTCACTTCTGGTCTACTGACTCGCTAACATTAATAAATAAGGAGGCTCTAATGGCACTCATTAGCCAATCAATCAAGAACTTGAAGGGTGGTATCAGCCAACAGCCTGACATCCTTCGTTATCCAGACCAAGGGTCACGCCAAGTTAACGGTTGGTCTTCGGAGACCGAGGGCCTCCAAAAGCGTCC

ACCTCTTGTTTTCTTAAATACACTTGGAGACAACGGTGCGTTAGGTCAAGCTCCGTACATCCACCTGATTAACCGAGATGAGCACGAACAGTATTACGCTGTGTTCACTGGTAGCGGAATCCGAGTGTTCGACCTTTCTGGTAACGAGAAGCAAGTTAGGTATCCTAACGGTTCCAACTACATCAAGACCGCTAATCCACGTAACGACCTGCGAATGGTTACTGTAGCAGACTATACGTTCATCGTTAACCGTAACGTTGTTGCACAGAAGAACACAAAGTCTGTCAACTTACCGAATTACAAC

CCTAATCAAGACGGATTGATTAACGTTCGTGGTGGTCAGTATGGTAGGGAACTAATTGTACACATTAACGGTAAAGACGTTGCGAAGTATAAGATACCAGATGGTAGTCAACCTGAACACGTAAACAATACGGATGCCCAATGGTTAGCTGAAGAGTTAGCCAAGCAGATGCGCACTAACTTGTCTGATTGGACTGTAAATGTAGGGCAAGGGTTCATCCATGTGACCGCACCTAGTGGTCAACAGATTGACTCCTTCACGACTAAAGATGGCTACGCAGACCAGTTGATTAACCCTGTGACC

CACTACGCTCAGTCGTTCTCTAAGCTGCCACCTAATGCTCCTAACGGCTACATGGTGAAAATCGTAGGGGACGCCTCTAAGTCTGCCGACCAGTATTACGTTCGGTATGACGCTGAGCGGAAAGTTTGGACTGAGACTTTAGGTTGGAACACTGAGGACCAAGTTCTATGGGAAACCATGCCACACGCTCTTGTGCGAGCCGCTGACGGTAATTTCGACTTCAAGTGGCTTGAGTGGTCTCCTAAGTCTTGTGGTGACGTTGACACCAACCCTTGGCCTTCTTTTGTTGGTTCAAGTATTA

ACGATGTGTTCTTCTTCCGTAACCGCTTAGGATTCCTTAGTGGGGAGAACATCATATTGAGTCGTACAGCCAAATACTTCAACTTCTACCCTGCGTCCATTGCGAACCTTAGTGATGACGACCCTATAGACGTAGCTGTGAGTACCAACCGAATAGCAATCCTTAAGTACGCCGTTCCGTTCTCAGAAGAGTTACTCATCTGGTCCGATGAAGCACAATTCGTCCTGACTGCCTCGGGTACTCTCACATCTAAGTCGGTTGAGTTGAACCTAACGACCCAGTTTGACGTACAGGACCGAGCGA

GACCTTTTGGGATTGGGCGTAATGTCTACTTTGCTAGTCCGAGGTCCAGCTTCACGTCCATCCACAGGTACTACGCTGTGCAGGATGTCAGTTCCGTTAAGAATGCTGAGGACATTACATCACACGTTCCTAACTACATCCCTAATGGTGTGTTCAGTATTTGCGGAAGTGGTACGGAAAACTTCTGTTCGGTACTATCTCACGGGGACCCTAGTAAAATCTTCATGTACAAATTCCTGTACCTGAACGAAGAGTTAAGGCAACAGTCGTGGTCTCATTGGGACTTTGGGGAAAACGTACAGG

TTCTAGCTTGTCAGAGTATCAGCTCAGATATGTATGTGATTCTTCGCAATGAGTTCAATACGTTCCTAGCTAGAATCTCTTTCACTAAGAACGCCATTGACTTACAGGGAGAACCCTATCGTGCCTTTATGGACATGAAGATTCGATACACGATTCCTAGTGGAACATACAACGATGACACATTCACTACCTCTATTCATATTCCAACAATTTATGGTGCAAACTTCGGGAGGGGCAAAATCACTGTATTGGAGCCTGATGGTAAGATAACCGTGTTTGAGCAACCTACGGCTGGGTGGAATAGCG

ACCCTTGGCTGAGACTCAGCGGTAACTTGGAGGGACGCATGGTGTACATTGGGTTCAACATTAACTTCGTATATGAGTTCTCTAAGTTCCTCATCAAGCAGACTGCCGACGACGGGTCTACCTCCACGGAAGACATTGGGCGCTTACAGTTACGCCGAGCGTGGGTTAACTACGAGAACTCTGGTACGTTTGACATTTATGTTGAGAACCAATCGTCTAACTGGAAGTACACAATGGCTGGTGCCCGATTAGGCTCTAACACTCTGAGGGCTGGGAGACTGAACTTAGGGACCGGACAAT

ATCGATTCCCTGTGGTTGGTAACGCCAAGTTCAACACTGTATACATCTTGTCAGATGAGACTACCCCTCTGAACATCATTGGGTGTGGCTGGGAAGGTAACTACTTACGGAGAAGTTCCGGTATTTAATTAAATATTCTCCCTGTGGTGGCTCGAAATTAATACGACTCACTATAGGGAGAACAATACGACTACGGGAGGGTTTTCTTATGATGACTATAAGACCTACTAAAAGTACAGACTTTGAGGTATTCACTCCGGCTCACCATGACATTCTTGAAGCTAAGGCTGCTGGTATTGAGCCGA

GTTTCCCTGATGCTTCCGAGTGTGTCACGTTGAGCCTCTATGGGTTCCCTCTAGCTATCGGTGGTAACTGCGGGGACCAGTGCTGGTTCGTTACGAGCGACCAAGTGTGGCGACTTAGTGGAAAGGCTAAGCGAAAGTTCCGTAAGTTAATCATGGAGTATCGCGATAAGATGCTTGAGAAGTATGATACTCTTTGGAATTACGTATGGGTAGGCAATACGTCCCACATTCGTTTCCTCAAGACTATCGGTGCGGTATTCCATGAAGAGTACACACGAGATGGTCAATTTCAGTTATTTACAA

TCACGAAAGGAGGATAACCATATGTGTTGGGCAGCCGCAATACCTATCGCTATATCTGGCGCTCAGGCTATCAGTGGTCAGAACGCTCAGGCCAAAATGATTGCCGCTCAGACCGCTGCTGGTCGTCGTCAAGCTATGGAAATCATGAGGCAGACGAACATCCAGAATGCTGACCTATCGTTGCAAGCTCGAAGTAAACTTGAGGAAGCGTCCGCCGAGTTGACCTCACAGAACATGCAGAAGGTCCAAGCTATTGGGTCTATCCGAGCGGCTATCGGAGAGAGTATGCTTGAAGGTTC

CTCAATGGACCGCATTAAGCGAGTCACAGAAGGACAGTTCATTCGGGAAGCCAATATGGTAACTGAGAACTATCGCCGTGACTACCAAGCAATCTTCGCACAGCAACTTGGTGGTACTCAAAGTGCTGCAAGTCAGATTGACGAAATCTATAAGAGCGAACAGAAACAGAAGAGTAAGCTACAGATGGTTCTGGACCCACTGGCTATCATGGGGTCTTCCGCTGCGAGTGCTTACGCATCCGGTGCGTTCGACTCTAAGTCCACAACTAAGGCACCTATTGTTGCCGCTAAAGGAACCAA

GACGGGGAGGTAATGAGCTATGAGTAAAATTGAATCTGCCCTTCAAGCGGCACAACCGGGACTCTCTCGGTTACGTGGTGGTGCTGGAGGTATGGGCTATCGTGCAGCAACCACTCAGGCCGAACAGCCAAGGTCAAGCCTATTGGACACCATTGGTCGGTTCGCTAAGGCTGGTGCCGATATGTATACCGCTAAGGAACAACGAGCACGAGACCTAGCTGATGAACGCTCTAACGAGATTATCCGTAAGCTGACCCCTGAGCAACGTCGAGAAGCTCTCAACAACGGGACCCTTCTG

TATCAGGATGACCCATACGCTATGGAAGCACTCCGAGTCAAGACTGGTCGTAACGCTGCGTATCTTGTGGACGATGACGTTATGCAGAAGATAAAAGAGGGTGTCTTCCGTACTCGCGAAGAGATGGAAGAGTATCGCCATAGTCGCCTTCAAGAGGGCGCTAAGGTATACGCTGAGCAGTTCGGCATCGACCCTGAGGACGTTGATTATCAGCGTGGTTTCAACGGGGACATTACCGAGCGTAACATCTCGCTGTATGGTGCGCATGATAACTTCTTGAGCCAGCAAGCTCAGAAGGGC

GCTATCATGAACAGCCGAGTGGAACTCAACGGTGTCCTTCAAGACCCTGATATGCTGCGTCGTCCAGACTCTGCTGACTTCTTTGAGAAGTATATCGACAACGGTCTGGTTACTGGCGCAATCCCATCTGATGCTCAAGCCACACAGCTTATAAGCCAAGCGTTCAGTGACGCTTCTAGCCGTGCTGGTGGTGCTGACTTCCTGATGCGAGTCGGTGACAAGAAGGTAACACTTAACGGAGCCACTACGACTTACCGAGAGTTGATTGGTGAGGAACAGTGGAACGCTCTCATGGTCAC

AGCACAACGTTCTCAGTTTGAGACTGACGCGAAGCTGAACGAGCAGTATCGCTTGAAGATTAACTCTGCGCTGAACCAAGAGGACCCAAGGACAGCTTGGGAGATGCTTCAAGGTATCAAGGCTGAACTAGATAAGGTCCAACCTGATGAGCAGATGACACCACAACGTGAGTGGCTAATCTCCGCACAGGAACAAGTTCAGAATCAGATGAACGCATGGACGAAAGCTCAGGCCAAGGCTCTGGACGATTCCATGAAGTCAATGAACAAACTTGACGTAATCGACAAGCAATTCCAGA

AGCGAATCAACGGTGAGTGGGTCTCAACGGATTTTAAGGATATGCCAGTCAACGAGAACACTGGTGAGTTCAAGCATAGCGATATGGTTAACTACGCCAATAAGAAGCTCGCTGAGATTGACAGTATGGACATTCCAGACGGTGCCAAGGATGCTATGAAGTTGAAGTACCTTCAAGCGGACTCTAAGGACGGAGCATTCCGTACAGCCATCGGAACCATGGTCACTGACGCTGGTCAAGAGTGGTCTGCCGCTGTGATTAACGGTAAGTTACCAGAACGAACCCCAGCTATGGATGCTC

TGCGCAGAATCCGCAATGCTGACCCTCAGTTGATTGCTGCGCTATACCCAGACCAAGCTGAGCTATTCCTGACGATGGACATGATGGACAAGCAGGGTATTGACCCTCAGGTTATTCTTGATGCCGACCGACTGACTGTTAAGCGGTCCAAAGAGCAACGCTTTGAGGATGATAAAGCATTCGAGTCTGCACTGAATGCATCTAAGGCTCCTGAGATTGCCCGTATGCCAGCGTCACTGCGCGAATCTGCACGTAAGATTTATGACTCCGTTAAGTATCGCTCGGGGAACGAAAGCATGG

CTATGGAGCAGATGACCAAGTTCCTTAAGGAATCTACCTACACGTTCACTGGTGATGATGTTGACGGTGATACCGTTGGTGTGATTCCTAAGAATATGATGCAGGTTAACTCTGACCCGAAATCATGGGAGCAAGGTCGGGATATTCTGGAGGAAGCACGTAAGGGAATCATTGCGAGCAACCCTTGGATAACCAATAAGCAACTGACCATGTATTCTCAAGGTGACTCCATTTACCTTATGGACACCACAGGTCAAGTCAGAGTCCGATACGACAAAGAGTTACTCTCGAAGGTCTGGAGT

GAGAACCAGAAGAAACTCGAAGAGAAAGCTCGTGAGAAGGCTCTGGCTGATGTGAACAAGCGAGCACCTATAGTTGCCGCTACGAAGGCCCGTGAAGCTGCTGCTAAACGAGTCCGAGAGAAACGTAAACAGACTCCTAAGTTCATCTACGGACGTAAGGAGTAACTAAAGGCTACATAAGGAGGCCCTAAATGGATAAGTACGATAAGAACGTACCAAGTGATTATGATGGTCTGTTCCAAAAGGCTGCTGATGCCAACGGGGTCTCTTATGACCTTTTACGTAAAGTCGCTTGGACAG

AATCACGATTTGTGCCTACAGCAAAATCTAAGACTGGACCATTAGGCATGATGCAATTTACCAAGGCAACCGCTAAGGCCCTCGGTCTGCGAGTTACCGATGGTCCAGACGACGACCGACTGAACCCTGAGTTAGCTATTAATGCTGCCGCTAAGCAACTTGCAGGTCTGGTAGGGAAGTTTGATGGCGATGAACTCAAAGCTGCCCTTGCGTACAACCAAGGCGAGGGACGCTTGGGTAATCCACAACTTGAGGCGTACTCTAAGGGAGACTTCGCATCAATCTCTGAGGAGGGACGT

AACTACATGCGTAACCTTCTGGATGTTGCTAAGTCACCTATGGCTGGACAGTTGGAAACTTTTGGTGGCATAACCCCAAAGGGTAAAGGCATTCCGGCTGAGGTAGGATTGGCTGGAATTGGTCACAAGCAGAAAGTAACACAGGAACTTCCTGAGTCCACAAGTTTTGACGTTAAGGGTATCGAACAGGAGGCTACGGCGAAACCATTCGCCAAGGACTTTTGGGAGACCCACGGAGAAACACTTGACGAGTACAACAGTCGTTCAACCTTCTTCGGATTCAAAAATGCTGCCGAAGC

TGAACTCTCCAACTCAGTCGCTGGGATGGCTTTCCGTGCTGGTCGTCTCGATAATGGTTTTGATGTGTTTAAAGACACCATTACGCCGACTCGCTGGAACTCTCACATCTGGACTCCAGAGGAGTTAGAGAAGATTCGAACAGAGGTTAAGAACCCTGCGTACATCAACGTTGTAACTGGTGGTTCCCCTGAGAACCTCGATGACCTCATTAAATTGGCTAACGAGAACTTTGAGAATGACTCCCGCGCTGCCGAGGCTGGCCTAGGTGCCAAACTGAGTGCTGGTATTATTGGTGCTGG

TGTGGACCCGCTTAGCTATGTTCCTATGGTCGGTGTCACTGGTAAGGGCTTTAAGTTAATCAATAAGGCTCTTGTAGTTGGTGCCGAAAGTGCTGCTCTGAACGTTGCATCCGAAGGTCTCCGTACCTCCGTAGCTGGTGGTGACGCAGACTATGCGGGTGCTGCCTTAGGTGGCTTTGTGTTTGGCGCAGGCATGTCTGCAATCAGTGACGCTGTAGCTGCTGGACTGAAACGCAGTAAACCAGAAGCTGAGTTCGACAATGAGTTCATCGGTCCTATGATGCGATTGGAAGCCCGTG

AGACAGCACGAAACGCCAACTCTGCGGACCTCTCTCGGATGAACACTGAGAACATGAAGTTTGAAGGTGAACATAATGGTGTCCCTTATGAGGACTTACCAACAGAGAGAGGTGCCGTGGTGTTACATGATGGCTCCGTTCTAAGTGCAAGCAACCCAATCAACCCTAAGACTCTAAAAGAGTTCTCCGAGGTTGACCCTGAGAAGGCTGCGCGAGGAATCAAACTGGCTGGGTTCACCGAGATTGGCTTGAAGACCTTGGGGTCTGACGATGCTGACATCCGTAGAGTGGCTATCGA

CCTCGTTCGCTCTCCTACTGGTATGCAGTCTGGTGCCTCAGGTAAGTTCGGTGCAACAGCTTCTGACATCCATGAGAGACTTCATGGTACTGACCAGCGTACTTATAATGACTTGTACAAAGCAATGTCTGACGCTATGAAAGACCCTGAGTTCTCTACTGGCGGCGCTAAGATGTCCCGTGAAGAAACTCGATACACTATCTACCGTAGAGCGGCACTAGCTATTGAGCGTCCAGAACTACAGAAGGCACTCACTCCGTCTGAGAGAATCGTTATGGACATCATTAAGCGTCACTTTGACA

CCAAGCGTGAACTTATGGAAAACCCAGCAATATTCGGTAACACAAAGGCTGTGAGTATCTTCCCTGAGAGTCGCCACAAAGGTACTTACGTTCCTCACGTATATGACCGTCATGCCAAGGCGCTGATGATTCAACGCTACGGTGCCGAAGGTTTGCAGGAAGGGATTGCCCGCTCATGGATGAACAGCTACGTCTCCAGACCTGAGGTCAAGGCCAGAGTCGATGAGATGCTTAAGGAATTACACGGGGTGAAGGAAGTAACACCAGAGATGGTAGAGAAGTACGCTATGGATAAGGCT

TATGGTATCTCCCACTCAGACCAGTTCACCAACAGTTCCATAATAGAAGAGAACATTGAGGGCTTAGTAGGTATCGAGAATAACTCATTCCTTGAGGCACGTAACTTGTTTGATTCGGACCTATCCATCACTATGCCAGACGGACAGCAATTCTCAGTGAATGACCTAAGGGACTTCGATATGTTCCGCATCATGCCAGCGTATGACCGCCGTGTCAATGGTGACATCGCCATCATGGGGTCTACTGGTAAAACCACTAAGGAACTTAAGGATGAGATTTTGGCTCTCAAAGCGAAAGCTGAG

GGAGACGGTAAGAAGACTGGCGAGGTACATGCTTTAATGGATACCGTTAAGATTCTTACTGGTCGTGCTAGACGCAATCAGGACACTGTGTGGGAAACCTCACTGCGTGCCATCAATGACCTAGGGTTCTTCGCTAAGAACGCCTACATGGGTGCTCAGAACATTACGGAGATTGCTGGGATGATTGTCACTGGTAACGTTCGTGCTCTAGGGCATGGTATCCCAATTCTGCGTGATACACTCTACAAGTCTAAACCAGTTTCAGCTAAGGAACTCAAGGAACTCCATGCGTCTCTGTTCG

GGAAGGAGGTGGACCAGTTGATTCGGCCTAAACGTGCTGACATTGTGCAGCGCCTAAGGGAAGCAACTGATACCGGACCTGCCGTGGCGAACATCGTAGGGACCTTGAAGTATTCAACACAGGAACTGGCTGCTCGCTCTCCGTGGACTAAGCTACTGAACGGAACCACTAACTACCTTCTGGATGCTGCGCGTCAAGGTATGCTTGGGGATGTTATTAGTGCCACCCTAACAGGTAAGACTACCCGCTGGGAGAAAGAAGGCTTCCTTCGTGGTGCCTCCGTAACTCCTGAGCAGAT

GGCTGGCATCAAGTCTCTCATCAAGGAACATATGGTACGCGGTGAGGACGGGAAGTTTACCGTTAAGGACAAGCAAGCGTTCTCTATGGACCCACGGGCTATGGACTTATGGAGACTGGCTGACAAGGTAGCTGATGAGGCAATGCTGCGTCCACATAAGGTGTCCTTACAGGATTCCCATGCGTTCGGAGCACTAGGTAAGATGGTTATGCAGTTTAAGTCTTTCACTATCAAGTCCCTTAACTCTAAGTTCCTGCGAACCTTCTATGATGGATACAAGAACAACCGAGCGATTGACGCT

GCGCTGAGCATCATCACCTCTATGGGTCTCGCTGGTGGTTTCTATGCTATGGCTGCACACGTCAAAGCATACGCTCTGCCTAAGGAGAAACGTAAGGAGTACTTGGAGCGTGCACTGGACCCAACCATGATTGCCCACGCTGCGTTATCTCGTAGTTCTCAATTGGGTGCTCCTTTGGCTATGGTTGACCTAGTTGGTGGTGTTTTAGGGTTCGAGTCCTCCAAGATGGCTCGCTCTACGATTCTACCTAAGGACACCGTGAAGGAACGTGACCCAAACAAACCGTACACCTCTAGAGAG

GTAATGGGCGCTATGGGTTCAAACCTTCTGGAACAGATGCCTTCGGCTGGCTTTGTGGCTAACGTAGGGGCTACCTTAATGAATGCTGCTGGCGTGGTCAACTCACCTAATAAAGCAACCGAGCAGGACTTCATGACTGGTCTTATGAACTCCACAAAAGAGTTAGTACCGAACGACCCATTGACTCAACAGCTTGTGTTGAAGATTTATGAGGCGAACGGTGTTAACTTGAGGGAGCGTAGGAAATAATACGACTCACTATAGGGAGAGGCGAAATAATCTTCTCCCTGTAGTCTCTTAGA

TTTACTTTAAGGAGGTCAAATGGCTAACGTAATTAAAACCGTTTTGACTTACCAGTTAGATGGCTCCAATCGTGATTTTAATATCCCGTTTGAGTATCTAGCCCGTAAGTTCGTAGTGGTAACTCTTATTGGTGTAGACCGAAAGGTCCTTACGATTAATACAGACTATCGCTTTGCTACACGTACTACTATCTCTCTGACAAAGGCTTGGGGTCCAGCCGATGGCTACACGACCATCGAGTTACGTCGAGTAACCTCCACTACCGACCGATTGGTTGACTTTACGGATGGTTCAATCCTCCGCGC

GTATGACCTTAACGTCGCTCAGATTCAAACGATGCACGTAGCGGAAGAGGCCCGTGACCTCACTACGGATACTATCGGTGTCAATAACGATGGTCACTTGGATGCTCGTGGTCGTCGAATTGTGAACCTAGCGAACGCCGTGGATGACCGCGATGCTGTTCCGTTTGGTCAACTAAAGACCATGAACCAGAACTCATGGCAAGCACGTAATGAAGCCTTACAGTTCCGTAATGAGGCTGAGACTTTCAGAAACCAAGCGGAGGGCTTTAAGAACGAGTCCAGTACCAACGCTACGAACA

CAAAGCAGTGGCGCGATGAGACCAAGGGTTTCCGAGACGAAGCCAAGCGGTTCAAGAATACGGCTGGTCAATACGCTACATCTGCTGGGAACTCTGCTTCCGCTGCGCATCAATCTGAGGTAAACGCTGAGAACTCTGCCACAGCATCCGCTAACTCTGCTCATTTGGCAGAACAGCAAGCAGACCGTGCGGAACGTGAGGCAGACAAGCTGGAAAATTACAATGGATTGGCTGGTGCAATTGATAAGGTAGATGGAACCAATGTGTACTGGAAAGGAAATATTCACGCTAACGGGCG

CCTTTACATGACCACAAACGGTTTTGACTGTGGCCAGTATCAACAGTTCTTTGGTGGTGTCACTAATCGTTACTCTGTCATGGAGTGGGGAGATGAGAACGGATGGCTGATGTATGTTCAACGTAGAGAGTGGACAACAGCGATAGGCGGTAACATCCAGTTAGTAGTAAACGGACAGATCATCACCCAAGGTGGAGCCATGACCGGTCAGCTAAAATTGCAGAATGGGCATGTTCTTCAATTAGAGTCCGCATCCGACAAGGCGCACTATATTCTATCTAAAGATGGTAACAGGAATAACT

GGTACATTGGTAGAGGGTCAGATAACAACAATGACTGTACCTTCCACTCCTATGTACATGGTACGACCTTAACACTCAAGCAGGACTATGCAGTAGTTAACAAACACTTCCACGTAGGTCAGGCCGTTGTGGCCACTGATGGTAATATTCAAGGTACTAAGTGGGGAGGTAAATGGCTGGATGCTTACCTACGTGACAGCTTCGTTGCGAAGTCCAAGGCGTGGACTCAGGTGTGGTCTGGTAGTGCTGGCGGTGGGGTAAGTGTGACTGTTTCACAGGATCTCCGCTTCCGCAATATCTG

GATTAAGTGTGCCAACAACTCTTGGAACTTCTTCCGTACTGGCCCCGATGGAATCTACTTCATAGCCTCTGATGGTGGATGGTTACGATTCCAAATACACTCCAACGGTCTCGGATTCAAGAATATTGCAGACAGTCGTTCAGTACCTAATGCAATCATGGTGGAGAACGAGTAATTGGTAAATCACAAGGAAAGACGTGTAGTCCACGGATGGACTCTCAAGGAGGTACAAGGTGCTATCATTAGACTTTAACAACGAATTGATTAAGGCTGCTCCAATTGTTGGGACGGGTGTAGCAGATG

TTAGTGCTCGACTGTTCTTTGGGTTAAGCCTTAACGAATGGTTCTACGTTGCTGCTATCGCCTACACAGTGGTTCAGATTGGTGCCAAGGTAGTCGATAAGATGATTGACTGGAAGAAAGCCAATAAGGAGTGATATGTATGGAAAAGGATAAGAGCCTTATTACATTCTTAGAGATGTTGGACACTGCGATGGCTCAGCGTATGCTTGCGGACCTTTCGGACCATGAGCGTCGCTCTCCGCAACTCTATAATGCTATTAACAAACTGTTAGACCGCCACAAGTTCCAGATTGGTAAGTTGCAG

CCGGATGTTCACATCTTAGGTGGCCTTGCTGGTGCTCTTGAAGAGTACAAAGAGAAAGTCGGTGATAACGGTCTTACGGATGATGATATTTACACATTACAGTGATATACTCAAGGCCACTACAGATAGTGGTCTTTATGGATGTCATTGTCTATACGAGATGCTCCTACGTGAAATCTGAAAGTTAACGGGAGGCATTATGCTAGAATTTTTACGTAAGCTAATCCCTTGGGTTCTCGCTGGGATGCTATTCGGGTTAGGATGGCATCTAGGGTCAGACTCAATGGACGCTAAATGGAAACAGG

AGGTACACAATGAGTACGTTAAGAGAGTTGAGGCTGCGAAGAGCACTCAAAGAGCAATCGATGCGGTATCTGCTAAGTATCAAGAAGACCTTGCCGCGCTGGAAGGGAGCACTGATAGGATTATTTCTGATTTGCGTAGCGACAATAAGCGGTTGCGCGTCAGAGTCAAAACTACCGGAACCTCCGATGGTCAGTGTGGATTCGAGCCTGATGGTCGAGCCGAACTTGACGACCGAGATGCTAAACGTATTCTCGCAGTGACCCAGAAGGGTGACGCATGGATTCGTGCGTTACAGGAT

ACTATTCGTGAACTGCAACGTAAGTAGGAAATCAAGTAAGGAGGCAATGTGTCTACTCAATCCAATCGTAATGCGCTCGTAGTGGCGCAACTGAAAGGAGACTTCGTGGCGTTCCTATTCGTCTTATGGAAGGCGCTAAACCTACCGGTGCCCACTAAGTGTCAGATTGACATGGCTAAGGTGCTGGCGAATGGAGACAACAAGAAGTTCATCTTACAGGCTTTCCGTGGTATCGGTAAGTCGTTCATCACATGTGCGTTCGTTGTGTGGTCCTTATGGAGAGACCCTCAGTTGAAGATACT

TATCGTATCAGCCTCTAAGGAGCGTGCAGACGCTAACTCCATCTTTATTAAGAACATCATTGACCTGCTGCCATTCCTATCTGAGTTAAAGCCAAGACCCGGACAGCGTGACTCGGTAATCAGCTTTGATGTAGGCCCAGCCAATCCTGACCACTCTCCTAGTGTGAAATCAGTAGGTATCACTGGTCAGTTAACTGGTAGCCGTGCTGACATTATCATTGCGGATGACGTTGAGATTCCGTCTAACAGCGCAACTATGGGTGCCCGTGAGAAGCTATGGACTCTGGTTCAGGAGTTCGCTG

CGTTACTTAAACCGCTGCCTTCCTCTCGCGTTATCTACCTTGGTACACCTCAGACAGAGATGACTCTCTATAAGGAACTTGAGGATAACCGTGGGTACACAACCATTATCTGGCCTGCTCTGTACCCAAGGACACGTGAAGAGAACCTCTATTACTCACAGCGTCTTGCTCCTATGTTACGCGCTGAGTACGATGAGAACCCTGAGGCACTTGCTGGGACTCCAACAGACCCAGTGCGCTTTGACCGTGATGACCTGCGCGAGCGTGAGTTGGAATACGGTAAGGCTGGCTTTACGCTAC

AGTTCATGCTTAACCCTAACCTTAGTGATGCCGAGAAGTACCCGCTGAGGCTTCGTGACGCTATCGTAGCGGCCTTAGACTTAGAGAAGGCCCCAATGCATTACCAGTGGCTTCCGAACCGTCAGAACATCATTGAGGACCTTCCTAACGTTGGCCTTAAGGGTGATGACCTGCATACGTACCACGATTGTTCCAACAACTCAGGTCAGTACCAACAGAAGATTCTGGTCATTGACCCTAGTGGTCGCGGTAAGGACGAAACAGGTTACGCTGTGCTGTACACACTGAACGGTTACATCTA

CCTTATGGAAGCTGGAGGTTTCCGTGATGGCTACTCCGATAAGACCCTTGAGTTACTCGCTAAGAAGGCAAAGCAATGGGGAGTCCAGACGGTTGTCTACGAGAGTAACTTCGGTGACGGTATGTTCGGTAAGGTATTCAGTCCTATCCTTCTTAAACACCACAACTGTGCGATGGAAGAGATTCGTGCCCGTGGTATGAAAGAGATGCGTATTTGCGATACCCTTGAGCCAGTCATGCAGACTCACCGCCTTGTAATTCGTGATGAGGTCATTAGGGCCGACTACCAGTCCGCTCGTGAC

GTAGACGGTAAGCATGACGTTAAGTACTCGTTGTTCTACCAGATGACCCGTATCACTCGTGAGAAAGGCGCTCTGGCTCATGATGACCGATTGGATGCCCTTGCGTTAGGCATTGAGTATCTCCGTGAGTCCATGCAGTTGGATTCCGTTAAGGTCGAGGGTGAAGTACTTGCTGACTTCCTTGAGGAACACATGATGCGTCCTACGGTTGCTGCTACGCATATCATTGAGATGTCTGTGGGAGGAGTTGATGTGTACTCTGAGGACGATGAGGGTTACGGTACGTCTTTCATTGAGTGGT

GATTTATGCATTAGGACTGCATAGGGATGCACTATAGACCACGGATGGTCAGTTCTTTAAGTTACTGAAAAGACACGATAAATTAATACGACTCACTATAGGGAGAGGAGGGACGAAAGGTTACTATATAGATACTGAATGAATACTTATAGAGTGCATAAAGTATGCATAATGGTGTACCTAGAGTGACCTCTAAGAATGGTGATTATATTGTATTAGTATCACCTTAACTTAAGGACCAACATAAAGGGAGGAGACTCATGTTCCGCTTATTGTTGAACCTACTGCGGCATAGAGTCACCTACCGAT

TTCTTGTGGTACTTTGTGCTGCCCTTGGGTACGCATCTCTTACTGGAGACCTCAGTTCACTGGAGTCTGTCGTTTGCTCTATACTCACTTGTAGCGATTAGGGTCTTCCTGACCGACTGATGGCTCACCGAGGGATTCAGCGGTATGATTGCATCACACCACTTCATCCCTATAGAGTCAAGTCCTAAGGTATACCCATAAAGAGCCTCTAATGGTCTATCCTAAGGTCTATACCTAAAGATAGGCCATCCTATCAGTGTCACCTAAAGAGGGTCTTAGAGAGGGCCTATGGAGTTCCTATAGG

GTCCTTTAAAATATACCATAAAAATCTGAGTGACTATCTCACAGTGTACGGACCTAAAGTTCCCCCATAGGGGGTACCTAAAGCCCAGCCAATCACCTAAAGTCAACCTTCGGTTGACCTTGAGGGTTCCCTAAGGGTTGGGGATGACCCTTGGGTTTGTCTTTGGGTGTTACCTTGAGTGTCTCTCTGTGTCCCT

Page 25: Computing the Future of Biology & Biotechnologyarchive2.cra.org/ccc/files/docs/Endy_CCC_lite.pdf · ~5E9 tons plants (1% DNA, made up number) ~1E9 tons animals (1% DNA, made up number)

25

1 4 5 6 8 91

012 15 16 17

1

9

Functional depiction of T7 genome

Page 26: Computing the Future of Biology & Biotechnologyarchive2.cra.org/ccc/files/docs/Endy_CCC_lite.pdf · ~5E9 tons plants (1% DNA, made up number) ~1E9 tons animals (1% DNA, made up number)

26

Dunn & Studier (1983)Journal of Molecular Biology v166 p477

Page 27: Computing the Future of Biology & Biotechnologyarchive2.cra.org/ccc/files/docs/Endy_CCC_lite.pdf · ~5E9 tons plants (1% DNA, made up number) ~1E9 tons animals (1% DNA, made up number)

27

1 4 5 6 8 91

012 15 16 17

1

9

Functional depiction of T7 genome

Page 28: Computing the Future of Biology & Biotechnologyarchive2.cra.org/ccc/files/docs/Endy_CCC_lite.pdf · ~5E9 tons plants (1% DNA, made up number) ~1E9 tons animals (1% DNA, made up number)

28

14 56 891

012 1516 17

1

9

I have no idea what this genetic program will do

Page 29: Computing the Future of Biology & Biotechnologyarchive2.cra.org/ccc/files/docs/Endy_CCC_lite.pdf · ~5E9 tons plants (1% DNA, made up number) ~1E9 tons animals (1% DNA, made up number)

29

Wild-type T7 genes 2.8-3

-------------------2.8----------------->

...acgcaaagggaggcgacatggcaggttacggcgctaaaggaatccgaaa...

<--3-RBS---><----------------3-----------------

Natural DNA sequence

Page 30: Computing the Future of Biology & Biotechnologyarchive2.cra.org/ccc/files/docs/Endy_CCC_lite.pdf · ~5E9 tons plants (1% DNA, made up number) ~1E9 tons animals (1% DNA, made up number)

30

J.R. Koza et al.Automated Synthesis of Computational Circuits using GeneticProgramming,1997 IEEE International Conference on Evolutionary Computation

Evolved Circuit:

1. What does it do?2. How does it work?3. Why this design?

Page 31: Computing the Future of Biology & Biotechnologyarchive2.cra.org/ccc/files/docs/Endy_CCC_lite.pdf · ~5E9 tons plants (1% DNA, made up number) ~1E9 tons animals (1% DNA, made up number)

31

Design & build new genome?

Page 32: Computing the Future of Biology & Biotechnologyarchive2.cra.org/ccc/files/docs/Endy_CCC_lite.pdf · ~5E9 tons plants (1% DNA, made up number) ~1E9 tons animals (1% DNA, made up number)

32

Phage T7: Post-Synthesis

Page 33: Computing the Future of Biology & Biotechnologyarchive2.cra.org/ccc/files/docs/Endy_CCC_lite.pdf · ~5E9 tons plants (1% DNA, made up number) ~1E9 tons animals (1% DNA, made up number)
Page 34: Computing the Future of Biology & Biotechnologyarchive2.cra.org/ccc/files/docs/Endy_CCC_lite.pdf · ~5E9 tons plants (1% DNA, made up number) ~1E9 tons animals (1% DNA, made up number)

1. Design and construct any mutant or combinations thereof; characterize.

34

Post-Synthesis Genetics?

2. Reveal / confirm functions or demonstrate absence thereof.

Page 35: Computing the Future of Biology & Biotechnologyarchive2.cra.org/ccc/files/docs/Endy_CCC_lite.pdf · ~5E9 tons plants (1% DNA, made up number) ~1E9 tons animals (1% DNA, made up number)

1. Synthesize and assemble DNA fragments of length, L (say 1.5x max CDS length), offset so as to provide 3x coverage, only including known elements, deleting, scrambling, or otherwise destroying all unannotated sequence (i.e., simplons).

35

2. Recombine simplons back onto native genome, or into generic testbed (perhaps a generic deletion collection), screen.

0. Given a newly sequenced & annotated genome...

Page 36: Computing the Future of Biology & Biotechnologyarchive2.cra.org/ccc/files/docs/Endy_CCC_lite.pdf · ~5E9 tons plants (1% DNA, made up number) ~1E9 tons animals (1% DNA, made up number)

36

4. As a biproduct of preceding, produce full collection of native genetic elements: CDSs, regulatory elements, and so on.

Page 37: Computing the Future of Biology & Biotechnologyarchive2.cra.org/ccc/files/docs/Endy_CCC_lite.pdf · ~5E9 tons plants (1% DNA, made up number) ~1E9 tons animals (1% DNA, made up number)

37

Page 38: Computing the Future of Biology & Biotechnologyarchive2.cra.org/ccc/files/docs/Endy_CCC_lite.pdf · ~5E9 tons plants (1% DNA, made up number) ~1E9 tons animals (1% DNA, made up number)
Page 39: Computing the Future of Biology & Biotechnologyarchive2.cra.org/ccc/files/docs/Endy_CCC_lite.pdf · ~5E9 tons plants (1% DNA, made up number) ~1E9 tons animals (1% DNA, made up number)
Page 40: Computing the Future of Biology & Biotechnologyarchive2.cra.org/ccc/files/docs/Endy_CCC_lite.pdf · ~5E9 tons plants (1% DNA, made up number) ~1E9 tons animals (1% DNA, made up number)

40

“I think you're just ignoringthe facts.

I'm a geneticist, I write code, okay?

A, G, T, P in different combinations; hacking the human genome, okay?

I choose what I choose. Either your kidneys work or you grow six fingers.

I do that.”

Tom Sizemore, Red Planet (2000).

Page 41: Computing the Future of Biology & Biotechnologyarchive2.cra.org/ccc/files/docs/Endy_CCC_lite.pdf · ~5E9 tons plants (1% DNA, made up number) ~1E9 tons animals (1% DNA, made up number)

- Primary sequence assembly, annotation, analysis

- Functional primary networks (graphs)

- Secondary structure (2D physics)

- Tertiary structures, networks, dynamics (3D physics)

- Reverse engineering tools (for reproducing machines; time-scale decoupling of control systems; pattern forming systems)

- Post-synthesis biology tools (perturbation design)

Proposed Computing Priorities

Page 42: Computing the Future of Biology & Biotechnologyarchive2.cra.org/ccc/files/docs/Endy_CCC_lite.pdf · ~5E9 tons plants (1% DNA, made up number) ~1E9 tons animals (1% DNA, made up number)

http://i.pbase.com/u47/hodero/large/30691408.jungle.jpg

ChallengeMake biology easy

to engineer.

OpportunityEnable all of biotechnology.

Page 43: Computing the Future of Biology & Biotechnologyarchive2.cra.org/ccc/files/docs/Endy_CCC_lite.pdf · ~5E9 tons plants (1% DNA, made up number) ~1E9 tons animals (1% DNA, made up number)

43

People

Tools

Platforms

Translation

Page 44: Computing the Future of Biology & Biotechnologyarchive2.cra.org/ccc/files/docs/Endy_CCC_lite.pdf · ~5E9 tons plants (1% DNA, made up number) ~1E9 tons animals (1% DNA, made up number)
Page 45: Computing the Future of Biology & Biotechnologyarchive2.cra.org/ccc/files/docs/Endy_CCC_lite.pdf · ~5E9 tons plants (1% DNA, made up number) ~1E9 tons animals (1% DNA, made up number)

45

People

Tools

Platforms

Translation

Page 46: Computing the Future of Biology & Biotechnologyarchive2.cra.org/ccc/files/docs/Endy_CCC_lite.pdf · ~5E9 tons plants (1% DNA, made up number) ~1E9 tons animals (1% DNA, made up number)

Over 3500 BioBrick DNA parts

Freely available*

Today

Page 47: Computing the Future of Biology & Biotechnologyarchive2.cra.org/ccc/files/docs/Endy_CCC_lite.pdf · ~5E9 tons plants (1% DNA, made up number) ~1E9 tons animals (1% DNA, made up number)

47

People

Tools

Platforms

Translation

Page 48: Computing the Future of Biology & Biotechnologyarchive2.cra.org/ccc/files/docs/Endy_CCC_lite.pdf · ~5E9 tons plants (1% DNA, made up number) ~1E9 tons animals (1% DNA, made up number)

Today, each project is Herculean

1) Malaria is a global problem,artemisinin offers a cure.

2) Jay Keasling’s team spent $25M to make artemisinin via biotechnology.

3) But artemisinin resistance is already occurring.

48

Must we always spend many years and $25M for each pressing

biotech project?

Page 49: Computing the Future of Biology & Biotechnologyarchive2.cra.org/ccc/files/docs/Endy_CCC_lite.pdf · ~5E9 tons plants (1% DNA, made up number) ~1E9 tons animals (1% DNA, made up number)

!"#$%&'"

(&

)*+#,&'-#.%/0#"*

12&12&3"

4!

5%&2"6$ 3$6,

!3""6$ 3$6,

78

7)

+690%&:3"$'0'";

7<

=-' ;#>/0%;6;:

?&',2$/;

@*+#,&'-#*

?&'?6'>3/%

A3$/6$

B&'?6'>6$ 3$6,

C3"'>6$ 3$6,

4%&6>%

42$$6>6$ 3$6,

52.3&6$ 3$6,

C3"6$ 3$6,

@*+#,&'-#*

D2/#&'"3$/'>%

($%/'6>

E/3$'>6$:3$6,

52&12&3"

A%F2"6>6$ 3$6,

!"2/3.6$:3$6,

G#"'>6$ 3$6,

G#"6/'"H(&3D6/'"

+<

C%/03>'"

C6-%,:3"$'0'";I6'D3;%,

4#> !3;

J6&%$/

B'"#.%&;:K:92.;

L%-/6"%;!"#$%&'()*+,%#'()-",#+.'()-",#+.)

./"&+01'()-/"2).3'4+/0'()3$4/5'&%#6()

7#"$%'()56.#"()'$"07%8

431%:5'',:42??"#*//7)$".9"1+01()$#%'%#:"&+:%'()

-%#&+5+;%#'()$%'&+.+7%'(),%:%#"1%)

,/&&5%'()"$$5+"0.%'(),%:%#"1%)."0)

./"&+01'():+&"2+0'

L&3>;?'&/3/6'>*3%5'()/861%0"&%'()"0&+<-#%%;%()=+$%#)

-53+7')2/57%7)$5"'&+.'()."#)'%"&'(),%5&')

4/'%'(),32$%#'()./##/'+/0)+04+,+&/#'

+'2;6>9>"+0&'()#%'+0'()'+7+01()+0'35"&+/0()

.%2%0&'()./"&+01'():"#0+'4%'()-5"2%)

#%&"#7%0&'()"74%'+:%'()."#$%&+01

M%$&%3/6'>*//&1%"#()$#/&%.&+:%)%?3+$2%0&()

."2%#")"07)-+52(),+.6.5%)$"#&')@)&+#%'()

=%&)'3+&'()&"$%'<!AB'<ACAB'()1/5-)

%?3+$2%0&()."2$+01)1%"#(),/"&'

+%3"/0:3>,:+#96%>%>5"'&+.)%6%15"''%'()./'2%&+.'()

7%&%#1%0&'()$4"#2".%3&+."5'()'30&"0)

5/&+/0()2%7+."5<7%0&"5)$#/73.&'()

7+'+0-%.&"0&'()"'$+#+0

E;'*;#>/0%;6;:

?&',2$/;

56;$0%&*L&'?;$0

A6N26,;

7@

7O (;?3&/6$:3$6,

76/&6$H($'>6/6$ 3$6,

A#;6>%

!"2$'>6$ 3$6,

4'&D6/'"

!"2$3&6$ 3$6,

(..'>63:;#>/0%;6;P:0#,&'9%>3/6'>:?&',2$/;

A6>%3&:3>,:D&3>$0%,:QR:3"$'0'";P:3>,:.6-%,:0690%&:3"$'0'";

C%/0#":%;/%&;P:5'&.3",%0#,%P:($%/6$:3$6,P:J6.%/0#"%/0%&P:

J6.%/0#"$3&D'>3/%P::C%/0#":3.6>%;P:CLISP:'"%16>;P:93;'"6>%

="%16>:0#,&'1'&.#"3/6'> ?&',2$/;T:3",%0#,%;P:3"$'0'";P:3$6,;

E;'*7O:.'"%$2"%;P:6;'D2/%>%:3>,:6/;:,%&6F3/6F%;

!*'"%16>;P:93;'"6>%P:U3-%;P:,6%;%"

5%&.%>/3/6'>:?&',2$/;P:B&'?#"%>%:9"#$'"P:.3"'>6$P:QP@*BJ=P:,63$6,;P:

?&'?#" 3"$'0'"P:,63",%0#,%P:%?'-6,%;:

($&#"3/%;P:A*B&'?#"%>%:9"#$'"P:J6'-3>%;P:B'"#%;/%&;P:A3$/6,%

($&#"3/%;P:($&#"3.6,%;P:S;/%&;P:QP@*B&'?3>%,6'"P:C3"'>6$ 3$6,:3>,:

'/0%&;:

L+5P:QPO*I2/3>%,6'"P:"*D2/#&'"3$/'>%P:?#&&'"6,'>%;P:%;/%&;P:,63.6>%;P:

OPO*I6'>%""%P:0#,&'-#D2/#&6$ 3$6,

V>;3/2&3/%,:;2$$6>3/% ,%&6F3/6F%;:W;%%:3D'F%X

+#,&'-#D2/#&3/%;P:%?'-#*"*D2/#&'"3$/'>%P:D2/%>'6$ 3$6,

C%/0#":;2$$6>3/% ,%&6F3/6F%;:W;%%:3D'F%XP:2>;3/2&3/%,:%;/%&;

C3>#:12&3>:,%&6F3/6F%;

#*3.6>'"%F2"6>3/%P:<*C%/0#":L+5P:QPO*,6'";P:%;/%&;P:;2$$6>3/%

S!P:B!P:9"#$%&'"P:"3$/3/%P:0#,&'-# 12&3>;P:;293&:3$6,;

(.6>':,6'";P:9"2/3&6$ 3$6,P;2D;/6/2/%,:?#&&'"6,'>%;:

QP)*?%>/3>%,6'"P:6/3$'>6$ ,%&6F3/6F%;P:?#&&'"6,'>%;P:%;/%&;

Y2.%&'2;:12&3>:,%&6F3/6F%;P:;2$$6>3/%P:%;/%&;P:"%F2"6>6$ 3$6,

73?&'"3$/3.P:,63.6>' 3"$'0'";P:QP)*,63.6>'?%>/3>%

J6"3$/'>%;P:.'>'"3$/'>%;P:'/0%&:?&',2$/;

!"2$'>'"3$/'>%;P:%;/%&;

!"#$'";:WS!P:B!XP:9"#$%&'"P:"3$/3/%P:6;';'&D6,%

B0%>'"6$;P:1'',:3,,6/6F%;

?'"#0#,&'-#3"Z'>'3/%;

?'#3.6>'3$6,;

?'"#;3$$03&6,%;

+#,&'-# ;2$$6>3/% ,%&6F3/6F%;:W3D'F%XP:0#,&'-#D2/#&'"3$/'>%

(.6>':;2$$6>3/% ,%&6F3/6F%;:W;%%:3D'F%X

I2/3>%,6'";P:D2/%>'";

J6'";P:Z%/'>% ,%&6F3/6F%;P:6>,%/%&.6>3>/

A3$/'>%;P:%;/%&;

(>/61&%%[%:3>,:,%6$%&;

52%":'-#9%>3/%;

!&%%>:;'"F%>/;

B0/03"3/%:?'"#%;/%&;

B"3;/6$6[%&;

B'"#3$&#"3/%;

B'"#3$&#"3.6,%;

B0%>'"*1'&.3",%0#,%:&%;6>;

BSEL:?'"#.%&

B'"#0#,&'-#?'"#%;/%&;

B'"#0#,&'-#?'"#3.6,%;:

70%"3/6>9:39%>/;

(.6>%;

4'"F%>/;

Y#"'>;:W?'"#3.6,%;X

B'"#2&%/03>%;

I6;?0%>'" (:&%?"3$%.%>/:

B'"#$3&D'>3/%;:

M%39%>/;*D26",6>9:2>6/

S>F6&'>.%>/D"&%#).4%2+."5'()-5/..35"0&'()

.4%5"&/#'().5%"0%#')"07)7%&%#1%0&'

M%39%>/P:?&'?6'>'"P:3$&#"3/%

B03&.3\:E>/%&.%,63/%;

B'"#F6>#":3$%/3/%

B'"#F6>#":3"$'0'"

4?%$63"/#:$0%.6$3":6>/%&.%,63/%

B'"#%/0%&;

B'"#?#&&'"6,'>%;

M%;6>;P:$&';;"6>Z%&;

?+:$'>/&'":39%>/;

S.2";616%&;

7'..2>6$3/6'>E/57%7)$5"'&+.'()./2$3&%#)."'+01'()

/$&+."5)-+,%#)./"&+01'()5+?3+7).#6'&"5)

7+'$5"6'()$%0'()$%0.+5'()+09'()76%'()

$"$%#)$#/73.&'

<*3.6>'*QP@*BJ=P:<*3.6>'.3"'>6$P:W3.6>'*@+BX

L0&%'>6>%

E>,2;/&63"!/##/'+/0)+04+,+&/#'()73'&)./0&#/5()

,/+5%#)="&%#)&#%"&2%0&()1"')

$3#+-+."&+/0()%2+''+/0)","&%2%0&()

'$%.+"5&6)53,#+."0&'()4/'%'()'%"5'

E>/%&.%,63/%:

B"3/1'&.;

I26",6>9:

I"'$Z;

4%$'>,3&#

70%.6$3";

B&',2$/;HV;%;E>/%&.%,63/%;I6'.3;;

5%%,;/'$Z;

4293&;F53./'%

*#3.&/'%

G65/'%

H#",+0/'%

I".&/'%

J3.#/'%

J&"#.4

4/3&$0

7%""2"';%

A69>6>

=6"

B&'/%6>

+%.6$%""2"';%

5692&%:@:] (>3"'9'2;:C',%":'1:3 I6'D3;%, B&',2$/:5"'U*$03&/:1'&:I6'.3;; 5%%,;/'$Z;

Food Energy Environment Agriculture Health Chemicals Security

2004 DOE report lists 120 highvalue chemicals for biomanufacturing

Page 50: Computing the Future of Biology & Biotechnologyarchive2.cra.org/ccc/files/docs/Endy_CCC_lite.pdf · ~5E9 tons plants (1% DNA, made up number) ~1E9 tons animals (1% DNA, made up number)

Biotechnology is everywhereFo

od

Ener

gy

Env

ironm

ent

Agr

iculture

Hea

lth

Chem

ical

s

Secu

rity

$20B $14B$91B$3B$2.1B$2B$2.5B

NIH spends $1.5B just on manipulating DNA

AAAS 2008 Federal R&D Budget Analysis; R&D Magazine 2008; John Mulligan, Blue Heron

Total U.S. R&D spending by sector

50

Investment in open technology platforms non-existent

Page 51: Computing the Future of Biology & Biotechnologyarchive2.cra.org/ccc/files/docs/Endy_CCC_lite.pdf · ~5E9 tons plants (1% DNA, made up number) ~1E9 tons animals (1% DNA, made up number)

We need new tools

Cohen et al., PNAS, 1973

2006Production of the antimalarial drug

precursor artemisinic acid in engineered yeast

Ro et al., Nature, 2006

1973

Construction of biologically functional

bacterial plasmids in vitro

Lin et al., PNAS,1985

1985

Cloning and expression of the human

erthropoietin gene

51Genetic engineering basics unchanged past 30+ years

Page 52: Computing the Future of Biology & Biotechnologyarchive2.cra.org/ccc/files/docs/Endy_CCC_lite.pdf · ~5E9 tons plants (1% DNA, made up number) ~1E9 tons animals (1% DNA, made up number)

52

Synthetic biology = tools revolutionDNA Sequencing

Read Out the Genetic Code

Recombinant DNA

Basic “Cut” & “Paste”

Polymerase Chain Reaction

Amplify & Make Simple Changes

First

Gen.

Biotech=

...

Next

Gen.

Biotech

Adds

New

Tools

=

Page 53: Computing the Future of Biology & Biotechnologyarchive2.cra.org/ccc/files/docs/Endy_CCC_lite.pdf · ~5E9 tons plants (1% DNA, made up number) ~1E9 tons animals (1% DNA, made up number)

53

A public-benefit facility producing the parts, tools & standards powering biotechnology.

What is the BioFAB?

“Off-the-shelf genetic parts cut our project development times and overall costs by half.”

- Jack Newman, Founder & SVP Research, Amyris Biotechnologies

Page 54: Computing the Future of Biology & Biotechnologyarchive2.cra.org/ccc/files/docs/Endy_CCC_lite.pdf · ~5E9 tons plants (1% DNA, made up number) ~1E9 tons animals (1% DNA, made up number)

Canton, Labno & Endy

Nature Biotechnology v26:7, 787-93 July 2008

Page 55: Computing the Future of Biology & Biotechnologyarchive2.cra.org/ccc/files/docs/Endy_CCC_lite.pdf · ~5E9 tons plants (1% DNA, made up number) ~1E9 tons animals (1% DNA, made up number)
Page 56: Computing the Future of Biology & Biotechnologyarchive2.cra.org/ccc/files/docs/Endy_CCC_lite.pdf · ~5E9 tons plants (1% DNA, made up number) ~1E9 tons animals (1% DNA, made up number)

Canton, Labno & Endy

Nature Biotechnology v26:7, 787-93 July 2008

Page 57: Computing the Future of Biology & Biotechnologyarchive2.cra.org/ccc/files/docs/Endy_CCC_lite.pdf · ~5E9 tons plants (1% DNA, made up number) ~1E9 tons animals (1% DNA, made up number)

57

Mechanism & Function

A transcription factor (LuxR) that is active in the presence

of a cell-cell signaling molecule (3OC6HSL) is controlled by

a regulated operator (PLtetO-1). Device input is 3OC6HSL.

Device output is PoPS from a LuxR-regulated operator. If

used in a cell containing TetR then a second input such as

aTc can be used to produce a Boolean AND function.

BBa_F26203OC6HSL PoPS Receiver

Registry of Standard Biological Parts making life better, one part at a time

License: Public

Conditions (abridged)

Output: PoPS measured via BBa_E0240

Culture: Supplemented M9, 37ºC

Plasmid: pSB3K3

Chassis: MG1655

*Equipment: PE Victor3 multi-well fluorimeter

**Equipment: BD FACScan cytometer

1E01E1

1E21E3

1E4

GFP (arbitrary units)

Doublin

gs

20

38

56

74

92

High Input (1E -7 M 3OC6HSL)

92

1E01E1

1E21E3

1E4

Doublin

gs

20

38

56

74

Low Input(0 M 3OC6HSL)

GFP (arbitrary units)

Reliability**

Genetic: >92/>56 culture doublingsPerformance: >92/>56 culture doublings

(low/high input during propagation)

Transcriptional Output Demand (low/high input)

Nucleotides: 0 / 6xNt nucleotides cell-1 s-1

Polymerases: 0 / 1.5E-1xNt RNAP cell-1

(Nt = downstream transcript length)

Chassis: MC4100, MG1655, and DH5

Plasmids: pSB3K3 and pSB1A2

Devices: E0240, E0430 and E0434

BBa_F2620 Response Time: <1 minBBa_T9002 Response Time: 6±1 minInputs: 0 M (Low), 1E-07 M (High) 3OC6HSL

Pmax: 6.6 PoPS cell-1

K: 1.5E-09 M 3OC6HSLn: 1.6

Static Performance*

Pout =Pmax [3OC6HSL]n

Kn+ [3OC6HSL]n

0E+00 1E!10 1E!09 1E!08 1E!07 1E!06 1E!05 1E!04

0

100

200

300

400

500

600

[3OC6HSL] (M)

GF

P s

yn

the

sis

ra

te (

mo

lec

ule

s c

ell!1 s!1)

Population Mean

Colony Range

Hill Equation

0

1

2

3

4

5

6

7

8

Po

PS

ce

ll!1

Part Compatibility (qualitative)

Authors: Barry Canton Ania Labno

Updated: March 2008

Sig

nalin

g D

evic

es

http://p

art

s.m

it.e

du/r

egis

try/index.p

hp/P

art

:BB

a_F

2620

BBa_F2620

R0040 B0034 C0062 B0015 R0062

PLtetO-1 RBS luxR Term. Plux,R

Component Parts

!10 0 10 20 30 40 50

0

100

200

300

400

500

600

+ 3OC6HSL

Time (min)

GF

P s

yn

the

sis

ra

te (

mo

lec

ule

s c

ell!1 s!1)

GFP synthesis rate (Low Input)

GFP synthesis rate (High Input)

Polynomial Fit (High Input)

PoPS (High Input)

0

1

2

3

4

5

6

7

8

Po

PS

ce

ll!1

0E+00 1E!10 1E!09 1E!08 1E!07 1E!06 1E!05 1E!04

0

100

200

300

400

500

600

[AHL] (M)

GF

P s

yn

the

sis

ra

te (

mo

lec

ule

s c

ell!1 s!1)

C4HSL

C6HSL

3OC6HSL

C7HSL

C8HSL

3OC8HSL

C10

HSL

C12

HSL

0

1

2

3

4

5

6

7

8

Po

PS

ce

ll!1

Input Compatibility*

Dynamic Performance*

Canton, Labno & Endy

Nature Biotechnology v26:7, 787-93 July 2008

Page 58: Computing the Future of Biology & Biotechnologyarchive2.cra.org/ccc/files/docs/Endy_CCC_lite.pdf · ~5E9 tons plants (1% DNA, made up number) ~1E9 tons animals (1% DNA, made up number)

UnknownOur Lord's Candle (Yucca whipplei), Lakeview Mountains, 22 Apr 2006Smugmug.com

Page 59: Computing the Future of Biology & Biotechnologyarchive2.cra.org/ccc/files/docs/Endy_CCC_lite.pdf · ~5E9 tons plants (1% DNA, made up number) ~1E9 tons animals (1% DNA, made up number)

Unknownhttp://www.rootsweb.com/~usgenweb/mn/stearns/postcards/quarry.jpg

Page 60: Computing the Future of Biology & Biotechnologyarchive2.cra.org/ccc/files/docs/Endy_CCC_lite.pdf · ~5E9 tons plants (1% DNA, made up number) ~1E9 tons animals (1% DNA, made up number)

Washington National Cathedralhttp://www.cathedral.org/cathedral/discover/1930photos/5.shtml

Page 61: Computing the Future of Biology & Biotechnologyarchive2.cra.org/ccc/files/docs/Endy_CCC_lite.pdf · ~5E9 tons plants (1% DNA, made up number) ~1E9 tons animals (1% DNA, made up number)
Page 66: Computing the Future of Biology & Biotechnologyarchive2.cra.org/ccc/files/docs/Endy_CCC_lite.pdf · ~5E9 tons plants (1% DNA, made up number) ~1E9 tons animals (1% DNA, made up number)

Apps

Page 67: Computing the Future of Biology & Biotechnologyarchive2.cra.org/ccc/files/docs/Endy_CCC_lite.pdf · ~5E9 tons plants (1% DNA, made up number) ~1E9 tons animals (1% DNA, made up number)

Tools

Page 68: Computing the Future of Biology & Biotechnologyarchive2.cra.org/ccc/files/docs/Endy_CCC_lite.pdf · ~5E9 tons plants (1% DNA, made up number) ~1E9 tons animals (1% DNA, made up number)

68

1. Info. theory & signal proc.2. Device design

7. Control & dyn. systems

5. Fab, CAD & EDA

6. Reverse engineering

3. Languages & grammars4. Standards & abstraction

Page 69: Computing the Future of Biology & Biotechnologyarchive2.cra.org/ccc/files/docs/Endy_CCC_lite.pdf · ~5E9 tons plants (1% DNA, made up number) ~1E9 tons animals (1% DNA, made up number)

69

Page 70: Computing the Future of Biology & Biotechnologyarchive2.cra.org/ccc/files/docs/Endy_CCC_lite.pdf · ~5E9 tons plants (1% DNA, made up number) ~1E9 tons animals (1% DNA, made up number)

70

Nagpal, Kondacs, Chang, Programming Methodology for Biologically-Inspired Self-Assembling Systems, in the AAAI Spring Symposium on Computational Synthesis: From Basic Building Blocks to High Level Functionality, March 2003.

Movie online via http://www.swiss.csail.mit.edu/projects/amorphous/Robust/cross.mpeg

Page 71: Computing the Future of Biology & Biotechnologyarchive2.cra.org/ccc/files/docs/Endy_CCC_lite.pdf · ~5E9 tons plants (1% DNA, made up number) ~1E9 tons animals (1% DNA, made up number)

71

1. Recombinant DNA

5. Abstraction

2. Polymerase chain reaction

3. DNA sequencing4. DNA construction

6. Standardization

Page 72: Computing the Future of Biology & Biotechnologyarchive2.cra.org/ccc/files/docs/Endy_CCC_lite.pdf · ~5E9 tons plants (1% DNA, made up number) ~1E9 tons animals (1% DNA, made up number)

- Advocacy for tools!!!

- Advocacy for open technology platforms.

- More tools, including: - Languages (e.g., for programming dynamic shapes) - CAD and EDA (what’s going into the synthesizers?) - Coding schemes (signatures, fail fast, et cetera)

- Theory supporting of design of understandable reproducing machines

- Tools that enable humanity (e.g., Games, Community)

Proposed Computing Priorities