71
1 Dan Graur Rates of Rates of Nucleotide Nucleotide Substitution Substitution

1 Dan Graur Rates of Nucleotide Substitution. 2 r = Rate of substitution per site per year K = Number of substitutions per site per year

Embed Size (px)

Citation preview

Page 1: 1 Dan Graur Rates of Nucleotide Substitution. 2 r = Rate of substitution per site per year K = Number of substitutions per site per year

1

Dan Graur

Rates of Rates of Nucleotide Nucleotide

SubstitutionSubstitution

Page 2: 1 Dan Graur Rates of Nucleotide Substitution. 2 r = Rate of substitution per site per year K = Number of substitutions per site per year

2

r K

2T

r = Rate of substitution per site per year K = Number of substitutions per site per year

Page 3: 1 Dan Graur Rates of Nucleotide Substitution. 2 r = Rate of substitution per site per year K = Number of substitutions per site per year

3

Less than 10-9

substitutions/site/year

Mean Rate of Nucleotide

Substitution in Mammalian Nuclear

GenomesEvolution is a very Evolution is a very slowslow process at the molecular process at the molecular level.level.

Not much happens in Not much happens in evolution. evolution.

Page 4: 1 Dan Graur Rates of Nucleotide Substitution. 2 r = Rate of substitution per site per year K = Number of substitutions per site per year

4

The rate of synonymoussynonymous substitution is much largerlarger than the nonsynonymousnonsynonymous rate.

Substitutions Rates in Protein-Coding Regions

Page 5: 1 Dan Graur Rates of Nucleotide Substitution. 2 r = Rate of substitution per site per year K = Number of substitutions per site per year

5

Page 6: 1 Dan Graur Rates of Nucleotide Substitution. 2 r = Rate of substitution per site per year K = Number of substitutions per site per year

6

A lot

A little

Page 7: 1 Dan Graur Rates of Nucleotide Substitution. 2 r = Rate of substitution per site per year K = Number of substitutions per site per year

7

SynonymousSynonymous substitutions are more frequent than nonsynonymousnonsynonymous ones.

Page 8: 1 Dan Graur Rates of Nucleotide Substitution. 2 r = Rate of substitution per site per year K = Number of substitutions per site per year

8

Mean nonsynonymous rate = 0.75 10–9 substitutions per site per year

Mean synonymous rate = 3.65 10–9 substitutions per site per year

Coefficient of variation of nonsynonymous rate = 95%

Coefficient of variation of synonymous rate = 31%

The synonymous substitution rate is 5 times higher than

the nonsynonymous substitution rate

Page 9: 1 Dan Graur Rates of Nucleotide Substitution. 2 r = Rate of substitution per site per year K = Number of substitutions per site per year

9

The distribution of KA to KS ratios in >13,000 orthologous protein-coding genes from human and chimpanzee

Page 10: 1 Dan Graur Rates of Nucleotide Substitution. 2 r = Rate of substitution per site per year K = Number of substitutions per site per year

10

58 nucleotide differences3 amino acid differences

In a comparison of human and yeast ubiquitin genes, the inferred number of synonymous substitutions per synonymous site is ~6 (almost certainly indicative of saturation). The inferred number of nonsynonymous substitutions per nonsynonymous site is 0.03. Thus, synonymous substitutions have accumulated at least 200 times faster than nonsynonymous substitutions.

Page 11: 1 Dan Graur Rates of Nucleotide Substitution. 2 r = Rate of substitution per site per year K = Number of substitutions per site per year

11

Ratio

1.5

4.4

1.1

Page 12: 1 Dan Graur Rates of Nucleotide Substitution. 2 r = Rate of substitution per site per year K = Number of substitutions per site per year

12

Substitution Rates of in Noncoding Regions

Page 13: 1 Dan Graur Rates of Nucleotide Substitution. 2 r = Rate of substitution per site per year K = Number of substitutions per site per year

13

Page 14: 1 Dan Graur Rates of Nucleotide Substitution. 2 r = Rate of substitution per site per year K = Number of substitutions per site per year

14

Divergence between cow and goat - and -globin genes and between cow and goat -globin pseudogenes

______________________________________________Region K______________________________________________5’ Flanking region 5.3 1.25’ Untranslated region 4.0 2.04-fold degenerate sites 8.6 2.5Introns 8.1 0.73’ Untranslated region 8.8 2.23’ Flanking region 8.0 1.5Pseudogenes 9.1 0.9______________________________________________

5’5’

3’3’

Page 15: 1 Dan Graur Rates of Nucleotide Substitution. 2 r = Rate of substitution per site per year K = Number of substitutions per site per year

15

Page 16: 1 Dan Graur Rates of Nucleotide Substitution. 2 r = Rate of substitution per site per year K = Number of substitutions per site per year

16

Coding Coding regionsregions evolve slowerslower than noncodingnoncoding regionsregions.

Page 17: 1 Dan Graur Rates of Nucleotide Substitution. 2 r = Rate of substitution per site per year K = Number of substitutions per site per year

17

Evolutionary Rate Profiles

Page 18: 1 Dan Graur Rates of Nucleotide Substitution. 2 r = Rate of substitution per site per year K = Number of substitutions per site per year

18

Xenopus MALWMQCLP-LVLVLLFSTPNTEALANQHLBos MALWTRLRPLLALLALWPPPPARAFVNQHL **** : * *.*: *:..* :. *:****

Xenopus CGSHLVEALYLVCGDRGFFYYPKIKRDIEQBos CGSHLVEALYLVCGERGFFYTPKARREVEG ***************:***** ** :*::*

Xenopus AQVNGPQDNELDG-MQFQPQEYQKMKRGIVBos PQVG---ALELAGGPGAGGLEGPPQKRGIV .**. ** * * *****

Xenopus EQCCHSTCSLFQLENYCNBos EQCCASVCSLYQLENYCN **** *.***:*******

Alignment preproinsulin

Page 19: 1 Dan Graur Rates of Nucleotide Substitution. 2 r = Rate of substitution per site per year K = Number of substitutions per site per year

19

Page 20: 1 Dan Graur Rates of Nucleotide Substitution. 2 r = Rate of substitution per site per year K = Number of substitutions per site per year

20

Functional Functional regionsregions evolve slowerslower than nonfunctionalnonfunctional regionsregions.

Page 21: 1 Dan Graur Rates of Nucleotide Substitution. 2 r = Rate of substitution per site per year K = Number of substitutions per site per year

21

Page 22: 1 Dan Graur Rates of Nucleotide Substitution. 2 r = Rate of substitution per site per year K = Number of substitutions per site per year

22

Page 23: 1 Dan Graur Rates of Nucleotide Substitution. 2 r = Rate of substitution per site per year K = Number of substitutions per site per year

23

Rates of amino acid replacement in different

proteins

Page 24: 1 Dan Graur Rates of Nucleotide Substitution. 2 r = Rate of substitution per site per year K = Number of substitutions per site per year

Fibrinogen to Fibrin

• Fibrinogen consists of 6 chains: 2, 2, 2• Fibrinopeptides are very negatively charged• Fibrinopeptides A are cleaved first (to allow polymerization of fibrins)• Fibrinopeptides B are cleaved second (to enhance

crosslinking)

Page 25: 1 Dan Graur Rates of Nucleotide Substitution. 2 r = Rate of substitution per site per year K = Number of substitutions per site per year

25

Page 26: 1 Dan Graur Rates of Nucleotide Substitution. 2 r = Rate of substitution per site per year K = Number of substitutions per site per year

26

Important Important proteins proteins evolve slowerslower than unimportantunimportant onesones.

Page 27: 1 Dan Graur Rates of Nucleotide Substitution. 2 r = Rate of substitution per site per year K = Number of substitutions per site per year

27

Page 28: 1 Dan Graur Rates of Nucleotide Substitution. 2 r = Rate of substitution per site per year K = Number of substitutions per site per year
Page 29: 1 Dan Graur Rates of Nucleotide Substitution. 2 r = Rate of substitution per site per year K = Number of substitutions per site per year

29

Can we explain the different rates of substitution by the selectionist model?

1.Mutations can be either deleteriousdeleterious

or advantageousadvantageous. 2.If the fractionfraction of advantageousadvantageous

mutations is largelarge, the raterate of evolution will be highhigh. If the fractionfraction of advantageousadvantageous mutations is smallsmall, the raterate of evolution will be lowlow.

3.A mutation occurring at a functional functional sitesite has a higher probability of being advantageous than a mutation occurring at a nonfunctional sitenonfunctional site.

Expectation: Expectation: Important entities Important entities should evolve faster than less less important onesimportant ones.

Page 30: 1 Dan Graur Rates of Nucleotide Substitution. 2 r = Rate of substitution per site per year K = Number of substitutions per site per year

30

Can we explain the different rates of substitution by the selectionist model?

1.Mutations can be either deleteriousdeleterious

or advantageousadvantageous. 2.If the fractionfraction of advantageousadvantageous

mutations is largelarge, the raterate of evolution will be highhigh. If the fractionfraction of advantageousadvantageous mutations is smallsmall, the raterate of evolution will be lowlow.

3.A mutation occurring at a functional functional sitesite has a higher probability of being advantageous than a mutation occurring at a nonfunctional sitenonfunctional site.

Expectation: Expectation: Important entities Important entities should evolve faster than less less important onesimportant ones.

Page 31: 1 Dan Graur Rates of Nucleotide Substitution. 2 r = Rate of substitution per site per year K = Number of substitutions per site per year

31

Can we explain the different rates of substitution by the

neutralist model?

1.Mutations can be either deleteriousdeleterious or neutralneutral.

2.If the fractionfraction of deleteriousdeleterious mutations is largelarge, the raterate of evolution will be lowlow. If the fractionfraction of deleteriousdeleterious mutations is smallsmall, the raterate of evolution will be highhigh.

3.A mutation occurring at a functional functional sitesite has a higher probability of being deleterious than a mutation occurring at a nonfunctional sitenonfunctional site.

Expectation: Expectation: Important entities Important entities should evolve slower than less less important onesimportant ones.

Page 32: 1 Dan Graur Rates of Nucleotide Substitution. 2 r = Rate of substitution per site per year K = Number of substitutions per site per year

32

Can we explain the different rates of substitution by the

neutralist model?

1.Mutations can be either deleteriousdeleterious or neutralneutral.

2.If the fractionfraction of deleteriousdeleterious mutations is largelarge, the raterate of evolution will be lowlow. If the fractionfraction of deleteriousdeleterious mutations is smallsmall, the raterate of evolution will be highhigh.

3.A mutation occurring at a functional functional sitesite has a higher probability of being deleterious than a mutation occurring at a nonfunctional sitenonfunctional site.

Expectation: Expectation: Important entities Important entities should evolve slower than less less important onesimportant ones.

Page 33: 1 Dan Graur Rates of Nucleotide Substitution. 2 r = Rate of substitution per site per year K = Number of substitutions per site per year

33

Kimura’s First Law of Molecular EvolutionKimura’s First Law of Molecular Evolution

Page 34: 1 Dan Graur Rates of Nucleotide Substitution. 2 r = Rate of substitution per site per year K = Number of substitutions per site per year

34

Functional entities Functional entities evolve slower than evolve slower than entities devoid of entities devoid of function.function.

Page 35: 1 Dan Graur Rates of Nucleotide Substitution. 2 r = Rate of substitution per site per year K = Number of substitutions per site per year

35

Functional constraintFunctional constraint = = Degree of intolerance Degree of intolerance towards mutations at a towards mutations at a genomic location.genomic location.The functional The functional constraint defines constraint defines the the range of alternative range of alternative residuesresidues that are that are acceptable at a site acceptable at a site without affecting without affecting negatively the fitness negatively the fitness of the organism.of the organism.

Page 36: 1 Dan Graur Rates of Nucleotide Substitution. 2 r = Rate of substitution per site per year K = Number of substitutions per site per year

36

K = v For neutral mutations:

Rate of substitution Mutation rate

Page 37: 1 Dan Graur Rates of Nucleotide Substitution. 2 r = Rate of substitution per site per year K = Number of substitutions per site per year

37

Kimura’s model of functional constraint

Suppose that a fraction, f0, of all mutations are selectively neutral and the rest (1 − f0) are deleterious.

Advantageous mutations are assumed to occur only very rarely, such that their relative frequency is effectively zero.

If we denote by vT the total mutation rate per unit time, then the rate of neutral mutation, v0, is

v0 vT f0

Page 38: 1 Dan Graur Rates of Nucleotide Substitution. 2 r = Rate of substitution per site per year K = Number of substitutions per site per year

38

According to the neutral theory, the rate of substitution is:

Hence,

The highest substitution rate is expected in sequences that do not have any function, such that all mutations are neutral

K v0

K vT f0

f0 1

v0 vT f0

Page 39: 1 Dan Graur Rates of Nucleotide Substitution. 2 r = Rate of substitution per site per year K = Number of substitutions per site per year

39

Page 40: 1 Dan Graur Rates of Nucleotide Substitution. 2 r = Rate of substitution per site per year K = Number of substitutions per site per year

40

An evolutionary An evolutionary experimentexperiment

Spalax Spalax ehrenberghiehrenberghi

Page 41: 1 Dan Graur Rates of Nucleotide Substitution. 2 r = Rate of substitution per site per year K = Number of substitutions per site per year

41

A-crystallinA-crystallin

Page 42: 1 Dan Graur Rates of Nucleotide Substitution. 2 r = Rate of substitution per site per year K = Number of substitutions per site per year

42

InIn Spalax Spalax,, A-crystallin A-crystallin lost its functional role lost its functional role more than more than 25 million years 25 million years agoago, when the mole rat , when the mole rat became subterranean and became subterranean and presumably lost use of its presumably lost use of its eyes. eyes. The The A-crystallin of A-crystallin of SpalaxSpalax evolves evolves 20 times faster20 times faster than than the the A-crystallins in other A-crystallins in other rodents, such as rats, mice, rodents, such as rats, mice, hamsters, gerbils and hamsters, gerbils and squirrel. squirrel.

Page 43: 1 Dan Graur Rates of Nucleotide Substitution. 2 r = Rate of substitution per site per year K = Number of substitutions per site per year

43

Additional Facts:Additional Facts:

(1) The (1) The A-crystallin of A-crystallin of SpalaxSpalax possess all the possess all the prerequisites for normal prerequisites for normal function and expressionfunction and expression, , including the proper signals including the proper signals for alternative splicing.for alternative splicing.

(2) The (2) The A-crystallin of A-crystallin of SpalaxSpalax evolves evolves slowerslower than than pseudogenes.pseudogenes.

Page 44: 1 Dan Graur Rates of Nucleotide Substitution. 2 r = Rate of substitution per site per year K = Number of substitutions per site per year

44

Explanation 1: The A-crystallin gene

may not have lost all of its vision-related functions, such as photoperiod perception and adaptation to seasonal changes.

Contradicting evidence: The atrophied eye of

Spalax does not respond to light.

Page 45: 1 Dan Graur Rates of Nucleotide Substitution. 2 r = Rate of substitution per site per year K = Number of substitutions per site per year

45

Explanation 2: The blind mole rat lost its vision more recently than 25 million years ago. The rate of nonsynonymous substitution after nonfunctionalization has been underestimated. Contradicting evidence: The A-crystallin gene is still an intact gene as far as the essential molecular structures for its expression are concerned.

Page 46: 1 Dan Graur Rates of Nucleotide Substitution. 2 r = Rate of substitution per site per year K = Number of substitutions per site per year

46

Explanation 3: The A-crystallin-gene product serves another function (unrelated to that of the eye). A-crystallin is a multifunctional protein

Supporting evidence: 1.A crystallin has been found in other tissues. 2.A crystallin also functions as a chaperonin that binds denaturing proteins and prevents their aggregation. 3.The regions within A crystallin

responsible for chaperonin activity are conserved in the mole rat.4.The protein has viable secondary and quarternary structures as well as normal thermostability.

Page 47: 1 Dan Graur Rates of Nucleotide Substitution. 2 r = Rate of substitution per site per year K = Number of substitutions per site per year

47

Genetic Genetic nonfunctionalization nonfunctionalization or partial or partial nonfunctionalization nonfunctionalization accelerates evolution.accelerates evolution. Most evolutionary Most evolutionary “action” occurs after “action” occurs after death. death.

Page 48: 1 Dan Graur Rates of Nucleotide Substitution. 2 r = Rate of substitution per site per year K = Number of substitutions per site per year

The Concept of Functional Constraint

The intensity of purifying selection is determined by the degree of intolerance characteristic of a site or a genomic region towards mutations.

The functional or selective constraint defines the range of alternative nucleotides that is acceptable at a site without affecting negatively the function or structure of the gene or the gene product.

DNA regions, in which a mutation is likely to affect function, have a more stringent functional constraint than regions devoid of function

Page 49: 1 Dan Graur Rates of Nucleotide Substitution. 2 r = Rate of substitution per site per year K = Number of substitutions per site per year

The stronger the functional constraints on a macromolecule are, the slower its rate of substitution will be.

Page 50: 1 Dan Graur Rates of Nucleotide Substitution. 2 r = Rate of substitution per site per year K = Number of substitutions per site per year

Functional density (Zuckerkandl 1976)

The functional density, F, of a gene is defined as ns/N, where ns is the number of sites committed to specific functions and N is the total number of sites. F, therefore, is the proportion of amino acids that are subject to stringent functional constraints.

Page 51: 1 Dan Graur Rates of Nucleotide Substitution. 2 r = Rate of substitution per site per year K = Number of substitutions per site per year

Functional density (Zuckerkandl 1976)

The higher the functional density, the lower the rate of substitution is expected to be.

Thus, a protein in which the active sites constitute only 1% of its sequence will be less constrained, and therefore will evolve more quickly than a protein that devotes 50% of its sequence to performing specific biochemical or physiological tasks.

Page 52: 1 Dan Graur Rates of Nucleotide Substitution. 2 r = Rate of substitution per site per year K = Number of substitutions per site per year

According to the neutral theory of evolution, the rate of substitution (as inferred from between-species comparisons) should positively correlate with the degree of genetic polymorphism (as inferred from comparisons among individuals within one species).

An interesting corollary of this hypothesis is that we should observe very little or no variation at the population level at evolutionary conserved positions.

The variation observed at conserved positions should be mostly deleterious (i.e., associated with disease).

Page 53: 1 Dan Graur Rates of Nucleotide Substitution. 2 r = Rate of substitution per site per year K = Number of substitutions per site per year

Gaucher disease is an autosomal recessive lysosomal storage disorder due to deficient activity of an enzyme called acid -glucosidase. There are many subtypes of Gaucher disease with fitness effects ranging from slight reduction in fitness to perinatally lethal, in which death occurs during the period between 154 days of gestation to seven days after birth.

Substitution rates and disease: The case of Gaucher disease

Page 54: 1 Dan Graur Rates of Nucleotide Substitution. 2 r = Rate of substitution per site per year K = Number of substitutions per site per year

We aligned the amino acid sequences of acid -glucosidase from nine placental mammals (human, chimpanzee, Sumatran orangutan, bovine, pig, dog, horse, rat, and mouse). The length of the alignment (excluding one gap due to a codon deletion in the ancestor of mouse and rat) was 496 amino-acids, of which 387 (78%) were identical in all nine species and 109 (22%) were variable..

-glucosidase

Page 55: 1 Dan Graur Rates of Nucleotide Substitution. 2 r = Rate of substitution per site per year K = Number of substitutions per site per year

Thirty-six single amino-acid replacements (at 34 amino-acid positions) resulting in Gaucher disease are described in the literature. Perinatal lethal mutations are shown in red.

Page 56: 1 Dan Graur Rates of Nucleotide Substitution. 2 r = Rate of substitution per site per year K = Number of substitutions per site per year

All 36 deleterious mutations occur at completely conserved sites (below asterisks). The expectation under a random model is that only 36 × 0.78 = 28 mutations should occur at completely conserved sites. This statistically significant non-random association between disease and evolutionary conservation (p = 0.0002) indicates that invariable sites are conserved because they evolve under extremely stringent functional constraints and cannot tolerate change.

Page 57: 1 Dan Graur Rates of Nucleotide Substitution. 2 r = Rate of substitution per site per year K = Number of substitutions per site per year

Q: What determines functional constraint?A: Many factors.

Q: Example?A: Interactions.

Page 58: 1 Dan Graur Rates of Nucleotide Substitution. 2 r = Rate of substitution per site per year K = Number of substitutions per site per year

A network (or graph) is an abstract representation of a set of objects, where some objects are connected to one another. The objects are represented by vertices (or nodes), and the links that connect the vertices are called edges (or branches). Edges can be polarized

Page 59: 1 Dan Graur Rates of Nucleotide Substitution. 2 r = Rate of substitution per site per year K = Number of substitutions per site per year

Edges can be polarized to indicate directionality and type of interaction (e.g., activation, inhibition). Edges can also be quantified to denoted extent of effect.

Page 60: 1 Dan Graur Rates of Nucleotide Substitution. 2 r = Rate of substitution per site per year K = Number of substitutions per site per year

Protein-protein interaction networks

(a) A simple example of a protein-protein interaction network consisting of five proteins (A-E), represented by the nodes, each of which interacts with at least one other protein. There are five interactions, denoted by the links.

In biological networks, three variables are usually studied:

(b) degree centrality or connectedness = the number of interactions for a protein.(c) betweenness centrality = the number of times that a node appears on the shortest path between all pairs of nodes.(d) closeness centrality = the mean number of links connecting a protein to all other proteins in the network.

Page 61: 1 Dan Graur Rates of Nucleotide Substitution. 2 r = Rate of substitution per site per year K = Number of substitutions per site per year

Proteins with high connectedness evolve slowly.Proteins with low connectedness evolve fast.

Proteins with high betweenness evolve slowly.Proteins with low betweenness evolve fast.

Proteins with high closeness evolve slowly.Proteins with low closeness evolve fast.

Page 62: 1 Dan Graur Rates of Nucleotide Substitution. 2 r = Rate of substitution per site per year K = Number of substitutions per site per year

Why do the rates of synonymous substitution vary from gene to gene?

(1) The variation represents stochastic fluctuations.(2) The variation is due to deterministic factors on top of stochastic fluctuations.

(2.1) Variation in the rate of mutation among different regions of the genome.

(2.2) Selection operating on synonymous mutations.

Page 63: 1 Dan Graur Rates of Nucleotide Substitution. 2 r = Rate of substitution per site per year K = Number of substitutions per site per year

Fact: There is a positive correlation between synonymous and nonsynonymous substitution rates in a gene.

Explanations:

(1)The rate of mutation varies along the genome and among genes (and hence some genes will have both high synonymous and nonsynonymous rates of substitution)

(2) The extent of selection at synonymous sites is affected by the nucleotide composition at adjacent nonsynonymous positions.

(3) (1) and (2).

Page 64: 1 Dan Graur Rates of Nucleotide Substitution. 2 r = Rate of substitution per site per year K = Number of substitutions per site per year

In the absence of positive Darwinian selection, the universal observation is that important sequences tend to evolve slower than less important ones.

The opposite, however, is not always true. That is, conserved regions in the genome may not always be important.

Defining “importance” is not a trivial undertaking.

Page 65: 1 Dan Graur Rates of Nucleotide Substitution. 2 r = Rate of substitution per site per year K = Number of substitutions per site per year

Hurst and Smith (1999) tested the relationship between rate of substitution and dispensability (a proxy for importance).

Approximately two thirds of all knockouts of individual mouse genes give rise to viable fertile mice. These genes have been termed “non-essential,” in contrast to “essential” genes, the knockouts of which result in death or infertility.

It is predicted that non-essential genes will subject to lesser intensities of purifying selection, and should therefore evolve faster than essential genes.

Page 66: 1 Dan Graur Rates of Nucleotide Substitution. 2 r = Rate of substitution per site per year K = Number of substitutions per site per year

In a comparison of 74 non-essential genes with 64 essential ones, the rate of substitution was found not to correlate with the severity of the knockout phenotype.

To account for differences in function, Hurst and Smith (1999) restricted their analysis exclusively to neuron-specific genes, which have significantly lower rates of substitution than other genes.

They could find no difference in the rate of substitution between 16 essential neuron-specific genes and 18 non-essential ones.

Page 67: 1 Dan Graur Rates of Nucleotide Substitution. 2 r = Rate of substitution per site per year K = Number of substitutions per site per year

The functional role (if any) of ~98% of mammalian genomes remains undetermined.

Nóbrega et al. (2004) deleted ~2 Mb-long sequences from the mouse genome, a 1,817,000 region mapping to mouse chromosome 3 and a 983,000 region mapping to chromosome 19. (Orthologous regions of about the same size are present on human chromosomes 1 and 10, respectively.)

Viable mice homozygous for the deletions were generated and were indistinguishable from wild-type littermates with regard to morphology, reproductive fitness, growth, longevity, and general homeostasis. Further analysis of the expression of multiple genes bracketing the deletions revealed only minor expression differences between homozygous-deletion mice and wild-type mice.

Page 68: 1 Dan Graur Rates of Nucleotide Substitution. 2 r = Rate of substitution per site per year K = Number of substitutions per site per year

The two deleted segments harbor 1,243 non-coding sequences conserved between humans and rodents (more than 100 base pairs, 70% identity). Yet, the deletion of so many sequences that have been conserved for such long period of time (mouse-human divergence ≈ 100 million years) resulted in no reduction in fitness.

Conclusion I: There are potentially ‘disposable DNA’ in the genomes of mammals.

Conclusion II: Sequence conservation may not necessarily indicate constraint.

Page 69: 1 Dan Graur Rates of Nucleotide Substitution. 2 r = Rate of substitution per site per year K = Number of substitutions per site per year

Ahituv et al. (2007) removed from the mouse genome four ultraconserved elements—sequences of 200 base pairs or longer that are 100% identical among human, mouse, and rat.

Page 70: 1 Dan Graur Rates of Nucleotide Substitution. 2 r = Rate of substitution per site per year K = Number of substitutions per site per year

Remarkably, lines of mice homozygous for the four deletions were viable and fertile, and failed to reveal any developmental or phenotypic abnormalities.

Page 71: 1 Dan Graur Rates of Nucleotide Substitution. 2 r = Rate of substitution per site per year K = Number of substitutions per site per year

These results indicate that extreme sequence conservation may not necessarily reflect extreme evolutionary constraint. There must be forces other than selection that promote sequence conservation.