View
217
Download
1
Embed Size (px)
Citation preview
“Evolutionary speculation constitutes a kind of metascience, which has the same intellectual fascination for some biologists that metaphysical speculation possessed for some mediaeval scholastics. It can be considered a relatively harmless habit, like eating peanuts, unless it assumes the form of an obsession; then it becomes a vice” (Stanier, 1970)
Linnaean classification• Two major characteristics
• Kingdom Animalia• Phylum Chordata• Class Mammalia• Order Primates• Family Hominidae• Genus Homo• Species sapiens
BIOL E-127 – 10/01/07
Microbial systematics
• Formerly Pseudomonas (partial list): Ralstonia, Burkholderia, Hydrogenophaga, Sphingomonas, Methylobacterium, Cellvibrio, Xanthomonas, Acidovorax, Hydrogenophillus, Brevundimonas, Pandoraea
multi-C C1(& all use methanol)
Molecular phylogenetics• Zuckerkandl & Pauling. 1965. Molecules as documents of
evolutionary history. J Theor Biol. 8:357-366.
• Neutral theory (Motoo Kimura, 1968)
Good Dataset
[A1, A2, A3, A4] [A1, B2, A3, A4]
Bad Dataset
A B
species 1 species 2 species 3 species 4
A1B1
A2B2 A4
B4A3
B3
1. Collect Sequence DataOrtholog vs. paralog?
Example: Neighbor Joining (NJ)
4. Choose Methods
Taxa CharactersSpecies A ATGGCTATTCTTATAGTACGSpecies B ATCGCTAGTCTTATATTACASpecies C TTCACTAGACCTGTGGTCCASpecies D TTGACCAGACCTGTGGTCCGSpecies E TTGACCAGTTCTCTAGTTCG
A
B
C
DE
Choose methods: distance-based
A B C D E Species A ---- 4 10 9 8Species B ---- 8 11 10Species C ---- 3 8Species D ---- 5Species E ----
A B C D E Species A ---- 4 10 9 8Species B -19.3 ---- 8 11 10Species C -10 -14.7 ---- 3 8Species D -10.7 -11.3 -16 ---- 5Species E -12.7 -13.3 -12 -14.7 ----
M(AB)=d(AB) -[(r(A) + r(B)]/(N-2)
4. Choose Methods
Maximum Parsimony (MP): Model: Evolution goes through the least number of changes
Maximum Likelihood (ML): L (data| model)
Bayesian Inference
Pr(data)
Pr(model)model)|Pr(datadata)|Pr(model
Markov chain Monte Carlo (MCMC) method for sampling from posterior probability distribution
Discrete character methods
5. Assess Reliability
I. Bootstrap
Re-sampling to produce pseudo-dataset (random weighting)
II. Jacknife
Sampling with replacement
III. Permutation test
Random deletion of sub-dataset
Randomize dataset to build null likelihood distribution
CGATCGTTA
CAATGATAG
CGCTGATAA
CGCTGATCG
taxa1
taxa2
taxa3
taxa4
123456789
Dataset1: 729338554Dataset2: 631981282
…
Dataset1: 1-3-56789Dataset2: 12-45678-
…
10073
Assess reliability