16
Nuclear Loci and Coalescent Methods Support Ancient Hybridization as Cause of Mitochondrial Paraphyly between Gadwall and Falcated Duck (Anas SPP.) Author(s): Jeffrey L. Peters, Yuri Zhuravlev, Igor Fefelov, Anne Logie and Kevin E. Omland Source: Evolution, Vol. 61, No. 8 (Aug., 2007), pp. 1992-2006 Published by: Society for the Study of Evolution Stable URL: http://www.jstor.org/stable/4627014 . Accessed: 23/10/2014 17:43 Your use of the JSTOR archive indicates your acceptance of the Terms & Conditions of Use, available at . http://www.jstor.org/page/info/about/policies/terms.jsp . JSTOR is a not-for-profit service that helps scholars, researchers, and students discover, use, and build upon a wide range of content in a trusted digital archive. We use information technology and tools to increase productivity and facilitate new forms of scholarship. For more information about JSTOR, please contact [email protected]. . Society for the Study of Evolution is collaborating with JSTOR to digitize, preserve and extend access to Evolution. http://www.jstor.org This content downloaded from 128.123.94.166 on Thu, 23 Oct 2014 17:43:53 PM All use subject to JSTOR Terms and Conditions

Nuclear Loci and Coalescent Methods Support Ancient ...lithornis.nmsu.edu/~phoude/Peters_et_al_2007_Nuclear_loci_and... · original article doi: 10. 1111/j. 1558-5646.2007.00149.x

  • Upload
    others

  • View
    0

  • Download
    0

Embed Size (px)

Citation preview

Page 1: Nuclear Loci and Coalescent Methods Support Ancient ...lithornis.nmsu.edu/~phoude/Peters_et_al_2007_Nuclear_loci_and... · original article doi: 10. 1111/j. 1558-5646.2007.00149.x

Nuclear Loci and Coalescent Methods Support Ancient Hybridization as Cause of MitochondrialParaphyly between Gadwall and Falcated Duck (Anas SPP.)Author(s): Jeffrey L. Peters, Yuri Zhuravlev, Igor Fefelov, Anne Logie and Kevin E. OmlandSource: Evolution, Vol. 61, No. 8 (Aug., 2007), pp. 1992-2006Published by: Society for the Study of EvolutionStable URL: http://www.jstor.org/stable/4627014 .

Accessed: 23/10/2014 17:43

Your use of the JSTOR archive indicates your acceptance of the Terms & Conditions of Use, available at .http://www.jstor.org/page/info/about/policies/terms.jsp

.JSTOR is a not-for-profit service that helps scholars, researchers, and students discover, use, and build upon a wide range ofcontent in a trusted digital archive. We use information technology and tools to increase productivity and facilitate new formsof scholarship. For more information about JSTOR, please contact [email protected].

.

Society for the Study of Evolution is collaborating with JSTOR to digitize, preserve and extend access toEvolution.

http://www.jstor.org

This content downloaded from 128.123.94.166 on Thu, 23 Oct 2014 17:43:53 PMAll use subject to JSTOR Terms and Conditions

Page 2: Nuclear Loci and Coalescent Methods Support Ancient ...lithornis.nmsu.edu/~phoude/Peters_et_al_2007_Nuclear_loci_and... · original article doi: 10. 1111/j. 1558-5646.2007.00149.x

ORIGINAL ARTICLE

doi: 10. 1111/j. 1558-5646.2007.00149.x

NUCLEAR LOCI AND COALESCENT METHODS SUPPORT ANCIENT HYBRIDIZATION AS CAUSE OF MITOCHONDRIAL PARAPHYLY BETWEEN GADWALL AND FALCATED DUCK (ANAS SPP.) Jeffrey L. Peters,1,2,3 Yuri Zhuravlev,4 Igor Fefelov,5 Anne Logie,1'6 and Kevin E. Omland1'7 'Department of Biological Sciences, University of Maryland Baltimore County, Baltimore, Maryland 21250 2E-mail: [email protected]

4lnstitute of Biology and Soil Science, Far East Branch, Russian Academy of Sciences, Vladivostok 690022, Russia E-mail: [email protected]

5Research Institute of Biology, Irkutsk State University, Irkutsk 664003, Russia E-mail: [email protected] 6E-mail: alogie [email protected] 7E-mail: [email protected]

Received July 24, 2006

Accepted March 21, 2007

Many species have mitochondrial DNA lineages that are phylogenetically intermixed with other species, but studies have rarely tested the cause of such paraphyly. In this study, we tested two hypotheses that could explain mitochondrial paraphyly of Holarctic

gadwalls (Anas strepera) with respect to Asian falcated ducks (A. falcata). First, hybridization could have resulted in falcated duck mitochondrial DNA (mtDNA) introgressing into the gadwall gene pool. Second, gadwalls and falcated ducks could have diverged so

recently that mtDNA lineages have not sorted to reciprocal monophyly. We used coalescent analyses of three independent loci to

distinguish between these two hypotheses. Two lines of evidence support introgression. First, analyses of the three loci combined show that some introgression is necessary to explain current genetic diversity in gadwalls. Second, we generated alternative

predictions regarding time since divergence estimated from mtDNA: falcated ducks and gadwalls would have diverged between 65,000 and 700,000 years before present (ybp) under the introgression hypothesis and between 11,000 and 76,000 ybp under the incomplete lineage sorting hypothesis. The two independent nuclear introns indicated that these species diverged between 210,000 and 5,200,000 ybp, which did not overlap the predicted time for incomplete lineage sorting. These analyses also suggested that ancient introgression (-14,000 ybp) has resulted in the widespread distribution and high frequency of falcated-like mtDNA (5.5% of haplotypes) in North America. This is the first study to use a rigorous quantitative framework to reject incomplete lineage sorting as the cause of mitochondrial paraphyly.

KEY WORDS: Anatidae, ancient introgression, coalescence, Holarctic, incomplete lineage sorting, mitochondrial paraphyly, nuclear introns.

In a survey of published phylogenetic and phylogeographic stud- ies, Funk and Omland (2003) documented that 23% of surveyed

animal species had mitochondrial DNA (mtDNA) that was not

monophyletic. Yet, few studies have attempted to differentiate between possible causes of such gene-tree paraphyly and poly- phyly (paraphyly hereafter). Two major causes of paraphyly are

introgression and incomplete lineage sorting (Avise 2000; Funk 3Current address: Institute of Arctic Biology, University of Alaska Fairbanks, 909 Koyukuk Dr., Fairbanks, Alaska 99775.

? 2007 The Author(s). Journal compilation ? 2007 The Society for the Study of Evolution. 1 992 Evolution 61-8: 1992-2006

This content downloaded from 128.123.94.166 on Thu, 23 Oct 2014 17:43:53 PMAll use subject to JSTOR Terms and Conditions

Page 3: Nuclear Loci and Coalescent Methods Support Ancient ...lithornis.nmsu.edu/~phoude/Peters_et_al_2007_Nuclear_loci_and... · original article doi: 10. 1111/j. 1558-5646.2007.00149.x

TESTING CAUSES OF PARAPHYLY

and Omland 2003). Introgression can occur when interspecific hybridization results in DNA crossing species boundaries; the re-

cipient species will have DNA that is paraphyletic with respect to the other. Alternatively, incomplete lineage sorting can result in

paraphyly if species divergence was too recent for ancestral poly- morphisms to have sorted to reciprocal monophyly. Mitochondrial

paraphyly is particularly important because mtDNA should sort more rapidly than nuclear DNA owing to its smaller effective pop- ulation size (Avise et al. 1987; Moritz et al. 1987; Moore 1995). Distinguishing between introgression and incomplete lineage sort-

ing is difficult because both can produce similar topologies in gene trees (Wakeley 1996; Holder et al. 2001), and most studies that have addressed the cause of paraphyly gave ad hoc explanations without statistically rejecting alternative hypotheses. Three main

approaches have been used to date: examining the topology of the mtDNA tree, comparing mtDNA with nuclear DNA, and compar- ing the geographic distributions of different clades.

The topology of the mtDNA tree has been used as a heuris- tic approach to determining the cause of paraphyly. For exam-

ple, a shallow genealogy that lacks distinct clades suggests that

species have diverged too recently for mtDNA to have sorted to

monophyly (e.g., Klicka et al. 1999; Baker et al. 2003). This in-

terpretation is especially supported when shared haplotypes are ancestral whereas species-specific haplotypes are derived (Baker et al. 2003; Donnelly et al. 2004; Omland et al. 2006). Other stud- ies have used the more rigorous method of nested clade analysis (Templeton 1998) to objectively evaluate these genealogical sig- natures (e.g., Liebers et al. 2004; Morando et al. 2004), but this method has been criticized because it does not provide a statis- tical framework for rejecting alternative scenarios (Knowles and Maddison 2002). Although the topology of a single gene tree can

provide some evidence regarding incomplete lineage sorting or

introgression, the randomness of mutation and genetic drift pre- vents any strong conclusions if analyses do not account for this stochastic variance (Knowles and Maddison 2002; Rosenberg and

Nordborg 2002; Knowles 2004). A second commonly used approach has been to compare

mtDNA to nuclear DNA (nDNA). The absence of shared nuclear

polymorphisms (i.e., differences were fixed between species) has

been interpreted as evidence of a deep species divergence and

introgression of mtDNA (e.g., Tegelstrom 1987; Ballard 2000; Weckstein et al. 2001; Redenbach and Taylor 2002). In contrast, extensive sharing of nDNA polymorphisms has been interpreted as evidence for a recent divergence and the retention of ances- tral polymorphisms (e.g., Wilson et al. 2000; Sefc et al. 2005). However, these comparisons implicitly assume that mtDNA has

higher potential for moving across species boundaries and that nDNA will better reflect phylogenetic history (see Brumfield et al. 2001; Funk and Omland 2003). If hybridization is common, then nuclear polymorphisms can be shared, even when species di-

verged a long time ago. Alternatively, species might exhibit fixed differences at some loci due to chance, even when species diver-

gence was recent (Palumbi et al. 2001; Hudson and Turelli 2003). Although comparing mtDNA to nDNA is a promising method of

determining the cause of mitochondrial paraphyly, more rigorous methods are needed to test alternative hypotheses (see Buckley et al. 2006).

Finally, a third approach has been to compare sympatric and

allopatric populations. Under the introgression hypothesis, lin-

eage sharing might be more common where the two species are

sympatric, because there are more opportunities for hybridization (e.g., Hare and Avise 1998; Masta et al. 2002; Donnelly et al.

2004; Kulikova et al. 2004, 2005; Morando et al. 2004; Sefc et al.

2005). In contrast, incomplete lineage sorting does not necessar-

ily make any predictions regarding the geographic distribution of

lineages. This method has merit if hybridization is ongoing and

dispersal is low, but it may not be applicable to all study systems. If species have high dispersal capabilities that facilitate gene flow between distant populations, then introgressed DNA might not be restricted to sympatry. A range expansion or positive selection can also cause introgressed DNA to spread beyond areas of sympatry. As such, finding shared lineages between allopatric populations does not necessarily reject hybridization as the cause of paraphyly.

Rigorous tests of explicit hypotheses are needed to distin-

guish between causes of paraphyly, and these tests need to ac- count for the stochastic variance of genetic processes (Knowles and Maddison 2002; Rosenberg and Nordborg 2002; Knowles

2004). The isolation-with-migration (IM) coalescent method ac- counts for genetic stochasticity by sampling from the range of genealogies that are consistent with the data (Nielsen and

Wakeley 2001; Hey and Nielsen 2004). Isolation-with-migration estimates six parameters, including effective population sizes of the ancestral population and each daughter population, im-

migration rates into each daughter population, and time since

divergence (Hey and Nielsen 2004). Using IM, the posterior distributions of migration rates can be examined to determine whether gene flow is necessary to explain phylogenetic patterns. Posterior distributions that include the lowest value of migra- tion examined suggest that the data are consistent with com-

plete isolation (e.g., Hey 2005; Peters et al. 2005a; Omland et al. 2006). In this case, paraphyly is consistent with incomplete lineage sorting. In contrast, posterior distributions that do not in- clude the lowest value examined (e.g., Won and Hey 2005) suggest that gene flow is necessary to explain phylogenetic patterns. Thus, IM provides a quantitative framework for evaluating the likelihood of gene flow in species exhibiting paraphyletic DNA.

STUDY TAXA-GADWALL AND FALCATED DUCK

In this study, we tested hypotheses regarding mitochondrial para- phyly of gadwalls (Anas strepera) with respect to falcated ducks

EVOLUTION AUGUST2007 1 9 9 3

This content downloaded from 128.123.94.166 on Thu, 23 Oct 2014 17:43:53 PMAll use subject to JSTOR Terms and Conditions

Page 4: Nuclear Loci and Coalescent Methods Support Ancient ...lithornis.nmsu.edu/~phoude/Peters_et_al_2007_Nuclear_loci_and... · original article doi: 10. 1111/j. 1558-5646.2007.00149.x

JEFFREY L. PETERS ET AL.

A mtDNA 1 1 B nDNAo 2-14

6 (CHD, LDH of ....e 1 5 6 1,0 ,2] ,2c; 1 351,2 ,4tfantit, 1 54 12Atlantic , , 2,

Vir,'a n)Ocea n Ocean

"

cto!Oe

,, cean.. 4O acean

n.. Ocean

40 65,18

ga1 d falcated 8 gadwall dc , 3 ~~duck 51

Sbreeding - 1 2 .

.;k...... %.," .jll, ?wintering lJ

[_- .'...i 3:r. . ..;,. sam pling .... . .- ,'i

Figure 1. Breeding and wintering ranges of gadwall and falcated ducks. Sampling location and sample sizes for (A) mtDNA and (B) introns (CHD, LDH) are shown in ellipses. (An additional mtDNA haplotype and CHD allele were sampled from a captive falcated duck.)

(A. falcata). The gadwall has a Holarctic distribution that extends across Europe, Asia, and North America, whereas the falcated duck is restricted to East Asia (Fig. 1). Within North America, 5.5% of 348 gadwalls had mtDNA (clade B) that was more sim- ilar to falcated ducks than to the majority of gadwalls (clade A), rendering gadwall paraphyletic with respect to falcated duck

(Peters and Omland 2007); none of the clade B haplotypes were shared between species. Clade B was geographically widespread throughout North America, but was not found in a small sample of gadwalls from Eurasia (Peters et al. 2005b). The relatively high frequency and widespread distribution of clade B in North Amer- ica seem consistent with incomplete lineage sorting, especially given the Asian distribution of falcated ducks. Furthermore, the mtDNA tree was consistent with a monophyletic falcated duck nested within gadwall (Peters and Omland 2007); widespread species having paraphyletic mtDNA relative to restricted-range species is consistent with expectations under models of recent

speciation via peripheral isolation (Harrison 1991; Omland 1997). However, gadwall and falcated ducks are known to hybridize in the wild (Johnsgard 1960), and introgression of mtDNA from fal- cated ducks into gadwalls could explain the pattern of paraphyly. Because these two species are each other's closest extant relatives

(Johnson and Sorenson 1999; Peters et al. 2005b), both incomplete lineage sorting and hybridization are viable hypotheses.

To test the cause of mitochondrial paraphyly between gad- walls and falcated ducks, we applied the approaches reviewed

above, but we made explicit predictions and tested those pre- dictions using coalescent models. Following species divergence, DNA progresses through various stages of polyphyly and para-

phyly before ultimately sorting to reciprocal monophyly (Neigel and Avise 1986; Omland et al. 2006). Thus if hybridization with falcated ducks resulted in the introgression of clade B haplotypes into gadwalls (i.e., mtDNA has otherwise sorted to reciprocal monophyly), then theory predicts a deeper species divergence (tINT; Fig. 2A) than predicted if clade B is an ancestral polymor- phism that has incompletely sorted (tiLs; Fig. 2B). The probability of monophyly, however, is also a function of effective population sizes (smaller populations will sort to monophyly more rapidly) and stochastic processes. Therefore, we first estimated the range of possible divergence times consistent with mtDNA sorting to

A Introgression B Incomplete lineage sorting

Present Clade A Clade B Clade A Clade B G G F G G F

TMRCA - -

Figure 2. Alternative hypotheses to explain mitochondrial para- phyly of gadwalls (G) with respect to falcated ducks (F). Thick lines show gene trees within species trees. (A) Clade B haplotypes in- trogressed into the gadwall gene pool after species divergence (tINT). (B) Clade B is found in both species as a result of incomplete lineage sorting following a more recent divergence (tlLS). TMRCA is the time to most recent common ancestor of all mtDNA haplo- types.

1 994 EVOLUTION AUGUST 2007

This content downloaded from 128.123.94.166 on Thu, 23 Oct 2014 17:43:53 PMAll use subject to JSTOR Terms and Conditions

Page 5: Nuclear Loci and Coalescent Methods Support Ancient ...lithornis.nmsu.edu/~phoude/Peters_et_al_2007_Nuclear_loci_and... · original article doi: 10. 1111/j. 1558-5646.2007.00149.x

TESTING CAUSES OF PARAPHYLY

reciprocal monophyly (tINT) and those consistent with incomplete sorting of ancestral polymorphisms (tiLS). Specifically, we used IM to jointly estimate effective population sizes and time since di-

vergence, while accounting for stochastic variance, to determine whether estimates of time since divergence differed between these two models. We then tested these predicted divergence times using two independent nuclear loci. Second, combining all loci to better account for stochastic among-locus variation, we estimated rates of gene flow to determine whether some introgression was nec-

essary to explain the observed phylogenetic patterns. Finally, we

compared falcated ducks to allopatric and to sympatric gadwalls to test the prediction that introgression rates would be higher in

sympatry.

Materials and Methods LABORATORY METHODS We sampled 42 gadwalls and 47 falcated ducks from areas of sym- patry in East Asia (Fig. lA) for comparisons with the 348 gad- walls sampled from North America by Peters and Omland (2007; GenBank accession numbers are given in online Supplementary Material). DNA was extracted from muscle or feathers using the

DNeasy tissue kit (Qiagen, Valencia, CA). For each individual, we sequenced 658-659 bp of the mtDNA control region using primers L78 and H774 (Sorenson and Fleischer 1996; Sorenson et al. 1999) following standard protocols (see McCracken et al.

2001). (Internal primers were used for some individuals; see on- line Supplementary Material.) Two haplotypes from Asian gad- walls and 10 haplotypes from falcated ducks had been published previously (Peters et al. 2005b); we also included nine published sequences from allopatric Eurasian gadwalls (Donne-Gouss6 et al. 2002; Peters et al. 2005b). Introgressed haplotypes shared with Eurasian wigeons (A. penelope) and an American black duck

(Anas rubripes) were excluded from this study (Peters et al. 2005b; Peters and Omland 2007).

For a subset of individuals, we sequenced a sex-linked intron from the chromo-helicase-DNA binding gene (CHD) located on the Z-chromosome (Fig. IB). We used primers 3007 and 3112 (Ellegren and Fridolfsson 1997) following published protocols (Peters et al. 2005b) or CHDb.F (5'-AACT- GTTCTACATACAGGTTCTACTCC-3') and CHDb.R (5'- AGTTCCTCTGCACCAAACTTTA-3') with an annealing temperature of 60?C; the latter primers yielded more consistent

amplifications. The overlapping portions of the two CHD frag- ments included a 270-271 bp intron and 43 bp of flanking exon. CHD sequences included 50 alleles from 34 North American

gadwalls, 50 alleles from 32 East Asian gadwalls, 14 alleles from 9 European/West Asian gadwalls, and 66 alleles from 40 falcated ducks (Fig. IB; 32 gadwall and 16 falcated duck alleles were published in Peters et al. 2005b). Because females carry

a single Z-chromosome, an unambiguous allele was amplified and sequenced. The gametic phase of alleles for males that were either homozygous or polymorphic at a single position could be resolved without ambiguity. However, five male falcated ducks and three male gadwalls were polymorphic at multiple positions, and gametic phases were ambiguous. For those males, we used PHASE version 2.1 (Stephens et al. 2001; Stephens and Donnelly 2003) to infer gametic phase (unambiguous alleles were included in analyses). Inferred alleles for four falcated ducks and two

gadwalls matched known alleles, but PHASE indicated novel alleles for one individual of each species. We used allele-specific primers to preferentially amplify and sequence one allele from those two individuals (see Peters et al. 2005b), and we subtracted this allele from the polymorphic sequence to resolve the phase of the other allele. (Allele-specific primer sequences are provided in online Supplementary Material.)

Seven falcated ducks contained a polymorphic indel in CHD. Locations of indels were easy to recognize, because the 5'-ends of sequences were unambiguous, whereas 3'-ends contained nu- merous double peaks (Jennings and Edwards 2005; also see on- line Supplementary Material). We compared the unambiguous 5'- ends to ambiguous 3'-ends between forward and reverse strands to determine indel length (see online Supplementary Material). Furthermore, because gaps resulted in shifted peaks in chro-

matograms, we could resolve gametic phase for other polymor- phisms within the sequence (see online Supplementary Material). We found one novel allele using this method; the remaining 13 alleles matched known alleles from unambiguous sequences.

We also sequenced intron 4 from the lactate dehydro- genase gene (LDH) using primers LDH4.F (5'-ATGGGAGC TTGTTCCTTCAA-3') and LDH4.R (5'-CTTCCTGCTGACG AACACCT-3') (Peters 2006). LDH sequences included 458-469

bp of intron sequence, including a variable T repeat (7-10 bp in length), and 69 bp of flanking exon. One falcated duck and four gadwalls were heterozygous for the T repeat, and resolv-

ing alleles as described above, all 10 alleles matched known alleles. One gadwall had two polymorphic sites, and we used allele specific primers to resolve alleles (online Supplementary Material). We sequenced LDH for 15 North American gad- walls (30 alleles), 12 Asian gadwalls (24 alleles), and 10 fal- cated ducks (20 alleles) (Fig. IB). All new sequences have been

deposited in GenBank (Accession Nos. EF537650-EF537858; EF541024).

SUMMARY STATISTICS

We tested whether the frequency of clade B haplotypes differed between North American and East Asian gadwalls using a two- tailed Fisher's exact test. For each locus, we calculated nucleotide

diversity (Tr) and haplotype diversity (h) in Arlequin version 2.001 (Schneider et al. 2000). We conducted analyses of molecular

EVOLUTION AUGUST2007 1 9 9 5

This content downloaded from 128.123.94.166 on Thu, 23 Oct 2014 17:43:53 PMAll use subject to JSTOR Terms and Conditions

Page 6: Nuclear Loci and Coalescent Methods Support Ancient ...lithornis.nmsu.edu/~phoude/Peters_et_al_2007_Nuclear_loci_and... · original article doi: 10. 1111/j. 1558-5646.2007.00149.x

JEFFREY L. PETERS ET AL.

variance (AMOVA, Excoffier et al. 1992) in Arlequin to examine

population structure between falcated ducks and gadwalls and be- tween North American and Asian gadwalls. We constructed hap- lotype networks for each locus using the median-joining algorithm in FLUXUS (Bandelt et al. 1999, www.fluxus-engineering.com). For all analyses, we treated each indel as a single polymorphism, regardless of its size.

COALESCENT ANALYSES- ISOLATION-WITH-MIGRATION We used a Bayesian MCMC method in the IM (Hey and Nielsen

2004) coalescent program to generate testable predictions and to test those predictions by simultaneously estimating six param- eters scaled to the neutral mutation rate: u: 0cG (effective pop- ulation size of gadwalls), 0 FD (effective population size of fal- cated ducks), 0A (effective size of the ancestral population at the time of divergence), t (time since divergence), mG (the immi-

gration rate for gadwalls; i.e., the rate at which falcated duck alleles enter the gadwall gene pool), and mFD (the immigration rate for falcated ducks). Although migration typically refers to movements of individuals between populations, the m parameter is better interpreted as the rate of DNA introgression for the pur- poses of this study. Fundamentally, there is no difference between these interpretations because both result in DNA moving between

gene pools, but for clarity, we hereafter refer to m as the rate of

introgression. Isolation-with-migration assumes no recombination within

loci and that loci are selectively neutral. There was no evidence for recombination in either CHD or LDH (four-gametes test; Hud- son and Kaplan 1985). Furthermore, an HKA test (Hudson et al.

1987) indicated that polymorphisms within and between gadwall and falcated ducks across the three loci fit expectations under neu-

trality. This result was robust to including or excluding gadwall clade B haplotypes in analyses (X 2 = 2.138, df = 4, P = 0.71 and

X 2 = 1.219, df = 4, P = 0.87, respectively). Following the approach used by Won and Hey (2005), we

first ran IM using large, flat priors for each parameter. Based on those runs, we defined narrower upper bounds that encompassed the full posterior distribution of each parameter. However, esti- mates of t and 0A sometimes contained distinct peaks, but the tails were flat and did not approach zero. In these cases, we defined up- per bounds based on other information obtained from preliminary runs (see below). We used a uniform (flat) prior distribution be- tween zero and the defined upper bound, which we assumed to be uninformative. Using those priors, we repeated runs with a burnin of at least 200,000 steps. To check that the program was converg- ing, we assessed autocorrelations and effective sample size (ESS) for each parameter throughout the run (Hey and Nielsen 2004). We ran the program until the lowest ESS was at least 150. We also repeated each run using a different random number seed to check

Table 1. Summary of data partitions used for generating and test- ing predictions for introgression and incomplete lineage sorting in IM.

Models of Falcated Gadwall divergence duck

Generating predictions Deep divergence mtDNA All populations- (introgression) mtDNA (clade

A only) Shallow mtDNA All populations- divergence mtDNA (clades (incomplete A & B) lineage sorting)

Testing predictions Isolation-with- CHD & LDH All populations- migration- CHD & nDNA LDH Isolation-with- mtDNA, CHD, & East Asia-mtDNA migration LDH (clades A & B), (sympatry) CHD, & LDH Isolation-with- mtDNA, CHD, & North America- migration LDH mtDNA (clades (allopatry) A & B), CHD,

& LDH

that multiple runs gave similar results. Details of each analysis are

provided below and are summarized in Table 1.

Generating predictions with mtDNA We first generated predictions regarding time since divergence based on two alternative hypotheses: mtDNA introgression and

incomplete lineage sorting. Under the introgression hypothesis, clade B became incorporated into the gadwall's gene pool af- ter they diverged from falcated ducks, but mtDNA has otherwise sorted to reciprocal monophyly. We used IM to estimate a range of

possible divergence times (tiNT; Fig. 2A) consistent with this hy- pothesis by comparing gadwall clade A and falcated duck mtDNA

(we excluded clade B gadwalls) using a model of isolation (only four parameters were estimated: 0FD, 0G, 0A, and t). Doing so caused the Markov chain to sample from parameter space con- sistent with mtDNA having sorted to reciprocal monophyly (the- oretically, deep divergence times and small effective population sizes). Because the posterior distribution of 0A was flat, we de- fined an upper bound for the prior by assuming that the ancestral

population size was not greater than the combined size of the two

daughter populations (OFD + OG); we summed the upper 95%

highest posterior distribution (HPD) for OFD

and 0G to set OA (upper bound = 110).

We estimated a range of possible divergence times under the incomplete lineage sorting hypothesis (tILS; Fig. 2B) by

1 9 9 6 EVOLUTION AUGUST 2007

This content downloaded from 128.123.94.166 on Thu, 23 Oct 2014 17:43:53 PMAll use subject to JSTOR Terms and Conditions

Page 7: Nuclear Loci and Coalescent Methods Support Ancient ...lithornis.nmsu.edu/~phoude/Peters_et_al_2007_Nuclear_loci_and... · original article doi: 10. 1111/j. 1558-5646.2007.00149.x

TESTING CAUSES OF PARAPHYLY

analyzing all gadwall (clades A and B) and falcated duck mtDNA with a model of strict isolation. Excluding the possibility of gene flow caused the Markov chain to sample from parameter space consistent with species still sharing ancestral polymorphisms (the- oretically, shallow divergence times and large effective population sizes). Using this model, the full posterior distribution for each pa- rameter was contained within the priors.

Testing predictions with nuclear introns To test predictions generated from mtDNA, we obtained indepen- dent estimates of time since divergence using the nuclear introns CHD and LDH (tnDNA). We excluded flanking exons from analyses because coding regions are more likely to be under direct selec- tion. We treated introns as HapSTRs, which are loci that have

single nucleotide polymorphisms (SNPs) linked to short-tandem

repeats (STR) that vary in length (Hey et al. 2004). We defined inheritance scalars in the input files to reflect modes of inheritance for CHD (0.75, sex-linked on the Z-chromosome) and LDH (1.0, biparentally inherited). By doing so, parameters estimated from CHD were readjusted by the program to be on the same scale as autosomal loci (Hey and Nielsen 2004). We analyzed nDNA

using a model of strict isolation and a model of isolation with

migration to evaluate the sensitivity of results to assumptions re-

garding gene flow. Using a model of isolation with migration, the

posterior distributions of 0A and t contained flat tails. We defined an upper bound for 0A by summing the upper 95% HPD of OFD and 0G as described above (upper bound = 2.5). To set an upper prior for t, we assumed that time since divergence could not be older than TMRCA (time to most recent common ancestor of all

sequences). Averaging the posterior distributions of TMRCA for the two loci, we used the upper 95% HPD of TMRCA as the upper bound for t (upper bound = 1.25). Using a model of isolation, the full posterior distributions of all parameters were encompassed by the priors.

We compared the posterior distributions of tnDNA to tINT and

tILS to determine whether we could reject one of the divergence times estimated from mtDNA. Because mtDNA and nDNA evolve at different rates, we converted estimates of t to biologically in- formative values, using the equation t = Tu, where T is time in

units of years and u is the neutral mutation rate for the locus. Because our mutation rates were calibrated as the number of sub-

stitutions per year (see below), no corrections were needed for

generation time. For the intron dataset, u was the geometric mean of the mutation rates for the two loci. HapSTR loci require two mutation rate calibrations: one for the STR portion and one for the sequence portion. Because we did not have calibrations for the STR portion, we used mutation rate scalars estimated in IM

(Hey and Nielsen 2004; Won et al. 2005). For conversions, we

multiplied t by the geometric mean of the mutation rate scalars for the sequence portions of loci, and then divided this value

by the geometric mean of calibrated mutation rates (Won et al.

2005).

Testing predictions using all loci Isolation-with-migration can better account for the stochasticity of mutation and lineage sorting when all loci are examined simul-

taneously. Therefore, we further tested predicted divergence times

using a model of isolation with migration that included mtDNA

(both clade A and clade B haplotypes), CHD, and LDH (Table 1). We also made two predictions regarding the migration parameter, m, under a hypothesis of mtDNA introgression. First, the 95% HPD of m would exclude zero and hence reject the possibility of

"no introgression." Second, we predicted that introgression rates would be higher where gadwalls and falcated ducks are sympatric. We treated allopatric North American gadwalls and sympatric East Asian gadwalls in separate analyses. We defined inheritance scalars as 0.25 for mtDNA, 0.75 for CHD, and 1.0 for LDH to reflect their modes of inheritance (Hey and Nielsen 2004). In both

analyses, we used the sum of the upper 95% HPDs of OFD and G

to set upper bounds for 0A (upper bound = 4) and the upper 95% HPD of TMRCA (averaged across loci) for t (upper bound = 1.9) (see above).

Isolation-with-migration can also record the time of any intro-

gression events detected within genealogies sampled throughout the run ("time of migration," Won and Hey 2005). We examined time of migration to determine whether any introgression was more consistent with recent or past hybridization.

MUTATION RATE CALIBRATIONS For mtDNA, we used the control region mutation rate calibrated

by Peters et al. (2005a) of 4.8 x 10-8 substitutions/site/year (s/s/y; range = 3.1 x 10-8 -6.9 x 10-8 s/s/y). This calibration was based on relative rates of control region to mtDNA coding regions in

multiple duck taxa. The rate for coding regions was calibrated by Paxinos et al. (2002) using fossil evidence that the goose genera Anser and Branta diverged during the Pliocene (four to five million

years ago [mya]). We calibrated mutation rates for CHD and LDH introns based

on the goose-duck split (subfamilies Anserinae and Anatinae, re-

spectively); hence our intron calibrations are independent of the mtDNA calibration. Fossil records suggest that these subfamilies

likely diverged during the Oligocene (23-38 mya; Howard 1964). Fossils assigned to each subfamily have been recovered from the

Oligocene and afterwards (Howard 1964; Livezey 1996, 1997b; Louchart et al. 2005), whereas earlier fossils have been assigned to lineages sister to Anatidae (Livezey 1997a; Ericson 1997; Dyke 2001; Clarke et al. 2005). We used d = 2u7r for calibrations, where d is the average percent sequence divergence between ducks and

geese (Subfamilies Anserinae and Anatinae), u is the mutation

EVOLUTION AUGUST 2007 1 9 9 7

This content downloaded from 128.123.94.166 on Thu, 23 Oct 2014 17:43:53 PMAll use subject to JSTOR Terms and Conditions

Page 8: Nuclear Loci and Coalescent Methods Support Ancient ...lithornis.nmsu.edu/~phoude/Peters_et_al_2007_Nuclear_loci_and... · original article doi: 10. 1111/j. 1558-5646.2007.00149.x

JEFFREY L. PETERS ET AL.

rate, and r is the time of divergence. We estimated d between Anserinae and Anatinae for 20 pairwise comparisons for CHD

(n = 2 Anserinae, n = 10 Anatinae; J. L. Peters et al. 2005b; Humphries and Peters unpubl. data, this study) and 40 pairwise comparisons for LDH (n = 4 Anserinae, n = 10 Anatinae; Kraft et al. 1993, St. John et al. 2005; A. Logie and J. L. Peters, un- publ. data, this study; see online Supplementary Material). We corrected d using the best-fit mutation model that was selected

using Akaike Information Criterion (AIC) in ModelTest version 3.7 (Posada and Crandall 1998). For CHD, the TVM model was

selected, which includes a general time reversible model of substi- tution (A-C, A-G, A-T, C-G, C-T, and G-T = 0.6, 2.4, 0.5, 2.8, 2.4, and 1.0, respectively). For LDH, the TrN model was selected, which includes different rates for transversions (1.0) and for A-G and C-T transitions (3.9 and 6.1, respectively). When multiple sequences were available for a single species, we calculated the

average value of d. As a best approximation to mutation rates, we defined T as the midpoint of the Oligocene (30.5 mya) and used the average d among all pairwise comparisons between subfami- lies. To set a lower bound, we used the lowest value of d and the

beginning of the Oligocene (38 mya); to set an upper bound, we used the maximum value of d and the end of the Oligocene (23 mya). Thus, our calibrations incorporate uncertainty in the exact

divergence date between subfamilies and stochastic variance in mutations.

To test the reliability of rate calibrations, we compared time since divergence estimated from introns and mtDNA coding re-

gions. In those comparisons, we grouped gadwalls and falcated

ducks, because divergence between those species is the parameter of interest in this study. For both CHD and LDH, we estimated time since divergence for 36 nonindependent pairwise comparisons within Anatinae (Kraft et al. 1993; Peters et al. 2005b, unpubl. data, this study). For species that had multiple available sequences, we calculated average sequence divergences. For mtDNA, we es- timated time since divergence using published cytochrome b and ND2 sequences (Johnson and Sorenson 1999; see online Supple- mentary Material) and a divergence rate of 2.1 % per million years (Paxinos et al. 2002). We used PAUP* (Swofford 1999) to correct

pairwise divergences for mtDNA using the TIM + I + F model chosen using AIC in ModelTest version 3.7 (Posada and Crandall 1998). This model includes a general time reversible substitution model (A-C, A-G, A-T, C-G, C-T, and G-T = 3.0, 106.1, 2.0, 0.2, 30.3, and 1.0, respectively), invariant sites (I = 0.71), and a

gamma distribution (F = 5.1). We plotted time since divergence estimated from introns against time since divergence estimated from mtDNA to determine whether there was general agreement between markers. We also performed a Mantel test with 100,000 permutations in the program ZT (Bonnet and de Peer 2002) to determine whether there was a significant positive correlation be- tween those estimates.

A mtDNA B nDNA - CHD

Clade 8

12 substitutions & 1 indel

C nDNA- LDH

Clade A/ * 1bp*

,.- , l 7bp

70 I -

falcated duck - nucleotide substitution gadwall - North America - - insertion/deletion

* gadwall - East Asia hypothetical unsampled * gadwall - Europe/West Asia haplotype/allele

Figure 3. Haplotype networks for (A) mtDNA, (B) CHD, and (C) LDH. Circle area is proportional to the number of shared sequences. Indel sizes are shown next to the corresponding branch; an asterisk (*) indicates STR portions that were included in coalescent analy- ses. Note that one clade B haplotype and one CHD allele (sampled from different individuals) from Asian gadwalls were shared with falcated ducks, whereas none of the clade B haplotypes sampled from North America were shared with falcated ducks.

Results PATTERNS OF GENETIC DIVERSITY Within North America, 19 of 348 gadwalls (5.5%) had clade B mtDNA haplotypes that grouped with falcated ducks to the exclu- sion of gadwall clade A haplotypes (Fig. 3A; Peters and Omland

2007). All 47 falcated ducks had clade B haplotypes (h = 0.68, ar =

0.0020), none of which were shared with North American gad- walls (Fig. 3A). Clade B haplotypes were strongly structured be- tween falcated ducks and North American gadwalls (PST = 0.47; P < 0.00001). One of the 42 gadwalls from East Asia also had a clade B haplotype (2.4% of haplotypes; the frequency of clade B did not differ significantly between Asia and North America; P =

0.71). The Asian clade B haplotype differed from North American

clade B by 1-6 bp and was identical to the most common falcated duck haplotype (Fig. 3A). The remaining 41 East Asian gadwalls

1998 EVOLUTION AUGUST 2007

This content downloaded from 128.123.94.166 on Thu, 23 Oct 2014 17:43:53 PMAll use subject to JSTOR Terms and Conditions

Page 9: Nuclear Loci and Coalescent Methods Support Ancient ...lithornis.nmsu.edu/~phoude/Peters_et_al_2007_Nuclear_loci_and... · original article doi: 10. 1111/j. 1558-5646.2007.00149.x

TESTING CAUSES OF PARAPHYLY

had clade A haplotypes (h = 0.89, rr = 0.0028). (Within clade A, mtDNA was significantly structured between hemispheres

[srT = 0.12; P < 0.00001].) For CHD, we found 19 different alleles among the 66 fal-

cated duck (h = 0.74, - = 0.0043) and 114 gadwall (h = 0.43,

T = 0.0018) sequences. Ten alleles were unique to falcated ducks and eight alleles were unique to gadwalls. Only one allele was shared between species-one allele from an East Asian gadwall was identical to 12 falcated duck alleles (Fig. 3B). CHD alle- les were strongly structured between falcated ducks and gadwalls (Fig. 3B; (ST =

-0.71; P < 0.00001). For LDH, we found eight dif-

ferent alleles among the 20 falcated duck (h = 0.19, w = 0.00043) and 54 gadwall (h = 0.49, a = 0.0012) sequences. Two fixed in- dels between the three falcated duck and the five gadwall alleles resulted in LDH being strongly structured (Fig. 3C; srT = 0.81; P < 0.00001). (Both CHD and LDH were also structured between North American and Asian gadwalls [CHD: OSTr = 0.088; P = 0.004; LDH: (sr = 0.23; P < 0.00001].)

INTRON MUTATION RATE CALIBRATIONS On average, CHD alleles from ten species of Anatinae were 8.9%

divergent from Anserinae alleles (range = 7.5% to 10.2%). Using the midpoint of the Oligocene, 30.5 mya, we estimated a mutation rate for CHD of 1.5 x 10-9 s/s/y. Average divergence for LDH alleles was 12.1% (range = 10.9% to 14.1%) between subfami- lies suggesting a mutation rate of 2.0 x 10-9 s/s/y. Incorporating possible sources of error, we defined a range of plausible mutation rates for CHD (1.0 x 10-9-2.2 x 10-9 s/s/y) and LDH (1.4 x

10-9 -3.1 x 10-9 s/s/y). Despite using a different calibration point (goose-duck split) than that used for mtDNA (Anser-Branta split),

S 16- CHD -- c o 14- LDH ---

1- 2 -? S.- 12

a ) i(A 0-

U c 0 6 Ce o 4 - g ' O

0 2 4 6 8 10 12 14 16 time since divergence (millions of years) -

mtDNA

Figure 4. Correlation between time since divergence estimated from mtDNA coding regions and two nuclear introns. The solid line is the expected line for a perfect correlation. Dashed lines show the correlation between mtDNA and introns with the origin forced through zero. There is a general agreement between divergence times estimated from different markers.

there was general agreement between divergence times estimated from nuclear introns and mtDNA (Fig. 4). We found significant positive correlations between mtDNA and CHD (r = 0.80, P =

0.00008), between mtDNA and LDH (r = 0.92, P = 0.0001), and between CHD and LDH (r = 0.62, P = 0.0006).

COALESCENT ANALYSES-ISOLATION-WITH-MIGRATION Time since divergence Estimates of time since divergence differed and 95% HPDs did not overlap between the two mtDNA models (Fig. 5A; see Tables 2 and 3 for the full sets of parameters estimated from the various IM models). Given mutation rate estimates, gadwalls and falcated ducks most likely diverged about 290,000 ybp (tiNT = 9.1) under a model that assumed mtDNA has sorted to reciprocal monophyly

TILS (lineage sorting) 0.005 A mtDNA 0.004 - TINT (introgression)

0.003- 0.002 ' i 0.001

C, 0

0- 1 10 100 1000 10000

C I TnDNA(I) a 0.007 BnDNA DBnDNA

0.006 0.005

cn 0.004 T S 0.003 TnDNA(IM) 0.002 0.001

1 10 110 1000 10000

0005 T 0.005 C mtDNA + nDNA Tallopatry 0.004

, sympatry

0.003 - 0.002 - 0.001

0? w1

1 10 100 1000 10000 Time since divergence

(T; thousands of years; log scale)

Figure 5. Time since divergence (T; t = Tu): posterior distributions

(thick, solid lines). Lower and upper 95% HPDs adjusted using fast and slow mutation rate calibrations, respectively, are shown by the thinner lines. (A) T predicted from mtDNA under hypotheses of in- complete lineage sorting (TILS) and mtDNA introgression (TINT). (B) T estimated from nDNA under models of strict isolation (TnDNA(i)) and isolation with migration (TnDNA(IM)). (C) T estimated from the three loci combined, comparing falcated ducks to Asian (Tsympatry) and North American (Tallopatry) gadwalls using a model of isolation- with-migration. The vertical dashed line indicates the threshold for rejecting a hypothesis of incomplete lineage sorting (upper 95%

HPDadj for TB). (See online Supplementary Material for posterior distributions of other parameters.)

EVOLUTION AUGUST 2007 1 999

This content downloaded from 128.123.94.166 on Thu, 23 Oct 2014 17:43:53 PMAll use subject to JSTOR Terms and Conditions

Page 10: Nuclear Loci and Coalescent Methods Support Ancient ...lithornis.nmsu.edu/~phoude/Peters_et_al_2007_Nuclear_loci_and... · original article doi: 10. 1111/j. 1558-5646.2007.00149.x

4 m 3111 ~1 3a m ?e r ?b c3 12~ 5a v,

m c3

r(

0CD 0- 0 0 o" 0 H0 o0 ?o o st -oI Or~o y

N ot l?

3 O("D (D (D r5 CD 3 C <0- p C o CD' a bC C-D C-D nDCc v DCDt, =t O 5 C V s : D r r a 0 z CD P-CD V C) f o- -cm " ? ~ a a p CD NE f of D ; O s rD t O O CPr lO , 3 p~ rv Yl cr l 3 C

CD P) 00 ? CD P.C

rD P o0, 55 > I CD z C, . :z (D CD CD CD~nw C ~O D

0D WCD 00

~< 0~ 0CD P- CDSo ~ ~ <

CD Q 0--0 CD CDE CD n CD CD a~Dc ~ r;c

0- 0 a~ J CD C0

p n ? 3 CD CD ?z c-, ,3cDCQ~~ SCD p) rD CD C). CD r),r r Ct QV CD P.(0 (D CD CD 3 o a o s9 C "CI E 3

l I I -

CD E3

0 5 -1 C. a 0~~0 ~ '0 C0L~ 0'

kC tj ?Z q 0 =1 *" Cd C C?, r. Vv

0 CL 0-0 f Y 000 a 0 00 E3 CD CD CD CD V, (D~?CD CD Q p CDDo 0 C CD , t. "r b 5aCC CD E3 ctP P CD l ?zr Ef` cr cr n t fi bCTI mc CJ)CD cr cr.c e~P)O3 ~ OO CDO CD ( t CC 0

CL 0-. f-+ ?z r- CDd ~ a aCD CD p CDv Cf CD rD CD; I s o C cr ~=t Y "vi cr CDE3 CD , p~

rD CD CD CD C?Q v Cr Os pr oz, CD ?7n1 Eo3 p p a v ua g ?7 o 1 - CD E3 Cd ,3 O CD nJ

Table 2. Parameters scaled to the neutral mutation rate u estimated from the various IM models (95% HPD).

Models of divergence 0FD 0G 0A t mED MG Mutation scalar1 1

Deep divergence (introgression 22 43 0.06 9.1 - - - 3.2x 10 of clade B)-mtDNA (11-40) (31-61) (-0-101) (2.6-14) (2.0x 105-4.5x 10-)

Shallow divergence (incomplete 25 59 17 0.88 - - - 3.2x 10- lineage sorting)-mtDNA (11-53) (40-89) (7.2-43) (0.51-1.6) (2.0x 10-5-4.5x 10-)

Isolation with 0.49 0.28 0.001 0.27 0.004 0.39 1.84 6.1 x 10- migration-nDNA (0.11-1.7) (0.08-0.78) (--0-2.2) (0.11-1.2) (--0-1.8) (--0-3.5) (4.2x 10-7-9.2x 10-7

Isolation-nDNA 1.2 0.39 0.16 0.19 - - 1.62 6.1 x 107 (0.35-4.4) (0.16-0.99) (0.004-1.1) (0.09-0.40) (4.2 x 10-7-9.2x 10-7

Isolation with migration 0.95 0.63 0.15 0.44 0.0035 0.47 2.10 2.3x 10-6

(sympatry)-all loci (0.31-2.1) (0.20-1.5) (--0-3.5) (0.17-1.7) ('--0-0.84) (0.046-2.7) (1.5x 10-6-3.4x 10-6 Isolation with migration 1.1 0.89 0.34 0.42 0.0035 0.30 2.59 2.3 x 10-6

(allopatry)-all loci (0.47-2.2) (0.39-1.7) ('-0-3.1) (0.17-1.0) (-0-0.64) (0.011-1.4) (1.5x 10-6-3.4x 10-6

1 Geometric mean for the sequence portions of loci in models that included multiple loci; u is the mutation rate per locus, not per site.

NJ 0 0

0

z

This content downloaded from 128.123.94.166 on Thu, 23 Oct 2014 17:43:53 PMAll use subject to JSTOR Terms and Conditions

Page 11: Nuclear Loci and Coalescent Methods Support Ancient ...lithornis.nmsu.edu/~phoude/Peters_et_al_2007_Nuclear_loci_and... · original article doi: 10. 1111/j. 1558-5646.2007.00149.x

z

C12

0

Table 3. Parameters estimated from the various IM models converted using calibrated mutation rates1.

Models of divergence Ne-effective number of individuals T-ybp M-migrants per generation O=4Neu2 t=Tu M=Om/2

Falcated duck Gadwall Ancestral Falcated duck Gadwall

Deep divergence (introgression 58,000 110,000 160 290,000 of clade B)-mtDNA (20,000-160,000) (57,000-250,000) (~0-410,000) (65,000-700,000)

Shallow divergence (incomplete 65,000 160,000 46,000 28,000 lineage sorting)-mtDNA (20,000-220,000) (74,000-360,000) (13,000-180,000) (11,000-76,000)

Isolation with 120,000 70,000 330 820,000 0.001 0.05 migration-nDNA (18,000-640,000) (13,000-290,000) (--0-810,000) (220,000-5,300,000) (~0-1.6) (~0-1.4)

Isolation-nDNA 270,000 86,000 35,000 510,000 (51,000-1,400,000) (24,000-320,000) (590-360,000) (150,000-1,600,000)

Isolation with migration 73,000 49,000 12,000 410,000 0.002 0.15 (sympatry)-all loci (16,000-240,000) (10,000-170,000) (~0-400,000) (110,000-2,300,000) (-~0-0.88) (0.005-2.0)

Isolation with migration 100,000 85,000 32,000 480,000 0.002 0.21 (allopatry)-all loci (30,000-310,000) (25,000-240,000) (-0-440,000) (130,000-1,800,000) (~0-0.70) (0.009-2.3)

1 For Ne and T, uncertainty in mutation rates was incorporated into conversions by using the upper bound of mutation rates (fast rate) to convert lower 95% HPDs and using the lower bound (slow rate) to convert upper 95% HPDs (see Table 2); 95% HPDs for M were adjusted using the 95% HPD of 0. 2Estimates of Ne assume an average generation time of three years (Peters 2006). This same formula was used for converting estimates from mtDNA because defining an inheritance scalar of 0.25 resulted in 0 being adjusted to the same scale as autosomal loci.

0

z

oJ 0

This content downloaded from 128.123.94.166 on Thu, 23 Oct 2014 17:43:53 PMAll use subject to JSTOR Terms and Conditions

Page 12: Nuclear Loci and Coalescent Methods Support Ancient ...lithornis.nmsu.edu/~phoude/Peters_et_al_2007_Nuclear_loci_and... · original article doi: 10. 1111/j. 1558-5646.2007.00149.x

JEFFREY L. PETERS ET AL.

0.010 A mG 0.008

< 0.006 . 0.004 -o 0.002 .-

0.000 0 1 2 3 4 5 6

o 0.040 B mFD

-;-- allopatry c 0.030 - sympatry 0.020

0.010

0.000 0 1 2 3 4 5 6

introgression rates (m)

Figure 6. Introgression rates scaled to the neutral mutation rate, u, estimated using all three loci. (A) The introgression rate of fal- cated duck DNA into gadwalls (mG), and (B) the introgression rate of gadwall DNA into falcated ducks (mFD).

Discussion MITOCHONDRIAL PARAPHYLY-INTROG RESSION OR INCOMPLETE LINEAGE SORTING? Mitochondrial paraphyly is common among animal taxa, but few studies have attempted to quantitatively differentiate between

possible causes of paraphyly, especially by accounting for the stochastic variance of lineage sorting (Funk and Omland 2003). In this study, we accounted for this stochasticity, as well as uncer- tainties in mutation rates, and rejected incomplete lineage sorting as the cause of mitochondrial paraphyly of gadwalls with respect to falcated ducks. Two lines of evidence support introgression. First, analyzing three independent loci with an isolation-with-migration

0.18 0.1 -- allopatry: mtDNA

0.16 - sympatry: mtDNA C 0.14 0.14 sympatry: CHD -0 " 0.12 o

o0.10o o 0.08 CL 76 0.06

.-0.04 2 0.02

0.00 0.001 0.01 0.1 1

time of migration (log scale)

Figure 7. Posterior distributions of time of migration (i.e., in- trogression; scaled to the neutral mutation rate) for those loci that showed evidence of introgression. Both mtDNA and CHD are consistent with recent and/or ongoing hybridization in sympatry, whereas mtDNA is more indicative of ancient introgression from falcated ducks to allopatric gadwalls in North America.

coalescent model indicated that some gene flow is necessary to

explain patterns of genetic diversity in both North American and Eurasian gadwalls. Specifically, the 95% HPD of introgression rates did not overlap zero in either comparison. Importantly, this result is not sensitive to mutation rates, and simultaneously ana-

lyzing three loci at least partially accounted for stochastic among- locus variance.

Second, lineage sorting is nearly complete for the two nu- clear introns examined, despite mitochondrial paraphyly. LDH contained two fixed differences between species, and CHD con- tained one nearly fixed difference (only one allele was shared; Fig 3). Qualitatively, these among-locus comparisons suggest in-

trogression of mtDNA as the cause of paraphyly, because lineage sorting should proceed more rapidly in mtDNA. Formally testing this hypothesis using coalescent methods resulted in quantitatively rejecting incomplete lineage sorting for mtDNA. Specifically, the

strong between-species differences in nDNA suggested that gad- walls and falcated ducks diverged between 220,000 and 5,300,000 ybp (Middle Pleistocene or earlier). However, for incomplete lin-

eage sorting to explain the patterns in mtDNA, then these species likely diverged between 11,000 and 76,000 ybp (Late Pleistocene), a date that does not overlap that estimated from nDNA. In contrast, a model that assumed introgression of clade B into the gadwall gene pool suggested that these species diverged between 65,000 and 700,000 ybp (Middle-Late Pleistocene), a date that is consis- tent with that estimated from nDNA. In other words, even after

accounting for the stochastic variance of genetic processes (and uncertainties in mutation rates), nDNA is too deeply structured for the pattern observed in mtDNA to be explained by incomplete lineage sorting.

Although comparing estimates of time since divergence de-

pends on accurate mutation rates, three observations suggest that our conclusions were not sensitive to possible calibration errors.

First, we adjusted our converted parameters by incorporating a

range of plausible mutation rates that incorporate both uncer-

tainty in outgroup divergence times and stochastic variance in mutations. Second, we found a strong agreement in divergence times estimated from mtDNA and from introns among waterfowl

species, despite using different fossil data for rate calibrations.

Third, rejecting a hypothesis of no gene flow (see above) was not dependent on mutation rates. Therefore, rejecting incomplete lineage sorting seems to be well supported despite depending on mutation rates for some analyses.

DNA INTROGRESSION There was no evidence that introgressed DNA was more abun- dant in Asia where gadwalls and falcated ducks are sympatric. Peters and Omland (2007) estimated that between 77,000 and 190,000 North American gadwalls had clade B haplotypes and those haplotypes were widely distributed (i.e., Alaska, California,

2002 EVOLUTION AUGUST 2007

This content downloaded from 128.123.94.166 on Thu, 23 Oct 2014 17:43:53 PMAll use subject to JSTOR Terms and Conditions

Page 13: Nuclear Loci and Coalescent Methods Support Ancient ...lithornis.nmsu.edu/~phoude/Peters_et_al_2007_Nuclear_loci_and... · original article doi: 10. 1111/j. 1558-5646.2007.00149.x

TESTING CAUSES OF PARAPHYLY

Utah, Alberta, North Dakota, Arkansas, Quebec, Delaware, and

Maryland). A combination of factors likely contributed to the rela-

tively high frequency and widespread distribution of clade B. First, estimates of time of migration suggested that clade B in North America resulted from ancient introgression ('-14,000 ybp). Al-

though closely related, none of the clade B haplotypes were shared between falcated ducks and North American gadwalls, and these data are inconsistent with recent, ongoing hybridization. Second, mtDNA supported a rapid historical population expansion in North America (Peters and Omland 2007), and observational data have revealed substantial population and range expansions during the

past century (LeSchack et al. 1997). Therefore, a likely explana- tion for the strong incorporation of clade B in North America is that introgression occurred when the gadwall population size was much smaller than it is today and periods of expansion facilitated the spread of clade B.

In sympatry, hybridization seems to have been recent and/or ongoing, yet we found no evidence of introgression from gadwalls to falcated ducks. All analyses that included the migration param- eter suggested asymmetrical gene flow (Table 2); DNA seemingly has introgressed from falcated ducks to gadwalls more often than the reverse (although 95% HPDs for m were broadly overlap- ping). One hypothesis is that this asymmetry resulted from differ- ences in species' range sizes, coupled with male-biased sex ratios

(Kulikova et al. 2004). Because of their restricted distribution, va-

grant female falcated ducks sometimes might randomly disperse to areas where male falcated ducks are absent. In contrast, gad- walls are widely distributed, and by random chance, female gad- walls will be more likely to disperse to areas where conspecific males are present. Waterfowl sex ratios tend to be male-biased

(Baldassarre and Bolen 1994), so unpaired males may mate with females of either species rather than forego breeding. Under this

scenario, most hybridization would involve female falcated ducks and male gadwalls, and also as a consequence of differences in

population size and geographic distributions, hybrid offspring would be more likely to mate with gadwalls. If this scenario is

correct, then introgressed DNA should also be present in other Eurasian populations of gadwalls, especially those that are para- patric with falcated ducks.

ISOLATION-WITH-MIGRATION ASSUMPTIONS

Isolation-with-migration makes a number of implicit assumptions, including selective neutrality, no recombination, that the two pop- ulations being compared are each panmictic and randomly sam- pled, that those populations are not exchanging genes with other

populations or species, and that population sizes and migration rates have been constant over time (Hey and Nielsen 2004; Won et al. 2005). These assumptions will be violated to some extent in most (if not all) systems, but the sensitivity of IM results to these violations needs additional study (see Hey 2005). In this study,

Asian and North American gadwalls were significantly structured at all three loci suggesting at least three distinct populations of interest: falcated duck, Asian gadwall, and North American gad- wall. There also exists significant mtDNA structure within North America, suggesting the possibility of additional populations (Pe- ters and Omland 2007). However, IM can only accommodate two

populations, leaving two options for handling structure: (1) lump- ing multiple populations as a single population, which violates the assumption of panmixia, or (2) treating each population in

separate analyses, which would likely violate the assumption of no gene flow with other populations. We obtained similar results

regarding time since divergence when lumping North American and Asian gadwalls in the same analyses (i.e., nDNA) and when

treating them in separate analyses (i.e., analyzing all three loci to-

gether). Therefore, the main conclusions of this study do not seem

overly sensitive to violations of assumptions regarding population structure.

The assumption of constant population sizes was also likely violated in this study. All three populations likely experienced recent population growth (Table 2; Peters and Omland 2007). Be- cause lineage sorting is a function of effective population size, as-

suming constant population sizes could potentially bias parameter estimates when species experienced more complex demographies. Isolation-with-migration can incorporate population growth into models of divergence, but this model assumes a constant rate of

exponential growth (Hey 2005), which may also be unrealistic for

many species. However, allowing for growth in one of our mod- els (Asian gadwalls vs. falcated ducks) suggested that gadwalls and faclated ducks most likely diverged about 380,000 ybp (95% HPDadj = 110,000 to 1,100,000 ybp; data not shown), which is similar to time since divergence estimated when assuming con- stant population sizes. Therefore, time since divergence, in this

case, is robust to changing the assumption of past population de-

mography. Isolation-with-migration assumes that introgression rates

have been constant since species divergence, but hybridization between gadwalls and falcated ducks may be a recent phe- nomenon that occurred after secondary contact. Indeed, estimates of "time of migration" suggested that introgression has occurred

recently relative to species divergence, probably after the last

glacial maximum. If migration has not been constant, but instead these species were strictly isolated during most of their history and only recently came into secondary contact, then current in-

trogression rates might be much higher than estimated in this

study.

USING nDNA TO TEST HYPOTHESES OF MITOCHONDRIAL PARAPHYLY

Comparing nDNA to mtDNA offered strong resolution for dis-

tinguishing between incomplete lineage sorting and introgression

EVOLUTION AUGUST2007 2003

This content downloaded from 128.123.94.166 on Thu, 23 Oct 2014 17:43:53 PMAll use subject to JSTOR Terms and Conditions

Page 14: Nuclear Loci and Coalescent Methods Support Ancient ...lithornis.nmsu.edu/~phoude/Peters_et_al_2007_Nuclear_loci_and... · original article doi: 10. 1111/j. 1558-5646.2007.00149.x

JEFFREY L. PETERS ET AL.

in this study. Furthermore, two nuclear introns were sufficient to

reject a hypothesis of incomplete lineage sorting, despite being a smaller sample size than other similar studies (e.g., Won and

Hey 2005; Won et al. 2005). The signal in those introns was in- creased by using information provided by the STR portions of

sequences (Won et al. 2005). For example, LDH contained two fixed indels (one of which was part of an STR) between species but no fixed differences in nucleotide substitutions (Fig. 3C). The

deep species divergence time relative to the recent introgression of mtDNA also likely contributed to the strong resolution provided by only two introns. Because the stochastic variance of genetic pro- cesses results in large confidence intervals, especially when few loci are studied, systems exhibiting smaller differences in diver-

gence times or more complex patterns of paraphyly will require more loci to reject alternative hypotheses (Edwards and Beerli

2000). Our approach of using multiple loci and coalescent methods

can be applied to any paraphyletic system. However, it is impor- tant to appreciate that introgression and incomplete lineage sort-

ing are not necessarily mutually exclusive hypotheses, and both

might have contributed to paraphyly within some systems (e.g., Kulikova et al. 2005; Sefc et al. 2005; Won et al. 2005). Under such a scenario, it might be difficult to generate and test predic- tions that unequivocally demonstrate that both have contributed to paraphyly.

In conclusion, despite some potentially limiting assumptions of IM, our approach provides a quantitative framework to bet- ter understand causes of gene tree paraphyly between closely re- lated species. Furthermore, this is the first study to incorporate the stochastic variance of genetic processes and reject incomplete lineage sorting as the cause of paraphyly. Finally, this study illus- trates a compelling case of ancient hybridization that has lead to DNA crossing species boundaries and becoming strongly incor-

porated into the gene pool of the recipient.

ACKNOWLEDGMENTS We thank J. Bell and B. Alemi of the UMBC Department of Mathe- matics and Statistics and the UMBC Bioinformatics Research Center for providing computer resources. S. Freeland and N. Keulman, provided ad- ditional computer resources and consultation. The Academy of Natural Sciences, the University of Washington Burke Museum, and the Delaware Museum of Natural History contributed tissues from museum specimens. M. Maystrenko, P. Yung, and N. Yakovchits assisted with field work in Russia, and E. Humphries helped with laboratory work. T. Roberts, R. Fleischer, J. Leips, P. Sokolove, S. Freeland, B. Kondo, K. McCracken, and five anonymous reviewers provided valuable feedback during vari- ous stages of this study. We also thank the US Fish and Wildlife Service for issuing import permits. This research was funded by Delta Waterfowl Foundation; KEO was supported by National Science Foundation DEB- 0347083. Alaska EPSCoR (EPS-0346770) provided salary for JLP during the final stages of revisions.

LITERATURE CITED Avise, J. C. 2000. Phylogeography: the history and formation of species. Har-

vard Univ. Press, Cambridge, MA. Avise, J. C., J. Arnold, R. Ball, E. Bermingham, T. Lamb, J. E. Niegel, C. A.

Reeb, and N. C. Saunders. 1987. Intraspecific phylogeography: the mi- tochondrial bridge between population genetics and systematics. Annu. Rev. Ecol. Syst. 18:489-522.

Baker, J. M., E. L6pez-Medrano, A. G. Navarro-Sigtienza, O. R. Rojas-Soto, and K. E. Omland. 2003. Recent speciation in the Orchard Oriole group: divergence of Icterus spurius spurius and Icterus spurius fuertesi. Auk 120:848-859.

Baldassarre, G. A., and E. G. Bolen. 1994. Waterfowl ecology and manage- ment. John Wiley & Sons, Inc. New York.

Ballard, J. W. 2000. When one is not enough: introgression of mitochondrial DNA in Drosophila. Mol. Biol. Evol. 17:1126-1130.

Bandelt, H.-J., P. Forster, and A. R6hl. 1999. Median-joining networks for

inferring intraspecific phylogenies. Mol. Biol. Evol. 16:37-48. Bonnet, E., and Y. V. de Peer. 2002. ZT: a software tool for simple and partial

Mantel tests. J. Stat. Software 7:1-12. Brumfield, R. T., R. W. Jernigan, D. B. McDonald, and M. J. Braun. 2001.

Evolutionary implications of divergent clines in an avian (Manacus: Aves) hybrid zone. Evolution 55:2070-2087.

Buckley, T. R., M. Cordeiro, D. C. Marshall, and C. Simon. 2006. Differ- entiating between hypotheses of lineage sorting and introgression in New Zealand alpine cicadas (Maoricicada Dugdale). Syst. Biol. 55:411- 425.

Clarke, J. A., C. P. Tambussi, J. I. Noriega, G. M. Erickson, and R. A. Ketcham. 2005. Definitive fossil evidence for the extant avian radiation in the Cretaceous. Nature 433:305-308.

Donne-Gouss6, C., V. Laudet, and C. Hanni. 2002. A molecular phylogeny of anseriformes based on mitochondrial DNA analysis. Mol. Phylogenet. Evol. 23:339-356.

Donnelly, M. J., J. Pinto, R. Girod, N. J. Besansky, and T. Lehmann. 2004. Revisiting the role of introgression vs shared ancestral polymorphisms as key processes shaping genetic diversity in the recently separated sibling species of the Anopheles gambiae complex. Heredity 92:61- 68.

Dyke, G. J. 2001. The fossil waterfowl (Aves: Anseriformes) from the Eocene of England. Am. Mus. Novitates No. 3354:15 Pp.

Edwards, S. V., and P. Beerli. 2000. Perspective: gene divergence, population divergence, and the variance in coalescence time in phylogeographic studies. Evolution 54:1839-1854.

Ellegren, H., and A. K. Fridolfsson. 1997. Male-driven evolution of DNA

sequences in birds. Nat. Genet. 17:182-184. Ericson, P. G. P. 1997. Systematic relationships of the palaeogene family Pres-

byornithidae (Aves: Anseriformes). Zool. J. Linn. Soc.-Lond. 121:429- 483.

Excoffier, L., P. E. Smouse, and J. M. Quattro. 1992. Analysis of molecular variance inferred from metric distances among DNA haplotypes: appli- cation to human mitochondrial DNA restriction data. Genetics 131:479- 491.

Funk, D. J., and K. E. Omland. 2003. Species-level paraphyly and polyphyly: frequency, causes, and consequences, with insights from animal mito- chondrial DNA. Annu. Rev. Ecol. Evol. Syst. 34:397-423.

Hare, M. P., and J. C. Avise. 1998. Population structure in the American oyster as inferred by nuclear gene genealogies. Mol. Biol. Evol. 15:119- 128.

Harrison, R. G. 1991. Molecular changes at speciation. Annu. Rev. Ecol. Syst. 22:281-308.

Hey, J. 2005. On the number of New World founders: a population genetic portrait of the peopling of the Americas. PLoS Biol. 3:e193.

2004 EVOLUTION AUGUST 2007

This content downloaded from 128.123.94.166 on Thu, 23 Oct 2014 17:43:53 PMAll use subject to JSTOR Terms and Conditions

Page 15: Nuclear Loci and Coalescent Methods Support Ancient ...lithornis.nmsu.edu/~phoude/Peters_et_al_2007_Nuclear_loci_and... · original article doi: 10. 1111/j. 1558-5646.2007.00149.x

TESTING CAUSES OF PARAPHYLY

Hey, J., and R. Nielsen. 2004. Multilocus methods for estimating popula- tion sizes, migration rates and divergence time, with applications to the divergence of Drosophila pseudoobscura and D. persimilis. Genetics 167:747-760.

Hey, J., Y. J. Won, A. Sivasundar, R. Nielsen, and J. A. Markert. 2004. Us- ing nuclear haplotypes with microsatellites to study gene flow between recently separated Cichlid species. Mol. Ecol. 13:909-919.

Holder, M. T., J. A. Anderson, and A. K. Holloway. 2001. Difficulties in detecting hybridization. Syst. Biol. 50:978-982.

Howard, H. 1964. Fossil Anseriformes. Pp. 233-326 in J. Delacour, ed. The waterfowl of the world. Country Life Limited, London.

Hudson, R. R., and N. L. Kaplan. 1985. Statistical properties of the number of recombination events in the history of a sample of DNA sequences. Genetics 111:147-164.

Hudson, R. R., and M. Turelli. 2003. Stochasticity overrules the "three-times rule": genetic drift, genetic draft, and coalescence times for nuclear loci versus mitochondrial DNA. Evolution 57:182-190.

Hudson, R. R., M. Kreitman, and M. Aguad6. 1987. A test of neutral molecular evolution based on nucleotide data. Genetics 116:153-159.

Jennings, W. B., and S. V. Edwards. 2005. Speciational history of Aus- tralian grass finches (Poephila) inferred from thirty gene trees. Evolution 59:2033-2047.

Johnsgard, P. A. 1960. Hybridization in the Anatidae and its taxonomic im- plications. Condor 62:25-33.

Johnson, K. P., and M. D. Sorenson. 1999. Phylogeny and biogeography of dabbling ducks (genus: Anas): a comparison of molecular and morpho- logical evidence. Auk 116:792-805.

Klicka, J., R. M. Zink, J. C. Barlow, W. B. McGillivray, and T. J. Doyle. 1999. Evidence supporting the recent origin and species status of the Timberline Sparrow. Condor 101:577-588.

Knowles, L. L. 2004. The burgeoning field of statistical phylogeography. J. Evolution. Biol. 17:1-10.

Knowles, L. L., and W. P. Maddison. 2002. Statistical phylogeography. Mol. Ecol. 11:2623-2635.

Kraft, H. J., W. Hendriks, W. W. de Jong, N. H. Lubsen, and J. G. Schoenmak- ers. 1993. Duck lactate dehydrogenase B/epsilon-crystallin gene. Lens recruitment of a GC-promoter. J. Mol. Biol. 229:849-859.

Kulikova, I. V., Y. N. Zhuravlev, and K. G. McCracken. 2004. Asymmetric hy- bridization and sex-biased gene flow between Eastern spot-billed ducks (Anas zonorhyncha) and mallards (A. platyrhynchos) in the Russian Far East. Auk 121:930-949.

Kulikova, I. V., S. V. Drovetski, D. D. Gibson, R. J. Harrigan, S. Rohwer, M. D. Sorenson, K. Winker, Y. N. Zhuravlev, and K. G. McCracken. 2005. Phylogeography of the Mallard (Anas platyrhynchos): hybridization, dis- persal, and lineage sorting contribute to complex geographic structure. Auk 122:949-965.

LeSchack, C. R., S. K. McKnight, and G. R. Hepp. 1997. Gadwall (Anas strepera) in A. Poole and F. Gill, eds. The birds of North America, No. 283. The Academy of Natural Sciences, Philadelphia, and The American Ornithologists' Union, Washington, DC.

Liebers, D., P. de Knijff, and A. J. Helbig. 2004. The herring gull complex is not a ring species. Proc. R. Soc. Lond. B 271:893-901.

Livezey, B. C. 1996. A phylogenetic analysis of geese and swans (Anseri- formes: Anserinae), including selected fossil species. Syst. Biol. 45:415- 450.

S1997a. A phylogenetic classification of waterfowl (Aves: Anseri- formes), including selected fossil species. Ann. Camrnegie Mus. 66:455- 496.

. 1997b. A phylogenetic analysis of basal Anseriformes, the fossil Presbyornis, and the interordinal relationships of waterfowl. Zool. J. Linn. Soc.-Lond. 121:361-428.

Louchart, A., P. Vignaud, A. Likius, H. T. Mackaye, and M. Brunet. 2005. A new swan (Aves: Anatidae) in Africa, from the latest Miocene of Chad and Libya. J. Vertebr. Paleontology 25:384-392.

Masta, S. E., B. K. Sullivan, T. Lamb, and E. J. Routman. 2002. Molecular

systematics, hybridization, and phylogeography of the Bufo americanus complex in Eastern North America. Mol. Phylogenet. Evol. 24:302-314.

McCracken, K. G., W. P. Johnson, and F. H. Sheldon. 2001. Molecular popula- tion genetics, phylogeography, and conservation biology of the Mottled Duck (Anasfulvigula). Conserv. Biol. 2:87-102.

Moore, W. S. 1995. Inferring phylogenies from mtDNA variation: mitochondrial-gene trees versus nuclear-gene trees. Evolution 49:718- 726.

Morando, M., L. J. Avila, J. Baker, and J. W. Sites Jr. 2004. Phylogeny and phylogeography of the Liolaemus darwinii complex (Squamata: Lio- laemidae): evidence for introgression and incomplete lineage sorting. Evolution 58:842-861.

Moritz, C., T. E. Dowling, and W. M. Brown. 1987. Evolution of animal mitochondrial DNA: relevance for population biology and systematics. Annu. Rev. Ecol. Syst. 18:269-292.

Neigel, J. E., and J. C. Avise. 1986. Phylogenetic relationships of mitochonial DNA under various demographic models of speciation. Pp. 515-534 in E. Nevodr and S. Karlin, eds. Evolutionary processes and theory. Academic Press, New York.

Nielsen, R., and J. Wakeley. 2001. Distinguishing migration from isolation: a Markov chain Monte Carlo approach. Genetics 158:885-896.

Omland, K. E. 1997. Examining two standard assumptions of ancestral re- constructions: the history of plumage dimorphism in dabbling ducks (Anatini). Evolution 51:1636-1646.

Omland, K. E., J. M. Baker, and J. L. Peters. 2006. Genetic signatures of inter- mediate divergence: population history of old and new world Holarctic ravens (Corvus corax). Mol. Ecol. 15:795-808.

Palumbi, S. R., F. Cipriano, and M. P. Hare. 2001. Predicting nuclear gene coalescence from mitochondrial data: the three-times rule. Evolution 55:859-868.

Paxinos, E. E., H. F. James, S. L. Olson, M. D. Sorenson, J. Jackson, and R. C. Fleischer. 2002. mtDNA from fossils reveals a radiation of Hawaiian

geese recently derived from the Canada goose (Branta canadensis). Proc. Natl. Acad. Sci. USA 99:1399-1404.

Peters, J. L. 2006. Controlling for random genetic processes in studies of

evolutionary history: phylogeography of the Holarctic gadwall (Anas strepera). Ph.D. diss. University of Maryland Baltimore County, Balti- more, MD.

Peters, J. L., W. Gretes, and K. E. Omland. 2005a. Late Pleistocene divergence between eastern and western populations of wood ducks (Aix sponsa) inferred by the 'isolation with migration' coalescent method. Mol. Ecol. 14:3407-3418.

Peters, J. L., K. G. McCracken, Y. N. Zhuravlev, Y. Lu, R. E. Wilson, K. P. Johnson, and K. E. Omland. 2005b. Phylogenetics of wigeons and allies (Anatidae: Anas): the importance of sampling multiple loci and multiple individuals. Mol. Phylogenet. Evol. 35:209-224.

Peters, J. L., and K. E. Omland. 2007. Population structure and mitochondrial polyphyly in North American gadwalls (Anas strepera). Auk 124:444- 462.

Posada, D., and K. A. Crandall. 1998. Modeltest: testing the model of DNA substitution. Bioinformatics 14:817-818.

Redenbach, Z., and E. B. Taylor. 2002. Evidence for historical introgression along a contact zone between two species of char (Pisces: Salmonidae) in northwestern North America. Evolution 56:1021-1035.

Rosenberg, N. A., and M. Nordborg. 2002. Genealogical trees, coalescent the- ory and the analysis of genetic polymorphisms. Nat. Rev. Genet. 3:380- 390.

EVOLUTION AUGUST 2007 2005

This content downloaded from 128.123.94.166 on Thu, 23 Oct 2014 17:43:53 PMAll use subject to JSTOR Terms and Conditions

Page 16: Nuclear Loci and Coalescent Methods Support Ancient ...lithornis.nmsu.edu/~phoude/Peters_et_al_2007_Nuclear_loci_and... · original article doi: 10. 1111/j. 1558-5646.2007.00149.x

JEFFREY L. PETERS ET AL.

Sefc, K. M., R. B. Payne, and M. D. Sorenson. 2005. Genetic continuity of brood-parasitic indigobird species. Mol. Ecol. 14:1407-1419.

Schneider, S., J.-M. Keuffer, D. Roessli, and L. Excoffier. 2000. Arlequin: a software package for population genetics. Univ. of Geneva, Switzerland.

Sorenson, M. D., and R. C. Fleischer. 1996. Multiple independent transpo- sitions of mitochondrial DNA control region sequences to the nucleus. Proc. Natl. Acad. Sci. USA 93:15239-15243.

Sorenson, M. D., J. C. Ast, D. E. Dimcheff, T. Yuri, and D. P. Mindell. 1999. Primers for a PCR-based approach to mitochondrial genome sequencing in birds and other vertebrates. Mol. Phylogenet. Evol. 12:105-114.

St. John, J. J.-P. Cotter, and T. W. Quinn. 2005. A recent chicken repeat 1 retro- transposition confirms the Coscoroba-Cape Barren goose clade. Mol. Phylogenet. Evol. 37:83-90.

Stephens, M., N. J. Smith, and P. Donnelly. 2001. A new statistical method for haplotype reconstruction from population data. Am. J. Hum. Genet. 68:978-989.

Stephens, M., and P. Donnelly. 2003. A comparison of Bayesian methods for haplotype reconstruction from population genotype data. Am. J. Hum. Genet. 73:1162-1169.

Swofford, D. L. 1999. PAUP*: Phylogenetic analysis using parsimony (and other methods), version 4.0. Sinauer Associates, Inc., Sunderland, MA.

Tegelstrom, H. 1987. Transfer of mitochondrial DNA from the northern red- backed vole (Clethrionomys rutilus) to the bank vole (C. glareolus). J. Mol. Evol. 24:218-227.

Templeton, A. R. 1998. Nested clade analyses of phylogeographic data: testing hypotheses about gene flow and population history. Mol. Ecol. 7:381- 397.

Wakeley, J. 1996. Distinguishing migration from isolation using the variance of pairwise differences. Theor. Popul. Biol. 49:369-386.

Weckstein, J. D., R. M. Zink, R. C. Blackwell-Rago, and D. A. Nelson. 2001. Anomalous variation in mitochondrial genomes of white-crowned (Zonotrichia leucophrys) and golden-crowned (Z. atricapilla) spar- rows: pseudogenes, hybridization, or incomplete lineage sorting? Auk 118:231-236.

Wilson, P. J., S. Grewal, I. D. Lawford, J. N. M. Heal, A. G. Granacki, D. Pennock, J. B. Theberge, M. T. Theberge, D. R. Voigt, W. Waddell, R. E. Chambers, P. C. Paquet, G. Goulet, D. Cluff, and B. N. White. 2000. DNA profiles of the eastern Canadian wolf and the red wolf provide evidence for a common evolutionary history independent of the gray wolf. Can. J. Zool. 78:2156-2166.

Won, Y. J., and J. Hey. 2005. Divergence population genetics of chimpanzees. Mol. Biol. Evol. 22:297-307.

Won, Y.-J., A. Sivasundar, Y. Wang, and J. Hey. 2005. On the origin of Lake Malawi cichlid species: a population genetic analysis of divergence. Proc. Natl. Acad. Sci. USA 102:6581-6586.

Associate Editor: K. Crandall

Supplementary Material The following supplementary material is available for this article:

Table S1. GenBank accession numbers for falcated ducks and gadwalls. Table S2. Additional numbers used for amplifying and/or sequencing smaller fragments of the mtDNA control region and allele

specific primers used for introns with multiple polymorphic positions. Table S3. GenBank accession numbers of sequences used for intron clock calibrations and for comparing divergence times estimated from introns and mtDNA. Table S4. List of vouchered specimens with GenBank accession numbers.

Figure St. Resolving simple indels.

Figure S2. Resolving indels with nucleotide polymorphisms. Figure S3. Marginal posterior densities of theta (effective population size scaled to the neutral mutation rate) estimated from the various models used.

Figure S4. Marginal posterior densities of introgression rates (scaled to the neutral mutation rate) estimated from nuclear introns

(CHD and LDH) using a model of Isolation with Migration.

This material is available as part of the online article from:

http://www.blackwell-synergy.com/doi/abs/10. 1111/j.1558-5646.2007.00149.x (This link will take you to the article abstract).

Please note: Blackwell Publishing is not responsible for the content or functionality of any supplementary materials supplied by the authors. Any queries (other than missing material) should be directed to the corresponding author for the article.

2006 EVOLUTION AUGUST2007

This content downloaded from 128.123.94.166 on Thu, 23 Oct 2014 17:43:53 PMAll use subject to JSTOR Terms and Conditions