11
Evolutionary placement of Xanthomonadales based on conserved protein signature sequences Ania M. Cutiño-Jiménez a , Marinalva Martins-Pinheiro b , Wanessa C. Lima b,1 , Alexander Martín-Tornet c , Osleidys G. Morales a , Carlos F.M. Menck b, * a Department of Biology, Facultad de Ciencias Naturales, Universidad de Oriente, Ave. Patricio Lumumba s/n., Santiago de Cuba, CP 90 500, Cuba b Department of Microbiology, Institute of Biomedical Sciences, University of São Paulo, Ave. Prof. Lineu Prestes, 1374, São Paulo, SP, Brazil c Department of Informatics, Centro de Ingeniería Genética y Biotecnología (CIGB), Ave. 31 e/158 y 190, Playa, P.O. Box 6162, Habana 10600, Cuba article info Article history: Received 14 May 2009 Revised 11 September 2009 Accepted 21 September 2009 Available online xxxx Keywords: Xanthomonadales Xanthomonas Xylella DNA repair INDELs abstract Xanthomonadales comprises one of the largest phytopathogenic bacterial groups, and is currently clas- sified within the gamma-proteobacteria. However, the phylogenetic placement of this group is not clearly resolved, and the results of different studies contradict one another. In this work, the evolutionary posi- tion of Xanthomonadales was determined by analyzing the presence of shared insertions and deletions (INDELs) in highly conserved proteins. Several distinctive insertions found in most of the members of the gamma-proteobacteria are absent in Xanthomonadales and groups such as Legionelalles, Chromati- ales, Methylococcales, Thiotrichales and Cardiobacteriales. These INDELs were most likely introduced after the branching of Xanthomonadales from most of the gamma-proteobacteria and provide evidence for the phylogenetic placement of the early gamma-proteobacteria. Moreover, other proteins contain insertions exclusive to the Xanthomonadales order, confirming that this is a monophyletic group and pro- vide important specific genetic markers. Thus, the data presented clearly support the Xanthomonadales group as an independent subdivision, and constitute one of the deepest branching lineage within the gamma-proteobacteria clade. Ó 2009 Elsevier Inc. All rights reserved. 1. Introduction With the advent of the genomic era, new molecular approaches to identify evolutionary relationships among organisms have come to light, and since then extensive work has been done in bacterial taxonomic classification, mainly focusing in the use of conserved protein sequences to reconstruct evolutionary histories. However, the use of single gene or protein sequences to reconstruct organism evolution has been contested (Korbel et al., 2002; Ochman, 2001), mainly due to inherent methodological problems during phyloge- netic reconstruction and analysis or to systematic errors from the sequences. Moreover, phylogenetic trees based on rRNA or protein sequences are unable to determine how certain divisions within Bacteria are related to each other and how they have evolved from a common ancestor (Gupta and Griffiths, 2002). New sequence- based approaches for assessing evolutionary relationships have re- duced some of the biases in earlier methodology that can lead to the misinterpretation of phylogenetic results. Detection of signa- ture sequences in the primary structure of a protein is a reliable and intuitive method for examining evolutionary relationships. Specific changes in protein residues, such as amino acid substitu- tions or specific deletions or insertions, observed in all members of one or more taxa but not in others, make it possible to establish common ancestry and identify major groups that share the same signature (Brocchieri, 2001; Gupta, 1997). There are numerous mechanisms by which specific insertions and deletions (herein referred to as INDELs) may be formed, such as, for example, DNA recombination, expansion of repetitive DNA sequences and insertion sequence (IS)-mediated events. As they are, in general, the product of one specific event, INDELs were pro- posed as reliable signature sequences with certain advantages over traditional phylogenetic analyses based on gene or protein se- quences (Griffiths and Gupta, 2007; Gupta, 1998, 2006; Gupta and Mok, 2007). Phylogenetic relationships assume constant muta- tion frequencies and this may be incorrect over long periods of time, ultimately leading to the identification of incorrect species relationships (Philippe et al., 2005). In contrast, conserved INDELs of defined sizes are not greatly affected by such differences in evo- lutionary rates (Gupta, 1998). 1055-7903/$ - see front matter Ó 2009 Elsevier Inc. All rights reserved. doi:10.1016/j.ympev.2009.09.026 * Corresponding author. Address: Ave. Prof. Lineu Prestes, 1374, Institute of Biomedical Sciences, ICB II/USP, Cidade Universitária, São Paulo, CEP 05508-900, Brazil. Fax: +55 11 3091 7357. E-mail address: [email protected] (C.F.M. Menck). 1 Present address: Department Cell Physiology and Metabolism, University of Geneva, R. Michel Servet 1, 1211, Geneva, CH, Switzerland. Molecular Phylogenetics and Evolution xxx (2009) xxx–xxx Contents lists available at ScienceDirect Molecular Phylogenetics and Evolution journal homepage: www.elsevier.com/locate/ympev ARTICLE IN PRESS Please cite this article in press as: Cutiño-Jiménez, A.M., et al. Evolutionary placement of Xanthomonadales based on conserved protein signature se- quences. Mol. Phylogenet. Evol. (2009), doi:10.1016/j.ympev.2009.09.026

Molecular Phylogenetics and Evolution - USPgenoma.ib.usp.br/sites/default/files/publicacoes/evolutionary... · cDepartment of Informatics, Centro de Ingeniería Genética y Biotecnología

Embed Size (px)

Citation preview

Molecular Phylogenetics and Evolution xxx (2009) xxx–xxx

ARTICLE IN PRESS

Contents lists available at ScienceDirect

Molecular Phylogenetics and Evolution

journal homepage: www.elsevier .com/ locate /ympev

Evolutionary placement of Xanthomonadales based on conserved proteinsignature sequences

Ania M. Cutiño-Jiménez a, Marinalva Martins-Pinheiro b, Wanessa C. Lima b,1, Alexander Martín-Tornet c,Osleidys G. Morales a, Carlos F.M. Menck b,*

a Department of Biology, Facultad de Ciencias Naturales, Universidad de Oriente, Ave. Patricio Lumumba s/n., Santiago de Cuba, CP 90 500, Cubab Department of Microbiology, Institute of Biomedical Sciences, University of São Paulo, Ave. Prof. Lineu Prestes, 1374, São Paulo, SP, Brazilc Department of Informatics, Centro de Ingeniería Genética y Biotecnología (CIGB), Ave. 31 e/158 y 190, Playa, P.O. Box 6162, Habana 10600, Cuba

a r t i c l e i n f o a b s t r a c t

Article history:Received 14 May 2009Revised 11 September 2009Accepted 21 September 2009Available online xxxx

Keywords:XanthomonadalesXanthomonasXylellaDNA repairINDELs

1055-7903/$ - see front matter � 2009 Elsevier Inc. Adoi:10.1016/j.ympev.2009.09.026

* Corresponding author. Address: Ave. Prof. LineuBiomedical Sciences, ICB II/USP, Cidade UniversitáriaBrazil. Fax: +55 11 3091 7357.

E-mail address: [email protected] (C.F.M. Menck)1 Present address: Department Cell Physiology an

Geneva, R. Michel Servet 1, 1211, Geneva, CH, Switzerl

Please cite this article in press as: Cutiño-Jiméquences. Mol. Phylogenet. Evol. (2009), doi:10.1

Xanthomonadales comprises one of the largest phytopathogenic bacterial groups, and is currently clas-sified within the gamma-proteobacteria. However, the phylogenetic placement of this group is not clearlyresolved, and the results of different studies contradict one another. In this work, the evolutionary posi-tion of Xanthomonadales was determined by analyzing the presence of shared insertions and deletions(INDELs) in highly conserved proteins. Several distinctive insertions found in most of the members ofthe gamma-proteobacteria are absent in Xanthomonadales and groups such as Legionelalles, Chromati-ales, Methylococcales, Thiotrichales and Cardiobacteriales. These INDELs were most likely introducedafter the branching of Xanthomonadales from most of the gamma-proteobacteria and provide evidencefor the phylogenetic placement of the early gamma-proteobacteria. Moreover, other proteins containinsertions exclusive to the Xanthomonadales order, confirming that this is a monophyletic group and pro-vide important specific genetic markers. Thus, the data presented clearly support the Xanthomonadalesgroup as an independent subdivision, and constitute one of the deepest branching lineage within thegamma-proteobacteria clade.

� 2009 Elsevier Inc. All rights reserved.

1. Introduction

With the advent of the genomic era, new molecular approachesto identify evolutionary relationships among organisms have cometo light, and since then extensive work has been done in bacterialtaxonomic classification, mainly focusing in the use of conservedprotein sequences to reconstruct evolutionary histories. However,the use of single gene or protein sequences to reconstruct organismevolution has been contested (Korbel et al., 2002; Ochman, 2001),mainly due to inherent methodological problems during phyloge-netic reconstruction and analysis or to systematic errors from thesequences. Moreover, phylogenetic trees based on rRNA or proteinsequences are unable to determine how certain divisions withinBacteria are related to each other and how they have evolved froma common ancestor (Gupta and Griffiths, 2002). New sequence-based approaches for assessing evolutionary relationships have re-

ll rights reserved.

Prestes, 1374, Institute of, São Paulo, CEP 05508-900,

.d Metabolism, University ofand.

nez, A.M., et al. Evolutionary p016/j.ympev.2009.09.026

duced some of the biases in earlier methodology that can lead tothe misinterpretation of phylogenetic results. Detection of signa-ture sequences in the primary structure of a protein is a reliableand intuitive method for examining evolutionary relationships.Specific changes in protein residues, such as amino acid substitu-tions or specific deletions or insertions, observed in all membersof one or more taxa but not in others, make it possible to establishcommon ancestry and identify major groups that share the samesignature (Brocchieri, 2001; Gupta, 1997).

There are numerous mechanisms by which specific insertionsand deletions (herein referred to as INDELs) may be formed, suchas, for example, DNA recombination, expansion of repetitive DNAsequences and insertion sequence (IS)-mediated events. As theyare, in general, the product of one specific event, INDELs were pro-posed as reliable signature sequences with certain advantages overtraditional phylogenetic analyses based on gene or protein se-quences (Griffiths and Gupta, 2007; Gupta, 1998, 2006; Guptaand Mok, 2007). Phylogenetic relationships assume constant muta-tion frequencies and this may be incorrect over long periods oftime, ultimately leading to the identification of incorrect speciesrelationships (Philippe et al., 2005). In contrast, conserved INDELsof defined sizes are not greatly affected by such differences in evo-lutionary rates (Gupta, 1998).

lacement of Xanthomonadales based on conserved protein signature se-

2 A.M. Cutiño-Jiménez et al. / Molecular Phylogenetics and Evolution xxx (2009) xxx–xxx

ARTICLE IN PRESS

Based on a 16S rRNA tree, the proteobacteria phylum has beendivided into five phylogenetically distinct groups (gamma, beta, al-pha, delta and epsilon subdivisions or classes) (Olsen et al., 1991;Stackebrandt and Woese, 1984). These subdivisions constitutewell-defined clades, their member species being clearly distin-guishable one from the other and from the other bacterial divisions(Eisen, 1995; Ludwig and Schleifer, 1994; Stoffels et al., 1999). Insupport of the use of signature sequences as evolutionary descrip-tors, all proteobacterial subdivisions contain specific INDELs insome of their protein sequences, confirming the evolutionary rela-tionships inside the proteobacteria group (Gupta, 2000, 2005,2006; Gupta and Mok, 2007).

The gamma subdivision is a well-defined group that probablydiverged from the rest of proteobacteria as early as 500 millionyears ago (Clark et al., 1999). It is one of the largest groups withinBacteria and includes at least 14 different subgroups, or orders(Acidithiobacillales, Aeromonadales, Alteromonadales, Cardiobac-teriales, Chromatiales, Enterobacteriales, Legionellales, Methylo-coccales, Oceanospirillales, Pasteurellales, Pseudomonadales,Thiotrichales, Vibrionales and Xanthomonadales), where we findfree-living and commensal species, intracellular symbionts andplant and animal pathogens, as well as species of medical and agri-cultural relevance. In one of the most basal group of gamma-prote-obacteria, the order Xanthomonadales, resides many significantpathogenic bacteria affecting humans, animals and plants. Thisclade, though it contains a single family (Xanthomonadaceae),comprises a diverse set of species differing both in their phenotypictraits and habitat. The group has considerable economic impact onagriculture, worldwide, and more than 350 different plant diseasescaused by xanthomonads have been reported (Leyns et al., 1984;Swings and Civerolo, 1993). Xanthomonas and Xylella are generawithin Xanthomonadales for which complete genome sequencesare available for several species and strains (da Silva et al., 2002;Lee et al., 2005; Qian et al., 2005; Simpson et al., 2000; Thiemeet al., 2005; Van Sluys et al., 2003). The genus Xanthomonas com-prises a diverse group of Gram-negative, obligate aerobic, non-fer-mentative rods, in which all reported strains are plant-associatedand most are reported as being pathogenic to a particular planthost (Van Sluys et al., 2002). Similarly, Xylella fastidiosa (Xf) isresponsible for causing economically important diseases in grape-vines, citrus fruits and many other plant species. Foremost amongthese are Pierces’s disease (PD) of grapevines, citrus variegatedchlorosis (CVC), and the leaf scorch diseases of almonds (ALS)and oleanders (OLS) (Davis et al., 1978; Purcell and Hopkins, 1996).

Although the gamma subdivision is a well-defined clade, theclassification of Xanthomonadales inside the group is quite prob-lematic. The 16S rRNA phylogenetic tree, by far the most fre-quently used sequence to assess phylogenetic placement of agiven bacterial group, places Xanthomonadales at the root of thegamma-proteobacteria (Lima et al., 2008). However, very oftenphylogenetic trees of highly conserved genes place this group inthe same branch as the beta subdivision (Martins-Pinheiro et al.,2004) or even as an outgroup of the beta/gamma-proteobacteria(Schneider et al., 2007). Moreover, these studies are complicatedby the high frequency of genes that are proposed to be acquiredby LGT, especially within the Xanthomonas genus. Due to frequentLGT, Xanthomonadales represents an extreme case of genomicmosaicism that prevents it from being assigned to any one of themajor proteobacterial clades (Comas et al., 2006; Lima et al.,2005, 2008). On considering the disagreements on the phyloge-netic placement of Xanthomonadales, the use of other reliable,non-phylogenetic approaches may allow us to better position thisgroup within the proteobacteria.

In this work, we were able to identify several specific INDELs,based on a sequence alignment of several highly conserved pro-teins, mostly those linked to DNA metabolism, that confirm the

Please cite this article in press as: Cutiño-Jiménez, A.M., et al. Evolutionary pquences. Mol. Phylogenet. Evol. (2009), doi:10.1016/j.ympev.2009.09.026

Xanthomonadales as a monophyletic, early-branching group with-in the gamma-proteobacteria subdivision.

2. Methodology

2.1. Phylogenetic analyses

Protein sequences were aligned by using Clustal X 2.0.9 pro-gram (Larkin et al., 2007), and regions of the alignments that wereambiguous, hypervariable or containing gaps were excluded fromsubsequent analysis (GeneDoc program, Nicholas et al., 1997).ProtTest was used to assess the best-fit amino acid substitutionmodel for maximum likelihood-based tree reconstruction (Abascalet al., 2005). Maximum likelihood (ML) trees were set up with RAx-ML 7.0.4 (Stamatakis, 2006), with the WAG (Whelan–Goldman)model and parameters for invariable sites (+I) and gamma-distrib-uted rate heterogeneity (+G, 4 categories). One-thousand bootstrapreplicates were executed and bootstrap values drawn up on thebest-scoring ML-tree. Trees were visualized using the TreeViewprogram (Page, 1996), and were arbitrarily rooted at midpoint(although trees should be fundamentally viewed as unrooted).The concatenated tree was based on the alignment of the concate-nated sequences of 11 evolutionarily conserved genes: arginyl-tRNA synthetase; LigA; DNA gyrase subunit A; Hsp70; isoleucyl-tRNA synthetase; DNA polymerase I; DNA polymerase III subunitalpha; DNA polymerase subunit epsilon; RecA; ribosomal L2 andS3 proteins. The proteins employed in phylogenetic analyses areshown in Table S01 (Supplementary material).

2.2. Identification of conserved signature sequences

DNA repair and replication-related protein sequences were ob-tained by similarity searches using the BlastP program (Altschulet al., 1997) as implemented in the NCBI server. The list of organ-isms used in this study, as well as the seed sequences used to per-form the similarity searches, is shown in Table S02. Sequenceswere aligned using Clustal X (Thompson et al., 1997), and align-ment parameters suggested by Hall (2005) were implemented(gap opening: 35.00 and gap extension: 0.75).

The identification of potential INDELs was performed as pro-posed before (Gupta, 1998). Briefly, this methodology is based onthe study of insertions and deletions present in a set of orthologousproteins. INDELs are detected as regions of defined length and se-quence, and flanked by highly conserved regions (which excludesmisleading INDEL identification due to improper alignment orsequencing errors) and found at precisely the same position inorthologs from different species (INDELs must be positional homo-logs). All sequence alignments are available upon request.

3. Results

Phylogenetic analyses of proteobacterial groups were con-ducted with the concatenated sequences of 11 conserved proteins,including the sequences employed in the search for conserved IN-DELs (Fig. 1). The use of concatenated sequences for tree genera-tion gave more consistent trees and better bootstrap supportthan single genes. In this concatenated tree, the Xanthomonadalesform a monophyletic group within the same clade as the beta-pro-teobacteria and at the base of the gamma-proteobacteria, close toother orders that are also difficult to correctly classify, such asthe Legionelalles, Chromatiales, Methylococcales, Thiotrichalesand Cardiobacteriales (Fig. 1). Phylogenetic trees for individual se-quences (including 16S rRNA, RecA, TopA and all the 11 genes em-ployed in the concatenated tree; see MM for the list) were alsogenerated. As a general trend, proteobacterial groups are clearly

lacement of Xanthomonadales based on conserved protein signature se-

Fig. 1. Maximum likelihood tree of the proteobacteria based on the concatenated sequences of 11 conserved proteins. Branch lengths are proportional to the number ofamino acid substitutions per site. Numbers at the nodes represent the percentage of bootstrap support (only values > 50% are shown). See Table S01 for the accession codes,and Table S02 for abbreviations of species names.

A.M. Cutiño-Jiménez et al. / Molecular Phylogenetics and Evolution xxx (2009) xxx–xxx 3

ARTICLE IN PRESS

distinguished (betas and gammas always forming a monophyleticgroup with high bootstrap support; data not shown). Nevertheless,depending on the sequence analyzed, the gamma-proteobacteriamay or may not form a monophyletic group (as in the case of TopA,where the group is split into at least two, Fig. 1S, in Supplementarymaterial) and the branching order for betas, gammas and Xantho-monadales, as seen in the topology obtained for the concatenatedtree, is supported in most of the cases, but not always (7 out of

Please cite this article in press as: Cutiño-Jiménez, A.M., et al. Evolutionary pquences. Mol. Phylogenet. Evol. (2009), doi:10.1016/j.ympev.2009.09.026

11 single gene trees have the same topology as the concatenatedtree; data not shown).

These results confirm that the evolutionary history of these bac-teria is quite complex and underline the need to use novel tools toinvestigate the taxonomic position of Xanthomonadales. Therefore,we sought additional protein signature sequences, the INDELs. Theassumption behind this approach is that when a conserved INDEL(conserved in terms of length, sequence and residual position, and

lacement of Xanthomonadales based on conserved protein signature se-

4 A.M. Cutiño-Jiménez et al. / Molecular Phylogenetics and Evolution xxx (2009) xxx–xxx

ARTICLE IN PRESS

flanked by a region sharing high levels of sequence similarity overall organisms analyzed) is found in an alignment, the simplest andmost parsimonious explanation for its presence is that it was intro-duced only once during the course of evolution into a hypotheticalancestral organism and then kept in all its progeny. Thus, INDELsshared by two organisms indicate a common ancestry. To assessthe phylogenetic relationship of the Xanthomonadales group with-in the proteobacteria, we selected evolutionarily conserved pro-teins with a role in DNA metabolism processes. The proteinsselected participate in informational processes and their functionis dependent upon interaction with other proteins, thereby mini-mizing the chance of LGT (Jain et al., 1999). Table 1 summarizesthe major INDELs that characterize distinct proteobacterial groups,especially the Xanthomonadales. Each protein containing an INDELused in this study is further described below.

3.1. Gamma-proteobacteria specific signature sequences

3.1.1. DNA polymerase III, alpha subunitThe DNA polymerase III holoenzyme is a complex that contains

10 different types of subunits. The alpha subunit, encoded by thepolC gene, is a 130-kDa polypeptide endowed with 50 to 30 DNApolymerase activity but none for proofreading (Patel and Loeb,2001; Steitz, 1999). A signature sequence gene that consists of a5-amino acid insert is shared by homologs from gamma-proteo-bacteria, but is absent from Xanthomonadales, Methylococcales,Legionellales, Cardiobacteriales, part of the Thiotrichales (Francisel-la novicida), and from other proteobacterial subdivisions (Fig. 2). Aswith many other INDELs, this separates the Xanthomonadales (andother groups) from the core group of gamma-proteobacteria.

Table 1Major distinctive INDELs of proteobacterial groups and, in special, the Xanthomonadales o

Protein Description Present

DNA polymerase IIIalpha subunit

5-amino acid lengthinsertion (Fig. 2)

Enterobacteriales, VibrionalesPseudomonadales, AlteromonOceanospirillales, ChromatialThiomicrospira crunogena (fro

DNA Topoisomerase I 29–41-amino acid lengthinsertion (Fig. 2S)

Enterobacteriales, VibrionalesAlteromonadales, Aeromonadand Pseudomonadales

4–5-amino acid insertion(Fig. 3)

Xanthomonadales, LegionellaChromatiales, Cardiobacterial(Francisella novicida specie), aspecies belonging to the betaand a small number of delta-(Desulfuromonadales group)

DNA polymerase I 7-amino acid lengthinsertion (Fig. 3S)

Enterobacteriales, VibrionalesAlteromonadales, Aeromonadand Pseudomonadales orders

1-amino acid lengthinsertion (Fig. 3S)

Pseudomonadaceae family (rePseudomonas and Azotobacter

DNA polymerase IIIepsilon subunit

4–5-amino acid lengthinsertion (Fig. 4S)

Enterobacterialles, PasteurellaAeromonadales, CardiobacterShewanellaceae family (Alter

DNA ligase NAD-dependent

57–65-amino acid lengthinsertion (Fig. 4)

Xanthomonadales

MutS 5-amino acid lengthinsertion (Fig. 5S)

Xanthomonadales

RecA 2-amino acid lengthinsertion (Fig. 6S)

Xanthomonadales

DNA polymerase IIIalpha subunit

4-amino acid lengthinsertion (Fig. 7S)

Xanthomonadales

Please cite this article in press as: Cutiño-Jiménez, A.M., et al. Evolutionary pquences. Mol. Phylogenet. Evol. (2009), doi:10.1016/j.ympev.2009.09.026

3.1.2. DNA topoisomerase IDNA topoisomerase I plays essential roles in DNA transcription

and replication, by altering the supercoiling of the double-strandedDNA molecule (Feinberg et al., 1999). It possesses several interest-ing INDELs, which indicates an atypical evolution and confirms anearly separation of two distinct branches among proteobacteria(Fig. 1S). One of them is a conserved 29–41-amino acid insertion(Fig. 2S, Supplementary material) shared by various speciesbelonging to the gamma-proteobacteria (Enterobacteriales, Vibrio-nales, Pasteurellales, Alteromonadales, Aeromonadales, Oceanospi-rillales and Pseudomonadales), but absent in homologs fromXanthomonadales, Methylococcales, Legionellales, Chromatiales,Cardiobacteriales, Thiotrichales, as well as the other proteobacteri-al groups (beta, alpha, delta and epsilon).

Another 4–5-amino acid INDEL in this protein is absent in mostof the gamma-proteobacteria, but present in Xanthomonadales,Legionellales, Methylococcales, Chromatiales, Cardiobacteriales,Thiotrichales (Francisella novicida), as well as proteobacteria ofthe beta and alpha subdivisions, and a small number of delta-pro-teobacteria (Fig. 3). Curiously, this INDEL is absent from most ofthe delta subdivision and is completely absent from the epsilonsubdivision. This heterogeneity suggests that delta-proteobacteriamay not be a monophyletic group, as part of it is more closely re-lated to alpha-proteobacteria, as already previously suggested byGupta (2005). This insertion is also absent outside of the proteo-bacteria clade (represented here by some firmicutes and chlamy-diae, Fig. 3). Interestingly, these inferences from signaturesequence are supported by phylogenetic analyses based on Topoi-somerase I (Fig. 1S): the beta-proteobacteria, most of the deepbranching gamma-proteobacteria groups, and Geobacter branch to-gether and independently from the other proteobacteria. Thus, the

rder.

Absent

, Pasteurellales,adales, Aeromonadales,es orders andm Thiotrichales group)

Xanthomonadales, Methylococcales, Legionellales,Cardiobacteriales, Francisella novicida; and in the othergroups of bacteria

, Pasteurellales,ales, Oceanospirillales

Xanthomonadales, Legionellales, Methylococcales,Chromatiales, Cardiobacteriales, Thiotrichales andhomologs from other bacteria

les, Methylococcales,es and Thiotrichaless well as proteobacteriaand alpha subdivisions,

proteobacteria

Other gamma-proteobacteria orders andThiomicrospira crunogena (from Thiotrichalesgroup).This insertion is also absent in most of thedelta, and completely absent from epsilon subdivision

, Pasteurellales,ales, Oceanospirillales

Other gamma-proteobacteria orders(Xanthomonadales, Chromatiales, Methylococcales,Legionellales, Cardiobacteriales and Thiotrichales).This insertion is also absent in some members fromPseudomonadales (Acinetobacter sp.) and other groupsof bacteria

presenting by)

In all other bacteria

les, Vibrionales,iales and members fromomonadales)

Other gamma-proteobacteria orders(Xanthomonadales, Methylococcales, Legionellales,Chromatiales, Thiotrichales, Oceanospirillales,Pseudomonadales and in the other Alteromonadales:Pseudoalteromonadaceae and Alteromonadaceaefamilies); and in all other bacteriaIn all other bacteria

In all other bacteria

In all other bacteria

In all other bacteria

lacement of Xanthomonadales based on conserved protein signature se-

Fig. 2. Partial sequence alignment of the DNA polymerase III epsilon subunit showing a 5-amino acid insert specific for the core gamma-proteobacteria groups, but absent inXanthomonadales, several deep branching bacteria of this class, and all other bacteria. Dots (.) in this and other alignments denote identity with the amino acid on the top lineand gaps are indicated by empty spaces. Accession numbers are given after each species name. See Table S02 for abbreviations of species names. The numbers on the leftindicate the amino acid position for each protein sequence. Sequence information for only representative species is shown, but the sequences from other available speciesfrom these groups behave similarly.

A.M. Cutiño-Jiménez et al. / Molecular Phylogenetics and Evolution xxx (2009) xxx–xxx 5

ARTICLE IN PRESS

Please cite this article in press as: Cutiño-Jiménez, A.M., et al. Evolutionary placement of Xanthomonadales based on conserved protein signature se-quences. Mol. Phylogenet. Evol. (2009), doi:10.1016/j.ympev.2009.09.026

6 A.M. Cutiño-Jiménez et al. / Molecular Phylogenetics and Evolution xxx (2009) xxx–xxx

ARTICLE IN PRESS

evolutionary history of this gene is not typical and indicates anearly divergence of the two groups (one of them with the inser-tion). The possibility that this gene underwent an ancient LGTevent cannot be excluded.

Fig. 3. Partial sequence alignment of Topoisomerase I, showing a 4–5-amino acid insert imost of the other deep branching bacteria of this class, as well as in the beta and alphaproteobacteria and also other bacteria (the Firmicutes/Chlamydiae group are presented

Please cite this article in press as: Cutiño-Jiménez, A.M., et al. Evolutionary pquences. Mol. Phylogenet. Evol. (2009), doi:10.1016/j.ympev.2009.09.026

The atypical grouping for Thiotrichales (represented here byFrancisella novicida and Thiomicrospira crunogena, which are dis-criminated here by both signatures) is in agreement with earlierwork by the group of Gupta, who found the two genera sepa-

s absent from the core gamma-proteobacteria, but present in Xanthomonadales and-proteobacteria, but only a few of the delta-proteobacteria. For most of the delta-as outgroups), the insert is absent. Details as described in Fig. 2.

lacement of Xanthomonadales based on conserved protein signature se-

A.M. Cutiño-Jiménez et al. / Molecular Phylogenetics and Evolution xxx (2009) xxx–xxx 7

ARTICLE IN PRESS

rated in protein signature and phylogenetic analyses (Gao et al.,2009).

3.1.3. DNA polymerase IIn addition to 50 to 30 DNA polymerase activity, DNA polymerase

I exhibits 30 to 50 exonuclease activity, which mediates proofread-ing, and 50 to 30 nick translation during DNA repair (Patel et al.,2001). A 7–8-amino acid insertion is present in homologs fromthe core gamma-proteobacteria, but is not found in homologs fromXanthomonadales, Chromatiales, Methylococcales, Legionellales,Cardiobacteriales and Thiotrichales, neither in homologs from thebeta, alpha, delta and epsilon subdivisions (Fig. 3S, Supplementarymaterial).

Moreover, a small 1-amino acid insertion in the Pseudomona-dales order is also observed. This INDEL is present as well in theOceanospirillales Chromohalobacter salexigens, thus indicating aclose relationship with the Pseudomonadales, as already demon-strated in previous studies (Quillaguaman et al., 2004). The ab-sence of both INDELs in two Acinetobacter species (belonging tothe Pseudomonadales group) points to an ancient divergence ofthese bacteria within the Pseudomonadales.

3.1.4. DNA polymerase III, epsilon subunitThe DNA polymerase III epsilon subunit protein, encoded by the

dnaQ gene, is involved in DNA replication through its proofreading30 to 50 exonuclease activity (Patel and Loeb, 2001; Steitz, 1999). Asignature sequence consisting of a 4–5-amino acid insert is presentin dnaQ homologs of the gamma-proteobacteria, but absent inhomologs from Xanthomonadales, Methylococcales, Legionellales,Chromatiales, Thiotrichales, Oceanospirillales, Pseudomonadalesand Alteromonadales, as well in the other proteobacterial groups(Fig. 4S, Supplementary material). This indicates a late origin forthis insertion, and adds support to the assertion that groups notcarrying this insertion are related but diverge from the other gam-ma-proteobacteria. A smaller 1-amino acid insertion is also ob-served and shared by homologs from the Vibrionales andAlteromonadales (Shewanellaceae family). The fact that the Car-diobacteriales also carry this insertion points to a potential LGTevent among these bacteria.

3.2. Xanthomonadales specific signature sequences

3.2.1. NAD-dependent DNA ligase LigATwo distinct families of DNA ligase are found, one that is typical

of Bacteria and uses NAD+ as a cofactor, and the other that usesATP and is common to Archaea and Eukaryotes; both families ofDNA ligase, however, are present in Xanthomonas (Wilkinsonet al., 2001; Martins-Pinheiro et al., 2004). The NAD-dependentDNA ligase bears a distinctive 65-amino acid insertion, only foundin the Xanthomonadales group (Fig. 4). Recursive BLAST searchesusing this striking long INDEL as a seed sequence were performed,but only sequences from the Xanthomonadales group showedsimilarity.

3.2.2. Mismatch repair protein MutSThe MutS protein is involved in the repair of mismatches in

DNA. This protein forms a dimer (MutS2) that recognizes the mis-matched base on the daughter-strand and binds the mutated DNAwith feeble ATPase activity (Kunkel and Erie, 2005). This proteinreveals a conserved 5-amino acid insertion, exclusive to membersof the Xanthomonadales group (Fig. 5S, Supplementary material).

3.2.3. RecARecA plays an essential role in at least three distinct cellular

processes: homologous recombination, gene control of the SOSregulon and DNA damage induced mutagenesis (Cox, 2003; Eisen,

Please cite this article in press as: Cutiño-Jiménez, A.M., et al. Evolutionary pquences. Mol. Phylogenet. Evol. (2009), doi:10.1016/j.ympev.2009.09.026

1995). Moreover, this protein has been widely used for phyloge-netic reconstruction in order to assess relationships within Bacte-ria (Eisen, 1995; Martins-Pinheiro et al., 2004). A 2-amino acidinsertion in a highly conserved region has been shown to distin-guish the Xanthomonadales order (Fig. 6S, Supplementarymaterial).

Other interesting signatures were also observed, such as a 4-amino acid insertion exclusive to Xanthomonadales in the alphasubunit of DNA polymerase III (Fig. 7S, Supplementary material),a 4-amino acid insertion in valyl-tRNA synthetase exclusive toXanthomonadales and a 37-amino acid insertion, also observedin valyl-tRNA synthetase, present in all gamma and beta-proteo-bacteria, but absent in the remaining groups, previously described(Gupta, 2000).

4. Discussion

An accurate classification of xanthomonads is not only impor-tant for the basic knowledge of the phylogeny of this group, butalso has practical implications for global regulation of severe dis-eases caused by these plant pathogens. Misidentification and/orunclear classification of xanthomonads may result in unnecessaryquarantine regulations and economic loss. In previous studiesusing different gene and protein sequences, the classification ofXanthomonadales was contradictory to the placement of thesebacteria as gamma-proteobacteria. For example, when analyzingthe phylogeny of the RecA protein (Martins-Pinheiro et al., 2004)or 16S rRNA (Lima et al., 2005, 2008), independent results placedthe Xanthomonadales in a branch close to the beta subdivision(as also shown herein, Fig. 1). Different genes and different meth-ods may sometimes result in contradicting trees. These problemsare typical in molecular phylogeny studies, even in those cases inwhich LGT can be ruled out. Other authors have previously high-lighted the incongruity between gene and presumed organismphylogeny (Poptsova and Gogarten, 2007).

Comas et al. (2006) analyzed whether incongruities apparent inprotein and/or gene trees from Xanthomonadales could have re-sulted from phylogenetic noise and/or LGT. Phylogenetic noise af-fects basal groups whose positions may have changed due tolimitations in phylogenetic reconstruction methods. They con-cluded that extensive transfer among Xanthomonadales and alpha,beta and even gamma-proteobacteria is the main source of incon-gruity in the proteobacterial tree. The results of Lima et al. (2008)corroborate this hypothesis through the identification of genomeislands as products of early and late LGT events, thus confirmingthe mosaic structure of Xanthomonas genomes. Phylogenetic treesof proteins not normally subject to LGT may help to solve thisproblem. However, as observed for 16S rRNA, RecA and other phy-logenetic reconstructions (Figs. 1 and 1S), the Xanthomonadalesgroup appears as paraphyletic with respect to the beta-proteobac-teria, branching at the base of the gamma-proteobacteria clade.

The identification of specific signature sequences is a reliableand consistent approach for determining relative branching andtaxonomic relationships, and has been used with success to deter-mine both broad and specific evolutionary relationships (Gupta,1997, 1998, 2000; Gupta and Griffiths, 2002). The main idea ofthese studies is based on the assumption that these signatures,mainly INDELs, are unique events within certain genes, and pro-vide powerful tools for inferring evolutionary relationships. How-ever, some of these INDELs do present some variation, whichappears as several small gaps within the insertion presented inFig. 2S. Although most of these gaps are clearly specific to certainclades, their origin indicates that the region of the protein affectedby the INDEL has some structural flexibility due to low selectivepressures.

lacement of Xanthomonadales based on conserved protein signature se-

Fig. 4. Partial sequence alignment of the DNA ligase NAD-dependent protein, showing a 65-amino acid insert specific for Xanthomonadales. Details as described in Fig. 2.

8 A.M. Cutiño-Jiménez et al. / Molecular Phylogenetics and Evolution xxx (2009) xxx–xxx

ARTICLE IN PRESS

In the present work, the search for INDELs was applied mainlyto uncover taxonomic relationships of the Xanthomonadales orderwithin the proteobacteria, by using a set of conserved proteins re-lated to core metabolic functions such as DNA repair and replica-tion. The main results of these analyses are summarized in aschematic genetic evolutionary flow of beta and gamma-proteo-bacteria, as shown in Fig. 5.

Interestingly, most of the described INDELs distinguish thegamma-proteobacteria group from other proteobacteria groups,as these INDELs are absent in beta, alpha or delta/epsilon subdivi-sions; however, the same profile of absence is shared by earlydivergent groups inside the gamma-proteobacteria, notably Xan-thomonadales, Chromatiales, Methylococcales, Legionellales, Car-diobacteriales and Thiotrichales. This strongly suggests a closerelationship among these groups, in agreement with the phyloge-netic trees shown in Figs. 1 and 1S. In fact, these INDELs were mostlikely to have been introduced after these groups branched fromthe rest of the gamma-proteobacteria. The profile of presence ingamma-proteobacterial species and absence in other proteobacte-rial groups indicates that these signatures constitute a synapomor-phy for this subdivision.

Please cite this article in press as: Cutiño-Jiménez, A.M., et al. Evolutionary pquences. Mol. Phylogenet. Evol. (2009), doi:10.1016/j.ympev.2009.09.026

A detailed phylogenomic analysis of the gamma-proteobacteriawas recently published (Gao et al., 2009). Basically, a phylogenetictree based on concatenated protein sequences from conservedgenes placed Xanthomonadales as a deep branch (as well as Thio-trichales, Cardiobacteriales, Legionellales, Chromatialles and Meth-ylococcales) in the phylogenies of gamma-proteobacteria.Although the beta-proteobacteria were not included in those anal-yses, a specific 2-amino acid INDEL, in AIACR-transformylase(PurH), could differentiate all the other proteobacteria (includingbeta-proteobacteria) from Xanthomonadales and gamma-proteo-bacteria. This INDEL was included in the schematic genetic evolu-tionary flow shown in Fig. 5. Moreover, four conserved proteins, ofunknown functions, were identified as exclusive to the gamma-proteobacteria, including Xanthomonadales. These results are goodindications that Xanthomonadales share important features withthe gamma-proteobacteria, but similar searches for genes sharedexclusively between Xanthomonadales and beta-proteobacteriawould be important to understand the relationships among thesegroups. In fact, the INDEL found for topoisomerase I (Fig. 3) andits interesting phylogeny (Fig. 1S) place the Xanthomonadales (aswell as Legionellales, Chromatialles, Methylococcales, Cardiobacte-

lacement of Xanthomonadales based on conserved protein signature se-

Fig. 5. Simplified evolutionary scheme of gamma-proteobacteria, based on INDEL identification. Groups were placed in schematic representation according to sharedsignature sequences. INDELs (white circles) are depicted in the branches where they probably arose. Question marks indicate phylogenetic positions that are not clear,comprising the deep branching gamma-proteobacteria orders, including Xanthomonadales.

A.M. Cutiño-Jiménez et al. / Molecular Phylogenetics and Evolution xxx (2009) xxx–xxx 9

ARTICLE IN PRESS

riales and some of the Thiotrichales, Francisella genus) in the samebranch of beta-proteobacteria, but, in this case, independent ofgamma-proteobacteria. This can be explained by a duplication ofthis gene, followed by loss of one of the paralogues. The evolutionof the two paralogues place the Xanthomonadales as an outgroupfrom the core gamma-proteobacteria. An alternative explanationis that this insert was introduced in a common ancestor of the var-ious alpha (and a few delta), beta and gamma-proteobacteria, fol-lowed by a second genetic event leading to loss of this insert from acommon ancestor of the Enterobacteriales, Pasteurellales, Vibrio-nales, Alteromonadales Aeromonadales, Oceanospirillales, Pseudo-monadales and some from Thiotrichales group (Thiomicrospiragenus).

On the other hand, the INDELs exclusively shared by homologsfrom Xanthomonadales, and absent in any homolog from otherproteobacteria, are suggestive of the independent evolution of thisgroup and provide molecular signatures to distinguish and defineit. These signatures are likely recent molecular events in the Xan-thomonadales, which occurred after its divergence from the mainbranches of gamma- and beta-proteobacteria. In view of the highlyconserved and well-defined nature of these signatures, they clearlydefine the Xanthomonadales as monophyletic. Combining the dataof INDEL and phylogeny, these bacteria are more likely to haveevolved independently soon after, or concomitant to, the diver-gence of the beta and gamma-proteobacteria.

Molecular approaches are being used increasingly not only inevolutionary and taxonomic studies, but also in epidemiologicalscrutinies. Because of their specificity and conservation, signa-ture sequences for Xanthomonadales may provide a novel meansfor identifying known or new species within Xanthomonadalesby means of PCR-based and immunological methods. Althoughconserved inside Xanthomonas, Xylella and Stenotrophomonas,the insertion found in the DNA ligase NAD-dependent is hetero-geneous among the genera (Fig. 4). This characteristic is veryimportant and may be of practical significance, as PCR-basedprotocols could possibly contribute to pathogen detection andidentification at the genus level. Thus, knowledge of such spe-

Please cite this article in press as: Cutiño-Jiménez, A.M., et al. Evolutionary pquences. Mol. Phylogenet. Evol. (2009), doi:10.1016/j.ympev.2009.09.026

cific INDELs may provide the means for developing specific, ra-pid and easy diagnostic protocols to identify the presence orabsence of Xanthomonadales in DNA samples from potentiallyinfected plants or animals.

5. Conclusions

The alignment of conserved protein sequences has led to theidentification of conserved INDELs, useful for assessing phyloge-netic placement of the Xanthomonadales group. The data pre-sented here delineate the evolution of the gamma lineage andprovide further evidence for the early gamma placement, indicat-ing that Xanthomonadales constitute one of the deeper branchinglineage within gamma-proteobacteria Moreover, the results con-firm the monophyly of Xanthomonadales and constitute interest-ing genetic markers to identify these bacteria.

Acknowledgments

The authors thank Dr. Radhey S. Gupta (McMaster University,Ontario, Canada) for helpful discussion. The authors are also indebt to the anonymous reviewers for their comments, criticismsand excellent suggestions. Financial support was obtained fromFAPESP (São Paulo, Brazil) and CNPq (Brasília, Brazil). During thiswork, A.M.C.J. received a fellowship from CAPES (Brasília, DF, Bra-zil), and M.M.P. and W.C.L. from FAPESP.

Appendix A. Supplementary data

ML phylogenetic tree of topoisomerase I (Fig. 1S) and partialprotein alignments (Figs. 2S–7S). List of proteins numbers usedin phylogenetic analyses (Table S01) and of organisms used to per-form similarity searches (Table S02). Supplementary data associ-ated with this article can be found, in the online version, atdoi:10.1016/j.ympev.2009.09.026.

lacement of Xanthomonadales based on conserved protein signature se-

10 A.M. Cutiño-Jiménez et al. / Molecular Phylogenetics and Evolution xxx (2009) xxx–xxx

ARTICLE IN PRESS

References

Abascal, F., Zardoya, R., Posada, D., 2005. ProtTest: selection of best-fit models ofprotein evolution. Bioinformatics 21, 2104–2105.

Altschul, S.F., Madden, T.L., Schaffer, A.A., Zhang, J., Zhang, Z., Miller, W., Lipman, D.J.,1997. Gapped BLAST and PSI-BLAST: a new generation of protein databasesearch programs. Nucleic Acids Res. 25, 3389–3402.

Brocchieri, L., 2001. Phylogenetic inferences from molecular sequences: review andcritique. Theor. Popul. Biol. 59, 27–40.

Clark, M.A., Moran, N.A., Baumann, P., 1999. Sequence evolution in bacterialendosymbionts having extreme base compositions. Mol. Biol. Evol. 16, 1586–1598.

Comas, I., Moya, A., Azad, R.K., Lawrence, J.G., Gonzalez-Candelas, F., 2006. Theevolutionary origin of Xanthomonadales genomes and the nature of thehorizontal gene transfer process. Mol. Biol. Evol. 23, 2049–2057.

Cox, M.M., 2003. The bacterial RecA protein as a motor protein. Annu. Rev.Microbiol. 57, 551–577.

da Silva, A.C., Ferro, J.A., Reinach, F.C., Farah, C.S., Furlan, L.R., Quaggio, R.B.,Monteiro-Vitorello, C.B., Van Sluys, M.A., Almeida, N.F., Alves, L.M., do Amaral,A.M., Bertolini, M.C., Camargo, L.E., Camarotte, G., Cannavan, F., Cardozo, J.,Chambergo, F., Ciapina, L.P., Cicarelli, R.M., Coutinho, L.L., Cursino-Santos, J.R.,El-Dorry, H., Faria, J.B., Ferreira, A.J., Ferreira, R.C., Ferro, M.I., Formighieri, E.F.,Franco, M.C., Greggio, C.C., Gruber, A., Katsuyama, A.M., Kishi, L.T., Leite, R.P.,Lemos, E.G., Lemos, M.V., Locali, E.C., Machado, M.A., Madeira, A.M., Martinez-Rossi, N.M., Martins, E.C., Meidanis, J., Menck, C.F., Miyaki, C.Y., Moon, D.H.,Moreira, L.M., Novo, M.T., Okura, V.K., Oliveira, M.C., Oliveira, V.R., Pereira, H.A.,Rossi, A., Sena, J.A., Silva, C., de Souza, R.F., Spinola, L.A., Takita, M.A., Tamura,R.E., Teixeira, E.C., Tezza, R.I., Trindade dos Santos, M., Truffi, D., Tsai, S.M.,White, F.F., Setubal, J.C., Kitajima, J.P., 2002. Comparison of the genomes of twoXanthomonas pathogens with differing host specificities. Nature 417, 459–463.

Davis, M.J., Purcell, A.H., Thomson, S.V., 1978. Pierce’s disease of grapevines:isolation of the causal bacterium. Science 199, 75–77.

Eisen, J.A., 1995. The RecA protein as a model molecule for molecular systematicstudies of bacteria: comparison of trees of RecAs and 16S rRNAs from the samespecies. J. Mol. Evol. 41, 1105–1123.

Feinberg, H., Changela, A., Mondragon, A., 1999. Protein–nucleotide interactions inE. Coli DNA topoisomerase I. Nat. Struct. Biol. 6, 961–968.

Gao, B., Mohan, R., Gupta, R.S., 2009. Phylogenomics and protein signatureselucidating the evolutionary relationships among the Gammaproteobacteria.Int. J. Syst. Evol. Microbiol. 59, 234–247.

Griffiths, E., Gupta, R.S., 2007. Identification of signature proteins that aredistinctive of the Deinococcus–Thermus phylum. Int. Microbiol. 10, 201–208.

Gupta, R.S., 1997. Protein phylogenies and signature sequences: evolutionaryrelationships within prokaryotes and between prokaryotes and eukaryotes.Antonie Van Leeuwenhoek 72, 49–61.

Gupta, R.S., 1998. Protein phylogenies and signature sequences: a reappraisal ofevolutionary relationships among archaebacteria, eubacteria, and eukaryotes.Microbiol. Mol. Biol. Rev. 62, 1435–1491.

Gupta, R.S., 2000. The phylogeny of proteobacteria: relationships to othereubacterial phyla and eukaryotes. FEMS Microbiol. Rev. 24, 367–402.

Gupta, R.S., 2005. Protein signatures distinctive of alpha proteobacteria and itssubgroups and a model for alpha-proteobacterial evolution. Crit. Rev. Microbiol.31, 101–135.

Gupta, R.S., 2006. Molecular signatures (unique proteins and conserved indels) thatare specific for the epsilon proteobacteria (Campylobacterales). BMC Genomics7, 167.

Gupta, R.S., Griffiths, E., 2002. Critical issues in bacterial phylogeny. Theor. Popul.Biol. 61 (4), 423–434.

Gupta, R.S., Mok, A., 2007. Phylogenomics and signature proteins for the alphaproteobacteria and its main groups. BMC Microbiol. 7, 106.

Hall, B.G., 2005. Comparison of the accuracies of several phylogenetic methodsusing protein and DNA sequences. Mol. Biol. Evol. 22, 792–802.

Jain, R., Rivera, M.C., Lake, J.A., 1999. Horizontal gene transfer among genomes: thecomplexity hypothesis. Proc Natl. Acad. Sci. USA 96, 3801–3806.

Korbel, J.O., Snel, B., Huynen, M.A., Bork, P., 2002. SHOT: a web server for theconstruction of genome phylogenies. Trends Genet. 18, 158–162.

Kunkel, T.A., Erie, D.A., 2005. DNA mismatch repair. Annu. Rev. Biochem. 74, 681–710.

Larkin, M.A., Blackshields, G., Brown, N.P., Chenna, R., McGettigan, P.A., McWilliam,H., Valentin, F., Wallace, I.M., Wilm, A., Lopez, R., Thompson, J.D., Gibson, T.J.,Higgins, D.G., 2007. Clustal W and Clustal X version 2.0. Bioinformatics 23,2947–2948.

Lee, B.M., Park, Y.J., Park, D.S., Kang, H.W., Kim, J.G., Song, E.S., Park, I.C., Yoon, U.H.,Hahn, J.H., Koo, B.S., Lee, G.B., Kim, H., Park, H.S., Yoon, K.O., Kim, J.H., Jung, C.H.,Koh, N.H., Seo, J.S., Go, S.J., 2005. The genome sequence of Xanthomonas oryzaepathovar oryzae KACC10331, the bacterial blight pathogen of rice. Nucleic AcidsRes. 33, 577–586.

Leyns, F., De Cleene, M., Swings, J.G., De Ley, J., 1984. The host range of the genusXanthomonas. Bot. Rev. 50, 308–356.

Lima, W.C., Van Sluys, M.A., Menck, C.F., 2005. Non-gamma-proteobacteria geneislands contribute to the Xanthomonas genome. OMICS J. Int. Biol. 9, 160–172.

Lima, W.C., Paquola, A.C., Varani, A.M., Van Sluys, M.A., Menck, C.F., 2008. Laterallytransferred genomic islands in Xanthomonadales related to pathogenicity andprimary metabolism. FEMS Microbiol. Lett. 281, 87–97.

Please cite this article in press as: Cutiño-Jiménez, A.M., et al. Evolutionary pquences. Mol. Phylogenet. Evol. (2009), doi:10.1016/j.ympev.2009.09.026

Ludwig, W., Schleifer, K.H., 1994. Bacterial phylogeny based on 16S and 23S rRNAsequence analysis. FEMS Microbiol. Rev. 15, 155–173.

Martins-Pinheiro, M., Galhardo, R.S., Lage, C., Lima-Bessa, K.M., Aires, K.A., Menck,C.F., 2004. Different patterns of evolution for duplicated DNA repair genes inbacteria of the Xanthomonadales group. BMC Evol. Biol. 4, 29.

Nicholas, K.B., Nicholas, H.B.J., Deerfield, D.W.I., 1997. GeneDoc: analysis andvisualization of genetic variation. EMB News 4, 14.

Ochman, H., 2001. Lateral and oblique gene transfer. Curr. Opin. Genet. Dev. 11,616–619.

Olsen, G.J., Larsen, N., Woese, C.R., 1991. The ribosomal RNA database project.Nucleic Acids Res. 19 (Suppl.), 2017–2021.

Page, R.D., 1996. TreeView: an application to display phylogenetic trees on personalcomputers. Comput. Appl. Biosci. 12, 357–358.

Patel, P.H., Loeb, L.A., 2001. Getting a grip on how DNA polymerases function. Nat.Struct. Biol. 8, 656–659.

Patel, P.H., Suzuki, M., Adman, E., Shinkai, A., Loeb, L.A., 2001. Prokaryotic DNApolymerase I: evolution, structure, and ‘‘base flipping” mechanism fornucleotide selection. J. Mol. Biol. 308, 823–837.

Philippe, H., Zhou, Y., Brinkmann, H., Rodrigue, N., Delsuc, F., 2005. Heterotachy andlong-branch attraction in phylogenetics. BMC Evol. Biol. 5, 50.

Poptsova, M.S., Gogarten, J.P., 2007. The power of phylogenetic approaches to detecthorizontally transferred genes. BMC Evol. Biol. 7, 45.

Purcell, A.H., Hopkins, D.L., 1996. Fastidious xylem-limited bacterial plantpathogens. Annu. Rev. Phytopathol. 34, 131–151.

Qian, W., Jia, Y., Ren, S.X., He, Y.Q., Feng, J.X., Lu, L.F., Sun, Q., Ying, G., Tang, D.J., Tang,H., Wu, W., Hao, P., Wang, L., Jiang, B.L., Zeng, S., Gu, W.Y., Lu, G., Rong, L., Tian,Y., Yao, Z., Fu, G., Chen, B., Fang, R., Qiang, B., Chen, Z., Zhao, G.P., Tang, J.L., He, C.,2005. Comparative and functional genomic analyses of the pathogenicity ofphytopathogen Xanthomonas campestris pv. campestris. Genome Res. 15, 757–767.

Quillaguaman, J., Delgado, O., Mattiasson, B., Hatti-Kaul, R., 2004. Chromohalobactersarecensis sp. nov., a psychrotolerant moderate halophile isolated from thesaline Andean region of Bolivia. Int. J. Syst. Evol. Microbiol. 54, 1921–1926.

Schneider, A., Dessimoz, C., Gonnet, G.H., 2007. OMA Browser – exploringorthologous relations across 352 complete genomes. Bioinformatics 23, 2180–2182.

Simpson, A.J., Reinach, F.C., Arruda, P., Abreu, F.A., Acencio, M., Alvarenga, R., Alves,L.M., Araya, J.E., Baia, G.S., Baptista, C.S., Barros, M.H., Bonaccorsi, E.D., Bordin, S.,Bove, J.M., Briones, M.R., Bueno, M.R., Camargo, A.A., Camargo, L.E., Carraro,D.M., Carrer, H., Colauto, N.B., Colombo, C., Costa, F.F., Costa, M.C., Costa-Neto,C.M., Coutinho, L.L., Cristofani, M., Dias-Neto, E., Docena, C., El-Dorry, H.,Facincani, A.P., Ferreira, A.J., Ferreira, V.C., Ferro, J.A., Fraga, J.S., Franca, S.C.,Franco, M.C., Frohme, M., Furlan, L.R., Garnier, M., Goldman, G.H., Goldman,M.H., Gomes, S.L., Gruber, A., Ho, P.L., Hoheisel, J.D., Junqueira, M.L., Kemper,E.L., Kitajima, J.P., Krieger, J.E., Kuramae, E.E., Laigret, F., Lambais, M.R., Leite, L.C.,Lemos, E.G., Lemos, M.V., Lopes, S.A., Lopes, C.R., Machado, J.A., Machado, M.A.,Madeira, A.M., Madeira, H.M., Marino, C.L., Marques, M.V., Martins, E.A.,Martins, E.M., Matsukuma, A.Y., Menck, C.F., Miracca, E.C., Miyaki, C.Y.,Monteriro-Vitorello, C.B., Moon, D.H., Nagai, M.A., Nascimento, A.L., Netto, L.E.,Nhani Jr., A., Nobrega, F.G., Nunes, L.R., Oliveira, M.A., de Oliveira, M.C., deOliveira, R.C., Palmieri, D.A., Paris, A., Peixoto, B.R., Pereira, G.A., Pereira Jr., H.A.,Pesquero, J.B., Quaggio, R.B., Roberto, P.G., Rodrigues, V., de, M.R.A.J., de Rosa Jr.,V.E., de Sa, R.G., Santelli, R.V., Sawasaki, H.E., da Silva, A.C., da Silva, A.M., daSilva, F.R., da Silva Jr., W.A., da Silveira, J.F., Silvestri, M.L., Siqueira, W.J., deSouza, A.A., de Souza, A.P., Terenzi, M.F., Truffi, D., Tsai, S.M., Tsuhako, M.H.,Vallada, H., Van Sluys, M.A., Verjovski-Almeida, S., Vettore, A.L., Zago, M.A., Zatz,M., Meidanis, J., Setubal, J.C., 2000. The genome sequence of the plant pathogenXylella fastidiosa. The Xylella fastidiosa Consortium of the organization fornucleotide sequencing and analysis. Nature 406, 151–159.

Stackebrandt, E., Woese, C.R., 1984. The phylogeny of prokaryotes. Microbiol. Sci. 1,117–122.

Stamatakis, A., 2006. RAxML-VI-HPC: maximum likelihood-based phylogeneticanalyses with thousands of taxa and mixed models. Bioinformatics 22, 2688–2690.

Steitz, T.A., 1999. DNA polymerases: structural diversity and common mechanisms.J. Biol. Chem. 274, 17395–17398.

Stoffels, M., Ludwig, W., Schleifer, K.H., 1999. RRNA probe-based cell fishing ofbacteria. Environ. Microbiol. 1, 259–271.

Swings, J.G., Civerolo, E.L., 1993. Xanthomonas. Chapman & Hall, London.Thieme, F., Koebnik, R., Bekel, T., Berger, C., Boch, J., Buttner, D., Caldana, C., Gaigalat,

L., Goesmann, A., Kay, S., Kirchner, O., Lanz, C., Linke, B., McHardy, A.C., Meyer, F.,Mittenhuber, G., Nies, D.H., Niesbach-Klosgen, U., Patschkowski, T., Ruckert, C.,Rupp, O., Schneiker, S., Schuster, S.C., Vorholter, F.J., Weber, E., Puhler, A., Bonas,U., Bartels, D., Kaiser, O., 2005. Insights into genome plasticity andpathogenicity of the plant pathogenic bacterium Xanthomonas campestris pv.vesicatoria revealed by the complete genome sequence. J. Bacteriol. 187, 7254–7266.

Thompson, J.D., Gibson, T.J., Plewniak, F., Jeanmougin, F., Higgins, D.G., 1997. TheCLUSTAL_X windows interface: flexible strategies for multiple sequencealignment aided by quality analysis tools. Nucleic Acids Res. 25, 4876–4882.

Van Sluys, M.A., Monteiro-Vitorello, C.B., Camargo, L.E., Menck, C.F., Da Silva, A.C.,Ferro, J.A., Oliveira, M.C., Setubal, J.C., Kitajima, J.P., Simpson, A.J., 2002.Comparative genomic analysis of plant-associated bacteria. Annu. Rev.Phytopathol. 40, 169–189.

Van Sluys, M.A., de Oliveira, M.C., Monteiro-Vitorello, C.B., Miyaki, C.Y., Furlan, L.R.,Camargo, L.E., da Silva, A.C., Moon, D.H., Takita, M.A., Lemos, E.G., Machado,

lacement of Xanthomonadales based on conserved protein signature se-

A.M. Cutiño-Jiménez et al. / Molecular Phylogenetics and Evolution xxx (2009) xxx–xxx 11

ARTICLE IN PRESS

M.A., Ferro, M.I., da Silva, F.R., Goldman, M.H., Goldman, G.H., Lemos, M.V., El-Dorry, H., Tsai, S.M., Carrer, H., Carraro, D.M., de Oliveira, R.C., Nunes, L.R.,Siqueira, W.J., Coutinho, L.L., Kimura, E.T., Ferro, E.S., Harakava, R., Kuramae, E.E.,Marino, C.L., Giglioti, E., Abreu, I.L., Alves, L.M., do Amaral, A.M., Baia, G.S.,Blanco, S.R., Brito, M.S., Cannavan, F.S., Celestino, A.V., da Cunha, A.F., Fenille,R.C., Ferro, J.A., Formighieri, E.F., Kishi, L.T., Leoni, S.G., Oliveira, A.R., Rosa Jr.,V.E., Sassaki, F.T., Sena, J.A., de Souza, A.A., Truffi, D., Tsukumo, F., Yanai, G.M.,

Please cite this article in press as: Cutiño-Jiménez, A.M., et al. Evolutionary pquences. Mol. Phylogenet. Evol. (2009), doi:10.1016/j.ympev.2009.09.026

Zaros, Civerolo, E.L., Simpson, A.J., Almeida Jr., N.F., Setubal, J.C., Kitajima, J.P., .Comparative analyses of the complete genome sequences of Pierce’s diseaseand citrus variegated chlorosis strains of Xylella fastidiosa. J. Bacteriol. 185,1018–1026.

Wilkinson, A., Day, J., Bowater, R., 2001. Bacterial DNA ligases. Mol. Microbiol. 40,1241–1248.

lacement of Xanthomonadales based on conserved protein signature se-