195
Genome wide identification of NBS-LRR genes in Brassica and their association with disease resistance in Brassica napus Salman F. Alamery BSC (Biochemistry) MSC (Molecular Biology and Biotechnology) A thesis submitted for the degree of Doctor of Philosophy at The University of Queensland in 2015 School of Agriculture and Food Sciences

Genome wide identification of NBS-LRR genes in Brassica …360384/s4255… ·  · 2017-01-18caused by the fungal pathogen Leptosphaeria maculans, is one of the most devastating diseases

  • Upload
    vungoc

  • View
    213

  • Download
    0

Embed Size (px)

Citation preview

Genome wide identification of NBS-LRR genes in Brassica and

their association with disease resistance in Brassica napus

Salman F. Alamery

BSC (Biochemistry)

MSC (Molecular Biology and Biotechnology)

A thesis submitted for the degree of Doctor of Philosophy at

The University of Queensland in 2015

School of Agriculture and Food Sciences

i

Abstract

Brassica napus (canola/rapeseed/oilseed rape) is an important commercial oilseed crop in Australia

with an annual production of approximately 1.6 million tons of oil. Canola is an important source of

edible vegetable oil and has a broad range of industrial purposes. Blackleg disease (stem canker)

caused by the fungal pathogen Leptosphaeria maculans, is one of the most devastating diseases in

B. napus. This disease causes significant yield losses with an annual average loss of 15 to 48 %,

although losses can reach up to 80% worldwide, mainly in Europe, Australia and Canada.

In order to develop an effective strategy to control the blackleg disease, there is a need to identify

blackleg resistance genes in Brassica species and understand the genetic interaction between plant

resistance genes and the pathogen avirulence genes. Thus, identification of Nucleotide Binding Site-

Leucine Rich Repeat (NBS-LRR) resistance genes is one of the most important objectives of

understanding resistance. NBS-LRR resistance genes have been extensively studied because they

represent the largest class of disease resistance genes and play a critical role in defending plants

from pathogens.

The objective of this study was to perform comprehensive analysis on identification and

characterization of NBS-LRR genes in the B. napus genome and to study the synteny and

conservation of NBS-LRR genes between Brassica species. In this study, a total of 641, 249 and

443 NBS-LRR encoding genes in B. napus, B. rapa and B. oleracea, respectively, were identified.

The comparative analysis between B. napus and its progenitor species indicated that NBS-LRR

genes exhibited similar gene structure, genomic location, arrangement in clusters and syntenic

relationships. The results provide evidence that there was a selective advantage to maintaining

similar features of NBS-LRR genes in B. napus to both B. rapa and B. oleracea following

polyploidization. More than 60% of NBS LRR genes from the progenitor species were conserved.

The differences in NBS-LRR gene conservation could be attributed to gene losses or selection

pressure to offer species-specific or cultivars-specific resistance.

This study found that the NBS-LRR resistance genes are physically clustered and individual genes

involved in clusters were more polymorphic and subject to evolutionary process than singleton

genes. These clusters, which have been described in many other species, provide a reservoir of

genetic variation influenced by tandem duplication and selection pressure. In addition, there was a

significant correlation and co-localization between the number of NBS-LRR genes within the

disease QTL intervals and the number of genes involved in gene clusters or duplication. This

ii

correlation provides evidence that NBS-LRR are distributed and clustered throughout the genome

and tends to be linked and associated with disease QTL intervals.

Genetic studies have identified the gene for gene interactions between avirulence (Avr) genes in L.

maculans and their corresponding Rlm (Resistance to Leptosphaeria maculans) genes in B. napus.

In addition, genetic mapping studies have shown that there are five major resistance genes on

chromosome A7: Rlm1, Rlm3, Rlm4, Rlm7, and Rlm9. At present, none of the genetically mapped

Rlm genes on chromosome A7 have been sequenced and validated in B. napus.

A total of 12 NBS-LRR and 18 LRR-containing resistance genes were on B. napus chromosome A7

located within the Rlm QTL region of interest. The comparative analysis of these resistance genes

between Brassica species confirmed the gene synteny and conservation. However, there was

considerable variation; either gene presence/absence or substantial differences in the protein

sequence. The comparative analysis allowed making an initial prediction and prioritization for

targeting these identified genes for further characterization and validation. In this study, a candidate

gene approach, combined with comparative analysis, was exploited for identification of Rlm9

candidate genes in B. napus. The candidate gene approach identified six NBS-LRR and eight LRR-

containing genes associated with the Rlm9 QTL region. The NBS-LRR genes were selected for

further analysis as highest priority candidate genes.

These results provide the first in-depth molecular characterization of NBS-LRR genes in B. napus

providing potential candidate gene for disease resistance trait in B. napus. More importantly, this

work has significantly increased our understanding about blackleg resistance in B. napus and major

disease resistance genes have been identified.

iii

Declaration by author

This thesis is composed of my original work, and contains no material previously published or

written by another person except where due reference has been made in the text. I have clearly

stated the contribution by others to jointly-authored works that I have included in my thesis.

I have clearly stated the contribution of others to my thesis as a whole, including statistical

assistance, survey design, data analysis, significant technical procedures, professional editorial

advice, and any other original research work used or reported in my thesis. The content of my thesis

is the result of work I have carried out since the commencement of my research higher degree

candidature and does not include a substantial part of work that has been submitted to qualify for

the award of any other degree or diploma in any university or other tertiary institution. I have

clearly stated which parts of my thesis, if any, have been submitted to qualify for another award.

I acknowledge that an electronic copy of my thesis must be lodged with the University Library and,

subject to the policy and procedures of The University of Queensland, the thesis be made available

for research and study in accordance with the Copyright Act 1968 unless a period of embargo has

been approved by the Dean of the Graduate School.

I acknowledge that copyright of all material contained in my thesis resides with the copyright

holder(s) of that material. Where appropriate I have obtained copyright permission from the

copyright holder to reproduce material in this thesis.

iv

Publications during candidature

Peer reviewed journal publications

Raman, H., Dalton-Morgan, J., Diffey, S., Raman, R., Alamery, S., Edwards, D. & Batley, J. 2014.

SNP markers-based map construction and genome-wide linkage analysis in Brassica

napus. Plant Biotechnology Journal, 12, 851-860.

Dalton-Morgan, J., Hayward, A., Alamery, S., Tollenaere, R., Mason, A., Campbell, E., Patel, D.,

Lorenc, M., Yi, B., Long, Y., Meng, J., Raman, R., Raman, H., Lawley, C., Edwards, D. &

Batley, J. 2014. A high-throughput SNP array in the amphidiploid species Brassica

napus shows diversity in resistance genes. Functional & Integrative Genomics, 1-13.

Chalhoub, B., Denoeud, F., Liu, S., Parkin, I. A. P., Tang, H., Wang, X., Chiquet, J., Belcram, H.,

Tong, C., Samans, B., Corréa, M., Da Silva, C., Just, J., Falentin, C., Koh, C. S., Le

Clainche, I., Bernard, M., Bento, P., Noel, B., Labadie, K., Alberti, A., Charles, M., Arnaud,

D., Guo, H., Daviaud, C., Alamery, S., Jabbari, K., Zhao, M., Edger, P. P., Chelaifa, H.,

Tack, D., Lassalle, G., Mestiri, I., Schnel, N., Le Paslier, M.-C., Fan, G., Renault, V., Bayer,

P. E., Golicz, A. A., Manoli, S., Lee, T.-H., Thi, V. H. D., Chalabi, S., Hu, Q., Fan, C.,

Tollenaere, R., Lu, Y., Battail, C., Shen, J., Sidebottom, C. H. D., Wang, X., Canaguier, A.,

Chauveau, A., Bérard, A., Deniot, G., Guan, M., Liu, Z., Sun, F., Lim, Y. P., Lyons, E.,

Town, C. D., Bancroft, I., Wang, X., Meng, J., Ma, J., Pires, J. C., King, G. J., Brunel, D.,

Delourme, R., Renard, M., Aury, J.-M., Adams, K. L., Batley, J., Snowdon, R. J., Tost, J.,

Edwards, D., Zhou, Y., Hua, W., Sharpe, A. G., Paterson, A. H., Guan, C. & Wincker, P.

2014. Early allopolyploid evolution in the post-Neolithic Brassica napus oilseed

genome. Science, 345, 950-953.

Salman Alamery, Reece Tollenaere, Philipp E. Bayer, Boulos Chalhoub, David Edwards and

Jacqueline Batley (2015) Genome wide identification and characterization of NBS-LRR

resistance genes in Brassica napus, Plant Biotechnology journal (submitted).

Salman Alamery, David Edwards and Jacqueline Batley (2015) Association and localization of

NBS-LRR genes with disease resistance QTL in Brassica napus, international Journal of

Genomics (submitted).

v

Publications included in this thesis

Chalhoub, B., Denoeud, F., Liu, S., Parkin, I. A. P., Tang, H., Wang, X., Chiquet, J., Belcram, H.,

Tong, C., Samans, B., Corréa, M., Da Silva, C., Just, J., Falentin, C., Koh, C. S., Le

Clainche, I., Bernard, M., Bento, P., Noel, B., Labadie, K., Alberti, A., Charles, M., Arnaud,

D., Guo, H., Daviaud, C., Alamery, S., Jabbari, K., Zhao, M., Edger, P. P., Chelaifa, H.,

Tack, D., Lassalle, G., Mestiri, I., Schnel, N., Le Paslier, M.-C., Fan, G., Renault, V., Bayer,

P. E., Golicz, A. A., Manoli, S., Lee, T.-H., Thi, V. H. D., Chalabi, S., Hu, Q., Fan, C.,

Tollenaere, R., Lu, Y., Battail, C., Shen, J., Sidebottom, C. H. D., Wang, X., Canaguier, A.,

Chauveau, A., Bérard, A., Deniot, G., Guan, M., Liu, Z., Sun, F., Lim, Y. P., Lyons, E.,

Town, C. D., Bancroft, I., Wang, X., Meng, J., Ma, J., Pires, J. C., King, G. J., Brunel, D.,

Delourme, R., Renard, M., Aury, J.-M., Adams, K. L., Batley, J., Snowdon, R. J., Tost, J.,

Edwards, D., Zhou, Y., Hua, W., Sharpe, A. G., Paterson, A. H., Guan, C. & Wincker, P.

2014. Early allopolyploid evolution in the post-Neolithic Brassica napus oilseed

genome. Science, 345, 950-953 (accepted). Included in Chapter 3

Contributor Statement of contribution

Salman Alamery (Candidate) NBS-LRR gene analysis, identification and

comparative analysis 40%

Jacqueline Batley NBS-LRR gene analysis, group leader 10%

Reece Tollenaere NBS-LRR gene analysis, identification and

comparative analysis 10%

Philipp E. Bayer

NBS-LRR gene analysis, bioinformatics 40% Chuchuan Fan

Yong Pyo Lim

Sahana Manoli

vi

Salman Alamery, Reece Tollenaere, Philipp E. Bayer, Boulos Chalhoub, David Edwards and

Jacqueline Batley (2015) Genome wide identification and characterization of NBS-LRR

resistance genes in Brassica napus, Plant Biotechnology journal (submitted). Included as Chapter

2 and 3

Contributor Statement of contribution

Salman Alamery (Candidate) Designed experiments (60%)

Conducted experiments (100%)

Sequence data analysis (60%)

Wrote the paper (100%)

Jacqueline Batley Designed experiments (40%)

Edited paper (50%)

David Edwards Edited paper (30%)

Boulos Chalhoub Edited paper (20%)

Philipp E. Bayer Sequence data analysis (20%)

Reece Tollenaere Sequence data analysis (20%)

Salman Alamery, David Edwards and Jacqueline Batley (2015) Association and localization of

NBS-LRR genes with disease resistance QTL in Brassica napus, International Journal of

Genomics (submitted). Included as Chapter 4

Contributor Statement of contribution

Salman Alamery (Candidate) Designed experiments (60%)

Conducted experiments (100%)

Wrote the paper (100%)

Jacqueline Batley Designed experiments (40%)

Edited paper (50%)

David Edwards Edited paper (50%)

vii

Contributions by others to the thesis

Prof Jacqueline Batley contributed to the conception and supervision of the research and assisted

with editing of writing. Illumina read mapping of Skipton and Ag-Spectrum reads and running of

the MAST/MEME pipeline was performed by Prof Edwards’ bioinformatics group.

Statement of parts of the thesis submitted to qualify for the award of another degree

None

viii

Acknowledgements

I am pleased to dedicate this thesis to all people who have helped and supported me during my PhD

study.

First of all, I would like to acknowledge and extend my deep gratitude to my principal advisor, Prof

Jacqueline Batley for supervision and providing me with the opportunity to conduct my PhD

research. She has been constantly supportive and encouraging throughout my PhD study. I have

learned a lot from her expertise and without her, this thesis wouldn’t have been completed. I greatly

appreciated her advices, friendship and the time taken to read my papers and thesis. I look forward

to further collaboration with in the future.

I also wish to thank my co-advisor, Prof David Edwards and his bioinformatics group at the

University of Queensland for assisting with bioinformatics analysis.

I would also like to extend my gratitude to all the members of the Batley group and the staff and

students at CILR at the University of Queensland, who I have been lucky enough to share a lab with

them. In particular, I would like to thank Reece Tollenaere for his friendship, helpful discussions.

I would like to thank A/Prof Elizabeth Aitken and Dr Brett Ferguson for their contributions to the

university assessment process and constructive advice during candidature reviews.

I would like to acknowledge King Saud University, Saudi Arabia, for a PhD Scholarship and

financial support during my study. Acknowledge is also made to the University of Queensland,

School of Agricultural and Food Sciences for study courses and research facilities to achieve my

PhD degree.

Above all, I would like to thank my wife, Alya and my daughters, Eman and Lean for their support,

great patience, love and encouragement at all times.

ix

Keywords

Brassica, Leptosphaeria maculans, blackleg disease, comparative genomics, NBS-LRR resistance

gene, gene cluster, candidate genes, disease resistance QTL

Australian and New Zealand Standard Research Classifications (ANZSRC)

ANZSRC code: 060702 Plant Cell and Molecular Biology, 40%

ANZSRC code: 060408 Genomics, 50%

ANZSRC code: 060704 Plant Pathology, 10%

Fields of Research (FoR) Classification

FoR code: 0607, Plant Biology, 80%

FoR code: 0601, Biochemistry and Cell Biology, 20%

x

Table of Contents

Abstract ............................................................................................................................................ i

Declaration by author ................................................................................................................... iii

Acknowledgements ...................................................................................................................... viii

Chapter 1 : Literature review ........................................................................................................... 2

1.1 Introduction ........................................................................................................................... 2

1.2 Brassica napus and related species ....................................................................................... 2

1.3 Brassica genome sequencing: an overview ........................................................................... 3

1.4 Identifying and characterizing the structure of NBS-LRR genes in Brassica ...................... 4

1.5 Genetics of disease resistance in Brassica ............................................................................ 6

1.6 The fungal pathogen Leptosphaeria maculans: Biology and blackleg disease ..................... 8

1.7 Molecular interaction between Brassica and L. maculans .................................................... 9

1.8 Genetic mapping of blackleg resistance genes in Brassica species .................................... 12

1.9 Introgression of blackleg resistance genes into the B. napus genome: attempts and

challenges ....................................................................................................................................... 17

1.10 Genomic approaches for identification of candidate resistance genes ................................ 19

1.10.1 Comparative analysis based candidate gene identification .......................................... 20

1.10.2 Sequence based candidate gene identification ............................................................. 21

1.10.3 SNP genotyping and association with candidate genes ............................................... 22

1.11 Research aims and significance ........................................................................................... 26

Chapter 2 : Genome wide identification and characterization of NBS-LRR resistance genes in

Brassica napus .................................................................................................................................. 30

2.1 Abstract ............................................................................................................................... 30

2.2 Introduction ......................................................................................................................... 31

2.3 Methods ............................................................................................................................... 33

2.3.1 Brassica genome and gene prediction ......................................................................... 33

2.3.2 Identification of NBS-LRR genes in B. napus by MEME/ MAST analysis................ 33

2.3.3 Identification of NBS-LRR genes in B. napus by CNLs and TNLs for validation ..... 33

2.3.4 Correlation between the MAST output and CNL/TNL BLAST results ...................... 34

2.3.5 Manual annotation check and characterization of candidate NBS-LRR genes ........... 34

2.3.6 Multiple alignment and phylogenetic analysis............................................................. 34

2.3.7 NBS-LRR genes cluster and duplication analyses ....................................................... 35

2.4 Results ................................................................................................................................. 36

2.4.1 Genome wide identification of B. napus NBS-LRR genes .......................................... 36

xi

2.4.2 Genomic distribution and organisation of NBS-LRR encoding genes in the B. napus

genome ………………………………………………………………………………………..40

2.4.3 NBS-LRR gene clustering ........................................................................................... 44

2.4.4 Alignment and phylogenetic analysis .......................................................................... 45

2.4.5 NBS-LRR gene duplication ......................................................................................... 49

2.5 Discussion ........................................................................................................................... 53

2.5.1 Genome wide identification of B. napus NBS-LRR genes .......................................... 53

2.5.2 Alignment and phylogenetic analysis .......................................................................... 55

2.5.3 NBS-LRR gene clustering ........................................................................................... 56

2.5.4 NBS-LRR gene duplication ......................................................................................... 57

Chapter 3 : Comparative and conservation analysis of NBS-LRR genes in Brassica ............... 61

3.1 Abstract ............................................................................................................................... 61

3.2 Introduction ......................................................................................................................... 62

3.3 Methods ............................................................................................................................... 64

3.3.1 Brassica reference genomes ......................................................................................... 64

3.3.2 Identification of NBS-LRR genes in B. rapa and B. oleracea .................................... 64

3.3.3 Reciprocal Best BLAST (RBB) of NBS-LRR for comparative analysis between B.

napus, B. rapa and B. oleracea................................................................................................... 64

3.4 Results ................................................................................................................................. 65

3.4.1 Comparison of the distribution of B. napus NBS-LRR genes to its related diploid

species B. rapa and B. oleracea .................................................................................................. 65

3.4.2 Conservation analysis of B. napus NBS-LRR genes to its related diploid species B.

rapa and B. oleracea ................................................................................................................... 75

3.4.3 Comparative analysis of chromosome A7 and C6 in B. napus .................................... 76

3.5 Discussion ........................................................................................................................... 79

3.5.1 Comparison of the distribution of B. napus NBS-LRR genes to its related diploid

progenitor species B. rapa and B. oleracea ................................................................................ 79

3.5.2 Conservation analysis of B. napus NBS-LRR genes to its related diploid species B.

rapa and B. oleracea ................................................................................................................... 80

3.5.3 Comparative analysis of chromosomes A7 and C6 in B. napus .................................. 82

Chapter 4 : Association and localization of NBS-LRR genes with disease resistance QTL in

Brassica napus .................................................................................................................................. 84

4.1 Abstract ............................................................................................................................... 84

4.2 Introduction ......................................................................................................................... 85

4.3 Methods ............................................................................................................................... 88

4.3.1 Brassica reference genome sequences ......................................................................... 88

xii

4.3.2 Identification of QTL intervals in B. napus by marker homology BLAST search ...... 88

4.4 Results ................................................................................................................................. 89

4.4.1 Identification and localization of QTL intervals in B. napus ....................................... 89

4.4.2 Co-localization and association of NBS-LRR genes and QTL intervals ..................... 99

4.5 Discussion ......................................................................................................................... 105

4.5.1 Identification and localization of QTL intervals in B. napus ..................................... 105

4.5.2 Co-localization and association of NBS-LRR genes and QTL intervals ................... 107

Chapter 5 : Comparative analysis enabled the identification of candidate Rlm9 blackleg

resistance genes in Brassica napus ................................................................................................ 110

5.1 Abstract ............................................................................................................................. 110

5.2 Introduction ....................................................................................................................... 111

5.3 Methods ............................................................................................................................. 113

5.3.1 Brassica genome and gene prediction ....................................................................... 113

5.3.2 Identification of NBS-LRR and LRR- containing genes in B. napus ........................ 113

5.3.3 Comparative analysis of NBS-LRR and LRR-containing gene on the A7 and C6

chromosome of B. napus .......................................................................................................... 113

5.3.4 Genetic mapping of the Rlm9 QTL ............................................................................ 113

5.3.5 SNP genotyping of B. napus cultivars ....................................................................... 114

5.3.6 Extraction of genomic DNA ...................................................................................... 116

5.3.7 PCR amplification and sequencing of Rlm9 candidate genes .................................... 116

5.3.8 DNA Sequencing and Sequence Analysis ................................................................. 118

5.4 Results ............................................................................................................................... 119

5.4.1 Identification and characterization of NBS LRR and LRR-containing genes on B.

napus chromosome A7 ............................................................................................................. 119

5.4.2 Comparative analysis of NBS-LRR and LRR-containing genes on chromosome A7 of

B. napus ………………………………………………………………………………………122

5.4.3 Haplotype analysis of Rlm9 and mapping potential SNPs in B. napus...................... 127

5.4.4 The identification of candidate Rlm9 blackleg resistance genes in B. napus ............ 130

5.4.4.1 Candidate gene BnDRG 13..................................................................................... 132

5.4.4.2 Candidate gene BnDRG 14..................................................................................... 132

5.4.4.3 Candidate gene BnDRG 17..................................................................................... 132

5.4.4.4 Candidate gene BnDRG19...................................................................................... 133

5.4.4.5 Candidate gene BnDRG20...................................................................................... 133

5.4.4.6 Candidate gene BnDRG 23..................................................................................... 133

5.4.5 Amplification and sequencing of Rlm9 candidate genes ........................................... 134

5.5 Discussion ......................................................................................................................... 145

xiii

5.5.1 Identification and characterization of NBS-LRR and LRR-containing genes on B.

napus chromosome A07 ........................................................................................................... 145

5.5.2 Comparative analysis of NBS-LRR and LRR-containing genes on chromosome A7

………………………………………………………………………………………145

5.5.3 Haplotype analysis of Rlm9 and mapping potential SNPs on B. napus ..................... 147

5.5.4 The physical location and identification of candidate Rlm9 blackleg resistance gene in

B. napus ………………………………………………………………………………………148

5.5.5 Amplification and sequencing Rlm9 candidate genes ................................................ 149

Chapter 6 : General discussion and future directions ................................................................ 153

6.1 Genome wide identification of NBS-LRR resistance genes in Brassica .......................... 153

6.2 Identification of candidate blackleg resistance genes in B. napus .................................... 155

6.3 Future directions ................................................................................................................ 158

Bibliography................................................................................................................................ 160

xiv

List of Figures

Figure ‎1.1: The genomic relationships between the six cultivated Brassica species indicating the

genome composition (U, 1935). ........................................................................................................... 3

Figure ‎1.2: The domain structure of NBS-LRR proteins in plants. Taken from (McHale et al.,

2006). ................................................................................................................................................... 5

Figure ‎1.3: Blackleg infection symptoms on B. napus caused by L. maculans .................................. 9

Figure ‎1.4: The mode of activation the plant immune responses by PAMP-triggered immunity

(PTI) and effector-triggered immunity (ETI). .................................................................................... 10

Figure ‎1.5: The infection outcomes between Brassica R genes and L. maculans Avr genes

according to the gene for gene interaction.. ....................................................................................... 11

Figure ‎1.6: Integration the partial genetic map of Rlm4 QTL on chromosome A7 of B. napus

against the physical map of B. rapa. .................................................................................................. 17

Figure ‎1.7: Sequence based identification of Rlm4 candidate gene using NGS data.. ..................... 22

Figure ‎1.8: The principles of whole genome genotyping using the Illumina infinium assay. .......... 25

Figure ‎1.9: Schematic representation of genomics based identification of candidate resistance

gene(s) used in this study. .................................................................................................................. 28

Figure ‎2.1: Example of representation of motif patterns in NBS-LRR genes in B. napus.. ............. 39

Figure ‎2.2: Physical locations of NBS-LRR genes on the chromosome of the A genome of B.

napus.. ................................................................................................................................................ 42

Figure ‎2.3: Physical locations of NBS-LRR genes on the chromosome of the C genome of B.

napus.. ................................................................................................................................................ 43

Figure ‎2.4: A phylogenetic tree of 176 CNL encoding genes of B. napus for NBS domains based

on the neighbour-joining method using MEGA 6.0 software.. .......................................................... 47

Figure ‎2.5: A phylogenetic tree of 366 TNL encoding genes of B. napus for NBS domains based on

the neighbour-joining method using MEGA 6.0 software. ................................................................ 48

Figure ‎2.6: Inter-genomic and intra-genomic duplication relationship of NBS-LRR -genes between

the two sub-genomes of B. napus: the A and C genomes.. ................................................................ 50

Figure ‎3.1: The distribution of NBS-LRR genes in the A genome of B. napus and B. rapa

(Chalhoub et al., 2014). ...................................................................................................................... 69

Figure ‎3.2: The distribution of NBS-LRR genes in the C genome of B. napus and B. oleracea

(Chalhoub et al., 2014). ...................................................................................................................... 69

Figure ‎3.3: The proportion of TNL and CNL genes to the total number of NBS-LRR genes in B.

napus, B. rapa and B. oleracea. ......................................................................................................... 70

Figure ‎3.4: Comparative organization of NBS-LRR genes between the An genome of B. napus and

Ar genome of B. rapa (chromosomes A1 to A5).). ........................................................................... 71

Figure ‎3.5: Comparative organization of NBS-LRR genes between the An genome of B. napus and

Ar genome of B. rapa (chromosomes A6 to A10).. ........................................................................... 72

Figure ‎3.6: Comparative organization of NBS-LRR genes between the Cn genome of B. napus and

Co genome of B. oleracea (chromosomes C1 to C5).. ...................................................................... 73

Figure ‎3.7: Comparative organization of NBS-LRR genes between the Cn genome of B. napus and

Co genome of B. oleracea (chromosomes C6 to C9).. ...................................................................... 74

Figure ‎3.8: Comparative chromosomal map of NBS-LRR genes on chromosomes A7 and C6

between B. napus (Bn), B. rapa (Br) and B. oleracea (Bo). .............................................................. 77

xv

Figure ‎3.9: A phylogenetic tree of TNL and CNL genes located on chromosomes A7 and C6 in B.

napus and the diploid species B. rapa and B. oleracea based on a neighbour-joining method using

MEGA 6.0 software. .......................................................................................................................... 78

Figure ‎4.1: The location of disease resistance loci QTL in the chromosomal map of B. napus. ..... 98

Figure ‎4.2: The correlation between the number of NBS-LRR genes within the 2Mbp flanking

regions of QTL and the number of NBS-LRR genes involved in gene clusters .............................. 103

Figure ‎4.3: The correlation between the number of NBS-LRR genes within the 2Mbp flanking

regions of QTL and the number of duplicated NBS-LRR genes. .................................................... 104

Figure ‎5.1: Distribution of NBS-LRR and LRR-containing genes on chromosome A7 of B. napus.

.......................................................................................................................................................... 121

Figure ‎5.2: Comparative chromosomal map of NBS-LRR genes on chromosomes A7 and C6

between B. napus (Bn), B. rapa (Br) and B. oleracea (Bo). ............................................................ 125

Figure ‎5.3: Comparative chromosomal map of LRR-containing genes on chromosomes A7 and C6

between B. napus (Bn), B. rapa (Br) and B. oleracea (Bo). s ......................................................... 126

Figure ‎5.4: Mapping based identification of the Rlm9 candidate gene using marker physical

mapping, NBS LRR genome wide identification and SNP haplotypes.. ......................................... 131

Figure ‎5.5: Comparison of deduced amino acid sequences of BnDRG 13 in B. napus Darmor A7 to

its homologous genes in B. napus Tapidor and B. rapa. ................................................................. 137

Figure ‎5.6: Comparison of deduced amino acid sequences of BnDRG 14 in B. napus Darmor A7 to

its homologous genes in B. napus Tapidor A7 and B. rapa............................................................. 138

Figure ‎5.7: Comparison of deduced amino acid sequences of BnDRG 17 in B. napus Darmor A7 to

its homologous genes in B. napus Darmor C6 and B. rapa. ............................................................ 139

Figure ‎5.8: Comparison of deduced amino acid sequences of BnDRG 19 in B. napus Darmor A7 to

its homologous genes in B. napus Darmor C6, B. napus Tapidor A7 and B. rapa. ........................ 140

Figure ‎5.9: Comparison of deduced amino acid sequences of BnDRG 20 in B. napus Darmor A7 to

its homologous genes in B. napus Darmor C6 and B. rapa. ............................................................ 142

Figure ‎5.10: Comparison of deduced amino acid sequences of BnDRG 23 in B. napus Darmor A7

to its homologous genes in B. napus Tapidor A7. ........................................................................... 143

Figure ‎5.11: Assembly of B. napus Skipton and Ag-Spectrum sequence short reads to B. napus

Darmor reference genome within the Rlm9 candidate resistance genes: BnDRG 12, BnDRG 13 ,

BnDRG 18 and BnDRG 19............................................................................................................... 144

xvi

List of Tables

Table 1.1: Resistance genes to L. maculans that have been identified in Brassica species based on

genetics and mapping studies. ............................................................................................................ 15

Table ‎2.1: The total number of predicted NBS-LRR genes identified in the B. napus genome. The

comparison between methods used: MEME / MAST and CNL and TNL BLAST .......................... 37

Table ‎2.2: NBS-LRR specific motifs sequence identified with MEME/MAST. .............................. 38

Table ‎2.3: The distribution and organisation of candidate NBS-LRR genes in B. napus ................. 41

Table ‎2.4: The number of NBS-LRR genes classified into different classes and their subfamilies

and domain compositions................................................................................................................... 44

Table ‎2.5: Conserved and consensus sequence of NBS domain motifs found in the NBS-LRR genes

identified in B. napus ......................................................................................................................... 46

Table ‎2.6: The correlation between B. napus NBS-LRR intra-genomic and inter-genomic

duplications and conserved blocks found between B. napus and Arabidopsis .................................. 52

Table ‎3.1: The total number of predicted NBS-LRR genes identified in the three Brassica genomes.

The table shows comparison between the methods used: MEME / MAST and CNL and TNL

BLAST ............................................................................................................................................... 65

Table ‎3.2: The number of candidate NBS-LRR genes in B. napus. The total numbers of genes in

each chromosome with their classification into TNL or CNL as well as the reciprocal best BLAST

results are presented ........................................................................................................................... 66

Table ‎3.3: The number of candidate NBS-LRR genes in B. rapa and B. oleracea. The total numbers

of genes in each chromosome with their classification into TNL or CNL as well as the reciprocal

best BLAST results are presented. ..................................................................................................... 67

Table 3.4: Comparative analysis of synteny of NBS-LRR genes in B. napus to its progenitor species

............................................................................................................................................................ 68

Table ‎3.5: Comparative analysis of the number of NBS-LRR genes in the three Brassica species

and the reciprocal best BLAST hits ................................................................................................... 75

Table ‎4.1: Marker homology search of clubroot resistance QTL flanking or linked markers for

identification of QTL intervals in B. napus ....................................................................................... 92

Table ‎4.2: Marker homology search of downy mildew resistance QTL flanking or linked markers

for identification of QTL intervals in B. napus .................................................................................. 94

Table ‎4.3: Marker homology search of blackleg resistance QTL flanking or linked markers for

identification of QTL intervals in B. napus ....................................................................................... 96

Table ‎4.4: Correlation between the number of NBS-LRR resistance genes and disease resistance

QTL intervals in chromosomes of B. napus .................................................................................... 100

Table ‎5.1: List of B. napus cultivars genotyped with the SNP genotyping assay........................... 115

Table ‎5.2: List of the B. napus cultivars and their Rlm complement used for the PCR amplification

.......................................................................................................................................................... 117

Table ‎5.3: List of primer pairs used for long range PCR amplification of Rlm9 candidate genes . 117

Table ‎5.4: List of characterized NBS-LRR and LRR-containing genes on chromosome A7 of B.

napus cv. Darmor. Candidate Rlm9 genes located within the Rlm9 QTL are highlighted .............. 120

Table ‎5.5: Comparative analysis of the number of NBS-LRR and LRR-containing genes on

chromosomes A7 and C6 of the three Brassica species: B. napus, B. rapa and B. oleracea .......... 122

Table ‎5.6: The potential SNPs on chromosome A7 with a good correlation to the Rlm9 gene and

association with disease and susceptibility in B. napus cultivars. ................................................... 128

Table ‎5.7: Mapping and annotation of the potential SNPs identified in B. napus cv. Darmor ....... 129

xvii

Table ‎5.8: Comparative analysis of NBS-LRR Rlm9 candidate genes between B. napus cv. Darmor,

B. napus cv. Tapidor and B. rapa. The sequence identity is based on protein sequence alignment 135

Table ‎5.9: The coverage of Skipton and Ag-Spectrum sequence short reads of candidate Rlm9

genes on B. napus. The identity is based on gene sequence alignment ........................................... 136

xviii

List of Abbreviations

Avr gene Avirulence gene

AvrLm Avirulence to Leptosphaeria maculans

BnDRG Brassica napus Disease Resistance Gene

CC Coiled-Coil domain

CNL CC-NBS-LRR

CNV Copy Number Variation

LRR-RLKs Leucine Rich Repeat -Receptor-Like Kinase

LRR-RLPs Leucine Rich Repeat -Receptor-Like Proteins

MAST Motif Alignment Search Tool

Mbp Millions Base Pairs

MEME Multiple Em for Motif Elicitation

NBS-LRR Nucleotide Binding Site- Leucine Rich Repeat protein

NGS Next-Generation Sequencing

QTL Quantitative Trait Loci

R gene Resistance gene

RBB Reciprocal Best BLAST

RGA Resistance Gene Analogues

Rlm Resistance to Leptosphaeria maculans

SCAR Sequence-Characterized Amplified Region

SNP Single Nucleotide Polymorphism

SSR Simple Sequence Repeat Marker

SV Structural Variation

TIR Toll/ Interleukin-1 Receptor domain

TNL TIR-NBS-LRR

WGD Whole Genome Duplication

1

Chapter 1

Literature Review

2

Chapter 1 : Literature review

1.1 Introduction

Brassica napus (canola/rapeseed/oilseed rape) is an important commercial oilseed crop in Australia

with an annual production of approximately 3 million tons of oil (AOF, 2014). Canola is an

important source of edible vegetable oil and has a broad range of industrial purposes. However,

blackleg disease (stem canker) caused by the fungal pathogen Leptosphaeria maculans, is one of

the most devastating diseases in B. napus. This disease causes significant yield losses with an

annual average losses of 15 to 48% although losses can reach up to 80% worldwide (Marcroft and

Bluett, 2008). An effective strategy to control blackleg disease is to develop resistant B. napus

varieties via interspecific hybridization or genetic transformation. This can be facilitated by the

identification of blackleg resistance genes in B. napus and application of these genes in the breeding

programs. The recent completion of the B. napus, B. rapa and B. oleracea genome sequences

provides invaluable genomics resources to accelerate this research. Furthermore, it allows the study

of the conservation of these genes between Brassica species. This literature review provides insight

about the genome wide NBS-LRR comparison and scope for identification of candidate resistance

genes associated with disease resistance QTL in Brassica. It also highlights significant progress that

has recently been made in mapping and identification of resistance genes in B. napus against L.

maculans.

1.2 Brassica napus and related species

Brassica is one of the most important genera in the Brassicaceae family due to its agricultural and

economic importance. Six of the important Brassica species are: B. juncea, B. napus, B. nigra, B.

oleracea, B. rapa and B. carinata. The genetic relationships among these different diploid and

amphidiploid Brassica species are described according to their genome composition (termed “A” ,

“B” and “C”) by U’s triangle (Figure 1.1) (U, 1935).

3

Figure ‎1.1: The genomic relationships between the six cultivated Brassica species indicating the

genome composition (U, 1935).

Brassicas are important sources of fresh and preserved vegetables and condiments. For instance, B.

oleracea and B. rapa represent many of the vegetables in our daily diet such as cauliflower,

broccoli and cabbage (all B. oleracea), Chinese cabbage (B. rapa) and various mustards (B. nigra)

(Paterson et al., 2001).

B. napus originated from interspecific hybridisation between the diploids B. rapa (A genome) and

B. oleracea (C genome) resulting in an amphidiploid genome (AACC, n=19). B. napus is cultivated

worldwide as the most productive Brassica oilseed species and is grown primarily for its seed

which yield approximately 40% oil making it the second important oilseed crop in the world after

soybean (Snowdon et al., 2007). Total oilseed production in 2014/15 was estimated to be 530

million metric tons, with rapeseed contributing 71.8 million metric tons (USDA, 2014). Canola oil

is used for cooking, in margarine and other edible products in the food processing industry as

healthy edible oil.

1.3 Brassica genome sequencing: an overview

Sequencing of Brassica genomes has been the main focus of Brassica researchers worldwide

because Brassicas are an excellent model species for studying the evolution of polyploid crop

plants. Brassica species contain many triplicated genomic segments and subsequent rearrangements

4

such as inversions, insertions, deletions, and substitutions (Jiang et al., 2011). Genome sequencing

projects for Brassica species, especially B. rapa and B. oleracea, were initiated after the

establishment of the multinational Brassica genome project in 2002 (http://www.brassica.info)

(Hong et al., 2008, Wang et al., 2011).

The rapid development of second-generation sequencing technology and advanced bioinformatics

programs has resulted in the completion of genome sequences of B. napus (Chalhoub et al., 2014),

B. rapa (Wang et al., 2011) and B. oleracea (Liu et al., 2014, Parkin et al., 2014). These Brassica

genome projects also aimed to produce reference genomes for genetic and genomic studies

(Hayward et al., 2011). Illumina whole genome sequencing has been adopted for Brassica species

resulting in the generation of large datasets of short read sequence information for different B.

napus cultivars (Imelfort and Edwards, 2009). Illumina sequencing technology (HiSeq2000)

provides high throughput sequence information generating hundreds of millions of paired short read

sequences (100 - 150 bp) in a single run. If there is a reference genome available, these short reads

can be physically mapped to the reference genome and used to search for sequence variations in

candidate genes underlying QTL regions. Bioinformatics tools such as genome browsers provide a

convenient way to visualise the short reads against a reference genome to identify sequence

coverage, SNPs, coding regions and other annotations (Batley and Edwards, 2009).

1.4 Identifying and characterizing the structure of NBS-LRR genes in Brassica

Plants have developed sophisticated defence mechanisms to recognise and respond to a wide range

of pathogens. Resistance (R) genes are involved in direct or indirect interaction with avirulence

(Avr) genes in order to trigger a defence response. Thus, understanding the molecular structure and

function of R genes has been crucial for plant resistance research. Over the last few decades, a vast

number of R genes have been cloned from a wide range of plant species. These R genes are diverse

in terms of their structure, function and evolution, but they can be grouped into five different

classes, based on structural similarities of their predicted protein products (Staskawicz et al., 1995,

Liu et al., 2007). The largest class includes proteins with putative nucleotide binding site (NBS) and

leucine-rich repeats (LRR). These NBS-LRR genes can be subdivided into two distinct types based

on the structure of their N-terminal domain either a coiled-coil (CC) motif or a Toll/ Interleukin-1

Receptor (TIR) domain (Figure 1.2) (Meyers et al., 2003). The NBS region is thought to be

important for ATP binding activity and comprises of a P-loop, Kinase 2a, Kinase 3 and GLPL

motifs. The LRRs may be the main determinant in recognition specificity of the avirulence gene

product and as components of a signal transduction pathway (Ellis et al., 2000b).

5

Figure ‎1.2: The domain structure of NBS-LRR proteins in plants. Taken from (McHale et al.,

2006).

Genome wide analysis of NBS-LRR genes in many plant species including rice (Monosi et al.,

2004) sorghum (Paterson et al., 2009), Arabidopsis (Meyers et al., 2003, Tan et al., 2007) and

papaya (Porter et al., 2009) showed that they are widely distributed throughout the genome with

approximately 0.6–1.8% of genes encoding NBS-LRRs at a density of 0.3–1.6 genes per megabase.

A study of NBS-LRR genes in the B. rapa genome (550Mbp) predicted a lower number of NBS-

encoding genes than in other sequenced crops (Mun et al., 2009). However, at the time of the study

the B. rapa genome had not been fully sequenced. Now this is completed (Wang et al., 2011), a

comprehensive assessment of NBS-LRR genes can be undertaken. Theoretically, the number of

NBS-LRR genes in the B. rapa genome should be higher than in Arabidopsis because the B. rapa

genome has undergone genome triplication since divergence from the common ancestor of

Arabidopsis. Another significant finding was that almost 50% of NBS family members were

detected as tandem arrays within homogenous clusters suggesting tandem duplication in

combination with polyploidy played an important role in the expansion of NBS-LRR genes in the

Brassica genome (Fourmann et al., 2001, Vicente and King, 2001, Mun et al., 2009). Overall, B.

rapa can be used as a source to identify NBS-LRR genes for common pathogens of Brassica crops

(Mun et al., 2009).

A more recent study (Yu et al., 2014) has identified 206, 176 and 157 NBS-LRR genes in B. rapa,

B. oleracea and Arabidopsis, respectively. They found that the number of NBS-LRR genes in these

three species was very close despite the genome size and complexity. This is surprising as B. rapa

and B. oleracea genomes would be expected to contain higher numbers of NBS-LRR genes than in

6

Arabidopsis because the B. rapa and B. oleracea genomes have undergone extensive genome

triplication events.

Resistance genes can occur as a single gene in allelic series or as tightly linked genes (within a gene

cluster) conferring different resistance specificities (Leister, 2004). Thus, race-specific resistance

might have evolved and be present in clusters resulting from tandem duplications of paralogous

sequences (Meyers et al., 2005, McDowell and Simon, 2006, Kaur et al., 2009). Indeed, previous

studies revealed that R genes are often present as tightly linked genes with high homology and are

prone to gene duplication and recombination, and thus evolve more rapidly than the rest of genome

(Grant et al., 1998). For this reason, functional polymorphism at R loci could maintain multiple

alleles or linked genes with different recognition capabilities that would recognize novel pathogen

variants (Grant et al., 1998).

The molecular basis of the resistance responses to pathogens in plants depends on the specificities

of the interactions between the Avr and R gene. The genetic analysis of plant NBS-LRR genes has

shown that they are highly related in sequence and present as multiple alleles of a simple locus or

several closely related resistance genes. Multiple resistance gene specificities can be encoded by

different allelic variants of resistance genes by which each allele can provide resistance to different

pathogen types (Ellis et al., 2000a). An example of allelic forms at the NBS-LRR locus is RPP

(recognition of peronospora parasitica) in Arabidopsis (McDowell et al., 1998) and several alleles

of the L gene in flax encode different flax rust resistance specificities (Ellis et al., 1999).

1.5 Genetics of disease resistance in Brassica

There are two types of disease resistance in Brassica species: qualitative and quantitative. The exact

type of resistance is controlled by different genetic interactions and determines resistance during

specific stages of plant growth: seedling and adult stage resistance (Ferreira et al., 1995b, Ansan-

Melayah et al., 1998, Pilet et al., 1998).

Seedling resistance is qualitative, single-gene race specific and expressed during the cotyledon

stage (Delourme et al., 2006). This resistance typically depends on the presence of a resistance (R)

gene in the plant and a corresponding avirulence (Avr) gene in the pathogen, where if the Avr gene

does not correspond to the R gene in the plant, the plant is susceptible to disease. This is a very

effective resistance and operates through R gene activity when a pathogen infects the cotyledons of

the seedling, subsequently preventing infection spread to the whole plant (Johnson and Lewis,

1994). By contrast, adult resistance is a partial resistance, known as quantitative resistance, and is

controlled by multiple genes. This is race non-specific resistance. It was suggested that quantitative

7

resistance genes work together in a complex interaction with many pathogen Avr genes. Adult plant

infection is the most damaging in terms of quality and yield losses (Zhu et al., 1993, Delourme et

al., 2008).

In spite of the fact that seedling and adult resistance are under separate genetic control: monogenic

and polygenic, respectively, and they are differentially expressed in the plant (Ferreira et al.,

1995a), the genes involved at both stages might be closely linked in quantitative trait loci (QTL)

(Salisbury et al., 1995, Delourme et al., 2006). For instance, two closely linked but distinct loci

mediating resistance at the seedling and adult plant stage of B. napus breeding lines were mapped to

chromosome A7 within 5-10 cM of each other (Ferreira et al., 1995b).

As with many crops, Brassicas are susceptible to many serious diseases including fungi, bacteria

and viruses which cause a major impact on canola production worldwide. In the three major

cultivated Brassica species, B. napus B. rapa, and B. oleracea, clubroot (caused by

Plasmodiophora brassicae), downy mildew (caused by Hyaloperonospora Parasitica) and blackleg

(caused by Leptosphaeria maculans) diseases have become an increasing problem causing heavy

yield losses to Brassica crops world-wide and have been a major focus in the Brassica disease

resistance research (Delourme et al., 2011).

Clubroot disease occurs worldwide to all cruciferous vegetable and oil crops, including Brassica

species. Different sources of resistance to clubroot have been found in B. oleracea, B. rapa and B.

napus (Hirai, 2006, Suwabe et al., 2006, Diederichsen et al., 2009, Piao et al., 2009, Verma et al.,

2014). These different studies suggested that the interaction between Plasmodiophora brassicae and

Brassica are classified as race-specific resistance. To date, two major resistance genes: Crr1a and

CRa, which confer resistance to clubroot in B. rapa, have been identified and cloned. They both

encode a TIR-NBS-LRR protein (Ueno et al., 2012, Hatakeyama et al., 2013).

The downy mildew disease causes damage to production of Brassica species worldwide, including

Australia. While it causes severe destruction on young seedlings, the disease still causes yield and

quality reduction at adult-plant stages. For instance, in Australia, outbreaks of downy mildew

disease at the early seedling stage have caused losses of canola production (Ge et al., 2008). Several

major qualitative resistance loci to downy mildew have been identified in B. rapa and B. oleracea

(Farinhó et al., 2007, Yu et al., 2009, Kim et al., 2011, Carlier et al., 2012). In addition, multiple

sources of resistance have been reported in B. napus (Nashaat et al., 1997, Ge et al., 2008). They

showed that the resistance is more likely to be controlled by a major resistance gene.

8

1.6 The fungal pathogen Leptosphaeria maculans: Biology and blackleg disease

Leptosphaeria maculans is the causal agent of blackleg disease (also termed stem canker or phoma

disease) of Brassica crops. L. maculans has a broad host range within the Brassicaceae infecting

numerous cruciferous species including Raphanus sativus (radish), R. raphanistrum (wild radish),

Sinapis alba (white mustard) and cultivated Brassica crops; such as B. napus, B. rapa, B. juncea

and B. oleracea (Johnson and Lewis, 1994, Williams and Fitt, 1999). Moreover, the model plant

Arabidopsis, also a member of the Brassicaceae, is a host for L. maculans (Rouxel and Balesdent,

2005).

Blackleg is the most serious fungal disease of Brassicas worldwide, and can lead to drastic losses of

up to 80% of canola production (Fitt et al., 2006). Blackleg disease is primarily of economic

importance in the main canola growing areas of Europe, Australia and North America (West et al.,

2001). For instance, the value of Canola production in Australia is around AUD $500 million

annually (Wang et al., 2009) with an average 15% yield loss occurring due to L. maculans infection.

Infection symptoms occur at various stages of crop development causing cotyledon and leaf lesions

prominent during the vegetative phase of the plant, progressing to canker development in the

susceptible cultivars (Figure 1.3) (Delourme et al., 2008). This latter stage results from the fungus

invading cells in the stem cortex, resulting in a blackened canker; hence the name “blackleg” and

causing lodging of the plants (Guo and Fernando, 2005, Fernando et al., 2007). The severity of

blackleg has increased in recent years due to intensification of cultivation and production, rapid

evolution of the pathogen population and lack of sustainable management strategies. Thus, there is

an urgent need to establish effective management of blackleg.

9

Figure ‎1.3: Blackleg infection symptoms on B. napus caused by L. maculans. (A) blackleg lesions

on leaves (source: Canola Council of Canada) (B) blackleg lesions from multiple infections

(Courtesy C. Bradley, American Phytopathological Society) (C) blackleg stem lesion (source:

Canola Council of Canada) (D) interior of stem infected with blackleg(source: Canola Council of

Canada) (E) interior of stem infected with different levels of blackleg severity (source: Canola

Council of Canada) (F) black root form of blackleg as compared to healthy root (source: DEPI,

Victoria).

L. maculans is a haploid fungus, with a small genome size of about 45 Mbp, predicted to encode

10,000-13,000 genes within 16 chromosomes (Rouxel et al., 2011). The Leptosphaeria Genome

Consortium was established in 2004 by Genoscope (CEA) (http://www.genoscope. cns.fr) and the

genome was completed and published in 2011. The genome sequence has been annotated,

assembled and publically released on NCBI (Howlett, 2004, Rouxel et al., 2011).

1.7 Molecular interaction between Brassica and L. maculans

Traditionally, the gene for gene hypothesis is used to explain genetic interactions between the host

plants and their pathogens. This concept proposes that for each resistance gene in the host there is a

specific gene corresponding to avirulence in the pathogen (Dangl and Jones, 2001). Basically, the

gene for gene interaction suggests direct or indirect recognition of pathogen Avr-encoded effectors

by the protein product encoded by the corresponding R gene. Often R genes encode nucleotide

binding site- Leucine rich repeat (NBS-LRR) proteins leading to the induction of a signalling

cascade and subsequent downstream defence responses (Bent, 1996).

10

Plants operate a defence response against pathogens at two levels: pattern-triggered immunity (PTI)

and effector-triggered immunity (ETI) (Dodds and Rathjen, 2010) (Figure 1.4). PTI involves the

recognition of pathogen associated molecular patterns (PAMPs) and trigger low levels of defence

activation. The pathogen is able to suppress PTI by effector proteins delivered into the plant.

However, PTI results in initiation of ETI. The ETI is gene for gene resistance in which the

interaction is between disease resistance proteins and specific effectors (Avr proteins). This is

mediated largely by nucleotide binding-site-leucine rich repeat (NBS-LRR) proteins. ETI triggers a

stronger resistance response and often causes localized cell death described as a hypersensitive

response (HR) and prevents further infection (Collier and Moffett, 2009, Tsuda and Katagiri, 2010).

Figure ‎1.4: The mode of activation of the plant immune responses by PAMP-triggered immunity (PTI) and

effector-triggered immunity (ETI). Adopted from (Alcázar and Parker, 2011).

In the Brassica–L. maculans pathosystem, a typical gene-for-gene interaction has been reported,

where the outcome of the infection (resistance or susceptibility) depends on the presence of an R

gene in the plant and one corresponding (Avr ) gene in the pathogen (Figure 1.5) (Ansan-Melayah

et al., 1998, Balesdent et al., 2001, Balesdent et al., 2002). A set of differential interactions between

Brassica species and L. maculans were utilized at the seedling stage using a cotyledon inoculation

test (Williams and Delwiche, 1979). This determined the presence of the Avr gene required for race

specific resistance in various Brassica species and cultivars with known R gene complements based

on genetic mapping (Balesdent et al., 2002).

11

Figure ‎1.5: The infection outcomes between Brassica R genes and L. maculans Avr genes

according to the gene for gene interaction. Only when the R gene is corresponding to the Avr gene,

is the plant resistant against the pathogen.

Genetic studies have identified the gene for gene interactions between Avr genes in L. maculans and

their corresponding Rlm (Resistance to Leptosphaeria maculans) genes in B. napus. For example,

Rlm1/AvrLm1 and Rlm2/ AvrLm2 interactions were found in the B. napus cultivars, Quinta and

Glacier, respectively (Ansan-Melayah et al., 1998).

Effector-triggered immunity (ETI), a pathogen race-specific resistance, is a well-studied example of

host-resistance and has been observed in most plant species, including B. napus (Jones & Dangl,

2006; Poland et al., 2009). The existence of ETI between B. napus and L. maculans has been

described between the Rlm1 and LepR 3 genes and AvrLm1 gene (Raman et al., 2012a, Larkan et

al., 2013) and between the Rlm 2 gene and AvrLm2 gene (Larkan et al., 2014).

Genetic mapping of avirulence genes revealed four unlinked genomic regions of L. maculans

associated with host specificity: the AvrLm1–AvrLm2–AvrLm6 cluster, the AvrLm3–AvrLm4–

AvrLm7 cluster, AvrLm5, AvrLm8 and AvrLm11 (Balesdent et al., 2002, Fernando et al., 2007,

Balesdent et al., 2013). This cluster of Avr genes in L. maculans has been described as the first

example of Avr gene clustering in fungi. It was suggested that there might be a relationship between

Avr gene clustering in L. maculans and a counterpart clustering of R genes in B. napus (Delourme et

al., 2004).

12

To date, several avirulence genes in L. maculans have been cloned: AvrLm1 (Gout et al., 2006),

AvrLm2 (Ghanbarnia et al., 2015), AvrLm6 (Fudal et al., 2007) and AvrLm4-7 (Parlange et al.,

2009) confers avirulence towards two different resistance genes, Rlm4 and Rlm7. AvrLm11

(Balesdent et al., 2013) and AvrLmJ1 (Van de Wouw et al., 2014).

The availability of genome sequences has led to the identification of candidate resistance genes and

provides the opportunity to validate the interaction between R genes and Avr genes using sequence

information, genetic transformation, phenotyping and gene expression analysis. However,

Agrobacterium -mediated transient expression (Agroinfiltration) is an alternative and quick

approach for validating a candidate R gene (Vaghchhipawala et al., 2011, Ma et al., 2012). It is a

rapid and simple technique to study gene function and host-pathogen interaction and protein-protein

interaction (Bendahmane et al., 2000, Wroblewski et al., 2005, Dodds et al., 2006). It is based on

infiltration of Agrobacterium tumefaciens cultures into intact plant leaves (tobacco) to co-express a

candidate R gene with its corresponding Avr gene. This triggers a defence response that results in

localized cell death at the site of infection termed a hypersensitive response (HR) indicating that

these are functional R-Avr gene interaction. In this study, the validation can be done if both R and

Avr gene sequences are known. For instance, candidate blackleg resistance genes such as Rlm4 can

be validated due to the presence of the cloned AvrLm4 gene. Whereas, candidate Rlm9 genes cannot

be validated as AvrLm9 has not been cloned or sequenced.

1.8 Genetic mapping of blackleg resistance genes in Brassica species

All the blackleg resistance genes identified to date have been found on the A genome of B. napus

with none found on the C genome (Ananga et al., 2006, Leflon et al., 2007). This is supported by

the absence of Rlm1, Rlm2 or Rlm4 in the diploid C genome species, B. oleracea when screened

against L. maculans isolates harbouring AvrLm1, AvrLm2 and AvrLm4 alleles (Rouxel et al.,

2003b).

To date, ten race specific Rlm genes have been genetically mapped and identified in B. napus. By

screening B. napus cultivars with a differential set of L. maculans isolates, Rlm1 was identified in

the cultivar Quinta (Ansan-Melayah et al., 1998), LEM1, LmR1 and cRLMm, are present in the

cultivars Major, Shiralee and Maluka, respectively, Rlm2 and Rlm3 in the cultivar Glacier

(Balesdent et al., 2002), Rlm4 in the cultivars Jet Neuf (Balesdent et al., 2001) and Skipton (Raman

et al., 2012), Rlm7 in non-commercial lines (Balesdent et al., 2002), and Rlm9 in the cultivar

Darmor (Delourme et al., 2004).

13

Genetic mapping studies have showed that all resistance genes in B. napus are organised in clusters

in two genomic regions: the Rlm2 resistance gene on chromosome A10 is associated with adult

resistance (Pilet et al., 2001) and five specific resistance genes: Rlm1, Rlm3, Rlm4, Rlm7, and Rlm9

are on chromosome A7 (Delourme et al., 2004) (Table 1.1).

Rlm1 was suggested to be distinct from Rlm3 because they both occur in one cultivar and they

genetically map to different positions. In addition, Rlm1 and Rlm4 are linked, not allelic and can be

present in the same cultivar. Rlm3 and Rlm4 are found in many rapeseed cultivars and are rarely

present together in a single cultivar (Delourme et al., 2006). Indeed, the existence of Rlm1, Rlm3,

Rlm4, Rlm7 and Rlm9 as a cluster of tightly linked genes has never been observed. Thus, it is

impossible to judge if these genes are a cluster of tightly linked genes, or a single gene with

different alleles, or a combination of both (Delourme et al., 2004).

Rlm4 provides effective resistance against isolates with the avirulence gene AvrLm4 (Rouxel et al.,

2003b, Raman et al., 2012b) and is present in the French cultivars Major and Jet Neuf, as well as

Australian cultivars including Maluka, Dunkeld and Skipton. However, the cultivar Westar is

highly susceptible to blackleg (lack of any Rlm4 genes) and shows a reciprocal translocation break

point close to the Rlm4 locus in A7, making fine mapping extremely difficult (Rimmer, 2006).

Interestingly, an AvrLm4-7 locus in L. maculans has dual interaction specificity to resistance genes

Rlm4 and Rlm7 in B. napus, yet it is unclear if Rlm4 and Rlm7 represent allelic forms of the same

resistance gene. It was suggested that deletion of AvrLm4-7 may be the major event leading to

acquisition of virulence towards Rlm7 resistance whilst mutation of a single nucleotide is sufficient

to avoid Rlm4- mediated recognition of AvrLm4-7 (Parlange et al., 2009).

The genes LEM1, LmR1, cRLMm and cRLMrb present in different B. napus cultivars were also

mapped onto chromosome A7 (Table 1.1) (Mayerhofer et al., 1997, Rimmer, 2006). LEM1 was

found to confer seedling resistance to an isolate containing Avr1-2-4-7 (Ferreira et al., 1995a) and to

be located within an extensive intra-chromosomal tandem duplication region (Mayerhofer et al.,

2005). However, LmR1, cRLMm and cRLMrb are likely to be identical to Rlm4, and are present in

the cultivars Major and Maluka (Rouxel et al., 2003b).

Four dominant race specific resistance genes were also identified in the A genome of B. rapa: Rlm8

(Balesdent et al., 2002), LepR1 a single dominant gene mapped on chromosome A2, LepR2 which

maps to chromosome A10 (Yu et al., 2005) and LepR3 mapped to chromosome A10, 15–20 cM

from LepR2 (Li and Cowling, 2003). It is also proposed that LepR3 is not identical to LepR2 (Yu et

al., 2008) and LepR3 possibly corresponds to Rlm1 (Van De Wouw et al., 2009). Another two

14

resistance genes BLMR1 and BLMR2 have been identified. The former may provide cotyledonary

stage resistance whereas the latter has a role in preventing infection development (Long et al.,

2011). Interestingly, LepR3 was located between BLMR1 and BLMR2 and represents the resistance

combination of BLMR1 and BLMR2 (Table 1.1) (Yu et al., 2008, Long et al., 2011). LepR3 was

first blackleg resistance gene in Brassica to be cloned in B. napus lines with introgressions from B.

rapa ssp. Sylvestris (Larkan et al., 2013). It belongs to a receptor-like protein (RLP) family.

The LepR4 resistance gene was mapped to chromosome A6 and two different alleles LepR4a and

LepR4b were mapped to the same position. LepR4 was found as a recessive gene conferring a wide

spectrum of resistance to the fungus isolates (Yu et al., 2012). In addition, the B. rapa genome

might harbour genes previously identified in B. napus such as Rlm1, Rlm2, Rlm4 and Rlm7 as

previous studies have (Leflon et al., 2007) suggested that these genes are orthologous and are

located in the same position in B. rapa and B. napus (Rana et al., 2004, Delourme et al., 2006).

The B genome Brassica species B. nigra contains two resistance genes: Lm1 and Rlm10 (Table 1.1)

(Chèvre et al., 1996, Wretblad et al., 2003). Plants containing Lm1 have improved resistance to L.

maculans in the cotyledon and adult stage when it is overexpressed in B. napus (Wretblad et al.,

2003). Significantly, the Lm1 gene belongs to the serine/theronine kinase family (Howlett, 2004). It

is expected that B. nigra carries a number of different resistance genes in the B genome (Zhu et al.,

1993, Chèvre et al., 1996), which are yet to be reported. Zhu et al. (1993) reported that the

resistance in B. nigra is polygenic, controlled by a small number of genes located on chromosome

B3. The Rlm10 gene introgression location into B. napus is thought to be on A7 (Chèvre et al.,

1997, Dixelius, 1999).

In the amphidiploid AABB species B. juncea, resistance to L. maculans was suggested to be

mediated by two genes called Rlm5 and Rlm6 (Table 1.1) (Rimmer and van den Berg, 1992, Keri et

al., 1997, Balesdent et al., 2002, Leflon et al., 2007). Additionally, the lm2 gene identified in B.

napus hybrids, derived from B. juncea, was highly effective against a wide range of L. maculans

isolates at the cotyledon stage (Saal et al., 2004). Sequence-characterized amplified regions (SCAR)

markers were developed based on amplification of resistance gene analogues (RGA) in which they

are isolated by PCR with degenerate primers designed from the highly conserved motifs of

resistance genes. The SCAR marker was strongly associated to the resistance locus lm2 in B. napus,

B. rapa and B. oleracea. Sequence analysis of this gene revealed significant homology to two

putative R genes on a resistance gene cluster on chromosome 5 of Arabidopsis (Saal and Struss,

2005).

15

Table 1.1: Resistance genes to L. maculans that have been identified in Brassica species based on

genetics and mapping studies.

Species Resistance gene

Corresponding avirulence gene

Gene location in B. napus

Comments

B. napus

LEM1 Avr1-2-4-7 A7 May be identical to Rlm4

LmR1 AvrLm4 A7 May be identical to Rlm4

cRLMm AvrLm4 A7 May be identical to Rlm4

cRLMb AvrLm4 A7 May be identical to Rlm4

Rlm1 Avrlm1 A7 Distinct from Rlm3 and Rlm4

Rlm2 Avrlm2 A10 Associated with adult resistance

Rlm3 Avrlm3 A7 Linked , not allelic to Rlm1.

Rlm4 AvrLm4 A7 Linked, not allelic to Rlm1. rarely present together with Rlm3

Rlm7 AvrLm7 A7 This might be allelic variant of Rlm4

Rlm9 AvrLm9 A7 Rlm3, Rlm4, Rlm7 and Rlm9 as cluster of tightly linked genes

B. rapa

Rlm8 AvrLm8 -- --

LepR1 -- A2 Introgressed into B. napus from B. rapa

LepR2 -- A10 Introgressed into B. napus from B. rapa

LepR3 -- A10

Mapped close to LepR2, not identical to LepR2 LepR3 is possibly corresponding to Rlm1 Introgressed into B. napus from B. rapa

LepR4 A6 A recessive gene, two alleles

BLMR1 -- A10 LepR3 was located in between BLMR1 and BLMR2

BLMR2 -- A10 --

B. nigra Lm1 -- A3

Cotyledon and adult resistance

Rlm10 -- A7 --

B. juncea

Rlm5 AvrLm5 A8 --

Rlm6 AvrLm6 A8 May be identical to r j lm2

r j lm2 -- --

Identified in B. napus and B. juncea hybrids effective to different L. maculans isolates at cotyledon stage

16

Arabidopsis was recently found to be resistant to a wide range of L. maculans isolates. It is believed

that a dominant resistance (R) gene encoding a nucleotide binding site - leucine-rich repeats (NBS-

LRR) is required to confer resistance. To date, three genes have been identified as Arabidopsis

thaliana Resistance to Leptosphaeria maculans named AtRlm1, AtRlm2 and AtRlm3 (Staal et al.,

2006, Staal et al., 2008). Two TIR-NBS- LRR genes: AtRlm1 and AtRlm2 are present in the

different genotypes Col and Ler and are mapped on chromosomes 1 and 4, respectively. AtRLM1

was found to share a similarity to a homologous region on chromosome 4 that corresponds to

AtRLM2 indicating that they are closely related and may have evolved through gene duplication

(Staal et al., 2006). AtRLM3 encodes an NBS-LRR and is another candidate that appears to act as a

critical downstream component of AtRlm1 (Meyers et al., 2003, Staal et al., 2006, Staal and

Dixelius, 2008, Staal et al., 2008). Since it has been shown that blackleg resistance genes in B.

napus (LepR1, LmR1, CLmR1, Rlm1, Rlm3, Rlm7 and Rlm9) are mapped to genomic loci that

correspond to the chromosome segment on Arabidopsis chromosome 1 harbouring AtRlm1

(Mayerhofer et al., 2005), it could be hypothesized that L. maculans resistance genes are related to

the NBS-LRR protein family. Also of importance is to study the level of sequence conservation in

blackleg resistance genes in Brassica species and Arabidopsis since little is known about sequence

information of resistance genes identified so far.

According to the most recent study (Raman et al., 2012b), simple sequence repeat (SSR) markers

have been linked to a Rlm4 blackleg resistance QTL region in a segregating population derived

from the B. napus cultivars: Skipton and Ag-Spectrum. This region has been physically mapped to

scaffold 3 on chromosome A7 of the sequenced B. rapa genome. Therefore, it could be

hypothesized Rlm4 and other resistance genes mapped to B. napus chromosome A7 are present on

intervals homologous to the B. rapa chromosome A7 regions with considerable sequence variations

due to gene duplication and recombination events (Figure 1.6).

Taken together, although many studies have identified numerous blackleg disease resistance QTL

and qualitative genes, it is impossible to compare or even use homology searches between these

studies because of different marker systems being used, along with differences in the cultivars used,

blackleg isolates and inoculum concentrations present at each location. Thus, the use of map-based

cloning to identify a gene underlying the QTL region is relatively difficult through map based

cloning because the region is still large and often not tightly linked to the resistance gene(s). This is

often further complicated by the complexity of the genomic region due to duplication and

rearrangement events. As a consequence, it is difficult to determine if these genes are truly different

genes or allelic variants.

17

Figure ‎1.6: Integration the partial genetic map of Rlm4 QTL on chromosome A7 of B. napus

against the physical map of B. rapa. This is by using 12 linked SSR markers (Raman et al., 2012b).

1.9 Introgression of blackleg resistance genes into the B. napus genome: attempts and

challenges

Attempts have been made to transfer blackleg resistance genes in B. napus breeding programs

through interspecific hybridization (Rimmer and van den Berg, 1992, Chèvre et al., 1996). A

resistance gene from B. juncea (later described as Rlm6 by (Chèvre et al., 1997, Chèvre et al., 2008)

as well as the gene, rjLM2 (Saal et al., 2004) were introduced into B. napus. The resistance was

effective at the seedling stage in spring and winter type genetic backgrounds of B. napus against

various L. maculans isolates. However, subsequent transfer from the B genome has been

unsuccessful, presumably due to a low level of homology between the two genomes or lack of

introgression of the B genome genes into B. napus (Salisbury et al., 1995).

The cultivar Surpass 400 was developed as a blackleg resistant cultivar by crossing B. napus and B.

rapa (Crouch et al., 1994). The resistance obtained was assumed to be as a result of introgresion of

the genes LepR1, LepR2 and LepR3 from B. rapa (Yu et al., 2005, Yu et al., 2008). Li and Cowling

(2003) reported a complete resistance to blackleg in ‘Surpass 400’ when tested against Australian

isolates at the time of its release in 2001. However, there are evidences of resistance breakdown in

blackleg resistant varieties in France (Rouxel et al., 2003b, Brun et al., 2010) and Australia

18

(Sprague et al., 2006). The breakdown of the Rlm1 resistance in France occurred within three years

of release due to L. maculans population changes from avirulence (AvrLm1) to virulence

(avrLm1)(Rouxel et al., 2011). In 2001, the cultivar surpass 400 was released commercially and

showed a highly resistant response to blackleg in the field. However, this resistance become

similarly ineffective after only three years due to a rapid increase in the local pathogen populations

of isolates virulent on LepR3 (Li and Cowling, 2003, Rouxel et al., 2003b). There is also evidence

of sequential breakdown of major resistances genes Rlm9, Rlm2 and Rlm4 in France, as well as

Rlm4 in Australia after widespread use of cultivars containing these genes (Rouxel and Balesdent,

2005).

Breeding for B. napus blackleg resistance has faced challenges with respect to the stability and

durability of resistance genes. The loss of resistance stability and rapid breakdown of race specific

genes is most likely associated with a major shift in populations of L. maculans towards altered Avr

composition (Balesdent et al., 2001, Fitt et al., 2006, Sprague et al., 2006).

Crop rotation, of cultivars containing different resistance genes, could be a suitable approach to

maximize the durability and stability of the resistance and minimize the selection pressure on

avirulence isolates to overcome the resistance genes. The rotation is aimed to decrease the incidence

of blackleg through reduced exposure to inoculum and not imposing a selection pressure on the

fungus to rapidly evolve towards virulence.

The breakdown of resistance seen in France in 2000 (Rouxel et al., 2003a) and Australia in 2003

(Sprague et al., 2006) was a result of rapid change of the frequency of virulence alleles of

avirulence genes. The frequency of virulence alleles of avirulence genes changed significantly

depending on the resistance gene present in the cultivar. Hence, rotation provides a way to reduce

the change of the frequency of virulent isolates towards a particular resistance gene upon exposure

to difference complements of resistance genes in the canola cultivars. For example, when a cultivar

with Rlm1 is grown, the frequency of virulence towards Rlm1 will be selected for and increase in

number. However, if a cultivar with Rlm4 is grown the following year, the frequency of virulence

will be shifted towards Rlm4. In this case, rotation of cultivars with different resistance genes

should reduce the selection pressure on fungus isolate to evolve (Marcroft et al., 2012b).

It is recommended that the cultivars containing a different combination of resistance genes be

rotated every four years, in order to minimize selection pressure on the fungus and increase

durability of cultivars (Marcroft et al., 2004). However, it was found that not all resistance genes

can be rotated with each other. For instance, cultivars with Rlm1 resistance should not be rotated

19

with cultivars with Rlm6 resistance (Marcroft et al., 2012b). This is because the selection towards

AvrLm1 in the pathogen also results in increase the frequency of virulence of AvrLm6 as these two

genes are tightly linked in the pathogen genome.

Thus, identification of these resistance genes and understanding of the genetic interaction between

resistance gene and avirulence genes is required for development of specific resistance gene

rotation.

1.10 Genomic approaches for identification of candidate resistance genes

Candidate resistance genes are putatively genes that are essential components of plant immune

systems and can be broadly classified as those potential genes involved in the initial recognition or

interaction with pathogen avirulence gene to trigger a defence response. Quantitative trait locus

(QTL) mapping is frequently used to identify genomic regions associated with a phenotypic trait of

interest. The association of candidate resistance genes or major resistance genes and QTL has been

widely noted (Gebhardt and Valkonen, 2001, Quint et al., 2003, Wisser et al., 2005). The nucleotide

binding site–leucine rich repeat (NBS-LRR) class of R genes has been found to be potential

candidate resistance genes co-localized with disease resistance QTL such as in barley, potato and

apple (Ramalingam et al., 2003, Calenge et al., 2005, Bakker et al., 2011). However, although

numerous disease resistance QTL have been identified, isolating and cloning a gene underlying the

QTL/gene mapped region is relatively difficult through map based cloning because the region is

still large and contains hundreds of genes that often not tightly linked to the resistance gene(s). This

is often further complicated by the complexity of the genomic region due to duplication and

rearrangement events.

An alternative approach, the candidate genes approach was exploited in this project to analyse the

disease resistance QTL using a homology search. One of the major steps in the candidate gene

approach is the physical identification and characterization of QTL region using genomics analysis.

The physical gene mapping determines the gene location and its relative distance to the QTL region

and other adjacent genes on the chromosome. This allows narrowing and prioritizing the number of

associated candidate resistance genes for further sequencing and functional studies. Thus, candidate

resistance gene identification becomes much faster and less expensive for rapid cloning and

sequencing of the gene/s responsible for plant-pathogen interactions.

20

1.10.1 Comparative analysis based candidate gene identification

Genome comparative analysis is a powerful tool for genetic studies to investigate the resistance

gene organisation, conservation and evolution of related species. It allows the study of conserved

genomic segments with gene content, synteny, collinearity and conservation between related

species at the micro-level in selected chromosomal regions such as QTL region or at the macro-

level in the whole genome. Determining orthologous and syntenic genes within genomic regions

among closely related species is important to identify structural or functional similarities/

differences. This also enables the confirmation of potential inversion, insertion, gene

presence/absence and gene sequence homology (Lyons et al., 2008, Tang et al., 2008).

The availability of a reference genome sequence allows the identification of position of the QTL in

the chromosome and prediction of the full complement of genes that encode the resistance gene.

When a QTL of the species under study is located in a syntenic region on its related species, the

resistance genes are likely to be found in a syntenic relationship and show high or divergent

sequence conservation (Paterson et al., 2000, Cheng et al., 2012a).

Several studies have shown aspects of genome evolution and organization between Brassica

genomes and the genome of A. thaliana (Parkin et al., 2005, Cheung et al., 2009, Trick et al., 2009).

The close relationship of the genus Brassica to A. thaliana was exploited in identifying the

candidate gene for a major flowering time QTL in B. nigra (Osterberg et al., 2002) and seed weight

in Brassica napus (Cai et al., 2012). In addition, this strategy has been used to identify blackleg

resistance genes in B. napus (Mayerhofer et al., 2005, Fopa Fomeju et al., 2014) and clubroot

resistance gene in B. rapa (Suwabe et al., 2006, Suwabe et al., 2012).

In the comparative analysis strategy, candidate resistance genes are likely to be structurally and

functionally conserved between related species providing insight into the evolution of these genes.

Therefore, the comparative analysis of the resistance gene sequences not only revealed the level of

conservation but also identified potential target genes showing or associated with disease resistance

QTL.

Candidate genes can be further studied via analysis of gene expression. The level of R genes

expression is regulated to ensure correct resistance responses. They are expressed constitutively at a

very low level which changes slightly following pathogen infection (Graham et al., 2000, Joshi and

Nayak, 2011). However, structurally and functionally conserved R genes are likely to show changes

in expression pattern contributing to the R gene diversity and resistance specificities. These related

R genes with different recognition specificities can be found within or between plant species as

21

single or a multiple alleles recognizing a particular pathogen race. The gene duplication followed by

sequence divergence in polyploid species such as B. napus is a primary mechanism for altering

expression patterns and causing phenotypic variations (Hanada et al., 2009, Wang et al., 2012).

Differential expression levels of orthologous or paralogous NBS-LRR genes reflect the expression

divergence and contribute to functional diversification (Blanc and Wolfe, 2004).

1.10.2 Sequence based candidate gene identification

Next-generation sequencing (NGS) technologies have advanced rapidly in the past few years,

providing the ability to sequence reference genomes as well as re-sequence cultivars of interest

(Duran et al., 2010, Malory et al., 2011). The combination of resistance genes profiling with next-

generation sequencing offers enormous advantages for genetic studies. Re-sequencing is used to

identify genetic variation between individuals, which can provide molecular genetic markers and

insights into gene function (Imelfort et al., 2009). Thus, the increasing use of next-generation

sequencing technologies has impact on disease gene discovery by speeding up the identification of

potentially candidate genes by their probability of being shown disease-relevant mutations.

These NGS strategies produce millions of sequencing reads in a single run, producing large

amounts of short-sequencing reads that are aligned/mapped to a reference genome sequence. This

way, the sequence reads were anchored to comparable sites in the target sequences, which allow the

alignment of the reads directly without the need of a prior assembly step (Batley and Edwards,

2009). Once this has been achieved, it is possible to determine read coverage and variation and

identify and characterise polymorphisms for candidate genes between individuals. Thus, the NGS

reads that mapped to the gene regions of interest can be used to assess complete sequence

divergence between the contrasting genotypes.

The approach was recently utilized to identify a candidate Rlm4 resistance gene. By taking

advantage of the B. rapa reference genome sequence (A genome) and next generation sequence

data (short reads) for B. napus cultivars, including the cultivars Skipton and Ag-Spectrum which

differed in resistance to Rlm4, the putative Rlm4 blackleg resistance genes can be identified among

candidate NBS-LRR genes located within the physically mapped Rlm4 QTL region on chromosome

A7. Preliminary data suggests that duplication and clustering of tightly linked genes in the QTL

region containing R genes on A7 indicate that the cloning of Rlm4 may also identify additional

blackleg resistance genes (Tollenaere et al., 2012) (Figure 1.7).

22

Figure ‎1.7: Sequence based identification of Rlm4 candidate gene using NGS data. Illumina short

reads: Skipton (in red arrows) and Ag-Spectrum (green arrows). Bold line shaded by grey

represents known sequence from Illumina reads (there is sequence coverage) compared with the

reference genome. Whereas, black and blue represent sequence obtained from PCR amplification

and sequencing.

1.10.3 SNP genotyping and association with candidate genes

A single nucleotide polymorphism (SNP) is a single base pair change in genomic DNA and the

most abundant type of DNA sequence variation in plant genomes. They are found in high density

across plant species. SNP variation between different alleles in contrasting genotypes has a direct

association with disease resistance and susceptibility (Heffner et al., 2009). In the model plant

Arabidopsis, more than 37,000 SNPs have been identified between the ecotypes Columbia (Col)

and Landsberg erecta (Ler) providing essential elements for a genotyping assay (Jander et al.,

2002). SNPs are invaluable genetic markers for developing genetic maps and genotyping

populations for QTL mapping and association mapping of important agricultural traits. In cases

where SNPs in genic regions have a significant change in amino acid causing the altered phenotype,

23

these SNPs are known as perfect markers. In B. napus, existing genomic resources for SNP

discovery and the ability to associate SNPs variation with particular traits creates an efficient

approach for gene identification (Westermeier et al., 2009, Edwards and Batley, 2010).

Genotyping is the process of identifying polymorphism between individuals in a genome or

genomic region. SNP genotyping has become extremely important to study the distribution of SNPs

and the relationship between SNPs and phenotypic variation in a populations (Syvanen, 2005). SNP

genotyping currently has many applications in plant genomics: identification of candidate resistance

genes, haplotype determination, quantitative trait loci (QTL) mapping, association mapping and

comparative genomics. SNP genotyping can be further used for the positional cloning of the

candidate gene underlying QTL and as a diagnostic marker (Kaur et al., 2012).

Genome wide association of SNPs and their correlation to a segregating population would

significantly identify regions that are associated with a particular phenotype gene (Rafalski, 2002).

The identification of SNPs for specific candidate genes might provide a direct correlation of allele

function with specific trait variation. Thus, SNP genotyping technologies provide an effective

approach for fine genetic mapping narrowing QTL region and its association with disease and

susceptibility. They are now routinely used to identify loci that are associated with many complex

traits (McCarthy et al., 2008).

The identification of candidate genes for flowering time in Brassica using EST-based SNP markers

has been possible (Li et al., 2009). There are many successful examples of identification of closely

linked SNP markers underlying QTL region to enhance resolution within the target locus and

narrow the number of candidate genes. For example, in rice, a number of SNPs had been

functionally associated with variation for the yield (Yu et al., 2011a). In soybean, SNP genotyping

improved narrowing the QTL for the aphid resistance gene, Rag1 with two linked SNP markers

and additional candidate genes were identified (Kim et al., 2010).

SNP genotyping involves selecting very large numbers of SNP markers, precisely located on a

genome, for detection of alleles and their association with a phenotype. Then, SNPs within the QTL

of interest are physically mapped to the reference genome and the genotypic data is compared

across different species, cultivars or segregating lines. Ultimately, a group of SNPs located within

the functional gene or at a small physical distance from the gene will be associated and identified.

This strategy provides a good solution to the problems associated with positional cloning and

physical mapping.

24

High-throughput genotyping assays (Infinium assay ) developed by Illumina have proven successful

and efficient for many crops (Chagné et al., 2007, Kaur et al., 2012) and could serve as the platform

of choice for SNP validation as well as for genotyping large populations (Durstewitz et al., 2010,

Hu et al., 2012)

The Infinium assay is a whole genome SNP genotyping assay which is capable of multiplexing

from 3,000 up to 1 million SNPs across a large number of individuals. It enables large-scale

analysis spanning the entire genome offering an ideal method to narrow down candidate

polymorphic SNPs for the whole genome. The Infinium assay develops BeadChips with customized

SNP content enabling virtually unlimited multiplexing in SNP genotyping. These BeadChips are

commercially available with a variable number of fixed or custom SNPs. Illumina BeadChips can

process up to 24 samples simultaneously, with four able to be processed at once, providing high

genome coverage, array efficiency and genotype call accuracy. Each SNP locus is assayed and

analysed independently for each sample yielding highly intensity genotype calls (Illumina,

www.illumina.com).

The Infinium assay consists of four steps (Figure 1.8), a whole-genome amplification and

fragmentation step, hybridisation to immobilised SNP-specific primers on BeadChips, followed by

single base extension with labelled nucleotides. The primers hybridise adjacent to the SNP and are

extended with a single fluorescently labelled dideoxynucleotide corresponding to the SNP allele.

The intensities of the bead’s fluorescence are detected by the iScan Reader (Illumina) to generate

the genotype calls (Illumina, www.illumina.com).

Allelic discrimination occurs through single-base primer extension reactions using a two-colour

assay. Data consists of two intensity values (X, Y) for each SNP, with intensity for each one of the

fluorescent dyes associated with the two alleles of the SNP. After normalization, the alleles

measured by the X colour are called the A alleles, whereas the alleles measured by the Y colour are

called the B alleles. Then, clustering and GeneCall (call rate) algorithms are used to determine the

genotype call depending on the clustering of tested samples at each SNP locus (Illumina,

www.illumina.com).

25

Figure ‎1.8: The principles of whole genome genotyping using the Illumina infinium assay.

modified from (Gunderson et al., 2006).

The availability of the Brassica A genome sequence (Wang et al., 2011) and NGS has made it

possible to develop high density SNP arrays the Illumina Infinium™ platform for Brassica

(Appleby et al., 2009). B. napus 6K Infinium™ chip comprised of 5306 of SNPs marker was

developed and was applied to genotype B. napus cultivars and mapping populations. The results

showed that the SNPs assay was successful with > 94% of these producing a reproducible,

polymorphic SNP alleles (Dalton-Morgan et al., 2014). The 6K chip was also used to genotype a

doubled haploid population derived from a cross between B. napus Skipton and Ag-Spectrum for

Rlm4 blackleg resistance association (Raman et al., 2014). The QTL analysis revealed the location

of Rlm4 locus was consistent with previous studies (Raman et al., 2012b, Tollenaere et al., 2012). A

number of SNP markers were identified linked to the Rlm4 locus. In this project, a 60K chip was

used to study the association to Rlm9 blackleg resistance in B. napus Darmor.

26

1.11 Research aims and significance

In order to understand disease resistance in Brassicas, the identification and characterization of

NBS-LRR genes in Brassica is important to provide comprehensive analysis into the resistance

genetics and genomic structure of NBS-LRR in Brassica species and disease resistance gene

conservation and evolution. Thus, the identification of NBS-LRR genes further leads to

identification and validation of potential sources of resistance genes involved in blackleg disease

resistance or other diseases in Brassica. In this project, the utilization and integration of the

information of genomics resources such as availability of Brassica genome sequences, NGS data,

SNP genotyping and QTL genetic maps are the foundation for identification of a candidate NBS-

LRR gene linked to disease traits allowing rapid cloning of functional resistance genes (Figure 1.9).

My primary motivation for the present study was to address the need for genome wide NBS-LRR

gene identification and characterization to identify candidate resistance genes within disease

resistance QTL interval. In particular, the identification of candidate Rlm9 blackleg resistance genes

in B. napus because B. napus cv. Darmor, which contains the Rlm9 resistance gene, was used as the

reference for B. napus genome sequencing.

The genetic studies have mapped all the resistance genes for the devastating pathogen

Leptosphaeria maculans to the A genome with none mapped to the C genome in Brassica species.

Thus, the blackleg resistance genes should be retained and conserved in the A genome of B. napus

and B. rapa, whereas, they are probably absent or highly divergent in the C genome of B. napus and

B. oleracea. By characterizing and comparing these NBS-LRR genes in these species, it would be

possible to identify candidate blackleg resistance genes and assess their genetic diversity between

Brassica species.

The primary objectives are:

1- A comprehensive study of NBS-LRR genes can be undertaken to provide new insights into

the genetic diversity of the R genes in Brassica species and to understand the genomic

architecture of this gene family. This involves identification and characterization of

candidate NBS-LRR genes and determines their distribution, domain structure, gene

clustering and duplication. Thus, the advances in Brassica genome sequencing enable

identification of large number of disease resistance genes and their characterization and

evolutionary relationships.

2- Comparative analysis of NBS-LRR genes between B. napus compared to B. rapa and B.

oleracea. The identification of candidate genes in Brassica species and studying the genome

27

organization and genetic diversity through comparative genomics will help to understand

NBS-LRR conservation and evolution. The better understanding of the syntenic structure of

NBS-LRR genes in ancestral genomes is required to study the genomic rearrangements such

as duplication and translocation within the B. napus genome that have occurred during

evolution. Previously, mapping and comparative studies have demonstrated that Brassica

species had undergone genome duplications and rearrangements. It would be interesting to

find how NBS LRR genes have been affected by the genome duplication and rearrangement

events in Brassica.

3- Study the correlation or association between a given diseases QTL regions and the candidate

NBS-LRR genes. This would allow identification of the physical location of QTL regions

into genome chromosomal maps and association of the QTL intervals with the genome wide

identification of disease resistance NBS LRR genes.

4- To identify candidate gene(s) for blackleg resistance in B. napus using all available

genomics resources such as QTL mapping, comparative studies and SNP genotyping. It

should be possible to tighten the QTL region and further narrow the list of candidate genes.

Then, the potential candidate resistance genes are to be identified and further subject to

sequence comparison studies.

This project is the first to conduct comprehensive analysis of NBS- LRR genes in the B. napus

genome and present comparative analysis of NBS-LRR genes between B. napus and its progenitor

species: B. rapa and B. oleracea. It will also enable identification of a large number of resistance

genes in Brassica species. In addition, the project will enrich the Brassica database with resistance

genes for several disease resistance traits, in particular, blackleg disease. The identification of

blackleg resistance genes in Brassica will provide some understanding of the key molecular

components of disease and enhance development and breeding of resistant varieties of B. napus.

28

Figure ‎1.9: Schematic representation of genomics based identification of candidate resistance gene(s) used in this study.

29

Chapter 2

Genome wide identification and

characterization of NBS-LRR

resistance genes in Brassica napus

30

Chapter 2 : Genome wide identification and characterization of NBS-LRR

resistance genes in Brassica napus

*This chapter contributed towards a paper being submitted

2.1 Abstract

Plant disease resistance genes play a critical role in providing resistance against pathogens. The

largest family of resistance genes are the nucleotide binding site (NBS) and leucine rich repeat

(LRR) genes. They are classified into two major subfamilies, TIR-NBS-LRR (TNL) proteins and

CC-NBS-LRR (CNL) proteins. The availability of the genome sequence of Brassica napus provides

an important resource to study the NBS-LRR structure and genomic organization.

This chapter presents a comprehensive overview of the genome wide NBS-LRR identification and

gene diversity in B. napus. A total of 641 NBS-LRR genes were identified in B. napus comprising

of a ratio of 1:2 of TNL to CNL genes. Further domain structure analysis revealed that 60% of the

NBS LRR genes are typical resistance gene with all three domains and the remaining are partially

deleted or truncated. Fifty nine percent of the NBS-LRR genes were found to be physically

clustered and individual genes involved in clusters were more polymorphic and subject to

evolutionary selection pressure. Fifty percent of the total NBS-LRR genes in B. napus were

identified as duplicates, suggesting a high level of genomic duplication and rearrangement.

This chapter provides a valuable resource for identification and association of candidate NBS-LRR

genes linked to disease resistance QTL allowing rapid cloning of functional NBS-LRR genes and

understanding the functional diversity.

31

2.2 Introduction

Plant disease resistance genes (R genes) are important components of the genetic resistance defence

mechanisms in plants. R genes are involved in a direct or indirect interaction with avirulence (Avr)

genes in order to trigger a defence response. Thus, understanding the molecular structure and

function of R genes has been crucial for plant resistance research. Genome sequencing efforts of

plant species have facilitated genome wide analysis of NBS-LRR genes in many plant species. For

examples, 438 genes in potato (Jupe et al., 2012), 535 genes in rice (Zhou et al., 2004), 333 genes in

Medicago truncatula (Ameline-Torregrosa et al., 2008), 402 genes in Populus trichocarpa (Kohler

et al., 2008), 221 genes in sorghum (Paterson et al., 2009), 167 genes in Arabidopsis (Yu et al.,

2014) and 355 genes in cotton (Wei et al., 2013). These studies showed that the NBS-LRR class of

genes is abundant and widely distributed throughout the genome. Moreover, genetic and genomics

studies demonstrated that the majority of NSB-LRR genes are present in gene clusters in plant

genomes (Hulbert et al., 2001). The clustered arrangement of these genes may be a critical attribute

of the generation of novel resistance specificities via gene duplication or recombination (Meyers et

al., 2003).

Sequence analyses revealed that NBS domains share a high degree of homology and have a number

of conserved motifs (Wan et al., 2012). To date, eight conserved motifs have been identified in the

NBS domains of NBS LRR classes including P-loop, kinase-2, kinase-3a, GLPL, RNBS-A-TIR,

RNBS-D-TIR, RNBS-A-non-TIR and RNBS-D-non-TIR (Wan et al., 2013). The conserved motifs

of NBS domains assist in cloning NBS-LRR genes via PCR amplification, alignment for

phylogenetic analyses and classification of NBS-LRR genes (Meyers et al., 2003).

The NBS-LRR proteins contain three domains: a NBS domain, a C-terminal LRR domain and an

N-terminal domain of either Toll Interleukin-1 receptor (TIR) or a coiled coil (CC). The NBS-LRR

genes can be subdivided into two distinct types based on the structure of their N-terminal domain:

either a coiled-coil (CC) motif or a Toll Interleukin-1 Receptor (TIR) domain (Meyers et al., 2003).

These form the two major subfamilies of plant NBS-LRR proteins: TIR-NBS-LRR proteins (TNLs)

and CC-NBS-LRR proteins (CNLs). The TNL group of genes has been observed only in dicot plant

species (Meyers et al., 1999, Goff et al., 2002). Whereas, CNLs (sometimes called non-TNL) are

found in both dicot and monocot plant species. Both families are involved in defence mechanisms

against pathogens in different signalling pathways. They are distinct in their NBS conserved

sequence motifs and therefore are found in separate clusters in phylogenetic analyses (McHale et

al., 2006).

32

A comprehensive study of NBS-LRR genes was undertaken to provide new insights into the genetic

diversity of the resistance genes in Brassica species and to understand the genomic architecture of

this gene family. This study is the first to conduct genome wide identification of NBS-LRR genes in

the B. napus genome (Chalhoub et al., 2014).

The main objective of this chapter is to identify NBS-LRR genes, characterize the functional

domain and motifs, and determine the distribution of NBS-LRR genes on the B. napus

chromosomes. It also aims to study the conservation, gene clustering and duplication of the NBS-

LRR genes. As a result, this will provide a valuable resource for identification of candidate NBS

LRR genes linked to disease traits allowing rapid cloning of functional NBS LRR genes.

33

2.3 Methods

2.3.1 Brassica genome and gene prediction

The genome sequence and gene annotation of B. napus cv. Darmor was used as the reference

genome for genomic studies on B. napus (Chalhoub et al., 2014).

2.3.2 Identification of NBS-LRR genes in B. napus by MEME/ MAST analysis

The MAST/MEME (Motif Alignment Search Tool/Multiple Em for Motif Elicitation) suite of

software was used to identify predicted genes that contain motif homology to known disease

resistance genes (Bailey et al., 2009). NBS-LRR “positive” and non NBS-LRR “negative” sequence

training sets which were consensus of 20 amino acid motifs derived from MEME analysis (Jupe et

al., 2012) were used as queries in a MAST search against the predicted genes of the B. napus

genome. Predicted genes were considered to be candidate CNL or TNL if the reported MAST E-

value were ≤ E-24

.

2.3.3 Identification of NBS-LRR genes in B. napus by CNLs and TNLs for validation

To further validate the results from the MAST output, consensus sequences of plant CNLs and

TNLs (Cannon et al., 2002, Cannon et al., 2004) were obtained from a previous study (Ameline-

Torregrosa et al., 2008). Candidate genes containing NBS domains were identified using tBLASTn

and BLASTp (maximum E-value 1E-5) (Altschul et al., 1997) performed using the consensus

sequences below against the B. napus cv. Darmor genome. Candidate NBS-LRR proteins were

provisionally assigned to either the CNL or TNL groups on the basis of similarity.

Consensus of CNLs used for BLAST search was as follows:

erpsestiVGletmleklwnrLledndvgivgiyGMGGVGKTTLatqifNdfdvkgehFdrviWVvVSkefnvekiqqdIl

ekLglgdeewlekteeekaaeienLfqlLegKkfLLvLDDvWekevdLdkigvpfPdrenGsKvlfTTRsesvavcgdmgv

dxmevecLtpeeAWeLFqkkvfentlksdpeieelaKevvkkCgGLPLAlkVlGgllacKrtvqEWkraievlssslaaefsg

messilpvLklSYdnLppelKsCFLYcalFPEDykIekekLieyWiaEGfideseggetaedvGyeylgeLVrrsLleegdk

tdnetsrketVkMHDvvREmALwiaseegfkeviiVraGvglreipnvkswntvrRmSlmnneieelldspenpklrsLltlllq

snsh.

Consensus of TNLs used for BLAST search was as follows:

RDFddlVGiEaHlekmksLLcLdsdeVrMVGIwGPaGIGKTTIARALfsqLSssFqlsaFmenlrgsyStrpaglD

eYsmKLhLQeqfLSkILnqkDikIhHLGvieERLkdqKVLIiLDDVDdleQLdALAketqWFGpGSRIIVT

TeDkqLLkaHgInhIYeVgfPSkeeALqIFCrsAFgQnsPpdGFeeLAreVtkLaGnLPLGLrVlGSsLRGkske

eWedmLpRLrtsLDgkIekvLrvsYDgLhekDqaLFLhIACfFNgekvdyVkalLadsnLDVrqGLkvLadKSLI

hisplgdgtieMHnLLqqLGReIVrkQsidePgKRqFLvDaeeIcdVLtdnTGtgsVlGIslDtseieeelnIsekAFeg

MrNLqFLriykksfrddgk.

34

2.3.4 Correlation between the MAST output and CNL/TNL BLAST results

The NBS-LRR candidate genes identified from the MAST output were compared with CNL and

TNL BLAST results to validate the NBS LRR genes and to classify them according to the BLAST

similarity to either the CNL or TNL groups. Furthermore, any candidate NBS-LRR that did not

correlate between the two approaches was subject to further search against NCBI non-redundant

nucleotide database.

2.3.5 Manual annotation check and characterization of candidate NBS-LRR genes

The annotation of all identified NBS-LRR gene were manually checked using gene prediction

programs to obtain information on complete open reading frames (ORFs). Every individual gene

sequence was manually annotated and the annotation compared between all gene prediction

programs. The prediction programs used were FGENSH (http://www.softberry.com), Genscan

(http://genes.mit.edu/GENSCAN.html) and AUGUSTUS (http://bioinf.uni-

greifswald.de/augustus/).

All predicted NBS-LRR genes were surveyed to further determine their domain structure and

whether they contained TIR, CC, NBS, and/or LRR motifs. The predicted genes were searched for

similarity to known proteins using BLASTp (Altschul et al., 1997) with a threshold of E-value < -

10 against the NCBI non-redundant (nr) protein database. The structural information on protein

domains was used to classify the NBS-LRR genes into subfamilies.

2.3.6 Multiple alignment and phylogenetic analysis

Alignment and phylogenetic analyses were performed with all NBS-LRR proteins. The NBS region

was defined as the region extending from the P-loop to the MHDV motif, which contains about 300

amino acids, using the corresponding conserved motifs from B. rapa (Mun et al., 2009).

To extract the NBS domain, CNL and TNL consensus sequences were aligned with NBS-LRR

genes using alignment in Geneious Pro v6.1.2 (Kearse et al., 2012) and then the NBS domain

manually extracted from the P loop motif to the MHDV motif or in case there is a missing motif,

the NBS domain extracted to get other internal motifs. In addition, a search comparison in NCBI

non-redundant (nr) protein database was used to extract the NBS domain and also to determine the

type and length of NBS motifs within the NBS domain.

35

Genes with ≤ 50% of total NBS domain length, such as genes with partial NBS domains with length

less than 150 aa or genes lacking NBS domains such as TIR genes, TIR-LRR genes and linker LRR

genes were excluded from phylogenetic analysis.

The NBS sequences were then aligned using CLUSTAL W (Thompson et al., 1994). A

phylogenetic tree was constructed using the neighbour-joining method (Saitou and Nei, 1987) with

bootstrap multiple alignment resampling set at 1000. These functions were performed using

Molecular Evolutionary Genetics Analysis (MEGA) software version 6.0 (Tamura et al., 2013).

2.3.7 NBS-LRR genes cluster and duplication analyses

NBS-LRR gene clusters were determined by their physical position on each chromosome according

to criteria previously published and widely used (Holub, 2001, Richly et al., 2002, Meyers et al.,

2003). The parameter to define a cluster was three or more NBS-LRR genes that occurred within a

maximum of eight non-NBS genes. Furthermore, the distance between neighbouring NBS-LRRs

was required to be ≤ 200 kb.

Gene duplication was defined in accordance with the criteria previously published and widely used

(Zhou et al., 2004): (1) the alignment covered more than 70% of the longer gene, and (2) the

aligned region had an identity >70%. A block of duplications was defined if more than one gene in

a gene cluster was involved in the duplication. Homologous or paralogous genes were confirmed by

the BLASTp comparison of all the predicted NBS-LRR proteins against each other with E-value of

E-20. This stringent E-value of E-20 was used to ensure the correct definition of singleton genes as

those having only one copy in the genome.

36

2.4 Results

2.4.1 Genome wide identification of B. napus NBS-LRR genes

The genome wide identification of NBS-LRR resistance genes has become an essential component

for better understanding of the genomics and evolution of R genes and resistance genetics. This

study is the first to perform comprehensive analysis on identification and characterization of NBS-

LRR genes in the B. napus genome taking advantages of the recent sequence completion of the

genome. Two methods were used to identify candidate NBS-LRR genes: MEME/MAST analysis

and CNL and TNL consensus sequences. MEME was used in combination with a positive sequence

set of characterised NBS-LRR protein sequences, from diverse plant species, and a negative

sequence set, containing diverse nucleotide binding protein sequences, to create a list of 20

sequence motifs putatively characteristic of NBS-LRR proteins which were then used as queries in

a MAST search against the predicted genes of the B. napus genome. Subsequently, predicted genes

were considered to be candidate CC or TIR NBS LRRs if the reported MAST E-value were ≤ E-24

.

In addition, candidate genes containing NBS domains were identified based on a BLASTp and

tBLASTn search of CNL and TNL consensus sequences as a query at a cut-off value of E -5 to

ensure that there were no other related genes missing from the gene prediction.

The initial classification of candidate NBS-LRR genes into CNLs or TNLs were mostly based on

CNL/TNL consensus BLAST results, as well as taking into account the presence of the TIR domain

motifs among motifs composition in the MAST output. However, the MEME/MAST analysis was

found to be highly dependent on the gene prediction. If the gene prediction is very accurate, an

improved output is obtained. For instance, the MAST E-value could drop down below the cut off

due to the gene annotation splitting one gene into multiple small genes. Furthermore, manual

checking of gene annotation revealed some inaccuracies and there were some genes with predicted

amino acid sequence that was still not complete due to sequence errors leading to insertion of false

introns or false start or stop codons. These were mainly the genes in the MAST output with E-value

close to the cut off E-24. Therefore, in our analysis, less stringent criteria was applied in order to

avoid underestimation of the total number of NBS-LRR genes in B. napus.

Furthermore, an essential component of our analysis was the manual re-annotation of individual

NBS-LRR genes that had been partially mis-annotated during the automated annotation process due

to minor errors such as incorrect start codon predictions, splicing errors, missed or extra exons,

fused genes or split genes. This step assisted to obtain reliable downstream analysis in domain

structure phylogenetic relation and to define gene duplications.

37

In total, 641 candidate NBS-LRR genes were identified in B. napus consisting of 615 genes from

the MAST output with reported E-value less -24 plus additional genes identified from consensus

sequences of TNL and CNL (Table 2.1). Approximately 80% of the identified NBS-LRR genes

were identified by both two methods (Table 2.1).

Table ‎2.1: The total number of predicted NBS-LRR genes identified in the B. napus genome. The

comparison between methods used: MEME / MAST and CNL and TNL BLAST

Initial analysis using BLAST searches for a number of genes that were not correlated between the

two methods showed that most of them were likely to be pseudogenes. They had very strong

identity with other full NBS protein but their sequence was truncated by either a premature stop

codon or a frameshift mutation causing them to fall under the level of significance of MAST or

CNL/TNL analysis.

In order to examine the motif compositions in the CNLs and TNLs they were subjected to domain

and motif analyses using the program MEME/MAST. The 20 putative conserved motifs were found

among them as shown in Table 2.2 and Figure 2.1. It can be seen that the 20 major motifs and

particularly eight motifs in the NBS domain have distinct patterns depending on whether they are

present in the TNL or CNL groups. They also differed in their divergence within and between the

CNL and TNL groups.

Species B. napus

Genome size (Mbp) 1284

Gene content 111,479

Total candidate NBS-LRR genes 641

Proportion of NBS-LRR genes 0.58%

Identified in MAST output 615

Common to MAST and CNL/TNL analysis 509

Not common to MAST and CNL/TNL analysis 136

Agreement % 79%

38

Table ‎2.2: NBS-LRR specific motifs sequence identified with MEME/MAST.

Motif No. Length

(bp) Motif domain

Group or class

Motif sequence

1 21 P-loop kinase 1 NB-ARC CNL/TNL PIWGMGGVGKTTLARAVYNDP

2 21 GLPL NB-ARC CNL/TNL CGGLPLAIKVWGGMLAGKQKT

3 15 Kinase 2 NB-ARC CNL/TNL YLVVLDDVWDTDQWD

4 15 RNBS-B NB-ARC CNL/TNL GSRIIITTRNKHVAN

5 15 RNBS-D NB-ARC CNL LKPCFLYCAIFPEDY

6 21 MHDV NB-ARC CNL/TNL CRMHDMMHDMCWYKAREQNFV

7 15 RNBS-A NB-ARC CNL HFDCRAWVCVSQQYD

8 21 -- Linker CNL/TNL MEDVGEYYFNELINRSMFQPI

9 21 LRR LRR CNL/TNL LIHLRYLNLSGTNIKHLPASI

10 19 RNBA-C NB-ARC CNL/TNL YHMQFLSHEESWQLFHKHA

11 19 LRR LRR CNL/TNL MPNLETLDIRNCPNLEEIP

12 21 -- Pre-NB CNL IDRNKLIWLWMAEGFVPHENG

13 15 -- NB-ARC CNL/TNL NEIMPILRLSYHHLP

14 25 TIR 3 TIR TNL QIVIPIFYDVDPSDVRHQTGSFGEA

15 17 -- Pre-NB CNL DAAYDAEDVIDSFKYHA

16 50 -- Monocot -- AIKDIQEQLQKVADRRDRNKVFVPHPTRPIAIDPCLRALYAEATELVGIY

17 21` TIR 2 TIR TNL KNYATSRWCLNELVKIMECKE

18 50 EDVID Monocot CNL ETSSFELMDLLGERWVPPVHLREFKSFMPSQLSALRGWIQRDPSHLSNLS

19 27 LRR LRR -- FLDIACFFRGRKKDYVMQILESCDFGA

20 21 TIR 1 TIR TNL KYDVFLSFRGPDTRKTFTSHL

39

Figure ‎2.1: Example of representation of motif patterns in NBS-LRR genes in B. napus. The different coloured boxes indicated distinct motifs

as shown in boxes below identified by MEME/MAST program. Refer to Table 2.2 for the type of each motif.

40

2.4.2 Genomic distribution and organisation of NBS-LRR encoding genes in the B. napus

genome

To find the distribution of NBS-LRR genes in the B. napus genome, the genes were plotted on a

chromosomal map based on their physical position in the genome. The NSB-LRR genes were found

to be distributed randomly and unevenly across the genome. The TNLs and CNLs are present on all

19 chromosomes in B. napus. However, TNL genes are more widely distributed across the B. napus

chromosomes (Figures 2.2 and 2.3).

The comparison of the total number of NBS-LRRs and whether they were classified as TNLs or

CNLs in both A and C genomes showed that there was a predominant presence of TNLs over CNLs

with about 70% of the genes comprising TNLs. This represents a ratio of 1:2.5 of CNLs to TNLs

(Table 2.3).

Approximately 40% of the NBS-LRR genes in the A genome of B. napus are located on

chromosomes A9 and A2 harbouring 58 and 44 genes, respectively. The chromosomes A5 and A10

contain the lowest number of NBS-LRR genes in the A genome (Figure 2.2). In the C genome,

chromosome C9 has the highest number of NBS-LRR genes followed by chromosome C3 (Figure

2.3).

All predicted NBS-LRR genes were surveyed to further determine their domain structure order to

classify these NBS genes with their predicted domains into subfamilies accordingly. The analysis of

the 641 candidate NBS-LRR genes against the non-redundant protein database and MEME/MAST

identified 366 (57%) genes were typical or regular NBS-LRR genes: 124 CNLs and 242 TNLs

(Table 2.4). These 366 genes show highly conserved NBS regions and complete open reading

frames. The remaining 188 genes (29%) were classified as non–regular genes because of the lack of

specific domains. These genes were classified into three distinct groups for TNL and CNL. Non-

regular TNLs were TN (74), N (9) and NL (43). Non-regular CNLs were classified as: CN (24), N

(8) and NL (23). These non-regular genes were described as partial or truncated within the N-

terminal domains and/or absence of LRR domains. In addition, 87 TIR genes which lack both NBS

and LRR domain were also identified in B. napus.

Furthermore, 75 NBS-LRR genes (44 CNL and 31 TNL (21 TIR + 10 TNL) are encoded by a single

reading frame without introns (single exon). The average number of exons in the total NBS-LRR is

4 exons where the average in CNLs is 3 exons, which was lower than that in TNLs which have an

average of 5 exons.

41

Table ‎2.3: The distribution and organisation of candidate NBS-LRR genes in B. napus

Ch

rom

oso

me

Tota

l

TNL

CN

L

No

. of

gen

e

clu

ste

r

No

. of

gen

e in

clu

ste

rs

No

. of

du

plic

ate

d

gen

es

(TN

L+C

NL)

No

. of

du

plic

ate

d

gen

es in

cl

ust

ers

A g

eno

me

B. n

ap

us

Dar

mo

r

A1 28 17 11 2 7 18(10+8) 4

A2 44 33 11 5 31 22(20+2) 15

A3 24 20 4 5 18 16(13+3) 6

A4 13 12 1 1 7 5(5+0) 3

A5 10 4 6 1 4 9(3+6) 5

A6 21 7 14 2 6 16(5+11) 5

A7 19 16 3 2 10 14(14+0) 8

A8 24 17 7 3 12 13(9+4) 3

A9 58 38 20 10 43 38(23+5) 24

A10 11 7 4 0 0 6(4+2) 0

Total 252 171 81 31 138

(54%) 157 (106+51)

(62%) 73 (28%)

C g

en

om

e B

. na

pu

s D

arm

or

C1 37 24 13 3 21 16(9+7) 10

C2 51 47 4 5 34 21(18+3) 13

C3 56 46 10 7 35 20(14+6) 12

C4 18 12 6 1 4 5(3+2) 2

C5 19 11 8 4 13 13(8+5) 7

C6 44 33 11 4 30 28(20+8) 17

C7 38 30 8 5 25 14(10+4) 8

C8 39 21 18 4 23 19(9+10) 11

C9 79 62 17 8 55 26(19+7) 18

Total 381 286 95 41 240

(62%) 162 (110+52)

(42%) 98 (25%)

Un assigned

8 4 4 0 0 0 0

Total 641 461 180 72 378

(59%) 319

(50%) 171

(53%)

42

Figure ‎2.2: Physical locations of NBS-LRR genes on the chromosome of the A genome of B. napus. Closed circles and triangles above and below

each chromosome (grey bars) with different colour system are designated for class: TNL and CNL, and their subfamily. The black straight line

connecting the NBS-LRR genes represents tandemly duplicated genes. Genes inside red rectangles represent homogenous cluster, whereas the blue

rectangles represent heterogeneous cluster. Numbers above the rectangles represent cluster size in kbp. Chromosome lengths are shown in megabase

pairs on the scale at top.

43

Figure ‎2.3: Physical locations of NBS-LRR genes on the chromosome of the C genome of B. napus. Closed circles and triangles above and below

each chromosome (grey bars) with different colour system are designated for class: TNL and CNL, and their subfamily. The black straight line

connecting the NBS-LRR genes represents tandemly duplicated genes. Genes inside red rectangles represent homogenous cluster, whereas the blue

rectangles represent heterogeneous cluster. Numbers above the rectangles represent cluster size in kbp. Chromosome lengths are shown in megabase

pairs on the scale at top.

44

Table ‎2.4: The number of NBS-LRR genes classified into different classes and their subfamilies

and domain compositions

2.4.3 NBS-LRR gene clustering

The gene cluster was defined as a cluster of three or more NBS-LRR genes located within the 200

kb or the presence of no more than eight non- NBS-LRR genes between the two adjacent NBS-LRR

genes. Using these parameters, 59% (378) of the total NBS-LRR genes identified in B. napus reside

in a gene cluster and are unevenly distributed over the chromosomes. Interestingly, the A and C

sub-genome in B. napus exhibited similar patterns of gene clustering. Moreover, some of these

NBS-LRR genes are located closely to clustered genes or even create a rich region of NBS-LRR

genes. A total of 72 clusters were observed in B. napus where the highest number was 10 clusters

found in A9 followed by 8 clusters in C9 and 7 clusters in C3 (Figures 2.2 and 2.3). The

chromosome A10 does not have any NBS-LRR clusters. The highest gene number in a cluster was

found in C6 and C9 with 11 genes followed by 10 genes in C9 and 9 genes in A2. The average

number of genes in a cluster was 5 genes where the average in the A genome (average 4 genes) was

found to be smaller than in the C genome (average 6 genes). With respect to TNLs and CNLs

clustering, there were more TNL clusters (23 clusters across the A genomes and 28 clusters on the

C genome) than CNL clusters (6 clusters on the A genome and 8 clusters on the C genome).

Although CNLs are rarely mixed with TNL genes, there are two mixed clusters of TNLs and CNLs

in the A genome and 6 mixed clusters in the C genome.

In addition, cluster size was determined by the sequence length between the two NBS-LRR located

on both side of the gene cluster. The cluster size varied in the A genome between 10 kbp (cluster in

A8) and 488 kbp (cluster in A9) (Figure 2.2). For the C genome, the cluster size varied between

14kbp (cluster in C8) and 837kbp (cluster in C9) (Figure 2.3).

When combining phylogenetic analyses and physical clustering of NBS-LRR genes, out of the total

72 clusters, 32 clusters were located in monophyletic clades and shared recent common ancestor.

CNL 180 TNL 461 Total 641

CNL - NBS - LRR 124 TIR - NBS - LRR 242 Regular

NBS-LRR gene 366(57%)

NBS - LRR 23 NBS - LRR 43

None Regular

NBS-LRR gene 188(29%)

NBS 8 NBS 9

CC - NBS 24 TIR - NBS 74

LINKER-LRR 1 LINKER-LRR 3

TIR - LRR 3

TIR 87 87(14%)

45

These clusters were considered to be homogeneous (highly similar to other NBS genes in the same

genomic cluster). The remaining 40 clusters are heterogeneous, as they contained more distantly

related NBS-LRR genes.

2.4.4 Alignment and phylogenetic analysis

To assess the sequence diversity and evolutionary relationships among the predicted NBS-LRR

genes, a phylogenetic tree was estimated from the protein alignment of the conserved NBS

domains. NBS sequences were used because they are present in both CNL and TNL proteins and

contain numerous conserved motifs that assist proper alignment. The alignment and phylogenetic

analysis of 176 CNL genes and 366 TNL genes were performed separately to identify the similarity

and relationship between genes. There were many genes excluded from the analysis due to more

than 50% deletion in the NBS domain or lack of an NBS domain, such as TIR genes. The multiple

alignments showed that the eight major motifs differed in their conserved residue and there was

divergence between TNLs and CNLs (Table 2.5).

Two phylogenetic trees were constructed separately using a neighbour-joining (NJ) method from

NBS sequences of CNLs and TNLs (CNL, Figure 2.4 and TNL, Figure 2.5). Bootstrap values

(1000 replicates) were used as a tree reliability test to estimate the confidence for each branch point.

From the values obtained in the bootstrap analysis, it was apparent that most of tree branches had a

value of more than 50% confidence. There were branches with low bootstrap scores in which it

does not necessarily indicate unreliable branching, but low confidence of assigning a particular

group.

The analysis revealed different evolutionary patterns of CNL and TNL genes. The phylogenetic

trees were divided into clades on the basis of clade rooting. This resulted in 16 clades for CNLs and

20 clades for TNLs (Figures 2.4 and 2.5).

The phylogenetic trees of TNLs and CNLs with their chromosome of origin provide a relationship

between genes from homeologous chromosome. Therefore, the TNL genes located in the

homeologous chromosomes of B. napus sub-genomes were likely found in the same clade. For

example, TNLs on chromosome A7 are often present in the same clade as TNLs on chromosome

C6 (Figure 2.5). There were 26 TNLs on chromosome C6 located next to 14 TNLs on chromosome

A7. This further confirms that the genes involved in formation of homogeneous clusters in

chromosomes A7 and C6 tend to be located in the same clade.

46

Table ‎2.5: Conserved and consensus sequence of NBS domain motifs found in the NBS-LRR genes identified in B. napus

Class Motif Consensus sequence* Class Motif Consensus sequence*

TNL

P-loop Kinase 1 VRMIGIWGPAGIGKTTIARALFNQLS

CNL

P-loop Kinase 1 VGIVGIYGMGGVGKTTLLRQINN

RNBS-A DDYNLKGIXXLLHLQEELLSKILNQK RNBS-A VIWVVVSQXFXDXXXGLDXXKKIQQ

Kinase 2 HLGVAXERLKDKKVLVVLDDVDDLXQLE

Kinase 2 LXXKKFLLVLDDIWXKVDLDWDLIGVPF

RNBS-B WFGPGSRIIITTRDK RNBS-B NGCKVVFTTRSXEVCGRMGXDDPM

RNBS-C HIYEVXFPSXXEALQIFCXYAFGQKSPP

RNBS-C VXCLTXEEAWELFXKKVGE

GLPL LAXEVVXLAGGLPLGLRVLGSXLRGKSXEEW

GLPL VAXKCGGLPLALNVIGE

RNBS-D LRXSYDGLXXEDKXLFLHIACFFNG

RNBS-D KSCFLYCALFPEDYKIXKEXLIEYWIAEG

MHDV MHXLLQQXGREIVR MHDV VKMHDVVREMALWIAS

* Conserved and Consensus amino acid sequence was derived from ClustalW multiple alignments of all predicted TNL or CNL genes. Bold residues indicate conserved sequence

identical to the motifs detected in Brassica rapa (Mun et al., 2009) and CNL/TNL consensus sequence (Ameline-Torregrosa et al., 2008). There is high agreement in conserved

sequence of NBS motifs between B. rapa and CNL/TNL consensus sequences

47

Figure ‎2.4: A phylogenetic tree of 176 CNL encoding genes of B. napus for NBS domains based

on the neighbour-joining method using MEGA 6.0 software. The numbers on the branches indicate

the percentage of 1000 bootstrap replicates. The tree only shows values > 50%. Gene names are

intended to represent chromosome and domain configurations. The blue and red coloured branches

are to separate the clades. The closed green triangle represent a homogenous cluster on

chromosome C6 where all genes sequence are related and located within a clade as well as their

relationship with genes on homologous chromosome A7 as represented by unfilled green triangles.

48

Figure ‎2.5: A phylogenetic tree of 366 TNL encoding genes of B. napus for NBS domains based on

the neighbour-joining method using MEGA 6.0 software. The numbers on the branches indicate the

percentage of 1000 bootstrap replicates. The tree only shows values > 50%. Gene names are

intended to represent chromosome and domain configurations. The blue and red coloured branches

are to separate the clades. The closed green triangles represent a homogenous cluster on

chromosome C6 where all genes sequence are related and located within a clade as well as their

relationship with genes on homologous chromosome A7, as represented by unfilled green triangles.

49

2.4.5 NBS-LRR gene duplication

The gene duplications of B. napus NBS-LRRs were confirmed by the BLAST comparison of all the

predicted proteins against each other. A total of 319 (50%) of the total NBS-LRR genes were

identified as duplicates (Table 2.3). There were 216 TNL duplicates and 103 CNL. The number of

duplicated genes was consistent between the two sub-genomes. Among the 19 chromosomes,

chromosome A9, C6 and C9 contain the highest number of duplicated NBS-LRR genes.

The duplicated genes were distributed randomly and 53% (171 duplicated genes) of total duplicate

genes were involved in formation of gene clusters, which may explain that tandem duplication plays

a role in gene cluster expansion (Table 2.3). Gene clusters containing tandemly duplicated genes

are shown in Figures 2.2 and 2.3. For instance, the gene cluster in chromosome A1 consists of

three TNL duplicates and one gene outside the cluster. Another example is the gene cluster in

chromosome A5 where all CNL genes are duplicates and form a cluster.

Two types of genomic duplication were observed (Figure 2.6). Intra-genomic duplications within

the B. napus A or C sub-genome are involved in duplication events at the level of an intra-

chromosomal region of the sub-genome. The A and C genomes had 16 and 24 duplication events,

respectively, involved in intra-chromosomal duplications. Moreover, chromosome A4 and A7 did

not show intra-genomic duplications. In contrast, inter-genomic duplication between the B. napus

genome (between A and C genome) involved a duplication between homeologous chromosomes of

the A and C genomes. There were 41 duplication events involved in duplicate loci between A and C

genome. The duplicated genes showed collinearity between homeologous chromosomes. For

example, 14 genes on chromosome A1 were collinear with chromosome C1 (Figure 2.6). However,

there are examples where they had a collinear relationship with a gene in non-homeologous

chromosomes. For example, five genes in chromosome A1 were collinear with a gene on

chromosome C3, C5 and C7 (Figure 2.6).

Furthermore, block duplication is considered as segmental duplication and defined as duplication of

a chromosomal region that includes more than one gene. Eleven duplication blocks were observed

between the B. napus homeologous chromosomes in which a block contained four to five genes

together in the duplication.

50

Figure ‎2.6: Inter-genomic and intra-genomic duplication relationship of NBS-LRR -genes between

the two sub-genomes of B. napus: the A and C genomes. Coloured lines represent the relationship

of paralogous gene pairs between homeologous chromosomes. Red lines represent the gene

duplication between chromosomes within a sub-genome. Black lines represent the gene duplication

between homologous chromosomes with collinearity. Blue lines represent the gene duplication

between non homeologous chromosomes. The green block represents a duplicated block where

there is more than one gene involved in duplication. Chromosomes are scaled based on the number

of NBS-LRR genes.

51

A study identified 21 conserved blocks within the Arabidopsis genome which showed synteny to

almost 90% of the B. napus genome (Parkin et al., 2005). This ancestral karyotype (AK) block

system was used to find a correlation between the B. napus NBS-LRR duplications and the

conserved blocks.

According to the conserved AK blocks relationships with B. napus chromosomes, the duplication

between NBS-LRR genes was defined as duplication between conserved genome regions within B.

napus chromosomes similar to the pattern of conserved blocks. Hence, the duplications between B.

napus chromosomes can be predicted based on the conserved blocks patterns between different

chromosomes. For example, conserved block 3A appeared in six chromosomes: A1, A3, A5, C1,

C3 and C5 (Table 2.6) suggesting that duplication may occur between them. This results in

comparison of the genome duplications based on NBS-LRR genes within and between B. napus A

and C sub-genomes.

In this study, the NBS-LRR genes duplication patterns in all B. napus chromosomes would be

expected to be correlated with the conserved AK blocks (Parkin et al., 2005). For instance, there

were two intra-genomic duplications in A1 with A6 and A8. The correlation indicated that the

duplication could occur between A1 and A8 but not with A6, because the conserved blocks in A1

have similar conserved blocks in A8 whereas there are no matching blocks between A1 and A6.

The correlation between NBS-LRR gene duplication and conserved blocks were significant as I

found that duplication pattern matching was about 70% and 85% in intra-genomic and inter-

genomic duplications, respectively (Table 2.6).

52

Table ‎2.6: The correlation between B. napus NBS-LRR intra-genomic and inter-genomic duplications and conserved blocks found between B. napus

and Arabidopsis

chr Arabidopsis Conserved blocks1 NBS-LRR

Intra-genomic duplication 2

Duplicated patterns

matching3

NBS-LRR Inter-genomic Duplication2

Duplicated patterns matching3

A1 4B 3A

A6 , A8 A8 C1 , C3 , C5, C7 , C9 C1 , C3 , C5 , C7

A2 5A 5E 1E 5D 3B 5B 5F

A10 A10 C1 , C2 , C7 C2 , C7

A3 5A 5E 2C 2B 4A 3A 4B

A9 A9 C2 , C3 , C7 C2, C3 , C7

A4 3D 4B 5C 2B 2C

--- --- C3 , C4 C3 , C4

A5 2C 1B 1C 3A

A9 --- C4 , C5 , C6 , C9 C4 , C6 , C5

A6 1C 1A 1B 5E 5F 5B 5C 3B 5D

A1 , A8 , A9 A8 , A9 C1 , C2 , C5, C6 , C7 , C8 , C9 C2 , C5, C6 , C7 , C8 , C9

A7 2A 1B 2B 3D 1E

--- --- C6 C6

A8 1C 4B 1B 1A

A1 , A6 , A9 A6 , A9 C1 , C3 , C5 , C6 , C7 , C8 C1 , C3 , C5 , C6 , C7 , C8

A9 4A 5B 1D 5D 4A 1B 3D 2B 1A

A3 , A5 , A6 , A8 A3 , A6 , A8 C2 , C3 , C4 , C5 , C7 , C8 , C9 C2 , C3 , C4 , C5 , C7 , C8 , C9

A10 1A 5E 5A

A2 A2 C2 , C6 , C9 C1 , C2

16 11(68%) 41 35(85%)

C1 4B 3A

C2 , C3 , C5 , C6 , C7 C3 , C5 A1 , A2 , A6 , A8 A1 , A8

C2 5A 5E 1E 5D 3B 5B 5F

C7 , C9 C7 , C9 A2 , A3 , A6 , A9 , A10 A2 , A3 , A6 , A9 , A10

C3 5A 5E 2C 2B 4A 3A 3B 3C 1B 4B C1 , C6 , C7 , C9 C1 , C7 , C9 A3 , A8 , A9 A3 , A8 , A9

C4 2C 2B 1D 3D 4B 5C 2B 2C

C9 C9 A4 , A5 , A9 A4 , A5 , A9

C5 1A 1B 3A

C1 , C8 , C9 C1 , C8 A1 , A5 , A6 , A8 , A9 A1 , A5 , A6 , A8 , A9

C6 1C 1E

C3 --- A1 , A5, A6 , A7 , A8 , A10 A5, A6 , A7 , A8

C7 2A 1B 5D 3B 5C 5B 4B

C1, C2 , C3 C1 , C2 , C3 A1 , A2 , A3 , A6 , A9 A1 , A2 , A3 , A6 , A9

C8 1C 4B 1A 1B 3D 1A

C1 , C5 C1 , C5 A6 , A8 , A9 A6 , A8 , A9

C9 4A 5B 5F 1D 5D 4A 5E 5A

C2 , C4 , C5 C2 , C4 A1 , A2 , A3 , A5 , A6 , A9 , A10 A2 , A3 , A6 , A9 , A10

24 17(70%) 41 35(85%)

1. Identified genome conserved blocks showing conservation of loci content and order between Arabidopsis and B. napus genome (Parkin et al., 2005). Blocks are arranged in

order from the top to bottom of the B. napus chromosomes.

2. The NBS-LRR gene intra-genomic or inter-genomic duplication confirmed by BLAST comparison of each individual NBS-LRR against all NBS-LRR genes in B. napus.

Then, the chromosome was determined based on the location of NBS-LRR gene in B. napus chromosomes

3. Based on the correlation between conserved blocks and NBS-LRR duplications on B. napus chromosomes.

53

2.5 Discussion

2.5.1 Genome wide identification of B. napus NBS-LRR genes

The recent whole genome sequencing of B. napus enables the genome wide identification, mapping

and characterization of candidate NBS-LRR genes in economically important plants such as

Brassica species. The genome wide NBS-LRR resistance genes have been extensively studied

because they represent the largest class of disease resistance gene and play a critical role in

defending plants from pathogens. Thus, identification of NBS-LRR resistance genes is one of the

most important objectives of understanding the resistance genetics. Here, this study aimed to

analyse the Brassica genomes for identification and characterization of candidate NBS-LRR genes

and to determine their distribution, domain structure, gene clustering, duplication and conservation.

In this study, I have characterized the complete set of 641 NBS-LRR genes in the Darmor B. napus

genome with about 70% of the genes comprising TNLs. This represents a ratio of 1:2.5 of CNLs to

TNLs. The TNL genes make the largest proportion of the NBS LRR genes and show large domain

diversity compared with the CNLs genes, as was also previously reported in B. rapa (Mun et al.,

2009, Yu et al., 2014), Arabidopsis (Meyers et al., 2003, Yu et al., 2014), Medicago truncatula

(Ameline-Torregrosa et al., 2008), poplar (Kohler et al., 2008) and linseed (Kale et al., 2012). The

high number of NBS-LRR resistance genes identified in this study could be explained mainly by the

complexity of the B. napus genome duplication and recombination

The predominance of TNL or CNL genes in a genome may be contributed by the nature of

pathogens that infect the plant species, or may be driven by another evolutionary force that could be

related to the success of one or another type of NBS-LRR genes (Leister, 2004, Lozano et al.,

2012). Moreover, TNL types are found widely in dicot genomes but not in monocot genomes,

whereas the CNL type is found in both monocot and dicot genomes (Bai et al., 2002).

The NBS-LRR genes were found to be distributed randomly and unevenly across the genomes. This

uneven distribution of NBS-LRR genes on chromosomes appears to be common in plants such as in

Arabidopsis, rice and poplar (Meyers et al., 2003, Zhou et al., 2004, Kohler et al., 2008, Yang et al.,

2008).

Most NBS-LRRs (59%) were found to be in clustered in the B. napus genome. This compares to

61% of all NBS genes in clusters in Arabidopsis (Meyers et al., 2003), 50% in Medicago truncatula

(Ameline-Torregrosa et al., 2008) and 58% in potato (Jupe et al., 2012). Large numbers of TNL

54

genes occur in clusters and tend to form large cluster, whereas, most of the CNLs were not in

clusters (singletons). This attributes to the predominance of TNLs to CNLs in B. napus.

Several identification approaches are available for NBS-LRRs, such as Pfam, mining ESTs, or

cloning resistance gene analogues (RGAs) using degenerate PCR. However, these methods each

have disadvantages. For example, RGAs has the limitation that the number of R genes identified is

affected by PCR and sequencing qualities. Pfam only finds gene models existing in the search

database, which might cause underestimation of the number of R genes. In this study, the MEME

and MAST analysis used in Jupe et al (2012) is a web based tool and the most common motif-

finding tool. The analysis is feasible and very efficient in identifying common specific motifs

between homologous protein sequences that show different motif compositions. By utilizing NBS-

LRR specific motifs from wide range of plant species, it enables 20 NBS-LRR specific motifs to be

distinguished between NBS-LRR subfamilies: TNLs and CNLs. In contrast, the domain based

identification of large NBS-LRR gene family members is very laborious and time-consuming. It

involves conserved domain searches followed by generating a hidden Markov models (HMMs)

profile. The domain based search does not identify homologous protein sequences if a domain has

diverged to another family.

Here, a combination of MAST and CNL/TNL analyses were utilized to have a more feasible and

accurate identification of potential candidate NBS-LRRs. The MEME/MAST analysis relies on 20

sequence motifs, putatively characteristic of NBS-LRR proteins, as queries against the predicted

genes of the B. napus genome to search for genes containing patterns matching the motifs giving an

E-value for each hit. Therefore, it is highly dependent on the gene prediction and annotation.

As expected, MEME/MAST analyses not only identified the number of NBS-LRR gene but, also

identified specific consensus sequence motifs for B. napus NBS-LRR genes showing similar motifs

sequence and patterns to those in other plants species such as Arabidopsis, potato, papaya, poplar

and Medicago. In contrast, consensus CNL and TNL analysis using either tBLASTn against

nucleotide sequence of the genome or BLASTp against amino acid sequence of the genome were

straightforward approaches to identify any gene or protein sequence with characteristics of NBS-

LRRs or even pseudogenes. The TNL and CNL consensus sequence BLAST analyses are suitable

for any plant genomes that have not yet been completed or assembled. In addition, CNL and TNL

consensus sequence were useful to ensure no related NBS-LRR genes are missing from the gene

prediction and mis-annotated. Thus, this analysis shows a good correlation between the two

methods indicating that it is deep enough for the genome wide identification of NBS-LRR genes

across the Brassica genomes.

55

The B. napus NBS-LRR genes were further divided into regular and non-regular NBS-LRR

encoding genes. The regular NBS-LRR genes have all domains and a complete open reading frame

whereas the non-regular NBS-LRR resistance genes have partial motifs of the NBS domain or lack

CC, TIR or LRR domains. They are probably considered as truncated or pseudogenes. For example,

TIR, TIR-NBS, and CC-NBS and NBS- LRR genes have been described in plant genomes and were

often distributed unevenly and surrounded regular NBS-LRR genes, for example in Arabidopsis

(Meyers et al., 2003), Medicago truncatula (Ameline-Torregrosa et al., 2008) and potato (Jupe et

al., 2012). One possible explanation for the presence of non-regular NBS-LRR genes is the partial

deletion of redundant NBS-LRR genes after whole genome duplication occurred. Interestingly, the

non-regular CNL or TNL genes were dispersed with regular genes among various clades throughout

the phylogenetic tree, indicating a diverse rather than a monophyletic origin due to domain deletion.

Thus, I anticipate that these non-regular NBS-LRR genes might show significant polymorphism

between B. napus cultivars.

Additionally, genes that encode for only a TIR domain lacking both NBS and LRR domains in B.

napus were reported. These genes might be truncated NBS LRR genes or a part of TIR containing

proteins as they have been previously identified in Arabidopsis and B. rapa (Meyers et al., 2002, Yu

et al., 2014). The majority of TIR genes were found residing in a cluster mixed with TNL and TN

suggesting they might have derived from TNs or TNLs and co-evolved (Yang et al., 2008). Thus,

TIR genes could interact and function together with TNL or NL proteins for their involvement in a

disease resistance response (Nandety et al., 2013).

The exon structure in TNLs is more complex than CNLs with an average of five exons in TNLs and

three exons in CNLs. The average number of exons in the total NBS-LRR is four exons. This result

is consistent with previous findings in B. rapa and A. thaliana. In B. rapa, the average exon number

in NBS-LRR genes was 4.2, but CNLs and TNLs have 2.3 and 5.2, respectively. Similarly, In A.

thaliana, NBS-LRR genes contain 4.2 exons, but CNLs and TNLs have 2.2 and 5.3 exons per gene

on average, respectively (Meyers et al., 2003, Mun et al., 2009).

2.5.2 Alignment and phylogenetic analysis

In the phylogenetic analysis using NBS domains, the NBS domain sequences in CNLs formed

distinct clades separated by higher genetic distances than in the TNLs. In addition, the average

number of genes per clades was similar for both TNLs and CNLs, despite there being a much larger

number of TNLs. This suggests that the NBS domains in CNLs were relatively more diverged in

sequence than TNLs due to greater selective pressure on the CNLs. This phenomenon was also

56

observed in previous studies in Fabaceae, Brassicaceae, Poaceae, and Solanaceae species (Bai et al.,

2002, Cannon et al., 2002, Wan et al., 2013). The results here concur with the hypothesis that the

CNL family is highly diverse and originated prior to the split between gymnosperms and

angiosperms while, the TNL family is more homogeneous and are found only in dicotyledons,

suggesting that the family arose after the divergence of monocotyledons and dicotyledons. In

addition, it can be attributed to the exon structure where the CNL proteins are mostly encoded by a

single exon, unlike TNLs that usually are encoded by a complex exon structure (Tarr and

Alexander, 2009). In this study, I also found that the TNL family of genes showed complex

structures and various domains (TIR, NBS, and LRR) which were often separated by introns and

usually consisted of many short exons in the LRR region.

Phylogenetic relationships also showed that the NBS-LRR genes located in the homeologous

chromosomes of the B. napus sub-genomes were likely to be found in the same clade. For instance,

the phylogenetic and sequence comparison between most of TNL genes in chromosomes A7 and C6

were found to be related to each other and located in the same clade suggesting that these genes

have been maintained without substantial divergence or selection pressure during B. napus

formation from its progenitor species. In addition, I found mixed clades dominated by sequences

from one chromosome (and usually from one or a small number of genomic clusters), but many also

contained small numbers of sequences from other chromosomes. This could arise in several ways:

by chromosomal rearrangements or by large-scale genomic duplication. Accordingly, this further

confirmed that gene duplication in B. napus might play a major role in the expansion and diversity

of NBS-LRR encoding genes.

2.5.3 NBS-LRR gene clustering

The combination of phylogenetic analyses and physical clustering of NBS-LRR genes was used to

study the nature of genes involved in clusters and whether they share homology. More

homogeneous than heterogeneous cluster were found (Figure 2.2 and 2.3). The existence of highly

related homologies in NBS-LRR gene clusters implies that NBS-LRR gene duplication can lead to

gain of resistance function (Grant et al., 1998). This is compared to heterogeneous clusters where

genes involved in the cluster are distantly related. These clusters of mixed unrelated genes could

have arisen as a result of either selective pressures or arranged and co-localized randomly in the

genome (Richly et al., 2002, Meyers et al., 2003). Moreover, clusters contain genes from non-

regular NBS LRR genes often located adjacent to regular NBS-LRR genes in gene clusters and this

is consistent with phylogenetic lineages. Therefore, the clustered organisation and arrangement of

these genes may be a critical attribute allowing the generation of novel resistance specificities via

57

intergenic rearrangement, duplication (especially tandem duplication) or gene conversion (Hulbert

et al., 2001).

Previously, it was hypothesized (Friedman and Baker, 2007) that R-gene distribution and quantity is

dependent on the size and cluster complexity of these genes so that these NBS-LRR genes keep

growing in number, complexity and expand the different functions of these genes. Thus, there could

be a correlation between the gene number and the size in the gene clusters as the large cluster size is

often associated with a higher gene number.

In addition, it has been suggested (Guo et al., 2011) that analyses of individual genes involved in

clusters provided evidence of gene diversity and selection pressure, suggesting NBS-LRR genes

within clusters to be more polymorphic and prone to evolutionary process while, singleton genes

remain a simple locus with no relatives in the genome reflect a relatively stable region. This is

supported by the finding that the gene clusters contained tandemly duplicated genes and formed

homogeneous clusters of many related genes.

2.5.4 NBS-LRR gene duplication

Since allotetraploid B. napus was formed from two ancestral diploid genomes of B. rapa and B.

oleracea via a polyploidization process, B. napus has undergone extensive genome rearrangement

and duplication. The level of gene duplication in the B. napus genome is thought to be significant

and there are high levels of genomic redundancy (Parkin et al., 2003, Rana et al., 2004, Parkin et al.,

2005). Thus, gene duplication, including tandem and whole-genome duplication (WGD) could have

resulted in a substantial increase or substitution of the gene number to compensate the gene losses

during formation of B. napus.

In this study, I found that 50% of B. napus NBS-LRR were duplicated, or in pairs of paralogs. The

distribution of duplicate genes was random which could be as a result of genome rearrangements or

gene loss/gain occurring after whole genome duplication events.

Intra-genomic and inter-genomic duplication events between B. napus NBS-LRR genes has been

observed suggesting that these genes possibly originated via tandem duplication where duplicated

genes are contiguous with its original copy. There was considerable collinearity between duplicated

genes in homeologous chromosomes, with the exception of some showing duplication between non

homeologous chromosomes due to possible translocation or genome mis-assembly.

58

In addition, block duplication between the B. napus homeologous chromosomes were observed and

considered as segmental duplication and defined as duplication of a chromosomal region that

includes more than one gene. It can be assumed that the duplication block might suggest gene losses

or rearrangement in B. napus. However, the level of genomic duplication reported based on NBS-

LRR genes is insufficient to provide a better resolution about the segmental duplication. Therefore,

this could be determined by analysis of sequence information flanking the NBS LRR genes.

A study by (Parkin et al., 2005) identified almost 90% of the B. napus mapped into conserved

syntenic blocks relative to Arabidopsis. In this study, I utilized the comparative duplication analysis

with conserved blocks to find the correlation with the intra-genomic and inter-genomic duplication

of NBS-LRR genes. This showed that these conserved blocks can be correlated with NBS-LRR

duplication, with about 70 to 85%. It was found that conserved blocks within Arabidopsis had

similar chromosomal rearrangements between A and C genome. This further supports my finding

that the NBS-LRR duplication between B. napus sub-genomes exhibited similar duplication

patterns suggesting that the NBS-LRR genes in B. napus might have experienced tandem gene

duplications and chromosomal rearrangements between the A and C genomes. However, the

duplications that did not correlate may be due to genome mis-assembly, or it may be due to location

of NBS-LRR genes in a region not covered by conserved blocks.

The duplication of NBS-LRR genes may be related to its physical location or cluster and occur

between chromosomes in two sub-genomes on B. napus. Theoretically, NBS-LRR derived from

gene duplications would share high similarity to NBS-LRR in the same or different genomic

clusters. It been suggested that duplication in combination with divergent selection acting on

duplicated genes, influences the generation of gene diversity within existing gene clusters (Parniske

and Jones, 1999, Richly et al., 2002, Wei et al., 2013). Approximately, 53% of duplicated genes

were involved in formation of gene clusters. This indicates that gene clusters could be chromosomal

hot spots in which the NBS genes get duplicated and then gene clusters may have been expanded by

tandem duplication followed by selection pressure (Wei et al., 2013). Indeed, previous studies

revealed that NBS-LRR genes are often present as tightly linked genes with high homology and are

prone to gene duplication and recombination, and thus evolve more rapidly than the rest of genome

(Grant et al., 1998). Therefore, the clustered distribution of NBS-LRR genes provides a reservoir of

genetic variation by which new specificities to pathogens can evolve via gene duplication

(Michelmore and Meyers, 1998, Yang et al., 2008). For instance, duplicated genes could be allelic

variants that show specificity for different pathogen strains. Together, these findings suggested that

59

tandem duplication in combination with recombination and selection pressure played a major role in

NBS gene expansion and diversity in B. napus.

60

Chapter 3

Comparative and conservation

analysis of NBS-LRR genes in

Brassica

*This chapter contributed to the science paper being published (Chalhoub et al., 2014)

61

Chapter 3 : Comparative and conservation analysis of NBS-LRR genes in

Brassica

3.1 Abstract

NBS-LRR genes are one of the most dominant gene families in plant genomes. The recent

completion of the genome sequences of B. napus and its related species: B. rapa and B. oleracea

provide an important opportunity for identification and comparative analysis of disease resistance

genes. The comparative analysis enables the understanding of the genomic structure and

conservation between Brassica species.

This chapter presents a comprehensive NBS-LRR comparative analysis between the three Brassica

species: B. napus, B. rapa and B. oleracea. From the genome wide NBS-LRR identification, a total

of 641, 249 and 443 NBS-LRR genes have been identified in B. napus, B. rapa and B. oleracea,

respectively. The distribution and synteny of NBS-LRR genes in the A genomes of B. napus and B.

rapa and C genomes of B. napus and B. oleracea are consistent and highly conserved in size and

orientation, particularly regions containing major gene clusters. This provided evidence that there

was a selective advantage to maintaining similar features of NBS-LRR genes in B. napus to both B.

rapa and B. oleracea following polyploidization. However, the differences in NBS-LRR gene

numbers could be attributed to gene losses or selection pressure to offer novel resistance genes in

the three species. More than 60% of NBS-LRR genes from the progenitor species were conserved

and maintained in B. napus. The differences in NBS-LRR content and conservation between B.

napus and its progenitors suggest differential selection for disease resistance that is either shared or

divergent.

This chapter provides an insight into NBS-LRR genome organisation and evolution between

Brassica species and enables understanding the candidate genes similarities and differences for

disease resistance traits.

62

3.2 Introduction

Comparative genomics is the study of sequence similarity/differences and chromosomal distribution

of gene homologues in related species. Comparative analysis provides an insight into genome

organisation, rearrangement and evolution between related species. The degree to which genome

comparison can facilitate cross-species analysis of gene evolution and function depends on the

conservation of gene content and synteny, collinearity between homologous chromosomes or

genomes.

The conservation of the NBS-LRR genes has been used to study the genomic architecture because

they are abundant, widely distributed throughout the genome and present in gene clusters. Genome

wide analyses of NBS-LRR genes in many genomes have showed that NBS-LRR genes are diverse

in number, structure and organization and suggested that these genes do not distribute randomly

within a genome (Baumgarten et al., 2003, Plocik et al., 2004, Yang et al., 2008, Zhang et al., 2011,

Gu et al., 2014, Yu et al., 2014). They often occur in a cluster at a specific location on the

chromosome. NBS-LRR genes and their syntenic positions were found conserved across the

genomes of tomato and potato (Pan et al., 2000).

Genomic comparison of NBS-LRR sequence similarity and chromosomal distribution in related

species is important to understand different perspectives towards NBS-LRR gene evolution. The

variation in NBS-LRR gene diversity within a species may be a fundamentally important strategy to

produce novel resistance specificities (Li et al., 2010). Evolutionary mechanisms generating

resistance gene diversity such as duplication, homeologous recombination and diversifying

selection have been proposed to contribute to the conservation and evolution of resistance genes

(Meyers et al., 2003, Ameline-Torregrosa et al., 2008, Ratnaparkhe et al., 2011). Comparative

analyses in soybean showed that the polyploidization did not result in increasing the number of

NBS-LRR genes. However, divergent selection acting on duplicated genes could result in creating

NBS-LRR number/cluster variation (Richly et al., 2002, Ashfield et al., 2012). NBS-LRR clustering

due to duplication and recombination may also be important factors that influence the evolution of

resistance genes.

Allotetraploid B. napus was formed from two ancestral diploid genomes of B. rapa and B. oleracea

via a polyploidization process. Previous studies have shown extensive genome collinearity and

structural conservation between Brassica species (Parkin et al., 2005, Cheung et al., 2009, Jiang et

al., 2011, Cheng et al., 2012a). However, considerable genome changes may have occurred via

duplication events (tandem duplication of genes), genomic rearrangements and homeologous

63

recombination between the A and C genomes. This often causes novel variations in B. napus

relative to its progenitors.

Polyploidization is considered to be the major mechanism of duplication in plants genomes. The

duplicated genes are produced by duplications arising from the alloploidy nature and post-

speciation duplications. The duplication arising by polyploidization originated from the ancestral

species, whereas the post speciation duplications take place independently after the polyploidization

event (Lawton-Rauh, 2003, Carretero-Paulet and Fares, 2012). In comparative genomics, it is

possible to examine the origin of duplications in polyploidy of B. napus through orthology,

paralogy and syntenic relationships to its ancestral species: B. rapa and B. oleracea. However, this

can be challenging in B. napus due the homeologous relationships between the progenitor genomes

as it is requires extensive genome wide analysis.

The completed genome sequences of the three Brassica species: B. napus (Chalhoub et al., 2014),

B. rapa (Wang et al., 2011) and B. oleracea (Parkin et al., 2014) offer a great opportunity for

functional and comparative analysis of genetic diversity and evolution between these Brassica

species. For instance, the effect of genomic variations on important genes such as disease resistance

genes can be studied.

Understanding the syntenic structure of NBS-LRR genes in ancestral genomes is required for

informative synteny analysis between different genomes. It also helps to study the genomic

rearrangements, such as duplication and translocation that might have occurred within the B. napus

genome. The comparative analysis serves as an invaluable resource for the identification and

cloning of the candidate genes and establishing an evolutionary relationship between species to

address the question of how NBS-LRR genes orthologous are highly conserved and evolved in B.

napus when compared to its two ancestral species in regards to disease resistance genetics.

The aim of this chapter is to find the NBS-LRR gene orthology and collinearity relationships

between B. napus and its diploid progenitor species: B. rapa and B .oleracea.

64

3.3 Methods

3.3.1 Brassica reference genomes

The genome sequence and annotation of B. napus cv. Darmor was used as the reference genome

(Chalhoub et al., 2014). The genome sequences of B. rapa v3.0 (Wang et al., 2011) and B. oleracea

cv. To1000 v1.0 (Parkin et al., 2014) were also used for comparative analysis.

3.3.2 Identification of NBS-LRR genes in B. rapa and B. oleracea

NBS-LRR candidate genes in B. rapa and B. oleracea were identified using the same methods in B.

napus as described in Chapter 2 section 2.3.2 and 2.3.4.

3.3.3 Reciprocal Best BLAST (RBB) of NBS-LRR for comparative analysis between B.

napus, B. rapa and B. oleracea

A reciprocal best BLAST (RBB) analysis was used to compare which of the NBS-LRR genes

predicted across the three genomes: B. napus, B. rapa and B. oleracea, were conserved. The NBS

LRR genes from B. rapa (v3.0) were compared with the identified NBS LRR genes from the B.

napus cv. Darmor (A genome) and vice versa using BLAST. The identified NBS-LRR genes from

B. oleracea (To1000 v1.0) were also compared with identified NBS-LRR genes from the B. napus

cv. Darmor (C genome), and vice-versa using BLAST.

The NBS-LRR genes were used to determine the chromosomal synteny between B. napus and B.

rapa as well as B. napus and B. oleracea according to their chromosome positions. Genes showing

orthology and collinearity were regarded as syntenic genes.

65

3.4 Results

3.4.1 Comparison of the distribution of B. napus NBS-LRR genes to its related diploid

species B. rapa and B. oleracea

The identification of NBS-LRR gene indicates that there are 641 NBS-LRR encoding genes in B.

napus. In the diploid species, a total 249 genes in B. rapa and 443 genes in B. oleracea were

identified (Table 3.1). The proportion of all genes encoding NBS-LRR in B. napus, B. rapa and B.

oleracea are 0.58%, 0.60% and 0.75%, respectively (Table 3.1).

Table ‎3.1: The total number of predicted NBS-LRR genes identified in the three Brassica genomes.

The table shows comparison between the methods used: MEME / MAST and CNL and TNL

BLAST

The number and distribution of NBS-LRR genes in the A and C genome of B. napus compared to

the A genome of B. rapa or C genome of B. oleracea are consistent across the chromosomes and

exhibit similar patterns (Tables 3.2 and 3.3 and Figures 3.1 and 3.2). The comparison of the total

number of NBS-LRRs and whether they were classified as TNL or CNL in both A and C genomes

also showed that the proportion of CNLs and TNLs was consistent across the three species, with

approximately 70% of NBS-LRR genes comprising TNLs (Figure 3.3). This represents a ratio of

1:2 of TNLs and CNLs in the A genome and 1:2.5 in C genome. The NBS-LRR in B. rapa and B.

oleracea were organised in gene clusters with approximately 53% and 61%, respectively. This is

very similar to the gene cluster percentage in B. napus (54% for A genome and 62% for C genome,

an average of 59%).

Species B. napus B. oleracea B. rapa

Genome size (Mbp) 1284 650 552

Gene content 111,479 59,225 41,019

Total candidate NBS-LRR genes 641 443 249

Proportion of NBS-LRR genes 0.58% 0.74% 0.60%

Identified in MAST output 615 437 248

Common to MAST and CNL/TNL analysis

509 379 208

Agreement % 79% 85% 83%

66

Table ‎3.2: The number of candidate NBS-LRR genes in B. napus. The total numbers of genes in

each chromosome with their classification into TNL or CNL as well as the reciprocal best BLAST

results are presented

Chromosome Total TNL CNL Reciprocal best BLAST

total conserved

TNL CNL A

gen

om

e B

. na

pu

s D

arm

or

A1 28 17 11 17 9 8

A2 44 33 11 26 24 2

A3 24 20 4 15 11 4

A4 13 12 1 6 6 0

A5 10 4 6 5 2 3

A6 21 7 14 11 4 7

A7 19 16 3 12 10 2

A8 24 17 7 12 8 4

A9 58 38 20 40 29 11

A10 11 7 4 5 5 0

Total in A genome

252 171

(68%) 81

(32%) 149 108 41

C g

en

om

e B

. na

pu

s D

arm

or

C1 37 24 13 25 14 11

C2 51 47 4 32 29 3

C3 56 46 10 33 27 6

C4 18 12 6 11 7 4

C5 19 11 8 14 7 7

C6 44 33 11 33 27 6

C7 38 30 8 32 25 7

C8 39 21 18 24 13 11

C9 79 62 17 54 42 12

Total in C genome

381 286

(75%) 95

(24%) 258 191 67

Unassigned 8 4 4 2 1 1

Total 641

461 (71%)

180 (28%)

409 300 109

67

Table ‎3.3: The number of candidate NBS-LRR genes in B. rapa and B. oleracea. The total numbers

of genes in each chromosome with their classification into TNL or CNL as well as the reciprocal

best BLAST results are presented.

Chromosome Total TNL CNL Reciprocal best BLAST

Total conserved

TNL CNL

A g

eno

me

B. r

ap

a

A1 18 10 8 16 8 8

A2 39 36 3 24 22 2

A3 29 25 4 16 12 4

A4 8 8 0 6 6 0

A5 11 2 9 5 1 4

A6 25 7 18 12 6 6

A7 19 16 3 13 11 2

A8 22 14 8 11 7 4

A9 59 40 19 41 29 12

A10 9 6 3 4 4 0

Unassigned 10 4 6 1 1 0

Total in B. rapa 249 168

(68%) 81

(32%) 149 107 42

C g

en

om

e B

. ole

race

a

C1 33 18 15 23 14 9

C2 52 43 9 30 27 3

C3 52 43 9 30 23 7

C4 24 15 9 10 5 5

C5 18 9 9 14 7 7

C6 45 32 13 31 24 7

C7 49 40 9 34 29 5

C8 41 23 18 27 12 15

C9 87 66 21 51 38 13

Unassigned 42 30 12 10 8 2

Total in B. oleracea

443 319

(72%) 124

(28%) 260 187 73

68

Table ‎3.4: Comparative analysis of synteny of NBS-LRR genes in B. napus to its progenitor

species

Chro Total Conserved Syntenic

genes

Syntenic genes in cluster

No. of conserved genes in clusters

No. of conserved genes not in clusters

A g

eno

me

B. n

ap

us

Dar

mo

r

A1 28 17 13 5 4 13

A2 44 26 20 16 17 9

A3 24 15 13 10 6 9

A4 13 6 3 2 2 4

A5 10 5 4 1 1 4

A6 21 11 7 2 3 8

A7 19 12 10 4 5 6

A8 24 12 10 5 5 6

A9 58 40 35 30 32 8

A10 11 5 4 1 0 5

Total in A genome

252 149 119 76 75 72

C g

en

om

e B

. na

pu

s D

arm

or

C1 37 25 21 12 13 12

C2 51 32 15 12 21 11

C3 56 33 26 20 20 13

C4 18 11 8 5 6 5

C5 19 14 14 10 10 4

C6 44 33 29 23 22 10

C7 38 32 31 25 21 11

C8 39 24 19 18 14 10

C9 79 54 42 33 33 20

Total in C genome

381 258 205 158 160 96

Unassigned

8 -- -- -- -- --

Total 641 409 324

(50%) 234

(72%) 235 168

69

Figure ‎3.1: The distribution of NBS-LRR genes in the A genome of B. napus and B. rapa

(Chalhoub et al., 2014).

Figure ‎3.2: The distribution of NBS-LRR genes in the C genome of B. napus and B. oleracea

(Chalhoub et al., 2014).

70

Figure ‎3.3: The proportion of TNL and CNL genes to the total number of NBS-LRR genes in B.

napus, B. rapa and B. oleracea.

Further NBS-LRR physical location of B. napus when compared to its diploid progenitors of B.

rapa and B. oleracea was performed to assess the NBS-LRR genes synteny and clustering (Figures

3.4, 3.5, 3.6 and 3.7). Comparison of the NBS-LRR genes of B. napus chromosomes with their

homeologous counterparts in the B. rapa and B. oleracea genomes indicated that the NBS-LRR

distribution showed clear syntenic and collinearity relationships between the A genome of B. napus

and B. rapa and the C genome of B. napus and B. oleracea. Approximately 324 (50%) of NBS-

LRR in the A and C genomes of B. napus were found to be syntenic and collinear to their respective

progenitor genomes (Table 3.4). They were highly consistent in size and orientation between

homeologous chromosomes in B. napus and its related species especially at the gene clustering

level (Figures 3.4, 3.5, 3.6 and 3.7). In addition, 72% of these syntenic and collinear genes in B.

napus were located within a gene cluster (Tables 3.4). However, there was evidence from local

gene clusters that the genes have been formed or lost between corresponding chromosomes of the

three Brassica species. In total, there were eight gene clusters formed in B. napus (four clusters in

the A genome and four clusters in the C genome) whereas there were ten gene clusters lost from the

B. napus (three clusters in the A genome and seven clusters in the C genomes) compared to the

diploid progenitors. For example, there are two gene clusters present on B. napus chromosome A1,

but absent in B. rapa. Likewise, two gene clusters are absent from B. napus chromosome C2 and C3

compared to B. oleracea.

71

Figure ‎3.4: Comparative organization of NBS-LRR genes between the An genome of B. napus and

Ar genome of B. rapa (chromosomes A1 to A5). Circles and triangles designate TNL and CNL

genes, respectively. Solid triangles and circles represent where the genes have orthologous copies,

blue and red symbols represent where genes were found to be in a cluster and grey symbols

represent where genes were not in a cluster. Unfilled symbols show genes without orthologs. A

solid line between genes represents genes that have a syntenic relationship in clusters, dotted lines

between genes show genes with syntenic relationships, but not in clusters. Chromosome lengths are

shown in Mb pairs. Published in (Chalhoub et al., 2014).

72

Figure ‎3.5: Comparative organization of NBS-LRR genes between the An genome of B. napus and

Ar genome of B. rapa (chromosomes A6 to A10). Circles and triangles designate TNL and CNL

genes, respectively. Solid triangles and circles represent where the genes have orthologous copies,

blue and red symbols represent where genes were found to be in a cluster and grey symbols

represent where genes were not in a cluster. Unfilled symbols show genes without orthologs. A

solid line between genes represents genes that have a syntenic relationship in clusters, dotted lines

between genes show genes with syntenic relationships, but not in clusters. Chromosome lengths are

shown in Mb pairs. Published in (Chalhoub et al., 2014).

73

Figure ‎3.6: Comparative organization of NBS-LRR genes between the Cn genome of B. napus and

Co genome of B. oleracea (chromosomes C1 to C5). Circles and triangles designate TNL and CNL

genes, respectively. Solid triangles and circles represent where the genes have orthologous copies,

blue and red symbols represent where genes were found to be in a cluster and grey symbols

represent where genes were not in a cluster. Unfilled symbols show genes without orthologs. A

solid line between genes represents genes that have a syntenic relationship in clusters, dotted lines

between genes show genes with syntenic relationships, but not in clusters. Chromosome lengths are

shown in Mb pairs. Published in (Chalhoub et al., 2014).

74

Figure ‎3.7: Comparative organization of NBS-LRR genes between the Cn genome of B. napus and

Co genome of B. oleracea (chromosomes C6 to C9). Circles and triangles designate TNL and CNL

genes, respectively. Solid triangles and circles represent where the genes have orthologous copies,

blue and red symbols represent where genes were found to be in a cluster and grey symbols

represent where genes were not in a cluster. Unfilled symbols show genes without orthologs. A

solid line between genes represents genes that have a syntenic relationship in clusters, dotted lines

between genes show genes with syntenic relationships, but not in clusters. Chromosome lengths are

shown in Mb pairs. Published in (Chalhoub et al., 2014).

75

3.4.2 Conservation analysis of B. napus NBS-LRR genes to its related diploid species B. rapa

and B. oleracea

In order to identify orthologous genes between each pair of genomes, the NBS-LRR genes

identified in the B. napus genome were individually compared to genes in both B. rapa and B.

oleracea by reciprocal best BLAST (RBB) to determine the number of reciprocal best hits for each

pairwise comparison.

In total 260 and 149 B. napus NBS-LRR have orthologous genes in B. oleracea and B. rapa,

respectively. NBS-LRR gene conservation was evident among the Brassica species representing

more than 60% of the genes in B. napus, B. rapa and B. oleracea which are conserved (retained)

(Tables 3.4 and 3.5). In contrast, 100 NBS-LRR genes were lost in the B. napus A genome and 183

in the C genome. There were also NBS-LRR genes present in B. napus which are absent in B. rapa

and B. oleracea; 103 and 121, respectively (Table 3.5).

In addition, the level of conservation (presence/absence and homology) tends to be higher within

gene clusters. There was high conservation in genes involved in gene cluster than singleton genes.

Approximately 80 (53%) and 160 (61%) of conserved genes in the A and C genome, respectively,

were within gene clusters. This compared to 96 (37%) and 72 (48%) of singleton genes were

conserved in A and C genome, respectively (Tables 3.4 and 3.5). There was conservation bias

towards TNLs in the A genome with 63% of TNLs conserved compared to 53% of CNLs

conserved. The conservation patterns do not seem to be consistent within the C genome (Tables 3.4

and 3.5).

Table ‎3.5: Comparative analysis of the number of NBS-LRR genes in the three Brassica species

and the reciprocal best BLAST hits

Genome B. oleracea C genome

B. napus Darmor

C genome

B. rapa A genome

B. napus Darmor

A genome

Total NBS LRR 443 381 249 252

Total TNL and CNL 319 , 124 286 , 95 168 , 81 171 , 81

Reciprocal BLAST hits conserved genes “retained”

260 (60%) 260 (75%) 149 (60%) 149 (60%)

Conserved TNL and CNL 187 , 73 192 , 68 107 , 42 108 , 41

% Conserved TNL and CNL 58% , 58% 67% , 71% 63% , 52% 63% , 52%

No. of conserved genes in cluster 160 (61%) 75 (50%)

No. of conserved genes not in cluster 96 (37%) 72 (48%)

Not conserved 183 121 100 103

Lost 183 0 100 0

Gained 0 121 0 103

76

3.4.3 Comparative analysis of chromosome A7 and C6 in B. napus

The genome wide NBS-LRR analysis enabled us to study the synteny and conservation comparison

of NBS-LRR genes on homeologous chromosomes A7 and C6 with their homologous counterparts

in B. rapa and B. oleracea. The gene content was similar with 19 NBS-LRR genes on chromosome

A7 of both B. rapa and B. napus, and 44 NBS-LRR genes on chromosome C6 in B. napus

compared to 45 genes in B. oleracea. There were 14 NBS-LRR genes involved in inter-genomic

duplications between B. napus A7 and C6 (Figure 3.8). The duplication events showed consistent

patterns with syntenic orthologous relationship. The comparison between chromosomes A7 genes in

B. napus and B. rapa showed 15 genes were orthologous and showed a clear syntenic relationship.

Similarly, 33 genes from chromosomes C6 in B. napus and B. oleracea were orthologous and

showed a clear syntenic relationship.

The comparison indicated that 15 and 33 genes on chromosomes A7 and C6, respectively, in B.

napus were produced by polyploidization. In addition, there were two tandemly duplicated genes on

chromosome C6 of B. napus produced by post-speciation (gene duplication) (Figure 3.8).

Phylogenetic analysis was further performed to study the relationships between the NBS-LRR

genes on A7 and C6. Figure 3.9 shows the phylogenetic analysis of NBS-LRR genes on

chromosomes A7 and C6 in B. napus and their homologs in its diploid progenitor species: B. rapa

and B. oleracea. As expected, the phylogenetic analysis separates the TNL and CNL gene products

into two distinct clades (Figure 3.9). TNL and CNL groups were further divided into distinct

subgroups represented by conserved and orthologous genes from all the three species grouped

together. The number of NBS-LRR genes for the three species in each subgroup was almost

identical. For example, two TNL subgroups were found to be the largest, containing 19 NBS-LRR

genes in total. A greater part in this subgroup was from B. napus (nine genes) compared to five

genes from B. rapa and five genes from B. oleracea.

77

Figure ‎3.8: Comparative chromosomal map of NBS-LRR genes on chromosomes A7 and C6

between B. napus (Bn), B. rapa (Br) and B. oleracea (Bo). The squares and triangles represent

NBS-LRR type: TNL and CNL, respectively. Genes in syntenic relationships are connected by

lines. Red lines show inter-genomic duplications between B. napus A7 and C6. Closed bars are

orthologous genes. Genes within green squares are in a gene cluster.

78

Figure ‎3.9: A phylogenetic tree of TNL and CNL genes located on chromosomes A7 and C6 in B.

napus and the diploid species B. rapa and B. oleracea based on a neighbour-joining method using

MEGA 6.0 software. The numbers on branches indicate the percentage of 1000 bootstrap replicates.

The coloured squares indicate the gene origin: red from B. napus, blue from B. rapa and green from

B. oleracea.

79

3.5 Discussion

3.5.1 Comparison of the distribution of B. napus NBS-LRR genes to its related diploid

progenitor species B. rapa and B. oleracea

In this study, a total of 249 and 443 NBS-LRR genes in B. rapa and B. oleracea, respectively, were

identified. This is compared to 641 NBS-LRR genes in the B. napus cv. Darmor genome. This

shows a gene reduction in B. napus compared to the diploid progenitors, which meets the

expectation that the number of NBS-LRR genes is reduced in B. napus compared to B. rapa and B.

oleracea due to gene losses during the polyploidization events.

A recent study (Yu et al., 2014) has identified 206, 176 and 157 NBS-LRR genes in B. rapa, B.

oleracea and Arabidopsis, respectively. The NBS-LRR genes were identified through Hidden

Markov Model (HMM) profile corresponding to the Pfam NBS family domain. The number of

NBS-LRR identified in B. rapa was close to what is identified in here, but, the number of NBS-

LRR in B. oleracea was underestimated compared to this study. They also found that the number of

NBS-LRR genes in these three species was very close despite the differences in genome size and

complexity. This is surprising as B. rapa and B. oleracea genomes would be expected to contain

higher numbers of NBS-LRR genes than Arabidopsis because the B. rapa and B. oleracea genomes

have undergone extensive genome triplication events since divergence from the common ancestor

of Arabidopsis. The difference may be due to a poorer quality genome sequence assembly than was

used in this study.

The comparison of the total number of NBS-LRRs and whether they were classified as TNLs or

CNLs in both the A and C genomes showed that the proportion of CNLs and TNLs was consistent

across the three species, with about 70% of NBS-LRR genes comprising TNLs. This represents a

ratio of 1:2 of TNL and CNL in the A genome and a ratio of 1:2.5 in the C genome. This is

consistent with findings in Arabidopsis (Meyers et al., 2003).

The proportion of all genes encoding NBS-LRR in B. napus, B. rapa and B. oleracea are 0.58%,

0.60% and 0.75%, respectively, which is in line with estimates for other plant species that range

between 0.6- 1.8% (Meyers et al., 2003, Monosi et al., 2004, Ameline-Torregrosa et al., 2008,

Kohler et al., 2008, Yang et al., 2008). Thus, there does not seem to be correlation between genome

size and the number of NBS LRR genes. This is consistent with a previous study which suggested

that the relationship between total predicted gene number and total NBS-LRR gene number is likely

to be non-linear and not proportional to genome size (Porter et al., 2009).

80

The distribution and synteny of NBS-LRR genes in the A genomes of B. napus and B. rapa and C

genomes of B. napus and B. oleracea are consistent. Indeed, they are highly conserved in size and

orientation particularly region containing major gene clusters since 72% of these syntenic and

collinear genes in B. napus were located within gene cluster. The gene conservation and syntenic

relationship between sub-genomes has been found previously in other plants (Lagercrantz, 1998,

Grant et al., 2000, McCouch, 2001). However, loss of synteny for a gene cluster or presence of non-

orthologous genes in a local cluster suggests gene deletion, translocation or divergence in the B.

napus genome. This study also confirmed the absence or presence at the DNA level of gene clusters

by which a gene cluster is formed in B. napus, whereas there is no corresponding gene cluster in the

diploid genome of either B. rapa or B. oleracea in syntenic regions in the chromosomes.

Overall, these comparisons do not reveal significant deletion of NBS-LRR in B. napus as compared

to the diploid species. Thus, B. napus may have been subjected to less evolutionary selection or

pressure leading to maintaining NBS-LRR properties with its diploid progenitors.

3.5.2 Conservation analysis of B. napus NBS-LRR genes to its related diploid species B. rapa

and B. oleracea

The reciprocal best BLAST between the three genomes indicated that about 60% of NBS-LRR

genes were conserved and retained in the three Brassica species. These conserved orthologous

genes indicate that they have been maintained after polyploidisation. However, there could be an

underestimation in conserved orthologous genes due to the fact that reciprocal best BLAST counts

only once if a gene is duplicated. Previous studies on the fate of duplicated genes in polyploids

found that the duplicated resistance genes are preferentially lost after genome duplication, most

likely via a deletion mechanism (Nobuta et al., 2005, Yu et al., 2005, Yang et al., 2008). After

hybridisation of B. napus from B. rapa and B. oleracea, NBS-LRR genes in Brassica species

experienced species-specific gene deletion resulting in gene losses. A similar phenomenon of

redundant NBS-LRR gene loss has also been reported in other plant genomes presumably because

the duplicates would have redundant functions (Bancroft, 2001, Rana et al., 2004).

The duplicate genes were classified as resulting from polyploidization or post-speciation. Based on

their conservation and synteny, it can be difficult to distinguish among orthologs and paralogs

without extensive genome wide analysis. However, the results gave indication that a large number

of duplicated genes were originated via polyploidization from the ancestral species: B. rapa and B.

oleracea, while the tandemly duplicated genes were more likely produced by post speciation

duplications.

81

Gene conservation in syntenic region between B. napus and its related species: B. rapa and B.

oleracea suggests that orthologous copies are likely to retain their original function whilst, non-

orthologous genes in B. napus may have become specific or gained novel disease resistance. A

study by Yang et al. (2008) suggested that species specific NBS-LRR genes tend to be chromosome

specific and arranged in one or a small number of genomic cluster. In my study, the conservation

analysis revealed that there were more conserved NBS-LRR genes in gene clusters than singletons

suggesting that the chromosomal locations of NBS-LRR gene clusters show significant level of

conservation between B. napus and its related species: B. rapa and B. oleracea. In addition, this

also may suggest different evolutionary patterns for singleton and clustered NBS-LRR genes.

Further studies would be required to investigate whether the orthologous genes maintained in B.

napus show similar or divergent gene expression. In previous studies, gene expression data has

suggested that most duplicated NBS-LRR genes exhibited different expression patterns (Tan et al.,

2007, Cheng et al., 2012b).

The differences in NBS-LRR genes between the B. napus genome and its diploid species: B. rapa

and B. oleracea represent genes losses compared to B. rapa and B. oleracea as well as genes gained

in B. napus. The NBS-LRR genes reduction is expected in B. napus compared to B. rapa and B.

oleracea mainly due to gene losses via deletion during the polyploidization events. At the same

time, gene duplication and recombination have also created novel NBS-LRR resistance genes

specific to B. napus and have resulted in substitution of the gene number to compensate the gene

losses during formation of B. napus. A study by Yue et al. (2012) suggested that the NBS-LRR

gene number difference within species is either driven by gene losses or expansion through

duplication or divergence. Moreover, previous studies have shown evidence of gene loss and gain in

B. napus (Cho et al., 2010, Chalhoub et al., 2014). It could be speculated that the NBS-LRR genes

lost in B. napus suggested that these genes in B. rapa and B. oleracea might be species or disease

specific and have undergone different selection pressure to offer a novel resistance to same

pathogen or some pathogens for B. napus.

Together, the variation in NBS-LRR gene number between B. napus and the diploid species

indicates that only a single copy of some NBS-LRR genes may be conserved in the B. napus

genome, whilst the presence of resistance genes in B. napus and not the diploids may suggest

exposure to different pathogen stresses between the species, which has driven NBS-LRR genes

evolution. Furthermore, the differences in NBS-LRR gene numbers and conservation between B.

82

napus and its progenitors could be also attributed to genome mis-assembly and/or annotation

differences. Moreover, the cultivar specific differences are another factor affecting the NBS-LRR

variation as some NBS-LRR genes were present in one genotype but absent in another.

3.5.3 Comparative analysis of chromosomes A7 and C6 in B. napus

The number of NBS-LRR genes on chromosomes A7 and C6 between the three species were

consistent suggesting that they might have been maintained and not affected much during the

formation of B. napus.

Moreover, a comparative phylogenetic relationship of the NBS-LRR genes located on

chromosomes A7 and C6 in B. napus B. oleracea and B. rapa represents two major groups of TNL

and CNL genes from the three species. The phylogenetic comparison revealed that NBS-LRR

sequences from the three species are grouped together indicating that these are highly conserved

and suggests gene duplication from a common ancestral source of NBS-LRR genes. It also showed

that orthologous genes located in the homeologous chromosomes were likely found in the same

subgroups. This further confirmed that tandem duplication plays a major role in the duplication of

NBS-LRR genes. Thus, it appears that these NBS-LRR genes in B. rapa and B .oleracea may have

undergone similar selection pressure to offer novel resistance to same or different pathogen for B.

napus.

This study found several candidate NBS-LRR genes in B. napus that could confer resistance to

blackleg disease caused by L. maculans, the most devastating fungal pathogen of B. napus. These

genes were conserved orthologous genes in B. rapa and shared a duplicated copy in both

chromosomes A7 and C6 suggesting chromosomal duplication or rearrangements of this region. An

interesting NBS-LRR cluster on B. napus A7 and B. rapa A7 associated with candidate resistance

genes was identified. Further comparison analyses are discussed in more details later in chapter 5.

83

Chapter 4

Association and localization of NBS-

LRR genes with disease resistance

QTL in Brassica napus

84

Chapter 4 : Association and localization of NBS-LRR genes with disease

resistance QTL in Brassica napus

*This chapter contributed towards a paper being submitted

4.1 Abstract

The availability of the genome sequence and disease resistance NBS-LRR genes, together with

linked or flanking markers and QTL genetic maps, provide an efficient strategy to examine the

characteristics of QTL intervals. The association and co-localization between NBS-LRRs and

disease resistance QTL is a common feature of plant genomes. Here, this chapter presents the study

of the association and correlation between disease resistance QTL intervals and the NBS-LRR

resistance genes in Brassica napus.

A total of 27 QTL intervals for qualitative resistance to clubroot, downy mildew and blackleg

disease resistance traits in B. napus were identified and classified into primary and homeologous

QTL intervals. It also indicated that the QTL intervals in B. napus were originally contributed by

the parental species: B. rapa and B. oleracea. The identified QTL intervals appeared to co-localize

with NBS-LRR genes. There was a significant correlation between the number of NBS-LRR genes

within the QTL intervals and the number of genes involved in gene clusters. These gene clusters

and duplicated genes were likely to be associated with QTL.

The results provide evidence that NBS-LRR gene clusters are associated with disease resistance

QTL. The comparative analysis of disease resistance QTL between B. napus and its related species

reveal the structural conservation of NBS-LRR genes and provide a set of candidate resistance

genes. Further investigation of the similarity/divergence in sequence and gene content of these

QTLs will help elucidate their conservation and evolution.

85

4.2 Introduction

In the three major cultivated Brassica species, B. napus B. rapa, and B. oleracea, clubroot (caused

by Plasmodiophora brassicae), downy mildew (caused by Hyaloperonospora Parasitica) and

blackleg (caused by Leptosphaeria maculans) diseases have become an increasing problem causing

heavy yield losses to Brassica crops world-wide and have been a major focus in Brassica disease

resistance research (Delourme et al., 2011).

Genetic studies have identified the gene for gene interactions between Avr genes in L. maculans and

their corresponding Rlm (Resistance to Leptosphaeria maculans) genes in B. napus. All the

blackleg resistance genes identified to date have been found on the A genome of B. napus, with

none found on the C genome (Rouxel et al., 2003b, Ananga et al., 2006, Leflon et al., 2007). In

addition, genetic mapping studies have shown that all blackleg resistance genes in B. napus are

organised in clusters in two genomic regions: the Rlm2 resistance gene on chromosome A10 (Pilet

et al., 2001) and five specific resistance genes: Rlm1, Rlm3, Rlm4, Rlm7, and Rlm9 on chromosome

A7 (Delourme et al., 2004, Mayerhofer et al., 2005). It remains unclear whether these genes are a

cluster of tightly linked genes, or a single gene with different alleles, or a combination of both

(Delourme et al., 2004). Other blackleg resistance genes have also been identified on the A genome

of B. rapa: LepR1 on chromosome A2 and LepR2 on chromosome A10 (Yu et al., 2005) , LepR3 on

chromosome A10 (Li and Cowling, 2003, Larkan et al., 2014) and BLMR1 and BLMR2 on

chromosome A10 (Long et al., 2011). Recently, the first blackleg resistance gene in Brassica,

LepR3, has also been cloned in B. napus lines with introgressions from B. rapa ssp. Sylvestris

(Larkan et al., 2013).

Different sources of resistance to clubroot have been found in B. oleracea, B. rapa and B. napus

(Hirai, 2006, Diederichsen et al., 2009). At least eight clubroot resistance loci have been identified

in B. rapa and allocated to five different chromosomes: Crr1 on A8, Crr2 on A1, Crr4 on A6

(Suwabe et al., 2006), CRc on A2 (Sakamoto et al., 2008), CRa, CRb , CRk and Crr3 on A3 (Piao

et al., 2004, Saito et al., 2006, Hayashida et al., 2008, Sakamoto et al., 2008). In B. oleracea,

various QTL have also been reported (Landry et al., 1992, Figdore et al., 1993, Grandclément and

Thomas, 1996, Voorrips et al., 1997, Moriguchi et al., 1999, Rocherieux et al., 2004, Nomura et al.,

2005). Recently, two major QTL: PbBo(Anju)1 and PbBo(Anju)3 on chromosomes C2 and C3,

respectively, and other QTLs with minor effects have been identified (Nagaoka et al., 2010). This

study also showed that the B. rapa chromosomal regions harbouring B. rapa clubroot resistance

gene specific markers are homologous to the corresponding region of B. oleracea.

86

Several major qualitative resistance loci to downy mildew have been identified in Brassica species.

In B. oleracea, four resistance loci have been identified: Pp523 on chromosome C8(Farinhó et al.,

2004, Farinhó et al., 2007, Carlier et al., 2011, Carlier et al., 2012), the dominant single locus Ppa3

(Singh et al., 2012), downy mildew resistance locus Dm (Giovannelli et al., 2002) and

PpALG1(Alabaça, 2009). In B. rapa, three resistance loci were found, BraDM on chromosome A8

(Yu et al., 2009, Li et al., 2011, Yu et al., 2011b), the BrRHP1 locus on chromosome A1 (Kim et

al., 2011) and candidate resistance gene locus RPP13MK on chromosome A9 (Yu et al., 2010). In

addition, multiple sources of resistance have been reported in B. napus (Nashaat et al., 1997, Ge et

al., 2008).

Although numerous disease resistance QTL and qualitative genes mapped as QTL have been

identified, isolating and cloning a gene underlying the QTL/gene mapped region is relatively

difficult through map based cloning because the QTL region is still large and often not tightly

linked to the candidate gene(s). This is often further complicated by the complexity of the genomic

region due to duplication and rearrangement events. The availability of functional genome

annotations makes it possible to investigate the association and distribution between NBS-LRR

resistance genes and disease resistance QTL. The association of qualitative resistance (R) genes or

major resistance genes and QTL has been widely noted (Gebhardt and Valkonen, 2001, Quint et al.,

2003, Wisser et al., 2005). Candidate NBS-LRR resistance genes were previously found to be co-

localized with disease resistance QTL such as barley, potato and apple (Ramalingam et al., 2003,

Calenge et al., 2005, Bakker et al., 2011). Therefore, the significant co-localization of NBS-LRR

and QTL provide an excellent template for further studies of the involvement of NBS-LRR with

disease resistance QTL and genetic mapping of qualitative genes in B. napus.

The genetic maps are an important component representing sets of inherited DNA markers in

relation to the position of a gene conferring a trait. In contrast, the physical gene mapping

determines the location and function of a gene on a chromosome and its relative distance to the

QTL. Thus, genomic approaches can be used to analyse the disease resistance QTL using a

homology search. The sequence information and genome position of markers linked to, or flanking,

the previously reported QTL can be used to identify the physical boundaries of the QTL region onto

genome chromosomal maps. This allows narrowing down the size and position of the QTL to a

small genomic region in which candidate genes can be found based on the genome wide

identification of disease resistance NBS-LRR genes. Once the physical distance of the QTL and

flanking markers are determined, the correlation or association between a given disease’s QTL

regions and the associated NBS-LRR genes is further studied to enable prioritization and selection

of potential NBS-LRR candidate genes. Additionally, NBS-LRR genes associated with the QTL

87

physical and flanking markers can be used as a comparative approach to study the conservation and

syntenic regions between B. napus and its progenitor species, B. rapa and B. oleracea.

This chapter describes the physical location identification of disease resistance QTL intervals for

three important disease resistance traits in B. napus: blackleg, clubroot and downy mildew. It also

presents the physical association between QTL intervals and the numbers of NBS-LRR genes to

address the question whether the clustered and duplicated NBS-LRR genes are structurally and

functionally relevant to disease resistance QTL characteristics. If the clustering of functionally

related disease resistance genes occurred within QTL as a result of evolutionary mechanisms such

as duplication and divergent selection pressure, it is possible that the NBS-LRR gene/cluster

associated with QTL contributes to genetic variation and is under selection for the functional

conservation.

88

4.3 Methods

4.3.1 Brassica reference genome sequences

The genome sequence and annotation of B. napus cv. Darmor v6 was used as the reference genome

(Chalhoub et al., 2014). The genome sequence of B. rapa v3.0 (Wang et al., 2011) and B. oleracea

cv. To1000 v1.0 (Parkin et al., 2014) were also used for comparative analysis. The NBS-LRR gene

annotation and identification was described previously in chapters 2 and 3.

4.3.2 Identification of QTL intervals in B. napus by marker homology BLAST search

To identify the location of QTL intervals on the genome, available marker data and sequences were

retrieved from previously published studies (Tables 4.1, 4.2 and 4.3). These sequences were used

to perform homology searches against the reference genomes using BLAST (maximum E-value 1E-

1) (Altschul et al., 1997). Markers with 100% identical sequence alignments, with over 90%

coverage and that each marker of a QTL region were placed on the same chromosome

corresponding to the genetic map were selected. If the markers could not be placed on the

chromosome, or had no BLAST hit, they were discarded from the analysis.

The physical position of each marker was assigned based on the BLAST hit position of the marker

sequences against the reference genomic sequence. The genetic locations of associated QTL

intervals were estimated based on the distance between linked or flanking markers. For each QTL

interval, any NBS-LRR genes located within the markers flanking the QTL interval were

considered associated and included in the analysis. The correlation between the number of NBS-

LRR genes and disease QTL intervals was analysed using linear regression in Microsoft Excel.

For co-mapping the QTL intervals between B. napus and its progenitor species, B. rapa and B.

oleracea, the BLAST searches of the QTL linked or flanking markers with respect to their physical

hit locations, order and coverage were compared between them. A syntenic region in B. rapa and B.

oleracea was characterized by marker locations present in the same chromosome and in the same

order as in B. napus.

89

4.4 Results

4.4.1 Identification and localization of QTL intervals in B. napus

The genetic mapping studies (Piao et al., 2009, Delourme et al., 2011) indicated the presence of

several resistance genes for clubroot and downy mildew in the Brassica A and C genomes with

most of these genes showing race specificity. Similarly, blackleg resistance is controlled by a major

gene for gene interaction and the resistance is contributed by the Brassica A genome. Thus, these

identified QTL intervals are for qualitative resistance controlled by major resistance genes that have

been identified in B. napus, B. rapa or B. oleracea.

A number of flanking, or linked, markers of these disease resistance QTL for qualitative genes in

the three species: B. napus, B. rapa, and B. oleracea, were utilized to determine the physical

location of QTL intervals on the B. napus genome. The QTL flanking or linked markers were

initially screened by BLAST homology searches to locate their physical positions on the genome,

then, the location of QTL intervals on the genome were estimated.

A total of 27 QTL intervals for qualitative resistance to clubroot, downy mildew and blackleg in B.

napus were identified. The number of QTL detected ranged from 12 QTL intervals for clubroot and

downy mildew and three QTL intervals for blackleg (Tables 4.1, 4.2 and 4.3). The distribution of

these QTL was found across all 19 of the B. napus chromosomes except A4, A5 and C6. The length

of the QTL intervals ranged from 0.7 to 23 Mbp with the majority (66 %) of the identified QTL

intervals less than 2.8 Mbp in length (Table 4.4).

Twelve QTL intervals involved in clubroot resistance were identified in B. napus (Table 4.1 and

Figure 4.1). These QTL intervals corresponded to the eight and two clubroot resistance loci which

have been identified in B. rapa and B. oleracea, respectively. These 12 QTL intervals were

classified into eight primary QTL intervals corresponding to their respective genetic maps and four

homeologous QTL intervals due to the amphidiploid nature of B. napus. The eight primary QTL

intervals in B. napus are Crr1 on chrA8, Crr2 on chrA1, Crr3 and CRk on chrA3, CRa and CRb on

chrA3, Crr4 on chrA6, CRc on chrA2, pbBo(Anju)1 on chrC2 and PbBo(Anju)3 on chrC3 (Table

4.1). The four homeologous QTL intervals in B. napus are Crr3 on chrC3, Crr4 on chrC6, CRa and

CRb on chrC7 and pbBo(Anju)1 on chrA2 (Table 4.1). The primary Crr3 QTL is overlapping with

the primary CRk QTL, so they are considered as one primary QTL for both Crr3 and CRk locus.

The primary and homeologous CRa QTL are overlapping with the primary and homeologous CRb

QTL, so they are considered as one primary QTL and one homeologous QTL for both CRa and CRb

locus.

90

Likewise, 12 QTL intervals involved in downy mildew resistance were identified in B. napus

(Table 4.2 and Figure 4.1). These QTL intervals were corresponding to the three mildew

resistance loci which have been identified each in B. rapa and B. oleracea. These QTL intervals

were also classified into seven primary QTL intervals and five homeologous QTL intervals. The

seven primary QTL intervals in B. napus are BraDM on chrA8, RPP13MK on chrA9, BrRHP1 on

chrA1, two Pp523 on chrC8 and PpALG1 on chrC4 and Dm on chrC2 (Table 2). The five

homeologous QTL intervals in B. napus are RPP13MK on chrC9, two BrRHP1 on chrC1, two

Pp523 on chrA9 (Table 4.2). The BrRHP1 locus had two homeologous QTL intervals separated by

a region of ~14 Mbp, so they are considered as two separate QTL intervals. The Pp523 locus had

two primary QTL intervals and two homeologous QTL intervals separated by a 19 Mbp region, so

they are considered as two separate QTL intervals.

The flanking markers search and QTL intervals for blackleg resistance was restricted only to the A

genome of B. napus because genetic mapping studies showed that all blackleg resistance genes are

found on the A genome (Rouxel et al., 2003b, Ananga et al., 2006, Leflon et al., 2007). In addition,

the homology search of flanking, or linked markers for the blackleg Rlm QTL revealed that the

flanking markers showed good correspondence between the A genome of B. napus and B. rapa.

However, the flanking markers did not show sequence and collinear similarity between the A and C

genome of B. napus and some markers even failed to be located in the C genome. As a result, the

flanking marker analysis in the C genome of B. napus was not considered in this study.

Hence, three QTL intervals involved in blackleg resistance were identified in the A genome of B.

napus (Table 4.3 and Figure 4.1). These three QTL intervals were small QTL intervals on

chromosome A2 for resistance loci, LepR1, major QTL on chromosome A7 for resistance genes:

Rlm1, Rlm3, Rlm4, Rlm7, and Rlm9 and major QTL on chromosome A10 for resistance genes:

LepR3, BLMR1 and BLMR2. The Rlm2 and LepR2 genes could not be mapped in this study because

there was no available markers sequence. The LepR2, LepR3, BLMR1 and BLMR2 genes were

identified in B. rapa. However, the B. napus genome might harbour orthologous genes for LepR2,

LepR3, BLMR1 and BLMR2 located in the same position in B. rapa.

The overlapping QTL intervals were identified as the QTL flanking markers were found to overlap

by their physical position and order on the genome. For example, clubroot primary and

homeologous QTL intervals for resistance loci CRa and CRb were overlapping in the A and C

genome of B. napus. Furthermore, clubroot QTL intervals for resistance loci Crr3 and CRk were

overlapping in the A genome but only the Crr3 locus showed homeologous QTL intervals in the C

genome.

91

The QTL intervals for downy mildew BrRHP1 had primary QTL intervals in the A genome of B.

napus and two adjacent homeologous QTL intervals separated by a region in the C genome.

Similarly, QTL intervals for downy mildew for Pp523 had two adjacent primary QTL intervals in

the C genome of B. napus and two adjacent homeologous QTL intervals separated by a region in

the A genome.

The co-mapping of the flanking markers of QTL intervals were performed to compare the syntenic

relationship between B. napus and its progenitor species. The physical distance of flanking markers

and location of the QTL intervals between B. napus and B. rapa and B. oleracea were mostly

consistent. Almost all QTL intervals identified in B. napus were co-mapped and contributed by the

progenitor species. However, there were several QTL intervals only co-mapped to either the A or C

sub-genome B. napus and its corresponding genome in the related species. For example, the

clubroot loci, Crr1 and Crr2 had only primary QTL intervals in the A genome of B. napus and they

co-mapped to the A genome of B. rapa.

92

Table ‎4.1: Marker homology search of clubroot resistance QTL flanking or linked markers for identification of QTL intervals in B. napus

Origin species

Gene or locus

Flanking marker

Marker homology search, Chromosome and QTL interval region (Mbp)

References B. napus v6

primary B. napus v6

Homeologous B. rapa v3 B. oleracea v1

B. rapa

Crr1

BRMS-088 BRMS-297 BRMS-173

chrA8 (9,157,719 -12,043,127)

No XA8 (9,066,423 -11,186,261)

No (Suwabe et al., 2003, Suwabe et al., 2006, Suwabe et al., 2012)

Crr2

BRMS-096 BRMS-056 BRMS-100 BSA3

chrA1 (4,791,456 -6,335,486 )

No XA1 (4,934,358 -6,345,669)

No (Suwabe et al., 2003, Suwabe et al., 2006, Nagaoka et al., 2010)

Crr3*

BRMS058 BrSTS-54 BrSTS-20 BrSTS-44 BrSTS-41 BrSTS-33 OPC11-2S BrSTS-61 BrSTS-78 BrSTS-26 BN308CAPS BN142CAPS OPC11-1S BRMS206

chrA3 (14,411,852 -18,264,109)

chrC3 (22,088,816 -24,102,086)

XA3 (14,571,901 -18,141,486)

C3 (21,404,891 -23,862,433)

(Hirai et al., 2004, Saito et al., 2006, Nagaoka et al., 2010)

Crr4 BRMS-125 BRMS-101 BRMS-027

chrA6 (100,821 - 2,122,156)

chrC6 (9,621,029 -3,923,239)

XA6 (322,565 -1,915,926 )

C6 (31,002,743 -35,909,761)

(Suwabe et al., 2006, Sakamoto et al., 2008)

CRa**

HC352b-SCAR GC2360 SC2930

chrA3 (24,281,973 -25,363,967)

chrC7 (43,472,000 -44,774,737)

XA3 (8,639,804 -7,449,703)

C7 (41,356,755-42,582,200)

(Sakamoto et al., 2008, Matsumoto et al., 2012)

93

HC181

CRb**

TCR05 KBrH059N21F KB59N07 B1210 B1005 KBrH129J18R TCR09 KBrB091M11R TCR02-F

chrA3 (24,657,890 – 26,478,686)

chrC7 (43,811,909 – 45,640,146)

XA3 ( 6,596,824- 8,359,700)

C7 (41,700,554 – 43,396,450)

(Piao et al., 2004, Kato et al., 2012, Kato et al., 2013)

CRc

PW155

BRMS-082

m6R

BRAS011

B50

chrA2 (12,021,130 – 23,656,219)

No XA2 (11,817,414-21,197,733)

No (Sakamoto et al., 2008, Matsumoto et al., 2012)

CRk* HC688

BRMS-158

chrA3 (16,540,113 -19,198,438)

No XA3 (17,852,750 -15,564,520)

No (Sakamoto et al., 2008, Matsumoto et al., 2012)

B. oleracea pbBo(Anju)1

pW176

KBrH059L13R

BRMS-228

chrC2 (51,280,390-54,799,276)

chrA02 (26,445,222-28,289,273)

XA2 (24,048,282-25,751,675)

C2 (47,638,250-50,630,426)

(Nagaoka et al., 2010)

PbBo(Anju)3

BRMS-330

KBrH008J04R

BRMS-008

chrC3 (2,031,456 -3,680,316)

No XA3 (30,530,122 -32,176,756)

C3 (2,048,895 -3,655,161)

(Nagaoka et al., 2010)

(*)The primary Crr3 QTL is overlapping with the primary CRk QTL, so they are considered as one primary QTL for both the Crr3 and CRk locus.

(**) The primary and homeologous CRa QTL are overlapping with the primary and homeologous CRb QTL, so they are considered as one

primary QTL and one homeologous QTL for both the CRa and CRb locus.

94

Table ‎4.2: Marker homology search of downy mildew resistance QTL flanking or linked markers for identification of QTL intervals in B. napus

Origin species

Gene or locus

Flanking marker

Marker homology search, Chromosome and QTL interval region (Mbp)

References

B. napus v6 primary

B. napus v6 Homeologous

B. rapa v3 B. oleracea

v1

B. rapa BraDM SCAR SCK14-825 K14-1030 kbrb058m10-1 Brh019-SNP137 Brb062-Indel230

Brb094-DraⅠ787 kbrb006c05-2 Ol12G04

Brb043-BglⅡ715

Brb094-AatⅡ666

chrA8 (17,655,647 – 18,527,339)

No XA8 (3,003,191 – 3,715,750 )

No (Yu et al., 2009, Li et al., 2011, Yu et al., 2011b)

RPP13MK RPP13MK marker chrA9 3,964,312

chrC9 5,472,171

XA9 35,017,366

C9 6,297,979

(Yu et al., 2010)

BrRHP1* A03 SCAR N18 SCAR M05 SCAR BrPERK15A SCAR BrPERK15B SCAR J11 SCAR G17 SCAR M22 SCAR]

chrA1 (11,636,716 -28,936,570)

chrC1 (18,207,518 – 23,807,810) and chrC01 (37,139,336 -46,454,126)

XA1 (9,119,600 – 24,979,656)

C1 (17,054,176–17,059,862) and (32,805,430- 42,695,072)

(Kim et al., 2011)

B. oleracea Pp523** CB10139 159G23_R 33N5_R 24H17_R

chrC8 (23,172,112) and chrC08

chrA9 (30,451,080 ) and chrA09 (42,648,154 –

XA9 (149,523- 1,595,774)

C8 (38,878,775 – 41,519,284)

(Farinhó et al., 2004, Farinhó et al., 2007, Carlier et al., 2011, Carlier et al., 2012)

95

AGC.CAT_300(SCAFB1) AAG.CTA_113 CB10028 OPR15_920 (SCR15) OPJ19_550 (SCJ19) OPK17_980 SCAR AT.CTA 133/134

(42,851,417 – 44,412,664)

43,351,988)

PpALG1 OPU20_1307 ScJ18_530 ScK04_434 OPK04 667 OPU20_970y OPT08_1112y ScAN12

chrC4 (14,179,370 – 37,643,262)

No No C4 (13,684,813- 24,945,714)

(Alabaça, 2009)

Downy mildew resistance locus Dm

UBC359620

OPM16750 chrC2 (54,044,677 -54278743)

No No C2 (50,021,049-50,193,751)

(Giovannelli et al., 2002)

(*) The BrRHP1 locus had two homeologous QTL intervals separated by a region of ~14 Mbp, so, they are considered as two separate QTL intervals.

(**) The Pp523 locus had two primary QTL intervals and two homeologous QTL intervals separated by a 19 Mbp region, so they are considered as

two separate QTL intervals.

96

Table ‎4.3: Marker homology search of blackleg resistance QTL flanking or linked markers for identification of QTL intervals in B. napus

Origin species

Gene or locus

Flanking marker

Marker homology search Chromosome and QTL interval region (Mbp)

References B. napus v6 A genome

B. rapa v3

B. rapa

LepR1 BnGMS254 BRMS-215

chrA2 (808,646 -2,314,919)

XA2 (3,976,187-3,052,782)

(Yu et al., 2005, Yu et al., 2008)

LepR3 Ind10-04 Ind10-17 Ind10-12 Ind10-13 Ind10-08 Ind10-07 Ind10-03

chrA10 (16,001,480 -16,265,343)

XA10 (3,096,287 -3,422,354)

(Larkan et al., 2013)

BLMR1 1128BG275 87B10a 9B23c 9B23a 9B23b 80A08a 80E24

chrA10 (13,961,320 – 16,241,031)

XA10 (2,600,133- 4,586,316)

(Long et al., 2011)

BLMR2 G278 10B23a 12D09a 1J13a 1J13b R278

chrA10 (10,781,132 – 13,515,501)

XA10 (6,989,137- 9,650,582)

(Long et al., 2011)

B. napus

Rlm gene cluster Rlm9, Rlm4, Rlm7 and Rlm1

chrA7 (10,985,714 – 25,262,138)

XA7 (6,721,636 – 22,178,571)

(Delourme et al., 2004, Leflon et al., 2007, Raman et al., 2012b)

Rlm9 BnGMS665 CB10450

chrA7 (12,963,123– 14,389,391)

XA7 (8,889,906 – 9,867,760)

97

BRAS023

Rlm4 BRMS075 Na12-E11b BRMS040 BRMS005b

chrA7 (14,214,108 -15,000,000)

XA7 (9,728,451 – 10,819,936)

Rlm7 na14B03 chrA7 15,894,933

XA7 13,033,236

Rlm1 kbrh143h15 bn204 na12-a02 ra2-a05

chrA7 (22,005,183- 25,262,138)

XA7 (16,180,246 – 19,196,267)

98

Figure ‎4.1: The location of disease resistance loci QTL in the chromosomal map of B. napus. The

disease resistance loci QTL indicated clubroot (red), downy mildew (black) and blackleg (green).

The number within brackets shows the QTL interval size in Mbp. The “h” means that the QTL

intervals are homeologous. Brackets are not to scale the distance but to indicate the relative location

of genes within QTL interval.

99

4.4.2 Co-localization and association of NBS-LRR genes and QTL intervals

The co-localization of the disease resistance NBS-LRR genes and the QTL intervals on each

chromosome were examined and correlated with the NBS-LRR genes clusters and duplications.

Based on the findings by (Kang et al., 2012) that QTL and NBS-LRR genes tended to be co-

localized and NBS-LRR genes are often positioned in the 2 Mbp flanking regions of a QTL, all the

NBS-LRR genes located in the QTL, including the 2 Mbp flanking regions were considered

associated NBS-LRR genes and included in the analysis.

The number of NBS-LRR genes associated with the QTL intervals ranged from 1 to 19, from a total

number of 232 genes in all the QTL intervals (Table 4.4). Approximately 40% of the associated

NBS-LRR genes are involved in a gene cluster, 58% are duplicates and 62% are conserved between

B. napus and its related species (Table 4.4). The number of associated NBS-LRR genes was

variable between primary and homeologous QTL intervals. For instance, downy mildew BrRHP1

contained 19 NBS-LRR genes in the primary QTL intervals and 12 NBS-LRR genes in the

homeologous QTL intervals.

Regression analysis showed that there are significant correlations between the number of NBS-LRR

genes within the QTL intervals and the number of genes involved in gene clusters or duplication

(Figure 4.2 and 4.3). The R2

value of 0.84 and 0.80 for the correlation of number of genes involved

in gene clusters or duplications was observed, respectively.

100

Table ‎4.4: Correlation between the number of NBS-LRR resistance genes and disease resistance QTL intervals in chromosomes of B. napus

Chromosome

Total No. of NBS LRR

genes

Number of QTL

QTL Disease resistance loci

in B. napus

Physical interval, Size and location (Mbp)

No. of NBS LRR

within 2 Mbp

flanking region of QTL

Gene cluster

No. of genes in cluster

No. of duplicate

genes

No. of conserved

genes

chrA1 28 2 clubroot Crr2

1 Mbp (4,791,456 -6,335,486)

3 0 0 3 3

downy mildew BrRHP1

17 Mbp (11,636,480 -28,936,570)

19 1 4 11 9

chrA2 44 3

blackleg lepR1 1.5 Mbp

(808,646 -2,314,919) 10 1 7 5 7

clubroot Crc 11.5 Mbp

(12,021,130 -23,656,219) 19 2 14 11 12

homeologous clubroot

pbBo(Anju)1

1.8 Mbp (26445222 -28289273)

14 1 7 6 6

chrA3 24 2

clubroot Crr3 and CRk

4.7 Mbp (14411852 -19198438)

8 1 4 5 4

clubroot CRa and CRb

2 Mbp (24281973 -26478686)

7 0 0 5 4

chrA6 21 1 clubroot Crr4 2 Mbp

(100,821 -2,122,156) 8 0 0 7 4

chrA7 19 4

Blackleg gene cluster Rlm3 Rlm9 Rlm4 Rlm7 Rlm1

14 Mbp (10,985,714 -25,262,138)

12 1 4 7 8

chrA8 24 2 clubroot Crr1 2.8 Mbp

(9,157,719- 12,043,127) 5 0 0 4 3

101

downy mildew BraDM

809 kbp (17,718,157- 18,527,908)

9 1 3 5 4

chrA9 58 2

downy midlew RPP13MK

3,964,312 3 1 3 3 2

homeologous downy mildew

Pp523

30,451,080 And

703kbp (42,648,154- 43,351,988)

15 2 10 13 10

chrA10 11 1 blackleg BLMR2

, BLMR1 and lepR 3

6 Mbp (10,781,132- 16,770,124)

10 0 0 5 5

chrC1 37 2 homeologous downy mildew

BrRHP1

5 Mbp (18,781,127- 23,808,081)

and 9 Mbp

(37,139,336 – 46,454,793)

12 1 4 6 8

chrC2 51 2

Clubroot pbBo(Anju)1

51,280,390 1 0 0 1 1

Downy mildew resistance locus Dm

54,278,743 1 0 0 1 1

chrC3 56 2

clubroot PbBo(Anju)3

1.6 Mbp (2,031,456- 3,680,316)

9 1 5 4 6

homeologous region to Crr3

2 Mbp (22,088,816- 24,102,086)

1 0 0 1 1

chrC4 18 1 downy mildew

PpALG1 23 Mbp

(14,179,370- 37,643,262) 10 0 0 2 7

chrC6 44 1 homeologous region to Crr4

5 Mbp (3,923,239- 9,621,029)

12 1 7 10 7

chrC7 38 1 homeologous region to CRa

and CRb

1.3 Mbp (43,472,000- 44,774,737)

13 1 6 6 12

chrC8 39 2 downy mildew

Pp523 23,172,112

and 27 3 16 12 17

102

837 kbp (42,851,417- 44,412,664)

chrC9 79 1 homeologous

RPP13MK 5,472,171 4 0 0 3 3

Total 591 29 -------- ----------------- 232 18 94 (40%) 136(58%) 144(62%)

103

Figure ‎4.2: The correlation between the number of NBS-LRR genes within the 2Mbp flanking

regions of QTL and the number of NBS-LRR genes involved in gene clusters

104

Figure ‎4.3: The correlation between the number of NBS-LRR genes within the 2Mbp flanking

regions of QTL and the number of duplicated NBS-LRR genes.

105

4.5 Discussion

4.5.1 Identification and localization of QTL intervals in B. napus

Several mapping studies have previously identified QTL for diseases such as blackleg, clubroot and

downy mildew in B. napus, B. rapa and B. oleracea. The availability of linked or flanking markers

and QTL genetic maps, together with the genome sequence and disease resistance NBS-LRR genes,

provide an efficient strategy to examine the characteristics of QTL intervals. This allows

identification and determination of the physical locations of these QTL intervals, based on the

physical location of marker information against the B. napus genome. Ultimately, this results in a

comprehensive co-localization of disease resistance QTL intervals and their association with NBS-

LRR genes in B. napus.

A total of 27 QTL intervals for major resistance genes for clubroot, downy mildew and blackleg

disease resistance traits were identified in B. napus. These QTL intervals are distributed throughout

the genomes. The number of identified QTL intervals in B. napus was 12, 12 and three for

resistance for clubroot, downy mildew and blackleg, respectively.

The QTL intervals identified in B. napus were classified into primary QTL intervals and

homeologous QTL intervals. The primary QTL intervals correspond to similar locations on the

respective genetic maps whereas homeologous QTL intervals are due to the polyploid genome in B.

napus and genome duplication (Li et al., 2013). Given that B. napus originated from two highly

conserved diploid genomes of B. rapa and B. oleracea, the relationships between homologous

regions of QTL or major resistance genes can support the inheritance pattern of the resistance traits.

Therefore, it could be hypothesized that B. napus has retained the homeologous QTL intervals

inherited from the ancestral species: B. rapa and B. oleracea. This is also seen by (Lestari et al.,

2013) showing that the major QTL for soybean seed protein and oil in Chr20 had duplicated

homeologous regions in Chr10 that occurred due to duplication event. Therefore, the primary and

homeologous QTL intervals on different chromosomes might result from genome duplication and

then undergo rapid divergence and reorganization. The primary and homeologous QTL in B. napus

could also correspond to a homeologous region in either B. rapa or B. oleracea. For this reason, the

present study assumed that these two QTL are produced by polyploidization.

Interestingly, there were differences in the number of associated NBS-LRR genes and the QTL

interval sizes between primary and homeologous QTL intervals. This emphasizes that the

polyploidy and genome rearrangements might have a major impact on the structure of QTL interval

and the contribution to disease resistance.

106

Overlapping QTL intervals for several resistance loci were found. For instance, four overlapping

QTL intervals for blackleg resistance genes Rlm9, Rlm7, Rlm4 and Rlm1 were on chromosome A7

of B. napus (Delourme et al., 2004). In addition, these QTL are located within an extensive intra-

chromosomal tandem duplication region (Raman et al., 2012b, Tollenaere et al., 2012). If a QTL

overlapped with other QTL identified from different research, this region could contain a potential

cluster of genes or allelic variants with resistance to different pathogen races.

Co-mapping of QTL intervals identified in B. napus to its related species, B. rapa and B. oleracea

were performed for identifying genomic syntenic relationships. I found that most QTL intervals co-

mapped between B. napus and the corresponding species. This is an indication that the QTL

intervals in B. napus were originally contributed by the parental species. Synteny and conservation

between the B. napus, B. rapa and B. oleracea genome has been previously established (Parkin et

al., 2003, Rana et al., 2004, Jiang et al., 2011, Chalhoub et al., 2014). The comparative QTL

intervals between B. napus, B. rapa and B. oleracea revealed the majority of NBS-LRR genes in

associated QTL intervals are conserved suggesting that the QTL underlying these resistance traits

might show some structural and functional conservation. A study by Jiang et al. (2011) also found

that the QTL region for seed oil, total seed glucosinolates, and seed protein content were conserved

between the A sub-genome of B. rapa and B. napus.

The primary and homeologous QTL intervals in B. napus which existed in syntenic region between

the sub-genomes were originally derived from the genomic region of the ancestral genome and

distributed into two different genomic regions following genome hybridization. Werner et al. (2008)

has suggested that a resistance locus controlled by a major gene in the progenitor genome might

have been duplicated to give rise to two functionally different genes. Alternatively, a resistance

locus containing a cluster of genes within the progenitor genome may have separated into two

different genomic regions due to major chromosomal rearrangements. This further supports our

finding that the primary and homeologous QTL intervals on different chromosomes of B. napus

might result from genome rearrangement and duplication.

I also observed several QTL intervals in the B. napus sub-genome that had no counterpart QTL

intervals in the corresponding sub-genome of either A or C genome. However, they were

corresponding to either the A genome of B. rapa or C genome of B. oleracea. This indicates that the

two ancestral species are not likely to share disease resistance QTL. These could be species-specific

QTL.

107

Taken together, it is interesting to observe how the syntenic QTL intervals regions between B.

napus and its related species B. rapa and B. oleracea show similar gene content, sequence and

structural similarities of NBS-LRR genes. This would further assist in understanding their

conservation and evolution on the diversity of NBS-LRR genes.

4.5.2 Co-localization and association of NBS-LRR genes and QTL intervals

After the identification of QTL intervals, the association and co-localization between the QTL

intervals for disease resistance loci and NBS-LRR genes was analyzed. In this study, I found that

the identified QTL intervals for the three resistance traits appeared to co-localize with NBS-LRR

genes. The total number of NBS-LRR genes associated with the QTL intervals was 239 genes

representing approximately 37% of the total NBS-LRR genes in B. napus. This suggests that, as

expected, localization of QTL intervals for major resistance genes are correlated with the presence

of disease resistance genes. A similar co-localization between disease resistance QTL and NBS-

LRR resistance genes has been found in other plants (Kang et al., 2012, Wei et al., 2013). For

example, QTL resistance loci were found to be co-localized with NBS-LRR resistance genes and

resistance gene analogues (RGAs) in common bean (Geffroy et al., 2000), apple (Calenge et al.,

2005), Barley (Madsen et al., 2003) and potato (Bakker et al., 2011). Therefore, the association and

co-localization between NBS-LRRs and QTL disease resistance loci is a common feature of plant

genomes.

Furthermore, the correlation between NBS-LRR gene cluster and duplication within the 2 Mbp

flanking region of QTL intervals was studied. The majority of QTL intervals harbor NBS-LRR

gene clusters. There was a significant correlation and co-localization between the number of NBS

LRR genes within the QTL intervals and the number of genes involved in gene clusters or

duplication. This is consistent with previous studies which found QTL located in a resistance gene-

rich region harbouring NBS-LRR gene clusters which, confer resistance to a number of different

pathogens (Jeong et al., 2001, Wang et al., 2010).

A clustered region of NBS-LRR genes were found to be co-located with different QTL conferring

fungal resistance in soybeans (Kang et al., 2012). Moreover, synteny analysis between B. rapa and

Arabidopsis revealed that two clubroot resistance loci: Crr1 and Crr2, overlap and align to a region

in chromosome 4 of Arabidopsis (Suwabe et al., 2006). This region in Arabidopsis was previously

identified as containing clusters of disease resistance genes (Speulman et al., 1998). For example,

RPP5 for resistance to downy mildew pathogen Peronospora parasitica and RPS2 for resistance to

Pseudomonas syringae (Bevan et al., 1998, Suwabe et al., 2012).

108

The correlation and association of NBS-LRR genes in the B. napus genome were apparent between

gene clusters and QTL intervals to a particular disease resistance. The duplicated NBS-LRR genes

are also retained in disease resistance QTL suggesting that genes involved in tandem duplication

raise the likelihood of creating redundant NBS-LRR genes underlying QTL resistance between

recently duplicated regions. This explains why the presence of gene clusters and duplications in a

major disease resistance QTL can play a role in the resistance evolution. In addition, this suggests

that the NBS-LRR genes are clustered and associated with particular QTL resistance loci in a

manner that reflects their functional relevance to disease resistance.

In this study, there was a correlation between the number of NBS-LRR genes within the QTL

intervals and the number of genes involved in gene clusters or duplication. Complex structures of

QTL can be as a result of resistance gene clustering and duplication leading to closely linked genes

which differ in their pathogen race specificity (Michelmore and Meyers, 1998). Thus, the clustering

of resistance genes in QTL intervals is advantageous by maintaining resistance while allowing

novel specificities to evolve. The results here provide evidence that NBS-LRR are distributed and

clustered throughout the genome and tend to be linked and associated with QTL intervals. These

QTL intervals containing gene clusters are likely to be major components of the resistance.

In order to validate whether the associated NBS-LRR genes with the identified QTL intervals are

related to any disease resistance genes that have been identified and sequenced in Brassica species.

I confirmed that B. rapa Cra clubroot resistance gene (Ueno et al., 2012) was orthologous and

conserved with a TIR-NBS-LRR gene in the reported CRa primary QTL intervals on chromosome

A3 in B. napus. Similarly, another B. rapa clubroot resistance gene Crr1a (Hatakeyama et al.,

2013) was also orthologous and conserved with a TIR-NBS-LRR in the primary Crr1a QTL

intervals on chromosome A8 in B. napus. Moreover, the blackleg LepR3 resistance gene (Larkan et

al., 2013) and Rlm2 candidate gene (Larkan et al., 2014) were found to be located within identified

LepR3 QTL intervals on chromosome A10 in B. napus. In addition, a Rlm4 blackleg candidate gene

was identified within the Rlm4 QTL intervals (Tollenaere et al., 2012).

109

Chapter 5

Comparative analysis enabled the

identification of candidate Rlm9

blackleg resistance genes in Brassica

napus

110

Chapter 5 : Comparative analysis enabled the identification of candidate Rlm9

blackleg resistance genes in Brassica napus

5.1 Abstract

The availability of genome sequences of Brassica species have made it possible to study and

understand the structure of disease resistance genes in Brassica napus. This provides an invaluable

resource for the identification and comparative analysis of disease resistance genes. A total of 58

disease resistance genes on chromosome A7 of B. napus: 19 NBS-LRRs, 15 LRR-RLPs and 24

LRR-RLKs were identified. The gene content and order between B. napus and B. rapa was

consistent and further confirmed the chromosome microsynteny and conservation.

Twelve NBS-LRR and 18 LRR-containing genes on B. napus chromosome A7 were located within

a Rlm (Resistance to Leptosphaeria maculans) QTL region of interest. Through previous genetic

mapping studies, this region is known to harbour a cluster of genes: Rlm9, Rlm4, Rlm7, Rlm3 and

Rlm1 that confer resistance to blackleg disease (Leptosphaeria maculans), the most devastating

fungal pathogen of B. napus. The comparative analysis of resistance genes in the Rlm QTL revealed

that there was higher functional conservation of resistance genes in chromosome A7 of B. napus

and B. rapa than in chromosome C6 of B. napus and B. oleracea.

In this study, a candidate gene approach combined with comparative analysis was exploited for

identification of Rlm9 candidate genes in B. napus. Six NBS-LRR and eight LRR-containing genes

were associated with the Rlm9 QTL region. The NBS-LRR genes were selected for further analysis

as the highest priority candidate genes. It was possible to amplify and sequence these genes in B.

napus cultivars that differ in Rlm9-based resistance. This confirmed that three genes can be

excluded as no polymorphism was observed while another three genes became strong candidates for

further investigation. This chapter provides a detailed analysis of the identification of candidate

blackleg resistance genes and further understating of the structural conservation.

111

5.2 Introduction

Genetic mapping studies showed that all Leptosphaeria maculans resistance genes in B. napus, the

most devastating fungal pathogen of B. napus, are found on the A genome of B. napus (Ananga et

al., 2006, Leflon et al., 2007) and are organised in clusters in two genomic regions: the Rlm2 and

LepR3 resistance genes on chromosome A10 (Pilet et al., 2001, Larkan et al., 2013, Larkan et al.,

2014) and five specific resistance genes: Rlm1, Rlm3, Rlm4, Rlm7, and Rlm9 are on chromosome

A7 (Delourme et al., 2004). A cluster of NBS-LRR genes on B. napus chromosome A7

corresponding to the mapped genetic resistance genes to L. maculans were identified based on a

marker homology based approach and genome wide NBS-LRR gene identification (described in

chapters 2 and 3). At present, none of these Rlm genes on chromosome A7 have been sequenced

and validated in B. napus.

A QTL for the Rlm9 resistance gene has been identified from a mapping population derived from B.

napus cultivars Darmor and Yudal (Delourme et al., 2004). This QTL is located on chromosome A7

of B. napus, which also contains other resistance genes: Rlm1, Rlm3, Rlm4 and Rlm7 (Delourme et

al., 2004). The identification of candidate blackleg resistance genes in B. napus has proven difficult

by map-based cloning approaches due to the complex genomic structure of this polyploid species

(Mayerhofer et al., 2005, Tollenaere et al., 2012).

An alternative approach called the candidate genes approach can be used to solve the problems

faced by map based cloning, in which QTL length intervals may be in mega base pairs (Mbp) and

include hundreds of genes. This approach is used to eliminate the need for fine mapping and

narrowing down the genomic regions by searching to discover the genes of interest all at once using

gene annotation and prediction. The rationale of the candidate gene approach is based on the

hypothesis that candidate genes should correspond to the loci controlling traits of interest if a

relationship between candidate gene and disease phenotype exist then significant polymorphism

between contrasting species or cultivars should be observed. Thus, comparison of disease resistance

genes between related species may provide insight into the conservation of these genes. Also,

comparison of syntenic loci known to harbour resistance genes may indicate candidate genes

governing the trait of interest that is yet to be identified. In a previous study, the collinearity

between B. napus and B. rapa was exploited to identify a number of candidate genes associated

with the Rlm4 blackleg resistance gene (Raman et al., 2012b, Tollenaere et al., 2012). In addition,

LepR3 gene identification and cloning was facilitated by the comparative collinearity of genetic

markers between B. napus and Arabidopsis (Larkan et al., 2013).

112

A common feature of plant resistance genes is the presence of leucine rich repeats (LRR). The LRR

domain plays a major role in recognition of pathogen effectors by facilitating protein–protein

interactions (McDowell and Woffenden, 2003). The LRR domain is found in NBS-LRR, as well as

LRR-containing proteins. The NBS-LRR genes are the largest class of plant resistance genes and

function predominantly as disease resistance proteins. They are located intracellularly and may

interact directly or indirectly with the effector proteins encoded by Avr genes. In contrast, the LRR-

containing proteins, classified into LRR-receptor-like proteins (LRR-RLPs) and LRR-receptor-like

kinases (LRR-RLKs) have an extracellular LRR domain and are anchored to the plasma membrane

(Kruijt et al., 2005, Yang et al., 2012). The LRR-RLPs differ from LRR-RLKs in that they lack the

cytoplasmic kinase domain. LRR-RLPs and LRR-RLKs are involved in both defence and

developmental pathways and play a role in perceiving extracellular signals to initiate an

intracellular response (Fritz-Laylin et al., 2005, Kemmerling et al., 2011)

The availability of sequenced Brassicas genomes makes candidate genes discovery more effective

by offering genomics resources. These genomics resources can be combined together with

information from QTL mapping to enable disease resistance gene identification (NBS-LRR and

LRR containing genes) and in this study for comparative analysis to investigate the Rlm9 QTL

region for candidate genes that can have potential for future sequencing and validation.

In this chapter, identification and characterization of Rlm9 candidate genes was attempted using

genomics approaches. Firstly, the localization of disease resistance genes such as NBS-LRR and

LRR-containing genes in chromosome A7 of B. napus cv. Darmor were identified and association

with Rlm9 resistance performed (chapter 4). Secondly, SNP genotyping (Illumina Infinium assay)

was used to genotype a number of B. napus cultivars carrying different complements of Rlm genes,

including Rlm9-positive and Rlm9-negative B. napus cultivars for genome wide association studies

(GWAS) with the Rlm9 QTL. Then, the comparative analysis of disease resistance genes between

B. napus and other Brassica species can be performed to help to identify any functional sequence

variations.

113

5.3 Methods

5.3.1 Brassica genome and gene prediction

The genome sequence and annotation of B. napus cv. Darmor v6 was used as the reference genome

(Chalhoub et al., 2014). The genome sequence of B. rapa v3.0 (Wang et al., 2011) and B. oleracea

cv. To1000 v1.0 (Parkin et al., 2014) were also used for comparative analysis. The NBS-LRR gene

annotation and identification was described previously chapters 2 and 3.

5.3.2 Identification of NBS-LRR and LRR- containing genes in B. napus

NBS-LRR candidate genes in B. napus were identified as described in chapter 2.

The identification of LRR containing proteins identification on A7 and C6 were based on

MAST/MEME output (chapter 2). All B. napus predicted genes were sorted by the MAST E-value.

Predicted genes containing more than 10 LRR motifs to be considered as LRR containing genes

with an E-value between -24 to 10 were selected. Then, they were further verified by BLASTp with

a threshold of E < -10 against the NCBI non-redundant (nr) protein database to check if they were

LRR-RLP or LRR-RLK.

5.3.3 Comparative analysis of NBS-LRR and LRR-containing gene on the A7 and C6

chromosome of B. napus

A reciprocal best BLAST (RBB) analysis of genes in B. napus was used to identify orthologous

genes in the B. rapa and B. oleracea genomes (chapter 3). The identification of homeologous or

paralogous genes were confirmed by the BLASTp comparison between genes between B. napus A7

and C6. In addition, the gene sequences were used to determine the syntenic positions between

chromosome A7 in B. napus and B. rapa, as well as chromosome C6 in B. napus and B. oleracea.

The genes showing homology and collinearity were regarded as syntenic genes. Then, the gene

sequences were aligned using CLUSTAL W in Geneious Pro v6.1.2 (Kearse et al., 2012).

5.3.4 Genetic mapping of the Rlm9 QTL

The genetic map position for Rlm9 was determined in a B. napus Darmor x Yudal mapping

population, segregating for blackleg resistance (Delourme et al., 2004). The markers underlying the

QTL on chromosome A7 were physically assigned to the B. napus Darmor A genome using

BLAST (Altschul et al., 1990) in Geneious Pro v6.1.2 (Kearse et al., 2012). The gene prediction

and genome wide identification of NBS LRR disease resistance across B. napus cv. Darmor

genome, particularly on chromosome A7, was used to identify the putative NBS-LRR or LRR-

114

containing genes to be selected as potential candidate genes for further analysis. Furthermore, all

Rlm9 candidate genes were subject to comparative sequence analysis between B. napus cv. Darmor,

B. napus cv. Tapidor and B. rapa.

Illumina short-sequence reads for B. napus cv. Skipton and cv. Ag-Spectrum, generated on the

Illumina Genome Analyser IIx (GAIIx) system, by the Batley lab, in collaboration with Prof

Edwards’ bioinformatics group, UQ, were mapped to the B. napus cv. Darmor genome by Prof

Edwards’ group. Given that Skipton is known to contain Rlm4 and Rlm9 genes, while Ag-Spectrum

contains Rlm9 (Raman et al., 2012), they were selected in this analysis. It is expected that there

would be no sequence variation in candidate Rlm9 genes when compared to cv. Darmor (Rlm9), but

they should show sequence variations if the candidate gene is Rlm4. The reads mapping in the

Rlm9 candidate genes underlying the Rlm9 QTL were visualized using Geneious Pro v6.1.2 (Kearse

et al., 2012) to assess sequence coverage and variation between the cultivars: Skipton and Ag-

Spectrum when compared to Darmor. Consensus sequences from short-sequence reads were

obtained for B. napus cultivars Skipton and Ag-Spectrum for each individual gene.

5.3.5 SNP genotyping of B. napus cultivars

Total genomic DNA was extracted from different B. napus cultivars and genotyped using a B.

napus custom array Infinium HD 60K assay with 52,157 SNPs throughout the B. napus genome as

implemented in the Illumina Infinium assay (Illumina Inc., San Diego, CA). The Infinium assay

was performed as described in the Illumina manual protocol.

SNP data processing, clustering and genotype calling were performed using the genotyping module

in the GenomeStudio software (Illumina, San Diego, CA) with default parameters. The software

assigns three clusters on a graph based on the fluorescence obtained. The clusters were reviewed

and manually edited when necessary. Limiting the analysis in this study to SNPs present on

chromosome A7, genotype calls for all SNPs loci were imported into Microsoft Excel 2013

(Microsoft) for further analysis.

For genotyping analysis, 26 B. napus cultivars including Darmor and Yudal were classified as Rlm9

resistant and susceptible cultivars based on the Rlm gene composition obtained from published

literature (Marcroft et al., 2012a) (Table 5.1). The cultivar Darmor (Rlm9 resistant) was selected as

a reference for the resistant individuals and Yudal (Rlm9 susceptible) for the susceptible

individuals. To aid in the comparison, all Darmor genotype calls were renamed as ‘AA’ and all

115

Yudal calls as ‘BB’, while the calls in the other lines varied depending on whether they carried the

Darmor or Yudal genotype at that position.

Table ‎5.1: List of B. napus cultivars genotyped with the SNP genotyping assay

B. napus cultivars

Rlm gene composition

Darmor Resistant reference Rlm9

Yudal Susceptible reference rlm9

Skipton

Positive Rlm9 cultivars

Rlm4 & Rlm9

Bravo-TT Rlm4 & Rlm9

Tornado-TT Rlm4 & Rlm9

AV-Sapphire Rlm4 & Rlm9

ATR-Stingray Rlm3 & Rlm9

ATR_Cobbler Rlm4(H) & Rlm9(H)

Ag-Spectrum Rlm9(H)

ATR-Stingray Rlm3 & Rlm9

Dunkeld

Negative Rlm9 cultivars

Rlm4

Nurendra Rlm4

Telfer Rlm4

Thunder-TT Rlm4

ATR-Summit Rlm4

Monty_028DH Rlm4

Taparoo Rlm3

Carousel Rlm1 & Rlm2

Surpass400 Rlm1 & RlmS

Argyle Rlm1 & RlmS

B2 B. napus "Lynx_037DH"

Unknown

C2 B. napus "Ningyou7"

RSO

Trilogy

Trigold

Boomer

“H” Indicates partial resistance.

116

5.3.6 Extraction of genomic DNA

Seeds for B. napus cultivars used in this study were obtained from a seed collection at the Batley

lab at the University of Queensland. They were germinated and planted in small polyvinyl pot filled

with potting mix in a temperature controlled glasshouse under 28o

C with natural lighting. Young

leaf tissue was collected for DNA extraction.

Genomic DNA was extracted from the different B. napus cultivars, according to a DNeasy Plant

Mini Kit (QIAGEN). Approximately 200 mg of leaf tissue was ground (dry) with a pestle and

mortar, sitting in liquid nitrogen, into a fine powder, then, transferred into a 1.5 mL microcentrifuge

tube. Then DNA was extracted following the manufacturer’s instructions. DNA concentration and

purity were measured using the NanoDrop (ND-1000 Spectrophotometer, Thermo Scientific). DNA

was also electrophoresed on a 1% TAE agarose gel containing ethidum bromide at 100 V for 30

min to check the DNA quality.

5.3.7 PCR amplification and sequencing of Rlm9 candidate genes

The selected Rlm9 candidate genes were amplified from genomic DNA of B. napus cv. Darmor and

B. napus cv. Yudal as well as other B. napus cultivars (Table 5.2) using PCR with the primers

listed in Table 5.3.

The primers pairs were designed based on the B. napus Darmor reference genome (Chalhoub et al.,

2014). The primer pair specificity was checked by BLAST analysis against the whole genome to

avoid non-specific binding as a result of duplications.

Long range PCR was carried out using the Expand Long Template PCR kit (Roche) following the

manufacturer’s instructions. A 50 ng aliquot of genomic DNA was amplified in a reaction mix

containing 1x PCR buffer, 1 U DNA polymerase mix, 500 μM dNTPs, and 400 nM forward and

reverse primers. The thermal cycling conditions were: 92 °C for 2 min; followed by 10 cycles of 92

°C for 10 sec, Tm °C for 15 sec, 68 °C for 60 sec/1Kbp; followed by 25 cycles of 92 °C for 10 sec,

Tm °C for 15 sec, 68 °C for 60sec/1Kbp +20 sec increment each cycle; then 68 °C for 7 min. PCR

products were resolved by electrophoresis in a 1 % agarose gel stained with ethidium bromide.

117

Table ‎5.2: List of the B. napus cultivars and their Rlm complement used for the PCR amplification

B. napus cultivar Rlm composition Remarks

Darmor Rlm9 Resistant Rlm9 reference. It is the reference genome for B. napus.

Yudal rlm9 Susceptible Rlm9 reference

Tapidor Rlm2 The genome sequence was available in-house.

Skipton Rlm4 and Rlm9 They have short sequence reads available. They both

contain the Rlm9 gene, but differ in Rlm4 Ag-Spectrum Rlm9

Jet Neuf Rlm4 and Rlm1

Westar No Rlm Used as negative control because it has no Rlm gene

Table ‎5.3: List of primer pairs used for long range PCR amplification of Rlm9 candidate genes

Target gene

sequence 5' --------> 3' Product size bp

PCR and sequencing

Results

BnDRG 13 and 14

F. TCCACGCTTACAACGGATAATTCTAGCC 14,900 Successful

R. ACAGCAGATGGATTACACGTACAACACC

BnDRG 17

F. GTTCTAAGGCTCTGACCAGTGT 3,000

Not Successful

R. CGGTGGCAGTGTTTCGAAACAT

F. TTACGGTTAGTACTTAGTAGCGGTGG

4,200 Not

Successful R. GCTTGTGCAGTTGAAGAATGGATAAC

F. ACTGCTATATTGTTATAGGTCGGCCG 2,000

Not Successful

R. TTGATGTTCTAAGGCTCTGACCAGTG

BnDRG 19

F. CGGTTCGTAAGCCACCTAATCTAAGACC 17,500

Not Successful

R. GAATCGGAATTGAGGAGAGGCTTAGAGG

F. GTAGCTATGTTCTCTCCGAGTATCCAC 11,800

Not Successful

R. GAAGGAAGTAGTAGAGCCAAGCATGAC

F. ATGCCAAATTTCTGCGTTTCG 1,740

Successful but not

sequenced R. TCAGGTCGAAATGTTGGAGGA

F. TTGGCCATATTATTCGGAACCG 6,400

Successful for all

cultivars except Ag-

R. GTAGTCACCTATGTACATCCGC

118

Spectrum

BnDRG 20

F.CTAAGCCTCTCCTCAATTCCGATTCAGC 18,000

Not Successful

R. GTCTACAGGTTGAAGTACACGATCTGGC

F. TGAAGACATTACGGTCGACTATGC 18,900

Not Successful

R. GAGAAGATCGTCGGTGCATTAACG

F. AGTATAGGATACGGTGTTGTAGTGCC 18,900

Not Successful

R. CTTGAGAAGATCGTCGGTGCATTAAC

F. TTACAGTGATGACTAGTTCATGGCCG

6,400

Successful for all

cultivars except Ag-Spectrum

R. TTGCTAGCTCATTACTGATCTCGGAG

F. ATGGCTTCTTCTTCTCACCGG ATA AGG

4,000

Successful for all

cultivars except Ag-Spectrum

R. CTAATCCATTGCGCTGCTACTACT CTC

BnDRG 23 F. CTTCAGGGTTGGTCCATTGGCACATGTC

16,000 Successful R. GTCATCTTCCTCACAGTCATCGTCAAGG

5.3.8 DNA Sequencing and Sequence Analysis

Purified PCR products were sequenced using the Sanger sequencing method via the Australian

Genome Research Facility (AGRF, http://www.agrf.org.au). Sequencing samples were prepared and

submitted according to AGRF guidelines to be sequenced in both forward and reverse directions.

Raw sequences (ABI sequence chromatograms), obtained from AGRF, were analyzed using

Geneious Pro v6.1.2 (Kearse et al., 2012). The sequence analysis included mapping to reference

genome or de novo assemblies and sequenced candidate gene from cultivars, nucleotide and protein

alignments. Then, full gene sequence and sequence variation were confirmed.

119

5.4 Results

5.4.1 Identification and characterization of NBS LRR and LRR-containing genes on B.

napus chromosome A7

Based on the genome wide NBS-LRR identification, 19 NBS-LRR genes on chromosome A7 of B.

napus were identified (Table 5.4). Further sequence analysis and characterization of these candidate

NBS LRR genes indicated that 10 genes contained a typical motif pattern characteristic of NB-LRR

proteins: three CNLs and seven TNLs. The other genes were truncated. Four of them were truncated

with the TIR-NBS domain, but lacking the LRR domain. Truncated genes lacking a portion of the

NBS domain or the LRR region are often associated with typical NBS-LRR genes, but, due to

frameshift mutation or deletion, they become truncated. Five genes were TIR-containing proteins.

In addition, LRR-containing genes were identified by MEME/MAST analysis. This identified 15

Leucine-rich repeat receptor-like proteins (LRR-RLPs) genes and 24 Leucine-rich repeat receptor-

like kinase (LRR-RLKs) genes (Table 5.4).

There was correlation in the number of genes between the NBS-LRR and LRR-containing genes.

The majority of disease resistance genes are distributed towards the distal end of the short arm of

chromosome A7. The LRR-RLPs and LRR-RLKs genes were found to be clustered or dispersed

with the NBS-LRR genes. There were four gene clusters. The largest gene cluster consisted of five

NSB-LRR genes and five LRR-containing genes. Two gene clusters were located within the Rlm

QTL region (Figure 5.1).

Annotation of these NBS-LRR and LRR-containing genes on chromosome A7 show they share

high homology to known disease resistance genes in other plant species. They were designated

BnDRG 1 (Brassica napus Disease Resistance Gene 1) to BnDRG 58 (Table 5.4).

120

Table ‎5.4: List of characterized NBS-LRR and LRR-containing genes on chromosome A7 of B.

napus cv. Darmor. Candidate Rlm9 genes located within the Rlm9 QTL are highlighted

Given name GenBank AC Gene Position on B. napus

Darmor V6 Gene class

BnDRG 1 CDY31987 GSBRNA2G00050797001 412,213 LRR-RLP

BnDRG 2 NA GSBRNA2G00067320001 1,179,872 LRR-RLP

BnDRG 3 CDY31412 GSBRNA2T00047634001 3,824,209 LRR-RLP

BnDRG 4 CDY32057 GSBRNA2T00051369001 4,843,786 LRR-RLP

BnDRG 5 CDX77472 GSBRNA2T00128337001 8,533,320 LRR-RLP

BnDRG 6 CDY43181 GSBRNA2G00076179001 10,010,633 LRR-RLK

BnDRG 7 CDY43183 GSBRNA2G00076181001 10,028,308 LRR-RLK

BnDRG 8 CDY43185 GSBRNA2G00076183001 10,039,885 LRR-RLK

BnDRG 9 CDY43187 GSBRNA2G00076186001 10,054,452 LRR-RLK

BnDRG 10 CDY44292 GSBRNA2G00079920001 10,354,023 LRR-RLK

BnDRG 11 CDY15668 GSBRNA2T00089880001 11,054,163 LRR-RLK

BnDRG 12 CDY01430 GSBRNA2T00112430001 13,928,484 LRR-RLK

BnDRG 13 CDY01454 GSBRNA2T00112460001 14,052,790 NBS-LRR (CNL)

BnDRG 14 CDY01455 GSBRNA2T00112461001 14,057,850 NBS-LRR (CNL)

BnDRG 15 CDY04476 GSBRNA2G00118634001 14,670,288 LRR-RLK

BnDRG 16 CDY57686 GSBRNA2T00022534001 15,944,110 LRR-RLP

BnDRG 17 CDY67955 GSBRNA2T00068325001 16,055,043 NBS-LRR (TNL)

BnDRG 18 CDX67490 GSBRNA2G00098602001 16,703,378 LRR-RLK

BnDRG 19 NA chrA07-snap.4041 16,765,735 NBS-LRR (TNL)

BnDRG 20 CDX67500 GSBRNA2T00098616001 16,783,435 NBS-LRR (TNL)

BnDRG 21 CDX67564 GSBRNA2G00098700001 17,128,516 LRR-RLP

BnDRG 22 CDX67632 GSBRNA2G00098789001 17,581,237 LRR-RLK

BnDRG 23 CDX67660 GSBRNA2T00098829001 17,769,448 NBS-LRR (CNL)

BnDRG 24 CDX67736 GSBRNA2T00098923001 18,186,595 LRR-RLK

BnDRG 25 CDX98379 GSBRNA2T00098994001 18,459,810 LRR-RLP

BnDRG 26 CDX68199 GSBRNA2G00099459001 20,542,818 LRR-RLP

BnDRG 27 CDX68246 GSBRNA2T00099523001 20,755,045 LRR-RLK

BnDRG 28 CDX68251 GSBRNA2T00099529001 20,789,302 NBS-LRR (TNL)

BnDRG 29 CDX68253 GSBRNA2T00099532001 20,800,824 NBS-LRR (TNL)

BnDRG 30 CDX68254 GSBRNA2T00099533001 20,801,471 NBS-LRR (TNL)

BnDRG 31 CDX68276 GSBRNA2T00099557001 20,914,840 LRR-RLK

BnDRG 32 CDX68299 GSBRNA2G00099588001 21,048,465 LRR-RLK

BnDRG 33 CDX68312 GSBRNA2T00099607001 21,137,684 LRR-RLP

BnDRG 34 CDX68313 GSBRNA2T00099609001 21,141,755 LRR-RLP

BnDRG 35 CDY07346 GSBRNA2T00124303001 21,937,031 NBS-LRR (TNL)

BnDRG 36 CDY07347 GSBRNA2T00124305001 21,947,475 NBS-LRR (TNL)

BnDRG 37 CDY56524 GSBRNA2T00019706001 22,890,711 LRR-RLP

BnDRG 38 CDX95990 GSBRNA2G00102016001 23,418,593 LRR-RLP

BnDRG 39 CDX96066 GSBRNA2G00102107001 23,822,389 LRR-RLK

BnDRG 40 CDX96121 GSBRNA2T00102173001 24,103,221 NBS-LRR (TNL)

121

BnDRG 41 CDY38804 GSBRNA2G00066610001 24,695,717 LRR-RLP

BnDRG 42 CDX96320 GSBRNA2T00102468001 25,026,741 LRR-RLP

BnDRG 43 NA GSBRNA2G00102501001 25,150,024 LRR-RLK

BnDRG 44 CDX96381 GSBRNA2T00102534001 25,311,436 LRR-RLK

BnDRG 45 CDX96394 GSBRNA2T00102550001 25,376,215 LRR-RLK

BnDRG 46 CDX96437 GSBRNA2T00102605001 25,584,103 NBS-LRR (TNL)

BnDRG 47 CDX96438 GSBRNA2T00102606001 25,588,007 NBS-LRR (TNL)

BnDRG 48 CDX96439 GSBRNA2T00102607001 25,594,415 NBS-LRR (TNL)

BnDRG 49 NA GSBRNA2T00102609001 25,598,597 NBS-LRR (TNL)

BnDRG 50 CDX96441 GSBRNA2T00102611001 25,603,196 NBS-LRR (TNL)

BnDRG 51 CDX96448 GSBRNA2T00102618001 25,628,804 LRR-RLK

BnDRG 52 CDX96522 GSBRNA2T00102705001 25,942,706 LRR-RLK

BnDRG 53 CDX96537 GSBRNA2T00102721001 25,990,167 LRR-RLK

BnDRG 54 CDY47508 GSBRNA2G00087566001 26,443,758 LRR-RLK

BnDRG 55 CDX87373 GSBRNA2T00146342001 27,009,762 NBS-LRR (TNL)

BnDRG 56 CDX87396 GSBRNA2T00146370001 27,122,969 NBS-LRR (TNL)

BnDRG 57 CDX87418 GSBRNA2G00146395001 27,231,134 LRR-RLK

BnDRG 58 CDX87661 GSBRNA2T00146668001 28,378,921 LRR-RLK

Figure ‎5.1: Distribution of NBS-LRR and LRR-containing genes on chromosome A7 of B. napus.

Different colours represent different gene types: blue (TNL), red (CNL), green (LRR-RLKs) and

yellow (LRR-RLPs). Genes inside the black rectangles are in a gene cluster. The Genes inside the

red rectangle are within the Rlm QTL.

122

5.4.2 Comparative analysis of NBS-LRR and LRR-containing genes on chromosome A7 of

B. napus

In order to study the gene conservation between chromosome A7 and its homeologous chromosome

C6 in the Brassica species: B. napus, B. rapa and B. oleracea, the protein sequences and gene

collinearity were compared. The total numbers of NBS-LRR and LRR-containing genes were

consistent (Table 5.5). Genes in B. napus showed good orthology and collinearity relationships

with genes in B. rapa and B. oleracea (Figures 5.2 and 5.3). The B. napus chromosome A7 is

collinear and syntenic around gene clusters to the same chromosome of B. rapa which includes non-

collinear genes. Likewise, the B. napus chromosome C6 also exhibited the same gene pattern to B.

oleracea. The majority of the NBS-LRR and LRR-containing genes of B. napus chromosomes A7

and C6 have orthologs in B. rapa and B. oleracea, although a number of genes showed presence

and absence between species.

Table ‎5.5: Comparative analysis of the number of NBS-LRR and LRR-containing genes on

chromosomes A7 and C6 of the three Brassica species: B. napus, B. rapa and B. oleracea

Species

Gene

The whole chromosome Within Rlm QTL

B. n

ap

us

A0

7

B. n

ap

us

C0

6

B. r

ap

a

A0

7

B. o

lera

cea

C

06

B. n

ap

us

A0

7

B. n

ap

us

C0

6

B. r

ap

a

A0

7

B. o

lera

cea

C

06

NBS-LRR 19 44 18 45 12 18 9 18

LRR-RLP 15 10 14 9 9 6 9 6

LRR-RLK 24 25 23 25 9 8 8 7

Total 58 79 55 79 30 32 26 31

The comparison revealed that 14/19 (73%) and 34/39 (87%) genes of NBS-LRR and LRR-

containing genes, respectively, between A7 B. napus and B. rapa were highly conserved with high

protein sequence similarity. Six and 12 genes of NBS-LRR and LRR-containing genes,

respectively, were not collinear. Four and three genes of NBS-LRR and LRR-containing genes,

respectively, showed substantial protein sequence differences. Three NBS-LRR genes showed

presence and absence between the species (Figures 5.2 and 5.3).

The comparison also revealed that 27/44 (61%) and 33/35 (68%) genes of NBS-LRR and LRR-

containing genes, respectively, between chromosome C6 of B. napus and B. oleracea were highly

conserved with high protein sequence similarity and collinearity. Two and 10 genes of NBS-LRR

123

and LRR-containing genes, respectively, were not collinear. Six NBS-LRR genes showed

substantial protein sequence differences. No NBS-LRR and LRR-containing genes showed

presence and absence variation between species (Figures 5.2 and 5.3).

Furthermore, the resistance genes between A7 and C6 B. napus are considered homeologous

(duplicates). The comparison between chromosomes A7 and C6 B. napus revealed that 14 of each

NBS-LRR and LRR-containing genes between A7 B. napus and B. rapa were homeologous

between chromosomes A7 and C6 of B. napus. There were 6/14 (42%) and 14/14 (100%) genes of

NBS-LRR and LRR-containing genes, respectively, highly conserved with high protein sequence

similarity. One and five genes of NBS-LRR and LRR-containing genes, respectively, were not

collinear. Six NBS-LRR genes showed substantial protein sequence differences (Figures 5.2 and

5.3).

Taken together, the comparison revealed that the LRR-containing genes showed higher sequence

conservation than the NBS-LRR genes. The NBS-LRR genes show protein sequence divergence

and presence and absence while, the LRR-containing genes show non-collinearity.

In order to be more specific, the NBS-LRR and LRR-containing genes within the Rlm QTL were

the great of interest for the comparison. Twelve NBS-LRR and 18 LRR-containing genes were

identified to be located within the Rlm QTL. There were slight differences in the gene number

between the chromosomes A7 of B. napus and B.rapa as well as chromosomes A7 and C6 of B.

napus (Table 5.5). These identified resistance genes could be Rlm9, Rlm4, Rlm, Rlm3 or Rlm1

candidates for blackleg disease resistance. However, because B. napus cv. Darmor is carrying Rlm9

gene, it is expected that the comparison should be informative within the Rlm9 QTL rather than

other Rlm QTL.

The comparison of NBS-LRR and LRR-containing genes within the Rlm QTL between

chromosome A7 in B. napus and B. rapa showed that 10 and 17 genes of NBS-LRR and LRR-

containing genes in B. napus were orthologous and conserved with B. rapa. Three NBS-LRR genes

showed substantial differences with 85, 84 and 87% amino acid identity, but one LRR-containing

genes showed substantial differences with 98% amino acid identity and 50% sequence aligned

coverage. Two NBS-LRR genes were absent from the A7 B. rapa (Figure 5.2 and 5.3).

To find the homeologous relationship between genes on chromosomes A7 and C6, the NBS-LRR

and LRR-containing genes within the Rlm QTL between the two homeologous chromosomes: A7

and C6 in B. napus were compared. Seven NBS-LRR genes were homeologous. Four NBS-LRR

124

gene were conserved with 96, 95, 93, 95% amino acid identity. Three genes showed substantial

differences with 88, 84 and 80% amino acid identity. There were five genes on chromosome A7

that were not homeologous. Only one gene was not collinear (Figure 5.2). Likewise, Seven LRR-

containing genes were homeologous with two genes on chromosome A7 that were corresponding to

two duplicates copies on chromosome C6. These genes showed high conservation with >94%

amino acid identity and 90% amino acid identity. Two genes were not collinear (Figure 5.3). There

were 11 genes on chromosome A7 that were not homeologous.

Above all, there were considerable variations, collinearity, presence/absence or substantial

differences in the protein sequence of several NBS-LRR and LRR-containing genes within the Rlm

QTL region, particularly in the Rlm9 QTL of interest. This comparison confirmed that the genes in

the Rlm9 QTL did not show good conservation and chromosome collinearity. For example, the

observed differences within the Rlm QTL between chromosome A7 of B. rapa and B. napus

indicated that there were three NBS-LRR genes and five LRR-containing genes, located in the Rlm9

QTL of interest that showed non-collinearity. There was also one NBS-LRR gene that was absent

from B. rapa and three NBS-LRR genes that showed substantial protein sequence differences.

Similarly, within the Rlm QTL between chromosome A7 and C6 in B. napus, there was one NBS-

LRR gene and three LRR-containing genes showing non-collinearity. Three and four NBS-LRR

LRR-containing genes, respectively, were not homeologous.

125

Figure ‎5.2: Comparative chromosomal map of NBS-LRR genes on chromosomes A7 and C6

between B. napus (Bn), B. rapa (Br) and B. oleracea (Bo). The squares and triangles represent

NBS-LRR; TNL and CNL, respectively. Orthologous genes are connected by lines. Genes within

circles are clustered. The numbers represent the amino acid alignments with alignment coverage in

bracket. Different colours represent different domain structures of TNL and CNL genes: blue (TIR-

NBS-LRR), red (CC-NBS-LRR), yellow (TIR- NBS or CC-NBS), green (NBS-LRR), orange

(NBS) and purple (TIR).

126

Figure ‎5.3: Comparative chromosomal map of LRR-containing genes on chromosomes A7 and C6

between B. napus (Bn), B. rapa (Br) and B. oleracea (Bo). The yellow and green semicircles

represent LRR-RLP and LRR-RLK genes, respectively. Orthologous genes are connected by lines.

Genes within circles are clustered. The numbers represent the alignment coverage with amino acid

alignments in brackets

127

5.4.3 Haplotype analysis of Rlm9 and mapping potential SNPs in B. napus

A 60 K Illumina Infinium SNP genotyping assay was performed for GWAS analysis to determine

the SNP association between the B. napus Darmor (Rlm9 resistant) genotype and B. napus Yudal

(Rlm9 susceptible) for blackleg resistance. This allows identification of any haplotypes associated

with Rlm9 resistance genes located in the QTL region on chromosome A7. A total of 26 B. napus

cultivars with different Rlm gene composition were also included in the assay, phenotypic data for

all cultivars was based on resistance and susceptibility to the blackleg disease reported in the

literature (Marcroft et al., 2012a).

The haplotype analysis of the 60K SNPs genotyping data was analysed based on the Rlm9 resistant

and susceptible B. napus cultivars, Darmor and Yudal, respectively. A total of 813 out of 2467

SNPs on chromosome A7 were polymorphic between the cultivars Darmor and Yudal. The majority

of SNPs on chromosome A7 did not show significant correlation with the resistance genotype

between different B. napus cultivars with different Rlm genes. However, a set of 10 SNPs with good

correlation patterns were observed to distinguish the resistance and susceptible phenotype within a

region on chromosome A7 (Table 5.6).

These SNPs showed 100% association to the Rlm9 resistant phenotype in all cultivars predicted to

carry the Rlm9 gene, except for the cultivar Ag-Spectrum. Ag-Spectrum, one of cultivars predicted

to be carrying Rlm9, showed association with the Rlm9 susceptible phenotype (Table 5.6). In

contrast, these ten SNPs did not show 100% association to the Rlm9 susceptible phenotype between

all cultivars predicted not to be carrying the Rlm9 gene. For example, Dunkeld, Argyle, Narendra,

ATR-Summit, and Taparoo showed the SNP genotype associated with the Rlm9 resistant

phenotype, even though they are classified as susceptible cultivars (Table 5.6).

Physical location and annotation of the SNP positions on the B. napus Darmor revealed that these

SNPs were located on chromosome A7 within the region between 12,606,759 bp to 14,533,279 bp

(Table 5.7). This region contained 85 SNPs being assayed, but only these ten SNPs identified in

this study were showing a good association to Rlm9 resistant phenotype. Two of them, SNPs (Bn-

A07-p9079474 and Bn-A07-p9791002), were located in predicted genes on A7. These were

annotated as zinc finger protein transcription factors and receptor protein kinase-like protein (LRR-

RLK), respectively.

128

Table ‎5.6: The potential SNPs on chromosome A7 with a good correlation to the Rlm9 gene and association with disease and susceptibility in B. napus

cultivars. AA allele refers to Darmor resistant allele and BB allele refers to Yudal susceptible allele. NA indicates a no genotype call

Resistance ( Rlm9 positive)

Susceptibale ( Rlm9 negative)

Unknown

SNP ID

Dar

mo

r

Skip

ton

Bra

vo-T

T

Torn

ado

-TT

ATR

-Sti

ngr

ay

ATR

_Co

bb

ler

Ag-

Spe

ctru

m

Y

ud

al

Du

nke

ld

Nu

ren

dra

Telf

er

Thu

nd

er-

TT

ATR

-Su

mm

it

Mo

nty

_02

8D

H

Tap

aro

o

Car

ou

sel

Surp

ass4

00

_0

24

DH

Arg

yle

B

2 B

. nap

us

"Lyn

x_0

37

DH

"

C2

B. n

apu

s "N

ingy

ou

7"

RSO

Trilo

gy

Trig

old

Bo

om

er

Bn-A07-p8531068 AA AA AA AA AA AA BB BB AA AA BB BB AA AA AA BB AA AA BB AA AA BB BB BB

Bn-A07-p8854569 AA AA AA AA AA AA BB BB AA AA BB BB AA BB AA BB BB AA AA AA AA BB BB BB

Bn-A07-p8745024 AA AA AA AA AA AA BB BB AA AA BB BB AA AA AA BB BB AA BB BB AA BB BB BB

Bn-A07-p8775069 AA AA AA AA AA AA BB BB AA AA BB BB AA BB AA BB BB AA BB NC AA BB BB BB

Bn-A07-p8941260 AA AA AA AA AA AA BB BB AA AA BB BB AA BB AA BB BB AA AA AA AA BB BB BB

Bn-A07-p9020570 AA AA AA AA AA AA BB BB AA AA NA NA AA BB AA NA NA AA AA AA AA BB BB NA

Bn-A07-p9079474 AA AA AA AA AA AA BB BB AA AA BB BB AA AA AA BB BB AA AA AA AA BB BB BB

Bn-A07-p9791002 AA AA AA AA AA AA BB BB AA AA BB BB AA BB AA BB BB AA AA AA BB BB BB BB

Bn-A07-p9831545 AA AA AA AA AA AA BB BB AA AA BB BB AA BB AA BB BB AA AA BB BB BB BB BB

Bn-A07-p10347794 AA AA AA AA AA AA BB BB AA AA BB BB AA AA BB BB AA AA BB AA BB BB BB BB

129

Table ‎5.7: Mapping and annotation of the potential SNPs identified in B. napus cv. Darmor

SNP ID SNP B. napus Darmor v6

origin SNP

SNP in Darmor

location of SNP Predicted gene Blast hit chro

position top hit

Bn-A07-p8531068 [A/C] chrA07 12,606,759 A A Intergenic region GSBRNA2G00085931001 and GSBRNA2G00085930001

No hit

Bn-A07-p8854569 [T/C] chrA07 12,855,583 T C Intergenic region GSBRNA2G00030498001 and GSBRNA2G00030500001

No hit

Bn-A07-p8745024 [A/C] chrA07 13,039,112 A C Intron GSBRNA2G00112258001 or chrA07-snap.3025 and 3026

nodulin family protein

Bn-A07-p8775069 [T/C] chrA07 13,075,780 T T Intron GSBRNA2G00112264001 or chrA07-snap.3045

protein disulfide isomerase

Bn-A07-p8941260 [A/C] chrA07 13,262,125 A A 3’ UTR chrA07-snap.3091 putative pol polyprotein

Bn-A07-p9020570 [T/C] chrA07 13,325,277 T C 5’ UTR GSBRNA2G00112311001 or chrA07-snap.3110

small nuclear ribonucleoprotein helicase-like

Bn-A07-p9079474 [A/G] chrA07 13,391,183 A A Exon GSBRNA2G00112326001 or chrA07-snap.3130

zinc finger protein transcription factors

Bn-A07-p9791002 [T/C] chrA07 13,925,780 T T 5’ UTR chrA07-snap.3302 and 3303 receptor protein kinase-like protein

Bn-A07-p9831545 [T/C] chrA07 13,956,273 T T Intergenic region GSBRNA2G00112433001 and GSBRNA2G00112434001 or chrA07-snap.3307 and 3308

No hit

Bn-A07-p10347794 [A/G] chrA07 14,533,279 A G 5’ UTR chrA07-snap.3457 No hit

130

5.4.4 The identification of candidate Rlm9 blackleg resistance genes in B. napus

An Rlm9 QTL was identified on chromosome A7 through a marker based approach where linked

markers were physically mapped to the reference B. napus genome. The Rlm9 QTL was roughly

spanning 5 Mbp in the B. napus Darmor chromosome A7 from 13,488,533 Mbp to 19,031,520

Mbp. In addition, the potential ten SNPs, identified from haplotype analysis, were physically

mapped to B. napus Darmor chromosome A7 located near or within the Rlm9 QTL region (Figure

5.4). The genome wide identification of disease resistance genes enabled the identification of six

NBS-LRR and eight LRR-containing genes co-localized with the Rlm9 QTL region.

131

Figure ‎5.4: Mapping based identification of the Rlm9 candidate gene using marker physical mapping, NBS LRR genome wide identification and SNP

haplotypes. Different colours represent different gene types: blue (TNL), red (CNL), green (LRR-RLKs) and yellow (LRR-RLPs). The orange triangles

represent the identified SNPs associated with Rlm9 resistance. Genes inside the black rectangles are in a gene cluster. The genes inside red rectangle

are within the Rlm QTL.

132

The following results describe the comparative analysis of six NBS-LRR candidate genes (Table

5.8).

5.4.4.1 Candidate gene BnDRG 13

The BnDRG 13 gene has a coding region consisting of seven exons of 2613 bp in length encoding a

870 aa protein This gene is a typical CC-NBS-LRR (CNL) consisting of the three domains: CC,

NBS and LRR. It has no homeologous copy in the C genome of B. napus cv. Darmor. The gene is

orthologous and in a collinear position in B. rapa chromosome A7 and showed 99% amino acid

identity (Table 5.8 and Figure 5.5). The corresponding gene in B. rapa is lacking the CC domain.

The gene is also present and in a collinear position in B. napus cv. Tapidor and showed 99% amino

acid identity. The gene in B. napus cv. Tapidor lacks half of the CC domain (about the first 60 aa)

(Table 5.8 and Figure 5.5). The gene had high coverage of Ag-Spectrum sequence reads of 80%

with 98% DNA sequence identity, whereas Skipton showed low read coverage of about 5% (Table

5.9 and Figure 5.11).

5.4.4.2 Candidate gene BnDRG 14

The BnDRG 14 gene has a coding region consisting of five exons of 2244 bp in length encoding a

754 aa protein. This gene is a typical CNL. It has no homeologous copy in the C genome of B.

napus cv. Darmor. The gene is orthologous and in a collinear position in B. rapa chromosome A7

and showed 99% amino acid identity in the NBS and LRR domain but showed substantial sequence

variation at the CC domain (Table 5.8 and Figure 5.6). The gene is also present in a collinear

position in A7 B. napus cv. Tapidor. Most of the gene shared 100% amino acid identity except a

region of about 60 aa where there was substantial difference in the CC domain (Table 5.8 and

Figure 5.6). The gene had high coverage of Skipton sequence reads (78%) with 98% DNA

sequence identity, whereas Ag-Spectrum showed low read coverage of about 5% (Table 5.9 and

Figure 5.11).

5.4.4.3 Candidate gene BnDRG 17

The gene BnDRG 17 has a coding region consisting of two exons of 1296 bp in length encoding a

431 aa protein. This gene is a truncated TNL consisting of two domains TIR and NBS and lacking

the LRR domain. It has a homeologous copy as a duplicate on chromosome C6 of B. napus cv.

Darmor. This duplicate copy is also truncated and not in a collinear position with 95% amino acid

identity (Table 5.8 and Figure 5.8). The gene is orthologous and not in a collinear position on

chromosome A7 in B. rapa with 97% amino acid identity (Table 5.8 and Figure 5.7). It is missing

the first 23 aa of the TIR domain in B. napus Darmor compared to B. rapa. The gene seems to be

133

present in an unplaced chromosome of B. napus cv. Tapidor, but, it was not possible to compare the

entire sequence similarity because of presence of N-gapped sequence in B. napus Tapidor. There

were no Skipton or Ag-Spectrum sequence reads for this gene.

5.4.4.4 Candidate gene BnDRG19

The gene BnDRG19 has a coding region consisting of three exons of 2763 bp in length encoding a

920 aa protein. This gene is a truncated TNL lacking the TIR domain and contains the NBS and

LRR domains. It has a homeologous copy as duplicate on chromosome C6 of B. napus cv. Darmor.

This duplicate copy is also truncated and not in collinear position with 96% amino acid identity

(Table 5.8 and Figure 5.8). The gene is orthologous and not in a collinear position in B. rapa on

chromosome A2 with 97% amino acid identity (Table 5.8 and Figure 5.8). The corresponding

gene in B. rapa is a typical TNL. The gene is also present on chromosome A7 B. napus cv. Tapidor

and showed 95% amino acid identity with a deletion of 30 aa in the NBS domain (Table 5.8 and

Figure 5.8). There was high coverage of Skipton and Ag-Spectrum sequence reads of about 99%

and 86% of the gene with 100% and 98% DNA sequence identity, respectively (Table 5.9 and

Figure 5.11).

5.4.4.5 Candidate gene BnDRG20

The gene BnDRG20 has a coding region consisting of four exons of 2979 bp in length encoding a

992 aa protein. This gene is a typical TNL containing all three domains: TIR, NBS and LRR. It has

a homeologous copy as a duplicate on chromosome C6 of B. napus cv. Darmor. This duplicate copy

is also a typical TNL and not in a collinear position and showed 88% amino acid identity (Table 5.8

and Figure 5.9). The gene is orthologous and not in a collinear position on chromosome A2 in B.

rapa and showed 98% amino acid identity (Table 5.8 and Figure 5.9). This gene is not present in

B. napus cv. Tapidor. There was high coverage of Skipton sequence reads covering 96% with 100%

DNA sequence identity, whereas, the Ag-Spectrum sequence reads coverage was about 55% of the

gene with 99% DNA sequence identity (Table 5.9 and Figure 5.11).

5.4.4.6 Candidate gene BnDRG 23

The gene BnDRG23 has a coding region consisting of six exons of 2346 bp in length encoding a

781 aa protein. This gene is a truncated CNL lacking the CC domain and consisting of the NBS and

LRR domains. It has no homeologous copy in the C genome of B. napus cv. Darmor. This gene is

not present in B. rapa. The gene is present on chromosome A7 of B. napus cv. Tapidor and showed

81% amino acid identity (Table 5.8 and Figure 5.10). There was high coverage of Skipton and Ag-

134

Spectrum sequence reads of about 90% and 98% of the gene with 100% DNA sequence identity,

respectively (Table 5.9).

5.4.5 Amplification and sequencing of Rlm9 candidate genes

The comparative analysis of NBS-LRR genes gave an initial indication that these genes could be

candidate Rlm9 genes because of their conservation or sequence variation between B. napus Darmor

(Rlm9 resistant) and B. rapa.

In order to validate these candidate genes, individual genes were amplified and sequenced using

primer pairs designed based on B. napus cv. Darmor sequence as a reference. In this study,

sequencing of six Rlm9 candidate genes is being attempted in B. napus Darmor and Yudal, resistant

and susceptible cultivars, respectively. It was very difficult to develop gene specific primers

because the B. napus genome is polyploid, highly duplicated and sequence similarity between the A

and C genome exists extensively between homeologous chromosomes.

BnDRG 13, BnDRG 14 and BnDRG 23 were successfully amplified and completely sequenced the

gene from B. napus cv. Darmor and c.v Yudal and no polymorphism was observed. I did not test

these on other B. napus cultivars because no polymorphism was observed. The two potential genes:

BnDRG 19 and BnDRG 20 were successfully amplified and sequenced complete gene sequence

from different B. napus cultivars: Darmor, Yudal, Skipton, Tapidor, Jet Neuf and, but no

polymorphism was observed. These two genes: BnDRG 19 and BnDRG 20 failed to amplify from

B. napus Ag-Spectrum and Westar. The presence of the BnDRG 20 gene in B. napus cv. Tapidor

was confirmed and showed 100% identity even though the comparative analysis of BnDRG 20

showed that was not present in B. napus Tapidor genome (Table 5.8). The amplification of the

candidate BnDRG 17 gene was not successful with any of B. napus cultivars.

135

Table ‎5.8: Comparative analysis of NBS-LRR Rlm9 candidate genes between B. napus cv. Darmor, B. napus cv. Tapidor and B. rapa. The sequence

identity is based on protein sequence alignment

Rlm9 gene in B. napus Darmor

A genome v6 B. napus Darmor C genome B. rapa v1

B. napus Tapidor A genome

Gene position Gene Identity (aligned coverage)

position Gene Identity (aligned coverage)

position Gene Identity (aligned coverage)

Position in Tapidor v4

BnDRG 12 870aa

14,052,790 Not present XA07-Bra012116 693aa

99 (80)

9,918,855 chrA07-snap.3272 805aa

99(94)

8,509,720

BnDRG 14 754aa

14,057,850 Not present XA07-Bra012114 735

84(100) 9,930,135 chrA07- snap.3274 755aa

100(90)

8,504,784

BnDRG 17 431aa

16,055,043 chrC06 GSBRNA2T00-049133001 432aa

95(100)

30,798,375 XA07-Bra003864 454aa

97(94)

15,441,277 unplaced

98 % of aligned sequence (not possible to check the entire gene due to N-gapped sequence in Tapidor)

BnDRG 19 920aa

16,765,735 chrC06 GSBRNA2T00-057893001 918aa

96(100)

19,526,985 XA02-Bra028499 1077aa

97( 85)

696,153 chrA07- snap.3935 921aa

95(100)

10,3013,17

BnDRG 20 992aa

16,783,435 chrC06 GSBRNA2T00-057889001 933aa

88(90)

19,487,657 XA02-Bra028500 992aa

98(100)

714,674 Not present

BnDRG 23 781aa

17,769,448 Not present Not present chrA07- snap.4091 905aa

81(100)

10,9775,76

136

Table ‎5.9: The coverage of Skipton and Ag-Spectrum sequence short reads of candidate Rlm9 genes on B. napus. The identity is based on gene

sequence alignment

Rlm9 gene in

B. napus Darmor

Sequence Reads B. napus Skipton

Sequence Reads B. napus Ag-Spectrum

Gene Coverage % Identity% Coverage % Identity%

BnDRG 13 3635bp

Very low coverage 5% 80% 98

BnDRG 14 2942bp

78% 98 Very low coverage 5%

BnDRG 17 1296 bp

No reads mapped to this gene

BnDRG 19 3183bp

99% 100 86% 99%

BnDRG 20 4029bp

96% 100 55% 99

BnDRG 23 2655bp

90 100 98 100

137

Figure ‎5.5: Comparison of deduced amino acid sequences of BnDRG 13 in B. napus Darmor A7 to its homologous genes in B. napus Tapidor and B.

rapa.

138

Figure ‎5.6: Comparison of deduced amino acid sequences of BnDRG 14 in B. napus Darmor A7 to its homologous genes in B. napus Tapidor A7 and

B. rapa.

139

Figure ‎5.7: Comparison of deduced amino acid sequences of BnDRG 17 in B. napus Darmor A7 to its homologous genes in B. napus Darmor C6 and

B. rapa.

140

Figure ‎5.8: Comparison of deduced amino acid sequences of BnDRG 19 in B. napus Darmor A7 to its homologous genes in B. napus Darmor C6, B.

napus Tapidor A7 and B. rapa.

141

Figure‎5.8:‎continued‎….

142

Figure ‎5.9: Comparison of deduced amino acid sequences of BnDRG 20 in B. napus Darmor A7 to its homologous genes in B. napus Darmor C6 and

B. rapa.

143

Figure ‎5.10: Comparison of deduced amino acid sequences of BnDRG 23 in B. napus Darmor A7 to its homologous genes in B. napus Tapidor A7.

144

Figure ‎5.11: Assembly of B. napus Skipton and Ag-Spectrum sequence short reads to B. napus

Darmor reference genome within the Rlm9 candidate resistance genes: BnDRG 12, BnDRG 13 ,

BnDRG 18 and BnDRG 19.

145

5.5 Discussion

5.5.1 Identification and characterization of NBS-LRR and LRR-containing genes on B.

napus chromosome A07

The fully sequenced B. napus genome has provided an invaluable resource in the identification and

comparative analysis of disease resistance genes. In total, 58 disease resistance genes were

identified on chromosome A7 of B. napus: 19 NBS-LRRs, 15 LRR-RLPs and 24 LRR-RLKs. The

NBS-LRR and LRR-containing genes have been demonstrated to play a major role in resistance to a

wide range of plant pathogens (Hulbert et al., 2001). In addition, the association and correlation

between disease resistance genes and QTL regions has been widely noted (Gebhardt and Valkonen,

2001, Quint et al., 2003, Wisser et al., 2005). This is suggesting that candidate resistance genes, co-

localized within QTL, are probably major genes contributing to qualitative resistance. In this study,

30 out of 58 genes in chromosome A7 were located with Rlm QTL. This region is of interest

harbouring five race specific blackleg resistance genes, Rlm1, Rlm3, Rlm4, Rlm7 and Rlm9.

It has been shown that blackleg resistance genes in B. napus such as Rlm1, Rlm3, Rlm4, Rlm7 and

Rlm9 are mapped to a genomic loci that correspond to the chromosome segment on Arabidopsis

harbouring AtRlm1 (Mayerhofer et al., 2005). AtRlm1 and another two genes, AtRlm2 and AtRlm3

have been identified as Arabidopsis thaliana Resistance to Leptosphaeria maculans gene. These

genes encode NBS-LRR proteins (Staal et al., 2006, Staal et al., 2008). Therefore, it could be

hypothesized that L. maculans resistance genes are related to the NBS-LRR protein family.

5.5.2 Comparative analysis of NBS-LRR and LRR-containing genes on chromosome A7

The identification of NBS-LRR and LRR-containing genes enabled the study of synteny,

collinearity and conservation between genes on chromosomes A7 and C6 with their homologous

counterparts in B. rapa and B. oleracea. The number of NBS-LRR and LRR-containing genes

between the three species were consistent suggesting that they might have been maintained and not

affected much during the formation of B. napus.

This comparison showed that the majority of genes were conserved and collinear between the

chromosomes A7 and its homeologous chromosome C6 in Brassica species: B. napus, B. rapa and

B. oleracea. However, there were considerable variations, collinearity, presence/absence or

substantial differences in the protein sequence of several NBS-LRR and LRR-containing genes.

This might be due to genome mis-assembly or genomic rearrangements. The genomic

rearrangements may imply genome structural changes such as translocations.

146

The LRR-RLP and LRR-RLK genes were more highly conserved than the NBS-LRR genes. This is

because NBS-LRR genes are often under diversifying selection, producing highly divergent

sequences and structural variants with distinct recognition capacities (Bai et al., 2002, Meyers et al.,

2003). By contrast, LRR-containing genes are under evolutionary pressure to maintain a specific

function (Fritz-Laylin et al., 2005). It has been hypothesized that LRR containing genes can play a

role in plant defence, as well as other biological processes. This makes them less likely to be

undergo diversifying selection than NBS-LRR genes (Leister, 2004, Guo et al., 2011). However,

such high sequence conservation in the LRR-containing genes does not rule out that LRR-

containing genes could be a potential candidate resistance gene. There are several examples

involving the LRR containing proteins in the plant resistance defence. For example, Cf genes in

tomato confer resistance against leaf mould disease caused by Cladosporium fulvum (Wulff et al.,

2009). LepR3 in B. rapa confers resistance to blackleg disease caused by L. maculans carrying

AvrLm1 (Larkan et al., 2013)

Furthermore, comparative analysis of NBS-LRR and LRR-containing genes between chromosome

A7 of B. napus and B. rapa as well as chromosomes A7 and C6 of B. napus were utilized to find

out how the structural diversity of resistance genes are associated with blackleg disease resistance.

A total of 12 NBS-LRR and 18 LRR-containing genes on B. napus chromosome A7 were identified

and located within the Rlm QTL region of interest.

The comparative analysis of resistance genes in the Rlm QTL region between B. napus and B. rapa

chromosome A7, as well as between B. napus chromosomes A7 and C6 confirmed the nature of

gene conservation with regards to gene order, content and sequence similarity. Even though there

were considerable variations; either gene presence/absence or substantial differences in the protein

sequence, there was significant structural conservation in the majority of resistance genes between

chromosome A7 of B. napus and B. rapa. In contrast, there was not high structural conservation

between chromosomes A7 and C6 of B. napus. For instance, more than 90 % of the resistance genes

have been retained in chromosome A7, whereas, about 50% of the resistance genes were found to

be duplicated between chromosomes A7 and C6. A previous studies (Fopa Fomeju et al., 2014)

found that the genomic region involved in resistance to blackleg in B. napus is expected to be linked

to the significant structural conservation between B. napus and B. rapa, but not between B. napus

and B. oleracea, because the source of blackleg resistance is only contributed by the A genome and

none has been found on the C genome (Rouxel et al., 2003b, Ananga et al., 2006, Leflon et al.,

2007). Therefore, it can be hypothesized that the resistance gene involved in blackleg resistance is

supposed to be retained in B. napus and B. rapa, however, some genome rearrangements or

147

sequence variation might have occurred during the formation of B. napus. Several candidate

resistance genes were identified underlying the Rlm QTL region of blackleg resistance on B. napus

chromosomeA7. These genes were orthologous and conserved with chromosome A7 of B. rapa,

but, they were not homeologous or showed substantial protein differences on B. napus chromosome

C6.

This study revealed that there is high functional conservation of resistance genes on chromosome

A7 of B. napus and B. rapa compared to chromosomes A7 and C6 of B. napus by Rlm QTL

comparison. This information provides an invaluable resource to identify a number of candidate

genes associated with blackleg disease resistance and further characterisation of the Rlm QTL

region is likely to resolve whether the various Rlm genes are distinct or different allelic versions of

the same blackleg resistance genes.

5.5.3 Haplotype analysis of Rlm9 and mapping potential SNPs on B. napus

The Illumina Infinium SNP data was used to study the association between the SNP alleles and

blackleg resistance. This allows identification of or narrowing down the list of candidate genes

underlying the blackleg resistance QTL region. In this study, the haplotype analysis revealed that

the SNPs showed a random pattern of polymorphism and was not perfectly showing association

between SNPs and the Rlm9 resistance and susceptibility. However a set of ten SNPs with a good

correlation towards Rlm9 resistance were identified. Five of the six cultivars predicted to be Rlm9

resistance were shown to carry SNP alleles that strongly correlated to Rlm9 resistance allele of cv.

Darmor at the SNPs assayed. Ag-Spectrum, one of the predicted resistant lines, had no Darmor

resistant allele in any of the selected SNP. But, it revealed an allele that was consistent with the

susceptible lines. This is probably because Ag-Spectrum is heterozygous for resistance towards the

Rlm9 gene (Raman et al., 2012b). In contrast, five of the ten cultivars predicted to be Rlm9

susceptible were shown to carry alleles that associated with the Rlm9 susceptible allele in Yudal at

the SNP assayed. However, there were five cultivars showed a good correlation to the resistant line

by having a Darmor resistant allele even though they are predicted to be Rlm9 susceptible.

It is could be assumed that there was misinterpretation of the phenotypic data for these cultivars due

to subjectivity or errors during the phenotyping process. It is often difficult to determine how much

of phenotypic variation is explained in the literature. In addition, the Rlm9-mediated response is

heat-sensitive. Therefore, the phenotypic analysis is particularly difficult to analyse and often

conflicting results are found in the literature (J. Batley, personal communication). Thus, it is highly

recommended to re-phenotype these cultivars to determine the true results in the future.

148

A number of the identified SNPs were located in the 3’ UTR and 5’ UTR of predicted genes. This

might suggest the involvement of transcription factors or regulatory factors in Rlm9 resistance.

Furthermore, sequence analysis of the predicted genes with these potential SNPs is required to

reveal whether there are any predicted genes that could be involved or work together with the

candidate Rlm9 genes.

5.5.4 The physical location and identification of candidate Rlm9 blackleg resistance gene in

B. napus

An Rlm9 QTL has been identified on chromosome A7 through a marker based approach where

linked markers were physically mapped to the reference B. napus genome (Delourme et al., 2004).

Based on the 60K SNP haplotype analysis, the potential ten SNPs, identified were mapped and

positioned along the Rlm9 QTL region on the chromosome A7. This result further strengthens the

hypothesis that Rlm9 lies somewhere within the targeted QTL region.

The Rlm QTL region on chromosome A7 co-localized with 12 NBS-LRR and 18 LRR-containing

genes in B. napus. These resistance genes could be candidates for blackleg disease resistance.

However, The A7 chromosome harbours several Rlm QTL regions underlying blackleg disease

resistance, where a cluster of blackleg resistance genes reside such as Rlm9, Rlm4, Rlm7, Rlm3 and

Rlm1 as identified previously (Delourme et al., 2004). In this study, this comparative analysis

confirmed that there were genes within the Rlm QTL region, particularly in Rlm9 QTL of interest

showed considerable variations, collinearity, presence/absence or substantial differences in the

protein sequence.

Based on these results, therefore, six NBS-LRR and eight LRR-containing genes were identified

and associated with the Rlm9 QTL region. The NBS-LRR genes were selected for further analysis.

The genes are referred to as BnDRG 13 BnDRG 14, BnDRG 17, BnDRG 19, BnDRG 20 and

BnDRG 23. Comparative analysis has made it possible to further investigate the structural diversity

in terms of protein sequence conservation or presence/absence between B. napus cv. Darmor and

other related species such as B. napus cv.Tapidor and B. rapa. This allowed initial prediction and

prioritization for targeting these candidate genes. The BnDRG 13 and BnDRG 14 genes were

considered as possible candidate genes because they were highly conserved and showed structural

difference in the CC domain between B. rapa and B. napus. They are presents as singleton gene in

B. napus without a homeologous (duplicate copy) in the B. napus C genome.

149

Since blackleg resistance was found to be contributed by the A genome of B. napus and B. rapa,

this implies that the resistance gene might be present with sequence variation in both species. The

BnDRG 23 was not considered as strong candidate gene because it is not present in the A genome of

B. rapa. The genes BnDRG 17, BnDRG 19 and BnDRG 20 were prioritized as the most promising

candidate genes. The genes BnDRG 19, BnDRG 20 were not syntenic and non-collinear between B.

napus Darmor and B. rapa. The gene BnDRG 19 was highly conserved and showed structural

difference as the TIR domain is present in the B. rapa but not in the B. napus gene. The gene

BnDRG 19 was highly conserved in B. rapa and showed substantial sequence variations in the B.

napus C genome duplicate. This gene was initially not found in B. napus cv. Tapidor. The genes

BnDRG 19 and BnDRG 20 had high sequence read coverage in Skipton and low sequence reads

coverage in Ag-Spectrum. The low sequence reads coverage in Ag-Spectrum may be due to

significant sequence variation or loss. It may also be explained by the fact that Ag-Spectrum is

heterozygous for resistance towards Rlm9 gene.

The gene BnDRG 17 was highly conserved between B. napus Darmor and B. rapa but not collinear.

This gene was found in B. napus cv. Tapidor but due to genome assembly issue of presence of N-

gapped sequence, it was not possible to compare the gene sequence between the cultivars Darmor

and Tapidor. In addition, this gene had no sequence read coverage for both Skipton and Ag-

Spectrum. Preliminary analysis of the genomic region where BnDRG 17 resides is believed to be a

complex region in Darmor as the assembly is not accurate. However, since sequencing of the

BnDRG 17 gene is yet to be obtained, it is still as a strong candidate Rlm9 gene possible and efforts

to fully sequence the gene should be continued.

In conclusion, the comparison of NBS-LRR and LRR-containing genes between the four

chromosomes for the three species: B. napus, B. napus and B. oleracea not only confirmed genomic

microsynteny and conservation between Brassica species but, also provided an invaluable resource

in the identification and prioritization of candidate resistance genes

5.5.5 Amplification and sequencing Rlm9 candidate genes

In this study, sequencing of four Rlm9 candidate genes was attempted to identify polymorphisms in

the identified Rlm9 candidate genes between the resistant and the susceptible cultivars. The

complexity of the QTL region of interest due to genome duplication and rearrangement events has

complicated the PCR amplification. It may be worthwhile in future to use different PCR approaches

in amplification and sequencing the Rlm candidate genes such as RACE-PCR or even advanced

150

approaches such as targeted sequence capture (Salmon et al., 2012, Winfield et al., 2012) or

resistance gene enrichment sequencing (Jupe et al., 2013).

Despite that, BnDRG 13, BnDRG 14 and BnDRG 23 candidate genes were successfully amplified

which revealed that there was no sequence polymorphism between two contrasting Rlm9 cultivars,

Darmor and Yudal. This clearly indicates that these genes are not a candidate Rlm9 gene.

The sequence for BnDRG 19 and BnDRG 20 genes between two contrasting Rlm9 cultivars,

Darmor and Yudal and other B. napus cultivars such as Skipton, was successful and showed 100%

identity without any polymorphism observed. However, BnDRG 19 and BnDRG 20 failed to

amplify from B napus Ag-Spectrum. This is supported by the low sequence reads coverage

(presence gapped sequence) which might have caused failure of amplification of Ag-Spectrum. I

believe that it is too early to make a deduction either to eliminate or to strongly nominate these

genes BnDRG 19 and BnDRG 20 from the blackleg candidate gene list. These two genes might be

of interest to be further investigated between other Rlm cultivars such as Rlm4. The Rlm9 QTL

region is believed to overlap or be adjacent to the Rlm4 QTL. These results confirmed the findings

of Delourme et al., (2004) stating that the QTLs for Rlm4 and Rlm9 are located next to each other.

The BnDRG 20 gene was not initially present in B. napus cv. Tapidor but that may be due to an

assembly problem, making the mapping and sequence comparison impossible. However, it was

confirmed by PCR and sequencing that this gene is present in B. napus cv. Tapidor with 100%

DNA sequence identity to B. napus cv. Darmor. This indicates that there is missing sequence

information resulting from the genome mis-assembly.

There is still one candidate gene BnDRG 17 that amplification has been attempted for, but has been

unsuccessful. This may be due to the assembly issue. This gene is still strong candidate gene to be

considered for future sequencing.

While sequencing and validation of these candidates must still occur, this study has proven that

candidate gene comparative analysis is quick and is becoming more affordable due to the generation

sequencing getting faster and cheaper. This method should occur in the future discovery of genes of

interest in Brassicas and can be followed in other crop species. In conclusion, none of the Rlm9

NBS-LRR candidate genes, that have been sequenced, showed a significant polymorphism

associated with Rlm9. However, it appears that one of LRR-containing genes located on

chromosome A7 is likely to be a candidate. There are eight LRR containing genes identified within

the Rlm9 QTL region that could become a strong prospect for further analysis.

151

The list of disease resistance genes in B. napus chromosome A07 is certainly not complete. The B.

napus cv. Darmor genome sequence still contains unmapped contigs and the presence of N-gapped

sequences. This means that valuable sequence information is missing. I believe that one sequenced

genome does not sufficiently facilitate identification of the total number of disease resistance genes.

This is probably because the resistance genes are often located within unannotated regions of the

genome or they may be interrupted by gapped sequence leading to separation the gene into different

partial genes. There are artefacts from genome assembly and annotation that could potentially cause

identification of gene loss. For instance, the gene BnDRG 20 in the B. napus cv. Darmor genome

assembly was not present in the B. napus cv. Tapidor genome assembly in the comparative analysis.

However, the presence of the gene was confirmed in B. napus cv. Tapidor by PCR in this study.

Moreover, some resistance genes may be present in one genotype or cultivars but absent in another

exhibiting presence/absence variation (PAV). This causes differences in the NBS-LRR genes

number between cultivars. Therefore, the improved and revised genome assembly and re-

sequencing of other B. napus cultivars will definitely result in identification of additional resistance

genes.

152

Chapter 6

General discussion and future

directions

153

Chapter 6 : General discussion and future directions

This thesis aims to provide the foundation for study of the conservation and evolution of disease

resistance genes in Brassica napus. The availability of genome sequences and comparative analysis

together with disease resistance QTL mapping is an invaluable resource to study functional and

structural variations of NBS-LRR resistance genes between Brassica species. Hence, the

comparative analysis between Brassica species is likely to identify number of resistance genes that

affect important disease traits in Brassica. This study provides a good starting point in the effort to

identify and sequence candidate blackleg resistance (Rlm) genes in B. napus.

6.1 Genome wide identification of NBS-LRR resistance genes in Brassica

For the discovery of candidate genes involved in traits of interest, the genome wide identification of

NBS-LRR resistance genes has become an essential component for better understanding of the

genomics and evolution of resistance genes. Thus, the main focus was to characterize the NBS-LRR

genes with regards to their structural diversity, conservation, domain structure, chromosomal

locations, gene cluster and duplications. More importantly, it aimed to identify more genes with

NBS-LRR characteristics that could be potential candidates for disease resistance in Brassica.

In this study, a total of 641 NBS-LRR genes in the B. napus cv. Darmor were identified. This is

compared to 249 and 443 genes identified in B. rapa and B. oleracea, respectively. The

comparative analysis provided evidence that NBS-LRR genes in B. napus genome exhibited similar

distribution, structure and syntenic relationships to its progenitor species. The gene contents and

order on B. napus chromosomes are generally very similar to those on the corresponding

chromosomes of the diploid progenitors. Overall, these comparisons did not show massive

differences of NBS-LRR in B. napus as compared to its parental species.

To address the question as to what extent resistance genes are structurally conserved between

Brassica species, the NBS-LRR genes with their genomic structure was compared. There was

evidence that the 50% of NBS-LRR genes showed high synteny and collinearity between the A

genome of B. napus and B. rapa and the C genome of B. napus and B. oleracea. They were

positionally well conserved in size and orientation particularly regions containing major gene

clusters. Thus, it appears that these NBS-LRR genes in B. rapa and B .oleracea species may have

undergone similar selection pressure and they might have not been largely affected during the

formation of B. napus. In other words, B. napus might have been subjected to less evolutionary

selection or pressure leading to maintaining NBS-LRR properties with its diploid progenitors.

154

The majority of B. napus NBS-LRR genes have orthologues in the progenitor species providing

evidence of evolution and genome duplication. More than 60% of NBS-LRR genes from the

progenitor species were conserved and maintained in B. napus. The differences in NBS-LRR gene

conservation could be attributed to gene losses or selection pressure to offer species-specific

resistance gene. After hybridisation of B. rapa and B. oleracea to form B. napus, NBS-LRR genes

in Brassica species experienced species-specific gene deletion resulting in gene losses. Similar

phenomenon of redundant NBS-LRR gene loss have also been reported in other plant genomes,

presumably because the duplicates would have had redundant functions (Bancroft, 2001, Rana et

al., 2004). At the same time, the gene duplication and recombination have also created novel NBS-

LRR resistance genes specific to B. napus and have resulted in substitutes the gene number to

compensate the gene losses during formation of B. napus.

It has been shown that the level of gene duplication in the B. napus genome is significant and

increases the genomic complexity (Parkin et al., 2003, Rana et al., 2004, Parkin et al., 2005). This is

also seen by the finding that 50% of B. napus NBS-LRR genes were identified as duplicates. The

NBS-LRR duplications exhibited the similar patterns and homeologous collinear between the two

sub-genomes of B .napus.

Previous genetic and genomics studies demonstrated that the majority of NBS-LRR genes are

present in gene clusters in plant genomes (Hulbert et al., 2001). The clustered arrangement of these

genes may be a critical attribute of the generation of novel resistance specificities via gene

duplication or recombination (Meyers et al., 2003). Indeed, previous studies revealed that NBS-

LRR genes are often present as tightly linked genes with high homology and are prone to gene

duplication and recombination, and thus evolve more rapidly than the rest of the genome (Grant et

al., 1998, Wei et al., 2013). Thus, the clustering of NBS-LRR genes provides a reservoir of genetic

variation by which new specificities to pathogens can evolve and allows for rapid evolutionary

response to pathogen effectors. This study found that 59% of the NBS-LRR genes in Brassica

species were found to be physically clustered and individual genes involved in clusters were more

polymorphic and subject to evolutionary selection pressure. This was further supported by the

observation that there was high conservation in genes involved in gene cluster than singleton genes.

The relative contributions of gene clustering and duplication in generating diversity in resistance

gene specificities remain to be further determined. However, duplication and divergent selection

may apparently play a major role in the evolution of resistance to major pathogens. A complex

pattern of tandem duplications was identified in the B. napus genome particularly within a gene

cluster suggesting that gene clusters could be chromosomal hot spots in which the NBS-LRR genes

155

get duplicated and expanded by tandem duplication followed by selection pressure. In addition,

there was a significant correlation and co-localization between the number of NBS-LRR genes

within the QTL intervals and the number of genes involved in gene clusters or duplication. The

results provide evidence that NBS-LRR are distributed and clustered throughout the genome and

tend to be linked and associated with disease resistance QTL. However, because the genomic

duplication reported in this study based on NBS-LRR genes tend to be insufficient to provide a

better resolution about the genomic structure around the NBS-LRR. Therefore, it may be

worthwhile to conduct large-scale NBS-LRR duplication analysis by including predicted genes

flanking the NBS-LRR genes.

To conclude, evidence for resistance gene clustering and duplication within Brassica species should

now be further used to study the sequence and structural variation within disease resistance QTL. It

is possible that clustered and duplicated resistance genes underlying the QTL of interest are likely

to be major components of the resistance and cause resistance gene specificities.

6.2 Identification of candidate blackleg resistance genes in B. napus

Genetic studies have identified the gene for gene interactions between Avr genes in L. maculans and

their corresponding Rlm (Resistance to Leptosphaeria maculans) genes in B. napus. In addition,

genetic mapping studies have shown that in B. napus there are five specific major blackleg

resistance genes: Rlm1, Rlm3, Rlm4, Rlm7, and Rlm9 on chromosome A7 (Delourme et al., 2004,

Mayerhofer et al., 2005, Ananga et al., 2006). It remains unclear whether these genes are a cluster

of tightly linked genes, or a single gene with different alleles, or a combination of both. Even

though QTL mapping of quantitative resistance to L. maculans has been pursued in B. napus, none

of the genetically mapped Rlm genes in chromosome A7 have been sequenced and validated in B.

napus.

The chromosomal region of Rlm QTL for blackleg resistance shows genome complexity. The

physical location of disease resistance QTL or even a major resistance gene in complex genome

such as B. napus is highly challenging. This is complicated by genome duplications and tandem

intra-chromosomal duplication between homeologous chromosomes which make characterization

of genes underlying QTL of interest extremely difficult. For example, several overlapping QTL

intervals for blackleg resistance genes Rlm9, Rlm7, Rlm4, Rlm3 and Rlm1 were genetically

identified on chromosome A7 of B. napus (Delourme et al., 2004). In addition, these QTL are

located within an extensive intra-chromosomal tandem duplication region (Raman et al., 2012b,

Tollenaere et al., 2012). Thus, the identification of candidate blackleg resistance gene(s) in B. napus

156

might also identify other blackleg resistance genes and elucidate their conservation and evolution in

Brassica species.

The resistance gene mediated blackleg resistance is inherited qualitatively. In addition, the source of

blackleg resistance is only contributed by the A genome and none found on the C genome. This is

due to significant structural conservation between B. napus and B. rapa and absence of blackleg

resistance sources in B. oleracea. However, paralogous genes in B. napus should have specific

recognition created by selective pressure to maintain the resistance source origin in the A genome.

Therefore, it can be hypothesized that the resistance genes involved in blackleg resistance should be

retained and conserved in the A genome of B. napus and B. rapa, whereas the resistance gene is

probably absent or highly divergent in the C genome of B. napus and B. oleracea.

In this study, co-mapping of QTL intervals identified in B. napus to its related species, B. rapa and

B. oleracea indicates that the QTL intervals in B. napus were originally contributed by the parental

species. The comparative QTL between B. napus, B. rapa and B. oleracea revealed the majority of

NBS-LRR genes in associated QTL intervals are syntenic and conserved. However, there were

differences in the number of associated NBS-LRR genes and the disease QTL interval sizes

between QTL regions in B. napus compared to its parental species. This suggests that the QTL

underlying resistance traits might show more structural and functional conservation towards one of

the B. napus progenitor species. There was a significant correlation and co-localization between the

number of NBS-LRR genes within the QTL intervals and the number of genes involved in gene

clusters or duplication. This is consistent with previous studies (Jeong et al., 2001, Wang et al.,

2010). The complex structure of QTL can be as a result of resistance gene cluster and duplication

leading to closely linked genes which differ in their pathogen race specificity (Michelmore and

Meyers, 1998). This implies that gene clusters are a good starting point for studying functional

molecular mechanisms and evolution of NBS-LRR resistance genes underlying resistance traits.

The identification of candidate blackleg resistance genes in B. napus by map-based cloning

approaches is a laborious and time consuming effort. Therefore, another approach which can be

undertaken is direct identification of candidate genes through next generation sequencing and whole

genome NBS-LRR analysis. This approach, combined with genome comparative analysis, allows

studying the genomic region for the disease resistance QTL with respect to the NBS-LRR genes

conservation and syntenic relationships between related species. This could facilitate the rapid

selection of putative candidate NBS-LRR genes involving in the blackleg resistance in Brassica.

157

Since chromosome A7 in B. napus was the major of interest in this study carrying major sources of

blackleg resistance genes in Brassica, The comparative analysis enabled me to study the synteny,

collinearity and conservation between NBS-LRR and LRR-containing genes on A7 and C6 with

their homologous counterparts in B. rapa and B. oleracea. In general, there was significant

structural conservation in the majority resistance genes between chromosome A7 of the B. napus

and B. rapa. However, there were considerable variations either as gene presence/absence or

substantial sequence differences for several resistance genes.

Interestingly, of the total NBS-LRR and LRR-containing genes identified on the chromosome A7 of

B. napus, there were 12 NBS-LRR and 18 LRR-containing genes underlying blackleg resistance

Rlm QTL. These resistance genes could be Rlm9, Rlm4, Rlm7, Rlm3 or Rlm1 candidates for

blackleg disease resistance. The Rlm QTL comparison analysis revealed that these genes were

orthologous and conserved in A7 B. rapa, but, they were not homeologous or showed substantial

protein differences in B. napus C6. Furthermore, the comparative analysis confirmed that there were

genes within the Rlm QTL region, particularly in Rlm9 QTL of interest showed considerable

variations, collinearity, presence/absence or substantial differences in the protein sequence

In this study, identification and characterization of Rlm9 candidate genes was attempted. A QTL for

the Rlm9 gene has been identified from a mapping population derived from crossing B. napus cv.

Darmor to cv. Yudal (Delourme et al., 2004). This QTL is located on chromosome A7 of B. napus,

which also contains other resistance genes: Rlm1, Rlm3, Rlm4 and Rlm7. In this study, six NBS-

LRR and eight LRR-containing genes were associated with Rlm9 QTL region. The NBS-LRR genes

were selected for further analysis. These genes are referred to as BnDRG 13 BnDRG 14, BnDRG

17, BnDRG 19, BnDRG 20 and BnDRG 23. The comparative analysis between Brassica species and

cultivars has made it possible to make initial prediction and prioritization for targeting these

identified genes. PCR amplification and sequencing confirmed that five of these candidate genes

showed no polymorphisms associated with Rlm9 resistance/susceptibility. However, there is still

one candidate BnDRG 17 gene considered a potential candidate gene for future sequencing. In

addition, it appears that LRR-containing genes identified within the Rlm9 QTL region could

become a strong prospect for further analysis.

The results of this study have also been used to identify and sequence other blackleg resistance

genes on chromosome A7 in B. napus such as Rlm4 and Rlm1. The sequencing and validation of

these candidate genes is still underway in our laboratory.

158

Although there has been relatively little research on the isolation and sequencing of blackleg

resistance genes in Brassica, this work has significantly increased our understanding of the

resistance genes organization and association in Brassica where several members of NBS-LRR

resistance genes on A7 that could confer resistance to blackleg resistance have been identified.

Furthermore, the comparative analyses yield valuable insight into the understanding of the structure,

evolution and organization of NBS-LRR genes in Brassica genomes.

6.3 Future directions

The identification of blackleg resistance genes in B. napus remains a key objective. To date, the

complete genome sequence of only one B. napus cv. Darmor (carrying Rlm9 resistance gene) is

available and used as reference genome in this study. This is why progress has been steady in

detailed genetic analysis of blackleg resistance even though considerable progress has been made in

genomics analysis of resistance genes between B. napus and its progenitor species. Therefore, it

would be even more useful to sequence B. napus cultivars with different Rlm gene composition in

the near future. However, caution is required to overcome the sequencing and assembly problems

that cause complications in the analysis. I would recommend that future work should be carried out

which will increase the quality and quantity of genomic sequence information within Brassica

species or cultivars providing the opportunity to conduct extensive comparative studies. This

includes:

Chromosome-based sequencing

It has been shown that genomic sequencing approaches can result in an error during the sequencing

or assembly and mis-mapping contigs associated with the nature of B. napus polyploidy. Thus,

alongside application of the genome sequence and NGS, another approach is to be undertaken to

overcome the Brassica genome complexity and mis-assembly problems, such as chromosome-

based sequencing. The analysis of sequence at the single chromosome level is a convenient means

of verifying genome sequence assemblies and increases the quantity of genomic information to

conduct more comparative studies within or between related species (Ruperao et al., 2014) . The

chromosome genomics approach has been successfully adapted in the wheat genome sequencing

(The International Wheat Genome Sequencing Consortium, 2014). Thus, chromosome sequence

comparisons and functional analysis will allow analysing candidate genes organization and

evolution and dissection of the molecular basis of specificity and allelic variations.

159

Copy number variation

Copy number variation (CNV) refers to differences in the number of gene loci between different

genotypes either species or cultivars. This structural variation (SV) is often observed as large

deletions or duplications and occurs from single genes to multi-genic regions. It is likely that

structural variation may play a role in the resistance. There is evidence that structural variation such

as copy number variations is associated with disease resistance and this variation is co-localized

with QTL and resistance genes (Cook et al., 2012, Jamann et al., 2014). Thus, it is possible that

structural variation, such as copy number variation and tandem repeats, rather than the sequence

variation between Brassicas species could play an important role in blackleg

resistance/susceptibility. Therefore, I would suggest future efforts should be devoted to identify any

CNVs associated with resistance genes using re-sequencing approach where sequence reads from

individual B. napus cultivars are assembled to the reference genome. The reference B. napus

Darmor genome could now be used as a template for re-sequencing of NBS-LRR profile from

diverse B. napus cultivars. This, indeed, will provide further motivation to study the effect of gene

duplication or sequence tandem repeats on the disease resistance genes.

160

Bibliography

Alabaça, C. S. 2009. Identification of molecular markers linked to downy mildew resistance gene,

PpALG1 in Brassica oleracea var tronchuda ,. PhD Thesis, University of Algarve, Portugal.

Alcázar, R. & Parker, J. E. 2011. The impact of temperature on balancing immune responsiveness

and growth in Arabidopsis. Trends in Plant Science, 16, 666-675.

Altschul, S. F., Madden, T. L., Schäffer, A. A., Zhang, J., Zhang, Z., Miller, W. & Lipman, D. J.

1997. Gapped BLAST and PSI-BLAST: a new generation of protein database search

programs. Nucleic Acids Research, 25, 3389-3402.

Ameline-Torregrosa, C., Wang, B.-B., O'Bleness, M. S., Deshpande, S., Zhu, H., Roe, B., Young,

N. D. & Cannon, S. B. 2008. Identification and Characterization of Nucleotide-Binding

Site-Leucine-Rich Repeat Genes in the Model Plant Medicago truncatula. Plant Physiology,

146, 5-21.

Ananga, A. O., Cebert, E., Soliman, K., Kantety, R., Pacumbaba, R. P. & Konan, K. 2006. RAPD

markers associated with resistance to blackleg disease in Brassica species. African Journal

of Biotechnology, 5, 2041-2048.

Ansan-Melayah, D., Balesdent, M. H., Delourme, R., Pilet, M. L., Tanguy, X., Renard, M. &

Rouxel, T. 1998. Genes for race-specific resistance against blackleg disease in Brassica

napus L. Plant Breeding, 117, 373-378.

AOF 2014. Australian Oilseeds Federation, Crop Reports. available at

http://www.australianoilseeds.com/oilseeds_industry/crop_report_assets. Australian

Oilseeds Federation.

Appleby, N., Edwards, D. & Batley, J. 2009. New Technologies for Ultra-High Throughput

Genotyping in Plants. In: GUSTAFSON, J. P., LANGRIDGE, P. & SOMERS, D. J. (eds.)

Plant Genomics. Humana Press.

Ashfield, T., Egan, A. N., Pfeil, B. E., Chen, N. W. G., Podicheti, R., Ratnaparkhe, M. B., Ameline-

Torregrosa, C., Denny, R., Cannon, S., Doyle, J. J., Geffroy, V., Roe, B. A., Saghai Maroof,

M. A., Young, N. D. & Innes, R. W. 2012. Evolution of a Complex Disease Resistance

Gene Cluster in Diploid Phaseolus and Tetraploid Glycine. Plant Physiology, 159, 336-354.

Bai, J., Pennill, L. A., Ning, J., Lee, S. W., Ramalingam, J., Webb, C. A., Zhao, B., Sun, Q.,

Nelson, J. C., Leach, J. E. & Hulbert, S. H. 2002. Diversity in Nucleotide Binding Site–

Leucine-Rich Repeat Genes in Cereals. Genome Research, 12, 1871-1884.

Bailey, T. L., Boden, M., Buske, F. A., Frith, M., Grant, C. E., Clementi, L., Ren, J., Li, W. W. &

Noble, W. S. 2009. MEME Suite: tools for motif discovery and searching. Nucleic Acids

Research, 37, W202-W208.

Bakker, E., Borm, T., Prins, P., van der Vossen, E., Uenk, G., Arens, M., de Boer, J., van Eck, H.,

Muskens, M., Vossen, J., van der Linden, G., van Ham, R., Klein-Lankhorst, R., Visser, R.,

Smant, G., Bakker, J. & Goverse, A. 2011. A genome-wide genetic map of NB-LRR disease

resistance loci in potato. Theoretical and Applied Genetics, 123, 493-508.

Balesdent, M.-H., Fudal, I., Ollivier, B., Bally, P., Grandaubert, J., Eber, F., Chèvre, A.-M., Leflon,

M. & Rouxel, T. 2013. The dispensable chromosome of Leptosphaeria maculans shelters an

effector gene conferring avirulence towards Brassica rapa. New Phytologist, 198, 887-898.

Balesdent, M. H., Attard, A., Ansan-Melayah, D., Delourme, R., Renard, M. & Rouxel, T. 2001.

Genetic Control and Host Range of Avirulence Toward Brassica napus Cultivars Quinta and

Jet Neuf in Leptosphaeria maculans. Phytopathology, 91, 70-76.

Balesdent, M. H., Attard, A., Kühn, M. L. & Rouxel, T. 2002. New Avirulence Genes in the

Phytopathogenic Fungus Leptosphaeria maculans. Phytopathology, 92, 1122-1133.

Bancroft, I. 2001. Duplicate and diverge: the evolution of plant genome microstructure. Trends in

Genetics, 17, 89-93.

Batley, J. & Edwards, D. 2009. Genome sequence data: management, storage, and visualization.

BioTechniques, 46, 333–336.

161

Baumgarten, A., Cannon, S., Spangler, R. & May, G. 2003. Genome-Level Evolution of Resistance

Genes in Arabidopsis thaliana. Genetics, 165, 309-319.

Bendahmane, A., Querci, M., Kanyuka, K. & Baulcombe, D. C. 2000. Agrobacterium transient

expression system as a tool for the isolation of disease resistance genes: application to the

Rx2 locus in potato. The Plant Journal, 21, 73-81.

Bent, A. F. 1996. Plant Disease Resistance Genes: Function Meets Structure. The Plant Cell Online,

8, 1757-1771.

Bevan, M., Bancroft, I., Bent, E., Love, K., Goodman, H., Dean, C., Bergkamp, R., Dirkse, W., Van

Staveren, M., Stiekema, W., Drost, L., Ridley, P., Hudson, S. A., Patel, K., Murphy, G.,

Piffanelli, P., Wedler, H., Wedler, E., Wambutt, R., Weitzenegger, T., Pohl, T. M., Terryn,

N., Gielen, J., Villarroel, R., De Clerck, R., Van Montagu, M., Lecharny, A., Auborg, S.,

Gy, I., Kreis, M., Lao, N., Kavanagh, T., Hempel, S., Kotter, P., Entian, K. D., Rieger, M.,

Schaeffer, M., Funk, B., Mueller-Auer, S., Silvey, M., James, R., Montfort, A., Pons, A.,

Puigdomenech, P., Douka, A., Voukelatou, E., Milioni, D., Hatzopoulos, P., Piravandi, E.,

Obermaier, B., Hilbert, H., Dusterhoft, A., Moores, T., Jones, J. D. G., Eneva, T., Palme, K.,

Benes, V., Rechman, S., Ansorge, W., Cooke, R., Berger, C., Delseny, M., Voet, M.,

Volckaert, G., Mewes, H. W., Klosterman, S., Schueller, C. & Chalwatzis, N. 1998.

Analysis of 1.9 Mb of contiguous sequence from chromosome 4 of Arabidopsis thaliana.

Nature, 391, 485-488.

Blanc, G. & Wolfe, K. H. 2004. Functional Divergence of Duplicated Genes Formed by Polyploidy

during Arabidopsis Evolution. The Plant Cell, 16, 1679-1691.

Brun, H., Chèvre, A. M., Fitt, B. D., Powers, S., Besnard, A. L., Ermel, M., Huteau, V., Marquer,

B., Eber, F., Renard, M. & Andrivon, D. 2010. Quantitative resistance increases the

durability of qualitative resistance to Leptosphaeria maculans in Brassica napus. New

Phytologist, 185, 285-299.

Cai, G., Yang, Q., Yang, Q., Zhao, Z., Chen, H., Wu, J., Fan, C. & Zhou, Y. 2012. Identification of

candidate genes of QTLs for seed weight in Brassica napus through comparative mapping

among Arabidopsis and Brassica species. Bmc Genetics, 13, 105.

Calenge, F., Van der Linden, C., Van de Weg, E., Schouten, H., Van Arkel, G., Denance, C. &

Durel, C. 2005. Resistance gene analogues identified through the NBS-profiling method

map close to major genes and QTL for disease resistance in apple. Theoretical and Applied

Genetics, 110, 660 - 668.

Cannon, S., Mitra, A., Baumgarten, A., Young, N. & May, G. 2004. The roles of segmental and

tandem gene duplication in the evolution of large gene families in Arabidopsis thaliana.

BMC Plant Biology, 4, 10.

Cannon, S. B., Zhu, H. Y., Baumgarten, A. M., Spangler, R., May, G., Cook, D. R. & Young, N. D.

2002. Diversity, distribution, and ancient taxonomic relationships within the TIR and non-

TIR NBS-LRR resistance gene subfamilies. Journal of Molecular Evolution, 54, 548-562.

Carlier, J. D., Alabaça, C. S., Sousa, N. H., Coelho, P. S., Monteiro, A. A., Paterson, A. H. &

Leitão, J. M. 2011. Physical Mapping in a Triplicated Genome: Mapping the Downy

Mildew Resistance Locus Pp523 in Brassica oleracea L. G3: Genes, Genomes, Genetics, 1,

593-601.

Carlier, J. D., Alabaça, C. A., Coelho, P. S., Monteiro, A. A. & Leitão, J. M. 2012. The downy

mildew resistance locus Pp523 is located on chromosome C8 of Brassica oleracea L. Plant

Breeding, 131, 170-175.

Carretero-Paulet, L. & Fares, M. A. 2012. Evolutionary Dynamics and Functional Specialization of

Plant Paralogs Formed by Whole and Small-Scale Genome Duplications. Molecular Biology

and Evolution, 29, 3541-3551.

Chagné, D., Batley, J., Edwards, D. & Forster, J. 2007. Single Nucleotide Polymorphism

Genotyping in Plants. In: ORAGUZIE, N., RIKKERINK, E. A., GARDINER, S. & SILVA,

H. N. (eds.) Association Mapping in Plants. Springer New York.

162

Chalhoub, B., Denoeud, F., Liu, S., Parkin, I. A. P., Tang, H., Wang, X., Chiquet, J., Belcram, H.,

Tong, C., Samans, B., Corréa, M., Da Silva, C., Just, J., Falentin, C., Koh, C. S., Le

Clainche, I., Bernard, M., Bento, P., Noel, B., Labadie, K., Alberti, A., Charles, M., Arnaud,

D., Guo, H., Daviaud, C., Alamery, S., Jabbari, K., Zhao, M., Edger, P. P., Chelaifa, H.,

Tack, D., Lassalle, G., Mestiri, I., Schnel, N., Le Paslier, M.-C., Fan, G., Renault, V., Bayer,

P. E., Golicz, A. A., Manoli, S., Lee, T.-H., Thi, V. H. D., Chalabi, S., Hu, Q., Fan, C.,

Tollenaere, R., Lu, Y., Battail, C., Shen, J., Sidebottom, C. H. D., Wang, X., Canaguier, A.,

Chauveau, A., Bérard, A., Deniot, G., Guan, M., Liu, Z., Sun, F., Lim, Y. P., Lyons, E.,

Town, C. D., Bancroft, I., Wang, X., Meng, J., Ma, J., Pires, J. C., King, G. J., Brunel, D.,

Delourme, R., Renard, M., Aury, J.-M., Adams, K. L., Batley, J., Snowdon, R. J., Tost, J.,

Edwards, D., Zhou, Y., Hua, W., Sharpe, A. G., Paterson, A. H., Guan, C. & Wincker, P.

2014. Early allopolyploid evolution in the post-Neolithic Brassica napus oilseed genome.

Science, 345, 950-953.

Cheng, F., Wu, J., Fang, L. & Wang, X. 2012a. Syntenic gene analysis between Brassica rapa and

other Brassicaceae species. Frontiers in Plant Science, 3.

Cheng, Y., Li, X., Jiang, H., Ma, W., Miao, W., Yamada, T. & Zhang, M. 2012b. Systematic

analysis and comparison of nucleotide-binding site disease resistance genes in maize. FEBS

Journal, 279, 2431-2443.

Cheung, F., Trick, M., Drou, N., Lim, Y. P., Park, J.-Y., Kwon, S.-J., Kim, J.-A., Scott, R., Pires, J.

C., Paterson, A. H., Town, C. & Bancroft, I. 2009. Comparative Analysis between

Homoeologous Genome Segments of Brassica napus and Its Progenitor Species Reveals

Extensive Sequence-Level Divergence. The Plant Cell, 21, 1912-1928.

Chèvre, A. M., Eber, F., This, P., Barret, P., Tanguy, X., Brun, H., Delseny, M. & Renard, M. 1996.

Characterization of Brassica nigra chromosomes and of blackleg resistance in B. napus-B.

nigra addition lines. Plant Breeding, 115, 113-118.

Chèvre, A. M., Barret, P., Eber, F., Dupuy, P., Brun, H., Tanguy, X. & Renard, M. 1997. Selection

of stable Brassica napus-B. juncea recombinant lines resistant to blackleg (Leptosphaeria

maculans). 1. Identification of molecular markers, chromosomal and genomic origin of the

introgression. Theoretical and Applied Genetics, 95, 1104-1111.

Chèvre, A. M., Brun, H., Eber, F., Letanneur, J. C., Vallee, P., Ermel, M., Glais, I., Li, H.,

Sivasithamparam, K. & Barbetti, M. J. 2008. Stabilization of resistance to Leptosphaeria

maculans in Brassica napus-B. juncea recombinant lines and its introgression into spring-

type Brassica napus. Plant Disease, 92, 1208-1214.

Cho, K., O’Neill, C. M., Kwon, S.-J., Yang, T.-J., Smooker, A. M., Fraser, F. & Bancroft, I. 2010.

Sequence-level comparative analysis of the Brassica napus genome around two stearoyl-

ACP desaturase loci. The Plant Journal, 61, 591-599.

Collier, S. M. & Moffett, P. 2009. NB-LRRs work a “bait and switch” on pathogens. Trends in

Plant Science, 14, 521-529.

Cook, D. E., Lee, T. G., Guo, X., Melito, S., Wang, K., Bayless, A. M., Wang, J., Hughes, T. J.,

Willis, D. K., Clemente, T. E., Diers, B. W., Jiang, J., Hudson, M. E. & Bent, A. F. 2012.

Copy Number Variation of Multiple Genes at Rhg1 Mediates Nematode Resistance in

Soybean. Science, 338, 1206-1209.

Crouch, J. H., Lewis, B. G. & Mithen, R. F. 1994. The Effect of A Genome Substitution on the

Resistance of Brassica napus to Infection by Leptosphaeria maculans. Plant Breeding, 112,

265-278.

Dalton-Morgan, J., Hayward, A., Alamery, S., Tollenaere, R., Mason, A., Campbell, E., Patel, D.,

Lorenc, M., Yi, B., Long, Y., Meng, J., Raman, R., Raman, H., Lawley, C., Edwards, D. &

Batley, J. 2014. A high-throughput SNP array in the amphidiploid species Brassica napus

shows diversity in resistance genes. Functional & Integrative Genomics, 1-13.

Dangl, J. L. & Jones, J. D. G. 2001. Plant pathogens and integrated defence responses to infection.

Nature, 411, 826-833.

163

Delourme, R., Pilet-Nayel, M. L., Archipiano, M., Horvais, R., Tanguy, X., Rouxel, T., Brun, H.,

Renard, M. & Balesdent, M. H. 2004. A cluster of major specific resistance genes to

Leptosphaeria maculans in Brassica napus. Phytopathology, 94, 578-583.

Delourme, R., Chèvre, A. M., Brun, H., Rouxel, T., Balesdent, M. H., Dias, J. S., Salisbury, P.,

Renard, M. & Rimmer, S. R. 2006. Major gene and polygenic resistance to Leptosphaeria

maculans in oilseed rape (Brassica napus). European Journal of Plant Pathology, 114, 41-

52.

Delourme, R., Brun, H., Ermel, M., Lucas, M. O., Vallee, P., Domin, C., Walton, G., Li, H.,

Sivasithamparam, K. & Barbetti, M. J. 2008. Expression of resistance to Leptosphaeria

maculans in Brassica napus double haploid lines in France and Australia is influenced by

location. Annals of Applied Biology, 153, 259-269.

Delourme, R., Barbetti, M. J., Snowdon, R., Zhao, J. & Manzanares-Dauleux, M. J. 2011. Genetics

and Genomics of Disease Resistance. In: KOLE, C. (ed.) Genetics, Genomics and Breeding

of Oilseed Brassicas. USA: Science Publishers

Diederichsen, E., Frauen, M., Linders, E. A., Hatakeyama, K. & Hirai, M. 2009. Status and

Perspectives of Clubroot Resistance Breeding in Crucifer Crops. Journal of Plant Growth

Regulation, 28, 265-281.

Dixelius, C. 1999. Inheritance of the resistance to Leptosphaeria maculans of Brassica nigra and B.

juncea in near-isogenic lines of B. napus. Plant Breeding, 118, 151-156.

Dodds, P. N., Lawrence, G. J., Catanzariti, A.-M., Teh, T., Wang, C.-I. A., Ayliffe, M. A., Kobe, B.

& Ellis, J. G. 2006. Direct protein interaction underlies gene-for-gene specificity and

coevolution of the flax resistance genes and flax rust avirulence genes. Proceedings of the

National Academy of Sciences, 103, 8888-8893.

Dodds, P. N. & Rathjen, J. P. 2010. Plant immunity: towards an integrated view of plant–pathogen

interactions. Nature Reviews Genetics, 11, 539-548.

Duran, C., Eales, D., Marshall, D., Imelfort, M., Stiller, J., Berkman, P. J., Clark, T., McKenzie, M.,

Appleby, N., Batley, J., Basford, K. & Edwards, D. 2010. Future tools for association

mapping in crop plants. Genome, 53, 1017-1023.

Durstewitz, G., Polley, A., Plieske, J., Luerssen, H., Graner, E. M., Wieseke, R. & Ganal, M. W.

2010. SNP discovery by amplicon sequencing and multiplex SNP genotyping in the

allopolyploid species Brassica napus. Genome, 53, 948-956.

Edwards, D. & Batley, J. 2010. Plant genome sequencing: applications for crop improvement. Plant

Biotechnology Journal, 8, 2-9.

Ellis, J., Dodds, P. & Pryor, T. 2000a. The generation of plant disease resistance gene specificities.

Trends in Plant Science, 5, 373-379.

Ellis, J., Dodds, P. & Pryor, T. 2000b. Structure, function and evolution of plant disease resistance

genes. Current Opinion in Plant Biology, 3, 278-284.

Ellis, J. G., Lawrence, G. J., Luck, J. E. & Dodds, P. N. 1999. Identification of regions in alleles of

the flax rust resistance gene L that determine differences in gene-for-gene specificity. The

Plant Cell, 11, 495-506.

Farinhó, M., Coelho, P., Carlier, J., Svetleva, D., Monteiro, A. & Leitão, J. 2004. Mapping of a

locus for adult plant resistance to downy mildew in broccoli (Brassica oleracea convar.

italica). Theoretical and Applied Genetics, 109, 1392-1398.

Farinhó, M., Coelho, P., Monteiro, A. & Leitão, J. 2007. SCAR and CAPS markers flanking the

Brassica oleracea L. Pp523 downy mildew resistance locus demarcate a genomic region

syntenic to the top arm end of Arabidopsis thaliana L. chromosome 1. Euphytica, 157, 215-

221.

Fernando, W. G. D., Chen, Y. & Ghanbarnia, K. 2007. Breeding for Blackleg Resistance: The

Biology and Epidemiology. In: SURINDER KUMAR GUPTA, M. D. & KADER, J. C.

(eds.) Advances in Botanical Research. Academic Press.

164

Ferreira, M. E., Rimmer, S. R., Williams, P. H. & Osborn, T. C. 1995a. Mapping loci controlling

Brassica napus resistance to Leptosphaeria maculans under different screening conditions.

Phytopathology, 85, 213-217.

Ferreira, M. E., Williams, P. H. & Osborn, T. C. 1995b. Mapping of a locus controlling resistance

to Albugo candida in Brassica napus using molecular markers. Phytopathology, 85, 218-

220.

Figdore, S. S., Ferreira, M. E., Slocum, M. K. & Williams, P. H. 1993. Association of RFLP

markers with trait loci affecting clubroot resistance and morphological characters in

Brassica oleracea L. Euphytica, 69, 33-44.

Fitt, B., Brun, H., Barbetti, M. & Rimmer, S. 2006. World-Wide Importance of Phoma Stem

Canker ( Leptosphaeria maculans and L. biglobosa ) on Oilseed Rape (Brassica napus).

European Journal of Plant Pathology, 114, 3-15.

Fopa Fomeju, B., Falentin, C., Lassalle, G., Manzanares-Dauleux, M. & Delourme, R. 2014.

Homoeologous duplicated regions are involved in quantitative resistance of Brassica napus

to stem canker. BMC Genomics, 15, 498.

Fourmann, M., Charlot, F., Froger, N., Delourme, R. & Brunel, D. 2001. Expression, mapping, and

genetic variability of Brassica napus disease resistance gene analogues. Genome, 44, 1083-

1099.

Friedman, A. & Baker, B. 2007. The evolution of resistance genes in multi-protein plant resistance

systems. Current Opinion in Genetics & Development, 17, 493 - 499.

Fritz-Laylin, L. K., Krishnamurthy, N., Tör, M., Sjölander, K. V. & Jones, J. D. 2005.

Phylogenomic analysis of the receptor-like proteins of rice and Arabidopsis. Plant

Physiology, 138, 611-623.

Fudal, I., Ross, S., Gout, L., Blaise, F., Kuhn, M. L., Eckert, M. R., Cattolico, L., Bernard-Samain,

S., Balesdent, M. H. & Rouxel, T. 2007. Heterochromatin-Like Regions as Ecological

Niches for Avirulence Genes in the Leptosphaeria maculans Genome: Map-Based Cloning

of AvrLm6. Molecular Plant-Microbe Interactions, 20, 459-470.

Ge, X. T., Li, H., Han, S., Sivasithamparam, K. & Barbetti, M. J. 2008. Evaluation of Australian

Brassica napus genotypes for resistance to the downy mildew pathogen, Hyaloperonospora

parasitica. Australian Journal of Agricultural Research, 59, 1030-1034.

Gebhardt, C. & Valkonen, J. P. 2001. Organization of genes controlling disease resistance in the

potato genome. Annual review of phytopathology, 39, 79-102.

Geffroy, V., Sévignac, M., De Oliveira, J. C. F., Fouilloux, G., Skroch, P., Thoquet, P., Gepts, P.,

Langin, T. & Dron, M. 2000. Inheritance of Partial Resistance Against Colletotrichum

lindemuthianum in Phaseolus vulgaris and Co-localization of Quantitative Trait Loci with

Genes Involved in Specific Resistance. Molecular Plant-Microbe Interactions, 13, 287-296.

Ghanbarnia, K., Fudal, I., Larkan, N. J., Links, M. G., Balesdent, M.-H., Profotova, B., Fernando,

W. G. D., Rouxel, T. & Borhan, M. H. 2015. Rapid identification of the Leptosphaeria

maculans avirulence gene AvrLm2 using an intraspecific comparative genomics approach.

Molecular Plant Pathology.

Giovannelli, J. L., Farnham, M. W., Wang, M. & Strand, A. E. 2002. Development of Sequence

Characterized Amplified Region Markers Linked to Downy Mildew Resistance in Broccoli.

Journal of the American Society for Horticultural Science, 127, 597-601.

Goff, S. A., Ricke, D., Lan, T.-H., Presting, G., Wang, R., Dunn, M., Glazebrook, J., Sessions, A.,

Oeller, P., Varma, H., Hadley, D., Hutchison, D., Martin, C., Katagiri, F., Lange, B. M.,

Moughamer, T., Xia, Y., Budworth, P., Zhong, J., Miguel, T., Paszkowski, U., Zhang, S.,

Colbert, M., Sun, W.-l., Chen, L., Cooper, B., Park, S., Wood, T. C., Mao, L., Quail, P.,

Wing, R., Dean, R., Yu, Y., Zharkikh, A., Shen, R., Sahasrabudhe, S., Thomas, A.,

Cannings, R., Gutin, A., Pruss, D., Reid, J., Tavtigian, S., Mitchell, J., Eldredge, G., Scholl,

T., Miller, R. M., Bhatnagar, S., Adey, N., Rubano, T., Tusneem, N., Robinson, R.,

Feldhaus, J., Macalma, T., Oliphant, A. & Briggs, S. 2002. A Draft Sequence of the Rice

Genome (Oryza sativa L. ssp. japonica). Science, 296, 92-100.

165

Gout, L., Fudal, I., Kuhn, M.-L., Blaise, F., Eckert, M., Cattolico, L., Balesdent, M.-H. & Rouxel,

T. 2006. Lost in the middle of nowhere: the AvrLm1 avirulence gene of the Dothideomycete

Leptosphaeria maculans. Molecular Microbiology, 60, 67-80.

Graham, M. A., Marek, L. F., Lohnes, D., Cregan, P. & Shoemaker, R. C. 2000. Expression and

genome organization of resistance gene analogs in soybean. Genome, 43, 86-93.

Grandclément, C. & Thomas, G. 1996. Detection and analysis of QTLs based on RAPD markers for

polygenic resistance to Plasmodiophora brassicae Woron in Brassica oleracea L. Theoretical

and Applied Genetics, 93, 86-90.

Grant, D., Cregan, P. & Shoemaker, R. C. 2000. Genome organization in dicots: Genome

duplication in Arabidopsis and synteny between soybean and Arabidopsis. Proceedings of

the National Academy of Sciences, 97, 4168-4173.

Grant, M. R., McDowell, J. M., Sharpe, A. G., De Torres Zabala, M., Lydiate, D. J. & Dangl, J. L.

1998. Independent deletions of a pathogen-resistance gene in Brassica and Arabidopsis.

Proceedings of the National Academy of Sciences of the United States of America, 95,

15843-15848.

Gu, L., Si, W., Zhao, L., Yang, S. & Zhang, X. 2014. Dynamic evolution of NBS–LRR genes in

bread wheat and its progenitors. Molecular Genetics and Genomics, 1-12.

Gunderson, K., Kuhn, K., Steemers, F., Ng, P., Murray, S. & Shen, R. 2006. Whole-genome

genotyping of haplotype tag single nucleotide polymorphisms. Pharmacogenomics, 7, 641 -

648.

Guo, X. W. & Fernando, W. G. D. 2005. Seasonal and diurnal patterns of spore dispersal by

Leptosphaeria maculans from canola stubble in relation to environmental conditions. Plant

Disease, 89, 97-104.

Guo, Y., Fitz, J., Schneeberger, K., Ossowski, S., Cao, J. & Weigel, D. 2011. Genome-wide

comparison of nucleotide-binding site-leucine-rich repeat-encoding genes in Arabidopsis.

Plant Physiology, 157, 757 - 769.

Hanada, K., Kuromori, T., Myouga, F., Toyoda, T. & Shinozaki, K. 2009. Increased Expression and

Protein Divergence in Duplicate Genes Is Associated with Morphological Diversification.

PLoS Genetics, 5, e1000781.

Hatakeyama, K., Suwabe, K., Tomita, R. N., Kato, T., Nunome, T., Fukuoka, H. & Matsumoto, S.

2013. Identification and Characterization of Crr1a, a Gene for Resistance to Clubroot

Disease (Plasmodiophora brassicae Woronin) in Brassica rapa. PLoS ONE, 8, e54745.

Hayashida, N., Takabatake, Y., Nakazawa, N., Aruga, D., Nakanishi, H., Taguchi, G., Sakamoto, K.

& Matsumoto, E. 2008. Construction of a Practical SCAR Marker Linked to Clubroot

Resistance in Chinese Cabbage, with Intensive Analysis of HC352b Genes. Journal of the

Japanese Society for Horticultural Science, 77, 150-154.

Hayward, A., McLanders, J., Campbell, E., Edwards, D. & Batley, J. 2011. Genomic advances will

herald new insights into the Brassica: Leptosphaeria maculans pathosystem. Plant Biology,

14, 1-10.

Heffner, E. L., Sorrells, M. E. & Jannink, J.-L. 2009. Genomic Selection for Crop Improvement.

Crop Science, 49, 1-12.

Hirai, M., Harada, T., Kubo, N., Tsukada, M., Suwabe, K. & Matsumoto, S. 2004. A novel locus

for clubroot resistance in Brassica rapa and its linkage markers. Theoretical and Applied

Genetics, 108, 639-643.

Hirai, M. 2006. Genetic analysis of clubroot resistance in Brassica crops. Breeding science, 56, 223-

229.

Holub, E. B. 2001. The arms race is ancient history in Arabidopsis, the wildflower. Nature Reviews

Genetics, 2, 516-527.

Hong, C. P., Kwon, S. J., Kim, J. S., Yang, T. J., Park, B. S. & Lim, Y. P. 2008. Progress in

understanding and sequencing the genome of brassica rapa. International Journal of Plant

Genomics, 2008.

166

Howlett, B. J. 2004. Current knowledge of the interaction between Brassica napus and

Leptosphaeria maculans. Canadian Journal of Plant Pathology, 26, 245-252.

Hu, Z., Huang, S., Sun, M., Wang, H. & Hua, W. 2012. Development and application of single

nucleotide polymorphism markers in the polyploid Brassica napus by 454 sequencing of

expressed sequence tags. Plant Breeding, 131, 293-299.

Hulbert, S., Webb, C., Smith, S. & Sun, Q. 2001. Resistance gene complexes: evolution and

utilization. Annual review of phytopathology, 39, 285 - 312.

Imelfort, M., Duran, C., Batley, J. & Edwards, D. 2009. Discovering genetic polymorphisms in

next-generation sequencing data. Plant Biotechnology Journal, 7, 312-317.

Imelfort, M. & Edwards, D. 2009. De novo sequencing of plant genomes using second-generation

technologies. Briefings in Bioinformatics, 10, 609-618.

Jamann, T. M., Poland, J. A., Kolkman, J. M., Smith, L. G. & Nelson, R. J. 2014. Unraveling

Genomic Complexity at a Quantitative Disease Resistance Locus in Maize. Genetics, 198,

333-344.

Jander, G., Norris, S. R., Rounsley, S. D., Bush, D. F., Levin, I. M. & Last, R. L. 2002. Arabidopsis

Map-Based Cloning in the Post-Genome Era. Plant Physiology, 129, 440-450.

Jeong, S. C., Hayes, A. J., Biyashev, R. M. & Maroof, M. A. S. 2001. Diversity and evolution of a

non-TIR-NBS sequence family that clusters to a chromosomal ’’hotspot” for disease

resistance genes in soybean. Theoretical and Applied Genetics, 103, 406-414.

Jiang, C., Ramchiary, N., Ma, Y., Jin, M., Feng, J., Li, R., Wang, H., Long, Y., Choi, S., Zhang, C.,

Cowling, W., Park, B., Lim, Y. & Meng, J. 2011. Structural and functional comparative

mapping between the Brassica A genomes in allotetraploid Brassica napus and diploid

Brassica rapa. Theoretical and Applied Genetics, 123, 927-941.

Johnson, R. D. & Lewis, B. G. 1994. Variation in host range, systemic infection and epidemiology

of Leptosphaeria maculans. Plant Pathology, 43, 269-277.

Joshi, R. & Nayak, S. 2011. Functional characterization and signal transduction ability of

nucleotide-binding site-leucine-rich repeat resistance genes in plants. Genetics and

Molecular Research, 10, 2637-2652.

Jupe, F., Pritchard, L., Etherington, G., MacKenzie, K., Cock, P., Wright, F., Sharma, S. K., Bolser,

D., Bryan, G., Jones, J. & Hein, I. 2012. Identification and localisation of the NB-LRR gene

family within the potato genome. BMC Genomics, 13, 75.

Jupe, F., Witek, K., Verweij, W., Śliwka, J., Pritchard, L., Etherington, G. J., Maclean, D., Cock, P.

J., Leggett, R. M. & Bryan, G. J. 2013. Resistance gene enrichment sequencing (RenSeq)

enables reannotation of the NB‐LRR gene family from sequenced plant genomes and rapid

mapping of resistance loci in segregating populations. The Plant Journal, 76, 530-544.

Kale, S. M., Pardeshi, V. C., Barvkar, V. T., Gupta, V. S. & Kadoo, N. Y. 2012. Genome-wide

identification and characterization of nucleotide binding site leucine-rich repeat genes in

linseed reveal distinct patterns of gene structure. Genome, 56, 91-99.

Kang, Y., Kim, K., Shim, S., Yoon, M., Sun, S., Kim, M., Van, K. & Lee, S.-H. 2012. Genome-

wide mapping of NBS-LRR genes and their association with disease resistance in soybean.

BMC Plant Biology, 12, 139.

Kato, T., Hatakeyama, K., Fukino, N. & Matsumoto, S. 2012. Identificaiton of a clubroot resistance

locus conferring resistance to a Plasmodiophora brassicae classified into pathotype group 3

in Chinese cabbage (Brassica rapa L.). Breeding science, 62, 282-287.

Kato, T., Hatakeyama, K., Fukino, N. & Matsumoto, S. 2013. Fine mapping of the clubroot

resistance gene CRb and development of a useful selectable marker in Brassica rapa.

Breeding science, 63, 116-124.

Kaur, S., Cogan, N. O. I., Ye, G., Baillie, R. C., Hand, M. L., Ling, A. E., McGearey, A. K., Kaur,

J., Hopkins, C. J., Todorovic, M., Mountford, H., Edwards, D., Batley, J., Burton, W.,

Salisbury, P., Gororo, N., Marcroft, S., Kearney, G., Smith, K. F., Forster, J. W. &

Spangenberg, G. C. 2009. Genetic map construction and QTL mapping of resistance to

167

blackleg (Leptosphaeria maculans) disease in Australian canola (Brassica napus L.)

cultivars. Theoretical and Applied Genetics, 120, 71-83.

Kaur, S., Francki, M. G. & Forster, J. W. 2012. Identification, characterization and interpretation of

single-nucleotide sequence variation in allopolyploid crop species. Plant Biotechnology

Journal, 10, 125-138.

Kearse, M., Moir, R., Wilson, A., Stones-Havas, S., Cheung, M., Sturrock, S., Buxton, S., Cooper,

A., Markowitz, S., Duran, C., Thierer, T., Ashton, B., Meintjes, P. & Drummond, A. 2012.

Geneious Basic: An integrated and extendable desktop software platform for the

organization and analysis of sequence data. Bioinformatics, 28, 1647-1649.

Kemmerling, B., Halter, T., Mazzotta, S., Mosher, S. & Nürnberger, T. 2011. A genome-wide

survey for Arabidopsis leucine-rich repeat receptor kinases implicated in plant immunity.

Frontiers in Plant Science, 2.

Keri, M., Van Den Berg, C. G. J., McVetty, P. B. E. & Rimmer, S. R. 1997. Inheritance of

resistance to Leptosphaeria maculans in Brassica juncea. Phytopathology, 87, 594-598.

Kim, K.-S., Hill, C., Hartman, G., Hyten, D., Hudson, M. & Diers, B. 2010. Fine mapping of the

soybean aphid-resistance gene Rag2 in soybean PI 200538. Theoretical and Applied

Genetics, 121, 599-610.

Kim, S., Song, Y., Lee, J.-Y., Choi, S., Dhandapani, V., Jang, C., Lim, Y. & Han, T. 2011.

Identification of the BrRHP1 locus that confers resistance to downy mildew in Chinese

cabbage (Brassica rapa ssp. pekinensis) and development of linked molecular markers.

Theoretical and Applied Genetics, 123, 1183-1192.

Kohler, A., Rinaldi, C., Duplessis, S., Baucher, M., Geelen, D., Duchaussoy, F., Meyers, B.,

Boerjan, W. & Martin, F. 2008. Genome-wide identification of NBS resistance genes in

Populus trichocarpa. Plant Molecular Biology, 66, 619-636.

Kruijt, M., De Kock, M. J. D. & De Wit, P. J. G. M. 2005. Receptor-like proteins involved in plant

disease resistance. Molecular Plant Pathology, 6, 85-97.

Lagercrantz, U. 1998. Comparative mapping between Arabidopsis thaliana and Brassica nigra

indicates that Brassica genomes have evolved through extensive genome replication

accompanied by chromosome fusions and frequent rearrangements. Genetics, 150, 1217-

1228.

Landry, B. S., Hubert, N., Crete, R., Chang, M. S., Lincoln, S. E. & Etoh, T. 1992. A genetic map

for Brassica oleracea based on RFLP markers detected with expressed DNA sequences and

mapping of resistance genes to race 2 of Plasmodiophora brassicae (Woronin). Genome, 35,

409-420.

Larkan, N. J., Lydiate, D. J., Parkin, I. A. P., Nelson, M. N., Epp, D. J., Cowling, W. A., Rimmer,

S. R. & Borhan, M. H. 2013. The Brassica napus blackleg resistance gene LepR3 encodes a

receptor-like protein triggered by the Leptosphaeria maculans effector AVRLM1. New

Phytologist, 197, 595-605.

Larkan, N. J., Lydiate, D. J., Yu, F., Rimmer, S. R. & Borhan, M. H. 2014. Co-localisation of the

blackleg resistance genes Rlm2 and LepR3 on Brassica napus chromosome A10. BMC Plant

Biology, 14.

Lawton-Rauh, A. 2003. Evolutionary dynamics of duplicated genes in plants. Molecular

Phylogenetics and Evolution, 29, 396-409.

Leflon, M., Brun, H., Eber, F., Delourme, R., Lucas, M. O., Vallée, P., Ermel, M., Balesdent, M. H.

& Chèvre, A. M. 2007. Detection, introgression and localization of genes conferring specific

resistance to Leptosphaeria maculans from Brassica rapa into B. napus. Theoretical and

Applied Genetics, 115, 897-906.

Leister, D. 2004. Tandem and segmental gene duplication and recombination in the evolution of

plant disease resistance genes. Trends in Genetics, 20, 116-122.

Lestari, P., Van, K., Lee, J. E., Kang, Y. J. & Lee, S.-H. 2013. Gene divergence of homeologous

regions associated with a major seed protein content QTL in soybean. Frontiers in Plant

Science, 4.

168

Li, C. X. & Cowling, W. A. 2003. Identification of a single dominant allele for resistance to

blackleg in Brassica napus 'Surpass 400'. Plant Breeding, 122, 485-488.

Li, F., Kitashiba, H., Inaba, K. & Nishio, T. 2009. A Brassica rapa Linkage Map of EST-based SNP

Markers for Identification of Candidate Genes Controlling Flowering Time and Leaf

Morphological Traits. DNA Research, 16, 311-323.

Li, H., Yu, S.-C., ZHANG, F.-L., Yu, Y.-J., ZHAO, X.-Y., ZHANG, D.-S. & ZHAO, X. 2011.

Development of molecular markers linked to the resistant QTL for downy mildew in

Brassica rapa L. ssp. pekinensis [ in Chinese]. Hereditas, 33, 1271-1278

Li, J., Ding, J., Zhang, W., Zhang, Y., Tang, P., Chen, J.-Q., Tian, D. & Yang, S. 2010. Unique

evolutionary pattern of numbers of gramineous NBS–LRR genes. Molecular Genetics and

Genomics, 283, 427-438.

Li, X., Ramchiary, N., Dhandapani, V., Choi, S. R., Hur, Y., Nou, I.-S., Yoon, M. K. & Lim, Y. P.

2013. Quantitative Trait Loci Mapping in Brassica rapa Revealed the Structural and

Functional Conservation of Genetic Loci Governing Morphological and Yield Component

Traits in the A, B, and C Subgenomes of Brassica Species. DNA Research, 20, 1-16.

Liu, J., Liu, X., Dai, L. & Wang, G. 2007. Recent Progress in Elucidating the Structure, Function

and Evolution of Disease Resistance Genes in Plants. Journal of Genetics and Genomics,

34, 765-776.

Liu, S., Liu, Y., Yang, X., Tong, C., Edwards, D., Parkin, I. A. P., Zhao, M., Ma, J., Yu, J., Huang,

S., Wang, X., Wang, J., Lu, K., Fang, Z., Bancroft, I., Yang, T.-J., Hu, Q., Wang, X., Yue,

Z., Li, H., Yang, L., Wu, J., Zhou, Q., Wang, W., King, G. J., Pires, J. C., Lu, C., Wu, Z.,

Sampath, P., Wang, Z., Guo, H., Pan, S., Yang, L., Min, J., Zhang, D., Jin, D., Li, W.,

Belcram, H., Tu, J., Guan, M., Qi, C., Du, D., Li, J., Jiang, L., Batley, J., Sharpe, A. G.,

Park, B.-S., Ruperao, P., Cheng, F., Waminal, N. E., Huang, Y., Dong, C., Wang, L., Li, J.,

Hu, Z., Zhuang, M., Huang, Y., Huang, J., Shi, J., Mei, D., Liu, J., Lee, T.-H., Wang, J., Jin,

H., Li, Z., Li, X., Zhang, J., Xiao, L., Zhou, Y., Liu, Z., Liu, X., Qin, R., Tang, X., Liu, W.,

Wang, Y., Zhang, Y., Lee, J., Kim, H. H., Denoeud, F., Xu, X., Liang, X., Hua, W., Wang,

X., Wang, J., Chalhoub, B. & Paterson, A. H. 2014. The Brassica oleracea genome reveals

the asymmetrical evolution of polyploid genomes. Nature Communications, 5.

Long, Y., Wang, Z., Sun, Z., Fernando, D. W. G., McVetty, P. B. E. & Li, G. 2011. Identification

of two blackleg resistance genes and fine mapping of one of these two genes in a Brassica

napus canola cultivar 'Surpass 400'. Theoretical and Applied Genetics, 122, 1223-1231.

Lozano, R., Ponce, O., Ramirez, M., Mostajo, N. & Orjeda, G. 2012. Genome-Wide Identification

and Mapping of NBS-Encoding Resistance Genes in Solanum tuberosum Group Phureja.

PLoS ONE, 7, e34775.

Lyons, E., Pedersen, B., Kane, J., Alam, M., Ming, R., Tang, H., Wang, X., Bowers, J., Paterson,

A., Lisch, D. & Freeling, M. 2008. Finding and Comparing Syntenic Regions among

Arabidopsis and the Outgroups Papaya, Poplar, and Grape: CoGe with Rosids. Plant

Physiology, 148, 1772-1781.

Ma, L., Lukasik, E., Gawehns, F. & Takken, F. W. 2012. The Use of Agroinfiltration for Transient

Expression of Plant Resistance and Fungal Effector Proteins in Nicotiana benthamiana

Leaves. In: BOLTON, M. D. & THOMMA, B. P. H. J. (eds.) Plant Fungal Pathogens.

Humana Press.

Madsen, L. H., Collins, N. C., Rakwalska, M., Backes, G., Sandal, N., Krusell, L., Jensen, J.,

Waterman, E. H., Jahoor, A., Ayliffe, M., Pryor, A. J., Langridge, P., Schulze-Lefert, P. &

Stougaard, J. 2003. Barley disease resistance gene analogs of the NBS-LRR class:

identification and mapping. Molecular Genetics and Genomics, 269, 150-161.

Malory, S., Shapter, F. M., Elphinstone, M. S., Chivers, I. H. & Henry, R. J. 2011. Characterizing

homologues of crop domestication genes in poorly described wild relatives by high-

throughput sequencing of whole genomes. Plant Biotechnology Journal, 9, 1131-1140.

Marcroft, S. & Bluett, C. 2008. Blackleg of canola. Agriculture notes. State of Victoria, Department

of Primary Industries.

169

Marcroft, S. J., Sprague, S. J., Pymer, S. J., Salisbury, P. A. & Howlett, B. J. 2004. Crop isolation,

not extended rotation length, reduces blackleg Leptosphaeria maculans severity of canola

Brassica napus in south-eastern Australia. Australian Journal of Experimental Agriculture,

44, 601-606.

Marcroft, S. J., Elliott, V. L., Cozijnsen, A. J., Salisbury, P. A., Howlett, B. J. & Van de Wouw, A.

P. 2012a. Identifying resistance genes to Leptosphaeria maculans in Australian Brassica

napus cultivars based on reactions to isolates with known avirulence genotypes. Crop and

Pasture Science, 63, 338-350.

Marcroft, S. J., Van de Wouw, A. P., Salisbury, P. A., Potter, T. D. & Howlett, B. J. 2012b. Effect

of rotation of canola (Brassica napus) cultivars with different complements of blackleg

resistance genes on disease severity. Plant Pathology, 61, 934-944.

Matsumoto, E., Ueno, H. & Aruga, D. 2012. Accumulation of Three Clubroot Resistance Genes

through Marker-assisted Selection in Chinese Cabbage (Brassica rapa ssp. pekinensis).

Journal of the Japanese Society for Horticultural Science, 81, 184-190.

Mayerhofer, R., Bansal, V. K., Thiagarajah, M. R., Stringam, G. R. & Good, A. G. 1997. Molecular

mapping of resistance to Leptosphaeria maculans in Australian cultivars of Brassica napus.

Genome, 40, 294-301.

Mayerhofer, R., Wilde, K., Mayerhofer, M., Lydiate, D., Bansal, V. K., Good, A. G. & Parkin, I. A.

P. 2005. Complexities of chromosome landing in a highly duplicated genome: Toward map-

based cloning of a gene controlling blackleg resistance in Brassica napus. Genetics, 171,

1977-1988.

McCarthy, M., Abecasis, G., Cardon, L., Goldstein, D., Little, J., Ioannidis, J. & Hirschhorn, J.

2008. Genome-wide association studies for complex traits: consensus, uncertainty and

challenges. Nature Reviews Genetics, 9, 356-369.

McCouch, S. R. 2001. Genomics and Synteny. Plant Physiology, 125, 152-155.

McDowell, J. M., Dhandaydham, M., Long, T. A., Aarts, M. G. M., Goff, S., Holub, E. B. & Dangl,

J. L. 1998. Intragenic Recombination and Diversifying Selection Contribute to the Evolution

of Downy Mildew Resistance at the RPP8 Locus of Arabidopsis. The Plant Cell, 10, 1861-

1874.

McDowell, J. M. & Woffenden, B. J. 2003. Plant disease resistance genes: recent insights and

potential applications. Trends in biotechnology, 21, 178-183.

McDowell, J. M. & Simon, S. A. 2006. Recent insights into R gene evolution. Molecular Plant

Pathology, 7, 437-448.

McHale, L., Tan, X., Koehl, P. & Michelmore, R. 2006. Plant NBS-LRR proteins: adaptable

guards. Genome Biology, 7, 212.

Meyers, B., Dickerman, A., Michelmore, R., Sivaramakrishnan, S., Sobral, B. & Young, N. 1999.

Plant disease resistance genes encode members of an ancient and diverse protein family

within the nucleotide-binding superfamily. The Plant Journal, 20, 317 - 332.

Meyers, B., Kozik, A., Griego, A., Kuang, H. & Michelmore, R. 2003. Genome-wide analysis of

NBS-LRR-encoding genes in Arabidopsis. The Plant Cell, 15, 809 - 834.

Meyers, B. C., Morgante, M. & Michelmore, R. W. 2002. TIR-X and TIR-NBS proteins: two new

families related to disease resistance TIR-NBS-LRR proteins encoded in Arabidopsis and

other plant genomes. The Plant Journal, 32, 77-92.

Meyers, B. C., Kaushik, S. & Nandety, R. S. 2005. Evolving disease resistance genes. Current

Opinion in Plant Biology, 8, 129-134.

Michelmore, R. & Meyers, B. 1998. Clusters of resistance genes in plants evolve by divergent

selection and a birth-and-death process. Genome Research, 8, 1113 - 1130.

Monosi, B., Wisser, R. J., Pennill, L. & Hulbert, S. H. 2004. Full-genome analysis of resistance

gene homologues in rice. Theoretical and Applied Genetics, 109, 1434-1447.

Moriguchi, K., Kimizuka-Takagi, C., Ishii, K. & Nomura, K. 1999. A genetic map based on RAPD,

RFLP, isozyme, morphological markers and QTL analysis for clubroot resistance in

Brassica oleracea. Breeding science, 49, 257-265.

170

Mun, J., Yu, H., Park, S. & Park, B. 2009. Genome-wide identification of NBS-encoding resistance

genes in Brassica rapa. Molecular Genetics and Genomics, 282, 617 - 631.

Nagaoka, T., Doullah, M. A. U., Matsumoto, S., Kawasaki, S., Ishikawa, T., Hori, H. & Okazaki,

K. 2010. Identification of QTLs that control clubroot resistance in Brassica oleracea and

comparative analysis of clubroot resistance genes between B. rapa and B. oleracea.

Theoretical and Applied Genetics, 120, 1335-1346.

Nandety, R. S., Caplan, J. L., Cavanaugh, K., Perroud, B., Wroblewski, T., Michelmore, R. W. &

Meyers, B. C. 2013. The Role of TIR-NBS and TIR-X Proteins in Plant Basal Defense

Responses. Plant Physiology, 162, 1459-1472.

Nashaat, N. I., Heran, A., Mitchell, S. E. & Awasthi, R. P. 1997. New genes for resistance to downy

mildew (Peronospora parasitica) in oilseed rape (Brassica napus ssp. oleifera). Plant

Pathology, 46, 964-968.

Nobuta, K., Ashfield, T., Kim, S. & Innes, R. W. 2005. Diversification of Non-TIR Class NB-LRR

Genes in Relation to Whole-Genome Duplication Events in Arabidopsis. Molecular Plant-

Microbe Interactions, 18, 103-109.

Nomura, K., Minegishi, Y., Kimizuka-Takagi, C., Fujioka, T., Moriguchi, K., Shishido, R. &

Ikehashi, H. 2005. Evaluation of F2 and F3 plants introgressed with QTLs for clubroot

resistance in cabbage developed by using SCAR markers. Plant Breeding, 124, 371-375.

Osterberg, M. K., Shavorskaya, O., Lascoux, M. & Lagercrantz, U. 2002. Naturally Occurring Indel

Variation in the Brassica nigra COL1 Gene Is Associated With Variation in Flowering

Time. Genetics, 161, 299-306.

Pan, Q., Liu, Y.-S., Budai-Hadrian, O., Sela, M., Carmel-Goren, L., Zamir, D. & Fluhr, R. 2000.

Comparative Genetics of Nucleotide Binding Site-Leucine Rich Repeat Resistance Gene

Homologues in the Genomes of Two Dicotyledons: Tomato and Arabidopsis. Genetics, 155,

309-322.

Parkin, I., Koh, C., Tang, H., Robinson, S., Kagale, S., Clarke, W., Town, C., Nixon, J.,

Krishnakumar, V., Bidwell, S., Denoeud, F., Belcram, H., Links, M., Just, J., Clarke, C.,

Bender, T., Huebert, T., Mason, A., Pires, C., Barker, G., Moore, J., Walley, P., Manoli, S.,

Batley, J., Edwards, D., Nelson, M., Wang, X., Paterson, A., King, G., Bancroft, I.,

Chalhoub, B. & Sharpe, A. 2014. Transcriptome and methylome profiling reveals relics of

genome dominance in the mesopolyploid Brassica oleracea. Genome Biology, 15, R77.

Parkin, I. A. P., Sharpe, A. G. & Lydiate, D. J. 2003. Patterns of genome duplication within the

Brassica napus genome. Genome, 46, 291-303.

Parkin, I. A. P., Gulden, S. M., Sharpe, A. G., Lukens, L., Trick, M., Osborn, T. C. & Lydiate, D. J.

2005. Segmental structure of the Brassica napus genome based on comparative analysis with

Arabidopsis thaliana. Genetics, 171, 765-781.

Parlange, F., Daverdin, G., Fudal, I., Kuhn, M. L., Balesdent, M. H., Blaise, F., Grezes-Besset, B. &

Rouxel, T. 2009. Leptosphaeria maculans avirulence gene AvrLm4-7 confers a dual

recognition specificity by the Rlm4 and Rlm7 resistance genes of oilseed rape, and

circumvents Rlm4-mediated recognition through a single amino acid change. Molecular

Microbiology, 71, 851-863.

Parniske, M. & Jones, J. D. G. 1999. Recombination between diverged clusters of the tomato Cf-9

plant disease resistance gene family. Proceedings of the National Academy of Sciences, 96,

5850-5855.

Paterson, A. H., Bowers, J. E., Burow, M. D., Draye, X., Elsik, C. G., Jiang, C.-X., Katsar, C. S.,

Lan, T.-H., Lin, Y.-R., Ming, R. & Wright, R. J. 2000. Comparative Genomics of Plant

Chromosomes. The Plant Cell Online, 12, 1523-1539.

Paterson, A. H., Lan, T. H., Amasino, R., Osborn, T. C. & Quiros, C. 2001. Brassica genomics: A

complement to, and early beneficiary of, the Arabidopsis sequence. Genome Biology, 2.

Paterson, A. H., Bowers, J. E., Bruggmann, R., Dubchak, I., Grimwood, J., Gundlach, H., Haberer,

G., Hellsten, U., Mitros, T., Poliakov, A., Schmutz, J., Spannagl, M., Tang, H., Wang, X.,

Wicker, T., Bharti, A. K., Chapman, J., Feltus, F. A., Gowik, U., Grigoriev, I. V., Lyons, E.,

171

Maher, C. A., Martis, M., Narechania, A., Otillar, R. P., Penning, B. W., Salamov, A. A.,

Wang, Y., Zhang, L., Carpita, N. C., Freeling, M., Gingle, A. R., Hash, C. T., Keller, B.,

Klein, P., Kresovich, S., McCann, M. C., Ming, R., Peterson, D. G., Mehboob Ur, R., Ware,

D., Westhoff, P., Mayer, K. F. X., Messing, J. & Rokhsar, D. S. 2009. The Sorghum bicolor

genome and the diversification of grasses. Nature, 457, 551-556.

Piao, Z., Ramchiary, N. & Lim, Y. 2009. Genetics of Clubroot Resistance in Brassica Species.

Journal of Plant Growth Regulation, 28, 252-264.

Piao, Z. Y., Deng, Y. Q., Choi, S. R., Park, Y. J. & Lim, Y. P. 2004. SCAR and CAPS mapping of

CRb, a gene conferring resistance to Plasmodiophora brassicae in Chinese cabbage

(Brassica rapa ssp. pekinensis). Theoretical and Applied Genetics, 108, 1458-1465.

Pilet, M. L., Delourme, R., Foisset, N. & Renard, M. 1998. Identification of QTL involved in field

resistance to light leaf spot (Pyrenopeziza brassicae) and blackleg resistance (Leptosphaeria

maculans) in winter rapeseed (Brassica napus L.). Theoretical and Applied Genetics, 97,

398-406.

Pilet, M. L., Duplan, G., Archipiano, M., Barret, P., Baron, C., Horvais, R., Tanguy, X., Lucas, M.

O., Renard, M. & Delourme, R. 2001. Stability of QTL for field resistance to blackleg

across two genetic backgrounds in oilseed rape. Crop Science, 41, 197-205.

Plocik, A., Layden, J. & Kesseli, R. 2004. Comparative analysis of NBS domain sequences of NBS-

LRR disease resistance genes from sunflower, lettuce, and chicory. Molecular Phylogenetics

and Evolution, 31, 153-163.

Porter, B. W., Paidi, M., Ming, R., Alam, M., Nishijima, W. T. & Zhu, Y. J. 2009. Genome-wide

analysis of Carica papaya reveals a small NBS resistance gene family. Molecular Genetics

and Genomics, 281, 609-626.

Quint, M., Dussle, C., Melchinger, A. & Lübberstedt, T. 2003. Identification of genetically linked

RGAs by BAC screening in maize and implications for gene cloning, mapping and MAS.

Theoretical and Applied Genetics, 106, 1171-1177.

Rafalski, A. 2002. Applications of single nucleotide polymorphisms in crop genetics. Current

Opinion in Plant Biology, 5, 94 - 100.

Ramalingam, J., Vera Cruz, C. M., Kukreja, K., Chittoor, J. M., Wu, J. L., Lee, S. W., Baraoidan,

M., George, M. L., Cohen, M. B., Hulbert, S. H., Leach, J. E. & Leung, H. 2003. Candidate

Defense Genes from Rice, Barley, and Maize and Their Association with Qualitative and

Quantitative Resistance in Rice. Molecular Plant-Microbe Interactions, 16, 14-24.

Raman, H., Dalton-Morgan, J., Diffey, S., Raman, R., Alamery, S., Edwards, D. & Batley, J. 2014.

SNP markers-based map construction and genome-wide linkage analysis in Brassica napus.

Plant Biotechnology Journal, n/a-n/a.

Raman, R., Taylor, B., Lindbeck, K., Coombes, N., Barbulescu, D., Salisbury, P. & Raman, H.

2012a. Molecular mapping and validation of Rlm1 gene for resistance to Leptosphaeria

maculans in canola (Brassica napus L.). Crop and Pasture Science, 63, 1007-1017.

Raman, R., Taylor, B., Marcroft, S., Stiller, J., Eckermann, P., Coombes, N., Rehman, A.,

Lindbeck, K., Luckett, D., Wratten, N., Batley, J., Edwards, D., Wang, X. & Raman, H.

2012b. Molecular mapping of qualitative and quantitative loci for resistance to

Leptosphaeria maculans causing blackleg disease in canola (Brassica napus L.). Theoretical

and Applied Genetics, 125, 405-418.

Rana, D., van den Boogaart, T., O'Neill, C. M., Hynes, L., Bent, E., Macpherson, L., Park, J. Y.,

Lim, Y. P. & Bancroft, I. 2004. Conservation of the microstructure of genome segments in

Brassica napus and its diploid relatives. The Plant Journal, 40, 725-733.

Ratnaparkhe, M., Wang, X., Li, J., Compton, R., Rainville, L., Lemke, C., Kim, C., Tang, H. &

Paterson, A. 2011. Comparative analysis of peanut NBS-LRR gene clusters suggests

evolutionary innovation among duplicated domains and erosion of gene microsynteny. New

Phytologist, 192, 164 - 178.

172

Richly, E., Kurth, J. & Leister, D. 2002. Mode of Amplification and Reorganization of Resistance

Genes During Recent Arabidopsis thaliana Evolution. Molecular Biology and Evolution, 19,

76-84.

Rimmer, S. & van den Berg, C. 1992. Resistance of oilseed Brassica spp. to blackleg caused by

Lptosphaeria maculans Canadian Journal of Plant Pathology, 14, 56-66.

Rimmer, S. R. 2006. Resistance genes to Leptosphaeria maculans in Brassica napus. Canadian

Journal of Plant Pathology, 28, S288-S297.

Rocherieux, J., Glory, P., Giboulot, A., Boury, S., Barbeyron, G., Thomas, G. & Manzanares-

Dauleux, M. J. 2004. Isolate-specific and broad-spectrum QTLs are involved in the control

of clubroot in Brassica oleracea. Theoretical and Applied Genetics, 108, 1555-1563.

Rouxel, T., Penaud, A., Pinochet, X., Brun, H., Gout, L., Delourme, R., Schmit, J. & Balesdent, M.-

H. 2003a. A 10-year Survey of Populations of Leptosphaeria maculans in France Indicates a

Rapid Adaptation Towards the Rlm1 Resistance Gene of Oilseed Rape. European Journal

of Plant Pathology, 109, 871-881.

Rouxel, T., Willner, E., Coudard, L. & Balesdent, M. H. 2003b. Screening and identification of

resistance to Leptosphaeria maculans (stem canker) in Brassica napus accessions. Euphytica,

133, 219-231.

Rouxel, T. & Balesdent, M. H. 2005. The stem canker (blackleg) fungus, Leptosphaeria maculans,

enters the genomic era. Molecular Plant Pathology, 6, 225-241.

Rouxel, T., Grandaubert, J., Hane, J. K., Hoede, C., van de Wouw, A. P., Couloux, A., Dominguez,

V., Anthouard, V., Bally, P., Bourras, S., Cozijnsen, A. J., Ciuffetti, L. M., Degrave, A.,

Dilmaghani, A., Duret, L., Fudal, I., Goodwin, S. B., Gout, L., Glaser, N., Linglin, J., Kema,

G. H. J., Lapalu, N., Lawrence, C. B., May, K., Meyer, M., Ollivier, B., Poulain, J., Schoch,

C. L., Simon, A., Spatafora, J. W., Stachowiak, A., Turgeon, B. G., Tyler, B. M., Vincent,

D., Weissenbach, J., Amselem, J., Quesneville, H., Oliver, R. P., Wincker, P., Balesdent,

M.-H. & Howlett, B. J. 2011. Effector diversification within compartments of the

Leptosphaeria maculans genome affected by Repeat-Induced Point mutations. Nature

Communications, 2, 202.

Ruperao, P., Chan, C.-K. K., Azam, S., Karafiátová, M., Hayashi, S., Čížková, J., Saxena, R. K.,

Šimková, H., Song, C., Vrána, J., Chitikineni, A., Visendi, P., Gaur, P. M., Millán, T.,

Singh, K. B., Taran, B., Wang, J., Batley, J., Doležel, J., Varshney, R. K. & Edwards, D.

2014. A chromosomal genomics approach to assess and validate the desi and kabuli draft

chickpea genome assemblies. Plant Biotechnology Journal, 12, 778-786.

Saal, B., Brun, H., Glais, I. & Struss, D. 2004. Identification of a Brassica juncea-derived recessive

gene conferring resistance to Leptosphaeria maculans in oilseed rape. Plant Breeding, 123,

505-511.

Saal, B. & Struss, D. 2005. RGA- and RAPD-derived SCAR markers for a Brassica B-genome

introgression conferring resistance to blackleg in oilseed rape. Theoretical and Applied

Genetics, 111, 281-290.

Saito, M., Kubo, N., Matsumoto, S., Suwabe, K., Tsukada, M. & Hirai, M. 2006. Fine mapping of

the clubroot resistance gene, Crr3, in Brassica rapa. Theoretical and Applied Genetics, 114,

81-91.

Saitou, N. & Nei, M. 1987. The neighbor-joining method: a new method for reconstructing

phylogenetic trees. Molecular Biology and Evolution, 4, 406-425.

Sakamoto, K., Saito, A., Hayashida, N., Taguchi, G. & Matsumoto, E. 2008. Mapping of isolate-

specific QTLs for clubroot resistance in Chinese cabbage (Brassica rapa L. ssp. pekinensis).

Theoretical and Applied Genetics, 117, 759-767.

Salisbury, P. A., Ballinger, D. J., Wratten, N., Plummer, K. M. & Howlett, B. J. 1995. Blackleg

disease on oilseed Brassica in Australia: A review. Australian Journal of Experimental

Agriculture, 35, 665-672.

173

Salmon, A., Udall, J. A., Jeddeloh, J. A. & Wendel, J. 2012. Targeted Capture of Homoeologous

Coding and Noncoding Sequence in Polyploid Cotton. G3: Genes, Genomes, Genetics, 2,

921-930.

Singh, S., Sharma, S., Kalia, P., Deshmukh, R., Kumar, V., Sharma, P. & Sharma, T. 2012.

Molecular mapping of the downy mildew resistance gene Ppa3 in cauliflower (Brassica

oleracea var. botrytis L.). JOURNAL OF HORTICULTURAL SCIENCE &

BIOTECHNOLOGY, 87, 137-143.

Snowdon, R., Lühs, W. & Friedt, W. 2007. Oilseed Rape. In: KOLE, C. (ed.) Genome Mapping

and Molecular Breeding in Plants (Oilseeds). Springer Berlin Heidelberg.

Speulman, E., Bouchez, D., Holub, E. B. & Beynon, J. L. 1998. Disease resistance gene homologs

correlate with disease resistance loci of Arabidopsis thaliana. The Plant Journal, 14, 467-

474.

Sprague, S. J., Balesdent, M. H., Brun, H., Hayden, H. L., Marcroft, S. J., Pinochet, X., Rouxel, T.

& Howlett, B. J. 2006. Major gene resistance in Brassica napus (oilseed rape) is overcome

by changes in virulence of populations of Leptosphaeria maculans in France and Australia.

European Journal of Plant Pathology, 114, 33-40.

Staal, J., Kaliff, M., Bohman, S. & Dixelius, C. 2006. Transgressive segregation reveals two

Arabidopsis TIR-NB-LRR resistance genes effective against Leptosphaeria maculans,

causal agent of blackleg disease. The Plant Journal, 46, 218-230.

Staal, J. & Dixelius, C. 2008. RLM3, a potential adapter between specific TIR-NB-LRR receptors

and DZC proteins. communicative and intergrative biology, 1, 59-61.

Staal, J., Kaliff, M., Dewaele, E., Persson, M. & Dixelius, C. 2008. RLM3, a TIR domain encoding

gene involved in broad-range immunity of Arabidopsis to necrotrophic fungal pathogens.

The Plant Journal, 55, 188-200.

Staskawicz, B. J., Ausubel, F. M., Baker, B. J., Ellis, J. G. & Jones, J. D. G. 1995. Molecular

genetics of plant disease resistance. Science, 268, 661-667.

Suwabe, K., Tsukazaki, H., Iketani, H., Hatakeyama, K., Fujimura, M., Nunome, T., Fukuoka, H.,

Matsumoto, S. & Hirai, M. 2003. Identification of two loci for resistance to clubroot

(Plasmodiophora brassicae Woronin) in Brassica rapa L. Theoretical and Applied Genetics,

107, 997-1002.

Suwabe, K., Tsukazaki, H., Iketani, H., Hatakeyama, K., Kondo, M., Fujimura, M., Nunome, T.,

Fukuoka, H., Hirai, M. & Matsumoto, S. 2006. Simple Sequence Repeat-Based

Comparative Genomics Between Brassica rapa and Arabidopsis thaliana: The Genetic

Origin of Clubroot Resistance. Genetics, 173, 309-319.

Suwabe, K., Suzuki, G., Nunome, T., Hatakeyama, K., Mukai, Y., Fukuoka, H. & Matsumoto, S.

2012. Microstructure of a Brassica rapa genome segment homoeologous to the resistance

gene cluster on Arabidopsis chromosome 4. Breeding science, 62, 170.

Syvanen, A.-C. 2005. Toward genome-wide SNP genotyping. NATURE GENETICS, 378.

Tamura, K., Stecher, G., Peterson, D., Filipski, A. & Kumar, S. 2013. MEGA6: Molecular

Evolutionary Genetics Analysis Version 6.0. Molecular Biology and Evolution, 30, 2725-

2729.

Tan, X., Meyers, B. C., Kozik, A., West, M. A., Morgante, M., St Clair, D. A., Bent, A. F. &

Michelmore, R. W. 2007. Global expression analysis of nucleotide binding site-leucine rich

repeat-encoding and related genes in Arabidopsis. BMC Plant Biology, 7.

Tang, H., Bowers, J. E., Wang, X., Ming, R., Alam, M. & Paterson, A. H. 2008. Synteny and

Collinearity in Plant Genomes. Science, 320, 486-488.

Tarr, D. & Alexander, H. 2009. TIR-NBS-LRR genes are rare in monocots: evidence from diverse

monocot orders. BMC Research Notes, 2, 197.

The International Wheat Genome Sequencing Consortium 2014. A chromosome-based draft

sequence of the hexaploid bread wheat (Triticum aestivum) genome. Science, 345.

174

Thompson, J. D., Higgins, D. G. & Gibson, T. J. 1994. CLUSTAL W: improving the sensitivity of

progressive multiple sequence alignment through sequence weighting, position-specific gap

penalties and weight matrix choice. Nucleic Acids Research, 22, 4673-4680.

Tollenaere, R., Hayward, A., Dalton-Morgan, J., Campbell, E., Lee, J. R. M., Lorenc, M. T.,

Manoli, S., Stiller, J., Raman, R., Raman, H., Edwards, D. & Batley, J. 2012. Identification

and characterization of candidate Rlm4 blackleg resistance genes in Brassica napus using

next-generation sequencing. Plant Biotechnology Journal, 10, 709-715.

Trick, M., Kwon, S.-J., Choi, S., Fraser, F., Soumpourou, E., Drou, N., Wang, Z., Lee, S., Yang, T.-

J., Mun, J.-H., Paterson, A., Town, C., Pires, J. C., Pyo Lim, Y., Park, B.-S. & Bancroft, I.

2009. Complexity of genome evolution by segmental rearrangement in Brassica rapa

revealed by sequence-level analysis. BMC Genomics, 10, 539.

Tsuda, K. & Katagiri, F. 2010. Comparing signaling mechanisms engaged in pattern-triggered and

effector-triggered immunity. Current Opinion in Plant Biology, 13, 459-465.

U, N. 1935. Genome analysis in Brassica with special reference to the experimental formation of

B.napus and peculiar mode of fertilization Japanese journal of botany, 7, 389-452.

Ueno, H., Matsumoto, E., Aruga, D., Kitagawa, S., Matsumura, H. & Hayashida, N. 2012.

Molecular characterization of the CRa gene conferring clubroot resistance in Brassica rapa.

Plant Molecular Biology, 80, 621-629.

USDA 2014. Oilseeds: World markets and trade. available at

http://www.fas.usda.gov/data/oilseeds-world-markets-and-trade. United States Department

of Agriculture

Vaghchhipawala, Z., Rojas, C., Senthil-Kumar, M. & Mysore, K. 2011. Agroinoculation and

Agroinfiltration: Simple Tools for Complex Gene Function Analyses. In: PEREIRA, A.

(ed.) Plant Reverse Genetics. Humana Press.

Van De Wouw, A. P., Marcroft, S. J., Barbetti, M. J., Hua, L., Salisbury, P. A., Gout, L., Rouxel,

T., Howlett, B. J. & Balesdent, M. H. 2009. Dual control of avirulence in Leptosphaeria

maculans towards a Brassica napus cultivar with 'sylvestris-derived' resistance suggests

involvement of two resistance genes. Plant Pathology, 58, 305-313.

Van de Wouw, A. P., Lowe, R. G. T., Elliott, C. E., Dubois, D. J. & Howlett, B. J. 2014. An

avirulence gene, AvrLmJ1, from the blackleg fungus, Leptosphaeria maculans, confers

avirulence to Brassica juncea cultivars. Molecular Plant Pathology, 15, 523-530.

Verma, S. S., Rahman, M. H., Deyholos, M. K., Basu, U. & Kav, N. N. V. 2014. Differential

Expression of miRNAs in Brassica napus Root following Infection with Plasmodiophora

brassicae. PLoS ONE, 9, e86648.

Vicente, J. G. & King, G. J. 2001. Characterisation of disease resistance gene-like sequences in

Brassica oleracea L. Theoretical and Applied Genetics, 102, 555-563.

Voorrips, R. E., Jongerius, M. C. & Kanne, H. J. 1997. Mapping of two genes for resistance to

clubroot (Plasmodiophora brassicae) in a population of doubled haploid lines of Brassica

oleracea by means of RFLP and AFLP markers. Theoretical and Applied Genetics, 94, 75-

82.

Wan, H., Yuan, W., Ye, Q., Wang, R., Ruan, M., Li, Z., Zhou, G., Yao, Z., Zhao, J., Liu, S. &

Yang, Y. 2012. Analysis of TIR- and non-TIR-NBS-LRR disease resistance gene analogous

in pepper: characterization, genetic variation, functional divergence and expression patterns.

BMC Genomics, 13, 502.

Wan, H., Yuan, W., Bo, K., Shen, J., Pang, X. & Chen, J. 2013. Genome-wide analysis of NBS-

encoding disease resistance genes in Cucumis sativus and phylogenetic study of NBS-

encoding genes in Cucurbitaceae crops. BMC Genomics, 14, 109.

Wang, H., Waller, L., Tripathy, S., St. Martin, S. K., Zhou, L., Krampis, K., Tucker, D. M., Mao,

Y., Hoeschele, I., Saghai Maroof, M. A., Tyler, B. M. & Dorrance, A. E. 2010. Analysis of

Genes Underlying Soybean Quantitative Trait Loci Conferring Partial Resistance to

Phytophthora sojae. The Plant Genome, 3, 23-40.

175

Wang, J., Kaur, S., Cogan, N. O. I., Dobrowolski, M. P., Salisbury, P. A., Burton, W. A., Baillie,

R., Hand, M., Hopkins, C., Forster, J. W., Smith, K. F. & Spangenberg, G. 2009.

Assessment of genetic diversity in Australian canola (Brassica napus L.) cultivars using SSR

markers. Crop and Pasture Science, 60, 1193-1201.

Wang, X., Wang, H., Wang, J., Sun, R., Wu, J., Liu, S., Bai, Y., Mun, J.-H., Bancroft, I., Cheng, F.,

Huang, S., Li, X., Hua, W., Wang, J., Wang, X., Freeling, M., Pires, J. C., Paterson, A. H.,

Chalhoub, B., Wang, B., Hayward, A., Sharpe, A. G., Park, B.-S., Weisshaar, B., Liu, B., Li,

B., Liu, B., Tong, C., Song, C., Duran, C., Peng, C., Geng, C., Koh, C., Lin, C., Edwards,

D., Mu, D., Shen, D., Soumpourou, E., Li, F., Fraser, F., Conant, G., Lassalle, G., King, G.

J., Bonnema, G., Tang, H., Wang, H., Belcram, H., Zhou, H., Hirakawa, H., Abe, H., Guo,

H., Wang, H., Jin, H., Parkin, I. A. P., Batley, J., Kim, J.-S., Just, J., Li, J., Xu, J., Deng, J.,

Kim, J. A., Li, J., Yu, J., Meng, J., Wang, J., Min, J., Poulain, J., Wang, J., Hatakeyama, K.,

Wu, K., Wang, L., Fang, L., Trick, M., Links, M. G., Zhao, M., Jin, M., Ramchiary, N.,

Drou, N., Berkman, P. J., Cai, Q., Huang, Q., Li, R., Tabata, S., Cheng, S., Zhang, S.,

Zhang, S., Huang, S., Sato, S., Sun, S., Kwon, S.-J., Choi, S.-R., Lee, T.-H., Fan, W., Zhao,

X., Tan, X., Xu, X., Wang, Y., Qiu, Y., Yin, Y., Li, Y., et al. 2011. The genome of the

mesopolyploid crop species Brassica rapa. NATURE GENETICS, 43, 1035-1039.

Wang, Y., Wang, X. & Paterson, A. H. 2012. Genome and gene duplications and gene expression

divergence: a view from plants. Annals of the New York Academy of Sciences, 1256, 1-14.

Wei, H., Li, W., Sun, X., Zhu, S. & Zhu, J. 2013. Systematic Analysis and Comparison of

Nucleotide-Binding Site Disease Resistance Genes in a Diploid Cotton Gossypium

raimondii. PLoS ONE, 8, e68435.

Werner, S., Diederichsen, E., Frauen, M., Schondelmaier, J. & Jung, C. 2008. Genetic mapping of

clubroot resistance genes in oilseed rape. Theoretical and Applied Genetics, 116, 363-372.

West, J. S., Kharbanda, P. D., Barbetti, M. J. & Fitt, B. D. L. 2001. Epidemiology and management

of Leptosphaeria maculans (phoma stem canker) on oilseed rape in Australia, Canada and

Europe. Plant Pathology, 50, 10-27.

Westermeier, P., Wenzel, G. & Mohler, V. 2009. Development and evaluation of single-nucleotide

polymorphism markers in allotetraploid rapeseed (Brassica napus L.). Theoretical and

Applied Genetics, 119, 1301-1311.

Williams & Fitt 1999. Differentiating A and B groups of Leptosphaeria maculans, causal agent of

stem canker (blackleg) of oilseed rape. Plant Pathology, 48, 161-175.

Williams, P. & Delwiche, P. 1979. screening for resistance to blackleg of crucifers in the seedling

stage. Proceedings of Cruciferous Crops, 164-170.

Winfield, M. O., Wilkinson, P. A., Allen, A. M., Barker, G. L., Coghill, J. A., Burridge, A., Hall,

A., Brenchley, R. C., D’Amore, R. & Hall, N. 2012. Targeted re‐sequencing of the

allohexaploid wheat exome. Plant Biotechnology Journal, 10, 733-742.

Wisser, R. J., Sun, Q., Hulbert, S. H., Kresovich, S. & Nelson, R. J. 2005. Identification and

Characterization of Regions of the Rice Genome Associated With Broad-Spectrum,

Quantitative Disease Resistance. Genetics, 169, 2277-2293.

Wretblad, S., Bohman, S. & Dixelius, C. 2003. Overexpression of a Brassica nigra cDNA gives

enhanced resistance to Leptosphaeria maculans in B. napus. Molecular Plant-Microbe

Interactions, 16, 477-484.

Wroblewski, T., Tomczak, A. & Michelmore, R. 2005. Optimization of Agrobacterium-mediated

transient assays of gene expression in lettuce, tomato and Arabidopsis. Plant Biotechnology

Journal, 3, 259-273.

Wulff, B. B., Heese, A., Tomlinson-Buhot, L., Jones, D. A., de la Peña, M. & Jones, J. D. 2009.

The major specificity-determining amino acids of the tomato Cf-9 disease resistance protein

are at hypervariable solvent-exposed positions in the central leucine-rich repeats. Molecular

Plant-Microbe Interactions, 22, 1203-1213.

176

Yang, S., Zhang, X., Yue, J.-X., Tian, D. & Chen, J.-Q. 2008. Recent duplications dominate NBS-

encoding gene expansion in two woody species. Molecular Genetics and Genomics, 280,

187-198.

Yang, X., Deng, F. & Ramonell, K. 2012. Receptor-like kinases and receptor-like proteins: keys to

pathogen recognition and defense signaling in plant innate immunity. Frontiers in Biology,

7, 155-166.

Yu, F., Lydiate, D. J. & Rimmer, S. R. 2005. Identification of two novel genes for blackleg

resistance in Brassica napus. Theoretical and Applied Genetics, 110, 969-979.

Yu, F., Lydiate, D. J. & Rimmer, S. R. 2008. Identification and mapping of a third blackleg

resistance locus in Brassica napus derived from B. rapa subsp. sylvestris. Genome, 51, 64-

72.

Yu, F., Gugel, R., Kutcher, H., Peng, G. & Rimmer, S. 2012. Identification and mapping of a novel

blackleg resistance locus LepR4 in the progenies from Brassica napus × B. rapa subsp.

sylvestris. Theoretical and Applied Genetics, 1-9.

Yu, H., Zhong, X., Li, B. & Gu, H. 2010. A Molecular Marker Linked to Downy Mildew Resistant

Gene in Heading Chinese Cabbage (Brassica rapa L.ssp .pekinensis (Lour) Olsson) ( in

chinese). CHINESE AGRICULTURAL SCIENCE, 26, 66-70

Yu, H., Xie, W., Wang, J., Xing, Y., Xu, C., Li, X., Xiao, J. & Zhang, Q. 2011a. Gains in QTL

Detection Using an Ultra-High Density SNP Map Based on Population Sequencing Relative

to Traditional RFLP/SSR Markers. PLoS ONE, 6, e17595.

Yu, J., Tehrim, S., Zhang, F., Tong, C., Huang, J., Cheng, X., Dong, C., Zhou, Y., Qin, R., Hua, W.

& Liu, S. 2014. Genome-wide comparative analysis of NBS-encoding genes between

Brassica species and Arabidopsis thaliana. BMC Genomics, 15, 3.

Yu, S., Zhang, F., Yu, R., Zou, Y., Qi, J., Zhao, X., Yu, Y., Zhang, D. & Li, L. 2009. Genetic

mapping and localization of a major QTL for seedling resistance to downy mildew in

Chinese cabbage (Brassica rapa ssp. pekinensis). Molecular Breeding, 23, 573-590.

Yu, S., Zhang, F., Zhao, X., Yu, Y. & Zhang, D. 2011b. Sequence-characterized amplified region

and simple sequence repeat markers for identifying the major quantitative trait locus

responsible for seedling resistance to downy mildew in Chinese cabbage (Brassica rapa ssp.

pekinensis). Plant Breeding, 130, 580-583.

Yue, J., Meyers, B., Chen, J., Tian, D. & Yang, S. 2012. Tracing the origin and evolutionary history

of plant nucleotide-binding site-leucine-rich repeat (NBS-LRR) genes. New Phytologist 193,

1049 - 1063.

Zhang, X., Feng, Y., Cheng, H., Tian, D., Yang, S. & Chen, J.-Q. 2011. Relative evolutionary rates

of NBS-encoding genes revealed by soybean segmental duplication. Molecular Genetics

and Genomics, 285, 79-90.

Zhou, T., Wang, Y., Chen, J. Q., Araki, H., Jing, Z., Jiang, K., Shen, J. & Tian, D. 2004. Genome-

wide identification of NBS genes in japonica rice reveals significant expansion of divergent

non-TIR NBS-LRR genes. Molecular Genetics and Genomics, 271, 402-415.

Zhu, J. S., Struss, D. & Robbelen, G. 1993. Studies on resistance to Phoma lingam in Brassica

napus-Brassica nigra addition lines. Plant Breeding, 111, 192-197.