6
Transposable elements drive rapid phenotypic variation in Capsella rubella Xiao-Min Niu a,b,1 , Yong-Chao Xu a,b,1 , Zi-Wen Li a , Yu-Tao Bian a,b , Xing-Hui Hou a,b , Jia-Fu Chen a,b , Yu-Pan Zou a,b , Juan Jiang a,b , Qiong Wu a , Song Ge a,b , Sureshkumar Balasubramanian c , and Ya-Long Guo a,b,2 a State Key Laboratory of Systematic and Evolutionary Botany, Institute of Botany, Chinese Academy of Sciences, 100093 Beijing, China; b University of Chinese Academy of Sciences, 100049 Beijing, China; and c School of Biological Sciences, Monash University, VIC 3800, Australia Edited by Ian T. Baldwin, Max Planck Institute for Chemical Ecology, Jena, Germany, and approved February 19, 2019 (received for review July 4, 2018) Rapid phenotypic changes in traits of adaptive significance are crucial for organisms to thrive in changing environments. How such phenotypic variation is achieved rapidly, despite limited genetic variation in species that experience a genetic bottleneck is unknown. Capsella rubella, an annual and inbreeding forb (Bras- sicaceae), is a great system for studying this basic question. Its distribution is wider than those of its congeneric species, despite an extreme genetic bottleneck event that severely diminished its genetic variation. Here, we demonstrate that transposable ele- ments (TEs) are an important source of genetic variation that could account for its high phenotypic diversity. TEs are (i ) highly enriched in C. rubella compared with its outcrossing sister species Capsella grandiflora, and (ii ) 4.2% of polymorphic TEs in C. rubella are associated with variation in the expression levels of their ad- jacent genes. Furthermore, we show that frequent TE insertions at FLOWERING LOCUS C (FLC) in natural populations of C. rubella could explain 12.5% of the natural variation in flowering time, a key life history trait correlated with fitness and adaptation. In particular, we show that a recent TE insertion at the 3UTR of FLC affects mRNA stability, which results in reducing its steady- state expression levels, to promote the onset of flowering. Our results highlight that TE insertions can drive rapid phenotypic var- iation, which could potentially help with adaptation to changing environments in a species with limited standing genetic variation. Brassicaceae | Capsella rubella | natural variation | rapid phenotypic variation | transposable elements R apid phenotypic changes in traits of adaptive significance provide organisms with potentials for local adaptation, which is critical for both survival and range expansion of organisms. This becomes even more crucial in the context of global climate change (1). Species often experience a genetic bottleneck after speciation or introduction into a new area that diminishes their genetic variation, but rapidly adapt and become invasive there- after, which is referred to as the genetic paradox of invasion(24). However, the underlying mechanisms mediating these rapid diversification phenotypes, in traits that are correlated with adaptation, are unclear. Transposable elements (TEs) are among the most variable components of the genome, which can replicate and integrate into new positions. Changes in the environment, such as climate change, can alter both the copy number of TEs and their effects on gene regulation, generating novel genetic and phenotypic variation of potential adaptive significance (5). Therefore, TEs have the potential to quickly create abundant genetic diversity and thus be agents of rapid adaptation (610). Despite extensive studies on their phenotypic effects, the extent to which TEs can contribute to the process of rapid adaptation is largely unknown (1113). To explore whether TEs could drive rapid phenotypic di- versification in species with limited genetic variation, we focused on Capsella rubella (Brassicaceae), an annual and inbreeding forb that experienced a genetic bottleneck during speciation (14, 15). C. rubella provides a typical example of the genetic paradox of invasion with a much wider distribution than that of its out- crossing sister species Capsella grandiflora (16). C. rubella provides an excellent opportunity to unravel the underlying mechanisms, in addition to its transition to selfing that helps with rapid coloni- zation, for the genetic paradox of invasion (14, 15, 17). In this study, we reasoned that, if TEs are critical for rapid phenotypic variation and adaptation, their distribution in the C. rubella genome might reflect this. Through population genomics analyses, we show that TEs are highly enriched in promoters and downstream regions of genes in C. rubella compared with C. grandiflora. TEs are also highly polymorphic in natural pop- ulations of C. rubella, and 4.2% of polymorphic TE insertions are associated with significant changes in expression levels of their adjacent genes. In particular, we demonstrate that frequent TE insertions at the FLOWERING LOCUS C (FLC) locus in natural populations of C. rubella affect its expression and could explain 12.5% of the variation in flowering time, one of the most important life history traits correlated with adaptation. We also reveal that a TE insertion in the 3UTR affects mRNA stability. Overall, our results indicate that TEs play a crucial role in rapid phenotypic variation, which could potentially promote adaptation to changing Significance The mechanisms underlying rapid adaptation to changing en- vironments in species with reduced genetic variation, referred to as the genetic paradox of invasion,are unknown. We re- port that transposable elements (TEs) are highly enriched in the gene promoter regions of Capsella rubella compared with its outcrossing sister species Capsella grandiflora. We also show that a number of polymorphic TEs in C. rubella are associated with changes in gene expression. Frequent TE insertions at FLOWERING LOCUS C of C. rubella affect flowering-time vari- ation, an important life history trait correlated with fitness. These results indicate that TE insertions drive rapid phenotypic variation, which could potentially help adapting to novel en- vironments in species with limited genetic variation. Author contributions: Y.-L.G. designed research; X.-M.N., Y.-C.X., Z.-W.L., Y.-T.B., X.-H.H., J.-F.C., and J.J. performed research; Y.-L.G. contributed new reagents/analytic tools; X.-M.N., Y.-C.X., Z.-W.L., Y.-P.Z., and Q.W. analyzed data; and X.-M.N., S.G., S.B., and Y.-L.G. wrote the paper. The authors declare no conflict of interest. This article is a PNAS Direct Submission. This open access article is distributed under Creative Commons Attribution-NonCommercial- NoDerivatives License 4.0 (CC BY-NC-ND). Data deposition: The data described in the paper have been deposited in the National Center for Biotechnology Information Sequence Read Archive under accession nos. PRJNA392709, PRJNA392711, and PRJNA511520 and in GenBank under accession nos. MF422379MF422413, MF422414MF422449, MF422450MF422525, MF142149MF142156, and MG492014MG492020. 1 X.-M.N. and Y.-C.X. contributed equally to this work. 2 To whom correspondence should be addressed. Email: [email protected]. This article contains supporting information online at www.pnas.org/lookup/suppl/doi:10. 1073/pnas.1811498116/-/DCSupplemental. www.pnas.org/cgi/doi/10.1073/pnas.1811498116 PNAS Latest Articles | 1 of 6 EVOLUTION Downloaded by guest on March 21, 2021

Transposable elements drive rapid phenotypic variation in ... · 3/13/2019  · Transposable elements drive rapid phenotypic variation in Capsella rubella Xiao-Min Niua,b,1, Yong-Chao

  • Upload
    others

  • View
    3

  • Download
    0

Embed Size (px)

Citation preview

Page 1: Transposable elements drive rapid phenotypic variation in ... · 3/13/2019  · Transposable elements drive rapid phenotypic variation in Capsella rubella Xiao-Min Niua,b,1, Yong-Chao

Transposable elements drive rapid phenotypicvariation in Capsella rubellaXiao-Min Niua,b,1, Yong-Chao Xua,b,1, Zi-Wen Lia, Yu-Tao Biana,b, Xing-Hui Houa,b, Jia-Fu Chena,b, Yu-Pan Zoua,b,Juan Jianga,b, Qiong Wua, Song Gea,b, Sureshkumar Balasubramanianc, and Ya-Long Guoa,b,2

aState Key Laboratory of Systematic and Evolutionary Botany, Institute of Botany, Chinese Academy of Sciences, 100093 Beijing, China; bUniversity ofChinese Academy of Sciences, 100049 Beijing, China; and cSchool of Biological Sciences, Monash University, VIC 3800, Australia

Edited by Ian T. Baldwin, Max Planck Institute for Chemical Ecology, Jena, Germany, and approved February 19, 2019 (received for review July 4, 2018)

Rapid phenotypic changes in traits of adaptive significance arecrucial for organisms to thrive in changing environments. Howsuch phenotypic variation is achieved rapidly, despite limitedgenetic variation in species that experience a genetic bottleneckis unknown. Capsella rubella, an annual and inbreeding forb (Bras-sicaceae), is a great system for studying this basic question. Itsdistribution is wider than those of its congeneric species, despitean extreme genetic bottleneck event that severely diminished itsgenetic variation. Here, we demonstrate that transposable ele-ments (TEs) are an important source of genetic variation that couldaccount for its high phenotypic diversity. TEs are (i) highlyenriched in C. rubella compared with its outcrossing sister speciesCapsella grandiflora, and (ii) 4.2% of polymorphic TEs in C. rubellaare associated with variation in the expression levels of their ad-jacent genes. Furthermore, we show that frequent TE insertions atFLOWERING LOCUS C (FLC) in natural populations of C. rubellacould explain 12.5% of the natural variation in flowering time, akey life history trait correlated with fitness and adaptation. Inparticular, we show that a recent TE insertion at the 3′ UTR ofFLC affects mRNA stability, which results in reducing its steady-state expression levels, to promote the onset of flowering. Ourresults highlight that TE insertions can drive rapid phenotypic var-iation, which could potentially help with adaptation to changingenvironments in a species with limited standing genetic variation.

Brassicaceae | Capsella rubella | natural variation | rapid phenotypicvariation | transposable elements

Rapid phenotypic changes in traits of adaptive significanceprovide organisms with potentials for local adaptation, which

is critical for both survival and range expansion of organisms.This becomes even more crucial in the context of global climatechange (1). Species often experience a genetic bottleneck afterspeciation or introduction into a new area that diminishes theirgenetic variation, but rapidly adapt and become invasive there-after, which is referred to as the “genetic paradox of invasion”(2–4). However, the underlying mechanisms mediating theserapid diversification phenotypes, in traits that are correlated withadaptation, are unclear.Transposable elements (TEs) are among the most variable

components of the genome, which can replicate and integrateinto new positions. Changes in the environment, such as climatechange, can alter both the copy number of TEs and their effectson gene regulation, generating novel genetic and phenotypicvariation of potential adaptive significance (5). Therefore, TEshave the potential to quickly create abundant genetic diversityand thus be agents of rapid adaptation (6–10). Despite extensivestudies on their phenotypic effects, the extent to which TEs cancontribute to the process of rapid adaptation is largely unknown(11–13).To explore whether TEs could drive rapid phenotypic di-

versification in species with limited genetic variation, we focusedon Capsella rubella (Brassicaceae), an annual and inbreedingforb that experienced a genetic bottleneck during speciation (14,15). C. rubella provides a typical example of the genetic paradox

of invasion with a much wider distribution than that of its out-crossing sister species Capsella grandiflora (16). C. rubella providesan excellent opportunity to unravel the underlying mechanisms, inaddition to its transition to selfing that helps with rapid coloni-zation, for the genetic paradox of invasion (14, 15, 17).In this study, we reasoned that, if TEs are critical for rapid

phenotypic variation and adaptation, their distribution in the C.rubella genome might reflect this. Through population genomicsanalyses, we show that TEs are highly enriched in promoters anddownstream regions of genes in C. rubella compared with C.grandiflora. TEs are also highly polymorphic in natural pop-ulations of C. rubella, and 4.2% of polymorphic TE insertions areassociated with significant changes in expression levels of theiradjacent genes. In particular, we demonstrate that frequent TEinsertions at the FLOWERING LOCUS C (FLC) locus in naturalpopulations of C. rubella affect its expression and could explain12.5% of the variation in flowering time, one of the most importantlife history traits correlated with adaptation. We also reveal that aTE insertion in the 3′ UTR affects mRNA stability. Overall, ourresults indicate that TEs play a crucial role in rapid phenotypicvariation, which could potentially promote adaptation to changing

Significance

The mechanisms underlying rapid adaptation to changing en-vironments in species with reduced genetic variation, referredto as the “genetic paradox of invasion,” are unknown. We re-port that transposable elements (TEs) are highly enriched in thegene promoter regions of Capsella rubella compared with itsoutcrossing sister species Capsella grandiflora. We also showthat a number of polymorphic TEs in C. rubella are associatedwith changes in gene expression. Frequent TE insertions atFLOWERING LOCUS C of C. rubella affect flowering-time vari-ation, an important life history trait correlated with fitness.These results indicate that TE insertions drive rapid phenotypicvariation, which could potentially help adapting to novel en-vironments in species with limited genetic variation.

Author contributions: Y.-L.G. designed research; X.-M.N., Y.-C.X., Z.-W.L., Y.-T.B., X.-H.H.,J.-F.C., and J.J. performed research; Y.-L.G. contributed new reagents/analytic tools;X.-M.N., Y.-C.X., Z.-W.L., Y.-P.Z., and Q.W. analyzed data; and X.-M.N., S.G., S.B., andY.-L.G. wrote the paper.

The authors declare no conflict of interest.

This article is a PNAS Direct Submission.

This open access article is distributed under Creative Commons Attribution-NonCommercial-NoDerivatives License 4.0 (CC BY-NC-ND).

Data deposition: The data described in the paper have been deposited in the NationalCenter for Biotechnology Information Sequence Read Archive under accession nos.PRJNA392709, PRJNA392711, and PRJNA511520 and in GenBank under accession nos.MF422379–MF422413, MF422414–MF422449, MF422450–MF422525, MF142149–MF142156,and MG492014–MG492020.1X.-M.N. and Y.-C.X. contributed equally to this work.2To whom correspondence should be addressed. Email: [email protected].

This article contains supporting information online at www.pnas.org/lookup/suppl/doi:10.1073/pnas.1811498116/-/DCSupplemental.

www.pnas.org/cgi/doi/10.1073/pnas.1811498116 PNAS Latest Articles | 1 of 6

EVOLU

TION

Dow

nloa

ded

by g

uest

on

Mar

ch 2

1, 2

021

Page 2: Transposable elements drive rapid phenotypic variation in ... · 3/13/2019  · Transposable elements drive rapid phenotypic variation in Capsella rubella Xiao-Min Niua,b,1, Yong-Chao

environments in plants and could explain the genetic paradoxof invasion.

ResultsTEs Are Highly Enriched in C. rubella and Affect the Expression Levelsof Its Adjacent Genes.We hypothesized that if TEs are among thekey determinants of phenotypic variation, their distribution maydiffer between C. rubella and its outcrossing congeneric sisterspecies C. grandiflora. To test this, we compared the distributionof TEs in 19,795 orthologous gene pairs between C. rubella andC. grandiflora. TEs are highly enriched in different genic regionsof the C. rubella compared with the corresponding regions of C.grandiflora (Fig. 1A). Our analysis with orthologous regionsunmasked these divergent distribution patterns compared withthe previous study (18), since the genomes of these two specieshave varied coverages/assemblies. In addition, TEs are highlyenriched in the promoter and downstream regions of genes in C.rubella compared with C. grandiflora (Fig. 1B), indicating thatTEs could potentially contribute to the diversification of geneexpression in C. rubella.To assess whether TEs are polymorphic in C. rubella, we

scanned 28 C. rubella genomes, which included 27 accessions (18,19) (SI Appendix, Table S1), with MTE accession as a reference(20). A phylogenetic tree based on whole-genome sequencessuggested that these 28 C. rubella accessions are closely related (SIAppendix, Fig. S1). We identified 3,808 polymorphic TEs amongthese genomes (SI Appendix, Fig. S2A). As reported earlier (18),these polymorphic TEs are enriched (89.84%) in intergenic

regions (SI Appendix, Fig. S2B). The number of polymorphic TEsin these accessions ranged from 1,024 to 1,688, and 1,731 in thereference MTE (SI Appendix, Fig. S3 and Table S1). The poly-morphic TEs that are present in at least two accessions wereconsidered as common. In total, of the 3,808 polymorphic TEs,2,558 (67.17%) were present in at least two accessions, suggestinga common allelic variation (Fig. 1C).To assess the functional implications of this variation, we

performed transcriptome sequencing for three representativeaccessions of C. rubella, MTE, 86IT1, and 879 (SI Appendix,Table S2), based on their positions on the phylogenetic tree of 28C. rubella accessions (SI Appendix, Fig. S1). In these three ac-cessions there are 1,309 polymorphic TEs, of which 5.45% werelocated in genic regions and 94.55% were in intergenic regions.We found that 55 of 1,309 polymorphic TE loci in these threeaccessions (4.2%) were associated with significant changes in theexpression levels of their adjacent genes [Wilcoxon sum test,false discovery rate (FDR) corrected, P < 0.05, SI Appendix,Table S3]. Among these TEs that are associated with changes ingene expression, retrotransposons are more common (36.37%,SI Appendix, Fig. S4). A Gene Ontology enrichment analysis (21)of the genes that are affected by the TEs using the Arabidopsisorthologous genes suggested that these genes are involved inlateral root formation and defense response (SI Appendix, Fig.S5). Taken together, these findings suggest that polymorphicTEs can diversify gene expression, which has the potential toshape the phenotypic variation in C. rubella as in other plants (8,22, 23).

Fig. 1. Transposable elements cause phenotypicdiversification and changes in gene expression in C.rubella. (A) Number of TEs in different genic regionsof the C. rubella and C. grandiflora genomes. (B)Enrichment of TEs in 100-bp bins in the regions 2 kbupstream of the start codon and 1 kb downstream ofthe stop codon between orthologous gene pairs ofC. rubella and C. grandiflora. The x axis indicates thedistance from TEs to the start codon (stop codon) ofthe closest gene; the y axis indicates the number ofgenes with TEs at different distances to the startcodon (stop codon). (C) Frequency distribution ofpolymorphic TEs in the genomes of 27 C. rubellanatural accessions with MTE as a reference. The xaxis indicates the number of accessions with thesame TE insertions. (D) Insertion of TEs in FLC in C.rubella populations. Thick lines in different colorsand lengths indicate the classification and length ofthe inserted TEs. Numbers in parentheses indicatethe sequence lengths of inserted TEs; the namesbefore the parentheses indicate the accessions withthe inserted TEs; when there are two accessions,accession names are separated by a backslash (/). (E)Flowering-time variation between accessions with orwithout a TE insertion at the CrFLC locus. (F) Relativeexpression levels of CrFLC, CrFT, and CrSOC1 in ac-cessions with TE insertions. Four biological replicates,each with three technical replicates, were analyzed.Expression data were normalized with CrTubulin andthen to the levels of 879 plants and are shown asmeans ± SD; *P < 0.05; **P < 0.01; NS, insignificant.

2 of 6 | www.pnas.org/cgi/doi/10.1073/pnas.1811498116 Niu et al.

Dow

nloa

ded

by g

uest

on

Mar

ch 2

1, 2

021

Page 3: Transposable elements drive rapid phenotypic variation in ... · 3/13/2019  · Transposable elements drive rapid phenotypic variation in Capsella rubella Xiao-Min Niua,b,1, Yong-Chao

Frequent TE Insertions at CrFLC Affect Natural Variation in FloweringTime. The geographic distribution of C. grandiflora is much nar-rower than that of C. rubella, which has colonized most areas ofsouthern and western Europe with varied environmental conditions(24). Flowering time is an important trait for adaptation to changingclimates and novel environments (25, 26). Interestingly, we foundthat FLC, a determinant of natural variation in flowering time of C.rubella (27), has TE insertions in its genic region (27). To assess thefrequency of TE insertions at CrFLC, we sequenced the CrFLCgenomic region (from 2 kb upstream of the start codon to the endof 3′UTR) in eight accessions by Sanger sequencing and comparedthem with 27 other previously published CrFLC genomic sequences(SI Appendix, Table S4). We found five different TE insertions in 7of 35 accessions (20%). We found a TE insertion in the 3′ UTR intwo accessions (879 and 1208) and four additional TE insertionsin intron 1 in five accessions (39.1, 690, 1204, 844, and TAAL) (Fig.1D). The TE insertions ranged from 498 to 1,958 bp, and most wereHelitron, except one insertion in the TAAL accession, which was aDNA transposon. All these seven accessions with TE insertions areearly flowering compared with the reference accession MTE andaccessions without TEs (Fig. 1E and SI Appendix, Table S4). Theexpression levels of CrFLC are lower (P < 0.05, Student’s t test)than MTE in all of the other six accessions except for 1204 (Fig.1F). This is also reflected in the inversely correlated expressionlevels of the CrFLC-downstream genes FT and SOC1 (Fig. 1F).Furthermore, 12.5% of the variation in flowering time could beexplained by allelic variation of the TE insertions at CrFLC in C.rubella (Pearson’s correlation coefficient, r2 = 0.125, P < 0.05).These results revealed that TE insertions in CrFLC are associatedwith early flowering in C. rubella.Given that each TE insertion is present in only a few C. rubella

accessions at different regions of the FLC locus, we concluded thatthese TE insertions originated recently and independently (SI Ap-pendix, Fig. S1). In the Brassicaceae family, there are frequent TEinsertions at FLC (SI Appendix, Fig. S6A). An analysis of a few ac-cessions from C. grandiflora and C. orientalis and 76 accessions ofArabidopsis thaliana (SI Appendix, Table S5) also suggested that the 3′UTR TE insertion in accession 879 is a recent mutation that ispresent only in two C. rubella accessions (SI Appendix, Fig. S7). If theTE insertions have any potential to be of adaptive significance, onemight expect that these accessions could show some geographicpatterns. To assess this, we compared the origin of these accessions.Intriguingly, accessions with TE insertions clustered in the south-ernmost habitat of this species around the Mediterranean region (SIAppendix, Fig. S6B and Table S4). This region harbors a Mediter-ranean climate, which is characterized by high precipitation, warmwinters, and dry and hot summers (28–31). Plants in the area areusually early flowering to avoid dry and hot summers (28–31). Thehigh TE insertion frequency, associated with early flowering in C.rubella, is consistent with this hypothesis. To assess whether there issignificant niche divergence between C. rubella accessions withor without the TE insertion, we performed ecological nichemodeling and found significant niche divergence between thetwo groups (100 random permutations, one-sample t test, P <0.001, SI Appendix, Fig. S8), consistent with the possibility thatTEs in C. rubella show the potential to cause phenotypic variationof adaptive significance.To test whether TE insertions in the 3′ UTR and intron 1 are

indeed the causal mutations for early flowering, we performedgenetic mapping. First, we analyzed an F2 population (879 ×MTE) derived from a cross between the late-flowering, se-quenced accession MTE (20) and the early flowering accession879 that harbors a TE in the 3′ UTR. Flowering time correlatessignificantly with FLC expression in the Brassicaceae family (32–34). Here, we found that the distribution of flowering time isconsistent with the distribution of CrFLC expression in F2 pop-ulations (Fig. 2 A and B). The F2 individuals with lower CrFLCexpression belong to the 879 genotype (Fig. 2B). Furthermore,

the CrFLC expression level was strongly correlated with the TEinsertion in the 3′ UTR in 2-wk-old seedlings of the F2 pop-ulation (Pearson’s correlation coefficient, r2 = 0.462, P < 0.01).These findings suggested that the 3′ UTR TE potentially af-fected CrFLC expression in this cross.Sequencing of a pool of early flowering plants from the F2

population mapped the causal loci to an ∼863-kb region onchromosome 6 (from position 2.78–3.64 Mb) (Fig. 2C). Wefurther used 825 individuals from the BC2F2 population andnarrowed the causal region down to a 130-kb interval (SI Ap-pendix, Table S6). This mapping interval contained 39 ORFs,including CrFLC (Fig. 2D). Sanger sequencing revealed thatCrFLC has one deletion of 2 bp in the promoter, two SNPs in itsintron, one SNP, and a 1,958-bp Helitron TE insertion in the 3′UTR that differentiated MTE from 879 (Fig. 2 D and E). TheCrFLC expression level in 879 was a quarter of that in MTE (Fig.1F), making it a prime candidate for the observed flowering-timevariation.To assess whether the TE insertion in intron 1 of CrFLC is

also the causal mutation for early flowering, we constructed anF2 population of 844 × MTE. Using a similar sequencing ap-proach, we identified an ∼650-kb region on chromosome 6 basedon 501 F2 individuals (SI Appendix, Fig. S9 A and B). We nar-rowed down the mapping interval to a 210-kb region using 561more F2 individuals and additional markers, which included theCrFLC locus (SI Appendix, Fig. S9 C and D and Table S7). Thelocation of TEs in intron 1 was roughly similar to the position ofLer in A. thaliana, in which a TE insertion regulates histonemodification via siRNA, thereby regulating flowering time (35).Therefore, it is likely that a similar mechanism may mediate theonset of early flowering in 844.However, the TE insertion at the 3′ UTR in accession 879 is

unique (SI Appendix, Fig. S6). To confirm whether the TE in-sertion in the 3′ UTR modulates flowering time, we generatedconstructs with genomic CrFLC from MTE and 879 and an ad-ditional construct of MTE-CrFLC in which the Helitron TE from879 was introduced (MTE-Ins) and compared their ability tocomplement FRIsf2 flc-3 A. thaliana plants (Fig. 3A). CrFLCexpression was higher in transgenic plants harboring the MTEconstruct than in those with 879 or MTE-Ins constructs (Fig. 3B;see SI Appendix, Fig. S10, for 10 randomly selected independenttransgenic lines). Consistent with these findings, levels of theCrFLC downstream genes CrFT and CrSOC1 were inverselycorrelated with the CrFLC expression level (Fig. 3B). Floweringtime was correlated with CrFLC expression. Transgenic plantswith the MTE genomic sequence were late flowering, whiletransgenic plants with either 879 or MTE with the TE insertionwere earlier flowering (all P < 0.001, Student’s t test; Fig. 3C).Taken together, these results suggested that the allelic variationcaused by the TE insertion at the 3′ UTR of CrFLC was re-sponsible for the flowering-time difference between 879 andMTE accessions of C. rubella.

TE in 3′ UTR of CrFLC Causes Alternative Polyadenylation and AffectsIts mRNA Stability. TE insertions can change gene expression byaltering transcription and/or by modulating mRNA stability (8).To assess whether the differential CrFLC expression betweenMTE and 879 is caused by transcriptional differences, wequantified transcription rates by measuring unspliced mRNAabundance (18, 36). The abundance of unspliced CrFLC nascenttranscript is similar between MTE and 879 (Fig. 4A). Consistentwith this, in both accessions, low levels of DNA methylation weredetected around the CrFLC transcription start site and the endof intron 1, two regions important for the regulation of FLCexpression (37) (SI Appendix, Fig. S11). These findings suggestthat transcriptional differences do not account for the observedchanges in gene expression.

Niu et al. PNAS Latest Articles | 3 of 6

EVOLU

TION

Dow

nloa

ded

by g

uest

on

Mar

ch 2

1, 2

021

Page 4: Transposable elements drive rapid phenotypic variation in ... · 3/13/2019  · Transposable elements drive rapid phenotypic variation in Capsella rubella Xiao-Min Niua,b,1, Yong-Chao

TE insertions at the 3′ UTR can give rise to alternative pol-yadenylation (APA), which in turn could modulate mRNA sta-bility (38, 39). The Helitron insertion in 879 introduces acanonical poly(A) signal (AAUAAA). The 3′ RACE assaysrevealed that there is APA in 879-CrFLC (Fig. 4B). To assesswhether APA could fully account for the observed phenotypicdifferences, we generated an additional 879-CrFLC construct in

FRIsf2 flc-3 A. thaliana plants, in which the poly(A) signal wasmutated (879-PAS-mut) (SI Appendix, Fig. S12). These plants werealso early flowering like transgenic plants of 879-CrFLC comparedwith transgenic plants of MTE-CrFLC, which suggests that, whilethe TE insertion at the 3′UTR causes APA, other mechanisms alsocontribute to the reduction in gene expression.To assess mRNA stability, we measured CrFLC expression in the

presence of cordycepin, which inhibits transcription (40), in parentallines. These experiments revealed that 879-CrFLC mRNA de-graded faster than MTE-CrFLC mRNA (Fig. 4C). To furtherconfirm the effect of the 3′ UTR, we generated GUS-reporterconstructs in FRIsf2 flc-3 A. thaliana plants driven by the CrFLCpromoter from MTE with either MTE or 879 3′ UTRs (Fig. 4D).We found that the MTE 3′UTR led to an almost threefold increasein expression compared with that of 879. These results are consis-tent with findings from Arabidopsis, indicating that the 3′ UTRcould affect the stability of FLC mRNA (36).MTE and 879 also differ in AU-rich elements (AREs) and

GU-rich elements (GREs) before their poly(A) tails (Fig. 4B).These elements, which are present only in 879, have the potentialto affect RNA stability (41). To assess whether AREs or GREscould mediate changes in gene expression, we compared ex-pression levels of GUS after specifically mutagenizing these el-ements in transgenic lines (Fig. 4D). While the loss of the firstARE had a significant impact on gene expression, the loss of theGRE and the second ARE did not, suggesting that the first AREat the 3′ UTR of 879 is critical for modulating mRNA expression

Fig. 2. QTL mapping analysis of flowering time. (A) Distribution of flow-ering time of the 557 individuals of the 879 × MTE F2 population. Averagesand ranges of flowering times for the two parental lines are shown above.(B) Histogram of CrFLC expression levels variations among F2 population (n =122) with TE insertion (879 genotype, shown in yellow), without TE insertion(MTE genotype, purple), and heterozygote (gray) at its 3′ UTR. Averages andranges of CrFLC expression levels for the two parental lines are shownabove. Expression data were normalized against CrTubulin. (C) SHOREmapanalysis of flowering time. The homozygosity estimator is 0 for even allelefrequencies for both parents, 1 when homozygous for the late-floweringaccession MTE, and −1 when homozygous for the early-flowering acces-sion 879. (D) Fine mapping by a genetic-linkage analysis and sequence var-iation in the candidate gene CrFLC. The number of recombinants betweenthe markers and the causal locus is indicated below the linkage map. CrFLC ismarked in red. Dashed red lines indicate sequence variation between MTEand 879; red line indicates deleted sequence. (E) Sequence variation in theCrFLC 3′ UTR and the inserted Helitron structure. The Helitron structure isillustrated at the top, composed of the 5′ TC and 3′ CTRR termini (shown inred), loop and two short palindromic sequences close to the 3′ terminus thatcould form a 20-bp hairpin (shown in blue and purple) and are insertedwithin a host dinucleotide AT (shown in green). The end sequences of theCrFLC 3′ UTR Helitrons are shown at the bottom in the same color. Fig. 3. TE insertion at CrFLC explains variation in flowering time. (A) Dia-

grams of constructs. Red vertical lines indicate variation between MTE and879. Pink rectangles indicate the TE insertion. (B) Expression of CrFLC andtwo downstream flowering regulators (FT, SOC1) in A. thaliana transgenicplants, as assayed using a pool of 25 independent transgenic lines. Expres-sion data were normalized against AtTubulin and then to the levels oftransgenic plants with 879-CrFLC and are shown as means ± SD; **P < 0.01.(C) Flowering time for transgenic plants. Flowering times for T1 transgeniclines with FRIsf2 and flc-3 alleles in the A. thaliana Col-0 background areshown; the number of independent transgenic lines scored for each con-struct is given above the bar graph. DTF, days to flowering.

4 of 6 | www.pnas.org/cgi/doi/10.1073/pnas.1811498116 Niu et al.

Dow

nloa

ded

by g

uest

on

Mar

ch 2

1, 2

021

Page 5: Transposable elements drive rapid phenotypic variation in ... · 3/13/2019  · Transposable elements drive rapid phenotypic variation in Capsella rubella Xiao-Min Niua,b,1, Yong-Chao

in these transgenic lines. An analysis of transgenic lines har-boring reporter constructs with partial UTRs from MTE sug-gested that the latter part of the MTE 3′ UTR is more critical asthe deletion of the first half of the MTE 3′ UTR (MTEΔ2)resulted in the same expression level as that of MTE, but thedeletion of the latter half (MTEΔ1) significantly reduced theexpression level (Fig. 4D). These results indicate that the pres-ence of AREs and the absence of the latter part of the MTE 3′UTR in 879-CrFLC might reduce CrFLC expression probably byaffecting mRNA stability (Fig. 4D).

DiscussionInvasive species with limited genetic variation often rapidly ac-quire diversification of phenotypes to adapt to changing envi-ronments and become successful in a process referred to as thegenetic paradox of invasion (2). How this process occurs is afundamental question in biology. C. rubella experienced an

extreme genetic bottleneck but has a wide distribution and istherefore a good model system to address this question. TEshave been suggested as an agent of rapid adaptation becausethey can produce abundant genetic variation in a limited time(5–9) and can affect phenotypic variation (11–13). For example,in peppered moths, a TE insertion within intron 1 of the cortexgene increases its expression level and induces melanization (13).However, the extent to which TEs contribute to rapid phenotypicdiversification and the mechanisms by which they influencephenotypes are unknown.Our results show that TEs play a crucial role in rapid phe-

notypic variation and suggest that they can potentially aid inadaptation to changing environments. Previous studies in othersystems have showed that TEs in 3′ UTRs can alter gene ex-pression via the regulation of methylation in Arabidopsis (42) andcan change the translation efficiency in rice (43). We have foundthat a TE insertion in the 3′ UTR affects mRNA stability. This

Fig. 4. TE insertion-induced alternative polyadenylation regulates CrFLC expression by modulating RNA stability. (A) Expression levels of unspliced CrFLC.Five biological replicates, each with three technical replicates, were analyzed. Expression data were normalized against CrTubulin and are shown as means ±SD. (B) Alternative polyadenylation at the 3′ UTR between 879 and MTE and a schematic model. The MTE 3′ UTR is shown in dark gray. Pink rectanglesindicate part of Helitron. The ARE (purple rectangles) and GRE (blue rectangles) sites and the polyadenylation signal (PAS, orange rectangles) domain areshown. Red line indicates variation. (C) RNA stability of CrFLC. CrATEXLA1 was used as a control, representing an unstable transcript. Three biologicalreplicates, each with three technical replicates, were analyzed. (D) Various deletion constructs with different 3′ UTRs and their effects on gene expression intransgenic lines. The MTE promoter is shown in light gray; GUS genes are in white; the same 3′ UTR region is shown in dark gray, and the latter half of theMTE 3′ UTR is shown in gray; the deleted regions are shown in dashed lines. The vertical red line indicates the mutation of the ARE motif, and the vertical blueline indicates the mutation of the GRE motif. The pooled number of independent transgenic lines is given next to the bar graphs. Expression data werenormalized against AtTubulin and are shown as means ± SD; *P < 0.05; **P < 0.01; NS, insignificant.

Niu et al. PNAS Latest Articles | 5 of 6

EVOLU

TION

Dow

nloa

ded

by g

uest

on

Mar

ch 2

1, 2

021

Page 6: Transposable elements drive rapid phenotypic variation in ... · 3/13/2019  · Transposable elements drive rapid phenotypic variation in Capsella rubella Xiao-Min Niua,b,1, Yong-Chao

mechanism also has the potential to affect gene expression andphenotypes. While our study hints at a correlation with potentialadaptive significance, future studies should explore whether suchphenotypic diversification ultimately confers an adaptive ad-vantage and whether TEs can drive adaptation.

Materials and MethodsMore detailed information on the materials and methods used in this study isprovided in SI Appendix, Supplementary Information Text. The C. rubellaaccessions were described in our previous study (14, 27, 32) or this study (SIAppendix, Table S4). Flowering time was assayed as days to flowering afterplanting. TEs in C. grandiflora (20) and C. rubella (20) were annotated usingRepeatMasker v.4.0.6 (www.repeatmasker.org). The polymorphic TEswere identified using TEPID (44) based on the raw reads of 27 resequencedgenomes of C. rubella accessions (18, 19) (SI Appendix, Table S1) with theMTE genome as the reference. A neighbor-joining tree of 28 C. rubella ac-cessions was constructed using PHYLIP (version 3.696) (45). Ecological nichemodeling was performed to measure niche similarity using ENMTools 1.3(46). Pooled leaves of early-flowering 879 × MTE (844 × MTE) F2 plants weresequenced to map the causal loci using SHOREmap (47). For the comple-mentation assay and the poly(A) signal mutation assay of CrFLC in A. thali-ana, the fragments with (or without) mutations were cloned into thepCAMBIA1300 vector using the KpnI and SalI sites and introduced into A.thaliana with FRIsf2 and flc-3 alleles in the Col-0 background. For the GUS-reporter assay, the promoter was amplified from MTE and cloned into the

pBI121 vector using the SbfI and XmaI sites, and the 3′ UTR was cloned usingthe SacI and EcoRI sites.

RNA-seq was performed on total RNA samples extracted from flower budtissues with three biological repeats for each accession (SI Appendix, TableS2). Expression levels of genes were estimated by fragments per kilobase ofexon per million reads mapped. RT-qPCR was performed to estimate RNAexpression levels. RNA stability was measured using RT-qPCR to quantifygene expression levels at different time points after incubation of 3-deoxyadenosine (cordycepin). DNA after bisulfite conversion was se-quenced and analyzed to measure DNA methylation levels with the onlinetool Kismeth (48). All primers are listed in SI Appendix, Table S8. All statis-tical analyses were performed in R (www.r-project.org/).

The data described in this paper have been deposited in the NCBI SequenceReadArchive under accession nos. PRJNA392709, PRJNA392711, and PRJNA511520and inGenBankunder accessionnos.MF422379–MF422413,MF422414–MF422449,MF422450–MF422525, MF142149–MF142156, and MG492014–MG492020.

ACKNOWLEDGMENTS. We thank Detlef Weigel, John Bowman, XiaofengCao, and Wolfgang Busch for valuable suggestions and discussions; Li Yangand Mao-Fu Li for assistance with the experiments; and especially theanonymous reviewers for their help in improving the manuscript. This studywas funded by National Natural Science Foundation of China Grant91731306 (to Y.-L.G.); Strategic Priority Research Program of the ChineseAcademy of Sciences Grant XDB27010305 (to Y.-L.G.); the InnovativeAcademy of Seed Design, Chinese Academy of Sciences (Y.-L.G.); ChineseAcademy of Sciences President’s International Fellowship for Visiting Scien-tists (S.B.); and Monash University Larkins Fellowship (to S.B.).

1. Scheffers BR, et al. (2016) The broad footprint of climate change from genes to bi-omes to people. Science 354:719.

2. Estoup A, et al. (2016) Is there a genetic paradox of biological invasion? Annu Rev EcolEvol Syst 47:51–72.

3. Kolbe JJ, et al. (2004) Genetic variation increases during biological invasion by a Cu-ban lizard. Nature 431:177–181.

4. Frankham R (2005) Resolving the genetic paradox in invasive species. Heredity (Edinb)94:385.

5. Stapley J, Santure AW, Dennis SR (2015) Transposable elements as agents of rapid adap-tation may explain the genetic paradox of invasive species. Mol Ecol 24:2241–2252.

6. Schrader L, et al. (2014) Transposable element islands facilitate adaptation to novelenvironments in an invasive species. Nat Commun 5:5495.

7. Casacuberta E, González J (2013) The impact of transposable elements in environ-mental adaptation. Mol Ecol 22:1503–1517.

8. Chuong EB, Elde NC, Feschotte C (2017) Regulatory activities of transposable ele-ments: From conflicts to benefits. Nat Rev Genet 18:71–86.

9. Barrón MG, Fiston-Lavier AS, Petrov DA, González J (2014) Population genomics oftransposable elements in Drosophila. Annu Rev Genet 48:561–581.

10. Li ZW, et al. (2018) Transposable elements contribute to the adaptation of Arabi-dopsis thaliana. Genome Biol Evol 10:2140–2150.

11. Studer A, Zhao Q, Ross-Ibarra J, Doebley J (2011) Identification of a functionaltransposon insertion in the maize domestication gene tb1. Nat Genet 43:1160–1163.

12. Martin A, et al. (2009) A transposon-induced epigenetic change leads to sex de-termination in melon. Nature 461:1135–1138.

13. Van’t Hof AE, et al. (2016) The industrial melanism mutation in British pepperedmoths is a transposable element. Nature 534:102–105.

14. Guo YL, et al. (2009) Recent speciation of Capsella rubella from Capsella grandiflora,associated with loss of self-incompatibility and an extreme bottleneck. Proc Natl AcadSci USA 106:5246–5251.

15. Foxe JP, et al. (2009) Recent speciation associated with the evolution of selfing inCapsella. Proc Natl Acad Sci USA 106:5241–5245.

16. Hurka H, Neuffer B (1997) Evolutionary processes in the genus Capsella (Brassicaceae).Plant Syst Evol 206:295–316.

17. Tang C, et al. (2007) The evolution of selfing in Arabidopsis thaliana. Science 317:1070–1072.18. Ågren JA, et al. (2014) Mating system shifts and transposable element evolution in

the plant genus Capsella. BMC Genomics 15:602.19. Wu Q, et al. (2017) Long-term balancing selection contributes to adaptation in Ara-

bidopsis and its relatives. Genome Biol 18:217.20. Slotte T, et al. (2013) The Capsella rubella genome and the genomic consequences of

rapid mating system evolution. Nat Genet 45:831–835.21. Huang da W, Sherman BT, Lempicki RA (2009) Systematic and integrative analysis of

large gene lists using DAVID bioinformatics resources. Nat Protoc 4:44–57.22. Hollister JD, et al. (2011) Transposable elements and small RNAs contribute to gene

expression divergence between Arabidopsis thaliana and Arabidopsis lyrata. ProcNatl Acad Sci USA 108:2322–2327.

23. Wei L, Cao X (2016) The effect of transposable elements on phenotypic variation:Insights from plants to humans. Sci China Life Sci 59:24–37.

24. Hurka H, Friesen N, German DA, Franzke A, Neuffer B (2012) ‘Missing link’ speciesCapsella orientalis and Capsella thracica elucidate evolution of model plant genusCapsella (Brassicaceae). Mol Ecol 21:1223–1238.

25. Weigel D (2012) Natural variation in Arabidopsis: From molecular genetics to eco-logical genomics. Plant Physiol 158:2–22.

26. Anderson JT, Inouye DW, McKinney AM, Colautti RI, Mitchell-Olds T (2012) Pheno-typic plasticity and adaptive evolution contribute to advancing flowering phenologyin response to climate change. Proc Biol Sci 279:3843–3852.

27. Yang L, et al. (2018) Parallel evolution of common allelic variants confers floweringdiversity in Capsella rubella. Plant Cell 30:1322–1336.

28. Kigel J, et al. (2011) Relationships between flowering time and rainfall gradientsacross Mediterranean-desert Transects. Isr J Ecol Evol 57:91–109.

29. Wolfe MD, Tonsor SJ (2014) Adaptation to spring heat and drought in northeasternSpanish Arabidopsis thaliana. New Phytol 201:323–334.

30. Shavrukov Y, et al. (2017) Early flowering as a drought escape mechanism in plants:How can it aid wheat production? Front Plant Sci 8:1950.

31. Exposito-Alonso M, et al. (2018) Genomic basis and evolutionary potential for ex-treme drought adaptation in Arabidopsis thaliana. Nat Ecol Evol 2:352–358.

32. Guo YL, Todesco M, Hagmann J, Das S, Weigel D (2012) Independent FLCmutations ascauses of flowering-time variation in Arabidopsis thaliana and Capsella rubella.Genetics 192:729–739.

33. Slotte T, Holm K, McIntyre LM, Lagercrantz U, Lascoux M (2007) Differential expres-sion of genes important for adaptation in Capsella bursa-pastoris (Brassicaceae). PlantPhysiol 145:160–173.

34. Lempe J, et al. (2005) Diversity of flowering responses in wild Arabidopsis thalianastrains. PLoS Genet 1:109–118.

35. Liu J, He Y, Amasino R, Chen X (2004) siRNAs targeting an intronic transposon in theregulation of natural flowering behavior in Arabidopsis. Genes Dev 18:2873–2878.

36. Csorba T, Questa JI, Sun Q, Dean C (2014) Antisense COOLAIR mediates the co-ordinated switching of chromatin states at FLC during vernalization. Proc Natl AcadSci USA 111:16160–16165.

37. Sheldon CC, Conn AB, Dennis ES, Peacock WJ (2002) Different regulatory regions arerequired for the vernalization-induced repression of FLOWERING LOCUS C and for theepigenetic maintenance of repression. Plant Cell 14:2527–2537.

38. Feschotte C (2008) Transposable elements and the evolution of regulatory networks.Nat Rev Genet 9:397–405.

39. Di Giammartino DC, Nishida K, Manley JL (2011) Mechanisms and consequences ofalternative polyadenylation. Mol Cell 43:853–866.

40. Golisz A, Sikorski PJ, Kruszka K, Kufel J (2013) Arabidopsis thaliana LSM proteinsfunction in mRNA splicing and degradation. Nucleic Acids Res 41:6232–6249.

41. Al-Ahmadi W, Al-Ghamdi M, Al-Haj L, Al-Saif M, Khabar KS (2009) Alternative poly-adenylation variants of the RNA binding protein, HuR: Abundance, role of AU-richelements and auto-regulation. Nucleic Acids Res 37:3612–3624.

42. Saze H, Kakutani T (2007) Heritable epigenetic mutation of a transposon-flanked Arabi-dopsis gene due to lack of the chromatin-remodeling factor DDM1. EMBO J 26:3641–3652.

43. Shen J, et al. (2017) Translational repression by a miniature inverted-repeat trans-posable element in the 3′ untranslated region. Nat Commun 8:14651.

44. Stuart T, et al. (2016) Population scale mapping of transposable element diversityreveals links to gene regulation and epigenomic variation. eLife 5:e20777.

45. Feisenstein J (1989) PHYLIP-phylogeny inference package (version 3.2). Cladistics 5:164–166.46. Warren DL, Glor RE, Turelli M (2010) ENMTools: A toolbox for comparative studies of

environmental niche models. Ecography 33:607–611.47. Schneeberger K, et al. (2009) SHOREmap: Simultaneous mapping and mutation

identification by deep sequencing. Nat Methods 6:550–551.48. Gruntman E, et al. (2008) Kismeth: Analyzer of plant methylation states through

bisulfite sequencing. BMC Bioinformatics 9:371.

6 of 6 | www.pnas.org/cgi/doi/10.1073/pnas.1811498116 Niu et al.

Dow

nloa

ded

by g

uest

on

Mar

ch 2

1, 2

021