12
The Plant Cell, Vol. 4, 839-849, August 1992 0 1992 American Society of Plant Physiologists Characterization of a Gene Encoding a DNA Binding Protein with Specificity for a Light-Responsive Element Philip M. Gilmartin,’ Johan Memelink,* Kazuyuki Hiratsuka, Steve A. K~Y,~ and Nam-Hai Chua4 Laboratory of Plant Molecular Biology, The Rockefeller University, 1230 York Avenue, New York, New York 10021-6399 The sequence element of box II (GTGTGGTTAATATG) is a regulatory component of a light-responsive element present within the upstream region of pea rbcS-3A. The nuclear protein GT-1 was defined previously as a DNA binding activity that interacts with box II. Here, we describe the isolation and characterization of cDNA sequences that encode a DNA binding protein with specificity for this element. The recombinant protein, tobacco GT-la, shows similar sequence re- quirements for DNA binding to nuclear GT-1, as assayed by its ability to interact with previously defined 2-bp scanning mutations of box II, and is shown to be immunologically related to nuclear GT-1. The predicted structure of the 43-kD protein derived from the cDNA sequence suggests the presence of a nove1 helix-helix-turn-helix (HHTH) motif. Compari- son between the predicted protein sequence encoded by the tobacco GT-la cDNA and that of another GT binding protein, rice GT-2, reveals strong amino acid conservation over the HHTH region; this motif appears to be involved in the interac- tion between the recombinant protein and box II. Genomic DNA gel blot analysis indicated the presence of a small gene family of related sequences within the tobacco nuclear genome. RNA gel blot analysis of tobacco mRNA using the iso- lated cDNA as a probe showed that transcripts are present in several tissues, including both light-grown and dark-adapted leaves. INTRODUCTION Light plays acritical role in regulating plant growth and devel- opment through the modulation of transcription levels of light-responsive genes. Following photoperception, by specific photoreceptors,the signal is subsequentlytransduced through as yet undefined pathways to mediate the transcriptional re- sponse (Kuhlemeier et al., 1987a; Silverthorneand Tobin, 1987; Jenkins, 1988; Nagy et al., 1988). Our studies on light-responsivetranscription have focused on the pea ribulose bisphosphate carboxylase small subunit 3A (rbcS-3A) gene. Light-responsiveexpression of rbcS-3A in tobacco is mediated by a complex array of cis-actingelements (Kuhlemeier et al., 1987b, 1988, 1989; Davis et al., 1990; Gilmartin et al., 1991). Severa1 nuclear protein factors that in- teract with these elements have been identified (see Gilmartin et al., 1990). One of these factors, GT-1, binds to six binding sites present in the upstream region of rbcS-3A (Green et al., 1987,1988a). Similar sequence motifs are also present within several other light-responsivegenes (Stockhaus et al., 1987; Manzara and Gruissem, 1988; Dean et al., 1989; Elliot et al., 1989; Kay et al., 1989; Dehesh et al., 1990; Schindler et al., Current address: Centre for Plant Biochemistry and Biotechnology, The University of Leeds, Leeds, LS2 9JT, U.K. * Current address: Department of Molecular Biology, Leiden Univer- sity, Lassenaarseweg 64, 2333AL, Leiden, The Netherlands. Current address: Center for Biological Timing, Department of Biol- ogy, University of Virginia, Charlottesville, VA 22901. To whom correspondence should be addressed. 1990; Kay, 1991; LaMon et al., 1991).The -166 deleted rbcS-34 promoter retains light responsiveness (Kuhlemeier et al., 1987); the light-responsiveelement (LRE) that mediatesthis response is located between positions -166 and -55 (Kuhlemeier et al., 1989; Davis et al., 1990). This LRE contains two binding sites for GT-1, box II (-151GTGTGGTTAATATG-138) and box III (-lZ5ATCATTTTCACT-ll4) (Green et al., 1987); both of these boxes are essential for the phytochrome-responsive activity conferred by this element (Kuhlemeier et al., 1988; Gilmartin and Chua, 1990a, 1990b). There is a strong correlation be- tween the affinity of GT-1 for box I1 and box III in vitro and the leve1of transcriptional activity conferred by these elements in vivo (Gilmartin and Chua, 199Oa). It has been shown that box 111 is a weak binding site for GT-1 as compared to box II. Replacement of box II by box III within the -166 promoter results in a 95% reduction in transcriptional activity as com- pared with wild-type levels (Gilmartin and Chua, 199Oa). lnteractions between box III and GT-1 may play only a minor role in the transcriptional activity conferred by the LRE. The critical role of box II as a regulatory sequence, as op- posed to a purely quantitative element, was established by gain-of-functionexperiments in which a synthetic tetramer of box II was fused to the -90 deleted cauliflower mosaic virus 35s promoter. In this context, the box II tetramer was able to confer light responsiveness upon the heterologous light- insensitive promoter (Lam and Chua, 1990). Further studies demonstrated a requirement for an interaction between the

Characterization of a Gene Encoding a DNA Binding … · Characterization of a Gene Encoding a DNA Binding Protein with Specificity for a Light-Responsive Element Philip M. Gilmartin,’

  • Upload
    lekhue

  • View
    220

  • Download
    0

Embed Size (px)

Citation preview

Page 1: Characterization of a Gene Encoding a DNA Binding … · Characterization of a Gene Encoding a DNA Binding Protein with Specificity for a Light-Responsive Element Philip M. Gilmartin,’

The Plant Cell, Vol. 4, 839-849, August 1992 0 1992 American Society of Plant Physiologists

Characterization of a Gene Encoding a DNA Binding Protein with Specificity for a Light-Responsive Element

Philip M. Gilmartin,’ Johan Memelink,* Kazuyuki Hiratsuka, Steve A. K ~ Y , ~ and Nam-Hai Chua4

Laboratory of Plant Molecular Biology, The Rockefeller University, 1230 York Avenue, New York, New York 10021-6399

The sequence element of box II (GTGTGGTTAATATG) is a regulatory component of a light-responsive element present within the upstream region of pea rbcS-3A. The nuclear protein GT-1 was defined previously as a DNA binding activity that interacts with box II. Here, we describe the isolation and characterization of cDNA sequences that encode a DNA binding protein with specificity for this element. The recombinant protein, tobacco GT-la, shows similar sequence re- quirements for DNA binding to nuclear GT-1, as assayed by its ability to interact with previously defined 2-bp scanning mutations of box I I , and is shown to be immunologically related to nuclear GT-1. The predicted structure of the 43-kD protein derived from the cDNA sequence suggests the presence of a nove1 helix-helix-turn-helix (HHTH) motif. Compari- son between the predicted protein sequence encoded by the tobacco GT-la cDNA and that of another GT binding protein, rice GT-2, reveals strong amino acid conservation over the HHTH region; this motif appears to be involved in the interac- tion between the recombinant protein and box II. Genomic DNA gel blot analysis indicated the presence of a small gene family of related sequences within the tobacco nuclear genome. RNA gel blot analysis of tobacco mRNA using the iso- lated cDNA as a probe showed that transcripts are present in several tissues, including both light-grown and dark-adapted leaves.

INTRODUCTION

Light plays acritical role in regulating plant growth and devel- opment through the modulation of transcription levels of light-responsive genes. Following photoperception, by specific photoreceptors, the signal is subsequently transduced through as yet undefined pathways to mediate the transcriptional re- sponse (Kuhlemeier et al., 1987a; Silverthorne and Tobin, 1987; Jenkins, 1988; Nagy et al., 1988).

Our studies on light-responsive transcription have focused on the pea ribulose bisphosphate carboxylase small subunit 3A (rbcS-3A) gene. Light-responsive expression of rbcS-3A in tobacco is mediated by a complex array of cis-acting elements (Kuhlemeier et al., 1987b, 1988, 1989; Davis et al., 1990; Gilmartin et al., 1991). Severa1 nuclear protein factors that in- teract with these elements have been identified (see Gilmartin et al., 1990). One of these factors, GT-1, binds to six binding sites present in the upstream region of rbcS-3A (Green et al., 1987,1988a). Similar sequence motifs are also present within several other light-responsive genes (Stockhaus et al., 1987; Manzara and Gruissem, 1988; Dean et al., 1989; Elliot et al., 1989; Kay et al., 1989; Dehesh et al., 1990; Schindler et al.,

Current address: Centre for Plant Biochemistry and Biotechnology, The University of Leeds, Leeds, LS2 9JT, U.K. * Current address: Department of Molecular Biology, Leiden Univer- sity, Lassenaarseweg 64, 2333AL, Leiden, The Netherlands.

Current address: Center for Biological Timing, Department of Biol- ogy, University of Virginia, Charlottesville, VA 22901.

To whom correspondence should be addressed.

1990; Kay, 1991; LaMon et al., 1991). The -166 deleted rbcS-34 promoter retains light responsiveness (Kuhlemeier et al., 1987); the light-responsive element (LRE) that mediates this response is located between positions -166 and -55 (Kuhlemeier et al., 1989; Davis et al., 1990). This LRE contains two binding sites for GT-1, box II (-151GTGTGGTTAATATG-138) and box III (-lZ5ATCATTTTCACT-ll4) (Green et al., 1987); both of these boxes are essential for the phytochrome-responsive activity conferred by this element (Kuhlemeier et al., 1988; Gilmartin and Chua, 1990a, 1990b). There is a strong correlation be- tween the affinity of GT-1 for box I1 and box III in vitro and the leve1 of transcriptional activity conferred by these elements in vivo (Gilmartin and Chua, 199Oa). It has been shown that box 111 is a weak binding site for GT-1 as compared to box II. Replacement of box II by box III within the -166 promoter results in a 95% reduction in transcriptional activity as com- pared with wild-type levels (Gilmartin and Chua, 199Oa). lnteractions between box III and GT-1 may play only a minor role in the transcriptional activity conferred by the LRE.

The critical role of box II as a regulatory sequence, as op- posed to a purely quantitative element, was established by gain-of-function experiments in which a synthetic tetramer of box II was fused to the -90 deleted cauliflower mosaic virus 35s promoter. In this context, the box II tetramer was able to confer light responsiveness upon the heterologous light- insensitive promoter (Lam and Chua, 1990). Further studies demonstrated a requirement for an interaction between the

Page 2: Characterization of a Gene Encoding a DNA Binding … · Characterization of a Gene Encoding a DNA Binding Protein with Specificity for a Light-Responsive Element Philip M. Gilmartin,’

840 The Plant Cell

box II tetramer and an element present between positions -90 and -46 of the cauliflower mosaic virus 35s promoter, most likely as-7 (Davis et al., 1990). These studies illustrate that GT-1 binding is required but not necessarily sufficient for light- responsive transcription. In addition, they establish a regula- tory role for the GT-1 box II binding site in the transcriptional light response.

With the long-term aim of elucidating the signal transduc- tion pathway that links photoperception to the box Il-mediated transcriptional response of rbcS-3A outlined above, we have isolated cDNA sequences encoding a DNA binding protein with specificity for box II. Here, we describe the characteriza- tion of the cDNA sequences and the encoded protein.

RESULTS

lsolation of a cDNA Encoding a DNA Binding Protein with Specificity for rbcS-3A Box I 1

Definition of the critical role of rbcS-3A box II as a regulatory component of the LRE prompted us to attempt to isolate cDNA sequences that encode the nuclear proteins through which box Il-mediated light-responsive transcription is modulated. Based on the observation that GT-1 binding activity can be detected in nuclear extracts prepared from plants grown in both the light and the dark, we expected GT-1 transcripts to be pres- ent in RNA samples prepared from both light- and dark-grown tissue. Because light-inducible transcripts, for example, rbcS and chlorophyll alb binding (cab) transcripts, are reduced in abundance in dark-grown tissue, low-abundance mRNA se- quences present in both light-grown and dark-adapted plants would be relatively more highly represented in RNA prepared from dark-grown tissue.

Following a slightly modified version of the screening pro- tocols for the isolation of phage encoding DNA binding proteins (Singh et al., 1989; Katagiri et al., 1990), we screened 6 x 105 phages of an etiolated tobacco cDNA library constructed in h ZAP with a box II tetramer (Gilmartin et al., 1991). From this screen, we obtained a single recombinant phage that encoded a DNA binding protein with specificity for the box II tetramer. The 1.4-kb insert was excised from h ZAP, propagated in the plasmid pBluescript II SK+, and subjected to DNA sequence analysis; this sequence is referred to as clone A. This cDNA insert was used as a probe to screen 1 x 106 phages of a second cDNA expression library prepared from light-grown tobacco leaf mRNA (Stratagene). A recombinant phage was isolated from this screen that also contains a 1.4-kb insert; this sequence is referred to as clone 6. Comparison of the two cDNA sequences revealed that they are almost identical with the following exceptions. The second sequence, clone B, lacks 27 bp at the 5' end as compared to clone A but contains an additional 27 bp including a poly(A) tract at the 3'end. These differences can be accounted for by the extent of cDNA syn- thesis at the 5'end and by the presence of an EcoRl restriction

1

1 61 16 106

31 151 46 196 61 241 76 286 91 331

106 376 121 42 1 136 466

151 511 166 556 181 601 196 646 211 691 226 736 241 781 256 826

271 871 286 916 301 961 316 1006 331 1051 346 1096 361 1141 376 1186 1239 1298 1357 1416

GAATTCCCTGCAAAGGCCGAAGGCCAAAGTCCAAGCAAGTATTACCTATTTCTATCTTAC

Met Ser Asp Lys Leu Pro Thr Ser Ile Asn Leu Tyr Glu G I u Gln ATG TCT GAT AAA CTT CCA ACA TCT ATC AAC TTG TAC GAG GAG CAA G l u Ser Met Asp Gln M i s Ser Asn H i s Met Ile Ile G l u Val A l a GAA TCA ATG GAT CAA CAC AGT AAC CAC ATG ATC ATC GAA GTG GCC Thr Pro Asn Ala H i s Leu G l n Gln Pro G l n G l n Ile Leu Leu Pro ACC CCT AAT GCT CAT CTT CAA CAA CCC CAG CAA ATT CTC CTT CCC Gly Ile Ser Gly Gly Asp Thr Thr Ser Ser Gly Gly G l u Asp Asn GGA ATC AGT GGT GGT GAC ACC ACC AGC AGC GGT GGC GAG GAC AAC Asn Asn Asn Val LyS Leu Ala PrO LyS LyS Arg Ala G1u Thr Trp AAC AAC AAC GTA AAA TTA GCA CCC AAG AAG CGA GCA GAA ACA TGG Val Gln Glu Glu Thr Arg Ala Leu Ile Ser Leu Arq Arg Glu Leu GTT CAA GAA GAG ACA CGA GCG CTC ATT AGC CTC CGC AGA GAA CTC Asp Ser Leu Phe Asn Thr Ser Lys ser Asn Lys H i s Leu Trp Asp

t l t2

GAC TCA CTC TTC AAC ACT TCA AAA TCG AAC AAG CAC TTG TGG GAC

G l n Ile Ser Leu Lys Met Arg G1u Lys G l y Phe Asp Arq Ser Pro CAG ATC TCC TTG AAG ATG AGG GAA AAG GGG TTT GAT AGG TCC CCT Thr Met Cy5 Thr Asp Lys Trp Arg Asn Leu Leu Lys G1U Phe Lys ACC ATG TGT ACT GAT AAA TGG AGG AAC TTG TTA AAG GAG TTC AAA

Lys Ala Lys H i s Asn Gln Glu Pro Asn Gly Ser Ala Lys Met Ser AAG GCT AAA CAC AAT CAA GAA CCA AAT GGG TCT GCT AAG ATG TCT **(466,467) Tyr H i s Lys Glu Ile G l u GIu Ile Leu Lys Ala Arg Asn Lys Asn TAT CAT AAG GAG ATT GAG GAG ATT CTC AAG GCC AGA AAC AAA AAT Tyr Lys Asn Pro Thr Leu Lys Val Asp Thr Phe Met Gln Phe Ser TAT AAG AAC CCT ACG CTT AAA GTC GAT ACC TTT ATG CAG TTT TCT Gln Lys Gly Leu Asp Asp Thr Ser Ile Thr Phe Gly Pro Val Glu CAG AAG GGT CTT GAC GAT ACC AGT ATA ACA TTT GGA CCT GTC GAA Glu Asn Gly Arq Pro Thr Leu Asn Leu G1u Arg Gln Leu Asp H i s GAG AAT GGG AGG CCA ACT CTT AAC TTA GAA CGC CAG CTG GAT CAT Asp Gly H i s Pro Leu Ala Ile Thr Ala Ala Asp Ala Val Thr Ala GAT GGG CAT CCT CTT GCT ATA ACA GCT GCA GAT GCG GTC ACT GCA Ser Gly Ser Pro Trp Asn Trp Arg G l u Thr Pro Gly Asn Gly Glu AGT GGA TCA CCT TGG AAT TGG AGA GAG ACG CCT GGA AAT GGT GAG Gln Ser Asn Ser Ala Glu Gly Arg Val Ile Ser Val Lys Trp Gly CAA AGT AAC TCA GCT GAA GGT AGG GTC ATT TCA GTC AAG TGG GGT Asp Tyr Thr Lys Arg Ile Gly Ile Asp Gly Thr Ala Asp Ala Ile

*(349)

GAT TAC ACA AAA AGA ATT GGT ATC GAT GGG ACT GCA GAT GCC ATC * ( 8 5 5 )

Lys Gln Ala Ile Lys Ser Ala Phe Arg Leu Arg Thr Glu Arg Ala AAG CAA GCC ATT AAA TCT GCT TTT AGG TTG AGG ACA GAG CGT GCA Phe Trp Leu Glu Asp G l u Gln Asn Ile Val Arg Ala Leu Asp Arg TTT TGG TTA GAA GAT GAA CAG AAT ATT GTT CGA GCA CTT GAC AGG Asp Met Pro Leu Gly ser Tyr Ser Leu H i s Val Asp Glu Gly Leu GAT ATG CCA CTA GGG AGC TAC AGC TTG CAT GTT GAT GAA GGT CTG Thr Ile Lys Val Cys Met Tyr GIu G l u Ala Asp H i s Ser ser Val ACA ATA AAA GTT TGC ATG TAC GAG GAG GCA GAT CAC TCC TCA GTG H i s Thr GIu Asp Lys Thr Phe Tyr Thr Ala Asn Asp Phe Arg Asp CAC ACC GAG GAT AAA ACT TTC TAC ACT GCG AAT GAT TTT CGT GAC Phe Leu ser H i s Arg Gly Trp Thr Cys Leu Arg G l u Tyr Asn Gly TTC TTG TCC CAC AGA GGC TGG ACA TGT TTA AGA GAG TAC AAT GGG Tyr Arg H i s Val Asp Met Leu Asp Glu Leu Cys Pro Gly Ala Val TAT CGG CAT GTA GAT ATG TTG GAT GAG CTT TGC CCT GGT GCA GTC Tyr Arg Gly Val Asn * * * TAC CGA GGT GTC AAT TGA CGATAATAGGGGATACCCTGTCCATCCACGGCTCG TTGGGGTGACTGCATTTGCCATCTTTGTAAGTAACATGATGTCGGTGCCATCAGAGTGT GTTGATAATGCTGTTCTGGAAGTGTAAATGGTGCAGTGGTAACCTGATATAGTGATATA GTACTGAACAATTTAAATGTACAAGTCAATGAACATGAAACCTTGTTCATATTAGTGTT C C G G A T G G T G A A A T C T T G T T C T G W A T T T T C A T T A T T C

Figure 1. Nucleotide Sequence of Tobacco GT-la Derived from Two lndependent cDNA Clones and Partia1 Genomic Sequence Data.

The sequence is numbered from the first nucleotide of the EcoRl linker added following cDNA synthesis. The linker sequences at the 5' and 3 ends are shown in boldface type. The first nucleotides of clone A and clone B are indicated by t l and t2, respectively. Nucleotide differ- ences between the two cDNA sequences are identified by an asterisk (*). Nucleotides *349 and *855 are absent from cDNA cione B but present in clone A; nucleotides *466 and *467 are absent from clone A but present in clone 8. Sequences used as oligonucleotide primers for PCR amplification of the genomic sequence to confirm the sequence between nucleotides 262 and 567 are underlined. The interna1 EcoRl site that terminates clone A is shown in boldface type and is under- lined. The predicted amino acid sequence derived from the continuous open reading frame is shown starting with Met-1 and terminating at the stop codon following Asn-380 (**e). The GenBank accession num- ber for the nucleotide sequence of tobacco GT-la is M93436.

Page 3: Characterization of a Gene Encoding a DNA Binding … · Characterization of a Gene Encoding a DNA Binding Protein with Specificity for a Light-Responsive Element Philip M. Gilmartin,’

Characterization of Tobacco GT-la 841

ACIDIC BASIC

site at the 3'end that was probably cleaved in clone A during construction of the first library. The 5' and 3' untranslated regions of clones A and B are otherwise identical, suggesting that they are derived from the same gene.

One further difference between these two cDNA sequences lies within the open reading frame. In a comparison of clone A to clone E, 2 bp (positions 466 and 467) are missing from clone A, causing a frameshift mutation. In addition, compar- ing clone B with clone A, 1 bp at position 349 is missing from clone E, causing a frameshift mutation; clone B also has a single base pair deletion at position 855. The discrepancy be- tween these two cDNA sequences was resolved by polymerase chain reaction amplification of the corresponding region from tobacco genomic DNA. The amplified fragments were ligated into pBluescript II SK+, and several independent recombinant clones were subjected to sequence analysis. The genomic se- quence spanning this region contains the single base pair absent from clone 8, but present in clone A, as well as the 2 bp missing from clone A. This analysis resolved the differ- ences between the two sequences and demonstrated the presence of a continuous open reading frame within the gene. The sequence derived from the two cDNA clones and the genomic region spanning the frameshifts are shown in Figure 1. The frameshift mutations observed in both cDNA sequences are either a consequence of cDNA cloning artifacts or could reflect our observations that production of the re- combinant protein appears to be toxic to fscherichia coli. In

HELlX 1 HELlX 2 T HELlX 3 AClDlC DOMAlN

A 1 380

I I I I

DNA-BINDINGIDIMERIZATlON ACTlVATlON PUTATIVE PUTATIVE

DOMAlN DOMAlN

B GT-1 (75-138)

helix 1 helix 2 helix 3 = - - W V Q E E T R R L I S L R R E L D S L F N T S K S N K H L W D Q I S L K M R E K " t*t * * *.t ** t. *. * * ** f ,e* t

WPKTEVQRLIQLRMELDHRYQETGPKGPLWEEISSGMRRLGYNRSSKRCKEKWENINKYFKKVK -- helix 1 helix 2

GT-2 (87-150)

Figure 2. Predicted Secondary Structure of Tobacco GT-la.

(A) The acidic domains at the extreme 5' and 3'termini of the protein are indicated as is the basic domain that extends throughout the three indicated helices. The putative turn region between helix 2 and helix 3 is indicated by a T. The putative DNA binding/dimerization and acti- vation domains are shown. (6) Amino acid sequence comparisons between the conserved region of tobacco G?-la and rice GT-2. The standard one-letter code is used. The tobacco G%la sequence between amino acids 75 and 138 is shown; the GT-2 sequence (Dehesh et al., 1990) between amino acids 87 and 150 is shown. Helices 1, 2, and 3 of tobacco GT-la are indicated as are helices 1 and 2 of rice GT-2, as defined by Dehesh et al. (1990). Amino acid identities between tobacco GT-la and rice GT-2 are indi- cated by an asterisk.

agreement with these observations, we have been unable to either isolate an intact cDNA or reconstitute one from the two truncated sequences. It is possible that tobacco GT-la can recognize E. coli DNA sequences and therefore possibly in- terferes with normal bacterial functions.

The 1140-bp open reading frame encodes a protein of 380 amino acid residues with an M, of 43000 (Figure 1). Analysis of the predicted protein sequence reveals the presence of a basic region (Lys-65 to Lys-182) and acidic regions at the ex- treme amino (Met-1 to Asp-59) and carboxy (Glu-289 to Asn-380) termini. Two strong amphipathic helices (helix 1 and helix 3) and a weaker helix (helix 2) are predicted from the amino acid primary structure within the basic region. In addi- tion, a putative, proline residue-containing turn region is predicted between helix 2 and helix 3. The theoretical struc- ture of the protein including this predicted helix-helix-turn-helix (HHTH) motif is shown in Figure 2A.

Comparison of the amino acid sequence of tobacco GT-la with that of rice GT-2, which binds to a motif within the rice phytochrome promoter containing the sequence GGTAAT, re- veals a 48% amino acid sequence (27 of 64) identity over the predicted HHTH region (Figure 2B). The remainder of the two proteins do not show any sequence homology. The HHTH re- gion of tobacco GT-la does not have any striking amino acid homology to any other helix-containing DNA binding proteins such as helix-loop-helix (HLH) and homeodomain proteins.

Binding Site Specificity of Tobacco GT-la

To define the binding site specificity of tobacco GT-la, DNA- protein filter binding assays were performed with wild-type and mutant binding sites for the two nuclear proteins activation sequence factor 2 (ASF-2) (Lam and Chua, 1989) and GA Fac- tor 1 (GAF-1) (J. Memelink, I? M. Gilmartin, and N.-H. Chua, manuscript in preparation) in comparison with the GT-1 box II binding site and its mutant box Ilm derivative. Figure 3A shows these results as well as the sequence of the wild-type and mutant binding sites. These data demonstrate that tobacco GT-la shows specificity for box II but is unable to bind to its mutant derivative, box IP, or to any of the other four se- quences tested.

From our analyses of rbcS-M, as well as studies of GT-1 binding sites present within the upstream regions of the rice phytochrome A @hyA) gene (Kay et al., 1989; Kay, 1991) and similar sequences within other regulatory elements (Schindler et al., 1990; Lawton et al., 1991), it is apparent that nuclear GT-1 can interact with several distinct, yet closely related se- quence motifs. Dehesh et al. (1990) have demonstrated that recombinant rice GT-2, which binds to the ricephyA GT motif GCGGTAATT, interacts only weakly with both the rice phyA TAGGTTAAT motif and the similar rbcS-34 box II element. This ObSeNatiOn prompted us to assay the specificity of tobacco GT-la for different nuclear GT-1 binding sites.

These studies were performed by assaying phage contain- ing the tobacco GT-la cDNA clone A for the ability of the

Page 4: Characterization of a Gene Encoding a DNA Binding … · Characterization of a Gene Encoding a DNA Binding Protein with Specificity for a Light-Responsive Element Philip M. Gilmartin,’

842 The Plant Cell

encoded protein to bind to the previously defined rbcS-3A nu-clear GT-1 binding sites box II, its mutant derivative box IT,and box III. In addition, the rice GT-2 binding site was assayed.This last sequence was originally identified as a binding sitefor nuclear GT-1 (Kay et al., 1989) and subsequently shownalso to be the specific target sequence of the recombinant riceDMA binding protein GT-2 (Dehesh et al., 1990). The resultspresented in Figure 3B demonstrate that tobacco GT-1a showsdramatically reduced binding to box llm and to box III as wellas to the GT-2 binding site from the rice phytochrome promoter.The observation that tobacco GT-1a does not interact stronglywith rbcS-3A box III was confirmed by gel shift studies usingE. coli extracts containing tobacco GT-1a (Figure 3C). Thus,to summarize, nuclear GT-1 can interact with box II, box III (Fig-ure 3C; Green et al., 1987), and the rice GT-2 binding site (Kayet al., 1989; Kay, 1991); tobacco GT-1a shows strong bindingonly to box II.

Binding Specificity of Nuclear and Tobacco GT-1 a for Box II

Previous studies of 2-bp scanning mutations through nbcS-34box II (GTGTGGTTAATATG) demonstrated a critical core ofGGTTAA with some sequence requirements for the followingTA nucleotides (Green et al., 1988a). Having demonstrated thattobacco GT-1a is a box ll-specific binding protein, we wishedto define the critical nucleotides within box II required for bind-ing. Figure 4 shows the results obtained using the wild-typeand seven mutant sequences described previously (Green etal., 1988a). As a comparison, the binding specificity of nuclearGT-1 for the eight probes was assayed by gel shift studies (Fig-ure 4A). As shown previously (Green et al., 1988a), these datareveal the 6-bp GGTTAA core region to be critical for bindingof nuclear GT-1.

To determine the binding specificity of tobacco GT-1a cloneA, a phage containing the cDNA sequence was screened forthe ability of the encoded protein to interact with each of thesame eight probe preparations used in Figure 4A. As shownin Figure 4B, it is clear that tobacco GT-1a shares the sameGGTTAA core binding requirements as nuclear GT-1. The spec-ificity of tobacco GT-1a was confirmed by gel shift studies, againusing the same eight probe preparations with E coli extractsprepared from cells expressing tobacco GT-1a. Figure 4Cshows that tobacco GT-1a shares the same GGTTAA core bind-ing site as nuclear GT-1 when assayed by gel shift analysis.In combination, these data demonstrate the similarity of thecore binding site requirements for tobacco GT-1a and nuclearGT-1. Some differences are seen, however, between the bind-ing specificity of nuclear GT-1 and tobacco GT-1a, and theseare discussed below.

Nuclear GT-1 and Tobacco GT-1 a AreAntigenically Related

Having demonstrated that the isolated cDNA encodes a pro-tein with similar binding specificity to nuclear GT-1, we wanted

galm box II ga 1 m box II

box llm ga 1 I- -

tobacco GT-1 a

box llm

non-specificDMA binding protein

box box llm

box III GT-2 site box III GT-2 site

tobacco GT-1 a non-specificDMA binding protein

NGT-1 R GT-1 a

bound

free

Figure 3. Sequence Specificity for Binding of Tobacco GT-1a.

(A) Filter binding assay with phage carrying GT-1a clone A in compar-ison with a nonspecific DMA binding protein. The probes used areas follows: tetramers of box II, GTGTGGTTAATATG; box llm,GTGTCCTTAATATG; as-2, GTGGATTGATGTGATATCACC; as-2m,GTGGATTCATGTAATATCACC; ga-1, TATGATAAGGCTAG, ga-1m,TATCATAAGACTAG.(B) Filter binding assay with clone A and nonspecific DMA bindingprotein B9 using tetramers of box II, GTGTGGTTAATATG; box llm,GTGTCCTTAATATG; box III, ACTTTATCATTTTCACTATCT; ligatedmonomers of the rice GT-2 site, TTGGCGGTAATTAAC. The GT-2 sitepanel of the nonspecific DNA binding assay represents a lower ex-posure relative to all other panels, including the tobacco GT-1 a assay,due to the higher specific activity of this random prime labeled probe.(C) Gel-shift assay with tobacco nuclear GT-1 (N GT-1) and tobaccorecombinant GT-1a (R GT-1a), using tetramer probes of box II and boxIII. The free probe and bound complex are indicated.

to determine whether tobacco GT-1a and nuclear GT-1 are an-tigenically related. Oligopeptides were synthesized based onthe predicted amino acid sequence of tobacco GT-1a. TheseOligopeptides—Pep-1, Lys-65to He-84; Pep-2, Lys-98toAsp-117;

Page 5: Characterization of a Gene Encoding a DNA Binding … · Characterization of a Gene Encoding a DNA Binding Protein with Specificity for a Light-Responsive Element Philip M. Gilmartin,’

Characterization of Tobacco GT-1a 843

probe: WT GT GT GG TT AA TA TG

bound

free

TT GG

tobacco GT-1 a non-specificDNA binding protein

Pep-3, Lys-259 to Phe-278 (Figure 1)—were synthesized withtwo lysine residues at both the amino and carboxy termini tofacilitate subsequent conjugation to BSA carrier protein. Anti-bodies were raised to the conjugated oligopeptides in rabbits.The specificity of the resulting antisera was confirmed by pro-tein gel blotting. The antisera prepared following immunizationwith Pep-1, Pep-2, and Pep-3 contained antibodies with speci-ficity for these synthetic peptides. These antisera were assayedfor their ability to interact with nuclear GT-1 in gel shift studies.Figure 5 shows that the antibodies raised against Pep-1 cross-react with nuclear GT-1 in such an assay. Incubation of 2 \iLof antisera raised against Pep-1 in a gel shift reaction with thetetramer of box II as a probe and tobacco nuclear extractresulted in a super shift of the retarded fragment (lane 5) ascompared to that seen when either no antisera (lanes 2 and3) or preimmune sera (lane 4) were added. Antisera raised toPep-2 and Pep-3 showed weak interactions with the nuclearprotein (data not shown). Data presented in Figure 5 demon-strate that the protein encoded by the isolated cDNA sequencesis antigenically related to nuclear GT-1.

Genomic Organization of Tobacco GT-1 a Genes

The demonstration that tobacco GT-1a does not interact stronglywith other GT motifs and the identification of the related riceGT-2 protein (Dehesh et al., 1990) prompted us to examine the

probe: WT GT GT GG TT AA TA TG

1 2 3 4 5

boundnon-specific

free

Figure 4. Sequence Specificity for Binding of Nuclear GT-1 andTobacco GT-1 a to Box II.

(A) Gel shift study with nuclear GT-1 using tetramer oligonucleotidesof box II and its mutant derivatives. WT, GTGTGGTTAATATG; GT,CAGTGGTTAATATG; GT, GTCCGGTTAATATG; GG, GTGTCCTTAATATG;TT, GTGTGGGGAATATG; AA, GTGTGGTTCCTATG; TA, GTGTGGTTA-AGCTG; TG, GTGTGGTTAATAGC. The free probe and bound complexare indicated.(B) Filter binding assay using tobacco GT-1a clone A in comparisonwith a nonspecific DNA binding protein with the same eight box II deriva-tive probes as above.(C) Gel shift assay using tobacco GT-1a prepared from E. coli infectedwith phage carrying GT-1a clone A. The probes are as described above.The free probe and bound complex are indicated as are nonspecificcomplexes due presumably to E. coli proteins.

super shiftbound

free

Figure 5. Antibodies Raised to an Oligopeptide from Tobacco GT-1aInteract with Nuclear GT-1.

Radiolabeled box II tetramer was used as a gel shift probe with tobaccoleaf nuclear extract (0.25 ng/nL). l-ane 1, probe alone; lanes 2,4, and5, 0.5 ng nuclear extract; lane 3, 1 ng nuclear extract. Either 2 jiL ofundiluted preimmune sera (lane 4) or 2 nL of undiluted serum fromPep-1 (lane 5) was added to the incubations. Free probe, bound com-plex, and super shift are indicated.

Page 6: Characterization of a Gene Encoding a DNA Binding … · Characterization of a Gene Encoding a DNA Binding Protein with Specificity for a Light-Responsive Element Philip M. Gilmartin,’

844 The Plant Cell

organization of sequences related to GT-1a within the tobaccogenome. Figure 6 shows these results. DNA gel blot analysisof tobacco nuclear DNA revealed the presence of a small num-ber of restriction fragments that show homology to the cDNAsequence. These data suggest the presence of more than oneGT-1a-related gene within the tobacco genome. It is possiblethat these sequences could indicate the presence of a tobaccoequivalent of rice GT-2 as well as other GT binding proteins.On the other hand, it is also possible that they could be ac-counted for in part by the amphidiploid nature of tobacco.

Domain Definition of Tobacco GT-1a

From comparisons between tobacco GT-1a and rice GT-2 (Fig-ure 2B), it is apparent that these two DNA binding proteinsshare homology over the HHTH region. The similarity betweenthe core binding sites of tobacco GT-1a and GT-2, GGTTAAand GGTAAT, respectively, suggests the possibility that theconserved regions of tobacco GT-1a and GT-2 play a rolein interacting with the related target DNA elements. Thispossibility is consistent with the DNA binding function of helix-containing structures of other DNA binding proteins (Desplanet a!., 1988; Murre et al., 1989).

1 2 3 423kb9.4

6.6

4.4

I

0.6

Figure 6. DNA Gel Blot Analysis of Tobacco Genomic DNATobacco genomic DNA (10 ng) digested with EcoR1, BamHI, Hindlll,and Bglll were run in lanes 1, 2, 3, and 4, respectively, and probedwith radiolabeled cDNA clone A. The positions of the X Hindlll digestedmarkers are indicated.

To localize the DNA binding domain of tobacco GT-1a, wecompared the ability of the truncated polypeptides encodedby clones A and B to bind to box II. The longest possible trans-lation product predicted from the cDNA sequences is shownin both Figures 1 and 7A. The protein sequence shown startswith the first methionine of the open reading frame, which ispresumably the translation start site in the plant. However, bothcDNA sequences are in the same reading frame within thepolylinker of X ZAP, yet out of frame with the (5-galactosidasecoding region. The recombinant protein from E. coli is there-fore likely not produced as a fusion protein but probably resultsfrom internal initiation of translation from Met-1, Met-18, orMet-25. The most likely initiation point in E. coli is Met-18 be-cause of the presence of a Shine-Delgarno (AGGAG) sequencedirectly 5' to this ATG.

The initial GT-1a cDNA, clone A, was isolated by the abilityof the encoded protein to bind box II. The encoded polypep-tide must therefore necessarily contain the DNA bindingdomain of tobacco GT-1a. The second cDNA sequence, cloneB, was isolated by DNA sequence homology to clone A. Com-parison of the two cDNA sequences, assuming translation fromMet-1, reveals that the frameshift mutation in clone A resultsin a predicted polypeptide of 136 amino acids. The frameshiftmutation in clone B results in a predicted polypeptide of only108 amino acids, 27 amino acids shorter than that encodedby clone A (Figure 7A). By comparison of the two sequences,it is apparent that the shorter protein encoded by clone B lacksthe second and third helices of the HHTH region (Figure 7B).The ability of the polypeptide encoded by clone B to bind tobox II was assayed by both DNA-protein screening of a phagecontaining this cDNA (Figure 7C) and gel shift analysis ofE. coli extracts prepared from cells expressing tobacco GT-1a(Figure 7D). It is of interest to note that the truncated polypep-tide gives rise to a retarded band of a similar mobility to nuclearGT-1. From these data, it is apparent that polypeptide B thatlacks helix 2 and helix 3 is unable to bind box II. This is instriking contrast to polypeptide A that contains all threepredicted helices. These observations therefore suggest thatthe second and third helices of the HHTH motif are involvedin the interaction between tobacco GT-1 a and box II.

Expression of Genes Encoding Tobacco GT-1 a

Previous studies on both tobacco and pea nuclear GT-1 demon-strated that GT-1 activity is present in extracts prepared fromboth light-grown and dark-adapted plants. We wished, there-fore, to analyze the expression pattern of the gene from whichthe tobacco GT-1a cDNA was derived. From our screens ofthe two cDNA libraries, one derived from RNA isolated frometiolated tobacco seedlings and the other derived from RNAisolated from light-grown tobacco leaves, it is clear that tobaccoGT-1a mRNA is present in both populations. These observa-tions were confirmed by RNA gel blot analyses of poly(A)+

RNA isolated from various tissue samples. These data arepresented in Figure 8. GT-1a transcripts are present in both

Page 7: Characterization of a Gene Encoding a DNA Binding … · Characterization of a Gene Encoding a DNA Binding Protein with Specificity for a Light-Responsive Element Philip M. Gilmartin,’

Characterization of Tobacco GT-1a 845

Aa

B1

2121

4141

6161

8181

101101

121

M S D K L P T S I N L Y E E Q E S M D QM S D K L P T S I N L Y E E Q E S M D Q

H S N H M I I E V A T P N A H L Q Q P QH S N H M I I E V A T P N A H L Q Q P Q

Q I L L P G I S G G D T T S S G G E D NQ I L L P G I S G G D T T S S G G E D N

N N N V K L A P K K R A E T W V Q E E TN N N V K L A P K K R A E T W V Q E E T

R A L I S L R R E L D S L F N T S K S NR A L I S L R R E L D S L F N T Q N R T

K H L H D Q I S L K M R E K G F D R S PS T C O T R S P * 1 0 8

T M C T D K W R N L L K E F K G * 136

136| ACIDIC[ BASIC | HELIX 111 HELIX 21 T | HELIX 3 |

108

ACIDIC BASIC HELIX 1

clone A

clone B

DISCUSSION

Tobacco Nuclear GT-1 and Recombinant GT-1aAre Related

With the aim of dissecting the signal transduction pathway thatlinks photoperception by phytochrome to transcriptional acti-vation of pea rbcS-3A, we isolated cDNA sequences thatencode a DMA binding protein with specificity for the rbcS-3AGT-1 box II binding site. The protein encoded by the isolatedcDNA sequences is related to nuclear GT-1. This is demon-strated by two lines of evidence. First, antibodies raised tosynthetic peptides derived from the predicted amino acid se-quence of tobacco GT-1a can interact with the nuclear protein.Second, both nuclear and tobacco GT-1a have similar DNAbinding site requirements for the box II sequence element. Itis also evident that tobacco GT-1a shares some sequencehomology to rice GT-2 (Dehesh et al., 1990); however, the rela-tionship between tobacco GT-1a and the purified DNA bindingactivity of the nuclear silencer binding factor 1 (SBF-1), whichalso requires a GGTTAA core consensus (Harrison et al., 1991),remains to be determined.

clone A clone B A B N

bound

free

Figure 7. DNA Binding Domain Definition of Tobacco GT-1a.(A) Comparison between the predicted amino acid sequences derivedfrom the two independent GT-1a cDNA clones. Clone A is shown aboveclone B. The polypeptide encoded by cDNA clone A is 136 amino acidresidues; the polypeptide encoded by clone B is 108 amino acidresidues. The amino acids present within the polypeptides as a con-sequence of the frameshift mutations in clone A and B are indicatedin bold. The predicted helix 1, 2, and 3 regions are underlined; helix2 and helix 3 are absent from the polypeptide encoded by clone B.(B) Predicted structure of GT-1a clone A and clone B. The predictedacidic, basic, helix, and turn (T) regions are indicated.(C) Filter binding assay with phage carrying GT-1a clone A and GT-1aclone B with a tetramer of box II (GTGTGGTTAATATG) as probe.(D) Gel shift analysis of GT-1a clone A (A) and GT-1a clone B (B) ascompared with nuclear GT-1 (N) using a tetramer of box II as probe.The free probe and bound complex are indicated.

light-grown (lane 1) and dark-adapted (lane 2) leaf tissue. Inaddition, stem (lane 3), root (lane 4), and etiolated seedling(lane 5) tissues also contain GT-1a mRNA. The expression levelof the gene encoding the GT-1a mRNA sequence is low, re-quiring the use of poly(A)+ RNA and precluding quantitativecomparisons from these studies.

Binding Specificity of Tobacco GT-1 a

It is clear from the binding specificity data that tobacco GT-1abinds specifically to box II (GTGTGGTTAATATG) with sequencerequirements similar to those of nuclear GT-1. A core of GGTTAAwas previously defined as critical for binding of pea and tobacconuclear GT-1 to box II in vitro (Green et al., 1988a) and tran-scriptional activity conferred by this element in transgenictobacco plants (Kuhlemeier et al., 1988; Sarokin and Chua,1992). Tobacco GT-1a does, however, show some differencesin its binding specificity from nuclear GT-1. Nuclear GT-1 showssome requirement for the TA dinucleotide following the 6-bpcore sequence (Figure 4; Green et al., 1988a). However, mu-tation of this TA dinucleotide does not appear to affect bindingof tobacco GT-1a, as demonstrated by both filter binding as-says and gel shift studies. Mutation of the first GT dinucleotideof box II results in reduced binding of tobacco GT-1a. This same

1 2 3 4 5*mm m m

Figure 8. RNA Gel Blot Analysis of Tobacco GT-1a mRNA.Tobacco poly(A)+ RNA (3 ng) from light-grown leaf tissue (lane 1),3-day dark-adapted leaf tissue (lane 2), stem tissue (lane 3), root tis-sue (lane 4), and etiolated seedlings (lane 5) was probed with theradiolabeled GT-1a cDNA clone A.

Page 8: Characterization of a Gene Encoding a DNA Binding … · Characterization of a Gene Encoding a DNA Binding Protein with Specificity for a Light-Responsive Element Philip M. Gilmartin,’

846 The Plant Cell

mutation does not dramatically affect binding of nuclear GT-1 (Green et al., 1988a); however, a different two-base substitu- tion of this GT dinucleotide does affect binding of nuclear GT-1 (Green et al., 1988a), demonstrating a requirement for these nucleotides under certain conditions.

The inability of tobacco GT-la to interact with box 111 (ACTT- TATCATTTTCACTATCT) or with the rice phyA 3' GT motif (TTGGCGGTAATTAAC) is in contrast to the broader binding specificity of nuclear GT-1, which interacts with all three se- quences (Green et al., 1987; Kay et al., 1989). The tobacco protein GT-la has a subset of binding activities shown by tobacco nuclear GT-1. Three possible explanations for these differences are considered. First, the recombinant protein is produced as a truncated polypeptide and is composed of only the amino terminal 136 amino acids of the full-length protein, which may affect the specificity as determined by sequences flanking the core binding site. Efforts to assay the binding spec- ificity of the full-length polypeptide have proven unsuccessful because of our inability to recover full-length GT-la cDNA se- quences. Second, the €. coli-produced protein may not be faithfully modified as compared with nuclear GT-1. Third, it is possible that, whereas tobacco GT-la alone can bind to box II, additional proteins are required to stabilize such interac- tions with box Il-related sequences such as box 111 and the rice phyA 3' GT motif. The presence of such proteins in nu- clear extracts and their absence in €. coliextracts containing only tobacco GT-la could account for the more stringent se- quence specificity of the recombinant protein compared with nuclear GT-1. It is clear, however, that both the recombinant and nuclear proteins share a requirement for the 6-bp core sequence of box II. Earlier studies demonstrated that box 111 is a much weaker binding site for nuclear GT-1 as compared with box II (Gilmartin et al., 1990a). The difference in affinities of nuclear GT-1 for these two sequences may be heightened when the recombinant protein, as opposed to the nuclear pro- tein, is used in such studies.

A Family of GT Binding Proteins

It is as yet unclear whether tobacco GT-la can interact with other box Il-like elements present upstream of rbcS-3A (Green et al., 1987,1988a) and other light-responsive genes (Stockhaus et al., 1987; Manzara and Gruissem, 1988; Dean et al., 1989; Elliot et al., 1989; Schindler et al., 1990; Lawton et al., 1991) or whether additional GT binding proteins exist. The charac- terization of rice GT-2 (Dehesh et al., 1990) and the purification of SBF-1 (Harrison et al., 1991) suggest that GT binding pro- teins probably comprise a small family of related factors of which tobacco GT-la is one.

The observation that tobacco GT-ladoes not bind to the rice phyA GT-2 binding site is complementary to the observations of Dehesh et al. (1990). These authors showed that recom- binant rice GT-2 binds to the 3' rice GT motif (GGCGGTAATT) with a relative affinity two orders of magnitude higher than it

does to the pea rbcS-3A box II element (GTGTGGTTAATG). In addition, they note that binding of recombinant GT-2 to the rice phyA 5'GT motif (TAGGTTAATTA) is severely reduced. Our com- bined findings demonstrate reciprocal binding specificities for tobacco GT-la and rice GT-2 with reference to the binding sites containing the core motifs GGTTAA and GGTAAT, respectively.

Structure of Tobacco GT-la

The predicted structure of tobacco GT-la, derived from cDNA sequences and a polymerase chain reaction-amplified genom- ic DNA fragment, indicates the presence of an HHTH region. Sequence comparison between tobacco GT-la and rice GT-2 shows that this region is highly conserved between the two proteins. Comparisons between the ability of clone A and clone B to interact with box II suggest a role for this region in such an interaction. Similar helical structures are present within HLH and homeodomain DNA binding proteins (cf. Johnson and McKnight, 1989) and have been shown to be involved in DNA binding (Desplan et al., 1988; Murre et al., 1989). However, the absence of any striking amino acid homology between tobacco GT-la and such proteins suggests that tobacco GT-la and rice GT-2 may contain a nove1 DNA binding motif that shares predicted structures similar to those of HLH and homeo- domain proteins.

We have used the designation HHTH as opposed to HLH for the following reasons. First, sequence structure predictions for this region of tobacco GT-la suggest the presence of three helices as opposed to two. The second and third helices are separated by a short sequence that contains a potentially turn- inducing proline residue. Second, the term HHTH discriminates between the predicted structure of tobacco GT-la and other previously described helix-containing proteins. The predicted structure of tobacco GT-la is of interest in that few plant DNA binding proteins have been isolated from cDNA expression libraries that do not contain a basic leucine zipper (bZIP) mo- tif (Katagiri and Chua, 1992).

Comparisons of the DNA binding abilities of the proteins encoded by the two independent cDNA clones for tobacco GT-la show that the second and third helices are required in the DNA binding domain of the recombinant protein. Because this region is highly conserved between tobacco GT-la and rice GT-2, it is likely that this motif may also play a similar role in binding of rice GT-2 to its target sequence. The similarities between the two proteins over this region may explain their affinities for such similar DNA sequence elements. The lack of homology between other regions of tobacco GT-la and rice GT-2 likely reflects their proposed reciprocal roles in mediat- ing positive regulation of pearbcS-3A in the light and ricephyA in the dark, respectively. By analogy with other DNA binding proteins, the acidic domains within tobacco GT-la may com- prise part of the activation domain of the protein. The availability of the cDNA sequences will enable this question to be addressed.

Page 9: Characterization of a Gene Encoding a DNA Binding … · Characterization of a Gene Encoding a DNA Binding Protein with Specificity for a Light-Responsive Element Philip M. Gilmartin,’

Characterization of Tobacco GT-la 847

A Model for the Role of GT-1

We have previously proposed three possible models to recon- cile the presence of nuclear GT-1 in the light and the dark with the ability of the GT-1 binding sites to activate transcription specifically in the light (Lam and Chua, 1990). Our observa- tions from RNA gel blot data that genes encoding tobacco GT-la show constitutive expression are consistent with the pres- ente of GT-1 binding activity in light- and dark-grown plants (Green et al., 1988). Our present findings are consistent with these models, which are presented in Figure 9. Two of the models predict the presence of an additional protein, either as a dark repressor (Figure 9, model 1) or as an activator in the light (Figure 9, model2). The third possibility is that GT-1 undergoes a photoreversible modification that leads to its ac- tivation specifically in the light. In addition, the presence of GT-Ia transcripts in stem and root tissue raises the question of how the box II tetramer confers activity specifically in chloroplast-containing leaf cells (Lam and Chua, 1990).

The isolation of cDNA sequences that encode a protein which binds specifically to the regulatory element box II and is antigenically related to nuclear GT-1 will enable further studies to address these questions and may provide access to the signal transduction chain that links photoperception to a transcriptional response.

were screened with the box II tetramer (4 x GTGTGGTTAATATG) as a probe using the procedure of Singh et al. (1989) and Katagiri et al. (1990) and modified as described previously (Gilmartin et al., 1991). The library was made from mRNA isolated from etiolated tobacco seed- lings by oligo(dT) priming. The cDNA was methylated with EcoRl methylase, and EcoRl linkers were added and ligated into h ZAP (Stratagene).

DNA-Protein Filter Hybridization Assays

These were performed essentially as described by Gilmartin et al. (1991). The recombinant phage was plated at adensity of approximately 200 pfu per 75-cm-diameter Petri dish. Following growth of the plaques on nitrocellulose, the filters were cut into sectors and each one treated identically with separate probes. Oligonucleotide probes for the bind- ing sites were excised from plasmids, gel purified, and end labeled with the Klenow fragment of DNA polymerase I with the following ex- ception: GT-2 binding site monomer oligonucleotides (10 ng each) were heated to 100°C in 20 pL of 1 x kinase buffer and allowed to cool to room temperature. ATP, polynucleotide kinase, and T4 DNA ligase (Stratagene) were added, and the reaction was incubated at room tem- perature overnight. The ligated oligonucleotide mixture was random prime-labeled using T7 DNA polymerase (Stratagene) and passed once over a pushcolumn (Stratagene).

Screening of Tobacco cDNA Library METHODS

lsolation of Sequence-Specific DNA Binding Proteins

Approximately 600,000 plaque-forming units (pfu) of phage (20,000 pfu per 150-cm-diameter Petri dish) of a tobacco seedling cDNA library

1 2 3

LIGHT

DARK

22 box II

,@, box II

p, box II

box II

223 box II

P, box II

Figure 9. Models for the Role of Gf-I in Light-responsive Transcription.

Model 1, GT-1 is bound to box II in the light and the dark; an accessory negative regulator (N) binds to GT-1 to block transcriptional activation in the dark. Model 2, GT-1 is bound to box 11 in the light and the dark; a positive accessory protein (P) binds to GT-1 in the light to activate transcription. Model3, GT-1 is bound to box I1 in the light and the dark and undergoes a photoreversible modification so that it is activated in response to light.

Following the growth and isopropyl P-thiogalactoside induction of ap- proximately 1 x 106 pfu of a h ZAP tobacco leaf library (Stratagene) as outlined above, plaques were lifted and UV cross-linked in a Stratalinker (Stratagene) onto Amersham HyBond N nylon mem- branes. Filters were prehybridized for 1 hr at 42OC in 50% formamide, 5 x SSPE (1 x SSPE is 0.15 M NaCI, 10 mM sodium phosphate, 1 mM EDTA, pH 7.4), 5 x SDS. The probe was prepared by random prime labeling the isolated cDNA insert with a T7 DNA polymerase kit (Stratagene). The probe was added to the prehybridization mix, and incubation at 42OC continued for 16 hr. Filters were washed twice for 15 min in 0.1 x SDS, 0.5 x SSC (1 x SSC is 0.15 M NaCI, 0.015 M sodium citrate) at 65OC and exposed to Kodak X-Omat film. Positive plaques were purified by subsequent rounds of screening.

Gel Shift Studies

Tobacco nuclear extracts were prepared and gel shift studies performed as described previously (Green et ai., 1988b). Escherichia coliextracts containing tobacco GT-la were prepared by infecting 100 pL of com- petent €. coli XL-1 blue cells (Hanahan, 1983) with 106 to 107 pfu of phage. Following 30 min on ice and a subsequent 5-min incubation at V C , 800 pL of SOC medium (Sambrook et ai., 1989) and 100 pL of 100 mM isopropyl P-thiogalactoside were added. The mixture was incubated at V C for 5 hr, and the cells were pelleted subsequently in a microcentrifuge for 2 min. The pellet was resuspended in 30 pL of nuclear extract buffer (Green et al., 1987) containing 5 mM DTT and 5 pglmL each antipain and leupeptin; the suspension was lysed by freezing and thawing.

Page 10: Characterization of a Gene Encoding a DNA Binding … · Characterization of a Gene Encoding a DNA Binding Protein with Specificity for a Light-Responsive Element Philip M. Gilmartin,’

848 The Plant Cell

Antlbody Production

One milliliter of synthetic peptide (5 mg/mL) was mixed with 1 mL of BSA (10 mg/mL). Two milliliters of 0.2% glutaraldehyde was added step- wise, and the mixture was incubated at room temperature for 1 hr. One milliliter of 1 M glycine in PBS was added to give a final concentration of 200 mM. The mixture was incubated with rocking at room tempera- ture for 1 hr. The peptide-protein conjugate was separated from free peptide by dialysis against PBS (Harlow and Lane, 1988). Antibodies were raised in New Zealand white rabbits against the peptide-protein conjugate (Harlow and Lane, 1988).

General Molecular Biological Techniques

RNA and DNA gel blot analyses as well as other standard molecular biology procedures were performed as described previously (Sambrook et al., 1989). DNA sequencing was carried out using a United States Biochemicals Sequenase kit according to the manufacturer’s instruc- tions. DNA and protein sequence data were processed using the DNASIS and PROSE packages from Hitachi (Brisbane, CA).

ACKNOWLEDGMENTS

We are most grateful to Dr. Maria Deak for providing the etiolated tobacco seedling cDNA library, Dr. Steve McKnight, Carnegie Insti- tute of Washington, for providing clone 69, Dr. Laura Sarokin for the box 111 probe, and Wen-Shu Lin, lnstitute of Molecular and Cellular Biology, Singapore, for the synthetic peptides. We also thank the mem- bers of the Laboratory of Plant Molecular Biology for helpful discussions and comments during this work. P.M.G., J.M., and K.H. weresupported by fellowships from the Winston Foundation, the North Atlantic Treaty Organization, and the Japanese Society for the Promotion of Science, respectively. This work was supported by National lnstitutes of Health Grant No. GM44640.

Received March 18, 1992; accepted May 11, 1992.

REFERENCES

Davis, M.C., Yong, M.-H., Gilmartin, EM., Goyvaerts, E., Kuhlemeier, C., Sarokin, L., and Chua, N.-H. (1990). Minimal sequence require- ments for the regulated expression of rbcS-3A from Pisum sativum in transgenic tobacco plants. Photochem. Photobiol. 52, 43-50.

Dean, C., Pichersky, E., and Dunsmuir, P. (1989). Structure, evolu- tion and regulation of rbcSgenes in higher plants. Annu. Rev. Plant Physiol. 40, 415-439.

Dehesh, K., Bruce, W.B., and Quall, P.H. (1990). A trans-acting fac- tor that binds to a GT-mdif in a phytochrome gene promoter. Science

Desplan, C. Theis, J., and OFarrell, P.H. (1988). The sequence spec- ificity of homeodomain-DNA interaction. Cell 54, 1081-1090.

Elliott, R.C., Pedersen, T.J., Fristensky, B., White, M.J., Dickey, L.F., and Thompson, W.F. (1989). Characterization of a single copy gene encoding ferredoxin I from pea. Plant Cell 1, 681-690.

250, 1397-1399.

Gilmartin, P.M., and Chua, N.-H. (1990a). Localization of a phyto- chrome responsive element within the upstream region of pea rbcS-3A. MOI. Cell Biol. 10, 5565-5568.

Gilmartin, P.M., and Chua, N.-H. (1990b). Spacing between GT-1 bind- ing sites within a light-responsive element is critical for transcriptional activity. Plant Cell 2, 447-455.

Gilmartin, P.M., Sarokin, L., Memelink, J., and Chua, N.-H. (1990). Molecular light switches for plant genes. Plant Cell 2, 369-378.

Gilmartin, P.M., Memelink, J., and Chua, N.-H. (1991). Dissection of the light-responsive elements of rbcS-34. In Phytochrome Prop- erties and Biological Action, NATO AS1 Series, Vol. 50, B. Thomas and C.B. Johnson, eds (Berlin: Springer-Verlag), pp. 141-156.

Green, P.J., Kay, S.A., and Chua, N.-H. (1987). Sequence-specific interactions of a pea nuclear factor with light-responsive elements upstream of the rbcS-3.4 gene. EMBO J. 6, 2543-2549.

Green, P.J., Yong, M.-H., Cuozzo, M., Kano-Murakami, Y., Silverstein, P., and Chua, N.-H. (1988a). Binding site requirements for pea nuclear protein factor GT-1 correlate with sequences required for light-dependent transcriptional activation of the rbcS-3A gene.

Green, P.J., Kay, S.A., Lam, E., and Chua, N.-H. (1988b). ln vitro DNA footprinting. In Plant Molecular Biology Manual, S. Gelvin and R. Schilperoort, eds (Dordrecht, The Netherlands: Kluwer Academic Publishers), pp. 1-20.

Hanahan, D. (1983). Studies on transformation of E. coli with plas- mids. J. MOI. Biol. 166, 557.

Harlow, E., and Lane, D. (1988). Antibodies: A Laboratory Manual. (Cold Spring Harbor, NY Cold Spring Harbor Press).

Harrison, M.J., Lawton, M.A., Lamb, C.J., and Dixon R.A. (1991). Characterization of a nuclear protein that binds to three elements within the silencer region of a bean chalcone synthase gene pro- moter. Proc. Natl. Acad. Sci. USA 88, 2515-2519.

Jenkins, G. (1988). Photoregulation of gene expression in plants. Pho- tochem. Photobiol. 48, 821-832.

Johnson, P.F., and McKnight, S.L. (1989). Eukaryotic transcriptional regulatory proteins. Annu. Rev. Biochem. 58, 799-839.

Katagiri, F., Lam, E., and Chua, N.-H. (1989). Two tobacco DNA binding proteins with homology to the nuclear factor CREB. Nature 340,

Kay, S.A. (1991). In vitro protein-DNA interactions in the rice phytochrome promoter. In Phytochrome Properties and Biological Action, NATO AS1 Series, Vol. 50,B. Thomas and C.B. Johnson, eds (Berlin: Springer-Verlag), pp. 129-140.

Kay, S.A., Keith, B., Shinozaki, K., Chye, M.-L., and Chua, N.-H. (1989). The rice phytochrome gene: Structure, autoregulated expres- sion, and binding of (3-1 to aconsetved site in the 5’upstream region. Plant Cell 1, 351-360.

Kuhlemeier, C., Green, P.J., and Chua, NrH. (1987a). Regulation of gene expression in higher plants. Annu. Rev. Plant Physiol. 38, 221-257.

Kuhlemeier, C., Fluhr, R., Green, P.J., and Chua, N.-H. (1987b). Se- quences in the pea rbcS4A gene have homology to constitutive mammalian enhancers but function as negative regulatory elements. Genes Dev. 1, 247-255.

Kuhlemeier, C., Cuozzo, M., Green, P., Goyvaerts, E., Ward, K., and Chua, N.-H. (1988). Localization and conditional redundancy of regulatory elements in rbcS-34, a pea gene encoding the small

EMBO J. 7, 4035-4044.

727-730.

Page 11: Characterization of a Gene Encoding a DNA Binding … · Characterization of a Gene Encoding a DNA Binding Protein with Specificity for a Light-Responsive Element Philip M. Gilmartin,’

Characterization of Tobacco GT-la 849

subunit of ribulose-bisphosphate carboxylase. Proc. Natl. Acad. Sci. USA 85, 4662-4666.

Kuhlemeier, C., Strittmatter, G., Ward, K., and Chua, N.-H. (1989). The pea rbcS-3A promoter mediates light responsiveness but not organ specificity. Plant Cell 1, 471-478.

Lam, E., and Chua, N.-H. (1989). ASF-2: A factor that binds to the cauliflower mosaic virus 35s promoter and a conserved GATA mo- tif in Cab promoters. Plant Cell 1, 1147-1156.

Lam, E., and Chua, NrH. (1990). GT-1 binding site confers light- responsive expression in transgenic tobacco. Science 248,471-474.

Lawton, M.A., Dean, S.M., Dron, M., Kooter, J.M., Kragh, K.M., Harrison, M.J., Yu, L., Tanguay, L., Dixon, R.A., and Lamb, C.J. (1991). Silencer region of a chalcone synthase promoter contains multiple binding sites for a factor, SBF1, closely related to GT-1. Plant MOI. Biol. 16, 235-249.

Manzara, T., and Gruissem, W. (1988). Organization and expression of the genes encoding ribulose-l,5-bisphosphate carboxylase in higher plants. Photosynth. Res. 16, 117-139.

Murre, C., McCaw, P.S., and Baltlmore, D. (1989). A new DNA bind- ing and dimerization motif in immunoglobulin enhancer binding, daughterless, MyoD and Myc proteins. Cell 56, 777-783.

Nagy, F., Kay, S.A., and Chua, N.-H. (1988). Gene regulation by phytochrome. Trends Genet. 4, 37-42.

Sambrook, J., Fritsch, E.F., and Maniatis, T. (1989). Molecular Clon- ing: A Laboratory Manual, 2nd edition. (Cold Spring Harbor, NY Cold Spring Harbor Laboratory Press).

Sarokin, L.P., and Chua, N.-H. (1992). Binding sites for two nove1 phos- phoproteins, 3AF5 and 3AF3, are required for rbcS-3l expression. Plant Cell 4, 473-483.

Sllverthorne, J., and Tobin, E. (1987). Phytochrome regulation of nu- clear gene expression. BioEssays 7, 18-23.

Singh, H., Clere, R.G., and Lebowitz, J.H. (1989). Molecular cloning of sequence-specific DNA binding proteins using recognition site probes. Biotechniques 7, 252-261.

Stockhaus, J., Eckes, F?, RochaSosa, M., Schell, J., and Willmitzer, L. (1987). Analysis of cis-acting sequences involved in the leaf-specific expression of a potato gene in transgenic plants. Proc. Natl. Acad. Sci. USA 84, 7943-7947.

Tobin, E.M., and Silverthorne, J. (1985). Light regulation of gene ex- pression in higher plants. Annu. Rev. Plant. Physiol. 36, 569-593.

Page 12: Characterization of a Gene Encoding a DNA Binding … · Characterization of a Gene Encoding a DNA Binding Protein with Specificity for a Light-Responsive Element Philip M. Gilmartin,’

DOI 10.1105/tpc.4.7.839 1992;4;839-849Plant Cell

P M Gilmartin, J Memelink, K Hiratsuka, S A Kay and N H Chuaelement.

Characterization of a gene encoding a DNA binding protein with specificity for a light-responsive

 This information is current as of September 12, 2018

 

Permissions X

https://www.copyright.com/ccc/openurl.do?sid=pd_hw1532298X&issn=1532298X&WT.mc_id=pd_hw1532298

eTOCs http://www.plantcell.org/cgi/alerts/ctmain

Sign up for eTOCs at:

CiteTrack Alerts http://www.plantcell.org/cgi/alerts/ctmain

Sign up for CiteTrack Alerts at:

Subscription Information http://www.aspb.org/publications/subscriptions.cfm

is available at:Plant Physiology and The Plant CellSubscription Information for

ADVANCING THE SCIENCE OF PLANT BIOLOGY © American Society of Plant Biologists