9
269 Gene, 47 (1986) 269-211 Elsevier GEN 01776 Cyclodextrin-glycosyltransferase from Klebsielh pneumoniue M5al: cloning, nucleotide sequence and expression (Recombinant DNA; precursor protein; secretion; homologies to a-amylases) Florian Binder”, Otto Huberb and August B&k”* n Lehrstuhl fi3r Mikrobiologie der Universittit Miinchen. Maria- Ward-Strasse la, D-8000 Miinchen 19, Tel. 089/l 77084, and b Consortium ftir Elektrochemische Industrie GrnbH, Zielstattstrasse 20. D-8000 Miinchen 70 (F.R.G.) Tel. 089178741 (Received June 12th, 1986) (Accepted September lst, 1986) SUMMARY The structural gene encoding cyclodextrin-glycosyltransferase of Klebsiella pneumoniae strain M5al was cloned; it is expressed both in Escheridzia coli and in K. pneumoniae and the gene product is secreted into the extracellular space. Determination of the nucleotide sequence revealed an open reading frame coding for a single polypeptide of 655 amino acid (aa) residues. The enzyme is synthesized as a precursor with an N-terminal signal peptide of 30 aa residues, which is proteolytically processed between two alanine residues during export. The primary structure of CGT bears homology with the sequences of amylases from both prokaryotic and eukaryotic origins. INTRODUCTION Some microorganisms able to hydrolyze starch synthesize and export CGT (EC 2.4.1.19). This enzyme catalyzes the degradation of starch into CD (for review see Szejtli, 1982). * To whom correspondence and reprint requests should be addressed. Abbreviations: aa, amino acid(s); bp, base pair(s); CAP, catabolite activator protein; CD, cyclodextrin(s); CGT, cyclo- dextrin-glycosyltransferase; cgr, cyclodextrin-glycosyltransferase gene; Cm, chloramphenicol; Ig, immunoglobulin; kb, kilobase or 1000 bp; LB, Luria broth; nt, nucleotide(s); ORF, open reading frame: PAGE, polyacrylamide-gel electrophoresis; pFp, parafluorophenylalanine; RBS, ribosome-binding site; SDS, sodium dodecyl sulfate; [ 1, designates plasmid-carrier state. Cyclodextrins (CD) are cyclic, non-reducing oligo- saccharides composed of six, seven or eight glucose units linked by a-1,Cbonds. According to the num- ber of their glucose units they are termed a-, j?-, and y-CD. They are able to form inclusion complexes with a number of organic and inorganic molecules, thereby changing the physical and chemical proper- ties of the included compounds (for review see Szejtli, 1982). This ability to form inclusion com- plexes is of potential industrial importance (Horikoshi and Teruhiko, 1982). The CGT produced by K.pneumoniae M5al (Bender, 1977) is one of the few proteins which are exported by Gram-negative bacteria in high amounts. The formation of the enzyme is subject to strict regulation, namely by induction and by catabolite repression (Bender, 1977). In order to investigate the molecular basis of expression and of 0378-l 119/86/$03.50 0 1986 Elsevier Science Publishers B.V. (Biomedical Division)

Cyclodextrin-glycosyltransferase from Klebsiella pneumoniae M5a1: cloning, nucleotide sequence and expression

  • Upload
    august

  • View
    215

  • Download
    1

Embed Size (px)

Citation preview

Page 1: Cyclodextrin-glycosyltransferase from Klebsiella pneumoniae M5a1: cloning, nucleotide sequence and expression

269 Gene, 47 (1986) 269-211

Elsevier

GEN 01776

Cyclodextrin-glycosyltransferase from Klebsielh pneumoniue M5al: cloning, nucleotide sequence and expression

(Recombinant DNA; precursor protein; secretion; homologies to a-amylases)

Florian Binder”, Otto Huberb and August B&k”*

n Lehrstuhl fi3r Mikrobiologie der Universittit Miinchen. Maria- Ward-Strasse la, D-8000 Miinchen 19, Tel. 089/l 77084, and b Consortium ftir Elektrochemische Industrie GrnbH, Zielstattstrasse 20. D-8000 Miinchen 70 (F.R.G.) Tel. 089178741

(Received June 12th, 1986)

(Accepted September lst, 1986)

SUMMARY

The structural gene encoding cyclodextrin-glycosyltransferase of Klebsiella pneumoniae strain M5al was cloned; it is expressed both in Escheridzia coli and in K. pneumoniae and the gene product is secreted into the extracellular space. Determination of the nucleotide sequence revealed an open reading frame coding for a single polypeptide of 655 amino acid (aa) residues. The enzyme is synthesized as a precursor with an N-terminal signal peptide of 30 aa residues, which is proteolytically processed between two alanine residues during export. The primary structure of CGT bears homology with the sequences of amylases from both prokaryotic and eukaryotic

origins.

INTRODUCTION

Some microorganisms able to hydrolyze starch synthesize and export CGT (EC 2.4.1.19). This enzyme catalyzes the degradation of starch into CD (for review see Szejtli, 1982).

* To whom correspondence and reprint requests should be

addressed.

Abbreviations: aa, amino acid(s); bp, base pair(s); CAP,

catabolite activator protein; CD, cyclodextrin(s); CGT, cyclo-

dextrin-glycosyltransferase; cgr, cyclodextrin-glycosyltransferase

gene; Cm, chloramphenicol; Ig, immunoglobulin; kb, kilobase

or 1000 bp; LB, Luria broth; nt, nucleotide(s); ORF, open

reading frame: PAGE, polyacrylamide-gel electrophoresis; pFp,

parafluorophenylalanine; RBS, ribosome-binding site; SDS,

sodium dodecyl sulfate; [ 1, designates plasmid-carrier state.

Cyclodextrins (CD) are cyclic, non-reducing oligo- saccharides composed of six, seven or eight glucose units linked by a-1,Cbonds. According to the num- ber of their glucose units they are termed a-, j?-, and y-CD. They are able to form inclusion complexes with a number of organic and inorganic molecules, thereby changing the physical and chemical proper- ties of the included compounds (for review see Szejtli, 1982). This ability to form inclusion com- plexes is of potential industrial importance (Horikoshi and Teruhiko, 1982).

The CGT produced by K.pneumoniae M5al (Bender, 1977) is one of the few proteins which are exported by Gram-negative bacteria in high amounts. The formation of the enzyme is subject to strict regulation, namely by induction and by catabolite repression (Bender, 1977). In order to investigate the molecular basis of expression and of

0378-l 119/86/$03.50 0 1986 Elsevier Science Publishers B.V. (Biomedical Division)

Page 2: Cyclodextrin-glycosyltransferase from Klebsiella pneumoniae M5a1: cloning, nucleotide sequence and expression

270

~~quenc~~ of N-terminal aa r&dues was per- formed by the method described hy ~~~~- Liebold and Kimura (1984),

MATERLUS AND METRODS

(P) Reagents

Page 3: Cyclodextrin-glycosyltransferase from Klebsiella pneumoniae M5a1: cloning, nucleotide sequence and expression

271

K. pneumoniae strains: (i) K. pneumoniae KAY2026,

which is naturally devoid of CGT activity, and (ii) K. pneumoniae RClO, a mutant of K. pneumoniae

M5a1, whose cgt structural gene was deleted by UV mutagenesis (our unpublished results). After trans- formation with the recombinant plasmid, termed pCM100, both Klebsiella strains as well as E. coli RR28 gained the ability to hydrolyze starch.

(b) Expression and excretion of CGT by E. coli

Immunoblotting experiments were carried out in order to analyze whether the starch-hydrolyzing activity conferred by pCM100 is due to the produc- tion of CGT. Fig. 1 gives the result: Transformation of E. coli RR28 by plasmid pCM100 confers to E. coli the capacity to synthesize and secrete a poly- peptide cross-reacting with anti-CGT antibodies. This polypeptide exhibits the same apparent M, (of 68 kDa) as that produced by K. pneumoniae M5al.

1 A 0

2 3 4 5 6 7 6 kDo

- 170 --974

-, r CI -

-55

-36

-20

Fig. 1. Immunoblotting analysis of cgf expression by K.pneumoniae and E. co/i. Cultures of K.pneumoniae M5a1, E. coli RR28 and E. coli RR28[pCMlOO] were grown in minimal medium containing 1% partially hydrolyzed starch (Noredux 150B) to a density of 10s cells/ml. 0.5 ml of the total culture (cells plus supernatant) and of the supernatant alone, respectively, were lyophilized; the freeze-dried material was taken up in 50 ~1 sample buffer (Laemmli, 1970) and subjected to immunoblotting analysis using the method of Howe and Hershey (198 1) with the modifications given by Schmid and Bock (1984) alter SDS-PAGE in 12.5 % gels (Laemmli, 1970). (Panel A) Cells plus supernatant. Lanes: 1, K. pneumoniae MSal; 2, E. coli RR28; 3, E. coli RR28[pCMlOO]; 4, purified CGT. (Panel B) Super- natant alone. Lanes: 5, E. coliRR28[pCMlOO]; 6, purified CGT; 7, K. pneumoniae M5al; 8, E. coli RR28.

In view of the fact that E. coli does not normally secrete proteins this finding may be evidence that it is an intrinsic structural property of the CGT which directs the export. It is interesting in this context that the acquisition of CGT secretion by E. coli provides the organism with the ability to hydrolyze starch but not with the capability to grow on this carbon source. The CD which accumulate in the medium (un- published results) apparently either cannot be taken up into the cells and/or they cannot be linearized in the cytoplasm.

(c) The nucleotide sequence of the cgt structural gene

Fig. 2 gives the restriction map of plasmid pCM100; the size of the PstI insert in vector pHE3 is 5.4 kb.

To confirm that the insert of the recombinant plasmid pCMlO0 was derived from K. pneumoniae DNA, a 2.55-kb EcoRI-EcoRV fragment of pCMlO0 was labelled with [ 32P]dATP by nick- translation and used as a probe for hybridization experiments with genomic DNA digested with &I. A hybridization signal was obtained with a 5.4-kb fragment of K. pneumoniae M5al DNA. PstI digests of DNAs from E. coli RR28, K.pneumoniae

KAY2026 and K. pneumoniae RClO did not exhibit any hybridization signals.

The PstI fragment was shortened by subcloning with appropriate restriction enzymes. The shortest fragment which still promoted the expression of active CGT was a 2.9-kb HpaI-EcoRV fragment (Fig. 2). Recloning of the 5.4 kb PstI-fragment in vector pHE3 with the opposite direction showed that the expression of the gene is independent of the orientation of this insert indicating that transcription starts from the homologous promoter. Plasmid pCM100 was used for the determination of the nucleotide sequence of the cgt gene. The sequencing strategy and restriction sites used for labellmg of fragments are depicted in Fig. 2. Within the sequenced segment of 2809 bp there is an ORF of 1965 bp (Fig. 3) corresponding to a protein of 655 aa residues.

The putative CGT protein contains a highly biased aa composition (Table I); for example, the per- centage of phenylalanine plus tyrosine is lo%, and that of serine plus threonine is almost 16%. Many of

Page 4: Cyclodextrin-glycosyltransferase from Klebsiella pneumoniae M5a1: cloning, nucleotide sequence and expression

212

, lkb I

PstI Hpal EcoRl Bgll Rll

L

I

Cyclodextrin-glycosyl- transferase

Fig. 2. Restriction map of the 5.4-kb insert of pCM 100 encoding the cgr gene. The expanded map gives that part for which the nucleotidl sequence was determined. Arrows indicate starting point and direction of sequence readings. (0) 5’ labelling; (a) 3’ labelling. Position corresponding to the C-terminus and N-terminus of precursor-CGT are also given (aa pos. 1 and 655, respectively).

TABLE I

Codon usage in gene cgt”

F TTT 26 F TTC 12 L TTA 10 L TTG 7

(5.79)” s S

(6.55) S S

TCT TCC TCA

TCG

8 (7.62) Y 3 Y

10 *

4 *

TAT TAC TAA

TAG

22 (4.27) C 6 C 1 *

0 W

TGT

TGC TGA

TGG

3 (0.46) 0 0

13 (1.98)

CCT ccc CCA CCG

(3.35) H H

Q Q

CAT CAC CAA CAG

9 (1.52) R 1 R

12 (3.20) R 9 R

CGT 12 (3.96) CGC 3 CGA 1 CGG 4

L CTT 7 L CTC 4 L CTA 8 L CTG 7

11

2 5 4

I ATT 23 (6.10) T ACT I ATC 7 T ACC I ATA 10 T ACA M ATG 11 (1.68) T ACG

20 (8.08) N AAT 8 N AAC

15 K AAA

10 K AAG

53 (10.21) s AGT 17 14 S AGC 8 20 (3.96) R AGA 4

6 R AGG 2

V GTT 19 (5.34) A GCT V GTC 3 A GCC V GTA 8 A GCA V GTG 5 A GCG

18 (6.71) D GAT

10 D GAC 9 E GAA I E GAG

39 (7.16) G GGT 26 (7.93) 8 G GGC 6

18 (3.96) G GGA 13 8 G GGG 7

a Numbers in parentheses give the percentage of occurrences of the respective amino acid. Other numerals specify the number c occurrences.

Page 5: Cyclodextrin-glycosyltransferase from Klebsiella pneumoniae M5a1: cloning, nucleotide sequence and expression

273

1

39

134

229

324

419

AMTATAC

514

566

656

730

802

Ikt L 6 Ar Am Arg Pha Phe hsn Thr Scr Ala Ala Ile Ala Ile Sar 11s Ala Leu Am Thr Phe *e C l ATG AiA AOR MC CGT TFTTTT MT ACC TCG GCT GCTATT CCC ATTTCG ATTGCA TTA AAT ACTTTT- T&r

SW blot Gin Thr Ile Ala Ala Glu Pro Glu Glu Thr 'K

r Leu As! Phe As LJ; @ZZ; mh ;g T& z K AGCATGCAGACGAlTGCTGCTGAACCAGAAGAA ACTT TCTT GA TTTC

t

Phe L=" A6! B AT Phe Ser Asp Gly Asp Pro Sar hsn hsn hla Gly Phe Aan Sor Ala Thr Tf$ ;;g .P& fis

TTC CFTGA CG TTCAGCGATGGA GATCCA AGT AAT AATGCA GGG TCT MTTCTGCA ACC T

i%%i %ti %T ACT GGK GGi GAbTC CGG GGG %=I% ATTMT dA CTACCC %K:&:%= i%% I Thr G1 Gl As Leu Arq Gly Lsu Ile Asn L 6 Leu Pro

Val Thr Ser Ile Trp Ile Thr Pro Pro 11s Asp Asn Val Asn Asn Thr Asp Ala Ala Gly Am Thr Gly Tyr GTTA~TCAA~n;GA~A~CCCCCAA~GAThATGn:MTMTACFGAT~CCTCCCMT~OCATAT

074

946

::: ii% 2; % i% :a %,P %y: !i? iii8 ATA CAT GAA CAT TTT Gd 11~ h6p Glu ills Phe Gl Es g ;A$ As! g L& g &

Thr Ssr Lau Met His Ser Pro Asp Tyr Asn Mat Lys Leu Val Leu Asp Tyr Ala Pro Asn llis 6ar Asn Ala ACT AGT !lTG ATG CAT AGT CCT GAT TAT AAT An; Ahh Cl’G GTT CTT GAT TAT CCC CCC MT CAT TiX AAT GCT

1018

1090

It62

1234

Ann Asp Glu Asn Glu Phe Gl MTGATGAAAATGAATTT GG4

Ala Leu Tyr Arg Asp Gly Val Phc Xle Thr Asp TK

r Pro Thr Asn Vsl Ala GCA CPA TAT CGT GhT GGT GTG TTT ATT ACT CAT T T CCT ACG AAT GH GCC

Ala Am Thr Gly Tr Tyr Ilis His Asn Gly Gly Vd Thr hsn Trp Aan Asp Phe Phc Cln Val L 6 Aan ills GCCAATACG GGC d TAT CATCACAATGGT GGG GTh ACG MCTGG AATGAT lTC TTCCAA G'lG AXG MTCAT

Am Leu Phe Asn Leu Sor As L&u Asn Gln Scr Asn Thr As Val T r Asn MT CFA TFC AAT CTA TCA GA e CTC MTCAA TCC hhT ACT GA

$j 1 ~rbeuLeuA8,G&~~ GTCT CCAG T C TTG TTG GA

Phe Trp Ile Asp Ala Gly Val Asp Ala Ile Arg Ile Asp Ala Ile Lys 1118 hat Asp Lys Ser Phe Lie Gln TTTTGG ATC GATGCTGGTGTG GATGCTATC AGG ATTGATGCC ATCMC CAT An; GACAAGTCTTl'T ATACAG

1306

1376 Phe Gly Ala Ser Ala Asn Thr Thr Thr Gly Val hsp Gly Ann Ala Xle Asp Tyr Ala Asn Thr Ser Gly Ser 'ITTGGTGCCAGTGCG MT ACTACAACAGGTGTTGATGGTMTGCTATCGATTACGCCAACACTTCCUXWA

1450

1522

1594

f666

Ala Leu Leu As F GCG TTG CTG GA

Phe Gly Phe Arg Asp Thr Leu Glu Arg Val Leu Val Gly hrg Ser Gly Asn Thr Rut L 6 TTT GGA TTC CGC GAT ACT TTA GhA AGA GTT TTG GTA GGA CGT AGC GCA MT ACA An; x

Thr Leu Am Ser r Leu Ile L 6 At Gin Thr Val Phe Thr Ser As ACG TTA MThGT %TCTG ATA & Ad CAA ACA GTC TTF ACC AGTGAf'%f%:k% gPke$i&

Asn His As P MC CATGA

Het Ala Arg fle Gl Thr Ala Leu Arg Ser Asn Ala Thr Thr Phe Gly Pro Gly Am Asn Glu ATGGCACGCATT & ACCGCTCTG CGTTCAAACGCCACTACT~GGTCCTGGA MTMTGM

ACCd63; Thr Gl GI Ser Gin Scr Glu Ala Phe Ala Gln Lys Arg Ile Asp L+su Gly Leu Val Ala Thr Uet Thr Vsl

AGTCAG AGTGAA GCTTTTGCTCAG AAA CGTATA GACCTCGGTCTGGTTGCGACA ATCACTGTA

1736 CG! Ar Gly Ile Pro Ala Ile Tyr Tyr Gly Thr Glu His Tyr Ala Ala Asn Phe Thr Ser Asn Ser Phe Gly Gln

GGTAm CCTGCCAmTATTAT GGTACT GM CAT TATGCC GCTMC TTT ACC TCTAAC AGTYYYGGTCAA

1810

1882

Val Gl Ser As Pro Glu L B Met Pro Cl Phe As Thr Glu Ser Glu Ala Phe Scr fle Ile GTTGG$AGTGAkCd%~~~ GAG A ATG CCA GGjl T'lTGAg ACG GAAAGT GAG GCFlTCTCCAYTAR

Lys Thr Leu sly Asp teu Arq Lys Ssr Ser Pro Ala Ilc Gln Asn Gly Thr Tyr Thr Clu Leu 'Frp Val Asn AAAACA CTG GGT GAC CTAAGG AM AGTAGC CCG GCA ATT CM MTGGA ACTTAT ACl'GAA CTA XGGTTAAT

1954

2026

As GA P

Asp Ile Leu Val Phe Glu Ar Arg Ser Gly Asn Asp Ile Val Ile Val Ala X&u Asn Arg Cly Glu Ala GAT ATA TTA GTAlTTGAG CG! CGTTCT GGG AhC GATATTGTTATT GTTGCA ClTAATCGTGGTGAG GCT

Am Thr Ile Aan Val Lys Asn Ile Ala Val Pro Asn Gly Val Tyr Pro Ser ku Ile Cly Asn Asn Ser Val MC ACA ATTAATGTPAAA MTATA GCG GTTCCT MT GGGGTA TAT CCG AGTTTG ATTCGG MT MT AGTtXT

2090

2170

Ser Val Al6 Am Lys Ar Thr Thr Leu Thr Leu Met Gin Ann Glu Ala Val Val Ile Ar Ser Gin Sar Asp TCA GTAGCAAATAAA CG8 ACA ACA CTA ACA CTT ATG CM MTGAACCT GlTGTC ATNG&CACAA TCA GAT

Asp Ala Glu Ann Pro Thr Val Gln Ser Ile hen Phe Thr Cys Asn Aen Gly Tyr Thr Ile Ser GLy Gin Ser CAT GCG GAG MCCCTACA GTA CAAAGCATA AAC TTCACA TGT MTMCGGT TATACG A'rl'TCAGGTCiiA AGT

2242

2314

2386

2456

Val TK

r 110 11s 01 Asn Ile Pro Gln Leu Cl GTTTTATTATT Gd NTATACCTCAG lTA GG r

Gly Trp Asp Leu Thr Lys Ala Val I s Ile Sar Pro Thr GGT TGG GACTTA ACT MA GCG GTA AIA ATA TCA CCG ACA

Gln Tyr Pro Gln Trp Ser Ala Ser Leu Glu Leu Pro Ser Asp Leu hsn Val Glu Trp Lys Cya Val Lys Ar CAA TAT CCA CAA TGO AGT GCG AGC TTA GAG CTT CCT TCT GAC TTA MT GTTGM TGG MC TGTGTG MA td

Asn Glu Thr Asn Pro Thr Ala Asn Val Glu Trp Gln Ser Gly Ala Am Am Gln Phe Asn Ser Asn Asp Thr AAT CM ACC MT CCC ACG GCT MT CTT GAG TGG CAG TCT GGT GCA MT MC CAG TTC AAT AGC MT GAC ACA

Gln Thr Thr Asn Gly Ser Phe l ** TTCGThCTCCGGCCATMTT.~T CAAACA ACG MT GGCTCG Tl'TTAA

lTAAAA_ --

2546 TTTTGACTAATACTCTTACAAATTTTCAACCTAGGCTGATGTGhCTACATATTPl’TG CTGTACGCGGACCTGGTCGTCTGCGTCTATTThGAGTC

2641 AAGA~ATATCCA’ITTCACCCKIACGAAATA~MATAG~CGCChAhTAGTAGA~A~~~GAA~CAGCCCGGATTATGCChTCIY;A

2736 ‘FCAATCCCCAAACCMCCATCACCMCCGGnCTAnGCGATGTCGATCATAGCACCAATACCCCGTACCGhAC

Fig. 3. Nucleotide and amino acid sequences of the K. pneumoniae M5al cgr gene. The sixfold repeated hexanucleotide 5’-TTGTAG-3’

(mentioned in the text) is underlined, the CAP-binding site is marked by a dotted line. Transcription initiation and putative ribosome-

binding sites are indicated (boxed and S/D, respectively). The cleavage site of the signal sequence is given by an upward arrow. The

palindromic sequence downstream from the ORF is indicated by horizontal arrows.

Page 6: Cyclodextrin-glycosyltransferase from Klebsiella pneumoniae M5a1: cloning, nucleotide sequence and expression

214

the arvmatic aa residues are clustered. The refevance of these structural features for the export process is not yet knvwn.

The cgt gene has a codon composition in which rare cvdons frequently occur. The f. index (Ikemura, 1981) was calculated to be 0.47, which is that of a weakly expressed gene in E. cofi. It is also striking that the percentage of G + C within the cgt gene is 397; compared to 52 to 56% for total chrvmvsomal DNA from Kleb~iellu. The unusual codvn usage and G + C-content and the fact that the cga gene vnly occurs in strain M5ai of R. ~~~~~~~ffe may indicate that this gene could have been acquired by lateral transfer from some other vrganism.

The N-terminal aa of purified CGT from K. pneumoniae M5al were determined in order to correlate primary structures of the protein and of the nucleotide sequence derived polypeptide. The aa positions at the CGT N-terminus which could be determined unambiguously, were H,N-ala-glu-pro; this N-terminus is separated from the start point of

the ORF by 30 aa which bear all the characteristic features of a signal peptide (Perlman and Halvorson, 1983). This indicates that CGT is synthesized as an M, 72 916 precursor protein of 655 aa residues with a 30-aa signal peptide which is processed between two alanine residues. A precursor of this size can be detected by immunoblvtting experiments in strains in which cgt expression has been placed under the control of the hybrid fat promoter (unpublished results).

Mature native CGT from K. ~ne~rnon~~e as well as

SDS denatured enzyme display an apparent Mr of 68000 (Bender, 1977). This is consistent with a M, of 69029 calculated from the aa composition and demonstrates that the enzyme consists of a single polypeptide.

Nuclease Sl mapping of the start point of tran- scription was performed to provide the structural basis for regulation studies. As shown in Fig. 4, a signal was obtained only with RNA prepared from K. pneumoniae MSal grown at the expense of starch. Upon growth on glucose the transcriptivn of egg is repressed. Transcription starts within a region between nt 150 and 155 upstream from the ATG of the ORF (boxed in Fig. 3).

Fig. 4. happens of the in viva start point of ~~sc~pt~on of the cgl gene. Bulk RNA was prepared from cultures ofK. pnetcmoniae

MM grown in minimal medium at the expense of starch and glucose, respectively, as carbon sources. Cells were harvested in the late exponential growth phase. The RNA was hybridized (45°C) with 3 405bp BgflI-EcoRI fragment of plasmid pCUlO0 (Fig, 2), labelled with 32P at its 5’-BglII-end. The hybridization mixture was treated with nuclease Sl (With et al., 1986) and el~~ophore~ca~y analyzed by PAGE in a 64/, gel conta~ing 7 M urea. Lane 1: Labelled 405 bp-fragment; lane 2: RNA pre- pared from K. pneumoniue M5al cells grown at the expense of starch; lane 3: RNA prepared from K.pnemnoniae M5al cells grown on glucose.

Upstream From the start point of transcriptivn there is no sequence motif corresponding to the - 10 and -35 consensus boxes of E. co/i promoters (Rosenberg and Court, 1979). Within a distance of 90 nt, however, a classical ~AP-b~~g motif 5’-TGTGA-3’ (De Crombrugghe et al., 1984) occurs. The existence of an E. co&type CAP binding site and the fact that c@ expression is subject to cat&o&e repression both in E. co& (~pub~sh~ results) and K. pneumoniae, suggest that the repres- sion systems of the two organisms are identical.

Page 7: Cyclodextrin-glycosyltransferase from Klebsiella pneumoniae M5a1: cloning, nucleotide sequence and expression

CGT K.pn. 49 AMY B.st. 20 AMY B.am. 17 AMY B.su. 16 AMY A.or. 36 AMY Rat 15 AMY Hog 15 AMY H.s. 18 AMY Bar. 20

CGT K.pn. 103 AMY B.st. 76 AMY B.am. 73 AMY B.su. 72 AMY A.or. 92 AMY Rat 71 AMY Hog 71 AMY H.s. 74 AMY Bar. 76

CGT K.pn. 149 RDGVFITDYPTNVAANTGWYHHNGG AMY B.st. 130 ISGTYQIQAWTKFDFPGRGNTYSSFKWRWYHFDGVDWDESRKLSRIYKFRGIGKA AMY B.am. 127 TSEEYQIKAWTDFRFPGRGNTYSDFKWHWYHFDGADWDESRKISRIFKFRGEGKA AMY B.su. 117 IPNWTHGNTQ AMY A.or. 137 FKPFSSQDYFHPFCFIQNYE AMY Rat 116 GSYFNPMNREFSAVPYSAWYFNDNKC NGE AMY Hog 116 GSYCNPGNREFPAVPYSAWDFNCNGKKTASGG AMY H.s. 119 GSYFNPGSRDFPVVPYSGWDFNDGKCKTGSGD

AMY Bar. 130 CRDDTK

CGT K.pn. 174 VTNWNDFFQVKNHNLFNLSDLNQSNTDVYQYLLDGSKFWIDAGV AMY B.st. 185 !iDWEVDTENGNYDYLMYADLD?lDHPEWTELKSWGKWYVNTTNI AMY B.am. 182 WDWEVSSENGNYDYLMYADVDYDHPDWAETKKWGIWYANELSL AMY B.su. 127 IKNWSDRWDVTQNSLLGLYDWNTQNTQVQSYLKRFLDRALNDGA AMY A.or. 157 DQTQVEDCWLGDNTVSLPDLDTTKDVVKNEWYDWVGSLVSNYSI AMY Rat 145 INNYNDANQVRNCRLSGLLDLALDKDYVRTKVADYMNNLIDIGV AMY Hog 148 IESYNDPYQVRDGCQVLLLDLALEKDYVRSMIADYLNKLIDIGV AMY H.s. 151 IENYNDATQVRDCRLSGLLDLALGKDYVRSKIAEYMNHLIDIGV AMY Bar. 121 FEGGT AMY Bar. 144 YSDGTANLDTGADFAAAPDIDHLNDRVQRKLKEWLLWLKSDLGF

CGT K.pn. 231 SFIQKWTSDIYDYSKSIGREGFFFF SANTTTGVDGNAIDYANTSG AMY B.st. 242 SFFPDWLSDVRSQTGKPLFT V DINKLHNYIMKTNGTMSLFD AMY B.am. 239 SFLRDWVQAVRQATGKEMFT V NAGKLENYLNKTSFNQSVFD AMY B.su. 184 PDDGSYGSQFWPNITNTSAEFQ Y SASRDAAYANYMDVTASHYG AMY A.or. 213 KDFWPGYNKAAGVYC I DPAYTCPYQNVMDGVLNYPI AMY Rat 202 GDIKAVLDKLHNLNTKWFSQGSRPFI F GGEAIKGSEYFGNGRVTEFK AMY Hog 205 GDIKAVLDKLHNLNTNWFPAGSRPFI F GGEAIKGSEYFSNGRVTEFK AMY H.s. 208 GDIKAILDKLHNLNSNWFPEGSKPFI Y GGEPIKSSDYFGNGRVTEFK AMY Bar. 200 PEMAKVYIDGTSPSLA V MATGGDGKPNYDQDAHRQNL

CGT K.pn. 202 SALLDFGFRDTLERVLVGRSGNTMKTLNSYLIKRQTVFTSDDWQ AMY B.st. 289 APLHNKFYTASKSGGTF DMRTLMTNTLMKDQPTLA AMY B.am. 286 VPLHFNLQAASSQGGGY DMRRLLDGTVVSRHPEKA AMY B.su. 233 Ak?Y A.or; 254

HSIRSA LKNRN LGVSMISHYASDVSADKL YYPLLNAFKSTSGSMDD LYNMINTVKSDCPDSTLL

AMY Rat 255 YGAKLGTVIRKTNGEKM SYLKNWGEGWGFVPTDRA AMY Hog 258 YGAKLGTVVRKWSGEKM SYLKGPLKGWGLMPSDRA AflY H.s. 261 YGAKLGTVIRKWTGEKM SYLKNWEEGWGFMPSDRA AMY Bar. 243 VNWVDKVGGAASAGMVFDFTTKGILNAAVEGEL!~RLIDPQGKAPGVMGW~lPAKA

CGT K.pn. 326 AMY B.st. 323 AMY B.am. 321 ANY B.su. 262 Ab!Y A.or. 290 AMY Rat 290 ANY Hog 293 AMY H.s. 296 AMY Bar. 297

GGDLRGLINKLPYLKSLGVTSIbIITPPIDNWiNTDAAGNT GYHGYWGRDYFRID GTLWTKVANEANNLSSLGITALWL PPAYKGTSRSDVGYGVYDLYDLGEFNQKGAVR GQHWKRLQNDAEHLSDIGITAVWI PPAYKGLSQSDNGYGPYDLYDLGEFQQKGTVR NWSFNTLKHNMKDIHDAGYTAIQT SPINQVKEGNQGDKSMSNWYWLYQPTSYQIGN KYCGGTWQGIIDKLDYIQGMGFTA IWITPVTAQLPQDCAYGDAYTGYWQTDIYSLN HLFEWRWADIAKECERYLAPKGFG GVQVSPPNENIIINNPSRPWWERYQPISYKIC~ HLFEWRWVDIAKECERYLGPKGFG GVQVSPPNENVVTGNPSRPWWERYQPVSYKLC HLFEWRWVDIALECERYLAPKGFG GVQVSPPNENVAINNPFRPWWERYQPVSYKLC GFNWESWKQSGGWYNMMMGKVDDI AAAGVTHVWLPPPSHSVSNEGYMPGRLYDIDA

EHFGNLDDFKELT SLMHSPDYNMKLV NANDENEPGALY TKYGTKAQY LQ AIQAAHAAGMQVY TKYGTKSE LQDAIGSLHSRNVQVY RYLGTEQEF KE MCAAAEEYGIKVI ENYGTADDL KA LSSALHERGMYLM SRSGNENEF KD MVTRCNNVGVRIY TRSGNEDEF RD MVTRCNNVGVRIY TRSGNEDEF RN MVTRCNNVGVRIY CGNAVSAGTSSTC SKYGNAAEL KS LIGALHGKGVQAI CADYKDSRGIYCI

Fig. 5. Comparison of amino acid sequences of CGT and various a-amylases. Organisms are abbreviated as: K. pn., Klebsieilu

pneumoniae; B. St., Bacillus stearothennophilus; B. am., Batik amyloliquefaciens; B. su., Bacillw subtilis; A. or., Aspergillm oryzae; H.,

Human; s., saliva; Bar., Barley. Alignment of or-amylases is adopted from Nakajima et al. (1986). CGT sequence is aligned to give the best tit and intervals between neighbouring amino acids are extended, whenever necessary for this purpose. Only the region of high homology is shown, and regions l-4 containing highly conserved sequences are boxed.

275

Page 8: Cyclodextrin-glycosyltransferase from Klebsiella pneumoniae M5a1: cloning, nucleotide sequence and expression

216

Upstream from the initiation codon there is a putative E. c&type RBS (Stormo et al., 1982) which may contribute to the excellent expression of cgt in E. c&L

The most striking feature of the region flanking the cgc gene at the 5’ side is the .occurrence of a sixfold repeated 5’-TTGTAG-3’ sequence between the start point of ~~sc~ption and the Shine-Dalgarno motif, Its functian in the expression of cgt is not yet known.

Downstream from the ORF there is a palindromic sequence which displays the characteristics of a

p-independent transcription termination signal (Rosenberg and Court, 1979).

(e) Homology to ez-amylaws

The CGT sequence was compared with the pri- mary structures of eight diEerent cr-amyiases com- piled recently by Nakajima et al. (1986) (Fig. 5). Restricted homologies were found between aa 77 and 36.5 of CGT and sequence motifs of ot-amylases assumed to participitate in substrate binding and/or catalysis. Amongst g-amylases, homology occurs at four di&rent segments of the primary structure (Nakajima et al,, 1986); two of them - regions 2 and4- are clearly present in the CGT sequence (Fig. 5). The homology to regions I and 3 is limited, although the spacing of the four regions in the sequences of CGT and a-amylases is similar. The homology of sites important for catalysis in a-amylases, with equivalent regions in CGT, sug- gests a common evolutionary derivation of the two classes of enzymes.

After submission of this manuscript Takano et al, (1986) published the sequence of 3~~~il~ ~~~~~

CGT. A comparision with the primary structure of K. pneumoniue CGT showed an overall homology of about 30 % _ The highly conserved boxes described in this paper can also be seen in the aa sequence of the B. macerans enzyme.

(1) G + C-content and codon usage of the cgt gene of ~~~~e~~oni~e MSal indicate that it might have been acquired by Klebsielh via lateral gene transfer. Sequence homologies between CGT and

a-amylases indicate that they are derived from a common precursor.

(2) The level of expression of cgt in E. coli is

compar&&z to that in K. ~~e~~~~e, It is subject to catabolite repression in both organisms.

(3) CGT is secreted into the medium by E. coli,

although to a lower extent as by K. pneumoniae. The

CGT signal peptide does not display any con- spicuous differences to those of proteins exported into the periplasm or integrated into membranes. Secretion, therefore, seems to be mediated by an intrinsic property of the protein.

ACKNOWLEDGEMENT

We thank GA Heller for her excellent technical assistance and G. Sawers for critically reading the manuscript. We are greatly indebted to M. Geier for editorial work.

REFERENCES

Beauclerk, A.AD., Hummel, H., Holmes, D.J., Beck, A. and Cundliffe, E.: Studies of the GTPase domain of archae- bacterial ribosomes. Eur. J. Biochem. 15 1 (1985) 245-255.

Bender, H.: Cyciodextrin-Gluc~otrrursferase van Kfeb&e&

pneumoniae, 1. Synthese, i\einigung und Eigenschaen des Enzyms von Klebsiella pneumoniae MSal. Arch. Microbial. 111 (1977) 271-282.

De Crombruggbe, B., Busby, S, and But, H.: CycIic AMP receptor protein: role in transcription activation. Science 224

(1984) 831-838. Fraenkel, D.G. and Neidhardt, F.C.: Use of chloramphenicol to

study control of RNA synthesis in bacteria. B&him. Biophys. Acta 53 (1961) 96-100.

Garvey, J.S., Cremer, N.E. and Sussdorf, D.H.: Methods in Immunology. Benjamin, Readii, MA, 1977.

Gray, C.P., Sommer, R., Potke, C., Beck, E. and Schaher, H.: Structure of the origin of DNA rephcation of bacteriophage

fd. Proc. Natl. Acad. Sci. USA 75 (1978) 50-53. Hennecke, H., GBnther, I. and Binder, F.: A novel cloning vector

for the direct selection of re~ornb~n~t DNA in E. c&i.

Gene 19 (1982) 231-234. Horikoshi, K. and Teruhiko, A.: Alcalophihc Microorganisms.

Japan Scientific Societies Press, Tokyo, 1982. Howe, 3.G. and Hershey, J.W.B.: A sensitive imm~oblotti~g

method for measuring protein initiation factor levels in lysates of Escherichiu coli. J. Biol. Chem. 256 (1981) 12836-12839.

Page 9: Cyclodextrin-glycosyltransferase from Klebsiella pneumoniae M5a1: cloning, nucleotide sequence and expression

211

Ikemura, T.: Correlation between the abundance of Esc~e~c~i~

cob’ transfer RNAs and the occurrence of the respective codons in its protein genes: a proposal for a s~on~ous codon choice that is optimal for the E. wli translational system. J. Mol. Biol. 151 (1981) 389-409.

Johnson, D.A., Gantsch, J.W., Sporbman, JR. and Elder, J.H.: Improved technique utilizing nonfat dry milk for analysis of proteins and nucleic acids transferred to nitrocellulose. Gene Anal. Technol. 1 (1984) 3-8.

Laemmli, UK.: Cleavage of structural proteins during the assembly of the head of bacteriophage T4. Nature 227 (1970) 680-685.

Maniatis, T., Fritsch, E.F. and Sambrook, J.: Molecular Cloning. A Laboratory Manual. Cold Spring Harbor Laboratory, CoId Spring Harbor, NY, 1982,

Maxam, A.M. and Gilbert W.: Sequencing end labeled DNA with base-specific chemical cleavages. Methods Enzymol. 65 (1980) 499-560.

Miller, J.H.: Experiments in Molecular Genetics. Cold Spring Harbor Laboratory, Cold Spring Harbor, NY, 1972.

Nakajima, R., Imanaka, T. and Aiba, S.: Comparision of amino acid sequences of eleven different a-amylases. Appl. Micro- biol. Biotechnol. 23 (1986) 335-360.

Perlman, D. and Halvorson, H.O.: A putative signal peptidase recognition site and sequence in eukaryotic and prokaryotic signal peptides. J. Mol. Biol. 167 (1983) 391-409.

Piepersberg, A., Hennecke, H., EngeIhard, M., Nass, G. and Bock, A.: Cross-reactivity ofphenyl~anyltransfer ribonucleic acid ligases from different microorganisms. J. Bacterial. 124 (1975) 1482-1488.

Rosenberg, M. and Court, D.: Regulatory sequences involved in the promotion and termination of RNA transcription. Annu. Rev. Genet. 13 (1979) 319-353.

Saito, H. and Miura, K.J.: Preparation of transfo~ing deoxyribonucleic acid by phenol treatment. Biochim. Biophys. Acta 72 (1963) 619-629.

Schmid, G. and Buck, A.: Immunoblotting analysis of ribosomal proteins from archaebacteria. System. Appi. Microbial. 5 (1984) l-10.

Southern, E.M.: Detection of specific sequences among DNA fragments separated by gel electrophoresis. J. Mol. Biol. 98 (1975) 503-517.

Sprenger, G.A. and Lengeler, J.W.: L-Sorbose metabolism in Klebsiella pneumoniae and Sor + derivatives of Escherichia coli

K-12 and chemotaxis towards sorbose. J. Bacteriof. 157 ( 1984) 39-49.

Stormo, G.D., Schneider, T.D. and Gold, L.M.: Character- ization of translational initiation sites in E. co& Nucl. Acids Res. 10 (1982) 2971-2995.

Szejtli, 1.: Cyciodextrins and Their Inclusion Complexes. Akadimiai Kiadb, Budapest, 1982.

Takano, T., Fukuda, M., Monma, M., Kobayashi, S., Kainuma, K. and Yamane, K.: Molecular cloning, DNA nucleotide sequence, and expression in Bacillus subtilis cells of the Bacillus macerans cyclodextrin glucanotransferase gene. J. Bacterial. 166 (1986) 1118-I 122.

With, G., Hummel, H., Jarsch, M., Bar, U. and Bock, A.: Transcription signals for stable RNA genes in Metbanococcus.

Nucl, Acids Res. 14 (1986) 2459-2479.

W~ttm~n-Liebold, B. and Kimura, M.: Microsequencing of peptides and proteins with ~~~-dimethyl-~~o-~o- benzen-4’-isothiocyanate. Methods Mol. Biol. 1 (1984).

Communicated by K.F. Chater.