12
Copyright 0 1993 by the Genetics Society of America Structural Changes in the Antennapedia Complex of Drosophila pseudoobscura Filippo M. Randazzo,’ Mark A. Seeger,* Catherine A. HUSS, Margaret A. Sweeney, Jeffrey K. Cecil and Thomas C. Kaufman’ Howard Hughes Medical Institute, Institute for Cellular and Molecular Biology, and Programs in Genetics and Cellular, Molecular and Developmental Biology, Department of Biology, Indiana University, Bloomington, Indiana 47405 Manuscript received October 16, 1992 Accepted for publication January 27, 1993 ABSTRACT The discovery of the striking positional conservation between the Antennapedia and Bithorax homeotic gene complexes (ANT-C and BX-C) in Drosophila melanogaster and the murine Hox and human HOX clusters has had a substantial impact on our understanding of the evolution of development and its genetic regulation. Structural differences do exist among the mammalian Hox complexes and the ANT-C in D. melanogaster. To gain further insight into the evolutionary changes among these complexes, the ANT-C was cloned in the closely related species, Drosophila pseudoobscura. The overall structure of the ANT-C in D. pseudoobscura is highly similar to its D. melanogaster counterpart; however, two differences in the organization of the ANT-C have been identified. First, the 22 gene, a member of the ANT-C in D. melanogaster, is not present in the D. pseudoobscura ANT- C and is possibly absent from the D. pseudoobscura genome. Second, the orientation of the Deformed gene is inverted in D. pseudoobscura, providing it with a 5’to 3‘ direction of transcription identical to the remaining ANT-C homeobox genes with the exception of fushi tarazu. These differences demonstrate that subtle changes can occur in ANT-C structure during relatively short periods of evolutionary divergence, although the fundamental organization of the complex is conserved. These - - observations and others suggest that the complex is have maintained this organization of genes for some T HE discoveries of the Antennapedia Complex (ANT-C) and the Bithorax Complex (BX-C) in Drosophila melanogaster (KAUFMAN, LEWIS and WAKI- MOTO 1980; LEWIS 1978) and the subsequent isolation of cognate Hox complexes in mammals (DUBOULE and DOLLE 1989;GRAHAM, PAPLOPULU andKRUMLAUF 1989; BONCINELLI et al. 1988) have contributed sig- nificantly to an understanding of the evolution of the developmental process and its genetic control. It has become increasingly clear that not only are genes that encode metabolically important products conserved across phyla, but also loci that encode developmentally important gene products. Among the develop- mentally important genes, homeobox genes have been isolated from a large array of eucaryotic species (see SCOTT, TAMKUN, and HARTZELL 1989 for a review). Perhaps more significantly, clustered sets of homeo- box genes have been discovered in humans (BONCI- NELLI et al. 1988), mice (DUBOULE and DOLLE 1989; GRAHAM, PAPLOPULU and KRUMLAUF 1989; KAPPEN, SCHUGHART and RUDDLE 1989), chickens (WEDDEN, Avenue, Toronto, Ontario M5G 1x5, Canada. Present address: Mount Sinai Hospital Research Institute, 600 University * Present address: Program in Molecular and Cell Biology, Oklahoma Medical Research Foundation, 825 Northeast 13th Street, Oklahoma City, Oklahoma 73104. ’Present address: Howard Hughes Medical Institute, Department of Biology, Indiana University, Bloomington, Indiana 47405. Genetics 133: 319-330 (May, 1993) v not absolutely rigid but that selective pressures Functional reason that remains elusive. PANGand EICHELE 1989; IZPISUA-BELMONTE et al. 199 l), and the German cockroach Blattella germanica (Ross and TANAKA 1988) and the red flour beetle Tribolium castaneum (STUART et al. 1991). Genetic analysis of Tribolium has uncovered a combined ANT-C/BX-C-like gene complex referred to as the HOM-C (BEEMAN et al. 1989). Its member genes are functionally homologous and similarly aligned to the ANT-C and BX-C homeotic genes of Drosophila (BEE- MAN et al. 1989). Furthermore, four duplicated sets of genes with high sequence similarity to the ANT-C/ BX-C genes has been found in the mouse (Hox) and in humans (HOX) (see AKAM 1989 for review). These observations taken together indicate that the homeo- box must have been present early on in the evolution of eucaryotes, and a combinedANT-C/BX- C-like complex must have been present before protostomes and deuterostomes diverged over 500 million years ago. The structure of the ANT-C and the BX-C in D. melanogaster is characterized by the unusual proximity of a significant number of developmentally important genes and by the collinearity between homeotic gene position on the chromosome and their respective func- tional domains along the anterior-posterior axis ofthe animal. The numerous homeobox-containing genes

Structural Changes in the Antennapedia Complex of ... · Antennapedia Complex Evolution 32 1 1984). The probes were made using the random primer a’* P method (FEINBERG and VOGELSTEIN

  • Upload
    others

  • View
    0

  • Download
    0

Embed Size (px)

Citation preview

Page 1: Structural Changes in the Antennapedia Complex of ... · Antennapedia Complex Evolution 32 1 1984). The probes were made using the random primer a’* P method (FEINBERG and VOGELSTEIN

Copyright 0 1993 by the Genetics Society of America

Structural Changes in the Antennapedia Complex of Drosophila pseudoobscura

Filippo M. Randazzo,’ Mark A. Seeger,* Catherine A. HUSS, Margaret A. Sweeney, Jeffrey K. Cecil and Thomas C. Kaufman’

Howard Hughes Medical Institute, Institute for Cellular and Molecular Biology, and Programs in Genetics and Cellular, Molecular and Developmental Biology, Department of Biology, Indiana University, Bloomington, Indiana 47405

Manuscript received October 16, 1992 Accepted for publication January 27, 1993

ABSTRACT The discovery of the striking positional conservation between the Antennapedia and Bithorax

homeotic gene complexes (ANT-C and BX-C) in Drosophila melanogaster and the murine Hox and human HOX clusters has had a substantial impact on our understanding of the evolution of development and its genetic regulation. Structural differences do exist among the mammalian Hox complexes and the ANT-C in D. melanogaster. To gain further insight into the evolutionary changes among these complexes, the ANT-C was cloned in the closely related species, Drosophila pseudoobscura. The overall structure of the ANT-C in D. pseudoobscura is highly similar to its D. melanogaster counterpart; however, two differences in the organization of the ANT-C have been identified. First, the 2 2 gene, a member of the ANT-C in D. melanogaster, is not present in the D. pseudoobscura ANT- C and is possibly absent from the D. pseudoobscura genome. Second, the orientation of the Deformed gene is inverted in D. pseudoobscura, providing it with a 5’to 3‘ direction of transcription identical to the remaining ANT-C homeobox genes with the exception of fushi tarazu. These differences demonstrate that subtle changes can occur in ANT-C structure during relatively short periods of evolutionary divergence, although the fundamental organization of the complex is conserved. These - - observations and others suggest that the complex is have maintained this organization of genes for some

T HE discoveries of the Antennapedia Complex (ANT-C) and the Bithorax Complex (BX-C) in

Drosophila melanogaster (KAUFMAN, LEWIS and WAKI- MOTO 1980; LEWIS 1978) and the subsequent isolation of cognate Hox complexes in mammals (DUBOULE and DOLLE 1989; GRAHAM, PAPLOPULU and KRUMLAUF 1989; BONCINELLI et al. 1988) have contributed sig- nificantly to an understanding of the evolution of the developmental process and its genetic control. It has become increasingly clear that not only are genes that encode metabolically important products conserved across phyla, but also loci that encode developmentally important gene products. Among the develop- mentally important genes, homeobox genes have been isolated from a large array of eucaryotic species (see SCOTT, TAMKUN, and HARTZELL 1989 for a review). Perhaps more significantly, clustered sets of homeo- box genes have been discovered in humans (BONCI- NELLI et al. 1988), mice (DUBOULE and DOLLE 1989; GRAHAM, PAPLOPULU and KRUMLAUF 1989; KAPPEN, SCHUGHART and RUDDLE 1989), chickens (WEDDEN,

Avenue, Toronto, Ontario M5G 1x5, Canada. ’ Present address: Mount Sinai Hospital Research Institute, 600 University

* Present address: Program in Molecular and Cell Biology, Oklahoma Medical Research Foundation, 825 Northeast 13th Street, Oklahoma City, Oklahoma 73104.

’Present address: Howard Hughes Medical Institute, Department of Biology, Indiana University, Bloomington, Indiana 47405.

Genetics 133: 319-330 (May, 1993)

v

not absolutely rigid but that selective pressures Functional reason that remains elusive.

PANG and EICHELE 1989; IZPISUA-BELMONTE et al. 199 l), and the German cockroach Blattella germanica (Ross and TANAKA 1988) and the red flour beetle Tribolium castaneum (STUART et al. 1991). Genetic analysis of Tribolium has uncovered a combined ANT-C/BX-C-like gene complex referred to as the HOM-C (BEEMAN et al. 1989). Its member genes are functionally homologous and similarly aligned to the ANT-C and BX-C homeotic genes of Drosophila (BEE- MAN et al. 1989). Furthermore, four duplicated sets of genes with high sequence similarity to the ANT-C/ BX-C genes has been found in the mouse (Hox) and in humans (HOX) (see AKAM 1989 for review). These observations taken together indicate that the homeo- box must have been present early on in the evolution of eucaryotes, and a combined ANT-C/BX- C-like complex must have been present before protostomes and deuterostomes diverged over 500 million years ago.

The structure of the ANT-C and the BX-C in D. melanogaster is characterized by the unusual proximity of a significant number of developmentally important genes and by the collinearity between homeotic gene position on the chromosome and their respective func- tional domains along the anterior-posterior axis of the animal. The numerous homeobox-containing genes

Page 2: Structural Changes in the Antennapedia Complex of ... · Antennapedia Complex Evolution 32 1 1984). The probes were made using the random primer a’* P method (FEINBERG and VOGELSTEIN

320 F. M. Randazzo et al.

of both these complexes presumably arose through a series of duplication and divergence events (LEWIS 1978). In contrast to the clustered configuration of these genes, inversion events normally tend to rear- range genes over evolutionary time (STURTEVANT and TAN 1937; DOBZHANSKY and STURTEVANT 1937). Thus, the extreme proximity of the homeobox genes infers some form of evolutionary constraint on change. At least two hypotheses could explain the existence of the homeotic gene complexes: (A) these genes cannot function o r do not function as well outside the domain of the complexes, e.g., because of possible chromatin structure requirements; (B) the juxtaposition of the member genes and their complex cis-regulatory regions makes separation by random genetic processes highly unlikely or even impossible, thereby maintaining the ancient configuration.

T h e unique characteristics of the ANT-C in D. melanogaster necessitate evolutionary analysis in more closely related species to determine the plasticity of the ANT-C structure. The ANT-C contains five hom- eotic genes that specify head and thoracic segmental identity, Antennapedia (Antp), Sex combs reduced (Scr), Deformed (Dfd ) , proboscipedia ( p b ) and labial (lab) [see MAHAFFEY and KAUFMAN 1987 or KAUFMAN, SEEGER and OLSEN 1990 for review]. The ANT-C differs from the BX-C [see DUNCAN 1987 and PEIFER, KARCH and BENDER 1987 for review] and from the murine Hox (DUBOULE and DOLLE 1989; GRAHAM, PAPLOPULU and KRUMLAUF 1989; KAPPEN, SCHUCHART and RUD- DLE 1989) and human HOX (BONCINELLI et al. 1988) complexes in that it carries some unique homeobox genes involved in establishing the embryonic body plan, namely zen,f tr and bcd. T h e pair rule genefushi tarazu ( f t z ) acts to divide the embryo into segments (WAKIMOTO, TURNER and KAUFMAN 1984; WEINER, SCOTT and KAUFMAN 1984), the maternal effect gene bicoid (bcd) encodes a morphogen important in estab- lishing anterior-posterior polarity (FRIGERIO et al. 1986; BERLETH et al. 1988), and the zygotic gene zerknullt (ren) specifies dorsal embryonic structures (WAKIMOTO, TURNER and KAUFMAN 1984; RUSHLOW et al. 1987). In addition, the z2 gene, which is located just proximal to Zen, contains a zen-like homeobox (RUSHLOW et al. 1987; PULTZ et al. 1988). It is ex- pressed in a pattern similar to that of zen, but has no detectable genetic function (PULTZ et al. 1988). The ANT-C also carries several other transcription units that do not contain homeoboxes- amalgam (ama) a member of the immunoglobulin superfamily (SEEGER, HAFFLEY and KAUFMAN 1988), and a cluster of eight genes that show sequence similarity to cuticle protein genes (FECHTEL et al. 1988; Fechtel, Fristrom and FRISTROM 1989; PULTZ et al. 1988; PULTZ 1988) . The presence of these nonhomeotic genes is one differen- tiating characteristic of the ANT-C from the BX-C,

the murine HOX complexes and the human HOX complex.

To address the questions pertaining to the rigidity of the ANT-C organization and the unique presence of nonhomeotic genes within its boundaries, the ANT-C in Drosophila pseudoobscura was cloned. Pre- vious studies have documented numerous chromo- somal changes in D. pseudoobscura when compared with D. melanogaster, indicating the potential for changes in the ANT-C (STURTEVANT and TAN 1937; DOBZHANSKY and STURTEVANT 1937; STEINEMANN, PINSKER and SPERLICH 1984). The cloning of ANT- C from a second Drosophilid species, estimated to have diverged approximately 46 million years ago (BEVERLY and WILSON 1984), and the previous clon- ing of homologous homeobox gene clusters in two mammalian species (mouse and human), estimated to have diverged roughly 68 million years ago (BENTON 1990), presents an opportunity to explore patterns in the more recently evolutionary history of homeobox gene complexes.

MATERIALS AND METHODS

Cloning ANT-C The chromosome walk was performed using techniques described in SCOTT et al. (1983). Both lambda EMBL-4 (gift of C. LANGLEY, University of Califor- nia, Davis) and lambda DASH-2 (gift of T. HAZELRIGG, Columbia University) D. pseudoobscura genomic libraries were used. The Ayala strain was used to make the EMBL-4 library. All clones except for p191, p195, p474, p475 and p488 were obtained from the EMBL-4 library. Prehybridi- zation and hybridization were performed using either 4X SSC, 5X Dendhardt's solution, 0.1% (w/v) SDS and 0.1% (w/v) carrier calf thymus DNA at 65" (interspecies and intraspecies hybridizations) or 5X SSC, 5X Denhardts, 0.1 % (w/v) SDS and 0.1 % (w/v) calf thymus DNA plus either 30% (w/v) formamide (intraspecies) or 50% (w/v) formamide (interspecies) at 42". Intraspecies hybridizations were fol- lowed by 0.1X SSC, 0.1 % (w/v) SDS, 0.1% (w/v) sodium py.rophosphate washes (IX wash at room temperature for 4 mln followed by 2 X washes for 40 min at 65'). The same conditions were used for Southern hybridizations. Probes were made by the nick translation method using Amersham kits (Arlington Heights, Illinois). All lambda genomic clones recovered in this study were mapped with five restriction enzymes (BurnH1, EcoRI, HindIII, Sal1 and XbaI). Cuticle- like genes were identified by Southern blot hybridization using a pIBI5 1 RX 1.5 (a known cuticle gene) subclones as probes (FECHTEL et al. 1988; FECHTEL, FRISTROM and FRIS- TROM 1989) WITH THE METHODS AND CONDITIONS AS DE- SCRIBED ABOVE.

Genomic Southern blot hybridizations: D. melanoguster Oregon R DNA was a gift of S. Horikami, while D. pseu- doobscura DNA was extracted using methods described in SCOTT et al. (1983). Genomic DNA was blotted on Nytran filters (Schlechier and Schuell, Keene, New Hampshire). Prehybridization and hybridization were performed using 5x SSC, 5x Denhardt's solution, 250 Pg/ml sonicated and boiled calf thymus DNA, 50 mM Nap04 pH 7.0, 0.1% (w/ V) SDS, and 43% (w/v) deionized formamide at 37" (Mc- GINNIS et al. 1984). Washing was performed first at room temperature for 5 min ( 3 X ) and then at 50" for 15 min (2X) using 2x SSC, 0.1% (w/v) SDS solution (MCGINNIS et al.

Page 3: Structural Changes in the Antennapedia Complex of ... · Antennapedia Complex Evolution 32 1 1984). The probes were made using the random primer a’* P method (FEINBERG and VOGELSTEIN

Antennapedia Complex Evolution 32 1

1984). The probes were made using the random primer a’* P method (FEINBERG and VOGELSTEIN 1983).

RESULTS

Cloning the ANT-C from D. pseudoobscura: The ANT-C in D. pseudoobscura (psANT-C) was cloned to determine the extent of evolutionary constraints on the structure of the ANT-C in another Drosophilid species. A number of separate regions of the psANT- C were cloned by hybridizing a D. pseudoobscura ge- nomic lambda phage library with D. melanogaster Antp, ftz, pb and lab derived probes under low stringency hybridization conditions (see MATERIALS AND METH- ODS). Standard chromosome walking techniques were used to fil l in the gaps between the cloned genomic entry points. Overlaps between steps in the walk were confirmed by Southern blot cross-hybridization and by restriction site mapping. All lambda genomic in- serts were mapped using five restriction enzymes- EcoRI, BamHI, HindIII, Sal1 and XbaI. A few restric- tion site polymorphisms among overlapping genomic clones were identified, and we suspect that they reflect heterogeneity in the nonisogenic flies used to generate the genomic libraries. The relative positions of D. pseudoobscura homologs of the ANT-C coding regions were first identified by Southern blot hybridization. This was done using D. melanogaster cDNAs and cor- responding genomic probes and low stringency con- ditions (final wash of 2x SSC, 65”; data not shown). The identity of restriction fragments similar to the D. melanogaster gene probes were then confirmed by sequencing selected D. pseudoobscura subclones. The DNA sequences were then compared with existing ANT-C gene sequences from D. melanogaster.

Gene order remains conserved for ANT-C in D. pseudoobscura: The ANT-C in D. pseudoobscura has been cloned from a position downstream of the labial gene to the Antp homeobox. Except for the cuticle- like genes that were identified by Southern blot ho- mology alone (data not shown, see MATERIALS AND

METHODS), diagnostic DNA sequences identified ANT-C gene homologs (Figure 1; SEECER and KAUF- MAN 1990 for bcd sequence) and their precise location within the psANT-C and the chromosomal positions of their D. melanogaster homologs is summarized in Figure 2. The results demonstrate that Antp,ftZ, Scr, Dfd, ama, bcd, zen, pb, the culticle-like gene cluster, and lab are clustered and identically ordered in both Drosophila species.

A 22 homolog is apparently absent from D. pseu- doobscura: One difference between the two species is the absence of z 2 from the bcd-pb interval in D. pseu- doobscura (Figure 3). Southern blots of D. pseudoob- scura genomic clones that encompass the pb-bcd inter- val, p300 and p3 10, were first hybridized with D. melanogaster genomic DNA clones containing either

the 3’ or 5‘ half of z 2 and then washed under low- stringency conditions (data not shown). The 3’ probe consists entirely of open-reading frame sequences lo- cated directly downstream of the homeobox, while the 5‘ probe contains 300 bp of coding sequence, including the homeobox, and 140 bp of transcribed, noncoding sequence, as determined from previous structural analysis (RUSHLOW et al. 1987). The 12 3’ probe hybridized to the predicted restriction frag- ments from the D. melanogaster genomic clones and to itself, but not to any DNA within the pb-bcd interval of D. pseudoobscura (data not shown). A Southern blot probed with the z2 5’ homeobox-containing fragment showed hybridization to the 22-containing restriction fragments of D. melanogaster genomic DNA and to the 22 5’ positive control. There were no strongly hybridized bands in the lanes containing the D. pseu- doobscura genomic phage (p300 and p3 10) from the pb-bcd interval. There were however several weakly hybridizing bands from these latter phages that rep- resent cross-hybridization to zen (data not shown). A z 2 homolog should hybridize more strongly to a z 2 homeobox-containing probe than the zen homolog. Thus, D. melanogaster z2 probes fail to detect a z 2 homolog in the pb-bcd interval of D. pseudoobscura.

The question remains, however, whether z2 is pres- ent elsewhere in the D. pseudoobscura genome. T o address this question, D. pseudoobscura genomic Southern blots were hybridized separately with (A) the zen cDNA, (B) the 3’ half of z2, and (C) the 5‘ half of z2 (all of which were derived from D. melano- gaster) and then washed under very low-stringency conditions (Figure 4). The zen cDNA probe, a positive control, hybridized to the expected zen-containing restriction fragments in D. melanogaster-7.2 kb BamHI, 1.2 kb BamHI, 5.9 kb HindIII and 1.4 kb HindIII (Figure 4A; see Figure 3 for restriction map). Consistent with our previous Southern hybridization and DNA sequence date in D. pseudoobscura, the zen cDNA probe hybridizes to restriction fragments that conform to the predicted sizes of zen-containing frag- ments in D. pseudoobscura-7.4 kb BamHI, 4.7 kb BamHI, 6.0 kb HindIII, 10.4 kb EcoRI and 1.2 kb EcoRI restriction fragments. Also evident are a num- ber of other DNA fragments with weaker homology in both D. melanogaster and D. pseudoobscura (Figure 4A). Due to the low stringency conditions used, hom- eobox cross-hybridization is the most likely explana- tion for the numerous secondary bands.

No apparent homology to the 3‘ portion of D. melanogaster 22 (22 3’) is detectable in D. pseudoobscura ( z 2 3’ contains the downstream 47% of the 22 open- reading frame (Figure 4B)). The z 2 3’ probe detects D. melanogaster z2, since the D. melanogaster control lanes display strong hybridization to the predicted 9.6 kb BamHI fragment. In the D. melanogaster HindIII

Page 4: Structural Changes in the Antennapedia Complex of ... · Antennapedia Complex Evolution 32 1 1984). The probes were made using the random primer a’* P method (FEINBERG and VOGELSTEIN

322 F. M . Randazzo et al.

~ n L e u A l ~ s P r o H i ~ G l ~ l n G l ~ r f f i l n V a l ~ l ~ l n G l ~ i ~ l ~ l n ~ H i f f i l n O l n ~ f f i l n H i ~ P ~ ~ ~ s ~ l ~ e r ~ ~ ~ a 1- A ~ ~ ~ C ~ ~ ~ C ~ ~ ~

I1 I I I I I I l l I I I IIIIIIII I I I I I I I I I I I I I I II I I I I I I l I I I I I I I I I I I I I I I I I I I I I I I I I I I I (*) D. M ~ C l C C C G M % C A ~ C A ~ ~ ~ ~

m r Ser H i s Leu HisProAnnProGln Val

!i exon-? begins

**o*o****;**A rProLyaLeuProAl~erOl~Il~l~er~et . HisAsp&GlnMet&ly Gl&uAspMetCyaArg 1. ~ C C l l C C A Q Z A ~ ~ ~ ~ ~ . . . C A C I ; A C T A C C . . . CAaXGGACA'KG'IGCCXC!. . . . . .

(') D. ~ ~ f X A M X T A ~ ~ ~ C A l G C A C C A ~ C T A C W ? G A A ~ ~ ~ III I II 1 1 1 1 1 1 1 I~IIIIIIIIII~I I II II I llir IIIIIIIIIIIIIIIIIIIII 1 1 1 1 1 I I IIIIIIIII

MnSar ASXI His Gln Glu SCiIval

G l y O ~ ~ ~ l y I 1 0 l ~ ~ ~ l y V a i P r o P l ~ ~ ~ r v a l ~ ~ t v a l ~ y ~ ~ ~ ~ ~ ~ ~ p L y ~ ~ ~ ~ ~ ~ ' -. 0 C G " l a " I l l l I I I IIIIIIII I I I I IIIIII IIIII I IIIIIIIIIIIIII I I IIIIIIIIIIIIIIIII 1 1 1 1 1 1 1 1 1 1 """ (c) D. am... ... GXAP

01- AlaSerAla Ala UetLeu AlaSer Ala

I. p G f X I ! A l W i T C P C"ClGCAC"

(') D - O C A A A G C M C A A A ~ . . .-m-. . . . G ~ y C l y P r ~ a G l y O l ~ ~ e ~ ~ l ~ ~ P r ~ e r ~ L y ~ i n O l ~ l ~ ~ V a l G l ~ ~ l ~ l n L e ~ i s ~ l ~ l ~ ~ l ~ ~

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I IIIII I I I I I I I I I I I I I I

AlaSer net

S e ~ l y H i ~ ~ s e i r y r A r m V a l ~ ~ V ~ l ~ ~ ~ e r I l l r L ~ ~ G 1 n A ~ p L . ~ 1 u l l ~ l ~ l a G l ~ a ~ e r L y s S e ~ L y s S ~ r ~ a

II 1 1 1 1 1 1 1 1 1 1 l 1 1 1 1 1 1 IIIII I II IIIIIIIIIIIII IIIIII IIIIIIIIII IIIIIIII IIIIIIIIIII II I I I

-1Y

I. . . . l Y x G c c c A c l a " ~ ~ M G c A G G A ~ c3umxm" (*) D. G c C ' R 1 T G C c C A C ' I R l l C ~ C " C C ~ l C A A A ~ ~ ~ ~ ~ ~

s e r ~ ~ p L y s ~ ~ ~ ~ n v a l ~ v ~ l ~ ~ l ~ l y ~ ~ L y ~ ~ r G l y s e r N ~ I l e T y ~ r P h e A r g I l e O i n A . n I l e O l ~ V a l S . r ~ p 1. A O T r ~ C C ~ ~ ~ ~ ~ C ~ ~ ~ C ~ ~ ~ ~ ~ ~ ~ C

D. A C 1 T C n ; C C C ~ ~ ~ ~ ~ ~ ~ T ~ ~ ~ m ~ ~ T (m I I I I I I I I I I I IIII I I I I I I I I I I 1 II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I

Ser Asp Ala Ann LysGln u n + exon4 begins haneobox begins

(Q1 -* ..""d AlaAsnGl&ema;lnk&lyliatCl c ~ c ~ C r o L y ~ ~ ~ l ~ r ~ a ~ ? L - i l h r A ; o H i s C l n I 1 e L e u G l u L e ~ 1 u L y s G l u P h e ~ i s ~ ~

D. MCGAATtXCltGTATCAACCGGGM ~ ~ C ~ C C G C C A C ~ ~ ~ C ~ C ~ ~ G ~ ~ A C I I I I I I I I I1 II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I

lexon-2 begins X y ~ u L y ; L e ~ ~ u L y ~ g I l d z l u G i ~ a ~ a ~ ~ ~ A ~ ~ l y O ~ ~ n ~ ~ ~ ~ i ~ ~ l ~ e ~ - ~ * ~ - ~ p ~ ~ ~ ~ y ~ * r

I. . .Gcc!TAMAC"~wcAA""mm~~ l I t I 1 1 1 1 1 1 1 IIIIIIII IIIIIIIIIIIlII1I1IlltIII1II I I I I I I I I1 II 1 1 1 1 1 1 1 1 1 1 l 1 1 1 1 1 1 l 1 1 1 1 1 1 I 1 1 1 (H)

D. h ~ ~ ~ ~ A C ~ ~ w ~ ~ ~ C ~ ~ ~ ~ ~ ~ ~ ~ ~

I. ~ . " ' " - - " " - - - - - - - - . I " C c ~ ~ T C ~ ~ w a % A G c A G IIIIII I 1 1 1 1 1 1 1 1 1 1 1 1 IIIIIIII IIIIIIIIII\IIIIII 1 1 1 1 1 1 1 1 l 1 1 1 1 1 1 1 1 1 1 1 IIIII 1 1 1 1 1 1 1 1 1 1 1 1 1 1 IIIIII

* ~ ~ p ~ p C y s P h ~ l ~ t S e ~ ~ l n P h n

(I) D. ~ C ~ ~ T C C ~ C A ~ ~ ~ ~ C w ~ % C C C A ~ ~ ~ % ~

f exon-3 "gins . rThrValAsnA1 r S e r ' I y r ; h r ~ ~ r O i n T r L e ~ l ~ u G l ~ y ~ l ~ ~ H i s P ; l e A ~ ~ ;

I I I I I I I I I I I I I . 1. G T A " X G C C C A " C

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I (J) D. -CmlGAA?W C l ' C A T A C A C C T C A M C A C W X M C - l T C C A m m m c

I

f exon-3 "gins . rThrValAsnA1 r S e r ' I y r ; h r ~ ~ r O i n T r L e ~ l ~ u G l ~ y ~ l ~ ~ H i s P ; l e A ~ ~ ;

I I I I I I I I I I I I I . 1. G T A " X G C C C A " C

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I (J) D. -CmlGAA?W C l ' C A T A C A C C T C A M C A C W X M C - l T C C A m m m c

I exon-1 e n d intron-1

,.. II IllltlltllllllJll111111l11 I IJI I I I l l 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 l 1 9. C~~~~~~~~~~

His Ala SerGlyValAlaAla ser

Gl~luIl~snHis~~IieValTh~AlaProA.nOly~ l~GiyA.pPh~~erHis;le01u01uI;lrIsuAlaSe; *).JL***** I. C C G A A ~ G . . . . . . C C X C G A T R C A A ~ T C G C A C A ' K G A G G M 2 A C " K ~ ~ T

\-a II I I f I l 1 1 1 1 1 1 1 1 1 1 ! ! i l ~

I. G C C C ' I G L U C C m ~ ~ ~ A ~ C't3WG-m D. G C C m a C - W T M G A - m A W - haneobox k m=- A l a L e u C y s ~ u T h r G l ~ ~ l n I l e L y ~ I l e T r p P h d ~ s ~ g ~ M e t L y s ~ L y ~ L y s G l ~ s ysTrLisGlyGluh-~lySerGly

(L) 1 1 1 1 1 I I I I I IIIIIIIIIIIIII IIIII I I I I I I I I I I I I I I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 l l l l l l l l l l l ~ l 1 1 1 1 1 IIIII I1 1 1

ends 1 FIGURE 1.-DNA sequence comparisons between D. melanogaster (m.; above) and D. pseudoobscura (p.; below) from selected regions in the

D. pseudoobscura chromosome walk. The DNA sequence was aligned using the "bestfit" program of the GCG software package (DEVEREUX, HAEBERLI and SMITHIES 1984). All alignments use a gap weight of 5.00 and a gap length weight of 0.30. Vertical bars betwen D. melano- gaster and D. pseudoobscura DNA sequence indicate nucleotide identity. Vertical arrows indicate intron/extron junctions. The highly con- served AG sequence immediately preceding all 3' splice sites and the highly conserved G T sequence that immediately

Page 5: Structural Changes in the Antennapedia Complex of ... · Antennapedia Complex Evolution 32 1 1984). The probes were made using the random primer a’* P method (FEINBERG and VOGELSTEIN

Antennapedia Complex Evolution 323

lane, however, strong hybridization signals are ob- served at 12 kb and at 4.7 kb, but not at the predicted 6.7 kb HindIII fragment. This discrepancy is most likely due to a strain polymorphism. The absence of detectable z 2 3‘ homology in D. pseudoobscura indi- cates that the 3‘ half of z 2 has diverged beyond detectability or that z 2 does not exist in D. pseudoob- scura.

The 5’ z 2 probe (z2 5’), which contains a homeo- box, hybridizes weakly to an apparently single ge- nomic locus in D. pseudoobscura (Figure 4C) rather than zen (see below). The origin of this signal at 14 kb (BamHI) , 5.0 kb (HindI I I ) and 2.8 kb (EcoRI) is unclear (Figure 4C). No comparable band is observed when D. pseudoobscura genomic DNA, digested with EcoRI, is probed with the zen cDNA (Figure 4A). The z 2 5‘ probe also produces a weak signal at the pre- dicted zen positions in D. pseudoobscura, and this prob- ably derives from homeobox cross-hybridization (Fig- ure 4C). Taken together, the absence of detectable z 2 3’ homology and the presence of weak z2 5’ homology suggests at least two alternative possibilities. The first is that z2 is no longer present in the D. pseudoobscura genome, but another related gene with detectable homeobox homology to D. melanogaster z 2 resides within the genome. Alternatively, a 22 homolog is present outside of the pb-bcd interval of D. pseudoob- scura, but has diverged significantly in nearly half of its coding region. Given the high degree of DNA sequence conservation found outside of the homeo- box for the other ANT-C genes ( A n t p , f t z , Scr, Dfd, bcd, zen, pb and lab) in both protein coding and nontranslated regions (Figure 1 ; SEECER and KAUF- MAN 1990 for bcd) and given that homology to the 3’ half of 22 is undetectable in D. pseudoobscura, it seems more likely that a 22 homeobox subfamily member exists in D. pseudoobscura rather than a true z 2 hom-

The D. pseudoobscura Dfd gene is inverted with respect to its homolog in D. melanogaster: One inter- esting aspect of ANT-C organization in D. melano- gaster is the common 5’ to 3‘ orientation of each of the member homeobox genes except for Dfd and f t z .

olog.

However, in contrast, in D. pseudoobscura the 5’ to 3’ orientation of the Dfd gene is in phase with the other homeobox-containing genes of the psANT-C (Figure 2). It seems that an inversion between ama and Scr has occurred in one of these two species. This is demonstrated by Southern hybridization data using separate D. melanogaster exon-l/exon-2 and exon-3/ exon-4 probes derived from Dfd (data not shown) and by DNA sequence from p431 and p475.

In contrast to Dfd, the remaining psANT-C genes remain oriented as in D. melanogaster. DNA sequence analysis from phage p 15 reveals that D. pseudoobscura ftz remains in the same 5’ to 3’ orientation relative to the complex as its homolog in D. melanogaster (Figure 1). Additionally, DNA sequence data demonstrate that the orientations of the remaining genes in psANT-C are unchanged with respect to their homo- logs in D. melanogaster (Figure 1). Since only Southern blot analysis was performed on the cuticle-like genes in the psANT-C, their 5’ to 3’ orientations are un- known.

Intergenic distances and large intron lengths are conserved across species: One defining characteristic of the ANT-C is the proximity of its member genes. Has the distance between adjacent genes changed over 46 million years of evolution? Measurements of the distances between selected positions (sequenced re- gions are identified in Figure 2) within adjacent genes demonstrate that there is < lo% difference in the size of the complexes in the two species (data not shown). Even the distance between pb and zen has remained relatively constant despite the apparent absence of z2 from this interval in D. pseudoobscura. Additionally, the relative positions of the outer boundaries of the cuticle-like gene homology are similar to those in D. melanogaster (Figure 2).

An unusual property of the D. melanogaster hom- eotic genes is the large size of the transcription unit compared with the processed mRNA product. This discrepancy is due to the presence of one or more large introns within these genes (see Figure 2). T o determine whether intron length has changed be- tween these two species, the distances between se-

follows all 5’ splice sites are darkened. Asterisks overlie D. melanogaster intronic sequences. Amino acid sequences overlie the D. melanogaster open-reading frame (ORF). The conceptually translated D. pseudoobscura amino acid sequence is identical to D. melanogaster unless otherwise designated below. The D. melanogaster sequences were obtained from the following sources: Antp SCHNEUWLY et al. 1986 (from EMBL- DROANTPE5);ftz. LAUCHON and SCOTT 1984; Scr, LEMOTTE et al. 1989; Dfd RECULSKI et al. 1987 (from EMBL or GENEBANK- DROANTDFD); ren, RUSHLOW et al. 1987; pb, CRIBB~ et al. 1992; lab, DIEDERICH et al. 1989). The genomic clones and subclones from which the D. pseudoobscura sequences were obtained are indicated above each comparison set. The located of these sequences on the molecular map are indicated in Figure 2. All of the sequence comparisons show high DNA sequence identity and high amino acid identity. Most of the nucleotide changes occur at a third position within codons and therefore usually do not result in amino acid changes (A)-(K). Listed for each DNA sequence comparison is the origin of the D. melanogaster sequence, followed by the origin of the D. pseudoobscura seuqence: (A) lab (exon-1 region) X p543 1.1 kb EcoRIIBamHI; (B) lab (exon-2 region) X p503 0.6 kb EcoRI; (C) pb (exon-2 region) X p300 1.4 kb HindIII; (D) pb (exon-8 region) X p202 1.3 kb EcoRI; (E) ren (exon-2 region) X p3 10 1.2 kb BamHI/HindIII; (F) ama (exon-2 region) X p401 0.75 kb BamHI/EcoRI (this sequence from SEECER 1989); (C) Dfd (exon-4 region) X p 431 1.0 kb HincIII; (H) Dfd (exon-2 region) p 474 1.1 kb BamHI; (I) Scr (exon-2 region) X p69 3.5 kb EcoRI; (J) Scr (exon-3 region) X p73 2.3 kb EcoRI and p90 1.7 kb EcoRI; (K)& (exon-1 region) X p15 2.3 kb XbaI/ClaI; (L) Antp (exon-8 region) pl105 3.0 kb EcoRI.

Page 6: Structural Changes in the Antennapedia Complex of ... · Antennapedia Complex Evolution 32 1 1984). The probes were made using the random primer a’* P method (FEINBERG and VOGELSTEIN

324 F. M. Randazzo et al.

(4 3'

cuticlelike cluster

+5 5

FIGURE 2.-Molecular map of ANT-C in D. melunoguster with corresponding D. pseudoobscuru genomic clones. The molecular map of ANT-C in D. melunoguster, depicted above the coordinate line (Scorn et ul. 1983; SEEGER, HAFFLEY and KAUFMAN 1988; DIEDERICH et ul. 1989), is divided into four segments, (a)-(d). The first segment (a) starts at the telomere-proximal end of the complex and the last segment (d) ends after the 3' most exon of Antennupediu. Exons are depicted as shaded boxes. Shown below the coordinates (kb of DNA) are the D. pseudoobscuru genomic clones obtained from the chromosome walk. Hind111 restriction sites found within the genomic clones are depicted as thin vertical bars. The thicker vertical bars at the ends of each clone are lambda phage cloning sites (see MATERIALS AND METHODS for description). Lines connecting the D. melanoguster molecular map to the D. pseudoobscuru clones indicate locations of homology as determined by DNA sequence analysis. The hatched bar located directly below the coordinates in (a) delineate the region of Southern blot homology in D. pseudoobscuru to D. melunoguster (ANT-C cuticle gene-like probes). The bcd and umu positions are from SEECER and KAUFMAN (1990) and S E E G E R ( ~ ~ ~ ~ ) .

Page 7: Structural Changes in the Antennapedia Complex of ... · Antennapedia Complex Evolution 32 1 1984). The probes were made using the random primer a’* P method (FEINBERG and VOGELSTEIN

Antennapedia Complex Evolution 325

H

9.6 B 3.1 B 128 ?.6 B 3.8R 3.S R 2.4 R S2R 8.3 R

(b) D. pseudoobscunr

lkb - 7.1 R 10.4 R 1 2 R 4 2 R

1.6 H 1.0 H 8.7 H 28H &OH 1.SH H H H R H H R R HR H

8.5 B B a

6.6 B 4.7 B 7.4 B ”_ e

FIGURE 3.-Restriction maps of the pb-zen interval in D. rnelano- gaster and D. pseudoobscura. Restriction map of the pb-ren interval in (a) D. melanogaster and (b) D. pseudoobscura (SEEGER 1989), (B) BamHI, (H) Hind111 and (R) EcoRI. The stippled boxes represent exon-2 of pb, the striped box represents the 22 transcription unit, the black boxes represent Zen, and the white box represent bcd. Drawn, with arrows, above the restriction maps are the boundaries of the genomic library clones from the respective chromosome walks. Note that an artifical EcoRI restriction site lies at each of those boundaries.

quenced positions within exons that border large in- trons were compared (Figure 2). Dramatic changes are not observed for Scr (1 %), Dfd (2%) and lab (7%). On the other hand, a moderate change is observed for pb (1 3%). Clearly, gross changes in intron size and intergenic spacing have not occurred during the di- vergence of these two species.

DISCUSSION

We report here the cloning of the ANT-C from Drosophila pseudoobscura. The homologs to the D. melanogaster Antp, ftz, Scr, Dfd, ama, bcd, zen, pb, and lab genes, but not z2 are all present in the D. pseu- doobscura complex (psANT-C), in the same linear order and similarly spaced along the chromosome. The similar arrangement of ANT-C homeotic genes in organisms ranging from D. melanogaster to D. pseu- doobscura to beetles, mice and humans suggests a functional importance to this organization of genes that natural selection has maintained.

A small chromosomal inversion reveals some flexibility in ANT-C structure: The extraordinary positional conservation of homeotic genes within the murine Hox complex, Drosophila ANT-C and BX-C,

and the beetle HOM-C suggests that the structures of these complexes are relatively rigid and incapable of undergoing rearrangements that survive evolutionary selection. The observed inversions of the Dfd gene in D. pseudoobscura and the f t z transcription unit in Dro- sophila hydei (relative to D. melanogaster) demonstrate that subtle rearrangements can occur without com- promising gene function (this study and MAIER, PREISS and POWELL 1990). Whether these observed inver- sions indicate the potential for other rearrangements that would split the complex is more difficult to de- termine. It is quite possible that cis-regulatory se- quences are rearranged by these inversions, but these alterations have little effect due to the ability of en- hancer elements to work 5’ or 3’ of a gene. Thus, a translocation with an identical breakpoint could dis- rupt proper gene regulation and result in lethality. Since the cis-regulatory elements for Dfd have not been fully defined and the precise extents of these inversions are not known, the impact of these inver- sions on the rigidity of the ANT-C is difficult to assess completely.

Interestingly, the inversion of Dfd and ftz, in D. pseudoobscura and D. hydei, respectively, likely repre- sents the orientation of these genes in an ancestral complex. The homeobox genes of the ANT-C pre- sumably arose from a series of duplication and diver- gence events that must have occurred before the ancestral split between mammals and arthropods. Consistent with this gene duplication and divergence model, one characteristic of the mouse Hox-2 com- plex is the identical direction of transcription for all the homeobox-containing member genes (RUBOCK et al. 1990). In D. melanogaster all the homeobox con- taining genes are transcribed in the same 5’ to 3’ direction except for Dfd andftz (KAUFMAN, SEEGER and OLSEN 1990). After the duplication events that gave rise to Dfd andftz, separate inversion events most likely resulted in the reversal of orientation of these two genes somewhere in the D. melanogaster lineage. Whether there is a Drosophilid species that has all ANT-C homeobox containing genes with the same direction of transcription, as is seen in the mouse Hox- 2 complex, remains unknown and would require the analysis of additional species.

The nonhomeotic genes of the ANT-C are con- served in D. pseudoobscura: A number of nonho- meotic genes reside within the ANT-C of D. melano- gaster. These include the homeobox-containing gene Zen, bcd and f t z , and the nonhomeobox-containing genes ama and the cuticle-like gene cluster. The origin of the homeobox containing genes zen, z 2 (see below for discussion), bcd, and f t z presumably derives from duplication and divergence events within the ancestral complex. It has been postulated that the lab and Abd- B genes are among the original homeobox genes of

Page 8: Structural Changes in the Antennapedia Complex of ... · Antennapedia Complex Evolution 32 1 1984). The probes were made using the random primer a’* P method (FEINBERG and VOGELSTEIN

326 F. M. Randazzo el al.

(A) zen cDNA probe (B) z z 3' Probe (c) 22 5' probe 0. mel. 0. pso. 0. mel. 0. pso. D. me/. 0. pso. mG-2 mxm

1.2"

l=d 4.7- 5"> "1

2.0-->

the ancient complex (CRIBBS et al. 1992). If this hy- pothesis is correct, then zen, 22, bcd and f t z would represent more recent additions to the ANT-C.

All of these homeobox-containing genes (except 22 (see below)) are found in the psANT-C complex. In addition, bothftz and bcd have been identified in other Diptera: f t z in D. hydei (MAIER et al. 1990) and bcd in seven different Drosophilid species (MACDONALD 1990; SEEGER and KAUFMAN 1990) and in Musca domestica (SOMMER and TAUTZ 1991). The question whether any of these genes has a homolog outside the Diptera remains unanswered. Attempts to identify a

j z homolog in the red flour beetle and honeybees have been to date unsuccessful (STUART et al. 1991 ; WALLDORF, FLEIC and GEHRING 1989). It is also the case that clear homologs of these four genes have yet to be identified within the murine Hox and human HOX complexes (see MCGINNIS and KRUMLAUF 1992). Thus, it is tempting to speculate that these genes represent recent additions to the ANT-C and may be limited to the Diptera. However more exten- sive analyses of the nondipteran examples will be required to demonstrate this point.

The nonhomeobox containing genes of the ANT- C have been conserved in the psANT-C as well. Hom- ologs to ama and the cuticle-like gene cluster are found in the psANT-C at positions similar to their D. melanogaster counterparts. Although deficiencies re- moving the cuticle-like gene cluster or ama do not have major phenotypic consequences (PULTZ 1988; M. A. SEECER, T. C. KAUFMAN and C. GOODMAN, in preparation), the conservation of these genes in D.

FIGURE 4.-Genomic Southern hybridiza- tions of D. melanogasterand D. pseduoobscura DNA: searching for 22 homology. Southern blot of restriction enzyme digested genomic D. mel- anogaster (D. mel.) Oregon R and D. pseudoob- scura (D. pso) DNA-(B) BamHI. (H) Hindlll, (R) EcoRI. DNA size markers consisted of a 1 : l mixture of lambda phage DNA digested with Hind111 and with Hind111 and EcoRI. Random primed S2P-labeled D. melanogaster (A) Zen pB.2 cDNA (1.2 kb EcoRI), (B) 22 3' (0.262 kb EcoRI/ Sall) , and (C) 22 5' (0.445 kb Sall/XbaI) probes were hybridized to identical Southern blots for greater than 36 hr using the low stringency conditions of MCGINNIS et al. (1984). Specific activity of the probes: (A) 7.1 X IO8 cpm/pg DNA; (B) 5.2 X 10" cpm/pg DNA and (C) 5.6 X 10" cpm/pg DNA. After hybridization, the blots were washed under very low-stringency condi- tions (2X ssc, 50" (McCrNNlset a[. 1984). Low- stringency hybridization and wash conditions plus excess lambda marker DNA relative to any given genomic DNA sequence resulted in strong signals eminating from some of the size markers. Proper DNA transfer to the Nytran filters in the D. pseudoobscura lanes is evident from the equal and even smearing caused by nonspecific hybrid- ization of probe to the genomic DNA.

pseudoobscura may indicate their functional impor- tance. The presence of nonhomeobox genes has not been reported in the murine Hox or human HOX complexes or the beetle HOM-C. If they are truly absent from those complexes, it is likely that ama and the cuticle-like genes reside in the ANT-C as a result of recent insertion events and the insertion of these genes in the ANT-C may be another example of the disintegration of the homeotic gene complexes in Drosophila relative to other organisms (see MCGINNIS and KRUMLAUF 1992).

r2 gene evolution: The absence of 22 from the psANT-C, its apparent absence from the D. pseudoob- scura genome, and the lack of a detectable function for 22 in D. melanogaster raises some interesting ques- tions concerning the origin and function of the 22 gene. Because of the high degree of relatedness of Zen and 22 with respect to homeobox sequence (75% amino acid identity), gene structure (-1 0 10 bp vs. -1350 bp transcription unit with one small intron located just downstream of the ATG translation start site), and temporal and spatial regulation, these genes likely arose from a duplication of an ancestral gene followed by divergence in sequence (RUSHLOW et al. 1987). Since the nonhomeobox sequences of zen and 22 exhibit no amino acid sequence similarity, the divergence between these two genes has been substan- tial.

A number of possibilities could explain the apparent absence of 22 in D. pseudoobscura. The 22 gene could have existed before the divergence of the common ancestor to D. melanogaster and D. pseudoobscura, but

Page 9: Structural Changes in the Antennapedia Complex of ... · Antennapedia Complex Evolution 32 1 1984). The probes were made using the random primer a’* P method (FEINBERG and VOGELSTEIN

Antennapedia Complex Evolution 321

was then subsequently lost in the D. pseudoobscura lineage. Alternatively, 2 2 may have arisen as a zen duplication after the separation of the D. melanogaster and D. pseudoobscura lineages. Thus, 22 may never have been present in a D. pseudoobscura ancestor. This explanation is consistent with the absence of a detect- able function for z 2 in D. melanogaster. Clearly an analysis of other intermediate Drosophilid species will be necessary to more fully understand the origin of the z2-zen gene pair.

Besides the zen-z2 gene pair, two other examples of adjacent and highly related homeobox genes have been described in Drosophila. The gooseberry (gsb) locus consists of two adjacent genes, gooseberry-proxi- mal and gooseberry-distal, which possess highly related homeoboxes and other sequence similarities. Al- though the gsb-p gene product accumulates primarily in the central nervous system and the gsb-d gene product in the epidermis, the gsb genes do have some overlapping sites of expression. These genes presum- ably arose by a duplication and divergence event, as did zen and 22, but unlike the zen-22 situation, the gsb patterns of expression have diverged considerably. The engrailed and invected genes represent another set of adjacent homeobox genes that are highly related in sequence (COLEMAN et al. 1987). Similar to Len and z2, the functional contribution of each gene in these gene pairs has not been defined.

The creation of new genes to fulfill novel functions is an essential process in the evolution of more com- plicated organisms. The presence of these highly re- lated and adjacent genes, like zen and 2 2 , most likely represent a phase in the evolutionary development of new genes and functions. Over time some of these duplicated genes will acquire novel functions and be- come important or essential to the organisms while others will be lost. The zen-z2 gene pair offers an excellent opportunity to study this process. The loca- tion of zen and z 2 in a defined region, the ANT-C, our knowledge about their functions in D. melano- gaster, and the changes that have occurred during the divergence of D. melanogaster and D. pseudoobscura, make the analysis of the origin of zen and z 2 very accessible and attractive.

Why is the organization of the homeotic gene complexes conserved? The most striking feature of the homeotic gene complexes is the collinearity of gene order on the chromosome with their domains of expression along the anterior-posterior axis of the embryo. That this aspect of homeotic gene complex organization has been maintained from Drosophila to human, with over 500 million years of divergence between these species, is truiy remarkable. Experi- mental evidence that helps to explain this conserved organization is sparse. Several hypotheses, however, can be proposed.

First, important transcriptional regulatory elements might be shared or interspersed within the complex. These cis-acting elements could be traditional en- hancer sequences that are interspersed among and used by multiple genes, or they could represent ele- ments that organize chromatin structure throughout the complex (see MIRKOVITCH, MIRAULT and LAEM- MLI 1984; GASSER and LAEMMLI 1986; GYURKOVICS et al. 1990; KELLUM and SCHEDL 199 1). In the murine Hox complex, preliminary evidence suggests that reg- ulatory elements may be interspersed between genes, thereby potentially restricting the fixation of inversion events (RUBOCK et al. 1990; RENUCCI et al. 1992). In D. melanogaster, theftz gene is embedded within Scr regulatory regions (PATTATUCCI, OTTESON and KAUF- MAN 199 I ) , and there is evidence that abd-A and Abd- B of the BX-C share some enhancer-like regions (CEL- INKER et al. 1990). However, further evidence for interspersed regulatory elements is lacking for other ANT-C and BX-C genes. In fact, the three homeotic genes of the BX-C can be separated genetically with- out detectably compromising functions (STRUHL 1984; TIONG, WHITTLE and GRIBBIN 1987; however, see KARCH et al. 1985 for contradictory evidence), and there are numerous inversions in the ANT-C that break within one gene but fail to affect other adjacent genes, suggesting that genes of the BX-C and ANT- C may not share regulatory elements (ABBOTT and KAUFMAN 1986; PULTZ et al. 1988; DIEDERICH et al. 1989). Additionally, p b and Zab null mutations have been rescued using transgenes that do not contain DNA derived from adjacent functional genes (RAN- DAZZO, CRIBBS and KAUFMAN 199 I ; CHOUINARD and KAUFMAN 1991). These data indicate that inter- spersed regulatory elements, if present, are not essen- tial for proper gene functions, although they may contribute a fine-tuning role that increases the overall fitness of the organism.

The role of chromatin structure in the regulation of ANT-C or BX-C genes is not well defined. Chro- matin scaffolding attachment sites (SARS) have been experimentally defined for sequence surrounding the f t z locus (GASSER and LAEMMLI 1986), and consensus sequences characteristic of SARS have been identified in upstream regulatory regions of the labial locus (S. CHOUINARD and T. C. KAUFMAN, unpublished obser- vations). In addition, synapsis dependent regulation (transvection effects) has been observed for a number of homeotic genes, including Scr in the ANT-C (PAT- TATUCCI and KAUFMAN 1991). In certain genetic back- grounds, the homologous chromosomes must be able to pair for proper Scr transcriptional regulation. Re- arrangements that prevent pairing disrupt Scr regu- lation. These observations reveal a nonessential re- quirement for transcriptional regulation that is de-

Page 10: Structural Changes in the Antennapedia Complex of ... · Antennapedia Complex Evolution 32 1 1984). The probes were made using the random primer a’* P method (FEINBERG and VOGELSTEIN

328 F. M . Randazzo et al.

pendent on chromosome organization and possibly chromatin architecture.

The complexity of regulatory elements surrounding the homeotic genes in combination with the proximity of these cis-acting sequences to adjacent genes could contribute to the conservtion of the homeotic gene complexes. These two factors, complexity and prox- imity, would substantially increase the probability that a random chromosomal rearrangement would disrupt proper gene regulation and thus would be eliminated from the gene pool. The observation that in D. mela- nogaster Scr regulatory sequences extend minimally to the 3’ end and possibly within the Ant@ transcription unit, with the ftz locus also embedded in this region, is an excellent example of this complexity and prox- imity (PATTATUCCI, OTTESON and KAUFMAN 1991). Other regions of the ANT-C are not as “dense” (see Figure 2). For example, the distances from ama to Dfd and Dfd to Scr are seemingly quite extensive, although the extent of cis-regulatory sequences for these genes has not been precisely defined. A variation of this hypothesis, proposed by RANDAZZO, CRIBBS and KAUFMAN (1991) as the “prisoner effect,” suggests that rearrangement events that disrupt the complexes are unlikely due to potential deleterious position-ef- fects on homeotic loci. Ectopic expression of homeotic genes often results in lethality. Unlike the first hy- pothesis, where the complexes are conserved for ad- vantageous functional reasons, these latter structural hypotheses suggest that the conservation of the com- plexes is a function of their organization and the improbability of changing this organization without deleterious consequences.

What insights, with respect to these hypotheses, are gained from our knowledge of the structure of the ANT-C in D. pseudoobscura and the description of homeotic genes complexes in other more diverse spe- cies? Although the overall conservation of the hom- eotic complexes is consistent with the predictions of either hypothesis, the nature of this conservation does potentially suggest a distinction between the two ra- tionales. The feature of the homeotic complexes that is conserved in these different organisms is the colin- earity of homeotic gene order on the chromosome with their domains of expression in the embryo. Other aspects of the complexes, including the number of genes, types of genes, direction of transcription, gene size and spacing, are relatively dynamic features. Some of these features have changed in D. pseudoobscura ( e .g . , direction of transcription for Dfd and the absence of z2), wherreas others have changed only in more diverse species (e .g . , gene number and size in the Hox complexes). These dynamic aspects of the homeotic complexes are more consistent with the hypothesis that the complexes exist for some functional advan- tage and are less consistent with the suggestion that

the complexes are conserved as a consequence of their organizational complexity. Although it is likely that nonessential but advantageous regulatory elements are interspered within the homeotic complexes, offer- ing a selective advantage to those who maintain this organization, it is also possible that the complexity of cis-regulatory sequences and the deleterious conse- quences to ectopic expression contribute to the ob- served rigidity of the complexes as well. Of course, these evolutionary comparisons are merely suggestive, and the proof of these hypotheses will depend on further experimental analysis.

We are grateful to CHARLES LANGLEY and TULLE HAZELRIGG for the D. pseudoobscura genomic libraries. We thank DAVID CRIBBS for the pb exon-2 sequence in D. pseudoobscura sequence, DWAYNE JOHNSON for performinga number of phage preps, SANDI HORIKAMI for the D. melanogaster genomic DNA, KIM FECHTEL for the cuticle gene probes and DEE VEROSTKO for her priceless administrative work. We also thank the dissertation committee of F.M.R., a.k.a. THOMAS BLUMENTHAL, MARC MUSKAVITCH, ELIZABETH RAFF and SUSAN STROME, for critical reading of this work. This work was supported by a National Institutes of Health Predoctoral Fellowship (GMO7757) to F.M.R. and M.A.S. and an NIH grant (GM24299) to T.C.K. T.C.K. is an investigator of the Howard Hughes Medical Institute.

LITERATURE CITED

ABBOTT, M. K., a n d T . C. KAUFMAN, 1986 T h e relationship between the functional complexity and the molecular organi- ration of the Antennapedia locus of Drosophila melanogaster. Genetics 114: 919-942.

AKAM, M., 1989 HOX and HOM: homologous gene clusters in

BEEMAN, R. W., J. J. STUART, M. S. HAAS and R. E. DENELL, 1989 Genetic analysis of the homeotic gene complex (HOM- C) in the beetle Tribolium casteneum. Dev. Biol. 133: 196-209.

BENTON, M. J., 1990 Phylogeny of the major Tetrapod groups: morphological data and divergence dates. J. Mol. Evol. 30: 409-424.

BERLETH, T., M. BURRI, G. THOMA, D. BOPP, S. RICHSTEIN, G. FRIGERIO, M. NOLL and C. NUSSLEIN-VOLHARD, 1988 The role of localization of bicoid RNA in organizing the anterior pattern of the Drosophila embryo. EMBO J. 7: 1749-1 756.

BEVERLY, S. M . , and A. C. WILSON, 1984 Molecular evolution in Drosophila and the higher Diptera 11. A time scale for fly evolution. J. Mol. Evol. 21: 1-13.

BONCINELLI, E., R. SOMMA, D. ACADMPRA, M. PANNESE, M. D’ES- POSITO, A. FAIELLA and A. SIMONE, 1988 Organization of human homeobox genes. Hum. Reprod. 3: 880-886.

CELNIKER, S. E., S. SHARMA, D. J. KEELAN and E. B. LEWIS, I990 T h e molecular genetics of the bithorax complex of Drosophila: cis-regulation in the Abdominal43 domain. EMBO J. 9: 4277-4286.

CHOUINARD, S., and T. C. KAUFMAN, 1991 Control of the expres- sion of the homeotic labial (lab) locus of Drosophila melano- gaster: evidence for both positive and negative autogenous regulation. Development 113: 1267-1 280.

COLEMAN, K. G., S. J. POOLE, M. P. WEIR, W. C. SOELLER and T . KORNBERG, 1987 T h e inuected gene of Drosophila: sequence analysis and expression studies reveal a close kinship to the engrailed gene. Genes Dev. 1: 19-28.

CRIBBS, D. L., M. A. PULTZ, D. JOHNSON, M. MAZZULLA and T. c. KAUFMAN, 1992 Structural complexity and evolutionary con-

insects and vertebrates. Cell 57: 347-349.

Page 11: Structural Changes in the Antennapedia Complex of ... · Antennapedia Complex Evolution 32 1 1984). The probes were made using the random primer a’* P method (FEINBERG and VOGELSTEIN

Antennapedia Complex Evolution 329

servation of the Drosophila homeotic gene proboscipedia. EMBO

DEVEREUX, J., P. HAEBERLIand 0. SMITHIES, 1984 A comprehen- sive set of sequence analysis programs for the VAX. Nucleic Acids Res. 12: 387-395.

DIEDERICH, R. J., V. K. L. MERRILL, M. A. PULTZ and T . C. KAUFMAN, 1989 Isolation, structure, and expression of labial, a homeotic gene of the Antennapedia Complex involved in Drosophila head development. Genes Dev. 3: 399-41 4.

DOBZHANSKY, T. H. and A. H. STURTEVANT, 1937 Inversions in the chromosomes of Drosophila pseudoobscura. Genetics 23: 28- 64.

DUBOULE, D., and P. DOLLE, 1989 The structural and functional organization of the murine HOX gene family resembles that of Drosophila homeotic genes. EMBO J. 8: 1497-1505.

DUNCAN, I . , 1987 The bithorax complex. Ann. Rev. Genet. 21:

FECHTEL, K., D. K. FRISTROM and J. W. FRISTROM, 1989 Prepupal differentation in Drosophila: distinct cell types elaborate a shared structure, the pupal cuticle, but accumulate transcripts in unique patterns. Development 1 0 6 649-656.

FECHTEL, K., J. E. NATZLE, E. E. BROWN and J. W. FRISTROM, 1988 Prepupal differentation of Drosophila imaginal disks: identification of four genes whose transcripts accumulate in response to a pulse of 20-hydroxyecdysone. Genetics 120: 465- 474,

FEINBERG, A. P. and B. VOGELSTEIN, 1983 A technique for radi- olabeling DNA restriction endonuclease fragments to high specific activity. Anal. Biochem. 132: 6- 13.

FRIGERIO, G., M. BURRI, D. BOPP, S. BAUMGARTNER and M. NOLL, 1986 Structure of the segmentation gene paired and the Drosophila PRD gene set as part of a gene network. Cell 47:

GASSER, S. M., and U . K. LAEMMLI, 1986 Cohabitation of scaffold binding regions with upstream/enhancer elements of three developmentally regulated genes of D. melanogaster. Cell 46:

GRAHAM, A., N. PAPLOPULU and R. KRUMLAUF, 1989 The murine and Drosophila homeobox gene complexes have common fea- tures of organization and expression. Cell 57: 367-378.

GYURKOVICS, H., J. GAUSZ, J. KUMMER and F. KARCH, 1990 A new homeotic mutation in the Drosophila bithorax complex removes a boundary separating two domains of regulation. EMBO J. 9: 2579-2585.

IZPISUA-BELMONTE, J.-C., C. TICKLE, P. DOLLE, L. WOLPERT and D. DUBOULE, 199 1 Expression of the homeobox Hox-4 genes and the specification of position in chick wing development. Nature 3 5 0 585-589.

KAPPEN, C., K. SCHUGHART and F. H. RUDDLE, 1989 Two steps in the evolution of Antennapedia-class vertebrate homeobox genes. Proc. Natl. Acad. Sci. USA 86: 5459-5463.

KARCH F., B. WEIFFENBACH, M. PEIFER, W. BENDER, I. DUNCAN, S. CELNIKER, M. CROSBY and E. B. LEWIS, 1985 The abdominal region of the bithorax complex. Cell 43: 81-96.

KAUFMAN, T. C., R. LEWIS and B. WAKIMOTO, 1980 Cytogenetic analysis of chromosome 3 in Drosophila melanogaster. The hom- eotic gene complex in polytene chromosome interval 84A-B. Genetics 94: 115-133.

J. 11: 1437-1449.

285-3 19.

735-746.

521-530.

KAUFMAN, T. C., M. A. SEEGER and G. OLSEN, 1990 Molecular and genetic organization of the Antennapedia gene complex of Drosophila melanogaster, pp. 309-362 in Advances in Genetics, edited by T. WRIGHT. Academic Press, New York.

KELLUM, R., and P. SCHEDL, 1991 A position-effect assay for boundaries of higher order chromosomal domains. Cell 64:

LAUGHON, A., and M. P. SCOTT, 1984 Sequence of a Drosophila 941-950.

segmentation gene: protein structure homology with DNA- binding proteins. Nature 3 1 0 25- 31.

LEMOTTE, P. K., A. KUROIWA, L. I. FESSLER and W. J. GEHRING, 1989 The homeotic gene Sex combs reduced of Drosophila: gene structure and embryonic expression. EMBO J. 8: 219- 227.

LEWIS, E. B., 1978 A gene complex controlling segmentation in Drosophila. Nature 2 7 6 565-570.

MACDONALD, P. M., 1990 bicoid mRNA localization signal: phy- logenetic conservation of function and RNA secondary struc- ture. Development 110: 161- 171.

MAHAFFEY, J. W., and T. C. KAUFMAN, 1987 The homeotic genes of the Antennapdeia and bithorax complex of Drosophila mel- anogaster, pp. 329-359 in Developmental Genetics of Higher Organisms: A Primer in Developmental Biology, edited by G. M. Malacinski. Macmillan, New York.

MAIER, D., A. PREISS and J. R. POWELL, 1990 Regulation of the segmentation gene fushi tarazu has been functionally conserved in Drosophila. EMBO J. 9: 3957- 3966.

MCGINNIS, W., R. L. GARBER, J. WIRZ, A. KUROIWA and W. J. GEHRING, 1984 A homologous protein-coding sequence in Drosophila homeotic genes and its conservation in other me- tazoans. Cell 37: 403-408.

MCGINNIS, W., and R. KRUMLAUF, 1992 Homeobox genes and axial patterning. Cell 66: 283-302.

MIRKOVITCH, J., M. F. MIRAULT and U . K. LAEMMLI, 1984 Organization of the higher-order chromatin loop: specific DNA attachment sites on nuclear scaffold. Cell 39: 223-232.

PATTATUCCI, A. M.. and T. C. KAUFMAN, 1991 The homeotic gene Sex combs reduced of Drosophila melanogaster is differen- tially regulated in the embryonic and imaginal states of devel- opment. Genetics 129: 443-461.

PATTATUCCI, A. M., D. C. OTTESON and T. C. KAUFMAN, 1991 A functional and structural analysis of the Sex combs reduced locus of Drosophila melanogaster. Genetics 129: 423-441.

PEIFER, M., F. KARCH and W. BENDER, 1987 The bithorax cam- plex: control of segmental identity. Genes Dev. 1: 891-898.

PULTZ, M. A,, 1988 A molecular and genetic analysis of probos- cipedia and flanking genes in the Antennapedia Complex of Drosophila melanogaster. Ph.D. Thesis, Indiana University, Bloomington.

PULTZ, M. A., R. J. DIEDERICH, D. L. CRIBBS and T. C. KAUFMAN, 1988 The proboscipedia locus of the Antennapedia Complex: a molecular and genetic analysis. Genes Dev. 2: 901-920.

RANDAZZO, F. M., D. L. CRIBBS and T . C. KAUFMAN, 1991 Rescue and regulation ofproboscipedia: a homeotic genes of the Anten- napedia Complex. Development 113: 257-27 1.

REGULSKI, M., N. MCGINNIS, R. CHADWICK and W. MCGINNIS, 1987 Developmental and nlolecular analysis of Deformed: homeotic gene controlling Drosophila head development. EMBO J. 6: 767-777.

RENUCCI, A., V. ZAPPAVIGNA, J. ZAKANY, J.-C. IZPISUA-BELMONTE, K. BURKI and D. DUBOULE, 1992 Comparison of mouse and human HOX-4 complexes defines conserved sequences in- volved in the regulation ofHox-4.4. EMBO J. 11: 1459-1468.

Ross, M. H., and A. TANAKA, 1988 Genetic variability in the german cockroach XII: a third mutant that suggests chrom- some 9 carries a highly conserved group of closely linked genes. J. Hered. 79: 439-443.

RUBOCK, M. J., 2. LARIN, M. COOK, N. PAPALOPULU, R. KRUMLAUF and H. LEHRACH, 1990 A yeast artificial chromosome con- taining a mouse homeobox cluster Hox-2. Proc. Natl. Acad. Sci. USA 87: 4751-4755.

RUSHLOW, C., H . DOYLE, T . HOEY and M. LEVINE, 1987 Molecular chracterization of the zerknullt region of the Antennapedia gene complex in Drosophila. Genes Dev. 1:

SCHNEUWLY, S., A. KUROIWA, P. BAUMGARTNER and W. J. GEHR- ING, 1986 Structural organization and sequence of the hom-

1268-1279.

Page 12: Structural Changes in the Antennapedia Complex of ... · Antennapedia Complex Evolution 32 1 1984). The probes were made using the random primer a’* P method (FEINBERG and VOGELSTEIN

330 F. M . Randazzo et al.

eotic gene Antennapedia of Drosophila melanogaster. EMBO J. 5: 733-739.

SCOTT, M. P., J. W. TAMKUN and G . W. HARTZELL, 1989 The structure and function of the homeodomain. Biochim. Biophys. Acta 989: 25-48.

SCOTT, M. P., A. J. WEINER, T. I . HAZELRIGC, B. A. POLISKY, V. PIRROTTA, F. SCALENCHE andF T. C. KAUFMAN, 1983 The molecular organization of the Antennapedia locus of Drosoph- ila. Cell 35: 763-776.

SEECER, M . A,, 1989 Molecular and genetic analysis of the bicoid- zerknullt interval of the Antennapedia gene complex in Dro- sophila melanogaster. Ph.D. Thesis, Indiana University, Bloom- ington.

SEECER, M. A,, L. HAFFLEY and T. C. KAUFMAN, 1988 Characterization of amalgam: a member of the immunoglobulin superfamily from Drosophila. Cell 55: 589-600.

SEEGER, M. A., and T. C. KAUFMAN, 1990 Molecular analysis of the bicoid gene from Drosophila pseudoobscura: identification of conserved domains within coding and noncoding regions of the bicoid mRNA. EMBO J. 9: 2977-2987.

SOMMER, R. and D. TAUTZ, 1991 Segmentation gene expression in the housefly Musca domestica. Development 113: 419-430.

STEINEMANN, M., W. PINSKER and D. SPERLICH, 1984 Chro- mosome homologies within the Drosophila obscura group probed by i n situ hybridization. Chromosoma 91: 46-53.

STRUHL, G . , 1984 Splitting the bithorax complex of Drosophila. Nature 308: 454- 457.

STUART, J., S. J. BROWN, R. W. BEEMAN and R. E. DENELL, 1991 A deficiency of the homeotic complex of the beetle Tribolium. Nature 350: 72-74.

SrURTEVANT, A. H., and C. C. TAN, 1937 The comparative genes of Drosophila pseudoobscura and D. melanogaster. J. Genet.

TIONG, S. Y. K., J. R. S. WHITTLE and M. C. GRIBBIN, 1987 Chromosomal continuity in the abdominal region of the bi- thorax complex of Drosophila is not essential for its contribu- tion to metameric identity. Development 101: 135-142.

WAKIMOTO, B. T., F. R. TURNER and T. C. KAUFMAN, 1984 Defects in embryogenesis in mutants associated with the Anten- napedia gene complex of Drosophila melanogaster. Dev. Biol.

WALLDORF, U., R. FLEIC and W. J. GEHRING, 1989 Comparison of homeobox-containing genes of the honeybee and Drosoph- ila. Proc. Natl. Acad. Sci. USA 86: 9971-9975.

WEDDEN, S. E., K. PANG and G. EICHELE, 1989 Expression pattern of homeobox containing genes during chick embryogenesis. Development 105: 639-650.

WEINER, A. J., M. P. SCOTT and T . C. KAUFMAN, 1984 A molec- ular analysis of fushi tarazu, a gene in Drosophila melanogaster that encodes a product affecting segment number and cell fate. Cell 37: 843-851.

34: 4 15- 432.

102: 147-172.

Cornmunicating editor: R. E. DENELL