LOCUS HUMA1ACMB 559 bp ds-DNA PRI 15-JUN-1989 DEFINITION Human alpha-1-antichymotrypsin gene, exon 1. ACCESSION M18035 KEYWORDS alpha-1-antichymotrypsin; plasma serine protease; serpin. SOURCE Human DNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 559) AUTHORS Bao,J.-j., Sifers,R.N., Kidd,V.J., Ledley,F.D. and Woo,S.L.C. TITLE Molecular evolution of serpins: Homologous structure of the human alpha-1-antichymotrypsin and alpha-1-antitrypsin genes JOURNAL Biochemistry 26, 7755-7759 (1987) STANDARD full staff_entry FEATURES from to/span description pre-msg 401 > 559 alpha-1-antichymotrypsin, exon 1 /nomgen="AACT" /map="14q32.1" /hgml_locus_uid="LE0075G" IVS 458 > 559 AACT intron A BASE COUNT 172 A 104 C 125 G 158 T ORIGIN Chromosome 14q32.1. 1 CTGTCTCAAA ATAAAAATAA AAAATAAAAA GAAATAAAAA AGAAATATAC CAAAATGTTA 61 GCTGGGGTCT TCTCTGGGTA GTAAAGTGCT GGGGGATATT TTCCAAAGTC CTTCTTTACA 121 TTCTCTGAGT TTTTCCATGT TCTTCAATGA GTATTTAATA AGCAGATAAA AACTAATACA 181 ACAAAGGATT TTTTCTGTGT GCTTTTTTGA CCTTGGAGGA AGAGATTAGA GCTAGTCCCA 241 TAACCAGGTT ATTTGAGTAG GTCTAACAAG CCCTATTACC AGAAATTATC ATCTGGTCAT 301 TTCCAGTCCG AGAACAGAAC ACTTGGTTGT CCTGGCATTT CCCAAGCAGT GGGAGGAGTT 361 CTCTGCAGGA ATAAATAAGC CTCAGCATTC ATGAAAATCC ACTACTCCAG ACAGACGGCT 421 TTGGAATCCA CCAGCTACAT CCAGCTCCCT GAGGCAGGTA ATCCATGATG TTTTACATCC 481 TGGGAGCGGA GGAATCTGTT TTTCCAGGAG AGTTTTAGGC AGCAGCCTGG AGTGTGTGGA 541 GTGTGAGGGG TAAGCAGAG // LOCUS HUMA1AR1 890 bp ds-DNA PRI 15-JUN-1989 DEFINITION Human alpha-1-antitrypsin-related protein gene, exon 2. ACCESSION M19685 J03044 KEYWORDS alpha-1-antitrypsin-related protein. SEGMENT 1 of 2 SOURCE Human (patient homozygous for a pi-null allele of AAT) lymphocyte DNA, clone cATN13. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 890) AUTHORS Bao,J.-j., Reed-Fourquet,L., Sifers,R.N., Kidd,V.J. and Woo,S.L.C. TITLE Molecular structure and sequence homology of a gene related to alpha-1-antitrypsin in the human genome JOURNAL Genomics 2, 165-173 (1988) STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly provided by L.Reed-Fourquet, 01-MAR-1988. FEATURES from to/span description pept 162 + 813 alpha-1-antitrypsin-related protein, exon 2 /nomgen="PI" /map="14q32.1" /hgml_locus_uid="LX0081X" pre-msg < 1 > 890 PI mRNA and introns IVS < 1 157 PI intron A IVS 814 > 890 PI intron B BASE COUNT 232 A 240 C 204 G 214 T ORIGIN About 2 kb after segment 1; chromosome 14q32.1. 1 TATCTTCTAT CTTTCAATAT ATGTGTAGAA ATGTTTGCAA TGCTGGCACA TGTGATGTTT 61 AATAATTCAT GGTTGCAATC ACACAATGTC AGTCCCTGGT CACTTGGGAG TCATCATGTG 121 CCCTGACTTG GGCCTGCCCT CTGCCTCTGT CTTGCAGGAC AATGCCATTC TCTGTCTCAT 181 GGGGCGTCCT CCTGCTGGCA GGCCTGTGCT GCCTGGTCCC CAGCTCCCTG GTTGAGGATC 241 CCCAGGGAGA TGCTGCCCAA AAGACGGATA CATCCCACCA TGATCAAGGG GACTGGGAGG 301 ACCTTGCTTG CCAGAAGATC TCCTATAACG TCACCGACCT CGCCTTTGAT TTGTACAAGA 361 GCTGGCTGAT CTATCACAAC CAGCATGTCT TAGTCACCCC AACAAGCGTG GCTATGGCCT 421 TTCGAATGCT CTCCCTGGGG ACCAAGGCTG ACACTCGCAC AGAGATCCTG GAAGGCCTGA 481 ATGTCAACCT CACAGAGACG CCTGAGGCCA AGATCCACGA ATGCTTCCAG CAAGTTCTCC 541 AAGCCCTCAG CAGGCCAGAC ACCCGGCTCC AGCTGACCAC CGGCAGTAGC CTGTTTGTTA 601 ACAAGAGTAT GAAGCTAGTG GACACGTTTT TGGAGGATAC CAAGAAGCTG TACCACTCAG 661 AAGCCTCTTC CATCAACTTC AGGGACACCG AGGAGGCCAA AGAGCAGATC AACAATTATG 721 TGGAGAAAAG AACTGGAAGA AAAGTAGTGG ATTTGGTCAA ACACCTGAAA AAAGACACAA 781 GTCTTGCCCT GGTGGATTAC ATTTCCTTTC ACGGTAAGGT AGCACTACCT GTCCAACGCC 841 GTTCCACTGA AATAATCCAA ACATATTCTC AAATTACTAG TTCTTAAGCT // LOCUS HUMA1AR2 3758 bp ds-DNA PRI 15-JUN-1989 DEFINITION Human alpha-1-antitrypsin-related protein gene, exons 3, 4 and 5. ACCESSION M19684 J03044 KEYWORDS alpha-1-antitrypsin-related protein. SEGMENT 2 of 2 SOURCE Human (patient homozygous for a pi-null allele of AAT) lymphocyte DNA, clone cATN13. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 3758) AUTHORS Bao,J.-j., Reed-Fourquet,L., Sifers,R.N., Kidd,V.J. and Woo,S.L.C. TITLE Molecular structure and sequence homology of a gene related to alpha-1-antitrypsin in the human genome JOURNAL Genomics 2, 165-173 (1988) STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly provided by L.Reed-Fourquet, 01-MAR-1988. FEATURES from to/span description pept + 408 678 alpha-1-antitrypsin-related protein, exon 3 /nomgen="PI" /map="14q32.1" /hgml_locus_uid="LX0081X" 2093 2240 alpha-1-antitrypsin-related protein, exon 4 3171 3362 alpha-1-antitrypsin-related protein, exon 5 pre-msg < 1 3362 PI mRNA and introns IVS < 1 407 PI intron B IVS 679 2092 PI intron C IVS 2241 3170 PI intron D BASE COUNT 914 A 953 C 1018 G 872 T 1 others ORIGIN About 2 kb after segment 1; chromosome 14q32.1. 1 AGTCTGCCTA GGCATCTGCA TCTCCCCTGC TGATCCAGGG AAAGGCCTCA GGAGAAGTAA 61 GCTCAATGCT CTCCTCGTAT CTCAACAGAA CCAGCCCTCT GCATGAGATC AGAAGCCTCC 121 AGCCAAAGGT GGCAAGGCAT AAGGAGTTCC GATTTTACTC ATTAACTTTC ATTAACCATC 181 CCCAAGCCGC TATCGAATAA ACAGAGCCAA ATATAAATGC TTCTTCTGTG GCACCAGGAT 241 TTTGCCAATT AGTTCAAGTA TATAGAAACT CTGGATTCCC ATTTTACAGA TGATAAAGCT 301 GAAGGAGAGG GAAGTGTCCT GCCAAAGGGG CTCAGGGCTC CTTTCATTCA CACACTTCCA 361 CCCTTTTATC CCACACCTGG CTGATGCCCA CCTTCCTTTT CTTCCAGGCA AGTGGAAAGA 421 TAAATTCAAG GCTGAGCGCA TTATGGTAGA GGGCTTCCAT GTGGATGATA AGACCATCAT 481 CAGAGTGCCT ATGATAAACC ACCTGGGTAG ATTTGACATC CACCGGGACA GGGAGTTATC 541 CAGCTGGGTG CTGGCACAGC ACTATGTGGG GAACGCCACT GCCTTCTTCA TCCTGCCCGA 601 TCCAAAGAAG ATGTGGCAGT TGGAAGAAAA ATTGACCTAC AGCCACCTTG AAAATATCCA 661 GAGAGCCTTT GACATAAGGT GATTTCCAGT TCAGGGGCTG AGTCAGAAGC TGCCCAGACC 721 TTTGAGCCAT GTCAGGACTG AGGCCCAACA GGACTGGCCA GAGGGCTGGG GAGGGTGGAC 781 CTCACATCCC TGGGTCACAG CTACTGCAGA GGCCACCACT GAGTTCAGGC GGACCATCCA 841 TGCTCCATGA GGAGGCAGTA CCCGGGGTGA GGAGGTAAGA ACTCGTCCCC TGGGACTTCC 901 CACTCCATGT GGAACACTGT CCCTTCCCAT ATACCATACC CAGGGAGGGA CGACGACACC 961 ACCACACATT ATCGGAGACC AGAGGACAGA TAGGGCTCCT TCGGAAGGCA ACAGTGGAGT 1021 GTGGAGTTCA AGGTAGCTAC CCGTCAGGGG ACGGGGGAAG AGCCTGTGTG GCAAGGCCCA 1081 GAAAAACAAA GTCGGAATTG GAACAGCCAG GCCATGTTAG CAGAAGGCTG CGTTCCTGCC 1141 GTATCGATGC TGTTAGATTG GGTGTCCTGT AGTAAATGAT ACTTGAGCAT AAGCCATACA 1201 TTAGTGTATG TGTGTGCATT CGCTGATTAT GCCCATGCCC TGCTGATCTA GTTTATTTTT 1261 ATTTTTACAC CGTAAAACCA AGATGAAAAT ACGAAAGTCT CAGGTTCDAT AATAGGAATC 1321 GAGGCTGGAA TTTCTCTGTC CCATGCAAAC GCCCTCCTGA GGTACCTGCT CCAGGGTTGA 1381 GAAAGAACAA GGAGGCGAGA GATAACTGAT CAGAGAGCCC AGAGCCAGGC TGCCCGCTCA 1441 CACAGACCGT GCCTTCGGGC GGCTCTGTCT TCCACGGAAA ACCAGAAGGG AGCATTCGGC 1501 CTGGTGTGAT CACTCTTCTC TCATCCCCTG AACGGGTGTC ACTGCGCACT GCTGCCCCAC 1561 GTTCCCCTGG GTGTACCCCC CTGGGGATGT TCCAGGCTGG GGGCCCGGTG ACCCAACACT 1621 ACACAGCAGG ATGAGACACG CTTCCAGGAC ACCTAGAAAT CTAGCCTGAA GGGCTTCTCA 1681 AAGGAGGTGG CATTTCAAGC TGCCTTAACA AGATGTTAGT GTCATACTAA CATTCTTTGA 1741 TTCTGGGTAT AGAGGAGCCT AAAGGAGAAC AAACCAAGGC TAACTTCTGA TTGTTTTATT 1801 TTTTTTTTTA ACTTTTCTTT TAGGTTCAGG GGTCCGTGTG TCAGGTTTGT TATATAGGTA 1861 AACCCTGGGG TTTTATTGTA TAGATTATTT TGTCACCCAG GTACTAAGCG TAGTACCCAA 1921 TAGTTATTTT CTCTGATCCT CTCCCTCTGA CTTCTGATCT TGATATGGGT GAGGACATGG 1981 CCACTAAGTG GGAGACGCAA TCAGAAGAGA AAGCAACTGG CCACTGTGGT GGGTTCCAGA 2041 AGAACAGGAG AAATGACTGC TCCATGCCTT GATTCTCTTT CCTGCATGGC AGGTCTATCA 2101 ATCTACATTT TCCCAAACTG TCCATTTCTG GAACCTACAA ACTCAAGAGA GTCCCTAGGA 2161 ATCTGGGCAT CACCAAGATC TTCAGCAACG AGGCCGACCT CTCTGGAGTC AGTCAGGAGG 2221 CACCCCTGAA GCTCTCCAAG GTGAGGTCAC CAGAAGACCG TGTTGTCCCC TGGTGTCGGC 2281 TGGCAGATTG TGTGGAGGTT GCAGTTCTGT CCTGAGGCTG AAGAAGGGGC TAAGAGACAC 2341 AAATGAAGGC CGAGCTCCTG AGGATGCCTG AGATTCACTG AGATGGGCCA TGGTCAGACG 2401 GCGGAGCTGG GGAGGCGCCT GGTGTGCAGC TGGCACTTTC AGGGCCTCCC CCGCTGTTGT 2461 GTCATTGACC CTTTGCCCAG AACCTTCCAG TTTGGGGCCT TAATTTATGC TATTGATACA 2521 AGAAAGCTTA GGCTCATTTC AAAAGACTTT CTGGCCCCTT TCATCTGTGA GGTTTTGAGT 2581 TTCTGGTGCA TCACCAAGCC CGAATCAATG AATGATTTGA GCTAAAGATG ACGCTAATTT 2641 AGGAAAACTA AAGGTGGCCA AAGGACAACT TTGGCAGTTC TGTGAATGCT GCGTTCACTT 2701 GGGGTCTGGG CACCATGAAG GACACTGTCA GAGGGTGGCC AAAGGACAAC TTTGGCAGTT 2761 CTGTGAATGG CTGCGTTCTC TTGGGGTCTG GGCACATGAA GGACATGTCA GAGTCCATGT 2821 CCCCAAGGAG CTTCAAGCTG TGTACAAGAA AGGAGAGAGC CCTGGAGGCA GACGTGCAGC 2881 GGGGATGCCC CTCCACTTTC TGAATTGTGG GTGCACCCTG AGCAGGGGAG AGGCTCTTAT 2941 AGGGAAGGAC AGACGGAGGG GAGCCAGCCC CATCAGCCAA AGCCTTGAGG GGGAGCAAGG 3001 CCTGTGTGAC AGGGAGGGAG AGGACGTGCA GGGTCAGTGC CACCGAGTGG AGAAAGGCTG 3061 GAGTGAGTTC CTCACCTCCT AGGATGAGGC TGTCCACATG GGCCTCGCTC CAGGCCTGGG 3121 CTGGGGCATG GCTCAGCCTC GGAACGTATC TCTGCTTCTC TCCCCTTCAG GCTGTGCATG 3181 TGGCTGTGCT CACCATTGAT GAGAAAGGGA CCGAAGCCAC CGGAGCCCCC CATCTGGAGG 3241 AGAAGGCCTG GTCTAAGTAT CAGACAGTCA TGTTCAACCG ACCCTTCCTG GTCATCATCA 3301 AGGAGTACAT CACCAACTTT CCGCTCTTCA TTGGAAAAGT GGTGAATCCC ACCCAAAAAT 3361 AACTGGCTGT CACTCCTCAG CCCCTCCCCT CCACCCCTGG CCCCCTCCCT GGATGACATT 3421 AAAGAAAGGC TGAATTGCCC TCTGCCTGCG TGTGACTGCA AACCCCTCCC CTGTTGTTTC 3481 TGAGTCTCCC TTTGTGCACT GAGGCTGTGT GTGGACTCCA GGTCACAGTG CTGTCTCCGG 3541 GCCTCCTGAA CTGTGTTCAT GGAGCACCTG ACTGGGCAGG CATGTGCTAG GCCAGGATGG 3601 AGGGGGCTGA ATCCTCAGTT TAAGGGCCTG GACCTGGGAG CTTATGGAAG AGCAAGGGGC 3661 TACCAGGAGC GGCCGCCAAG TGCGGGTCAC CAGAGAGAGC CAGGCCAGGG CCTGAGTGCT 3721 AGAACGCTGT CCGCATGTGT GACCCAGAAA ACCACTTT // LOCUS HUMA1AT2 101 bp ds-DNA PRI 30-JUN-1987 DEFINITION Human alpha-1-antitrypsin gene: exon 1 (partial). ACCESSION J00065 KEYWORDS alpha-1-antitrypsin; antitrypsin. SEGMENT 2 of 4 SOURCE human genomic dna. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 101) AUTHORS Leicht,M., Long,G.L., Chandra,T., Kurachi,K., Kidd,V.J., Mace,M.Jr., Davie,E.W. and Woo,S.L. TITLE sequence homology and structural comparison between the chromosomal human alpha-1-antitrypsin and chicken ovalbumin genes JOURNAL Nature 297, 655-659 (1982) STANDARD full staff_review COMMENT [1] compares the sequences around the junction of exon 1 and intron a of human alpha-1-antitrypsin with those around the junction of exon 6 and intron f of chicken ovalbumin (see chkoval). see other huma1at and mnka1at loci. FEATURES from to/span description pept < 1 + 61 alpha-1-antitrypsin (exon 1, AA at ?) /nomgen="PI" /map="14q32.1" /hgml_locus_uid="LX0001X" pre-msg < 1 > 101 alpha-1-antitrypsin mRNA IVS 62 > 101 alpha-1-antitrypsin intron a BASE COUNT 27 A 20 C 23 G 31 T ORIGIN about 387 bases after huma1at1. 1 TTGGTCAAGG AGCTTGACAG AGACACAGTT TTTGCTCTGG TGAATTACAT CTTCTTTAAA 61 GGTAAGGTTG CTCAACCAGC CTGAGCTGTT TCCCATAGAA A // LOCUS HUMA1AT3 69 bp ds-DNA PRI 01-SEP-1983 DEFINITION human alpha-1-antitrypsin gene: ivs a/ exon 2 junction. ACCESSION J00066 KEYWORDS alpha-1-antitrypsin; antitrypsin. SEGMENT 3 of 4 SOURCE human genomic dna. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 69) AUTHORS Leicht,M., Long,G.L., Chandra,T., Kurachi,K., Kidd,V.J., Mace,M.Jr., Davie,E.W. and Woo,S.L. TITLE sequence homology and structural comparison between the chromosomal human alpha-1-antitrypsin and chicken ovalbumin genes JOURNAL Nature 297, 655-659 (1982) STANDARD full staff_review COMMENT [1] compares the sequences around the junction of intron a and exon 2 of human alpha-1-antitrypsin with those around the junction of intron f and exon 7 of chicken ovalbumin (see chkoval). see other huma1at and mnka1at loci. FEATURES from to/span description pept + 41 > 69 alpha-1-antitrypsin (exon 2, partial) /nomgen="PI" /map="14q32.1" /hgml_locus_uid="LX0001X" pre-msg < 1 > 69 alpha-1-antitrypsin mRNA IVS < 1 40 alpha-1-antitrypsin intron a BASE COUNT 15 A 25 C 14 G 15 T ORIGIN about 1450 bases after huma1at2. 1 TCACTCACCC CTGGTGATGC CCACCTTCCC CTCTCTCCAG GCAAATGGGA GAGACCCTTT 61 GAAGTCAAG // LOCUS HUMA1ATA 322 bp ds-DNA PRI 15-JUN-1988 DEFINITION Human alpha-1-antitrypsin gene, exon 1 (unexpressed). ACCESSION M11288 KEYWORDS alpha-1-antitrypsin. SOURCE Human liver DNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 322) AUTHORS Ciliberto,G., Dente,L. and Cortese,R. TITLE Cell-specific expression of a transfected human alpha-1- antitrypsin gene JOURNAL Cell 41, 531-540 (1985) STANDARD simple staff_entry FEATURES from to/span description pre-msg 135 > 322 alpha-1 AT mRNA and intron /nomgen="PI" /map="14q32.1" /hgml_locus_uid="LX0001X" IVS 180 > 322 intron A BASE COUNT 61 A 90 C 101 G 69 T 1 others ORIGIN 1 GACTCAGATC CCAGTGGACT TAGCCCCTGT TTGCTCCTCC GATAACTGGG GTGACCTTGG 61 TTAAATATTC ACCAGCAGCC TCCCCCGTTG CCCCTCTGGA TCCACTGCTT AAATACGGAC 121 GAGGACAGGG CCCTGTCTCC TCAGCTTCAG GCACCACCAC TGACCTGGGA CAGTGAATCG 181 TAAGTATGCC TTTCACTGCG AGGGGTTCTG GAGAGGCTTC TGAGCTCCCC ATGGCCCAGG 241 CAGGCAGCAG GTCTGGGGCA GGAGGGGGGT TGTGGAGTGG GTATCCNGCT GCTGAGGTCA 301 GGGCAGATGG AGCAGGCTGC AG // LOCUS HUMA1ATP 12222 bp ds-DNA PRI 15-SEP-1989 DEFINITION Human alpha-1-antitrypsin gene (S variant), complete cds. ACCESSION K02212 KEYWORDS alpha-1-antitrypsin; antitrypsin. SOURCE Human: liver, cDNA to mRNA, clone pAT83; genomic DNA (S variant), clones pAT4.6 and pAT9.6. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 12222) AUTHORS Long,G.L., Chandra,T., Woo,S.L.C., Davie,E.W. and Kurachi,K. TITLE Complete sequence of the cDNA for human alpha-1-antitrypsin and the gene for the S variant JOURNAL Biochemistry 23, 4828-4837 (1984) STANDARD full staff_review FEATURES from to/span description pept 7316 7961 alpha-1-antitrypsin, exon 2 /nomgen="PI" /map="14q32.1" /hgml_locus_uid="LX0001X" 9412 9682 alpha-1-antitrypsin, exon 3 10939 11086 alpha-1-antitrypsin, exon 4 11910 12101 alpha-1-antitrypsin, exon 5 sigp 7316 7387 alpha-1-antitrypsin signal peptide matp 7388 7961 alpha-1-antitrypsin mature peptide 9412 9682 alpha-1-antitrypsin mature peptide 10939 11086 alpha-1-antitrypsin mature peptide 11910 12098 alpha-1-antitrypsin mature peptide pre-msg 1952 12177 A1A mRNA IVS 2002 7311 A1A mRNA intron A IVS 7962 9411 A1A cds intron B IVS 9683 10938 A1A cds intron C IVS 11087 11909 A1A cds intron D variant 7971 7971 c in one cDNA clone; t in another cDNA clone allele 9628 9628 a in cDNA clone; t in S variant genomic clone variant 11999 11999 c in one cDNA clone; g in another cDNA clone BASE COUNT 2984 A 3125 C 3155 G 2958 T ORIGIN 1 bp upstream of EcoRI site. 1 GAATTCCAGG TTGGAGGGGC GGCAACCTCC TGCCAGCCTT CAGGCCACTC TCCTGTGCCT 61 GCCAGAAGAG ACAGAGCTTG AGGAGAGCTT GAGGAGAGCA GGAAAGGTGG AACATTGCTG 121 CTGCTGCTCA CTCAGTTCCA CAGGTGGGAG GAACAGCAGG GCTTAGAGTG GGGGTCATTG 181 TGCAGATGGG AAAACAAAGG CCCAGAGAGG GGAAGAAATG CCTAGGAGCT ACCGAGGGCA 241 GGCGACCTCA ACCACAGCCC AGTGCTGGAG CTGTGAGTGG ATGTAGAGCA GCGGAATATC 301 CATTCAGCCA GCTCAGGGGA AGGACAGGGG CCCTGAAGCC AGGGGATGGA GCTGCAGGGA 361 AGGGAGCTCA GAGAGAAGGG GAGGGGAGTC TGAGCTCAGT TTCCCGCTGC CTGAAAGGAG 421 GGTGGTACCT ACTCCCTTCA CAGGGTAACT GAATGAGAGA CTGCCTGGAG GAAAGCTCTT 481 CAAGTGTGGC CCACCCCACC CCAGTGACAC CAGCCCCTGA CACGGGGGAG GGAGGGCAGC 541 ATCAGGAGGG GCTTTCTGGG CACACCCAGT ACCCGTCTCT GAGCTTTCCT TGAACTGTTG 601 CATTTTAATC CTCACAGCAG CTCAACAAGG TACATACCGT CACCATCCCC ATTTTACAGA 661 TAGGGAAATT GAGGCTCGGA GCGGTTAAAC AACTCACCTG AGGCCTCACA GCCAGTAAGT 721 GGGTTCCCTG GTCTGAATGT GTGTGCTGGA GGATCCTGTG GGTCACTCGC CTGGTAGAGC 781 CCCAAGGTGG AGGCATAAAT GGGACTGGTG AATGACAGAA GGGGCAAAAA TGCACTCATC 841 CATTCACTCT GCAAGTATCT ACGGCACGTA CGCCAGCTCC CAAGCAGGTT TGCGGGTTGC 901 ACAGCGGAGC GATGCAATCT GATTTAGGCT TTTAAAGGAT TGCAATCAAG TGGGACCCAC 961 TAGCCTCAAC CCTGTACCTC CCCTCCCCTC CACCCCCAGC AGTCTCCAAA GGCCTCCAAC 1021 AACCCCAGAG TGGGGGCCAT GTATCCAAAG AAACTCCAAG CTGTATACGG ATCACACTGG 1081 TTTTCCAGGA GCAAAAACAG AAACAGCCTG AGGCTGGTCA AAATTGAACC TCCTCCTGCT 1141 CTGAGCAGCC TAGGGGGCAG ACTAAGCAGA GGGCTGTGCA GACCCACATA AAGAGCCTAC 1201 TGTGTGCCAG GCACTTCACC CGAGGCACTT CACAAGCATG CTTGGGAATG AAACTTCCAA 1261 CTCTTTGGGA TGCAGGTGAA ACAGTTCCTG GTTCAGAGAG GTGAAGCGGC CTGCCTGAGG 1321 CAGCACAGCT CTTCTTTACA GATGTGCTTC CCCACCTCTA CCCTGTCTCA CGGCCCCCCA 1381 TGCCAGCCTG ACGGTTGTGT CTGCCTCAGT CATGCTCCAT TTTTCCATCG GGACCATCAA 1441 GAGGGTGTTT GTGTCTAAGG CTGACTGGGT AACTTTGGAT GAGCGGTCTC TCCGCTCCGA 1501 GCCTGTTTCC TCATCTGTCA AACGGGCTCT AACCCACTCT GATCTCCCAG GGCGGCAGTA 1561 AGTCTTCAGC ATCAGGCATT TTGGGGTGAC TCAGTAAATG GTAGATCTTG CTACCAGTGG 1621 AACAGCCACT AAGGATTCTG CAGTGAGAGC AGAGGGCCAG CTAAGTGGTA CTCTCCCAGA 1681 GACTGTCTGA CTCACGCCAC CCCCTCCACC TTGGACACAG GACGCTGTGG TTTCTGAGCC 1741 AGGTACAATG ACTCCTTTCG GTAAGTGCAG TGGAAGCTGT ACACTGCCCA GGCAAAGCGT 1801 CCGGGCAGCG TAGGCGGGCG ACTCAGATCC CAGCCAGTGG ACTTAGCCCC TGTTTGCTCC 1861 TCCGATAACT GGGGTGACCT TGGTTAATAT TCACCAGCAG CCTCCCCCGT TGCCCCTCTG 1921 GATCCACTGC TTAAATACGG ACGAGGACAG GGCCCTGTCT CCTCAGCTTC AGGCACCACC 1981 ACTGACCTGG GACAGTGAAT CGTAAGTATG CCTTTCACTG CGAGGGGTTC TGGAGAGGCT 2041 TCCGAGCTCC CCATGGCCCA GGCAGGCAGC AGGTCTGGGG CAGGAGGGGG GTTGTGGAGT 2101 GGGTATCCGC CTGCTGAGGT GCAGGGCAGA TGGAGAGGCT GCAGCTGAGC TCCTATTTTC 2161 ATAATAACAG CAGCCATGAG GGTTGTGTCC TGTTTCCCAG TCCTGCCCGG TCCCCCCTCG 2221 GTACCTCCTG GTGGATACAC TGGTTCCTGT AAGCAGAAGT GGATGAGGGT GTCTAGGTCT 2281 GCAGTCCTGG CACCCCAGGA TGGGGGACAC CAGCCAAGAT ACAGCAACAG CAACAAAGCG 2341 CAGCCATTTC TTTCTGTTTG CACAGCTCCT CTGTCTGTCG GGGGCTCCTG TCTGTTGTCT 2401 CCTATAAGCC TCACCACCTC TCCTACTGCT TGGGCATGCA TCTTTCTCCC CTTCTATAGA 2461 TGAGGAGGTT AAGGTTCAGA GAGGGGTGGG GAGGAACGCC GGCTCACATT CTCCATCCCC 2521 TCCAGATATG ACCAGGAACA GACCTGTGCC AGCCTCAGCC TTACATCAAA ATGGGCCTCC 2581 CCATGCACCG TGGACCTCTG GGCCCTCCTG TCCCAGTGGA GGACAGGAAG CTGTGAGGGG 2641 CACTGTCACC CAGGGCTCAA GCTGGCATTC CTGAATAATC GCTCTGCACC AGGCCACGGC 2701 TAAGCTCAGT GCGTGATTAA GCCTCATAAC CCTCCAAGGC AGTTACTAGT GTGATTCCCA 2761 TTTTACAGAT GAGGAAGATG GGGACAGAGA GGTGAATAAC TGGCCCCAAA TCACACACCA 2821 TCCATAATTC GGGCTCAGGC ACCTGGCTCC AGTCCCCAAA CTCTTGAACC TGGCCCTAGT 2881 GTCACTGTTT CTCTTGGGTC TCAGGCGCTG GATGGGGAAC AGGAAACCTG GGCTGAACTT 2941 GAGGCCTCTC TGATGCTCGG TGACTTCAGA CAGTTGCTCA ACCTCTCTGT TCTCTTGGGC 3001 AAAACATGAT AACCTTTGAC TTCTGTCCCC TCCCCTCACC CCACCCGACC TTGATCTCTG 3061 AAGTGTTGGA AGGATTTAAT TTTTCCTGCA CTGAGTTTTG GAGACAGGTC AAAAAGATGA 3121 CCAAGGCCAA GGTGGCCAGT TTCCTATAGA ACGCCTCTAA AAGACCTGCA GCAATAGCAG 3181 CAAGAACTGG TATTCTCGAG AACTTGCTGC GCAGCAGGCA CTTCTTGGCA TTTTATGTGT 3241 ATTTAATTTC ACAATAGCTC TATGACAAAG TCCACCTTTC TCATCTCCAG GAAACTGAGG 3301 TTCAGAGAGG TTAAGTAACT TGTCCAAGGT CACACAGCTA ATAGCAAGTT GACGTGGAGC 3361 AATCTGGCCT CAGAGCCTTT AATTTTAGCC ACAGACTGAT GCTCCCCTCT TCATTTAGCC 3421 AGGCTGCCTC TGAAGTTTTC TGATTCAAGA CTTCTGGCTT CAGCTTTGTA CACAGAGATG 3481 ATTCAATGTC AGGTTTTGGA GCGAAATCTG TTTAATCCCA GACAAAACAT TTAGGATTAC 3541 ATCTCAGTTT TGTAAGCAAG TAGCTCTGTG ATTTTTAGTG AGTTATTTAA TGCTCTTTGG 3601 GGCTCAATTT TTCTATCTAT AAAATAGGGC TAATAATTTG CACCTTATAG GGTAAGCTTT 3661 GAGGACAGAT TAGATGATAC GGTGCCTGTA AAACACCAGG TGTTAGTAAG TGTGGCAATG 3721 ATGGTGACGC TGAGGCTGTG TTTGCTTAGC ATAGGGTTAG GCAGCTGGCA GGCAGTAAAC 3781 AGTTGGATAA TTTAATGGAA AATTTGCCAA ACTCAGATGC TGTTCACTGC TGAGCAGGAG 3841 CCCCTTCCTG CTGAAATGGT CCTGGGGAGT GCAGCAGGCT CTCCGGGAAG AAATCTACCA 3901 TCTCTCGGGC AGGAGCTCAA CCTGTGTGCA GGTACAGGGA GGGCTTCCTC ACCTGGTGCC 3961 CACTCATGCA TTACGTCAGT TATTCCTCAT CCCTGTCCAA AGGATTCTTT TCTCCATTGT 4021 ACAGCTATGA AGCTAGTGCT CAAAGAAGTG AAGTCATTTA CCCCAGGCCC CCTGCCAGTA 4081 AGTGACAGGG CCTGGTCACA CTTGGGTTTA TTTATTGCCC AGTTCAACAG GTTGTTTGAC 4141 CATAGGCGAG ATTCTCTTCC CTGCACCCTG CCGGGTTGCT CTTGGTCCCT TATTTTATGC 4201 TCCTGGGTAG AAATGGTGCG AGATTAGGCA GGGAGTGGAC GCTTCCCTGT CCCTGGCCCC 4261 GCAAAGAGTG CTCCCACCTG CCCCGATCCC AGAAATGTCA CCATGAAGCC TTCATTCTTT 4321 TGGTTTAAAG CTTGGCCTCA GTGTCCGTAC ACCATGGGGT CCTTGGCCAG ATGGCGACTT 4381 TCTCCTCTCC AGTCGCCCTC CCAGGCACTA GCTTTTAGGA GTGCAGGGTG CTGCCTCTGA 4441 TAGAAGGGCC AGGAGAGAGC AGGTTTTGGA GACCTGATGT TATAAGGAAC AGCTTGGGAG 4501 GCATAATGAA CCCAACATGA TGCTTGAGAC CAATGTCACA GCCCAATTCT GACATTCATC 4561 ATCTGAGATC TGAGGACACA GCTGTCTCAG TTCATGATCT GAGTGCTGGG AAAGCCAAGA 4621 CTTGTTCCAG CTTTGTCACT GACTTGCTGT ATAGCCTCAA CAAGGCCCTG ACCCTCTCTG 4681 GGCTTCAAAC TCTTCACTGT GAAAGGAGGA AACCAGAGTA GGTGATGTGA CACCAGGAAA 4741 GATGGATGGG TGTGGGGGAA TGTGCTCCTC CCAGCTGTCA CCCCCTCGCC ACCCTCCCTG 4801 CACCAGCCTC TCCACCTCCT TTGAGCCCAG AATTCCCCTG TCTAGGAGGG CACCTGTCTC 4861 GTGCCTAGCC ATGGGAATTC TCCATCTGTT TTGCTACATT GAACCCAGAT GCCATTCTAA 4921 CCAAGAATCC TGGCTGGGTG CAGGGGCTCT CGCCTGTAAC CCCAGCACTT TGGGAGGCCA 4981 AGGCAGGCGG ATCAAGAGGT CAGGAGTTCA AGACCTGCCT GGCCAACACG GTGAAACCTC 5041 AGCTCTACTA AAAATACAAA AATTAGCCAG GCGTGGTGGC ACACGCCTGT AATCCCAGCT 5101 ATTTGGGAAG CTGAGACAGA AGAATTTCTT GAACCCGGGA GGTGGAGGTT TCAGTGAGCC 5161 GAGATCACGC CACTGCACTC CACCCTGGCG GATAAAGCGA GACTCTGTCT CAAAAAAAAC 5221 CCAAAAACCT ATGTTAGTGT ACAGAGGGCC CCAGTGAAGT CTTCTCCCAG CCCCACTTTG 5281 CACAACTGGG GAGAGTGAGG CCCCAGGACC AGAGGATTCT TGCTAAAGGC CAAGTGGATA 5341 GTGATGGCCC TGCCAGGCTA GAAGCCACAA CCTCTGGCCC TGAGGCCACT CAGCATATTT 5401 AGTGTCCCCA CCCTGCAGAG GCCCAACTCC CTCCTGACCA CTGAGCCCTG TAATGATGGG 5461 GGAATTTCCA TAAGCCATGA AGGACTGCAC AAAGTTCAGT TGGGAGTGAA AGAGAAATTA 5521 AAGGGAGATG GAAATATACA GCACTAATTT TAGCACCGTC TTCAGTTCTA ACAACACTAG 5581 CTAGCTGAAG AAAATACAAA CATGTATTAT GTAATGTGTG GTCTGTTCCA TTTGGATTAC 5641 TTAGAGGCAC GAGGGCCAAG GAGAAAGGTG GTGGAGAGAA ACCAGCTTTG CACTTCATTT 5701 GTTGCTTTAT TGGAAGGAAA CTTTTAAAAG TCCAAGGGGG TTGAAGAATC TCAATATTTG 5761 TTATTTCCAG CTTTTTTTCT CCAGTTTTTC ATTTCCCAAA TTCAAGGACA CCTTTTTCTT 5821 TGTATTTTGT TAAGATGATG GTTTTGGTTT TGTGACTAGT AGTTAACAAT GTGGCTGCCG 5881 GGCATATTCT CCTCAGCTAG GACCTCAGTT TTCCCATCTG TGAAGACGGC AGGTTCTACC 5941 TAGGGGGCTG CAGGCAGGTG GTCCGAAGCC TGGGCATATC TGGAGTAGAA GGATCACTGT 6001 GGGGCAGGGC AGGTTCTGTG TTGCTGTGGA TGACGTTGAC TTTGACCATT GCTCGGCAGA 6061 GCCTGCTCTC GCTGGTTCAG CCACAGGCCC CACCACTCCC TATTGTCTCA GCCCCGGGTA 6121 TGAAACATGT ATTCCTCACT GGCCTATCAC CTGAAGCCTT TGAATTTGCA ACACCTGCCA 6181 ACCCCTCCCT CAAAAGAGTT GCCCTCTCTA GATCCTTTTG ATGTAAGGTT TGGTGTTGAG 6241 ACTTATTTCA CTAAATTCTC ATACATAAAC ATCACTTTAT GTATGAGGCA AAATGAGGAC 6301 CAGGGAGATG AATGACTTGT CCTGGCTCAT ACACCTGGAA AGTGACAGAG TCAGATTAGA 6361 TCCTAGGTCT ATCTGAAGTT AAAAGAGGTG TCTTTTCACT TCCCACCTCC TCCATCTACT 6421 TTAAAGCAGC ACAAACCCCT GCTTTCAAGG AGAGATGAGC GTCTCTAAAG CCCCTGACAG 6481 CAAGAGCCCA GAACTGGGAC ACCATTAGTG ACCCAGACGG CAGGTAAGCT GACTGCAGGA 6541 GCATCAGCCT ATTCTTGTGT CTGGGACCAC AGAGCATTGT GGGGACAGCC CCGTCTCTTG 6601 GGAAAAAAAC CCTAAGGGCT GAGGATCCTT GTGAGTGTTG GGTGGGAACA GCTCCCAGGA 6661 GGTTTAATCA CAGCCCCTCC ATGCTCTCTA GCTGTTGCCA TTGTGCAAGA TGCATTTCCC 6721 TTCTGTGCAG CAGTTTCCCT GGCCACTAAA TAGTGGGATT AGATAGAAGC CCTCCAAGGG 6781 CTCCAGCTTG ACATGATTCT TGATTCTGAT CTGACCCGAT TCTGATAATC GTGGGCAGGC 6841 CCATTCCTCT TCTTGTGCCT CATTTTCTTC TTTTGTAAAA CAATGGCTGT ACCATTTGCA 6901 TCTTAGGGTC ATTGCAGATG AAAGTGTTGC TGTCCAGAGC CTGGGTGCAG GACCTAGATG 6961 TAGGATTCTG GTTCTGCTAC TTCCTCAGTG ACATTGAATA GCTGACCTAA TCTCTCTGGC 7021 TTTGGTTTCT TCATCTGTAA AAGAAGGATA TTAGCATTAG CACCTCACGG GATTGTTACA 7081 AGAAAGCAAT GAATTAACAC ATGTGAGCAC GGAGAACAGT GCTTGGCATA TGGTAAGCAC 7141 TACGTACATT TTGCTATTCT TCTGATTCTT TCAGTGTTAC TGATGTCGGC AAGTACTTGG 7201 CACAGGCTGG TTTAATAATC CCTAGGCACT TTCACGTGGT GTCAATCCCT GATCACTGGG 7261 AGTCATCATG TGCCTTGACT CGGGCCTGGC CCCCCCATCT CTGTCTTGCA GGACAATGCC 7321 GTCTTCTGTC TCGTGGGGCA TCCTCCTGCT GGCAGGCCTG TGCTGCCTGG TCCCTGTCTC 7381 CCTGGCTGAG GATCCCCAGG GAGATGCTGC CCAGAAGACA GATACATCCC ACCATGATCA 7441 GGATCACCCA ACCTTCAACA AGATCACCCC CAACCTGGCT GAGTTCGCCT TCAGCCTATA 7501 CCGCCAGCTG GCACACCAGT CCAACAGCAC CAATATCTTC TTCTCCCCAG TGAGCATCGC 7561 TACAGCCTTT GCAATGCTCT CCCTGGGGAC CAAGGCTGAC ACTCACGATG AAATCCTGGA 7621 GGGCCTGAAT TTCAACCTCA CGGAGATTCC GGAGGCTCAG ATCCATGAAG GCTTCCAGGA 7681 ACTCCTCCGT ACCCTCAACC AGCCAGACAG CCAGCTCCAG CTGACCACCG GCAATGGCCT 7741 GTTCCTCAGC GAGGGCCTGA AGCTAGTGGA TAAGTTTTTG GAGGATGTTA AAAAGTTGTA 7801 CCACTCAGAA GCCTTCACTG TCAACTTCGG GGACACCGAA GAGGCCAAGA AACAGATCAA 7861 CGATTACGTG GAGAAGGGTA CTCAAGGGAA AATTGTGGAT TTGGTCAAGG AGCTTGACAG 7921 AGACACAGTT TTTGCTCTGG TGAATTACAT CTTCTTTAAA GGTAAGGTTG CTCAACCAGC 7981 CTGAGCTGTT TCCCATAGAA ACAAGCAAAA ATATTTCTCA AACCATCAGT TCTTGAACTC 8041 TCCTTGGCAA TGCATTATGG GCCATAGCAA TGCTTTTCAG CGTGGATTCT TCAGTTTTCT 8101 ACACACAAAC ACTAAAATGT TTTCCATCAT TGAGTAATTT GAGGAAATAA TAGATTAAAC 8161 TGTCAAAACT ACTGACGCTC TGCAGAACTT TTCAGAGCCT TTAATGTCCT TGTGTATACT 8221 GTATATGTAG AATATATAAT GCTTAGAACT ATAGAACAAA TTGTAATACA CTGCATAAAG 8281 GGATAGTTTC ATGGAACATA CTTTACACGA CTCTAGTGTC CCAGAATCAG TATCAGTTTT 8341 GCAATCTGAA AGACCTGGGT TCAAATCCTG CCTCTAACAC AATTAGCTTT TGACAAAAAC 8401 AATGCATTCT ACCTCTTTGA GGTGCTAATT TCTCATCTTA GCATGGACAA AATACCATTC 8461 TTGCTGTCAG GTTTTTTTAG GATTAAACAA ATGACAAAGA CTGTGGGGAT GGTGTGTGGC 8521 ATACAGCAGG TGATGGACTC TTCTGTATCT CAGGCTGCCT TCCTGCCCCT GAGGGGTTAA 8581 AATGCCAGGG TCCTGGGGGC CCCAGGGCAT TCTAAGCCAG CTCCCACTGT CCCAGGAAAA 8641 CAGCATAGGG GAGGGGAGGT GGGAGGCAAG GCCAGGGGCT GCTTCCTCCA CTCTGAGGCT 8701 CCCTTGCTCT TGAGGCAAAG GAGGGCAGTG GAGGCAAGCC AGGCTGCAGT CAGCACAGCT 8761 AAAGTCCTGG CTCTGCTGTG GCCTTAGTGG GGGCCCAGGT CCCTCTCCAG CCCCAGTCTC 8821 CTCCTTCTGT CCAATGAGAA AGCTGGGATC AGGGGTCCCT GAGGCCCCTG TCCACTCTGC 8881 ATGCCTCGAT GGTGAAGCTC TGTTGGTATG GCAGAGGGGA GGCTGCTCAG GCATCTGCAT 8941 TTCCCCTGCC AATCTAGAGG ATGAGGAAAG CTCTCAGGAA TAGTAAGCAG AATGTTTGCC 9001 CTGGATGAAT AACTGAGCTG CCAATTAACA AGGGGCAGGG AGCCTTAGAC AGAAGGTACC 9061 AAATATGCCT GATGCTCCAA CATTTTATTT GTAATATCCA AGACACCCTC AAATAAACAT 9121 ATGATTCCAA TAAAAATGCA CAGCCACGAT GGCATCTCTT AGCCTGACAT CGCCACGATG 9181 TAGAAATTCT GCATCTTCCT CTAGTTTTGA ATTATCCCCA CACAATCTTT TTCGGCAGCT 9241 TGGATGGTCA GTTTCAGCAC CTTTTACAGA TGATGAAGCT GAGCCTCGAG GGATGTGTGT 9301 CGTCAAGGGG GCTCAGGGCT TCTCAGGGAG GGGACTCATG GTTTCTTATT CTGCTACACT 9361 CTTCCAAACC TTCACTCACC CCTGGTGATG CCCACCTTCC CCTCTCTCCA GGCAAATGGG 9421 AGAGACCCTT TGAAGTCAAG GACACCGAGG AAGAGGACTT CCACGTGGAC CAGGTGACCA 9481 CCGTGAAGGT GCCTATGATG AAGCGTTTAG GCATGTTTAA CATCCAGCAC TGTAAGAAGC 9541 TGTCCAGCTG GGTGCTGCTG ATGAAATACC TGGGCAATGC CACCGCCATC TTCTTCCTGC 9601 CTGATGAGGG GAAACTACAG CACCTGGTAA ATGAACTCAC CCACGATATC ATCACCAAGT 9661 TCCTGGAAAA TGAAGACAGA AGGTGATTCC CCAACCTGAG GGTGACCAAG AAGCTGCCCA 9721 CACCTCTTAG CCATGTTGGG ACTGAGGCCC ATCAGGACTG GCCAGAGGGC TGAGGAGGGT 9781 GAACCCCACA TCCCTGGGTC ACTGCTACTC TGTATAAACT TGGCTTCCAG AATGAGGCCA 9841 CCACTGAGTT CAGGCAGCGC CGTCCATGCT CCATGAGGAG AACAGTACCC AGGGTGAGGA 9901 GGTAAAGGTC TCGTCCCTGG GAACTTCCCA CTCCAGTGTG GACACTGTCC CTTCCCAATA 9961 TCCAGTGCCC AAGGCAGGGA CAGCAGCACC ACCACACGTT CTGGCAGAAC CAAAAAGGAA 10021 CAGATGGGCT TCCTGGCAAA GGCAGCAGTG GAGTGTGGAG TTCAAGGGTA GAATGTCCCT 10081 GGGGGGACGG GGGAAGAGCC TGTGTGGCAA GGCCCAGAAA AGCAAGGTTC GGAATTGGAA 10141 CAGCCAGGCC ATGTTCGCAG AAGGCTTGCG TTTCTCTGTC ACTTTATCGG TGCTGTTAGA 10201 TTGGGTGTCC TGTAGTAAGT GATACTTAAA CATGAGCCAC ACATTAGTGT ATGTGTGTGC 10261 ATTCGTGATT ATGCCCATGC CCTGCTGATC TAGTTCGTTT TGTACACTGT AAAACCAAGA 10321 TGAAAATACA AAAGGTGTCG GGTTCATAAT AGGAATCGAG GCTGGAATTT CTCTGTTCCA 10381 TGCCAGCACC TCCTGAGGTC TCTGCTCCAG GGGTTGAGAA AGAACAAAGA GGCTGAGAGG 10441 GTAACGGATC AGAGAGCCCA GAGCCAGCTG CCGCTCACAC CAGACCCTGC TCAGGGTGGC 10501 ATTGTCTCCC CATGGAAAAC CAGAGAGGAG CACTCAGCCT GGTGTGGTCA CTCTTCTCTT 10561 ATCCACTAAA CGGTTGTCAC TGGGCACTGC CACCAGCCCC GTGTTTCTCT GGGTGTAGGG 10621 CCCTGGGGAT GTTACAGGCT GGGGGCCAGG TGACCCAACA CTACAGGGCA AGATGAGACA 10681 GGCTTCCAGG ACACCTAGAA TATCAGAGGA GGTGGCATTT CAAGCTTTTG TGATTCATTC 10741 GATGTTAACA TTCTTTGACT CAATGTAGAA GAGCTAAAAG TAGAACAAAC CAAAGCCGAG 10801 TTCCCATCTT AGTGTGGGTG GAGGACACAG GAGTAAGTGG CAGAAATAAT CAGAAAAGAA 10861 AACACTTGCA CTGTGGTGGG TCCCAGAAGA ACAAGAGGAA TGCTGTGCCA TGCCTTGAAT 10921 TTCTTTTCTG CACGACAGGT CTGCCAGCTT ACATTTACCC AAACTGTCCA TTACTGGAAC 10981 CTATGATCTG AAGAGCGTCC TGGGTCAACT GGGCATCACT AAGGTCTTCA GCAATGGGGC 11041 TGACCTCTCC GGGGTCACAG AGGAGGCACC CCTGAAGCTC TCCAAGGTGA GATCACCCTG 11101 ACGACCTTGT TGCACCATGG TATCTGTAGG GAAGAATGTG TGGGGGCTGC AGCACTGTCC 11161 TGAGGCTGAG GAAGGGGCCG AGGGAAACAA ATGAAGACCC AGGCTGAGCT CCTGAAGATG 11221 CCCGTGATTC ACTGACACGG GACGGTGGGC AAACAGCAAA GCCAGGCAGG GGCTGCTGTG 11281 CAGCTGGCAC TTTCGGGGCC TCCCTTGAGG TTGTGTCACT GACCCTGAAT TTCAACTTTG 11341 CCCAAGACCT TCTAGACATT GGGCCTTGAT TTATCCATAC TGACACAGAA AGGTTTGGGC 11401 TAAGTTGTTT CAAAGGAATT TCTGACTCCT TCGATCTGTG AGATTTGGTG TCTGAATTAA 11461 TGAATGATTT CAGCTAAAGT GACACTTATT TTGGAAAACT AAAGGCGACC AATGAACAAC 11521 CTGCAGTTCC ATGAATGGCT GCATTATCTT GGGGTCTGGG CACTGTGAAG GTCACTGCCA 11581 GGGTCCGTGT CCTCAAGGAG CTTCAAGCCG TGTACTAGAA AGGAGAGAGC CCTGGAGGCA 11641 GACGTGGAGT GACGATGCTC TTCCCTGTTC TGAGTTGTGG GTGCACCTGA GCAGGGGGAG 11701 AGGCGCTTGT CAGGAAGATG GACAGAGGGG AGCCAGCCCC ATCAGCCAAA GCCTTGAGGA 11761 GGAGCAAGGC CTATGTGACA GGGAGGGAGA GGATGTGCAG GGCCAGGGCC GTCCAGGGGG 11821 AGTGAGCGCT TCCTGGGAGG TGTCCACGTG AGCCTTGCTC GAGGCCTGGG ATCAGCCTTA 11881 CAACGTGTCT CTGCTTCTCT CCCCTCCAGG CCGTGCATAA GGCTGTGCTG ACCATCGACG 11941 AGAAAGGGAC TGAAGCTGCT GGGGCCATGT TTTTAGAGGC CATACCCATG TCTATCCCCC 12001 CCGAGGTCAA GTTCAACAAA CCCTTTGTCT TCTTAATGAT TGAACAAAAT ACCAAGTCTC 12061 CCCTCTTCAT GGGAAAAGTG GTGAATCCCA CCCAAAAATA ACTGCCTCTC GCTCCTCAAC 12121 CCCTCCCCTC CATCCCTGGC CCCCTCCCTG GATGACATTA AAGAAGGGTT GAGCTGGTCC 12181 CTGCCTGCAT GTGATCTGTA AATCCCTGGG ATGTTTTCTC TG // LOCUS HUMA1GLY2 4944 bp ds-DNA PRI 15-SEP-1989 DEFINITION Human alpha-1-acid glycoprotein 2 (AGP2) gene, complete cds. ACCESSION M21540 KEYWORDS alpha-1-acid glycoprotein; orosomucoid. SOURCE Human DNA, clones lambda-AGP-[1A,2A,6B]. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 4944) AUTHORS Merritt,C.M. and Board,P.G. TITLE Structure and characterisation of a duplicated human alpha-1 acid glycoprotein gene JOURNAL Gene 66, 97-106 (1988) STANDARD simple staff_entry FEATURES from to/span description pept 1609 1722 alpha-1-acid glycoprotein 2, exon 1 /nomgen="ORM2" /map="9q31-qter" /hgml_locus_uid="LG0060L" 2138 2280 alpha-1-acid glycoprotein 2, exon 2 2499 2569 alpha-1-acid glycoprotein 2, exon 3 3274 3381 alpha-1-acid glycoprotein 2, exon 4 3532 3635 alpha-1-acid glycoprotein 2, exon 5 4776 4841 alpha-1-acid glycoprotein 2, exon 6 pre-msg 1594 > 4940 AGP2 mRNA and introns IVS 1723 2137 AGP2 intron A IVS 2281 2498 AGP2 intron B IVS 2570 3273 AGP2 intron C IVS 3382 3531 AGP2 intron D IVS 3636 4775 AGP2 intron E BASE COUNT 1206 A 1369 C 1292 G 1077 T ORIGIN 1 bp upstream of BamHI site; chromosome 9q31-qter. 1 GGATCCGCTG AAAAATGAAA CAGAAATGAG TCGTGATGGG CAGGGAGGGA GAAGCAAGGG 61 AGACGAGAAG TGGGGAACAT GGAAGGAAAA GCCACGTGAG GAAGAAACCA GAGGTCAAGA 121 GAAAAAGAAT CATGGAGGTA GAGGAAGCAA AAAACACACA TAACAAAGAA TGTGGACTTT 181 GGAGTCAAAC TAATGTGAGT CCAAACCCAG GCTCTCTCCC AAACCAGTTT GCGGCAGATG 241 GCCAGTGGAA CCTCACTCTC CTCATCAGTA AAAAGGGGGC AGAGTGAGGG TCCTGAGAGC 301 TAGTACAGGG ACTGTGTGAA GTAGACAATG CCCAGTGTTT AGCGTAAGAA TCAGGGTCCA 361 GCTGGTGCTC CCTAAACAGC AGCTGCTGTT CACTGTTGAA AGGCGCTCTG GAAGGCCAGG 421 CGCGGTGGCT CATGCTTGTA ATCCCAGCAC TGTGGGAGGC CGAGGTGGGC GGATCACCTG 481 AGGTAGGGAG TTCGAGACCA GCCTGACCAA CGTGGAGAAA CCCCATCTCT CCTAAAAATA 541 CAAAATTAGC CAGGCGTGGT AGCACATACC TGTAATCCCA GCGACTCGGG AGGCTGAGGC 601 AAGAGAATTG CTTGAAACCA GCAGGGGAGG TTGTGGTGAG CCAAGATCGA GCCATTGCAC 661 TCCAGCCAGG GCAACAAGAG GCAAAATGGC GAAACTCCAT CTCCGAGAAA AAAAAAAAAA 721 AAGAATACTT TCTGAAAGTA TTTATTCATA CAAATAAAGA CTTGACCCAT AAGGTAGGAA 781 CGCAAATGGG CCACGGAATC ACTCATTCCA CAGTATACAC CGAGTGCCCT TGAAGTGCTG 841 GGCACTGCTC CAGGATTGGG GGCATATTGG TGAAAAGAGA AGCAAGCCTG CCTGCTCAGA 901 TGGCAGGGAA TGGGGAAAAA CAGGGAGACA GTTTCCTGTT TGAGATGTTG GGAGTCTGCT 961 TCGAGTAGTA TATTTACTGG AAATAGACCA CTAACTTGGA TGTCCCTTTT TGGAAATGTG 1021 CCTGCGTCCA GGGCTGGGTT GGGGCCCCAA TGAACTTTGG CTCTGACATA GCTGTTGCCA 1081 CACTCAGTGG AACTGAATCC ATGTTTGCCT TCACCCGGCA TCCTTCACCC CAACTCTCCC 1141 CGCCACAACA TACATCCCAT GCCAGCCTGG GGACCCTCAA AGGTGCTTCA TCATTAGGTT 1201 TGTGGCTGGG TCCTACTGAA GTAAGTCTTG GCACTCAGAG GGATAGGAAT TGAATGAAGA 1261 CATGAGATTC CTCTGCGGGA GGCCTCTCTA GGAAATCTGT GGACTCACAC GTTTACTAAT 1321 GTTGCTGCAG CCCCGCACCC ACCTTGGCCT TGGGCAGCCA TACTCTAGGG CTTTTGTAAC 1381 CTCTCCATGT GAGGAACTCA AATTAGACCT GGGTTTGGAG GCGGTGCTCC GAGCTGGCCT 1441 TTGGGGGAGG TTTTGTGCGA GGCATTTCCC AAGTGCTGGC AGGATTGTGT CACAGACACA 1501 GAGTAAACTT TTGCTGGGCT CCAAGTGACC GCCCATAGTT TATTATAAAG GTGACTGCAC 1561 CCTGCAGCCA CCAGCACTGC CTGGCTCCAC GTGCCTCCTG GTCTCAGTAT GGCGCTGTCC 1621 TGGGTTCTTA CAGTCCTGAG CCTCCTACCT CTGCTGGAAG CCCAGATCCC ATTGTGTGCC 1681 AACCTAGTAC CGGTGCCCAT CACCAACGCC ACCCTGGACC GGGTGAGTGC CTGGGCTAGC 1741 CCTGTCCTGA GCACATGGGC AGCTGCCTCC CTTCTCTGGG CTTCCCTTTA CCTGCTGGCT 1801 GTGGTCGCAC CCCCACTCCC AGCTCTGCCT TTTTCTCTTC TGGGTCCCCA GGGTGAAATT 1861 CTCACCAGCC CAGGGGACTC TGGAGGCACC CCCTGCCTCC AAACACAGAA GCCTCACTGC 1921 AGAGTCCTTC ACGGAGGACG GTTCTGTGCT GGGCCTGGAG GGGCTGCCTG GGGGGCAATG 1981 ACTGATCCTC AGGGTGAGCT CCTGCATGCG CACTGCCCAC CAGGGGCCTC ATCTCCCCAT 2041 CTGCAAAATC AGGGAGAGAT CTGCCTGAGT CTCCTCCCAG CTGACAGTCA AAGATTCAGC 2101 ATCAAGCCCC CATCACCAGC TCCCCCCTTC TCCCCAGATC ACTGGCAAGT GGTTTTATAT 2161 CGCATCGGCC TTTCGAAACG AGGAGTACAA TAAGTCGGTT CAGGAGATCC AAGCAACCTT 2221 CTTTTACTTT ACCCCCAACA AGACAGAGGA CACGATCTTT CTCAGAGAGT ACCAGACCCG 2281 GTGAGAGCCC CCATTCCAAT GCACCCCCGA TCTCAGCTGT CTGGCCAGAA GACCTGAGCA 2341 AGTCCCTCCT TCTTCCTGGC CTTGGCCTTC CCATGGGTGG AACCGGGAGG GTTGGCTTTA 2401 ATCTCCACCA GAACTCTTGC CCCGGGACTG TGATGGGCGA TTGGCCACTT CTCCTCGATA 2461 ACATTACTGT TTTTCTTCCG CCTTCTGGTT GACTTTAGCC AGAACCAGTG CTTCTATAAC 2521 TCCAGTTACC TGAATGTCCA GCGGGAGAAT GGGACCGTCT CCAGATACGG TGAGGGCCAG 2581 CCCTCAGGCA GGAGGGTTCA CCGTGGGAAC AGGGCAGGCC AGCATAAGGT GGGGGCTGGA 2641 TGTAGAGCCC TGGAGGCTTT GGGCACAGAG AAATAACCAC TAACATTTTT GAGCTCTTAC 2701 CACGTGCTCA GAAAAAATCC CTAAGAAGAC ACTGAGAGAA TTAGATGAGG AAACATAAGA 2761 ACAGAGACCT CAAATAGTTT CCCCAAGGTC ACACAGCTTA TAATTAGAAC TAGAATTGGA 2821 ACTCCAGGCT GGCTTCAGAT CTGCCTCTCT CTCACGCCCT CTTTAAGATC CTTTGCAAAC 2881 CAATGGTAGA AGCCTGTATG TTGGAGAGGT GGTACCTTCA ACTATGTCCC CCATCACCGC 2941 AGAGGTGGCA CATGGCAGGG ATCTGATGGA GCTGAACTGA CATCATTTAG CATCCCGAGC 3001 CTCCTCTCTG GGCCTCATTT TCCTCCTCTG TAAAACGGGG AGAAAGGCCC TGACAGCCAC 3061 AGTCTGTGTG AGGCTCCTGA GATCTCATGT ACAGAAAGTG CTTGGCGTGG AGCTGGGCAC 3121 GCAGCAGGGG CTGGGCACAC GGTGGCCCAA AGGAGACCCG GGCCTTCACT GATGGGCTTT 3181 GTGGCCCCGG ACACACCTAG GACTCCTCAC CTGTAAGACA GGCACCATTG TGCCATCCCA 3241 TGTTCTCACC CAGAGGCTCT TTTTCTCTTC CAGAGGGAGG CCGAGAACAT GTTGCTCACC 3301 TGCTGTTCCT TAGGGACACC AAGACCTTGA TGTTTGGTTC CTACCTGGAC GATGAGAAGA 3361 ACTGGGGGCT GTCTTTCTAT GGTAGGCATG CTTAGCAGCC CCAAACTCAT GCCCCTCTCA 3421 GGCCTCACCC CCCATTCACC CACCCCTGGG CTGGCCCCTA GAACCCCAGC CCTCCCTGGC 3481 CTCCGCCGGG CCCCACCATG TCCCCAGTCA GTCTCCTTGC TCCCCCTGCA GCTGACAAGC 3541 CAGAGACGAC CAAGGAGCAA CTGGGAGAGT TCTACGAAGC TCTCGACTGC TTGTGCATTC 3601 CCAGGTCAGA TGTCATGTAC ACCGACTGGA AAAAGGTAAA CGCAAGGGAT TGGACATTGC 3661 CCACCTTGTC CATGGCCCAA CTTGGGCAGC CCCAGAGGCC CAGAGCAGGA AAGCTGCCAG 3721 GCAAGGCTGC ACAGCTAGGC AGATCTTCTG CTTTTAGGCA CCTGCCTCAC TGTAGGGACA 3781 GCTGAGCTCT ACAGAGGCCC AGGGGTGGTG GATGAGAGCC CAGGAGGGAG AAGTCCCTGT 3841 GAAACCAGGG AGGACCTGAA AGCTAACAGG AGGGAACAGC GTGAGCCACG GGGTTGGGGG 3901 ATTGGCAATT GGAGGGGACG TAATGCGGGG AGTTACCACC TACAGACGCG TCCCAAACCC 3961 CAGGCTTTCA CCCCAACCTC CACTCCCCGC TCATTTTTAA TACCCGTGCA GTGGGGAATT 4021 GATACTGTGG TTTTCAATGT CACCCACACT GCAGCACGGC CACAGTCACC ATCCCGATTT 4081 TTGCTACAAA TGAAAATTAC TGTATAATGA GCTCCTTAAC ACTTTTCTTT AAACCTGTGT 4141 TTGGAAGACT TGTGTTGGTG TGGCCCTGTG CCCTAATACC TGTGAAATCA CAGCACCGAT 4201 GAGCTGGTTC CAATTTTTAA AATATATACA TGCAGTACTT CCATGACTAT TCAAAGAAAA 4261 ACAATTCCTT CCATTTGCCA CCTGAGATGA CCACCAGGGA TGTGAACTAC CTCCTGCCCC 4321 ATCCCCAGCC CCAGGATCCT GGGACAGGGC TTATGAACGC AACCACTGTA GTCAGCTCAC 4381 TTGATCCACA GCCTGGCACC TCCACTGTCT GGCTAGGGAG CCTCGAATGG GTCCCAAGGC 4441 CACCCTGCTC CTCAGTTACA TCATCTGCAT AGTAGTGGTG GTTGTGAGGA ATTCAGGAGC 4501 TGCAGCATAA GGGCCCTGCA GGTACTATGT GCTCAGTAAA TGCCAGTGGT TCTTAAGGGT 4561 CTGAGCTCCC ATTGTAGAGG CAAGTAAGCT GAGGTTCAGA GACAGAAAAT GACTTGCCCA 4621 AGATCACCCA GCTGGGAAGT GACAGTGCCA GGGTTGGAGC CCTGGTTGAG CTGGTTCCAC 4681 AGGCCAGAGC TCATTCTGCC CTCTCCCCGG AAGACCTCCC ACCCTGTCCC CATGCCTCTG 4741 CTTCTCCCTC ACCCCAATTC CCCGCTGCCT TCTAGGATAA GTGTGAGCCA CTGGAGAAGC 4801 AGCACGAGAA GGAGAGGAAA CAGGAGGAGG GGGAATCCTA GCAGGACACA GCCTTGGATC 4861 AGGACAGAGA CTTGGGGGCC ATCCTGCCCC TCCAACCCGA CATGTGTACC TCAGCTTTTT 4921 CCCTCACTTG CATCAATAAA GCTT // LOCUS HUMA2PIG1 1658 bp ds-DNA PRI 15-DEC-1989 DEFINITION Human alpha-2-plasmin inhibitor gene, exon 1. ACCESSION M20781 J03830 KEYWORDS alpha-2-plasmin inhibitor; plasmin inhibitor; serine protease inhibitor; serpin. SEGMENT 1 of 6 SOURCE Human placenta DNA, clones lambda-PI[1,2,6]. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 1658) AUTHORS Hirosawa,S., Nakamura,Y., Miura,O., Sumi,Y. and Aoki,N. TITLE Organization of the human alpha-2-plasmin inhibitor gene JOURNAL Proc. Natl. Acad. Sci. U.S.A. 85, 6836-6840 (1988) STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by N.Aoki, 28-SEP-1988. FEATURES from to/span description signal 1070 1075 TATA box pre-msg 1087 > 1658 A2PI mRNA and introns /hgml_locus_uid="LK0224R" /nomgen="PLI" /map="18p11.1-q11.2" IVS 1105 > 1658 A2PI intron A BASE COUNT 282 A 495 C 533 G 348 T ORIGIN 233 bp upstream of BamHI site. 1 CCATCCAGGC TAACATGGTA AAACCCCGTC TCTACCAAAA ATACAAAAAA TTAGCCAGGC 61 GTGGTGGTGG ACGCCTGTAG TCCCAGCTAT TCGGGCTGAG GCAAGAGAAT TGCTTGAACC 121 TGGGAGGTGG AGGTTGCAGT GAGCCGAGAT TGCACCACTG GCACCACTGC ACTCCAGCCT 181 GGGCGAAGCA ACTCCGTCTC AAAAAAAGAG AAACATCTTT AGCATTTTCT AAGGATCCCT 241 GGGGGACGGG AGGCAGGTGT GCGGTGAGTT GGGGGATTAG CTCCCAGGGC TCTTCCGTCA 301 GCTGCTGGGA CCCCAGATCC ACTGTGACTT TCCTTCCCAG GGAAGACCCT TCGCACAGTG 361 GAGCCGCTGG ACCGGGTGCC CCTGACTGCG GTGGCTGTCA TGCCCGCCCC CCACACCAGC 421 ATCACCATGG CCAGCTCTGA CTCTACCCTG CGCTTTGTGG ACTGCAAGCC TGGTCTGCAG 481 GTCAGGGGGG TCCAGTTCCC TGAGCACTCG CCTGGTTCTC TGGGGACCTG GCAAGGAGGA 541 GAGACTCCCC AAAAACAGGC CAGGATGTTG TTCTGGGGCC CTAGTTAGTT TCTCTTTGGT 601 GCTAGATCAC CCACAGCCAC ACATCCTGCG GGGCAGGACT CTGGCCTGTG AGGGTGGGGT 661 TCTGGCTTTT CATGCCCCCT GATGAGGGTC AGAGCTCAGG CCTTCCTGCT GTGTGGGCTT 721 GGGTGGTGGG CAGGGCCTTG GGGATGAGAT GGGAAGGTGG TGCCTCAGCT CAGCCGCCCT 781 CTCCCTGCAG CACGAGTTCC GACTGGGCGG TGGGCTGAAC CCTGGGCTTG TCCGTGCGGC 841 CATCAGCCCC AGTGGCCGTA GTGTCGTGGC CGGCTTCTCC TCAGGCTTCA TGGTGCTCCT 901 GGACACCCGC ACGGGCCTGG TTCTGCGAGC CCAGCCCACG AGGGGGACAG TTCTGCAGGT 961 CAAGGTGACG GGCCGGGTCT CCCTCCCCTT GCTGCCCAAC CCCCGCCCCT GTCCAGCCAT 1021 CACTGCTTAG GGTCTGCCTG CCCGGGTTCA GGCTCGAACT GGTCTGTCTT ATATACCTGG 1081 TCCAGGACTA ACTGGGCAGG GAGGGTAGCC CTCTCGGTCC ACCTTGGGAG CCAGTTGCAC 1141 TGCAGGGGTC TCAGCAGGAG GCAGGCCATG GGGGCGGGAG ACCGGGGCAG TGGGGCGGGA 1201 CGTATGGGGC AGGGCTGACC CTGTTGCTCA TGGCGATGTT CTAATGAGTA ACCCTTGTCC 1261 ATATTTGTCT TGCTTGGAGG ATCAGGGGTC AGGCCCTGTC CGTGACCTAG TTCAGGTTAA 1321 ATAAAGCTGA GCTGGGAGGC TGTTTACTGC CTGTAGACCG CTGAGCAGAG TCCATGGCCC 1381 TGCCGGGCTG CCCTGGCTGG GGGCAGGAGC ACGCCTGGGC ATGGGGATCC TGCCCCCGTC 1441 TCCTTCCCCT CCCCCTCCCT CCATTCTCAG TCACAGGCCT GGGGAGCTGC TCAGGGAGGC 1501 CCAGGAGGGA GGAACAGGGC TTCGTGGGAG GATTTCTGCA GGCAGCTGGG ACTTTCCTCC 1561 CCAGCTTGGC TGAGGCCAGG AGTCTTGCAT TGCCCTGCTG ACAGAGCTGT CGCTGGCTCC 1621 CTCCAGAGCC CCAGGGAACC CTCAGCTCAA GGTGCCCC // LOCUS HUMA2PIG2 2233 bp ds-DNA PRI 15-DEC-1989 DEFINITION Human alpha-2-plasmin inhibitor gene, exons 2 to 5. ACCESSION M20782 J03830 KEYWORDS alpha-2-plasmin inhibitor; plasmin inhibitor; serine protease inhibitor; serpin. SEGMENT 2 of 6 SOURCE Human placenta DNA, clones lambda-PI[1,2,6]. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 2233) AUTHORS Hirosawa,S., Nakamura,Y., Miura,O., Sumi,Y. and Aoki,N. TITLE Organization of the human alpha-2-plasmin inhibitor gene JOURNAL Proc. Natl. Acad. Sci. U.S.A. 85, 6836-6840 (1988) STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by N.Aoki, 28-SEP-1988. FEATURES from to/span description pept 488 550 alpha-2-plasmin inhibitor precursor, exon 2 (first expressed exon) /hgml_locus_uid="LK0224R" /nomgen="PLI" /map="18p11.1-q11.2" 656 694 alpha-2-plasmin inhibitor precursor, exon 3 815 877 alpha-2-plasmin inhibitor precursor, exon 4 1188 + 1389 alpha-2-plasmin inhibitor precursor, exon 5 sigp 488 550 alpha-2-plasmin inhibitor signal peptide 656 694 alpha-2-plasmin inhibitor signal peptide 815 877 alpha-2-plasmin inhibitor signal peptide 815 829 alpha-2-plasmin inhibitor signal peptide matp 830 + 1389 alpha-2-plasmin inhibitor pre-msg < 1 > 2233 A2PI mRNA and introns IVS < 1 483 A2PI intron A IVS 551 655 A2PI intron B IVS 695 814 A2PI intron C IVS 878 1187 A2PI intron D IVS 1390 > 2233 A2PI intron E BASE COUNT 456 A 606 C 682 G 489 T ORIGIN About 6.0 kb downstream of segment 1. 1 AAAAATCCCA AAAAGACGGT CTTATTTTGG TCCTCACCAT GCATGTGAGA AGAGTGAGGG 61 ACTTGTGCCA CCGTTTTACA AGGTAAGGCC AAGCCTGGTG GAGTTACGGG AAATGCCAGG 121 TCCTTTGGCA AGAGGTAGCC TGGATTCAGA CACAGATCTG ATTCACAGCG CAGGGCCTTG 181 TAGAATGAGA ACGTTTTTGA TTTGGTATCT CCCTCCTATT CACCAAAACA CCCTCAGTGC 241 ATGAAATGCA TGAAATATGA AACACCAGAA ACTAAAAAGG GGGAGAAGCC AGGGCGGATG 301 TCCCAGCTGC AGGAGTGGGA GCCGCTGCTC GTGTGTTTGG GGTCTGTTCT GATTCTGAGC 361 CTGCTTCTTC CCCTTGGCAA TCATGACCCC AGGACTTGGC GTTATCTGTG ATCGCGTGGG 421 TAGGATTCCT GGCGGGCGTG GGGATGTGCA GATGGGAACA GAGCTTTCTG TCCCTGCCCA 481 CAGGAACATG GCGCTGCTCT GGGGGCTCCT GGTGCTCAGC TGGTCCTGCC TGCAAGGCCC 541 CTGCTCCGTG GTGAGCTGGT GAAGTGCAAG TGGGTGGGTG AGGGGAAGAA GAGGGCTTGG 601 CATGAGGAGG GCTTGGCTCC GAGGGGACCT CCTATCCTCA TCCCTTTCTC CACAGTTCTC 661 CCCTGTGAGC GCCATGGAGC CCTTGGGCCG GCAGGTACTG GGGAGTGAGG AGCCTGTGAT 721 GGGGGGAAGG TCCCGGGGGT CTCACTGGTG GCCTTGGGCA GGGTGGGGGG CCTGTGGGAA 781 GGGTCGGTCT CCATCTGCTT GCTCCTTTCC GCAGCTAACT AGCGGGCCGA ACCAGGAGCA 841 GGTGTCCCCA CTTACCCTCC TCAAGTTGGG CAACCAGGTA CAACCAGGTG GGGCTGGGGA 901 AGAGTGGGCG GGGCTAGAGG GAGGAGGGCC CATCGGCAGG GGTCGGGGGG TGGGGGCGCG 961 TGCTGAGGCT GAGGCTCTGG AGTCCAGAGG CCAGAAGGGA AAGGGTGGGG AGGACCGAAG 1021 GTGGGCGCCA GGCCCCAGAA TGCCAGTGCC CTCCGTCTGA CGCTCCCTCT TCCCTGGGGC 1081 TGGGACAAGG CCCTGCTGTC CTCAGGCACA GGGGCTGTGA CAAGGCCTTC AACACAGAAC 1141 CTGGAGCTGA CCCCTTGACC TCCCTGACCC CTGATCTGTC CCTGCAGGAG CCTGGTGGCC 1201 AGACTGCCCT GAAGAGTCCC CCAGGAGTCT GCAGCAGAGA CCCCACCCCA GAGCAGACCC 1261 ACAGGCTGGC CCGGGCCATG ATGGCCTTCA CTGCCGACCT GTTCTCCCTG GTGGCTCAAA 1321 CGTCCACCTG CCCCAACCTC ATCCTGTCAC CCCTGAGTGT GGCCCTGGCG CTGTCTCACC 1381 TGGCACTAGG TACCCTGGCA CCACTTGTCC AGACCAAGAG ACTGGGAGGC CAGGAACTCA 1441 GTACTCCAGT GGTTCTCCGC GGGCGTTCCT CCACCAGGGT CACGTGGCTG TTTGGTAAAA 1501 ATGCGAGATT CCTAGGCCGG GGCGGTGGCT CACGCCTGTA ATCCCAACAC TTTGGAGGCT 1561 GAGGCGGGTG GATCACGAGG TCAGGAGTTC AAGACCAGCC TGGCCAACAT GTGAAACTCT 1621 CTCTACTAAA AATACAAAAA ATTTAGCTGT GCGTGGTGGT GCGCACCTGT AATTCCAGCT 1681 ATTCAGGAGG CTGAGGCAGA GAACTGTTTG AACCTGGGAG TTGGAGGTTA CAGTGAGCCG 1741 AGATGGCGCC ACTGCACTCC AGCCTGGGTG ACAGAGCAAG ATTCCGTCTC AAACAACAAC 1801 AACAACAAAT GCAGATTCCT GGGCCCCCAC CCATCTGTCT ATGTGAATCA GATCTCTGGG 1861 CCGGGGAATC TGCTTATTTA CAAGTCCTCC TGGTGATTTT TTTTTTTTTT TTGAGACAGA 1921 GTCTTGCCTC GTCACCCAGG CTAGAGTGCA GTGGTGTGAT CTAGCTCACT GCAACCTCTG 1981 TCTCCCAGGT TCAAGCAATT CTCCTGCCTC AGCCTCCCAA ATAGCTGGGA TCACAGGCAC 2041 CAGCCACCAT GCACAGCTGA TTTTTGTATT TTTAGTAGTA GAGAGGGGTT TCACCATGTT 2101 TGGCCAGGGT GGTCTCGAAC TCTCGACCTA AGGTGATCAA CTGCCTAGCT CCCAAAGTGC 2161 TGGGATTACA GGCGTGCGAC GCGCCCGGCC CCCTCCTGGT GATTCTTATG CAAGAGTTTG 2221 CTAGCTAATT TCC // LOCUS HUMA2PIG3 756 bp ds-DNA PRI 15-DEC-1989 DEFINITION Human alpha-2-plasmin inhibitor gene, exons 6 and 7. ACCESSION M20783 J03830 KEYWORDS alpha-2-plasmin inhibitor; plasmin inhibitor; serine protease inhibitor; serpin. SEGMENT 3 of 6 SOURCE Human placenta DNA, clones lambda-PI[1,2,6]. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 756) AUTHORS Hirosawa,S., Nakamura,Y., Miura,O., Sumi,Y. and Aoki,N. TITLE Organization of the human alpha-2-plasmin inhibitor gene JOURNAL Proc. Natl. Acad. Sci. U.S.A. 85, 6836-6840 (1988) STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by N.Aoki, 28-SEP-1988. FEATURES from to/span description pept + 136 279 alpha-2-plasmin inhibitor precursor, exon 6 /hgml_locus_uid="LK0224R" /nomgen="PLI" /map="18p11.1-q11.2" 426 + 629 alpha-2-plasmin inhibitor precursor, exon 7 matp + 136 279 alpha-2-plasmin inhibitor 426 + 629 alpha-2-plasmin inhibitor pre-msg < 1 > 756 A2PI mRNA and introns IVS < 1 135 A2PI intron E IVS 280 425 A2PI intron F IVS 630 > 756 A2PI intron G BASE COUNT 140 A 217 C 253 G 146 T ORIGIN About 1.5 kb downstream of segment 2. 1 AGATCCGTCG GCTGTGGAAG GATGGCTGTG GTCCCTGGAC GTCCTCGTCA CGGGTATCCA 61 GGAGGGACTG GAGTGGGCAG TCGGGGGTGA GGAAAGGACC CGCAGCCGGG CCTCAGCCTG 121 TGCGGTGCCC TCCAGGTGCT CAGAACCACA CGTTGCAGAG GCTGCAACAG GTGCTGCACG 181 CAGGCTCAGG GCCCTGCCTC CCCCATCTGC TGAGCCGCCT CTGCCAGGAC CTGGGCCCCG 241 GCGCGTTCCG ACTGGCTGCC AGGATGTACC TGCAGAAAGG TAGGCGCTGA TGGCAGGGAG 301 CTCCCTCAGT CCTGCCCTGG GTGGAGGAGG GTGAGAGCAA GGGGCTGGGC CTCTGGTAGC 361 GAGTAGGGGC GTGTCTGGCT GTGGAGCCTG GAGCCCTGGG AACAGCTTGT GCTGCCTCCG 421 TGCAGGATTT CCCATCAAAG AAGATTTCCT GGAACAATCC GAACAGCTAT TTGGGGCAAA 481 GCCCGTGAGC CTGACGGGAA AGCAGGAAGA TGACCTGGCA AACATCAACC AATGGGTGAA 541 GGAGGCCACG GAGGGGAAGA TTCAGGAATT CCTCTCTGGG CTGCCGGAAG ACACCGTGTT 601 GCTTCTCCTC AACGCCATCC ACTTCCAGGG TGCGCTCCTC CTCCTCTCAG ATCCCCCACC 661 CTGTAGGCTG AGCTGGGACG TGCAGGCCTT TTTGTTTTTT GAGACAAGTC TCGCTCTGTC 721 ACCCAGGGTG GAGCGCACTG GCGCGATCTG GTCTCA // LOCUS HUMA2PIG4 227 bp ds-DNA PRI 15-DEC-1989 DEFINITION Human alpha-2-plasmin inhibitor gene, exon 8. ACCESSION M20784 J03830 KEYWORDS alpha-2-plasmin inhibitor; plasmin inhibitor; serine protease inhibitor; serpin. SEGMENT 4 of 6 SOURCE Human placenta DNA, clones lambda-PI[1,2,6]. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 227) AUTHORS Hirosawa,S., Nakamura,Y., Miura,O., Sumi,Y. and Aoki,N. TITLE Organization of the human alpha-2-plasmin inhibitor gene JOURNAL Proc. Natl. Acad. Sci. U.S.A. 85, 6836-6840 (1988) STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by N.Aoki, 28-SEP-1988. FEATURES from to/span description pept + 57 + 199 alpha-2-plasmin inhibitor precursor, exon 8 /hgml_locus_uid="LK0224R" /nomgen="PLI" /map="18p11.1-q11.2" matp + 57 + 199 alpha-2-plasmin inhibitor pre-msg < 1 > 227 A2PI mRNA and introns IVS < 1 56 A2PI intron G IVS 200 > 227 A2PI intron H BASE COUNT 38 A 78 C 61 G 50 T ORIGIN About 1.0 kb downstream of segment 3. 1 CCTCCTCTCC AACTGGTCCC CGTCGACGTG ACCCCTGACC CTCTGCTGGG TTTCAGGTTT 61 CTGGAGGAAC AAGTTTGACC CGAGCCTTAC CCAGAGAGAC TCCTTCCACC TGGACGAGCA 121 GTTCACGGTG CCCGTGGAAA TGATGCAGGC CCGCACGTAC CCGCTGCGCT GGTTCTTGCT 181 GGAGCAGCCT GAGATCCAGG TCACCCTTGG TTCTCCAGCA GGCTGCC // LOCUS HUMA2PIG5 341 bp ds-DNA PRI 15-DEC-1989 DEFINITION Human alpha-2-plasmin inhibitor gene, exon 9. ACCESSION M20785 J03830 KEYWORDS alpha-2-plasmin inhibitor; plasmin inhibitor; serine protease inhibitor; serpin. SEGMENT 5 of 6 SOURCE Human placenta DNA, clones lambda-PI[1,2,6]. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 341) AUTHORS Hirosawa,S., Nakamura,Y., Miura,O., Sumi,Y. and Aoki,N. TITLE Organization of the human alpha-2-plasmin inhibitor gene JOURNAL Proc. Natl. Acad. Sci. U.S.A. 85, 6836-6840 (1988) STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by N.Aoki, 28-SEP-1988. FEATURES from to/span description pept + 84 + 288 alpha-2-plasmin inhibitor precursor, exon 9 /hgml_locus_uid="LK0224R" /nomgen="PLI" /map="18p11.1-q11.2" matp + 84 + 288 alpha-2-plasmin inhibitor pre-msg < 1 > 341 A2PI mRNA and introns IVS < 1 83 A2PI intron H IVS 289 > 341 A2PI intron I BASE COUNT 62 A 107 C 95 G 77 T ORIGIN About 3.0 kb downstream of segment 4. 1 TGCCTTAGGA GCACCTGCTG GCCCCACCCC CACTTAGCTT CGGGCCTTTC TGTCCTCATG 61 CTCTTCCCTT CCCTTTTCTG TAGGTGGCTC ATTTCCCCTT TAAGAACAAC ATGAGCTTTG 121 TGGTCCTTGT ACCCACCCAC TTTGAATGGA ACGTGTCCCA GGTACTGGCC AACCTGAGTT 181 GGGACACCCT GCACCCACCT CTGGTGTGGG AGAGGCCCAC CAAGGTCCGG CTGCCTAAGC 241 TGTATCTGAA ACACCAAATG GACCTGGTGG CCACCCTCAG CCAGCTGGGT AAGGAGGAGG 301 GTGCGGGCGA GCCCCGAGGT CAGGCTGGGC AGGGCGGGTA A // LOCUS HUMA2PIG6 1298 bp ds-DNA PRI 15-DEC-1989 DEFINITION Human alpha-2-plasmin inhibitor gene, exon 10. ACCESSION M20786 J03830 KEYWORDS alpha-2-plasmin inhibitor; plasmin inhibitor; serine protease inhibitor; serpin. SEGMENT 6 of 6 SOURCE Human placenta DNA, clones lambda-PI[1,2,6]. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 1298) AUTHORS Hirosawa,S., Nakamura,Y., Miura,O., Sumi,Y. and Aoki,N. TITLE Organization of the human alpha-2-plasmin inhibitor gene JOURNAL Proc. Natl. Acad. Sci. U.S.A. 85, 6836-6840 (1988) STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by N.Aoki, 28-SEP-1988. FEATURES from to/span description pept + 128 540 alpha-2-plasmin inhibitor precursor, exon 10 /hgml_locus_uid="LK0224R" /nomgen="PLI" /map="18p11.1-q11.2" matp + 128 537 alpha-2-plasmin inhibitor pre-msg < 1 1298 A2PI mRNA and introns IVS < 1 127 A2PI intron I signal 1253 1258 poly-adenylation signal BASE COUNT 231 A 402 C 397 G 268 T ORIGIN About 1.0 kb downstream of segment 5. 1 TAGGAATGAA GCGGTATCTG TGAGTTCAAG CTGTTCCCTG GCCAGGATCT CAGACACCCT 61 CCAAAGCACC TCCAGGAGCC TGTGACGCCA AGGGCAGCTC TGACCACGCA TCTCTGGCCC 121 TGGGCAGGCC TGCAGGAGTT GTTCCAGGCC CCAGACCTGC GTGGGATCTC CGAGCAGAGC 181 CTGGTGGTGT CCGGCGTGCA GCATCAGTCC ACCCTGGAGC TCAGCGAGGT CGGCGTGGAG 241 GCGGCGGCGG CCACCAGCAT TGCCATGTCC CGCATGTCCC TGTCCTCCTT CAGCGTGAAC 301 CGCCCCTTCC TCTTCTTCAT CTTCGAGGAC ACCACAGGCC TTCCCCTCTT CGTGGGCAGC 361 GTGAGGAACC CCAACCCCAG TGCACCGCGG GAGCTCAAGG AACAGCAGGA TTCCCCGGGC 421 AACAAGGACT TCCTCCAGAG CCTGAAAGGC TTCCCCCGCG GAGACAAGCT TTTCGGCCCT 481 GACTTAAAAC TTGTGCCCCC CATGGAGGAG GATTACCCCC AGTTTGGCAG CCCCAAGTGA 541 GGGGCCGTGG CTGTGGCATC CAGAGTCCCT GCCTGGACCA GCCTCTCCAC TCATGTGACT 601 CTTTCCAACC GGCTTTGTGG CACTGGGGCA GGGGCCGGGG GCAGTCTGAG AGAGGCCATT 661 CTTTCCCAAC ACCTCTTGGG GAGTTTAGGG TGGGGGGGGG CGCGGCTGGG AGGAGGGCAG 721 GCATCGGGGA GCCGGGAGCC TGACCCTCAT CTTTCTTCCA AACAGGCTCA GAGGGTGTCC 781 TGCACCGGGG CCTGGGCAGG AGGGAGGTGC TTCTAGTTCT GCCAGGAGAC AGGTTAGCTG 841 CTCCCCACGT CAGCTGGGAC ACCCCGACTT TTGTTTACCA GAGAAAAAGG GAGGGGGAGA 901 GGGCTGCCTT TGGACTTGTC CCGGGACACC TAGGCTAGGG TGGGGAGAGA CGGGCCCTGG 961 TGGTGGCTCG GGAGGCGAAG CGTTGTCCTC AGCCCCGCGT GGAACTCGTG TCTGGCACAG 1021 CCTGGCTGTG GCCTAACCTG CCGAGAGTCC ATCAGCCTCC ATCCTACCCC CTGTGCCTTG 1081 TCACGCCAGA CTTCCCACGG CTCCTCGAGA TCCCAACACT GCCAGCATTT CCCTTCCTTC 1141 CTCTCCTGTC TCCCTCCTCT GCCCGGGAGC TCAGGAACCG AGGCAGGGAA GGATCCCATG 1201 AGCTCCTTAA GGCTCTTTTG TAAGGTTTTT GTAGTGATTT TTATGCCACC TGAATAAAGA 1261 ATGAATGGGC CTGGCTGGTT TGATGTCACC GTTCTGGG // LOCUS HUMABL1B 2795 bp ds-DNA PRI 15-JUN-1989 DEFINITION DNA sequence of the region containing exon 1b of the human c-abl gene. ACCESSION M17310 KEYWORDS abl oncogene; cellular oncogene; oncogene; proto-oncogene; translocation. SOURCE Human DNA, clone pHRR7.5. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 2277) AUTHORS Bernards,A., Rubin,C.M., Westbrook,C.A., Paskind,M. and Baltimore,D. TITLE The first intron in the human c-abl gene is at least 200 kilobases long and is a target for translocations in chronic myelogenous leukemia JOURNAL Mol. Cell. Biol. 7, 3231-3236 (1987) STANDARD full staff_review REFERENCE 2 (bases 2278 to 2795) AUTHORS Paskind,M. JOURNAL Unpublished (1987) Whitehead Institute, Cambridge, Massachusetts. STANDARD full staff_review COMMENT Draft entry and computer-readable sequence [1],[2] kindly submitted by A.Bernards (23-OCT-1987). FEATURES from to/span description pept 2027 / 2162 c-abl coding sequence, exon 1b /nomgen="ABL" /map="9q34" /hgml_locus_uid="LY0005H" pre-msg 751 > 2795 mRNA and introns IVS 2163 > 2795 c-abl, intron A rpt 1 70 Alu repeat BASE COUNT 586 A 735 C 773 G 701 T ORIGIN About 0.1 kb downstream from XhoI site; chromosome 9q34. 1 CCTCAGGTGA TCCGCCCGTC TCGGCCTCCC AAAGTGCTGG GATTACCGGC GTGAGCCACC 61 GCGCCCGGCC CGCTAGTTCT AATATTTATG CTTTTTGGAG GGTGAAGGGC TTGAAAGGCA 121 TCAAAAACCT TGGTTTCTAG GAGCGAGCCA AATTTTCTCC CTTAGAGACT GAAGGAGGTT 181 ATCCGACTCC GCTTCCAGGA AAATCCAATC CGCAATTTTT CCCTCGCTTT GGCTCTTCCA 241 GGTCTCACTA CTCGCGCCGG GGAGACGCGG TCCGGCCTAC GCTAGGCCGG CAAGAATCTA 301 CCAGGTCCCA GGAGGGCGGG GCCGAGCCCC AGAAGCGCCG GGGCCCGGGC CAGGAGTCTT 361 GGTGCAGGCG GACTCGGGCG CGTGCCATGG GACCACAACT CCCGGCACGC ACCGCGGACT 421 GCGCCGAACA CGGTTGGCTG GGAGATGCCA ACCCTACTGC TCCCACCGGC CAATGGGACG 481 GAGGAAAGAG GCCTGACCCG GATTCTCGGA GGCCAATGGC AATACAGAGG AGGCGGGTGC 541 TTCCTTACCA CTCACCTTGC GCTCAGAGAA GCCACATTTA AAAGGTCGTG CCGGGCCGGG 601 CTTTCGCAGG CAGTGGGACC AGAGCCGGGA GGGGCGGCGG TGTCCCGGGA AAGCGGCTCT 661 TGGGGCATCC GGGTCCCCTC GAGGCGGGCT CCGGACGTCG CCGTGGGCGG GCCGAGGGCG 721 GGGCCTGGCC TCGTTGTGGA GCGGCTCGTA ATCCATCATG GCGGCCGCGG GGTTTGGTGT 781 CTGTGCCTGA GCAGCGCTGG AGCCGGAGCC GGTTCCCGGG TCCTGCGGCT GAGGAGCCCC 841 TCCGCTGTCC ACGGCCCCTA CCGGCGGGGG GCGGCTGGGT CCCTCGGCGG AGCTCGGGAG 901 ATGTGACTGC CTGAGGGCGG TGGTGGTGTC AGCGTCCGGG GCCGGGGGAG GGGGTGTCTC 961 GGGCAGAGAC CCCCGGGCTT GGGGCAGCTG AGGCGGCCGG GCCTCCTCTA CACGGGGCCC 1021 GCCTTCCGCT GTCTGGGCCG CGAGAGTCCT TCGTCCCTTA CAGCCCCGCC CCGGCTTTGG 1081 GACACTGCGG GTGGTCTGTT TCCCCCAGCT TGGGACACCC CGTTTTCTGA GGCGTGGAAG 1141 AGCGTCGCCC CGGAGTAAGC TGCCCGTGCC GCGCCCCGAC AGCTTCCCTC AGCCCCAAGC 1201 CGCCCCTTAT TCCGGATCCC GGCCCCAACT TTGGCCACGG AGCCTCCCAT TCAAATCCCT 1261 CCCTTGCTGT CAAGGGGTCT CCCCTTCCCC CAAGGTGGCT CCCGCGAGCC TCTAATGCCC 1321 TGACTTCTTC CAATGTCACC TACGGCCCCC TTAGTCTCAG CTCAGCCAAA AACTTTAATG 1381 CAAAGGAAAA GTCTGGATTG GCTCCACAGG CCTTTTAAAA AGCGGACTTA AAAGTTGCTG 1441 GCAATGCATT CCTTTTCGTC AGAGTCGAGG GCAAACTCGC TGAAATCTGG GTGACCCGTG 1501 TCCTTTTCCG GAGAGCAAAG CAGAGAAGCG AGAGCGGCCA CTAGTTCGGC AGGAAATTTG 1561 TTGGAAGATG AAGAAGCTAA GATAGGGGGT TGGTGACTTC CACAGGAAAA GTTCTGGAGG 1621 AGTAGCCAAA GACCATCAGC GTTTCCTTTA TGTGTGAGAA TTGAAATGAC TAGCATTATT 1681 GACCCTTTTC AGCATCCCCT GTGAATATTT CTGTTTAGGT TTTTCTTCTT GAAAAGAAAT 1741 TGTTATTCAG CCCGTTTAAA ACAAATCAAG AAACTTTTGG GTAACATTGC AATTACATGA 1801 AATTGATAAC CGCGAAAATA ATTGGAACTC CTGCTTGCAA GTGTCAACCT AAAAAAAGTG 1861 CTTCCTTTTG TTATGGAAGA TGTCTTTCTG TGATTGACTT CAATTGCTGA CTTGTGGAGA 1921 TGCAGCGAAT GTGAAATCCC ACGTATATGC CATTTCCCTC TACGCTCGCT GACCGTTCTG 1981 GAAGATCTTG AACCCTCTTC TGGAAAGGGG TACCTATTAT TACTTTATGG GGCAGCAGCC 2041 TGGAAAAGTA CTTGGGGACC AAAGAAGGCC AAGCTTGCCT GCCCTGCATT TTATCAAAGG 2101 AGCAGGGAAG AAGGAATCAT CGAGGCATGG GGGTCCACAC TGCAATGTTT TTGTGGAACA 2161 TGGTGAGTGC TTTTCAAAAT TTCTGCTCAT GGTTTTCCTC ATGCATTCAT CTTAGGCCTT 2221 CAAGGAACTT TGAACAACAG TACTTGCGAC AGTTCCTTCC AATTCCACTT AATAAATTTG 2281 TTACTGTAGT TATCTCTTAG TGGAACTTTC TTTGTATAAG AAAAAGTTAC TTCGTGACTT 2341 CGGCTTTATT CAAAATCTAT TTGAGTTGCT TATTTCTTGC CTGATTCCTC TGCTAATTCA 2401 CTGTTGACCT TGGACAAGCC ACTTTACTGT TCCAGGGCTT TAGTTTCCTG AGGGGCTGGA 2461 CTTCAGTGGT TTCTTAGGGT CCCTGGTTCT ATGATTCTGC TCTAACCGAA GTCGTGGGCT 2521 CTTCTGACAA GCATGGATTC TTTACTGCTT TGTCTACAGA GAGCTTTTAT CATATTCTCA 2581 AAGTGGTTTG TGTTCCTGAG AAAGTTAAGA ATCATAGATA CTGGTTGACG CTAATATCCT 2641 TGACCTTTTC CTTCGTAAGT AGGAACTTGA AAATACTCAC TTTGGAGCCA TGTGGGAAAA 2701 ATCAAGTGGG GAAGCAGCAT TCCTTGTGAA TTTTAGATAG ACAGCTTCTG TCTTACCTTC 2761 CATAAGGAGT AATCTCTTCC TCGTTGATGA AGCTT // LOCUS HUMABLBC 90 bp ss-mRNA PRI 15-DEC-1988 DEFINITION Human abl gene, intron preceding exon II. ACCESSION M14755 KEYWORDS oncogene; protein kinase. SOURCE Human DNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 90) AUTHORS Shtivelman,E., Lifshitz,B., Gale,R.P., Roe,B.A. and Canaani,E. TITLE Alternative splicing of RNAs transcribed from the human abl gene and from the bcr-abl fused gene JOURNAL Cell 47, 277-284 (1986) STANDARD simple staff_entry FEATURES from to/span description pept / 74 > 90 abl protein (AA at 76) /nomgen="ABL" /map="9q34" /hgml_locus_uid="LY0005H" IVS < 1 73 abl intron BASE COUNT 16 A 28 C 8 G 38 T ORIGIN 1 GAGAATAAAA CTAATTTTTT CTCCCAATTT TCTCTTCCTT TTTCTTTTTT CTGTTCCCCC 61 CTTTCTCTTC CAGAAGCCCT TCAGCGGCCA // LOCUS HUMACCYBA 3657 bp ds-DNA PRI 04-AUG-1986 DEFINITION Human cytoplasmic beta-actin gene, complete cds. ACCESSION X00351 M10278 J00074 KEYWORDS actin; beta-actin; beta-cytoplasmic actin. SOURCE Human: cDNA to mRNA, cultured epidermal cell library [1], clones pHF[-beta-A-1,-5,-7] [2]; fetal DNA, clone ACTB [3]. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 2484 to 3470; mRNA only) AUTHORS Hanukoglu,I., Tanese,N. and Fuchs,E. TITLE Complementary DNA sequence of a human cytoplasmic actin: Interspecies divergence of 3' non-coding regions JOURNAL J. Mol. Biol. 163, 673-678 (1983) STANDARD full staff_review REFERENCE 2 (bases 284 to 3657; mRNA only) AUTHORS Ponte,P., Ng,S.-Y., Engel,J., Gunning,P. and Kedes,L. TITLE Evolutionary conservation in the untranslated regions of actin mRNAs: DNA sequence of a human beta-actin cDNA JOURNAL Nucleic Acids Res. 12, 1687-1696 (1984) STANDARD full staff_review REFERENCE 3 (bases 1 to 3066) AUTHORS Ng,S.-Y., Gunning,P., Eddy,R., Ponte,P., Leavitt,J., Shows,T. and Kedes,L. TITLE Evolution of the functional human beta-actin gene and its multi-pseudogene family: Conservation of noncoding regions and chromosomal dispersion of pseudogenes JOURNAL Mol. Cell. Biol. 5, 2720-2732 (1985) STANDARD full staff_review COMMENT There are at least 20 different beta-actin gene sequences in the human genome [3]. FEATURES from to/span description pept 1157 1279 cytoplasmic beta actin, exon 2 (first expressed exon) /nomgen="ACTB" /map="7pter-q22" /hgml_locus_uid="LU0150X" 1414 1653 cytoplasmic beta actin, exon 3 2095 2533 cytoplasmic beta actin, exon 4 2629 2810 cytoplasmic beta actin, exon 5 2923 3066 cytoplasmic beta actin, exon 6 pre-msg 241 3657 actin mRNA IVS 319 1150 actin cds intron A IVS 1280 1413 actin cds intron B IVS 1654 2094 actin cds intron C IVS 2534 2628 actin cds intron D IVS 2811 2922 actin cds intron E revision 2279 2279 c in DNA [3]; g in mRNA [2] revision 3189 3191 ttg in [2]; tg in [1] BASE COUNT 609 A 1129 C 1110 G 802 T 7 others ORIGIN 144 bp upstream of AvaI site; chromosome 7. 1 CCCAGCACCC CAAGGCGGCC AACGCCAAAA CTCTCCCTCC TCCTCTTCCT CAATNCTCGC 61 TCTCGCTCTT TTTTTTTTTC GCAAAAGGAG GGGAGAGGGG GTAAAAAAAT GCTGCACTGT 121 CGGCGAAGCC GGTGAGTGAG CGGCGCGGGG CCAATCGCGT GCGCCGTTCC GAAAGTTGCC 181 TTTTATGGCT CGAGCGGCCG CGGCGGCGCC CTATAAAACC CAGCGGCGCG ACGCGCCACC 241 ACCGCCGAGA CCGCGTCCGC CCCGCGAGCA CAGAGCCTCG CCTTTGCCGA TCCGCCGCCC 301 GTCCACACCC GCCGCCAGGT AAGCCCGGCC AGCCGACCGG GGCATGCGGC CGCGGCCCCT 361 TCGCCCGTGC AGAGCCGCCG TCTGGGCCGC AGCGGGGGGC GCATGGGGGG GGAACCGGAC 421 CGCCGTGGGG GGCGCGGGAG AAGCCCCTGG GCCTCCGGAG ATGGGGGACA CCCCACGCCA 481 GTTCGGAGGC GCGAGGCCGC GCTCGGGAGG CGCGCTCCGG GGGTGCCGCT CTCGGGGCGG 541 GGGCAACCGG CGGGGTCTTT GTCTGAGCCG GGCTCTTGCC AATGGGGATC GCAGGGTGGG 601 CGCGGCGTAG CCCCCGCCAG GCCCGGTGGG GGCTGGGGCG CCATGCGCGT GCGCGCTGGT 661 CCTTTGGGCG CTAACTGCGT GCGCGCYGGG AATTGGCGCT AATTGCGGCT GCGGCCYGGG 721 ACTCAAGGCG CTAATTGCGG CTGCGTTCTG GGGCCCGGGG TGCCGCGGCC YGGGCYGGGG 781 CGAAGGCGGG CTCGGTCGGA AGGGGTGGGG TCGCCGCGGC TCCCGGGCGC TTGCGCACTT 841 CCTGCCCGAG CCGCYGGCCG CCCGAGGGTG TGGCCGCTGC GTGCGCGCGC GCGACCCGGC 901 GCTGTTTGAA YCGGGCGGAG GCGGGGCTGG CGCCCGGTTG GGAGGGGGTT GGGGCCTGGC 961 TTCCTGCCGC GCGCCGCGGG GACGCCTCCG ACCAGTGTTT GCCTTTTATG GTAATAACGC 1021 GCCGGCCCGG CTTCCTTTAT CCCCAATCGT GCGCGCGCCG GCGCCCCCTA GCGGCCTAAG 1081 GACTCGGCGC GCCGGAAGTG GCCAGGGCGG GGGCGACTTC GGCTCACAGC GCGCCCGGCT 1141 ATTCTCGCAG CTCACCATGG ATGATGATAT CGCCGCGCTC GTCGTCGACA ACGGCTCCGG 1201 CATGTGCAAG GCCGGCTTCG CGGGCGACGA TGCCCCCCGG GCCGTCTTCC CCTCCATCGT 1261 GGGGCGCCCC AGGCACCAGG TAGGGGAGCT GGCTGGGTGG GGCAGCCCCG GGAGCGGGCG 1321 GGAGGCAAGG GCGCTTTCTC TGCACAGGAG CCTCCCGGTT TCCGGGGTGG GGGCTGCGCC 1381 GCTGCTCAGG GCTTCTTGTC CTTTCCTTCC CAGGGCGTGA TGGTGGGCAT GGGTCAGAAG 1441 GATTCCTATG TGGGCGACGA GGCCCAGAGC AAGAGAGGCA TCCTCACCCT GAAGTACCCC 1501 ATCGAGCACG GCATCGTCAC CAACTGGGAC GACATGGAGA AAATCTGGCA CCACACCTTC 1561 TACAATGAGC TGCGTGTGGC TCCCGAGGAG CACCCCGTGC TGCTGACCGA GGCCCCCCTG 1621 AACCCCAAGG CCAACCGCGA GAAGATGACC CAGGTGAGTG GCCCGCTACC TCTTCTGGTG 1681 GCCGCCTCCC TCCTTCCTGG CCTCCCGGAG CTGCGCCCTT TCTCACTGGT TCTCTCTTCT 1741 GCCGTTTTCC GTAGGACTCT CTTCTCTGAC CTGAGTCTCC TTTGGAACTC TGCAGGTTCT 1801 ATTTGCTTTT TCCCAGATGA GCTCTTTTTC TGGTGTTTGT CTCTCTGACT AGGTGTCTAA 1861 GACAGTGTTG TGGGTGTAGG TACTAACACT GGCTCGTGTG ACAAGGCCAT GAGGCTGGTG 1921 TAAAGCGGCC TTGGAGTGTG TATTAAGTAG GCGCACAGTA GGTCTGAACA GACTCCCCAT 1981 CCCAAGACCC CAGCACACTT AGCCGTGTTC TTTGCACTTT CTGCATGTCC CCCGTCTGGC 2041 CTGGCTGTCC CCAGTGGCTT CCCCAGTGTG ACATGGTGCA TCTCTGCCTT ACAGATCATG 2101 TTTGAGACCT TCAACACCCC AGCCATGTAC GTTGCTATCC AGGCTGTGCT ATCCCTGTAC 2161 GCCTCTGGCC GTACCACTGG CATCGTGATG GACTCCGGTG ACGGGGTCAC CCACACTGTG 2221 CCCATCTACG AGGGGTATGC CCTCCCCCAT GCCATCCTGC GTCTGGACCT GGCTGGCCCG 2281 GACCTGACTG ACTACCTCAT GAAGATCCTC ACCGAGCGCG GCTACAGCTT CACCACCACG 2341 GCCGAGCGGG AAATCGTGCG TGACATTAAG GAGAAGCTGT GCTACGTCGC CCTGGACTTC 2401 GAGCAAGAGA TGGCCACGGC TGCTTCCAGC TCCTCCCTGG AGAAGAGCTA CGAGCTGCCT 2461 GACGGCCAGG TCATCACCAT TGGCAATGAG CGGTTCCGCT GCCCTGAGGC ACTCTTCCAG 2521 CCTTCCTTCC TGGGTGAGTG GAGACTGTCT CCCGGCTCTG CCTGACATGA GGGTTACCCC 2581 TCGGGGCTGT GCTGTGGAAG CTAAGTCCTG CCCTCATTTC CCTCTCAGGC ATGGAGTCCT 2641 GTGGCATCCA CGAAACTACC TTCAACTCCA TCATGAAGTG TGACGTGGAC ATCCGCAAAG 2701 ACCTGTACGC CAACACAGTG CTGTCTGGCG GCACCACCAT GTACCCTGGC ATTGCCGACA 2761 GGATGCAGAA GGAGATCACT GCCCTGGCAC CCAGCACAAT GAAGATCAAG GTGGGTGTCT 2821 TTCCTGCCTG AGCTGACCTG GGCAGGTCAG CTGTGGGGTC CTGTGGTGTG TGGGGAGCTG 2881 TCACATCCAG GGTCCTCACT GCCTGTCCCC TTCCCTCCTC AGATCATTGC TCCTCCTGAG 2941 CGCAAGTACT CCGTGTGGAT CGGCGGCTCC ATCCTGGCCT CGCTGTCCAC CTTCCAGCAG 3001 ATGTGGATCA GCAAGCAGGA GTATGACGAG TCCGGCCCCT CCATCGTCCA CCGCAAATGC 3061 TTCTAGGCGG ACTATGACTT AGTTGCGTTA CACCCTTTCT TGACAAAACC TAACTTGCGC 3121 AGAAAACAAG ATGAGATTGG CATGGCTTTA TTTGTTTTTT TTGTTTTGTT TTGGTTTTTT 3181 TTTTTTTTTT GGCTTGACTC AGGATTTAAA AACTGGAACG GTGAAGGTGA CAGCAGTCGG 3241 TTGGAGCGAG CATCCCCCAA AGTTCACAAT GTGGCCGAGG ACTTTGATTG CACATTGTTG 3301 TTTTTTTAAT AGTCATTCCA AATATGAGAT GCATTGTTAC AGGAAGTCCC TTGCCATCCT 3361 AAAAGCCACC CCACTTCTCT CTAAGGAGAA TGGCCCAGTC CTCTCCCAAG TCCACACAGG 3421 GGAGGTGATA GCATTGCTTT CGTGTAAATT ATGTAATGCA AAATTTTTTT AATCTTCGCC 3481 TTAATACTTT TTTATTTTGT TTTATTTTGA ATGATGAGCC TTCGTGCCCC CCCTTCCCCC 3541 TTTTTGTCCC CCAACTTGAG ATGTATGAAG GCTTTTGGTC TCCCTGGGAG TGGGTGGAGG 3601 CAGCCAGGGC TTACCTGTAC ACTGACTTGA GACCAGTTGA ATAAAAGTGC ACACCTT // LOCUS HUMACCYBB 3646 bp ds-DNA PRI 04-AUG-1986 DEFINITION Human cytoplasmic beta-actin gene, complete cds. ACCESSION M10277 KEYWORDS actin; beta-cytoplasmic actin; cytoplasmic actin. SOURCE Human DNA library from HUT-14 cell line, clone lambda-Ha160. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 3649) AUTHORS Nakajima-Iijima,S., Hamada,H., Reddy,P. and Kakunaga,T. TITLE Molecular structure of the human cytoplasmic beta-actin gene: Interspecies homology of sequences in the introns JOURNAL Proc. Natl. Acad. Sci. U.S.A. 82, 6133-6137 (1985) STANDARD full staff_review COMMENT A potential cap site was found at position 239. Through interspecies sequence comparison, a second potential cap site was found at positions 242-244. FEATURES from to/span description pept 1093 1215 cytoplasmic beta actin, exon 2 (first expressed exon) /nomgen="ACTB" /map="7pter-q22" /hgml_locus_uid="LU0250X" 1348 1587 cytoplasmic beta actin, exon 3 2029 2467 cytoplasmic beta actin, exon 4 2563 2744 cytoplasmic beta actin, exon 5 2857 3000 cytoplasmic beta actin, exon 6 pre-msg < 319 3589 actin mRNA IVS 320 1086 actin intron A IVS 1216 1347 actin intron B IVS 1588 2028 actin intron C IVS 2468 2562 actin intron D IVS 2745 2856 actin intron E BASE COUNT 613 A 1117 C 1102 G 814 T ORIGIN 145 bp upstream of AvaI site. 1 GCCCAGCACC CCAAGGCGGC CAACGCCAAA ACTCTCCCTC CTCCTCTTCC TCAATCTCGC 61 TCTCGCTCTT TTTTTTTTTC GCAAAAGGAG GGGAGAGGGG GTAAAAAAAT GCTGCACTGT 121 GCGGCGAAGC CGGTGAGTGA GCGGCGCGGG GCCAATCAGC GTGCGCCGTT CCGAAAGTTG 181 CCTTTTATGG CTCGAGCGGC CGCGGCGGCG CCCTATAAAA CCCAGCGGCG CGACGCGCCA 241 CCACCGCCGA GACCGCGTCC GCCCGCGAGC ACAGAGCCTC GCCTTTGCCG ATCCGCCGCC 301 CGTCCACACC CGCCGCCAGG TAAGCCCGGC CAGCCGACCG GGGCATGCGG CCGCGGCCCT 361 TCGCCCGTGC AGAGCCGCCG TCTGGGCCGC AGCGGGGGGC GCATGGGGCG GAACCGGACC 421 GCCGTGGGGG GCGCGGGAGA AGCCCCTGGG CCTCCGGAGA TGGGGGACAC CCCACGCCAG 481 TTCGCAGGCG CGAGGCCGCG CTCGGGCGGG CGCGCTCCGG GGGTGCCGCT CTCGGGGCGG 541 GGGCAACCGG CGGGGTCTTT GTCTGAGCCG GGCTCTTGCC AATGGGGATC GCACGGTGGG 601 CGCGGCGTAG CCCCCGTCAG GCCCGGTGGG GGCTGGGGCG CCATGCGCGT GCGCGCTGGT 661 CCTTTGGGCG CTAACTGCGT GCGCGCTGGG AATTGGCGCT AATTGCGCGT GCGCGCTGGG 721 ACTCAATGGC GCTAATCGCG CGTGCGTTCT GGGGCCCGGG CGCTTGCGCC ACTTCCTGCC 781 CGAGCCGCTG GCGCCCGAGG GTGTGGCCGC TGCGTGCGCG CGCGCGACCC GGTCGCTGTT 841 TGAACCGGGC GGAGGCGGGG CTGGCGCCCG GTTGGGAGGG GGTTGGGGCC TGGCTTCCTG 901 CCGCGCGCCG CGGGGACGCC TCCGACCAGT GTTTGCCTTT TATGGTAATA ACGCGGCCGG 961 CCCGGCTTCC TTTGTCCCCA ATCTGGGCGC GCGCCGGCGC CCCCTGGCGG CCTAAGGACT 1021 CGGCGCGCCG GAAGTGGCCA GGGCGGGGGC GACTTCGGCT CACAGCGCGC CCGGCTATTC 1081 TCGCAGCTCA CCATGGATGA TGATATCGCC GCGCTCGTCG TCGACAACGG CTCCGGCATG 1141 TGCAAGGCCG GCTTCGCGGG CGACGATGCC CCCCGGGCCG TCTTCCCCTC CATCGTGGGG 1201 CGCCCCAGGC ACCAGGTAGG GGAGCTGGCT GGGTGGGGCA GCCCCGGGAG CGGGCGGGAG 1261 GCAAGGGCGC TTTCTCTGCA CAGGAGCCTC CCGGTTTCCG GGGTGGGCTG CGCCCGTGCT 1321 CAGGGCTTCT TGTCCTTTCC TTCCCAGGGC GTGATGGTGG GCATGGGTCA GAAGGATTCC 1381 TATGTGGGCG ACGAGGCCCA GAGCAAGAGA GGCATCCTCA CCCTGAAGTA CCCCATCGAG 1441 CACGGCATCG TCACCAACTG GGACGACATG GAGAAAATCT GGCACCACAC CTTCTACAAT 1501 GAGCTGCGTG TGGCTCCCGA GGAGCACCCC GTGCTGCTGA CCGAGGCCCC CCTGAACCCC 1561 AAGGCCAACC GCGAGAAGAT GACCCAGGTG AGTGGCCCGC TACCTCTTCT GGTGGCCGCC 1621 TCCCTCCTTC CTGGCCTCCC GGAGCTGCGC CCTTTCTCAC TGGTTCTCTC TTCTGCCGTT 1681 TTCCGTAGGA CTCTCTTCTC TGACCTGAGT CTCCTTTGGA ACTCTGCAGG TTCTATTTGC 1741 TTTTTCCCAG ATGAGCTCTT TTTCTGGTGT TTGTCTCTCT GACTAGGTGT CTGAGACAGT 1801 GTTGTGGGTG TAGGTACTAA CACTGGCTCG TGTGACAAGG CCATGAGGCT GGTGTAAAGC 1861 GGCCTTGGAG TGTGTATTAA GTAGGCGCAC AGTAGGTCTG AACAGACTCC CCATCCCAAG 1921 ACCCCAGCAC ACTTAGCCGT GTTCTTTGCA CTTTCTGCAT GTCCCCCGTC TGGCCTGGCT 1981 GTCCCCAGTG GCTTCCCCAG TGTGACATGG TGCATCTCTG CCTTACAGAT CATGTTTGAG 2041 ACCTTCAACA CCCCAGCCAT GTACGTTGCT ATCCAGGCTG TGCTATCCCT GTACGCCTCT 2101 GGCCGTACCA CTGGCATCGT GATGGACTCC GGTGACGGGG TCACCCACAC TGTGCCCATC 2161 TACGAGGGGT ATGCCCTCCC CCATGCCATC CTGCGTCTGG ACCTGGCTGG CCGGGACCTG 2221 ACTGACTACC TCATGAAGAT CCTCACCGAG CGCGGCTACA GCTTCACCAC CACGGCCGAG 2281 CGGGAAATCG TGCGTGACAT TAAGGAGAAG CTGTGCTACG TCGCCCTGGA CTTCGAGCAA 2341 GAGATGGCCA CGGCTGCTTC CAGCTCCTCC CTGGAGAAGA GCTACGAGCT GCCTGACGGC 2401 CAGGTCATCA CCATTGGCAA TGAGCGGTTC CGCTGCCCTG AGGCACTCTT CCAGCCTTCC 2461 TTCCTGGGTG AGTGGAGACT GTCTCCCGGC TCTGCCTGAC ATGAGGGTTA CCCCTCGGGG 2521 CTGTGCTGTG GAAGCTAAGT CCTGCCCTCA TTTCCCTCTC AGGCATGGAG TCCTGTGGCA 2581 TCCACGAAAC TACCTTCAAC TCCATCATGA AGTGTGACGT GGACATCCGC AAAGACCTGT 2641 ACGCCAACAC AGTGCTGTCT GGCGGCACCA CCATGTACCC TGGCATTGCC GACAGGATGC 2701 AGAAGGAGAT CACTGCCCTG GCACCCAGCA CAATGAAGAT CAAGGTGGGT GTCTTTCCTG 2761 CCTGAGCTGA CCTGGGCAGG TCAGCTGTGG GGTCCTGTGG TGTGTGGGGA GCTGTCACAT 2821 CCAGGGTCCT CACTGCCTGT CCCCTTCCCT CCTCAGATCA TTGCTCCTCC TGAGCGCAAG 2881 TACTCCGTGT GGATCGGCGG CTCCATCCTG GCCTCGCTGT CCACCTTCCA GCAGATGTGG 2941 ATCAGCAAGC AGGAGTATGA CGAGTCCGGC CCCTCCATCG TCCACCGCAA ATGCTTCTAG 3001 GCGGACTATG ACTTAGTTGC GTTACACCCT TTCTTGACAA AACCTAACTT GCGCAGAAAA 3061 CAAGATGAGA TTGGCATGGC TTTATTTGTT TTTTTTGTTT TGTTTTGGTT TTTTTTTTTT 3121 TTTTGGCTTG ACTCAGGATT TAAAAACTGG AACGGTGAAG GTGACAGCAG TCGGTTGGAG 3181 CGAGCATCCC CCAAAGTTCA CAATGTGGCC GAGGACTTTG ATTGCATTGT TGTTTTTTTA 3241 ATAGTCATTC CAAATATGAG ATGCATTGTT ACAGGAAGTC CCTTGCCATC CTAAAAGCCA 3301 CCCCACTTCT CTCTAAGGAG AATGGCCCAG TCCTCTCCCA AGTCCACACA GGGGAGGTGA 3361 TAGCATTGCT TTCGTGTAAA TTATGTAATG CAAAATTTTT TTAATCTTCG CCTTAATACT 3421 TTTTTATTTT GTTTTATTTT GAATGATGAG CCTTCGTGCC CCCCCTTCCC CCTTTTTGTC 3481 CCCCAACTTG AGATGTATGA AGGCTTTTGG TCTCCCTGGG AGTGGGTGGA GGCAGCCAGG 3541 GCTTACCTGT ACACTGACTT GAGACCAGTT GAATAAAAGT GCACACCTTA AAAATGAGGC 3601 CAAGTGTGAC TTTGTGGTGT GGCTGGGTTG GGGGCAGCAG AGGGTG // LOCUS HUMACHRA1 198 bp ds-DNA PRI 16-JUN-1986 DEFINITION Human acetylcholine receptor alpha-subunit gene, exon P1. ACCESSION X02502 KEYWORDS acetylcholine receptor. SEGMENT 1 of 7 SOURCE Human fetal liver DNA (library of T.Maniatis), clones lambda-hACR-alpha-[1,22,31]. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 198) AUTHORS Noda,M., Furutani,Y., Takahashi,H., Toyosato,M., Tanabe,T., Shimizu,H., Kikyotani,S., Kayano,T., Hirose,T., Inayama,S. and Numa,S. TITLE Cloning and sequence analysis of calf cDNA and human genomic DNA encoding alpha-subunit precursor of muscle acetylcholine receptor JOURNAL Nature 305, 818-823 (1983) STANDARD full staff_review FEATURES from to/span description pept 46 + 88 acetylcholine receptor alpha-SU, exon P1 /nomgen="CHRNA" /map="2q24-q32" /hgml_locus_uid="LV0016B" sigp 46 + 88 acetylcholine receptor alpha-SU signal pept IVS 89 > 198 AChR intron A BASE COUNT 38 A 60 C 56 G 44 T ORIGIN 6 bp upstream of HaeIII site. 1 CACAGGCCAC CACTCTGCCC TGGTCCACAC AAGCTCCGGT AGCCCATGGA GCCCTGGCCT 61 CTCCTCCTGC TCTTTAGCCT TTGCTCAGGT AAGTGCTGGG GTCAGGGGTG GGGAGAGGGC 121 TCTCCTTTGA GGCTAAAGTG CTGGAGGCCC CCAAATCAAA GTCTTGCTTC TTTGATGAGA 181 CCACAGAATG GCAGAGCC // LOCUS HUMACHRA2 513 bp ds-DNA PRI 16-JUN-1986 DEFINITION Human acetylcholine receptor alpha-subunit gene, exons P2 and P3. ACCESSION X02503 KEYWORDS acetylcholine receptor. SEGMENT 2 of 7 SOURCE Human fetal liver DNA (library of T.Maniatis), clones lambda-hACR-alpha-[1,22,31]. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 513) AUTHORS Noda,M., Furutani,Y., Takahashi,H., Toyosato,M., Tanabe,T., Shimizu,H., Kikyotani,S., Kayano,T., Hirose,T., Inayama,S. and Numa,S. TITLE Cloning and sequence analysis of calf cDNA and human genomic DNA encoding alpha-subunit precursor of muscle acetylcholine receptor JOURNAL Nature 305, 818-823 (1983) STANDARD full staff_review FEATURES from to/span description pept + 71 216 acetylcholine receptor alpha-SU, exon P2 /nomgen="CHRNA" /map="2q24-q32" /hgml_locus_uid="LV0016B" 328 + 372 acetylcholine receptor alpha-SU, exon P3 sigp + 71 87 acetylcholine receptor alpha-SU signal pept matp 88 216 acetylcholine receptor alpha-SU mature pept 328 + 372 acetylcholine receptor alpha-SU mature pept IVS < 1 70 AChR intron A IVS 217 327 AChR intron B IVS 373 > 513 AChR intron C BASE COUNT 130 A 122 C 132 G 129 T ORIGIN About 5 kb after segment 1. 1 TTCCTCATGT TCATGTGTTT GGAGGGTGGG GAGAAGAATC TGACAGTGTT GGTGTCATCT 61 GCTGTCTCAG CTGGCCTCGT CCTGGGCTCC GAACATGAGA CCCGTCTGGT GGCAAAGCTA 121 TTTAAAGACT ACAGCAGCGT GGTGCGGCCA GTGGAAGACC ACCGCCAGGT CGTGGAGGTC 181 ACCGTGGGCC TGCAGCTGAT ACAGCTCATC AATGTGGTGA GACAAGAGCA CTGGGGGCCT 241 GCCCACACAC CCACACCTCT CACCCCTGGG TTTCCTCCCC TGGGTCCCCT CTCTCCCCCA 301 TAAAAATGAT CCTTATTTTT TTTCCAGGAT GAAGTAAATC AGATCGTGAC AACCAATGTG 361 CGTCTGAAAC AGGTAACTTA GCCAGATAAT TGCATGTGTG TCGTTTTCAT TCACACTGAA 421 ATTAAACCTG GGCCCAGAGG AAATCAATAT GGGTAACAGG GATATATACC AAAATGTTGA 481 AAGTAGATAT TTTGAGATGG TGAAAGCATG GAT // LOCUS HUMACHRA3 229 bp ds-DNA PRI 16-JUN-1986 DEFINITION Human acetylcholine receptor alpha-subunit gene, exon P4. ACCESSION X02504 KEYWORDS acetylcholine receptor. SEGMENT 3 of 7 SOURCE Human fetal liver DNA (library of T.Maniatis), clones lambda-hACR-alpha-[1,22,31]. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 229) AUTHORS Noda,M., Furutani,Y., Takahashi,H., Toyosato,M., Tanabe,T., Shimizu,H., Kikyotani,S., Kayano,T., Hirose,T., Inayama,S. and Numa,S. TITLE Cloning and sequence analysis of calf cDNA and human genomic DNA encoding alpha-subunit precursor of muscle acetylcholine receptor JOURNAL Nature 305, 818-823 (1983) STANDARD full staff_review FEATURES from to/span description pept + 87 + 196 acetylcholine receptor alpha-SU, exon P4 /nomgen="CHRNA" /map="2q24-q32" /hgml_locus_uid="LV0016B" matp + 87 + 196 acetylcholine receptor alpha-SU mature pept IVS < 1 86 AChR intron C IVS 197 > 229 AChR intron D BASE COUNT 62 A 43 C 53 G 71 T ORIGIN About 1.7 kb after segment 2. 1 TTCCCATGCA CATCAGGGCT CTTGATAACC TTAGATTTTA GAAAGTGTTT TTGGCTTTTT 61 AATTAGCTGT ATGTCTTTGT TTCTAGCAAT GGGTGGATTA CAACCTAAAA TGGAATCCAG 121 ATGACTATGG CGGTGTGAAA AAAATTCACA TTCCTTCAGA AAAGATCTGG CGCCCAGACC 181 TTGTTCTCTA TAACAAGTGA GCAAACTGCA TGGGTGGCGG AGGGTGCTC // LOCUS HUMACHRA4 363 bp ds-DNA PRI 16-JUN-1986 DEFINITION Human acetylcholine receptor alpha-subunit gene, exon 5. ACCESSION X02505 KEYWORDS acetylcholine receptor. SEGMENT 4 of 7 SOURCE Human fetal liver DNA (library of T.Maniatis), clones lambda-hACR-alpha-[1,22,31]. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 363) AUTHORS Noda,M., Furutani,Y., Takahashi,H., Toyosato,M., Tanabe,T., Shimizu,H., Kikyotani,S., Kayano,T., Hirose,T., Inayama,S. and Numa,S. TITLE Cloning and sequence analysis of calf cDNA and human genomic DNA encoding alpha-subunit precursor of muscle acetylcholine receptor JOURNAL Nature 305, 818-823 (1983) STANDARD full staff_review FEATURES from to/span description pept + 80 + 275 acetylcholine receptor alpha-SU, exon P5 /nomgen="CHRNA" /map="2q24-q32" /hgml_locus_uid="LV0016B" matp + 80 + 275 acetylcholine receptor alpha-SU mature pept IVS < 1 79 AChR intron D IVS 276 > 363 AChR intron E BASE COUNT 85 A 97 C 81 G 100 T ORIGIN About 3.2 kb after segment 3. 1 TCTCCTGTTA ACGTCCCCAA GCAGAATGGA AGGCTCATCT GTCACTTGGA GCCATTTTCC 61 TCTTTTATCC CAATTGTAGT GCAGATGGTG ACTTTGCTAT TGTCAAGTTC ACCAAAGTGC 121 TCCTGCAGTA CACTGGCCAC ATCACGTGGA CACCTCCAGC CATCTTTAAA AGCTACTGTG 181 AGATCATCGT CACCCACTTT CCCTTTGATG AACAGAACTG CAGCATGAAG CTGGGCACCT 241 GGACCTACGA CGGCTCTGTC GTGGCCATCA ACCCGGTAGG TGGTGGCCAC ATATGATAAG 301 GGTGGTTTCA GGAAAATTGT GATTCGGGTC TCCAACTTTT CCTTGGACAA TCATAGGCTC 361 TCA // LOCUS HUMACHRA5 417 bp ds-DNA PRI 16-JUN-1986 DEFINITION Human acetylcholine receptor alpha-subunit gene, exon P6. ACCESSION X02506 KEYWORDS acetylcholine receptor. SEGMENT 5 of 7 SOURCE Human fetal liver DNA (library of T.Maniatis), clones lambda-hACR-alpha-[1,22,31]. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 417) AUTHORS Noda,M., Furutani,Y., Takahashi,H., Toyosato,M., Tanabe,T., Shimizu,H., Kikyotani,S., Kayano,T., Hirose,T., Inayama,S. and Numa,S. TITLE Cloning and sequence analysis of calf cDNA and human genomic DNA encoding alpha-subunit precursor of muscle acetylcholine receptor JOURNAL Nature 305, 818-823 (1983) STANDARD full staff_review FEATURES from to/span description pept + 81 + 318 acetylcholine receptor alpha-SU, exon P6 /nomgen="CHRNA" /map="2q24-q32" /hgml_locus_uid="LV0016B" matp + 81 + 318 acetylcholine receptor alpha-SU mature pept IVS < 1 80 AChR intron E IVS 319 > 417 AChR intron F BASE COUNT 85 A 132 C 99 G 101 T ORIGIN About 500 bp after segment 4. 1 CTTGTCACTG CTGATTACAA AACGCTGCCC CTCATCAGGG TGACCTGCCT GGGATTTCCA 61 AACCTCACTT CCTTTCTCAG GAAAGCGACC AGCCAGACCT GAGCAACTTC ATGGAGAGCG 121 GGGAGTGGGT GATCAAGGAG TCCCGGGGCT GGAAGCACTC CGTGACCTAT TCCTGCTGCC 181 CCGACACCCC CTACCTGGAC ATCACCTACC ACTTCGTCAT GCAGCGCCTG CCCCTCTACT 241 TCATCGTCAA CGTCATCATC CCCTGCCTGC TCTTCTCCTT CTTAACTGGC CTGGTATTCT 301 ACCTGCCCAC AGACTCAGGT GGGTGTGGTT GCCATGACTG CTGCTGCTGA CGCTGATGGG 361 TCTCGTGCTG CTATTTTGGG GAAGCCAGAA ATAATCATGT GCTAACAAAC AGGCAGA // LOCUS HUMACHRA6 309 bp ds-DNA PRI 16-JUN-1986 DEFINITION Human acetylcholine receptor alpha-subunit gene, exon P7. ACCESSION X02507 KEYWORDS acetylcholine receptor. SEGMENT 6 of 7 SOURCE Human fetal liver DNA (library of T.Maniatis), clones lambda-hACR-alpha-[1,22,31]. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 309) AUTHORS Noda,M., Furutani,Y., Takahashi,H., Toyosato,M., Tanabe,T., Shimizu,H., Kikyotani,S., Kayano,T., Hirose,T., Inayama,S. and Numa,S. TITLE Cloning and sequence analysis of calf cDNA and human genomic DNA encoding alpha-subunit precursor of muscle acetylcholine receptor JOURNAL Nature 305, 818-823 (1983) STANDARD full staff_review FEATURES from to/span description pept + 31 + 254 acetylcholine receptor alpha-SU, exon P7 /nomgen="CHRNA" /map="2q24-q32" /hgml_locus_uid="LV0016B" matp + 31 + 254 acetylcholine receptor alpha-SU mature pept IVS < 1 30 AChR intron F IVS 255 > 309 AChR intron G BASE COUNT 58 A 90 C 71 G 90 T ORIGIN About 3.2 kb after segment 5. 1 GGGATTTTTT TTTTTTTTTG GTCTTTTCAG GGGAGAAGAT GACTCTGAGC ATCTCTGTCT 61 TACTGTCTTT GACTGTGTTC CTTCTGGTCA TCGTGGAGCT GATCCCCTCC ACGTCCAGTG 121 CTGTGCCCTT GATTGGAAAA TACATGCTGT TCACCATGGT GTTCGTCATT GCCTCCATCA 181 TCATCACTGT CATCGTCATC AACACACACC ACCGCTCACC CAGCACCCAT GTCATGCCCA 241 ACTGGGTGCG GAAGGTGAGT GGGGAGGCCC CTGCAGACTT CACACGGAGG CACATCTACA 301 GGCCTCCCC // LOCUS HUMACHRA7 2318 bp ds-DNA PRI 16-JUN-1986 DEFINITION Human acetylcholine receptor alpha-subunit gene, exons P8 and P9. ACCESSION X02508 KEYWORDS acetylcholine receptor. SEGMENT 7 of 7 SOURCE Human fetal liver DNA (library of T.Maniatis), clones lambda-hACR-alpha-[1,22,31]. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 2318) AUTHORS Noda,M., Furutani,Y., Takahashi,H., Toyosato,M., Tanabe,T., Shimizu,H., Kikyotani,S., Kayano,T., Hirose,T., Inayama,S. and Numa,S. TITLE Cloning and sequence analysis of calf cDNA and human genomic DNA encoding alpha-subunit precursor of muscle acetylcholine receptor JOURNAL Nature 305, 818-823 (1983) STANDARD full staff_review COMMENT EMBL features not translated to GenBank features: key from to description SITE 1239 1244 pot. polyA signal RPT 1767 1815 49 bp direct repeat 1 RPT 1816 1864 49 bp direct repeat 1' FEATURES from to/span description pept + 41 280 acetylcholine receptor alpha-SU, exon P8 /nomgen="CHRNA" /map="2q24-q32" /hgml_locus_uid="LV0016B" 605 736 acetylcholine receptor alpha-SU, exon P9 matp + 41 280 acetylcholine receptor alpha-SU mature pept 605 733 acetylcholine receptor alpha-SU mature pept IVS < 1 40 AChR intron G IVS 281 604 AChR intron H BASE COUNT 667 A 515 C 472 G 664 T ORIGIN About 1 kb after segment 6. 1 GGACAATAAT TCTCATGCAT GGATTTCTTT TTTATGTTAG GTTTTTATCG ACACTATCCC 61 AAATATCATG TTTTTCTCCA CAATGAAAAG ACCATCCAGA GAAAAGCAAG ACAAAAAGAT 121 TTTTACAGAA GACATTGATA TCTCTGACAT TTCTGGAAAG CCAGGGCCTC CACCCATGGG 181 CTTCCACTCT CCCCTGATCA AACACCCCGA GGTGAAAAGT GCCATCGAGG GCATCAAGTA 241 CATCGCAGAG ACCATGAAGT CAGACCAGGA GTCTAACAAT GTAAGCTTCG TGGCTTGAAA 301 TTCATGCCTC TGGGTTTAAA TGGTCAAATG TATGAGCACA GGTGGTGAGT CTGACTTAAC 361 AAGTAGGTGG CATACAACAG GCAGCTACTG ACCTAAGATG AACCCTGGTG ACCGCCTCTA 421 GACCACCTTA GTAAGTTATC ACTTTGTAAA TAATCTATTC AAAAAATAAA GATGATGCCA 481 GCCAAGGGAT GGGAGGACTT TGAGATCTAC AAATTGAAGG GCCCATAGGT GATGCAAAGC 541 ACAAACGTTG TGAGTCCAGA AACCACCTTA GGGAGACTCT GTCTAACATT TTCCTCTCTC 601 CCAGGCGGCG GCAGAGTGGA AGTACGTTGC AATGGTGATG GACCACATAC TCCTCGGAGT 661 CTTCATGCTT GTTTGCATCA TCGGAACCCT AGCCGTGTTT GCAGGTCGAC TCATTGAATT 721 AAATCAGCAA GGATGAGCAG AAAATGAGCT GAGCTTAGCT CTGCCCTGGA ACCTACCAGA 781 GCAGAGAAGG GCAGGAGAGG AAGATTTGTC TACTTGCTCC ACTCGCACTT ATCAAACGTG 841 TTATATTCCA TACTTATTAT TGATGATAAG ATTTACCTTT ATGTAAGTTT ATGGCCTTGA 901 AGTGTTTTCA TATTGCTTCT CCCTTTAGTT CTGCTGTCTC CCTGAAGAGT GAACCCTCTT 961 TAGTAAATGA AACTAATCAC TAAAAAAAGT GTTCATTTCC AGTGTCTGGA AGAGTTTTTG 1021 CCAGGATAAC CGAGGTTTTC TGTTGCATTG TCATTGCTGT TATATTTATA TATTTACTTA 1081 TTTTGTCTAA CCACGAGACC ACTAGGGAGT TGCTGATGTT CTAAATGAAG TGTATTCTTT 1141 GCTTAAGAGT CAGTTTTGTA CCTACTAAGA AAACCCCATC TACCAGTCCT AAAGAGATAA 1201 GGCATTTGAT TTGGAGAGAG AGGAAGCAGT AGAAATGAAA TAAACTAAAA TGCAAACTGC 1261 ATCAAAAATC CATGGGTTTT TTTGTTTGTT TGATTGTTTT TTGAGACAGG GTCTTACTCT 1321 TATCACCCAG GCTAGAGTGC AGTGGCCCGA TCTCGTCTCA CTGCAGCCTC CACTTGCCAG 1381 ACTCAGGTGA TTCTTCTACC TCAGCCTCCC GGGGTAGCTG AGACTACAGG TGCATGTCAC 1441 CACACCCAAC TAACTTTTGT ATTTTTTGCA GAGACAGGGT TTTGCCACGT TGCCCAGGCT 1501 GGTCTTGAAC TCCTGGGCTC AAGTGATCCA CCCATCTTGG CCTCCCGAAG TGCTGGGATT 1561 ACAGCTGTGA GCCACTGCCC TTGACAAAAA AAATCCATGT TTTAAAGCAG CTTTCTAGAC 1621 CTGAGAAGTC AGAGAAGGTT TACCAGTCTC ATATGGCACC TGGGCTTGAT AAGGGTCTGA 1681 AAGGAAACAA GTCCAAACAA TAGGGCCTAG GCACTACGGA AATTGTCTTG CCTGTATGGG 1741 TTTACTTCAT CATCATCCCC TTCATAAAAT CCTAGAATGT CCCTATTTTA CAAATGAGAA 1801 AATTAAATTT CAGGGAAATC CTGTAATGTC CCTATTTTCC AAATGAGAAA ATTGAATTTC 1861 AGGGAGGTTA AGTTCTTTGC TCAAACTAGA AGAACTGGGA TTTGAATCCA GACTTTGAGA 1921 TTCCAAAGCA ACTCCAGGCT GGGTCAGTTC ATCTGTTCAC TGGCTCTCCC AGCCACCCCT 1981 GGCTCCATCT AGGCCATCCT GTAACTCTGC TCAGTGTTGT CAAGAATTTC AGTCTTAGCA 2041 AAAAACCATC TTTCTGTCTT CCCAATTAAA ATTTGTGCCT TCCACCCACC ACCAGCCCTT 2101 ATTCCAATCA TGTTTGAGGT TCAAGTCCTA TAAATGAAAC CAGAAATGTA TTACGCATCA 2161 TCACCATTTC TAAAATTCTG AAGAAGTAAG TAGAAGAGAA ATTGATTTCT TTTCACATTG 2221 TAAACTTTAA TCCACAGTAT AGTTTGCCCA AGAACTAGAA ACTGTGCAGC TACGGGCCAG 2281 GCTTAGTTGA TGGCCTCCAA ATCACCATAA CCCAACTG // LOCUS HUMACHRG1 709 bp ds-DNA PRI 16-JUN-1986 DEFINITION Human acetylcholine receptor gamma subunit gene, exons 1 and 2. ACCESSION X01715 KEYWORDS acetylcholine receptor. SEGMENT 1 of 8 SOURCE Human fetal liver DNA (library of T.Maniatis), clones lambda-hACR-gamma-[3,12] [1]. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 709) AUTHORS Shibahara,S., Kubo,T., Perski,H.J., Takahashi,H., Noda,M. and Numa,S. TITLE Cloning and sequence analysis of human genomic DNA encoding gamma subunit precursor of muscle acetylcholine receptor JOURNAL Eur. J. Biochem. 146, 15-22 (1985) STANDARD full staff_review COMMENT EMBL features not translated to GenBank features: key from to description PRM 196 200 pot. TATA-box FEATURES from to/span description pept 281 335 acetylcholine receptor gamma subunit prepeptide, exon 1 /nomgen="CHRNG" /map="2q32-qter" /hgml_locus_uid="LJ0139Q" 525 + 664 acetylcholine receptor gamma subunit prepeptide, exon 2 sigp 281 335 acetylcholine receptor gamma subunit, signal peptide 525 535 acetylcholine receptor gamma subunit, signal peptide matp 536 + 664 acetylcholine receptor gamma subunit IVS 336 524 ACHR intron A IVS 665 > 709 ACHR intron B BASE COUNT 121 A 230 C 227 G 131 T ORIGIN 1 bp upstream of AvaI site. 1 CCCGAGTTGG GGCTGGAAGG TGGATGTCTC TTTTGGGGCA GACGGGGCCC CTGTCTCCCC 61 TCTCCAGCCC AGGTAACCTG AGCCCAGCAT TGTGTCCATC CTGGAACAGC TGACAACGCT 121 GTGGTCAGAC AGCTGGTGGG GCTGGGCCAG GCTGGCCGGG CTGGCTGGGC TGGCTGGGGT 181 GGGAGTGTAG GCTGTTATAT GACACCCAGA GCCCATCTCT CTCTGCCCCA GACCTTGGAG 241 CTGTTGTCCC ACCCCTGTCA CTGCAGAGAG CTGAGGCACC ATGCATGGGG GCCAGGGGCC 301 GCTGCTCCTC CTGCTGCTGC TGGCTGTCTG CCTGGGTGGG ACACAAAGGA ATCTCAGCCT 361 GGGGAGTCCC AGAGCTGGGG TCCACAGCCT CAGGGGATGG AGGGTCTGAG GGGTATTGGG 421 GCCTGCCCTG GACCCAGTTC CCTGAGTCCC CACTTCACAC CCCCAGGGCC TCCCCGCTCT 481 TTCCACCTCC AAGCTCCTGC TAGGCTCACG CCTGTCTATT GCAGGGGCCC AGGGCCGGAA 541 CCAGGAGGAG CGCCTGCTCG CAGACCTGAT GCAAAACTAC GACCCCAACC TGCGGCCCGC 601 GGAACGAGAC TCGGATGTGG TCAATGTCAG CCTGAAGCTA ACCCTCACCA ACCTCATCTC 661 CCTGGTAAGC CGCAGGACGG AGGAGGGGTC AGCGCACCAC GCCCTGGGA // LOCUS HUMACHRG2 480 bp ds-DNA PRI 16-JUN-1986 DEFINITION Human acetylcholine receptor gamma subunit gene, exons 3 and 4. ACCESSION X01716 KEYWORDS acetylcholine receptor. SEGMENT 2 of 8 SOURCE Human fetal liver DNA (library of T.Maniatis), clones lambda-hACR-gamma-[3,12] [1]. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 480) AUTHORS Shibahara,S., Kubo,T., Perski,H.J., Takahashi,H., Noda,M. and Numa,S. TITLE Cloning and sequence analysis of human genomic DNA encoding gamma subunit precursor of muscle acetylcholine receptor JOURNAL Eur. J. Biochem. 146, 15-22 (1985) STANDARD full staff_review FEATURES from to/span description pept + 101 145 acetylcholine receptor gamma subunit precursor, exon 3 /nomgen="CHRNG" /map="2q32-qter" /hgml_locus_uid="LJ0139Q" 322 + 431 acetylcholine receptor gamma subunit precursor, exon 4 matp + 101 145 acetylcholine receptor gamma subunit 322 + 431 acetylcholine receptor gamma subunit IVS < 1 100 ACHR intron B IVS 146 321 ACHR intron C IVS 432 > 480 ACHR intron D BASE COUNT 88 A 147 C 159 G 86 T ORIGIN About 250 bp after segment 1. 1 TATCAAGAGG CTGGGGGATG CTTGGCCCCA TTGGTGGCCT GTGGGGACTG GCACTGAAGT 61 CGGGGGCTGA GCCCTCCATA CTACACCCTT GCACCCCCAG AACGAGCGAG AGGAAGCCCT 121 CACCACCAAT GTCTGGATAG AGATGGTAAG AGGCCACCCT GCCACCCTCC TTCCATCAGG 181 GGTCCCACCC CACCACCCCA AGGCCTCCTG AGAGTTGCCT GCCCCGTTCC TGCCTCTTCT 241 GTCCTCTTGG GCTGGATGCC CACTCCTAGG GCTGTGGTGC AGCAGAGGGC AGAGGCCTAG 301 CAACTGCCCC TCCCCCTGCA GCAGTGGTGC GACTATCGCC TGCGCTGGGA TCCGCGAGAC 361 TACGAAGGCC TGTGGGTGCT GAGGGTGCCG TCCACCATGG TGTGGCGGCC GGATATCGTG 421 CTGGAGAACA AGTGAGGAGG GGGTGCAGGC AGGGGTGTGG GGGACAAAGG ACACAGGGTC // LOCUS HUMACHRG3 240 bp ds-DNA PRI 16-JUN-1986 DEFINITION Human acetylcholine receptor gamma subunit gene, exon 5. ACCESSION X01717 KEYWORDS acetylcholine receptor. SEGMENT 3 of 8 SOURCE Human fetal liver DNA (library of T.Maniatis), clones lambda-hACR-gamma-[3,12] [1]. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 240) AUTHORS Shibahara,S., Kubo,T., Perski,H.J., Takahashi,H., Noda,M. and Numa,S. TITLE Cloning and sequence analysis of human genomic DNA encoding gamma subunit precursor of muscle acetylcholine receptor JOURNAL Eur. J. Biochem. 146, 15-22 (1985) STANDARD full staff_review FEATURES from to/span description pept + 42 + 197 acetylcholine receptor gamma subunit precursor, exon 5 /nomgen="CHRNG" /map="2q32-qter" /hgml_locus_uid="LJ0139Q" matp + 42 + 197 acetylcholine receptor gamma subunit IVS < 1 41 ACHR intron D IVS 198 > 240 ACHR intron E BASE COUNT 36 A 77 C 61 G 66 T ORIGIN About 625 bp after segment 2. 1 GAGCCCACAG CCTCGTGGCC TGGCCTGTTC TGTGCATACA GCGTGGACGG TGTCTTCGAG 61 GTGGCCCTCT ACTGCAATGT GCTCGTGTCC CCTGACGGCT GTATCTACTG GCTGCCGCCT 121 GCCATCTTCC GTTCCGCCTG CTCTATCTCA GTCACCTACT TCCCCTTCGA CTGGCAGAAC 181 TGCTCCCTTA TCTTCCAGTG AGGCCATTTA TTGGGGAGGA TTAAGAGAGC TGCTCTCAGA // LOCUS HUMACHRG4 435 bp ds-DNA PRI 16-JUN-1986 DEFINITION Human acetylcholine receptor gamma subunit gene, exon 6. ACCESSION X01718 KEYWORDS acetylcholine receptor. SEGMENT 4 of 8 SOURCE Human fetal liver DNA (library of T.Maniatis), clones lambda-hACR-gamma-[3,12] [1]. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 435) AUTHORS Shibahara,S., Kubo,T., Perski,H.J., Takahashi,H., Noda,M. and Numa,S. TITLE Cloning and sequence analysis of human genomic DNA encoding gamma subunit precursor of muscle acetylcholine receptor JOURNAL Eur. J. Biochem. 146, 15-22 (1985) STANDARD full staff_review FEATURES from to/span description pept + 174 + 271 acetylcholine receptor gamma subunit precursor, exon 6 /nomgen="CHRNG" /map="2q32-qter" /hgml_locus_uid="LJ0139Q" matp + 174 + 271 acetylcholine receptor gamma subunit IVS < 1 173 ACHR intron E IVS 272 > 435 ACHR intron F BASE COUNT 111 A 130 C 97 G 97 T ORIGIN About 875 bp after segment 3. 1 TAGGTCCTCC TGCTCTCCAT ATCTCGCCAG TGGGGTTTGA TAGAGAACTC AGAAGCGTGG 61 GGCTGCATTT TGTTGAAGAA AAGCTGCCCA CACTTGTCCC CAGAAGGTCA TCCCCATGCA 121 GTCGTGGCAG GTCCACCCGC TCACATTTAG CCTCTTTCCT TGGTGACTCC CAGGTCCCAG 181 ACTTACAGCA CCAATGAGAT TGATCTGCAG CTGAGTCAGG AAGATGGCCA GACCATCGAG 241 TGGATTTTCA TTGACCCTGA GGCCTTCACA GGTAACCCCC ACCCAAGGGC TCCCCAGGCA 301 GCCTCATCCA GGGCTCCTGC TGGACCCAGC TGTGGTCAAG GCTGGACCAA GGTCAAATCC 361 CTCCCATGTA ACTCAAAATG AAAACTACAG CAAACCATAA AATATGCTTT TTAAAACGTC 421 CAACAAAGCT CTGAC // LOCUS HUMACHRG5 714 bp ds-DNA PRI 16-JUN-1986 DEFINITION Human acetylcholine receptor gamma subunit gene, exons 7 and 8. ACCESSION X01719 KEYWORDS acetylcholine receptor. SEGMENT 5 of 8 SOURCE Human fetal liver DNA (library of T.Maniatis), clones lambda-hACR-gamma-[3,12] [1]. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 714) AUTHORS Shibahara,S., Kubo,T., Perski,H.J., Takahashi,H., Noda,M. and Numa,S. TITLE Cloning and sequence analysis of human genomic DNA encoding gamma subunit precursor of muscle acetylcholine receptor JOURNAL Eur. J. Biochem. 146, 15-22 (1985) STANDARD full staff_review FEATURES from to/span description pept + 155 355 acetylcholine receptor gamma subunit precursor, exon 7 /nomgen="CHRNG" /map="2q32-qter" /hgml_locus_uid="LJ0139Q" 548 + 662 acetylcholine receptor gamma subunit precursor, exon 8 matp + 155 355 acetylcholine receptor gamma subunit 548 + 662 acetylcholine receptor gamma subunit IVS < 1 154 ACHR intron F IVS 356 547 ACHR intron G IVS 663 > 714 ACHR intron H BASE COUNT 124 A 242 C 199 G 149 T ORIGIN About 300 bp after segment 4. 1 TCCAATCTCC CACCCCACTT CTGTCCCTCA AGGGTGCCTC CCCTGCTGGT GCTCCTTAGG 61 GCACATGCTG CCCTTGCACC TGGGTCACTC GGCTGCAGGG ATCTGCCTAG CTCACGCTTC 121 TTGTGCCCAC TCCTGCCTGC CTGCCTGCCC GCAGAGAATG GGGAGTGGGC CATCCAGCAC 181 CGACCAGCCA AGATGCTCCT GGACCCAGCG GCGCCAGCCC AGGAAGCAGG CCACCAGAAG 241 GTGGTGTTCT ACCTGCTCAT CCAGCGCAAG CCCCTCTTCT ACGTCATCAA CATCATCGCC 301 CCCTGTGTGC TCATCTCCTC TGTCGCCATC CTCATCCACT TCCTTCCTGC CAAGGGTACC 361 TGGAGCCTAT GGGAAGGAGC CATCCAGTAG CACAGGGGAC ACCTGGGAGG CCGGGGTGGG 421 CCCTGCCTGG GGAACAGAGT GGCATTACGA CCCAGGACAG AGGCAGCGGG CTACTTCTGG 481 GGTAAGGGGT TCCTCTGTGG GTGGGGGAGG TAGGAACCTG CTCTGAGAGC CTCTCGGTCA 541 TGGATAGCTG GGGGCCAGAA GTGTACCGTC GCCATCAACG TGCTCCTGGC CCAGACTGTC 601 TTCCTCTTCC TTCTGGCCAA GAAGGTGCCT GAAACCTCCC AGGCGGTGCC ACTCATCAGC 661 AAGTAAGGCT GGTCTTCATG TCCACCCGCC TATGCCACTC TCCCTTCTTG GGAG // LOCUS HUMACHRG6 344 bp ds-DNA PRI 15-JUN-1989 DEFINITION Human acetylcholine receptor gamma subunit gene, exon 9. ACCESSION X01720 KEYWORDS acetylcholine receptor. SEGMENT 6 of 8 SOURCE Human fetal liver DNA (library of T.Maniatis), clones lambda-hACR-gamma-[3,12] [1]. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 344) AUTHORS Shibahara,S., Kubo,T., Perski,H.J., Takahashi,H., Noda,M. and Numa,S. TITLE Cloning and sequence analysis of human genomic DNA encoding gamma subunit precursor of muscle acetylcholine receptor JOURNAL Eur. J. Biochem. 146, 15-22 (1985) STANDARD full staff_review FEATURES from to/span description pept + 60 + 174 acetylcholine receptor gamma subunit precursor, exon 9 /nomgen="CHRNG" /map="2q32-qter" /hgml_locus_uid="LJ0139Q" matp + 60 + 174 acetylcholine receptor gamma subunit IVS < 1 59 ACHR intron H IVS 175 > 344 ACHR intron I (no splice consensus at 175) BASE COUNT 67 A 123 C 80 G 74 T ORIGIN About 125 bp after segment 5. 1 AAGGGTGGGG GTGGCATCAT GGTATGGGCT GCCAGCTCCT GCCCACCCCA CCCTGACAGG 61 TACCTGACCT TCCTCCTGGT GGTGACCATC CTCATTGTCG TGAATGCTGT GGTTGTGCTC 121 AATGTCTCCT TGCGGTCTCC ACACACACAC TCCATGGCCC GAGGGGTCCG CAAGGCAAGG 181 ACCCTCCCTG CCCACTTCAA CATCCCGCTG CCCACTCCCC TACGCCTCCC TCTCGCACGC 241 CCCGGCAGTA CTCACCTGTG GCATTCCACA GCACACCCAT CCTGGGCGTA TCTGGACGCA 301 TGGACCAAAA TCGATTACAG TAATACAGGA ATGAAATTGC TTCC // LOCUS HUMACHRG7 690 bp ds-DNA PRI 16-JUN-1986 DEFINITION Human acetylcholine receptor gamma subunit gene, exons 10 and 11. ACCESSION N00019 X01721 KEYWORDS acetylcholine receptor. SEGMENT 7 of 8 SOURCE Human fetal liver DNA (library of T.Maniatis), clones lambda-hACR-gamma-[3,12] [1]. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 690) AUTHORS Shibahara,S., Kubo,T., Perski,H.J., Takahashi,H., Noda,M. and Numa,S. TITLE Cloning and sequence analysis of human genomic DNA encoding gamma subunit precursor of muscle acetylcholine receptor JOURNAL Eur. J. Biochem. 146, 15-22 (1985) STANDARD full staff_review FEATURES from to/span description pept + 84 297 acetylcholine receptor gamma subunit precursor, exon 10 /nomgen="CHRNG" /map="2q32-qter" /hgml_locus_uid="LJ0139Q" 489 + 619 acetylcholine receptor gamma subunit precursor, exon 11 matp + 84 297 acetylcholine receptor gamma subunit 489 + 619 acetylcholine receptor gamma subunit IVS < 1 83 ACHR intron I IVS 298 488 ACHR intron J IVS 620 > 690 ACHR intron K BASE COUNT 133 A 206 C 216 G 135 T ORIGIN About 650 bp after segment 6. 1 AGCAGCACAA GCCCTTCACA CCAACCTCTG GCTTCTGCTC TGAAGCTCGG CCTGCTGCCC 61 TAGTGAAGCC ACCCCCTCTC TAGGTGTTCC TGAGGCTCTT GCCCCAGCTG CTGAGGATGC 121 ACGTTCGCCC GCTGGCCCCG GCAGCTGTGC AGGACACCCA GTCCCGGCTA CAGAATGGCT 181 CCTCGGGATG GTCGATCACA ACTGGGGAGG AGGTGGCCCT CTGCCTGCCT CGCAGTGAAC 241 TCCTCTTCCA GCAGTGGCAG CGGCAAGGGC TGGTGGCGGC AGCGCTGGAG AAGCTAGGTG 301 AGACACACCA GGTGTGCCTG GGGACAGTCC TCCCCTGGGA CCCCAGCTGG GGAGCCAGGC 361 ACAGCAGATG AGTGCTGGAG AAGTGCCCAG GTCAGGGAGA GAGGAGCTGG GGTCCCTAAG 421 GAGAGGCCAT CTTCTCTGCC TGTTTCTCCT CCATTCTACT CCCAAACCTT ACCCTTTCTC 481 TTTATCAGAG AAAGGCCCGG AGTTAGGGCT GAGCCAGTTC TGTGGCAGCC TGAAGCAGGC 541 TGCCCCAGCC ATCCAGGCCT GTGTGGAAGC CTGCAACCTC ATTGCCTGTG CCCGGCACCA 601 GCAGAGTCAC TTTGACAATG TAAGCTGAGT CAGGGTGGGG TGGAGGTGGA GTGAGTACCT 661 GGGCTTGGAA CCGTGATAGA GACAGGATGA // LOCUS HUMACHRG8 703 bp ds-DNA PRI 16-JUN-1986 DEFINITION Human acetylcholine receptor gamma subunit gene, exon 12. ACCESSION M11811 X01721 KEYWORDS acetylcholine receptor. SEGMENT 8 of 8 SOURCE Human fetal liver DNA (library of T.Maniatis), clones lambda-hACR-gamma-[3,12] [1]. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 703) AUTHORS Shibahara,S., Kubo,T., Perski,H.J., Takahashi,H., Noda,M. and Numa,S. TITLE Cloning and sequence analysis of human genomic DNA encoding gamma subunit precursor of muscle acetylcholine receptor JOURNAL Eur. J. Biochem. 146, 15-22 (1985) STANDARD full staff_review FEATURES from to/span description pept + 101 274 acetylcholine receptor gamma subunit precursor, exon 12 /nomgen="CHRNG" /map="2q32-qter" /hgml_locus_uid="LJ0139Q" matp + 101 271 acetylcholine receptor gamma subunit IVS < 1 100 ACHR intron K BASE COUNT 145 A 211 C 196 G 151 T ORIGIN About 600 bp after segment 7. 1 AATGGAGACA TGGGCCTGCT GGAAGCCCAA GGATGAGAAC AGGACCCAGG GAAGACCTGG 61 TGCCGCCGCT GGTTATCCCA CACCTGCCTC CCACCCTCAG GGGAATGAGG AGTGGTTCCT 121 GGTGGGCCGA GTGCTGGACC GCGTCTGCTT CCTGGCCATG CTCTCGCTCT TCATCTGTGG 181 CACAGCTGGC ATCTTCCTCA TGGCCCACTA CAACCGGGTG CCGGCCCTGC CATTCCCTGG 241 AGATCCACGC CCCTACCTGC CCTCACCAGA CTGAGCCAAC CAACCACTGT GGGGCATGTG 301 GGAGTCACAC ACGTGGGTCA CACTGAGTCT TATCAGCCAC GTTCTCCTAC TGAGGTCCTA 361 AGTGTGCTCT TTGGGAAGTG CCCTTCAGGA CTGTGTGAGC CAAACAGCCC TGAGAAAAGC 421 TGGGGAAACA GTCTGAGCTG GAGTCCGAGA GTGGTTGGGG GTGGGCCGTG GCTAGTGTCC 481 TGCTGCAGTC AGCACACACG TGGGATTGGC TAGCTCATCC TGGCACCAGC CACCCCTCCA 541 CTCAGTGCAC TCCCCTCACT TAGGCAAAGC ATTATTCATT CCCATCAGTC TGAAGCCCGA 601 AGGACTGTTT TGTATAATAC CTTCGGACTT GGGACTGGCT CCCCTTTTAC AAGTTCTCCC 661 TGAAAGAGGG CAGTCACAAG AGGTGTGAAG AGTAGCAGCC GAT // LOCUS HUMACHRM2 2210 bp ds-DNA PRI 15-JUN-1988 DEFINITION Human m2 muscarinic acetylcholine receptor gene. ACCESSION M16404 KEYWORDS acetylcholine receptor; m2 muscarinic acetylcholine receptor; neurotransmitter. SOURCE Human DNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 2210) AUTHORS Bonner,T.I., Buckley,N.J., Young,A.C. and Brann,M.R. TITLE Identification of a family of muscarinic acetylcholine receptor genes JOURNAL Science 237, 527-532 (1987) STANDARD full staff_review COMMENT Draft entry and computer-readable copy of sequence in [1] kindly provided by T.I.Bonner, 17-JUL-1987. FEATURES from to/span description pept 195 1595 muscarinic acetylcholine receptor m2 /nomgen="CHRM2" /map="UNASSIGNED" /hgml_locus_uid="LL0134T" pre-msg < 1 2171 ACHR-m2 pre-mRNA IVS < 1 148 ACHR-m2 intron A BASE COUNT 657 A 467 C 474 G 612 T ORIGIN 19 bp upstream of AccI site. 1 ACATGGGAAT TAGGCAGGTA GACACAGTAA TCATGCAGGG GAAGGGAGAT TTGGGAGAAA 61 ATAATGTGGT TTAAAAGGAG AAACAACATT ATGTATTTTA AACCAATGTT TATATTATGT 121 TTGTTAATTT TATTCTATTT CCTTGCAGGT TTAAATGTTT ATTTGCTACT TGGCTACTGA 181 TTAGAGAACG CAAAATGAAT AACTCAACAA ACTCCTCTAA CAATAGCCTG GCTCTTACAA 241 GTCCTTATAA GACATTTGAA GTGGTGTTTA TTGTCCTGGT GGCTGGATCC CTCAGTTTGG 301 TGACCATTAT CGGGAACATC CTAGTCATGG TTTCCATTAA AGTCAACCGC CACCTCCAGA 361 CCGTCAACAA TTACTTTTTA TTCAGCTTGG CCTGTGCTGA CCTTATCATA GGTGTTTTCT 421 CCATGAACTT GTACACCCTC TACACTGTGA TTGGTTACTG GCCTTTGGGA CCTGTGGTGT 481 GTGACCTTTG GCTAGCCCTG GACTATGTGG TCAGCAATGC CTCAGTTATG AATCTGCTCA 541 TCATCAGCTT TGACAGGTAC TTCTGTGTCA CAAAACCTCT GACCTACCCA GTCAAGCGGA 601 CCACAAAAAT GGCAGGTATG ATGATTGCAG CTGCCTGGGT CCTCTCTTTC ATCCTCTGGG 661 CTCCAGCCAT TCTCTTCTGG CAGTTCATTG TAGGGGTGAG AACTGTGGAG GATGGGGAGT 721 GCTACATTCA GTTTTTTTCC AATGCTGCTG TCACCTTTGG TACGGCTATT GCAGCCTTCT 781 ATTTGCCAGT GATCATCATG ACTGTGCTAT ATTGGCACAT ATCCCGAGCC AGCAAGAGCA 841 GGATAAAGAA GGACAAGAAG GAGCCTGTTG CCAACCAAGA CCCCGTTTCT CCAAGTCTGG 901 TACAAGGAAG GATAGTGAAG CCAAACAATA ACAACATGCC CAGCAGTGAC GATGGCCTGG 961 AGCACAACAA AATCCAGAAT GGCAAAGCCC CCAGGGATCC TGTGACTGAA AACTGTGTTC 1021 AGGGAGAGGA GAAGGAGAGC TCCAATGACT CCACCTCAGT CAGTGCTGTT GCCTCTAATA 1081 TGAGAGATGA TGAAATAACC CAGGATGAAA ACACAGTTTC CACTTCCCTG GGCCATTCCA 1141 AAGATGAGAA CTCTAAGCAA ACATGCATCA GAATTGGCAC CAAGACCCCA AAAAGTGACT 1201 CATGTACCCC AACTAATACC ACCGTGGAGG TAGTGGGGTC TTCAGGTCAG AATGGAGATG 1261 AAAAGCAGAA TATTGTAGCC CGCAAGATTG TGAAGATGAC TAAGCAGCCT GCAAAAAAGA 1321 AGCCTCCTCC TTCCCGGGAA AAGAAAGTCA CCAGGACAAT CTTGGCTATT CTGTTGGCTT 1381 TCATCATCAC TTGGGCCCCA TACAATGTCA TGGTGCTCAT TAACACCTTT TGTGCACCTT 1441 GCATCCCCAA CACTGTGTGG ACAATTGGTT ACTGGCTTTG TTACATCAAC AGCACTATCA 1501 ACCCTGCCTG CTATGCACTT TGCAATGCCA CCTTCAAGAA GACCTTTAAA CACCTTCTCA 1561 TGTGTCATTA TAAGAACATA GGCGCTACAA GGTAAAATAT CTTTGAAAAA GATAGAAGGT 1621 GGGCAAGGGG AGCTTGAGAA GAATAAAAGG GATAAACGAG CTCCTAGTTT TAAAATCTCT 1681 GCCATTGCAC TTTATAGTCT GATTACAAAA CGTGCAATTC AGGAGCCCAG CAGTGACACA 1741 CTTATCACGC CTAGGCTCCA GTTTGCAAAA ATTGCACCTT ATAAACTGTC AGTATTAGGA 1801 GCAATGAGAC AATGAAAGAA ACATGTTGGG ATCGTGGATT TAAGAAACTA TACACTGTTT 1861 CTCATAATCT CTTGAAGAAG GGCTTCTGAT TCTACAATTT TATCAGTCTC TGCACAAGAG 1921 GAATAACCTT GTTCCTTTTT TGTTACTTTT GTTGTTGTTG TTCTCATGTG TCCTTAAGAG 1981 AAGGAATGCC ACAGTTACAA GGTAAACATG GAGACTTAAA CATAAAGAAA TAGGCACTAT 2041 ACAATGGGGA CATAAAAAAA GAAAATGAAA GAAGGATGCA GAAATTTGTC TCCGGAGTGT 2101 TAAGCATATT TTATTCTTTT GTTACGGTCC TATTTAGAGG ATTGGAATGT AATAAATGCT 2161 TATTTTTTGC CTTTCTTTTT CCCACCATGA AGAGAAAGCA AACAAACAGA // LOCUS HUMACHRM4 2595 bp ds-DNA PRI 15-JUN-1988 DEFINITION Human m4 muscarinic acetylcholine receptor gene. ACCESSION M16405 KEYWORDS acetylcholine receptor; muscarinic acetylcholine receptor; neurotransmitter. SOURCE Human DNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 2595) AUTHORS Bonner,T.I., Buckley,N.J., Young,A.C. and Brann,M.R. TITLE Identification of a family of muscarinic acetylcholine receptor genes JOURNAL Science 237, 527-532 (1987) STANDARD full staff_review COMMENT Draft entry and computer-readable copy of sequence in [1] kindly provided by T.I.Bonner, 17-JUL-1987. FEATURES from to/span description pept 801 2237 muscarinic acetylcholine receptor m4 /nomgen="CHRM4" /map="UNASSIGNED" pre-msg < 1 2595 ACHR-m4 pre-mRNA IVS < 1 771 ACHR-m4 intron A BASE COUNT 528 A 839 C 674 G 552 T 2 others ORIGIN 1 bp upstream of XbaI site. 1 TCTAGACCAC CAGCCTGGAC AACATACCAA GACCCTGTCT CTACAAATAA ATAGATAAAT 61 AAATAGACAC TTTTTTTAAG TGTCAAAAGT GCTTGGCACT TAGTAGACCA TCAGTGTTAG 121 GTGCTCATAC ATACCCCGAT TATTGCCTTG TCCCAGTGTC TTGTACAGGG GTTGGAGAGN 181 AGGTGTTAAG AAATGACCGA ATGGGTAAAT GGATGAACAG AACACCTCCC TCCAGAGCCC 241 ACATGCTCGT GGGCCTCTGG GACCACTCTC CTCCTCCTCT TGCTTCCCTG AGCTCCCCCA 301 GCATGGCCTC TGTCCAGGCC TTGCGCTGCC TCCAGGCCTT TGCTGTGGCT ACTGCCCCTG 361 GAGCGCCATN TCCACAGCTC CTCCTGTGGC TGGCTCCTCA TCACCCAGAT GACCTGGTGG 421 GTGAGGCCAC CTAGCAAGGA GTCATGCCTG TCCTGCCTTC TGACTCACTC TCTCATCACC 481 CTGCCTTTTT TTTCTTTTGT GGCTCACGTG TTTGCATGTC TCCCCCCATG AGGCAGGGGG 541 CCATGTGTGT CTTATTCACT TCTGTAGCCA CAGCACCCTG AGCAATGCTT GCCACATAGT 601 AGGTGCTCAA TTAATGTTGA ATGAATGGGC AAAATGCGGG ATGGCGGGAC AGAGTTCTCT 661 CAAGGCATTC TGCCAGAGAA TGTCCCTCTG TCACCTTGAA TCCAGTGTAC CTCCAGATGA 721 CTCCCCCATT CCCTCCTGTA GTTCATGCTT TTCTCTCCCC TTCCTCCCCA GACACGGCCT 781 ACCCACCCCT GGCAACCAAC ATGGCCAACT TCACACCTGT CAATGGCAGC TCGGGCAATC 841 AGTCCGTGCG CCTGGTCACG TCATCATCCC ACAATCGCTA TGAGACGGTG GAAATGGTCT 901 TCATTGCCAC AGTGACAGGC TCCCTGAGCC TGGTGACTGT CGTGGGCAAC ATCCTGGTGA 961 TGCTGTCCAT CAAGGTCAAC AGGCAGCTGC AGACAGTCAA CAACTACTTC CTCTTCAGCC 1021 TGGCGTGTGC TGATCTCATC ATAGGCGCCT TCTCCATGAA CCTCTACACC GTGTACATCA 1081 TCAAGGGCTA CTGGCCCCTG GGCGCCGTGG TCTGCGACCT GTGGCTGGCC CTGGACTACG 1141 TGGTGAGCAA CGCCTCCGTC ATGAACCTTC TCATCATCAG CTTTGACCGC TACTTCTGCG 1201 TCACCAAGCC TCTCACCTAC CCTGCCCGGC GCACCACCAA GATGGCAGGC CTCATGATTG 1261 CTGCTGCCTG GGTACTGTCC TTCGTGCTCT GGGCGCCTGC CATCTTGTTC TGGCAGTTTG 1321 TGGTGGGTAA GCGGACGGTG CCCGACAACC ACTGCTTCAT CCAGTTCCTG TCCAACCCAG 1381 CAGTGACCTT TGGCACAGCC ATTGCTGCCT TCTACCTGCC TGTGGTCATC ATGACGGTGC 1441 TGTACATCCA CATCTCCCTG GCCAGTCGCA GCCGAGTCCA CAAGCACCGG CCCGAGGGCC 1501 CGAAGGAGAA GAAAGCCAAG ACGCTGGCCT TCCTCAAGAG CCCACTAATG AAGCAGAGCG 1561 TCAAGAAGCC CCGCCCGGGA GGCCGCCCGG GAGGACTGCG CAATGGCAAG CTGGAGGAGG 1621 CCCCCCCGCC AGCGCTGCCA CCGCCACCGC GCCCCGTGGC TGATAAGGAC ACTTCCAATG 1681 AGTCCAGCTC AGGCAGTGCC ACCCAGAACA CCAAGGAACG CCCAGCCACA GAGCTGTCCA 1741 CCACAGAGGC CACCACTCCC GCCATGCCCG CCCCTCCCCT GCAGCCGCGG GCCCTCAACC 1801 CAGCCTCCAG ATGGTCCAAG ATCCAGATTG TGACGAAGCA GACAGGCAAT GAGTGTGTGA 1861 CAGCCATTGA GATTGTGCCT GCCACGCCGG CTGGCATGCG CCCTGCGGCC AACGTGGCCC 1921 GCAAGTTCGC CAGCATCGCT CGCAACCAGG TGCGCAAGAA GCGGCAGATG GCGGCCCGGG 1981 AGCGCAAAGT GACACGAACG ATCTTTGCCA TTCTGCTAGC CTTCATCCTC ACCTGGACGC 2041 CCTACAACGT CATGGTCCTG GTGAACACCT TCTGCCAGAG CTGCATCCCT GACACGGTGT 2101 GGTCCATTGG CTACTGGCTC TGCTACGTCA ACAGCACCAT CAACCCTGCC TGCTATGCTC 2161 TGTGCAACGC CACCTTTAAA AAGACCTTCC GGCACCTGCT GCTGTGCCAG TATCGGAACA 2221 TCGGCACTGC CAGGTAGGCA GGCAGGAGTG CCCTAGGAGG TGCGGTGTGC GTGCGTGTGC 2281 TGGGGGACCA CACGGCTCAC TTGCTGTGGG GAAGAGTGCA GGCACCATTC TGCGTTCACG 2341 TTTGCTGAGG AGGAAGTTCA GAAGAGGCTC TGTGGCTGCA TTCAGAGACC AGATCTCTGC 2401 TCACCCGTGA GGAGGCTCAC CCCAGGGAGT GTCTGAACTG GGGCTGCCTG GCCCACCTCT 2461 GTGGCCCTGC TTCAGCGAGC TGCGGGGCAC TGGCCTGGGT GGGCACCTGC CCACTGTGAC 2521 CAACCATCAG CAGTGCTGGA AGAATGGAGA TCTGGATGGG GGCCGAAGCC CAGGGCCCCC 2581 TCAGGAAGAA CAAAG // LOCUS HUMACTCA1 232 bp ds-DNA PRI 15-JUN-1989 DEFINITION Human alpha-cardiac actin gene, 5' flank and exon 1. ACCESSION J00070 KEYWORDS alpha-cardiac actin. SEGMENT 1 of 4 SOURCE Human: DNA (beta-thalassemic library of Fritsch et al.), clone lambda-HA-25 [1]; skeletal muscle, cDNA to mRNA, clone pHMCA-1 [2]. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 232) AUTHORS Hamada,H., Petrino,M.G. and Kakunaga,T. TITLE Molecular structure and evolutionary origin of human cardiac muscle actin gene JOURNAL Proc. Natl. Acad. Sci. U.S.A. 79, 5901-5905 (1982) STANDARD full staff_review REFERENCE 2 (bases 104 to 158) AUTHORS Gunning,P., Ponte,P., Blau,H. and Kedes,L. TITLE alpha-skeletal and alpha-cardiac actin genes are coexpressed in adult human skeletal muscle and heart JOURNAL Mol. Cell. Biol. 3, 1985-1995 (1983) STANDARD full staff_review COMMENT [1] provides the following summary. There appear to be six isoforms of actin in mammals: cytoplasmic beta- and gamma-actin, co-expressed in non-muscle cells; two smooth muscle actins, co-expressed in smooth muscle (in varying ratios from cell type to cell type), but not detected elsewhere; and alpha-cardiac and alpha-skeletal actin, co-expressed in human adult striated muscle cells (though in varying ratios -- alpha-cardiac actin predominates in cardiac muscle, alpha-skeletal actin predominates in skeletal muscle). The initial Met-Cys of the alpha-cardiac coding sequence is post-translationally removed. The complete coding sequence consists of six exons; all intron boundaries obey the "gt" and "ag" consensus intron boundary rules. FEATURES from to/span description pept 30 + 158 alpha-cardiac actin propeptide, exon 1 /nomgen="ACTC" /map="15q11-qter" /hgml_locus_uid="LB0038B" matp 36 + 158 alpha-cardiac actin mature peptide pre-msg < 30 > 232 actin mRNA [2] IVS 159 > 232 actin intron I unsure 140 140 n in [2] BASE COUNT 39 A 85 C 69 G 39 T ORIGIN Chromosome 15q11-qter; 5 bp upstream of PstI site. 1 CTGCAGAAAC CCCCTGAAGC TGTGCCAAGA TGTGTGACGA CGAGGAGACC ACCGCCCTGG 61 TGTGCGACAA CGGCTCTGGG CTGGTGAAGG CCGGCTTTGC GGGCGATGAC GCGCCCCGCG 121 CTGTCTTCCC GTCCATCGTG GGCCGCCCGC GGCACCAGGT AAACTTCCCG CCGAGCCCCC 181 CGTCCCACTC GGGACCCCTT CAGTCCAGCG ATCTAGGAAA TGGCTCTCAC CT // LOCUS HUMACTCA2 1846 bp ds-DNA PRI 15-JUN-1989 DEFINITION human alpha-cardiac actin gene, exons 2, 3, and 4. ACCESSION J00071 KEYWORDS actin; alpha-cardiac actin; cardiac actin. SEGMENT 2 of 4 SOURCE human (homo sapiens): beta-thalassemic genomic library (fritsch et al.), in clone lambda-ha-25 [1]; cdna to skeletal muscle mrna (clone phmca-1) [2]. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 1846) AUTHORS Hamada,H., Petrino,M.G. and Kakunaga,T. TITLE molecular structure and evolutionary origin of human cardiac muscle actin gene JOURNAL Proc. Natl. Acad. Sci. U.S.A. 79, 5901-5905 (1982) STANDARD full staff_review REFERENCE 2 (bases 110 to 434; 1105 to 1266; 1393 to 1584) AUTHORS Gunning,P., Ponte,P., Blau,H. and Kedes,L. TITLE alpha-skeletal and alpha-cardiac actin genes are coexpressed in adult human skeletal muscle and heart JOURNAL Mol. Cell. Biol. 3, 1985-1995 (1983) STANDARD full staff_review COMMENT see segment 1 comment. FEATURES from to/span description pept + 110 434 alpha-cardiac actin propeptide (exon 2) /nomgen="ACTC" /map="15q11-qter" /hgml_locus_uid="LB0038B" 1105 1266 alpha-cardiac actin propeptide (exon 3) 1393 + 1584 alpha-cardiac actin propeptide (exon 4) matp + 110 434 alpha-cardiac actin mature peptide (exon 2) 1105 1266 alpha-cardiac actin mature peptide (exon 3) 1393 + 1584 alpha-cardiac actin mature peptide (exon 4) pre-msg < 1 > 1846 actin mRNA [2] IVS < 1 109 actin intron i IVS 435 1104 actin intron ii IVS 1267 1392 actin intron iii IVS 1585 > 1846 actin intron iv conflict 274 274 t in [1]; g in [2] conflict 1488 1490 ttg in [1]; cta in [2] conflict 1532 1532 c in [1]; t in [2] BASE COUNT 460 A 431 C 440 G 515 T ORIGIN About 1 kb after segment 1; chromosome 15q11-qter. 1 GGTATTTAAA TATGTTCCTT GACTTGGGCA GTTAGATATA AATGGACAAG ACACTGATTA 61 TATTCCTGAC ATGGTGAGAG CATGATTTTC TCATTTTTTC TTCTCATAGG GAGTTATGGT 121 GGGTATGGGT CAGAAGGACT CCTACGTAGG TGATGAAGCC CAGAGCAAGA GAGGCATCCT 181 GACCCTGAAG TATCCCATCG AGCATGGTAT CATCACCAAC TGGGACGACA TGGAGAAGAT 241 CTGGCACCAC ACCTTCTACA ATGAGCTCCG TGTTGCTCCC GAGGAGCACC CCACCCTGCT 301 CACAGAGGCC CCGCTGAACC CCAAGGCCAA CCGGGAGAAG ATGACTCAGA TCATGTTTGA 361 GACCTTCAAT GTCCCTGCCA TGTACGTGGC CATCCAGGCA GTGCTATCCC TGTATGCTTC 421 TGGCCGTACC ACAGGTATGC TGGGCTCTGG GGACAGTTAC TGATGAATCA CATTCCCAAG 481 TCACCGACCT TGCTGTGAAT CAGATCCCCC AGTTGAAAAA GGGATAATCC CTTTCCTCCC 541 ATTCCCTAGC AAGGTCTGTG CTAAGAGAAA GAGTTAACGG TAGTGCCCTG AGGTTAGTTT 601 CGGAGCACAA TTATTATTGT TGAGCTGATA GCTTGTGGAG GTGGGCCTTC CCTCATTTAA 661 AGCTCAGCGC AGTGTAGCAG CTTGGAGTGC AGCAGTCATT GTTATGTGTT TAAACCATCA 721 CATCACCTGG GCAAGCATCC CCAAGGAGAA TACATTCCAT ACAGGGTCTG ACTCAAAAGA 781 GAGAGAAACG TGTAAGTTCA ATAGGAGCAA AGAAAAACAC CCTTGGGTGC TTACATAATG 841 TGGCTGACAA GAAAGATGGT CATTTGAAAG TGTCCTCGGG AATTTTTTCT ACTATAATAG 901 TTAAAAAGAT GAGCTGCAGC TTGCTTCAGA TTTAGTATTC CTGATGCGCA TTTTTATTCT 961 TTGTGTGTAA GGAATCTAAT TTTATCTGGA TCAATGCCCA TTGCTAGCAT CTCTTAGCCA 1021 AGATTGGAAG CGGGCTTTGC CGTGGCTAGA GCAGTGGTGT TGTCCTCAGG AATTTACCTT 1081 GTTCTTGTGT ACTTCCCCGG GCAGGCATTG TTCTGGACTC TGGGGATGGT GTAACTCACA 1141 ATGTCCCCAT CTATGAGGGC TACGCTTTGC CCCATGCCAT CATGCGTCTG GATCTGGCTG 1201 GTCGGGACCT CACTGACTAC CTCATGAAGA TCCTCACTGA GCGTGGCTAC TCCTTTGTCA 1261 CCACTGGTGA GTGTGTGTGT CTCATCTGCC ACAGTGTGGG TCTGCTTTCC TCCTCTCTCA 1321 CTGAATCCGC CTACCTCCCT ATAATTGACT TCTTGCTTCA GAGCATGACT GTGATACTCT 1381 TTATTTCTGT AGCTGAACGT GAAATTGTCC GTGACATTAA AGAGAAGCTG TGCTATGTCG 1441 CCCTGGATTT TGAGAATGAG ATGGCCACAG CTGCCTCTTC CTCCTCCTTG GAGAAGAGCT 1501 ATGAACTGCC TGATGGCCAA GTCATCACTA TCGGCAATGA GCGCTTCCGC TGTCCTGAGA 1561 CACTCTTCCA GCCCTCCTTC ATTGGTGAGT TGTAGGGTCT GGTGTAGAGG CACGATTTTC 1621 CTGGAAATCT TAGGGTCTCC CAGAGTAAAA TCTAGAATTC TCAAGAAGCC CTTGAGTTAA 1681 AAGAAGTCAT TGTTTGGATT CCCACACAGC TCAACCTCAA GTCCTGATGG CTCGATTAAA 1741 GGGGAAGTCT ATGTTCTGCT CTAGTGATAT AATAAAGGAT GACACACTGA GTTCATCTCA 1801 AAATAAAGGA TGACACAGAG TAGTAACTGA ATAGCACTAT CTGCAG // LOCUS HUMACTCA3 442 bp ds-DNA PRI 15-JUN-1989 DEFINITION human alpha-cardiac actin gene, exon 5. ACCESSION J00072 KEYWORDS Z DNA; actin; alpha-cardiac actin; cardiac actin. SEGMENT 3 of 4 SOURCE human (homo sapiens): beta-thalassemic genomic library (fritsch et al.), in clone lambda-ha-25 [1],[2]; cdna to skeletal muscle mrna (clone phmca-1) [3]. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 41 to 193) AUTHORS Hamada,H. and Kakunaga,T. TITLE potential z-dna forming sequences are highly dispersed in the human genome JOURNAL Nature 298, 396-398 (1982) STANDARD full staff_review REFERENCE 2 (bases 1 to 442) AUTHORS Hamada,H., Petrino,M.G. and Kakunaga,T. TITLE molecular structure and evolutionary origin of human cardiac muscle actin gene JOURNAL Proc. Natl. Acad. Sci. U.S.A. 79, 5901-5905 (1982) STANDARD full staff_review REFERENCE 3 (bases 173 to 354) AUTHORS Gunning,P., Ponte,P., Blau,H. and Kedes,L. TITLE alpha-skeletal and alpha-cardiac actin genes are coexpressed in adult human skeletal muscle and heart JOURNAL Mol. Cell. Biol. 3, 1985-1995 (1983) STANDARD full staff_review COMMENT see segment 1 comment. [1] notes a potential z-dna sequence (poly (tg)) from bases 111 to 160. FEATURES from to/span description pept + 173 + 354 cardiac actin propeptide (exon 5) /nomgen="ACTC" /map="15q11-qter" /hgml_locus_uid="LB0038B" matp + 173 + 354 cardiac actin mature peptide (exon 5) pre-msg < 1 > 442 actin mRNA [2] IVS < 1 172 actin intron iv IVS 355 > 442 actin intron v conflict 246 246 c in [1]; g in [2] BASE COUNT 106 A 92 C 110 G 134 T ORIGIN About 400 bp after segment 2; chromosome 15q11-qter. 1 CTGCAGTGTG TCTTATAGGG GAACATATGT TTCAGAGACA AATGGTGACA GCTCCCCCAC 61 ACAAAGAAGT TCTGTTCTCT TCCCTCTACC TTGACCTGAA TGCACTGTGA TGTGTGTGTG 121 TGTGTGTGTG TGTGTGTGTG TGTGTGTGTG TGTGTGTGTG ACTCGTTCCC AGGTATGGAA 181 TCTGCTGGCA TCCATGAAAC AACTTACAAT AGCATCATGA AGTGTGACAT TGATATCCGC 241 AAGGACCTGT ATGCCAACAA TGTCTTATCT GGAGGCACCA CTATGTACCC TGGTATTGCT 301 GATCGTATGC AGAAGGAAAT CACTGCTCTG GCTCCTAGCA CCATGAAGAT TAAGGTAAAG 361 AACTTTTGTG AGTGGGAGAT CGAGGCAGGT CTTGGTATTC TAAGCGAACT ACGTTCCAAA 421 TTCCTTTTCC CTCATACTCA GT // LOCUS HUMACTCA4 749 bp ds-DNA PRI 15-JUN-1989 DEFINITION human alpha-cardiac actin gene, exon 6 and 3' flank. ACCESSION J00073 KEYWORDS actin; alpha-cardiac actin; cardiac actin. SEGMENT 4 of 4 SOURCE human (homo sapiens): beta-thalassemic genomic library (fritsch et al.), in clone lambda-ha-25 [1]; human cardiac muscle dna (clone phrl83-3'ut) and human cdna to skeletal muscle mrna (clone phmca-1) [2]. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 604) AUTHORS Hamada,H., Petrino,M.G. and Kakunaga,T. TITLE molecular structure and evolutionary origin of human cardiac muscle actin gene JOURNAL Proc. Natl. Acad. Sci. U.S.A. 79, 5901-5905 (1982) STANDARD full staff_review REFERENCE 2 (bases 1 to 749) AUTHORS Gunning,P., Ponte,P., Blau,H. and Kedes,L. TITLE alpha-skeletal and alpha-cardiac actin genes are coexpressed in adult human skeletal muscle and heart JOURNAL Mol. Cell. Biol. 3, 1985-1995 (1983) STANDARD full staff_review COMMENT see segment 1 comment. [1] notes putative poly-adenylation signal at bases 531-536. FEATURES from to/span description pept + 233 376 cardiac actin propeptide (exon 6) /nomgen="ACTC" /map="15q11-qter" /hgml_locus_uid="LB0038B" matp + 233 376 cardiac actin mature peptide (exon 6) pre-msg < 1 557 actin mRNA [2] IVS < 1 232 actin intron v conflict 284 284 t in [1]; c in [2] conflict 438 438 c in [1]; t in [2] conflict 543 543 t in [1]; c in [2] BASE COUNT 194 A 151 C 148 G 254 T 2 others ORIGIN About 200 bp after segment 3; chromosome 15q11-qter. 1 TAGGTAAGTG GAGAGGTTCC ATTTTAATAA TAGAATATAT CTTTAATAAT ATAAATGATG 61 AAATAGAGGT GAGTTCATAT AACTTGATTG GCCATATTAT TTCGTGGTAT GACATATCCC 121 ACATTATAGC GAATTAATAT CTAATGGTTT TTCTGTGAAT CCTCCCAATG TGTTATTTGC 181 TCCCTTGCTT GGAACTTCAG AGTTCACTGG AAGTTTTTGT TTTCTTCTGC AGATTATTGC 241 TCCCCCTGAG CGTAAATACT CTGTCTGGAT TGGGGGCTCC ATCTTGGCCT CTCTGTCCAC 301 CTTCCAGCAA ATGTGGATTA GCAAGCAAGA GTACGATGAG GCAGGCCCAT CCATTGTCCA 361 CCGCAAATGC TTCTAAGATG CCTTCTCTCT CCATCTACCT TCCAGTCAGG ATGACGGTAT 421 TATGCTTCTT GGAGTCTCCC AAACCACCTT CCCTCATCTT TCATCAATCA TTGTACAGTT 481 TGTTTACACA CGTGCAATTT GTTTGTGCTT CTAATATTTA TTGCTTTATA AATAAACCAG 541 ACTAGGACTT GCAACCTATA AAAGCCTCTC GTTTGTTTTT GGGGTAGGCG TGGGGTGGGG 601 CAGGTGTTTG CTTTGACACC CTGAGCATTG TCAAAGTTCA GTAGCACAAY GTTCATCCAG 661 ATGAATTAAT ATGACAGTTA GCRGGGAGTT ATAATGCTAA CTTTGATTCA TATTTGGACA 721 GAATCATGAA TATATTCATA TCCGAAGCG // LOCUS HUMACTGA 3583 bp ds-DNA PRI 15-SEP-1989 DEFINITION Human cytoskeletal gamma-actin gene, complete cds. ACCESSION M19283 KEYWORDS actin. SOURCE Human HeLa cell DNA, clone pgH-gamma-NMA-1, and cDNA to mRNA, clone pHF-gamma-A-1. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 3583) AUTHORS Erba,H.P., Eddy,R., Shows,T., Kedes,L. and Gunning,P. TITLE Structure, chromosome location, and expression of the human gamma-actin gene: differential evolution, location, and expression of the cytoskeletal beta- and gamma-actin genes JOURNAL Mol. Cell. Biol. 8, 1775-1789 (1988) STANDARD full staff_entry COMMENT Draft entry and computer-readable seqence for [1] kindly provided by H.P.Erba, 16-MAY-1988 FEATURES from to/span description pept 922 1044 gamma-actin, exon 2 (first expressed exon) /nomgen="ACTG" /map="17" /hgml_locus_uid="LD0166N" 1134 1373 gamma actin, exon 3 1651 2089 gamma actin, exon 4 2169 2350 gamma-actin, exon 5 2444 2587 gamma-actin, exon 6 pre-msg 475 > 3325 actin mRNA and introns IVS 543 915 actin intron A IVS 1045 1133 actin intron B IVS 1374 1650 actin intron C IVS 2090 2168 actin intron D IVS 2351 2443 actin intron E BASE COUNT 646 A 1036 C 1122 G 779 T ORIGIN 1 bp upstream of BamHI site. 1 GGATCCGCTT GGAGCCCGAG GGGAAAGATT GCAAGGGACG TCCCAGGGAG CAAAGCCCCG 61 CAGCGTGGCC GGGGACTCGG GGACCCACCC CAGCTCCGGC GCGGCCCCGC CCCGGGTCAG 121 GCAGTTTCGG GGCGCAGCAG GGCCCTCCCC ACGGCGCGGC AGTCCCGGGG AGGGCCAGGC 181 CCGTCAGCCC TTCGGCCCTC CCGGCGACCC TCGGAGGGCG CCCCAACTCA GACCCGCCCG 241 CCGGCCCGCG CAGGGTGCCG AGCGCGCGGG GGTGGCGCGG GCGGGGCCGT TGGGAGGCGG 301 TGCGGGCTGG GGGCGGGGCG GGCCGGGGCC CCACGTGTCC CTGCCCGGCG GGCCAATGGG 361 TGCCCGGCTT TCGGAAAGAT CGCCATATAT GGACATGTTC TGGGGCCGCG CGCGCCGCCG 421 GGCGCCGCGG GCGCGCCGCT TCCGCTTAAA TAACGGCGGG GGAGGCCGCG GTCGGTCTCA 481 GTCGCCGCTG CCAGCTCTCG CACTCTGTTC TTCCGCCCCT CCGCCGTCGC GTTTCTCTGC 541 CGGTGAGCGC CCCGCCCCGG GGCCTGAGCT GGACGTCGCA GGCCTGCGCC CCCCGACCCC 601 GGCTGGCCCC GCTTCCAGCT GCCGAGGCCT CGTCGCGCCT TCCCCGGGAA CAAAAGGCGG 661 GGTGGGCCGG GTCAGGCGGA CGGGGCTGGG GGGGCGCCGG GGGTCCAGGG CGGGGCCCGG 721 CCGCAGTGCA GACTTCCGAG CCGTGGACGT TACGTAAAAG GCCCGGCCTT CGGTTCCAGG 781 CGGTTGCGAT CTTGGAGGCC ACGGCCGCTG GGGTTGGGGC AGGGCCGAGG CTGCCCCGCG 841 GGAGGGCATT AGGTGGCTGT GGGGAGGGGA CCGTGTTACA GACGCGCCCG CCTGAGTCCA 901 TCCTTTTCCG GGCAGGTCGC AATGGAAGAA GAGATCGCCG CGCTGGTCAT TGACAATGGC 961 TCCGGCATGT GCAAAGCTGG TTTTGCTGGG GACGACGCTC CCCGAGCCGT GTTTCCTTCC 1021 ATCGTCGGGC GCCCCAGACA CCAGGTGAGT GGATGGCGCC GCGGGGCTCC TGGGTTCTGC 1081 GTTGCGGGGT GGGTTCGGTG TCGCCGGCGA GGCTGACGGA TCGTCCCCTG CAGGGCGTCA 1141 TGGTGGGCAT GGGCCAGAAG GACTCCTACG TGGGCGACGA GGCCCAGAGC AAGCGTGGCA 1201 TCCTGACCCT GAAGTACCCC ATTGAGCATG GCATCGTCAC CAACTGGGAC GACATGGAGA 1261 AGATCTGGCA CCACACCTTC TACAACGAGC TGCGCGTGGC CCCGGAGGAG CACCCAGTGC 1321 TGCTGACCGA GGCCCCCCTG AACCCCAAGG CCAACAGAGA GAAGATGACT CAGGTGAGGC 1381 TCGGCCGACG CCCGTGCTCC TCCCGTCCTT TCCCCAGTCA TTTTCTCGCC CGGCTTGATT 1441 TCTGACATTT AAGTGTTTCT TTCGCTGTTC CAGGCTCTGT TCCTCTCCCG GCATTTCCTC 1501 CCTGAAGCCT CCAGGTTTCT CATTTGGTTT CTGCCTGCGT TCTTTTCTTT TCTCCACACA 1561 TCACACTGGC ATGCAGCATG TTGTGGCGTG TGAGCATGGG GTGGCCGTGG GTCTCTGTCC 1621 CTGACTAAGC CGCCCCTTGT CCCTTCTCAG ATTATGTTTG AGACCTTCAA CACCCCGGCC 1681 ATGTACGTGG CCATCCAGGC CGTGCTGTCC CTCTACGCCT CTGGGCGCAC CACTGGCATT 1741 GTCATGGACT CTGGAGACGG GGTCACCCAC ACGGTGCCCA TCTACGAGGG CTACGCCCTC 1801 CCCCACGCCA TCCTGCGTCT GGACCTGGCT GGCCGGGACC TGACCGACTA CCTCATGAAG 1861 ATCCTCACTG AGCGAGGCTA CAGCTTCACC ACCACGGCCG AGCGGGAAAT CGTGCGCGAC 1921 ATCAAGGAGA AGCTGTGCTA CGTCGCCCTG GACTTCGAGC AGGAGATGGC CACCGCCGCA 1981 TCCTCCTCTT CTCTGGAGAA GAGCTACGAG CTGCCCGATG GCCAGGTCAT CACCATTGGC 2041 AATGAGCGGT TCCGGTGTCC GGAGGCGCTG TTCCAGCCTT CCTTCCTGGG TAGGTGTTGT 2101 GAGCTAAAGG TTTCTACTCT TTCATCCTCG GTGACACAGC ATCACTAAGG GAGGGCTCTG 2161 TCCCCTAGGT ATGGAATCTT GCGGCATCCA CGAGACCACC TTCAACTCCA TCATGAAGTG 2221 TGACGTGGAC ATCCGCAAAG ACCTGTACGC CAACACGGTG CTGTCGGGCG GCACCACCAT 2281 GTACCCGGGC ATTGCCGACA GGATGCAGAA GGAGATCACC GCCCTGGCGC CCAGCACCAT 2341 GAAGATCAAG GTGAGTCGAG GGGTTGGTGG CCCTCTGCCT GGCTCGGGAG AGCTGACTGG 2401 GGGGCGCTCT GCGAGCTGAA GCCGTGCCTG GCTGTCTTTG CAGATCATCG CACCCCCAGA 2461 GCGCAAGTAC TCAGTGTGGA TCGGCGGCTC CATCCTGGCC TCACTGTCCA CCTTCCAGCA 2521 GATGTGGATT AGCAAGCAGG AGTACGACGA GTCGGGCCCC TCCATCGTCC ACCGCAAATG 2581 CTTCTAGACG GACTCAGCAG ATGCGTAGCA TTTGCTGCAT GGGTTAATTG AGAATAGAAA 2641 TTTGCCCCTG GCAAATGCAC ACACCTCATG CTAGCCTCAC GAAACTGGAA TAAGCCTTCG 2701 AAAAGAAATT GTCCTTGAAG CTTGTATCTG ATATCAGCAC TGGATTGTAG AACTTGTTGC 2761 TGATTTTGAC CTTGTATTCA AGTTAACTGT TCCCCTTGGT ATTTGTTTAA TACCCTGTAC 2821 ATATCTTTGA GTTCAACCTT TAGTACGTGT GGCTTGGTCA CTTCGTGGCT GAGGTAAGAA 2881 CGTGCTTGTG GAAGACAAGT CTGTGGCTTG GTGAGTCTGT GTGGCCAGCA GCCTCTGATC 2941 TGTGCAGGGT ATTAACATGT CAGGGCTGAG TGTTCTGGGA TTTCTCTAGA GGCTGGCAAG 3001 AACCAGTTGT TTTTGTCTTG CGGGTCTGTC AGGGTTGGAA AGTCCAAGCC GTAGGACCCA 3061 GTTTCCTTTC TTAGCTGATG TCTTTGGCCA GAACACCGTG GGCTGTTAAC TTGCCTTGAG 3121 TTGGAAGCGG TTTGCATTTA CGCCTGTAAA TGTATTCATT CTTAATTTAT GTAAGGTTTT 3181 TTTTGTACGC AATTCTCGAT TCTTTAAAGA GATGACAACA AATTTTGGTT TTCTACTGTT 3241 ATGTGAGAAT ATTAGGCCCC AGCAACACGT CATTGTGTAA GGAAAAATAA AAGTGCTGCC 3301 GTAACCAATG AATGGCTCCT GTTTGGGGAA GTAGCAAGTG GGCTGGGAAA GACAGACCAA 3361 CCTGGAAGTA TTGGGTAGTC TTGGGGGTGG GGGCGGTGGC TGCTGCTCAG CTTGGCTTCG 3421 TGGGCTGGTG AGAAAACGGC TTAACAATAA ACTGAGCAGT TCGATTTCTT CCAAATCGAA 3481 AGTGCAAGAA CAAGGCAGCT AGTGGGTTTG GGTGTTGGAA ATAACTGAAG CAACATCATA 3541 AGCAGGCTGG GAGGGAAGTC CTGAGACGGC TTTTTCCGTT ATT // LOCUS HUMACTSM1 976 bp ds-DNA PRI 07-JAN-1985 DEFINITION Human smooth muscle actin gene (aortic): exon 1 & ivs flanks. ACCESSION K01741 KEYWORDS actin. SEGMENT 1 of 7 SOURCE Human DNA, clone lambda-Ha201. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 976) AUTHORS Ueyama,H., Hamada,H., Battula,N. and Kakunaga,T. TITLE Structure of a human smooth muscle actin gene (aortic type) with a unique intron site JOURNAL Mol. Cell. Biol. 4, 1073-1078 (1984) STANDARD simple staff_review COMMENT [1] the amino acid sequences deduced from these nucleotide sequences are identical to those reported for bovine aortic smooth muscle actin (Vandekerckhove,J., and Weber,K. (1979) 14, 123-133), except for codon 309, in which codon gcn for alanine is replaced by codon ftc for valine. FEATURES from to/span description pept 200 + 328 actin prepeptide matp 206 + 328 actin mature peptide exon 1 IVS 329 > 976 actin intron 1 BASE COUNT 295 A 205 C 193 G 283 T ORIGIN 1 base upstream of HindIII site. 1 AAGCTTATAG TGTAACTAAA GTACTTCCAG ATACATGCTA TATTAGATAA GGGTCAGAGG 61 ATAGCACTAG GACACAGAGG AGGGCCCCCA AACCCAGCCT TGGACGTGGG AGTTGTCAGG 121 TAAGTGCCAC AAATGCCCAA TTACAGCTGA GGCTGCTTCC TCCCTGTTTT CTATAGAATC 181 CTGTGAAGCA GCTCCAGCTA TGTGTGAAGA AGAGGACAGC ACTGCCTTGG TGTGTGACAA 241 TGGCTCTGGG CTCTGTAAGG CCGGCTTTGC TGGGGACGAT GCTCCCAGGG CTGTTTTCCC 301 ATCCATTGTG GGACGTCCCA GACATCAGGT GAGGAAAATT ATGATTGGAA CATAACAGGA 361 CACAGATTAG GTTCCTCTGT GTTCATGTTT ATCCCCAGAT TGTCTATCTC TTTCTGCCCA 421 GAAGTTATGT AACCTTGTTA CAAATAGACC TCTAATGATT AAGGTCCAGG AGTAAATGCC 481 AATTCACATA TAATGTAGAC AGATTATCTG ATGGGCATCT ATCAGATACA AAGTCTGCCC 541 CTTTTTCATG TCCTTTTTGT CTAAATATAG TCATTATCAT CATCATCATC ATCATCAAAT 601 CATTTCATCA CCATCAGAAA TGCTTATACA TTATCCTGAT GTATACCAAA GCTACTGTTT 661 GGAAAGAAAC TAAAATAAAA GTCCAGGTCA CTTAACCATA CAGGGCTGAT GTTAGATGAA 721 AGCAAGCATC GATACCAAAT GCAATTTTAC ATAATATTAC CTGTCAACAA AATATATTTG 781 GACAGCCGCA TGGTAATTTT ACACATTATG TGTAAACAAA GTATTGGTGG CATCACATGG 841 TAAAAACTCA GTAATTTCAC CTCAGAAATT CTCCTCCACA CCAGAAATGT AGTTTGTGCA 901 TTGAGGCTAT CTGATTGATG TTTATGCCTC TCTGCTTGGG ATATATTCAT GAGAATAAAT 961 AATAGAAACC TCTCCC // LOCUS HUMACTSM2 292 bp ds-DNA PRI 04-AUG-1986 DEFINITION Human smooth muscle actin gene (aortic): exon 2 and ivs flanks. ACCESSION K01742 KEYWORDS actin. SEGMENT 2 of 7 SOURCE Human DNA, clone lambda-Ha201. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 292) AUTHORS Ueyama,H., Hamada,H., Battula,N. and Kakunaga,T. TITLE Structure of a human smooth muscle actin gene (aortic type) with a unique intron site JOURNAL Mol. Cell. Biol. 4, 1073-1078 (1984) STANDARD simple staff_review FEATURES from to/span description pept + 150 + 278 actin exon 2 IVS < 1 149 actin intron 1 IVS 279 > 292 actin intron 2 BASE COUNT 88 A 54 C 78 G 72 T ORIGIN 45 bases upstream of MboII site. 1 CATAGTGGCA ATGAGACTCA ATATGTTTAT TGTTGCTGAT AACCAGAAGA GGAACTTCCC 61 ATCTCAATCA CTTAACCTTT TGTGGCTTGG CTGTAACTGA TTCACAAGTT TTGGGAGATG 121 CTGACTCATA ATGTGTTGAA CCTTTTTAGG GGGTGATGGT GGGAATGGGA CAAAAAGACA 181 GCTACGTGGG TCACGAAGCA CAGAGCAAAA GAGGAATCCT GACCCTGAAG TACCCGATAG 241 AACATGGCAT CATCACCAAC TGGGACGACA TGGAAAAGGT ATGGGAGGTT GC // LOCUS HUMACTSM3 223 bp ds-DNA PRI 04-AUG-1986 DEFINITION Human smooth muscle actin gene (aortic): exon 3 and ivs flanks. ACCESSION K01743 KEYWORDS actin. SEGMENT 3 of 7 SOURCE Human DNA, clone lambda-Ha201. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 223) AUTHORS Ueyama,H., Hamada,H., Battula,N. and Kakunaga,T. TITLE Structure of a human smooth muscle actin gene (aortic type) with a unique intron site JOURNAL Mol. Cell. Biol. 4, 1073-1078 (1984) STANDARD simple staff_review FEATURES from to/span description pept + 44 + 154 actin exon 3 IVS < 1 43 actin intron 2 IVS 155 > 223 actin intron 3 BASE COUNT 57 A 70 C 48 G 48 T ORIGIN 42 bases upstream of BglII site. 1 ACAGCACATC ATTGTGTTTC TCCTCTGTCC CCCAACTCAC TAGATCTGGC ACCACTCTTT 61 CTACAATGAG CTTCGTGTTG CCCCTGAAGA GCATCCCACC CTGCTCACGG AGGCACCCCT 121 GAACCCCAAG GCCAACCGGG AGAAAATGAC TCAAGTAAGT ATGCTACCCC AGCACACAAC 181 ACTGTGGGGT CTGGGGAAAC TTCTGTTAGA AGGAGGCTAT GCA // LOCUS HUMACTSM4 203 bp ds-DNA PRI 04-AUG-1986 DEFINITION Human smooth muscle actin gene (aortic): exon 4 and ivs flanks. ACCESSION K01744 KEYWORDS actin. SEGMENT 4 of 7 SOURCE Human DNA, clone lambda-Ha201. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 203) AUTHORS Ueyama,H., Hamada,H., Battula,N. and Kakunaga,T. TITLE Structure of a human smooth muscle actin gene (aortic type) with a unique intron site JOURNAL Mol. Cell. Biol. 4, 1073-1078 (1984) STANDARD simple staff_review FEATURES from to/span description pept + 75 + 159 actin exon 4 IVS < 1 74 actin intron 3 IVS 160 > 203 actin intron 5 BASE COUNT 35 A 53 C 56 G 59 T ORIGIN 50 bases upstream of the DdeI site. 1 CCAGGTGTGC TCTCCTGACC ATTCTTTGTC AGATGGGCAC CTTCACTAAC TCAGCTCTTT 61 GGATTGCTTT GCAGATTATG TTTGAGACTT TCAATGTCCC AGCCATGTAT GTGGCTATCC 121 AGGCGGTGCT GTCTCTCTAT GCCTCTGGAC GCACAACTGG TAGGTGGCTG GGCGGGACAG 181 CAAGCTGGTT GGAGCGCACT GCC // LOCUS HUMACTSM5 530 bp ds-DNA PRI 07-JAN-1985 DEFINITION Human smooth muscle actin gene (aortic): exon 5 and ivs flanks. ACCESSION K01745 KEYWORDS actin. SEGMENT 5 of 7 SOURCE Human DNA, clone lambda-Ha201. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 530) AUTHORS Ueyama,H., Hamada,H., Battula,N. and Kakunaga,T. TITLE Structure of a human smooth muscle actin gene (aortic type) with a unique intron site JOURNAL Mol. Cell. Biol. 4, 1073-1078 (1984) STANDARD simple staff_review FEATURES from to/span description pept + 314 + 475 actin exon 5 IVS < 1 313 actin intron 4 IVS 476 > 530 actin intron 6 BASE COUNT 106 A 153 C 113 G 158 T ORIGIN 48 bases upstream of the BglII site. 1 CGGACTGGAG TTGAGTTAGA ATGGACACGG CTGAACCCAG CCCTAAAAGA TCTTCTTCTA 61 GTGTTCAAAA TAAGCCCTTG ATCTCATATT TCTAGCAGAA CTTTTTTCTA CTGTTTTCCT 121 CTTCCTGTTC TTTTACTTTT TTTCATCTTC CCCGAGCCCT GTCCCACCTC ACCTCTCCTA 181 TCACATGCCC TGTTTTCTCT TTTTAAATAA GAGGCTCGGA AATTCCCTCT CCCCCAGGGC 241 TTCCCTCTAC TTGTCCAGCA AGTAACCACA TGCCAGCTGC CATGGTGACT TATCACGCCC 301 AATTGTCTCC CAGGCATCGT GCTGGACTCT GGAGATGGTG TCACCCACAA TGTCCCCATC 361 TATGAGGGCT ATGCCTTGCC CCATGCCATC ATGCGTCTGG ATCTGGCTGG CCGAGATCTC 421 ACTGACTACC TCATGAAGAT CCTGACTGAG CGTGGCTATT CCTTCGTTAC TACTGGTGAG 481 GCCTGTGGGA GATAAGGGGG AGAGGGGAGC AGTGTGTTGC TCAGTACCTC // LOCUS HUMACTSM6 269 bp ds-DNA PRI 07-JAN-1985 DEFINITION Human smooth muscle actin gene (aortic): exon 6 and ivs flanks. ACCESSION K01746 KEYWORDS actin. SEGMENT 6 of 7 SOURCE Human DNA, clone lambda-Ha201. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 269) AUTHORS Ueyama,H., Hamada,H., Battula,N. and Kakunaga,T. TITLE Structure of a human smooth muscle actin gene (aortic type) with a unique intron site JOURNAL Mol. Cell. Biol. 4, 1073-1078 (1984) STANDARD simple staff_review FEATURES from to/span description pept + 43 + 234 actin exon 6 IVS < 1 42 actin intron 5 IVS 235 > 269 actin intron 6 BASE COUNT 65 A 65 C 72 G 67 T ORIGIN 43 bases upstream of the DdeI site. 1 CAGACCCTAA TGTTTGCCAC AATGTTTGTA TCTGGTTGAC AGCTGAGCGT GAGATTGTCC 61 GGGACATCAA GGAGAAACTG TGTTATGTAG CTCTGGACTT TGAAAATGAG ATGGCCACTG 121 CCGCATGGTC ATCCTCCCTT GAGAAGAGTT ACGAGTTGCC TGATGGGCAA GTGATCACCA 181 TCGGAAATGA ACGTTTCCGC TGCCCAGAGA CCCTGTTCCA GCCATCCTTC ATCGGTAGGC 241 AACAAACGCC CTTTTCCTGG GAAGGGGAG // LOCUS HUMACTSM7 408 bp ds-DNA PRI 07-JAN-1985 DEFINITION Human smooth muscle actin gene (aortic): exon 7 and ivs flanks. ACCESSION K01747 KEYWORDS actin. SEGMENT 7 of 7 SOURCE Human DNA, clone lambda-Ha201. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 408) AUTHORS Ueyama,H., Hamada,H., Battula,N. and Kakunaga,T. TITLE Structure of a human smooth muscle actin gene (aortic type) with a unique intron site JOURNAL Mol. Cell. Biol. 4, 1073-1078 (1984) STANDARD simple staff_review FEATURES from to/span description pept + 98 / 279 actin peptide (exon 6) IVS 280 > 408 intron 7 BASE COUNT 110 A 102 C 97 G 99 T ORIGIN 43 bases upstream of the DdeI site. 1 CACTGCGAAG ATCATCCAGA TGTTCAAGTC TGGTCTTCCA TGTCTAGCCT AGGTTTTACT 61 CTCCACTTAA GGACCATGGC CTGTGTCTCT TTTCCAGGGA TGGAGTCTGC TGGCATCCAT 121 GAAACCACCT ACAACAGCAT CATGAAGTGT GATATTGACA TCAGGAAGGA CCTCTATGCT 181 AACAATGTCC TATCAGGGGG CACCACTATG TACCCTGGCA TTGTCGACCG AATGCAGAAG 241 GAGATCACGG CCCTAGCACC CAGCACCATG AAGATCAAGG TACTGTTCAG TGACAGACCC 301 AGTGGCAATG CCGCCAGCAG TGTCAGCCCA TAACCACAGA TAAGAACTCT TCAGAGACTG 361 TATTTTGGTG ACATGCAATT GTGGGTTTTA GGGGTAGGTA ATGACACA // LOCUS HUMADA01 272 bp ds-DNA PRI 22-DEC-1986 DEFINITION Human adenosine deaminase gene, exon 1. ACCESSION M11816 X02994 KEYWORDS adenosine deaminase; deaminase; hydrolase. SEGMENT 1 of 11 SOURCE Human placenta DNA, clones ADA cos[1.3,2.5,4.3] [3]; lymphoblastoid T-cell line MOLT-4, cDNA to mRNA, clones pADAc[2,30] and pLL [1]; cell line HPB-ALL, cDNA to mRNA, clone ADA 211 [2]. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 166 to 259) AUTHORS Valerio,D., McIvor,R.S., Williams,S.R., Duyvesteyn,M.G.C., Van Ormondt,H., Van Der Eb,A.J. and Martin,D.W.Jr. TITLE Cloning of human deaminase cDNA and expression in mouse cells JOURNAL Gene 31, 147-153 (1984) STANDARD full staff_review REFERENCE 2 (bases 155 to 259) AUTHORS Adrian,G.S., Wiginton,D.A. and Hutton,J.J. TITLE Structure of adenosine deaminase mRNAs from normal and adenosine deaminase-deficient human cell lines JOURNAL Mol. Cell. Biol. 4, 1712-1717 (1984) STANDARD full staff_review REFERENCE 3 (bases 1 to 272) AUTHORS Valerio,D., Duyvesteyn,M.G.C., Dekker,B.M.M., Weeda,G., Berkvens,T.M., van der Voorn,L., Van Ormondt,H. and Van Der Eb,A.J. TITLE Adenosine deaminase: Characterization and expression of a gene with a remarkable promoter JOURNAL EMBO J. 4, 437-443 (1985) STANDARD full staff_review REFERENCE 4 (bases 1 to 272) AUTHORS Van Ormondt,H. JOURNAL Unpublished (1985) Submitted to the EMBL data library. STANDARD full staff_review FEATURES from to/span description pept 227 + 259 adenosine deaminase, exon 1 (EC 3.5.4.4) /nomgen="ADA" /map="20q13.11 or 20q13.2-qter" /hgml_locus_uid="LK0163R" pre-msg 132 > 272 ADA mRNA (alt.) pre-msg 135 > 272 ADA mRNA (alt.) IVS 260 > 272 ADA intron A variant 173 173 a in [2],[3]; c in clone ADA33 revision 195 195 g in [3],[4]; t in [1] revision 203 204 ag in [4]; ga in [1],[3] conflict 214 216 gca in [1],[3],[4]; ga in [2] BASE COUNT 45 A 93 C 122 G 12 T ORIGIN 127 bp upstream of SacII site; long arm of chromosome 20. 1 TCCAGGAAAT GCGCGATCCA GGCCGGCGGG CGGGGCGGGG GCTCCGGCGA GAGGGCGGGC 61 CCCGGGAACG GCGGCGGGCG GGGCGGGAGG CGGGGCCCGG CCCGTTAAGA AGAGCGTGGC 121 CGGCCGCGGC CACCGCTGGC CCCAGGGAAA GCCGAGCGGC CACCGAGCCG GCAGAGACCC 181 ACCGAGCGGC GGCGGAGGGA GCGACGCCGG GGCGCACGAG GGCACCATGG CCCAGACGCC 241 CGCCTTCGAC AAGCCCAAAG TGAGCGCGCG CG // LOCUS HUMADA02 153 bp ds-DNA PRI 22-DEC-1986 DEFINITION Human adenosine deaminase gene, exon 2. ACCESSION M11817 X02994 KEYWORDS adenosine deaminase; deaminase; hydrolase. SEGMENT 2 of 11 SOURCE Human placenta DNA, clones ADA cos[1.3,2.5,4.3] [3]; lymphoblastoid T-cell line MOLT-4, cDNA to mRNA, clones pADAc[2,30] and pLL [1]; cell line HPB-ALL, cDNA to mRNA, clone ADA 211 [2]. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 46 to 107) AUTHORS Valerio,D., McIvor,R.S., Williams,S.R., Duyvesteyn,M.G.C., Van Ormondt,H., Van Der Eb,A.J. and Martin,D.W.Jr. TITLE Cloning of human deaminase cDNA and expression in mouse cells JOURNAL Gene 31, 147-153 (1984) STANDARD full staff_review REFERENCE 2 (bases 46 to 107) AUTHORS Adrian,G.S., Wiginton,D.A. and Hutton,J.J. TITLE Structure of adenosine deaminase mRNAs from normal and adenosine deaminase-deficient human cell lines JOURNAL Mol. Cell. Biol. 4, 1712-1717 (1984) STANDARD full staff_review REFERENCE 3 (bases 1 to 153) AUTHORS Valerio,D., Duyvesteyn,M.G.C., Dekker,B.M.M., Weeda,G., Berkvens,T.M., van der Voorn,L., Van Ormondt,H. and Van Der Eb,A.J. TITLE Adenosine deaminase: Characterization and expression of a gene with a remarkable promoter JOURNAL EMBO J. 4, 437-443 (1985) STANDARD full staff_review REFERENCE 4 (bases 1 to 153) AUTHORS Van Ormondt,H. JOURNAL Unpublished (1985) Submitted to the EMBL data library. STANDARD full staff_review FEATURES from to/span description pept + 46 + 107 adenosine deaminase, exon 2 /nomgen="ADA" /map="20q13.11 or 20q13.2-qter" /hgml_locus_uid="LK0163R" pre-msg < 1 > 153 ADA mRNA IVS < 1 45 ADA intron A IVS 108 > 153 ADA intron B BASE COUNT 30 A 51 C 29 G 43 T ORIGIN About 15 kb after segment 1. 1 TTCTCTCTCT CCCTTTCTCT CTCTCTTCCC CCTGCCCCCT TGCAGGTAGA ACTGCATGTC 61 CACCTAGACG GATCCATCAA GCCTGAAACC ATCTTATACT ATGGCAGGTA AGTCCATACA 121 GAAGAGCCCT CTCTCCCTGG GATTTGAGTG GGG // LOCUS HUMADA03 180 bp ds-DNA PRI 22-DEC-1986 DEFINITION Human adenosine deaminase gene, exon 3. ACCESSION M11818 X02994 KEYWORDS adenosine deaminase; deaminase; hydrolase. SEGMENT 3 of 11 SOURCE Human placenta DNA, clones ADA cos[1.3,2.5,4.3] [3]; lymphoblastoid T-cell line MOLT-4, cDNA to mRNA, clones pADAc[2,30] and pLL [1]; cell line HPB-ALL, cDNA to mRNA, clone ADA 211 [2]; cDNA to mRNA, clone pSV2 ADA(II) [4]. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 46 to 168) AUTHORS Valerio,D., McIvor,R.S., Williams,S.R., Duyvesteyn,M.G.C., Van Ormondt,H., Van Der Eb,A.J. and Martin,D.W.Jr. TITLE Cloning of human deaminase cDNA and expression in mouse cells JOURNAL Gene 31, 147-153 (1984) STANDARD full staff_review REFERENCE 2 (bases 46 to 168) AUTHORS Adrian,G.S., Wiginton,D.A. and Hutton,J.J. TITLE Structure of adenosine deaminase mRNAs from normal and adenosine deaminase-deficient human cell lines JOURNAL Mol. Cell. Biol. 4, 1712-1717 (1984) STANDARD full staff_review REFERENCE 3 (bases 1 to 180) AUTHORS Valerio,D., Duyvesteyn,M.G.C., Dekker,B.M.M., Weeda,G., Berkvens,T.M., van der Voorn,L., Van Ormondt,H. and Van Der Eb,A.J. TITLE Adenosine deaminase: Characterization and expression of a gene with a remarkable promoter JOURNAL EMBO J. 4, 437-443 (1985) STANDARD full staff_review REFERENCE 4 (bases 92 to 115) AUTHORS Orkin,S.H., Goff,S.C., Kelley,W.N. and Daddona,P.E. TITLE Transient expression of human adenosine deaminase cDNAs: Identification of a nonfunctional clone resulting from a single amino acid substitution JOURNAL Mol. Cell. Biol. 5, 762-767 (1985) STANDARD full staff_review REFERENCE 5 (bases 1 to 180) AUTHORS Van Ormondt,H. JOURNAL Unpublished (1985) Submitted to the EMBL data library.ref. STANDARD full staff_review COMMENT The mutation annotated below results in a Gly to Val change in clone pSV2 ADA(II), which inactivates the protein and appears to be a change that arose during cDNA cloning [4]. FEATURES from to/span description pept + 46 + 168 adenosine deaminase, exon 3 /nomgen="ADA" /map="20q13.11 or 20q13.2-qter" /hgml_locus_uid="LK0163R" pre-msg < 1 > 180 ADA mRNA IVS < 1 45 ADA intron B IVS 169 > 180 ADA intron C mut 102 102 g in wt; t in clone pSV2 ADA(II) (see comment) BASE COUNT 34 A 58 C 47 G 41 T ORIGIN About 7 kb after segment 2. 1 GGATCCTGCC CCGGGTTGGT CCACCACTCA CTGTTTTGTT TCCAGGAGGA GAGGGATCGC 61 CCTCCCAGCT AACACAGCAG AGGGGCTGCT GAACGTCATT GGCATGGACA AGCCGCTCAC 121 CCTTCCAGAC TTCCTGGCCA AGTTTGACTA CTACATGCCT GCTATCGCGT GAGTTGCCCC // LOCUS HUMADA04 185 bp ds-DNA PRI 22-DEC-1986 DEFINITION Human adenosine deaminase gene, exon 4. ACCESSION M11819 X02994 KEYWORDS adenosine deaminase; deaminase; hydrolase. SEGMENT 4 of 11 SOURCE Human placenta DNA, clones ADA cos[1.3,2.5,4.3] [3]; lymphoblastoid T-cell line MOLT-4, cDNA to mRNA, clones pADAc[2,30] and pLL [1]; cell line HPB-ALL, cDNA to mRNA, clone ADA 211 [2]. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 21 to 164) AUTHORS Valerio,D., McIvor,R.S., Williams,S.R., Duyvesteyn,M.G.C., Van Ormondt,H., Van Der Eb,A.J. and Martin,D.W.Jr. TITLE Cloning of human deaminase cDNA and expression in mouse cells JOURNAL Gene 31, 147-153 (1984) STANDARD full staff_review REFERENCE 2 (bases 21 to 164) AUTHORS Adrian,G.S., Wiginton,D.A. and Hutton,J.J. TITLE Structure of adenosine deaminase mRNAs from normal and adenosine deaminase-deficient human cell lines JOURNAL Mol. Cell. Biol. 4, 1712-1717 (1984) STANDARD full staff_review REFERENCE 3 (bases 1 to 185) AUTHORS Valerio,D., Duyvesteyn,M.G.C., Dekker,B.M.M., Weeda,G., Berkvens,T.M., van der Voorn,L., Van Ormondt,H. and Van Der Eb,A.J. TITLE Adenosine deaminase: Characterization and expression of a gene with a remarkable promoter JOURNAL EMBO J. 4, 437-443 (1985) STANDARD full staff_review REFERENCE 4 (bases 1 to 185) AUTHORS Van Ormondt,H. JOURNAL Unpublished (1985) Submitted to the EMBL data library. STANDARD full staff_review FEATURES from to/span description pept + 21 + 164 adenosine deaminase, exon 4 /nomgen="ADA" /map="20q13.11 or 20q13.2-qter" /hgml_locus_uid="LK0163R" pre-msg < 1 > 185 ADA mRNA IVS < 1 20 ADA intron C IVS 165 > 185 ADA intron D BASE COUNT 40 A 46 C 62 G 37 T ORIGIN About 2.5 kb after segment 3. 1 CCCCTTTCTT CCCTTCCCAG GGGCTGCCGG GAGGCTATCA AAAGGATCGC CTATGAGTTT 61 GTAGAGATGA AGGCCAAAGA GGGCGTGGTG TATGTGGAGG TGCGGTACAG TCCGCACCTG 121 CTGGCCAACT CCAAAGTGGA GCCAATCCCC TGGAACCAGG CTGAGTGAGT GATGGGCCTG 181 GAAGG // LOCUS HUMADA05 182 bp ds-DNA PRI 22-DEC-1986 DEFINITION Human adenosine deaminase gene, exon 5. ACCESSION M11820 X02994 KEYWORDS adenosine deaminase; deaminase; hydrolase. SEGMENT 5 of 11 SOURCE Human placenta DNA, clones ADA cos[1.3,2.5,4.3] [3]; lymphoblastoid T-cell line MOLT-4, cDNA to mRNA, clones pADAc[2,30] and pLL [1]; cell line HPB-ALL, cDNA to mRNA, clone ADA 211 [2]. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 18 to 133) AUTHORS Valerio,D., McIvor,R.S., Williams,S.R., Duyvesteyn,M.G.C., Van Ormondt,H., Van Der Eb,A.J. and Martin,D.W.Jr. TITLE Cloning of human deaminase cDNA and expression in mouse cells JOURNAL Gene 31, 147-153 (1984) STANDARD full staff_review REFERENCE 2 (bases 18 to 133) AUTHORS Adrian,G.S., Wiginton,D.A. and Hutton,J.J. TITLE Structure of adenosine deaminase mRNAs from normal and adenosine deaminase-deficient human cell lines JOURNAL Mol. Cell. Biol. 4, 1712-1717 (1984) STANDARD full staff_review REFERENCE 3 (bases 1 to 182) AUTHORS Valerio,D., Duyvesteyn,M.G.C., Dekker,B.M.M., Weeda,G., Berkvens,T.M., van der Voorn,L., Van Ormondt,H. and Van Der Eb,A.J. TITLE Adenosine deaminase: Characterization and expression of a gene with a remarkable promoter JOURNAL EMBO J. 4, 437-443 (1985) STANDARD full staff_review REFERENCE 4 (bases 1 to 182) AUTHORS Van Ormondt,H. JOURNAL Unpublished (1985) Submitted to the EMBL data library. STANDARD full staff_review FEATURES from to/span description pept + 18 + 133 adenosine deaminase, exon 5 /nomgen="ADA" /map="20q13.11 or 20q13.2-qter" /hgml_locus_uid="LK0163R" pre-msg < 1 > 182 ADA mRNA IVS < 1 17 ADA intron D IVS 134 > 182 ADA intron E variant 45 45 g in [1],[3] and clone ADA33; a in [2] BASE COUNT 29 A 67 C 59 G 27 T ORIGIN About 0.7 kb after segment 4. 1 CTCCTCTCCT CACACAGAGG GGACCTCACC CCAGACGAGG TGGTGGCCCT AGTGGGCCAG 61 GGCCTGCAGG AGGGGGAGCG AGACTTCGGG GTCAAGGCCC GGTCCATCCT GTGCTGCATG 121 CGCCACCAGC CCAGTGAGTA GGATCACCGC CCTGCCCAGG GCGCCCGTCT CACCCTGGCC 181 CT // LOCUS HUMADA06 168 bp ds-DNA PRI 22-DEC-1986 DEFINITION Human adenosine deaminase gene, exon 6. ACCESSION M11821 X02994 KEYWORDS adenosine deaminase; deaminase; hydrolase. SEGMENT 6 of 11 SOURCE Human placenta DNA, clones ADA cos[1.3,2.5,4.3] [3]; lymphoblastoid T-cell line MOLT-4, cDNA to mRNA, clones pADAc[2,30] and pLL [1]; cell line HPB-ALL, cDNA to mRNA, clone ADA 211 [2]. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 13 to 140) AUTHORS Valerio,D., McIvor,R.S., Williams,S.R., Duyvesteyn,M.G.C., Van Ormondt,H., Van Der Eb,A.J. and Martin,D.W.Jr. TITLE Cloning of human deaminase cDNA and expression in mouse cells JOURNAL Gene 31, 147-153 (1984) STANDARD full staff_review REFERENCE 2 (bases 13 to 140) AUTHORS Adrian,G.S., Wiginton,D.A. and Hutton,J.J. TITLE Structure of adenosine deaminase mRNAs from normal and adenosine deaminase-deficient human cell lines JOURNAL Mol. Cell. Biol. 4, 1712-1717 (1984) STANDARD full staff_review REFERENCE 3 (bases 1 to 168) AUTHORS Valerio,D., Duyvesteyn,M.G.C., Dekker,B.M.M., Weeda,G., Berkvens,T.M., van der Voorn,L., Van Ormondt,H. and Van Der Eb,A.J. TITLE Adenosine deaminase: Characterization and expression of a gene with a remarkable promoter JOURNAL EMBO J. 4, 437-443 (1985) STANDARD full staff_review REFERENCE 4 (bases 1 to 168) AUTHORS Van Ormondt,H. JOURNAL Unpublished (1985) Submitted to the EMBL data library. STANDARD full staff_review FEATURES from to/span description pept + 13 + 140 adenosine deaminase, exon 6 /nomgen="ADA" /map="20q13.11 or 20q13.2-qter" /hgml_locus_uid="LK0163R" pre-msg < 1 > 168 ADA mRNA IVS < 1 12 ADA intron E IVS 141 > 168 ADA intron F BASE COUNT 38 A 44 C 56 G 30 T ORIGIN About 1 kb after segment 5. 1 TCTCGCCCAC AGACTGGTCC CCCAAGGTGG TGGAGCTGTG TAAGAAGTAC CAGCAGCAGA 61 CCGTGGTGGC CATTGACCTG GCTGGAGATG AGACCATCCC AGGAAGCAGC CTCTTGCCTG 121 GACATGTCCA GGCCTACCAG GTGGGTCCTG TGAGAAGGAA TGGAGAGG // LOCUS HUMADA07 371 bp ds-DNA PRI 22-DEC-1986 DEFINITION Human adenosine deaminase gene, exons 7 and 8. ACCESSION M11822 X02994 KEYWORDS adenosine deaminase; deaminase; hydrolase. SEGMENT 7 of 11 SOURCE Human placenta DNA, clones ADA cos[1.3,2.5,4.3] [3]; lymphoblastoid T-cell line MOLT-4, cDNA to mRNA, clones pADAc[2,30] and pLL [1]; cell line HPB-ALL, cDNA to mRNA, clone ADA 211 [2]. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 28 to 99; 176 to 277) AUTHORS Valerio,D., McIvor,R.S., Williams,S.R., Duyvesteyn,M.G.C., Van Ormondt,H., Van Der Eb,A.J. and Martin,D.W.Jr. TITLE Cloning of human deaminase cDNA and expression in mouse cells JOURNAL Gene 31, 147-153 (1984) STANDARD full staff_review REFERENCE 2 (bases 28 to 99; 176 to 277) AUTHORS Adrian,G.S., Wiginton,D.A. and Hutton,J.J. TITLE Structure of adenosine deaminase mRNAs from normal and adenosine deaminase-deficient human cell lines JOURNAL Mol. Cell. Biol. 4, 1712-1717 (1984) STANDARD full staff_review REFERENCE 3 (bases 1 to 371) AUTHORS Valerio,D., Duyvesteyn,M.G.C., Dekker,B.M.M., Weeda,G., Berkvens,T.M., van der Voorn,L., Van Ormondt,H. and Van Der Eb,A.J. TITLE Adenosine deaminase: Characterization and expression of a gene with a remarkable promoter JOURNAL EMBO J. 4, 437-443 (1985) STANDARD full staff_review REFERENCE 4 (bases 1 to 371) AUTHORS Van Ormondt,H. JOURNAL Unpublished (1985) Submitted to the EMBL data library. STANDARD full staff_review FEATURES from to/span description pept + 28 99 adenosine deaminase, exon 7 /nomgen="ADA" /map="20q13.11 or 20q13.2-qter" /hgml_locus_uid="LK0163R" 176 + 277 adenosine deaminase, exon 8 pre-msg < 1 > 371 ADA mRNA IVS < 1 27 ADA intron F IVS 100 175 ADA intron G IVS 278 > 371 ADA intron H BASE COUNT 79 A 110 C 116 G 66 T ORIGIN About 1.1 kb after segment 6. 1 CCCGTGACCT GGCTCTCCCC CTTCCAGGAG GCTGTGAAGA GCGGCATTCA CCGTACTGTC 61 CACGCCGGGG AGGTGGGCTC GGCCGAAGTA GTAAAAGAGG TGAGGGCCTG GGCTGGCCAT 121 GGGGTCCCTC CTCACTGCCT CCTCCCATAC TTGGCTCTAT TCTGCTTCTC TACAGGCTGT 181 GGACATACTC AAGACAGAGC GGCTGGGACA CGGCTACCAC ACCCTGGAAG ACCAGGCCCT 241 TTATAACAGG CTGCGGCAGG AAAACATGCA CTTCGAGGTA AGCGGGCCAG GGAGTGGGGA 301 GGAACCATCC CCGGCTGTCC CAACTTCCTG TATAGAGAGG CAGAAAGCAG GGCGGGTCCC 361 AGGAACTCGA G // LOCUS HUMADA08 95 bp ds-DNA PRI 22-DEC-1986 DEFINITION Human adenosine deaminase gene, exon 9. ACCESSION M11823 X02994 KEYWORDS adenosine deaminase; deaminase; hydrolase. SEGMENT 8 of 11 SOURCE Human placenta DNA, clones ADA cos[1.3,2.5,4.3] [3]; lymphoblastoid T-cell line MOLT-4, cDNA to mRNA, clones pADAc[2,30] and pLL [1]; cell line HPB-ALL, cDNA to mRNA, clone ADA 211 [2]. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 21 to 85) AUTHORS Valerio,D., McIvor,R.S., Williams,S.R., Duyvesteyn,M.G.C., Van Ormondt,H., Van Der Eb,A.J. and Martin,D.W.Jr. TITLE Cloning of human deaminase cDNA and expression in mouse cells JOURNAL Gene 31, 147-153 (1984) STANDARD full staff_review REFERENCE 2 (bases 21 to 85) AUTHORS Adrian,G.S., Wiginton,D.A. and Hutton,J.J. TITLE Structure of adenosine deaminase mRNAs from normal and adenosine deaminase-deficient human cell lines JOURNAL Mol. Cell. Biol. 4, 1712-1717 (1984) STANDARD full staff_review REFERENCE 3 (bases 1 to 95) AUTHORS Valerio,D., Duyvesteyn,M.G.C., Dekker,B.M.M., Weeda,G., Berkvens,T.M., van der Voorn,L., Van Ormondt,H. and Van Der Eb,A.J. TITLE Adenosine deaminase: Characterization and expression of a gene with a remarkable promoter JOURNAL EMBO J. 4, 437-443 (1985) STANDARD full staff_review REFERENCE 4 (bases 1 to 95) AUTHORS Van Ormondt,H. JOURNAL Unpublished (1985) Submitted to the EMDL data library. STANDARD full staff_review FEATURES from to/span description pept + 21 + 85 adenosine deaminase, exon 9 /nomgen="ADA" /map="20q13.11 or 20q13.2-qter" /hgml_locus_uid="LK0163R" pre-msg < 1 > 95 ADA mRNA IVS < 1 20 ADA intron H IVS 86 > 95 ADA intron I BASE COUNT 17 A 34 C 24 G 20 T ORIGIN About 190 bp after segment 8. 1 CCACACACCT GCTCTTCCAG ATCTGCCCCT GGTCCAGCTA CCTCACTGGT GCCTGGAAGC 61 CGGACACGGA GCATGCAGTC ATTCGGTGAG CTCTG // LOCUS HUMADA09 144 bp ds-DNA PRI 15-JUN-1989 DEFINITION Human adenosine deaminase gene, exon 10. ACCESSION M11824 X02994 KEYWORDS adenosine deaminase; deaminase; hydrolase. SEGMENT 9 of 11 SOURCE Human placenta DNA, clones ADA cos[1.3,2.5,4.3] [3]; lymphoblastoid T-cell line MOLT-4, cDNA to mRNA, clones pADAc[2,30] and pLL [1]; cell line HPB-ALL, cDNA to mRNA, clone ADA 211 [2]. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 7 to 136) AUTHORS Valerio,D., McIvor,R.S., Williams,S.R., Duyvesteyn,M.G.C., Van Ormondt,H., Van Der Eb,A.J. and Martin,D.W.Jr. TITLE Cloning of human deaminase cDNA and expression in mouse cells JOURNAL Gene 31, 147-153 (1984) STANDARD full staff_review REFERENCE 2 (bases 7 to 136) AUTHORS Adrian,G.S., Wiginton,D.A. and Hutton,J.J. TITLE Structure of adenosine deaminase mRNAs from normal and adenosine deaminase-deficient human cell lines JOURNAL Mol. Cell. Biol. 4, 1712-1717 (1984) STANDARD full staff_review REFERENCE 3 (bases 1 to 144) AUTHORS Valerio,D., Duyvesteyn,M.G.C., Dekker,B.M.M., Weeda,G., Berkvens,T.M., van der Voorn,L., Van Ormondt,H. and Van Der Eb,A.J. TITLE Adenosine deaminase: Characterization and expression of a gene with a remarkable promoter JOURNAL EMBO J. 4, 437-443 (1985) STANDARD full staff_review REFERENCE 4 (bases 1 to 144) AUTHORS Van Ormondt,H. JOURNAL Unpublished (1985) Submitted to the EMBL data library. STANDARD full staff_review REFERENCE 5 (sites; site of mutation causing partial ADA deficiency) AUTHORS Hirschhorn,R., Tzall,S., Ellenbogen,A. and Orkin,S.H. TITLE Identification of a point mutation resulting in heat-labile adenosine deaminase (ADA) in two unrelated children with partial ADA deficiency JOURNAL J. Clin. Invest. 83, 497-501 (1989) STANDARD full staff_entryw COMMENT The substitution at position 51 of an "a" for a "c" results in replacement of a proline by a glutamine. The result is partial adenosine deaminase deficiency in persons who are phenotypically homozygous for this mutant ADA gene. FEATURES from to/span description pept + 7 + 136 adenosine deaminase, exon 10 /nomgen="ADA" /map="20q13.11 or 20q13.2-qter" /hgml_locus_uid="LK0163R" pre-msg < 1 > 144 ADA mRNA IVS < 1 6 ADA intron I IVS 137 > 144 ADA intron J mut 51 51 c in normal ADA; a in heat labile ADA BASE COUNT 43 A 37 C 35 G 29 T ORIGIN About 1.4 kb after segment 9. 1 CTGCAGGCTC AAAAATGACC AGGCTAACTA CTCGCTCAAC ACAGATGACC CGCTCATCTT 61 CAAGTCCACC CTGGACACTG ATTACCAGAT GACCAAACGG GACATGGGCT TTACTGAAGA 121 GGAGTTTAAA AGGCTGGTGA GTGG // LOCUS HUMADA10 155 bp ds-DNA PRI 22-DEC-1986 DEFINITION Human adenosine deaminase gene, exon 11. ACCESSION M11825 X02994 KEYWORDS adenosine deaminase; deaminase; hydrolase. SEGMENT 10 of 11 SOURCE Human placenta DNA, clones ADA cos[1.3,2.5,4.3] [3]; lymphoblastoid T-cell line MOLT-4, cDNA to mRNA, clones pADAc[2,30] and pLL [1]; cell line HPB-ALL, cDNA to mRNA, clone ADA 211 [2]. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 20 to 122) AUTHORS Valerio,D., McIvor,R.S., Williams,S.R., Duyvesteyn,M.G.C., Van Ormondt,H., Van Der Eb,A.J. and Martin,D.W.Jr. TITLE Cloning of human deaminase cDNA and expression in mouse cells JOURNAL Gene 31, 147-153 (1984) STANDARD full staff_review REFERENCE 2 (bases 20 to 122) AUTHORS Adrian,G.S., Wiginton,D.A. and Hutton,J.J. TITLE Structure of adenosine deaminase mRNAs from normal and adenosine deaminase-deficient human cell lines JOURNAL Mol. Cell. Biol. 4, 1712-1717 (1984) STANDARD full staff_review REFERENCE 3 (bases 1 to 155) AUTHORS Valerio,D., Duyvesteyn,M.G.C., Dekker,B.M.M., Weeda,G., Berkvens,T.M., van der Voorn,L., Van Ormondt,H. and Van Der Eb,A.J. TITLE Adenosine deaminase: Characterization and expression of a gene with a remarkable promoter JOURNAL EMBO J. 4, 437-443 (1985) STANDARD full staff_review REFERENCE 4 (bases 1 to 155) AUTHORS Van Ormondt,H. JOURNAL Unpublished (1985) Submitted to the EMBL data library. STANDARD full staff_review FEATURES from to/span description pept + 20 + 122 adenosine deaminase, exon 11 /nomgen="ADA" /map="20q13.11 or 20q13.2-qter" /hgml_locus_uid="LK0163R" pre-msg < 1 > 155 ADA mRNA IVS < 1 19 ADA intron J IVS 123 > 155 ADA intron K BASE COUNT 33 A 44 C 38 G 40 T ORIGIN About 600 bp after segment 10. 1 GCCATTCTGG CCTTTCCAGA ACATCAATGC GGCCAAATCT AGTTTCCTCC CAGAAGATGA 61 AAAGAGGGAG CTTCTCGACC TGCTCTATAA AGCCTATGGG ATGCCACCTT CAGCCTCTGC 121 AGGTAGGTTC CTGTCTGGGC TTCTGGGCAG TTGCC // LOCUS HUMADAG 36741 bp ds-DNA PRI 25-SEP-1987 DEFINITION Human adenosine deaminase gene, complete cds. ACCESSION M13792 KEYWORDS Alu repetitive sequence; adenosine deaminase; long terminal repeat; repetitive sequence. SOURCE Human DNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 36741) AUTHORS Wiginton,D.A., Kaplan,D., States,J.C., Akeson,A.L., Perme,C.M., Bilyk,I.J., Vaughn,A.J., Lattier,D.C. and Hutton,J.J. TITLE Complete sequence and structure of the gene for human adenosine deaminase JOURNAL Biochemistry 25, 8234-8244 (1986) STANDARD full automatic FEATURES from to/span description pept 4031 4063 adenosine deaminase, exon 1 /nomgen="ADA" /map="20q13.11 or 20q13.2-qter" /hgml_locus_uid="LK0163R" 19230 19291 adenosine deaminase, exon 2 26344 26466 adenosine deaminase, exon 3 28908 29051 adenosine deaminase, exon 4 29823 29938 adenosine deaminase, exon 5 31176 31303 adenosine deaminase, exon 6 32425 32496 adenosine deaminase, exon 7 32573 32674 adenosine deaminase, exon 8 32851 32915 adenosine deaminase, exon 9 34354 34483 adenosine deaminase, exon 10 35100 35202 adenosine deaminase, exon 11 35651 35664 adenosine deaminase, exon 12 pre-msg 3936 35975 ADA mRNA IVS 4064 19229 ADA cds intron 1 IVS 19292 26343 ADA cds intron 2 IVS 26467 28907 ADA cds intron 3 IVS 29052 29822 ADA cds intron 4 IVS 29939 31175 ADA cds intron 5 IVS 31304 32424 ADA cds intron 6 IVS 32497 32572 ADA cds intron 7 IVS 32675 32850 ADA cds intron 8 IVS 32916 34353 ADA cds intron 9 IVS 34484 35099 ADA cds intron 10 IVS 35203 35650 ADA cds intron 11 LTR 1025 1357 THE O family LTR LTR 1680 1717 THE O family LTR rpt 1362 1672 Alu repeat rpt 2357 2903 Alu repeat rpt 4907 5227 Alu repeat rpt 5606 5908 Alu repeat rpt 7582 8001 Alu repeat rpt 8179 8484 Alu repeat rpt 10005 10204 Alu repeat rpt 10257 10534 Alu repeat rpt 13452 13777 Alu repeat rpt 14837 15386 Alu repeat rpt 15806 16106 Alu repeat rpt 16913 17224 Alu repeat rpt 18414 18717 Alu repeat rpt 19605 19902 Alu repeat rpt 22523 22829 Alu repeat rpt 24481 24773 Alu repeat rpt 25143 25453 Alu repeat rpt 26949 27269 Alu repeat rpt 28032 28333 Alu repeat rpt 31460 31867 Alu repeat BASE COUNT 8165 A 9716 C 9721 G 9139 T ORIGIN 475 bp upstream of HindIII site; chromosome 20q13.2-qter. 1 GATCTGGGTA AAGGGTTTTC CAGGTGTCAG GATGGAAGTG ACTAAGGTGC AGAGGCTGGA 61 GGGCTGGGGC AGGTAGAAGC AAGCATTCCT GTTACCTACT GCTGTGTGAC AATCTCCCCC 121 TAAAACACAA TGGCTTAAAA TAACATCCAT TTCATTACAT ATCTCAATAC TATAGGTCAG 181 GAATTTGGGC TGGGCTTACT TGGGTAATTC TTCTGTCCCA CATGGCATTG ACCAAAGCCT 241 GGTTTTCAGT GGGCAGCTGG GCTGGATGGC CCAACACAGC TTCGCTAACA TGATTGCTGT 301 CTTCGTAGGG ATGGTGGAAG CCTGGGCTCA GTGGGACTGT CAACTGGAAT GGCCATATGT 361 GGACTCTCTT AGCATGATGG TCTCTTCTAG AAGCTTGGGT TCCCAGAGAG AATGTTCAAG 421 AGGCCCCAAA GGACACCACA AAGCTTCTTT ATGACCAAGG CTCGGAAATC CAGGAAGCTT 481 GCTCCCATCA CGCTCTATTA CTCCAACAAG TCACTCAGGC CAGCCCAGGT CCAAGAGGAG 541 GAAACCTAGA CTCCATCTTG CAATGTGAAG AATTGCAAAT AATTTGTGTC ACCCTTAAGC 601 AACCAGCAAC TCATCTAGGT TGATTGGCAT TTCAGCAATG TGGTGGGAAG TGGTGGGACT 661 GATGTTGAAG AGGGACTTGA ATGTCATGAG AGGCTGGGGA GGCAATAAGG TGGGGAGTGA 721 AGTTTCTCGA GTCAGATTCA AATTTAAACC CCAGTTTTGC CACTTACAAC CCATGAGCCA 781 AGCAGGCTGT CTCTCTATCT GAACCTCAGT GTCCTCATCT GTAAAATGAG GAGAACACCT 841 CCTACATCTG AGGATGACTG TAAAGATGAA ATGGGATGGG TGCTTATAAA GTGCTTCCCA 901 GTGTACCTGG CTCCAAACCT GTCTCAGTAA ATGGCAGCCC CTATTATTGA ACCCGAGTAA 961 CACAGAGAGC CAAGAAAGGA TCTTACAAAA AACTCCCCTG GCTTTGACAA TGTATGAGAC 1021 CCACTGATAG GGTTTGGCTT TGTGTCCTCA CCCAAATCTC ATCTAGTAGC TCCCATAATT 1081 CCTACATGTT GTGGGAGAGA CTCGGCGGGA GATAATTGAA TCATGGGGGA TGGTCTTTCC 1141 CATGCTGTTC TTGTGATAGT AAATAAGTCT CACAAGATCT GATGGTTTTA AAAATGGGAG 1201 TTTCCCTGCA GGCGCTCTCT CTTTGTCTAC TGCCATCCAT GTAAGACGTG ACTTGCTCCT 1261 CCTTTGCCTT CTGCCATGAT TGCAAGGCCT CCCCACCATT GTGGAACTGT AAGTCTATTA 1321 AAGCCTCTTT CTTTTGTAAA TTACCCAGTC TCAGGTATGT CTTTTTTTTT TTTTTCATGA 1381 GATGGAGTTT CGCTCTTGTT GCCCAGGCTG GAATGCAATG GTGTAATCTT GGCTCACCAC 1441 AACCTCCACC TCCCAGGTTC AAGCGATTCT CCTGCCTCAG CCTCCCGAGT AGCTGGGATT 1501 ACAGTCATAC ACCACCACGC CTGGCTAATT TTGTATTTTT TTTTTTTTTT TTAGTAGAGA 1561 CGGGGTTTCA CCATGTTGGT CAGGCTGGTC TCAAACTCCC GACCTCAGGT GATCCTCCTG 1621 CCTTGGCCTC CCAAAGTCCT GGGATTACAG GCATGAACCA CTGCGCCCAG GCTCGGGTAT 1681 GTCTTCATCA GTAGCATGAA AATAATGGAC TAATACAGCC ACCCTCTCCC TCACTCCCAC 1741 ATACAACCAA ACCCCAAATC CAGCTGATTT TACACCCTAA ATGCAGCTTG AATATGAGTT 1801 TCTCCACTTC CCCCACTGAC ATCACTATGC CCTACCCAGA CCATGGCAGT TGCCTCCTTC 1861 CTGGTATCCT GTCCTCCCTC ACCCCCGCTG GCCCCCTGTA ATGCCCTCCC CTCACAGCAG 1921 GGAGCCCAGG CTTCTCAAAG TGCCCTGTGG GTGCGAACCA CCTGGGGGTC CTGTTTGTAT 1981 AAAATACAGA TTCTACTTCA GTAGGTCTGG GATGGGGTCT GAAAGTCTGC ATTTGTAGTC 2041 AGCTCCCAGG TGATGTGGGT GCTGATGATC CCTGGATCAC ACTTTCAGTA GCTGGAGAAT 2101 ATTTTTTCCA AATAAAAGGG TGATTTTGTC TCGCCTCCAC TTAAAACACT CCACTGACTT 2161 CCTAGGAATC CCACACCATC GCTGGGTCCC ACATCCCTGG CAGGATTCAG CTCCCATCAG 2221 ACCTTCTAGC CCCTTGCTCT CCACTCTCCC ACTCTCTCTT TCCCCCTTGT TTATGGGTTT 2281 GTTAATTTAT TTATGATGAA ATGAAATGAA GCTACCATCC ACCCCAGTAC TGGAACATTA 2341 TCAATAACCT GTGTGTGGCC AGGCGTGGTG GCTCATGCCT GTAATCACGC CTTGGGAAGC 2401 CGAGGTGGGT GGATCATGTG AGGTCAGGTG TTCGAGACCA GCCTGGCCAA CATGGTGAAA 2461 CCCCGTCTCT ACTACAAATC CAAAACTTAG CAGGGCACGG TGCCACGCGC CTGTAATCCC 2521 AGCTACTCGG GACGCTGAGG CCGAGAACTG CTTAAAATCC AGGAGGTGGA GGTTGCAGTG 2581 AGCCGAGATT TCGCCACTGC ACTCCAGCCT GGGCGACAGA GCAAGAGTCC ATCTCAAAAA 2641 AACAAAAACA AAAACAAAAA AACAAAAAAC AAAAATTAGC CAGGCGTGGT TGTGGGCGCC 2701 TATAATCCCA GCTACTCGGG AGGCTGAGAC AGGAAAATCG CTTGAAACGC TGGGGGTGCG 2761 GGGGGGCGGT GGGGAGGAGG CGGGCCAGAG GGGCAGAGGT TGCAGTGAGC CCAGATCGCG 2821 CCACTTCACT GCAGCCTCCG CGAAAGAGCG AAACTCCGTC TCAGTAAATA AATAAATAAA 2881 TAAATAAATA AATAAATAAA TAACCTGTAC CCGCGTGTTA TTTCCCTCCG TCCTTACCTC 2941 CTCCCGGCTC CTTCCCTTTC ACCTGAGATA ACCACTCTTC TCGTATCTAT GCTCATCTTT 3001 CCCTTGCTTT ACATTTTTTC CACCGATGCA TGTGTCTAAA CATACATACT TTTGGTTTTG 3061 CTTTTACACA TTCTAAAAGT TGCACCATTG TATGCAGTTT TCCGCAACTT AGTTTTTTTC 3121 ACTCAACATT GTTTCTGAGA CATTGTTTCT GTTGTTGTCT GGCTGAAGTT CATTCCGTTT 3181 CACTGCTGTC TAACGTTTCA TGGTGTGAAT ATTCCGGTTT ATTTGCCCAC TCGCCCGTGG 3241 AGGGGCATTT GAGGGTGTTT CCAATGTTCC TGTTATTCGG AATAGCGCTG GTGTGAACAT 3301 TCTGCACAGG TCTCTGGCTG CGCCTGGGCG GGTTTCTTAA AGGTGAATGC CCAGGAGGGG 3361 ACTGTCTGTG TTCTCCCTCC CTCCGAGCTC CAGCCTTCCT CGCCTCCTTT CACTCCCAGC 3421 TCCCTGGAGT CTCTCACGTA GAATGTCCTC TCCACCCCCA CCCACCCCTG ATGAACTCCT 3481 GCAGGTTCTG CAGGCCACGG CTGGCCCCCC TCGAAAGTTC CTTAACTATA CAATTATGGT 3541 GTGTGTTTCT GCGACGAGCG TCCGTCTATC CGGTGGAAGG CACGCCGCTC GAGGCTTGCG 3601 ATGCTCCCGG GGTCCCCGCT TCTAGCTTGG GCCTGGCGCA CAGCAGCGCC CAGACTGCAG 3661 GGGGACGCTT GAAAGTTGCT GGAGGAGCCG GGGGGAAGGC AGCGCCCAGC GAGGCGGCTG 3721 GAGCGCGCGC CCACAGGTGG GTCCGGTCGG GCGCCGCGGG GCCGTAGTTT TCGGGTCGGC 3781 GGGCGAGGAC GCCGGGTCCA GAATTCCAGG AAATGCGCGA TCCAGGCCGG CGGGCGGGGC 3841 GGGGGCTCCG GCGAGAGGGC GGGCCCCGGG AACGGCGGCG GGCGGGGCGG GAGGCGGGGC 3901 CCGGCCCGTT AAGAAGAGCG TGGCCGGCCG CGGCCACCGC TGGCCCCAGG GAAAGCCGAG 3961 CGGCCACCGA GCCGGCAGAG ACCCACCGAG CGGCGGCGGA GGGAGCAGCG CCGGGGCGCA 4021 CGAGGGCACC ATGGCCCAGA CGCCCGCCTT CGACAAGCCC AAAGTGAGCG CGCGCGGGGG 4081 CTCCGGGGAC GGGGGTCCGG CGCCTGGGCG GCCCGAGGGG CTTAGCGGGG CCCAGCCCGG 4141 GGCGTCCAAA CCCTGGGAAC GAACGGGGGC TCCTGCAGGC GAGTTCTTCC TTCGGCTTAG 4201 GCCGTGGCTT GCTTGCGGGC TAATCAGGGA CAATGGGGCA GAGAAGGTCC AGAACCCGGA 4261 GGCCTCCAGA GTCTGCTTCT GCCCCTGACT TGACCCCTCT GGGTCTCAGT TTCGCTGTCT 4321 GTCAAGTGGG CATCCTAGCA CCGCTGAGCG CTGTGTGGGC CTGGGCAGGG ACTTGAGGTC 4381 TCTGAAGCTC AGCTGTATGA TCAGGCCCGA TGTCTACGCC GGATAGCGAC CTAGTGCTGT 4441 GCCCCGCGCC TACTGAGTGC TCAGTGAATG GAAGCAGCTT TGTACGCCAG CGTTATGGTG 4501 GTGAGCGCCA AGGAGCTCAG GTTTGTGGAT GCGCCCCGGG GAAGAACCGT GAGCCCTGCC 4561 AGAAAGGGGA GGGAGGGGAG CAGAGCACCC CCCTTCCCCC GCGCGGGAAG AACAGGAGCT 4621 AGGTAGGCCC TGGGTTTGGG GCCCTAGCAG GGTTCACTCG AGGCCAAGCC ATGGCCCACT 4681 GGCCCCAGGG GAGAATCCCC TTGTTTCTCC GCCCACCAGC TGTGGCGTCT TGGGACTGTT 4741 GGGGTCAGGG AGGGTCTGGA CCCCCTTGGC CTGTCTCAGA GTCCGAGAGG AGGGGCCCAG 4801 GAGTCTGCCA AGCAGGGTGA GTCAGCCAGT AGGGTGTGAG AGTGGTTGGG GAAGGAGTCA 4861 GCTGCAGTCA GCCTCAACTT ACCCTTCTAA GAAATAGGTG TGAGTGGCCC AGGAGGTTGG 4921 CTCACGCCTG TAATCCCAGC ACTTTGTGAG GCTGAGGCGG GAGGATCATT TGAGTCCAGG 4981 AGTTTGAGAC TAGCCTGGAC AACAAAACTA GACCCCGTCT CTCCAAAAAA TAAAAAAAGT 5041 TAGGGGAAGT GTGTGTGGTG GTGCACTCCC GTAGTCCCAG CTACTCAGGA GGCTGAGGCG 5101 GGAGGATCGC TTGAGCCCAG GAGGTTGAGG CTGCAGTGAG GTGTGATGGT GCCACTGACC 5161 TTCAGCCTGG GAGACAGAGC GAGACCCTGT CTCAAAAAAA AAGAGAAGAA AAAGAAAAGA 5221 AAAGAAATAG GTGTGAATGA TGATGACAGC TATCACAAAA GTGCCGGTGA GAATCCAGTG 5281 AGTGTGCATG TGTCAGTGAG GGAGACAGGC TGTGGAGAGC CCACCTACCT TCTGAGGAGG 5341 GTGAGGCCTG GCCCCCACTA CTGATGCCCC CAGCCCAGGG AAAATGCTCA GCTACTCCCC 5401 GTCAGAAGCT GGAACGACTG AGGTGCTGTA CAAGCCCTCC TACCCCCACC CCTGCCTCCT 5461 TCACGTCTTA CTGGAGCTGG GGCCCATGAT TGGCGCCTCC CCTTTGCAGT CTTTTTATTA 5521 AATGCTCTGG GCTCCCTCTG CCCTTGGGCT GGGGACCCAC TGTACCCTGA TGTGAATCCT 5581 ATGGCAGTAG CAAAGCTCTT TGATTGGCGG GGTGCAGTGG CTCACGCCTG TAATCCCAGC 5641 ACTTTGGGAG GCAAAGGTGG GTGGATCATG AGGCCAGGAG TTCGAGACCA GCCTGGCCAA 5701 CATGGCAAAA CCCCATTTCT ACTAAAAATA CAAAAAATTA GCTGGGCATG GTGCGGGCGC 5761 CTGTAGTCCC ACGTACGCAG AAGGCTGAGG CAGGAGAATG GCATAAACCC GGGAGGTGGA 5821 GCTTGCAGTG AGCCGAGATC TCGCCATTGC ACTCCAGCCT GGGTGACAGA GTGAGACTCT 5881 GTCTCAAAAA AAAAAAAAAA AAAAAAAAGG CTCCTTGATT GCGAACATGT TGGGAGTTAT 5941 GGAGAGAACA GCAGGGCCCA CTTCTAGAGC ACTTGTTGCA GACACCCATT GGATCCTTGC 6001 AGTTCTTCTG TAACAGCCCA TCAAGGGAGG GGCTCATATT ATTATCCCCA TTTTTTGGCC 6061 TTGCTCAGTC CTCCCATCTG ATTCAAGCTG GCAGATCATT TTCCCTATTG GGACCTCAGT 6121 GTCCACACCT GGAGGATGGA ACATCAGCTG CTTATGTGGG TGTCCCGTGT CCTGAGTCCC 6181 AAGGCCACAA GGTGATGCTT GAGAGTGAAG GTAGAATGTT ACCTGCCATG TGTTTGAGGC 6241 GTGACAAATC TTGTATGATT GTGAGGAGGA ACTTGTGTGA GCTGGCAGGA GAAGTGGGAA 6301 GGAGTGTGAA TCTCAGAGCC ACTGTGACCA GAGCCAGCTC CCTGCCCTCT TGTGGGAGGG 6361 ACAGATGACA GTTATAATTA TTAGCATTAC TAGCTGCAGC TAATGGAGTG TTGATGTTTC 6421 TGCCAGGCAC CGTTCTAAAC ACATTATCTG CATTTTTTAT TTAATCCAGG CACAGAGAGG 6481 TTAACTAGGC CCAAGATCAC ACAGCTAGGA AATGTCCAAC TCTGGGGTTT GAGTCCAAGG 6541 GAGGCTGGCT TCGAAATCCC ATGCCTCTAA CCATCTTTCC TAAACTACCT CTGCAGAAGC 6601 CTTTGGGGAT AGAGGTGCCA GTGCCCCAGG TGCAAACCTC CTGAGACAGG AGCCTTTGCT 6661 GTGTCCTTCA GCTTCTCATA CCTGCCACCA GCTGAGGCCT GGGACCTGGT CAGCTAGAAG 6721 AAAGCAGAGC AGGGCAGCGC TTTTCAAACT GCACTCAAGT GGCCTGACTT TTAATGTTCA 6781 CACTGTGATT CTGTGTGGGT CGGGTTGGGG CCTGCGATGC TGCACTGCTG ACCAGCTCCC 6841 AGGAAATGCT AATGTCAACG ATCCAGGAAC ACACTTTGCT TAGCAAGGCC CTAGGCAGCT 6901 GCCTTCTGTT GTGCGGGACC CCTATTGACT CCAATGGATA TAGCACCAGG TTCAAGAGGC 6961 TACCTTCTTT GGAAGAGGTA GCAAACAAGA TACGGGGTTT TACTGGGGGC TTAGACACAG 7021 GGAAGAGAGT CCAGTGGCGG CAGACTGAGC AGAAGAACCG CAACCACTTG CAAATCATGC 7081 AGTTTATGTA GCATTTTCAT TTAACACCTT CTCCCAACCA TCTCCACCTA GTAACCTTCA 7141 TTTAACCCAA AACAAAGGGC CTCGGTCCCT ATACCCCTGT ATGGTCAGTG TCCCGTGGGA 7201 ATGGGGTGGG GCTCAGATGT TCCTCATAGA TAACGACTGG ATCTCCAGGT TGGCCACTCT 7261 TGGATTCCTT CGCTCAGAAC TCTGAACACC CATTCAAGTG TGCCTGCCAT GCAGGGTCAT 7321 CGTCAGGGGA TGCCCAAGTC AAGTTTGCCT GTCGGGTGTG CCTCCCATAC CCCCACCTGG 7381 TTTGACTTAG CACCTGCTGG GCACTGGAAG AAGTGCAAAG GGGGGTTGCA GGGGTGGCCC 7441 TTATCAGCCT ATGTTCACAG GTGGCACCAG GCACTCAGGC ATTCTGCATC CTGGAGGCCA 7501 GTGCTGATCA CATGCCTGTT ACAATAATCA TAACAATAGC TGTCCTTGAA GTAGTCCTGG 7561 GTACCAGGTG CCTTCAGTGA CTTTTTCTTC TTTGCCAGAA TCTCACTCTG TCGCCCAAGC 7621 TGGAGTGCAG TGGCAAGATT TTGGGTCCCT GCAACCTCTG CCTCCTGGGT TCATGCGATC 7681 CTCCTGCCTC AGCCTCCCAA GTAGCTGGGA CTACAGGCGT GTGCCGCAGT CTCACTCTGT 7741 TGCCCAGGCT GGAGTGCAGT GGTGTGATCC TGGCTCACTA CAACCTCCAC CTCCCGAGTT 7801 CAAGCCATTC TTCTGCCTCA GCCTCCGGAG TAGCTGGGAT TACAGGCGTC CACCACCACG 7861 CCCGGCTAAT TTTTGTATTT TTAGTAGAGA CAGGGTTTCA CCACGTTAGC CAGCTGGTCT 7921 CGAACTCCTG ATCTCAGGTG ATCCTCCCAC CTTGGCTTCC CAAAGCGCTG GGATTACAGG 7981 TGTGAGCCAC TGTGCCCGGC TAGTAACTTT TATCTCACGG AATCCTCTGG ACGACTTGAC 8041 AAGGCATGGG TCTTCATCCC CATTTACAGA TGAAGAAACT GAAGCTTAGG GAGTGGAGGG 8101 ACTTGCCAGG GCTACACAAA ATCTGAGAGC CTTGAAGCTG TAGACTGGCA AGTGAACAGG 8161 TACAGGCTGG GACAGCAGTT TCTTTCTTTT TTTCTTTTTT TAGACAGAGT TTCGCTCTTG 8221 TTGCCCAGGC TGGAGTGCAA TGGCACGACC TCGGCTCACT GCAACCTTCG CCTCCCAGGT 8281 TCAAGTGATT CTTCTGCCTC AGCCTCCCAA GTAGCTGGAA TTACAGGCAT GCACCACCAT 8341 GCCCGGCTAA TTTTTTGTAT TTTTAGTAGA GACGGGGTTT CTCCTTGTTG GCCAGGCTGG 8401 TCTCGAACTC CCGACTTCAG GTGATCCGCC CACCTCAGCC TCCCAAAGTG CCGGGATTAC 8461 AGGCATGAGC CACCGCACCC GGCCAAGGGA CAGCAGTTTC TAAACTGTCC CTCTCTGATG 8521 CAGAGGGGAA TTGGGGCTAA ATCAGCAATG TGCCTTTTCT GTCTCATATT TGAATGTCTA 8581 CTCTGCACGA GGCGCTGTCC TGCTTTGCAT ACAGTGACTC ATTTAATGTT TATGTCAGCC 8641 CTCTGAGGAA GGTCCTGTCC TATTATTAAC TTCACTTATT ATGAGGAAAC TGAGACTCAG 8701 AGAGGGGAGG GAACTTGCCA AAGTCACACA GCTGGCAAGC AGCAGAGCTA GACTTGAACC 8761 CAGATCTGCC TGCACTCAAG TAGAAGCTGT TCATTGCTTT GCTCATTTGC CAATTCCACT 8821 TTATGCAAAA AAGAGGGGGC AGTGTGGGGG GAAGAGTTAG AATCAGGGTG GCAGGGTGGG 8881 CCAGTGCATT AGCCCTGGGC TTCAGATGTA CTGGGGTTGA ATTCCTGCCT GCCGCTTAGC 8941 AGCTAGGGTA CCTCAGGTAG ACAACTCCTG AAACTCAGCT TCCCCCTCTG TAAAATGGGG 9001 TGACAAAACC AAGATCTTGG GGTTCTTGGG GAAACTGACA TGCTGATTGG TTTTTGTACA 9061 GTGCCTGGCT GGTAACAGCA GGCCCTCAGG GGTGCGTTTC CTTCCTGGGG ACTGGAGTGG 9121 GGGTTGCAGT AGACTCTGGG AGGCCTCTCC AGCTGCAGAA TCTCCCTCCT CCCTCCTCCT 9181 TTTTGTCTTC CTGACACAAA ACCCACCAGC TGCACTTCTT TGGGCTTGCA GTGGCTTTCA 9241 GTTACCAGAG CCACCTGTTA AAACAAAAAT GTGCCTAGGA AGAGCCTGCC TTACCCATTT 9301 TGACTCACAT GGCAGTTGGT GGTGGAGGGG AACAAAGGAG ACTGAGTTTC ATCGAAGCCT 9361 TTTGCTTCGG AGGAGGAAGG GAGGATCAGA GAGAGGAAGT GGTCTGTGTT CACACAGGGA 9421 GGCAGGGGAG GCCAGGCAGC TTCCCAATCC TGCATTCAAC CTCAGGGTGG GCTTGACCTG 9481 GGTGGCTGGG GGCCCTGTGA TCCAGGAGAG ACTTGTCCAC CTGCTCAGGT GTCTTGAAGG 9541 GGTCCCTGTG GTACCCCCTG GGCGGGGCAA GGTAGTAGGA CCATGGTCTG GCTGGGGAGG 9601 TGGAGAGGAG CAGGCTGTGG GCGCAGAGTG AGGTTGGAAT CTGTATTTAC CCAAGGTGTT 9661 GGGGGTAGGC TTGCCCTCAG CCCTTAATGT TCTCAGGCCC CTGAGCAGTT GTGGGGGATA 9721 ACCTCTGCAC TCCTAGTGAC CAGGGAGCTA GAACAGCAAG GAATTTGAAC TTGGACACCA 9781 GCTGGGGTCA GGCTCTCTGG GTCTGAGTCC TGATTTCCCA CTTTCCAGCT AGAGGAGCTT 9841 GAATGAGTCA TTTAACTTCA CGGTGCCTCA GTTTCCCCTC TCTAAAATGA GAATTATACC 9901 CATACCCACC TCTCAAACAC CAAGTGCAGG CCTGGCTCAG AGCAGGTGCT GCAGCAATAG 9961 CTGCCATTGG TCAGCATCAT CATCATGGTT GGTAATGGTC CTACTTTGAC TTTTGAGACA 10021 GAGTCTCACT CTGTCGCCCA GGCTGGAGTG CAGTGGTGCA ATCTCGGCTC ACTACAACCT 10081 CTGCTCCCGG GTTCAAGTGA TTCTTCTGCC TCAGTCTCCC AAGTAGTTGG GATTACAGGT 10141 GTGCGCCACC ATGCCTGGCT AATTTTTGTG TTTTTAGTAG AGACAGGGTT TCACCATGTT 10201 GGCCATAACA ATGGCTGTCC TTGAAGTAGT CCTGGGTACC AGGTGCCTTC AGTGACTTTT 10261 TTTTTTTTTT TTTTTTTGAG CTGGAGTCTT CCTCTGTCAC CCAAGCTGGA GTGCAGTGGC 10321 ACGATTTTGG CTCACTGCAA CCTCTGCCTC CTGGGTTCAT GCGATCCTCC TGCCTCAGCC 10381 TCCCAAGTAG CTGGGACTTG GGATACACTT GCCCCCGCTG GTCCTCCCTT CCACCTCTGT 10441 GAAGAGGAGG TCTCAAACTC CTGGCCTCAA GTGATCCACC CACCTCAGCC TCCCAAAGTG 10501 CTGGGATTTC AAGAGTGAGC CACCGCACCT GGCCCCTGTT TAGATGTTAG CATCAGTGAC 10561 CCAGCACCTT GCTATGTGGC ATGCAGGGAG CGTGCTGCTA GACCTCCGGG TTTAGAGTCA 10621 AATAGCTTCC TGGCTGTGGT GTGCATTAGA CTTTCTAACT CAAGGTCCTC CCACTCTCTG 10681 AGCCTCAGTC TTGTTGCCTT TAAAACGAGT TTAAGTGTGC TGAGTCCCTA TGCTGTGGCT 10741 CCACAGGAAT TTCCCCAGGT GGAAGACACA TCTTGCCTTC TGTGAAACCT CTCAGCAGCA 10801 GAGCTGTCAG GCCCCGTCAG CAGGAGACAC TGTGGGGACT GCTCAGTCCC TTCCACTGTG 10861 TACCTCGGAG CTGGCGGAGC CTAGATGAGG CTGAGCATAG AGGGCTTCCT GGAGGAAGTG 10921 GAGCTGAAAC AGTTTCTCAG CCCAGGGCTG CTCTGTCTCC TGGCCTCACA CTAAAAGTCA 10981 GTTGAGAGGC CATAGTGGCA TAAGTCACTG ACCCTGGCAC TGCCCAGCTC ATCACCAAAA 11041 GCAGGGCTAG GGAGGGAGGG GACATTCGAT TGGCAGTGGG CACCTGTGGC TCATCTGGGT 11101 TCTGGCCACG GTGCTCAGGT TCTGTGAGCT GACCAGGCAG CCCTGGCTCC TCTGCCCCCG 11161 TGTGGGTTCT GCCAGGTCCC ATGGGGCAGG TCAGCCCCTT CCTTGTTGCA GGGAGAGCAC 11221 CCAGCATTGC TGACATGGGA CAGGGAAACG AGGAAATAAC GGTGTGGTCA TTGAACACAG 11281 AGAGCACTAG GTGCTGTGCG AGGTGCTGAG GACACGACAT GATGACACAG ACAAGGTCCC 11341 CCCTCTCAGC AAACGGCTCA TGAGGGAGAC AGACATGTTA CATACATGAA CCCAAAAAGT 11401 CAGACGAAAA CAAAACAGAG CGATGTGTTT GGGAGGCAAA CCCAACTGCC GGAGGGCGAG 11461 CAGTTGGGAA CGTGGAAACA TGAGTCAGAT CTGGGAGTAT CTGTCCCAGG AGTCCAAGAC 11521 CTGGGTCCTC ATGGTAGCTC TGCCACCGAC ACACTGAGTG ACCTTGGGTA AGTGAACCCA 11581 CCGCCCTGGA CCTCTCTGGC ACGCATCTCT TGAGAGCAGG GACTTAGTGC ATTTCCCGAG 11641 GGCCTCCACG GTGCCTGGCA CATAGTGGGG CTTAGTAAAT ATTTGTTGGT AACTGAGGAT 11701 GCTTCCTGTT CACATCAGCG CTGGGAGGAT TTCCTGCTGT TCAGACAAAT GCTGGGCTGG 11761 CTGTGAGTCA GCCTTGCAGA GAGCAAAGGC AGTGGGAAGG GGCGTGAGAT TCCCCTCTGG 11821 AGAGGTCAGG AGGCCAGGCA CTGTCTCGAC ATGAGTGCCA GGGAGGGGGT GTGGCCTGTG 11881 GGCAGGGCTT GGGCTGAGGC AGAGGGACTT GAGTTCCACC CTAGCTCTAC CACCATCAAT 11941 TTTGTGTAAC TCTGGACAGG CCACTGAACT TCTCCGGGCT TAGCCTGGCA AGTCCATTTC 12001 CCCATCTGTA ACATGGGCCG ATATGTACAT TGCCTAGGGA TTAAATGAGA TAAAGGGTCT 12061 GAAAACAGTA GGTAGCTGCT TTATCATTAT TATTATTTCT GTATTATTGA TGTCTGAGGC 12121 TAGGCCCACA GAGGCAGTAC AGTAGAGTGG TTAGGAGCTC AAGAATCAGA CTAGGGTTCA 12181 AATTCTGACT CCATCACTGA CTGTTTTGGG GTACTTCTTT GAACCTCAGT TTCTTCATCA 12241 GTAAAATGGG AGTGAAGTCT CTACCTTGCT GGTTGTAAGG ATGAAATAAG ATAATGCATA 12301 TAGATGGTCT AGCACATAGT AGATACTCAA AAGTTTGAGG CCACTGCTGA CCCTTTTCCC 12361 TGAAAGGAGA CAGGAGAGCG GGGTCGCCAC CCCATTGTCA TTGTCATCTG GAATAGGCTG 12421 ACAGACTTCC CATGGTGTGT TGCAGTTTTC TAGAAAATTC AGTAGGAGGC CTGCCTGAGC 12481 TTGAGCCACC TGTGGAGGTG CTTCCTGCCT CTGCTCCACA CCTGAAACGC GTCTGGGCCT 12541 CTTCTCAGGC AGCCGTGAGA AGGGATGAGT GCTACTGGTC ATGGTGGGCA GCTGGCTCTG 12601 CTTTCCCCCT TCCCAGAGGC GCTCCTGCCT CCTGCCCAGC TCCCTGAACC CCTAGCTTCT 12661 GCACCCCGGC ACTGTCTGGC TTCTGCCCCG CTGAGCACCC ACTGTCTCTG ACGCTGCCTT 12721 GAGTACTTCC CGCATGTTAT TCAAATCCCA ATCAGATCTT CCCTCCCCCA GTAGCTGGTC 12781 TTCTGTTCTG GCTTCCTGCC ATCCTGTCCT CCACACAGCA GCCGGGAAAG GTTTTTTTAA 12841 AGGGGACTCT CCGATTTAAC ACACTTGGGT GGAAAACCCT TTGCTTCGGC CTCTGCAATC 12901 TCCCTGCCCC CTCTCCACTT TGCCCTGGCC TCATTTCTCA CCACTAACCT CACTCTGCAC 12961 TCTGGCCAAC TCCCCGCCTG CTTCCTGATT CAGACACTAA GCACACGCAG CTCCCCTGCC 13021 TGGAGCCATT CTCCCTCTCC TTCTTTCTTC TCCCTGGAGA ACTCCCCCTT TAAGTGATCT 13081 TTTCCCAACA CACTTTCTAA ATTGCCCCCA CCCCAGTGTG ATTTTTCTTT ATCTCATAGC 13141 ACTTGGTCTG CTTCTTATCA CAGTTTGCAA GGCTGAGTTC AGAAAGGTGT GTTTGCTCAT 13201 TCTGAGGCAG GAGAGGCTAC CTTGTGCTGC TGTGGTAACA AACAGCCCCC AGGTCTGAGG 13261 GGTCTGCAGA GACCCAGGTT GACCTCATAC TGCTTGTCCC TCCAGGGCCT CCAGTGAGGT 13321 TTCGGCTCCT TGGATCACTC AGGGCCCCAG GCAGATGGGA AGATTCCACT CTGAACATTG 13381 CCAATTGTTG TGCCAGAGTA AAGCAGAGCT GGGAGGTGGG CTCTTGAATT GGCATTTAAA 13441 TACTTTTGCC AGGCAGGGTA AGGCAGCTCA CGCCTGTAAT CATAACACTT TGGGAGGCCT 13501 AGGTGGGTGG ATCACCTGAG GTCAGGAGTT CAAAACCAGC CTGGCCAACA TGGTGAAACC 13561 CTGTCTCTAC TAAAAGTACA AAAATTAGCC GGGCATGGTG GTGGGCGCCT GTAATCCCAG 13621 CTACTTGGGA GGCTGAGGCA CGAGAATCCC TTGAACCTGG GAGGCAGAGG CTGCAATGAG 13681 CTGAGATCTT GCCACTGCAC TCCAGCCTGG GCAACAGAGC CAGACTCCAT CTCAAAAAAA 13741 AAAAAACAAC AACAACAAAT AAATAAATGA ATAAATACTT TAGCCAGAAG TAGCCATGCA 13801 GACCTCCCCC CACCAGTCCC ACCCACAAGC GGACGTGACT ACCGCCCCCA TTCACTGCCT 13861 GATCCTCCTG TTCTCAGGGG CTCCAAGGCC AGGCCTGGTT TGACCTTCTG ACTTTCTGAC 13921 TTCCTCCTAC CTTCCCAGTA ACCTCATGCA ACTCCTTTCA CTCAGCCTCA ATCATCCCCA 13981 TGGGTGTTTA AACTTGCCCA AGACATGCCC CTTTGAAAAA GCCTGCCATT CTCTTGACCC 14041 ACATGCACGT CCTGCCCCCT CCAAGGCTGC TAGTTCCTTT AGGGGCAAAA TTGTGAAAGA 14101 GTAGTCTAAA CCTTCTTCCT CTTCTTACCT CCACTTCTTT CTTACCTTAT TCCCATGTGG 14161 ATTCTACCCT CACTCAGGCC TCTAGAACGG TTCCTCTACG GCAGTGGTTC CCAATCTTGA 14221 CTACGTGTTT TTTTAAAAAA AGTCCTCCAC CTGGGCCTGC CACCAAGGAT TTTTCTTTAA 14281 TTGACCTCAG ATGGGGTTGA GGCCTTGGGA ACTGGCCAGA ACTTCCCGTG CTCCTAACTT 14341 GCAGCCGGGG TTAAGAACTA CTCCTCTGAA GCCCCCAGTG CCTGCGCTTT TAGCCCGACG 14401 GACAAGTTTC TGCCCTTCCA TCCTGTGACC TCCAGCAGGG CCTGACCATG TGAGTTTTCT 14461 GTGGCTGCCG TGACAAGTTG CCACACCCTG CATGGCTTCA ACCAACAGAA ACGTGTGCCC 14521 TGGCAGTTCT GGGGGCCAGA AGTCCAACAT CAAGATATCA TCAGAGCCAC ATGCCCACTG 14581 AAGGCTCTCG GGGGAATCCA TTCCTTGCCT CTTCTGGTTG CTGGTGGCTC TAGGCATTCC 14641 TTGGCTTGTG GCTGCATCAT TCCAGTCTCT GCCTCTGAGG TCACGTTGCT GCTTCCTCTT 14701 GTGTGTGTTT CTCTTAAAAC TCTCTGCTTC TGTCTTATAA GGATACATGT GATTGCATCT 14761 AGGGCCCAAC CAGATAATCC AGGATAAACT CTTCCTGTCA AGACATTTAA TAATCACACT 14821 TTGCCATATA AGGTAATTTT TTTTTTTTTT TGAGGTGGAG TTTTGCACTT TCACCCAGGC 14881 TGGAGTAAAG TGATTTAATC TCGGCTCACT GGAATCTCTG CCCCCAGGTT CAAGCAATTC 14941 TCCTGCCTCA GCCTCCTGAG TAGCTGGGAT TATAGGTACC TGCCACCATG CCCAGCTAAC 15001 TTTTGTATTT TTAGTAGACA TGGGGTTTCA CCATGTTGGC CAGGCTGGTC TCGAACTCCT 15061 GACCTCAGGT GATCCACCCG CCATAAGTTA ATATTTTTTT TTTGAGAGGG AGTATTGCTC 15121 TGTTGCCCAG GCTGGAGTGC TAGTGGCTCA ATCTCGGCTC ACTGCAACCT CCGCCTCCCA 15181 GGTTCAAATG ATTCTCCTAC CTCAGTCTCC TGAGTAGCTG GGACTACAGA TGCATGCCAC 15241 CATGCCTGGC TGATTTTTGT ATTTTTAATA GAGAGGGGAT TTCACCATGT TGGCCAGGCT 15301 GGTGTTGAAC TCCTAACCTC AAGTGATCCA CCCACCTCAG CCTCCCAAAG TGTTGGGATT 15361 ACAGGCATGA ACCACCACGC CCGACCCATA TAAGGTAATA TTTACAGGTT CTGGGGATTA 15421 GGATTAGCAT GTAGACAGCT TTGTGGGGGC CACCATTCAG CCCACTATGC TAACCCTGTG 15481 AACCGTTGCT CGCTTCTCCT TGACATCTGA CGGCCTGGCC TTCTGCATAC CACACACCCT 15541 CCCACCTCTC TGGCCACAGT TCTGTAGGCT CAGCCTCCTC CGTAAGGCCA TTAAGTGCTT 15601 GTGCTGGTCA AAGTTTCATC CTAGGCCTTT TCCTTACCTC CCTTGATATT TTCTCCCTAG 15661 GTGAGCTCCT TCAAGCCCAC AGCTTCTGTG CTTACCCACA CTCCTACCTA CATTCCCAGC 15721 TTGGGCTTCT CAGGCCAGCT CTAGACTCTT GTATCCCACT GGGTTCTTCC ACTTACCTTT 15781 GGATATCTCA AAGGCATCTC CAGTTGGCTG GGCACGATGG TTCACACCTG TAACCCCAGC 15841 ACTTTGGGAG GCCGAGGTGG GCAGATCACT TGAGGTCAGG AGTTCAAGAC CAGCCTGGCC 15901 AATATGGTGA AACCCCATCT CTACTAAAAA TACAAAAATT AGCTGGGCAT GGTGGTGGGT 15961 GCCTGTAGTC CCAACTACTC GGGAGGCTGA GGCAGGAGAA TCGCTTGAAC CCGGGAGGTG 16021 GAGGTTTCCG TGAGCTGAGC TGGAGCCACT GCACTCCAGC CTGGGCAACA GAGTGAAACT 16081 CCGTCTTAAA AAAACAAAAA ACAAAAGGTG TCTCTAGTGT AACATAACTA AAACCAAACC 16141 AATCATGCCT CCCTCCCCCG CATCCTCCCT CCTGGAGGGA GCTCCAGGAC TTGGTCTTCT 16201 CTTCCAGAGT TCTCTGTCTC AAACTGCGGG AATTGCTCCC CACCCAGGCC TAACCTGAAG 16261 TGTGAGCCTT GGCATCTCTT TCTATCCACC TGTTTTTCCT CTATGCACCT CACAACCCTG 16321 GTCCAAGCCA CCGTCATCTT TCAAATGGCT GCAGTAGCCT CTAACTGGCC TTGGAGGAGC 16381 CATCCTCTTT CTCTAACCAG CTGCCAACCC TGCAATGGCC TCTGTGTGCT TTCCAGATAA 16441 AGCCTGACTC CTCGTGGCCC GCACAGCCCT GCCTGGGTGG TCCTATCCTG CAGCCTCTCC 16501 AGTACCATGA ACCCTCCCTT CTCTGAACCT CTATTTAATC CATTTCATAT ACCCCGTTTT 16561 CTCCTGCCAT AGGGCCTTGC ACATGCTGTT CCTTCTGCCT GGAATTTTCT TCCTGCCTCC 16621 CTCCGCACCC CTGCCTTGTG TTGTGGGTTC CTCGCTATCC TCTAGCTTTT CGCTCAGGCT 16681 CATTGTTGGC CCTCTAGATG TATTCACTTC TCTTGTTTGT TACCCTCTGT CATAGGACTG 16741 TGTTCGTACT TCCCAAGGAG TCGTCTTGGT TTGTGACTGT ACATTTTCCC ATGTGACATT 16801 TGCTTAATGC CTCTCCCACT CTGGGGCCTG TACAAGCCCC AGGAACAGGA CTTGGACCCT 16861 CCTGTTTAAC TCTACAATCT AGCATCCAGC AGGCGCGCAG GCCTTCGTTG ACTTTTATTT 16921 TATTCTTATT TTTTATTTTT GAGATGCAGT TTCGCTCTTG TCGCCCAGGC TGGAGTGCAG 16981 TGGCGTAATC TCGGCTCACT GCAGCCTCTG CCTCCCAGGT TCAGGTGATT CTCCTGTCTC 17041 AGCCTCCCAA GTAGCTGGGA TTACAGGTGT GCGCCACCAC GCCTGGCTAA TTTTTTGCAT 17101 TTTTAGTAGA GATGGGGTTT CACCATGTTG GCCAGGCTGG TCTCAAACTC CTGGCCTCAG 17161 GTGATCCACC CACCTCGGCC TCCCAAAGTG GCTGGATTAC AGGGGTGAGC CCCCATGCCC 17221 AGCCTTCATT GACTTTTAGT TGACAACTAT TTAGCATTTG CTATGTGCCA AGAACTCCCT 17281 GCCTACTAAT GCAGTTAACC CTCATGAAGC CTAGAAGGAA GGACTGCCAT TCTCCCCACT 17341 TAACAGATGA GGATGCCGAG GCACAGGAAG TGAAGTGACT TTCTCAGGGT CAAGCAGGGA 17401 GTGAGTGGAG GAGCCGAGAT TCCAGCTCTA ACCGCATGAT GCTCTATACA GTGTGACTCC 17461 GGCTCTCTGG CTGGGCCCTC TCCATAGCCC TGTGAGGGTT AAGGATAGAA AACAGAGGCT 17521 CAGAGAGTTG AGGTCCCTTG CCTGAGGTCA CACAGCTGGT TGGCCGTTCC CTGGGCTATA 17581 AGCTTCAGTA TTCCCAATGC TGAGCATATT TTGAGAACCC GAGAAACAGA CGTTTGGCTG 17641 GGTGGGAACT GAACTCATTT TGTCAGGGAA TTCAACAACT AAGTTGGCCC TGAGACTGGG 17701 TGTGAAGACC GCTCTGTCCC CTGCCAGCTG GATGACCTCA GGAGAGATCT GATGACTCTG 17761 AGGTCCTGCT GATAGGACCT CTGGTGTCTC TGTTCCCTGC TGGCCTCCCC TGGGCCTGGG 17821 TTGGGTTTCC TCTGCAGGAG GCAGCTCATG TATGTGCTCC TAGACGCCCT TGGGCCAGCA 17881 GCTCCTTGGC TGTTCCTCCC TGAGCCAGGG CAGCCAACTT TCTTATCCAG CTCTCCATGC 17941 TCCCCACCCC AGCATGAGAT GTCAGCTGAG AGTTTTCTGG ATCTCCCCTA GCTAGGGGGA 18001 AAGCTTCCAT CATTTGGAAC AGGAACAGCA GGAACAGCAA AGTCCCTTTC CCCACCATCT 18061 CCCACTGCCT GCTGTGCTTC TCCTAACAGC TCATGGTAAA CACCCTGACT GAGCGGCAGG 18121 GGCTGTTTCC TTTGGGCTAT CCATGTCCAC CTACACTGCC CTTTTTAATC CTTACAATTT 18181 TTCTTGGACA CGGGGGCATA ATATTCCATT GTTTTTCAGT TGAGGAAACT GAGGCTCAGA 18241 GAGGTCAAGT GTCTTGTCTG AGGTCACACA GCAGAACTGG GAGTCAAGCC AGATGGGCTG 18301 CCTCCAAGGA TCCTACTCTT AAACTCTAGA GTACTAGAAA GATCTTCCGT TGCCTAATAT 18361 TGATTCCTGA TAGGCTATGC TTGAGTAGCA TCTGCTTTTG AAAATGGAGC CTGGGTCGGT 18421 TGCGGTGGCA CATACCTGTA ATCCCAGCAC TTTGGGAGGC TGAGGTGGGT GGACACCTGA 18481 GGTCAGGAGT TCGAGACTAG CCTGAGCAAC ATGGTGAAAC CCTGTCTCTA CTAAAAATAC 18541 AAAAATTAAC TGGGTGTGGT GGCACCTGCC TATAGTCCCA GCTACTCCGG AGGCTGAGGC 18601 ACAAGAATTG CTTGAACCCA GGAGGTGGAG GTTGCAGTGA GAGGAGATCA CGTCACTGCA 18661 CTCCAGCCTG GGAGACAGAG CGAGACTCCA TCCGTCTCAA AAAAAAGAAA ACGAAAATGG 18721 ATCCTGAATT TTGAAATATG CTGTGACTCT TCCCTAGTTT GGGACATCTG GGTCAATCCC 18781 TTTTGTTAAA GTAGTTTATT TAGTTGGCTG AGAGCGGGAG CTGCCTACGT GACCTGGAGC 18841 ACAAGCTTTG GAATTGGGCT TGGGTTAGAA TTCCGCCTCT GCCACTCACC AGCTGCGATT 18901 AAGAACAAAG ATACTGGGTT GGGCTCCTGC CTCTATTACT TGCAATCTGT GTGGCCTTGG 18961 ATGAGATATT TAACACCTCC GAACCTCAGT GTCCTCAATT GTGAAAGAGA TCGAGATAAC 19021 AGCTGAACCC ACATCCCAGG AGCGGATTAA ATGAGATAGT GCAGTACAGA GTTTACCGAA 19081 GTATATGGGG TCAGCAGCCA GCCAGTAAAA TGGTGGCTAA TGGTTATCAT GATTAATGTT 19141 AACATTAAGC TCTGAAAGGT CCTTCGTGAA CTCATAGGTA TTTGTTCTCT CTCTCCCTTT 19201 CTCTCTCTCT TCCCCCTGCC CCCTTGCAGG TAGAACTGCA TGTCCACCTA GACGGATCCA 19261 TCAAGCCTGA AACCATCTTA TACTATGGCA GGTAAGTCCA TACAGAAGAG CCCTCTCTCC 19321 CTGGGATTTG AGTGGGGTCC CCAGCTCCAC CCAGAGGCCC CTGGGGAATT CCAGGGTCAC 19381 TGTTCCTTCC TGTCTCCCTG TGGGAATCAA GCCAGCTCCA GGCCAGAAGT GGGACTGTGA 19441 GGACATGGAG GCCTCGGCAC TGAGCTGCAG ACCCGCAGAC CAACTCCTGA GCTTTCTGGG 19501 CCTCTGAGTC TTGTCCTCCT GGTGTCAGGT GAGCCAGGCC TGAGCCTGCT CTCCCCACCC 19561 ACCCACATAC GTGCATGAAG GTAGTTCCCA GGGCTGAATC CGTCTTTTTT TTTTTCTTTT 19621 GAGATAGAGT CTTGCTCTGT CGCCCAGGCT GGAGTGCAGT GGCATGATCT CGGCTCACTG 19681 CAACCTCCAC CTCCTGGGTT CAAGTGATTC TCCTGCCTCA GCCTCCTGAG TAGCTGGGAT 19741 TACAAGCACA TGCCACCACA TCCAGCTAAT TTTTGTATTT TTAGCGGAGA TGGGGTTTCA 19801 CATGTTGGCC AGGCTGGTCT CGAACTCCTG ACCTCAAGTG ATCCACCCAG CTTGGCCTCC 19861 CACAGTGCTG GGATTACAGG CATGAGCCAC TGTGCCTGGC TCCTGTCTTT TGACTTAACT 19921 GAGAGCCTAT ATATAGCAGG TGATGTGCTC ACATGAGATG CCAGTACAAT TTCTTGAGCA 19981 TCTCCTAGAG CTGGGCTGGG CTTTATCAGC TCATTGAATT CCTCCACGCT TGGAAGAGGA 20041 GGATACGCTC TCTGCATTTT ACTGAGGAGG GAATGGGCTC AGCCAAGACA GTTGTCCACG 20101 GTCACACAAA TTAATAGCAG ATCAAGAGTT GAACCCAAGG CTGTCTGACC CCTAAGGCTT 20161 TACTACATCA TCAGGGTCAT AACCTGCTAG GAGTCACGGA AAAGTGGCTC CCCAACTCTG 20221 GGCCTAAATC TCTGCATCTT CCAAGTGAGA ACACACTTCC TGCCTCAGCT CTCAGAGATG 20281 CTAGGGGGCC AGAGGGTCCC CCTGTTCCCC AGCGAGGAAG GTTCTTCCCT TCCTACCCAG 20341 ACCTCAAGGG CTCACAGCAG CTCCTCTCTT AGGACCAGCT TTTAAGGGCA GGGACTTTAA 20401 AGGCCAGTGG ATCTGGATTC AAATTTGGAC ATATTATCTC CTGTCTGCGA ACTTGGTCTC 20461 TATCAACTGA GGCTAAGAAC AGGCCCTCCC TAGAGAGATG ACCTAGGAGC TAGGGGCTCC 20521 TTGTCCACCC AGCCCTGCCC CCGCAGACCT GTGTTCCTCG GATGTTTGCA CAACACTCAT 20581 TTTGTTTGGA GCTGAAAGAA CTCAGCCTCT CTGTCACAGT CTTGAAATTC AGCTCGGGAC 20641 CCAAATTTGA ACATTTCTGC TCCATAAGCC AGAATCCTGT TATTCAGAGG CCTGCCCTCA 20701 TGGAGAGAAT GAGGGATCCC GGGGGTTGCC CCCAACTCTC GGGAGCATCT CCACCAACTC 20761 CCTGAGAGAT TTCTGGTAAG TCCACTATTC TCCATCTTTT CACACTTCCA GGGACCTTCT 20821 TCTGCCCCAG GAAGCTGCCA TTGATTTAAT TCCTATTTAA CTGCAAGGCA TAAGCACAGT 20881 AGCACCTCCT GTGTGCCAAA CACTCCTTTA AGTGCGTTAC CCGGGTTAAG TTATTGAAGC 20941 CTCACAACAA TTTGTAAGAT AGGAACTCTA TTGCCGTCAT TTACAGATGA GGAGACTGAG 21001 CCGTGGTAGG TGGAGTAAGG TGCCCAGTAA GCACAGGGCG GAGGTTTGAA CCCAGATAGT 21061 CTGCCCCCGA GTCCATGGCC CTGGCCATTA CCCCCTGTCA GTTAGAGGTT TTGGTAAGTG 21121 ATGCCCGTAA AATGCTTAGT TCAGGGCCTA GCACACATTA ATGTGCTCCA TAAATGTCAC 21181 TTAATGATAA TATTCTTATT AATTGGAGCT TATATCTCTA AGTGGGGTGA AACCTCTTGG 21241 CTTATCTCTG CCTGGCCTTT GCCCATGTCA AGCCGCCAAC TTGCCACAAG GCCCCTAATG 21301 AGGTCGTTCA GTGGGGCACC AAGATGAGAT CGAACCCAGG CACTCATTAA GGGGTCACGG 21361 AGGGCTCATC AGCTGCAGCC AGGGGCTGGG AGCGCCGGGT GGGGCTAAGA GAAAGGGGAA 21421 AGGAGCCGCC GGGAGGGGCA CTGGTCTGAT CGTCCATTCC TCACACCACC TCTGGGCCTT 21481 GGAGATGGCG TGCGGCAGGT GCCAGCTGGA GCTTGGCCTG AAGTCAGCAG GCAGGGGACT 21541 GGGGAGTTTG TCACACTCAG ATATGGGTGT CTGTAAATGC ACACAAATAT GGGCTAAGAA 21601 TGGAAGGAGG AGGGGAGCCC CTGGCCTGAG CCCTGCTAGG CCCAATTCAG TGGCCCTTTT 21661 TCCAGCTCTG GGACTCAGGC CTGCCTCATT AACTGTCCTC ACCCATTTCT CCTTCCTCCA 21721 GTTCCCAGGA TTCTGGCCTT TTCAGGGGCC TCTCCAACCT CTTTCTCAGT CTTGTTTATA 21781 ACCCTGTCAA CTATTTCTAC AGAGATTCTG AAACTGGCTG CTCTTTCCTC CGATCACTGC 21841 CCTGGTCTGG GCCACCACTG CCCCTCCCTG GTGCTGTGGC CTCCTGATTG GTCTCAGCCA 21901 TCTACTCTGG CCTTCCTCTC TACGGGCCCT GCAGTGCTGT AGTTGGAGCA AGAGCCTTAA 21961 CCCATGGTCT TCCCAGCTCA TTCCCCAGCT TCCCCATCTC ACTCAGAGTC AAAGCCAAAG 22021 TCCACACATG GGCCTTAAAG TTCTGCAAAG CCTGCATTGC CTCTCTGACC TCTCTAAGGC 22081 TCCTTGCTTA GTCCACACTG GATGTTTTTC AAACATGCCA GACCTAGGAA ACAGAGAGTC 22141 TGGGTTACTT GCCCAAGGTC ACACAGCCTT TAAGTCACAG AGCTGGGATT CAAACCCAGA 22201 CCACTGGGCT TCAGAGTCTG CTCTTTCTCA TGACACACAA AGTTTCATTT CTTCCTCTGT 22261 GCACCCCTAC ATGGAAAATA TTATGTTTTA CTGACAAGGG CACCAAGGGC CTTAGAGGGG 22321 AGCGCTCCTG CCTGGGATGA TGTGGTAAAT AGGGGTGGGA GATGGACTTG ACCTGCAACC 22381 CCTGCGCTCA TCCTCCCTCC CTCCCTGGGC TCCTGATGGT GGGCTTCTTG TGACTGTGTT 22441 GCCCACCAAG GCCGGAAGAG GACCAGACAG TGCCCCAGCA CAGCAGCTGT GGCTGACCAG 22501 GGAGTAGGGA TCATCTAAGA ACAGAGCGTG CATGGTGCTC ACGCCTGTAA TCCCAGCACT 22561 TTGGGAGGCC AAGGCGGGTG GATCACCTGA GGTCAGGAGT TCAAGACCAG CGTGGCCAAC 22621 ATGGGAAACC CCGTGTCTAC TAAACATACA AAAAATTAGC CAGGCATGGT GGTGGGCATC 22681 TATAATCCCA GCTACTTGAG AGGCTGAGGC AGGAGAATCA CTTGAACCAG GGAGGTGAAG 22741 GTTGCAGTGA GTCGAGGTCG TGCCATTGCA CTCCAGCCTG GGCAACAAGA GCAAGACTCC 22801 GTCTCAAAAA AACAAAACAA AAGAAAAAAC AGAGGGTGGC CCTATGAGGA GCCTTCGCTT 22861 GTGTGGGTGG CCAGGGACAG CAAGAGGTGC CAGGGCCCTA GGAACAGCTC TTTCCTGCTT 22921 CAACTTTGGG CTCCAGATGG GCGCTTTCCA GCTCAGTCTG AGCAGCTTCG GGAAGCTGTG 22981 TCCCATGGGA GACACTGGGA GTCCCCTGTG CTCTTTGTCT CCTGTCGGGC CCCCACATTA 23041 GCTCTCTGGC CTCAGCTCTG GCTTCCCTCC AATTTGTTTC CCACGCAGCA GCCAGAGGAG 23101 CTTTCAAAAA GGTAAATTAT TTCATGCTAG TCCCCTGCTT GAAATCCTAC AGTGCCTTCC 23161 CAGTGCTTTC AGCCAAAGCC CCAGTCCCTT CCTAAGCCCA GCCTGGCCCT GCCTCCCTGG 23221 TGCATCATCT GCACAAATGC CTGCTCTCTG ACCTCCAGCC ACCCTGCACT TCCAATGCCC 23281 GCGGCTTCCT GCCTGCAGCT TTAGTACAGA CCCCTCCCCT GCCCAGAACT GCCCCCACCC 23341 CAAGGCTTCT GCTGAAATGT CACCTCCTCA GAGAGGCCTT CCCTGGCTGC TCTGTCTAAA 23401 CTCTGTGTTG AGAAGTTCCT TCTTGATGGT TGTTGAGGAG GGAGGCTGGA GAAGAAGAAT 23461 CAAAGAGGAG AAATAGAAAG CAAAATAATT TGTTCTTGGG GACGGGCTGG TGCTGGGCAC 23521 GGGGAGGCGC CCGTCTCTGG TGTGGGCAGC TGGGTAGATG GAGGAGCCGT ATTTGGAAAT 23581 GTGGAACCCA GGAAGGGAGT GATCTAGAGG GAGGGGAAAG GTGGCGCGAG ATGCCTGCCT 23641 CTCAACAGGT AGCCAGACAC ATGGGTCTGT CTTGGTCACT GCTATCTGCC CAGTGCCCAG 23701 CACATCACAG GCCCTCAGTG GTGGTGTGTG GGCATAGAGA ATTAGAAGCT GTGGACCTCT 23761 GGATCCGGAG CTGAAAACCA CCAAAGGAGA TGAGTTGGCC TGGCCAGGTG TGTAAAAGGC 23821 AGAGTCTGAG AGAGAACGAC CAGAGGGCAG AGCCCCGCAG GTGGAGTCCT GGGGGCTGGA 23881 GGGAGACCAT TAGGAGAATC GCACATGGCT GGCGCAGCAG GTCCCAGGCA AATGTGGCCA 23941 CTGGGTTTGG CAATATGGGA GCCAGAGCCC TAGTGTCATC TCCCTGCCTT CTACCCAGCA 24001 GTTCCCAGAG TGATATCCCC AACAGTGTTT GACAACTGGT ACAGGCTCTT CAGCGGCCAC 24061 AGTTACTGGG CAAGGCCTTG TGAGGGTGAC TTTGGGGCAG CTGGCCAGCA GTGGGAGGGG 24121 AAGCAGTCTC AGGGGTACCT GAGGCACTGA GCTCCGACCT CCAGGTGCCA ATGCCGCACC 24181 AGGGCACCGT TCCCCTGCAG GCTCTTACAG GGATTAGGGG CTGGTAAGGA GCAGTGATTA 24241 GGGGCTGACT AGCAGGCTGG TGGGCACCAG CATGACCCCT TGGTGGTACC CTCTGGGCAC 24301 TCATGGGGAC TTGGGCTAAC AGATGGGGAA GGGAGCACAT TCAGGGGGCT TAGGAAACAT 24361 ATTTATGTAG GGAAGCATTT TAATATTTTA GTAACAGAAG CTATTAAAGG ACTTACAAAC 24421 TTACTTACAT ACACTAAAAC ACTATTTGGT CAAACTTCTG TTTCTTTGGC ACTTTCCTCC 24481 TTTATTCTTT TTTATTTTTT TGAGACAGGG TCTTGCTCTG TCACCCAAGC TGGAGTGCAG 24541 TGGTGCAATC TTGGCCCGCA GTAGCCTTGA CTTCCAGGCT CAGGTGGTCC TCCCACCTTA 24601 GCCTCCCAAG TAGCTGGGAC TACAGGTGCA CGCCACCACG CCTGGTGAAT TTTTGTTTTG 24661 AAGGGGTTTC ACTGTGTTGC CCAGGCTGGT TTCAAACTCC TGGGCTTAAG TGATCCGCCA 24721 GCCTTGGCTT CCCAAAGTAC TGTGATTACA GGTATGAGCC ACTGCACCCG GCCTCCTATT 24781 TTTCTGCTTC TGCTTTGTGG ATAATTGGAT GCTTGGACCT CCTGATTTAA TCTTCTAATT 24841 TCCTTAACTG TTTACTCCTA TTTTTCATCA TCTTGTCTTT TTGTTCTACT TTGTGGAGGA 24901 TTTCTTCACT TTTAGCTTCC AGTTCTTTTC TTACATCGTG ACAGTTGCTG CCGCATTCTC 24961 TTGTAAATTT CCGAGGGCTC GTTCTTGGGT TCTGAATGTT CCCTCCTTTC AAGGATCTTC 25021 TCATCTCTTT GAGGATATTC ATGTCTTTTT TGTTTTGGTT CTTAGGTTTT CATCTGTTCT 25081 CTGTGCTGTT TCCTCGGAGT GCTTTTGTCT ATTCTGTTGT TTTGTCCCTC ATGTTAGAAG 25141 CATTTCTTTT TTTTTTCTTT TTTTTTTTGT GATACAGAGT CTTGCTCTGT CACCAGGCTG 25201 GAGTGCAGTA GCATGATCTC GGCTCACCAC AGCCTCTGAC TCCCTGGTTC AAGTGATTCT 25261 CCTGCCTCAG CCTCCTGAGT AGCTGGGATT ACAGGCACAC ACCACCACAC CCAACTAATT 25321 TTTGTATTTT TGGTAGAGAC GGGGTTTCAC CATGTTGGCC AGGATAGTCT CAATCTCCTG 25381 ACCTCATGAT CCTCCGACCT TGCCTGGGAG GCCAAAGTGC TGGGATTACA GGCGTGAGCC 25441 ACCATGCCCA GCCTAGAAGC ATTTCTTAAT GTCTGGTGTT CTCTGGCTGT TGTATCTTAA 25501 AAAAAAAAGG GGGGGGAAAC TGAGGCTCGA GGTGACCTTG TGAGCTGGAG CAGAGCCGGG 25561 ATGGGATGAG GAGGCAGGAG CGTGTGCAGA AGAGAGGGAG CCCCCCTGAG CTCGCACCCT 25621 GCTTCCCGTG GCTGGGAGGG GAGGCCGAGA TGCTTGGGGA GAAATGGAGG CTCCAAGCCA 25681 GAGGGGCTGT TTCCAGCACG CTCTTACTGA GCGCTGCTGT AGTCCAGCTT GGTGTGGCGG 25741 CTGTGGGCAG GGAGGGGAGA GAGGTCTGAG CTGGCTGGCG GCCCACTGGG CCCCTCCCCT 25801 GAGCCTCCAC CGGCCCTCTC CCAGTGCGCT GGGCTGGGCA AGCCTCTGAT GTGCCAGCCA 25861 GATGGAGGGT GAAGTCCTGA TGCCTGCCCC TACCCTGGGA ATTGTGATGC TGCAGTTACT 25921 GCCCCTGATA ACCCCTGACT GGGCATAGGA CCAGCTGGCT GAGCCAGCTC CTGGGGCTGA 25981 GGAGGAAGCC ATGAACTTGA CCTGGCACTT TCCTTGTCTC CAAGCATCAG TCAACCAAGG 26041 ATATGGAGGG GGTGTGTGCA TGTGTGCACA CATACACACA CACACACACA CACACTTCAA 26101 CCTGTTTATC CCCCTTGAGA TTTGCTGACT TGTGCATTGG GGGTAGAAGG TGCTGGAAAA 26161 ATTCCGGTCC TGGTTCTCAG TTTCCCCATC TGTCCAGTGG GAGCAGCTGG ACTGAGAGAC 26221 GCCCATGTCT CCTGCTGTGG TCCTGCAAGG AGGCTGGCGC TCCTGAGTCT GCTCCATCCT 26281 GGCCTGTCAG GCCTGCCTGG ATCCTGCCCC GGGTTGGTCC ACCACTCACT GTTTTGTTTC 26341 CAGGAGGAGA GGGATCGCCC TCCCAGCTAA CACAGCAGAG GGGCTGCTGA ACGTCATTGG 26401 CATGGACAAG CCGCTCACCC TTCCAGACTT CCTGGCCAAG TTTGACTACT ACATGCCTGC 26461 TATCGCGTGA GTTGCCCCCA ACCCACAGGT CCTAGGGCAG CATTGATCCC TATGACTAGG 26521 ACCAGGCCTG TCCCTCAGCC TGTGGGGGCC AGAGAAGTTG CTCTGAAACC ACAGCTGTCT 26581 TTCTCACCAT TGTGTACACT TAGTGAGTCT CTCCAGTGCC TTTAGGCCTC AGTTTTCCCT 26641 TCTGAGATGT GGGTGTGATG GACTGAAATT GCTTCAAGTT CTACAGAGAA ATGGCAGAAT 26701 ATGGGAGCTA AGAACACAGG GTCAGAGGCA GTGCAGGGCT TGAACCCGGG CCATCTATCT 26761 CCTAGTTCAG GGCTTCGTGT TGTGAGGGGA GGAGAGGCCT GAATATAGGG TGGGGGCGGG 26821 GAGATGTGGG GAAGATTCTC CAAAAGGCTT TTTCTTTTTC TTGTCTTGAG TCGCCAGGGA 26881 ACAGCACTAG GTACCGAAAA GGCCAGAAGG GGTATGGGCG AGTACTAGAG AGAAATTTCC 26941 ATGACTGCTT TATTTATTTA TTTATTTATT TATTTATTTA TTTATTGAGA CAGAGTCTCA 27001 CTCTGTTGCC CAGGCTGAAG TGCAGTGGTG CGATCTCAGC TCACTGCAAC CTCCACCTCC 27061 CAGTTTAAGG GATTCTCCTG CTTTAGCCTC CCAAGTAGCT GGGATCACAG GCACCCACCA 27121 TCACACCCAA CTAATGGTTT TGTATTTTTA GTAGAGATGG GGTTTTACTA TGTTTGCCAG 27181 GCTGGTCTCG AATTCCTGAC CTCAGGTGAT CTGCCCGCCT CGGCCTCCCA AAATGCTGGG 27241 ATTACAGGCG TGAGCCACTG CGCCTGGCCT CCATCCTCAT CCTGAAGATG CAAGAACTTC 27301 TGGTGACCCC TTCTCCTGAG AGTGGCCTGA TCTCCCCTGG GCAGGGCACT TTCTTCCCAC 27361 GCTGGGCTCT CCCACGACTT GTGTGCCTTC CCTCACACAT TCTAGTAACC ACTTCATTTT 27421 CACTCTTCAT GGTGGGAACT TCCAGCTAAG CACAGTCCAC CGTTACGTGA TCAACACAGT 27481 GGCCCTGGCA GGCCAATTTG TGCCTTGCTT CTGGAACAAA CATGCAGTAA TAACAACGAA 27541 AATGTTTTGA GCATTTGTCC GCTCTGCTCC AAGCACTGAC CCGGGTGGGG TTTATGAAGT 27601 TTGACTCATT TGTCCCCGCA ATAACTCCTT GACCTAGGTG TCAGAGGGTG ACTAACCAGG 27661 GGTCACACAG CAGATAAGTG TGGGCACAAG GATCCAAGTC CATGACTGTA TCCCACGTGT 27721 CTCCCACATC CAGGCATCCC TCTGGACTTG TCCAGCTGTG TCCTTTTCTC TCATTTCTCT 27781 TCCCTGCCAG CCTTAACTCC ATCACCAACA AATATTGGGC TACTCTGTCC TAGGCATGGT 27841 CCTCAGCTGA GAGGTCGCAG CCATCCCAAG ACAGAGGGGT CCTTGCCACA TGGAGACTGC 27901 ATTCTAGTAG GGAATACAGC AAACTGGCTG ATAAGCCATA TGACACACAA TGTTGAGTAG 27961 TGATAAGGAC CTGGGAGAAA AAGAAAGCCC AGGAGAATGG TGGAGGGGCC GTTTTAAGAT 28021 AAGGCGGTCT GGGCCAGGTA CAGTGGCTCA CGCCTGTATC CCCAGCACTT TGGGAGGCTG 28081 AGGTGGGCGG ATCATGAGGT CAGGAGATCG AGACCATCCT GGCTAACACA GCGAAACGCT 28141 GTCTCTACTA AAAATACAAA AAATTAGCCG GGCGTGGTGG CATGCGCCTG TAATCCCAGC 28201 TACTTGGGAG GCTGAGGCAG ACGAATCACT TGAACCCAGG AGGCAGAGGC TGCAGTGAGC 28261 TGAGATGGCG CCACTGCACT CCAGCCTGGG CGACAGAGCA AGATTCTGTC TCAAAAAAAA 28321 AAAAAAAAGA TAAGGTGGTC AGGGAAGGCC TCTCTGAGGA GGTGAAGCTT CAGCTGGCTC 28381 TAAACCAGGG GAGCGGGAGA GACGCAGTGT AGGACAGTAT CGGGGAAGAG CAGGCCTGTG 28441 TCTTCTCCGG TGGCCTCAGG GAATGAGGGA GAAGGAAGGT GCTGGGGAGG CTGGCAAGGC 28501 TGGAGGATGC AGGCTTGTGG GCAGGACCTG GGAGTTGCGA TGTCACTCTC CGTGGCAGGA 28561 AGCTACTGGG GCTTCGAGGG GAGAAGTGAT ATGCTTTGAT TTACCTTCTT AAAAGATTGC 28621 CCCAACTGCT GGGTGGAGAA CAGGATGACA GGGGCAAGCA TGGAGACAGG GAGGCCAGTT 28681 AGAGATGGCG TGATTCAGGC CAGGATGAGG GGTGAGAACT GGTATGCAGT TCCAAAGTAG 28741 AGCTGATAGG ACTTGCCCAG TGTCTGGATC TTATCCAGTG GATGCCCAGA GCTTGGGTCT 28801 GGGGATGAAG TGGGTTTAAT CTGCCAAGGG TTGGGGATGT CATTTGCTCC TGGAGCTCCC 28861 AAGGGACTTG GGGAAGGTTG TTCCCAACCC CTTTCTTCCC TTCCCAGGGG CTGCCGGGAG 28921 GCTATCAAAA GGATCGCCTA TGAGTTTGTA GAGATGAAGG CCAAAGAGGG CGTGGTGTAT 28981 GTGGAGGTGC GGTACAGTCC GCACCTGCTG GCCAACTCCA AAGTGGAGCC AATCCCCTGG 29041 AACCAGGCTG AGTGAGTGAT GGGCCTGGAA GGGGCCATGC TGAGGGTGTG GCTGGGAGGC 29101 TCAGCTCTGA GACTGGAAGG GCGAACTGCT GGGAATCCCT GACCCAAGCA AGACCTTGTT 29161 CTTGCCCCCA GTCTGGTCCA TGGCCTCAGA AAGATGGGTT TAACTCTGTC ACAAGAGACG 29221 TGGTTCCCAT CCTCCCTTTG CCGTTATGTT CTTACCTTGG GCACAAGTGT TTGGCTGTGT 29281 CTTGCTCTGG CCACAGGCCT GCTGTCCAGG AATGTTAACC TGCTTAGCCA CCCAGGATTT 29341 CTGAGGGGTC TCCCTTGTCA CTGATGCTGA TCAGATCTCT AAAGGCCCTA AAGGTCCTGC 29401 TCTAACTTCA TAACTGAAGT GAGTCTGGCC CATTTCTAGC CCCCTGCCTG GGCCCCCATG 29461 GATCTCTAAG TGGTATCACA AAACCACCCT GCCCCATTTT CTGAGCCATG ATTCTGATAC 29521 ATATAGAATG TGAACATCAT GGCAGGCCCA AGCTTAGCAA TGCTGTCCAT CTGGGGGTGG 29581 GGAGGGCCAT GTTGACACCC CACACCTCCC ACTAAGATCT AGGAGCACCC AGCTGCTTTA 29641 AGAGCTAGAG GGACATGCTA GGGCCTGGGG GCATCTCTGC CAGTCTTTCC TCTGAGGCAG 29701 TGGGTCAGTG GGGGAGGAGG GTCCTCCCCA AAGCCTCCTC TTCCTCCTCT GTCCCAGTCC 29761 CAGAGCTGCC CTTTAGGCCT TCCTTTTGCC TCAGGCCCAT CCCTACTCCT CTCCTCACAC 29821 AGAGGGGACC TCACCCCAGA CGAGGTGGTG GCCCTAGTGG GCCAGGGCCT GCAGGAGGGG 29881 GAGCGAGACT TCGGGGTCAA GGCCCGGTCC ATCCTGTGCT GCATGCGCCA CCAGCCCAGT 29941 GAGTAGGATC ACCGCCCTGC CCAGGGCCGC CCGTCTCACC CTGGCCCTGA CCTCCTGGCC 30001 TAGCAGTGGG GCTGTACCTG ATCTCCCCTG TGCCCCACAG CCCCATGGTG TCCCCTTGAG 30061 CCCACTGGCA TGAACTTGGG GCTTCATGAA ACAACTGGAG ACCTCCTAGG CAGGCTCAGA 30121 ACTTCTGGAG ATGTTCTCCC CAGGGACACC ATGCCTTTAT AGCCACCCTG CAGGAAGCTC 30181 AACACCAAAT AGGAACGTAA CTATTGAAAA AAAAATCTAG GCTAGATTCT GATCAGCCCA 30241 TAGTCCTCCC TCGAGACCCA GTGGACCAGG CCCCATCCTG TCTGGGCCTG AATAGGTCTG 30301 ATTTCCAAGA TTTCTGAGGG GTCTCCCTTG TCACTGACGC AGATCAGATC TCTAGAGTTT 30361 GTGCCTCATG GTGCACAGCC TCACTGTGTG ATATTGGGCA GGTCACACTG CTGCTCTGGT 30421 TATGCACCAA GACACCTCAG TTGTGCACTG TCACAAGGAG ATGATCACAC TTACTTCATT 30481 CCTCTACCCT CAGGATTAGT AAGAACCAAA GAGCTACCTG CACGCATTTC CTCTAATCCT 30541 CGCAGCAGCC TGCAAAGCAG AACTACCATT GCTTAGTCCC ATTTGACAGA TGAGGAAACT 30601 GAGGTGGAGT GAGGTGCAGC CTCTTGCAAG GCACAAACCC TGGATTTGTA TCCGGGGACA 30661 TCTAGTTCCA AAGCCTGTGT TCATTCATTC TTTCTTAAAC ACTTCAGAAT AACTTTATTG 30721 GTTAAGAGTA CCTAATACAT TAGCGAGATA CTTCCCAATA CTAGTGTGAG TTCTATTTTA 30781 GATGACGTGT TAAACGGTCC TCCGTTTCCT CATCTGCGCA TGGGAATAAG CCTACCATGA 30841 GTGTTGTTGG AAACACCAGG TGAGAGAAGG GTCCGTGTCA TTTACTGAGC TCAGGCCCCG 30901 TCCTTGGTGC TTTACACACA TGGCCTCGGC AAAGCCTGGC CGTGACCCTG TGCAATAGCT 30961 GGCAGGGTTC TTTCTGAAAA GGGCGGAAAC TGAGGCCATA AGCAGAGCAG TTTTCCGCAG 31021 CCATGTGGTT AGGACATAGC AGTTAGGATT TGAAGACACT GAGCCCTGTT TTGTGCTGGC 31081 CTCCCATGGG GGGTTTGGGT GGGACAGCAG GCAGGTAGGC TGGGAGGTCT CTCCATGGTG 31141 CTGGTGACAG AGCCTGGGTG GGCATCTGCC CACAGACTGG TCCCCCAAGG TGGTGGAGCT 31201 GTGTAAGAAG TACCAGCAGC AGACCGTGGT AGCCATTGAC CTGGCTGGAG ATGAGACCAT 31261 CCCAGGAAGC AGCCTCTTGC CTGGACATGT CCAGGCCTAC CAGGTGGGTC CTGTGAGAAG 31321 GAATGGAGAG GCTGGCCCTG GGTGAGCTTG TCTCCCACCC ATAGTTGGGA GAAATCACAA 31381 GAACCAGGGA CCATGGTGTC TCCTGAGTTC TGAAGTGTGT CTTTGTTGGG TCTTAAGGCT 31441 TGGAACTGGA ATCCCCCTGG GCCAGGCGTG GTGGTTCATG CCTGTGATCC CAGCACTTTG 31501 GGAGGCGAGG CAGGAGGATT GCTTGAGCCT AGGAGTTTGA GACCAGCCAG GGCAACATAG 31561 TGAGATCCAT CTCTGCAAAT ACAAAAAAAA GTAGTCAGGC ATGGTGGTGC ATGCCTGTAG 31621 TCCCAGCTAC TTGGGAGGCT GAGGTGGGAG AATTGCTTGA GTCCAGGAAG TCAAAGCTGC 31681 AGTGAGCTGT GATAATGCGA CTGCACTCCA GCCTGGGTGA CAGAGGGAGA CCCTGTCTCA 31741 AAAAAAAAAA AAAGGAAGAA AGAAGAAAGA GAAAAGAAAG AGAAAGAAAG AGAGGAAGGA 31801 AGGAAAAAGA GGAAGGGAGG GAGGGAGGAA GGAAGGAAAG AAGGAAGGAA GGGAGAGAGA 31861 AAGAAAAGCC TCCACTTGGT GTTGGGAGTC CTGTGCTGAG CCTGCTTCTG GCTGTGATTT 31921 GCTGTGTGAA CCTGGGCAAC ACTGTGTCTT CTCTGGGCCT CTGTTTCTTC TATTGGGATG 31981 ACTGAGTTGG AGCCGACATC TCAAAAGTCG CTTCCAGCGT GATGATGAAT GGGCCTCCTG 32041 TGGAGGGTGC AGCATGGTGG AGAAGTCAGG GCTCTGGAGT CCCACTGCCC GGGCTCAGAG 32101 CTTGGTTCCA CACTTCCTGT CTGACCTTGG TCACATTACT TGAATCTCCT GAGCTTCAGT 32161 CCTTCATCAT AAAATGGGTG GGATAATAGT TGTGAATATT AGATAATGTA TACAAGTCAC 32221 TTCATATACT ACCTGACACA TGGTAACTGG CTAATGAGTG ACAGCTACCA CTTAGATAAG 32281 GACTTGGAGG GTAAAAGACC AGGTTTCCCC ATGCTGTTGA AGCAGGCAGC ATGACTAGGA 32341 TGGTTCAATC TCCACAGCAT GGTCAAGGCA GGCTGCCGGG GCCCTCCCGC TAGGGCACCC 32401 ATGACCTGGC TCTCCCCCTT CCAGGAGGCT GTGAAGAGCG GCATTCACCG TACTGTCCAC 32461 GCCGGGGAGG TGGGCTCGGC CGAAGTAGTA AAAGAGGTGA GGGCCTGGGC TGGCCATGGG 32521 GTCCCTCCTC ACTGCCTCCT CCCATACTTG GCTCTATTCT GCTTCTCTAC AGGCTGTGGA 32581 CATACTCAAG ACAGAGCGGC TGGGACACGG CTACCACACC CTGGAAGACC AGGCCCTTTA 32641 TAACAGGCTG CGGCAGGAAA ACATGCACTT CGAGGTAAGC GGGCCAGGGA GTGGGGAGGA 32701 ACCATCCCCG GCTGTCCCAA CTTCCTGTAT AGAGAGGCAG AAAGCAGGGC GGGTCCCAGG 32761 AACTCGAGGG GTGGCCCCAG GCCCAGACAT GGGGGGAGGA ATCAGCATGG CCTGGGGCCA 32821 TCCCTGCCAG CCACACACCT GCTCTTCCAG ATCTGCCCCT GGTCCAGCTA CCTCACTGGT 32881 GCCTGGAAGC CGGACACGGA GCATGCAGTC ATTCGGTGAG CTCTGTTCCC CTGGGCCTGT 32941 TCAATTTTGT TCCAGGAAGG CCAAAGAGGG AAGAAACTTT AGGGATTGGG CATCAGCCCA 33001 TGCCGCGTCT TTTAGATATG AAATCTCTTC GACACCCTGG GAAGCAGGCA TTGCCGTCCT 33061 CATCTTACAA ATGAGGAATC CGAGGCCCAG ATGTGCTGTG GCTTGACTGG GATTACCCAG 33121 CTGCTAACCA GCAGAGCTGG GGCCCTACAG CTCATCAGCT GGAGCAGAAC GCTCCATTAC 33181 TCTGAGGGAA GCTTCCACAC TTCCAATTCT CCCAACTCTG CCCCCTGGGC ATCGCATAGG 33241 AAGCAGGAGT CCCTCTGGCC AGCATGTTCT CTCTTCCTGA CACCTGGCCC TTGGGACCCC 33301 TGGGCATTCC CCTGAGCGCC ATCTTGAAGC TTTCCACCGG AGGTCTGTTC CACCCTGCCT 33361 GGCTCCCATC CTGGAGTCTA ACCAGGGTCA AGGCCCTCCT TCCGTCCTGT CGCCAAGCCA 33421 CAGGAGCAGT ATCAGGCCTT AGGAAAAAGC CGCCTTCCCC AAGACAAGGA CAGCAAGAAC 33481 TCAGGGTGAC CATGGTCAGG CCAGCACTTA TCCATCTGCC AGGCATATGA GAAGGGGAGG 33541 GGCTTCGGCT CTGATGTTCT GATGACAAGG GGGTCTTGGG GCTTGCTTAG GGACACGTGG 33601 CACCTGTGGA GGTTCTTGGA GGCATGTGGG TATACCATGG GCTGGAAAAA GATCCAGGAG 33661 TCATCTGCAC AGATATGGTG GCTGAAGGAG AAGCAGTGGC CCCAGGAGGT GGTGGAGCAA 33721 GAAGGGCCTA GGATAGAACC CAGAAGGACA ATGGTATTTA AGGGACCAGC AAAAGAGACA 33781 AGTAGGAGGA AAGTCAAAAG TGTGGTGTCA CAGAAATCCA GGGAAAAGGT TTCAAGAAAC 33841 AGTCAACAGT GTGAAATTCT GCTATGCAAG TCGATTATGG TCAGAGCTAG GAAAGATCCA 33901 TTAGATACAA CAAGATGGTG GTCAGGGATC GTGCCAAGAA CAGCTTCCAT GGTATGTTGG 33961 AGTAGCCAGC TCCCAGTGGG ACTGAGGAAC AAGCAGGGTA GGGTGCAGAG GGGAAGGCTG 34021 GAGAGGGTGG CAGCCGGAGG GGGATGTTGC TTTCTTGGCT CCCACCCCCA CGCCCCCACC 34081 GGCTGCCATT CTGCCTGGTT CCCATGTCTG GCCCCTCTGC TGCCTTTGCC CAGCTCTGGT 34141 CTTCAGGATG GGCTGGATTC TGGACTTTCT GGTTACATAG ACTTGAACAA GTCACCTAAG 34201 TTCTGAATTT ATTTCCCCCT CTGCACAAGG ATCAGATCTT TCAGATCTGT TTGAGGCTGC 34261 TGTGAGGATC AAAGGCGGGT GAACGTCAAT GTGTTCTGAC TATTTATGTA AGAGTAAAAG 34321 GAGGCTGATT CTCTCCTCCT CCCTCTTCTG CAGGCTCAAA AATGACCAGG CTAACTACTC 34381 GCTCAACACA GATGACCCGC TCATCTTCAA GTCCACCCTG GACACTGATT ACCAGATGAC 34441 CAAACGGGAC ATGGGCTTTA CTGAAGAGGA GTTTAAAAGG CTGGTGAGTG GGTGTGAGCC 34501 ATACTGGCCT TGACTCGGGT TTGGGAGTAT GGTATCTACA GGTCCAGTCC GGGGCCTGGA 34561 ATCTTTGGAG AGAGGGAGTG AGTCTGCCTC AACAGTCCAA GACAAGCCCA ACCTAGACAC 34621 TTTCCACAGA GAAGACATCT TTGTGTTGAC GTCCTGACCT AGGACCAGGT TTTTGATCCT 34681 TTGCTTGGGT TGAGTGCCTT TAAAGAATCC AGTGAAAGCT GTCAACCCTC TCCCCAGAAA 34741 GGTGTGTGCA GCAGCTATGA AGTCTTGCAC ACTCTCTTCA GGTTGTTCTT AAATCCCAGG 34801 CTGAATAAGT CCATTCCTGC ACGTGTCTGC GAGGTGTCTC TGGCCCCCTA CATGCCACCC 34861 TGTCTCTCAA AGGTTTCTCC AACTTCCTTC TCACAGCCCT TTTTCATGTA ATGACAAATT 34921 AAGAACACGA CCTCATGGTC TCTACTCTGG CACTTGCTGC CGTGTGACAG TGGACAAATC 34981 CTTCCCCCTC TAAGCGTATC TGCCCATGTT GAGTGAAGAG GATGGACTAT CACTACATTG 35041 CTAAGAGCTG CCTTCTTTGT TCTCTGGTTC CATGTTGTCT GCCATTCTGG CCTTTCCAGA 35101 ACATCAATGC GGCCAAATCT AGTTTCCTCC CAGAAGATGA AAAGAGGGAG CTTCTCGACC 35161 TGCTCTATAA AGCCTATGGG ATGCCACCTT CAGCCTCTGC AGGTAGGTTC CTGTCTGGGC 35221 TTCTGGGCAG TTGCCTGTCC TGGCCCCAGT GTGGCTTTCT GTGGGACTTC TAGCAAGATG 35281 CCCTTCCATT CTTGGGCAGC GCATGAATGT GTGATGACTC CCTGGTTTCT GGGCCCTGGC 35341 TGGGAGCAGC GTCTCATTAG ATCGGTTTGT TTTCTATAAA AGTTCTTGAG AGGCTGTTCT 35401 AAGGGGAGAC TTTCTGAAGC CCAGTCCCAA AGGTCTGGGC AGTTGGGGAC ACCTCCATGG 35461 CTGCCCAAAG CCAAGGGCAG GGAGAGGGGC CCAGGCTGTT CTGCTCCTTT CTTCCTATGT 35521 GGTCTTGGCA AGGCATCTTC TTGCCATCAT AGGAAGGAGT TCCTTTCTGG TTCTGGTGTT 35581 CTATGATTTT TACAACATCC TGGGTACTAC AAGTTGCCTG ATCTTTTTGC TTCTCTGAAC 35641 CAACGAGCAG GGCAGAACCT CTGAAGACGC CACTCCTCCA AGCCTTCACC CTGTGGAGTC 35701 ACCCCAACTC TGTGGGGCTG AGCAACATTT TTACATTTAT TCCTTCCAAG AAGACCATGA 35761 TCTCAATAGT CAGTTACTGA TGCTCCTGAA CCCTATGTGT CCATTTCTGC ACACACGTAT 35821 ACCTCGGCAT GGCCGCGTCA CTTCTCTGAT TATGTGCCCT GGCCAGGGAC CAGCGCCCTT 35881 GCACATGGGC ATGGTTGAAT CTGAAACCCT CCTTCTGTGG CAACTTGTAC TGAAAATCTG 35941 GTGCTCAATA AAGAAGCCCA TGGCTGGTGG CATGCAGCAG GTGGCATGTA ATTTGGTGGT 36001 CTTGGGCGGG CCGATGTGGG CAGGATGAGC ATGGAGGGAG CTGGGTCAGC CTGCTCAGCA 36061 GCAGGGCCTG AGCCTAAGGG TGGCTGTGAA TGCCAGGCCA GAGATCCCAA TGCTGTGGGC 36121 CAAGAGGGGT CCAGAGGCTG TCCTCCTTCC AGAAGAAATA AGGCTTCTCT GGTTGTTGCT 36181 CAAACATTCC CTGAACTCTC AGCCCCTCCT AACTCTAGGT TTTAAGGAGT AAAGCTTCCT 36241 TTTGGGTTCC TGAAGCTGGC AGTTGGGGTG AGAGCAGATG AGATGGAAGA GGGCTCATCA 36301 GACACTGGCC TTGGAGGGTG CTGGCCTCTG CAGAACGCCA GCATCTTCTC AGAATCGTAT 36361 GTTCTAGAAG CCTGGGCGAA GTCCGGCTAA TTGTGGACTT GGGGAAAATA AGGCCCAACC 36421 CCTGTTTTTG CAAGGTTAAG GAGAAATAAT CTTAAACCAG TCACACAAAT CATCGGCATT 36481 TATTTCCTGG GTCCTAGGTG TCACTTATCC TGGTGGACAG GGCAGAGGTG GTCAGATCGT 36541 TTTGAGCCAA AATCCCTTCC CTAAAAATGG ATCTGTGGAG CTCCATGAGG GAACCTCAGA 36601 GATGCACAAT GACAGTTTAG CTAAAATGGC TTAAAAAATG TGAATTGATT GTCAGCTCTC 36661 TCCATATCTG CTGAAAAAAG GTTTAAAATT TTTAAAAAGT TTAAAAGTGT TTTCTAAAAA 36721 AGGGACAAGC AGGTCTGGAC C // LOCUS HUMADH2E1 375 bp ds-DNA PRI 15-JUN-1989 DEFINITION Human class I alcohol dehydrogenase (ADH2) beta-1 subunit, allele 1 gene, exon 1. ACCESSION M24308 M11831 K01883 KEYWORDS alcohol dehydrogenase; dehydrogenase. SEGMENT 1 of 9 SOURCE Human leukocyte DNA (library of Lawn et al.), clone lambda-ADH15. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 375) AUTHORS Duester,G., Smith,M., Bilanchone,V. and Hatfield,G.W. TITLE Molecular analysis of the human class I alcohol dehydrogenase gene family and nucleotide sequence of the gene encoding the beta subunit JOURNAL J. Biol. Chem. 261, 2027-2033 (1986) STANDARD full staff_review COMMENT ADH is encoded by at least five genes that fall into three classes. Class I, which evolved as a multigene family, contains a large group of isoenzymes possessing various combinations of alpha, beta, and gamma subunits, coded for by three genes designated ADH1, ADH2, and ADH3, respectively. Class II contains the pi isoenzyme and class III contains the chi isoenzyme [1]. The majority of Orientals have the "atypical" 2-ADH-2 gene, producing an atypical beta-2 subunit instead of the beta-1 subunit, which is predominant in Caucasians [2]. The other human class I ADH1 beta subunit sequences are found under accession X03350 D00137 M24309-M24317 M21692 FEATURES from to/span description pept 343 + 360 alcohol dehydrogenase beta-1 subunit precursor (EC 1.1.1.1), exon 1 /nomgen="ADH2" /map="4q21-q25" /hgml_locus_uid="LR0050W" matp 346 + 360 alcohol dehydrogenase beta-1 subunit pre-msg 273 > 375 ADH2 mRNA IVS 361 > 375 ADH2 intron A BASE COUNT 137 A 55 C 87 G 96 T ORIGIN 1 bp upstream of EcoRI site; chromosome 4q21-q25. 1 GAATTCCAGA GGCCGGGGGG GGGTGGGAAG TGAGGAAAAG AGAAAGTGAT TACAATTTAT 61 CACTTTAACT TAATATTTAA ACTAATGAAA ACAAAATCTT ATCTAGAATT TGGAAGTCAA 121 TATTTTGATT GCTGGTTCAG TACCCTTTTA TCTGTTTTGA CAGTCTGGGA ATAATCCAGT 181 GGGTGTGGCT TAAAGACATA GATCACGTGT GGAATTGGAA TTGGATGTTA CACAAGCAAA 241 CAAAATAAAT ATCTGTGCAA TATATCTGCT TTATGCACTC AAGCAGAGAA GAAATCCACA 301 AAGACTCACA GTCTGCTGGT GGGCAGAGAA GACAGAAACG ACATGAGCAC AGCAGGAAAA 361 GTAAGCAAAA AATAT // LOCUS HUMADH2E2 132 bp ds-DNA PRI 15-DEC-1989 DEFINITION Human class I alcohol dehydrogenase (ADH2) beta-1 subunit, allele 1 gene, exon 2. ACCESSION M24309 M11832 K01883 KEYWORDS alcohol dehydrogenase; dehydrogenase. SEGMENT 2 of 9 SOURCE Human leukocyte DNA (library of Lawn et al.), clone lambda-ADH15. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 132) AUTHORS Duester,G., Smith,M., Bilanchone,V. and Hatfield,G.W. TITLE Molecular analysis of the human class I alcohol dehydrogenase gene family and nucleotide sequence of the gene encoding the beta subunit JOURNAL J. Biol. Chem. 261, 2027-2033 (1986) STANDARD full staff_review COMMENT The cDNA sequence for this protein is found under accession number M24317. FEATURES from to/span description pept + 16 + 117 alcohol dehydrogenase beta-1 subunit precursor, exon 2 /nomgen="ADH2" /map="4q21-q25" /hgml_locus_uid="LR0050W" matp + 16 + 117 alcohol dehydrogenase beta-1 subunit pre-msg < 1 > 132 ADH2 mRNA IVS < 1 15 ADH2 intron A IVS 118 > 132 ADH2 intron B BASE COUNT 36 A 21 C 32 G 43 T ORIGIN About 2.8 kb after segment 1; chromosome 4q21-q25. 1 TGTTTGATTT TCTAGGTAAT CAAATGCAAA GCAGCTGTGC TATGGGAGGT AAAGAAACCC 61 TTTTCCATTG AGGATGTGGA GGTTGCACCT CCTAAGGCTT ATGAAGTTCG CATTAAGGTG 121 AAATATTTTT CC // LOCUS HUMADH2E3 169 bp ds-DNA PRI 15-JUN-1989 DEFINITION Human class I alcohol dehydrogenase (ADH2) beta-1 subunit, allele 1 gene, exon 3. ACCESSION M24310 M11833 K01883 KEYWORDS alcohol dehydrogenase; dehydrogenase. SEGMENT 3 of 9 SOURCE Human leukocyte DNA (library of Lawn et al.), clone lambda-ADH15. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 169) AUTHORS Duester,G., Smith,M., Bilanchone,V. and Hatfield,G.W. TITLE Molecular analysis of the human class I alcohol dehydrogenase gene family and nucleotide sequence of the gene encoding the beta subunit JOURNAL J. Biol. Chem. 261, 2027-2033 (1986) STANDARD full staff_review REFERENCE 2 (bases 29 to 48) AUTHORS Xu,Y., Carr,L.G., Bosron,W.F., Li,T.-K. and Edenberg,H.J. TITLE Genotyping of human alcohol dehydrogenases at the ADH2 and ADH3 loci following DNA sequence amplification JOURNAL Genomics 2, 209-214 (1988) STANDARD simple staff_entry COMMENT The cDNA sequence for this protein is found under accession number M24317. FEATURES from to/span description pept + 16 + 154 alcohol dehydrogenase beta-1 subunit precursor, exon 3 /nomgen="ADH2" /map="4q21-q25" /hgml_locus_uid="LR0050W" matp + 16 + 154 alcohol dehydrogenase beta-1 subunit pre-msg < 1 > 169 ADH2 mRNA IVS < 1 15 ADH2 intron B IVS 155 > 169 ADH2 intron C allele 38 38 g in ADH2-1; a in ADH2-2 BASE COUNT 38 A 38 C 51 G 42 T ORIGIN About 0.6 kb after segment 2; chromosome 4q21-q25. 1 CTCTTTATTC TGTAGATGGT GGCTGTAGGA ATCTGTCGCA CAGATGACCA CGTGGTTAGT 61 GGCAACCTGG TGACCCCCCT TCCTGTGATT TTAGGCCATG AGGCAGCCGG CATCGTGGAG 121 AGTGTTGGAG AAGGGGTGAC TACAGTCAAA CCAGGTACAG GATTCACAC // LOCUS HUMADH2E4 118 bp ds-DNA PRI 15-JUN-1989 DEFINITION Human class I alcohol dehydrogenase (ADH2) beta-1 subunit, allele 1 gene, exon 4. ACCESSION M24311 M11834 K01883 KEYWORDS alcohol dehydrogenase; dehydrogenase. SEGMENT 4 of 9 SOURCE Human leukocyte DNA (library of Lawn et al.), clone lambda-ADH15. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 118) AUTHORS Duester,G., Smith,M., Bilanchone,V. and Hatfield,G.W. TITLE Molecular analysis of the human class I alcohol dehydrogenase gene family and nucleotide sequence of the gene encoding the beta subunit JOURNAL J. Biol. Chem. 261, 2027-2033 (1986) STANDARD full staff_review COMMENT The cDNA sequence for this protein is found under accession number M24317. FEATURES from to/span description pept + 16 + 103 alcohol dehydrogenase beta-1 subunit precursor, exon 4 /nomgen="ADH2" /map="4q21-q25" /hgml_locus_uid="LR0050W" matp + 16 + 103 alcohol dehydrogenase beta-1 subunit pre-msg < 1 > 118 ADH2 mRNA IVS < 1 15 ADH2 intron C IVS 104 > 118 ADH2 intron D BASE COUNT 33 A 25 C 25 G 35 T ORIGIN About 1.7 kb after segment 3; chromosome 4q21-q25. 1 TTTATTCCTC TCCAGGTGAT AAAGTCATCC CGCTCTTTAC TCCTCAGTGT GGAAAATGCA 61 GAGTTTGTAA AAACCCGGAG AGCAACTACT GCTTGAAAAA TGAGTAGGTT TCTGATGC // LOCUS HUMADH2E5 250 bp ds-DNA PRI 15-JUN-1989 DEFINITION Human class I alcohol dehydrogenase (ADH2) beta-1 subunit, allele 1 gene, exon 5. ACCESSION M24312 M11835 K01883 KEYWORDS alcohol dehydrogenase; dehydrogenase. SEGMENT 5 of 9 SOURCE Human leukocyte DNA (library of Lawn et al.), clone lambda-ADH15. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 250) AUTHORS Duester,G., Smith,M., Bilanchone,V. and Hatfield,G.W. TITLE Molecular analysis of the human class I alcohol dehydrogenase gene family and nucleotide sequence of the gene encoding the beta subunit JOURNAL J. Biol. Chem. 261, 2027-2033 (1986) STANDARD full staff_review COMMENT The cDNA sequence for this protein is found under accession number M24317. FEATURES from to/span description pept + 16 + 235 alcohol dehydrogenase beta-1 subunit precursor, exon 5 /nomgen="ADH2" /map="4q21-q25" /hgml_locus_uid="LR0050W" matp + 16 + 235 alcohol dehydrogenase beta-1 subunit pre-msg < 1 > 250 ADH2 mRNA IVS < 1 15 ADH2 intron D IVS 236 > 250 ADH2 intron E BASE COUNT 55 A 67 C 72 G 56 T ORIGIN About 67 bp after segment 4; chromosome 4q21-q25. 1 TATTGCACTG CCCAGTCTAG GCAATCCTCG GGGGACCCTG CAGGATGGCA CCAGGAGGTT 61 CACCTGCAGG GGGAAGCCCA TTCACCACTT CCTTGGCACC AGCACCTTCT CCCAGTACAC 121 GGTGGTGGAT GAGAATGCAG TGGCCAAAAT TGATGCAGCC TCGCCCCTGG AGAAAGTCTG 181 CCTCATTGGC TGTGGATTCT CGACTGGTTA TGGGTCTGCA GTTAACGTTG CCAAGGTGAG 241 AATGACAATG // LOCUS HUMADH2E6 291 bp ds-DNA PRI 15-JUN-1989 DEFINITION Human class I alcohol dehydrogenase (ADH2) beta-1 subunit, allele 1 gene, exon 6. ACCESSION M24313 M11836 K01883 KEYWORDS alcohol dehydrogenase; dehydrogenase. SEGMENT 6 of 9 SOURCE Human leukocyte DNA (library of Lawn et al.), clone lambda-ADH15. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 291) AUTHORS Duester,G., Smith,M., Bilanchone,V. and Hatfield,G.W. TITLE Molecular analysis of the human class I alcohol dehydrogenase gene family and nucleotide sequence of the gene encoding the beta subunit JOURNAL J. Biol. Chem. 261, 2027-2033 (1986) STANDARD full staff_review COMMENT The cDNA sequence for this protein is found under accession number M24317. FEATURES from to/span description pept + 16 + 276 alcohol dehydrogenase beta-1 subunit precursor, exon 6 /nomgen="ADH2" /map="4q21-q25" /hgml_locus_uid="LR0050W" matp + 16 + 276 alcohol dehydrogenase beta-1 subunit pre-msg < 1 > 291 ADH2 mRNA IVS < 1 15 ADH2 intron E IVS 277 > 291 ADH2 intron F BASE COUNT 79 A 63 C 80 G 69 T ORIGIN About 2.0 kb after segment 5; chromosome 4q21-25. 1 TTTTTGAAAA CACAGGTCAC CCCAGGCTCT ACCTGTGCTG TGTTTGGCCT GGGAGGGGTC 61 GGCCTATCTG CTGTTATGGG CTGTAAAGCA GCTGGAGCAG CCAGAATCAT TGCGGTGGAC 121 ATCAACAAGG ACAAATTTGC AAAGGCCAAA GAGTTGGGTG CCACTGAATG CATCAACCCT 181 CAAGACTACA AGAAACCCAT CCAGGAAGTG CTAAAGGAAA TGACTGATGG AGGTGTGGAT 241 TTTTCGTTTG AAGTCATCGG TCGGCTTGAC ACCATGGTAT GTACCATGAC A // LOCUS HUMADH2E7 166 bp ds-DNA PRI 15-JUN-1989 DEFINITION Human class I alcohol dehydrogenase (ADH2) beta-1 subunit, allele 1 gene, exon 7. ACCESSION M24314 M11837 K01883 KEYWORDS alcohol dehydrogenase; dehydrogenase. SEGMENT 7 of 9 SOURCE Human leukocyte DNA (library of Lawn et al.), clone lambda-ADH15 [1]; adult liver, cDNA to mRNA (library of Orkin), clone pADH12 [2]. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 40 to 151) AUTHORS Duester,G., Hatfield,G.W., Buehler,R., Hempel,J., Joernvall,H. and Smith,M. TITLE Molecular cloning and characterization of a cDNA for the beta subunit of human alcohol dehydrogenase JOURNAL Proc. Natl. Acad. Sci. U.S.A. 81, 4055-4059 (1984) STANDARD full staff_review REFERENCE 2 (bases 1 to 166) AUTHORS Duester,G., Smith,M., Bilanchone,V. and Hatfield,G.W. TITLE Molecular analysis of the human class I alcohol dehydrogenase gene family and nucleotide sequence of the gene encoding the beta subunit JOURNAL J. Biol. Chem. 261, 2027-2033 (1986) STANDARD full staff_review COMMENT The cDNA sequence for this protein is found under accession number M24317. FEATURES from to/span description pept + 16 + 151 alcohol dehydrogenase beta-1 subunit precursor, exon 7 /nomgen="ADH2" /map="4q21-q25" /hgml_locus_uid="LR0050W" matp + 16 + 151 alcohol dehydrogenase beta-1 subunit pre-msg < 1 > 166 ADH2 mRNA IVS < 1 15 ADH2 intron F IVS 152 > 166 ADH2 intron G BASE COUNT 32 A 40 C 44 G 50 T ORIGIN About 2.2 kb after segment 6; chromosome 4q21-q25. 1 TTCACTTTAT TCCAGATGGC TTCCCTGTTA TGTTGTCATG AGGCATGTGG CACAAGCGTC 61 ATCGTAGGGG TACCTCCTGC TTCCCAGAAC CTCTCAATAA ACCCTATGCT GCTACTGACT 121 GGACGCACCT GGAAGGGGGC TGTTTATGGT GGTATGTAGT TGGGCT // LOCUS HUMADH2E8 169 bp ds-DNA PRI 15-JUN-1989 DEFINITION Human class I alcohol dehydrogenase (ADH2) beta-1 subunit, allele 1 gene, exon 8. ACCESSION M24315 M11838 K01883 KEYWORDS alcohol dehydrogenase; dehydrogenase. SEGMENT 8 of 9 SOURCE Human leukocyte DNA (library of Lawn et al.), clone lambda-ADH15 [1]; adult liver, cDNA to mRNA (library of Orkin), clone pADH12 [2]. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 16 to 154) AUTHORS Duester,G., Hatfield,G.W., Buehler,R., Hempel,J., Joernvall,H. and Smith,M. TITLE Molecular cloning and characterization of a cDNA for the beta subunit of human alcohol dehydrogenase JOURNAL Proc. Natl. Acad. Sci. U.S.A. 81, 4055-4059 (1984) STANDARD full staff_review REFERENCE 2 (bases 1 to 169) AUTHORS Duester,G., Smith,M., Bilanchone,V. and Hatfield,G.W. TITLE Molecular analysis of the human class I alcohol dehydrogenase gene family and nucleotide sequence of the gene encoding the beta subunit JOURNAL J. Biol. Chem. 261, 2027-2033 (1986) STANDARD full staff_review COMMENT The cDNA sequence for this protein is found under accession number M24317. FEATURES from to/span description pept + 16 + 154 alcohol dehydrogenase beta-1 subunit precursor, exon 8 /nomgen="ADH2" /map="4q21-q25" /hgml_locus_uid="LR0050W" matp + 16 + 154 alcohol dehydrogenase beta-1 subunit pre-msg < 1 > 169 ADH2 mRNA IVS < 1 15 ADH2 intron G IVS 155 > 169 ADH2 intron H BASE COUNT 51 A 25 C 34 G 59 T ORIGIN About 0.6 kb after segment 7; chromosome 4q21-q25. 1 CTGTCTTCTT TTCAGGCTTT AAGAGTAAAG AAGGTATCCC AAAACTTGTG GCTGATTTTA 61 TGGCTAAGAA GTTTTCACTG GATGCGTTAA TAACCCATGT TTTACCTTTT GAAAAAATAA 121 ATGAAGGATT TGACCTGCTT CACTCTGGGA AAAGGTAGAT TTTAAGTTA // LOCUS HUMADH2E9 700 bp ds-DNA PRI 15-JUN-1989 DEFINITION Human class I alcohol dehydrogenase (ADH2) beta-1 subunit, allele 1 gene, exon 9. ACCESSION M24316 M11839 K01883 KEYWORDS alcohol dehydrogenase; dehydrogenase. SEGMENT 9 of 9 SOURCE Human leukocyte DNA (library of Lawn et al.), clone lambda-ADH15 [1]; adult liver, cDNA to mRNA (library of Orkin), clone pADH12 [2]. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 16 to 631) AUTHORS Duester,G., Hatfield,G.W., Buehler,R., Hempel,J., Joernvall,H. and Smith,M. TITLE Molecular cloning and characterization of a cDNA for the beta subunit of human alcohol dehydrogenase JOURNAL Proc. Natl. Acad. Sci. U.S.A. 81, 4055-4059 (1984) STANDARD full staff_review REFERENCE 2 (bases 1 to 700) AUTHORS Duester,G., Smith,M., Bilanchone,V. and Hatfield,G.W. TITLE Molecular analysis of the human class I alcohol dehydrogenase gene family and nucleotide sequence of the gene encoding the beta subunit JOURNAL J. Biol. Chem. 261, 2027-2033 (1986) STANDARD full staff_review REFERENCE 3 (bases 11 to 30) AUTHORS Xu,Y., Carr,L.G., Bosron,W.F., Li,T.-K. and Edenberg,H.J. TITLE Genotyping of human alcohol dehydrogenases at the ADH2 and ADH3 loci following DNA sequence amplification JOURNAL Genomics 2, 209-214 (1988) STANDARD simple staff_entry COMMENT The cDNA sequence for this protein is found under accession number M24317. FEATURES from to/span description pept + 16 40 alcohol dehydrogenase beta-1 subunit precursor, exon 9 /nomgen="ADH2" /map="4q21-q25" /hgml_locus_uid="LR0050W" matp + 16 37 alcohol dehydrogenase beta-1 subunit pre-msg < 1 631 ADH2 mRNA IVS < 1 15 ADH2 intron H allele 20 20 c in ADH2-1; t in ADH2-3 [2] variant 92 92 a in DNA clone lambda-ADH15 [2]; g in cDNA clone pADH12 [1] variant 616 618 ccc in DNA clone lambda-ADH15 [2]; cc in cDNA clone pADH12 [1] BASE COUNT 211 A 137 C 125 G 227 T ORIGIN About 2.8 kb after segment 8; chromosome 4q21-q25. 1 TCTTTCCTAT TGCAGTATCC GTACCGTCCT GACGTTTTGA GGCAATAGAG ATGCCTTCCC 61 CTGTAGCAGT CTTCAGCCTC CTCTACCCTA CAAGATCTGG AGCAACAGCT AGGAAATATC 121 ATTAATTCAG CTCTTCAGAG ATGTTATCAA TAAATTACAC ATGGGGGCTT TCCAAAGAAA 181 TGGAAATTGA TGGGAAATTA TTTTTCAGGA AAATTTAAAA TTCAAGTGAG AAGTAAATAA 241 AGTGTTGAAC ATCAGCTGGG GAATTGAAGC CAACAAACCT TCCTTCTTAA CCATTCTACT 301 GTGTCACCTT TGCCATTGAG GAAAAATATT CCTGTGACTT CTTGCATTTT TGGTATCTTC 361 ATAATCTTTA GTCATCGAAT CCCAGTGGAG GGGACCCTTT TACTTGCCCT GAACATACAC 421 ATGCTGGGCC ATTGTGATTG AAGTCTTCTA ACTCTGTCTC AGTTTTCACT GTCGACATTT 481 TCCTTTTTCT AATAAAAATG TACCAAATCC CTGGGGTAAA AGCTAGGGTA AGGTAAAGGA 541 TAGACTCACA TTTACAAGTA GTGAAGGTCC AAGAGTTCTA AATACAGGAA ATTTCTTAGG 601 AACTCAAATA AAATGCCCCA CATTTTACTA CAGTAAATGG CAGTGTTTTT ATGACTTTTA 661 TACTCATTTC TTTATGGTCG ATATACAATT GATTTTTAAA // LOCUS HUMADPRT01 226 bp ds-DNA PRI 15-DEC-1989 DEFINITION Human NAD+ ADP-ribosyltransferase (ADPRT) gene, exon 1. ACCESSION M29544 M22953 KEYWORDS ADP-D-ribosyltransferase; NAD+ ADP-ribosyltransferase. SEGMENT 1 of 23 SOURCE Human HeLa cell line, cDNA to mRNA, clones Hz[19,20,21,22,23,24]. and liver DNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 207 to 226) AUTHORS Auer,B., Nagl,U., Herzog,H., Schneider,R. and Schweiger,M. TITLE Human nuclear NAD+ ADP-ribosyltransferase (polymerizing): Organization of the gene JOURNAL DNA 8, 575-580 (1989) STANDARD full staff_entry REFERENCE 2 (bases 1 to 216) AUTHORS Herzog,H. JOURNAL Unpublished (1989) Univ. Innsbruck, A-6020 Innsbruck, Austria STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [2] kindly submitted by H.Herzog, 07-MAR-1989. FEATURES from to/span description pept 97 + 216 NAD+ ADP-ribosyltransferase, exon 1 (EC 2.4.2.30; 5' end put.) /nomgen="PPOL" /map="1q41-q42" /hgml_locus_uid="LZ0154E" IVS 217 > 226 PPOL intron A BASE COUNT 38 A 58 C 91 G 39 T ORIGIN Chromosome 1q41-q42. 1 GTGCGGCTGG GTGAGCGCAC GCGAGGCGGC GAGGCGGCAG CGTGTTTCTA GGTCGTGGCG 61 TCGGGCTTCC GGAGCTTTGG CGGCAGCTAG GGGAGGATGG CGGAGTCTTC GGATAAGCTC 121 TATCGAGTCG AGTACGCCAA GAGCGGGCGC GCCTCTTGCA AGAAATGCAG CGAGAGCATC 181 CCCAAGGACT CGCTCCGGAT GGCCATCATG GTGCAGGTGC GGGCGC // LOCUS HUMADPRT02 186 bp ds-DNA PRI 15-DEC-1989 DEFINITION Human NAD+ ADP-ribosyltransferase (ADPRT) gene, exon 2. ACCESSION M29545 M22953 KEYWORDS ADP-D-ribosyltransferase; NAD+ ADP-ribosyltransferase. SEGMENT 2 of 23 SOURCE Human HeLa cell line, cDNA to mRNA, clones Hz[19,20,21,22,23,24]. and liver DNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 92 to 176) AUTHORS Herzog,H., Zabel,B.U., Schneider,R., Auer,B., Hirsch-Kauffmann,M. and Schweiger,M. TITLE Human nuclear NAD+ ADP-ribosyltransferase: Localization of the gene on chromosome 1q41-q42 and expression of an active human enzyme in Escherichia coli JOURNAL Proc. Natl. Acad. Sci. U.S.A. 86, 3514-3518 (1989) STANDARD full staff_review REFERENCE 2 (bases 1 to 20; 167 to 186) AUTHORS Auer,B., Nagl,U., Herzog,H., Schneider,R. and Schweiger,M. TITLE Human nuclear NAD+ ADP-ribosyltransferase (polymerizing): Organization of the gene JOURNAL DNA 8, 575-580 (1989) STANDARD full staff_entry REFERENCE 3 (bases 21 to 176) AUTHORS Herzog,H. JOURNAL Unpublished (1989) Univ. Innsbruck, A-6020 Innsbruck, Austria STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1],[3] kindly submitted by H.Herzog, 07-MAR-1989. FEATURES from to/span description pept + 11 + 176 NAD+ ADP-ribosyltransferase, exon 2 (EC 2.4.2.30; 5' end put.) /nomgen="PPOL" /map="1q41-q42" /hgml_locus_uid="LZ0154E" IVS < 1 10 PPOL intron A IVS 177 > 186 PPOL intron B BASE COUNT 41 A 45 C 56 G 44 T ORIGIN About 5 kb after segment 1; chromosome 1q41-q42. 1 TTCCTTCTAG TCGCCCATGT TTGATGGAAA AGTCCCACAC TGGTACCACT TCTCCTGCTT 61 CTGGAAGGTG GGCCACTCCA TCCGGCACCC TGACGTTGAG GTGGATGGGT TCTCTGAGCT 121 TCGGTGGGAT GACCAGCAGA AAGTCAAGAA GACAGCGGAA GCTGGAGGAG TGACAGGTGT 181 GTACAT // LOCUS HUMADPRT03 136 bp ds-DNA PRI 15-DEC-1989 DEFINITION Human NAD+ ADP-ribosyltransferase (ADPRT) gene, exon 3. ACCESSION M29766 M22953 KEYWORDS ADP-D-ribosyltransferase; NAD+ ADP-ribosyltransferase. SEGMENT 3 of 23 SOURCE Human HeLa cell line, cDNA to mRNA, clones Hz[19,20,21,22,23,24]. and liver DNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 11 to 126) AUTHORS Herzog,H., Zabel,B.U., Schneider,R., Auer,B., Hirsch-Kauffmann,M. and Schweiger,M. TITLE Human nuclear NAD+ ADP-ribosyltransferase: Localization of the gene on chromosome 1q41-q42 and expression of an active human enzyme in Escherichia coli JOURNAL Proc. Natl. Acad. Sci. U.S.A. 86, 3514-3518 (1989) STANDARD full staff_review REFERENCE 2 (bases 1 to 20; 117 to 136) AUTHORS Auer,B., Nagl,U., Herzog,H., Schneider,R. and Schweiger,M. TITLE Human nuclear NAD+ ADP-ribosyltransferase (polymerizing): Organization of the gene JOURNAL DNA 8, 575-580 (1989) STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by H.Herzog, 07-MAR-1989. FEATURES from to/span description pept + 11 + 126 NAD+ ADP-ribosyltransferase, exon 3 (EC 2.4.2.30; 5' end put.) /nomgen="PPOL" /map="1q41-q42" /hgml_locus_uid="LZ0154E" IVS < 1 10 PPOL intron B IVS 127 > 136 PPOL intron C BASE COUNT 45 A 21 C 44 G 26 T ORIGIN About 8 kb after segment 2; chromosome 1q41-q42. 1 GATTCTCCAG GCAAAGGCCA GGATGGAATT GGTAGCAAGG CAGAGAAGAC TCTGGGTGAC 61 TTTGCAGCAG AGTATGCCAA GTCCAACAGA AGTACGTGCA AGGGGTGTAT GGAGAAGATA 121 GAAAAGGTAA GATCTT // LOCUS HUMADPRT04 235 bp ds-DNA PRI 15-DEC-1989 DEFINITION Human NAD+ ADP-ribosyltransferase (ADPRT) gene, exon 4. ACCESSION M29767 M22953 KEYWORDS ADP-D-ribosyltransferase; NAD+ ADP-ribosyltransferase. SEGMENT 4 of 23 SOURCE Human HeLa cell line, cDNA to mRNA, clones Hz[19,20,21,22,23,24]. and liver DNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 11 to 225) AUTHORS Herzog,H., Zabel,B.U., Schneider,R., Auer,B., Hirsch-Kauffmann,M. and Schweiger,M. TITLE Human nuclear NAD+ ADP-ribosyltransferase: Localization of the gene on chromosome 1q41-q42 and expression of an active human enzyme in Escherichia coli JOURNAL Proc. Natl. Acad. Sci. U.S.A. 86, 3514-3518 (1989) STANDARD full staff_review REFERENCE 2 (bases 1 to 20; 216 to 235) AUTHORS Auer,B., Nagl,U., Herzog,H., Schneider,R. and Schweiger,M. TITLE Human nuclear NAD+ ADP-ribosyltransferase (polymerizing): Organization of the gene JOURNAL DNA 8, 575-580 (1989) STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by H.Herzog, 07-MAR-1989. FEATURES from to/span description pept + 11 + 225 NAD+ ADP-ribosyltransferase, exon 4 (EC 2.4.2.30; 5' end put.) /nomgen="PPOL" /map="1q41-q42" /hgml_locus_uid="LZ0154E" IVS < 1 10 PPOL intron C IVS 226 > 235 PPOL intron D BASE COUNT 57 A 57 C 79 G 42 T ORIGIN About 1.6 kb after segment 3; chromosome 1q41-q42. 1 TTTCCTGCAG GGCCAGGTGC GCCTGTCCAA GAAGATGGTG GACCCGGAGA AGCCACAGCT 61 AGGCATGATT GACCGCTGGT ACCATCCAGG CTGCTTTGTC AAGAACAGGG AGGAGCTGGG 121 TTTCCGGCCC GAGTACAGTG CGAGTCAGCT CAAGGGCTTC AGCCTCCTTG CTACAGAGGA 181 TAAAGAAGCC CTGAAGAAGC AGCTCCCAGG AGTCAAGAGT GAAGGGTGGG TGGAG // LOCUS HUMADPRT05 120 bp ds-DNA PRI 15-DEC-1989 DEFINITION Human NAD+ ADP-ribosyltransferase (ADPRT) gene, exon 5. ACCESSION M29768 M22953 KEYWORDS ADP-D-ribosyltransferase; NAD+ ADP-ribosyltransferase. SEGMENT 5 of 23 SOURCE Human HeLa cell line, cDNA to mRNA, clones Hz[19,20,21,22,23,24]. and liver DNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 11 to 110) AUTHORS Herzog,H., Zabel,B.U., Schneider,R., Auer,B., Hirsch-Kauffmann,M. and Schweiger,M. TITLE Human nuclear NAD+ ADP-ribosyltransferase: Localization of the gene on chromosome 1q41-q42 and expression of an active human enzyme in Escherichia coli JOURNAL Proc. Natl. Acad. Sci. U.S.A. 86, 3514-3518 (1989) STANDARD full staff_review REFERENCE 2 (bases 1 to 20; 101 to 120) AUTHORS Auer,B., Nagl,U., Herzog,H., Schneider,R. and Schweiger,M. TITLE Human nuclear NAD+ ADP-ribosyltransferase (polymerizing): Organization of the gene JOURNAL DNA 8, 575-580 (1989) STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by H.Herzog, 07-MAR-1989. FEATURES from to/span description pept + 11 + 110 NAD+ ADP-ribosyltransferase, exon 5 (EC 2.4.2.30; 5' end put.) /nomgen="PPOL" /map="1q41-q42" /hgml_locus_uid="LZ0154E" IVS < 1 10 PPOL intron D IVS 111 > 120 PPOL intron E BASE COUNT 50 A 14 C 35 G 21 T ORIGIN About 1.2 kb after segment 4; chromosome 1q41-q42. 1 TCTCCTTCAG AAAGAGAAAA GGCGATGAGG TGGATGGAGT GGATGAAGTG GCGAAGAAGA 61 AATCTAAAAA AGAAAAAGAC AAGGATAGTA AGCTTGAAAA AGCCCTAAAG GTGAGTTCTC // LOCUS HUMADPRT06 137 bp ds-DNA PRI 15-DEC-1989 DEFINITION Human NAD+ ADP-ribosyltransferase (ADPRT) gene, exon 6. ACCESSION M29769 M22953 KEYWORDS ADP-D-ribosyltransferase; NAD+ ADP-ribosyltransferase. SEGMENT 6 of 23 SOURCE Human HeLa cell line, cDNA to mRNA, clones Hz[19,20,21,22,23,24]. and liver DNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 11 to 127) AUTHORS Herzog,H., Zabel,B.U., Schneider,R., Auer,B., Hirsch-Kauffmann,M. and Schweiger,M. TITLE Human nuclear NAD+ ADP-ribosyltransferase: Localization of the gene on chromosome 1q41-q42 and expression of an active human enzyme in Escherichia coli JOURNAL Proc. Natl. Acad. Sci. U.S.A. 86, 3514-3518 (1989) STANDARD full staff_review REFERENCE 2 (bases 1 to 20; 118 to 137) AUTHORS Auer,B., Nagl,U., Herzog,H., Schneider,R. and Schweiger,M. TITLE Human nuclear NAD+ ADP-ribosyltransferase (polymerizing): Organization of the gene JOURNAL DNA 8, 575-580 (1989) STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by H.Herzog, 07-MAR-1989. FEATURES from to/span description pept + 11 + 127 NAD+ ADP-ribosyltransferase, exon 6 (EC 2.4.2.30; 5' end put.) /nomgen="PPOL" /map="1q41-q42" /hgml_locus_uid="LZ0154E" IVS < 1 10 PPOL intron E IVS 128 > 137 PPOL intron F BASE COUNT 39 A 32 C 36 G 30 T ORIGIN About 1.6 kb after segment 5; chromosome 1q41-q42. 1 TTGTCCATAG GCTCAGAACG ACCTGATCTG GAACATCAAG GACGAGCTAA AGAAAGTGTG 61 TTCAACTAAT GACCTGAAGG AGCTACTCAT CTTCAACAAG CAGCAAGTGC CTTCTGGGGA 121 GTCGGCGGTG ACTGCCT // LOCUS HUMADPRT07 193 bp ds-DNA PRI 15-DEC-1989 DEFINITION Human NAD+ ADP-ribosyltransferase (ADPRT) gene, exon 7. ACCESSION M29770 M22953 KEYWORDS ADP-D-ribosyltransferase; NAD+ ADP-ribosyltransferase. SEGMENT 7 of 23 SOURCE Human HeLa cell line, cDNA to mRNA, clones Hz[19,20,21,22,23,24]. and liver DNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 7 to 183) AUTHORS Herzog,H., Zabel,B.U., Schneider,R., Auer,B., Hirsch-Kauffmann,M. and Schweiger,M. TITLE Human nuclear NAD+ ADP-ribosyltransferase: Localization of the gene on chromosome 1q41-q42 and expression of an active human enzyme in Escherichia coli JOURNAL Proc. Natl. Acad. Sci. U.S.A. 86, 3514-3518 (1989) STANDARD full staff_review REFERENCE 2 (bases 1 to 16; 174 to 193) AUTHORS Auer,B., Nagl,U., Herzog,H., Schneider,R. and Schweiger,M. TITLE Human nuclear NAD+ ADP-ribosyltransferase (polymerizing): Organization of the gene JOURNAL DNA 8, 575-580 (1989) STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by H.Herzog, 07-MAR-1989. FEATURES from to/span description pept + 7 + 183 NAD+ ADP-ribosyltransferase, exon 7 (EC 2.4.2.30; 5' end put.) /nomgen="PPOL" /map="1q41-q42" /hgml_locus_uid="LZ0154E" IVS < 1 6 PPOL intron F IVS 184 > 193 PPOL intron G BASE COUNT 44 A 49 C 60 G 40 T ORIGIN About 0.2 kb after segment 6; chromosome 1q41-q42. 1 CTGCAGATCT TGGACCGAGT AGCTGATGGC ATGGTGTTCG GTGCCCTCCT TCCCTGCGAG 61 GAATGCTCGG GTCAGCTGGT CTTCAAGAGC GATGCCTATT ACTGCACTGG GGACGTCACT 121 GCCTGGACCA AGTGTATGGT CAAGACACAG ACACCCAACC GGAAGGAGTG GGTAACCCCA 181 AAGGTAAGGG GTA // LOCUS HUMADPRT08 168 bp ds-DNA PRI 15-DEC-1989 DEFINITION Human NAD+ ADP-ribosyltransferase (ADPRT) gene, exon 8. ACCESSION M29771 M22953 KEYWORDS ADP-D-ribosyltransferase; NAD+ ADP-ribosyltransferase. SEGMENT 8 of 23 SOURCE Human HeLa cell line, cDNA to mRNA, clones Hz[19,20,21,22,23,24]. and liver DNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 11 to 158) AUTHORS Herzog,H., Zabel,B.U., Schneider,R., Auer,B., Hirsch-Kauffmann,M. and Schweiger,M. TITLE Human nuclear NAD+ ADP-ribosyltransferase: Localization of the gene on chromosome 1q41-q42 and expression of an active human enzyme in Escherichia coli JOURNAL Proc. Natl. Acad. Sci. U.S.A. 86, 3514-3518 (1989) STANDARD full staff_review REFERENCE 2 (bases 1 to 20; 149 to 168) AUTHORS Auer,B., Nagl,U., Herzog,H., Schneider,R. and Schweiger,M. TITLE Human nuclear NAD+ ADP-ribosyltransferase (polymerizing): Organization of the gene JOURNAL DNA 8, 575-580 (1989) STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by H.Herzog, 07-MAR-1989. FEATURES from to/span description pept + 11 + 158 NAD+ ADP-ribosyltransferase, exon 8 (EC 2.4.2.30; 5' end put.) /nomgen="PPOL" /map="1q41-q42" /hgml_locus_uid="LZ0154E" IVS < 1 10 PPOL intron G IVS 159 > 168 PPOL intron H BASE COUNT 40 A 54 C 34 G 40 T ORIGIN About 3.4 kb after segment 7; chromosome 1q41-q42. 1 TATTTTTTAG GAATTCCGAG AAATCTCTTA CCTCAAGAAA TTGAAGGTTA AAAAGCAGGA 61 CCGTATATTC CCCCCAGAAA CCAGCGCCTC CGTGGCGGCC ACGCCTCCGC CCTCCACAGC 121 CTCGGCTCCT GCTGCTGTGA ACTCCTCTGC TTCAGCAGGT ACAGCATT // LOCUS HUMADPRT09 161 bp ds-DNA PRI 15-DEC-1989 DEFINITION Human NAD+ ADP-ribosyltransferase (ADPRT) gene, exon 9. ACCESSION M29772 M22953 KEYWORDS ADP-D-ribosyltransferase; NAD+ ADP-ribosyltransferase. SEGMENT 9 of 23 SOURCE Human HeLa cell line, cDNA to mRNA, clones Hz[19,20,21,22,23,24]. and liver DNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 11 to 151) AUTHORS Herzog,H., Zabel,B.U., Schneider,R., Auer,B., Hirsch-Kauffmann,M. and Schweiger,M. TITLE Human nuclear NAD+ ADP-ribosyltransferase: Localization of the gene on chromosome 1q41-q42 and expression of an active human enzyme in Escherichia coli JOURNAL Proc. Natl. Acad. Sci. U.S.A. 86, 3514-3518 (1989) STANDARD full staff_review REFERENCE 2 (bases 1 to 20; 142 to 161) AUTHORS Auer,B., Nagl,U., Herzog,H., Schneider,R. and Schweiger,M. TITLE Human nuclear NAD+ ADP-ribosyltransferase (polymerizing): Organization of the gene JOURNAL DNA 8, 575-580 (1989) STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by H.Herzog, 07-MAR-1989. FEATURES from to/span description pept + 11 + 151 NAD+ ADP-ribosyltransferase, exon 9 (EC 2.4.2.30; 5' end put.) /nomgen="PPOL" /map="1q41-q42" /hgml_locus_uid="LZ0154E" IVS < 1 10 PPOL intron H IVS 152 > 161 PPOL intron I BASE COUNT 49 A 37 C 45 G 30 T ORIGIN About 1.3 kb after segment 8; chromosome 1q41-q42. 1 TTCATCCCAG ATAAGCCATT ATCCAACATG AAGATCCTGA CTCTCGGGAA GCTGTCCCGG 61 AACAAGGATG AAGTGAAGGC CATGATTGAG AAACTCGGGG GGAAGTTGAC GGGGACGGCC 121 AACAAGGCTT CCCTGTGCAT CAGCACCAAA AGTGAGTTGA A // LOCUS HUMADPRT10 258 bp ds-DNA PRI 15-DEC-1989 DEFINITION Human NAD+ ADP-ribosyltransferase (ADPRT) gene, exon 10. ACCESSION M29773 M22953 KEYWORDS ADP-D-ribosyltransferase; NAD+ ADP-ribosyltransferase. SEGMENT 10 of 23 SOURCE Human HeLa cell line, cDNA to mRNA, clones Hz[19,20,21,22,23,24]. and liver DNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 11 to 253) AUTHORS Herzog,H., Zabel,B.U., Schneider,R., Auer,B., Hirsch-Kauffmann,M. and Schweiger,M. TITLE Human nuclear NAD+ ADP-ribosyltransferase: Localization of the gene on chromosome 1q41-q42 and expression of an active human enzyme in Escherichia coli JOURNAL Proc. Natl. Acad. Sci. U.S.A. 86, 3514-3518 (1989) STANDARD full staff_review REFERENCE 2 (bases 1 to 20; 244 to 258) AUTHORS Auer,B., Nagl,U., Herzog,H., Schneider,R. and Schweiger,M. TITLE Human nuclear NAD+ ADP-ribosyltransferase (polymerizing): Organization of the gene JOURNAL DNA 8, 575-580 (1989) STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by H.Herzog, 07-MAR-1989. FEATURES from to/span description pept + 11 + 253 NAD+ ADP-ribosyltransferase, exon 10 (EC 2.4.2.30; 5' end put.) /nomgen="PPOL" /map="1q41-q42" /hgml_locus_uid="LZ0154E" IVS < 1 10 PPOL intron I IVS 254 > 258 PPOL intron J BASE COUNT 73 A 53 C 85 G 47 T ORIGIN About 1.0 kb after segment 9; chromosome 1q41-q42. 1 ATTTGAACAG AGGAGGTGGA AAAGATGAAT AAGAAGATGG AGGAAGTAAA GGAAGCCAAC 61 ATCCGAGTTG TGTCTGAGGA CTTCCTCCAG GACGTCTCCG CCTCCACCAA GAGCCTTCAG 121 GAGTTGTTCT TAGCGCACAT CTTGTCCCCT TGGGGGGCAG AGGTGAAGGC AGAGCCTGTT 181 GAAGTTGTGG CCCCAAGAGG GAAGTCAGGG GCTGCGCTCT CCAAAAAAAG CAAGGGCCAG 241 GTCAAGGAGG AAGGTGAG // LOCUS HUMADPRT11 89 bp ds-DNA PRI 15-DEC-1989 DEFINITION Human NAD+ ADP-ribosyltransferase (ADPRT) gene, exon 11. ACCESSION M29774 M22953 KEYWORDS ADP-D-ribosyltransferase; NAD+ ADP-ribosyltransferase. SEGMENT 11 of 23 SOURCE Human HeLa cell line, cDNA to mRNA, clones Hz[19,20,21,22,23,24]. and liver DNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 11 to 79) AUTHORS Herzog,H., Zabel,B.U., Schneider,R., Auer,B., Hirsch-Kauffmann,M. and Schweiger,M. TITLE Human nuclear NAD+ ADP-ribosyltransferase: Localization of the gene on chromosome 1q41-q42 and expression of an active human enzyme in Escherichia coli JOURNAL Proc. Natl. Acad. Sci. U.S.A. 86, 3514-3518 (1989) STANDARD full staff_review REFERENCE 2 (bases 1 to 20; 70 to 89) AUTHORS Auer,B., Nagl,U., Herzog,H., Schneider,R. and Schweiger,M. TITLE Human nuclear NAD+ ADP-ribosyltransferase (polymerizing): Organization of the gene JOURNAL DNA 8, 575-580 (1989) STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by H.Herzog, 07-MAR-1989. FEATURES from to/span description pept + 11 + 79 NAD+ ADP-ribosyltransferase, exon 11 (EC 2.4.2.30; 5' end put.) /nomgen="PPOL" /map="1q41-q42" /hgml_locus_uid="LZ0154E" IVS < 1 10 PPOL intron J IVS 80 > 89 PPOL intron K BASE COUNT 33 A 17 C 19 G 20 T ORIGIN About 0.6 kb after segment 10; chromosome 1q41-q42. 1 CTCCTCCAAG GTATCAACAA ATCTGAAAAG AGAATGAAAT TAACTCTTAA AGGAGGAGCA 61 GCTGTGGATC CTGATTCTGG TAAGCAAAC // LOCUS HUMADPRT12 153 bp ds-DNA PRI 15-DEC-1989 DEFINITION Human NAD+ ADP-ribosyltransferase (ADPRT) gene, exon 12. ACCESSION M29775 M22953 KEYWORDS ADP-D-ribosyltransferase; NAD+ ADP-ribosyltransferase. SEGMENT 12 of 23 SOURCE Human HeLa cell line, cDNA to mRNA, clones Hz[19,20,21,22,23,24]. and liver DNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 11 to 143) AUTHORS Herzog,H., Zabel,B.U., Schneider,R., Auer,B., Hirsch-Kauffmann,M. and Schweiger,M. TITLE Human nuclear NAD+ ADP-ribosyltransferase: Localization of the gene on chromosome 1q41-q42 and expression of an active human enzyme in Escherichia coli JOURNAL Proc. Natl. Acad. Sci. U.S.A. 86, 3514-3518 (1989) STANDARD full staff_review REFERENCE 2 (bases 1 to 20; 134 to 153) AUTHORS Auer,B., Nagl,U., Herzog,H., Schneider,R. and Schweiger,M. TITLE Human nuclear NAD+ ADP-ribosyltransferase (polymerizing): Organization of the gene JOURNAL DNA 8, 575-580 (1989) STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by H.Herzog, 07-MAR-1989. FEATURES from to/span description pept + 11 + 143 NAD+ ADP-ribosyltransferase, exon 12 (EC 2.4.2.30; 5' end put.) /nomgen="PPOL" /map="1q41-q42" /hgml_locus_uid="LZ0154E" IVS < 1 10 PPOL intron K IVS 144 > 153 PPOL intron L BASE COUNT 40 A 36 C 46 G 31 T ORIGIN About 0.6 kb after segment 11; chromosome 1q41-q42. 1 TCTGGGACAG GACTGGAACA CTCTGCGCAT GTCCTGGAGA AAGGTGGGAA GGTCTTCAGT 61 GCCACCCTTG GCCTGGTGGA CATCGTTAAA GGAACCAACT CCTACTACAA GCTGCAGCTT 121 CTGGAGGACG ACAAGGAAAA CAGGTGAGTT CTG // LOCUS HUMADPRT13 216 bp ds-DNA PRI 15-DEC-1989 DEFINITION Human NAD+ ADP-ribosyltransferase (ADPRT) gene, exon 13. ACCESSION M29776 M22953 KEYWORDS ADP-D-ribosyltransferase; NAD+ ADP-ribosyltransferase. SEGMENT 13 of 23 SOURCE Human HeLa cell line, cDNA to mRNA, clones Hz[19,20,21,22,23,24]. and liver DNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 11 to 151) AUTHORS Herzog,H., Zabel,B.U., Schneider,R., Auer,B., Hirsch-Kauffmann,M. and Schweiger,M. TITLE Human nuclear NAD+ ADP-ribosyltransferase: Localization of the gene on chromosome 1q41-q42 and expression of an active human enzyme in Escherichia coli JOURNAL Proc. Natl. Acad. Sci. U.S.A. 86, 3514-3518 (1989) STANDARD full staff_review REFERENCE 2 (bases 1 to 20; 197 to 216) AUTHORS Auer,B., Nagl,U., Herzog,H., Schneider,R. and Schweiger,M. TITLE Human nuclear NAD+ ADP-ribosyltransferase (polymerizing): Organization of the gene JOURNAL DNA 8, 575-580 (1989) STANDARD full staff_entry REFERENCE 3 (bases 152 to 206) AUTHORS Herzog,H. JOURNAL Unpublished (1989) Univ. Innsbruck, A-6020 Innsbruck, Austria STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1],[3] kindly submitted by H.Herzog, 07-MAR-1989. FEATURES from to/span description pept + 11 + 206 NAD+ ADP-ribosyltransferase, exon 13 (EC 2.4.2.30; 5' end put.) /nomgen="PPOL" /map="1q41-q42" /hgml_locus_uid="LZ0154E" IVS < 1 10 PPOL intron L IVS 207 > 216 PPOL intron M BASE COUNT 66 A 50 C 54 G 46 T ORIGIN About 1.8 kb after segment 12; chromosome 1q41-q42. 1 CCCACTGCAG GTATTGGATA TTCAGGTCCT GGGGCCGTGT GGGTACGGTG ATCGGTAGCA 61 ACAAACTGGA ACAGATGCCG TCCAAGGAGG ATGCCATTGA GCACTTCATG AAATTATATG 121 AAGAAAAAAC CGGGAACGCT TGGCACTCCA AAAATTTCAC GAAGTATCCC AAAAAGTTCT 181 ACCCCCTGGA GATTGACTAT GGCCAGGTAA CCACAT // LOCUS HUMADPRT14 149 bp ds-DNA PRI 15-DEC-1989 DEFINITION Human NAD+ ADP-ribosyltransferase (ADPRT) gene, exon 14. ACCESSION M29777 M22953 KEYWORDS ADP-D-ribosyltransferase; NAD+ ADP-ribosyltransferase. SEGMENT 14 of 23 SOURCE Human HeLa cell line, cDNA to mRNA, clones Hz[19,20,21,22,23,24]. and liver DNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 11 to 139) AUTHORS Herzog,H. JOURNAL Unpublished (1989) Univ. Innsbruck, A-6020 Innsbruck, Austria STANDARD full staff_review REFERENCE 2 (bases 1 to 20; 130 to 149) AUTHORS Auer,B., Nagl,U., Herzog,H., Schneider,R. and Schweiger,M. TITLE Human nuclear NAD+ ADP-ribosyltransferase (polymerizing): Organization of the gene JOURNAL DNA 8, 575-580 (1989) STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by H.Herzog, 07-MAR-1989. FEATURES from to/span description pept + 11 + 139 NAD+ ADP-ribosyltransferase, exon 14 (EC 2.4.2.30; 5' end put.) /nomgen="PPOL" /map="1q41-q42" /hgml_locus_uid="LZ0154E" IVS < 1 10 PPOL intron M IVS 140 > 149 PPOL intron N BASE COUNT 46 A 29 C 41 G 33 T ORIGIN About 1.6 kb after segment 13; chromosome 1q41-q42. 1 TTTCCTGAAG GATGAAGAGG CAGTGAAGAA GCTGACAGTA AATCCTGGCA CCAAGTCCAA 61 GCTCCCCAAG CCAGTTCAGG ACCTCATCAA GATGATCTTT GATGTGGAAA GTATGAAGAA 121 AGCCATGGTG GAGTATGAGG TTATTGCGC // LOCUS HUMADPRT15 104 bp ds-DNA PRI 15-DEC-1989 DEFINITION Human NAD+ ADP-ribosyltransferase (ADPRT) gene, exon 15. ACCESSION M29778 M22953 KEYWORDS ADP-D-ribosyltransferase; NAD+ ADP-ribosyltransferase. SEGMENT 15 of 23 SOURCE Human HeLa cell line, cDNA to mRNA, clones Hz[19,20,21,22,23,24]. and liver DNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 11 to 94) AUTHORS Herzog,H. JOURNAL Unpublished (1989) Univ. Innsbruck, A-6020 Innsbruck, Austria STANDARD full staff_review REFERENCE 2 (bases 1 to 20; 85 to 104) AUTHORS Auer,B., Nagl,U., Herzog,H., Schneider,R. and Schweiger,M. TITLE Human nuclear NAD+ ADP-ribosyltransferase (polymerizing): Organization of the gene JOURNAL DNA 8, 575-580 (1989) STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by H.Herzog, 07-MAR-1989. FEATURES from to/span description pept + 11 + 94 NAD+ ADP-ribosyltransferase, exon 15 (EC 2.4.2.30; 5' end put.) /nomgen="PPOL" /map="1q41-q42" /hgml_locus_uid="LZ0154E" IVS < 1 10 PPOL intron N IVS 95 > 104 PPOL intron O BASE COUNT 28 A 30 C 28 G 18 T ORIGIN About 3.6 kb after segment 14; chromosome 1q41-q42. 1 GTCTCCTCAG ATCGACCTTC AGAAGATGCC CTTGGGGAAG CTGAGCAAAA GGCAGATCCA 61 GGCCGCATAC TCCATCCTCA GTGAGGTCCA GCAGGTAAGC ACAG // LOCUS HUMADPRT16 143 bp ds-DNA PRI 15-DEC-1989 DEFINITION Human NAD+ ADP-ribosyltransferase (ADPRT) gene, exon 16. ACCESSION M29779 M22953 KEYWORDS ADP-D-ribosyltransferase; NAD+ ADP-ribosyltransferase. SEGMENT 16 of 23 SOURCE Human HeLa cell line, cDNA to mRNA, clones Hz[19,20,21,22,23,24]. and liver DNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 11 to 133) AUTHORS Herzog,H. JOURNAL Unpublished (1989) Univ. Innsbruck, A-6020 Innsbruck, Austria STANDARD full staff_review REFERENCE 2 (bases 1 to 20; 124 to 143) AUTHORS Auer,B., Nagl,U., Herzog,H., Schneider,R. and Schweiger,M. TITLE Human nuclear NAD+ ADP-ribosyltransferase (polymerizing): Organization of the gene JOURNAL DNA 8, 575-580 (1989) STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by H.Herzog, 07-MAR-1989. FEATURES from to/span description pept + 11 + 133 NAD+ ADP-ribosyltransferase, exon 16 (EC 2.4.2.30; 5' end put.) /nomgen="PPOL" /map="1q41-q42" /hgml_locus_uid="LZ0154E" IVS < 1 10 PPOL intron O IVS 134 > 143 PPOL intron P BASE COUNT 31 A 47 C 34 G 31 T ORIGIN About 2.5 kb after segment 15; chromosome 1q41-q42. 1 TCTCCTCCAG GCGGTGTCTC AGGGCAGCAG CGACTCTCAG ATCCTGGATC TCTCAAATCG 61 CTTTTACACC CTGATCCCCC ACGACTTTGG GATGAAGAAG CCTCCGCTCC TGAACAATGC 121 AGACAGTGTG CAGGTAGCGC ACC // LOCUS HUMADPRT17 149 bp ds-DNA PRI 15-DEC-1989 DEFINITION Human NAD+ ADP-ribosyltransferase (ADPRT) gene, exon 17. ACCESSION M29780 M22953 KEYWORDS ADP-D-ribosyltransferase; NAD+ ADP-ribosyltransferase. SEGMENT 17 of 23 SOURCE Human HeLa cell line, cDNA to mRNA, clones Hz[19,20,21,22,23,24]. and liver DNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 11 to 139) AUTHORS Herzog,H. JOURNAL Unpublished (1989) Univ. Innsbruck, A-6020 Innsbruck, Austria STANDARD full staff_review REFERENCE 2 (bases 1 to 20; 130 to 149) AUTHORS Auer,B., Nagl,U., Herzog,H., Schneider,R. and Schweiger,M. TITLE Human nuclear NAD+ ADP-ribosyltransferase (polymerizing): Organization of the gene JOURNAL DNA 8, 575-580 (1989) STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by H.Herzog, 07-MAR-1989. FEATURES from to/span description pept + 11 + 139 NAD+ ADP-ribosyltransferase, exon 17 (EC 2.4.2.30; 5' end put.) /nomgen="PPOL" /map="1q41-q42" /hgml_locus_uid="LZ0154E" IVS < 1 10 PPOL intron P IVS 140 > 149 PPOL intron Q BASE COUNT 42 A 34 C 43 G 30 T ORIGIN About 0.5 kb after segment 16; chromosome 1q41-q42. 1 GCTCCTCCAG GCCAAGGTGG AAATGCTTGA CAACCTGCTG GACATCGAGG TGGCCTACAG 61 TCTGCTCAGG GGAGGGTCTG ATGATAGCAG CAAGGATCCC ATCGATGTCA ACTATGAGAA 121 GCTCAAAACT GACATTAAGG TAACAGGGT // LOCUS HUMADPRT18 119 bp ds-DNA PRI 15-DEC-1989 DEFINITION Human NAD+ ADP-ribosyltransferase (ADPRT) gene, exon 18. ACCESSION M29781 M22953 KEYWORDS ADP-D-ribosyltransferase; NAD+ ADP-ribosyltransferase. SEGMENT 18 of 23 SOURCE Human HeLa cell line, cDNA to mRNA, clones Hz[19,20,21,22,23,24]. and liver DNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 11 to 109) AUTHORS Herzog,H. JOURNAL Unpublished (1989) Univ. Innsbruck, A-6020 Innsbruck, Austria STANDARD full staff_review REFERENCE 2 (bases 1 to 20; 100 to 119) AUTHORS Auer,B., Nagl,U., Herzog,H., Schneider,R. and Schweiger,M. TITLE Human nuclear NAD+ ADP-ribosyltransferase (polymerizing): Organization of the gene JOURNAL DNA 8, 575-580 (1989) STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by H.Herzog, 07-MAR-1989. FEATURES from to/span description pept + 11 + 109 NAD+ ADP-ribosyltransferase, exon 18 (EC 2.4.2.30; 5' end put.) /nomgen="PPOL" /map="1q41-q42" /hgml_locus_uid="LZ0154E" IVS < 1 10 PPOL intron Q IVS 110 > 119 PPOL intron R BASE COUNT 40 A 23 C 28 G 28 T ORIGIN About 0.6 kb after segment 17; chromosome 1q41-q42. 1 GTTTATCCAG GTGGTTGACA GAGATTCTGA AGAAGCCGAG ATCATCAGGA AGTATGTTAA 61 GAACACTCAT GCAACCACAC ACAATGCGTA TGACTTGGAA GTCATCGATG TAAGAATCC // LOCUS HUMADPRT19 173 bp ds-DNA PRI 15-DEC-1989 DEFINITION Human NAD+ ADP-ribosyltransferase (ADPRT) gene, exon 19. ACCESSION M29782 M22953 KEYWORDS ADP-D-ribosyltransferase; NAD+ ADP-ribosyltransferase. SEGMENT 19 of 23 SOURCE Human HeLa cell line, cDNA to mRNA, clones Hz[19,20,21,22,23,24]. and liver DNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 11 to 163) AUTHORS Herzog,H. JOURNAL Unpublished (1989) Univ. Innsbruck, A-6020 Innsbruck, Austria STANDARD full staff_review REFERENCE 2 (bases 1 to 20; 154 to 173) AUTHORS Auer,B., Nagl,U., Herzog,H., Schneider,R. and Schweiger,M. TITLE Human nuclear NAD+ ADP-ribosyltransferase (polymerizing): Organization of the gene JOURNAL DNA 8, 575-580 (1989) STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by H.Herzog, 07-MAR-1989. FEATURES from to/span description pept + 11 + 163 NAD+ ADP-ribosyltransferase, exon 19 (EC 2.4.2.30; 5' end put.) /nomgen="PPOL" /map="1q41-q42" /hgml_locus_uid="LZ0154E" IVS < 1 10 PPOL intron R IVS 164 > 173 PPOL intron S BASE COUNT 39 A 50 C 46 G 38 T ORIGIN About 1.8 kb after segment 18; chromosome 1q41-q42. 1 CTCCAACCAG ATCTTTAAGA TAGAGCGTGA AGGCGAATGC CAGCGTTACA AGCCCTTTAA 61 GCAGCTTCAT AACCGAAGAT TGCTGTGGCA CGGGTCCAGG ACCACCAACT TTGCTGGGAT 121 CCTGTCCCAG GGTCTTCGGA TAGCCCCGCC TGAAGCGCCC GTGGTATGTG CCT // LOCUS HUMADPRT20 148 bp ds-DNA PRI 15-DEC-1989 DEFINITION Human NAD+ ADP-ribosyltransferase (ADPRT) gene, exon 20. ACCESSION M29783 M22953 KEYWORDS ADP-D-ribosyltransferase; NAD+ ADP-ribosyltransferase. SEGMENT 20 of 23 SOURCE Human HeLa cell line, cDNA to mRNA, clones Hz[19,20,21,22,23,24]. and liver DNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 11 to 138) AUTHORS Herzog,H. JOURNAL Unpublished (1989) Univ. Innsbruck, A-6020 Innsbruck, Austria STANDARD full staff_review REFERENCE 2 (bases 1 to 20; 129 to 148) AUTHORS Auer,B., Nagl,U., Herzog,H., Schneider,R. and Schweiger,M. TITLE Human nuclear NAD+ ADP-ribosyltransferase (polymerizing): Organization of the gene JOURNAL DNA 8, 575-580 (1989) STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by H.Herzog, 07-MAR-1989. FEATURES from to/span description pept + 11 + 138 NAD+ ADP-ribosyltransferase, exon 20 (EC 2.4.2.30; 5' end put.) /nomgen="PPOL" /map="1q41-q42" /hgml_locus_uid="LZ0154E" IVS < 1 10 PPOL intron S IVS 139 > 148 PPOL intron T BASE COUNT 40 A 34 C 37 G 37 T ORIGIN About 0.8 kb after segment 19; chromosome 1q41-q42. 1 GACCTTCCAG ACAGGCTACA TGTTTGGTAA AGGGATCTAT TTCGCTGACA TGGTCTCCAA 61 GAGTGCCAAC TACTGCCATA CGTCTCAGGG AGACCCAATA GGCTTAATCC TGTTGGGAGA 121 AGTTGCCCTT GGAAACATGT GAGTAAAC // LOCUS HUMADPRT21 82 bp ds-DNA PRI 15-DEC-1989 DEFINITION Human NAD+ ADP-ribosyltransferase (ADPRT) gene, exon 21. ACCESSION M29784 M22953 KEYWORDS ADP-D-ribosyltransferase; NAD+ ADP-ribosyltransferase. SEGMENT 21 of 23 SOURCE Human HeLa cell line, cDNA to mRNA, clones Hz[19,20,21,22,23,24]. and liver DNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 11 to 72) AUTHORS Herzog,H. JOURNAL Unpublished (1989) Univ. Innsbruck, A-6020 Innsbruck, Austria STANDARD full staff_review REFERENCE 2 (bases 1 to 20; 63 to 82) AUTHORS Auer,B., Nagl,U., Herzog,H., Schneider,R. and Schweiger,M. TITLE Human nuclear NAD+ ADP-ribosyltransferase (polymerizing): Organization of the gene JOURNAL DNA 8, 575-580 (1989) STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by H.Herzog, 07-MAR-1989. FEATURES from to/span description pept + 11 + 72 NAD+ ADP-ribosyltransferase, exon 21 (EC 2.4.2.30; 5' end put.) /nomgen="PPOL" /map="1q41-q42" /hgml_locus_uid="LZ0154E" IVS < 1 10 PPOL intron T IVS 73 > 82 PPOL intron U BASE COUNT 28 A 15 C 19 G 20 T ORIGIN About 1.0 kb after segment 20; chromosome 1q41-q42. 1 TTTTGAATAG GTATGAACTG AAGCACGCTT CACATATCAG CAAGTTACCC AAGGGCAAGC 61 ACAGTGTCAA AGGTAAAGTG TT // LOCUS HUMADPRT22 135 bp ds-DNA PRI 15-DEC-1989 DEFINITION Human NAD+ ADP-ribosyltransferase (ADPRT) gene, exon 22. ACCESSION M29785 M22953 KEYWORDS ADP-D-ribosyltransferase; NAD+ ADP-ribosyltransferase. SEGMENT 22 of 23 SOURCE Human HeLa cell line, cDNA to mRNA, clones Hz[19,20,21,22,23,24]. and liver DNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 11 to 125) AUTHORS Herzog,H. JOURNAL Unpublished (1989) Univ. Innsbruck, A-6020 Innsbruck, Austria STANDARD full staff_review REFERENCE 2 (bases 1 to 20; 116 to 135) AUTHORS Auer,B., Nagl,U., Herzog,H., Schneider,R. and Schweiger,M. TITLE Human nuclear NAD+ ADP-ribosyltransferase (polymerizing): Organization of the gene JOURNAL DNA 8, 575-580 (1989) STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by H.Herzog, 07-MAR-1989. FEATURES from to/span description pept + 11 + 125 NAD+ ADP-ribosyltransferase, exon 22 (EC 2.4.2.30; 5' end put.) /nomgen="PPOL" /map="1q41-q42" /hgml_locus_uid="LZ0154E" IVS < 1 10 PPOL intron U IVS 126 > 135 PPOL intron V BASE COUNT 29 A 31 C 29 G 46 T ORIGIN About 0.5 kb after segment 21; chromosome 1q41-q42. 1 TGTTTTTCAG GTTTGGGCAA AACTACCCCT GATCCTTCAG CTAACATTAG TCTGGATGGT 61 GTAGACGTTC CTCTTGGGAC CGGGATTTCA TCTGGTGTGA ATGACACCTC TCTACTATAT 121 AACGAGTATC CTTTC // LOCUS HUMADPRT23 857 bp ds-DNA PRI 15-DEC-1989 DEFINITION Human NAD+ ADP-ribosyltransferase (ADPRT) gene, exon 23. ACCESSION M29786 M22953 KEYWORDS ADP-D-ribosyltransferase; NAD+ ADP-ribosyltransferase. SEGMENT 23 of 23 SOURCE Human HeLa cell line, cDNA to mRNA, clones Hz[19,20,21,22,23,24]. and liver DNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 11 to 857) AUTHORS Herzog,H. JOURNAL Unpublished (1989) Univ. Innsbruck, A-6020 Innsbruck, Austria STANDARD full staff_review REFERENCE 2 (bases 1 to 20) AUTHORS Auer,B., Nagl,U., Herzog,H., Schneider,R. and Schweiger,M. TITLE Human nuclear NAD+ ADP-ribosyltransferase (polymerizing): Organization of the gene JOURNAL DNA 8, 575-580 (1989) STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by H.Herzog, 07-MAR-1989. FEATURES from to/span description pept + 11 92 NAD+ ADP-ribosyltransferase, exon 23 (EC 2.4.2.30; 5' end put.) /nomgen="PPOL" /map="1q41-q42" /hgml_locus_uid="LZ0154E" IVS < 1 10 PPOL intron V BASE COUNT 234 A 157 C 173 G 293 T ORIGIN About 0.4 kb after segment 22; chromosome 1q41-q42. 1 TTCCCCTCAG GTACATTGTC TATGATATTG CTCAGGTAAA TCTGAAGTAT CTGCTGAAAC 61 TGAAATTCAA TTTTAAGACC TCCCTGTGGT AATTGGGAGA GGTAGCCGAG TCACACCCGG 121 TGGCTCTGGT ATGAATTCAC CCGAAGCGCT TCTGCACCAA CTCACCTGGC CGCTAAGTTG 181 CTGATGGGTA GTACCTGTAC TAAACCACCT CAGAAAGGAT TTTACAGAAA CGTGTTAAAG 241 GTTTTCTCTA ACTTCTCAAG TCCCTTGTTT TGTGTTGTGT CTGTGGGGAG GGGTTGTTTT 301 GGGGTTGTTT TTGTTTTTTC TTGCCAGGTA GATAAAACTG ACATAGAGAA AAGGCTGGAG 361 AGAGATTCTG TTGCATAGAC TAGTCCTATG GAAAAAACCA AGCTTCGTTA GAATGTCTGC 421 CTTACTGGTT TCCCCAGGGA AGGAAAAATA CACTTCCACC CTTTTTTCTA AGTGTTCGTC 481 TTTAGTTTTG ATTTTGGAAA GATGTTAAGC ATTTATTTTT AGTTAAAAAT AAAAACTAAT 541 TTCATACTAT TTAGATTTTC TTTTTTATCT TGCACTTATT GTCCCCTTTT TAGTTTTTTT 601 TGTTTGCCTC TTGTGGTGAG GGGTGTGGGA AGACCAAAGG AAGGAACGCT AACAATTTCT 661 CATACTTAGA AACAAAAAGA GCTTTCCTTC TCCAGGAATA CTGAACATGG GAGCTCTTGA 721 AATATGTAGT ATTAAAAGTT GCATTTGAAA TTCTTGACTT TCTTATGGGC ACTTTTGTCT 781 TCCAAATTAA AACTCTACCA CAAATATACT TACCCAAGGG CTAATAGTAA TACTCGATTA 841 AAAATGCAGA TGCCTTC // LOCUS HUMAFP 22166 bp ds-DNA PRI 15-JUN-1989 DEFINITION Human alpha-fetoprotein gene, complete cds. ACCESSION M16110 KEYWORDS Alu repetitive sequence; KpnI repetitive sequence; alpha-fetoprotein; repetitive sequence. SOURCE Human DNA, library of T.Maniatis, clones lambda-HAFP[26,33]. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 22166) AUTHORS Gibbs,P.E.M., Zielinski,R., Boyd,C. and Dugaiczyk,A. TITLE Structure, polymorphism, and novel repeated DNA elements revealed by a complete sequence of the human alpha-fetoprotein gene JOURNAL Biochemistry 26, 1332-1343 (1987) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly provided by A.Dugaiczyk, 17-JUN-1987. FEATURES from to/span description pept 915 999 alpha-fetoprotein precursor, exon 1 /nomgen="AFP" /map="4q11-q13" /hgml_locus_uid="LB0045G" 1812 1863 alpha-fetoprotein precursor, exon 2 2826 2958 alpha-fetoprotein precursor, exon 3 5246 5457 alpha-fetoprotein precursor, exon 4 6944 7076 alpha-fetoprotein precursor, exon 5 7995 8092 alpha-fetoprotein precursor, exon 6 9641 9770 alpha-fetoprotein precursor, exon 7 12046 12260 alpha-fetoprotein precursor, exon 8 13918 14050 alpha-fetoprotein precursor, exon 9 14619 14716 alpha-fetoprotein precursor, exon 10 15199 15337 alpha-fetoprotein precursor, exon 11 16985 17208 alpha-fetoprotein precursor, exon 12 18349 18481 alpha-fetoprotein precursor, exon 13 19820 19864 alpha-fetoprotein precursor, exon 14 sigp 915 971 alpha-fetoprotein signal peptide matp 972 999 alpha-fetoprotein 1812 1863 alpha-fetoprotein 2826 2958 alpha-fetoprotein 5246 5457 alpha-fetoprotein 6944 7076 alpha-fetoprotein 7995 8092 alpha-fetoprotein 9641 9770 alpha-fetoprotein 12046 12260 alpha-fetoprotein 13918 14050 alpha-fetoprotein 14619 14716 alpha-fetoprotein 15199 15337 alpha-fetoprotein 16985 17208 alpha-fetoprotein 18349 18481 alpha-fetoprotein 19820 19864 alpha-fetoprotein pre-msg 871 20359 AFP mRNA IVS 1000 1811 AFP cds intron A IVS 1864 2825 AFP cds intron B IVS 2959 5245 AFP cds intron C IVS 5458 6943 AFP cds intron D IVS 7077 7994 AFP cds intron E IVS 8093 9640 AFP cds intron F IVS 9771 12045 AFP cds intron G IVS 12261 13917 AFP cds intron H IVS 14051 14618 AFP cds intron I IVS 14717 15198 AFP cds intron J IVS 15338 16984 AFP cds intron K IVS 17209 18348 AFP cds intron L IVS 18482 19819 AFP cds intron M IVS 19875 20214 AFP mRNA intron N rpt 339 435 repetitive element X-1 rpt 564 664 repetitive element X-2 rpt 4911 4686 (c) Kpn repeat rpt 5918 5928 5' Alu flanking repeat rpt 5929 6232 Alu repeat copy A rpt 6233 6243 3' Alu flanking repeat rpt 11548 11850 Xba repeat copy A rpt 12822 13124 Xba repeat copy B rpt 21501 21491 (c) 3' Alu flanking repeat rpt 21631 21502 (c) Alu repeat copy B rpt 21642 21632 (c) 5' Alu flanking repeat rpt > 22166 21955 (c) Alu repeat copy C variant 6132 6133 aa in [1]; aaa in allele variant 6421 6423 ttc in [1]; tc in allele variant 6452 6454 aat in [1]; at in allele variant 6652 6652 a in [1]; g in allele variant 7421 7425 agaac in [1]; ac in allele variant 8403 8403 c in [1]; g in allele variant 10364 10364 c in [1]; g in allele BASE COUNT 7164 A 3625 C 4062 G 7315 T ORIGIN EcoRI site; chromosome 4q11-q13. 1 AATTCAAAAA GTTCCCCCAG TGATTCTAAT GTGTAGCCAA GATCGGGAAC CCTTGTAGAC 61 AGGGATGATA GGAGGTGAGC CACTCTTAGC ATCCATCATT TAGTATTAAC ATCATCATCT 121 TGAGTTGCTA AGTGAATGAT GCACCTGACC CACTTTATAA AGACACATGT GCAAATAAAA 181 TTATTATAGG ACTTGGTTTA TTAGGGCTTG TGCTCTAAGT TTTCTATGTT AAGCCATACA 241 TCGCATACTA AATACTTTAA AATGTACCTT ATTGACATAC ATATTAAGTG AAAAGTGTTT 301 CTGAGCTAAA CAATGACAGC ATAATTATCA AGCAATGATA ATTTGAAATG AATTTATTAT 361 TCTGCAACTT AGGGACAAGT CATCTCTCTG AATTTTTTGT ACTTTGAGAG TATTTGTTAT 421 ATTTGCAAGA TGAAGAGTCT GAATTGGTCA GACAATGTCT TGTGTGCCTG GCATATGATA 481 GGCATTTAAT AGTTTTAAAG AATTAATGTA TTTAGATGAA TTGCATACCA AATCTGCTGT 541 CTTTTCTTTA TGGCTTCATT AACTTAATTT GAGAGAAATT AATTATTCTG CAACTTAGGG 601 ACAAGTCATG TCTTTGAATA TTCTGTAGTT TGAGGAGAAT ATTTGTTATA TTTGCAAAAT 661 AAAATAAGTT TGCAAGTTTT TTTTTTCTGC CCCAAAGAGC TCTGTGTCCT TGAACATAAA 721 ATACAAATAA CCGCTATGCT GTTAATTATT GGCAAATGTC CCATTTTCAA CCTAAGGAAA 781 TACCATAAAG TAACAGATAT ACCAACAAAA GGTTACTAGT TAACAGGCAT TGCCTGAAAA 841 GAGTATAAAA GAATTTCAGC ATGATTTTCC ATATTGTGCT TCCACCACTG CCAATAACAA 901 AATAACTAGC AACCATGAAG TGGGTGGAAT CAATTTTTTT AATTTTCCTA CTAAATTTTA 961 CTGAATCCAG AACACTGCAT AGAAATGAAT ATGGAATAGG TGAGATATTT TGTGTTTTTC 1021 TTGTCTTTTC TCTATATCAA AATTTTTTAA ATTATAAAAT TTGCATTAAT TTGTCTTGAT 1081 TTATTATTCA TATTTATTAT TCCACATGGA GAAAAAATAT TTAACTGATG GATATATTTA 1141 AATGAAAGAA AAACTTGTAA CTTTACAAGA GGTTTACAAA GTTATAGCAG TGTTTAATGG 1201 ATGAATGGTT TGTATGTTTC ATGTTGAATT AATTTTTACA CTTCAATGGT ATGCATATTA 1261 ACTTTGAAAA ATTATATATA TACACATATA TGTACATATA TATGAATATA AATAAAATTT 1321 TATATGTGAA GAAGCCAGAA TTATGCTCCT TCACATAACT CCCTCAGACT AGTAAAATAG 1381 ATAAAATCTT TGTTTTTAAT ACAGAAAAAT GGGTCATTAT TTGATGGTCT GAAGAAGAAA 1441 TATTGTGACT GGGATATGAA TGGCAAACCG TAGTACAACT ATGTTCAAAA GAATGCCTGA 1501 AATATATTTT TAACCATTTG ACTTTCAGGA CAGTTACAGC ACTACAGTAC AGGGAAAAAC 1561 CAAACAACTG GAAGACAAAA TCTGGATTTT AGTGATAGGT CTACTATAAA TTATGCTTGT 1621 TAACTTCATT CCTTAGTTTC CTAGTTTTCT TTTCCTCAAG TATAAAATTA AGATGCTTAG 1681 GTTATCCCTA ATGTTCTTTT AATTCTGAAA CTGTACAGTT CTAACTGAAA CACAAACATT 1741 CATATGTAAC AATGATTACT TTCTTGGTTG CAGTTGAAAA CACGTTTCAT GAAGTTTATT 1801 TTGCCTTCCA GCTTCCATAT TGGATTCTTA CCAATGTACT GCAGAGATAA GTTTAGCTGA 1861 CCTGTAAGTT TTGCTTATAT AAATGTACTT TAAATGTGTA AAGCAAGGAT AAGTAAATAC 1921 TTAAATAAAA TTGGGTACCC CTGTGAGCTC TTAAAAGCAC AAAAGCAATT TGGACAATTT 1981 CAAGAAAAGT TACTCATACT GAATATCAAC TTGATGTTGA AGAGGTTAAA CTGTTGACTA 2041 ATGTCTTCGA CATTGACCTT TTGATTCCTT GAAATCTCAT GAGTCAAACC AAATCAGATT 2101 TTAGAAACTG AAGATTAGTG TCTGATCAGT GACAACCATA TACTAATTCA GGAATTTTTC 2161 TCATCAGTAC CAACAGGGTG ATATTATAAT GTTTTCTTTT CTGTATACTA TTTAAATCTT 2221 AGCAGCAAAC CATAGGTGAT AAAATATTCT ATTTGCTGTT ATTTGTGGAG AGTATGTTAG 2281 TCTCTTGGAT GTCTTTCCAT TCCACATTTT AAAAATTTCT AACAAAGAAT TTAAAGTAGT 2341 GTGTTGCTGT TACTCCTTGC ACATCCAAAC CTGCATAAGG ATTGCTTTGA GTCAATCCAT 2401 GAGCACTGTA GTCTTGGGTT TTAGACCTTG ATCATACTGG GAATAGACAC TGTTAGAGGT 2461 CTGTCTAATT ACCAATTTTT TTTTGCTTAA ATTTAAAAGT AACCATAAAG AATATAGATA 2521 CCCTCAATTA TGGGTACATT ACAGTAGATG GATGGTCACA GAAGGAGAAA CCACTCTTAT 2581 GGGAAATCCA CTTATTTTAG CCTTTAACAT CTATATGTAT ATTTATGGCA AAAGAAAACA 2641 AGAAAAAGAC TAAAGTTTCT TCTCAGATGA CCTGGAAGCT AATTTTACAT AATTTTACAA 2701 ATCAAATGTC TAAACAGATT ACAACATAAA TAGAAAACAA AACAAACAAA TGAAAAACTA 2761 TACTTGAGAA AAATAAGCTT GCTGCAGGTC TGTTCCTTAA GGATTCACAC GTATTTTTGT 2821 TTCAGGGCTA CCATATTTTT TGCCCAGTTT GTTCAAGAAG CCACTTACAA GGAAGTAAGC 2881 AAAATGGTGA AAGATGCATT GACTGCAATT GAGAAACCCA CTGGAGATGA ACAGTCTTCA 2941 GGGTGTTTAG AAAACCAGGT GAGTGAATAA TTTTAAAAAA GCATTGTGAT ATTTGACAAA 3001 AATTTAGCAT GCTGAAGAGA AGATACAAAA ATAGCAGTGA AAAATGCATT TAAATATTTG 3061 AAGAGCTATT GTATGAAAGA GGGATTAGAT TCATTCTGAA TTGCTAAAGA GGGCAGAAGA 3121 GAACAATAGG TAGTTATTAT AAAGAGACCA TATAAATATG ATGAACTAAG GTTCTGAAAT 3181 AAGATTATCT TGATGACTAT GGGCATATTA ACTTTTTTGA GCTTCAGTTT TCTTATCTGT 3241 AAAATAAGGG ATGATAATAG CTCCCATTTC ATAGTTAGCA TGGAAATTGA TATAACAGCA 3301 ATAGTAGCTA ACTTTTATTA TACACACAAT GTGACTGGCA TTATTCTAGG GAGCATAATG 3361 TGTATATTGA TAATAAAAAT ATTTTATGAC ATAGGGGATA GATAGCACTG ATGAATCAGA 3421 ATGGTTGTCC AGTGAGTCAA GAGATGCTGG CTCGGGCTTC TGGGCAGGAT ATCAGCTTTG 3481 CTTACCTATA TTTATTTATT AAACATTTAA AATAATCCTT GAAGATAGAT GCTAATCTTC 3541 CAACTGAGGA AGCTGAGGCT CAGAGAATTT AAGTAACTTT CTTATGGGAA CCACCAAATG 3601 GCAGAGCCAG GATTTGAACT AGACCATCTG GCTTAAAATT GACAGTCTTA GTAGCTTCAT 3661 TACACTATAA CTATAGTGAA TGTAAGATGC ATAGCACATC GTTAGGGTTG CCAGGTTTAG 3721 CAAACAACAA CAAAACATAA TACCCAGTTC AATCTGAATT TTAGATAAAC ACTAAATACT 3781 TTTCTTAGTA TAAGGATATT TCATTGTAGA AGCTCAACAA ATAATATTTA TTATTTATTT 3841 TATCTCAACA TAGAAACAAA CTTGATAATG ATTAGAACTC TCCAATTATA AAACAACATG 3901 CCCAGAGAAT ACTCTGTTAT GGTGGGGTTA ATTAGGTGGC TGAAAGACAA TGTACCTGGA 3961 ATATCATAGA AGAGATGCTC CTTTAAGGAT ATAGTTTAAG TTCTTTCCAA CTTTGAAATT 4021 TATGAATTGA CAAAAATTTC TGTTTTGCAT CTCTATTTTT GTCTTGTTCT GATAATCTTT 4081 TCAAAATGTG TATAAAAAAA CAAGAATACA TTATCTATTG CAACTTTACA ACCAATTAGA 4141 GGTTCAAGGT AATGTTACAG ATCGCTGATT TATTCTTGTA AATTCAAAGG TATGTCTTTT 4201 AAATGAGGAT TGGGAATTAG AAATCTTACG TAAGCCTTCC AGGATTCTCT AAATATTACT 4261 GTAGCAGCTA TAAAAGCTAC ATAAAAGTTC CCTCAGATAC ATGAAACACA TGTATTCCTC 4321 AGATGCTTTC TGTGGAATAT TGATGCTGTC ATCTGAGTTT GGTAAGGGTA AGTCACAGAG 4381 GAGGAAACAC ATACATTTTA AAACATTTTA GCTAAATATG TAATTGTGGC CAAGAAAAGT 4441 GTTTTTTTAA AAAATAATTA TTTCATTTCA AAATCATTTT TATTTATAAT TGAAAATAAT 4501 ATGCAGTTTT TTATTGTCTT GTAAGGATGG CATGTAAAAT GAGCATTTAT GTCTGAAATG 4561 TGGTATGTCT GTGTGTGTGT GTGTATATGT ATATATGTAT GTATATACCA TAATATATAC 4621 ATATGTATTT GCAATTCCAA AAGTTACATC TTTAATGAGA ATCATGAAAA TATTATTTTG 4681 CTCAGTTTCT TTTTTTATTT AAATTTAAAT TTAAAGTTCT GGTGTACACG GGCAGGATGA 4741 GCAGGTCTGT TACATAGGTA AACACGTGGC ATGGTGGTTT GCTGCACCTA CCAACCTGTT 4801 GCCTGGGTAT TAAGCCCAGC GTGCATTACC TATTTTTCCT AATGCTCTCC CTCCCTGCAC 4861 TCCACTCCCT GACAGGCCCC AGTGTGTGCT GTTTCCCTCC CTGTGTCCAT GAAACATGGT 4921 TTATATATTT ATAAAAGTTT ATATATCCTA GTCCAAATTT TTGATAACAC AAAAAGGAAA 4981 AATAAAATAA TTCAAAATTG GGAAAGAGAA ACAAAAGATT GCATGGCTTT TTTCCTTTTA 5041 TTGTTTGGAC ATTAAAGTCT CATTTTCCAT AAGGCAGCAA AGAAATCTAT TTCATCAGGC 5101 TGAAACAAAA TACATTAGAA TTTGTATGGA AAATTTTTCA GAATCTATAG TTCTGATTTT 5161 AGACATTAGA AAATGTTTCA TGTGTCTTAT AGATTTTTAA AGCAAAGTTA GTTGTCTTTC 5221 TCCAAACATA AAATATTTCT TATAGCTACC TGCCTTTCTG GAAGAACTTT GCCATGAGAA 5281 AGAAATTTTG GAGAAGTACG GACATTCAGA CTGCTGCAGC CAAAGTGAAG AGGGAAGACA 5341 TAACTGTTTT CTTGCACACA AAAAGCCCAC TCCAGCATCG ATCCCACTTT TCCAAGTTCC 5401 AGAACCTGTC ACAAGCTGTG AAGCATATGA AGAAGACAGG GAGACATTCA TGAACAAGTA 5461 AGGATCCAGT TTAAAGGTAG ATGCAAACCT CAGAAACACA GCAATGGCAA GCCTAATTTA 5521 GTATTTTTGC AATGTACTCA TGTACTCCCA GTAAGAGGTA TAATGTTTCT TTGGTGTTGT 5581 GTCTGCTGAG GTCCCAGGCA AGGTAATTAG GAGAGCAACA TTCAATGTAA CTTGGTTTCC 5641 ATAGCACGCT AGATGTAGTA CAAACCACAG GACAAACCTA CCAGAGGTTC CATTAGTCCC 5701 TTGAGAGATA TACACCTTTT TTTTTTCTTT TTACCCATTC TACTCTCAAT TTTACCTTGT 5761 TCAAGAATAT ATTAGTATTG CACCAAATTG ATGCTTTCCA GAGCCATATA TTGGTGTTTT 5821 GTGTTCACCT AATTGCCTAG AAATAAAAGT TAGAGATCCA AGCATGGTCA AGCATGGTCA 5881 AGATACACAG ACATGGCAAA GGTTTACTGA ACACTTAGGA TGTTGTGGGG CCGGGCGCGG 5941 TGGCTCACGC TTGTAATCCC AGCACTTTGG GAGGCCGAGG CGGGCGGATC ACGAGGTCAG 6001 GAGATCGAGA CCATCCTGGC TAACATGGTG AAACCCCGTC TCTACTAAAA ATACAAAAAA 6061 AATTAGCCGG GCGTGATGGT GGGCGCCTGT AGTCCCAGCT ACTCGGGAGG CTGAGGCAGG 6121 AGAATGGCGT GAACCCTGGA GGCGGAGCTT GCAGTGAGCC GAGATTGCGC CACTGCACTC 6181 CCGCCTGGGC CACAGAGCGA GACTCCGTCT CAAAAAAAAA AAAAAAAAAA AAGGATGTTG 6241 TGGAAACATG TCTGCTTGCA CAGAGGATCA GATTAACACT CACAAACAAA CTTTGAAGCC 6301 TCCTTCTCCT CTTCCTCTTC ATTCTTCTTC TTTCCCCCAT TTTGTTGATA GGAAATTGAA 6361 GGTTGGAAGG CTAAAACAAC TGGCTAAGGG CACACAGCTA GTACATTTGA CCCCAGGTTC 6421 TTCTGTTGGA GAAGCCTGTA CGTAACTCTT CAATCACTTC TGCCTTCCAT GTTACTTCCT 6481 AAACCAGATA AATAGAGAGA TTGCCCTTAG AACATCTCTG CTATGGCGAC TATCCAGGTG 6541 CATAACCCCA TTCTGCACTG ACAAGGGATA AAATGTCCAT CTTTCTGTTG CACTTAGCAG 6601 AAGTCTGGCT TTGCTGATCC CTGAAACATA TCTGAGTAGG CTTATTGAAA AAGACCTTTT 6661 AATAAGAATC ATGGTTAGCA TGTCTGCCTA TTTTCTTCTC ATAATAGAAT CTGGTACCAT 6721 CTGTCAGATA TTTTTCCCCC CAGGATCTTT AGTGAGGAAA ATTTTGACAA CATGTGGAAA 6781 AAATATGATG ATTTTCCCCT TAGGAACACA GTAAAGACAA AGTCAAATGC ATTTTGTTGT 6841 TGTTGTTTTT GAATCTTTAA ATAAATCCCA GTGTCCAGTT CCAAGCAGTA GTAGTCCTAT 6901 TTTTAGGTGT TTATAAATCT TCTAGCTCTA TTTTATTTCA CAGATTCATT TATGAGATAG 6961 CAAGAAGGCA TCCCTTCCTG TATGCACCTA CAATTCTTCT TTGGGCTGCT CGCTATGACA 7021 AAATAATTCC ATCTTGCTGC AAAGCTGAAA ATGCAGTTGA ATGCTTCCAA ACAAAGGTAT 7081 CATATTTGCG TGGATATCTG AACCAGTACT GTAGTCTATG ACTCATTAAA ACAAAACAAA 7141 GTTAAAAATG AAAACGTGCT TAATTGTGGA GAGTATCGTT TTTGGAATAG AGAAATAGTT 7201 CAGCAGTCTG ATATTCTTCG AGTGAACAAA ACTGGATTTG CTGGTTTTTA TTATCTATTC 7261 ATTAAGTCAA CAATAATTTT ATTATTAAGG AAGTGAGTTT GATGGGATAA AGAAGGAAAG 7321 ACAGACAGAC AGAAAGAGAG AAAGAGAGGG TGCTATAAAT GGGAAGGAGT AAGTGAAAGA 7381 AAACGGAGAA AAGGAGGAGC AGAGGAGAGA GAAAAAATGA AGAACTATGA TAAATGCTTT 7441 ATACTTACTG TTTTTTTAAA CCTTAGAACA AATTTGAGGT GTAAGTATTT TTGTCCTTTA 7501 ACAAACTAGG AAAGAGGTTT AGAGAAGAAA ATTTTTTTTA GGTTTAGAGA GGTAGAATAA 7561 TATCCACCAT TAACTACTTG CTACCAGCAG TCCTATAACA AAAATTTCCT GAGAACTGAC 7621 TACTCTCAGT GTCAAGCCGT GATACAATCG TATTTTTTGT AACTGAGAAG ACACTTAAGA 7681 AATTAGCAGG CTTCAATTTG TCTTTAAGCT GTTTAAAGGT ACAGTTGTTC ATTTATGATC 7741 CCCAGTATAA AAGTTATGTT TTTGTTTCAA GGTACTTTGA GTAAATTTGT CTGGCACAGA 7801 TGCATATAAA CTAACCCAAA AGAATAAATG AGTCAATAAT ATTCTGCGAT AATGTATGAC 7861 ATTTATAATT TTTAGTAAAA AACATATTTT ATGGAATTTC ATTTTACCTA TACTTGTTGT 7921 TTTTCTAAAT ATTAGAGCTT GTAAAGAAAA TGTTAGTATA TGCTTTCATG ACATTTTGTT 7981 TCCTCTACAT CTAGGCAGCA ACAGTTACAA AAGAATTAAG AGAAAGCAGC TTGTTAAATC 8041 AACATGCATG TGCAGTAATG AAAAATTTTG GGACCCGAAC TTTCCAAGCC ATGTAAGTTC 8101 AAGTTCTATC TAGGGAAGAG GGTGAGAGCT ACAGAACTAC CATTTTGCAA TTTGGGTTCG 8161 TTTTTTTAAT TGTTGCTGTT TTAGAGAATG AAGACCCCTT TGTGACCCCT TTGATGAGGG 8221 CTAATGGGAT TAGAACCATG AACTCTTAGG ATCAGAAGGA AGCTAACGGA TAAGTCAGTT 8281 TAACACTTAC TAAAGCCTAG CTGAGATAAT ACATAGAAAA GACTTTTAAA GTTTATGTTA 8341 TTTACCTGTT CTTTAAGACA CTTAAGTTCT GGCCTGCCAT CAAATTATAC CTCATCACTA 8401 GACCATATTT TTCTAGCTCT TCTACAAAAT AAGTCAGCCT TCACTGAGTG TCATTTAAAA 8461 CTTTTGCCTT AACAAGAAAT TCTTTATATT AATTGTGTTT CTTAATCTTC TATAAGGCTC 8521 TTTATAGCAT TTATTGCTTC CCATAAAAAT ATTCCTTTGA GGCAATAATA TTAGAATCTA 8581 GTGTCAGGAG AAAAGAACAT TTTAAATTAT ATAACTTCTT TAAAGTACTC ATCAACTCTT 8641 TTATGATAAA ACATTTCTAG TATATGAAAA TATCTGAGCT GCTAATCGAA TAGTAGTAAG 8701 TATATATATT CAGAGTTTAT TTGATTGCTG TTTGGTTAAA TAACACGTTA AAGCATATTG 8761 TAGACAATGG AAATCTAGAA TGAAGTTTTT AGTAATAGAA TTAGTTCTAA AGACTGAAAT 8821 TTCTTCTTGT AAGAAGTCAG ATTTATGCCT TAACTACCCT CCAACTCAAT TAGAAATTGA 8881 AAGATTAATT TATGACCTAC TTTAAAAAAA TTTATCTTAG TAATAAAATA GAAATTGAGA 8941 TAGTATTATA ATACCCTTAT CATTTGCTAT CCATAAGTGA AAGCTAAGTG GTCTAAATTT 9001 ATAGAGAAGA TCCATCTTTA TCAGAAGCAG GAGTAACACT ATTTTCTCTA GGAGCAATGG 9061 TTTTACAAAG ACAACTCTTA AAATTTAATG AAATATGCAG AGGAAACCAG AATTTTTATT 9121 TCTTACTTCT TTTTTGGCCA CATTACTCTC TTGGCACACT GGTATGCCTG AAATGTTTAA 9181 TCTGCTCCAC TTTCTTGTCC CATTTCATTG ATTTGCACAT ACGCCTCCTT TCCTGGATTC 9241 TCAGAAGTAT TTTTTACCCC AGTAAAAAAT GGTTCTCATT TTCATGGAAT TTTCCATTTC 9301 TTAATTGTTT ATCCTTTTAA AGTTTTATGT CCACTCAATA CAAATGCTCA GCATAAGGGT 9361 TGAATGGCAA AAATAATTTA CCTTAATTCT TAAGACATGT TTTAGGAAAA GATAAATTAT 9421 TTCTAAACCA TTTGTGGGGC CAGCTCACAG TTTACAGAGT TACCACCTTG AAAGATATGG 9481 CTGGAATCAA GCCTTAAAAC ATGTTTCCTT TTCTTTTTAA ACAACAAGGG AATTTCAGTC 9541 ATTTCTCTTT GAAACTTCTA CTGTAGTAAT ACTCTATAAA TTCTATTTTA TTTTGACAGA 9601 TAACCAAGAA ATTAATTTCT AATTTCTTTT TTTTCCCTAG AACTGTTACT AAACTGAGTC 9661 AGAAGTTTAC CAAAGTTAAT TTTACTGAAA TCCAGAAACT AGTCCTGGAT GTGGCCCATG 9721 TACATGAGCA CTGTTGCAGA GGAGATGTGC TGGATTGTCT GCAGGATGGG GTGAGGAGTC 9781 TTGCTTCTTA AAATAGAAGA TTTTCACTCC CTTTTCTTTC TTTTTGTCTC ATTCTAAAAG 9841 GGAGAAGGTT GTTTGACTTG AATTGGTTAC AGAGTATGTA AACTAGGTGA CTCCTTAAAT 9901 TTGCAGAATT CTCGGTAGTA AAACTTAAAC CATCTTTTGT TGATCCTGGC TTTCACTTTA 9961 GCTATACCCC TTTTTGTGAA ACCAAGGCTC ATCTATTTCT TACTTCTAAA AAAACCGTGG 10021 GAACTTCTCA GAAGGCTTCT CCATAGTTAC TTGGAGGACG GGAGGAAACT AAGGTTTAAT 10081 GTATTTATTT TTTCATTCAT TTATTCTTTC ATTTGACAAA TAAATATATA TTAAATACTT 10141 TCTATCTGCT AGCCACTATG ACAGACACTT GTTTTAAAAG CACAGGCTGA CCTCAAGGAA 10201 TTCACAGTCT GATAGGAGAG ATAAGACAGT GACCTCCTTG GAGTTAGGGA CTGCCTTGGT 10261 TTACTGTTAT CTCCATAGCA CAATGCCTGG CACATGGAAG GCATTCTATA ATAGTTTGTT 10321 AAATGAACGA ATGCAATAAA AATTGCACAA GTAACTGTCC TACCAGGTAA AAAGCTAGCC 10381 TTGCCAAAGA CAAGTGTGAA TAAAGTGGTC TCGGAGAATT AGAAAACAAA ATTTAAAAAA 10441 CCCAGCAACA GTTTCTTGAG TGTGCTCTAG TCAGTGGTGG TTTAAGCAGT GTGGCATTGG 10501 CTTATTTTGG CTAACCCTAG ACTTCCTAGA TTTTACAAGA GAGGACTGTT GACCCTCAGA 10561 TTACTCTGTC TCTGTGGGCT CCATGACACA CCAAAGAGAT TAAAATCCAA GGAGCTTAAA 10621 AATTACTGCT CTTAGGTACT CAAATGGCTT GATAACCAGC ACTGGACTAT GGTTTCCAGA 10681 AGGCTGAAAG TGAAGATAAA ATGCTCATTT CAGCCCATCC GATGGCAATT CAGTGAGATG 10741 CTTGAAGTAA GCAAGAAGCC GAGGCTGCAG AGGAGGCCTG AGAGTGACAG TCCCTGAGGA 10801 GCTGGGGAAG AAGTGGGAGG AGGCAGCCTG GCAGGCGACT GTACTACTTT ACTTGTTTTT 10861 CTTCTAGAAA TATGGCATAC TAGGAAAACC AACAAGAAGT ATTTTGTTTT TCTTTGAGCT 10921 CAGTTTTCCC ATTTTGAACG GACAATTTTA CTGTTTCTCG TTGTCATTTT TAAAAAGTTA 10981 ATTTTTTCAA TTTTTGGAGC TCATAACCAC CCTTTTCCTT TAAAGTGGAA ACATTAATTT 11041 CAGCATGATA TGTAAGTTGG ATTTTGATAG CTGAATAATG GGTTCTAATT ATCTTTGCTG 11101 AGAATGTACA GAATTTTCAG TCCCATGACA GGTATATATG TAAGCTCTGC CTCTCTCTGG 11161 CCACTTAGGT GCATTGCCAT TTTATTATCT ATAGACTGCC CTCTGAAGGT CATAGTCAGT 11221 CACTGCAGTA TGTTCTGATA AAGATGATTA TCATTCTTAC GGAATTTCTC GTGGAGCAGA 11281 AAGTTTGCTC TCCATGTTAT GATACCAGTT GCAAGTGTTG TTTAGGGGCA AATTTGAATG 11341 CTAATAGAAA TACATATAGC AACATGCATC CTATTTTATT TGAGCACATT TCCCTCTTAT 11401 TTGTAAAGGT TTTCAATTGA AATAACATAG GATTTAGCTT ACAACTATGG AAAGAAGAAT 11461 TGAACAAACA GGTAAGTGGA AAGGAATGAG AAAAGGCAAA AGTGGGGAGA AAGCACTAAA 11521 ACGGGAGACA AGTTAAAATT TCTTTTTAAT TGATAGGTCA CGTTCTCACT CTATTTGCCT 11581 TTAAGGGAAG AAAGCAATCA AGTTAATATG TTTTCCTTCA TTGTATAGTA TGTAACTACG 11641 GACACTATTA GAGGAGGGAT TTGTGTAGCA CTTAGGACAT TATACTTGAT AATTTCCAAG 11701 GGTCTTTCTA GATTTAAAAG TCTGATTCTA ACGTAGTAAT AAAAATAAAG GCCCAATTTT 11761 CTCTTTAATA TTGCCTGAAG ATATTACTCT ATTATTGCAT TAAAATTAAA CATTCACACA 11821 TTGTTTGCAC TGCTAAATAA AATTATGTAA TTTCTTCTTC TTTCCTTCCT CCTTCCCCCA 11881 TCCCTCTCTA TTTCCCTTTC CCCTTCCTTC TTTCCTGGCC TTTTTTCCTT CTTTCTTTGT 11941 TCCCTTCTCC CTCCCTCCCC TTTCTTCCTT TTTCTAAAGC TGGCTTTGAG ATCCTTTATT 12001 AAAGAATAAA TCTTTAAAAC TTATACTTTA TTTTCCCTGT TGCAGGAAAA AATCATGTCC 12061 TACATATGTT CTCAACAAGA CACTCTGTCA AACAAAATAA CAGAATGCTG CAAACTGACC 12121 ACGCTGGAAC GTGGTCAATG TATAATTCAT GCAGAAAATG ATGAAAAACC TGAAGGTCTA 12181 TCTCCAAATC TAAACAGGTT TTTAGGAGAT AGAGATTTTA ACCAATTTTC TTCAGGGGAA 12241 AAAAATATCT TCTTGGCAAG GTAACACACT CTGTAAATGC ATGTTCATGC AAGTAAAAAT 12301 GATTATGTGG CTGACAGATT TGCGTTGTTG AAATGGAGAG TGATGATTAT GGTTTTTGAG 12361 TTCAATATGT GAGGATATTT GGCTAGAATG TTCTGAGCCA AAATAGATTT CAGTAGATAA 12421 CCAGGGAATA AGTAATGGGA TTTGGTGTTT AACGGTGAAG CGTTCACCAC TGTGACTCAT 12481 TAACTGCTTT GCTATGAAGC TGAATTTTAT TTCACATCAA TTTCTCTGGA ATCAGAAGCA 12541 TTGTCATCCT GTAAAGATTA CTCATATCAA GGCCACCATT GAACTCTCAA ATAGGATATG 12601 GATATTTTTG TAATAAGAAG AGTTCATGAT TAAGAATGAA CTCTTGCTAC GCATGTTAAA 12661 AAAAAAACTT TTCTCCAAAA GATAACACAA GAGATAATGC TAGGTAGAAG AACTTTTATA 12721 GGAACAGCTT ATTGGCTATG TATTAAATAC ATGTTTTGTA TTTTTTAAGA AAATCAAAAC 12781 ATGTTTAGAG ACATTTGCAG TACAGTAGTT TGTTTTAATA CAACTGATAG GTCACGTTCT 12841 CACTCTATTT GCCTTTAAGG GAAGAAAGCA ATCAAGTTAA TACGTTTTCC TTCATTGTAT 12901 AGTATGTAAC TATGGACACT ATTAGAGGAG GGATTTGTGT AGCACTTAGG ACATTATACT 12961 TGATAATTTC CAAGGGTCTT TCTAGATTTA AAAGTCTGAT TCTAACGTAG TAATAAAAAT 13021 AAAGGCCCAA TTTTCTCTTT AATATTGCCT GAAGATATTA CTCTATTATT GCATTAAAAT 13081 TAAACATTCA CACATTGTTT GCACTGCTAA ATAAAATTAT GTAAGCTAGA ATAAAGTTCA 13141 GATTTAGGAG ACACATAGTG ACAACTGATT GGTGACAGAA CTAATCCTAT AATCTGGGAA 13201 TACGGTTAGT AAAGTCAAGA ATTACCTTTA AGTTTACACA TCCATGCACA TCTAAATCTA 13261 ATTGTTTAAT AGAAGCAGTT CTTCAGTTGC AAAGGTTCTT TGCAGTAGAA TTTTCTCAGC 13321 CAGGAATGAT TTTCCCCCAG ATATTTGCAT GGCTTCTTTC ACTTAGCTGA TCTCTGTTCT 13381 GATATCAGCT GCCTAGAGAG AATTTTCTTG ACCACATTCA AAGTTAGTGG CCTCTCCACC 13441 TTGGGTATCA TCCTTTTTTC TCTTTTTCAT CTTTATTTAT TTTCATTGAT TTATCGCTAA 13501 CTGAAATGAG ATGGCCTATT TCTTGTTTAT TTGTTCTGCC TCCCTATAAT GTGTGCTTTT 13561 CAGAGGGCAG GTATTTATCT TAGACATCAT TGAGTCCGTT CTGCTTAAAG CAATGCTAGC 13621 AAAGAGTGGA CACTGGAAAA ATATTTGTTG AATAAATGAA TATAAAGTCC GTAATTGAAA 13681 AGTCAAATTG AGAGATGCAG GAGAAAACAA AAAGCCATTT TACAGGACAA TTTGAAGGAT 13741 CACAGTCTGT ATTAACAGTT TTGCCATTCA TATAATTCAA ATCATATTTG ATTTTCAGGT 13801 TTATTTATTT GAATTTAACT TCCACATGCC ATATTATATA GGAATAACTG GAGAAGTGAT 13861 GGCTCCTTTT GTCTCTTAGT TCCAATAACT TGAAATATTT TTCTCCACAT ATTTCAGTTT 13921 TGTTCATGAA TATTCAAGAA GACATCCTCA GCTTGCTGTC TCAGTAATTC TAAGAGTTGC 13981 TAAAGGATAC CAGGAGTTAT TGGAGAAGTG TTTCCAGACT GAAAACCCTC TTGAATGCCA 14041 AGATAAAGGA GTAAGTTGCT CTAGAATTTT AGGGGAGTAT GAAAAACTGG ATTGATATCA 14101 TCTGTTAAAA ATGCTGTTTG TTTGAAAGCC TCTAGTTTTC AACTAGTTGT TAGCCAGTTA 14161 TATCTATTTG TCTAGATATT AAGCTGTTAT TAACTAGCAG TCAGCAGCTA GTGGCTTGCT 14221 TTAGAAACAA AAATGTTAAT TGCTTCTCAG CCTTTTGGCT AAGATCAAGT GTAGAAATAA 14281 AAATGTTAAC CAAAAGTCCT TTGATCCACA AATAAAGGTA GTATTCATTA TTCATTTTTG 14341 GATAACTTCA GAAAGGCAAG AATTTGGTAC AGAAAGAACT GTAACCATTT ATCCAAAGAT 14401 TGAGTTTTGC CATTAAATGA TTTTGTGATT TATAAAATGT TAAACTTAAT CTCCCCAAAA 14461 TCCATTTTCT GTAATTATCA AAATTTACAC TTTACCATAT TTAATATTTA AACATCTCTG 14521 ATTGGTTTTA TAATAGTATA TAATATTGAT CAATTTTATA TACAAAGTTA TGCATCCAAG 14581 AAAAGAAAAA TGTATATGTA ATAATTCTTC ATTTTCAGGA AGAAGAATTA CAGAAATACA 14641 TCCAGGAGAG CCAAGCATTG GCAAAGCGAA GCTGCGGCCT CTTCCAGAAA CTAGGAGAAT 14701 ATTACTTACA AAATGCGTAT GTTTTTGTAA ACAGTATTTT TAGTGAATTA AAATTATTAA 14761 AGAGAATGTA GCCTTCCCCA ATTCTCCTCC TTTGGAAGCA ACAAGAATGA CCTGTGAGGT 14821 CTGATCTGTG GTATTGACTT TAAGTTCCCC ATACTGTGCA AATTTTTGCA GTAAAGATAT 14881 CCATTCTGTC ATAGTCTGTC CGAGTTAAAG CACCAAAAGA TCACAGTTAA AATCAATGAA 14941 GCTCCGAGGT TGAGAATACA AGTCAGGCTT CCTCCTGGAG TTTGCTTTTT CATTGTGATA 15001 TGCTTCTATA ATAGGAGTAC AGGGAGTGGT TGTTAGAGTA AATGCATCTC AAAAGTTGGT 15061 TCAAATACCA TGTAGATGAA ACAAACGAGC TGACCTCATG TCTTCTGGCA TGAGAGTAGA 15121 GAGTCTGTGA GAAGAAGCAA GGCGGCTAAA AACTCATGAA TGACTCAGCA GGACTTAGTT 15181 AAAAAATGCT TCTTTCAGGT TTCTCGTTGC TTACACAAAG AAAGCCCCCC AGCTGACCTC 15241 GTCGGAGCTG ATGGCCATCA CCAGAAAAAT GGCAGCCACA GCAGCCACTT GTTGCCAACT 15301 CAGTGAGGAC AAACTATTGG CCTGTGGCGA GGGAGCGGTG AGTGTCTGCT TGGTTTGGTC 15361 CCATCTCATT TCTGCCCTGT TTGACTTGAA ATAGCCTCAT AATTCCCCTC TAGGGAAGAC 15421 GGTAAAAACC AATGTAGAGA TGGCCTTAGG AGGCTTGTTT GATTAGTCAC GGTTGGAGGG 15481 GTGTGAGAAC CCAGCTCTGG ATGGCTGGCA TGTGGCCATG CTTCCTATTC CTCCAGGGTG 15541 GCTGGTGGAA GTTCAGCCAG TTTAGTCAAC AATATCTGAG CCAACTTTAT ATATCAGAAA 15601 GACAGAACGA CCAACATGTA ACTCATAATT CATAACAATT CATAATTCAT AACTTCAAAT 15661 CATAATTTCT GCTTTTTGTT CATATACTTA CTTTGATGTT TTAAAAAAGC TTTATCTTTG 15721 ATTGATTAAA ATTAGTCATG CTATTTTAGC CATATTTATA TTTTCACCTT TTGTAACATG 15781 ATATTATTAT TATGACATCA GGAATAATTG GTTCCCTTTC GCAGGGTATA GGGTACAGCA 15841 CAGGATAAGT ATTATGTTCT GAATAGTAAA ATGACTTTCG AGTCAGTAAT GCCAATATTC 15901 TTTACTTCCT AATGTCACTA GTATCATACA TAAGATTACA GGATGAATTA AAAATATTTT 15961 TCCTATAAAG TCATAATTGC AAACAAAATT GTCTATTTTA TCCTTTTTCC TCTTTTTCAT 16021 AATGGGGAGT TATTTGCTGT TAGTCTTCAT GTCATACATT TTTTCCCCAA AGGTTAAGAG 16081 TAAAAGGAGA GTTCTTGTGA TTAAATGTCA CCTCAATTGT TTGTTGAATT TCCCATGCTG 16141 GGAGGCTGCA GGGATGCAGG ATGGTGTAAT GGTTCAGGAG TGTATGTTCC GGAGGCCACC 16201 AGCCAACAAC CGCATTCTAC TTTCACTATT CTTTAGTTGT ATCACAGTGG GAAAGCAACT 16261 TATGTCATTA AGCTTGAGTT TTTTCATCTG ACATTTGAGA ATACAAATTA TACCACCCTC 16321 ATACGACAGC TGTTTTAAAC AAGATAATCT GCATAACTCA CACAGCACTA GTCTGACAGA 16381 TAAAGTGCAC ACAAAACATA TTATTTCTTA TTACAAGTTA TTACTAGGTG ATTAAGAAAT 16441 ATCTCCTAAG TAGGCAAGGT AGCAAGATTC TACATTAGGA AAGTCTTAAA AACCCACAAA 16501 ATTGCTCTTA CTTCTTTTCA ATTAGGATGA TATATTAGCT GCAAGTGTAT ACATGTGTAT 16561 ATGTATGTGA ATAAAAGGGG TAAGTTTGTG CTATTCTTAC CTTCAGATAG TGATTATCAA 16621 AAGAAAAATG GAAAGTTCAA CTAAATACAC ATGGGAAACA TAAAGGCAGA GACATTTTTG 16681 TCCTTTAGAA GTGTGTATGT AACTGGAAGC ATGTTTCAAA TAGCTGACAC AAATAGCTAA 16741 ATGACTATCC TCAACATCAC ATATGGACCA TCTGCTACTA CTTGCTAAGG CTTAGCCCAA 16801 ACAAATGGGT AAATCCTGGA ATTTACAATA TAATGTCACA TGATCCTACA TAGCAAATTT 16861 TCCTGTAATA TTAATTATAA ATTGCTGGGC ATTAGAAATT ATTGCAGCAG TTTTCTGAAA 16921 AACTGAACCA ACTTTGTGAC TAATGCCCAA TCTCCTTACT TTTTTTTCTC ATTCTCCTAA 16981 CCAGGCTGAC ATTATTATCG GACACTTATG TATCAGACAT GAAATGACTC CAGTAAACCC 17041 TGGTGTTGGC CAGTGCTGCA CTTCTTCATA TGCCAACAGG AGGCCATGCT TCAGCAGCTT 17101 GGTGGTGGAT GAAACATATG TCCCTCCTGC ATTCTCTGAT GACAAGTTCA TTTTCCATAA 17161 GGATCTGTGC CAAGCTCAGG GTGTAGCGCT GCAAACGATG AAGCAAGAGT AAGAAACTGT 17221 TACTTGCTAG CATGGAAAAG AATGACAACC CCAAAGAGTA ACTGAGACTT CTACCTCGCT 17281 CACCTAACAC TATTGGGCTC ACTAACAGAG CGTTACTCCC AAAACACTTA AAATGCCTTT 17341 GAAAATAGTT TTGTCTCAGT GTCTTCACAG TCTCATTGGG GAAGCAGGTC TAGAAAAATC 17401 GACGAGGGTG GACAATTTCC TGTTTGTAAA AATAATCTCT GTTGTAACTG TTATTGTGAT 17461 ATGTATTTGG GGGTTGAGGA AAAGTGGGCA ATCTATTCTG AGGAATTAGA GTGTATCTTT 17521 GCAGCAAATT TGGGTACTTC CATTCCAAGC ACAGGAAACA CATCATTGAA TCTTTTTTTA 17581 CACTATTTAC ACTTTGAAGA GAATAACCAT CTTATTTAAT TCAACCATGC AGTTTGGGTG 17641 TTAAGAAATG ACATGTACAT TTCAGTTCAT TGTGGGAGCT CTTTTGTAAT GGTGATGGTC 17701 ATGCAAGTCA ATGGAGCTTA TGTTCTTCAA ACTCCCATGC ATTTTAATCC TCACTTGTTT 17761 TGTAAATAGT CTTCCTTCAT TGGAAAACCC ATTCTTCTCT TTTTTCCTCT ATCACAGTCT 17821 GAGGTATGTT TCACAGTATG ATAAGAATGT TGCCTGTTCT GGCAAGCTTT TTCTATTGCT 17881 CTGGTCTACT TTCTATTGCT CTGGTCTAAG TCCAACATGA AAGGCTTGCT AAGTGAGCAG 17941 TGCAGGCAAT TAGTGCTGCC AGTGCCCAGA TAAGGGGTGT GATAACTGGA TGGGCAGGAT 18001 TCGGAGATCT GGGTCTTTGA GTGTAGATAA GACACAGTTA AGAAGAGCGG ACAGGAAAGG 18061 ATATTCCTGG GGGATGAGGG GAGATTGCCT TCCACTACAC ATAAGTATGG TCAAGTATGA 18121 AATAGTGTTT TATCCACAAC CTGCACAACT CCAGGCTGGT GGAACACTTG GCATGTTTTC 18181 AGCCTCAATC TTTCTACTGA AAGTACTAGA CAAGGTGTGT GTGGTCAGTC TGGTGATAGG 18241 TTGATGGAGT AAGGGTTTAG GCTCTGAAAA TTCTCTACTA GGAAGGCTGT AGAAAAATAG 18301 CATTGCATAA CAGACTTCTC TTGTATTTTG TTTTGTTTTA AATCACAGGT TTCTCATTAA 18361 CCTTGTGAAG CAAAAGCCAC AAATAACAGA GGAACAACTT GAGGCTGTCA TTGCAGATTT 18421 CTCAGGCCTG TTGGAGAAAT GCTGCCAAGG CCAGGAACAG GAAGTCTGCT TTGCTGAAGA 18481 GGTACATGCA GCTCATTTCA TACTCAAAAT ACTTGCTATG GAATTTTCTG TAGTGGATAA 18541 TGAAAGGAAG ACCCTACAAA TTTATAACTT TAAAATATTT TCAGAGAGAT TTAAATTTCA 18601 TTGAGAAGCA GATTGAGGGA TTCTATAAGA TTTAAAAAAT AATCACATTT TCTTGCTTAA 18661 TATTAGGAAA ATTTATAATA TTAAAATATA TTAATAGAAT TAGTAATTTT AATTTATTTC 18721 CTAGTAGAGA AACCCATAAA GTGAATGTGT AAATAATTGA TGGTAATTTA GATAGTTTCT 18781 GGCCTAAAAT TGATCAATTC AGCTAAATGG ATTAAAGGAT TTAATAGCAA ATTAATTGTG 18841 CAAACAGAGT ATTAGGAGTC TATTTGTAGA AAATGTTTTT GAACTCATTT AGAAGCTTGC 18901 TTTTGTACAT CAACAGAGTA GTATTTAGGA GTTATTTTAA TTACATAGTA ATTTTAGCTG 18961 GATAATTAGC CAGATTTTCT TTAACCAGGG GATTCTACCT AACATTTAAA AAAATTACCT 19021 TTTTTCAGCT TTATTGAGGC ATGATTGACA AATACAAATT ATATATGTTT AGGGTGAACA 19081 TGTGATGTTT CAATATATTT ATACATTGTG AAATGATTAT CACAATCAAG ATAATTGGCT 19141 AAATTTTACA AATCTTTAGT TTGTATTGCT ACATATATTT GAATATAGCA ACACTATACT 19201 TTAAAAAGAT ATTCTATAAC TTAGCGTTTT TGTCAATTTT ACCTTTCTCA CCATGTAAAA 19261 TCCAAAGACA GATATATTTA GAAATGTAGA GTTTTTCTAT AAATAATATA ATTAGATGCA 19321 TTTGAGTGTG TGCACTTACC AGTATATGTG TGTGTTTTTG GTGGGATCAG GTAGGGTGGG 19381 ACATAGATAA CCAAATTAGA TAAAACTGGT GAAACAGATT TGATGTGAAG CATTTCTGAA 19441 AAACATGACA CAAGAAGATT AATGTTCTCT AATCTGAGAA GACATTTATT TAGATATAGA 19501 GAACATGAAC AAATAGTAGC AGTGCTTTAT CTGCAAACCT TTTAATTTCT AATAACTTGT 19561 AATTTGTAGA GGAAGGGGAA AGATTGAGAA TACGCATTGA TTTGGAGATT GTTATAGAAG 19621 AAAACTGTTG ATGTGAAAGA ATATTGTTTT CTCCCTGGCT TTTACTATCC CAGGTTGTTG 19681 GCATCAGAGA TGTGTTTCTT CATTTTTAAC TTAGTTAATC TACAAACCTA TGAATTCACC 19741 CCGGATTGTA GAGTGTTAAC TGTATGATTG GTATAATAAT CCATTTCTTT ATCTGATTAT 19801 GTTTATTCTT AATTTTCAGG GACAAAAACT GATTTCAAAA ACTCGTGCTG CTTTGGGAGT 19861 TTAAATTACT TCAGGTAACA AAACATTCAG ACAAGCCTGA ATACAATGTT GTTTCTCCAG 19921 AAATATCAAT CCATAATGAG ATAGATCATG AGGAGTGCCA TTAATTCTCT TAAAAATACA 19981 TGGAATTCAA AAAAAAGTTT ATTTTAAAAA CACTTGAACA AAATTACGCA CACAATGTTA 20041 AATTAGTGGC TCAACTATGC AAAATCCTTT TTGGTTATTT AAAAGACTTC AACAAATGCT 20101 ATCAGAAGAC TTTCCTACGT ATCCAATATT TCTCTGATAT AAAATAATAG AACCAGTTAC 20161 TTACTGCACC TATTAGTTTA ATTAGTATTT AATATATTTT TGCTCATATT GCAGGGGAAG 20221 AGAAGACAAA ACGAGTCTTT CATTCGGTGT GAACTTTTCT CTTTAATTTT AACTGATTTA 20281 ACACTTTTTG TGAATTAATG AAATGATAAA GACTTTTATG TGAGATTTCC TTATCACAGA 20341 AATAAAATAT CTCCAAATGT TTCCTTTTCC AAGTTTGCTT ATTTATGAAA AGTTATCGAT 20401 AATTTCTTTA GTTTTGTATA CCATTGTCTG AAGCAGATTC TGTTAAAATA GCATTAAGTG 20461 TTGGTTGTTA TAGGAGATTA AAGCTATCCA AGGATGGATT TACAGCACTA GATCACTTGG 20521 TGAACTGAAA AATGTTCCCA AGTTAAACAC TATTTGATGC TACCAGGGCA TTTTGTTTAT 20581 TAAATGACCA TCACTGAAGT ATTCTAACAG ATAATCTGGA GATGAGAAAA GAAATTATTA 20641 TTCTTCTATG GGATCTAAGA AAATTTAACA TCTACTTTTT TTCTCAATCT TGTTCAGTTC 20701 TCTATTCACA GACATTTTAA AACATAAGAA TATTACAGTT TTGATTGAAT ATAATTAATG 20761 TTTTCATCAA TTAAATTTTT ATTCCACAAA TGTTTATTAA GCCTTCACTG AGTTCCAGGC 20821 CTTGGGCTGG GTACAGTCAC TGTTCTCCAG TCGCTTCAGT AGAGAAGGAA GCAACACAAT 20881 AATGATGCCT CAGATTATGA CCACAAGGAG TTGACAACAA ATTGTTGAGG GAACGAAAGG 20941 GTGGGTGACA TCAGTTCTGA CTGGAAAAGG TGCCATAAAT GTTTAAGAAG AGTCTTAGAA 21001 AATGAATGAG CATTCATCAG TTCAAGTAAA AGAGAAGGCC ATTTTATGAA AGAGGAAACA 21061 TGGTCAGAGG TACTGGGTAT GACAAAGTAT GATGAATAGT TTTAGAGGAG TACAGGGTAG 21121 GTAGGAAGGG TAGAGATGTA GAATGTTAGG CAGGATGTGG AAGCTATGAA GGGCTAATCC 21181 AGACTCTCAG GGCTCTATTC TAAAGAGGTT GGACAATATC TTATGAGTAA CAGAGGACCA 21241 CAGTGATAGC TAAGCAGAGG TGTGCTATTA ATAATTAGCT TTGAGGAAGA TAATCTATGA 21301 GATACGGAGC ATGGGCAATG GGGCAGGGAA AAGGGTAGAG GGGGCAAAGT CAGCAAGAGG 21361 TGACAAAAAG ACCATTCCCC CAGTAGGGAA GCTGTTTATC TTATTAGCTT TGAATAGAAA 21421 TCTGAAAATT ACATGCATAC TTTGGCCTGG GCTCTGGGAG AAGACAATGG CTTTAGAGCA 21481 ATAAAATGTG CTTTCGGTTC TTCTTCTCTT TTTTTTTGAA TTGGAATCTG CTATGTTGCC 21541 CAGCTGGTCT CAAAATCCTG GGCTCAAAAG ATCCTCCTAC CTTGGCCTCC TGAGTAGCTG 21601 GGACTAGAGC CATGAACCAC TACACCTAGC CCTTTTAGTT CTAACTAGAT TGCTTTCTAC 21661 TTGACATTTT TCTTTCCATT GTACTGCAAC ATCTATGAAC TTTGTCTGAT TTTGGTGTAA 21721 TTCTAAAAGG TATTAATTTA AAGATCACAA TAAAATAAAA TCACGTTCAA TATAAAAGCA 21781 TTAAGCTGGA AAATGCTTTA TTTTTTTAAG GGAAAAGCAC AATGAAGGTA AAACTGTTAG 21841 GGAAATTTGT GAGTCAAATA ATATTTTGTT GAAATATATA AAGAAAAAAT GATTGGAGGC 21901 AGAAGGATAA ATTGACGTAA GTACAATTGT AAATTGGGGA TTTTTAAAAA TTTATTTTAT 21961 TTATTTATTT ATTTTTTTGA GACAGAGTCT CGCTGTGTCG TCAGGCTGGA GTGCAGTGGT 22021 ACAATCTCGG CTCACTGCAA GCTCTGACTC CCTGGTTCAA GCGATTCTCC TACCTCAGCC 22081 TCCTGAGTAG CTGGGATTAC AGGCACGCGC CATGACACCC AGCTAATTTT TGTATTTTTA 22141 GTAGAGACGG GGTTTCACCA TGTTGG // LOCUS HUMAFP1 875 bp ds-DNA PRI 15-JUN-1989 DEFINITION Human alpha-fetoprotein (AFP) gene, exon 1 with 5' flanking region. ACCESSION M10949 KEYWORDS . SEGMENT 1 of 2 SOURCE Human DNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 875) AUTHORS Sakai,M., Morinaga,T., Urano,Y., Watanabe,K., Wegmann,T.G. and Tamaoki,T. TITLE The human alpha-fetoprotein gene: Sequence organization and the 5' flanking region JOURNAL J. Biol. Chem. 260, 5055-5060 (1985) STANDARD simple staff_review FEATURES from to/span description pept 781 / 865 alpha-fetoprotein, exon 1 /nomgen="AFP" /map="4q11-q13" /hgml_locus_uid="LB0045G" pre-msg 737 > 875 AFP, mRNA and introns IVS 866 > 875 AFP, intron A BASE COUNT 302 A 132 C 140 G 301 T ORIGIN Chromosome 4q11-q13. 1 AATGATGCAC CTGACCCACT TTATAAAGAC ACATGTGCAA ATAAAATTAT TATAGGACTT 61 GGTTTATTAG GGCTTGTGCT CTAAGTTTTC TATGTTAAGC CATACATCGC ATATTAAATA 121 CTTTAAAATG TACCTTATTG ACATACATAT TAAGTGAAAA GTGTTTCTGA GCTAAACAAT 181 GACAACATAA TTATCAAGCA ATGATAATTT GAAATGAATT TATTATTCTG CAACTTAGGG 241 ACAAGTCATC TCTCTGAATT TTTTGTACTT TGAGAGTATT TGTTATATTT GCAAGATGAA 301 GAGTCTGAAT CGGTCAGACA ATCTCTTGTG TGCCTGGCAT ATGATAGGCA TTTAATAGTT 361 TTAAAGAATT AATGTATTTA GATGAATTGC ATACCAAATC TGCTGTCTTT TCTTTATGGC 421 TTCATTAACT TAATTTGAGA GAAATTAATT ATTCTGCAAC TTAGGGACAA GTCATCTCTT 481 TGAATATTCT GTAGTTTGAG GAGAATATTT GTTATATTTG CAAAATAAAA TAAGTTTGCA 541 AGTTTTTTTT TTCTGCCCCA AAGAGCTCTG TGTCCTTGAA CATAAAATAC AAATAACCGC 601 TATGCTGTTA ATTATTGGCA AATGTCCCAT TTTCAACCTA AGGAAATACC ATAAAGTAAC 661 AGATATACCA ACAAAAGGTT ACTAGTTAAC AGGCATTGCC TGAAAAGAGT ATAAAAGAAT 721 TTCAGCATGA TTTTCCATAT TGTGCTTCCA CCACTGCCAA TAACAAAATA ACTAGCAACC 781 ATGAAGTGGG TGGAATCAAT TTTTTTAATT TTCCTACTAA ATTTTACTGA ATCCAGAACA 841 CTGCATAGAA ATGAATATGG AATAGGTGAG AAATT // LOCUS HUMAFP2 620 bp ds-DNA PRI 15-JUN-1989 DEFINITION Human alpha-fetoprotein gene, exons 14 and 15. ACCESSION M10950 KEYWORDS . SEGMENT 2 of 2 SOURCE Human DNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 620) AUTHORS Sakai,M., Morinaga,T., Urano,Y., Watanabe,K., Wegmann,T.G. and Tamaoki,T. TITLE The human alpha-fetoprotein gene: Sequence organization and the 5' flanking region JOURNAL J. Biol. Chem. 260, 5055-5060 (1985) STANDARD simple staff_review FEATURES from to/span description pept / 9 53 alpha-fetoprotein, exon 14 /nomgen="AFP" /map="4q11-q13" /hgml_locus_uid="LB0045G" pre-msg < 1 547 AFP mRNA and introns IVS 64 403 AFP intron BASE COUNT 216 A 95 C 87 G 222 T ORIGIN About 18 kb downstream of segment 1; chromosome 4q11-q13. 1 ATTTTCAGGG ACAAAAACTG ATTTCAAAAA CTCGTGCTGC TTTGGGAGTT TAAATTACTT 61 CAGGTAACAA AACATTCAGA CAAGCCTGAA TACAATGTTG TTTCTCCAGA AATATCAATC 121 CATAATGAGA TAGATCATGA GGAGTGCCAT TAATTCTCTT AAAAATACAT GGAATTCAAA 181 AAAAAGTTTA TTTTAAAACA CTTGAACAAA ATTACGCACA CAATTGTTAA ATTAGTGGCT 241 CAACTATGCA AAATCCTTTT TGGTTATTTA AAAGACTTCA ACAAATGCTA TCAGAAGACT 301 TTCCTACGTA TCCAATATTT CTCTGATATA AAATAATAGA ACCAGTTACT TACTGCACCT 361 ATTAGTTTAA TTAGTATTTA ATATATTTTT GCTCATATTG CAGGGGAAGA GAAGACAAAA 421 CGAGTCTTCA TTCGGTGTGA ACTTTTCTCT TTAATTTTAT CTGATTTAAC ACTTTTTGTG 481 AATTAATGAA ATGATAAAGA CTTTTATGTG AGATTTCCTT ATCACAGAAA TAAAATATCT 541 CCAAATGTTT CCTTTTCCAA GTTTGCTTAT TTATGAAAAG TTATCGATAT TTCTTTGGTT 601 TTGTATACCA TTGTCTGAAG // LOCUS HUMAK1 12229 bp ds-DNA PRI 15-SEP-1989 DEFINITION Human cytosolic adenylate kinase (AK1) gene, complete cds. ACCESSION J04809 KEYWORDS adenylate kinase. SOURCE Human DNA, (library of T.Maniatis). ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 12229) AUTHORS Matsuura,S., Igarashi,M., Tanizawa,Y., Yamada,M., Kishi,F., Kajii,T., Fujii,H., Miwa,S., Sakurai,M. and Nakazawa,A. TITLE Human adenylate kinase deficiency associated with hemolytic anemia: A single base substitution affecting solubility and catalytic activity of the cytosolic adenylate kinase JOURNAL J. Biol. Chem. (1989) In press STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly provided by A.Nakazawa, 10-JUL-1989. FEATURES from to/span description pept 3982 3988 adenylate kinase, exon 2 (first expressed exon) /nomgen="AK1" /map="9q34.1-q34.2" /hgml_locus_uid="LJ0150M" 5534 5569 adenylate kinase, exon 3 5742 5905 adenylate kinase, exon 4 6656 6772 adenylate kinase, exon 5 10075 10266 adenylate kinase, exon 6 10518 10571 adenylate kinase, exon 7 pre-msg 944 12188 AK1 mRNA and introns IVS 984 3947 AK1 intron A (no splice consensus) IVS 3989 5533 AK1 intron B IVS 5570 5741 AK1 intron C IVS 5906 6655 AK1 intron D IVS 6773 10074 AK1 intron E IVS 10267 10517 AK1 intron F (no splice consensus) signal 902 908 TATA box BASE COUNT 2417 A 3457 C 3877 G 2478 T ORIGIN Chromosome 9q34.1-q34.2. 1 TAGCCTATAA TACAAATTCC AACCCACCTC ATCTGGGGCT GCTGCCTGGG CTCCCATCCC 61 TGCCCGGCTA CATCACTGAG CACCTACTAC TATGTGCCAG TCTCCCTGCA AAACGCTGGA 121 TAAACACGTG GCTTTCTACC AGGGAACCTC CCGCAAGGTA TTTGACATGC TCGCCTCCCG 181 TTCTCTGCTG TGTCTAAGGA TTCACAGATG CGGCTGGAGT CTGCTGCTTA GCACAGTGAG 241 TCGTCATTAT GGGAGCTATT CTTCTTATCA AGACCGCAAA TCCCCTCTCT GGCTATCCAC 301 AGCCTTCTGA ATGTCCCGGG CAACTCCAGG GGAGCAGGGC TGGTTTTCTA TAATCTTCTA 361 CCTTACTGGG TGGTCTTTGG GGTCTGGCGG GTTCCAGTCC CAGAGGAGCT CGAAGGGTCC 421 CTCCAACAGG GGAAGAGGAG TCCCAGGTGG GTCCTGGCCA GGGCCTCTGG GCAGGCTCTG 481 AGGGCGGGCT TGGGGACTGT GGCCAGCAAA GCCCCTGAGT CGATGCCTCA GCCCTCTCGC 541 TCCCTGGTCT GGCCTCTCTC TGGCACCAAT GCGCTGTGGG ATTTTGCGGA AAGAGCCGCT 601 CTTCTCTGAG CCTCAGTTTC TCCAGCTATC ATAGGGAAAG CCTGGCCTTG TACCTTGAGC 661 ACAGTCGGGG TATCGCAATG GAAAACTCTT GGCAAACTGT AAAGTGTAGT TCGCGTGTGT 721 GTGGGCACAG CCACCTGGGG GTCCACGGTG CGGGGCACAC GGTGCGGGTG CGGTGTCGCC 781 GCGCACCCGG CTCGGGCTCG GTCCCGCCCC GCTTCCCGGT CCCTGGTCCG CTCTCCCTCC 841 CTCCTTCCCG CCCTCCCTGC CTTCGGGAAC GCCGGCTCCC GATGCCGCGC GCTGACAGCC 901 TTATAAATAG TCGCCTTTGC CGGCCGCCGC GAGGACGGGC AGGGCACGCA CTGGCCCCGG 961 CGCCCACCCG CACCCCTCCC CAGGTCAGTG CGTGCCCGCG CGTGTCTGGG GGGGCGCCTC 1021 TGCGGGGAGG GGCAGCGGCA CCGGGGGAGG GCGGGGGCGT CCTGGTGCGG GGCTCCCTGG 1081 GGGCTGTGCC GGCTGTGTAT CCGTGGTTGT GATCCGTGTG TGTTTGCGTG GCTGTGTTCC 1141 CCGGGCGCTG GGGATCTGTG CCTCGCTGCC TGTGCCTTTG TGGATCCGTG CGTGTGTGTG 1201 CGTGTGTGTT TGTGTGTGTT GCAGCCCCTT CCAGCCTCCG AGATAGTCAC CCCTTTGTTC 1261 TGGCTGGCAC TCCCGGGGCT CCAGGTCACT GCCCTTGGCA TGTCCCAGCT TGGGCCCAGA 1321 GAGGGAGAGC TGTGGCCTGG CTCCCTGCCC CAGAGGACCT GGACAGGGGC AGCCTCTGTG 1381 CTCCGTAGGC TATCGTGTCA CTCCACTTCT AGTGCTGGGT GCATTGGAGA ACACCCCCTG 1441 GATGCTCAGA GCTTGCCCTA GCCTCCTGAC CAGGGCCAGC GCTTGGGGCC TTCCCACAGC 1501 ACTGCTCAGT GGCCGCCTGG TACTCCTCAG CAGCAGCAGG ACAGGCCCAG AGAGGGGACA 1561 CCAACTGCCC CAAGGTCACA CAGCTGAGTG ATGGAGCTGG GATTCGAACC CTAGTCAGTT 1621 GCCACCCACT GGGAACCGAC TACGTTGGGT CAGAAGCTGG GAGACCTGGG CTTGCCTTGG 1681 CCTTGTCTTG TGATTGGGGT TAAGTCACCG TCCCTCTGGG CTTCGGCCCC ACAGGCCCCT 1741 TAGCAGACCA TCCAGTCATA TTAGGAGTCG CTGGGGCCTT GACCCAGGGG CCTTATAAGG 1801 CCTGGGTGCT TAGCCCGTAA CCCAACTCCA TCCTAGACCC CTGTCCTCTG TGAGGCCTCC 1861 TCACGACCCA AGTTATTTTG GATCAAGGAA GTGGCAGGTG CCATCAGGGC GGAGGGGGCT 1921 TAGATTACCA GGAGGGGGGC GGGGCAGGGC GGGGTCGAGG CCACTTCTCA GGGCTTCACT 1981 GTGGTTGTCC CTGGCCCAGG ACGCTCCCCC TCCCATGGGC AGCTGCTGAG GGGCACTTGG 2041 CAGCTCCTAC CCCCCTGAAG CACACTCAGC AGGAAAAGTG GCTGCCACTC CTGCCTGGGG 2101 CAGCAGGGGC ATCAGCAAAG CCACTTCAGC TGCTACTGGG TGCCAGGCCC AGCGTAGCAG 2161 AAACTCAGGG CCGAGTTGGA CCCTAGCTGG CCCTCGAACC AGTCTGTCCT CCTCCCCATG 2221 CCTCAGTTTC CTCCCAAGAT GGCAGTGAGG ATGGAACGAG GCTCCCCAGC CCTGTGCTCA 2281 GCCCATGCCA GGCCCAGTCA ATGGGATCCC AGGCCCGCAG CACAGAAGAG GGGCCCAGTA 2341 CAGCGTGGTT GTGCAGTGCA GGGCAGGTTT CCAGGAGGCA CCTGCCAGTG TGGGAGTGGA 2401 GGGCAGAGTC CAGGAAGACT TCCTGAAGGA GGTGGCACCT GAGCTGAGTC CCACACAATA 2461 GCAAGGTGGG GAAAGGCATC CAGGCATTGG GTACAACATC TGCCAGTGGC AGCTGATAGG 2521 GTTGCAGAGG GCAGCAGAGG TCAGCTCACC GGGACCTATC CACTATGTCA AGGGGCCCAG 2581 GCTTTGTCTG GGAGCCGAGA GGCCACACAT GCAAGGTTGT AAGCGGGTGT GTGATGTGGT 2641 CTGATACTTG CCAGAGAAAA TCCTGCAAAT TGCGTGGCAG GGTGTGGAGG CTTCAAGGCC 2701 CATGTGGAGG CGGGGGCAGT GGCCCAGGCC CCTTGCTTAG CACAGGACAT AGTGGTGAGG 2761 AGACAGACGG AGAGAGCCTG GGGAGGGGAG GGTTGGGCAG TGGGTGTGCA GGGCTGTGGA 2821 GAGGAGAGAG GAGCCTTCAC ACTGACTCAG CCTGTGGCTT GCCATAGGGG CTGGAGCTGG 2881 GACTTGGGGA GGGCTGTTCA GAGCCTGAGA AGCCTGTGGG GCTTCTGGGG CCTGTCTCGG 2941 GGGCATGGAT GTCTGGGGCT GGAGAGAAAG GCCTGGGCTG CCTGCACAGG GATGGGGGTC 3001 CTTGGAGCCT GAAGCCAAGT AGAGAGGGTG GTGTGGGTGC CTTGGGTGGA GGTCCCGGTG 3061 CGGTGGGGTC GCTGGCTGGA GGTCAAGGCT GGAGGGGAGT GTGGCTGTGG TGGTTGCGAG 3121 TGGCCTTGGG GAGGGCTGGT GGCTGGGAGT GAAGAAGTGG AGGCCAAAGG TGTCCAGAGG 3181 GCAGTTCTGA GATAGGGGGG CGAGGTAGGG GAGGTCCTGG GGGCTGTGGC AGCACAGGGT 3241 GGGGTCGCAG GTGTGATTGG GCTCATGGGC AGGCTGAGGG CAAGGTGGGG GCAGTAAAGC 3301 CAGGTCCTCA AAGGGTGGGG GTGCAGGGGA CATGCGTGGG GCTGGTTGTT GGGCCTGTAG 3361 AGGTGAAGAG CCTGGGACCG CCTCACTTTG ACAACTGTAA AGGGCTGTGT TCATGCGAGC 3421 GGCTCCTGTA GATTCCTTCA CCCCAGGCCT CTGTCCCTTA GAGGATCCCC ATCCAAACCT 3481 CATCCCAGAC ACACATAGTT CCTGGCCACC TCCCACCTCC TGGCAGCTTC CCTCTGGCCC 3541 TGGGGAGCGT TGTCAAGGTG TCTGGGCAGA CGCTGGCCCC GCGCCCTGTT TTCTGGGAGT 3601 TTCGAGGGGA GCAGCTCTGC AGTTGTGTGG CCGCTGTTTA TGTCCCTCGT GCCATCCTCA 3661 CTGTCCCAGC CCCTCCAGCG GGAACCCAGG TGGATGGGAG GGGGCTACTG CTTCTGTGAC 3721 AGGTGAGGCC ATGGAGGCCA GAGAGGAAAG GTGACTTTTC CAAGGTCACT TTCCCAAGGT 3781 GGCTTTCTCA GGGAAGGGGT CCTGGGCCTT CCCTGCCCCC AGCAGCCCCC TCTGCCCTCA 3841 GACTGCATGT CGGGCCTGGC ACACACCAGT TGCCTTGGGT GTGGCTTAGT GGAAGGTGAG 3901 TCAGGGTGAG GAGGAAAATC CCTCCCCAAC CTGTGCCCTT GTCTTACAGA GCACTGACAC 3961 GGCTCCCGGG ACCTCGGCAG GATGGAAGGT ATGAGCCTCC TCTCCTTTTG CCCATTGCCT 4021 GGGCCTCAGT GTGCCTGACT ATTAAGGGGG TGCGCTCTCA GGGAGACATT CTGGGAGGGG 4081 TTGGAGGAGG CTCTGGGGCT GCCGGAGGGG GCCTGGCTGG GAGCCTCCAG TATCCTCCTC 4141 TGGCCTCTCA CGGGATGTGC CAGCTCCAAG ATTAATTTTA GACGTGACGA GGCCTCCCCA 4201 AGGCTCCTGA GGGCCCCAGC GCACACCCCC TATGCTGGGT TTATAGAGCT GAGACCCTGC 4261 CCCATGGAGA GGCCACAGAT ACACAAAACA CCAGGTCTAA TGAGGACAAG GCAATTCCCC 4321 GTACCTCTCC GGGACTCAGT TTCCCTATCA TCCAGTGAGG GTGGGCCTTG AGGAAGTCTT 4381 GGACCCTCCT AGTTCCTATG GGCTGGGACT TAGAGGGAGA CACGGGCAGG CAGAGAGTCC 4441 AGAACGGGCC CGCCCTCAGC TGCTGGCAGC ACCATCCGTC CAGTTCCTGG CACTGTGCCT 4501 TGGCCTGGGC AGCTCTTAAA GACACCCAGC GGGCCTGCTG CTGCTGCTGC TGCTGCTCTG 4561 CCTGAGCTCC AAATATCAAC AGCCATAGGG CTGCGGCCAC TGCACCCCCG GGATAGGGGC 4621 CCACAGGAGG AGCCCTTCTT GACAGCTCCT GTAGGGCTCC AGCTTATGGT GAGGGCCCCT 4681 GGGAGCCGCC CAGCCACCTC CTCCCTGGGG AAGGGGCAGA GAACCAGAGA GGGGCAGGCC 4741 TTTCCTAAGG TCACACAGCA AGGTGGGATA CAGCCTGGTC CTCTATTTCC CTGGGAAGCG 4801 GGGGAGCCTG AAAGCCCAGT GGGGAGCTTG GGTTCTGGTT TTGGTTCTGA CATCCCTTGA 4861 GCAGGCCTTC CCTCTCTCGA GCTCAGTTTC CCTCTGCCCA GTGGGGCTGT GGCCTTGCTG 4921 GGATCTAACA GCAGGCAGGA CGAGACAGAG GTTGGAGCCA AGCTCACCGC CCTCTCTGGG 4981 CAAGAGTTCC ACATGCCCAC CACAGCCCTC CACCCCGGCA ACTGTCTCCA TGGCAACATA 5041 TGTTGTGGTT GCTAAGCCCC CACTGCCGGG GGAGGCGGGT GATGGGCCGG CCCAGGGCCT 5101 GAGAGGGGAA GGGCAGGGTG CACAGGAGAC ACGGCAGGAC GGGACCATGG GCTGCTGCTC 5161 CTCGAGTGAC CCCCGCAGAG AAGACGATCT GAGAGCCAGA GGTGACCGGC CCCGGGGTCC 5221 AGGCACCCAC AGGGAGGCGC CATGAGGGCT GGCGGCCCAG CCCGCTGGCT GTGTGGAGAG 5281 TACTTAGCCT TTAGGCTTTT GCCTCTGCCC GTCTGGCTGC CTGGAGCACC CTCCCTTCCC 5341 TGACCTCTCC CAGGCTCCGA GCTCACTCAC ATCCCTGCTG ACTCTGGGCT GTGACTTGTC 5401 TGCCTCCCTG ACAGGAGCGC CAGCGGTGTC TGTCCCTCCG CCTGGGGCCC AGCACAGGCC 5461 TGGGACTCCA GAGGTGCCTC TGGCTGTGGG TCCAGCGAGT GACTGAGCCC TGCTCCCCTT 5521 TTCTCTCCCG CAGAGAAGCT GAAGAAAACC AAGATCATCT TTGTGGTGGG TGAGTTGCGG 5581 GCAGGCGGGT GGTGCAGCAA GGGTCTTCAC TGGCACTGGA GGGAGCCCGG GCCTGGGGCT 5641 GCACCCTGGG CTCTGCCCCT CCCTCTCTGG GAGACCCTGG CAGCCCCTGT TCCTCCTGGG 5701 ACCCTGTGTC TCGGGGTTTA TGAATGGGTG GGCGTTTGCA GGTGGGCCTG GCTCAGGGAA 5761 GGGCACCCAG TGTGAGAAGA TCGTGCAGAA GTATGGCTAC ACCCACCTCT CCACCGGGGA 5821 CCTCCTGCGG TCCGAGGTCA GCTCAGGCTC GGCCAGGGGC AAGAAGCTGT CGGAAATCAT 5881 GGAGAAGGGG CAGCTGGTTC CACTGGTGAG TGGGCCCTGG TGGGGTGAGA GGCAGGGGAT 5941 GACAGTAGCC CTGAGTGGGT GTCCCACCAG CAGCAAAGCC CACGGCACAC AGGCAAAGCT 6001 GACCGTAACC GCCCCCGGCT CCAAACCCTT CTGTGGTTCC CCATTGCCCT GGACAGAAAG 6061 TTCCTTCCAG AAGCTCCCAA GACCTTGCAG GACTCTGCCC TGCCCCCAAC TCTCTCCCTT 6121 CCCTCCATCC TTCAGCCACC CTGGCCTCTG TCCCCTCAGA CTAGCCATAC TCCTTCCAGC 6181 CTGGAATATT CTTTCCTCTT CAGCTTTCAG ACTTCCCTAA CTCCCCAGAT TTGGCCTAAG 6241 ACCTCCCCCC CCAATTTTCT ACTCCAGAAG CCCCACTTCT CTGCCCGTAC AACCTGTGCC 6301 ATGCTTAGAA ATGGAGAAAT CATCTGTGGA ATGTGTCTGT CCCCTCACCA GCCCCTGAGC 6361 TCCCCAGGGC AGGGAAGGCC TAATTGGTCC ATGGCCGTGT CCTCGGGACC CAGGACAGCA 6421 CCTGGCTGGA AGGAGGGTCT CAGTGAATAC TGTTGAGTGA ATGCATCATT CATTCATTCA 6481 TTCACTCAGT CCAGGTCTCC ACAATTCCGA GGAAGCTTCC TGGAGGGGGA GGCTTGTTTG 6541 GGACATTTGT GTTGGGGAGA GGGCATTTCA GGTGGCAGGA GCAGCGTGAC GAGAGGCTGG 6601 GAGAAGGGAT GGGGAGGAGA GGCTCATGGC CCCTTCCTCC TTTGCTGTGC CCCAGGAGAC 6661 AGTGTTGGAC ATGCTCCGGG ATGCCATGGT GGCCAAAGTC AATACTTCCA AAGGCTTCCT 6721 GATTGATGGC TACCCGCGGG AGGTGCAGCA AGGAGAAGAG TTTGAGCGAC GGGTAAGGCA 6781 CTGACCCAAG TGGGGATCCT GGTGGGCTGG GGCAGGATAA GACTCCCATC CCCATGGGCG 6841 GGCCTGCGGG GGCTGTGATG CAGGTCAGAG TTCTGAGCAG AGGCTGATTC ATGGAGTGTG 6901 GGTGAGGAGG AAAACCAGGA AGACTTTCTG GAAGTGTAGA AGAAAATGTG TAGAAGGGGA 6961 AGCGCCAGGA GGGGCAGTTG CCAAACCTGG GCTCAGATTC CAGCTCAACG GCTTGAGATC 7021 TTGGGCCAGT CATTCTGCCT CCTGAACCTC AGTTTTCCCA TCTGCAAAAT GCAGGGAGTG 7081 GGAGGACCAG GTCACAGCTG CCTTAAGGGA CAGTGTGGGG ACTGGAGAGG ATGTGCCCTT 7141 CCAGTGTGGG CCTTGGCTCC CTATAAACAC ACAGAGGGGC AGGGAACAGC ACTGGCAGGG 7201 CTGGAAAGGC AAGGCCAGTG TGGGGAGAAC AGAATATGTG GGCAGGACAA CAAGGGAGGT 7261 TCCAGCCTGG CTGGAGGCTG CAAATGCCAT AAAGGGACAT CAGTCTCCTT CCAGGGGTCA 7321 TGAGGAGCCT GGCTGGAAAG AGCAGGCCTG GAAAATGATG GCCCTGAGCT TCCTAGCACG 7381 AGCATCCTGC AGATACACCT TCCCCAGGAT GGCACCCCAC ACACGCACAC ACGTGGACGC 7441 TTGCCACATA CACATGCACA AGCCCCACAA GTTCATACAC ACAAGGCCAG GGGCATGTGC 7501 ACACATCGAC ACACACGGCA GTGGATACTC ACTCCTGCAC ATGCATGTCA CGGACGCAGC 7561 ACACATATGC AGGAATGCAC ACGTGCTCGC AACATGGCCC TGTGCACACA CATACACTTT 7621 TACCTCGGTG TCACAAGGCT GTATGGGAGA AATAAACGGG ACTGCAAGGG CCTAGGAGGG 7681 AGAGGTCGGC AAGGGCTAGG GGGAGCCTAA CCAGCATCCT GGTGGGCTGG GGCAGGATAA 7741 GACTCCCATC CCCGTGCCCC ATCCCCATGC CCCATTCCCA TGCCCCGTCT CGTAGGGCCT 7801 TCTGGGCTTC CGTAGCTTCC AGCCTCATCT GGGGATAGGA GGCCATATTT TCTGACCTCC 7861 GAGTGGCTGC TGAGTGGCTG CAAGACCTCA GGAAAGTGTC TACCCCACTC TGCCCCTCAC 7921 CTTCCCTATC TGAAATGGGG GTTAGCTGGG ATTCCTTCAG ACCTCAGACC CCTGACCCTG 7981 TGTTAGGTGC CACAGTTCTG AAAGATGTCC TGGCCTGGCT CACACATACC AGCTGTGTGG 8041 TACCACCTGT GACCCTCAGG GCCACATCTA GAAAATAGAG TTAATGAAAG TCGGCCGGGT 8101 GCAGTGGCTC ACGCCTGAGA TCCTAGCACT TTGGGAGGCC AAAGAGGGCA GATCACTTGA 8161 GGTCAGGGTC AAAACCAGCC TGGCCAACAT GGTGAAAACT CATGTACTAA AAATACAAAA 8221 ATTAGCCAGG CATGGTGGCG GGCGCCTGTA ATCCCAGCTA CTCAGGAGGC TGAGACACGA 8281 GAATCATTTG AACCTGGCTG GACGGAGGTT GCAGTGAGCC AAGATCACGC CATTGCACTC 8341 AAGCCTGAAC AACAGAATGA GACTCCGTCT CAAAAAAAAA AAAAAAAAAA AGTCCCCACC 8401 CTTCCAGGGC TGTTGTAGGT GCCGCTGAGG GCCGCAGCTC AGCCCTGGTC CCATCCCTGC 8461 TGTGGCCTCC AGGTGTCCGG GCAGCTGGTC TTCTTGGCTC TGGACACCAG GGGGCGCCAG 8521 GGCTTCACCG AAGAGGGAGG GTCAGCCTGG AAGTGTGGCC GGCTCACACC CAAAGCTTTT 8581 CTGGAGACTG GCAGCTGCCG TTGTGGACTC CCCGGGCTAG ATGTCCATTG TCCTGACAGG 8641 GAGACTGAGG CCCCACAGCC AGAATCTCAA GCCCAGAGGC TGAGCTGGCT CTCCCAGATT 8701 TACCACCTCT GCTGGGTGTG TGTAAAGAGC CTACTGTGCT TGGTCCTGGA GACCCAGATG 8761 AACCAGAGAC TGTGAGTCTG TGCCCTGGAG GAGCTCAGCA GCTGATGGAT GATGATGGGG 8821 GGTGGGTGAC CTTGAAGTTG AGGTCAGACT GCTGCATCCA GGCCCAGCCT TTGCCCCCAT 8881 GGTAATCTGT AAGGTAGAAG CAGCCGCTGG GTGGTACGGA GTCATCTCCC TGAAGGCAGC 8941 TGACGCTGTC ACCCTGAATC CTTAACTACG GAGGAGCTAG GGATGAGCGC TAGGATGGGT 9001 ATCGCAGGCG AGGACAAAGG GCTTGCCCCA GGTCACACAG CTGAACAGAG GCAGGGGTGA 9061 GCCTGGGTCC AGGTCAGTCT GACGCCGGAG GTCAGGCTGC TTCTCGCCTC TGTCCAGCTG 9121 TGTGGCCTTG GACAAGCCTC TTGGTTACAC AGCTGGACAG AGGCACGAAA CAGCCTGACC 9181 TGAGAGGCAG TGCCCCAAGG CCTTGACAGG GGTCCAGGGC CAAACGTGCG CAGAAACCCT 9241 TGGGGGCTGG CCCAGGCAGT TGGAAGCAGG AGACAGGGTT TGAAGGTCCC AATCCCATTC 9301 CTAAAGCTGT GGCAGTCATG GGTGACTCTG GGGTGCTCTG CCCTCCCAAC CAGGCAGCTG 9361 TGGTAGCTGG ATTGAGCGAG GCAGGGGCTG CAGAACCTTA TTCCTCTTGC CTTTCCAAGG 9421 CGTCCCATGG ATGACATGGA TGACATGCTG TTCCAATGAC ACGGACATTG CTCAGAACCC 9481 CCCAGGACTC AGCAGCCCCT CACCCCTTCC CTCATCCCTT GTTCCTTTGT CAAAGGACCC 9541 CAGAATTGTG AGACAAGTGC CTCCCTGCCC TCAGGGCCCT TGGGAGACCT GCTTAAGGAA 9601 AGATAAAGGC GCTGGAAGAG CTGTTCCAGG AGGCGGGGAG TACTGGGGGG CGGGGGGTGG 9661 CATCAGAGAG ACTGCCCTGA GGAAGTGAGC TTCGCACTGA AACCTGATGC TTGGTATTGA 9721 GGTGAAGGCG AGTTAGGATC AGGCTCGGGG GAGAGGGCTC CTAGCAGAGG GAACAGCATG 9781 AGCACAGGTC TGGCATGGAA GGAGGTTGAT GAGAGCCTAG AGCAGAGAGA GTCAGGCTCT 9841 CTCAGTCTGG CAAGTAGTGT GGACTTTCTC CTGAGGGCAC TGGGGAGCCA TGGTGTAGGT 9901 TTGAGCAGGG GAGTGGCAGG AAAGATTGGT TTTAGGAAGG TCTTTCAGGC TGCCTTGGGG 9961 AAGGGATTGA AGGTCTGAGG GGGTATCTAC TGGGGCTTCA TAGATGGGAA GATGGGGAGG 10021 ACATGGCCCT AGGGGGCAGG AATGGGGTCC CTGTGACATG GTGCCCGCCT GCAGATTGGA 10081 CAGCCCACAC TGCTGCTGTA TGTGGACGCA GGCCCTGAGA CCATGACCCA GCGGCTCTTG 10141 AAACGTGGAG AGACCAGCGG GCGTGTGGAC GACAATGAGG AGACCATCAA AAAGCGGCTG 10201 GAGACCTATT ACAAGGCCAC AGAACCCGTC ATCGCCTTCT ATGAGAAACG TGGCATTGTG 10261 CGCAAGGTGG GCCCCGCGGG ACGGGGCAGC TCCGGGAAGA ACGGGTTCCT ATGTGGCTCC 10321 GCCTCAGCTC ACTGTGTGGC CTCAGGCCAG CCCCTCTCTC TGGGCCTTGG TTTCCCCACT 10381 GGTTCAATGA GGGGCTGCTG TCAATTAGGT TGGGCTCGGG GAGGCAGGCC TGGTCTCTGG 10441 GGCCTGCCAT GGTGAGGCCC ACCCCACTCC ACCTGGTGAC TCAGCCTCAC GGCCCATCTC 10501 CCCACAGGTC AACGCTGAGG GCTCCGTGGA CAGTGTCTTC TCCCAGGTCT GCACCCACCT 10561 GGACGCCCTA AAGTAGCAAC GCTGGAGCCG CTTCCCCAGC TCAGAGCCCC GCCCCACCCC 10621 GTCCTGATTA GAGGTCCTCC TGGCCTGAGC GCAGCGCCTC CACCCTGCCC TGCTGAGCAC 10681 AGACGGAGGA AGCCGCTTAT CCTGTTTTCA TGGACAGCTG AGCACTAAAG GAATTTCTAA 10741 GGACATTTGG TTTTACTGCT TTTTCTCTGC TTCCAGTTGG AGTTGATTCA TGTGCTTGTG 10801 CCTACCTGGC CGCAAGTCCC CAGCCCCTCA ACCCTCCGTT CCTCCTCAGC CTCCCTTTGC 10861 CAGCCACCCC TCCTCTAGCT CTGGTGGGAG GCCCGGGGCC CTTCCTCGCA CAGGGCATGC 10921 CTGGCCTGAG GACCCGGCGC TGAGTGGCGG GGCCCCTGCT CCGAGGGGCT CATGTTCAGG 10981 CAGAACCGGT CCCAGCCTGG GCTCCTCTGC ATCTTGCTCT GTGCCTTGGC CCTGACCCCC 11041 ATCGCTCTGA GCATATGTTC CATGCCTGCC TTGCCGGGGC CTGGACTGCA CAGGCAGCAA 11101 GGTCATGGTC TGAGTGGGGC TTCCTGGGCA GTTGGGGCGG CCCACGCCAG CTGGCCCAGT 11161 GGGTAGTGAA TTGGCTTCCT TGACGCGAGA GGCTCTGAGG GTCTGAAAAG GGCATCTCAA 11221 TGGCATGGGT GGGTGGGGAG TCAGTCATGT CACTGAAATT GAATGGGGGA GGCCCAATGA 11281 GGTGGCTCAT GCCTGTAATC CCAGCACTTT GGGAAGCTGA GGCAGGAGGA TCACCTGAGG 11341 TCAGGAGTTC GAGAGCAGCC TGGCCAACAT GGCAAAACCC CTTATTTACT AAAAATACAA 11401 AAACTTAGCC GGGCATGGTG GCATGTGCCT GTATTCCCAG CTACTCATGA GGCTGAGGCA 11461 GGAGAATGGC GTGAACCCGG GAGTGGAGCT TGCAATGAGC CAAGATTGCG CCACTGCACT 11521 CCAGCCTGGG TGACAGAGCA AGACTCCGTC TCAAAAAAAA AAAAAAAAAA AAAGAAATTA 11581 ATTGGGGGAG TCATGGTCAG GGGTGAGACC TGAAGGACCT CCCCCTGTGT GGCCCTGGAC 11641 ACAGCCCACC CTCTGTGAGC CGTTTCCAAT TCTAAAACAG ACTCAATGTC CCCCTCACCC 11701 CCACCTCAAG GTCAGGATGC GAACACACTG AGTGAGGAGT GGACGCTGTC CATTGCCACG 11761 GCCATGAGGG CTGGAGACCA GAACAGCATG GCCCGAAGCG TGCGGGGCCC CGGATGACTT 11821 GGGGACACCC CAGAATCCCC TGGGGAGAAC CCTTCCTGCG CGCTTTCATT TTTTGACCTC 11881 ATCACTGAGA AAGGCTCAAT TTGGTGCTCA CGTGTCCTTA ACACCTGATC TGGCCCAAGC 11941 TGCGTGCCCT TTAAGCCAAG AGAGCCTCTT GTGGACCCCG CCTGCCCGAA TGAAATCCGA 12001 ACAGTTGGGG CTGTTATGGC AAGTGGGGCT GGTTTTTCAT TTCCATTGGT TATTTAAAGT 12061 TTCCTTTAAA ATAAACGATT TTAAGTTATA AAAGGTGAAT CTATTGAAAG AAGAACATCA 12121 AAGAAATAAA CAGGAGTTCA GCGGAGTAGC AGAAGACAAG GCATGTAGGG GGAGCCATTC 12181 TGTCCCAGGG AAGTTGTGGA GGGTAGGGGC TGTGTGGAGG CCTCTGCAG // LOCUS HUMALB1 1286 bp ds-DNA PRI 15-JUN-1989 DEFINITION Human albumin gene, exon 1 and 5' flank. ACCESSION M13075 KEYWORDS albumin; serum protein. SEGMENT 1 of 2 SOURCE Human fibroblast DNA, library of H.Hamada and T.Kakunaga, clones lambda-HAL-[3W,HA1]. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 1286) AUTHORS Urano,Y., Watanabe,K., Sakai,M. and Tamaoki,T. TITLE The human albumin gene: Characterization of the 5' and 3' flanking regions and the polymorphic gene transcripts JOURNAL J. Biol. Chem. 261, 3244-3251 (1986) STANDARD full staff_review FEATURES from to/span description pept 1092 / 1170 albumin exon 1 /nomgen="ALB" /map="4q11-q13" /hgml_locus_uid="LE0106K" pre-msg 1048 > 1286 albumin mRNA (alt.) pre-msg 1053 > 1286 albumin mRNA (alt.) IVS 1171 > 1286 albumin intron BASE COUNT 414 A 199 C 233 G 440 T ORIGIN 421 bp upstream of Dde1 site; Chromsome 4q11-q13. 1 GAGGTAGAGA TAAGAACCTT GGGTAGGGAA GGAAGATTTA TGAAATATTT AAAAAATTAT 61 TCTTCCTTCG CTTTGTTTTT AGACATAATG TTAAATTTAT TTTGAAATTT AAAGCAACAT 121 AAAAGAACAT GTGATTTTTC TACTTATTGA AAGAGAGAAA GGAAAAAAAT ATGAAACAGG 181 GATGGAAAGA ATCCTATGCC TGGTGAAGGT CAAGGGTTCT CATAACCTAC AGAGAATTTG 241 GGGTCAGCCT GTCCTATTGT ATATTATGGC AAAGATAATC ATCATCTCAT TTGGGTCCAT 301 TTTCCTCTCC ATCTCTGCTT AACTGAAGAT CCCATGAGAT ATACTCACAC TGAATCTAAA 361 TAGCCTATCT CAGGGCTTGA ATCACATGTG GGCCACAGCA GGAATGGGAA CATGGAATTT 421 CTAAGTCCTA TCTTACTTGT TATTGTTGCT ATGTCTTTTT CTTAGTTTGC ATCTGAGGCA 481 ACATCAGCTT TTTCAGACAG AATGGCTTTG GAATAGTAAA AAAGACACAG AAGCCCTAAA 541 ATATGTATGT ATGTATATGT GTGTGTGCAT GCGTGAGTAC TTGTGTGTAA ATTTTTCATT 601 ATCTATAGGT AAAAGCACAC TTGGAATTAG CAATAGATGC AATTTGGGAC TTAACTCTTT 661 CAGTATGTCT TATTTCTAAG CAAAGTATTT AGTTTGGTTA GTAATTACTA AACACTGAGA 721 ACTAAATTGC AAACTCCAAG AACTAAAATG TTCAAGTGGG AAATTACAGT TAAATACCAT 781 GGTAATGAAT AAAAGGTACA AATCGTTTAA ACTCTTATGT AAAATTTGAT AAGATGTTTT 841 ACACAACTTT AATACATTGA CAAGGTCTTG TGGAGAAAAC AGTTCCAGAT GGTAAATATA 901 CACAAGGGAT TTAGTCAAAC AATTTTTTGG CAAGAATATT ATGAATTTTG TAATCGGTTG 961 GCAGCCAATG AAATACAAAG ATGAGTCTAG TTAATAATCT ACAATTATTG GTTAAAGAAG 1021 TATATTAGTG CTAATTTCCC TCCGTTTGTC CTAGCTTTTC TCTTCTGTCA ACCCCACACG 1081 CCTTTGGCAC AATGAAGTGG GTAACCTTTA TTTCCCTTCT TTTTCTCTTT AGCTCGGCTT 1141 ATTCCAGGGG TGTGTTTCGT CGAGATGCAC GTAAGAAATC CATTTTTCTA TTGTTCAACT 1201 TTTATTCTAT TTTCCCAGTA AAATAAAGTT TTAGTAAACT CTGCATCTTT AAAGAATTAT 1261 TTTGGCATTT ATTTCTAAAA TGGCAT // LOCUS HUMALBAF2 336 bp ds-DNA PRI 02-MAY-1986 DEFINITION Human serum albumin gene, 3' flank. ACCESSION L00133 J00078 KEYWORDS albumin; serum albumin. SEGMENT 2 of 4 SOURCE Human liver, cDNA to mRNA, clone pHSA1 [1]; clones pHA36 [2]; hepatoma DNA, clone lambda-HAL-2W [3]. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 8 to 329) AUTHORS Lawn,R.M., Adelman,J., Bock,S.C., Franke,A.E., Houck,C.M., Najarian,R.C., Seeburg,P.H. and Wion,K.L. TITLE The sequence of human serum albumin cDNA and its expression in E. coli JOURNAL Nucleic Acids Res. 9, 6103-6114 (1981) STANDARD full staff_review REFERENCE 2 (bases 8 to 171) AUTHORS Dugaiczyk,A., Law,S.W. and Dennison,O.E. TITLE Nucleotide sequence and the encoded amino acids of human serum albumin mRNA JOURNAL Proc. Natl. Acad. Sci. U.S.A. 79, 71-75 (1982) STANDARD full staff_review REFERENCE 3 (bases 1 to 336) AUTHORS Urano,Y., Sakai,M., Watanabe,K. and Tamaoki,T. TITLE Tandem arrangement of the albumin and alpha-fetoprotein genes in the human genome JOURNAL Gene 32, 255-261 (1984) STANDARD full staff_review COMMENT Serum albumin, the major protein component in adult plasma, is produced in the liver; it helps maintain normal osmolarity in the bloodstream and functions as a carrier for numerous small molecules. There is extensive homology between the deduced structures of human serum albumin and mouse and human alpha-fetoprotein, indicating that these proteins are encoded in the same gene family. [2] examines the repeating homology in the triple-domain structure of the serum albumin molecule. Some of the conflicts between the two reported sequences may be assigned to human polymorphism, according to the authors of [2]. Two poly-A signals are found at positions 149-154 and 303-308. FEATURES from to/span description pre-msg < 1 169 alb mRNA (alt. poly-A site [2]) /nomgen="ALB" /map="4q11-q13" /hgml_locus_uid="LE0106K" pre-msg < 1 329 alb mRNA (alt. poly-A site [1]) IVS < 1 7 alb mRNA last intron 1896 in [2] variant 60 61 tt in [1],[2]; ttt in [3] variant 201 202 tg in [1]; t in [3] variant 312 312 c in [1]; t in [3] BASE COUNT 102 A 55 C 55 G 124 T ORIGIN About 7.5 kb after segment 1. 1 TTTTCAGCCT ACCATGAGAA TAAGAGAAAG AAAATGAAGA TCAAAAGCTT ATTCATCTGT 61 TTTCTTTTTC GTTGGTGTAA AGCCAACACC CTGTCTAAAA AACATAAATT TCTTTAATCA 121 TTTTGCCTCT TTTCTCTGTG CTTCAATTAA TAAAAAATGG AAAGAATCTA ATAGAGTGGT 181 ACAGCACTGT TATTTTTCAA AGATGTGTTG CTATCCTGAA AATTCTGTAG GTTCTGTGGA 241 AGTTCCAGTG TTCTCTCTTA TTCCACTTCG GTAGAGGATT TCTAGTTTCT GTGGGCTAAT 301 TAAATAAATC ACTAATACTC TTCTAAGTTA TGGATT // LOCUS HUMALBGC 19002 bp ds-DNA PRI 15-DEC-1989 DEFINITION Human serum albumin gene, complete cds. ACCESSION M12523 J04457 KEYWORDS albumin. SOURCE Human DNA, (library T. Maniatis). ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 19002) AUTHORS Minghetti,P.P., Ruffner,D.E., Kuang,W.-J., Dennison,O.E., Hawkins,J.W., Beattie,W.G. and Dugaiczyk,A. TITLE Molecular structure of the human albumin gene is revealed by nucleotide sequence within q11-22 of chromosome 4 JOURNAL J. Biol. Chem. 261, 6747-6757 (1986) STANDARD full staff_review REFERENCE 2 (bases 17688 to 17755; 18526 to 18555) AUTHORS Minchiotti,L., Galliano,M., Iadarola,P., Meloni,M.L., Ferri,G., Porta,F. and Castellani,A.A. TITLE The molecular defect in a COOH-terminal-modified and shortened mutant of serum albumin JOURNAL J. Biol. Chem. 264, 3385-3389 (1989) STANDARD full staff_entry COMMENT Computer-readable sequence in [1] was kindly provided by A.Dugaiczyk, 01-JUL-1986. Draft entry and printed copy of sequence for [2] kindly provided by L.Minchiotti, 09-DEC-1988. [2] describes a missplicing event in alooalbumin Venezia by which exon 14 is not translated. The protein translation goes from exon 13 to exon 15, which is normally in the 3' flanking region. The 3' end of this incorrectly translated and prematurely ended protein is hydrophilic instead of hydrophobic [2]. FEATURES from to/span description pept 1776 1854 albumin precursor, exon 1 /nomgen="PALB" /map="18q11.2-q12" /hgml_locus_uid="LC0122E" 2564 2621 albumin precursor, exon 2 4076 4208 albumin precursor, exon 3 6041 6252 albumin precursor, exon 4 6802 6934 albumin precursor, exon 5 7759 7856 albumin precursor, exon 6 9444 9573 albumin precursor, exon 7 10867 11081 albumin precursor, exon 8 12481 12613 albumin precursor, exon 9 13702 13799 albumin precursor, exon 10 14977 15115 albumin precursor, exon 11 15534 15757 albumin precursor, exon 12 16941 17073 albumin precursor, exon 13 17688 17732 albumin precursor, exon 14 sigp 1776 1829 albumin signal peptide matp 1848 1854 albumin, exon 1 2564 2621 albumin, exon 2 4076 4208 albumin, exon 3 6041 6252 albumin, exon 4 6802 6934 albumin, exon 5 7759 7856 albumin, exon 6 9444 9573 albumin, exon 7 10867 11081 albumin, exon 8 12481 12613 albumin, exon 9 13702 13799 albumin, exon 10 14977 15115 albumin, exon 11 15534 15757 albumin, exon 12 16941 17073 albumin, exon 13 17688 17729 albumin, exon 14 pept 1776 1854 alloalbumin Venezia precursor, exon 1 2564 2621 alloalbumin Venezia precursor, exon 2 4076 4208 alloalbumin Venezia precursor, exon 3 6041 6252 alloalbumin Venezia precursor, exon 4 6802 6934 alloalbumin Venezia precursor, exon 5 7759 7856 alloalbumin Venezia precursor, exon 6 9444 9573 alloalbumin Venezia precursor, exon 7 10867 11081 alloalbumin Venezia precursor, exon 8 12481 12613 alloalbumin Venezia precursor, exon 9 13702 13799 alloalbumin Venezia precursor, exon 10 14977 15115 alloalbumin Venezia precursor, exon 11 15534 15757 alloalbumin Venezia precursor, exon 12 16941 17073 alloalbumin Venezia precursor, exon 13 18526 18555 alloalbumin Venezia precursor, exon 14 sigp 1776 1829 alloalbumin Venezia signal peptide matp 1848 1854 alloalbumin Venezia, exon 1 2564 2621 alloalbumin Venezia, exon 2 4076 4208 alloalbumin Venezia, exon 3 6041 6252 alloalbumin Venezia, exon 4 6802 6934 alloalbumin Venezia, exon 5 7759 7856 alloalbumin Venezia, exon 6 9444 9573 alloalbumin Venezia, exon 7 10867 11081 alloalbumin Venezia, exon 8 12481 12613 alloalbumin Venezia, exon 9 13702 13799 alloalbumin Venezia, exon 10 14977 15115 alloalbumin Venezia, exon 11 15534 15757 alloalbumin Venezia, exon 12 16941 17073 alloalbumin Venezia, exon 13 18526 18552 alloalbumin Venezia, exon 14 pre-msg 1737 18688 alb mRNA (alt.) pre-msg 1737 18849 alb mRNA (alt.) IVS 1855 2563 alb intron A IVS 2622 4075 alb intron B IVS 4209 6040 alb intron C IVS 6253 6801 alb intron D IVS 6935 7758 alb intron E IVS 7857 9443 alb intron F IVS 9574 10866 alb intron G IVS 11082 12480 alb intron H IVS 12614 13701 alb intron I IVS 13800 14976 alb intron J IVS 15116 15533 alb intron K IVS 15758 16940 alb intron L IVS 17074 17687 alb intron M IVS 17756 18525 alb intron N IVS 17074 18525 alloalbumin Venezia intron M rpt 3041 3035 (c) direct repeat flanking Alu 1, 3' copy rpt 3334 3042 (c) Alu 1 repeat rpt 3341 3335 (c) direct repeat flanking Alu 1, 5' copy rpt 3421 3427 direct repeat flanking Alu 2, 5' copy rpt 3428 3722 Alu 2 repeat rpt 3725 3731 direct repeat flanking Alu 2, 3' copy rpt 9892 9885 (c) direct repeat flanking Alu 3, 3' copy rpt 10191 9893 (c) Alu 3 repeat rpt 10199 10192 (c) direct repeat flanking Alu 3, 5' copy rpt 11918 11906 (c) direct repeat flanking Alu 4, 3' copy rpt 12200 11921 (c) Alu 4 repeat rpt 12213 12201 (c) direct repeat flanking Alu 4, 5' copy rpt 15481 15297 (c) Alu 5 repeat BASE COUNT 5891 A 3215 C 3440 G 6456 T ORIGIN 629 bp upstream of SstI site; chromosome 4q11-22. 1 CCTTTCCCAG GGACTTCTAC AAGGAAAAAG CTAGAGTTGG TTACTGACTT CTAATAAATA 61 ATGCCTACAA TTTCTAGGAA GTTAAAAGTT GACATAATTT ATCCAAGAAA GAATTATTTT 121 CTTAACTTAG AATAGTTTCT TTTTTCTTTT CAGATGTAGG TTTTTCTGGC TTTAGAAAAA 181 ATGCTTGTTT TTCTTCAATG GAAAATAGGC ACACTTGTTT TATGTCTGTT CATCTGTAGT 241 CAGAAAGACA AGTCTGGTAT TTCCTTTCAG GACTCCCTTG AGTCATTAAA AAAAATCTTC 301 CTATCTATCT ATGTATCTAT CATCCATCTA GCTTTGATTT TTTCCTCTTC TGTGCTTTAT 361 TAGTTAATTA GTACCCATTT CTGAAGAAGA AATAACATAA GATTATAGAA AATAATTTCT 421 TTCATTGTAA GACTGAATAG AAAAAATTTT CTTTCATTAT AAGACTGAGT AGAAAAAATA 481 ATACTTTGTT AGTCTCTGTG CCTCTATGTG CCATGAGGAA ATTTGACTAC TGGTTTTGAC 541 TGACTGAGTT ATTTAATTAA GTAAAATAAC TGGCTTAGTA CTAATTATTG TTCTGTAGTA 601 TCAGAGAAAG TTGTTCTTCC TACTGGTTGA GCTCAGTAGT TCTTCATATT CTGAGCAAAA 661 GGGCAGAGGT AGGATAGCTT TTCTGAGGTA GAGATAAGAA CCTTGGGTAG GGAAGGAAGA 721 TTTATGAAAT ATTTAAAAAA TTATTCTTCC TTCGCTTTGT TTTTAGACAT AATGTTAAAT 781 TTATTTTGAA ATTTAAAGCA ACATAAAAGA ACATGTGATT TTTCTACTTA TTGAAAGAGA 841 GAAAGGAAAA AAATATGAAA CAGGGATGGA AAGAATCCTA TGCCTGGTGA AGGTCAAGGG 901 TTCTCATAAC CTACAGAGAA TTTGGGGTCA GCCTGTCCTA TTGTATATTA TGGCAAAGAT 961 AATCATCATC TCATTTGGGT CCATTTTCCT CTCCATCTCT GCTTAACTGA AGATCCCATG 1021 AGATATACTC ACACTGAATC TAAATAGCCT ATCTCAGGGC TTGAATCACA TGTGGGCCAC 1081 AGCAGGAATG GGAACATGGA ATTTCTAAGT CCTATCTTAC TTGTTATTGT TGCTATGTCT 1141 TTTTCTTAGT TTGCATCTGA GGCAACATCA GCTTTTTCAG ACAGAATGGC TTTGGAATAG 1201 TAAAAAAGAC ACAGAAGCCC TAAAATATGT ATGTATGTAT ATGTGTGTGT GCATGCGTGA 1261 GTACTTGTGT GTAAATTTTT CATTATCTAT AGGTAAAAGC ACACTTGGAA TTAGCAATAG 1321 ATGCAATTTG GGACTTAACT CTTTCAGTAT GTCTTATTTC TAAGCAAAGT ATTTAGTTTG 1381 GTTAGTAATT ACTAAACACT GAGAACTAAA TTGCAAACAC CAAGAACTAA AATGTTCAAG 1441 TGGGAAATTA CAGTTAAATA CCATGGTAAT GAATAAAAGG TACAAATCGT TTAAACTCTT 1501 ATGTAAAATT TGATAAGATG TTTTACACAA CTTTAATACA TTGACAAGGT CTTGTGGAGA 1561 AAACAGTTCC AGATGGTAAA TATACACAAG GGATTTAGTC AAACAATTTT TTGGCAAGAA 1621 TATTATGAAT TTTGTAATCG GTTGGCAGCC AATGAAATAC AAAGATGAGT CTAGTTAATA 1681 ATCTACAATT ATTGGTTAAA GAAGTATATT AGTGCTAATT TCCCTCCGTT TGTCCTAGCT 1741 TTTCTCTTCT GTCAACCCCA CACGCCTTTG GCACAATGAA GTGGGTAACC TTTATTTCCC 1801 TTCTTTTTCT CTTTAGCTCG GCTTATTCCA GGGGTGTGTT TCGTCGAGAT GCACGTAAGA 1861 AATCCATTTT TCTATTGTTC AACTTTTATT CTATTTTCCC AGTAAAATAA AGTTTTAGTA 1921 AACTCTGCAT CTTTAAAGAA TTATTTTGGC ATTTATTTCT AAAATGGCAT AGTATTTTGT 1981 ATTTGTGAAG TCTTACAAGG TTATCTTATT AATAAAATTC AAACATCCTA GGTAAAAAAA 2041 AAAAAAGGTC AGAATTGTTT AGTGACTGTA ATTTTCTTTT GCGCACTAAG GAAAGTGCAA 2101 AGTAACTTAG AGTGACTGAA ACTTCACAGA ATAGGGTTGA AGATTGAATT CATAACTATC 2161 CCAAAGACCT ATCCATTGCA CTATGCTTTA TTTAAAAACC ACAAAACCTG TGCTGTTGAT 2221 CTCATAAATA GAACTTGTAT TTATATTTAT TTTCATTTTA GTCTGTCTTC TTGGTTGCTG 2281 TTGATAGACA CTAAAAGAGT ATTAGATATT ATCTAAGTTT GAATATAAGG CTATAAATAT 2341 TTAATAATTT TTAAAATAGT ATTCTTGGTA ATTGAATTAT TCTTCTGTTT AAAGGCAGAA 2401 GAAATAATTG AACATCATCC TGAGTTTTTC TGTAGGAATC AGAGCCCAAT ATTTTGAAAC 2461 AAATGCATAA TCTAAGTCAA ATGGAAAGAA ATATAAAAAG TAACATTATT ACTTCTTGTT 2521 TTCTTCAGTA TTTAACAATC CTTTTTTTTC TTCCCTTGCC CAGACAAGAG TGAGGTTGCT 2581 CATCGGTTTA AAGATTTGGG AGAAGAAAAT TTCAAAGCCT TGTAAGTTAA AATATTGATG 2641 AATCAAATTT AATGTTTCTA ATAGTGTTGT TTATTATTCT AAAGTGCTTA TATTTCCTTG 2701 TCATCAGGGT TCAGATTCTA AAACAGTGCT GCCTCGTAGA GTTTTCTGCG TTGAGGAAGA 2761 TATTCTGTAT CTGGGCTATC CAATAAGGTA GTCACTGGTC ACATGGCTAT TGAGTACTTC 2821 AAATATGACA AGTGCAACTG AGAAACAAAA ACTTAAATTG TATTTAATTG TAGTTAATTT 2881 GAATGTATAT AGTCACATGT GGCTAATGGC TACTGTATTG GACAGTACAG CTCTGGAACT 2941 TGCTTGGTGG AAAGGACTTT AATATAGGTT TCCTTTGGTG GCTTACCCAC TAAATCTTCT 3001 TTACATAGCA AGCATTCCTG TGCTTAGTTG GGAATATTTA ATTTTTTTTT TTTTTTAAGA 3061 CAGGGTCTCG CTCTGTCGCC CAGGCTGGAG TGCAGTGGCG CAATCTCGGC TCACTGCAAA 3121 CTCCGCTCCC GGGTTCACGC CATTCTCCTG CCTCAGCCTC CCGAGTAGCT GGGACTACAG 3181 GCGCCCGCCA TCACGCCCGG CTAATCTTTT GTATTTTTAG TAGAGATGGG GTTTCACCGT 3241 GTGCCAGGAT GGTCTCAATC TCCTGACATC GTGATCTGCC CACCTCGGCC TCCCAAAGTG 3301 CTGGGATTAC AGGAGTGAGT CACCGCGCCC GGCCTATTTA AATGTTTTTT AATCTAGTAA 3361 AAAATGAGAA AATTGTTTTT TTAAAAGTCT ACCTAATCCT ACAGGCTAAT TAAAGACGTG 3421 TGTGGGGATC AGGTGCGGTG GTTCACACCT GTAATCCCAG CACTTTGGAA GGCTGATGCA 3481 GGAGGATTGC TTGAGCCCAG GAGTACAAGA CCAGCCTGGG CAAGTCTCTT TAAAAAAAAC 3541 AAAACAAACA AACAAAAAAA TTAGGCATGG TGGCACATGC CTGTAGTCCT AGCTACTTAG 3601 GAGGCTGACG TAGGAGGATC GTTTGGACCT GAGAGGTCAA GGCTACAGTG AGCCATGATT 3661 GTGCCACTGC ACTCCAGCCT GGGTGACAGA GTGAGACTCT GTCTCAAAAA AGAAAAAGGA 3721 AATCTGTGGG GTTTGTTTTA GTTTTAAGTA ATTCTAAGGA CTTTAAAAAT GCCTAGTCTT 3781 GACAATTAGA TCTATTTGGC ATACAATTTG CTTGCTTAAT CTATGTGTGT GCATAGATCT 3841 ACTGACACAC GCATACATAT AAACATTAGG GAACTACCAT TCTCTTTGCG TAGGAAGCCA 3901 CATATGCCTA TCTAGGCCTC AGATCATACC TGATATGAAT AGGCTTTCTG GATAATGGTG 3961 AAGAAGATGT ATAAAAGATA GAACCTATAC CCATACATGA TTTGTTCTCT AGCGTAGCAA 4021 CCTGTTACAT ATTAAAGTTT TATTATACTA CATTTTTCTA CATCCTTTGT TTCAGGGTGT 4081 TGATTGCCTT TGCTCAGTAT CTTCAGCAGT GTCCATTTGA AGATCATGTA AAATTAGTGA 4141 ATGAAGTAAC TGAATTTGCA AAAACATGTG TTGCTGATGA GTCAGCTGAA AATTGTGACA 4201 AATCACTTGT AAGTACATTC TAATTGTGGA GATTCTTTCT TCTGTTTGAA GTAATCCCAA 4261 GCATTTCAAA GGAATTTTTT TTAAGTTTTC TCAATTATTA TTAAGTGTCC TGATTTGTAA 4321 GAAACACTAA AAAGTTGCTC ATAGACTGAT AAGCCATTGT TTCTTTTGTG ATAGAGATGC 4381 TTTAGCTATG TCCACAGTTT TAAAATCATT TCTTTATTGA GACCAAACAC AACAGTCATG 4441 GTGTATTTAA ATGGCAATTT GTCATTTATA AACACCTCTT TTTAAAATTT GAGGTTTGGT 4501 TTCTTTTTGT AGAGGCTAAT AGGGATATGA TAGCATGTAT TTATTTATTT ATTTATCTTA 4561 TTTTATTATA GTAAGAACCC TTAACATGAG ATCTACCCTG TTATATTTTT AAGTGTACAA 4621 TCCATTATTG TTAACTACGG GTACACTGTT GTATAGCTTA CTCATCTTGC TGTATTAAAA 4681 CTTTGTGCCC ATTGATTAGT AACCCCTCGT TTCGTCCTCC CCCAGCCACT GGCAACCAGC 4741 ATTATACTCT TTGATTCTAT GAGTTTGACT ACTTTAGCTA CCTTATATAA GTGGTATTAT 4801 GTACTGTTTA TCTTTTTATG ACTGACTTAT TTCCCTTAGC ATAGTGCATT CAAAGTCCAA 4861 CCATGTTGTT GCCTATTGCA GAATTTCCTT CTTTTCAAGG CTGAATAATA TTCCAGTGCA 4921 TGTGTGTACC ACATTTTCTT TATCCATTAA TTTGTTGATT GATAGACATT TAGGTTGGTT 4981 TTCTACATCT TGACTATCAT GAATAGTGTT GCAATGAACA CAGGAGAGCT ACTATCTCTT 5041 AGAGATGATA TCATGGTTTT TATCATCAGA AAACACCCAC TGATTTCTAT GCTAATTTTG 5101 TTACCTGGGT GGAATAATAG TACAGCTATA TATTCCTCAT TTTAGATATC TTTGTATTTC 5161 TACATACAAT AAAAAAGCAG AGTACTTAGT CATGTTGAAG AACTTTAAAC TTTTAGTATT 5221 TCCAGATCAA TCTTCAAAAC AAGGACAGGT TTATCTTTCT CTCACCACTC AATCTATATA 5281 TACCTCTTGT GGGCAAGGCC AGTTTTTATC ACTGGAGCCT TTCCCCTTTT TATTATGTAC 5341 CTCTCCCTCA CAGCAGAGTC AGGACTTTAA CTTTACACAA TACTATGGCT CTACATATGA 5401 AATCTTAAAA ATACATAAAA ATTAATAAAT TCTGTCTAGA GTAGTATATT TTCCCTGGGG 5461 TTACGATTAC TTTCATAATA AAAATTAGAG ATAAGGAAAG GACTCATTTA TTGGAAAGTG 5521 ATTTTAGGTA ACATTTCTGG AAGAAAAATG TCTATATCTT AATAGTCACT TAATATATGA 5581 TGGATTGTGT TACTCCTCAG TTTTCAATGG CATATACTAA AACATGGCCC TCTAAAAAGG 5641 GGGCAAATGA AATGAGAAAC TCTCTGAATG TTTTTCTCCC CTAGGTGAAT TCACCTGCTG 5701 CTTAGAAGCT TATTTTCTCT TGATTTCTGT TATAATGATT GCTCTTACCC TTTAGTTTTA 5761 AGTTTCAAAA TAGGAGTCAT ATAACTTTCC TTAAAGCTAT TGACTGTCTT TTTGTCCTGT 5821 TTTATTCACC ATGAGTTATA GTGTGACAGT TAATTCTTAT GAAAATTATA TAGAGATGGT 5881 TAAATCATCA GAAACTGTAA ACCTCGATTG GGAGGGGAAG CGGATTTTTA AATGATTTCC 5941 TGACCAAGCT TAACCAGTAT ATTAAATCCT TTGTACTGTT CTTTGGCTAT AAAGAAAAAA 6001 GGTACTGTCC AGCAACTGAA ACCTGCTTTC TTCCATTTAG CATACCCTTT TTGGAGACAA 6061 ATTATGCACA GTTGCAACTC TTCGTGAAAC CTATGGTGAA ATGGCTGACT GCTGTGCAAA 6121 ACAAGAACCT GAGAGAAATG AATGCTTCTT GCAACACAAA GATGACAACC CAAACCTCCC 6181 CCGATTGGTG AGACCAGAGG TTGATGTGAT GTGCACTGCT TTTCATGACA ATGAAGAGAC 6241 ATTTTTGAAA AAGTAAGTAA TCAGATGTTT ATAGTTCAAA ATTAAAAAGC ATGGAGTAAC 6301 TCCATAGGCC AACACTCTAT AAAAATTACC ATAACAAAAA TATTTTCAAC ATTAAGACTT 6361 GGAAGTTTTG TTATGATGAT TTTTTAAAGA AGTAGTATTT GATACCACAA AATTCTACAC 6421 AGCAAAAAAT ATGATCAAAG ATATTTTGAA GTTTATTGAA ACAGGATACA ATCTTTCTGA 6481 AAAATTTAAG ATAGACAAAT TATTTAATGT ATTACGAAGA TATGTATATA TGGTTGTTAT 6541 AATTGATTTC GTTTTAGTCA GCAACATTAT ATTGCCAAAA TTTAACCATT TATGCACACA 6601 CACACACACA CACACACACT TAACCCTTTT TTCCACATAC TTAAAGAATG ACAGAGACAA 6661 GACCATCATG TGCAAATTGA GCTTAATTGG TTAATTAGAT ATCTTTGGAA TTTGGAGGTT 6721 CTGGGGAGAA TGTCGATTAC AATTATTTCT GTAATATTGT CTGCTATAGA AAAGTGACTG 6781 TTTTTCTTTT TCAAAATTTA GATACTTATA TGAAATTGCC AGAAGACATC CTTACTTTTA 6841 TGCCCCGGAA CTCCTTTTCT TTGCTAAAAG GTATAAAGCT GCTTTTACAG AATGTTGCCA 6901 AGCTGCTGAT AAAGCTGCCT GCCTGTTGCC AAAGGTATTA TGCAAAAGAA TAGAAAAAAA 6961 GAGTTCATTA TCCAACCTGA TTTTGTCCAT TTTGTGGCTA GATTTAGGGA ACCTGAGTGT 7021 CTGATACAAA CTTTCCGACA TGGTCAAAAA AGCCTTCCTT TTATCTGTCT TGAAAATCTT 7081 TCATCTTTGA AGGCCTACAC TCTCGTTTCT TCTTTTAAGA TTTGCCAATG ATGATCTGTC 7141 AGAGGTAATC ACTGTGCATG TGTTTAAAGA TTTCACCACT TTTTATGGTG GTGATCACTA 7201 TAGTGAAATA CTGAAACTTG TTTGTCAAAT TGCACAGCAA GGGGACACAG TTCTTGTTTA 7261 TCTTTTCATG ATAATTTTTA GTAGGGAGGG AATTCAAAGT AGAGAATTTT ACTGCATCTA 7321 GATGCCTGAG TTCATGCATT CATTCCATAA ATATATATTA TGGAATGCTT TATTTTCTTT 7381 TCTGAGGAGT TTACTGATGT TGGTGGAGGA GAGACTGAAA TGAATTATAC ACAAAATTTA 7441 AAAATTAGCA AAATTGCAGC CCCTGGGATA TTAGCGTACT CTTTCTCTGA CTTTTCTCCC 7501 ACTTTTAAGG CTCTTTTTCC TGGCAATGTT TCCAGTTGGT TTCTAACTAC ATAGGGAATT 7561 CCGCTGTGAC CAGAATGATC GAATGATCTT TCCTTTTCTT AGAGAGCAAA ATCATTATTC 7621 GCTAAAGGGA GTACTTGGGA ATTTAGGCAT AAATTATGCC TTCAAAATTT AATTTGGCAC 7681 AGTCTCATCT GAGCTTATGG AGGGGTGTTT CATGTAGAAT TTTTCTTCTA ATTTTCATCA 7741 AATTATTCCT TTTTGTAGCT CGATGAACTT CGGGATGAAG GGAAGGCTTC GTCTGCCAAA 7801 CAGAGACTCA AGTGTGCCAG TCTCCAAAAA TTTGGAGAAA GAGCTTTCAA AGCATGGTAA 7861 ATACTTTTAA ACATAGTTGG CATCTTTATA ACGATGTAAA TGATAATGCT TCAGTGACAA 7921 ATTGTACATT TTTATGTATT TTGCAAAGTG CTGTCAAATA CATTTCTTTG GTTGTCTAAC 7981 AGGTAGAACT CTAATAGAGG TAAAAATCAG AATATCAATG ACAATTTGAC ATTATTTTTA 8041 ATCTTTTCTT TTCTAAATAG TTGAATAATT TAGAGGACGC TGTCCTTTTT GTCCTAAAAA 8101 AAGGGACAGA TATTTAAGTT CTATTTATTT ATAAAATCTT GGACTCTTAT TCTAATGGTT 8161 CATTATTTTT ATAGAGCTGT AGGCATGGTT CTTTATTTAA TTTTTTAAAG TTATTTTTAA 8221 TTTTTGTGGA TACAGAGTAG GTATACATAT TTACGGGGTA TATGAGATAT TTTGATATAA 8281 GTATACAACA TATATAATCC CTTTATTTAA TTTTATCTTC CCCCCAATGA TCTAAAACTA 8341 TTTGCTTGTC CTTTTATGTC TTATAGTTAA ATTCAGTCAC CAACTAAGTT GAAGTTACTT 8401 CTTATTTTTG CATAGCTCCA GCTCTGATCT TCATCTCATG TTTTTGCCTG AGCCTCTGTT 8461 TTCATATTAC TTAGTTGGTT CTGGGAGCAT ACTTTAATAG CCGAGTCAAG AAAAATACTA 8521 GCTGCCCCGT CACCCACACT CCTCACCTGC TAGTCAACAG CAAATCAACA CAACAGGAAA 8581 TAAAATGAAA ATAATAGACA TTATGCATGC TCTCTAGAAA CTGTCAATTG AACTGTATTT 8641 GCTCATCATT CCTACCATCT ACACCACCAA AATCAACCAA ATTTATGAAA AAAAAACAGC 8701 CCCAACATAA AATTATACAC AGATAAACAG GCTATGATTG GTTTTGGGAA AGAAGTCACC 8761 TTTACCTGAT TTAGGCAACT GTGAAATGAC TAGAGAATGA AGAAAATTAG ACGTTTACAT 8821 CTTGTCATAG AGTTTGAAGA TAGTGCTGGA TCTTTCTTTT TATAAGTAAG ATCAATAAAA 8881 ACTCCCTCAT TCTGTAGAAG TTATGATTTC TTTTCTAAGA GACCTTTAGA AGTCAGAAAA 8941 AATGTGTTTC AATTGAGAAA AAAGATAACT GGAGTTTGTG TAGTACTTCC CAGATTATAA 9001 AATGCTTTTG TATGTATTAT CTAATTTAAT CCTCAAAACT TCTTCAATTT AGCATGTTGT 9061 CATGACACTG CAGAGGCTGA AGCTCAGAGA CGCTGAGCCC TCTGCTAACA AGTCCTACTG 9121 CTAACAAGTG ATAAAGCCAG AGCTGGAAGT CACATCTGGA CTCCAAACCT GATGCTTCTC 9181 AGCCTGTTGC CCCTTTTAGA GTTCCTTTTT AATTTCTGCT TTTATGACTT GCTAGATTTC 9241 TACCTACCAC ACACACTCTT AAATGGATAA TTCTGCCCTA AGGATAAGTG ATTACCATTT 9301 GGTTCAGAAC TAGAACTAAT GAATTTTAAA AATTATTTCT GTATGTCCAT TTTGAATTTT 9361 CTTATGAGAA ATAGTATTTG CCTAGTGTTT TCATATAAAA TATCGCATGA TAATACCATT 9421 TTGATTGGCG ATTTTCTTTT TAGGGCAGTA GCTCGCCTGA GCCAGAGATT TCCCAAAGCT 9481 GAGTTTGCAG AAGTTTCCAA GTTAGTGACA GATCTTACCA AAGTCCACAC GGAATGCTGC 9541 CATGGAGATC TGCTTGAATG TGCTGATGAC AGGGTAAAGA GTCGTCGATA TGCTTTTTGG 9601 TAGCTTGCAT GCTCAAGTTG GTAGAATGGA TGCGTTTGGT ATCATTGGTG ATAGCTGACA 9661 GTGGGTTGAG ATTGTCTTCT GTGCTTTCGT CTGTCCTATC TTCAATCTTT CCCTGCCTAT 9721 GGTGGTGGTA CCTTTCTGTT TTTAACCTGC TATAAATTAC CAGATAAACC CATTCACTGA 9781 TTTGTAACTC CTTTCAGTCA TGCTCTAACT GTAAATGAAG GCTTAAACTG AAGTAGAACA 9841 GTTACAAGGT TTTACTTGGC AGAACATCTT GCAAGGTAGA TGTCTAAGAA GATTTTTTTT 9901 TCTTTTTTTA AGACAGAGTT TCGCTCTTGT TTCCCAGGCT GGGGTGCAAT GGTGTGATCT 9961 TGGCTCAGCG CAACCTCTGC CTCCTGGGTT CAAGTGATTT TCATGCCTCA GCCTCCCAAG 10021 TAGCTGGGAT TACAGGCATG CGCCACCACA CCTGGCTAAT TTTGTATTTT TAGTAGAGGC 10081 GGGGTTTCAC CATATTGTCC AGACTGGTCT CGAACTCCTG ACCTCAGGTG ATCCACCCGC 10141 CTTGGCCTCC CAAAGTGCTG GGATTACAGG CATGAGCCAC CTTGCCCAGC CTAAGAAGAT 10201 TTTTTGAGGG AGGTAGGTGG ACTTGGAGAA GGTCACTACT TGAAGAGATT TTTGGAAATG 10261 ATGTATTTTT CTTCTCTATA TTCCTTCCCT TAATTAACTC TGTTTGTTAG ATGTGCAAAT 10321 ATTTGGAATG ATATCTCTTT TCTCAAAACT TATAATATTT TCTTTCTCCC TTTCTTCAAG 10381 ATTAAACTTA TGGGCAAATA CTAGAATCCT AATCTCTCAT GGCACTTTCT GGAAAATTTA 10441 AGGCGGTTAT TTTATATATG TAAGCAGGGC CTATGACTAT GATCTTGACT CATTTTTCAA 10501 AAATCTTCTA TATTTTATTT AGTTATTTGG TTTCAAAAGG CCTGCACTTA ATTTTGGGGG 10561 ATTATTTGGA AAAACAGCAT TGAGTTTTAA TGAAAAAAAC TTAAATGCCC TAACAGTAGA 10621 AACATAAAAT TAATAAATAA CTGAGCTGAG CACCTGCTAC TGATTAGTCT ATTTTAATTA 10681 AGTGGGAATG TTTTTGTAGT CCTATCTACA TCTCCAGGTT TAGGAGCAAA CAGAGTATGT 10741 TCATAGAAGG AATATGTGTA TGGTCTTAGA ATACAATGAA CATGTTCTGC CAACTTAATA 10801 AAGGTCTGAG GAGAAAGTGT AGCAATGTCA ATTCGTGTTG AACAATTTCC ACCAACTTAC 10861 TTATAGGCGG ACCTTGCCAA GTATATCTGT GAAAATCAAG ATTCGATCTC CAGTAAACTG 10921 AAGGAATGCT GTGAAAAACC TCTGTTGGAA AAATCCCACT GCATTGCCGA AGTGGAAAAT 10981 GATGAGATGC CTGCTGACTT GCCTTCATTA GCTGCTGATT TTGTTGAAAG TAAGGATGTT 11041 TGCAAAAACT ATGCTGAGGC AAAGGATGTC TTCCTGGGCA TGTAAGTAGA TAAGAAATTA 11101 TTCTTTTATA GCTTTGGCAT GACCTCACAA CTTAGGAGGA TAGCCTAGGC TTTTCTGTGG 11161 AGTTGCTACA ATTTCCCTGC TGCCCAGAAT GTTTCTTCAT CCTTCCCTTT CCCAGGCTTT 11221 AACAATTTTT GAAATAGTTA ATTAGTTGAA TACATTGTCA TAAAATAATA CATGTTCACG 11281 GCAAAGCTCA ACATTCCTTA CTCCTTAGGG GTATTTCTGA AAATACGTCT AGAAACATTT 11341 TGTGTATATA TAAATTATGT ATACTTCAGT CATTCATTCC AAGTGTATTT CTTGAACATC 11401 TATAATATAT GTGTGTGACT ATGTATTGCC TGTCTATCTA ACTAATCTAA TCTAATCTAG 11461 TCTATCTATC TAATCTATGC AATGATAGCA AAGAAGTATA AAAAGAAATA TAGAGTCTGA 11521 CACAGGTGCT TTATATTTGG TGAAAAGACC AGAAGTTCAG TATAATGGCA ATATGGTAGG 11581 CAACTCAATT ACAAAATAAA TGTTTACGTA TTGTCAGAAG TTGTGGTGAT AAACTGCATT 11641 TTTGTTGTTG GATTATGATA ATGCACTAAA TAATATTTCC TAAAATTATG TACCCTACAA 11701 GATTTCACTC ATACAGAGAA GAAAGAGAAT ATTTTAAGAA CATATCTCTG CCCATCTATT 11761 TATCAGAATC CTTTTGAGAT GTAGTTTAAA TCAAACAAAA TGTTAATAAA AATAACAAGT 11821 ATCATTCATC AAAGACTTCA TATGTGCCAA GCAGTGTGTG CTTTGTGTAG ATTATGTCAT 11881 ATAGTTCTCA TAATCCACCT TCCGAGACAG ATACTATTTA TTTTTTGAGA CAGAGTTTTA 11941 CTCTTGTTGC CCAGGCTGGA GTGCAATGGT GCCATCTCGG CTCACCACAA CCTTCGCCTC 12001 CCAGGTTCAA GCGATTCTCC TGCCTCAGCC TCCTGGGATT ACAGGCATGC ACCACCATGC 12061 CTGGCTAATT TTGTATTTTT AGTAGAGATG GGGTTTCACC ATGTTGGTCA GACTGGTCTC 12121 AAACTCCTGA CCTCTGGTGA TATGCCTGCC TCAGCCTCCT AAAGTGCTGG GATTACAGGC 12181 ATGAGCCACT GTGCCCAGCC GACAGATACT ATTATTATTT CCATTCTACC GAGAAGGAGA 12241 CTAAGGCTCT GATCATTTAA ATAAGTTGCC TAAGGTGATG CAGTGATATA AGTAGCAGAG 12301 CTAGGAATTG AGCCTTGGTA ACTTTAACTC TGGACCCCAA GTCCTTAGCT ACTAAGCTTT 12361 ACTGCATGGG GTTTAGTCAA ATTAAGACTT TTGGAATATG AGTTACTTTT GAGATTAGCT 12421 TTGTGATATT TTTTGTGCTC ATTTGTCCAA CAAAGTCTAT TTTATTTTCA TCTTAATTAG 12481 GTTTTTGTAT GAATATGCAA GAAGGCATCC TGATTACTCT GTCGTGCTGC TGCTGAGACT 12541 TGCCAAGACA TATGAAACCA CTCTAGAGAA GTGCTGTGCC GCTGCAGATC CTCATGAATG 12601 CTATGCCAAA GTGGTAGGTT TATTGTTGGA AAAAAATGTA GTTCTTTGAC TGATGATTCC 12661 AATAATGAGA AAGAAAAATA ATGCAAGAAT GTAAAATGAT ATACAGTGCA ATTTAGATCT 12721 TTTCTTGAGA TGGTTTCAAT TCTGGAATCT TAAACATGAA AGAAAAAGTA GCCTTAGAAT 12781 GATTAACAAA ATTTAGACTA GTTAGAATAG AAAGATCTGA ATAGAGCAAT CTCTAAAAAA 12841 TTTTGATCTT TTTTTCTCTT TTTCACAATC CTGAGAACAA AAAAAAATTA AATTTAAATG 12901 TTAATTAGAA GATATTTAAC TTAGATGTAA AGTGAGTTAA CCTGATTCCA GGATTAATCA 12961 AGTACTAGAA TTAGTATCTT ATGGCAAATT ATAGAACCTA TCCCTTTAGA ATATTTTCAA 13021 ATCTTTTTGA GGATGTTTAG GAATAGTTTT ACAAGAAATT AAGTTAGGAG AGGAAATCTG 13081 TTCTGGAGGA TTTTTAGGGT TCCCACTAGC ATATGTAATG GTTTCTGAAC TATTCAGAAT 13141 CAGAGAAAAC TCATTTTTCC TGCTTTCAAG AAGCTACTGT ATGCCAGGCA CCATGCACAA 13201 ACAATGACCA ACGTAAAATC TCTCATTTTG GAGAGCCTGG AATCTAACTG GAAAGGTGAA 13261 CTAATAATAA TAATATGTAC AATCATAGCC ATCATTTATT AAACTTTTAT TATATGCAAG 13321 GCACTGTTTA ATTTCATTAG CTTACCTGGT TTACAGAGCA GCTCTATGAG ATGAGTGCCA 13381 TCTTTGCCCC TATTTTAGGG ATAAGGATTC CGAAATGTGG AGATGGTAAG TAAAATTGCA 13441 CAACTGAAGA ATGAGTTACA TGACTTGGCT CAAATACTGG TCATTGAACT CCAGAGCCTG 13501 AATATTCTTA ACCACTTACA TGATGCAAGC TCACCAAATA AATAGTTCGA ATGTATTGTG 13561 ACAGAGCGGC ATTGATATTC ATCTATTCAT GTGGCTTTGA GTAGGAAGAA GAAAGGATAT 13621 CATTCTGACC AGAGGGGTGA AAAACAACCT GCATCTGATC CTGAGGCATA ATACTATTAA 13681 CACAATTCTT TTATGTTTCA GTTCGATGAA TTTAAACCTC TTGTGGAAGA GCCTCAGAAT 13741 TTAATCAAAC AAAATTGTGA GCTTTTTGAG CAGCTTGGAG AGTACAAATT CCAGAATGCG 13801 TAAGTAATTT TTATTGACTG ATTTTTTTTA TCAATTTGTA ATTATTTAAG ACTTAATATA 13861 TGAGCCACCT AGCATAGAAC TTTTAAGAAT GAAAATACAT TGCATATTTC TAATCACTCT 13921 TTGTCAAGAA AGATAGGAGA GGAGAGATAA AATAGTTGAT GGGGTGGAGA GGTCTATATT 13981 TGAATGTAGT CTAAAAATTG TTCTCTTAAG ATTGGAAGTA TGTAGGCTGG GAGGGTAAAT 14041 ACCAAATCTT GGTATATCAG AACTGAGCAT GTCCCTTGAA GGTTAAGAAA TAGTTAATGG 14101 GCAAATAGAG CATGGCAATA TTTTGTAGAG CAGCAAGTAG TAGGCCTTGA ATAGATGTCG 14161 CTCAAAAAGT AATATGTAAG CTGAACACAA AAATGTAACA AATGAATTTA GATACATATT 14221 TGAATATTAA ATTCAGGTTG TTTGGGAGAT GCACCTAGTC TTTGATGGTT AAACCTTTCC 14281 CTCCATAGAA GAGACAGAGA CAGAATGGCT TGCTGGACTA ATGTCCCAAT TCAATAGAGT 14341 CTTATCTACG AAGGTTAAAA ACAAGAAGAG ACATATTATA CAGTAGATAT TTATTGTGTG 14401 GCTCATACAC ATGGTGCTCT TCTGATTATG GATTTTAGAG ATAATAACAG TGAACAAGAC 14461 ATAGTTTCTT TCCTCGAGTA GATTAAAGTC ATACATTGAC TTTTAATGGT GACTGGCATT 14521 CTTAATACAT GATTATTATA TATTAGGTAC CATGTCAGAT TAATTATAAT ACTTTACTAT 14581 TTTTAATTTA ACCCTTGAAC TATCCCTATT GAGTCAGATA TATTTCCTTC CATTTTCTAC 14641 TTGTATCTTT CAAGTTTAGC ATATGCTGAT ACATATGAAG CTCTCTCCAG GTTTTATTGA 14701 AAGAAGAAAT TAATAAATTT ATTAATGTCA CTGAATTAGG CAACTCACTT TCCCAAGATT 14761 ATGCAAGTGG TACAGGTGGA ACTCAAAGCC AAGTTTAACT AGTTGTTCAG GAGAATGTTT 14821 TCTACCCTCC ACTAACCCAC TACTCTGCAG ATGGAGATAA TATGATGAAT GGAACATAGC 14881 AACATCTTAG TTGATTCCGG CCAAGTGTTC TCTGTTTTAT CTACTATGTT AGACAGTTTC 14941 TTGCCTTGCT GAAAACACAT GACTTCTTTT TTTCAGGCTA TTAGTTCGTT ACACCAAGAA 15001 AGTACCCCAA GTGTCAACTC CAACTCTTGT AGAGGTCTCA AGAAACCTAG GAAAAGTGGG 15061 CAGCAAATGT TGTAAACATC CTGAAGCAAA AAGAATGCCC TGTGCAGAAG ACTATGTGAG 15121 TCTTTAAAAA AATATAATAA ATTAATAATG AAAAAATTTT ACCTTTAGAT ATTGATAATG 15181 CTAGCTTTCA TAAGCAGAAG GAAGTAATGT GTGTGTGTGC ATGTTTGTGT GCATGTGTGT 15241 GTGCATGCAC GTGTGTGTAT GTGTGATATT GGCAGTCAAG GCCCCGAGGA TGATAATTTT 15301 TTTTTTTTTT TTGAGACGGA GTCTCGCTTT GTTGTCCAGG CTGGAGTGCA GTGGTGCCAT 15361 CTCGGCTCAC TGCAACCTCC GCCTCCCAAG TTCAAGCCAT TCTCCTGCCT CAGCCTCCCA 15421 AGTAGCTGGG ACTACAGGTG CATGCCACCA TGCCTGGCTA ATTTTTTGTA TTTTTAGTAG 15481 AAAATTTTCA GCTTCACCTC TTTTGAATTT CTGCTCTCCT GCCTGTTCTT TAGCTATCCG 15541 TGGTCCTGAA CCAGTTATGT GTGTTGCATG AGAAAACGCC AGTAAGTGAC AGAGTCACCA 15601 AATGCTGCAC AGAATCCTTG GTGAACAGGC GACCATGCTT TTCAGCTCTG GAAGTCGATG 15661 AAACATACGT TCCCAAAGAG TTTAATGCTG AAACATTCAC CTTCCATGCA GATATATGCA 15721 CACTTTCTGA GAAGGAGAGA CAAATCAAGA AACAAACGTG AGGAGTATTT CATTACTGCA 15781 TGTGTTTGTA GTCTTGATAG CAAGAACTGT CAATTCAAGC TAGCAACTTT TTCCTGAAGT 15841 AGTGATTATA TTTCTTAGAG GAAAGTATTG GAGTGTTGCC CTTATTATGC TGATAAGAGT 15901 ACCCAGAATA AAATGAATAA CTTTTTAAAG ACAAAATCCT CTGTTATAAT ATTGCTAAAA 15961 TTATTCAGAG TAATATTGTG GATTAAAGCC ACAATAGAAT AACATGTTAG ACCATATTCA 16021 GTAGAAAAAG ATGAACAATT AACTGATAAA TTTGTGCACA TGGCAAATTA GTTAATGGGA 16081 ACCATAGGAG AATTTATTTC TAGATGTAAA TAATTATTTT AAGTTTGCCC TATGGTGGCC 16141 CCACACATGA GACAAACCCC CAAGATGTGA CTTTTGAGAA TGAGACTTGG ATAAAAAACA 16201 TGTAGAAATG CAAGCCCTGA AGCTCAACTC CCTATTGCTA TCACAGGGGT TATAATTGCA 16261 TAAAATTTAG CTATAGAAAG TTGCTGTCAT CTCTTGTGGG CTGTAATCAT CGTCTAGGCT 16321 TAAGAGTAAT ATTGCAAAAC CTGTCATGCC CACACAAATC TCTCCCTGGC ATTGTTGTCT 16381 TTGCAGATGT CAGTGAAAGA GAACCAGCAG CTCCCATGAG TTTGGATAGC CTTATTTTCT 16441 ATAGCCTCCC CACTGAAGGG AGCAAAGTTT AAGAACCAAA TATAAAGTTT CTCATCTTTA 16501 TAGATGAGAA AAATTTTAAA TAAAGTCCAA GATAATTAAA TTTTTAAGGA TCATTTTTAG 16561 CTCTTTAATA GCAATAAAAC TCAATATGAC ATAATATGGC ACTTCCAAAA TCTGAATAAT 16621 ATATAATTGC AATGACATAC TTCTTTTCAG AGATTTACTG AAAAGAAATT TGTTGACACT 16681 ACATAACGTG ATGAGTGGTT TATACTGATT GTTTCAGTTG GTCTTCCCAC CAACTCCATG 16741 AAAGTGGATT TTATTATCCT CATCATGCAG ATGAGAATAT TGAGACTTAT AGCGGTATGC 16801 CTGGCCCAAG TACTCAGAGT TGCCTGGCTC CAAGATTTAT AATCTTAAAT GATGGGACTA 16861 CCATCCTTAC TCTCTCCATT TTTCTATACG TGAGTAATGT TTTTTCTGTT TTTTTTTTTT 16921 CTTTTTCCAT TCAAACTCAG TGCACTTGTT GAGCTCGTGA AACACAAGCC CAAGGCAACA 16981 AAAGAGCAAC TGAAAGCTGT TATGGATGAT TTCGCAGCTT TTGTAGAGAA GTGCTGCAAG 17041 GCTGACGATA AGGAGACCTG CTTTGCCGAG GAGGTACTAC AGTTCTCTTC ATTTTAATAT 17101 GTCCAGTATT CATTTTTGCA TGTTTGGTTA GGCTAGGGCT TAGGGATTTA TATATCAAAG 17161 GAGGCTTTGT ACATGTGGGA CAGGGATCTT ATTTTACAAA CAATTGTCTT ACAAAATGAA 17221 TAAAACAGCA CTTTGTTTTT ATCTCCTGCT CTATTGTGCC ATACTGTTGA ATGTTTATAA 17281 TGCATGTTCT GTTTCCAAAT TTGTGATGCT TATGAATATT AATAGGAATA TTTGTAAGGC 17341 CTGAAATATT TTGATCATGA AATCAAAACA TTAATTTATT TAAACATTTA CTTGAAATGT 17401 GGTGGTTTGT GATTTAGTTG ATTTTATAGG CTAGTGGGAG AATTTACATT CAAATGTCTA 17461 AATCACTTAA AATTTCCCTT TATGGCCTGA CAGTAACTTT TTTTTATTCA TTTGGGGACA 17521 ACTATGTCCG TGAGCTTCCA TCCAGAGATT ATAGTAGTAA ATTGTAATTA AAGGATATGA 17581 TGCACGTGAA ATCACTTTGC AATCATCAAT AGCTTCATAA ATGTTAATTT TGTATCCTAA 17641 TAGTAATGCT AATATTTTCC TAACATCTGT CATGTCTTTG TGTTCAGGGT AAAAAACTTG 17701 TTGCTGCAAG TCAAGCTGCC TTAGGCTTAT AACATCACAT TTAAAAGCAT CTCAGGTAAC 17761 TATATTTTGA ATTTTTTAAA AAAGTAACTA TAATAGTTAT TATTAAAATA GCAAAGATTG 17821 ACCATTTCCA AGAGCCATAT AGACCAGCAC CGACCACTAT TCTAAACTAT TTATGTATGT 17881 AAATATTAGC TTTTAAAATT CTCAAAATAG TTGCTGAGTT GGGAACCACT ATTATTTCTA 17941 TTTTGTAGAT GAGAAAATGA AGATAAACAT CAAAGCATAG ATTAAGTAAT TTTCCAAAGG 18001 GTCAAAATTC AAAATTGAAA CCAAGGTTTC AGTGTTGCCC ATTGTCCTGT TCTGACTTAT 18061 ATGATGCGGT ACACAGAGCC ATCCAAGTAA GTGATGGCTC AGCAGTGGAA TACTCTGGGA 18121 ATTAGGCTGA ACCACATGAA AGAGTGCTTT ATAGGGCAAA AACAGTTGAA TATCAGTGAT 18181 TTCACATGGT TCAACCTAAT AGTTCAACTC ATCCTTTCCA TTGGAGAATA TGATGGATCT 18241 ACCTTCTGTG AACTTTATAG TGAAGAATCT GCTATTACAT TTCCAATTTG TCAACATGCT 18301 GAGCTTTAAT AGGACTTATC TTCTTATGAC AACATTTATT GGTGTGTCCC CTTGCCTAGC 18361 CCAACAGAAG AATTCAGCAG CCGTAAGTCT AGGACAGGCT TAAATTGTTT TCACTGGTGT 18421 AAATTGCAGA AAGATGATCT AAGTAATTTG GCATTTATTT TAATAGGTTT GAAAAACACA 18481 TGCCATTTTA CAAATAAGAC TTATATTTGT CCTTTTGTTT TTCAGCCTAC CATGAGAATA 18541 AGAGAAAGAA AATGAAGATC AAAAGCTTAT TCATCTGTTT TTCTTTTTCG TTGGTGTAAA 18601 GCCAACACCC TGTCTAAAAA ACATAAATTT CTTTAATCAT TTTGCCTCTT TTCTCTGTGC 18661 TTCAATTAAT AAAAAATGGA AAGAATCTAA TAGAGTGGTA CAGCACTGTT ATTTTTCAAA 18721 GATGTGTTGC TATCCTGAAA ATTCTGTAGG TTCTGTGGAA GTTCCAGTGT TCTCTCTTAT 18781 TCCACTTCGG TAGAGGATTT CTAGTTTCTT GTGGGCTAAT TAAATAAATC ATTAATACTC 18841 TTCTAAGTTA TGGATTATAA ACATTCAAAA TAATATTTTG ACATTATGAT AATTCTGAAT 18901 AAAAGAACAA AAACCATGGT ATAGGTAAGG AATATAAAAC ATGGCTTTTA CCTTAGAAAA 18961 AACAATTCTA AAATTCATAT GGAATCAAAA AAGAGCCTGC AG // LOCUS HUMALDA1 3586 bp ds-DNA PRI 15-SEP-1989 DEFINITION Human aldolase A gene 5' non-coding exons. ACCESSION X06351 KEYWORDS aldolase; aldolase A. SEGMENT 1 of 2 SOURCE human (Homo sapiens). ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 3586; enum. 1 to 3586) AUTHORS Maire,P., Gautron,S., Hakim,V., Gregori,C., Mennecier,F. and Kahn,A. TITLE Characterization of three optional promoters in the 5' region of the human aldolase A gene JOURNAL J. Mol. Biol. 197, 425-438 (1987) STANDARD simple automatic COMMENT *source: library=lambda EMBL4; clone=lambda6A; Three types of put. 5' non-coding exons are present. GC boxes and their inverted complement are observed. See also x06352 EMBL features not translated to GenBank features: 1527 1531 put. TATA-box 2076 2081 put. TATA-box 3148 3152 put. TATA-box 3089 3095 put. TATA-box FEATURES from to/span description pre-msg 1550 > 3586 aldolase A mRNA and introns /nomgen="ALDOA" /map="16q22-q24" /hgml_locus_uid="LA0129H" IVS 1614 1788 aldolase A intron A IVS 1871 2103 aldolase A intron B (no splice consensus) IVS 2149 3113 aldolase A intron C (no splice consensus) IVS 3286 > 3586 aldolase A intron D BASE COUNT 729 A 1109 C 1054 G 694 T ORIGIN 1 GAATTCAAGA CCAGCCTGGT CAACATAGCG AAACCCTGTC TCTACTAAAA ATACAAAATT 61 AGCCAAGTGT GGTGGCACAT GCCTGTAATC CCACCTACTT GGGAGGCTGA GGCAGCGACG 121 AACCGCTTGA ATCCAGAGGT GGAAGTTGCA GTGAGTGGAG ATCACGCCAT TGTGCTCCAG 181 CCTGGGCAAC AGGAGTGAAA CTCCATCTCG GAAAAAAAAA AAAAAAAATG GCCGGGCACA 241 GTTGCTCACT CCTTTAATCT TAGCACTTTG GGAGGCTAGG CAGGTGGATC ATCTGAGGCC 301 AGGAGTTCCG AGACCAGCCT GACCAACATG GTGAAACCCC CATCTCTACT ACAAATACAA 361 AAATTAGCTG GGTGTGGTGG CTCACGCCTG TAGTTCAGCT ACTCAGAAGG CTGAGGCAGA 421 ACTACTTGAA CCCAGGAGGT GGAGGTTGCA GTGAGCCAAG ATCGCACCAT TGCACTCCAC 481 CCTGGGCAAC AGAGTGAGAC CCTGTCTCAA AAAAAAAAAA AAAATTAAAA ATAGGTAAAT 541 TGCCATTTTC AAAGATACCT GTAAAGATTT TTAAAATCAT TCATGAATGC GGTCTGTCGT 601 GCACAGAGTA GAATGCTCAA AAATGGTGAA TGAGACCCTC TATTTTGGTC TCATGCTGAA 661 GAAGTCCATA AAGCCCACAA GTCATTTTCA TGATGGACAG AAAAATGTGT GTGCTTTCTC 721 TGTCTACTGC CTCAACTGCA CAGACCCCGG GACTGTAGCA GAACCATCTT TTGAGCTTGA 781 CACCGGGAGG CCCAAATTCT AGCACGGGAC CCAGGGCCAG TTGCTCTCTG GTCCTCAGTC 841 TCCTCACCCA TAAAATGGGA AGGAGAGAAC CCTGAATCAT TGCTTCTAGC TTCTGAACTC 901 AGTTGTTCAG AACAAGGACT CACTGCTGAT TTTTCAACAG CACAGGGAAT TGCACTGTTC 961 CTGGGAATGA TGGACAGTAC CCTCTGTTCC ACTGGGCAAG TGAGATTTCC CAGGCCTCTC 1021 TGTTCCCTTC TCCCCTTAGA GAGCAACAGA CGTGGCCCCA TCCCACCTCC CTCAACCCTC 1081 TCTCTCCTCC CTCAGGACTG GGCACCTCGC TGCCCCCGCT GCTGCCCACT CTGCGACTGT 1141 GCCTGTACGT GCCAGCTCCC CGACTGCCAG AGCCTCAACT GTCTCTGCTT CGAGATCAAG 1201 CTCCGATGAG GACCCAGGGC CCCTGCCTCT GGGGAGCGGC CAGCCCCCAG GGCCCATGTG 1261 CCCTCCTCCC TGAAGAGCCT TTCCCCACGC CACTGGAACC ACAGATGGCC TGCCGAGCAC 1321 CCAGGCCTGG GAACTGGAAG TGGCAGCGCA GGGCCTGGCT CCTGCAGGGC AGGACTCTTG 1381 GCCGGCTGGA CGGCAGCTCC TCTGGAGGGC CAGAAAAGAG AGGGGCTAGT GCTCGGGCAG 1441 GTGCCCTGGC TTCCCTTCCC CTCCACACGT CAACGATTCT ATTTGAAGTT GGGCAGGGGG 1501 GTGGCGCTGC TCACCACACA CAAGTGTTAT AGGAGGAGTC TGGCCCTTGA GTACCGGGTA 1561 CGCAGGGGTG CCTCAACCAC ACTCCGTCCA CGGACTCTCC GTTATTTTAG GAGGTGAGTG 1621 TAGTGCCAGT ATCTACTCTC CTTCTTAAAA AAAACCAGGG CTCCAGAGAA TCAGAACAGC 1681 CACCATCACC GCAGGGAGTC AAGGGAGGAG GGAGATTAGA GAAGGAGCCA GGGAGGGTGG 1741 CAGGGAGGCC ACGTGATCCG AGTCCCCTCA CCCCTTTCCT TCCCACAGGT CCCTGGCCAA 1801 AGATTTATTT CTCTTGACAA CCAAGGGCCT CCGTCTGGAT TTCCAAGGAA GAATTTCCTC 1861 TGAAGCACCG GTGAGTGGGC AGGGGCTCCC TCCCCATCAA TAGGGCCGAC CCAAGTCTTC 1921 CTCCCCCTTC CCCCATGCCG GGCCCCACGA TAGTGTGAAT GTCAGGGGCT TCAGGTTTCC 1981 CTAAATATAG GTCCCTGCCA GAGGATCCGT GGCGGGAAAA GGGCAGGGGT CATTAGAGAA 2041 GATCGGGGAC ACATGTGGGG CGGGCAGGAG CTGCCTTATA ACCAGCCCGG GAACCCCTAG 2101 CTCACTCGCT GCTGACCAGG CTCTGCGGCT CCTTCGGCCT CGCCGCAGGT GGGCCCCTTG 2161 CAGGACCGGG CCGGGGTGGG GATGGGGTTG GGGTTGGGCC AACAGGGTCC AGATGGGGTC 2221 CAGGTGAGGA GGGGAGATTT GGACGATAGG AGCAGGGGGC TCAGCATCTG GGAGGCAGAT 2281 CAGTTCGGGG ACGGATTTTC TTTTGGAGAA GGAAGTCAGG CTCAAGGAAG ACGTTTGGCA 2341 GGAACTGTGA CCCCGCATGC CCAGAGGCCG AGCAGCGCCG TGCATAGCCG CGCATTCTGG 2401 TTTTCTGTGG CGCAGAGGAC TACCAGCCTG GCTGCGGCGG CCCGGCGGAG AGCGCGCACG 2461 CATGCGCGAC CCAGCCCGGC CTGCCAGCCT GGAACTCGGA TGGGGAGGTC TCGCCTCCGC 2521 GCGCCGCTAG TTCCGCCGCC TTCTCGCGCT CCCTGCGTGT CCTCTCGATG CCCTTTCCTC 2581 CGCCTCCCTT ACGCCTGGGC CAGTGACAGG GTGTCGGCTC CGCGCCGATT CAGCCCGCGG 2641 GCGAGGCAGG CTAACGCACG ACTGCGCGAT GTGGCCCCTA TGGTGACACG CGCTGCAGCC 2701 GCGAAGACCG GAAGCTGGGG CCCCGGGCGC GCGCGCTGGG CCTGGGAGGC GAAACTCAGC 2761 TTCCTTCGTT TCCGACTTTT CCATCCGCGT CCTCCACTTC CCCGTTCCGC CCTCCCCCAT 2821 TGCCAACATT CTGGCTGAGT CACGGCGCCC CAGAGCGCGC CAGGCTGGGG GAAAGGAGCA 2881 GAAGGGAGGG CCCTAGCGAC CCGCGGGATG TGGTCCGAGT CACGTCCGAG GGGGGTGGGG 2941 AGGGATCGTG TTCTCGGCGC CCGCCCCTTC CTAGCGCGGC CTCTGGGCTG GCCTCTCGGG 3001 GGCGGCCCGT AGCCCAGTCC GTCGCCTGCC ATTGGACGCC GCCCGCTCCT CGTAAAGGAA 3061 AAAGCTCGGC GGAGGGCGGA GTGGTGCCTT TAAAAGGCCG GGCGCCGCCT TCCGCCTGCC 3121 CGCCTCCTGC GCCGCCCCTT CCGAGGCTAA ATCGGCTGCG TTCCTCTCGG AACGCGCCGC 3181 AGAAGGGGTC CTGGTGACGA GTCCCGCGTT CTCTCCTTGA ATCCACTCGC CAGCCCGCCG 3241 CCCTCTGCCG CCGCACCCTG CACACCCGCC CCTCTCCTGT GCCAGGTGAG CGCCCCTCTT 3301 CACGTGCGGG GACCAGGGAC CGTGGAGAGG GATCTTGGGG GCAGTGGCGG GTTGGGCGTC 3361 CGCGTGGAGG CCTCCCCCAT CCCATGCCAG CGTCTCCCCA CTACCAGGCA CACACAGGCT 3421 CCCCGGCCCC TCCAGCCTGA GGTCCTCTAA CTGCGCAATG CAGCGGCTGC GCGCGCTGAG 3481 TCATGGCGGG GGAGGAAGCC GGACGAGATG AAGGACCATT CTCCCCCTTT TCTTGCAGGG 3541 ACCCCTGTGG CAAAGGATTA GGGCCCCTTA GCCCTGGCGG GGATCC // LOCUS HUMALDA2 547 bp ds-DNA PRI 15-SEP-1989 DEFINITION Human aldolase A gene, exons 5 and 6 (first two coding exons). ACCESSION X06352 KEYWORDS aldolase; aldolase A. SEGMENT 2 of 2 SOURCE human (Homo sapiens). ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 547; enum. -99 to 435, no zero) AUTHORS Maire,P., Gautron,S., Hakim,V., Gregori,C., Mennecier,F. and Kahn,A. TITLE Characterization of three optional promoters in the 5' region of the human aldolase A gene JOURNAL J. Mol. Biol. 197, 425-438 (1987) STANDARD simple automatic COMMENT *source: library=lambda EMBL4; clone=lambda 6A; see also x06351 FEATURES from to/span description pept 134 245 aldolase A, exon 5 (first coding exon) /nomgen="ALDOA" /map="16q22-q24" /hgml_locus_uid="LA0129H" 329 / 540 aldolase A, exon 6 pre-msg < 1 > 547 ald-A mRNA and introns IVS < 1 112 intron D IVS 246 328 intron E IVS 541 > 547 intron F BASE COUNT 121 A 157 C 165 G 104 T ORIGIN 1 GGCAGGGGAG GTAGGGAACA TTTCCCTGAC CTCCAGGAGA GGGGCCCTGC TCATCGGGAG 61 ATGATGGGAA ACCCTAGCTA ACTAGTCCTT CCCCTCTGTT TCCTGTATCC AGGAACTTGC 121 TACTACCAGC ACCATGCCCT ACCAATATCC AGCACTGACC CCGGAGCAGA AGAAGGAGCT 181 GTCTGACATC GCTCACCGCA TCGTGGCACC TGGCAAGGGC ATCCTGGCTG CAGATGAGTC 241 CACTGGTGCG GGCAGGAGAC AGAATGGGTG GAGGGTGCAG GGTTGGGAGT GGCAGGCTGA 301 TCCCCTAATT CCCATGTGAC ACTCCCAGGG AGCATTGCCA AGCGGCTGCA GTCCATTGGC 361 ACCGAGAACA CCGAGGAGAA CCGGCGCTTC TACCGCCAGC TGCTGCTGAC AGCTGACGAC 421 CGCGTGAACC CCTGCATTGG GGGTGTCATC CTCTTCCATG AGACACTCTA CCAGAAGGCG 481 GATGATGGGC GTCCCTTCCC CCAAGTTATC AAATCCAAGG GCGGTGTTGT GGGCATCAAG 541 GTAAGGG // LOCUS HUMALDB1 10239 bp ds-DNA PRI 15-SEP-1989 DEFINITION Human aldolase B (ALDOB) gene, exons 2 through 6. ACCESSION M15657 KEYWORDS aldolase; aldolase B; fructose bisphosphate. SEGMENT 1 of 2 SOURCE Human DNA, clones pHL[210,413], pKBg/413, PP201c, pTP201. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 10239) AUTHORS Tolan,D.R. and Penhoet,E.E. TITLE Characterization of the human aldolase B gene JOURNAL Mol. Biol. Med. 3, 245-264 (1986) STANDARD simple staff_review FEATURES from to/span description pept 5817 5928 aldolase B, exon 2 (EC 4.1.2.13) /nomgen="ALDOB" /map="9q21.3-q22.2" /hgml_locus_uid="LL0050R" 6738 6949 aldolase B, exon 3 8181 8235 aldolase B, exon 4 9072 9232 aldolase B, exon 5 10074 + 10157 aldolase B, exon 6 pre-msg 924 > 10239 aldoB mRNA + introns IVS 996 5806 aldoB, intron A IVS 5929 6737 aldoB, intron B IVS 6950 8180 aldoB, intron C IVS 8236 9071 aldoB, intron D IVS 9233 10073 aldoB, intron E IVS 10158 > 10239 aldoB, intron F BASE COUNT 3065 A 2130 C 2119 G 2925 T ORIGIN 5 bp upstream of PstI site; chromosome 9q21.3-q22.2. 1 CTGCAGAATG AACCCATACC CAGACAGTAC CATATTTAGG GGTGTTGATG GCTTAGTTCA 61 ATCACTTTAA TTCTAGTTTT TAACCAAATT TGAAGACAAA AGTACAAAGG TTAAAAGAGA 121 ATGGGAATAC AAAACAATGA TTTCTGGCAT TAAATACATC AGAAAGCCCA GCTAGGTTTT 181 GGAGGACTGG AGAAGATGGG AGAGACAATG ATTTTTAAAA ACAGAATGTT TTGAATTTTA 241 TTTCCAATTT GGACAAAATT CACACATGGG AAATTCAAGG ATTCAGAATC AATTTCTTTT 301 ATTGAAAGAT TATTTAAATT TAACTATAAA TTATAGAATA AATTGTTGGA ATTGACTACT 361 AATGATATAC TGAAGATAAA TAATGAAAAT ATAGCATTGA TTACCTAATT TGGAAATAAT 421 AATTCTTGAA TGCCTACTGC GTACAGACAC TATACAACTC ACTTAATGCT CCTGTGAAAT 481 TGAGATCCCA CTTTACAGAG AAGAAAATAA CGGCTCAGAC AAGTAAAAAA CCAAAACCAA 541 AAACCCGTCA CCAAGTTCAT GTGTTTTGTA AGTGTTACAG CTACACTAAC ATTCTTCCTA 601 GTATCATTTA ATGGACCCTA GAGCTCCATC ATTGACTAGC TGTATGACTT AAGGGAACCT 661 CCTTATCTTC CTTGCCTTAG TTTCCTCATC TGTGAAATGG AGGGTCTGGA TTAGGAACTC 721 CACGGTCTAA AGACATTCCT TGCAGCTTTA ACATTCTATG AGCATAAGGA GTATATGGGC 781 ATGATTTTAA GGACTGGTTG TTTATGAGCC AATCAGAGGT GTTGAATAAA CACCTCCCTA 841 CTAGGTCAAG GTAGAAAGGG GAGGGCAAAT ATTGGAAAAA AAAAACATGA TGAGAAGTCT 901 ATAAAAATTG TGTGCTACCA AAGATCTGTC TTATTTGGCA GCTGCTGCCT CACCCACAGC 961 TTTTGATATC TAGGAGGACT CTTCTCTCCC AAACTGTAAG TAAAAAAGCT TCCTTTGTAT 1021 ACCTGGACAG CACCAGTGAA TAAGGGGTTA ATAATACTAC TAAGCTTATA GAAGGGAAGA 1081 TTGGTTTGCC TTTGTTTTCG GGAAAAACTT GATGGTGGCT GAAAACATGA TGGCTCTGCT 1141 TGGAGGGAAA TGCTTGTTTT CCTCTCTCTA AGTATAAATG CTCATTCACA CCTTTCCTGA 1201 CTTTTTGAAA GGTGGCAAAA TGCCCAAGAT TGATTGCCAG TTACTTGTGT TACCTGAATG 1261 AGGCCCTTGT GAGTCTCTGA TCTAGGAAAA ACAACTTGAA CACTCCTATC GTTAGGTTCT 1321 AATCAGTTTT TGAGATTCCT GTTCATTTCA ACTCTCAACT CTCTTTTCAT AACAACTACT 1381 TATTTTAGTG ATTTATTTAC ATATCTACTG CCTTATGTCA TTTTCTACAT TCTCTTCCCA 1441 CACCTACTGG TACCGACACT GAAGTTTCTT TGTCACAGAA ACAAACCCTT GGAGTATTTT 1501 GCCCTTCGCC TTCACCCACT GCTCAAATGA GGAAATGCAA CAGGGTTGGA AACTTCACAG 1561 AGCATAGTCA TCATTTGATT ATGAATTTCC AAACCCTTCA TGTTTGACAC TTTCTAGAAC 1621 ATTTCCTGCT GGCCCCAGAC ACTGTGAAGA CCCTACAGCT TAAGATAGTA GGAATCAAAA 1681 GGTTGTGTAT TTTGATACAT TTGCTCTACT ATCTGAAAAA TAGGCATTGT GAGAATATGT 1741 TGCCTTTTTA AAAAATAACT TCTCTTTAAA TAGTTACCTA GGGCTCCATT AGACAATTTG 1801 GATGCACCAG AGCTTGAACT CTGCTACCCT GCTATTCATC ACTTTAGCTA AGGCACATCT 1861 GGGTAGCACC CAGGCCCAGG TTAGAGCCAG AGATACTGGG GCAGGGAATG GGGAAATGTT 1921 GAATATCTAA GAGGTCCTTT GGCCTTCATT CCAAGTTTAG AGGAAAGAGC ACCACTCTGG 1981 ACTCGTTCTG GGCTGCTCCG CGTGTGTTAA AGGGCACTGG AAATCAGGCT GTCCTAGGAC 2041 TGCCCTAGAT CATCCCACTG CACTTGTCAC TGGGACATTG GGGTGGATAA TGAAGCTATT 2101 GGCAGCTTTA AGCATCATGT ACAAAAGTGC TGCATGCAGT GCCTAGAAAT CATGGGCTAT 2161 CAATAAATAA GCCTCCTCAT ATTTGCCTTA AAAATCCATT TGTCTTGTAC ATAGAGAAGC 2221 AAATGAACTT TCTGCTGCCT TCTATTTCTC CAGGAATTAA ATATCCATGT TACAAATAAT 2281 TATGTTTTCA CAGAACATCT TCAAGGAAGG CAGAAAGAAC CATACCATAG GATTCTACCT 2341 GCTATTTTGC TGAAGGATTA GAAATAGAAT TCTAAGTCAT AAGCCAATAG GATGTAACTT 2401 GCAATCCTAG AACATCACAA CTTAGAAGGC ACATCAGACA TAGTCTCTCT GGCTTTAGTT 2461 TCCTCATCTC TAAAGTGACA GGGTTGACGC AGGTATCTCT GAGGTCATTT GTAGCTCTGT 2521 TTTTCTGTGG TTACCTTCTG TTGTTAATAC AACTTTCCTC CAGATTTATA TACATATTAC 2581 TTGTTACACA TATGTAATAA AGTGCGTGAT AAAGAATGTA TAAAATGAAA TGTGGCCGGG 2641 TGCAGTGGCT CATGCCTGTA ATCCCAGCAC TTTGCGAGGC CAAGGTGGGA GGATCGCGTG 2701 AACTCAGGAG TTCAAGACAA GCCTGGGCAA CATAGCAAGA CTTTGTTTAT ACCAAAAATA 2761 AAAAAAAAAA TAGCTGGGCA TGGTGGCAGG CGCCTGTAGT TTCAGCTGCT TGGTGTCTGA 2821 GGTGGGAAGA TCACTTGAGC CCAGGAGGAC GAGGCCTTCA GTGAGTCATG ATTGTGCCAG 2881 TGCATCCAGG CTAGGCAACA GAGCAAGACT CTGTCTCAGA AAATAAAAAA GAAAGAAAGA 2941 AAAAGAAAAG AAATGTGAGA ATCTCTCCAC CCTACCCCAT TCTTCAGAGG TAACTCCAGA 3001 GTTGTTACTC TTTGTTTATA TAGACACAAA AACAGTATGG GTATATATCA TTTTTTAATT 3061 AAAAAATAGG AGGACAGTAT ATATTGCATA CTTCTCTGAA ACTTGTTTTT CATTTAATAA 3121 TATAATGTGA ACAACTTCCA TCACACTGAA AAAAGTTGGC TTGAGATCTG AGAGAGCAGA 3181 GGCTTAGGAG TACGGAAATC GTTCTTTGTC ATTACCCATC CCAGGTTGTC CTCCTGTCTC 3241 CTTGTGGTGA ACATTGGCCT GTGACCCTGT TTTATGATTA ACAGAGAGGT GACATTTGTG 3301 CTAACGTGGT ACAGACCTTT AATCCCACTG TAGAAGTTTA ACTCCCTGTA AACAGTATTA 3361 GTTTGAATTA TTATTTTCAC TGTTATTCTG GTTGAGTCAG CATACCCAGA TTGAAGAAAA 3421 TAGTTTAGTA ATAGGCAGTA AACAGAAAAA TGGAGGTCAC TTAGGTGATT TTTTTCATTA 3481 AAAAAAGAGA ATTAGTAATA CATTGCATTT TATAGTGCTT TGTAAAGAAT CTTCATGTTT 3541 ATCATGCAAT TTAACAAATA TATTTTGAGC AACTACAATG AGCAGGGTGC TCATAGATGG 3601 CATTTGAGTT ACCTGGCAAA GAGACAACCA GAACAGTTAT TATCACCATT TAATCCATAG 3661 TAAACTGAGA CTCAGGAAGC TGCCTGACTT GCCTGAGGTT CCAGCTAGAT TAGGCAGAGC 3721 TGGTTCTTAA AACTAGGTCT GTCTGATCTG CTAGCAGCAA ACCTCTACAT AAATGTGTCA 3781 GATGTTTATG AATAAGTAAT GTCCATGGTG TAATCACCCA CAGCGTAAGG TACTCTCGGG 3841 GTGTCATCAA GGACATTAGG AGTCATCCAA AGGCCATTTT TTTCATCAAG CTCACTCTTT 3901 CCCTTGAATG GAAAGGAGTA CTGCCCCTTA TCTCAACTTG ATGCCTTATT AGCTATTTGT 3961 CCTTGGATAA ATAATTTACA CTCTCTTATA TTCTTTCCTA ATCTATTAAA TGGGAGGGGA 4021 AAAGAGCCCA ACCACCTTCT GGGATTATTT TCAAGAAAAA TTATATAGTG CATGTAAAGC 4081 CATTAGCAGA GTGCTTGACA ATACATGATC ACTTGTTGTT ATTATTATTA TTGGGCTGAT 4141 TTTCCCTTTT GTTTCATTAA GAAAATATCT TTCTTTCTTG GTAATTGTCA ACTCCTTTGT 4201 ACAAATGCCT AAACTGAGAG AGAAAGTTTG CACCTTGCTC TGAAGATCAC ACCCAGGTTG 4261 GCTTTTTTCA GCATGATGAT CTGGAGGCCA GGTCCAAAGC TAAAAGTCTT TGACTCCAAT 4321 ATTACAGCTA ATATATAAAC AGGGCAAACT TGCTCTGCTG GTGGCTGGGA CATAATGGGC 4381 CTTAGAAGGT CATCTTCTCT AGAAATTGTT GCCCACTAGC TTGTCCTATT ACCCCAGTAA 4441 CAGCATTGCA GTTTCGTTCT TTTGGGGTAA GACAAGAACC TCTGTTGAAA TGGAGAGGCA 4501 ACAATGAGAT CATCAGTTAT ACCTTCTCAT GAATTAGCTC AGTGGCTACT GACTTTTTAT 4561 ATCAAGGGAC AGAGTACTGA CAGCCTACCC TGAAGGATGC AGAAAATGTG GAGAAATATA 4621 TTTGTCATGA GGACAGGACT GGATAGGGAC TCTGCTTCAT CCTAGCTGGG ACCTTGAACA 4681 AGTCAGTTAG CTTCTCTACG TCTTAGCTTG CTCATTAATT AATGAAGATA ATACCTGCTA 4741 TAATTACCTC ATTGCCAATG GATCACATGA GACCTGAGGT GTCCCTGGGC TTCCGGGAAG 4801 TAATATGTAG ACATGGAGCA TCCACCCTGT GCTGAGTGTA TTGGAGGGAG AGAAGATTAG 4861 AGAAAGGAGG AGACCCTGGC ATTGGATAGG AAGCTACCCA AGATAATCTT CATTACCATT 4921 TTTTCTTTGG TAGCCCAACC TATGCAAAAG CACTCTTTGT AACACCCAAA GTGAGGGCTC 4981 TTTGAGCTTG CATTTTCAAG ATTAACCCAA AGTGGCATCC GAACAGGGGC AACGATCTCC 5041 TTCAGAGCAC AACCACTGTG GAGGCTGCTT GGTTAGGGGG TCTTCAAGCC AGAGTCTCCC 5101 CTTGCTAAAG CAGTGAAGCT GGAGCAGGGA CTGCCATACA ACTTTAGCAT CCTCATAGGC 5161 AGTGAGAAAC CCAAGCCCAG AGGCACTTAG GAAACATCCT CTCTGGTCTG CCCCAAGTGG 5221 TTTCATAGAG GGTTGAATAT AAGGAAAAGG AAACTTGCAC CCAACCCTCT GTAGCCCGAG 5281 GCCAAATGGA AGTTAGCTTT AAGACTATGT ATAGATTTCT CCTTTCAAGT TTGTTTCCTG 5341 GAGAATAATC TGTTGGAACT ACAGACTAAC AGAGCAAAGC AGAGGATAAC TGCAGAGGTC 5401 AAGAGCGACA TTGTCCTGGC CTCAAGCCAT TAAGGAAATC CAGTGGCACC AGGAGTCCAC 5461 AGAATAGAGA GACAGTCCAT TACTGCAGTC CCCTCTATTA GTCATATTTT AGTGGGTCAC 5521 TAATTAGCTG GTTCAGTGTT TTGATTAGTG ATTCCAAAGG CCTGCTGGAA AGCATTCTGT 5581 CTAGGATCAC CATGAACATT CAGTAACCCA CCTTCATTCA TTGCAGCTAA TTGCAACTAA 5641 TAAGTGATGG GGGAAAAACA AACCTCATTT GCTTATCCAC CTGTGGAAAT GGGTCAGAGG 5701 TGTCACAGAG GTCTGCACAG CATGTCTGCA CCCCCAACTG TCTTGGTCAT GCATCCATCT 5761 GAGCCACCCA TGGTTCTGTG CACAACTCTC GCACACTCTT TTCCAGACCT GTCACCATGG 5821 CCCACCGATT TCCAGCCCTC ACCCAGGAGC AGAAGAAGGA GCTCTCAGAA ATTGCCCAGA 5881 GCATTGTTGC CAATGGAAAG GGGATCCTGG CTGCAGATGA ATCTGTAGGT GAGTGTAAAA 5941 AGCAGTCTCA TCATATAACA ACATATTATT TTTTCTACAC TTAGCAAAAA CAACTTGATA 6001 ATCATATTCT TTATGGAAGT AAAGGTTTAT GTACACGGGA GTGGGAGCAG AAGACTCAGT 6061 AAGCGTAGAG CTTTTGAGTC CTGTAAGTAG TAGGCACATT CCTGAAAGCC AAGGAAAGAG 6121 GCTCAAGCCC TTCATTTTAA GCCAGGGGTC ATTTGAATCC AGCCTTGTAG TAAGGCCAGA 6181 AACTTAGTAA CTAGCATAAG GTCAGATACA CAGGCATCTG CTACTAGTGA GAAGTCAGTT 6241 TAAAGTCAGT TTAAATTCAG TTGAGTCAAG AAGGGCAGGA AAGTATCTGC ATCTGGAATA 6301 GGACAGGGCT TATGGATGGG AGAACCTCAG GATGGCACAA GGGAGAGCTG TGGCAGAGTT 6361 GCACAGCCTG CCATAAAATC ACAGAACCCT GAGAATCAAG AGCATGCCCC ATCCCTCCTT 6421 CTGCATGTAC ATCCAGCAAG ACTGAGGACC AGATCAGGGA GGCTAGAGAG GCCTGAGGGC 6481 CTTCCATATC CACCTGGACC TGAGGGTGGC TTCCACCCTG ACCTCTATAT GCCCAGCCCA 6541 CATACTCCCT AACTAGCCAC CTGAGAGCAA CCAGGGTCTT CTCCCTGGAA CACAAAAAAT 6601 TGAAGCAGGA GCTTTGACTA TCTGAAAAAA GAAAGTCTTG CTTTCCACTG TGGTGAGGGT 6661 CTCTGAGATG ATGTGGAGAA GGGTGACAGG AAAGCCCTGG CTTGCTCCTT ATGCTGCCCT 6721 TGGCCCTCCC ACCATAGGTA CCATGGGGAA CCGCCTGCAG AGGATCAAGG TGGAAAACAC 6781 TGAAGACAAC CGCCGGCAGT TCCGAGAAAT CCTCTTCTCT GTGGACAGTT CCATCAACCA 6841 GAGCATCGGG GGTGTGATCC TTTTCCACGA GACCCTCTAC CAGAAGGACA GCCAGGGAAA 6901 GCTGTTCAGA AACATCCTCA AGGAAAAGGG GATCGTGGTG GGAATCAAGG TGAACACCTC 6961 CATTCACACC TCACTTCTGC TTTGTCCTAG GCAAACTCCA AGCACACTGG CAAGGACAGG 7021 GCCAACACTC TGAACATTCT CCTCATCAAA TTTCTCTTCT CACCCTTTTC CATCGTCATC 7081 TTCCCACAGA GAACAAGATC CTCTTCTAGA GTTTTCCTCC AGTGCCCTCT TCTTTCACTC 7141 AAATTTTCTA CACGTTCTTA ATCTCTTTGC ATCTCAATCT CATTTCTTAA AATGGGAATC 7201 TACCTCATGA TGTTCTTACA GGGATGAAAT GACTAAAAGT TTCTAGGCTG GACGTGGTTG 7261 CTTACTGCTG TAATCCCAGC ACTTTGGAAG GCAGAGGCAA GAGGATCGCT TGAGCTCAGG 7321 AGTTTGAGAC CAGACTGAGC AACACAGTGG GACCTTGTCT CTACTAAAAA TCAAAAACAT 7381 TAGCCAGATG TGTTGGTGCA TGCCTGTAGT CCCAGCTACT CAGGAGGCTG ACGTGGAAAG 7441 ATTGATCACC TGAGCCCTGG AGGTCGAAGC TGGAGTGAGC TATGGTCATA CCACTGCACT 7501 CCAGCCTGGA TGACCCTGTC TCAAAAAAAA AAAGAGTAAG ATCCTGTCTA AAAAAAAAAA 7561 GAAAGAAAGA AAAAAGAAAA GAAAAAGTTT ATAGTGCGTA ATTATTGTTA ATGGTAACTA 7621 TCATAATTAT TGAAGCATTT ACTATTCTAT CTTATAACTA GGTCTTCCAA TAGTGCATAA 7681 ACTTCTCAAA AGTGAAAACT AGTATCCATT GAATAAGAGT TTTTACCATT CCAAGCATTC 7741 ATTTAATTCT CATAACCTCT GGTAGTAGAC TCAATTATTA CCCTTATTTA GAGATGAAGT 7801 GAGAAAACTG AACTACGAAG AGTAACATAA CTTGCCCACA CAACAAGGCA AGGTCAGAGC 7861 CAGTTTAGGC TAATTCTCAA GCTCATGACT TTTGCCTTTG CTATGTAGAG AGTAGAGGTT 7921 GTATCCTTAA TTCTCTTTAC CTCCTGTACT AACACAACAC CTGGTGCATG CAGGTGCTCA 7981 ATAAATGTTA GGGTAGACAC TCTCACAGTG TACCTTCTAG GTAACTTCCC TTCTGCTATG 8041 GACTGAACTG GGTCCCTCGC ACTAATACAC TTTACTAATT CAACTCCTTG ATCTGTAAAA 8101 GATGCAAACT GTTAGTTAGT TCAAGAGTTC TGCTTGTGTT TTCCTGACAT CTGACACCTG 8161 TTAATTATTT GTATCCACAG TTAGACCAAG GAGGTGCTCC TCTTGCAGGA ACAAACAAAG 8221 AAACCACCAT TCAAGGTAAG GATATTGCCC ACTGAGATGA ATCATGCCAG TGTAAGCTAG 8281 AAATGAAGGC AAGTAAAGAA GGAAGCAAGG AAAAACAAAA CACAACTATT ACACTGGTCT 8341 TAGAGCCACA CGTACCTGGT TTTCAATACC AGCTCCTCCA TTTAACAACT GTGACCTTGG 8401 GCCAGTCACT GAAATATTGT GAGCCTCAGT TCTCCTATCT CAAAATGGGG ATTTCTCAAA 8461 ATGTTGTGGG GATTAACTTA GACAATGTAC ATAAGGTGCT TACCACAGTA CCTTGGACAA 8521 AGAAGATTAA ACTCTGCACC CCGGAAGTAT TATTTTTAGA AAAAAATTAG CAGGCAGTAG 8581 AGTGGAAAGT CCCCACACTT TGAGGTCAGA CACACCTGGG TTTGAATCCT GGTTCTATCA 8641 CATGCTACCT AGATAATTTT GCAAGGTACT TCTGCCCAAG TAGACAAGCA TTAACTCACA 8701 ACATTGGGTT AGCTAAGCAG AATTTGAATC CCAGCTCTGT CACTCTTGAG CAATTCAAAC 8761 TTGGTAAAGT TGCTTAACCA TTAGGAGCTT CAGTTTCCAC CTTTGTAAAA TGAGTATCAT 8821 ATTAACTATT GGTTTCTTCT ATTCTATATA TGGAGTAGAT AGTATGTATG TCAAAACACT 8881 ATTTATTAAA TCTAAGTCTT CTGGTTCAGT GTTGTGATTA ATGCTGATAT GCTTGGCTGT 8941 TTTCAATCCT CAAGCACAGT GGATTGAAAG CTAAGCAAAG GGAGAACTCC TTCCCTTTAT 9001 TAGAAGCCCC ATGGATCAGG TACAAAGGTA CAAAGGTACA AAGAAGCCTT TCTCTCTTTT 9061 GTGACTTGCA GGGCTTGATG GCCTCTCAGA GCGCTGTGCT CAGTACAAGA AAGATGGTGT 9121 TGACTTTGGG AAGTGGCGTG CTGTGCTGAG GATTGCCGAC CAGTGTCCAT CCAGCCTCGC 9181 TATCCAGGAA AACGCCAACG CCCTGGCTCG CTACGCCAGC ATCTGTCAGC AGGTGCTCTG 9241 CCTTCCCCTT GGGCTGAAAA AGAGTAGGCT AGAGTTTTCT TCAGAGCTTT TCTTTTCAAT 9301 TATACTATAA CTACAAATGG ACCTCCTTTT CCCTCACCAG TATATCCTAG TGGCATTTTT 9361 CACAACTTTT GCTATAGCCA ACTGTGGTAG GGAAAGATTT GGTCCCACAA GCCAGTAGGA 9421 GTCCAACTAA GCTATGAGTT CAAAGGATAA TGCCATCAAC TGCTGCCCTC TGTCAAAGCT 9481 CCATAAGCTG TCTTATAAGT GATCGGGCTT TTGAGGGCAG GGAACTTGGT TGTCTATACT 9541 CCGTAACATC CAAGACATGA TCCTCTTAGA TTCCTAGCTG AAGAGTGACA GCTCTGACCG 9601 AGTTCCAATC CAAATAATTC TTTGGAGGAT GGTGACCAGA TGCTAAAGTT TCCAACACAT 9661 ACTAAAATGT CTCTGGGTCC GAGTTAAGTT TGATTTTCTT CTGTGAGTAC TATGTTGGGG 9721 TTACTAGAGT TGTGGACAGC TGGAGATCCC CAATTAGAAC TCTAAACACT GCTTGAATTT 9781 CTTCATTAAC CCAACTTAGA GACATCTTTC TTAATGATTT ATGCTTAGAT GCCCTAAAAA 9841 GGCAGTAGCT AGGTTCTGAG GCAGCTAGAA TAATCTCTAG AAAGAAATAC AGTAATACGG 9901 GTTTGTATTT ATTTTTTATC TCATCTGAAA AATTCTACTG TCTTTAAGTC ATCTATGTAG 9961 TATTTCTCCA TAATGGCAAC AATTTATACC AATTGCAACT GAAGAATCTC CTCCTAAGGA 10021 AAGGATAACA GTGAGCACAG CTTCACTTTT GCTTCCTTGA TTTTCCTTTT TAGAATGGAC 10081 TGGTACCTAT TGTTGAACCA GAGGTAATTC CTGATGGAGA CCATGACCTG GAACACTGCC 10141 AGTATGTTAC TGAGAAGGTA AGTTTTAAAT ATGAAGGCCC CAATTCTAGT TGAAAAATCT 10201 TAGTTTTAGG TAACAGCTGT TACTTAACAT ATAACAGCT // LOCUS HUMALDB2 4656 bp ds-DNA PRI 15-SEP-1989 DEFINITION Human aldolase B (ALDOB) gene, exons 7 through 9. ACCESSION M15656 KEYWORDS aldolase; aldolase B; fructose bisphosphate. SEGMENT 2 of 2 SOURCE Human DNA, clones pHL[210,413], pKBg/413, PP201c, pTP201. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 4656) AUTHORS Tolan,D.R. and Penhoet,E.E. TITLE Characterization of the human aldolase B gene JOURNAL Mol. Biol. Med. 3, 245-264 (1986) STANDARD simple staff_review FEATURES from to/span description pept + 197 371 aldolase B protein, exon 7 (EC 4.1.2.13) /nomgen="ALDOB" /map="9q21.3-q22.2" /hgml_locus_uid="LL0050R" 783 982 aldolase B protein, exon 8 3920 4015 aldolase B protein, exon 9 pre-msg < 1 4185 aldoB mRNA + introns (alt.) pre-msg < 1 4448 aldoB mRNA + introns (alt.) IVS < 1 196 aldoB, intron F IVS 372 782 aldoB, intron G IVS 983 3919 aldoB, intron H BASE COUNT 1435 A 935 C 1033 G 1253 T ORIGIN About 500 bp after segment 1; chromosome 9q21.3-q22.2. 1 GGCCAGTTTA GTTGTGCTCA GTGGCTATGG ATTGCAACAC ATTGGTCTAA TGAGGCCTGG 61 AGATTGGTTC AGAAGCTTTT GCCAAGCAAC AAATAAGAAT TCCTTCATCC TGCCTCCCTG 121 CAGTGTAAAT GTGCCAAGGT CAAGTGGCTC TATGACTAGC TTTGAGTTTC ACCTGGTGGG 181 ACCTCTTGTC CTTCAGGTCC TCGCTGCTGT CTACAAGGCC CTGAATGACC ATCATGTTTA 241 CCTGGAGGGC ACCCTGCTAA AGCCCAACAT GGTGACTGCT GGACATGCCT GCACCAAGAA 301 GTATACTCCA GAACAAGTAG CTATGGCCAC CGTAACAGCT CTCCACCGTA CTGTTCCTGC 361 AGCTGTTCCT GGTAAGGCCT TCTTTCTTCT CTAACTCAAG GTCTTAGCCC TCATTCTTTG 421 GAGAGCCACA AGCTTTCTGT TTGTCCAGAA ATCTGCTTCT ATTCATGAAA CAAACCTCTG 481 TTATCTCACA GGCAGCCAGC ACTTCTTCTC CCACTTCAGA ATACCAAGCA TCCCAAGTCT 541 GCCCAAATGC CCACAATCCA TGTATTTCTT CAACATTTCA CTGCCTTCCT TACGTATTTA 601 AAGCACTTAT CTCTAGCCCA ACTAGCTTCA GCAAAAGCAA ACTGATTTCT CAGTTTGCTT 661 TAAGGCAAAT TAAGCAGAGA AAAGACTGAG ACATTACTTT TCTGGTAGGT TCATTGCTTG 721 CTTTCTCAAG CAGGGTATAT AAGGTGGGAC TAATAGAGTG AAATGGCCTT CTCTCCTACC 781 AGGCATCTGC TTTTTGTCTG GTGGCATGAG TGAAGAGGAT GCCACTCTCA ACCTCAATGC 841 TATCAACCTT TGCCCTCTAC CAAAGCCCTG GAAACTAAGT TTCTCTTATG GACGGGCCCT 901 GCAGGCCAGT GCACTGGCTG CCTGGGGTGG CAAGGCTGCA AACAAGGAGG CAACCCAGGA 961 GGCTTTTATG AAGCGGGCCA TGGTAAGATG CTGCCACCTC TTATCTACTT GATGATGTTC 1021 ACATTTGGGG CTTGACTTTC CAACACGGAG AAGCATTGTT TTCTTCGGGC CAAGAAGGTA 1081 TCTACCAATA GTGTCTATTA GGCATTTGAA AATGTGGCAG TAGAGGTCAG TATGAGGATT 1141 GAGGCTAGAG AGAATTTTGA ATTATCCACA CTAGAGTATA GTTGATGACA GGAGACTGAA 1201 GGACAATTTC AACAGAAACA CTTCAGAGGG AAGGAGATAA GAGGACCAAG AACTGAACCT 1261 AGGATACTTT TTATTTTGGG AAGATGAAGA AGAGAAGCAA CCAAGAAAAG CAGCGATTAT 1321 ATTGTAGGGT GAACCAAGCA TTCTGGGAAT AAAGACAGAA GAATTTCTTT CTTTCATGTT 1381 TATTTGTTTG TTTGTTTGTT TGTTTTGAGA TGGAGTGTTA CTCTGTCACC CAGGCTGGAG 1441 TACAGTGGCA CAATCTCGGC TCACTGCAAC CTCCACCTCC TGGATTCAAG CAATTCTTCT 1501 GCCTCAGCCT CCCAAGTAGC TGGGACTAGA GGCATGCGCC ACCATACCTG GCTAATTTCT 1561 GTATATTTTA GTAAAGATGG GGTTTCACCA TGTTGGCCAG GCTGGTCTCA AACTCCTGAC 1621 CTCAGGTGAT CTGCCCACCT CAACCTCCCA AAGTGCTGAG ATTACAGACA TGAGCCACCA 1681 CACCAGCCGG GTAAACAATT TCAAAGGGCC AATAATAACA TGCTATAGAG AGATTCAAAA 1741 GAAAGGCCAA GAAAAGACCC TGAGATTGGC ATTTAGGAGC ATACCAGTGA CCCTCACTGA 1801 AACAGTTTCA ATAAAGAGGA AAGGAAAGGT GCCCACTCAA GAAGTTTTTC AGAAAAAGAC 1861 CCAGAAAGGT TGCCAGGAGA AATGAGGACC AACAAAGTTT TTATTCAAAT AATAACCAAA 1921 ATTTTTAAAA TACGCTTTAT GAGCCAAGAA CTGGGGTGAA CAAATGAGAC ATAGCCATGT 1981 CCTCCAGGAA GCTTGGAGTT TAGCCAAATA GGAGAGAGCT TGGCATGTTT AAATGTAAAG 2041 AAAGAGGAGC CAGAGGAGAC AGTAAAGATG AGGAGAGTGG GGATAATTGA GTGGAGAAGA 2101 ACTTAGAGGA GCAGGACTGC ATGAGACTCA ATATTAAGCA TAGGTGGCAC AGCTGGTGTA 2161 AGACAAAAAG GAGGGAAGAG GCAGAAAAGA GAATACAGTT GGAATCAGCT CTATATAAGT 2221 CAAGAGATGT AGCAGTTGAT GACTGATTAC AGTTGAGGGT ATCTGCTGAG AATGATGAAG 2281 AAATGGAAAT TTCAATACTG ACAGTCGGAA AGCGTAGAAA AGCAGTTAAA GGAACTTGCT 2341 AGGAAATTGA TCGGACGTTA TGGCCAGACA TAGCCAAGCA TCAGGTAATG TGCAGCTGAA 2401 ATAAGAGTGA GTGTGTGGTA GCTTCATTCC TGAGTTTTCC TAACAGCAGT CTAAAGCATG 2461 CAGGAAGATT CAGAACGCAA AATAGCGGGG TAGAAGAGTG AAAGTTTAAT AGGAAGATAG 2521 AAGAGTGACA AATTTTACCG ATAGCCTCTG TTAGCTTGCT GACCATAGAG GAAGCTAGGC 2581 AAGGAATCAA GAGTAGTCAG AAGTGTGGTG ATGAGGCCCT AAAGGGAAGG AAGGCCCCAG 2641 GGAAGCTAAA TCATCATCTC ACAGGCATAA GAGAGGTATC AGATTTTCAA AGTAGTGCTT 2701 TACCTTGACT GTGGAAAGCA GGGTGCAGCA GTGCTGCTGT GTGGCTGATG GACTGAGGAG 2761 ACCTAGCTTA CATGATGGAG GGATTGTGAA AGCGCAGGTC ATCAGGGACA TCCACACAGA 2821 AGAAGGGAAG CTGAGTTTAT CATCAAGAAT GGTGCTTCCC AAGCTCCAGT GTGGATCTGA 2881 ATCACCTGAA GGTGATTTTG GTTTTGTTGT TATGATCTTT TATATTTACA CACAATAATT 2941 GTATAGATCT GTGGGGTATC TGTGATTTTT TATATATGCA TATAATGTAT AATGATCAAA 3001 TCAGGATAAT TAGGGTATTC ACTCCAAATA TTTATTATTT CTTTGGGTTT GGAACATTCC 3061 AAATCTTCTC TTCCAGCTAT TTTAAAATAT GTAATAAATC ATTGTTAACT ATAGTCATCT 3121 TACTGTTGTA TCAAACACTA GAACTTATTC CTTCTGCCTA ACTGTATTTT TGTACCCATT 3181 ACCAACCTCT TTTCATTTCC CCTACCCCCA CTTGAAGCGC TTGTTAAAAA TGCAGATTAT 3241 GGGTCTCACC CACTGAGTTT CTAATTCAGT AGGTCAGGGG TAAGGCCAGG GAATTTCTAT 3301 TTTTAACATG TTCCTAAGGG ATGCTGGTAG ACCATGAAAC ACAGTTGGAG AACCATTGAT 3361 GTAAAATGTA GAAATAAGTC CTGGAGAGAA CAGACAATAG CAACACTGAC TGGAAGAAGT 3421 GGTACCACTG AGTGACATTC AAATATTGTC GTAAGTTTCA TAATTCTGAA TTTATCTTTA 3481 ACTGGAATGT ATACAGGAAA GCAAATGATA CTACAATAAC TAAATAAAAA TTACATATTC 3541 TTCTAGACCA ATATAGGTAG AAAATGGAAA TGTTTTTAAA ATAAACATGT TTTTCTTTTG 3601 GTATAGTGTA CTATACTAGT GAGAAATTTT TCAACTTTAG TTTCTAGTAC TGTTGCTGCA 3661 ATATTGTTGG TACTGAACAG AGATTTCCCC AGTAACTGAC ACAAACCTCA AGCTTACCAA 3721 AGAAATGCTC AGAAAACTGG GATAAAAAGG GGAGATACTT TATAGAAGGG GATGGTATCC 3781 CCAGCAATAT TCAGCAACAT TGCTGTAAAA AGAAGAAAAT CTGAGTGAAG GTTTGACTGG 3841 TTTCCCATGA GAGGCAGACA GGGTCAAGGT GGGGTCACAT TTACTCTAAC CAGTCTCCTC 3901 TCTCATATTT GTCTTCTAGG CTAACTGCCA GGCGGCCAAA GGACAGTATG TTCACACGGG 3961 TTCTTCTGGG GCTGCTTCCA CCCAGTCGCT CTTCACAGCC TGCTATACCT ACTAGGGTCC 4021 AATGCCCGCC AGCCTAGCTC CAGTGCTTCT AGTAGGAGGG CTGAAAGGGA GCAACTTTTC 4081 CTCTAATCCT GGAAATTCGA CACAATTAGA TTTGAACTGC TGGAAATACA ACACATGTTA 4141 AATCTTAAGT ACAAGGGGGA AAAAATAAAT CAGTTATTGA AACATAAAAA TGAATACCAA 4201 GGACCTGATC AAATTTCACA CAGCAGTTTC CTTGCAACAC TTTCAGCTCC CCATGCTCCA 4261 GAATACCCAC CCAAGAAAAT AATAGGCTTT AAAACAATAT CGGCTCCTCA TCCAAAGAAC 4321 AACTGCTGAT TGAAACACCT CATTAGCTGA GTGTAGAGAA GTGCATCTTA TGAAACAGTC 4381 TTAGCAGTGG TAGGTTGGGA AGGAGATAGC TGCAACCAAA AAAGAAATAA ATATTCTATA 4441 AACCTTCAGC TGCTATCGGG TTTCACTTTT CTGCTCTTGC TGTCCAAAGA CTCAGTGTAT 4501 TTCATTACTT TTGACTCTAC TAGACATGAC TGGGTTTCAA CAGTAAAGGT CTTCAACTCT 4561 TGCTAGTCAT TGGAATCAAG CCGCAAAATT TTAAAAACTG AGATGCTCAG GCCACACCCC 4621 AGCTCAATTA AATCAGAAAC CCTAGACTTG GGATCC // LOCUS HUMALDH01 721 bp ds-DNA PRI 15-JUN-1989 DEFINITION Human mitochondrial aldehyde dehydrogenase 2 (ALDH2) gene, exon 1. ACCESSION M20444 J04043 KEYWORDS aldehyde dehydrogenase; aldehyde:NAD+ oxidoreductase. SEGMENT 1 of 13 SOURCE Human DNA (library of Y.-F.C.Lau), clones pCV[105,108], and cDNA to mRNA, clones ALDH2[105-1,105-2,105-20,108-12a]. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 721) AUTHORS Hsu,L.C., Bendel,R.E. and Yoshida,A. TITLE Genomic structure of the human mitochondrial aldehyde dehydrogenase gene JOURNAL Genomics 2, 57-65 (1988) STANDARD simple staff_review COMMENT Draft entry and printed copy of sequence for [1] kindly provided by L.C.Hsu, 01-MAR-1988. FEATURES from to/span description pept 601 + 714 aldehyde dehydrogenase precursor, exon 1 (EC 1.2.1.3) /nomgen="ALDH2" /map="12q24.2" /hgml_locus_uid="LG0051K" sigp 601 651 aldehyde dehydrogenase signal peptide matp 652 + 714 aldehyde dehydrogenase pre-msg 162 > 721 aldehyde dehydrogenase mRNA and ALDH intron IVS 715 > 721 ALDH intron A BASE COUNT 93 A 260 C 217 G 151 T ORIGIN 3 bp upstream of HincII site; chromosome 12q24.2. 1 GTCAACTGGG CTCCATTCAT TCTTTCCATT TCTCTAACAC GTGCCAGGTG GTCTCATCTC 61 CGGGCCTTTG CCCTTGCTGT TCCCTGTCAT CATTCAGGTC TCACTTGTCA TTTCCTGACC 121 ATGGTACTTA TAAAAGCAGT GCCGTCTGCC CCATCCATGT CACCTCGTTC ATCTCCTTCA 181 CCTCCGAAAT GATCTCGCTT TTGGGTTTAC GGCCGGTCTC TTCACCTGGA GCATCAGCCG 241 GGGAGGTCAG GGTCCCCTGG CTCGGGCCTG TTCACATTGG GGTCAAAGGC ACACATTGGG 301 GGCTCAACCA AGGCGAGCTC GTTCGCGGGG CCGGGTCTTT CCGCACAGGC GGAGGGCGGT 361 GGCGGGCGCG GAGGCGTCGC GCGAGCCAGG GGGCACGCAC GGGCCGGGGG TACCTAGCGC 421 CACCCGCTTC GCTTGCATCA GCTGCGCGCC CCATCCCGAG GAATGGTAGA GGCAGCCCCG 481 CCCCCGGCCC GCCCCCGCCT TTCCATTGGC TGCCGCGCGG GGCGGGGAGC GGGGTCGGCT 541 CAGTGGCCCT GAGACCCTAG CTCTGCTCTC GGTCCGCTCG CTGTCCGCTA GCCCGCTGCG 601 ATGTTGCGCG CTGCCGCCCG CTTCGGGCCC CGCCTGGGCC GCCGCCTCTT GTCAGCCGCC 661 GCCACCCAGG CCGTGCCTGC CCCCAACCAG CAGCCCGAGG TCTTCTGCAA CCAGGTGCGT 721 A // LOCUS HUMALDH02 125 bp ds-DNA PRI 15-JUN-1989 DEFINITION Human mitochondrial aldehyde dehydrogenase 2 (ALDH2) gene, exon 2. ACCESSION M20445 J03043 KEYWORDS aldehyde dehydrogenase; aldehyde:NAD+ oxidoreductase. SEGMENT 2 of 13 SOURCE Human DNA (library of Y.-F.C.Lau), clones pCV[105,108], and cDNA to mRNA, clones ALDH2[105-1,105-2,105-20,108-12a]. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 125) AUTHORS Hsu,L.C., Bendel,R.E. and Yoshida,A. TITLE Genomic structure of the human mitochondrial aldehyde dehydrogenase gene JOURNAL Genomics 2, 57-65 (1988) STANDARD simple staff_review COMMENT Draft entry and printed copy of sequence for [1] kindly provided by L.C.Hsu, 01-MAR-1988. FEATURES from to/span description pept + 11 + 115 aldehyde dehydrongenase precursor, exon 2 (EC 1.2.1.3) /nomgen="ALDH2" /map="12q24.2" /hgml_locus_uid="LG0051K" matp + 11 + 115 aldehyde dehydrongenase IVS < 1 10 ALDH intron A IVS 116 > 125 ALDH intron B BASE COUNT 37 A 29 C 32 G 27 T ORIGIN 15.4 kb after segment 1; chromosome 12q24.2. 1 CTCATTGTAG ATTTTCATAA ACAATGAATG GCACGATGCC GTCAGCAGGA AAACATTCCC 61 CACCGTCAAT CCGTCCACTG GAGAGGTCAT CTGTCAGGTA GCTGAAGGGG ACAAGGTGAG 121 AACTG // LOCUS HUMALDH03 162 bp ds-DNA PRI 15-JUN-1989 DEFINITION Human mitochondrial aldehyde dehydrogenase 2 (ALDH2) gene, exon 3. ACCESSION M20446 J03043 KEYWORDS aldehyde dehydrogenase; aldehyde:NAD+ oxidoreductase. SEGMENT 3 of 13 SOURCE Human DNA (library of Y.-F.C.Lau), clones pCV[105,108], and cDNA to mRNA, clones ALDH2[105-1,105-2,105-20,108-12a]. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 162) AUTHORS Hsu,L.C., Bendel,R.E. and Yoshida,A. TITLE Genomic structure of the human mitochondrial aldehyde dehydrogenase gene JOURNAL Genomics 2, 57-65 (1988) STANDARD simple staff_review COMMENT Draft entry and printed copy of sequence for [1] kindly provided by L.C.Hsu, 01-MAR-1988. FEATURES from to/span description pept + 11 + 151 aldehyde dehydrongenase precursor, exon 3 (EC 1.2.1.3) /nomgen="ALDH2" /map="12q24.2" /hgml_locus_uid="LG0051K" matp + 11 + 151 aldehyde dehydrongenase IVS < 1 10 ALDH intron B IVS 152 > 162 ALDH intron C BASE COUNT 28 A 50 C 56 G 28 T ORIGIN 1.4 kb after segment 2; chromosome 12q24.2. 1 TGTTTTCTAG GAAGATGTGG ACAAGGCAGT GAAGGCCGCC CGGGCCGCCT TCCAGCTGGG 61 CTCACCTTGG CGCCGCATGG ACGCATCACA CAGGGGCCGG CTGCTGAACC GCCTGGCCGA 121 TCTGATCGAG CGGGACCGGA CCTACCTGGC GGTGAGTGCT CA // LOCUS HUMALDH04 100 bp ds-DNA PRI 15-JUN-1989 DEFINITION Human mitochondrial aldehyde dehydrogenase 2 (ALDH2) gene, exon 4. ACCESSION M20447 J03043 KEYWORDS aldehyde dehydrogenase; aldehyde:NAD+ oxidoreductase. SEGMENT 4 of 13 SOURCE Human DNA (library of Y.-F.C.Lau), clones pCV[105,108], and cDNA to mRNA, clones ALDH2[105-1,105-2,105-20,108-12a]. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 100) AUTHORS Hsu,L.C., Bendel,R.E. and Yoshida,A. TITLE Genomic structure of the human mitochondrial aldehyde dehydrogenase gene JOURNAL Genomics 2, 57-65 (1988) STANDARD simple staff_review COMMENT Draft entry and printed copy of sequence for [1] kindly provided by L.C.Hsu, 01-MAR-1988. FEATURES from to/span description pept + 11 + 90 aldehyde dehydrogenase precursor, exon 4 (EC 1.2.1.3) /nomgen="ALDH2" /map="12q24.2" /hgml_locus_uid="LG0051K" matp + 11 + 90 aldehyde dehydrogenase IVS < 1 10 ALDH intron C IVS 91 > 100 ALDH intron D BASE COUNT 18 A 28 C 25 G 29 T ORIGIN 2.3 Kb after segment 3; chromosome 12q24.2.. 1 TTCTTCTCAG GCCTTGGAGA CCCTGGACAA TGGCAAGCCC TATGTCATCT CCTACCTGGT 61 GGATTTGGAC ATGGTCCTCA AATGTCTCCG GTATGGGCTC // LOCUS HUMALDH05 132 bp ds-DNA PRI 15-JUN-1989 DEFINITION Human mitochondrial aldehyde dehydrogenase 2 (ALDH2) gene, exon 5. ACCESSION M20448 J03043 KEYWORDS aldehyde dehydrogenase; aldehyde:NAD+ oxidoreductase. SEGMENT 5 of 13 SOURCE Human DNA (library of Y.-F.C.Lau), clones pCV[105,108], and cDNA to mRNA, clones ALDH2[105-1,105-2,105-20,108-12a]. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 132) AUTHORS Hsu,L.C., Bendel,R.E. and Yoshida,A. TITLE Genomic structure of the human mitochondrial aldehyde dehydrogenase gene JOURNAL Genomics 2, 57-65 (1988) STANDARD simple staff_review COMMENT Draft entry and printed copy of sequence for [1] kindly provided by L.C.Hsu, 01-MAR-1988. FEATURES from to/span description pept + 11 + 122 aldehyde dehydrognease precursor, exon 5 (EC 1.2.1.3) /nomgen="ALDH2" /map="12q24.2" /hgml_locus_uid="LG0051K" matp + 11 + 122 aldehyde dehydrognease IVS < 1 10 ALDH intron D IVS 123 > 132 ALDH intron E BASE COUNT 31 A 34 C 37 G 30 T ORIGIN 4.5 kb after segment 4; chromosome 12q24.2. 1 TCTCTTAAAG GTATTATGCC GGCTGGGCTG ATAAGTACCA CGGGAAAACC ATCCCCATTG 61 ACGGAGACTT CTTCAGCTAC ACACGCCATG AACCTGTGGG GGTGTGCGGG CAGATCATTC 121 CGGTGAGTCC AG // LOCUS HUMALDH06 149 bp ds-DNA PRI 15-JUN-1989 DEFINITION Human mitochondrial aldehyde dehydrogenase 2 (ALDH2) gene, exon 6. ACCESSION M20449 J03043 KEYWORDS aldehyde dehydrogenase; aldehyde:NAD+ oxidoreductase. SEGMENT 6 of 13 SOURCE Human DNA (library of Y.-F.C.Lau), clones pCV[105,108], and cDNA to mRNA, clones ALDH2[105-1,105-2,105-20,108-12a]. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 149) AUTHORS Hsu,L.C., Bendel,R.E. and Yoshida,A. TITLE Genomic structure of the human mitochondrial aldehyde dehydrogenase gene JOURNAL Genomics 2, 57-65 (1988) STANDARD simple staff_review COMMENT Draft entry and printed copy of sequence for [1] kindly provided by L.C.Hsu, 01-MAR-1988. FEATURES from to/span description pept + 11 + 139 aldehyde dehydrogenase precursor, exon 6 (EC 1.2.1.3) /nomgen="ALDH2" /map="12q24.2" /hgml_locus_uid="LG0051K" matp + 11 + 139 aldehyde dehydrogenase IVS < 1 10 ALDH intron E IVS 140 > 149 ALDH intron F BASE COUNT 31 A 39 C 45 G 34 T ORIGIN 0.5 kb after segment 5; chromosome 12q24.2. 1 TTTGTTGCAG TGGAATTTCC CGCTCCTGAT GCAAGCATGG AAGCTGGGCC CAGCCTTGGC 61 AACTGGAAAC GTGGTTGTGA TGAAGGTAGC TGAGCAGACA CCCCTCACCG CCCTCTATGT 121 GGCCAACCTG ATCAAGGAGG TGCGTGGCT // LOCUS HUMALDH07 134 bp ds-DNA PRI 15-JUN-1989 DEFINITION Human mitochondrial aldehyde dehydrogenase 2 (ALDH2) gene, exon 7. ACCESSION M20450 J03043 KEYWORDS aldehyde dehydrogenase; aldehyde:NAD+ oxidoreductase. SEGMENT 7 of 13 SOURCE Human DNA (library of Y.-F.C.Lau), clones pCV[105,108], and cDNA to mRNA, clones ALDH2[105-1,105-2,105-20,108-12a]. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 134) AUTHORS Hsu,L.C., Bendel,R.E. and Yoshida,A. TITLE Genomic structure of the human mitochondrial aldehyde dehydrogenase gene JOURNAL Genomics 2, 57-65 (1988) STANDARD simple staff_review COMMENT Draft entry and printed copy of sequence for [1] kindly provided by L.C.Hsu, 01-MAR-1988. FEATURES from to/span description pept + 11 + 124 aldehyde dehydrognease precursor, exon 7 (EC 1.2.1.3) /nomgen="ALDH2" /map="12q24.2" /hgml_locus_uid="LG0051K" matp + 11 + 124 aldehyde dehydrognease IVS < 1 10 ALDH intron F IVS 125 > 134 ALDH intron G BASE COUNT 23 A 38 C 41 G 32 T ORIGIN 0.8 kb after segment 6; chromosome 12q24.2. 1 TTGCTCACAG GCTGGCTTTC CCCCTGGTGT GGTCAACATT GTGCCTGGAT TTGGCCCCAC 61 GGCTGGGGCC GCCATTGCCT CCCATGAGGA TGTGGACAAA GTGGCATTCA CAGGCTCCAC 121 TGAGGTAAGG TGAC // LOCUS HUMALDH08 123 bp ds-DNA PRI 15-JUN-1989 DEFINITION Human mitochondrial aldehyde dehydrogenase 2 (ALDH2) gene, exon 8. ACCESSION M20451 J03043 KEYWORDS aldehyde dehydrogenase; aldehyde:NAD+ oxidoreductase. SEGMENT 8 of 13 SOURCE Human DNA (library of Y.-F.C.Lau), clones pCV[105,108], and cDNA to mRNA, clones ALDH2[105-1,105-2,105-20,108-12a]. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 123) AUTHORS Hsu,L.C., Bendel,R.E. and Yoshida,A. TITLE Genomic structure of the human mitochondrial aldehyde dehydrogenase gene JOURNAL Genomics 2, 57-65 (1988) STANDARD simple staff_review COMMENT Draft entry and printed copy of sequence for [1] kindly provided by L.C.Hsu, 01-MAR-1988. FEATURES from to/span description pept + 11 + 113 aldehyde dehydrogenase precursor, exon 8 (EC 1.2.1.3) /nomgen="ALDH2" /map="12q24.2" /hgml_locus_uid="LG0051K" matp + 11 + 113 aldehyde dehydrogenase IVS < 1 10 ALDH intron G IVS 114 > 123 ALDH intron H BASE COUNT 29 A 31 C 38 G 25 T ORIGIN 0.6 kb after segment 7; chromosome 12q24.2. 1 TGTCCCACAG ATTGGCCGCG TAATCCAGGT TGCTGCTGGG AGCAGCAACC TCAAGAGAGT 61 GACCTTGGAG CTGGGGGGGA AGAGCCCCAA CATCATCATG TCAGATGCCG ATAGTGAGTT 121 TCC // LOCUS HUMALDH09 204 bp ds-DNA PRI 15-JUN-1989 DEFINITION Human mitochondrial aldehyde dehydrogenase 2 (ALDH2) gene, exon 9. ACCESSION M20452 J03043 KEYWORDS aldehyde dehydrogenase; aldehyde:NAD+ oxidoreductase. SEGMENT 9 of 13 SOURCE Human DNA (library of Y.-F.C.Lau), clones pCV[105,108], and cDNA to mRNA, clones ALDH2[105-1,105-2,105-20,108-12a]. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 204) AUTHORS Hsu,L.C., Bendel,R.E. and Yoshida,A. TITLE Genomic structure of the human mitochondrial aldehyde dehydrogenase gene JOURNAL Genomics 2, 57-65 (1988) STANDARD simple staff_review COMMENT Draft entry and printed copy of sequence for [1] kindly provided by L.C.Hsu, 01-MAR-1988. FEATURES from to/span description pept + 11 + 194 aldehyde dehydrognease precursor, exon 9 (EC 1.2.1.3) /nomgen="ALDH2" /map="12q24.2" /hgml_locus_uid="LG0051K" matp + 11 + 194 aldehyde dehydrognease IVS < 1 10 ALDH intron H IVS 195 > 204 ALDH intron I BASE COUNT 34 A 59 C 71 G 40 T ORIGIN 0.6 kb after segment 8; chromosome 12q24.2. 1 CTTCCCGCAG TGGATTGGGC CGTGGAACAG GCCCACTTCG CCCTGTTCTT CAACCAGGGC 61 CAGTGCTGCT GTGCCGGCTC CCGGACCTTC GTGCAGGAGG ACATCTATGA TGAGTTTGTG 121 GAGCGGAGCG TTGCCCGGGC CAAGTCTCGG GTGGTCGGGA ACCCCTTTGA TAGCAAGACC 181 GAGCAGGGGC CGCAGTGAGC CAGG // LOCUS HUMALDH10 186 bp ds-DNA PRI 15-JUN-1989 DEFINITION Human mitochondrial aldehyde dehydrogenase 2 (ALDH2) gene, exon 10. ACCESSION M20453 J03043 KEYWORDS aldehyde dehydrogenase; aldehyde:NAD+ oxidoreductase. SEGMENT 10 of 13 SOURCE Human DNA (library of Y.-F.C.Lau), clones pCV[105,108], and cDNA to mRNA, clones ALDH2[105-1,105-2,105-20,108-12a]. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 186) AUTHORS Hsu,L.C., Bendel,R.E. and Yoshida,A. TITLE Genomic structure of the human mitochondrial aldehyde dehydrogenase gene JOURNAL Genomics 2, 57-65 (1988) STANDARD simple staff_review COMMENT Draft entry and printed copy of sequence for [1] kindly provided by L.C.Hsu, 01-MAR-1988. FEATURES from to/span description pept + 11 + 176 aldehyde dehydrongenase precursor, exon 10 (EC 1.2.1.3) /nomgen="ALDH2" /map="12q24.2" /hgml_locus_uid="LG0051K" matp + 11 + 176 aldehyde dehydrongenase IVS < 1 10 ALDH intron I IVS 177 > 186 ALDH intron J BASE COUNT 43 A 41 C 62 G 40 T ORIGIN 5.0 kb after segment 9; chromosome 12q24.2. 1 GTCTCCACAG GGTGGATGAA ACTCAGTTTA AGAAGATCCT CGGCTACATC AACACGGGGA 61 AGCAACAGGG GGCGAAGCTG CTGTGTGGTG GGGGCATTGC TGCTGACCGT GGTTACTTCA 121 TCCAGCCCAC TGTGTTTGGA GATGTGCAGG ATGGCATGAC CATCGCCAAG GAGGAGGTGA 181 GCACTT // LOCUS HUMALDH11 178 bp ds-DNA PRI 15-JUN-1989 DEFINITION Human mitochondrial aldehyde dehydrogenase 2 (ALDH2) gene, exon 11. ACCESSION M20454 J03043 KEYWORDS aldehyde dehydrogenase; aldehyde:NAD+ oxidoreductase. SEGMENT 11 of 13 SOURCE Human DNA (library of Y.-F.C.Lau), clones pCV[105,108], and cDNA to mRNA, clones ALDH2[105-1,105-2,105-20,108-12a]. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 178) AUTHORS Hsu,L.C., Bendel,R.E. and Yoshida,A. TITLE Genomic structure of the human mitochondrial aldehyde dehydrogenase gene JOURNAL Genomics 2, 57-65 (1988) STANDARD simple staff_review COMMENT Draft entry and printed copy of sequence for [1] kindly provided by L.C.Hsu, 01-MAR-1988. FEATURES from to/span description pept + 11 + 168 aldehyde dehydrogenase precursor, exon 11 (EC 1.2.1.3) /nomgen="ALDH2" /map="12q24.2" /hgml_locus_uid="LG0051K" matp + 11 + 168 aldehyde dehydrogenase IVS < 1 10 ALDH intron J IVS 169 > 178 ALDH intron K BASE COUNT 42 A 47 C 51 G 38 T ORIGIN 1.6 kb after segment 10; chromosome 12q24.2. 1 TCTCACTCAG ATCTTCGGGC CAGTGATGCA GATCCTGAAG TTCAAGACCA TAGAGGAGGT 61 TGTTGGGAGA GCCAACAATT CCACGTACGG GCTGGCCGCA GCTGTCTTCA CAAAGGATTT 121 GGACAAGGCC AATTACCTGT CCCAGGCCCT CCAGGCGGGC ACTGTGTGGT AAGAGCCT // LOCUS HUMALDH12 135 bp ds-DNA PRI 15-JUN-1989 DEFINITION Human mitochondrial aldehyde dehydrogenase 2 (ALDH2) gene, exon 12. ACCESSION M20455 J03043 KEYWORDS aldehyde dehydrogenase; aldehyde:NAD+ oxidoreductase. SEGMENT 12 of 13 SOURCE Human DNA (library of Y.-F.C.Lau), clones pCV[105,108], and cDNA to mRNA, clones ALDH2[105-1,105-2,105-20,108-12a]. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 135) AUTHORS Hsu,L.C., Bendel,R.E. and Yoshida,A. TITLE Genomic structure of the human mitochondrial aldehyde dehydrogenase gene JOURNAL Genomics 2, 57-65 (1988) STANDARD simple staff_review REFERENCE 2 (sites; allele ALDH2-2 variation) AUTHORS Crabb,D.W., Edenberg,H.J., Bosron,W.F. and Li,T.-K. TITLE Genotypes for aldehyde dehydrogenase deficiency and alcohol sensitivity: The inactive ALDH2-2 allele is dominant JOURNAL J. Clin. Invest. 83, 314-316 (1989) STANDARD full staff_entry COMMENT Draft entry and printed copy of sequence for [1] kindly provided by L.C.Hsu, 01-MAR-1988. Many orientals lack the mitochondrial aldehyde dehydrogenase (ALDH2) activity responsible for the oxidation of acetaldehyde produced during ethanol metabolism. A substitution of lysine for glutamate at amino acid position 487 produced by a point mutation at position 114 (g->a) in the sequence below is the cause. FEATURES from to/span description pept + 11 + 125 aldehyde dehydrogenase precursor, exon 12 (EC 1.2.1.3) /nomgen="ALDH2" /map="12q24.2" /hgml_locus_uid="LG0051K" matp + 11 + 125 aldehyde dehydrogenase IVS < 1 10 ALDH intron J IVS 126 > 135 ALDH intron K allele 114 114 g in ALDH2-1 allele; a in ALDH2-2 allele BASE COUNT 31 A 25 C 48 G 31 T ORIGIN 3.7 kb after segment 11; chromosome 12q24.2. 1 CAAATTACAG GGTCAACTGC TATGATGTGT TTGGAGCCCA GTCACCCTTT GGTGGCTACA 61 AGATGTCGGG GAGTGGCCGG GAGTTGGGCG AGTACGGGCT GCAGGCATAC ACTGAAGTGA 121 AAACTGTGAG TGTGG // LOCUS HUMALDH13 453 bp ds-DNA PRI 15-JUN-1989 DEFINITION Human mitochondrial aldehyde dehydrogenase 2 (ALDH2) gene, exon 13. ACCESSION M20456 J03043 KEYWORDS aldehyde dehydrogenase; aldehyde:NAD+ oxidoreductase. SEGMENT 13 of 13 SOURCE Human DNA (library of Y.-F.C.Lau), clones pCV[105,108], and cDNA to mRNA, clones ALDH2[105-1,105-2,105-20,108-12a]. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 453) AUTHORS Hsu,L.C., Bendel,R.E. and Yoshida,A. TITLE Genomic structure of the human mitochondrial aldehyde dehydrogenase gene JOURNAL Genomics 2, 57-65 (1988) STANDARD simple staff_review COMMENT Draft entry and printed copy of sequence for [1] kindly provided by L.C.Hsu, 01-MAR-1988. FEATURES from to/span description pept + 11 43 aldehyde dehydrogenase precursor, exon 13 (EC 1.2.1.3) /nomgen="ALDH2" /map="12q24.2" /hgml_locus_uid="LG0051K" matp + 11 40 aldehyde dehydrogenase IVS < 1 10 ALDH intron K BASE COUNT 154 A 88 C 82 G 129 T ORIGIN 5.7 kb after segment 12; chromosome 12q24.2. 1 CCCCTTACAG GTCACAGTCA AAGTGCCTCA GAAGAACTCA TAAGAATCAT GCAAGCTTCC 61 TCCCTCAGCC ATTGATGGAA AGTTCAGCAA GATCAGCAAC AAAACCAAGA AAAATGATCC 121 TTGCGTGCTG AATATCTGAA AAGAGAAATT TTTCCTACAA AATCTCTTGG GTCAAGAAAG 181 TTCTAGAATT TGAATTGATA AACATGGTGG GTTGGCTGAG GGTAAGAGTA TATGAGGAAC 241 CTTTTAAACG ACAACAATAC TGCTAGCTTT CAGGATGATT TTTAAAAAAT AGATTCAAAT 301 GTGTTATCCT CTCTCTGAAA CGCTTCCTAT AACTCGAGTT TATAGGGGAA GAAAAAGCTA 361 TTGTTTACAA TTATATCACC ATTAAGGCAA CTGCTACACC CTGCTTTGTA TTCTGGGCTA 421 AGATTCATTA AAAACTAGCT GCTCTTAACT TAC // LOCUS HUMALDOB1 1032 bp ds-DNA PRI 01-SEP-1988 DEFINITION Human aldolase B gene, exon 1 and 5' flank. ACCESSION D00175 KEYWORDS aldolase; aldolase B. SEGMENT 1 of 9 SOURCE Human genomic library in lambda-Charon 4A supplied by T.Maniatis, clone HAB5-1. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 1032) AUTHORS Mukai,T., Yatsuki,H., Arai,Y., Joh,K., Matsuhashi,S. and Hori,K. TITLE Human aldolase B gene: characterization of the genomic aldolase B gene and analysis of sequences required for multiple polyadenylations JOURNAL J. Biochem. 102, 1043-1051 (1987) STANDARD simple automatic COMMENT The human aldolase B gene is composed of nine exons interrupted by eight introns and spans 15 kb. The length of a part of the intron f which they could not clone was determined by Southern blotting, and it was estimated to be 0.3 kb. To define the mRNA start site, S1 nuclease analysis was performed. Three start sites, one major and two minor sites, were identified. CAT and TATA sequences were found at appropriate distances upstream from presumed transcription initiation sites, and unusual poly(A) stretches were observed near the promoter region. corresponds to DDBJ (rel3.0) entry (D00175) dated 18-APR-1988 FEATURES from to/span description pre-msg 872 > 1032 aldolase B mRNA (alt.) /nomgen="ALDOB" /map="9q21.3-q22.2" /hgml_locus_uid="LL0050R" pre-msg 900 > 1032 aldolase B mRNA (alt.) pre-msg 921 > 1032 aldolase B mRNA (alt.) IVS 991 > 1032 aldolase B intron a BASE COUNT 357 A 182 C 192 G 301 T ORIGIN 920 bp upstream of the major start site of the aldolase B mRNA. 1 GCAGAATGAA CCCATACCCA GACAGTACCA TATTTAGGGG TGTTGATGGC TTAGTTCAAT 61 CACTTTAATT CTAGTTTTTA ACCAAATTTG AAGACAAAAG TACAAAGGTT AAAAGAGAAT 121 GGGAATACAA AACAGTGATT TCTGGCATTA AATACATCAG AAAGCCCAGC TAGGTTTTGG 181 AGGACTGGAG AAGATGGGAG AGACAATGAT TTTTAAAAAC AGAATGTTTT GAATTTTATT 241 TCCAATTTGG ACAAATTCAC ACATGGGAAT TCCAGGATTA CAGAATCAAT TTCTTTTATT 301 GAAAGATTAT TTAAATTTAA CTATAAATTA TAGAATAAAT TGTTGGAATT GACTACTAAT 361 GATATACTGA GATAAATAAT GAAAATATAG CATTGATTAC CTAATTTGGA AATAATAATT 421 CTTGAATGCC TACTGCGTAC AGACACTATA CAACTCACTT AATGCTCCTG TGAAATTCAG 481 ATCCCACTTT ACAGAGAAGA AAATAACGGC TCAGACAAGT AAAAAACAAA ACCAAAAACC 541 CGTCACCAAG TTCATGTGTT TTGTAAGTGT TACAGCTACA CTAACATTCT TCCTAGTATC 601 ATTTAATGGG ACCCTAGAGC CTCCATCATT GACTAGCTGT ATGACTTAAG GGAACCTCCT 661 TATCTTCCTT GCCTTAGTTT CCTCATCTGT GAAATGGAGG GCTGGATTAG GAACTCCACG 721 GTCTAAAGAC ATTCCTTGCA GCTTTAACAT TCTATGAGCA TAAGGAGTAT ATGGGCATGA 781 TTTTAAGGAC TGGTGTTTAT GAGCCAATCA GAGGTGTTGA ATAAACACCT CCCTACTAGG 841 TCAAGGTAGA AAGGGGAGGG CAAATATTGG AAAAAAAAAA CATGATGAGA AGTCTATAAA 901 AATTGTGTGC TACCAAAGAT CTGTCTTATT TGGCAGCTGC TGCCTCACCC ACAGCTTTTG 961 ATATCTAGGA GGACTCTTCT CTCCCAAACT GTAAGTAAAA AAGCTTCCTT TGTATACCTG 1021 GACAGCACCA GT // LOCUS HUMALDOB2 202 bp ds-DNA PRI 01-SEP-1988 DEFINITION Human aldolase B gene, exon 2. ACCESSION D00176 KEYWORDS aldolase; aldolase B. SEGMENT 2 of 9 SOURCE Human genomic library in lambda-Charon 4A supplied by T.Maniatis, clone HAB5-1. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 202) AUTHORS Mukai,T., Yatsuki,H., Arai,Y., Joh,K., Matsuhashi,S. and Hori,K. TITLE Human aldolase B gene: characterization of the genomic aldolase B gene and analysis of sequences required for multiple polyadenylations JOURNAL J. Biochem. 102, 1043-1051 (1987) STANDARD simple automatic COMMENT The human aldolase B gene is composed of nine exons interrupted by eight introns and spans 15 kb. The length of the part of intron f which they could not clone was determined by Southern blotting, and it was estimated to be 0.3 kb. To define the mRNA start site, S1 nuclease analysis was performed. corresponds to DDBJ (rel3.0) entry (D00176) dated 18-APR-1988 FEATURES from to/span description pept 51 + 162 aldolase B (exon 2) /nomgen="ALDOB" /map="9q21.3-q22.2" /hgml_locus_uid="LL0050R" pre-msg < 1 > 202 aldolase B mRNA intron a intron b IVS < 1 40 aldolase B intron a IVS 163 > 202 aldolase B intron b BASE COUNT 57 A 55 C 44 G 46 T ORIGIN About 4.9 kb after segment 1. 1 CCCATGGTTC TGTGCACAAC TCTCGCACAC TCTTTTCCAG ACCTGTCACC ATGGCCCACC 61 GATTTCCAGC CCTCACCCAG GAGCAGAAGA AGGAGCTCTC AGAAATTGCC CAGAGCATTG 121 TTGCCAATGG AAAGGGGATC CTGGCTGCAG ATGAATCTGT AGGTGAGTGT AAAAAGCAGT 181 CTCATCATAT AACAACATAT TA // LOCUS HUMALDOB3 292 bp ds-DNA PRI 01-SEP-1988 DEFINITION Human aldolase B gene, exon 3. ACCESSION D00177 KEYWORDS aldolase; aldolase B. SEGMENT 3 of 9 SOURCE Human genomic library in lambda-Charon 4A supplied by T. Maniatis, clone HAB5-1. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 292) AUTHORS Mukai,T., Yatsuki,H., Arai,Y., Joh,K., Matsuhashi,S. and Hori,K. TITLE Human aldolase B gene: characterization of the genomic aldolase B gene and analysis of sequences required for multiple polyadenylations JOURNAL J. Biochem. 102, 1043-1051 (1987) STANDARD simple automatic COMMENT The human aldolase B gene is composed of nine exons interrupted by eight introns and spans 15 kb. The length of the part of intron f which they could not clone was determined by Southern blotting, and it was estimated to be 0.3 kb. To define the mRNA start site, S1 nuclease analysis was performed. corresponds to DDBJ (rel3.0) entry (D00177) dated 18-APR-1988 FEATURES from to/span description pept + 41 + 252 aldolase B (exon 3) (AA at 43) /nomgen="ALDOB" /map="9q21.3-q22.2" /hgml_locus_uid="LL0050R" pre-msg < 1 > 292 aldolase B mRNA intron b intron c IVS < 1 40 aldolase B intron b IVS 253 > 292 aldolase B intron c BASE COUNT 74 A 83 C 73 G 62 T ORIGIN About 0.8 kb after segment 2. 1 TGGCTTGCTC CTTATGCTGC CCTTGGCCCT CCCACCATAG GTACCATGGG GAACCGCCTG 61 CAGAGGATCA AGGTGGAAAA CACTGAAGAG AACCGCCGGC AGTTCCGAGA AATCCTCTTC 121 TCTGTGGACA GTTCCATCAA CCAGAGCATC GGGGGTGTGA TCCTTTTCCA CGAGACCCTC 181 TACCAGAAGG ACAGCCAGGG AAAGCTGTTC AGAAACATCC TCAAGGAAAA GGGGATCGTG 241 GTGGGAATCA AGGTAACACC TCCATTCACA CCTCACTTCT GCTTTGTCCT AG // LOCUS HUMALDOB4 122 bp ds-DNA PRI 01-SEP-1988 DEFINITION Human aldolase B gene, exon 4. ACCESSION D00178 KEYWORDS aldolase; aldolase B. SEGMENT 4 of 9 SOURCE Human genomic library in lambda-Charon 4A supplied by T.Maniatis, clone HAB5-1. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 122) AUTHORS Mukai,T., Yatsuki,H., Arai,Y., Joh,K., Matsuhashi,S. and Hori,K. TITLE Human aldolase B gene: characterization of the genomic aldolase B gene and analysis of sequences required for multiple polyadenylations JOURNAL J. Biochem. 102, 1043-1051 (1987) STANDARD simple automatic COMMENT The human aldolase B gene is composed of nine exons interrupted by eight introns and spans 15 kb. The length of the part of intron f which they could not clone was determined by Southern blotting, and it was estimated to be 0.3 kb. To define the mRNA start sites, S1 nuclease analysis was performed. corresponds to DDBJ (rel3.0) entry (D00178) dated 18-APR-1988 FEATURES from to/span description pept + 41 + 95 aldolase B (exon 4) /nomgen="ALDOB" /map="9q21.3-q22.2" /hgml_locus_uid="LL0050R" pre-msg < 1 > 122 aldolase B intron c intron d IVS < 1 40 aldolase B intron c IVS 96 > 122 aldolase B intron d BASE COUNT 40 A 28 C 23 G 31 T ORIGIN About 1.3 kb after segment 3. 1 TTCCTGACAT CTGACACCTG TTAATTATTT GTATCCACAG TTAGACCAAG GAGGTGCTCC 61 TCTTGCAGGA ACAAACAAAG AAACCACCAT TCAAGGTAAG GATATTGCCC ACTGAGATGA 121 AT // LOCUS HUMALDOB5 241 bp ds-DNA PRI 01-SEP-1988 DEFINITION Human aldolase B gene, exon 5. ACCESSION D00179 KEYWORDS aldolase; aldolase B. SEGMENT 5 of 9 SOURCE Human genomic library in lambda-Charon 4A supplied by T.Maniatis, clone HAB5-1. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 241) AUTHORS Mukai,T., Yatsuki,H., Arai,Y., Joh,K., Matsuhashi,S. and Hori,K. TITLE Human aldolase B gene: characterization of the genomic aldolase B gene and analysis of sequences required for multiple polyadenylations JOURNAL J. Biochem. 102, 1043-1051 (1987) STANDARD simple automatic COMMENT The human aldolase B gene is composed of nine exons interrupted by eight introns and spans 15 kb. The length of the part of intron f which they could not clone was determined by Southern blotting, and it was estimated to be 0.3 kb. To define the mRNA start site, S1 nuclease analysis was performed. corresponds to DDBJ (rel3.0) entry (D00179) dated 18-APR-1988 FEATURES from to/span description pept + 41 + 201 aldolase B (exon 5) (AA at 43) /nomgen="ALDOB" /map="9q21.3-q22.2" /hgml_locus_uid="LL0050R" pre-msg < 1 > 241 aldolase B intron d intron e IVS < 1 40 aldolase B intron d IVS 202 > 241 aldolase B intron e BASE COUNT 52 A 64 C 67 G 58 T ORIGIN About 1.3 kb after segment 4. 1 AAAGGTACAA AGAAGCCTTT CTCTCTTTTG TGACTTGCAG GGCTTGATGG CCTCTCAGAG 61 CGCTGTGCTC AGTACAAGAA AGATGGTGTT GACTTTGGGA AGTGGCGTCC TGTGCTGAGG 121 ATTGCCGACC AGTGTCCATC CAGCCTCGCT ATCCAGGAAA ACGCCAACGC CCTGGCTCGC 181 TACGCCAGCA TCTGTCAGCA GGTGCTCTGC CTTCCCCTTG GGCTGAAAAA GAGTAGGCTA 241 G // LOCUS HUMALDOB6 155 bp ds-DNA PRI 01-SEP-1988 DEFINITION Human aldolase B gene, exon 6. ACCESSION D00180 KEYWORDS aldolase; aldolase B. SEGMENT 6 of 9 SOURCE Human genomic library in lambda-Charon 4A supplied by T. Maniatis, clone HAB5-1. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 155) AUTHORS Mukai,T., Yatsuki,H., Arai,Y., Joh,K., Matsuhashi,S. and Hori,K. TITLE Human aldolase B gene: characterization of the genomic aldolase B gene and analysis of sequences required for multiple polyadenylations JOURNAL J. Biochem. 102, 1043-1051 (1987) STANDARD simple automatic COMMENT The human aldolase B gene is composed of nine exons interrupted by eight introns and spans 15 kb. The length of the part of intron f which they could not clone was determined by southern blotting, and it was estimated to be 0.3 kb. To define the mRNA start site, S1 nuclease analysis was performed. corresponds to DDBJ (rel3.0) entry (D00180) dated 18-APR-1988 FEATURES from to/span description pept + 32 + 115 aldolase B (exon 6) /nomgen="ALDOB" /map="9q21.3-q22.2" /hgml_locus_uid="LL0050R" pre-msg < 1 > 155 aldolase B mRNA intron e intron f IVS < 1 31 aldolase B intron e IVS 116 > 155 aldolase B intron f BASE COUNT 43 A 29 C 32 G 51 T ORIGIN About 0.9 kb after segment 5. 1 TCACTTTTGC TTCCTGATTT TTCCTTTTTA GAATGGACTG GTACCTATTG TTGAACCAGA 61 GGTAATTCCT GATGGAGACC ATGACCTGGA ACACTGCCAG TATGTTACTG AGAAGGTAAG 121 TTTTAAATAT GAAGGCCCCA ATTCTAGTGA AAATC // LOCUS HUMALDOB7 255 bp ds-DNA PRI 01-SEP-1988 DEFINITION Human aldolase B gene, exon 7. ACCESSION D00181 KEYWORDS aldolase; aldolase B. SEGMENT 7 of 9 SOURCE Human genomic library in lambda-Charon 4A supplied by T. Maniatis, clone HAB4-1. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 255) AUTHORS Mukai,T., Yatsuki,H., Arai,Y., Joh,K., Matsuhashi,S. and Hori,K. TITLE Human aldolase B gene: characterization of the genomic aldolase B gene and analysis of sequences required for multiple polyadenylations JOURNAL J. Biochem. 102, 1043-1051 (1987) STANDARD simple automatic COMMENT The human aldolase B gene is composed of nine exons interrupted by eight introns and spans 15 kb. The length of the part of intron f which they could not clone was determined by Southern blotting, and it was estimated to be 0.3 kb. To define the mRNA start site, S1 nuclease analysis was performed. corresponds to DDBJ (rel3.0) entry (D00181) dated 18-APR-1988 FEATURES from to/span description pept + 41 + 215 aldolase B (exon 7) /nomgen="ALDOB" /map="9q21.3-q22.2" /hgml_locus_uid="LL0050R" pre-msg < 1 > 255 aldolase B mRNA intron f intron g IVS < 1 40 aldolase B intron f IVS 216 > 255 aldolase B intron g BASE COUNT 53 A 76 C 56 G 70 T ORIGIN About 0.5 kb after segment 6. 1 TAGCTTTGAG TTTCACCTGG TGGGACCTCT TGTCCTTCAG GTCCTGGCTG CTGTCTACAA 61 GGCCCTGAAT GACCATCATG TTTACCTGGA GGGCACCCTG CTAAAGCCCA ACATGGTGAC 121 TGCTGGACAT GCCTGCACCA AGAAGTATAC TCCAGAACAA GTAGCTATGG CCACCGTAAC 181 AGCTCTCCAC CGTACTGTTC CTGCAGCTGT TCCTGGTAAG GCCTTCTTTC TTCTCTAACT 241 CAAGGTCTTA GCCCT // LOCUS HUMALDOB8 280 bp ds-DNA PRI 01-SEP-1988 DEFINITION Human aldolase B gene, exon 8. ACCESSION D00182 KEYWORDS aldolase; aldolase B. SEGMENT 8 of 9 SOURCE Human genomic library in lambda-Charon 4A supplied by T.Maniatis, clone HAB-1. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 280) AUTHORS Mukai,T., Yatsuki,H., Arai,Y., Joh,K., Matsuhashi,S. and Hori,K. TITLE Human aldolase B gene: characterization of the genomic aldolase B gene and analysis of sequences required for multiple polyadenylations JOURNAL J. Biochem. 102, 1043-1051 (1987) STANDARD simple automatic COMMENT The human aldolase B gene is composed of nine exons interrupted by eight introns and spans 15 kb. The length of the part of intron f which they could not clone was determined by Southern blotting, and it was estimated to be 0.3 kb. To define the mRNA start site, S1 nuclease analysis was performed. corresponds to DDBJ (rel3.0) entry (D00182) dated 18-APR-1988 FEATURES from to/span description pept + 41 + 240 aldolase B (exon 8) (AA at 43) /nomgen="ALDOB" /map="9q21.3-q22.2" /hgml_locus_uid="LL0050R" pre-msg < 1 > 280 aldolase B mRNA intron g intron h IVS < 1 40 aldolase B intron g IVS 241 > 280 aldolase B intron h BASE COUNT 65 A 72 C 73 G 70 T ORIGIN About 0.4 kb after segment 7. 1 AGGTGGGACT AATAGAGTGA AATGGCTTCT CTCCTACCAG GCATCTGCTT TTTGTCTGGT 61 GGCATGAGTG AAGACGATGC CACTCTCAAC CTCAATGCTA TCAACCTTTG CCCTCTACCA 121 AAGCCCTGGA AACTAAGTTT CTCTTATGGA CGGGCCCTGC AGGCCAGTGC ACTGGCTGCC 181 TGGGGTGGCA AGGCTGCAAA CAAGGAGGCA ACCCAGGAGG CTTTTATGAA GCGGGCCATG 241 GTAAGATGCT GCCACCTCTT ATCTACTTGA TGATGTTCAC // LOCUS HUMALDOB9 777 bp ds-DNA PRI 01-SEP-1988 DEFINITION Human aldolase B gene, exon 9 and 3' flank. ACCESSION D00183 KEYWORDS aldolase; aldolase B. SEGMENT 9 of 9 SOURCE Human genomic library in lambda-Charon 4A supplied by T. Maniatis, clone HAB4-1. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 777) AUTHORS Mukai,T., Yatsuki,H., Arai,Y., Joh,K., Matsuhashi,S. and Hori,K. TITLE Human aldolase B gene: characterization of the genomic aldolase B gene and analysis of sequences required for multiple polyadenylations JOURNAL J. Biochem. 102, 1043-1051 (1987) STANDARD simple automatic COMMENT The human aldolase B gene is composed of nine exons interrupted by eight introns and spans 15 kb. The length of the part of intron f which they could not clone was determined by Southern blotting, and it was estimated to be 0.3 kb. To define the mRNA start site, S1 nuclease analysis was performed. The gene carries poly(A) addition signals at two different sites. The sequences required for mRNA 3'-end formation were examined in this gene. corresponds to DDBJ (rel3.0) entry (D00183) dated 18-APR-1988 FEATURES from to/span description pept + 41 136 aldolase B (exon 9) (AA at 41) /nomgen="ALDOB" /map="9q21.3-q22.2" /hgml_locus_uid="LL0050R" pre-msg < 1 366 aldolase B mRNA (alt.) pre-msg < 1 568 aldolase B mRNA (alt.) IVS < 1 40 aldolase B intron h BASE COUNT 240 A 187 C 143 G 207 T ORIGIN About 3.0 kb after segment 8. 1 TTTACTCTAA CCAGTCTCCT CTCTCATATT TGTCTTCTAG GCTAACTGCC AGGCCGCCAA 61 AGGACAGTAT GTTCACACGG GTTGTTCTGG GGCTGCTTCC ACCCAGTCGC TCTTCACAGC 121 CTGCTATACC TACTAGGGTC CAATGCCCGC CAGCCTAGCT CCAGTGCTTC TAGTAGGAGG 181 GCTGAAAGGG AGCAACTTTT CCTCTAATCC TGGAAATTCG ACACAATTAG ATTTGAACTG 241 CTGGAAATAC AACACATGTT AAATCTTAAG TACAAGGGGG AAAAAATAAA TCAGTTATTG 301 AAACATAAAA ATGAATACCA AGGACCTGAT CAAATTTCAC ACAGCAGTTT CCTTGCAACA 361 CTTTCAGCTC CCCATGCTCC AGAATACCCA CCCAAGAAAA TAATAGGCTT TAAAACAATA 421 TCGGCTCCTC ATCCAAAGAA CAACTGCTGA TTGAAACACC TCATTAGCTG AGTGTAGAGA 481 AGTGCATCTT ATGAAACAGT CTTAGCAGTG GTAGGTTGGG AAGGAGATAG CTGCAACCAA 541 AAAAGAAATA AATATTCTAT AAACCTTCAG CTGCTATCGG GTTTCACTTT TCTGCTCTTG 601 CTGTCCAAAG ACTCAGTGTA TTTCATTACT TTTGACTCTA CTAGACATGA CTGGGTTTCA 661 ACAGTAAAGG TCTTCAACTC TTGCTAGTCA TTGGAATCAA GCCGCAAAAT TTTAAAAACT 721 GAGATGCTCA GGCCACACCC CAGCTCAATT AAATCAGAAA CCCTAGACTT GGGATCC // LOCUS HUMALPHA 4556 bp ds-DNA PRI 15-SEP-1989 DEFINITION Human alkaline phosphatase gene, complete cds. ACCESSION J03252 J03512 KEYWORDS Nagao isozyme; alkaline phosphatase. SOURCE Human placenta and spleen DNA, (library of K.Mayo), clone lambda-4. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 4487) AUTHORS Millan,J.L. and Manes,T. TITLE Seminoma-derived Nagao isozyme is encoded by a germ-cell alkaline phosphatase gene JOURNAL Proc. Natl. Acad. Sci. U.S.A. 85, 3024-3028 (1988) STANDARD simple staff_review REFERENCE 2 (bases 1 to 4556) AUTHORS Millan,J.L. JOURNAL Unpublished (1988) La Jolla Cancer Res. Found., La Jolla CA 92037 STANDARD simple staff_review COMMENT Draft entry and computer-readable sequence for [1],[2] kindly provided by J.L.Millan, 26-APR-1988. FEATURES from to/span description pept 450 516 alkaline phosphatase precursor, exon 1 (EC 3.1.3.1) /nomgen="ALPP" /map="2q37" /hgml_locus_uid="LV0092C" 611 727 alkaline phosphatase precursor, exon 2 839 954 alkaline phosphatase precursor, exon 3 1147 1321 alkaline phosphatase precursor, exon 4 1399 1571 alkaline phosphatase precursor, exon 5 1819 1953 alkaline phosphatase precursor, exon 6 2053 2125 alkaline phosphatase precursor, exon 7 2254 2388 alkaline phosphatase precursor, exon 8 2470 2661 alkaline phosphatase precursor, exon 9 2881 2997 alkaline phosphatase precursor, exon 10 3120 3418 alkaline phosphatase precursor, exon 11 sigp 450 506 alkaline phosphatase signal peptide matp 507 516 alkaline phosphatase 611 727 alkaline phosphatase 839 954 alkaline phosphatase 1147 1321 alkaline phosphatase 1399 1571 alkaline phosphatase 1819 1953 alkaline phosphatase 2053 2125 alkaline phosphatase 2254 2388 alkaline phosphatase 2470 2661 alkaline phosphatase 2881 2997 alkaline phosphatase 3120 3415 alkaline phosphatase pre-msg 405 > 4234 alkaline phosphatase mRNA and introns IVS 517 610 ALPA intron A IVS 728 838 ALPA intron B IVS 955 1146 ALPA intron C IVS 1322 1398 ALPA intron D IVS 1572 1818 ALPA intron E IVS 1954 2052 ALPA intron F IVS 2126 2253 ALPA intron G IVS 2389 2469 ALPA intron H IVS 2662 2880 ALPA intron I IVS 2998 3119 ALPA intron J BASE COUNT 912 A 1417 C 1413 G 814 T ORIGIN 3 bp upstream of SmaI site; chromosome 2q37. 1 CCCGGGCTGT GTGCTTCCAG CCTCCCCTCC TCTCGACACC AGAACAGAGC CTGGCCCCCA 61 GCTCCCAGGA AATACAGAAA AAAAAAATGG TGGATGAACG AGTGACAGGG TGTCTTGTTC 121 CACACAAGAC ACAGTGAGCA GGGGTTGGGG GAGGGGCCCC TGGGGCAGGA TGCACACTGC 181 ACTATACCCA AAATCCCCAC CCTTCCCTGG GGACACCTGG TCCACCCTAA GCTGCCTTTC 241 TCAGGACCCA GCCCCAGCCC AGCCCAGCCA ACCCTGCCAC TCCCTTCAGC CAGTGTGGCT 301 TCAGGTCAAG AGGCTGGGCG GGGTCAAGGT GGTAACAAGG GGAGGGGCCA GGACACAGTT 361 TTCCCTGATT TAAACCCAGG CAGCCTGGAG TGCAGCTCAT ACTCCATACC TGGGATTTCC 421 GCCTCGCCGC TCTCCGACTG CTTCCAGACA TGCAGGGGCC CTGGGTGCTG CTCCTGCTGG 481 GCCTGAGGCT ACAGCTCTCC CTGGGCATCA TCCCAGGTAA TGAGGCTCCC CAGCCTGCCC 541 TACACACACA CACACACACA CACCACCCCC CAGCCCAGGC TGACCTGATC TTTGCTCTCC 601 CCCTGGCCAG TTGAGGAGGA GAACCCGGAC TTCTGGAACC GCCAGGCAGC CGAGGCCCTG 661 GGTGCCGCCA AGAAGCTGCA GCCTGCACAG ACAGCCGCCA AGAACCTCAT CATGTTCCTG 721 GGTGACGGTG AGTGAGCCAG CCTTCCAGCC CCGCAGCCCT CACAGCCCCG GCGCCCGGAC 781 CCTCAGTGGT TCCAGGACAG CCCTGGGGAG CAAGCCTCAC ACACTTCTGC TCCTTCAGGG 841 ATGGGGGTGT CTACGGTGAC AGCTGCCAGG ATCCTAAAAG GGCAGAAGAA GGACAAACTG 901 GGGCCTGAGA CCTTCCTGGC CATGGACCGC TTCCCGTACG TGGCTCTGTC CAAGGTAAGT 961 GCTGGGCTAC CTTAGAGTCC TCCAAGCAGA GAAGGGGAAT CCTGGCTATG GAGTGTGGTA 1021 GGAGGGAGGG ACCCTAAACA GCTGGGGCTC CAATAAGGAG CTGGAGGCAG TTGGAATCCC 1081 AGAGGACAGA GATCAGGGTC TTGTTTGTCT GCCCCAGAGA AGAGCTCAGA GTGTCTCTGT 1141 CCCCAGACAT ACAGTGTAGA CAAGCATGTG CCAGACAGTG GAGCCACAGC CACGGCCTAC 1201 CTGTGCGGGG TCAAGGGCAA CTTCCAGACC ATTGGCTTGA GTGCAGCCGC CCGCTTTAAC 1261 CAGTGCAACA CGACACGCGG CAACGAGGTC ATCTCCGTGG TGAATCGGGC CAAGAAAGCA 1321 GGTGGAGCTG GGGCCCGGCT GTGGGGTCAG GGCCAGTGAC AGACCTCTAT CGCATATCCT 1381 GACCTCTATC ACCCTCAGGA AAGTCAGTGG GAGTGGTAAC CACCACACGG GTGCAGCATG 1441 CCTCGCCAGC CGGCACCTAC GCCCACACGG TGAACCGCAA CTGGTACTCG GATGCCGACG 1501 TGCCTGCCTC GGCCCGCCAG GAGGGGTGCC AGGACATCGC CACGCAGCTC ATCTCCAACA 1561 TGGACATTGA TGTGCGACCC CCGGGCCAAG GGTGGGGCTG GGCAGAGAGT AGCAGGGAGG 1621 GGGCACCAGC TCAGACCAGG CAACCAAAAG CCTTATCTGG GCCAGCAGGG TCTGGAAGGT 1681 GGGGTTGGGG GCGTAGAAGG CGCACCAGGC TGGGCCATTC CCACAGCCTT GGGGAGGGGA 1741 GTCAGGGGCT CTGCATGAGG AGGGGACACG GGGCCTAGCC ATGGCCCAAA GTCCACCTGC 1801 CCCATCCTCT GTTCCCAGGT GATCCTAGGT GGAGGCCGAA AGTACATGTT TCCCATGGGG 1861 ACCCCAGACC CTGAGTACCC AGATGACTAC AGCCAAGGTG GGACCAGGCT GGACGGGAAG 1921 AATCTGGTGC AGGAATGGCT GGCGAAGCAC CAGGTGATGG GGGCTGGTGG GTGTGCTGGG 1981 CACAGCAGGG GGAGGGCAGA GGTGTGGGGC TCGGGGCTGT GGGCTGAGGC CTGGCTCTCT 2041 CCCTCCCCGC AGGGTGCCCG GTACGTGTGG AACCGCACTG AGCTCCTGCA GGCTTCCCTG 2101 GACCCGTCTG TGACCCATCT CATGGGTAAT GACCCCCTTC CTGCCCTGGC ATCCTCAGAT 2161 GGCCTCAGAT GGCACTTCTG AGCCTGTGTG CACATCCGCC AGCACCCTCC CACCCCCAGC 2221 CTGCCAGTCA CCACAGGACC CCTTGTCCCA CAGGTCTCTT TGAGCCTGGA GACATGAAAT 2281 ACGAGATCCA CCGAGACTCC ACACTGGACC CCTCCCTGAT GGAGATGACA GAGGCTGCCC 2341 TGCTCCTGCT GAGCAGGAAC CCCCGCGGCT TCTTCCTCTT CGTGGAGGGT GCGTGGTGGC 2401 CCTGGGAGTG GGGGGTTGGG GGTTGGAGCA GGGCAGGCTC AGCATCTCCC CCCTCTGGCC 2461 TTCCTGCAGG TGGTCGCATC GACCATGGTC ATCATGAAAG CAGGGCTTAC CGGGCACTGA 2521 CTGAGACGAT CATGTTCGAC GACGCCATTG AGAGGGCGGG CCAGCTCACC AGCGAGGAGG 2581 ACACGCTGAG CCTCGTCACT GCCGACCACT CCCACGTCTT CTCCTTCGGA GGCTACCCCC 2641 TGCGAGGGAG CTCCATCTTC GGTAGGCCTG GGGATGAGTG GCAGGTGCTG CTGCAGCAAT 2701 TAAGTGGGTG AAATCTGAGC CTCAGTCTCC TCCTCTGTCA AGTGGGAGTA ATGCTGGCAC 2761 CAGCCTAATA GGGTCCTCTG CGGACTAAGC CCCTGACCAG GCAAAACGTG CGGTGCCTAG 2821 CACGTGGGAG ACACTCCACA GCTGTGTTCA GCTCAACCAC AGGGACCCCT CTCTCTGCAG 2881 GGCTGGCCCC TGGCAAGGCC CGGGACAGGA AGGCCTACAC GGTCCTCCTA TACGGAAACG 2941 GTCCAGGCTA TGTGCTCAAG GACGGCGCCC GGCCGGATGT TACGGAGAGC GAGAGCGGTG 3001 AGTGCCGTGG GGTGGCCTGA GGGGGACCAG GGTGCCAAGG ATGGGGGGCT GGCGGGAAGG 3061 GGTCACCTCT TGTCTGCCTG GAACTGAAAC TTCCTACTGA AACTGAACCC TCCAACCAGG 3121 GAGCCCCGAG TATCGGCAGC AGTCAGCAGT GCCCCTGGAC GGAGAGACCC ACGCAGGCGA 3181 GGACGTGGCG GTGTTCGCGC GCGGCCCGCA GGCGCACCTG GTTCACGGCG TGCAGGAGCA 3241 GACCTTCATA GCGCACGTCA TGGCCTTCGC CGCCTGCCTG GAGCCCTACA CCGCCTGCGA 3301 CCTGGCGCCC CCCGCCGGCA CCACCGACGC CGCGCACCCG GGGCCGTCCG TGGTCCCCGC 3361 GTTGCTTCCT CTGCTGGCAG GGACCTTGCT GCTGCTGGGG ACGGCCACTG CTCCCTGAGT 3421 GTCCCGTCCC TGGGGGCTCC TGCTTCCCCA TCCCGGAGTT CCCCTGCTCC CCACCTCCAG 3481 TTCTGCCTGC CGGACCTCCA CCTGGAGCTG TCACCCCCGG AGTCGCCACA CAGACTGTCC 3541 TGCCATGGAA CCTTCCCTCC CGGTGCACCC TGGGGACCGA GCCCTTGACA CCACGCCCTT 3601 TGCTTTATCT TGCTCTTAAA TTTTGGCCCC AACTCCAGGG ACTGGGGATT TGTGCCTGGC 3661 AGCTGCCTGC ATTTCAGGAA AAGAGGAGGC TCAGACCATC CAGCCCCCCG CCCATATCCT 3721 GAGGTGGATC AGGCAGGCTC TCTCCCCGGG GACATGAGGC ACCCATACCT AGGACCCCCT 3781 GCGCCTTTTT TAGCTTCAGT CATGGCAGCA CCTGAGGGAC ACAAGGACTT GGGTGCATCA 3841 GGACGCCTTG GAGAAGCGTG GCTTCCTGCC ACCCTGCAAC CCACCCTCCC AGCCAAGGAG 3901 GCTGCTGTGG TGGGGATCCC CAGGGGCTTT GACACAGTCC TCTGCTGTCC CTCCACTGGG 3961 CTAATTCTAC ACCCCTGTGC CCCTCCTAGG CGCATGAGTC AGAGAGGCTT GCCCAAGTCA 4021 CAGCCACTCA GATGTTCGAC GCCCTAAGGT CCATTCCAGC ACCCACCTGA GTTCCGAGGA 4081 GCACCTGGGA AGCTCTGGGT GCAGGATAGC AGTCCAGAGT CCATGGCCCC GCTAGGCCAT 4141 CTGGGTGCTG GGCATGGATT TCTCAGCAAG GAAGACTCAT TACCTTCCCT CCCTGGGCCT 4201 CCATTCTTCT GGGAAACACA AAGCAATAAT AAAAGGAAGT GTTAGACAAT GTAATGCCAG 4261 TACTACTTCC TAGCATAAAA ATCATGACTG AATGTGGACA CAGTGGCTGG AGGGGTGGAT 4321 AACACAGGCC AGGAGGGGCT GCTGAGGAGC AGATGACTGA GCAGGAGACC TGAACAGAGT 4381 CGGGGCTTGA GCAAGGTGGC ACAGCAGCAC AAAGGCCCTG GGGAGTGTCA GCAGGCTGTC 4441 TGGGAGGCCA GGGGTGGATC AGAGGGTGGG TAGATGGGGT AAAGCTTGAG GGGTCAGGAG 4501 GGTGGGGGAC ATGGGGGACC GTGAAGTCTA GGTAGAGGGT GTGGTTGGAG GTCTTT // LOCUS HUMALPI 5291 bp ds-DNA PRI 30-SEP-1988 DEFINITION Human intestinal alkaline phosphatase gene, complete cds. ACCESSION J03930 KEYWORDS alkaline phosphatase. SOURCE Human fibroblast cell line 563 DNA, clones Ch40[Bam5,Bg5]. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 5291) AUTHORS Henthorn,P.S., Raducha,M., Kadesch,T., Weiss,M.J. and Harris,H. TITLE Sequence and characterization of the human intestinal alkaline phosphatase gene JOURNAL J. Biol. Chem. 263, 12011-12019 (1988) STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly provided by P.S.Henthorn, 20-JUN-1988 FEATURES from to/span description pept 296 362 alkaline phosphatase precursor, exon 1 (EC 3.1.3.1) /nomgen="ALPI" /map="2q34-q37" /hgml_locus_uid="LN0047P" 445 561 alkaline phosphatase precursor, exon 2 676 791 alkaline phosphatase precursor, exon 3 995 1169 alkaline phosphatase precursor, exon 4 1246 1418 alkaline phosphatase precursor, exon 5 1661 1795 alkaline phosphatase precursor, exon 6 1891 1963 alkaline phosphatase precursor, exon 7 2094 2228 alkaline phosphatase precursor, exon 8 2313 2504 alkaline phosphatase precursor, exon 9 2728 2844 alkaline phosphatase precursor, exon 10 2956 3242 alkaline phosphatase precursor, exon 11 sigp 296 352 alkaline phosphatase signal peptide matp 353 362 alkaline phosphatase 445 561 alkaline phosphatase 676 791 alkaline phosphatase 995 1169 alkaline phosphatase 1246 1418 alkaline phosphatase 1661 1795 alkaline phosphatase 1891 1963 alkaline phosphatase 2094 2228 alkaline phosphatase 2313 2504 alkaline phosphatase 2728 2844 alkaline phosphatase 2956 3239 alkaline phosphatase pre-msg 246 4128 ALP mRNA and introns IVS 363 444 ALP intron A IVS 562 675 ALP intron B IVS 792 994 ALP intron C IVS 1170 1245 ALP intron D IVS 1419 1660 ALP intron E IVS 1796 1890 ALP intron F IVS 1964 2093 ALP intron G IVS 2229 2312 ALP intron H IVS 2505 2727 ALP intron I IVS 2845 2955 ALP intron J BASE COUNT 1177 A 1520 C 1552 G 1042 T ORIGIN 118 bp upstream of HindIII site; chromosome 2q34-q37. 1 CCTAGGCTGT GTGTTTCCAG TCTCACCTCT CTTCACACCT TGAATGAGGT GAATGAAGGA 61 GTGGCAACGC GTCTCCCACA AGACACTGTG AGCCACACCC AGTCCCTTCC CTTCAGCAAG 121 CTTGGCTTCA GGTCACAGGA CTGGGCGGGG TCAAGATGGA CACCAGGGGT GTGGGGAGGG 181 ACGTGGAGCA TTTACAGCCA GGGGCAAAGT CCTCCCCTGA TTTAAACCCA GGCAGCCTGC 241 GCTGCAGCCG GTTCCTGGTG TCCCCACTTC GCCTCCCTCC TGCTGCCCCC AAGACATGCA 301 GGGGCCCTGG GTGCTGCTGC TGCTGGGCCT GAGGCTACAG CTCTCCCTGG GCGTCATCCC 361 AGGTAATGAG GCTCCCCAAG CTGTTCCACA CACAGGGCAC CCCCTCAGCC AGGCTGACCT 421 GATCTCTACT CTCCCCCTGG CCAGCTGAGG AGGAGAACCC GGCCTTCTGG AACCGCCAGG 481 CAGCTGAGGC CCTGGATGCT GCCAAGAAGC TGCAGCCCAT CCAGAAGGTC GCCAAGAACC 541 TCATCCTCTT CCTGGGCGAT GGTGAGTGAG CAAGGCCTGT CCAGCCCCGT AGTCCTCACA 601 GCCCCGGCAC CCGGGACCTT CAGTGGTTCC AGGACAACCC TGGGGCCCAG GACTCACACA 661 TTTCTGCTCC TTCAGGGTTG GGGGTGCCCA CGGTGACAGC CACCAGGATC CTAAAGGGGC 721 AGAAGAATGG CAAACTGGGG CCTGAGACGC CCCTGGCCAT GGACCGCTTC CCATACCTGG 781 CTCTGTCCAA GGTAAGGGCT GGGCCACCTC AGAGTCCTCC AAGCAGAGGA GAGGGATCAA 841 GGATATGGAG TGTGGCAGGA GGGAGGGAGC CAGGACAGCT GGGGCCTAAG TTAGGAGCTG 901 GGAGCAGTTA GGATCCCAGA GGACCAGAAC CAGGTCCTTG GTTGGGGTCT GGGTGTCCGC 961 CCCGAAGTAG AGCTCAGGGT GTCTCCGTTC GCAGACATAC AATGTGGACA GACAGGTGCC 1021 AGACAGCGCA GCCACAGCCA CGGCCTACCT GTGCGGGGTC AAGGCCAACT TCCAGACCAT 1081 CGGCTTGAGT GCAGCCGCCC GCTTTAACCA GTGCAACACG ACACGCGGCA ATGAGGTCAT 1141 CTCCGTGATG AACCGGGCCA AGCAAGCAGG TGAGCTGGGG CCCGCTGTGG GGTCAGGACC 1201 AGGCCCAAGA TCTCGGTCAC CGATCCTGAC CTCTGTCACC CTCAGGAAAG TCAGTAGGAG 1261 TGGTGACCAC CACACGGGTG CAGCACGCCT CGCCAGCCGG CACCTACGCA CACACAGTGA 1321 ACCGCAACTG GTACTCAGAT GCTGACATGC CTGCCTCAGC CCGCCAGGAG GGGTGCCAGG 1381 ACATCGCCAC TCAGCTCATC TCCAACATGG ACATTGACGT GCGACCCCCG GGCCAAGGGC 1441 TGGGGCTGGG CAGAGGGGAA GGTGGCACAG GCTCAGATCC AGGCAACCAA AAGCCTGATC 1501 TGGGTCAGCA GGTTCTGGAG GTGGAGTTGG GGATGTAGAA TGTGCAATAC AGGCTGGGCC 1561 ATTCCCACAG CCCTGGGGAG GGGAGCCAGG GGCTATGCAT GAGGAGGGGG CACGGGGCCA 1621 GCCAGGCCCC CAAACCACCT GCCCCATCCA TTGTCCTCAG GTGATCCTTG GCGGAGGCCG 1681 CAAGTACATG TTTCCCATGG GGACCCCAGA CCCTGAGTAC CCAGCTGATG CCAGCCAGAA 1741 TGGAATCAGG CTGGACGGGA AGAACCTGGT GCAGGAATGG CTGGCAAAGC ACCAGGTGAT 1801 GGGGGCTGGT GGGTGTGGGA GGCACGGCAG GGGGAGGCCA AGTGTGTGGG TCTCAGGGCT 1861 GTGGGCTGAA GCCTGGCTCT GTCCCTGCAG GGTGCCTGGT ATGTGTGGAA CCGCACTGAG 1921 CTCATGCAGG CGTCCCTGGA CCAGTCTGTG ACCCATCTCA TGGGTAATGA CCCCCTTCCT 1981 GCCCTGGCAT TCCTCAGACA ACCTCAGAGG GTGCCATCCG AGCCTGTGTG CCCATTTGCC 2041 AGCACCCTCC CGCTCACAGC CTGCCAATCA CCACCAAGCT CCTTGTCCCA CAGGCCTCTT 2101 TGAGCCCGGA GACACGAAAT ATGAGATCCA CCGAGACCCC ACACTGGACC CCTCCCTGAT 2161 GGAGATGACA GAGGCTGCCC TGCGCCTGCT GAGCAGGAAC CCCCGCGGCT TCTACCTCTT 2221 TGTGGAGGGT GCGTGGTGGC CCCTGGGGAG TGGAGGAAGG CGGGGCGCGG CAGGGCAGGT 2281 TCAAGCATCA CCCCCCTCTG GCCTTCCTGC AGGCGGCCGC ATCGACCATG GTCATCATGA 2341 GGGTGTGGCT TACCAGGCAC TCACTGAGGC GGTCATGTTC GACGACGCCA TTGAGAGGGC 2401 GGGCCAGCTC ACCAGCGAGG AGGACACGCT GACCCTCGTC ACCGCTGACC ACTCCCATGT 2461 CTTCTCCTTT GGTGGCTACA CCTTGCGAGG GAGCTCCATC TTCGGTAGGC CTGGGGAGAG 2521 TGGCAGGTGC TGCTGCATCA ATTATGAGGG TGAAGTTTGA GCCTCAGTTT CCTCCTCTGT 2581 CAAAAGTGTG CAATGCTGGC ACCAGCCCTA TAGGGATCTT GTGAGGACCG AGCCCCCGAA 2641 CAGGCAAAAA GTGGCGGTGC CTGGCACATA GGAGGCACTC CCACAGCTGT GGTCAGCTCA 2701 ACTACAGGGA CCCGCATCTC CCTACAGGGT TGGCCCCCAG CAAGGCTCAG GACAGCAAAG 2761 CCTACACGTC CATCCTGTAC GGCAATGGCC CGGGCTACGT GTTCAACTCA GGCGTGCGAC 2821 CAGACGTGAA TGAGAGCGAG AGCGGTGAGT GAGGCTGAAT GGCCCGTGCA GGGGGACCAG 2881 GGTGCCAGGG ATGGGGGCAT TCGCGGGAGG GGGACGCCGC CTGCCTGCCC TGAAGTGCAC 2941 TCACCCTCCT ACCAGGGAGC CCCGATTACC AGCAGCAGGC GGCGGTGCCC CTGTCGTCCG 3001 AGACCCACGG AGGCGAAGAC GTGGCGGTGT TTGCGCGCGG CCCGCAGGCG CACCTGGTGC 3061 ATGGTGTGCA GGAGCAGAGC TTCGTAGCGC ATGTCATGGC CTTCGCTGCC TGTCTGGAGC 3121 CCTACACGGC CTGCGACCTG GCGCCTCCCG CCTGCACCAC CGACGCCGCG CACCCAGTTG 3181 CCGCGTCGCT GCCACTGCTG GCCGGGACCC TGCTGCTGCT GGGGGCGTCC GCTGCTCCCT 3241 GAGTGCCCCA CTCCGGAGTT ATCCTGCTCC CCACCTCCGG GCGTCCTGCC CTGTTCCCCG 3301 TCCTGAGCCG CCACTTCCAG CGAACACACA CAGGTGTCCT GCCGTTGGAC CTTCACCTCC 3361 TAGAGATAAA CCAGCCTCAG CTGGCGCAGC GGGGCCCTTC TTCCCTCCGC ATCCCCTTCA 3421 GGGAGCAGGA GCCCAGGGCG CCCTGGGAGC TGAGCCTGGG ACTTCCAGGA CCTCCCCTCA 3481 GGTTGTTCTC TGATTCTTCC TCCCAACCCC AGAGACTGCA GATTTGTGCC ATGCGGCTGC 3541 CTGCACCCCA GACAATAAAG GGACCAAAAC CACCCAACCC CCACCCTGCC TCTATCCTAA 3601 GGAAGACCAA GCAGGCCTGG ACCCAGAGAC GTCCCCCATC GTGGGACACG ACACACCCAG 3661 ACCGCGTGCC CCACCGTCTT AGCTTCAATC CTGGCAGCAC CTGGTAGACC CAAGGACTTG 3721 GGTGGATCAG GACACCTGAA GAAGAGAAGC TTCCGGCAAC CCTGCAACCC ACCCAAGGAG 3781 GCTACTGGAT CGGGGATTCC CAGGGGGGCT TTGACACAGT CCTCTGCTGT CTCCCCACTA 3841 GGATCATTCC ACACCCCTGC ACCTGACCAA GGGACCAATG AGGCAGAGGC TTGCCCCAAG 3901 TCACAGCCAC TCAGATGCTT CCTGCCCCCC AGTGCCCATT CCAGGTCACC AGATCCAAGG 3961 AGCGCTTGAG GAGCTCTGGG TACAGGGCAG CAACCCAGAG CCCATGGGCC CTCCCGGGAC 4021 ATCTGGATGC TGGGCATAGA TTTCTCAACA AGGAAGACTC CCCTGCCTCC TCAAGGTCTC 4081 CATTCTCCTA GGAGACAAAG CAATAATAAA AGGTGTTAGA CAATGTAATG CCAGTACTAC 4141 TTCCTAGGAG AAAAATCATG AGTGAGTGTG GGCACAGTAT CTGGAGAGGT GGATAACGCA 4201 GGCCAGGAGG TACTGCTGAG GGGCAGATGA TTGAGCAAGA GACTTGAACA GAGTGGGGGC 4261 TTGAGCAAGG CAGCACAGCA GTGCAAACGC CCTGGGGCAG TGTCAGCAGG TGCTCTGGGA 4321 GGCCAAGGGC TGGATCAGAG GGGTGGGGGT GGGTGGGCAG AGTGGGGAAA GCCTGAGGGG 4381 TCAGGAGAGT GGGGTGTGCA TGGGGGACTG TGAAGTCTGG TTAGAGGGGT GTGGTTGGAG 4441 GTCTTTGAGG AGGGCTGTGA CCTGCCCTGG TTGGGAAATA AGCACTCTGG CTGCTGCCAG 4501 GAGAAGGGTC TGGTCTTTTG GGCAGAGGGT GGGGGTGGTG GCAGGCTCAG GTGAAAGCTG 4561 GGGAAGGAGC TGACTCCAGG TGTTTCTGAC CTCCCTCTGA AAGTATTCTG GAGCGCCCAT 4621 CCCAATACAG CCATACTTAG TGAGTACACA CCTGCTCCAA GAGAACATTG AAAAGAATAA 4681 AGGTGAAATC AACCACATTT TCCAGCAAAT TTTGCAGTAT TACAAATTTA TTTGTACATT 4741 TACAAAGGTG CAAAAAAGCA TCTTGCTTTT GCAAGAAATA GTAACATCAT TCAATATGCT 4801 TTCTTATTTA CTAAAACCTT GAAATAAAAT TGTAAAACAT CAGTTTGAAG GCCTGACTCT 4861 CAGGGTAGTT CTTTTTTAAT TCTGGGTTTT AGTAGCTGTC ACAAAAATAT TGGAGGACCA 4921 TGATCCCACT TGTGAATAGC CATAGGACTC CAGCCTGGGA AGCATAGCGA AAATCTGTGT 4981 CTAAAAAATG AAATAAAAGG ATGAATTTTA TGGTATGTAA ATTATATCTA AATTTTAAAA 5041 AACAGATTCG AATATATAAT CTGCTTTCAA GTTTTTTTAA ATGTGTAGGG ATCAGGGTTT 5101 TATCAGTCAA ATACATTTTT TACCACAAAA TTCACATGTC AATGAAAACA TTCTCAAACT 5161 TTGGTTCTAA AAAATGTTTT CTTTGGCATG AGTTTTCATT CCAAGATGAT TACTTTCTCA 5221 TTTTTTCATT GAAAGGACAT CTTTACCTTG AAGGAGCAGA TGCAAGAAAA GTACAATTAT 5281 TTTTCAAGCT T // LOCUS HUMALPIA 416 bp ds-DNA PRI 15-SEP-1989 DEFINITION Human heat-stable alkaline phosphatase gene, 5' end. ACCESSION M19161 KEYWORDS alkaline phosphatase; heat-stable alkaline phosphatase; phosphatase. SOURCE Human adult liver DNA, clone lambda-EMBL8. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 416) AUTHORS Knoll,B.J., Rothblum,K.N. and Longley,M. TITLE Two gene duplication events in the evolution of the human heat-stable alkaline phosphatases JOURNAL Gene 60, 267-276 (1987) STANDARD simple staff_review FEATURES from to/span description pept 1 67 intestinal alkaline phosphatase, exon 1 /nomgen="ALPI" /map="2q34-q37" /hgml_locus_uid="LN0047P" 150 266 intestinal alkaline phosphatase, exon 2 381 > 416 intestinal alkaline phosphatase, exon 3 pre-msg < 1 > 416 IAP mRNA and introns IVS 68 149 intron A IVS 267 380 intron B BASE COUNT 76 A 147 C 120 G 73 T ORIGIN 219 bp upstream of PstI site; chromosome 2q34-q37. 1 ATGCAGGGGC CCTGGGTGCT GCTGCTGCTG GGCCTGAGGC TACAGCTCTC CCTGGGCGTC 61 ATCCCAGGTA ATGAGGCTCC CCAAGCTGTT CCACACACAG GGCACCCCCT CAGCCAGGCT 121 GACCTGATCT CTACTCTCCC CCTGGCCAGC TGAGGAGGAG AACCCGGCCT TCTGGAACCG 181 CCAGGCAGCT GAGGCCCTGG ATGCTGCCAA GAAGCTGCAG CCCATCCAGA AGGTCGCCAA 241 GAACCTCATC CTCTTCCTGG GCGATGGTGA GTGAGCAAGG CCTGTCCAGC CCCGTAGTCC 301 TCACAGCCCC GGCACCCGGG ACCTTCAGTG GTTCCAGGAC AACCCTGGGG CCCAGGACTC 361 ACACATTTCT GCTCCTTCAG GGTTGGGGGT GCCCACGGTG ACAGCCACCA GGATCC // LOCUS HUMALPL01 806 bp ds-DNA PRI 15-JUN-1989 DEFINITION Human liver/bone/kidney-type alkaline phosphatase (ALP) gene, 5' flank. ACCESSION M24428 M14168 M21959 J03929 KEYWORDS alkaline phosphatase; orthophosphoric-monoester phosphohydrolase; phosphatase. SEGMENT 1 of 12 SOURCE Human osteosarcoma-derived cell line Saos-2 DNA, and cDNA to mRNA, clones pLBK 14 [1] and pS3-1 [2]. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 631 to 702) AUTHORS Weiss,M.J., Henthorn,P.S., Lafferty,M.A., Slaughter,C., Raducha,M. and Harris,H. TITLE Isolation and characterization of a cDNA encoding a human liver/bone/kidney-type alkaline phosphatase JOURNAL Proc. Natl. Acad. Sci. U.S.A. 83, 7182-7186 (1986) STANDARD full staff_review REFERENCE 2 (bases 1 to 806) AUTHORS Weiss,M.J., Ray,K., Lamb,B., Kadesch,T. and Harris,H. TITLE Structure of the human liver/bone/kidney alkaline phosphatase gene JOURNAL J. Biol. Chem. 263, 12002-12010 (1988) STANDARD simple staff_review COMMENT Draft entry and clean copy sequence for [1] kindly provided by M.J.Weiss, 27-JAN-1987. FEATURES from to/span description pre-msg 610 > 806 ALPL mRNA and intron (major alt.) /nomgen="ALPL" /map="1p36.1-p34" /hgml_locus_uid="LH0058R" (EC 3.1.3.1) pre-msg 611 > 806 ALPL mRNA and intron (major alt.) pre-msg 612 > 806 ALPL mRNA and intron (major alt.) pre-msg 613 > 806 ALPL mRNA and intron (major alt.) pre-msg 614 > 806 ALPL mRNA and intron (major alt.) pre-msg 615 > 806 ALPL mRNA and intron (major alt.) pre-msg 616 > 806 ALPL mRNA and intron (major alt.) pre-msg 560 > 806 ALPL mRNA and intron (minor alt.) pre-msg 561 > 806 ALPL mRNA and intron (minor alt.) pre-msg 562 > 806 ALPL mRNA and intron (minor alt.) pre-msg 563 > 806 ALPL mRNA and intron (minor alt.) IVS 702 > 806 ALPL intron A BASE COUNT 148 A 244 C 319 G 95 T ORIGIN Chromosome 1p36.1-p34. 1 GGTCCCCTTC TGCTTCTTCT TGCGGTAGCC AGGGAGGGCA GCCCACGGGC AGGAAGCGGG 61 GGTGGGGGTG CAGAGTCAGA GGTGCACGTG GACAGAGACA GAGAGACAGG GACACGTGGG 121 CAGAGACGGA TAAAGACAGA GACCCAGAGA AAGCCAGATA TGTTGACAGA CACAGAGACA 181 GACGCCAGAG AGGAAGGCAG ACAAAGAGAC GGGTGGAGAC AAAGACTCCC ACCAAGAGAC 241 GCAGAAGGAA GATGCCGACG GTAAAGACAA AACAGGAGAC GCGCGCAAGG AGCAGGTCAG 301 AGCCCAGGCT CGCTGAGAGA GGAAGGGCTG GGCTGGGGCA GCCCGGAGGC AGAGAGACCG 361 AGAGTGCGGG GCGGGCGAGG GACGCCAGGG CCGCGTCACC CCAGCCCGTT CCTAGCTCCG 421 CTCCCGGCAG GGGGCGCCCT GGCCTCGTGG CACGACCGGC CCGCGGGGCG CGGGGCTCGG 481 GCCGGGGGCG GGGCCGGGGC CGGGCTGGGG AGGGGTTGGG GCCGGGGGCG GGGGAGGGGG 541 CGGGCTGCCC GGGCCTCACT CGGGCCCCGC GGCCGCCTTT ATAAGGCGGC GGGGGTGGTG 601 GCCCGGGCCG CGTTGCGCTC CCGCCACTCC GCGCCCGCTA TCCTGGCTCC GTGCTCCCAC 661 GCGCTTGTGC CTGGACGGAC CCTCGCCAGT GCTCTGCGCA GGTAAGGATT CGACGCTGCC 721 CCGCGCCCTG GTTCCCCAGG GCCCCAGCGG ACGTGGTCCA TCCCCTTCTG CATCCTCCGC 781 TGGCCCCGTG GTTGAACTTT AATGGC // LOCUS HUMALPL02 183 bp ds-DNA PRI 15-JUN-1989 DEFINITION Human liver/bone/kidney-type alkaline phosphatase (ALP) gene, exon 2. ACCESSION M24429 M14168 M21960 J03929 KEYWORDS alkaline phosphatase; orthophosphoric-monoester phosphohydrolase; phosphatase. SEGMENT 2 of 12 SOURCE Human osteosarcoma-derived cell line Saos-2 DNA, and cDNA to mRNA, clones pLBK 14 [1] and pS3-1 [2]. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 13 to 177) AUTHORS Weiss,M.J., Henthorn,P.S., Lafferty,M.A., Slaughter,C., Raducha,M. and Harris,H. TITLE Isolation and characterization of a cDNA encoding a human liver/bone/kidney-type alkaline phosphatase JOURNAL Proc. Natl. Acad. Sci. U.S.A. 83, 7182-7186 (1986) STANDARD full staff_review REFERENCE 2 (bases 1 to 183) AUTHORS Weiss,M.J., Ray,K., Lamb,B., Kadesch,T. and Harris,H. TITLE Structure of the human liver/bone/kidney alkaline phosphatase gene JOURNAL J. Biol. Chem. 263, 12002-12010 (1988) STANDARD simple staff_review COMMENT Draft entry and clean copy sequence for [1] kindly provided by M.J.Weiss, 27-JAN-1987. FEATURES from to/span description pept 117 + 177 alkaline phosphatase precursor, exon 2 (first expressed exon) (EC 3.1.3.1) /nomgen="ALPL" /map="1p36.1-p34" /hgml_locus_uid="LH0058R" sigp 117 167 alkaline phosphatase signal peptide matp 168 + 177 alkaline phosphatase pre-msg < 1 > 183 ALPL mRNA and introns IVS < 1 12 ALPL intron A IVS 178 > 183 ALPL intron B BASE COUNT 40 A 56 C 37 G 50 T ORIGIN About 25 kb after segment 1; chromosome 1p36.1-p34. 1 TTTAATTTCT AGGATTGGAA CATCAGTTAA CATCTGACCA CTGCCAGCCC ACCCCCTCCC 61 ACCCACGTCG ATTGCATCTC TGGGCTCCAG GGATAAAGCA GGTCTTGGGG TGCACCATGA 121 TTTCACCATT CTTAGTACTG GCCATTGGCA CCTGCCTTAC TAACTCCTTA GTGCCAGGTA 181 TGC // LOCUS HUMALPL03 138 bp ds-DNA PRI 15-JUN-1989 DEFINITION Human liver/bone/kidney-type alkaline phosphatase (ALP) gene, exon 3. ACCESSION M24430 M14168 KEYWORDS alkaline phosphatase; orthophosphoric-monoester phosphohydrolase; phosphatase. SEGMENT 3 of 12 SOURCE Human osteosarcoma-derived cell line Saos-2 DNA, and cDNA to mRNA, clones pLBK 14 [1] and pS3-1 [2]. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 13 to 132) AUTHORS Weiss,M.J., Henthorn,P.S., Lafferty,M.A., Slaughter,C., Raducha,M. and Harris,H. TITLE Isolation and characterization of a cDNA encoding a human liver/bone/kidney-type alkaline phosphatase JOURNAL Proc. Natl. Acad. Sci. U.S.A. 83, 7182-7186 (1986) STANDARD full staff_review REFERENCE 2 (bases 1 to 138) AUTHORS Weiss,M.J., Ray,K., Lamb,B., Kadesch,T. and Harris,H. TITLE Structure of the human liver/bone/kidney alkaline phosphatase gene JOURNAL J. Biol. Chem. 263, 12002-12010 (1988) STANDARD simple staff_review COMMENT Draft entry and clean copy sequence for [1] kindly provided by M.J.Weiss, 27-JAN-1987. FEATURES from to/span description pept + 13 + 132 alkaline phosphatase precursor, exon 3 /nomgen="ALPL" /map="1p36.1-p34" /hgml_locus_uid="LH0058R" (EC 3.1.3.1) matp 13 + 132 alkaline phosphatase pre-msg < 1 > 132 ALPL mRNA and introns IVS < 1 12 ALPL intron B IVS 133 > 138 ALPL intron C BASE COUNT 44 A 30 C 38 G 26 T ORIGIN About 7.5 kb after segment 2; chromosome 1p36.1-p34. 1 CTCTGTGTTT AGAGAAAGAG AAAGACCCCA AGTACTGGCG AGACCAAGCG CAAGAGACAC 61 TGAAATATGC CCTGGAGCTT CAGAAGCTCA ACACCAACGT GGCTAAGAAT GTCATCATGT 121 TCCTGGGAGA TGGTGAGG // LOCUS HUMALPL04 128 bp ds-DNA PRI 15-JUN-1989 DEFINITION Human liver/bone/kidney-type alkaline phosphatase (ALP) gene, exon 4. ACCESSION M24431 M14168 KEYWORDS alkaline phosphatase; orthophosphoric-monoester phosphohydrolase; phosphatase. SEGMENT 4 of 12 SOURCE Human osteosarcoma-derived cell line Saos-2 DNA, and cDNA to mRNA, clones pLBK 14 [1] and pS3-1 [2]. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 7 to 122) AUTHORS Weiss,M.J., Henthorn,P.S., Lafferty,M.A., Slaughter,C., Raducha,M. and Harris,H. TITLE Isolation and characterization of a cDNA encoding a human liver/bone/kidney-type alkaline phosphatase JOURNAL Proc. Natl. Acad. Sci. U.S.A. 83, 7182-7186 (1986) STANDARD full staff_review REFERENCE 2 (bases 1 to 128) AUTHORS Weiss,M.J., Ray,K., Lamb,B., Kadesch,T. and Harris,H. TITLE Structure of the human liver/bone/kidney alkaline phosphatase gene JOURNAL J. Biol. Chem. 263, 12002-12010 (1988) STANDARD simple staff_review COMMENT Draft entry and clean copy sequence for [1] kindly provided by M.J.Weiss, 27-JAN-1987. FEATURES from to/span description pept + 7 + 122 alkaline phosphatase precursor, exon 4 /nomgen="ALPL" /map="1p36.1-p34" /hgml_locus_uid="LH0058R" (EC 3.1.3.1) matp + 7 + 122 alkaline phosphatase pre-msg < 1 > 128 ALPL mRNA and introns IVS < 1 6 ALPL intron C IVS 123 > 128 ALPL intorn D BASE COUNT 25 A 41 C 40 G 22 T ORIGIN 0.4 kb after segment 3; chromosome 1p36.1-p34. 1 CTGCAGGGAT GGGTGTCTCC ACAGTGACGG CTGCCCGCAT CCTCAAGGGT CAGCTCCACC 61 ACAACCCTGG GGAGGAGACC AGGCTGGAGA TGGACAAGTT CCCCTTCGTG GCCCTCTCCA 121 AGGTGAGC // LOCUS HUMALPL05 193 bp ds-DNA PRI 15-JUN-1989 DEFINITION Human liver/bone/kidney-type alkaline phosphatase (ALP) gene, exon 5. ACCESSION M24432 M14168 KEYWORDS alkaline phosphatase; orthophosphoric-monoester phosphohydrolase; phosphatase. SEGMENT 5 of 12 SOURCE Human osteosarcoma-derived cell line Saos-2 DNA, and cDNA to mRNA, clones pLBK 14 [1] and pS3-1 [2]. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 13 to 187) AUTHORS Weiss,M.J., Henthorn,P.S., Lafferty,M.A., Slaughter,C., Raducha,M. and Harris,H. TITLE Isolation and characterization of a cDNA encoding a human liver/bone/kidney-type alkaline phosphatase JOURNAL Proc. Natl. Acad. Sci. U.S.A. 83, 7182-7186 (1986) STANDARD full staff_review REFERENCE 2 (bases 1 to 193) AUTHORS Weiss,M.J., Ray,K., Lamb,B., Kadesch,T. and Harris,H. TITLE Structure of the human liver/bone/kidney alkaline phosphatase gene JOURNAL J. Biol. Chem. 263, 12002-12010 (1988) STANDARD simple staff_review COMMENT Draft entry and clean copy sequence for [1] kindly provided by M.J.Weiss, 27-JAN-1987. FEATURES from to/span description pept + 13 + 187 alkaline phosphatase precursor, exon 5 /nomgen="ALPL" /map="1p36.1-p34" /hgml_locus_uid="LH0058R" (EC 3.1.3.1) matp + 13 + 187 alkaline phosphatase pre-msg < 1 > 193 ALPL mRNA and introns IVS < 1 12 ALPL intron D IVS 188 > 193 ALPL intron E conflict 45 45 c in [2]; t in [1] BASE COUNT 40 A 68 C 60 G 25 T ORIGIN 1.8 kb after segment 4; chromosome 1p36.1-p34. 1 CCCCACCTGC AGACGTACAA CACCAATGCC CAGGTCCCTG ACAGCGCCGG CACCGCCACC 61 GCCTACCTGT GTGGGGTGAA GGCCAATGAG GGCACCGTGG GGGTAAGCGC AGCCACTGAG 121 CGTTCCCGGT GCAACACCAC CCAGGGGAAC GAGGTCACCT CCATCCTGCG CTGGGCCAAG 181 GACGCTGGTG AGT // LOCUS HUMALPL06 194 bp ds-DNA PRI 15-JUN-1989 DEFINITION Human liver/bone/kidney-type alkaline phosphatase (ALP) gene, exon 6. ACCESSION M24433 M14168 KEYWORDS alkaline phosphatase; orthophosphoric-monoester phosphohydrolase; phosphatase. SEGMENT 6 of 12 SOURCE Human osteosarcoma-derived cell line Saos-2 DNA, and cDNA to mRNA, clones pLBK 14 [1] and pS3-1 [2]. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 13 to 188) AUTHORS Weiss,M.J., Henthorn,P.S., Lafferty,M.A., Slaughter,C., Raducha,M. and Harris,H. TITLE Isolation and characterization of a cDNA encoding a human liver/bone/kidney-type alkaline phosphatase JOURNAL Proc. Natl. Acad. Sci. U.S.A. 83, 7182-7186 (1986) STANDARD full staff_review REFERENCE 2 (bases 1 to 194) AUTHORS Weiss,M.J., Ray,K., Lamb,B., Kadesch,T. and Harris,H. TITLE Structure of the human liver/bone/kidney alkaline phosphatase gene JOURNAL J. Biol. Chem. 263, 12002-12010 (1988) STANDARD simple staff_review COMMENT Draft entry and clean copy sequence for [1] kindly provided by M.J.Weiss, 27-JAN-1987. FEATURES from to/span description pept + 13 + 188 alkaline phosphatase precursor, exon 6 /nomgen="ALPL" /map="1p36.1-p34" /hgml_locus_uid="LH0058R" (EC 3.1.3.1) matp + 13 + 188 alkaline phosphatase pre-msg < 1 > 194 ALPL mRNA and introns IVS < 1 12 ALPL intron E IVS 189 > 194 ALPL intron F BASE COUNT 46 A 65 C 52 G 31 T ORIGIN 0.9 kb after segment 5; chromosome 1p36.1-p34. 1 CCTGCACCCC AGGGAAATCT GTGGGCATTG TGACCACCAC GAGAGTGAAC CATGCCACCC 61 CCAGCGCCGC CTACGCCCAC TCGGCTGACC GGGACTGGTA CTCAGACAAC GAGATGCCCC 121 CTGAGGCCTT GAGCCAGGGC TGTAAGGACA TCGCCTACCA GCTCATGCAT AACATCAGGG 181 ACATTGACGT GAGT // LOCUS HUMALPL07 162 bp ds-DNA PRI 15-JUN-1989 DEFINITION Human liver/bone/kidney-type alkaline phosphatase (ALP) gene, exon 7. ACCESSION M24434 M14168 KEYWORDS alkaline phosphatase; orthophosphoric-monoester phosphohydrolase; phosphatase. SEGMENT 7 of 12 SOURCE Human osteosarcoma-derived cell line Saos-2 DNA, and cDNA to mRNA, clones pLBK 14 [1] and pS3-1 [2]. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 13 to 156) AUTHORS Weiss,M.J., Henthorn,P.S., Lafferty,M.A., Slaughter,C., Raducha,M. and Harris,H. TITLE Isolation and characterization of a cDNA encoding a human liver/bone/kidney-type alkaline phosphatase JOURNAL Proc. Natl. Acad. Sci. U.S.A. 83, 7182-7186 (1986) STANDARD full staff_review REFERENCE 2 (bases 1 to 162) AUTHORS Weiss,M.J., Ray,K., Lamb,B., Kadesch,T. and Harris,H. TITLE Structure of the human liver/bone/kidney alkaline phosphatase gene JOURNAL J. Biol. Chem. 263, 12002-12010 (1988) STANDARD simple staff_review COMMENT Draft entry and clean copy sequence for [1] kindly provided by M.J.Weiss, 27-JAN-1987. FEATURES from to/span description pept + 13 + 156 alkaline phosphatase precursor, exon 7 /nomgen="ALPL" /map="1p36.1-p34" /hgml_locus_uid="LH0058R" (EC 3.1.3.1) matp + 13 + 156 alkaline phosphatase pre-msg < 1 > 162 ALPL mRNA and introns IVS < 1 12 ALPL intron F IVS 157 > 162 ALPL intron G conflict 68 68 a in [2]; g in [1] BASE COUNT 46 A 33 C 52 G 31 T ORIGIN 4.3 kb after segment 6; chromosome 1p36.1-p34. 1 TGTCTCTTTT AGGTGATCAT GGGGGGTGGC CGGAAATACA TGTACCCCAA GAATAAAACT 61 GATGTGGAGT ATGAGAGTGA CGAGAAAGCC AGGGGCACGA GGCTGGACGG CCTGGACCTC 121 GTTGACACCT GGAAGAGCTT CAAACCGAGA TACAAGGTAG CC // LOCUS HUMALPL08 88 bp ds-DNA PRI 15-JUN-1989 DEFINITION Human liver/bone/kidney-type alkaline phosphatase (ALP) gene, exon 8. ACCESSION M24435 M14168 KEYWORDS alkaline phosphatase; orthophosphoric-monoester phosphohydrolase; phosphatase. SEGMENT 8 of 12 SOURCE Human osteosarcoma-derived cell line Saos-2 DNA, and cDNA to mRNA, clones pLBK 14 [1] and pS3-1 [2]. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 13 to 82) AUTHORS Weiss,M.J., Henthorn,P.S., Lafferty,M.A., Slaughter,C., Raducha,M. and Harris,H. TITLE Isolation and characterization of a cDNA encoding a human liver/bone/kidney-type alkaline phosphatase JOURNAL Proc. Natl. Acad. Sci. U.S.A. 83, 7182-7186 (1986) STANDARD full staff_review REFERENCE 2 (bases 1 to 88) AUTHORS Weiss,M.J., Ray,K., Lamb,B., Kadesch,T. and Harris,H. TITLE Structure of the human liver/bone/kidney alkaline phosphatase gene JOURNAL J. Biol. Chem. 263, 12002-12010 (1988) STANDARD simple staff_review COMMENT Draft entry and clean copy sequence for [1] kindly provided by M.J.Weiss, 27-JAN-1987. FEATURES from to/span description pept + 13 + 82 alkaline phosphatase precursor, exon 8 /nomgen="ALPL" /map="1p36.1-p34" /hgml_locus_uid="LH0058R" (EC 3.1.3.1) matp + 13 + 82 alkaline phosphatase pre-msg < 1 > 88 ALPL mRNA and introns IVS < 1 12 ALPL intron G IVS 83 > 88 ALPL intron H BASE COUNT 19 A 33 C 15 G 21 T ORIGIN 1.9 kb after segment 7; chromosome 1p36.1-p34. 1 CCTTCCTCCT AGCACTCCCA CTTCATCTGG AACCGCACGG AACTCCTGAC CCTTGACCCC 61 CACAATGTGG ACTACCTATT GGGTAAGT // LOCUS HUMALPL09 153 bp ds-DNA PRI 15-JUN-1989 DEFINITION Human liver/bone/kidney-type alkaline phosphatase (ALP) gene, exon 9. ACCESSION M24436 M14168 KEYWORDS alkaline phosphatase; orthophosphoric-monoester phosphohydrolase; phosphatase. SEGMENT 9 of 12 SOURCE Human osteosarcoma-derived cell line Saos-2 DNA, and cDNA to mRNA, clones pLBK 14 [1] and pS3-1 [2]. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 13 to 148) AUTHORS Weiss,M.J., Henthorn,P.S., Lafferty,M.A., Slaughter,C., Raducha,M. and Harris,H. TITLE Isolation and characterization of a cDNA encoding a human liver/bone/kidney-type alkaline phosphatase JOURNAL Proc. Natl. Acad. Sci. U.S.A. 83, 7182-7186 (1986) STANDARD full staff_review REFERENCE 2 (bases 1 to 153) AUTHORS Weiss,M.J., Ray,K., Lamb,B., Kadesch,T. and Harris,H. TITLE Structure of the human liver/bone/kidney alkaline phosphatase gene JOURNAL J. Biol. Chem. 263, 12002-12010 (1988) STANDARD simple staff_review COMMENT Draft entry and clean copy sequence for [1] kindly provided by M.J.Weiss, 27-JAN-1987. FEATURES from to/span description pept + 13 + 147 alkaline phosphatase precursor, exon 9 /nomgen="ALPL" /map="1p36.1-p34" /hgml_locus_uid="LH0058R" (EC 3.1.3.1) matp + 13 + 147 alkaline phosphatase pre-msg < 1 > 153 ALPL mRNA and introns IVS < 1 12 ALPL intron H IVS 148 > 153 ALPL intron I BASE COUNT 33 A 42 C 48 G 30 T ORIGIN 3.4 kb after segment 8; chromosome 1p36.1-p34. 1 CGTCCTCCTC AGGTCTCTTC GAGCCAGGGG ACATGCAGTA CGAGCTGAAC AGGAACAACG 61 TGACGGACCC GTCACTCTCC GAGATGGTGG TGGTGGCCAT CCAGATCCTG CGGAAGAACC 121 CCAAAGGCTT CTTCTTGCTG GTGGAAGGTA GGG // LOCUS HUMALPL10 210 bp ds-DNA PRI 15-JUN-1989 DEFINITION Human liver/bone/kidney-type alkaline phosphatase (ALP) gene, exon 10. ACCESSION M24437 M14168 KEYWORDS alkaline phosphatase; orthophosphoric-monoester phosphohydrolase; phosphatase. SEGMENT 10 of 12 SOURCE Human osteosarcoma-derived cell line Saos-2 DNA, and cDNA to mRNA, clones pLBK 14 [1] and pS3-1 [2]. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 13 to 204) AUTHORS Weiss,M.J., Henthorn,P.S., Lafferty,M.A., Slaughter,C., Raducha,M. and Harris,H. TITLE Isolation and characterization of a cDNA encoding a human liver/bone/kidney-type alkaline phosphatase JOURNAL Proc. Natl. Acad. Sci. U.S.A. 83, 7182-7186 (1986) STANDARD full staff_review REFERENCE 2 (bases 1 to 210) AUTHORS Weiss,M.J., Ray,K., Lamb,B., Kadesch,T. and Harris,H. TITLE Structure of the human liver/bone/kidney alkaline phosphatase gene JOURNAL J. Biol. Chem. 263, 12002-12010 (1988) STANDARD simple staff_review COMMENT Draft entry and clean copy sequence for [1] kindly provided by M.J.Weiss, 27-JAN-1987. FEATURES from to/span description pept + 13 + 204 alkaline phosphatase precursor, exon 10 /nomgen="ALPL" /map="1p36.1-p34" /hgml_locus_uid="LH0058R" (EC 3.1.3.1) matp + 13 + 204 alkaline phosphatase pre-msg < 1 > 210 ALPL mRNA and introns IVS < 1 12 ALPL intron I IVS 205 > 210 ALPL intron J BASE COUNT 47 A 60 C 64 G 39 T ORIGIN 2.1 kb after segment 9; chromosome 1p36.1-p34. 1 TGGTGTCCCA AGGAGGCAGA ATTGACCACG GGCACCATGA AGGAAAAGCC AAGCAGGCCC 61 TGCATGAGGC GGTGGAGATG GACCGGGCCA TCGGGCAGGC AGGCAGCTTG ACCTCCTCGG 121 AAGACACTCT GACCGTGGTC ACTGCGGACC ATTCCCACGT CTTCACATTT GGTGGATACA 181 CCCCCCGTGG CAACTCTATC TTTGGTAGGT // LOCUS HUMALPL11 138 bp ds-DNA PRI 15-JUN-1989 DEFINITION Human liver/bone/kidney-type alkaline phosphatase (ALP) gene, exon 11. ACCESSION M24438 M14168 KEYWORDS alkaline phosphatase; orthophosphoric-monoester phosphohydrolase; phosphatase. SEGMENT 11 of 12 SOURCE Human osteosarcoma-derived cell line Saos-2 DNA, and cDNA to mRNA, clones pLBK 14 [1] and pS3-1 [2]. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 13 to 132) AUTHORS Weiss,M.J., Henthorn,P.S., Lafferty,M.A., Slaughter,C., Raducha,M. and Harris,H. TITLE Isolation and characterization of a cDNA encoding a human liver/bone/kidney-type alkaline phosphatase JOURNAL Proc. Natl. Acad. Sci. U.S.A. 83, 7182-7186 (1986) STANDARD full staff_review REFERENCE 2 (bases 1 to 138) AUTHORS Weiss,M.J., Ray,K., Lamb,B., Kadesch,T. and Harris,H. TITLE Structure of the human liver/bone/kidney alkaline phosphatase gene JOURNAL J. Biol. Chem. 263, 12002-12010 (1988) STANDARD simple staff_review COMMENT Draft entry and clean copy sequence for [1] kindly provided by M.J.Weiss, 27-JAN-1987. FEATURES from to/span description pept + 13 + 132 alkaline phosphatase precursor, exon 11 /nomgen="ALPL" /map="1p36.1-p34" /hgml_locus_uid="LH0058R" (EC 3.1.3.1) matp + 13 + 132 alkaline phosphatase pre-msg < 1 > 138 ALPL mRNA and introns IVS < 1 12 ALPL intron J IVS 133 > 138 ALPL intron K BASE COUNT 30 A 35 C 44 G 29 T ORIGIN 0.7 kb after segment 10; chromosome 1p36.1-p34. 1 CTCCCTGTGC AGGTCTGGCC CCCATGCTGA GTGACACAGA CAAGAAGCCC TTCACTGCCA 61 TCCTGTATGG CAATGGGCCT GGCTACAAGG TGGTGGGCGG TGAACGAGAG AATGTCTCCA 121 TGGTGGACTA TGGTGAGA // LOCUS HUMALPL12 1051 bp ds-DNA PRI 15-JUN-1989 DEFINITION Human liver/bone/kidney-type alkaline phosphatase (ALP) gene, exon 12. ACCESSION M24439 M14168 KEYWORDS alkaline phosphatase; orthophosphoric-monoester phosphohydrolase; phosphatase. SEGMENT 12 of 12 SOURCE Human osteosarcoma-derived cell line Saos-2 DNA, and cDNA to mRNA, clones pLBK 14 [1] and pS3-1 [2]. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 13 to 471; 473 to 1022) AUTHORS Weiss,M.J., Henthorn,P.S., Lafferty,M.A., Slaughter,C., Raducha,M. and Harris,H. TITLE Isolation and characterization of a cDNA encoding a human liver/bone/kidney-type alkaline phosphatase JOURNAL Proc. Natl. Acad. Sci. U.S.A. 83, 7182-7186 (1986) STANDARD full staff_review REFERENCE 2 (bases 1 to 1051) AUTHORS Weiss,M.J., Ray,K., Lamb,B., Kadesch,T. and Harris,H. TITLE Structure of the human liver/bone/kidney alkaline phosphatase gene JOURNAL J. Biol. Chem. 263, 12002-12010 (1988) STANDARD simple staff_review COMMENT Draft entry and clean copy sequence for [1] kindly provided by M.J.Weiss, 27-JAN-1987. FEATURES from to/span description pept + 13 278 alkaline phosphatase precursor, exon 12 /nomgen="ALPL" /map="1p36.1-p34" /hgml_locus_uid="LH0058R" (EC 3.1.3.1) matp + 13 275 alkaline phosphatase pre-msg < 1 1027 ALPL mRNA and introns IVS < 1 12 ALPL intron K revision 239 240 gc in [2]; cg in [1] conflict 245 245 t in [2]; g in [1] conflict 344 344 t in [2]; c in [1] revision 471 471 gg in [2]; g in [1] conflict 510 510 a in [2]; c in [1] conflict 632 632 c in [2]; g in [1] conflict 732 735 ggcc in [2]; gccg in [1] revision 900 900 c in [2]; t in [1] revision 924 924 c in [2]; t in [1] conflict 1023 1027 cgagg in [2]; caaatatttccgagg in [1] BASE COUNT 208 A 378 C 256 G 209 T ORIGIN 0.5 kb after segment 11; chromosome 1p36.1-p34. 1 CCTGGCCCAC AGCTCACAAC AACTACCAGG CGCAGTCTGC TGTGCCCCTG CGCCACGAGA 61 CCCACGGCGG GGAGGACGTG GCCGTCTTCT CCAAGGGCCC CATGGCGCAC CTGCTGCACG 121 GCGTCCACGA GCAGAACTAC GTCCCCCACG TGATGGCGTA TGCAGCCTGC ATCGGGGCCA 181 ACCTCGGCCA CTGTGCTCCT GCCAGCTCGG CAGGCAGCCT TGCTGCAGGC CCCCTGCTGC 241 TCGCTCTGGC CCTCTACCCC CTGAGCGTCC TGTTCTGAGG GCCCAGGGCC CGGGCACCCA 301 CAAGCCCGTG ACAGATGCCA ACTTCCCACA CGGCAGCCCC CCCTTCAAGG GGCAGGGAGG 361 TGGGGGCCTC CTCAGCCTCT GCAACTGCAA GAAAGGGGAC CCAGGAAACC AAAGTCTGCC 421 GCCCACCTCG CTCCCCTCTG GAATCTTCCC CAAGGGCCAA ACCCACTTCT GGCCTCCAGC 481 CTTTGCTCCC TCCCCGCTGC CCTTTGGCCA ACAGGGTAGA TTTCTCTTGG GCAGGCAGAG 541 AGTACAGACT GCAGACATTC TCAAAGCCTC TTATTTTTCT AGCGAACGTA TTTCTCCAGA 601 CCCAGAGGCC CTGAAGCCTC CGTGGAACAT TCTGGATCTG ACCCTCCCAG TCTCATCTCC 661 TGACCCTCCC ACTCCCATCT CCTTACCTCT GGAACCCCCC AGGCCCTACA ATGCTCATGT 721 CCCTGTCCCC AGGCCAGCCC TCCTTCAGGG GAGTTGAGGT CTTTCTCCTC AGGACAAGGC 781 CTTGCTCACT CACTCACTCC AAGACCACCA GGGTCCCAGG AAGCCGGTGC CTGGGTGGCC 841 ATCCTACCCA GCGTGCCCAG GCCGGGAAGA GCCACCTGGC AGGGCTCACA CTCCTGGGCT 901 CTGAACACAC ACGCCAGCTC CTCTCTGAAG CGACTCTCCT GTTTGGAACG GCAAAAAAAA 961 ATTTTTTTTT CTCTTTTTGG TGGTGGTTAA AAGGGAACAC AAAACATTTA AATAAAACTT 1021 TCCGAGGACA GAGCTGAGTC TTTGTGGTCA G // LOCUS HUMALPP2A 428 bp ds-DNA PRI 15-SEP-1989 DEFINITION Human placental heat-stable alkaline phosphatase (PLAP-1) gene, 5' end. ACCESSION M19160 KEYWORDS alkaline phosphatase; heat-stable alkaline phosphatase; phosphatase. SOURCE Human adult liver DNA, clone lambda-EMBL19. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 428) AUTHORS Knoll,B.J., Rothblum,K.N. and Longley,M. TITLE Two gene duplication events in the evolution of the human heat-stable alkaline phosphatases JOURNAL Gene 60, 267-276 (1987) STANDARD simple staff_review FEATURES from to/span description pept 1 67 placental alkaline phosphatase-2, exon 1 /nomgen="ALPP" /map="2q37" /hgml_locus_uid="LV0092C" 164 280 placental alkaline phosphatase-2, exon 2 393 > 428 placental alkaline phosphatase-2, exon 3 pre-msg < 1 > 428 PLAP-2 mRNA and introns IVS 68 163 intron A IVS 281 392 intron B BASE COUNT 80 A 157 C 121 G 70 T ORIGIN 235 bp upstream of PstI site; chromosome 2q37. 1 ATGCAGGGGC CCTGGGTGCT GCTTCTGCTG GGCCTGAGGC TACAGCTCTC CCTGGGCATC 61 ATCCCAGGTA ATGAGGCTCC CCCAGCTGCC CCTACACACA CACACACACA CAGGGCACCC 121 CCCAGCCCAG GCTGACCTGA TCTTTGCTCT CCCCCTTGGC CAGTTGAGGA GGAGAACCCG 181 GACTTCTGGA ACCGCCAGGC AGCCGAGGCC CTGGGTGCCG CCAAGAAGCT GCAGCCTGCA 241 CAGACAGCCG CCAAGAACCT CATCATCTTC CTGGGTGACG GTGAGTGAGC CAGGCCTTCC 301 AGCCCCGCAG CCCTCACAGC CCCGGCGCCC GGACCCTCAG TGGTTCCAGG ACAGCCCTGG 361 GGAGCAAGCC TCACACACTT CTGCTCCTTC AGGGATGGGG GTGTCTACGG TGACAGCTGC 421 CAGGATCC // LOCUS HUMALPPD 4268 bp ds-DNA PRI 15-DEC-1989 DEFINITION Human placental heat-stable alkaline phosphatase (PLAP-1) gene, complete cds. ACCESSION M19159 M19890 M19356 J03931 KEYWORDS alkaline phosphatase; heat-stable alkaline phosphatase; phosphatase. SOURCE Human adult liver DNA, clone EMBL 54A. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 3047; 3783 to 4268) AUTHORS Knoll,B.J., Rothblum,K.N. and Longley,M.A. TITLE Nucleotide sequence of the human placental alkaline phosphatase gene. Evolution of the 5' flanking region by deletion/substitution JOURNAL J. Biol. Chem. 263, 12020-12027 (1988) STANDARD full staff_review REFERENCE 2 (bases 54 to 486) AUTHORS Knoll,B.J., Rothblum,K.N. and Longley,M. TITLE Two gene duplication events in the evolution of the human heat-stable alkaline phosphatases JOURNAL Gene 60, 267-276 (1987) STANDARD full staff_review REFERENCE 3 (bases 3048 to 3782) AUTHORS Knoll,B.J. JOURNAL Unpublished (1988) see COMMENT for Author Address STANDARD full staff_review COMMENT Draft entry and computer-readable sequence [1],[3] kindly submitted by B.J.Knoll, 19-JUL-1988. [1] Author Address: Department of Pathology and Laboratory Medicine, University of Texas Medical School, Houston, TX, 77225. FEATURES from to/span description pept 54 129 placental alkaline phosphatase-1, exon 1 (EC 3.1.3.1) /nomgen="ALPP" /map="2q37" /hgml_locus_uid="LV0092C" 222 338 placental alkaline phosphatase-1, exon 2 451 566 placental alkaline phosphatase-1, exon 3 764 938 placental alkaline phosphatase-1, exon 4 1015 1187 placental alkaline phosphatase-1, exon 5 1438 1572 placental alkaline phosphatase-1, exon 6 1672 1744 placental alkaline phosphatase-1, exon 7 1875 2009 placental alkaline phosphatase-1, exon 8 2092 2283 placental alkaline phosphatase-1, exon 9 2503 2619 placental alkaline phosphatase-1, exon 10 2749 3047 placental alkaline phosphatase-1, exon 11 pre-msg < 1 > 4268 PLAP-1 mRNA and introns IVS 130 221 PLAP-1, intron A IVS 339 450 PLAP-1, intron B IVS 567 763 PLAP-1, intron C IVS 939 1014 PLAP-1, intron D IVS 1188 1437 PLAP-1, intron E IVS 1573 1671 PLAP-1, intron F IVS 1745 1874 PLAP-1, intron G IVS 2010 2091 PLAP-1, intron H IVS 2284 2502 PLAP-1, intron I IVS 2620 2748 PLAP-1, intron J BASE COUNT 872 A 1343 C 1304 G 749 T ORIGIN 16 bp upstream of EcoRI site. 1 CATACTCCAT GCCCAGAATT CCTGCCTCGC CACTGTCCTG CTGCCCTCCA GACATGCTGG 61 GGCCCTGCAT GCTGCTGCTG CTGCTGCTGC TGGGCCTGAG GCTACAGCTC TCCCTGGGCA 121 TCATCCCAGG TAATGAGGCT CCCCGAGCTG CCCCTACACA ACACACACAC AGGGCACCCC 181 CCAGCCCAGG CTGACCTGAT CTTTGCTCTC CCCCTGGCCA GTTGAGGAGG AGAACCCGGA 241 CTTCTGGAAC CGCGAGGCAG CCGAGGCCCT GGGTGCCGCC AAGAAGCTGC AGCCTGCACA 301 GACAGCCGCC AAGAACCTCA TCATCTTCCT GGGCGATGGT GAGTGAGCCA GGCCTTCCAG 361 CCCTGCAGCC CTCACAGCCC CGGCGCCCGG ACCCTCAGTG GTTCCAGGAG AGCCCTGGGG 421 CCCAAGCCTC ACACATTTCT GTTCCTTCAG GGATGGGGGT GTCTACGGTG ACAGCTGCCA 481 GGATCCTAAA AGGGCAGAAG AAGGACAAAC TGGGGCCTGA GATACCCCTG GCCATGGACC 541 GCTTCCCATA TGTGGCTCTG TCCAAGGTAA GTGCTGGGCT ACCTTAGAGT CCTCCAAGCA 601 CAGAAGGGGA ATCCTGGCTA TGGAGTGTGG TAGGAGGGAG GGACCCTAAA CAGCTGGGGC 661 TCCAGTAAGG AGTTAGAGGC AGTTGGAATC CCAGAGGACA GAGATCAGGG TCTGGGTCTC 721 CGTGTCTGCC CCAGAGAAGA GCTCAGAGTG TCTCTGTCCC CAGACATACA ATGTAGACAA 781 ACATGTGCCA GACAGTGGAG CCACAGCCAC GGCCTACCTG TGCGGGGTCA AGGGCAACTT 841 CCAGACCATT GGCTTGAGTG CAGCCGCCCG CTTTAACCAG TGCAACACGA CACGCGGCAA 901 CGAGGTCATC TCCGTGATGA ATCGGGCCAA GAAAGCAGGT GAGCTGGGGC CCGCTGCTGG 961 GTCACGGCCA GGTCACAGAC GTTGGTCACA TATACTGACC TCTGACACCC TTAGGGAAGT 1021 CAGTGGGAGT GGTAACCACC ACACGAGTGC AGCACGCCTC GCCAGCCGGC ACCTACGCCC 1081 ACACGGTGAA CCGCAACTGG TACTCGGACG CCGACGTGCC TGCCTCGGCC CGCCAGGAGG 1141 GGTGCCAGGA CATCGCTACG CAGCTCATCT CCAACATGGA CATTGACGTG CGACCCCCAG 1201 GACCAAGGGC TGGGGCTGGG CAGAGAGTAG CAGGGAGGGG GCACTAGCTC AGACCCAGGC 1261 AACCAAAAGC CTTATCTGGG CCAGCAGGGT CTGGAGGTGG GGTTGTGGGC GTAGAAGGTG 1321 CAGCCCAGGC TGGGCCATTC CCACAGCCTT GGGGAGGGGA GTCAGGGGCT GTGCATGAGG 1381 AGGGGGCACG GGGCCAGCCA GGGCCCCAAA TCCACCTGCC CCATCCTCTG TTCCCAGGTG 1441 ATCCTAGGTG GAGGCCGAAA GTACATGTTT CGCATGGGAA CCCCAGACCC TGAGTACCCA 1501 GATGACTACA GCCAAGGTGG GACCAGGCTG GACGGGAAGA ATCTGGTGCA GGAATGGCTG 1561 GCGAAGCGCC AGGTGATGGG GGCTGGCGGG TGCAGGGGGC ACAGCAGGGG GAGGGCAGAG 1621 GTGTGGGGCT CAGGGCTGTG GGCTGAGGCC TGGCTCTCTC CCTCCCCACA GGGTGCCCGG 1681 TACGTGTGGA ACCGCACTGA GCTCATGCAG GCTTCCCTGG ACCCGTCTGT GACCCATCTC 1741 ATGGGTAATG ACCCCCTTCC TGCCCTGGCA TCCCTCAGAT GGCCTCAGAT GGCACCTTCT 1801 GAGCCTGTGT GCACATCCGC CAGCACCCGC CCACCCCCAG CCTGCCAGTC ACCACAGGAC 1861 CCCTTGTCCC ACAGGTCTCT TTGAGCCTGG AGACATGAAA TACGAGATCC ACCGAGACTC 1921 CACACTGGAC CCCTCCCTGA TGGAGATGAC AGAGGCTGCC CTGCGCCTGC TGAGCAGGAA 1981 CCCCCGCGGC TTCTTCCTCT TCGTGGAGGG TGCGTGGTGG CCCCTGGGGA GTGGGGGTTG 2041 GGGGTTGGAG CAGGGCAGGC TCAGCATCTC CCCCCTCTGG CCTTCCTGCA GGTGGTCGCA 2101 TCGACCATGG TCATCATGAA AGCAGGGCTT ACCGGGCACT GACTGAGACG ATCATGTTCG 2161 ACGACGCCAT TGAGAGGGCG GGCCAGCTCA CCAGCGAGGA GGACACGCTG AGCCTCGTCA 2221 CTGCCGACCA CTCCCACGTC TTCTCCTTCG GAGGCTACCC CCTGCGAGGG AGCTCCATCT 2281 TCGGTAGGCC TGGGGAGAGT GGCAGGTGCT GCTGCAGCAA TTAAGTGGGT GAAATCTGAG 2341 CCTCAGTCTC CTCCTCTGTC AAATGGGAGT AATGCTGGCA CCAGCCCTGT AGGGTCTCCT 2401 GAGGACTAAG CCCCTGACCA GGCAAAACGT GGCGGTGCCT AGCACGTGGG AGACACTCCA 2461 CAGCTGTGTT CAGCTCAACC ACAGGGACCC CTCTCTCTGC AGGGCTGGCC CCTGGCAAGG 2521 CCCGGGACAG GAAGGCCTAC ACGGTCCTCC TATACGGAAA CGGTCCAGGC TATGTGCTCA 2581 AGGACGGCGC CCGGCCGGAT GTTACCGAGA GCGAGAGCGG TGAGTGCCGC GGGGTGGCCC 2641 CCTGAGGGGG GACCAGGGTG CCAAGGATGG GGGGCTGGCG GGAAGGGGTC ACCTCCTGTC 2701 TGCCTGGAAC TGAATGAACC CTCCTACCGG AACTGAACCC TCCAACAGGG AGCCCCGAGT 2761 ATCGGCAGCA GTCAGCAGTG CCCCTGGACG AAGAGACCCA CGCAGGCGAG GACGTGGCGG 2821 TGTTCGCGCG CGGCCCGCAG GCGCACCTGG TTCACGGCGT GCAGGAGCAG ACCTTCATAG 2881 CGCACGTCAT GGCCTTCGCC GCCTGCCTGG AGCCCTACAC CGCCTGCGAC CTGGCGCCCC 2941 CCGCCGGCAC CACCGACGCC GCGCACCCGG GGCGGTCCGT GGTCCCCGCG TTGCTTCCTC 3001 TGCTGGCCGG GACCCTGCTG CTGCTGGAGA CGGCCACTGC TCCCTGAGTG TCCCGTCCCT 3061 GGGGCTCCTG CTTCCCCATC CCGGAGTTCT CCTGCTCCCC ACCTCCTGTC GTCCTGCCTG 3121 CCTCCAGCCC GAGTCGTCAT CCCCGGAGTC CCTATACAGA GGTCCTGCCA TGGAACCTTC 3181 CCCTCCCCGT GCGCTCTGGG GACTGAGCCC ATGACACCAA ACCTGCCCCT TGGCTGCTCT 3241 CGGACTCCCT ACCCCAACCC CAGGGACTGC AGGTTGTGCC CTGTGGCTGC CTGCACCCCA 3301 GGAAAGGAGG GGGCTCAGGC CATCCAGCCA CCACCTACAG CCCAGTGGGG TACCAGGCAG 3361 GCTCCTTTCC TGGGGAAAAG AAGCACCCAG ACCCCGCGCC CCGCTGATCT TTGCTTCAGT 3421 CCTTGAATCA CCTGTGGGAC TTGAGGACTC GGGATCTTCA GGACGCCTGG AGAAGGGTGG 3481 TTTCCTGCCA CCCTGCTGGC CAAGGAGGCT CCTGGGGTGG GGATCACCAG GGGGATTTTG 3541 ACAACGTTCG GCTGCCCCCC ACTAAGCTAA TTCCACACCC CTGTACCCCC CCAGGGGGCC 3601 CTCTGCCTCA TGGCAAAGGC TTGCCCCAAA TCTCAACTTC TCAGACGTTC CATACCCCCA 3661 CATGCCAATT TCAGCACCCA ACTGAGATCC GAGGAGCTCC TGGGAAGCCC TGGGTGCAGG 3721 ACACTGGTCG AGAGCCAAAG GTCCCTCCCC AGACATCTGG ACACTGGGCA TAGATTTCTC 3781 AAGAAGGAAG ACTCCCCTGC CTCCCCAGGG CCTCTGCTCT CCTGGGAGAC AAAGCAATAA 3841 TAAAAGGAAG TGTTTGTAAT CCCAGCACTT TGGGAGGCCG AGGTGGGCGG ATCACGAGGT 3901 CAGGAGATGG AGACCATCCT GGCTAACACG GTGAAACCCC TTATCTATGC GCCTGTAGTC 3961 CCAGCTACCC AGGAGGCTGA AGCAGGATAA TCGCTTGAAC CCGGGCGGCG GAGATTGCAG 4021 TGAGCCGAGG TCATGCCACT GCACTGCAGC CTGGGCGACA GAGCGAGATT CTGCCTCAAA 4081 AATAAACAAA TAAATTTTAA AAATAAATAA ATAATAAAAG GAAGTGTTAG ACAATGTAAT 4141 GCCAGTACTA CTTCCTAGGA GGAAAATCAT GAGTGCCTGT GGGCACAGTG TCTGGAGGGG 4201 TGGATAATGC AGGCCAGGAG GGGCTGCTGA GGAGCAGATG ATTGAGCAGG AGACCTAAAC 4261 AGAGTGGG // LOCUS HUMAMY1 292 bp ds-DNA PRI 15-SEP-1989 DEFINITION Human salivary alpha-amylase (AMY1) gene, exon 9. ACCESSION M18674 KEYWORDS alpha-amylase; amylase. SOURCE Human salivary gland DNA, clone N1. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 292) AUTHORS Gumucio,D.L., Wiebauer,K., Caldwell,R.M., Samuelson,L.C. and Meisler,M.H. TITLE Concerted Evolution of Human Amylase Genes JOURNAL Mol. Cell. Biol. 8, 1197-1205 (1988) STANDARD simple staff_entry COMMENT Draft entry and computer_readable copy of sequence kindly submitted by M.Meisler 15_MAR-1988 Base 91 removed at request of author. FEATURES from to/span description pept / 23 / 148 alpha-amylase, exon 9 (AA at 24) /map="1p21" /hgml-locus-uid="LJ0263N" IVS < 1 22 alpha-amylase, intron H IVS 149 > 292 alpha-amylase, intron I BASE COUNT 87 A 41 C 63 G 101 T ORIGIN 1 TTATATGTAT TGTCCTTTTT AGGAACATGG TTAATTTCCG CAATGTAGTG GATGGCCAGC 61 CTTTTACAAA CTGGTATGAT AATGGGAGCA ACCAAGTGGC TTTTGGGAGA GGAAACAGAG 121 GATTCATTGT TTTCAACAAT GATGACTGGT AAGTAAATAT CAATTAAAAA TAATATTTTG 181 TACCAGTATG TTCTTGGTTT ATTCTTTTTT TTTTCTGTTC ATTGACATTT ATCATATCTG 241 AAAAATCATG TAGTCAGTGG AGCGAGAAGA CAATAGAGAT CCCCGGGCGA GC // LOCUS HUMAMY101 941 bp ds-DNA PRI 15-JUN-1989 DEFINITION Human salivary alpha-amylase (amy1) gene, exon 1. ACCESSION M18715 K02214 KEYWORDS alpha-amylase; amylase. SEGMENT 1 of 10 SOURCE Human salivary gland, cDNA to mRNA, clones pHSA7 and pHSA15 and [1],[2],[3] and Human cosmid library DNA, clone cosP2 [4]. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 543 to 931) AUTHORS Nakamura,Y., Ogawa,M., Nishide,T., Emi,M., Kosaki,G., Himeno,S. and Matsubara,K. TITLE Sequences of cDNAs for human salivary and pancreatic alpha- amylases JOURNAL Gene 28, 263-270 (1984) STANDARD full staff_review REFERENCE 2 (bases 1 to 941) AUTHORS Nishide,T., Nakamura,Y., Emi,M., Yamamoto,T., Ogawa,M., Takesada,M. and Matsubara,K. TITLE Primary structure of human salivary alpha-amylase gene JOURNAL Gene 41, 299-304 (1986) STANDARD full staff_review REFERENCE 3 (bases 543 to 931; revises [1]) AUTHORS Nishide,T., Emi,M., Nakamura,Y. and Matsubara,K. TITLE Corrected sequences of cDNAs for human salivary and pancreatic alpha-amylases JOURNAL Gene 50, 371-372 (1986) STANDARD full staff_review REFERENCE 4 (bases 1 to 941; revises [1],[2],[3]) AUTHORS Horii,A., Emi,M., Tomita,N., Nishide,T., Ogawa,M., Mori,T. and Matsubara,K. TITLE Primary structure of human pancreatic alpha-amylase gene: Its comparison with human salivary alpha-amylase gene JOURNAL Gene 60, 57-64 (1987) STANDARD full staff_entry COMMENT Draft entry and computer readable copy of sequence [1] kindly provided by A.Horii (14-MAR-1988). [1] also sequenced the pancreatic alpha-amylase mRNA. Comparison of the two alpha-amylase nucleotide sequences reveals 96% homology in the coding region. Comparison of the sequences of human alpha-amylase cDNAs with those for mouse suggests the possibility of gene conversion between the two genes of alpha-amylase. FEATURES from to/span description pept 764 + 931 alpha-amylase, exon 1 /map="1p21" /hgml_locus_uid="LJ0263N" IVS 932 > 941 alpha-amylase, intron A BASE COUNT 258 A 160 C 211 G 312 T ORIGIN 193 bp upstream of XbaI site; chromosome 1p21. 1 GCATTCAAGT TAACTCTTCC CCTTGGTATC TGTACATACC TTTGATGTCA GTGTTTAGTA 61 CACGTGGCTT GGTCACTTCA TGGCTAAAAA CGTGCTTGTG GAAGACAAGT CTGGCTTGGT 121 GAGTCTGTGT GGTCAGCAGT CTCTGATCCG TGCAGGGTAT TAATGTGTCA GGGCTGAGTG 181 TTCTGAGATT TATCTAGAGG CTGGGAAGGG CTCCTGAACC AGTTGTTTCC GTCTTGTCGG 241 TCTGTCAGGG TTGGAAAGTC CAAGCCATAG GACCCAGTTT CCTTTCTTAG CTTACGTTAT 301 CTACCAGAGC ACCGTGGGCT GTTACTTGCC TTGAGTTGGA AGCGGTTCGC ATTTATACCG 361 GTAAATGTAT TCATCCTTTT AATTTATGTA AAGTTTTTTA GTATGCAATT