LOCUS HUMDBPG 1805 bp ds-DNA PRI 15-MAR-1989 DEFINITION Human vitamin D-binding protein (DBP) gene, exon 1. ACCESSION M17156 KEYWORDS plasma protein; vitamin D-binding protein. SOURCE Human chromosome 4 DNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 1805) AUTHORS Yang,F., Naberhaus,K.H., Adrian,G.S., Gardella,J.M., Brissenden,J.E. and Bowman,B.H. TITLE The vitamin D-binding protein gene contains conserved nucleotide sequences that respond to heavy metal, adipocyte and mitotic signals JOURNAL Gene 54, 285-290 (1987) STANDARD simple staff_review FEATURES from to/span description pept 1711 / 1768 vitamin D-binding protein, exon 1 /nomgen="GC" /map="4q12-q13" /hgml_locus_uid="LZ0044H" pre-msg 1650 > 1805 vitamin D-binding protein mRNA and intron IVS 1769 > 1805 vitamin D-binding protien intron A rpt 119 112 (c) direct/inverted repeat homologous to enhancer element of anti-thrombin III site 177 183 metal regulatory element site 278 290 adipocyte cell-specific element rpt 433 440 direct/inverted repeat homologous to enhancer element of anti-thrombin III site 521 526 metal regulatory element site 527 534 metal regulatory element site 570 583 mitosis related sequence site 749 743 (c) metal regulatory element site 821 827 metal regulatory element site 874 880 metal regulatory element site 957 951 (c) metal regulatory element site 984 996 adipocyte cell-specific element site 1071 1065 (c) metal regulatory element rpt 1291 1284 (c) direct/inverted repeat homologous to enhancer element of anti-thrombin III rpt 1397 1404 direct/inverted repeat homologous to enhancer element of anti-thrombin III rpt 1408 1420 direct repeat rpt 1472 1465 (c) direct/inverted repeat homologous to enhancer element of anti-thrombin III rpt 1491 1503 direct repeat rpt 1576 1595 direct repeat BASE COUNT 563 A 325 C 349 G 568 T ORIGIN 1 bp upstream of EcoRI site. 1 GAATTCAGCC AAAGGGAACT GAAAATTTAG ATTTATAGTA ATCAGGAAGT TGATCCTTGA 61 ACAAGATAGA GTTGGCTCTT GGACAGGAAA GCTATCTCCT TGACCATATG GATTTCCACT 121 GGCCTTACAG AAATACTTGT GCTTTGATCT GGCATAGGTA TCTTTTTCCA ACTAGCTGGA 181 CTCTATTTTC TGAACCTAAC ACCCAAGGAT TTTCAGTATA GATAGAAAAT CTGACTAAAC 241 ATGTAACTAC CTAAATTGGT TTACTGAAAT TTTGATAGAC TCTGCCCATG TAGATAATCC 301 TTGGCTACTG AGTTGATAAT GAAGTTTTAA ATCACTTATC TGCGGGACCC TGGACACAGG 361 GCTGTCTTTA AACTCTTCCA CTGAGTTCCA ATGGCCTCAT TAGTTTCATT GACCTTCACA 421 GATATTATAA ATTTGGAAAG CCAGTGGGAG AACGTGGCAA TAACTAGAAT AAGAGTAATA 481 GGTGACATTA ATCCATGTTA CTATGTGATG GATGTTACCA TGAGCCCTGC ACCCAGATTA 541 CCTTAATTAA CCTCATAGCA ATGTTGTGTC AAATGTACTG AGTTATCACC ACATTGCATA 601 TGAGGAAAAT GAAACTTAGA GGGGTAAAAT AAATTTCCCA AGGTTACACA GCTATGAAAG 661 CTGAGTAGGG ATTTGATTTT AGAGCATGTG TAACAACCTG TACCTGACCA TTTCCTGAAA 721 ATAGGGCTAC TGCAAAAACC AGGAGTGGAA CTCATCTAAT TCCAATGAAT GATCTACCTA 781 TGACTCTCTA GACAAGTCAC AGAATTTACC CAGTGCTCAG TTCACTCTTC TGTAAAATGA 841 AGATGTTGGA TTTCATGTCC TACAGGGCCT CTCTGAGCTC TCAAATTACA TGCCCATAAA 901 ATATACTAGT TGTAAACTAT TAGTAGAAGG AGCTTTAAAA TCACTGTAGG GAGTCCAAGA 961 TTTGCCACTA ACTTTTTATG ATTGGCAGTG GTGAGCCATT TAACCTCTCC AGTTATTTGC 1021 CTCACTTCAA AGGTTATTTT TCATATATAT GTCAGTGCTT CACAAAGTGC AATACAGCAT 1081 AGAAATACAA AGTACTATGC CATCATCTAA AATAAGATTA CTTCATAGCA TCAGAATTAT 1141 GGATTTAAAA TATTATGTTG AATTGGCTTC TCACTCTTTT TCCCCTTTTA CTTAGAACAT 1201 CTGGTCTGCA ATATTTTAAG GTAATGCTTA TTTGTAGTAG ATTTAAACAA AGAGAGGAAG 1261 AGAGGTAAAG ACAGAGTTTC CGATTTTCCA CTTACATATG AGAAAGGTGG GGTGTCCAAA 1321 GAAGACACAC AGCCCTATAC AGGGGGAGAA GGTGTGCGTT ACTAACATAT TAAGTAAACT 1381 TTAGTGAGGA ACAGCAGTGG AAAATAATCT ATATACCTTG GCTCTTTTGC AGTTTGACAA 1441 AGTTAATGAT TAAAATCTCC TAGATTTTCC ACTACAGTAT CCCCAGGGTG TCTATTTACC 1501 TTGATTGATA TTATTTTATC TCTTTTGGGC CAAAGATAAC AGCCCCTTGC TTCTGTGTTT 1561 AATAATAATT CTGTGTTGCT TCTGAGATTA ATAATTGATT AATTCATAGT CAGGAATCTT 1621 TGTAAAAAGG AAACCAATTA CTTTTGGCTA CCACTTTTAC ATGGTCACCT ACAGGAGAGA 1681 GGAGGTGCTG CAAGACTCTC TGGTAGAAAA ATGAAGAGGG TCCTGGTACT ACTGCTTGCT 1741 GTGGCATTTG GACATGCTTT AGAGAGAGGT AAGATTTCTT TTGTTGTGAC CATTTACAGG 1801 AATTC // LOCUS HUMDES 8878 bp ds-DNA PRI 15-DEC-1989 DEFINITION Human desmin gene, complete cds. ACCESSION M26935 KEYWORDS Alu repetitive sequence; desmin. SOURCE Human DNA, clone pHuDes14, pHuDes1, pHuDes3, pHuDes10. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 8878) AUTHORS Li,Z., Lilienbaum,A., Butler-Browne,G. and Paulin,D. TITLE Human desmin-coding gene: Complete nucleotide sequence, characterization and regulation of expression during myogenesis and development JOURNAL Gene 78, 243-254 (1989) STANDARD full staff_review COMMENT Draft entry and computer readable copy of sequence [1] kindly submitted by D.Paulin, 05-SEP-1989. FEATURES from to/span description pept 166 740 desmin, exon 1 /nomgen="DES" /map="2" /hgml_locus_uid="LS0146T" 1793 1853 desmin, exon 2 1950 2045 desmin, exon 3 2194 2355 desmin, exon 4 2534 2659 desmin, exon 5 3045 3265 desmin, exon 6 5443 5486 desmin, exon 7 7313 7395 desmin, exon 8 7580 7621 desmin, exon 9 pre-msg 86 > 8351 desmin mRNA and introns IVS 741 1792 desmin, intron A IVS 1854 1949 desmin, intron B IVS 2046 2193 desmin, intron C IVS 2356 2533 desmin, intron D IVS 2660 3044 desmin, intron E IVS 3266 5442 desmin, intron F IVS 5487 7312 desmin, intron G IVS 7396 7579 desmin, intron H signal 51 56 TATA promoter (put.) rpt 3374 3668 AluI repetitive sequence 1 rpt 4146 4439 AluI repetitive sequence 2 rpt 5113 5403 AluI repetitive sequence 3 rpt 6571 6856 AluI repetitive sequence 4 signal 8346 8351 poly-A signal BASE COUNT 1968 A 2401 C 2493 G 2016 T ORIGIN 1 GATCTTGCAG CTGTCAGGGG AGGGGAGGCG GGGGCTGATG TCAGGAGGGA TACAAATAGT 61 GCCGACGGCT GGGGGCCCTG TCTCCCCTCG CCGCATCCAC TCTCCGGCCG GCCGCCTGCC 121 CGCCGCCTCC TCCGTGCGCC CGCCAGCCTC GCCCGCGCCG TCACCATGAG CCAGGCCTAC 181 TCGTCCAGCC AGCGCGTGTC CTCCTACCGC CGCACCTTCG GCGGCGCCCC GGTCTTCTCG 241 CTCGGCTCCC CGCTGAGCTC GCCCGTGTTC CCGCGGGCGC CTTTCGGCTC TAAGGGCTCC 301 TCCAGCTCGG TGACGTCCCG CGTGTACCAG GTGTCGCGCA CGTCGGGAGG GGCCGGGGGC 361 CTGGGGTCGC TGCGGGCCAG CCGGCTGGGG ACCACCCGCA CGCCCTCCTC CTACGGCGCA 421 GGCGAGCTGC TGGACTTCTC ACTGGCCGAC GCGGTGAACC AGGAGTTTCT GACCACGCGC 481 ACCAACGAGA AGGTGGAGCT GCAGGAGCTC AATGACCGTT CGCCAATCTA CATGGAGAAG 541 GTGCGCTTCC TGGAGCAGCA GAACGCGCTC GCCGCCGAAG TGAACCGGCT CAAGGGCCGC 601 GAGCCGACGC GAGTGGCCGA GCTCTACGAG GAGGAGCTGC GGGAGCTGCG GCGCCAGGTG 661 GAGGTGCTCA CTAACCAGCG CGCGCGCGTC GACGTCGAGC GCGACAACCT GCTCGACGAC 721 CTGCAGCGGC TCAAGGCCAA GTGAGGGCCC GGCACCCCAG ACTCCTCTTT CTGCGGGCAG 781 GGCACAGGAG GCTAGGCCTG GGGGCTGGGG TCCCGCTGTC ACCACCTGCC TTCTCCCGGG 841 GCCCGACCGC TCCTCCCCCA TGTGGAGAAA GGGTCCTCCA CCTGTGTGTT TCAAGGGGCC 901 GTGACCTCCA GGTCTCTCCC CCTGCGATCC CATCTTGCCA CAGGAGTTTT CTTGGGGACA 961 TAGATCAGGG GGTGGATATG GGAGAATTTA GGGGACCCGG TGCCCTGTGG ACAGCCCCGT 1021 TAAAAAGCAT TTTAAGATGC TGGGGCGATA TTTATGGGGT CAGGTAGTTG ATGGGCAGAG 1081 GAAGGGCTGC AGGAGGCCCA GAGCGAGTGT AGCCAGAGGG AGAAGGGAGG CTGATAGGAG 1141 ACAGGGAAAG CAGGGCAAGG GCCAGAGTCC AAGCAACAGC TCTCAGTCTC AGCGTTGATG 1201 AGGCCCTGGG GGAGGTGGGG GGAGGGGGGA GCTTGGCCCT GGGGCTTGCC GAGACTGTGT 1261 CTTTTTACAA GGTGAATGGA CAGGCTGGAG AAAAAGGGAG TAGGTGGGGG TCACAGCTCT 1321 CAGAGAGCTT GGGAGGACCT GACTGTAGAC TTCACCAGGC TCCAAGAACG AAAAGGGCAG 1381 CAAGTGTAGC ATATTTCGCT CTGGTCCCAC TTCTGACAGG CCAAGTGAGC ACAGTCACCC 1441 TCCTGCCACC AAAGTCATAA ATATTAATTG AGCAGCTATA TGGCCAGGCT GGAGCTGGGA 1501 ACCAGAAACA CAGAGGTGGA TAAAATAGAC ACAGTTTCTA GAGACCCAGG GAGGTCACAC 1561 AGTCTGGTGG GGACATAGAC TTCAAGGGTG TGGCTCCTGG GCAGAGATTG GGCCACTTCC 1621 TCTGCCCTCT CTGGGTGGGT GGGGCCTCTC CACTCCCTGT CTCTCCTGCC TCTACCCAGC 1681 AGCCAGGCCC TCCCCCTCTG TCCTGGACCC ACCCCCTGGT CAGCCCCGCC AGTCGTTTCC 1741 ACTGCCAGCT TTATCACCCG CAACTGTCTG TCTTTCTGTC TGTCCCACCC AGGCTGCAGG 1801 AGGAGATTCA GTTGAAGGAA GAAGCAGAGA ACAATTTGGC TGCCTTCCGA GCGGTGAGTG 1861 CCCTTCTTTT CCCCTTGCAT GGCCTCTCGC GTTTGCTCTG CCCCACCTGG GTCGCGGAGA 1921 CCATGTCCTT CTCGCTTGGC CTCTCCCAGG ACGTGGATGC AGCTACTCTA GCTCGCATTG 1981 ACCTGGAGCG CAGAATTGAA TCTCTCAACG AGGAGATCGC GTTCCTTAAG AAAGTGCATG 2041 AAGAGGTATA CTTGGCCCCT CTTCCTGGGG TCACTGGGCC ATTGGGGAAA GCAGCCGGAA 2101 AGTGGGGTTT GGGGTGAGGC TCTGGCTGGG AATAGGGGTG TGAGGGTCGT GTGTGGGCCC 2161 TGAGAGGGAC TGAAGCCCAG TCATGCCCTA CAGGAGATCC GTGAGTTGCA GGCTCAGCTT 2221 CAGGAACAGC AGGTCCAGGT GGAGATGGAC ATGTCTAAGC CAGACCTCAC TGCCGCCCTC 2281 AGGGATATCC GGGCTCAGTA TGAGACCATC GCGGCTAAGA ACATTTCTGA AGCTGAGGAG 2341 TGGTACAAGT CGAAGGTGGG TGGCCTCGCC CGGGGACTGG GATCTCCGTC CCCCTGGAAT 2401 CCCAAGGCCT TTGGATTGTT GCTTGCCTGT GGTACCATCC ATGGGAGGAG AGCCCAGAGG 2461 TTCATGCTCC CAAGCTCATC CCTACCCGTG CCCTGCATCC TTCTCATTTT TGGGCCCCTT 2521 TCTCTGCCCT TAGGTGTCAG ACCTGACCCA GGCAGCCAAC AAGAACAACG ACGCCCTGCG 2581 CCAGGCCAAG CAGGAGATGA TGGAATACCG ACACCAGATC CAGTCCTACA CCTGCGAGAT 2641 TGACGCCCTC AAGGGCACTG TGAGTCCCTG CCCACCTGGG CCAGGCCCTG CCCCTTCTCT 2701 GTCTGCAGTT CACACCCTCA CTTTGTGACC TTGGCCATCA TAGATCCTCT CTGGCCTTCA 2761 TCTACTTAAA TCTACAATAG GGTAAAACCA GACAAGTGGA TTCCAGTTGG ATGCTAAGGA 2821 ATCAGGGGTT CCTGGGCATC TACCTATGTG GGGACTTGTG AGGCTGATGC AATGTTCCTT 2881 TGTATCTATT TTATTCTGAG TGTTCCACAT ATAGCTTAAT TTGAGTTCAG GGTTCAACAT 2941 GGCCAGGACC TGACCATTCT GGGGTTGCCT GCCAGCCCCA AAGCTTTCTT TGGGCTGCTA 3001 GTGTCCTCTT CCCTTCCTTG ACCTGGGTTC CCCCTCTCCT GCAGAACGAT TCCCTGATGA 3061 GGCAGATGCG GGAATTGGAG GACCGATTTG CCAGTGAGGC CAGTGGCTAC CAGGACAACA 3121 TTGCACGCCT GGAGGAGGAA ATCCGGCACC TCAAGGATGA GATGGCCCGC CATCTGCGCG 3181 AGTACCAGGA CCTGCTCAAC GTGAAGATGG CCCTGGATGT GGAGATTGCC ACCTACCGGA 3241 AGCTGCTGGA GGGAGAGGAG AGCCGGTGAG GGGCCAGGCA GGAGCCGAGT GGGAGGTGCG 3301 GGGTGCTGGG TGGTCCATTT CTGTCCCCAG GAGGCTCGAG ATTACTGATT ACCTCAACAA 3361 GACCTGGAAA CAATTTTTTT TTTTTTTGAG ATGGAGTTTC GCTCTTGTCG CCCAGTCTGG 3421 AGTGCAATGG CACCATCTTG GCTCACTGCA ACCTCCGCCT CCTGGGTTCA AGCAATTCTC 3481 CTGCCTCAGC TTCCCGAGCA GCTGGGATTA CAGGCGCCTA CCACCACACC TGGCTAATTT 3541 TTGTATTTTT AGTAGAGAGG GGTTTCACTA TGTTGGGCAG CTGGTCTCGA ACTCCTGACC 3601 TCAGGTGATC TGCCTACCTC AGCCTCCAAA GTACTGGGAT TACAGGCGTG CGCCGCGCCC 3661 GGCGTGGAAA CAATTTTATA CAAGAGGCCA CTGCTCTATA ACCGCAGAGA ATTAACCAAG 3721 GCCACCTGGG TAAAAAGCAA TTTAATTAAC AGCATTTATA AAAAGAGAAA TATAAGTATT 3781 CCCTTATCCA CCTTCAGAAT TAAGAACCAG CAGCAAAAAA AACCATATTC AGCAACCAAT 3841 AATAAAATAT TACTGCCAGT CAAGGGAGAG ATGAAAGCAG AATGGGGAGT ATGAGGAGGG 3901 AAACGGAGGG AGCAGATAGA GGCAAATAGA GAGAGGGAGG GAGAGACCTT CACAGGGATA 3961 GGCTAAGAGA AGGTAAAGAG ATGCATACAG AAGGCATGAG TTTGCACACA CATGCACACA 4021 TGTCAGCATA TCACATGTCG GAGGAAAAGA CACAGACGAT ATGCTCAGAG CAACCCACTC 4081 TAGACACAGA AACACATACA CTGGAAGGGA CACCAGGGAA CTGTCGGATA CAGAATGTCT 4141 ATATCTTTTT TTTTTTTTTT TGAGCAGAGT CTGCTCTGTA CAGCCGCTAG GTAGCGTGTG 4201 TATCTCCCTA CACTGCAACC TGCTCCTCCC AGGCTCAAGC GCTCTCCTGT CTCAGCCTCC 4261 CTGTAGCTGG ATACAGGATC GACAGTGCCA ACACACTCAG ACTCGATTTT TTGTATTTTT 4321 AGTAGAGATG AGGTTTCACC ATGTTGGTCA GGCTGGACTC TAACTCCTGA CCTCAAGTAA 4381 TTCGCCTGCC TCAGCCTCCC AAAGTGCTGG GATTATAGGC ATGAGCCACT GTGCCTGGCC 4441 TAGAAATGTT CTATATCTTG ATTTGGACTG GTAGTTACAC AGACGTGGAC ATTGATCAAA 4501 ACTCATTGAA TTGTACATTA AAGATCTGTG CATTTCACTG TATGTAAATT TAACCTCAAT 4561 TTAAAACAGA TCAGTTACTC CAAAAACTGG TACCCCTTGA TACTCATAAA AATAACTATC 4621 TGAAATTTTT AAAAAGGAGC AGACATGTTT AGATACAGGT ACAAACATAA AGAGAAACTT 4681 CAGATTTTGT AAAATCACGT CTACATATGA TGGCCTTTGA AGGTAGCTAA AGGCATGTTG 4741 CATTTTCCCA GCATTAGGCC TAAATAGTAA GCTCTCAGAA CTGGCTGGCT GGAGGGTGGG 4801 CATGAGCAAT GGATATCACC CACAGCTGCT ACTAGTACCA ACTTCAGTGC CAGGAGCAAG 4861 AAGGAAATCC TGGTGCTCTT GAGGGTGCGG TGGGGTTGCA CATCCTTAGT GTCGCGCAAC 4921 GCAGTTCACA GTTGAAATAA TGATCCTTCC AGAAGAGACT ATCCCTGGGG ACAGCTCTAG 4981 CTGGCTTCGA CCCCATTCCT TAGCAGTATG GGTGGTGGGA GTGCTGGGGT CGTGGGACTG 5041 GGACAGCTGA GGATGGTGTT TAGAAAGGAC CAGGTGAGCC TTGATGGGCG TGCAGGGAAA 5101 GATCGGGAGG TCTTTTTTTT TTTTTTTAGA CAGGGCTTGT TCTGTACCCC AGGCTGGAGT 5161 GCAGTGGTGC AACTCCGCTC ACTGCAAGCT CCACCTCCTG GGTTCGCACC ATTCTCCTGC 5221 CTCACCCTCC TGAGTATCTG GAACTACAGG TGCCCGCCAC CATGCCCGGC TAATTTTTGT 5281 ATTTTTAGTA GAGACGGGGT TTCACTGTGT TAGCCAGGAT GGTCTCGATC TCCTGACCTG 5341 GTGATCCGCC CGCCTCGGCC TTCCAAAGTG CTGGGATTAC AGCTGGGCCC GGCCGATGGG 5401 AGGGTTCTTA ACTCTTAGGA GGTTTTGTCT CTTCCCTTTT AGGATCAATC TCCCCATCCA 5461 GACCTACTCT GCCCTCAACT TCCGAGGTGA GTGTCTGCTG GAGGCGGAGG CTGGAGTTGC 5521 AGGGGCCAGG AGTCCAGCAT GGGCACTGCC CAAGGCCAGC CAGGAGGGAG GATGGGACCC 5581 TGGGGCTAGG GACAGACCTG AGCTAGGGAA GAAAAGGGAC CACTGCGGGT AGGTCGGGGA 5641 GTTTAGGTAG AGGTGGATGA GGCAACTTAT GACAGAGAGA GCTTTTTATG TGATTATATT 5701 GTATTGTATT TACCATGTCC CCTGCACCTT CTTGGAGTAT TGGGCCTCAG AAGCAGCCTC 5761 TGTTCTGAAT CCCAAACTCA TCCCAAACTC TGAAAGCAGG AGTGTCCTTG GGGCCCTGGC 5821 TTCAGGAGCA TCTCATACCA CCTGCCCTCT AAGACACCCA CAGCCCATTG TCTGTTTGGC 5881 ACATTGCCAA GCCCCACCTT GGCTGCTTGT CGAACACCTC CAGTTTGTTG GAGCAAGATG 5941 TTGGATGCAC GTTTGGAAAA AAATGCCAAA TCTTAGCAAA GCCAAACAAT GAAAAGAAAG 6001 AGCTCTTAGG GCACTCTGCC ATTTTGACAT TAGTAATACG CAGATCGGGT GGCTTTCCAA 6061 CCCCAGAGAG CAGAGAAACA CAGGATAATG ATGCATTAGG AAAGCAAGTC GGAGTCCTTG 6121 GCAGCTTCTC CTTTCCTAGA TCAAACCTTT TCTATCTTCG ATGTACATTT CCATCAGGAA 6181 AAAAAACGCG CTGCGTCCAG CGAGTCACAA CATTCATCAC AGGCTGGTTG CTAGCTAAGG 6241 AAATTCGTAA GCACCAAACT CATAATACCA AGGACCTCCA TTTGACCTTT CCTACACAAC 6301 TCAGAGCCTC TCTTTTCAGT TAGTTGTTTA TACAGACACC AAGTATGAGT CTTGCATTTA 6361 AAACTTCATA GTACAAAAAA CTCCACCCAC ATTGCACAGT GCTTTTCCAA ATCAATAGCT 6421 TTGTGGCCAT GATAAGGTTG GTACCACTTC ACTCCGTGTC CTCAGTGAGG AAACTGAGGC 6481 ACAGAATGGC AAGTGACTGA GCATCAGCAC AATGAGTCAG TGCCTTATTC CTGCTCTGTC 6541 TCACTTAAGA CATCAAGTTG CTCATCTGAA TTTTTTTTTT CTTTTGAAAC GGAGTCTCGC 6601 TCTGTCGCCC AGGCTGGAGT GCAGTGGTGC AATCTCGGCT CACCGCAACC TCCGTCCTCC 6661 CAGGTTCAAG CAATTCTCCT GCCTCAGCCT CCTGAGTAGC TGGGATTACA GGCATGTGCA 6721 CCAGTCCCCC TAATTTTGTA TTTTTAGTAG AGACGGGTTC ACATGTTGGT CAGGGTGGTC 6781 TGGAACTCCT GGTACTCATG ATCCACTGCC TTGGTCCAAA CTACTGGCAT TACAGATGTG 6841 ACCAACCGCA CCTGGCCCTG AAATCTTAAA GGGAGATAGG TACTGTAAGG TCCTCTAAAG 6901 AGTGTCTTGA GACTGGGCTT TGGGGTTCTA TCTTGAAGAA GGGGAGTCCC AGGAGACAAA 6961 TGTGTGCAGG GCTCTCCTGG GGCCTGGGGG TGGAGAGGAA CTAGGAGGGA TGGGGAATGT 7021 CAGTGCTGTG CAGCCTGGGC CTCAGGTGTC CCCTACCCTC CTGCACCATC CTGCACATGG 7081 AGCAAATCTG TTGGCTCCTG AGACCATCTA AACTATGGGA CAGGCGCTGG GGGCTGCAGC 7141 TGGGTGCTGG GCTGAAGGAA AGGTGTTAAA GTCTTGAAGA AGTAACAAGC CTGTCTTGAG 7201 GGGGTTGGGG TCTGCTAGGG CTCTGCCAAT GTGGCCCCAG ATGGACTCCC AGCCCCTGGT 7261 ATAGCCCAGC CTGGACTTGG TCAGGCTGAG TGTGCGATGG ACCCTGTTAC AGAAACCAGC 7321 CCTGAGCAAA GGGGTTCTGA GGTCCATACC AAGAAGACGG TGATGATCAA GACCATCGAG 7381 ACACGGGATG GGGAGGTAAG TGGTCTGTCT GGGCTCCTTA CCCTTGGTTG GGGCTATGGA 7441 TGTGTCTGGG GGGACTGTCT TCCACCAGCT GTGCAGGTCT AGGTCCTGGC TAGTGGGCAA 7501 GAGAGATCCT GCGATCCTGG GTGGGATGCC TAGGCTGACG TCATCCTGCT AGCACATGTG 7561 ACTGGACTCC TCTCCCCAGG TCGTCAGTGA GGCCACACAG CAGCAGCATG AAGTGCTCTA 7621 AAGACGAGAG ACCCTCTGCC ACCAGAGACC GTCCTCACCC CTGTCCTCAC TGCTCCCTGA 7681 AGCCCAGCCT TCTTCCGATC CCAGGACACC ACACCCAGCC TCAGTCCTCC AAATCACAGC 7741 CTCTGACCCC TCCTCACTGG CCATCCCTCG TGGTCCCCAA CAGCGACATA GCCCATCCCT 7801 GCCTGGTCAC AGGCATGCCC CGGCCACCTC TGCGGACCCC AGCTGTGAGC CTTGGCTGTT 7861 GGCAGTGAGT GAGCCTGGCT CTTGTGCTGG ATGGAGCCCA GGCGGGAGCG GTGGCCCTGT 7921 CCCTCCCACC TCTGTGACCT CAGGCCTACG CTTTGGCTCT GGAGATAGCC CCAGAGCAGG 7981 GTGTTGGGAT ACTGCAGGGC CAGGACTGAG CCCCGCAGAC CTCCCCAGCC CCTAGCCCAG 8041 GAGAGAGAAA GCCAGGCAGG TAGCCTGGGG GACTAGCCCT GTGGAGACTG GGGGGCTTGA 8101 AATTGTCCCC GTGGTCTCTT ACTTTCCTTT CCCCAGCCCA GGGTGGACTT AGAAAGCAGG 8161 GGCTACAAGA GGGAATCCCC GAAGGTGCTG GAGGTGGGAG CAGGAGATTG AGAAGGAGAG 8221 AAAGTGGGTG AGATGCTGGA GAAGAGAGGA GAGGAGAGAG GCAGAGAGCG GTCTCAGGCT 8281 GGTGGGAGGG GCGCCCACCT CCCCACGCCC TCCCCCCCCC TGCTGCAGGG GCTCTGGAGA 8341 GAAACAATAA AGAGATTCAC ACACAAGCCA AGCTGGCCCC TGTTGTCTTG ACTGCCAGGT 8401 CTCCTGTGGG TCCCCTGGCC GGTGTGATAC AACCTGTCAC CCTGAGATGG ACTATACTGA 8461 CCTTCTACCC AGCCAGACTT TTCCCAGGGC CTCGCGCCAC TTCTGTTCCC AGAGCCACCT 8521 GGCCTTGATG TCTGTATGCA GTGTATGGGG GTGGAGGGGC TGGACAGGGC TTTTGTTGGG 8581 GAACCACAAA TACCTCTGCT TAGGCCCCAG TTCCAGGACT GCACCCCTTG TGCTCTCTAC 8641 TACACCTTCA GGTCAGCCAG GCTGGGGGAC ACCACCTGCT ATTCCTTTTG AAGGATGTAC 8701 CTGCTCCCCA CTGACTGGCT CTGAAGTAGG GCTCCCCCAG TCCCTTCTTC AAAGATGCCA 8761 AGAGCCCCTT TTTTTCTCTC TCTCTTTTTT TCTTTTTGAG ATGGAGTCTT GCTGTGTCAC 8821 CCAGGCTGGA GTGCAGCAGT GTGATCACCG CTTACTGCAG CCTGGACTTC CTGAGCTC // LOCUS HUMDMD17 1041 bp ds-DNA PRI 15-SEP-1989 DEFINITION Human Duchenne muscular dystrophy gene, exon 17. ACCESSION X13045 KEYWORDS Duchenne muscular dystrophy; dystrophin. SOURCE Human DNA, (library of J.Weiss). ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 1041) AUTHORS Chamberlain,J.S. JOURNAL Unpublished (1988) Baylor Med. Sch. 1 Baylor Plaza, Houston TX STANDARD simple staff_entry REFERENCE 2 (bases 6 to 421) AUTHORS Chamberlain,J.S., Gibbs,R.A., Ranier,J.E., Nguyen,P.N. and Caskey,C.T. TITLE Deletion screening of the Duchenne muscular dystrophy locus via multiplex DNA amplification JOURNAL Nucleic Acids Res. 16, 11141-11156 (1988) STANDARD simple staff_entry COMMENT Submitted (23-SEP-1988) on tape to the EMBL data library by J.S.Chamberlain. FEATURES from to/span description ORF / 227 / 402 dystrophin (AA at 227) /nomgen="DMD" /map="Xp21.3-p21.1" /hgml_locus_uid="LD0116H" pre-msg < 1 > 1041 DMD mRNA and introns IVS < 1 226 DMD intron P IVS 403 > 1041 DMD intron Q BASE COUNT 325 A 186 C 186 G 344 T ORIGIN Chromosome Xp21.3-p21.1. 1 TAAATTGACT TTCGATGTTG AGATTACTTT CCCTTGCTAT TTCAGTGAAC CAAACTTAAG 61 TCAGATAAAA CAATTTTATT TGGCTTCAAT ATGGTGCTAT TTTGATCTGA AGGTCAATCT 121 ACCAACAAGC AAGAACAGTT TCTCATTATT TTCCTTTGCC ACTCCAAGCA GTCTTTACTG 181 AAGTCTTTCG AGCAATGTCT GACCTCTGTT TCAATACTTC TCACAGATTT CACAGGCTGT 241 CACCACCACT CAGCCATCAC TAACACAGAC AACTGTAATG GAAACAGTAA CTACGGTGAC 301 CACAAGGGAA CAGATCCTGG TAAAGCATGC TCAAGAGGAA CTTCCACCAC CACCTCCCCA 361 AAAGAAGAGG CAGATTACTG TGGATTCTGA AATTAGGAAA AGGTGAGAGC ATCTCAAGCT 421 TTTATCTGCA AATGAAGTGG AGAAAACTCA TTTACAGCAG TTTTGTTGGT GGTGTTTTCA 481 CTTCAGCAAT ATTTCCAGAA TCCTCGGGTA CCTGTAATGT CAGTTAATGT AGTGAGAAAA 541 ATTATGAAGT ACATTTTAAA ACTTTCACAA GAAATCACTA TCGCAACAGA AACTAAATGC 601 TTAATGGAAA TGGTGTTTTC TGGGGTGAAA GAAGAAACTA TAGAAACTAT AGGTGATAAA 661 CTACTGTGGT AGCATTTTAA TCCTAAAAGT TTCTTTCTTT CTTTTTTTTT TTTCTTCCTT 721 ATAAAGGGCC TGCTTGTTGA GTCCCTAGTT TTGCATTAAA TGTCTTTTTT TTCCAGTAAC 781 GGAAAGTGCA TTTTCATGAA GAAGTACACC TATAATAGAT GGGATCCATC CTGGTAGTTT 841 ACGAGAACAT GATGTCTCAG TCTGCGCATC CTAAATCAGG AGTAATTACA GAACACATTT 901 CCTGTTCTTT GATATTTATA AAGTCTTATC TTGAAGGTGT TAGAATTTTT AACTGATCTT 961 TTTGTGACTA TTCAGAATTA TGCATTTTAG ATAAGATTAG GTATTATGTA AATCAGTGGA 1021 TATATTAAAT GATGGCAATA A // LOCUS HUMDMDHI41 583 bp ds-DNA PRI 15-SEP-1989 DEFINITION Human 4.1 kb Hind III fragment of Duchenne muscular dystrophy gene with exon X. ACCESSION X13046 KEYWORDS Duchenne muscular dystrophy; dystrophin. SOURCE Human DNA, (library of J.Weiss), clone TN-4.1. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 583) AUTHORS Chamberlain,J.S., Gibbs,R.A., Ranier,J.E., Nguyen,P.N. and Caskey,C.T. TITLE Deletion screening of the Duchenne muscular dystrophy locus via multiplex DNA amplification JOURNAL Nucleic Acids Res. 16, 11141-11156 (1988) STANDARD simple staff_entry COMMENT Submitted (23-SEP-1988) on tape to the EMBL data library by J.S.Chamberlain. FEATURES from to/span description pept / 295 / 442 dystrophin, exon X (AA at 296) /nomgen="DMD" /map="Xp21.3-p21.1" /hgml_locus_uid="LD0116H" pre-msg < 1 > 583 DMD mRNA and intron IVS < 1 294 DMD intron X IVS 443 > 583 DMD intron X+1 BASE COUNT 188 A 83 C 93 G 219 T ORIGIN Chromosome Xp21.3-p21.1. 1 TGTCCAAAAT AGTTGACTTT CTTTCTTTAA TCAATAAATA TATTACTTTA AAGGGAAAAA 61 TTGCAACCTT CCATTTAAAA TCAGCTTTAT ATTGAGTATT TTTTTAAAAT GTTGTGTGTA 121 CATGCTAGGT GTGTATATTA ATTTTTATTT GTTACTTGAA ACTAAACTCT GCAAATGCAG 181 GAAACTATCA GAGTGATATC TTTGTCAGTA TAACCAAAAA ATATACGCTA TATCTCTATA 241 ATCTGTTTTA CATAATCCAT CTATTTTTCT TGATCCATAT GCTTTTACCT GCAGGCGATT 301 TGACAGATCT GTTGAGAAAT GGCGGCGTTT TCATTATGAT ATAAAGATAT TTAATCAGTG 361 GCTAACAGAA GCTGAACAGT TTCTCAGAAA GACACAAATT CCTGAGAATT GGGAACATGC 421 TAAATACAAA TGGTATCTTA AGGTAAGTCT TTGATTTGTT TTTTCGAAAT TGTATTTATC 481 TTCAGCACAT CTGGACTCTT TAACTTCTTA AAGATCAGGT TCTGAAGGGT GATGGAAATT 541 ACTTTTGACT GTTGTTGTCA TCATTATATT ACTAGAAAGA AAA // LOCUS HUMDMDHI5 625 bp ds-DNA PRI 15-SEP-1989 DEFINITION Human Duchenne muscular dystrophy DNA fragment (overlaps 0.5 kb Hind III fragment) with exon Z. ACCESSION X13048 KEYWORDS Duchenne muscular dystrophy; dystrophin. SOURCE Human DNA, (library of J.Weiss), clone FV-0.5. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 625) AUTHORS Chamberlain,J.S. JOURNAL Unpublished (1988) Baylor Med. Sch. 1 Baylor Plaza, Houston TX STANDARD simple staff_entry REFERENCE 2 (bases 30 to 625) AUTHORS Chamberlain,J.S., Gibbs,R.A., Ranier,J.E., Nguyen,P.N. and Caskey,C.T. TITLE Deletion screening of the Duchenne muscular dystrophy locus via multiplex DNA amplification JOURNAL Nucleic Acids Res. 16, 11141-11156 (1988) STANDARD simple staff_entry COMMENT Submitted (23-SEP-1988) on tape to the EMBL data library by J.S.Chamberlain. FEATURES from to/span description pept / 396 / 571 dystrophin, exon Z (AA at 398) /nomgen="DMD" /map="Xp21.3-p21.1" /hgml_locus_uid="LD0116H" pre-msg < 1 > 625 DMD mRNA and introns IVS < 1 395 DMD intron Z IVS 572 > 625 DMD intron Z+1 BASE COUNT 195 A 110 C 130 G 190 T ORIGIN Chromosome Xp21.3-p21.1. 1 ACCCAAATAC TTTGTTCATG TTTAAATTTT ACAACATTTC ATAGACTATT AAACATGGAA 61 CATCCTTGTG GGGACAAGAA ATCGAATTTG CTCTTGAAAA GGTTTCCAAC TAATTGATTT 121 GTAGGACATT ATAACATCCT CTAGCTGACA AGCTTACAAA AATAAAAACT GGAGCTAACC 181 GAGAGGGTGC TTTTTTCCCT GACACATAAA AGGTGTCTTT CTGTCTTGTA TCCTTTGGAT 241 ATGGGCATGT CAGTTTCATA GGGAAATTTT CACATGGAGC TTTTGTATTT CTTTCTTTGC 301 CAGTACAACT GCATGTGGTA GCACACTGTT TAATCTTTTC TCAAATAAAA AGACATGGGG 361 CTTCATTTTT GTTTTGCCTT TTTGGTATCT TACAGGAACT CCAGGATGGC ATTGGGCAGC 421 GGCAAACTGT TGTCAGAACA TTGAATGCAA CTGGGGAAGA AATAATTCAG CAATCCTCAA 481 AAACAGATGC CAGTATTCTA CAGGAAAAAT TGGGAAGCCT GAATCTGCGG TGGCAGGAGG 541 TCTGCAAACA GCTGTCAGAC AGAAAAAAGA GGTAGGGCGA CAGATCTAAT AGGAATGAAA 601 ACATTTTAGC AGACTTTTTA AGCTT // LOCUS HUMDMDHI7 641 bp ds-DNA PRI 15-SEP-1989 DEFINITION Human Duchenne muscular dystrophy DNA fragment (overlaps 1.2 and 3.8 kb Hind III fragments) with exon Y. ACCESSION X13047 KEYWORDS Duchenne muscular dystrophy; dystrophin. SOURCE Human DNA, (library of J.Weiss), clone DMDHIB. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 641) AUTHORS Chamberlain,J.S., Gibbs,R.A., Ranier,J.E., Nguyen,P.N. and Caskey,C.T. TITLE Deletion screening of the Duchenne muscular dystrophy locus via multiplex DNA amplification JOURNAL Nucleic Acids Res. 16, 11141-11156 (1988) STANDARD simple staff_entry COMMENT Submitted (23-SEP-1988) on tape to the EMBL data library by J.S.Chamberlain. FEATURES from to/span description pept / 211 / 396 dystrophin /nomgen="DMD" (AA at 211) /map="Xp21.3-p21.1" /hgml_locus_uid="LD0116H" pre-msg < 1 > 641 DMD mRNA and introns IVS < 1 210 DMD intron Y IVS 397 > 641 DMD intron Y+1 BASE COUNT 229 A 88 C 115 G 209 T ORIGIN Chromosome Xp21.3-p21.1. 1 TTTTGTAGAC GGTTAATGAA TAATTTTGAA TACATTGGTT AAATCCCAAC ATGTAATATA 61 TGTAAATAAT CAATATTATG CTGCTAAAAT AACACAAATC AGTAAGATTC TGTAATATTT 121 CATGATAAAT AACTTTTGAA AATATATTTT TAAACATTTT GCTTATGCCT TGAGAATTAT 181 TTACCTTTTT AAAATGTATT TTCCTTTCAG GTTTCCAGAG CTTTACCTGA GAAACAAGGA 241 GAAATTGAAG CTCAAATAAA AGACCTTGGG CAGCTTGAAA AAAAGCTTGA AGACCTTGAA 301 GAGCAGTTAA ATCATCTGCT GCTGTGGTTA TCTCCTATTA GGAATCAGTT GGAAATTTAT 361 AACCAACCAA ACCAAGAAGG ACCATTTGAC GTTAAGGTAG GGAACTTTTT GCTTTAAATA 421 TTTTTGTCTT TTTTAAGAAA AATGGCAATA TCACTGAATT TTCTCATTTG GTATCATTAT 481 TAAAGACAAA ATATTACTTG TTAAAGTGTG GTAAGGAAGA CTTTATTCAG GATAACCACA 541 ATAGGCACAG GGACCACTGC AATGGAGTAT TACAGGAGGT TGGATAGAGA GAGATTGGGC 601 TCAACTCTAA ATACAGCACA GTGGAAGTAG GAATTTATAG C // LOCUS HUMDONT01 922 bp ss-mRNA PRI 15-JUN-1989 DEFINITION Human terminal deoxynucleotidyltransferase gene, 5' end and exon 1. ACCESSION M22968 J03243 KEYWORDS DNA polymerase. SEGMENT 1 of 11 SOURCE Human maniatis fetal liver library, cDNA to mRNA, clones G21, C1, G1, G3, G5, G9, G10. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 922) AUTHORS Riley,L., Morrow,J., Danton,M. and Coleman,M. TITLE Human terminal deoxyribonucleotidyltransferase: molecular cloning and structural analysis of the gene and 5' flanking region JOURNAL Proc. Natl. Acad. Sci. U.S.A. 85, 2489-2493 (1988) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence [1] kindly submitted by M.S.Coleman 15-JUN-1988 FEATURES from to/span description pept 710 + 912 terminal DONT (EC 2.7.7.31), exon 1 /nomgen="DNTT" /map="10q23-q24" /hgml_locus_uid="LA0024A" IVS 913 > 922 terminal DONT, intron A BASE COUNT 294 A 202 C 217 G 209 T ORIGIN 713 bp upstream of BamHI site; chromosome 10q23-q24. 1 ATGGCTTGGC CCCTTATTCT GTGATCTTGA ATAACTGAGC CTTGGTTCTC TTGCTTATAA 61 AATGTACCTA TGTAAGTGAA CAGTGGTAAT ACCCACCATG AAATACTATG GAGCAGTTAG 121 AAGCAACAGA GCAATTAGAA TACGGATCTG GAAAACATAG TTCCAAGTGA AATAAAAAGG 181 TGATTGGCAA AATGAGAAAT GTAACAATGC CATTTATGTA AAGTCAAAAC AACCACAAGC 241 AGAATATAAC ACCCATGCAA ATAACAGACA TCCAGTACAG TGTAATGATT GTCTACAAAA 301 GGAGGAGAAT GCAAGTGAGT TGGGAGGTTT AAAGGGGATC AATAAATAAA GACCACAGAA 361 GGGCCTCAGT ACATTTAGAA ATAAATAACA TGCAAACAAT GATGCTTCCC TACCTTCCTC 421 ACGAGGTTAC TCTGAGGAGC TAATGAGATT GTGCGTATGA AAACAAAAGT TAATTGACTG 481 TCTTCATTAT GATCTCCATA TTGGACAAAT GAACTGAACA GAGATAAAAA TTCCCCATCA 541 CACTTTGGCA GGAAGCTGTT GCCAGGGCAG CACCTGTGAA GCCCTGGCCT GGCTTCAGAG 601 TCTGCTGGTG AGATGACATC AAAACCCTTC GTGTAGGAGG GTGGCAGTCT CCCTCCCTTC 661 TGGAGACACC ACCAGATGGG CCAGCCAGAG GCAGCAGCAG CCTCTTCCCA TGGATCCACC 721 ACGAGCGTCC CACTTGAGCC CTCGGAAGAA GAGACCCCGG CAGACGGGTG CCTTGATGGC 781 CTCCTCTCCT CAAGACATCA AATTTCAAGA TTTGGTCGTC TTCATTTTGG AGAAGAAAAT 841 GGGAACCACC CGCAGAGCGT TCCTCATGGA GCTGGCCCGC AGGAAAGGGT TCAGGGTTGA 901 AAATGAGCTC AGGTAGGACA GC // LOCUS HUMDONT02 195 bp ss-mRNA PRI 15-JUN-1989 DEFINITION Human terminal deoxynucleotidyltransferase gene, exon 2. ACCESSION M20694 J03243 KEYWORDS DNA polymerase. SEGMENT 2 of 11 SOURCE Human maniatis fetal liver library, cDNA to mRNA, clones C4, C5, G1, G3, G5, G9, G10, G11, G12. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 195) AUTHORS Riley,L., Morrow,J., Danton,M. and Coleman,M. TITLE Human terminal deoxyribonucleotidyltransferase: molecular cloning and structural analysis of the gene and 5' flanking region JOURNAL Proc. Natl. Acad. Sci. U.S.A. 85, 2489-2493 (1988) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence [1] kindly submitted by M.S.Coleman 15-JUN-1988 FEATURES from to/span description pept + 11 + 185 terminal DONT, exon 2 /nomgen="DNTT" /map="10q23-q24" /hgml_locus_uid="LA0024A" (no splice consensus) IVS < 1 10 terminal DONT, intron A IVS 186 > 195 terminal DONT, intron B BASE COUNT 55 A 44 C 51 G 45 T ORIGIN Chromosome 10q23-q24. 1 CATTTTGCAG TGATTCTGTC ACCCACATTG TAGCAGAGAA CAACTCGGGT TCGGATGTTC 61 TGGAGTGGCT TCAAGCACAG AAAGTACAAG TCAGCTCACA ACCAGAGCTC CTCGATGTCT 121 CCTGGCTGAT CGAATGCATA GGAGCAGGGA AACCGGTGGA AATGACAGGA AAACACCAGC 181 TTGTTGTAGT GTCAT // LOCUS HUMDONT03 149 bp ss-mRNA PRI 15-JUN-1989 DEFINITION Human terminal deoxynucleotidyltransferase gene, exon 3. ACCESSION M20695 J03243 KEYWORDS DNA polymerase. SEGMENT 3 of 11 SOURCE Human maniatis fetal liver library, cDNA to mRNA, clones G1, G3, G5, G9, G10, G11, G12, C4, C5. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 149) AUTHORS Riley,L., Morrow,J., Danton,M. and Coleman,M. TITLE Human terminal deoxyribonucleotidyltransferase: molecular cloning and structural analysis of the gene and 5' flanking region JOURNAL Proc. Natl. Acad. Sci. U.S.A. 85, 2489-2493 (1988) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence [1] kindly submitted by M.S.Coleman 15-JUN-1988 FEATURES from to/span description pept + 11 + 139 terminal DONT, exon 3 /nomgen="DNTT" /map="10q23-q24" /hgml_locus_uid="LA0024A" IVS < 1 10 terminal DONT, intron B IVS 140 > 149 terminal DONT, intron C BASE COUNT 52 A 38 C 29 G 30 T ORIGIN Chromosome 10q23-q24. 1 AATTGTTAAG GTGAGAAGAG ACTATTCAGA TAGCACCAAC CCAGGCCCCC CGAAGACTCC 61 ACCAATTGCT GTACAAAAGA TCTCCCAGTA TGCGTGTCAG AGAAGAACCA CTTTAAACAA 121 CTGTAACCAG ATATTCACGG TAACGGACT // LOCUS HUMDONT04 191 bp ss-mRNA PRI 15-JUN-1989 DEFINITION Human terminal deoxynucleotidyltransferase gene, exon 4. ACCESSION M20696 J03243 KEYWORDS DNA polymerase. SEGMENT 4 of 11 SOURCE Human maniatis fetal liver library, cDNA to mRNA, clones G1, G3, G4, G5, G7-12. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 191) AUTHORS Riley,L., Morrow,J., Danton,M. and Coleman,M. TITLE Human terminal deoxyribonucleotidyltransferase: molecular cloning and structural analysis of the gene and 5' flanking region JOURNAL Proc. Natl. Acad. Sci. U.S.A. 85, 2489-2493 (1988) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence [1] kindly submitted by M.S.Coleman 15-JUN-1988 FEATURES from to/span description pept + 11 + 181 terminal DONT, exon 4 /nomgen="DNTT" /map="10q23-q24" /hgml_locus_uid="LA0024A" IVS < 1 10 terminal DONT, intron C IVS 182 > 191 terminal DONT, intron D BASE COUNT 57 A 33 C 50 G 51 T ORIGIN 141 bp upstream of EcoRI site; chromosome 10q23-q24. 1 TATCCTGCAG GATGCCTTTG ATATACTGGC TGAAAACTGT GAGTTTAGAG AAAATGAAGA 61 CTCCTGTGTG ACATTTATGA GAGCAGCTTC TGTATTGAAA TCTCTGCCAT TCACAATCAT 121 CAGTATGAAG GACACAGAAG GAATTCCCTG CCTGGGGTCC AAGGTGAAGG GTATCATAGA 181 GGTAAGGGTG A // LOCUS HUMDONT05 92 bp ss-mRNA PRI 15-JUN-1989 DEFINITION Human terminal deoxynucleotidyltransferase gene, exon 5. ACCESSION M20697 J03243 KEYWORDS DNA polymerase. SEGMENT 5 of 11 SOURCE Human maniatis fetal liver library, cDNA to mRNA, clones G1, G3, G4-5, G7-12. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 92) AUTHORS Riley,L., Morrow,J., Danton,M. and Coleman,M. TITLE Human terminal deoxyribonucleotidyltransferase: molecular cloning and structural analysis of the gene and 5' flanking region JOURNAL Proc. Natl. Acad. Sci. U.S.A. 85, 2489-2493 (1988) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence [1] kindly submitted by M.S.Coleman 15-JUN-1988 FEATURES from to/span description pept + 11 + 82 terminal DONT, exon 5 /nomgen="DNTT" /map="10q23-q24" /hgml_locus_uid="LA0024A" IVS < 1 10 terminal DONT, intron D IVS 83 > 92 terminal DONT, intron E BASE COUNT 33 A 9 C 21 G 29 T ORIGIN Chromosome 10q23-q24. 1 TGTCCATTAG GAGATTATTG AAGATGGAGA AAGTTCTGAA GTTAAAGCTG TGTTAAATGA 61 TGAACGATAT CAATCCTTCA AAGTAAGTGA TT // LOCUS HUMDONT06 144 bp ss-mRNA PRI 15-JUN-1989 DEFINITION Human terminal deoxynucleotidyltransferase gene, exon 6. ACCESSION M20698 J03243 KEYWORDS DNA polymerase. SEGMENT 6 of 11 SOURCE Human maniatis fetal liver library, cDNA to mRNA, clones G1, G3, G4-5, G7-12. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 144) AUTHORS Riley,L., Morrow,J., Danton,M. and Coleman,M. TITLE Human terminal deoxyribonucleotidyltransferase: molecular cloning and structural analysis of the gene and 5' flanking region JOURNAL Proc. Natl. Acad. Sci. U.S.A. 85, 2489-2493 (1988) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence [1] kindly submitted by M.S.Coleman 15-JUN-1988 FEATURES from to/span description pept + 11 + 134 terminal DONT, exon 6 /nomgen="DNTT" /map="10q23-q24" /hgml_locus_uid="LA0024A" IVS < 1 10 terminal DONT, intron E IVS 135 > 144 terminal DONT, intron F BASE COUNT 41 A 22 C 38 G 43 T ORIGIN Chromosome 10q23-q24. 1 TCCATTTTAG CTCTTTACTT CTGTATTTGG AGTGGGGCTG AAGACTTCTG AGAAGTGGTT 61 CAGGATGGGT TTCAGAACTC TGAGTAAAGT AAGGTCGGAC AAAAGCCTGA AATTTACACG 121 AATGCAGAAA GCAGGTAATT GTCT // LOCUS HUMDONT07 153 bp ss-mRNA PRI 15-JUN-1989 DEFINITION Human terminal deoxynucleotidyltransferase gene, exon 7. ACCESSION M20699 J03243 KEYWORDS DNA polymerase. SEGMENT 7 of 11 SOURCE Human maniatis fetal liver library, cDNA to mRNA, clones G1, G3, G4-5, G7-12. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 153) AUTHORS Riley,L., Morrow,J., Danton,M. and Coleman,M. TITLE Human terminal deoxyribonucleotidyltransferase: molecular cloning and structural analysis of the gene and 5' flanking region JOURNAL Proc. Natl. Acad. Sci. U.S.A. 85, 2489-2493 (1988) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence [1] kindly submitted by M.S.Coleman 15-JUN-1988 FEATURES from to/span description pept + 11 + 143 terminal DONT, exon 7 /nomgen="DNTT" /map="10q23-q24" /hgml_locus_uid="LA0024A" IVS < 1 10 terminal DONT, intron F IVS 144 > 153 terminal DONT, intron G BASE COUNT 32 A 31 C 47 G 43 T ORIGIN Chromosome 10q23-q24. 1 GGCCTCTTAG GATTTCTGTA TTATGAAGAC CTTGTCAGCT GTGTGACCAG GGCAGAAGCA 61 GAGGCCGTCA GTGTGCTGGT TAAAGAGGCT GTCTGGGCAT TTCTTCCGGA TGCTTTCGTC 121 ACCATGACAG GAGGGTTCCG GAGGTAATTA ACT // LOCUS HUMDONT08 126 bp ss-mRNA PRI 15-JUN-1989 DEFINITION Human terminal deoxynucleotidyltransferase gene, exon 8. ACCESSION M20700 J03243 KEYWORDS DNA polymerase. SEGMENT 8 of 11 SOURCE Human maniatis fetal liver library, cDNA to mRNA, clones G3, G4-5, G7-8, G13-14. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 126) AUTHORS Riley,L., Morrow,J., Danton,M. and Coleman,M. TITLE Human terminal deoxyribonucleotidyltransferase: molecular cloning and structural analysis of the gene and 5' flanking region JOURNAL Proc. Natl. Acad. Sci. U.S.A. 85, 2489-2493 (1988) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence [1] kindly submitted by M.S.Coleman 15-JUN-1988 FEATURES from to/span description pept + 11 + 116 terminal DONT, exon 8 /nomgen="DNTT" /map="10q23-q24" /hgml_locus_uid="LA0024A" IVS < 1 10 terminal DONT, intron G IVS 117 > 126 terminal DONT, intron H BASE COUNT 50 A 12 C 37 G 27 T ORIGIN Chromosome 10q23-q24. 1 GAAAATTGAG GGGTAAGAAG ATGGGGCATG ATGTAGATTT TTTAATTACC AGCCCAGGAT 61 CAACAGAGGA TGAAGAGCAA CTTTTACAGA AAGTGATGAA CTTATGGGAA AAGAAGGTGA 121 GAAGAA // LOCUS HUMDONT09 269 bp ss-mRNA PRI 15-JUN-1989 DEFINITION Human terminal deoxynucleotidyltransferase gene, exon 9. ACCESSION M20701 J03243 KEYWORDS DNA polymerase. SEGMENT 9 of 11 SOURCE Human maniatis fetal liver library, cDNA to mRNA, clones G2, G4, G6-8, G11-14. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 269) AUTHORS Riley,L., Morrow,J., Danton,M. and Coleman,M. TITLE Human terminal deoxyribonucleotidyltransferase: molecular cloning and structural analysis of the gene and 5' flanking region JOURNAL Proc. Natl. Acad. Sci. U.S.A. 85, 2489-2493 (1988) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence [1] kindly submitted by M.S.Coleman 15-JUN-1988 FEATURES from to/span description pept + 11 + 259 terminal DONT, exon 9 /nomgen="DNTT" /map="10q23-q24" /hgml_locus_uid="LA0024A" IVS < 1 10 terminal DONT, intron H IVS 260 > 269 terminal DONT, intron I (no splice consensus 260) BASE COUNT 56 A 59 C 73 G 81 T ORIGIN Chromosome 10q23-q24. 1 TTCTTTTCAG GGATTACTTT TATATTATGA CCTTGTGGAG TCAACATTTG AAAAGCTCAG 61 GTTGCCTAGC AGGAAGGTTG ATGCTTTGGA TCATTTTCAA AAGTGCTTTC TGATTTTCAA 121 ATTGCCTCGT CAAAGAGTGG ACAGTGACCA GTCCAGCTGG CAGGAAGGAA AGACCTGGAA 181 GGCCATCCGT GTGGATTTAG TTCTGTGCCC CTACGAGCGT CGTGCCTTTG CCCTGTTGGG 241 ATGGACTGGC TCCCCTCGGT TGCCGCCGG // LOCUS HUMDONT10 104 bp ss-mRNA PRI 15-JUN-1989 DEFINITION Human terminal deoxynucleotidyltransferase gene, exon 10. ACCESSION M20702 J03243 KEYWORDS DNA polymerase. SEGMENT 10 of 11 SOURCE Human maniatis fetal liver library, cDNA to mRNA, clones G2, G4, G6-8, G11-14. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 104) AUTHORS Riley,L., Morrow,J., Danton,M. and Coleman,M. TITLE Human terminal deoxyribonucleotidyltransferase: molecular cloning and structural analysis of the gene and 5' flanking region JOURNAL Proc. Natl. Acad. Sci. U.S.A. 85, 2489-2493 (1988) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence [1] kindly submitted by M.S.Coleman 15-JUN-1988 FEATURES from to/span description pept + 11 + 94 terminal DONT, exon 10 /nomgen="DNTT" /map="10q23-q24" /hgml_locus_uid="LA0024A" IVS < 1 10 terminal DONT, intron I IVS 95 103 terminal DONT, intron J (no splice consensus 103) BASE COUNT 30 A 26 C 24 G 24 T ORIGIN Chromosome 10q23-q24. 1 TCCTATCCAG CAGTTTGAGA GAGACCTCCG GCGCTATGCC ACACATGAGC GGAAGATGAT 61 TCTGGATAAC CATGCTTTAT ATGACAAGAC CAAGGTACAG TTCC // LOCUS HUMDONT11 310 bp ss-mRNA PRI 15-JUN-1989 DEFINITION Human terminal deoxynucleotidyltransferase gene, exon 11. ACCESSION M20703 J03243 KEYWORDS DNA polymerase. SEGMENT 11 of 11 SOURCE Human maniatis fetal liver library, cDNA to mRNA, clones G2, G6, G13-14. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 310) AUTHORS Riley,L., Morrow,J., Danton,M. and Coleman,M. TITLE Human terminal deoxyribonucleotidyltransferase: molecular cloning and structural analysis of the gene and 5' flanking region JOURNAL Proc. Natl. Acad. Sci. U.S.A. 85, 2489-2493 (1988) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence [1] kindly submitted by M.S.Coleman 15-JUN-1988 FEATURES from to/span description pept + 11 97 terminal DONT, exon 11 /nomgen="DNTT" /map="10q23-q24" /hgml_locus_uid="LA0024A" IVS < 1 10 terminal DONT, intron J BASE COUNT 99 A 37 C 72 G 102 T ORIGIN 303 bp upstream of EcoRI site; chromosome 10q23-q24. 1 TGTTTTCGAG AGGATATTCC TCAAAGCAGA AAGTGAAGAA GAAATTTTTG CGCATCTGGG 61 ATTGGATTAT ATTGAACCGT GGGAAAGAAA TGCCTAGGAA AGTGTTGTCA ACATTTTTTC 121 CTATTCTTTT CAAGTTAAAT AAATTATGCT TCATATTAGT AAAAGATGCC ATAGGAGAGT 181 TTGGGGTTAT TTAGGTCTTA TTGAAATGCA GATTGCTACT AGAAATAAAT AACTTTGGAA 241 ACATGGGAAG GTGCCACTGG TAATGGGTAA GGTTCTAATA GGCCATGTTT ATGACTGTTG 301 CATAGAATTC // LOCUS HUMEDHB17 4845 bp ds-DNA PRI 15-DEC-1989 DEFINITION Human estradiol 17 beta-dehydrogenase gene, complete cds. ACCESSION M27138 M27136 KEYWORDS Alu repetitive element; estradiol 17 beta-dehydrogenase. SOURCE Human, cDNA to mRNA, and leukocyte DNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1056 to 4149; exons only) AUTHORS Luu,T.V., Labrie,C., Zhao,H.F., Couet,J., Lachance,Y., Simard,J., Leblanc,G. and Labrie,F. TITLE Characterization of cDNAs for human estradiol 17 beta- dehydrogenase and assignment of the gene to chromosome 17: Evidence of two mRNA species with distinct 5'- termini in human placenta JOURNAL Mol. Endocrinol. 3, 1301-1309 (1989) STANDARD full staff_entry REFERENCE 2 (bases 1 to 4845) AUTHORS Luu,T.V., Lachance,Y., Labrie,C., Zhao,H.F., Simard,J., Couet,J., Leblanc,G. and Labrie,F. TITLE Structure of human 17 beta-hydroxysteroid dehydrogenase gene JOURNAL Mol. Endocrinol. (1989) In press STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by F.Labrie, 15-AUG-1989. FEATURES from to/span description pept 1871 1967 estradiol 17 beta-dehydrogenase, exon 1 /nomgen="EDHB17" /map="17q11-q12" /hgml_locus_uid="LN0217N" 2061 2228 estradiol 17 beta-dehydrogenase, exon 2 2376 2555 estradiol 17 beta-dehydrogenase, exon 3 2731 2824 estradiol 17 beta-dehydrogenase, exon 4 3341 3518 estradiol 17 beta-dehydrogenase, exon 5 3605 3874 estradiol 17 beta-dehydrogenase, exon 6 pre-msg 910 4149 EDHB17 mRNA (alt.) pre-msg 1862 4149 EDHB17 mRNA (alt.) IVS 1968 2060 EDHB17 intron A IVS 2229 2375 EDHB17 intron B IVS 2556 2730 EDHB17 intron C IVS 2825 3340 EDHB17 intron D IVS 3519 3604 EDHB17 intron E rpt 113 392 Alu repeat copy A rpt 416 696 Alu repeat copy B rpt 711 950 Alu repeat copy C rpt 4008 4257 Alu repeat copy D rpt 4516 4780 Alu repeat copy E BASE COUNT 1032 A 1341 C 1449 G 1023 T ORIGIN Chromosome 17q11-q12. 1 ACACTGTCAT CTCTCCCTTC AATTACAGCA GTAGCCTCCT AACTAGTTTT CTTGATTCCA 61 CTCTTGCCCA TCAGCAGTGA AATTACCCCA GAGCAGTTAA AATGATCTTT TAGGATGGGC 121 ACAGTGTCAC GCCTGTAATA CCAGCACTTT GGGAGGTGGA GGCTGGCTGA TCCCCTTAGG 181 TTGGGAGTTC AAGACCATCC TCACCAACAG GGAGAAACCC CATCTCTACT AAAAATACAA 241 AAATTAGCCG CATGTGGTGG AACGTGTCTG TAATCCCAAC TACTTGGGAG GCTGAGACAG 301 GAGAACTGCT TGAACCCGGG AGGTGGAGGT TGCAGTGAGC CAGAACGTGC CACAGCACTC 361 CAGCCTGGGC AAGAAGCGCG AAACTCCGTC TCAAAATAAA ATAATCTTCT AAAAATGACA 421 GGGCCAGGTG GGGTGGCACT TTTTTATAAT CCGAGCACTT TGGGAGGCTG AGGTGGGCAG 481 ATCGCTTGAC ATCAGGGGTT TGAGACCAGC CTGGCCAACA TGGTGAAACT CCGTCTCTAC 541 TAAAAATACT AAAAATTAGC TGGGCGTGGT GGCGGGTGCC TGTAATCCCA GCTACTCGGG 601 AGGCTGATGC AGGAGAATCG CCTGAACCAG GCAGATGCAG GATGCAGTGA GCCAAGATTA 661 AGACACTGCA CTCCAACCTA AGCAATACTT TGTCTCAAAA ATAAAAAAAA GCCTGGGAAA 721 CAAAGTGAGA CCCCGTCTCT ACAAAAAAGT CAAAAAATTA GCTGGGTATG GTGGCAGTGA 781 TGGCACACAC CTGTAGTCCC GGCTACTTGG GAAGCTTTTT AATATTTTTT GCAGAGACCG 841 GGTCTCACTC TGTTACCTGG CCTGGTCTTG AACTCCTGGG CTCCAACAAT CCCCTTTCCT 901 GGGGCCTCCC AAAGTGCTGG GATTAGAGGC ATGAGCCACC GTGCCCAGCC TCAAAGCATA 961 TTTTAAAGGA TAGAAATAAA CAGCCATATG AAGAGATACA GACAGGGCGG TCTGGAAGGG 1021 TCCAGAGCAG GAGCTTCTAT CTCCATAGAG TTTGGGGTTA CGTCACCCTC TGGGCACATT 1081 CTGTCAGCCT CCACACGTTC AGCTCTCAGA AGCTCCCGAA CCCTGTCCTT TGGGCCTTTT 1141 ATGGAGAACT CCATTGGCTG TCCATGACTG AAGCATGGAC AACTGTGATA ATGTGATTGG 1201 GCAAAAAGGG TCTGATCTAA GCCCAGCAAG GCCAGTCCAG ATTCTTTGGG CCTTTGTGCA 1261 GCATTCCTTT CTCCAGGGTA TGGGGCAAGG ACCCACTCTG GAATGAGGAT CCTACAACCC 1321 ACAATCAGAT TAGAGTCCTG CCTTGGGCAG CTGAAAAGAG GACAGGAGAA GGTCAGAGAG 1381 ACGAAAGGCT GTTTTTTGAG GCCTGAGGCA CCCCAACATG ACAACGTAAG ACTGTAACCA 1441 TGGTCATGTG AGTTATGAGC TAGGAACCCT GGACGAAACC AACACATATA CAATCATCTC 1501 CCACCTCCCA ACGCCTTTAC TTTCACAGCC TCTGCAGCAA ACTGCGGTCA CTATAATCGC 1561 TCCTGTGGCA CAGAGGCATA CCCAGGGGAA TCTGCCCAGG GGGCCACTCT GTGCCCACGT 1621 GGGAACCCAC ACCTGCTTGT AAAGCCTCCC CTCCCTCTGA CCAGCAACCA GGACAGTTTG 1681 TTGTTCCAAG CAGTGGGCTC ATGTCTGTTT TGGCTCAGAA CAGGGTGGGG AGAGCGGGCC 1741 AGGGACCCGC AGGAAGGCTT ATCCTTGAGA TTGCGTGGGA GACACAACAA GGGGTGGGGG 1801 CCCGCAGGCG GGGCGGGGCG AAGCAGGTGA TATCAAGCCC AGAGCCCCAG CCTCTCCCCA 1861 CAGTCTCACC ATGGCCCGCA CCGTGGTGCT CATCACCGGC TGTTCCTCGG GCATCGGCCT 1921 GCACTTGGCC GTACGTCTGG CTTCAGATCC ATCCCAGAGC TTCAAAGGTA TAGATAGGCA 1981 GGGACAGGGA GGGAGAGAAG GGAGGAGCCC TTGGAGGCTA GAAGGGAAGT CAGATCTTCC 2041 TCCTCTCCCA AAACCTCCAG TGTATGCCAC GTTGAGGGAC CTGAAAACAC AGGGCCGGCT 2101 GTGGGAGGCG GCCCGGGCCC TGGCATGCCC TCCGGGATCC CTGGAGACGT TGCAGCTGGA 2161 CGTAAGGGAC TCAAAATCCG TGGCCGCTGC CCGGGAACGC GTGACTGAGG GCCGCGTGGA 2221 CGTGCTGGGT GAGCCTCCTG GAAGCATATG GGCTCCTAGG AGCCTTCTCC GCCCTGCGTT 2281 GAAACCAACA TGTTCCCAGG CCCAGGGAGC ACGAGGGGAC AGGCCGTGCT GAGGGTGATG 2341 CTGAGGCGGG CTGGTCGGGC CTCTTGTCTC CGCAGTGTGT AACGCAGGCC TGGGCCTGCT 2401 GGGGCCGCTG GAGGCGCTGG GGGAGGACGC CGTGGCCTCT GTGCTGGACG TGAATGTAGT 2461 AGGGACTGTG CGGATGCTGC AGGCCTTCCT GCCAGACATG AAGAGGCGCG GTTCGGGACG 2521 CGTGTTGGTG ACCGGGAGCG TGGGAGGATT GATGGGTGAG TGGTAGGGAG TGGCCTCGGC 2581 AGCTCCAGAT TCTTTGTGTG CGGAGCTGAG CCTTGAAGGC AGGTTCCGCG GGGAGGGTGG 2641 AGGGGGGTGC CGTCAGCTTG GAGGGGCACC GTCTGCCCGG GGATGACCCC CTGGCCGCTG 2701 CGCCTCAGGA ACCTCGTCTC CCCACCTAAG GGCTGCCTTT CAATGACGTT TATTGCGCCA 2761 GCAAGTTCGC GCTCGAAGGC TTATGCGAGA GTCTGGCGGT TCTGCTGCTG CCCTTTGGGG 2821 TCCAGTGAGT CAACACCCCC GTTCCCCGAA CCCTCTTAAC TCTGACCTAG AGATGCCGAG 2881 CACCCTGTCC TGCGGGAGCC GCTCTGGGGC GATCTCCCTG GCCCTCTCTG CCCGGCTCAC 2941 ATTAGCTGTG TGCCAGGCAC TTGGGCTGGG CGCATGGCAC GCATTGTGCC ACCTGCTGAC 3001 CTAGCTGCTG AAGTGTTGGT ATTGTTATGG GGAAGCTCCA GCCCAGAGAG GTTAGGTGAC 3061 TGGCCCAAGG TCACACAGCG GCCAGGGACC CCGCTAGATT CGAATCCTGA CACCAGGGCT 3121 CCCTTGTAGC CTCAGATGGA TTTGGGAGGG CTGCTCCGGC AGGAACCCGC GTTTCAAATG 3181 TTCTGGTTAT CCCCAGCGCC CTTTCCGCCT CACTTCCCAG CGGACGGTGC TGCTCGCGGT 3241 CGGGGGCCGG GACGTGGTTG GGGCTGGGAC TGGGGTTGGG GCTGGGACTG GGGCCTGGCT 3301 GGCGTCCGCC CCCTCCCACT CGTTGCTCTC CGGCCAGCAG CTTGAGCCTG ATCGAGTGCG 3361 GCCCAGTGCA CACCGCCTTC ATGGAGAAGG TGTTGGGCAG CCCAGAGGAG GTGCTGGACC 3421 GCACGGACAT CCACACCTTC CACCGCTTCT ACCAATACCT CGCCCACAGC AAGCAAGTCT 3481 TTCGCGAGGC GGCGCAGAAC CCTGAGGAGG TGGCGGAGGT GAGCGCCGGG CTGGACTCCA 3541 GGAGTGGGGG CGGTGCGTCC TCCGGCCGGC AGCGGTGGCC ACAGCTCTCC TCCCGCCGCC 3601 GCAGGTCTTC CTCACCGCTT TGCGCGCCCC GAAGCCGACC CTGCGCTACT TCACCACCGA 3661 GCGCTTCCTG CCCCTGCTGC GGATGCGCCT GGACGACCCC AGCGGCTCCA ACTACGTCAC 3721 CGCCATGCAC CGGGAAGTGT TCGGCGACGT TCCGGCAAAG GCCGAGGCTG GGGCCGAGGC 3781 TGGGGGCGGG GCCGGGCCTG GGGCAGAGGA CGAGGCCGGG CGCAGTGCGG TGGGGGACCC 3841 TGAGCTCGGC GATCCTCCGG CCGCCCCGCA GTAAAGGCTT CCTCAGCCGC TGTCTCCCGC 3901 GCCCTTCTTT GTCCCCTGGG TCTGTGTGGT CCCTGGGGAT GGGGCGGCGG TAGCAGCTGT 3961 GGGTGGCTAA TTAAGATAGA TCGCGTTAGC CAGTTTTACC AGCGCAGCTA GGCGCGATGG 4021 CTGTCGCCTG TAATGCCAGC GCTTTGGGAG GCGGAGGCAG GAGGATCGCT CAAGCCCCGG 4081 AGTTGGAGAC CAGCCAGAGC AACACAGTGA GACCCCCATC TCTACAAAAA TAAAGAAAAT 4141 TTAAAAATCA GCACAGTGGC ACCATTCCTT GAGCCCAGGA GTTGGAGGGT GCAGTGAGCA 4201 TGATGGGGCC ACTGCACTCC AGCCTGGGTG ACAGAGTGAG ACCCTGTCAA TTAATCAAAT 4261 GAACCAACCA ACCGAAAAAC TCAGAAACCA AGGTCCAGAA AGAAGCCAGC CCAGGATCAC 4321 ACCTCAAGTC CATATTAAAG GCCAGACACA GTCTCTGGAT ACCCAAGGGG CATCTGCAGG 4381 AGGAGTTAGG TGGGAATTGG CTTGGGGTAG AGTCAGGTTG TGACTTGGAC CCCATTAGCC 4441 ATGAGACCTC AGGCAAGTTC CTTGCTTTCT CTCAGTCTTT CTTTTCTTTC TTTCTTTCTT 4501 TTTTTTTTAT TTTCTGAGAC AGAGTTTCGC TCTTGTTGCC TAGGCTGGAG TGCAATGGGG 4561 CAATCTCGGC TCACCACAAT CTCCGCTTCC CAGATTCAAG TGATTCTCCT GCCTCAGCCT 4621 CCCAAGTAGC TGGGATACGG CTAAGATTTG TATGTTCGGT AGAGACGGGG TTTCTCCATG 4681 TTGGTCAGGC TGGTCTCAAA CTCTCGACCT TGGTGATCCG ACCGCCTCGG CCTCTCAGAT 4741 TGCTGGGCAT CGGCATGAGC ACCGCGACCG GCCCTCTGAG TCTTTCTTAT CTGTAAAATG 4801 GGTATATAAT ACCTATCTAA ACGGTTTTAG TAATGGTGGA GATAT // LOCUS HUMEF1A 4695 bp ds-DNA PRI 15-DEC-1989 DEFINITION Human elongation factor EF-1-alpha gene, complete cds. ACCESSION J04617 J04616 KEYWORDS elongation factor. SOURCE Human placenta DNA, clone pEFG1, and fibroblast cell line GM 637, cDNA to mRNA, (library of H.Okayama), clone pAN7. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 4695) AUTHORS Uetsuki,T., Naito,A., Nagata,S. and Kaziro,Y. TITLE Isolation and characterization of the human chromosomal gene for polypeptide chain elongation factor 1-alpha JOURNAL J. Biol. Chem. 264, 5791-5798 (1989) STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly provided by S.Nagata, 20-JAN-1989. FEATURES from to/span description pept 1582 1725 elongation factor EF-1-alpha, exon 2 /nomgen="EF1A" /map="LK0088Q" 2092 2271 elongation factor EF-1-alpha, exon 3 2377 2673 elongation factor EF-1-alpha, exon 4 2757 2907 elongation factor EF-1-alpha, exon 5 2995 3251 elongation factor EF-1-alpha, exon 6 3341 3575 elongation factor EF-1-alpha, exon 7 3671 3795 elongation factor EF-1-alpha, exon 8 pre-msg 576 4087 EF-1-alpha mRNA and introns IVS 609 1551 EF-1-alpha intron A IVS 1726 2091 EF-1-alpha intron B IVS 2272 2376 EF-1-alpha intron C IVS 2674 2756 EF-1-alpha intron D IVS 2908 2994 EF-1-alpha intron E IVS 3252 3340 EF-1-alpha intron F IVS 3576 3670 EF-1-alpha intron G binding 205 214 Sp1 binding site binding 320 328 Sp1 binding site binding 332 340 Sp1 binding site signal 546 552 TATA box binding 983 992 Sp1 binding site binding 1026 1034 Sp1 binding site binding 1122 1131 Sp1 binding site binding 1132 1141 Sp1 binding site binding 1240 1249 Sp1 binding site binding 1302 1308 Ap1 binding site BASE COUNT 1200 A 989 C 1235 G 1271 T ORIGIN 3 bp upstream of SmaI site. 1 CCCGGGCTGG GCTGAGACCC GCAGAGGAAG ACGCTCTAGG GATTTGTCCC GGACTAGCGA 61 GATGGCAAGG CTGAGGACGG GAGGCTGATT GAGAGGCGAA GGTACACCCT AATCTCAATA 121 CAACCTTTGG AGCTAAGCCA GCAATGGTAG AGGGAAGATT CTGCACGTCC CTTCCAGGCG 181 GCCTCCCCGT CACCACCCCC CCCAACCCGC CCCGACCGGA GCTGAGAGTA ATTCATACAA 241 AAGGACTCGC CCCTGCCTTG GGGAATCCCA GGGACCGTCG TTAAACTCCC ACTAACGTAG 301 AACCCAGAGA TCGCTGCGTT CCCGCCCCCT CACCCGCCCG CTCTCGTCAT CACTGAGGTG 361 GAGAAGAGCA TGCGTGAGGC TCCGGTGCCC GTCAGTGGGC AGAGCGCACA TCGCCCACAG 421 TCCCCGAGAA GTTGGGGGGA GGGGTCGGCA ATTGAACCGG TGCCTAGAGA AGGTGGCGCG 481 GGGTAAACTG GGAAAGTGAT GTCGTGTACT GGCTCCGCCT TTTTCCCGAG GGTGGGGGAG 541 AACCGTATAT AAGTGCAGTA GTCGCCGTGA ACGTTCTTTT TCGCAACGGG TTTGCCGCCA 601 GAACACAGGT AAGTGCCGTG TGTGGTTCCC GCGGGCCTGG CCTCTTTACG GGTTATGGCC 661 CTTGCGTGCC TTGAATTACT TCCACGCCCC TGGCTGCAGT ACGTGATTCT TGATCCCGAG 721 CTTCGGGTTG GAAGTGGGTG GGAGAGTTCG AGGCCTTGCG CTTAAGGAGC CCCTTCGCCT 781 CGTGCTTGAG TTGAGGCCTG GCCTGGGCGC TGGGGCCGCC GCGTGCGAAT CTGGTGGCAC 841 CTTCGCGCCT GTCTCGCTGC TTTCGATAAG TCTCTAGCCA TTTAAAATTT TTGATGACCT 901 GCTGCGACGC TTTTTTTCTG GCAAGATAGT CTTGTAAATG CGGGCCAAGA TCTGCACACT 961 GGTATTTCGG TTTTTGGGGC CGCGGGCGGC GACGGGGCCC GTGCGTCCCA GCGCACATGT 1021 TCGGCGAGGC GGGGCCTGCG AGCGCGGCCA CCGAGAATCG GACGGGGGTA GTCTCAAGCT 1081 GGCCGGCCTG CTCTGGTGCC TGGCCTCGCG CCGCCGTGTA TCGCCCCGCC CTGGGCGGCA 1141 AGGCTGGCCC GGTCGGCACC AGTTGCGTGA GCGGAAAGAT GGCCGCTTCC CGGCCCTGCT 1201 GCAGGGAGCT CAAAATGGAG GACGCGGCGC TCGGGAGAGC GGGCGGGTGA GTCACCCACA 1261 CAAAGGAAAA GGGCCTTTCC GTCCTCAGCC GTCGCTTCAT GTGACTCCAC GGAGTACCGG 1321 GCGCCGTCCA GGCACCTCGA TTAGTTCTCG AGCTTTTGGA GTACGTCGTC TTTAGGTTGG 1381 GGGGAGGGGT TTTATGCGAT GGAGTTTCCC CACACTGAGT GGGTGGAGAC TGAAGTTAGG 1441 CCAGCTTGGC ACTTGATGTA ATTCTCCTTG GAATTTGCCC TTTTTGAGTT TGGATCTTGG 1501 TTCATTCTCA AGCCTCAGAC AGTGGTTCAA AGTTTTTTTC TTCCATTTCA GGTGTCGTGA 1561 AAACTACCCC TAAAAGCCAA AATGGGAAAG GAAAAGACTC ATATCAACAT TGTCGTCATT 1621 GGACACGTAG ATTCGGGCAA GTCCACCACT ACTGGCCATC TGATCTATAA ATGCGGTGGC 1681 ATCGACAAAA GAACCATTGA AAAATTTGAG AAGGAGGCTG CTGAGGTATG TTTAATACCA 1741 GAAAGGGAAA GATCAACTAA AATGAGTTTT ACCAGCAGAA TCATTAGGTG ATTTCCCCAG 1801 AACTAGTGAG TGGTTTAGAT CTGAATGCTA ATAGTTAAGA CCTTACTTAT GAAATAATTT 1861 TGCTTTTGGT GACTTCTGTA ATCGTATTGC TAGTGAGTAG ATTTGGATGT TAATAGTTAA 1921 GATCCTACTT ATAAAAGTTT GATTTTTGGT TGCTTCTGTA ACCCAAAGTG ACCAAAATCA 1981 CTTTGGACTT GGAGTTGTAA AGTGGAAACT GCCAATTAAG GGCTGGGGAC AAGGAAATTG 2041 AAGCTGGAGT TTGTGTTTTA GTAACCAAGT AACGACTCTT AATCCTTACA GATGGGAAAG 2101 GGCTCCTTCA AGTATGCCTG GGTCTTGGAT AAACTGAAAG CTGAGCGTGA ACGTGGTATC 2161 ACCATTGATA TCTCCTTGTG GAAATTTGAG ACCAGCAAGT ACTATGTGAC TATCATTGAT 2221 GCCCCAGGAC ACAGAGACTT TATCAAAAAC ATGATTACAG GGACATCTCA GGTTGGGATT 2281 AATAATTCTA GGTTTCTTTA TCCCAAAAGG CTTGCTTTGT ACACTGGTTT TGTCATTTGG 2341 AGAGTTGACA GGGATATGTC TTTGCTTTCT TTAAAGGCTG ACTGTGCTGT CCTGATTGTT 2401 GCTGCTGGTG TTGGTGAATT TGAAGCTGGT ATCTCCAAGA ATGGGCAGAC CCGAGAGCAT 2461 GCCCTTCTGG CTTACACACT GGGTGTGAAA CAACTAATTG TCGGTGTTAA CAAAATGGAT 2521 TCCACTGAGC CACCCTACAG CCAGAAGAGA TATGAGGAAA TTGTTAAGGA AGTCAGCACT 2581 TACATTAAGA AAATTGGCTA CAACCCCGAC ACAGTAGCAT TTGTGCCAAT TTCTGGTTGG 2641 AATGGTGACA ACATGCTGGA GCCAAGTGCT AACGTAAGTG GCTTTCAAGA CCATTGTTAA 2701 AAAGCTCTGG GAATGGCGAT TTCATGCTTA CACAAATTGG CATGCTTGTG TTTCAGATGC 2761 CTTGGTTCAA GGGATGGAAA GTCACCCGTA AGGATGGCAA TGCCAGTGGA ACCACGCTGC 2821 TTGAGGCTCT GGACTGCATC CTACCACCAA CTCGTCCAAC TGACAAGCCC TTGCGCCTGC 2881 CTCTCCAGGA TGTCTACAAA ATTGGTGGTA AGTTGGCTGT AAACAAAGTT GAATTTGAGT 2941 TGATAGAGTA CTGTCTGCCT TCATAGGTAT TTAGTATGCT GTAAATATTT TTAGGTATTG 3001 GTACTGTTCC TGTTGGCCGA GTGGAGACTG GTGTTCTCAA ACCCGGTATG GTGGTCACCT 3061 TTGCTCCAGT CAACGTTACA ACGGAAGTAA AATCTGTCGA AATGCACCAT GAAGCTTTGA 3121 GTGAAGCTCT TCCTGGGGAC AATGTGGGCT TCAATGTCAA GAATGTGTCT GTCAAGGATG 3181 TTCGTCGTGG CAACGTTGCT GGTGACAGCA AAAATGACCC ACCAATGGAA GCAGCTGGCT 3241 TCACTGCTCA GGTAACAATT TAAAGTAACA TTAACTTATT GCAGAGGCTA AAGTCATTTG 3301 AGACTTTGGA TTTGCACTGA ATGCAAATCT TTTTTCCAAG GTGATTATCC TGAACCATCC 3361 AGGCCAAATA AGCGCCGGCT ATGCCCCTGT ATTGGATTGC CACACGGCTC ACATTGCATG 3421 CAAGTTTGCT GAGCTGAAGG AAAAGATTGA TCGCCGTTCT GGTAAAAAGC TGGAAGATGG 3481 CCCTAAATTC TTGAAGTCTG GTGATGCTGC CATTGTTGAT ATGGTTCCTG GCAAGCCCAT 3541 GTGTGTTGAG AGCTTCTCAG ACTATCCACC TTTGGGTAAG GATGACTACT TAAATGTAAA 3601 AAAGTTGTGT TAAAGATGAA AAATACAACT GAACAGTACT TTGGGTAATA ATTAACTTTT 3661 TTTTTAATAG GTCGCTTTGC TGTTCGTGAT ATGAGACAGA CAGTTGCGGT GGGTGTCATC 3721 AAAGCAGTGG ACAAGAAGGC TGCTGGAGCT GGCAAGGTCA CCAAGTCTGC CCAGAAAGCT 3781 CAGAAGGCTA AATGAATATT ATCCCTAATA CCTGCCACCC CACTCTTAAT CAGTGGTGGA 3841 AGAACGGTCT CAGAACTGTT TGTTTCAATT GGCCATTTAA GTTTAGTAGT AAAAGACTGG 3901 TTAATGATAA CAATGCATCG TAAAACCTTC AGAAGGAAAG GAGAATGTTT TGTGGACCAC 3961 TTTGGTTTTC TTTTTTGCGT GTGGCAGTTT TAAGTTATTA GTTTTTAAAA TCAGTACTTT 4021 TTAATGGAAA CAACTTGACC AAAAATTTGT CACAGAATTT TGAGACCCAT TAAAAAAGTT 4081 AAATGAGAAA CCTGTGTGTT CCTTTGGTCA ACACCGAGAC ATTTAGGTGA AAGACATCTA 4141 ATTCTGGTTT TACGAATCTG GAAACTTCTT GAAAATGTAA TTCTTGAGTT AACACTTCTG 4201 GGTGGAGAAT AGGGTTGTTT TCCCCCCACA TAATTGGAAG GGGAAGGAAT ATCATTTAAA 4261 GCTATGGGAG GGTTTCTTTG ATTACAACAC TGGAGAGAAA TGCAGCATGT TGCTGATTGC 4321 CTGTCACTAA AACAGGCCAA AAACTGAGTC CTTGGGTTGC ATAGAAAGCT TCATGTTGCT 4381 AAACCAATGT TAAGTGAATC TTTGGAAACA AAATGTTTCC AAATTACTGG GATGTGCATG 4441 TTGAAACGTG GGTTAAAATG ACTGGGCAGT GAAAGTTGAC TATTTGCCAT GACATAAGAA 4501 ATAAGTGTAG TGGCTAGTGT ACACCCTATG AGTGGAAGGG TCCATTTTGA AGTCAGTGGA 4561 GTAAGCTTTA TGCCATTTTG ATGGTTTCAC AAGTTCTATT GAGTGCTATT CAGAATAGGA 4621 ACAAGGTTCT AATAGAAAAA GATGGCAATT TGAAGTAGCT ATAAAATTAG ACTAATTACA 4681 TTGCTTTTCT CCGAC // LOCUS HUMEGFA1 757 bp ds-DNA PRI 15-JUN-1989 DEFINITION Human epidermal growth factor (EGF) with HER2 promoter, exon 1. ACCESSION M16789 KEYWORDS epidermal growth factor; promoter. SEGMENT 1 of 4 SOURCE Human DNA, clone lambda-n2. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 757) AUTHORS Tal,M., King,C.R., Kraus,M.H., Ullrich,A., Schlessinger,J. and Givol,D. TITLE Human HER2 (neu) promoter: Evidence for multiple mechanisms for transcriptional initiation JOURNAL Mol. Cell. Biol. 7, 2597-2601 (1987) STANDARD simple staff_entry FEATURES from to/span description pept 678 + 750 epidermal growth factor, exon 1 /nomgen="EGF" /map="4q25" /hgml_locus_uid="LQ0069V" pre-msg 500 > 757 EGF mRNA and introns IVS 751 > 757 EGF intron A BASE COUNT 169 A 216 C 250 G 122 T ORIGIN 3 bp upstream of SmaI site; chromosome 4q25. 1 CCCGGGGGTC CTGGAAGCCA CAAGGTAAAC ACAACACATC CCCCTCCTTG ACTATGCAAT 61 TTTACTAGAG GATGTGGTGG GAAAACCATT ATTTGATATT AAAACAAATA GGCTTGGGAT 121 GGAGTAGGAT GCAAGCTCCC CAGGAAAGTT TAAGATAAAA CCTGAGACTT AAAAGGGTGT 181 TAAGAGTGGC AGCCTAGGGA ATTTATCCCG GACTCCGGGG GAGGGGGCAG AGTCACCAGC 241 CTCTGCATTT AGGGATTCTC CGAGGAAAAG TGTGAGAACG GCTGCAGGCA ACCCAGGCGT 301 CCCGGCGCTA GGAGGGACGA CCCAGGCCTG CGCGAAGAGA GGGAGAAAGT GAAGCTGGGA 361 GTTGCCGACT CCCAGACTTC GTTGGAATGC AGTTGGAGGG GGCGAGCTGG GAGCGCGCTT 421 GCTCCCAATC ACAGGAGAAG GAGGAGGTGG AGGAGGAGGG CTGCTTGAGG AAGTATAAGA 481 ATGAAGTTGT GAAGCTGAGA TTCCCCTCCA TTGGGACCGG AGAAACCAGG GGAGCCCCCC 541 GGGCAGCCGC GCGCCCCTTC CCACGGGGCC CTTTACTGCG CCGCGCGCCC GGCCCCCACC 601 CCTCGCAGCA CCCCGCGCCC CGCGCCCTCC CAGCCGGGTC CAGCCGGAGC CATGGGGCCG 661 GAGCCGCAGT GAGCACCATG GAGCTGGCGG CCTTGTGCCG CTGGGGGCTC CTCCTCGCCC 721 TCTTGCCCCC CGGAGCCGCG AGCACCCAAG GTGGGTC // LOCUS HUMEGFA2 174 bp ds-DNA PRI 15-JUN-1989 DEFINITION Human epidermal growth factor (EGF) gene, exon 2. ACCESSION M16790 KEYWORDS epidermal growth factor. SEGMENT 2 of 4 SOURCE Human DNA, clone lambda-n2. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 174) AUTHORS Tal,M., King,C.R., Kraus,M.H., Ullrich,A., Schlessinger,J. and Givol,D. TITLE Human HER2 (neu) promoter: Evidence for multiple mechanisms for transcriptional initiation JOURNAL Mol. Cell. Biol. 7, 2597-2601 (1987) STANDARD simple staff_entry FEATURES from to/span description pept + 15 + 166 epidermal growth factor, exon 2 /nomgen="EGF" /map="4q25" /hgml_locus_uid="LQ0069V" pre-msg < 1 > 174 EGF mRNA and introns IVS < 1 14 EGF intron A IVS 167 > 174 EGF intron B BASE COUNT 32 A 69 C 43 G 30 T ORIGIN About 7 kb after segment 1; chromosome 4q25. 1 CTCTCTCCTG CCAGTGTGCA CCGGCACAGA CATGAAGCTG CGGCTCCCTG CCAGTCCCGA 61 GACCCACCTG GACATGCTCC GCCACCTCTA CCAGGGCTGC CAGGTGGTGC AGGGAAACCT 121 GGAACTCACC TACCTGCCCA CCAATGCCAG CCTGTCCTTC CTGCAGGTAG GCCC // LOCUS HUMEGFA3 233 bp ds-DNA PRI 15-JUN-1989 DEFINITION Human epidermal growth factor (EGF) gene, exon 3. ACCESSION M16791 KEYWORDS epidermal growth factor. SEGMENT 3 of 4 SOURCE Human DNA, clone lambda-n2. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 233) AUTHORS Tal,M., King,C.R., Kraus,M.H., Ullrich,A., Schlessinger,J. and Givol,D. TITLE Human HER2 (neu) promoter: Evidence for multiple mechanisms for transcriptional initiation JOURNAL Mol. Cell. Biol. 7, 2597-2601 (1987) STANDARD simple staff_entry FEATURES from to/span description pept + 15 + 228 epidermal growth factor, exon 3 /nomgen="EGF" /map="4q25" /hgml_locus_uid="LQ0069V" pre-msg < 1 > 233 EGF mRNA and introns IVS < 1 14 EGF intron B IVS 229 > 233 EGF intron C BASE COUNT 49 A 77 C 68 G 39 T ORIGIN About 1.2 kb after segment 2; chromosome 4q25. 1 CTTCCCCCTC CCAGGATATC CAGGAGGTGC AGGGCTACGT GCTCATCGCT CACAACCAAG 61 TGAGGCAGGT CCCACTGCAG AGGCTGCGGA TTGTGCGAGG CACCCAGCTC TTTGAGGACA 121 ACTATGCCCT GGCCGTGCTA GACAATGGAG ACCCGCTGAA CAATACCACC CCTGTCACAG 181 GGGCCTCCCC AGGAGGCCTG CGGGAGCTGC AGCTTCGAAG CCTCACAGGT GGC // LOCUS HUMEGFA4 147 bp ds-DNA PRI 15-JUN-1989 DEFINITION Human epidermal growth factor (EGF) gene, exon 4. ACCESSION M16792 KEYWORDS epidermal growth factor. SEGMENT 4 of 4 SOURCE Human DNA, clone lambda-n2. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 147) AUTHORS Tal,M., King,C.R., Kraus,M.H., Ullrich,A., Schlessinger,J. and Givol,D. TITLE Human HER2 (neu) promoter: Evidence for multiple mechanisms for transcriptional initiation JOURNAL Mol. Cell. Biol. 7, 2597-2601 (1987) STANDARD simple staff_entry FEATURES from to/span description pept + 9 / 143 epidermal growth factor, exon 4 /nomgen="EGF" /map="4q25" /hgml_locus_uid="LQ0069V" pre-msg < 1 > 147 EGF mRNA and introns IVS < 1 8 EGF intron C IVS 144 > 147 EGF intron D BASE COUNT 40 A 46 C 34 G 27 T ORIGIN About 0.8 kb after segment 3; chromosome 4q25. 1 CCCCACAGAG ATCTTGAAAG GAGGGGTCTT GATCCAGCGG AACCCCCAGC TCTGCTACCA 61 GGACACGATT TTGTGGAAGG ACATCTTCCA CAAGAACAAC CAGCTGGCTC TCACACTGAT 121 AGACACCAAC CGCTCTCGGG CCTGTAA // LOCUS HUMEGFRG 560 bp ds-DNA PRI 15-JUN-1989 DEFINITION Human epidermal growth factor receptor gene, promoter region and exon 1. ACCESSION M11234 KEYWORDS epidermal growth factor receptor. SOURCE Human DNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 560) AUTHORS Ishii,S., Xu,Y.-H., Stratton,R.H., Roe,B.A., Merlino,G.T. and Pastan,I. TITLE Characterization and sequence of the promoter region of the human epidermal growth factor receptor gene JOURNAL Proc. Natl. Acad. Sci. U.S.A. 82, 4920-4924 (1985) STANDARD simple staff_review FEATURES from to/span description pept 466 / 553 EGF receptor, exon 1 /nomgen="EGFR" /map="7p13-p12" /hgml_locus_uid="LJ0075L" IVS 554 > 560 intron A BASE COUNT 60 A 253 C 171 G 76 T ORIGIN Chromosome 7p13-p12. 1 CCTCCCATGC GCCGCCCCAC TCGCCGGAGA CTAGGCCCGC GGGGGCCACC GCTGTCCACC 61 GCCTCCGGCC GCTGGCCTTG GGTCCCCGCT GCTGGTTCTC CTCCTCCTCC TCGCATTCTC 121 CTCCTCCTCT GCTGCTCCCG ATCCCTCCTC CGCCGCCTGG TCCCTCCTCC TCCCGCCCTG 181 CCTCCCGCGC CTCGGCCCGC GCGAGCTAGA CGTTCGGGCA GCCCCCGGCG CAGCGCGGCC 241 GCAGCGCCTC CGCCCCCCGC ACGGTGTGAG CGCCCGCCGC GCCGAGGCGG CCGGAGTCCC 301 GAGCTAGCCC CGGCGCCGCC GCCGCCCAGA CCGGACACAG GCCACCTCGT CGGCGTCCGC 361 CCGAGTCCCC GCCTCGCCGC CAACGCCACA ACCACCGCGC ACGGCCCCCC TCACTCCGTC 421 CAGTATTGAT CGGGAGAGCC GGAGCGAGCT CTTCGGGGAG CAGCGATGCG ACCCTCCGGG 481 ACGGCCGGGG CAGCGCTCCT GGCGCTGCTG GCTGCGCTCT GCCCGGCGAG TCGGGCTCTG 541 GAGGAAAAGA AAGGTAAGGG // LOCUS HUMEL02 211 bp ds-DNA PRI 15-MAR-1988 DEFINITION Human elastin gene, exon 18. ACCESSION M17265 J02948 KEYWORDS alternative splicing; elastin. SEGMENT 2 of 20 SOURCE Human fetal aorta, cDNA to mRNA, clones cHEL[2,3,4]; DNA, clones HEL[1,2,3]. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 211) AUTHORS Indik,Z., Yeh,H., Ornstein-Goldstein,N., Sheppard,P., Anderson,N., Rosenbloom,J.C., Peltonen,L. and Rosenbloom,J. TITLE Alternative splicing of human elastin mRNA indicated by sequence analysis of cloned genomic and complementary DNA JOURNAL Proc. Natl. Acad. Sci. U.S.A. 84, 5680-5684 (1987) STANDARD full staff_review FEATURES from to/span description pept + 55 + 201 elastin A precursor, exon 2 /nomgen="ELN" /map="2q31-qter" /hgml_locus_uid="LC0146D" pep$ + 55 + 201 elastin B precursor, exon 2 pep$ + 55 + 201 elastin C precursor, exon 2 matp + 55 + 201 elastin A matp + 55 + 201 elastin B matp + 55 + 201 elastin C pre-msg < 1 > 211 elastin mRNA and intron IVS < 1 54 elastin A intron A IVS < 202 > 211 elastin A intron B IVS < 1 54 elastin B intron A IVS 202 > 211 elastin B intron B IVS < 1 54 elastin C intron A IVS 202 > 211 elastin C intron B BASE COUNT 29 A 52 C 76 G 54 T ORIGIN Undetermined number of bp after segment 1. 1 CTAGCCCCTC TGAGGTTCCC ATAGGTTAGG GGAACAATGC TTTTTCTTCC ACAGGAGCTG 61 CTGCAGGCTT AGTGCCTGGT GGGCCAGGCT TTGGCCCGGG AGTAGTTGGT GTCCCAGGAG 121 CTGGCGTTCC AGGTGTTGGT GTCCCAGGAG CTGGGATTCC AGTTGTCCCA GGTGCTGGGA 181 TCCCAGGTGC TGCGGTTCCA GGTGAGCTGG G // LOCUS HUMEL03 118 bp ds-DNA PRI 15-MAR-1988 DEFINITION Human elastin gene, exon 17. ACCESSION M17266 J02948 KEYWORDS alternative splicing; elastin. SEGMENT 3 of 20 SOURCE Human fetal aorta, cDNA to mRNA, clones cHEL[2,3,4]; DNA, clones HEL[1,2,3]. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 118) AUTHORS Indik,Z., Yeh,H., Ornstein-Goldstein,N., Sheppard,P., Anderson,N., Rosenbloom,J.C., Peltonen,L. and Rosenbloom,J. TITLE Alternative splicing of human elastin mRNA indicated by sequence analysis of cloned genomic and complementary DNA JOURNAL Proc. Natl. Acad. Sci. U.S.A. 84, 5680-5684 (1987) STANDARD full staff_review FEATURES from to/span description pept + 55 + 108 elastin A precursor, exon 3 /nomgen="ELN" /map="2q31-qter" /hgml_locus_uid="LC0146D" pep$ + 55 + 108 elastin B precursor, exon 3 pep$ + 55 + 108 elastin C precursor, exon 3 matp + 55 + 108 elastin A matp + 55 + 108 elastin B matp + 55 + 108 elastin C pre-msg < 1 > 118 elastin mRNA and intron IVS < 1 54 elastin A intron B IVS 109 > 118 elastin A intron C IVS < 1 54 elastin B intron B IVS 109 > 118 elastin B intron C IVS < 1 54 elastin C intron B IVS 109 > 118 elastin C intron C BASE COUNT 23 A 31 C 34 G 30 T ORIGIN About 1.1 kb after segment 2. 1 TGCTGCCTCC AATGCTGCTG CCTGAGCATG TTGTGTCCCT TTTGGTCTCT CCAGGGGTTG 61 TGTCACCAGA AGCAGCTGCT AAGGCAGCTG CAAAGGCAGC CAAATACGGT GAGTGCTA // LOCUS HUMEL04 229 bp ds-DNA PRI 15-MAR-1988 DEFINITION Human elastin gene, exon 16. ACCESSION M17267 J02948 KEYWORDS alternative splicing; elastin. SEGMENT 4 of 20 SOURCE Human fetal aorta, cDNA to mRNA, clones cHEL[2,3,4]; DNA, clones HEL[1,2,3]. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 229) AUTHORS Indik,Z., Yeh,H., Ornstein-Goldstein,N., Sheppard,P., Anderson,N., Rosenbloom,J.C., Peltonen,L. and Rosenbloom,J. TITLE Alternative splicing of human elastin mRNA indicated by sequence analysis of cloned genomic and complementary DNA JOURNAL Proc. Natl. Acad. Sci. U.S.A. 84, 5680-5684 (1987) STANDARD full staff_review FEATURES from to/span description pept + 55 + 219 elastin A precursor, exon 4 /nomgen="ELN" /map="2q31-qter" /hgml_locus_uid="LC0146D" pep$ + 55 + 219 elastin B precursor, exon 4 pep$ + 55 + 219 elastin C precursor, exon 4 matp + 55 + 219 elastin A matp + 55 + 219 elastin B matp + 55 + 219 elastin C pre-msg < 1 > 229 elastin mRNA and intron IVS < 1 54 elastin A intron C IVS 220 > 229 elastin A intron D IVS < 1 54 elastin B intron C IVS 220 > 229 elastin B intron D IVS < 1 54 elastin C intron C IVS 220 > 229 elastin C intron D BASE COUNT 26 A 63 C 79 G 61 T ORIGIN About 1.2 kb after segment 3. 1 GCCCAGCCTC TCTCACTGAG GCTTCTTTTC TACTTGGCTC CCTTCCCTCT GCAGGGGCCA 61 GGCCCGGAGT CGGAGTTGGA GGCATTCCTA CTTACGGGGT TGGAGCTGGG GGCTTTCCCG 121 GCTTTGGTGT CGGAGTCGGA GGTATCCCTG GAGTCGCAGG TGTCCCTAGT GTCGGAGGTG 181 TTCCCGGAGT CGGAGGTGTC CCGGGAGTTG GCATTTCCCG TGAGCCTTA // LOCUS HUMEL05 106 bp ds-DNA PRI 15-MAR-1988 DEFINITION Human elastin gene, exon 15. ACCESSION M17268 J02948 KEYWORDS alternative splicing; elastin. SEGMENT 5 of 20 SOURCE Human fetal aorta, cDNA to mRNA, clones cHEL[2,3,4]; DNA, clones HEL[1,2,3]. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 106) AUTHORS Indik,Z., Yeh,H., Ornstein-Goldstein,N., Sheppard,P., Anderson,N., Rosenbloom,J.C., Peltonen,L. and Rosenbloom,J. TITLE Alternative splicing of human elastin mRNA indicated by sequence analysis of cloned genomic and complementary DNA JOURNAL Proc. Natl. Acad. Sci. U.S.A. 84, 5680-5684 (1987) STANDARD full staff_review FEATURES from to/span description pept + 55 + 96 elastin A precursor, exon 5 /nomgen="ELN" /map="2q31-qter" /hgml_locus_uid="LC0146D" pep$ + 55 + 96 elastin B precursor, exon 5 pep$ + 55 + 96 elastin C precursor, exon 5 matp + 55 + 96 elastin A matp + 55 + 96 elastin B matp + 55 + 96 elastin C pre-msg < 1 > 106 elastin mRNA and intron IVS < 1 54 elastin A intron D IVS 97 > 106 elastin A intron E IVS < 1 54 elastin B intron D IVS 97 > 106 elastin B intron E IVS < 1 54 elastin C intron D IVS 97 > 106 elastin C intron E BASE COUNT 20 A 32 C 31 G 23 T ORIGIN About 0.1 kb after segment 4. 1 GAGGAGACCC AGGCACGGCT TCTGAGGGTC TCTATCTTTC TCGTTTCCTT GTAGCCGAAG 61 CTCAGGCAGC AGCTGCCGCC AAGGCTGCCA AGTACGGTAA GTGCCC // LOCUS HUMEL07 121 bp ds-DNA PRI 15-MAR-1988 DEFINITION Human elastin gene, exon 13. ACCESSION M17270 J02948 KEYWORDS alternative splicing; elastin. SEGMENT 7 of 20 SOURCE Human fetal aorta, cDNA to mRNA, clones cHEL[2,3,4]; DNA, clones HEL[1,2,3]. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 121) AUTHORS Indik,Z., Yeh,H., Ornstein-Goldstein,N., Sheppard,P., Anderson,N., Rosenbloom,J.C., Peltonen,L. and Rosenbloom,J. TITLE Alternative splicing of human elastin mRNA indicated by sequence analysis of cloned genomic and complementary DNA JOURNAL Proc. Natl. Acad. Sci. U.S.A. 84, 5680-5684 (1987) STANDARD full staff_review FEATURES from to/span description pept + 55 + 111 elastin B precursor, exon 6 /nomgen="ELN" /map="2q31-qter" /hgml_locus_uid="LC0146D" pep$ + 55 + 111 elastin C precursor, exon 6 matp + 55 + 111 elastin B matp + 55 + 111 elastin C pre-msg < 1 > 121 elastin mRNA and intron IVS < 1 > 121 elastin A intron E IVS < 1 54 elastin B intron E IVS 112 > 121 elastin B intron F IVS < 1 54 elastin C intron E IVS 112 > 121 elastin C intron F BASE COUNT 26 A 40 C 33 G 22 T ORIGIN About 0.3 kb after segment 6. 1 AGCAGGGAGG GGTGTGAGAG ATTACTCTCT CACCCCTTCT CTTCACACCT CCAGGAGTGG 61 GGACCCCAGC AGCTGCAGCT GCTAAAGCAG CCGCCAAAGC CGCCCAGTTT GGTAAGTCCC 121 C // LOCUS HUMEL08 226 bp ds-DNA PRI 15-MAR-1988 DEFINITION Human elastin gene, exon 12. ACCESSION M17271 J02948 KEYWORDS alternative splicing; elastin. SEGMENT 8 of 20 SOURCE Human fetal aorta, cDNA to mRNA, clones cHEL[2,3,4]; DNA, clones HEL[1,2,3]. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 226) AUTHORS Indik,Z., Yeh,H., Ornstein-Goldstein,N., Sheppard,P., Anderson,N., Rosenbloom,J.C., Peltonen,L. and Rosenbloom,J. TITLE Alternative splicing of human elastin mRNA indicated by sequence analysis of cloned genomic and complementary DNA JOURNAL Proc. Natl. Acad. Sci. U.S.A. 84, 5680-5684 (1987) STANDARD full staff_review FEATURES from to/span description pept + 55 + 216 elastin A precursor, exon 6 /nomgen="ELN" /map="2q31-qter" /hgml_locus_uid="LC0146D" pep$ + 55 + 216 elastin B precursor, exon 7 pep$ + 55 + 216 elastin C precursor, exon 7 matp + 55 + 216 elastin A matp + 55 + 216 elastin B matp + 55 + 216 elastin C pre-msg < 1 > 216 elastin mRNA and intron IVS < 1 54 elastin A intron E IVS 217 > 226 elastin A intron F IVS < 1 54 elastin B intron F IVS 112 > 216 elastin B intron G IVS < 1 54 elastin C intron F IVS 112 > 216 elastin C intron G BASE COUNT 19 A 51 C 82 G 74 T ORIGIN About 2.1 kb after segment 7. 1 TCTGTCCTCT TTGATCAGGT CTTGGTTAAT GATCAGCTCT TCTCAATCTT GCAGGGTTAG 61 TTCCTGGTGT CGGCGTGGCT CCTGGAGTTG GCGTGGCTCC TGGTGTCGGT GTGGCTCCTG 121 GAGTTGGCTT GGCTCCTGGA GTTGGCGTGG CTCCTGGAGT TGGTGTGGCT CCTGGCGTTG 181 GCGTGGCTCC CGGCATTGGC CCTGGTGGAG TTGCAGGTGA GTTTCA // LOCUS HUMEL09 109 bp ds-DNA PRI 15-MAR-1988 DEFINITION Human elastin gene, exon 11. ACCESSION M17272 J02948 KEYWORDS alternative splicing; elastin. SEGMENT 9 of 20 SOURCE Human fetal aorta, cDNA to mRNA, clones cHEL[2,3,4]; DNA, clones HEL[1,2,3]. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 109) AUTHORS Indik,Z., Yeh,H., Ornstein-Goldstein,N., Sheppard,P., Anderson,N., Rosenbloom,J.C., Peltonen,L. and Rosenbloom,J. TITLE Alternative splicing of human elastin mRNA indicated by sequence analysis of cloned genomic and complementary DNA JOURNAL Proc. Natl. Acad. Sci. U.S.A. 84, 5680-5684 (1987) STANDARD full staff_review FEATURES from to/span description pept + 55 + 99 elastin A precursor, exon 7 /nomgen="ELN" /map="2q31-qter" /hgml_locus_uid="LC0146D" pep$ + 55 + 99 elastin B precursor, exon 8 pep$ + 55 + 99 elastin C precursor, exon 8 matp + 55 + 99 elastin A matp + 55 + 99 elastin B matp + 55 + 99 elastin C pre-msg < 1 > 109 elastin mRNA and intron IVS < 1 54 elastin A intron F IVS 100 > 109 elastin A intron G IVS < 1 54 elastin B intron G IVS 100 > 109 elastin B intron H IVS < 1 54 elastin C intron G IVS 100 > 109 elastin C intron H BASE COUNT 18 A 48 C 21 G 22 T ORIGIN About 0.2 kb after segment 8. 1 AGCCTCCATG GGCCCCGCCT CCATCTCTAA TCCCCCTCTC TCTCCCTCCC TCAGCTGCAG 61 CAAAATCCGC TGCCAAGGTG GCTGCCAAAG CCCAGCTCCG TGAGTGCCT // LOCUS HUMEL10 190 bp ds-DNA PRI 15-MAR-1988 DEFINITION Human elastin gene, exon 10. ACCESSION M17273 J02948 KEYWORDS alternative splicing; elastin. SEGMENT 10 of 20 SOURCE Human fetal aorta, cDNA to mRNA, clones cHEL[2,3,4]; DNA, clones HEL[1,2,3]. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 190) AUTHORS Indik,Z., Yeh,H., Ornstein-Goldstein,N., Sheppard,P., Anderson,N., Rosenbloom,J.C., Peltonen,L. and Rosenbloom,J. TITLE Alternative splicing of human elastin mRNA indicated by sequence analysis of cloned genomic and complementary DNA JOURNAL Proc. Natl. Acad. Sci. U.S.A. 84, 5680-5684 (1987) STANDARD full staff_review FEATURES from to/span description pept + 55 + 180 elastin A precursor, exon 8 /nomgen="ELN" /map="2q31-qter" /hgml_locus_uid="LC0146D" pep$ + 55 + 180 elastin B precursor, exon 9 pep$ + 55 + 180 elastin C precursor, exon 9 matp + 55 + 180 elastin A matp + 55 + 180 elastin B matp + 55 + 180 elastin C pre-msg < 1 > 190 elastin mRNA and intron IVS < 1 54 elastin A intron G IVS 181 > 190 elastin A intron H IVS < 1 54 elastin B intron H IVS 181 > 190 elastin B intron I IVS < 1 54 elastin C intron H IVS 181 > 190 elastin C intron I BASE COUNT 18 A 49 C 69 G 54 T ORIGIN About 0.1 kb after segment 9. 1 TCCTTAGGGG CATGCTCCCT GCCTGCTGTC GCCACCACTG CCCTCTGTCT GCAGGAGCTG 61 CAGCTGGGCT TGGTGCTGGC ATCCCTGGAC TTGGAGTTGG TGTCGGCGTC CCTGGACTTG 121 GAGTTGGTGC TGGTGTTCCT GGACTTGGAG TTGGTGCTGG TGTTCCTGGC TTCGGGGCAG 181 GTGCAGATGA // LOCUS HUMEL11 109 bp ds-DNA PRI 15-MAR-1988 DEFINITION Human elastin gene, exon 10A. ACCESSION M17274 J02948 KEYWORDS alternative splicing; elastin. SEGMENT 11 of 20 SOURCE Human fetal aorta, cDNA to mRNA, clones cHEL[2,3,4]; DNA, clones HEL[1,2,3]. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 109) AUTHORS Indik,Z., Yeh,H., Ornstein-Goldstein,N., Sheppard,P., Anderson,N., Rosenbloom,J.C., Peltonen,L. and Rosenbloom,J. TITLE Alternative splicing of human elastin mRNA indicated by sequence analysis of cloned genomic and complementary DNA JOURNAL Proc. Natl. Acad. Sci. U.S.A. 84, 5680-5684 (1987) STANDARD full staff_review FEATURES from to/span description pept + 1 + 99 elastin C precursor, exon 10 /nomgen="ELN" /map="2q31-qter" /hgml_locus_uid="LC0146D" matp + 1 + 99 elastin C pre-msg < 1 > 109 elastin mRNA and intron IVS < 1 > 109 elastin A intron H IVS < 1 > 109 elastin B intron I IVS 100 > 109 elastin C intron J BASE COUNT 22 A 38 C 30 G 19 T ORIGIN About 0.1 kb after segment 10. 1 GTGCAGATGA GGGAGTTAGG CGGAGCCTGT CCCCTGAGCT CAGGGAAGGA GATCCCTCCT 61 CCTCTCAGCA CCTCCCCAGC ACCCCCTCAT CACCCAGGGG TGCATAGTA // LOCUS HUMEL12 103 bp ds-DNA PRI 15-MAR-1988 DEFINITION Human elastin gene, exon 9. ACCESSION M17275 J02948 KEYWORDS alternative splicing; elastin. SEGMENT 12 of 20 SOURCE Human fetal aorta, cDNA to mRNA, clones cHEL[2,3,4]; DNA, clones HEL[1,2,3]. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 103) AUTHORS Indik,Z., Yeh,H., Ornstein-Goldstein,N., Sheppard,P., Anderson,N., Rosenbloom,J.C., Peltonen,L. and Rosenbloom,J. TITLE Alternative splicing of human elastin mRNA indicated by sequence analysis of cloned genomic and complementary DNA JOURNAL Proc. Natl. Acad. Sci. U.S.A. 84, 5680-5684 (1987) STANDARD full staff_review FEATURES from to/span description pept + 55 + 93 elastin A precursor, exon 9 /nomgen="ELN" /map="2q31-qter" /hgml_locus_uid="LC0146D" pep$ + 55 + 93 elastin B precursor, exon 10 pep$ + 55 + 93 elastin C precursor, exon 11 matp + 55 + 93 elastin A matp + 55 + 93 elastin B matp + 55 + 93 elastin C pre-msg < 1 > 103 elastin mRNA and intron IVS < 1 54 elastin A intron H IVS 94 > 103 elastin A intron I IVS < 1 54 elastin B intron I IVS 94 > 103 elastin B intron J IVS < 1 54 elastin C intron J IVS 94 > 103 elastin C intron K BASE COUNT 21 A 38 C 22 G 22 T ORIGIN About 0.8 kb after segment 11. 1 TCCCAGGCAC AGAGCTCGGC TCCTGACCAC TCCCCAACTT TTCTTTCTCC CCAGTACCTG 61 GAGCCCTGGC TGCCGCTAAA GCAGCCAAAT ATGGTGAGTG CAC // LOCUS HUMEL13 136 bp ds-DNA PRI 15-MAR-1988 DEFINITION Human elastin gene, exon 8. ACCESSION M17276 J02948 KEYWORDS alternative splicing; elastin. SEGMENT 13 of 20 SOURCE Human fetal aorta, cDNA to mRNA, clones cHEL[2,3,4]; DNA, clones HEL[1,2,3]. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 136) AUTHORS Indik,Z., Yeh,H., Ornstein-Goldstein,N., Sheppard,P., Anderson,N., Rosenbloom,J.C., Peltonen,L. and Rosenbloom,J. TITLE Alternative splicing of human elastin mRNA indicated by sequence analysis of cloned genomic and complementary DNA JOURNAL Proc. Natl. Acad. Sci. U.S.A. 84, 5680-5684 (1987) STANDARD full staff_review FEATURES from to/span description pept + 55 + 126 elastin A precursor, exon 10 /nomgen="ELN" /map="2q31-qter" /hgml_locus_uid="LC0146D" pep$ + 55 + 126 elastin B precursor, exon 11 pep$ + 55 + 126 elastin C precursor, exon 12 matp + 55 + 126 elastin A matp + 55 + 126 elastin B matp + 55 + 126 elastin C pre-msg < 1 > 136 elastin mRNA and intron IVS < 1 54 elastin A intron I IVS 127 > 136 elastin A intron J IVS < 1 54 elastin B intron J IVS 127 > 136 elastin B intron K IVS < 1 54 elastin C intron K IVS 127 > 136 elastin C intron L BASE COUNT 23 A 35 C 48 G 30 T ORIGIN About 1.5 kb after segment 12. 1 AGGGAGACCC ATCGTTCAGA AATGGAACAC TCATTTTCCC TCCTCTCCCC GCAGGAGCAG 61 CAGTGCCTGG GGTCCTTGGA GGGCTCGGGG CTCTCGGTGG AGTAGGCATC CCAGGCGGTG 121 TGGTGGGTGA GTTGAT // LOCUS HUMEL14 124 bp ds-DNA PRI 15-MAR-1988 DEFINITION Human elastin gene, exon 7. ACCESSION M17277 J02948 KEYWORDS alternative splicing; elastin. SEGMENT 14 of 20 SOURCE Human fetal aorta, cDNA to mRNA, clones cHEL[2,3,4]; DNA, clones HEL[1,2,3]. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 124) AUTHORS Indik,Z., Yeh,H., Ornstein-Goldstein,N., Sheppard,P., Anderson,N., Rosenbloom,J.C., Peltonen,L. and Rosenbloom,J. TITLE Alternative splicing of human elastin mRNA indicated by sequence analysis of cloned genomic and complementary DNA JOURNAL Proc. Natl. Acad. Sci. U.S.A. 84, 5680-5684 (1987) STANDARD full staff_review FEATURES from to/span description pept + 55 + 114 elastin A precursor, exon 11 /nomgen="ELN" /map="2q31-qter" /hgml_locus_uid="LC0146D" pep$ + 55 + 114 elastin B precursor, exon 12 pep$ + 55 + 114 elastin C precursor, exon 13 matp + 55 + 114 elastin A matp + 55 + 114 elastin B matp + 55 + 114 elastin C pre-msg < 1 > 124 elastin mRNA and intron IVS < 1 54 elastin A intron J IVS 115 > 124 elastin A intron K IVS < 1 54 elastin B intron K IVS 115 > 124 elastin B intron L IVS < 1 54 elastin C intron L IVS 115 > 124 elastin C intron M BASE COUNT 24 A 43 C 37 G 20 T ORIGIN About 0.1 kb after segment 13. 1 GGAGGGAATC TAACCAGTAC AGAGTGCCTC CCTGAACTCG GTCTGTGTTC CCAGGAGCCG 61 GACCCGCCGC CGCCGCTGCC GCAGCCAAAG CTGCTGCCAA AGCCGCCCAG TTTGGTGAGC 121 ACTG // LOCUS HUMEL15 139 bp ds-DNA PRI 15-MAR-1988 DEFINITION Human elastin gene, exon 6. ACCESSION M17278 J02948 KEYWORDS alternative splicing; elastin. SEGMENT 15 of 20 SOURCE Human fetal aorta, cDNA to mRNA, clones cHEL[2,3,4]; DNA, clones HEL[1,2,3]. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 139) AUTHORS Indik,Z., Yeh,H., Ornstein-Goldstein,N., Sheppard,P., Anderson,N., Rosenbloom,J.C., Peltonen,L. and Rosenbloom,J. TITLE Alternative splicing of human elastin mRNA indicated by sequence analysis of cloned genomic and complementary DNA JOURNAL Proc. Natl. Acad. Sci. U.S.A. 84, 5680-5684 (1987) STANDARD full staff_review FEATURES from to/span description pept + 55 + 129 elastin A precursor, exon 12 /nomgen="ELN" /map="2q31-qter" /hgml_locus_uid="LC0146D" pep$ + 55 + 129 elastin B precursor, exon 13 pep$ + 55 + 129 elastin C precursor, exon 14 matp + 55 + 129 elastin A matp + 55 + 129 elastin B matp + 55 + 129 elastin C pre-msg < 1 > 139 elastin mRNA and intron IVS < 1 54 elastin A intron K IVS 130 > 139 elastin A intron L IVS < 1 54 elastin B intron L IVS 130 > 139 elastin B intron M IVS < 1 54 elastin C intron M IVS 130 > 139 elastin C intron N BASE COUNT 21 A 34 C 52 G 32 T ORIGIN About 0.3 kb after segment 14. 1 GCTTCAGTCC CACCTTTCTG ACCAGCGGAG TCTAATGCTC AGCTGTCTCC ACAGGCCTAG 61 TGGGAGCCGC TGGGCTCGGA GGACTCGGAG TCGGAGGGCT TGGAGTTCCA GGTGTTGGGG 121 GCCTTGGAGG TGAGAGTTG // LOCUS HUMEL16 103 bp ds-DNA PRI 15-MAR-1988 DEFINITION Human elastin gene, exon 5. ACCESSION M17279 J02948 KEYWORDS alternative splicing; elastin. SEGMENT 16 of 20 SOURCE Human fetal aorta, cDNA to mRNA, clones cHEL[2,3,4]; DNA, clones HEL[1,2,3]. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 103) AUTHORS Indik,Z., Yeh,H., Ornstein-Goldstein,N., Sheppard,P., Anderson,N., Rosenbloom,J.C., Peltonen,L. and Rosenbloom,J. TITLE Alternative splicing of human elastin mRNA indicated by sequence analysis of cloned genomic and complementary DNA JOURNAL Proc. Natl. Acad. Sci. U.S.A. 84, 5680-5684 (1987) STANDARD full staff_review FEATURES from to/span description pept + 55 + 93 elastin A precursor, exon 13 /nomgen="ELN" /map="2q31-qter" /hgml_locus_uid="LC0146D" pep$ + 55 + 93 elastin B precursor, exon 14 pep$ + 55 + 93 elastin C precursor, exon 15 matp + 55 + 93 elastin A matp + 55 + 93 elastin B matp + 55 + 93 elastin C pre-msg < 1 > 103 elastin mRNA and intron IVS < 1 54 elastin A intron L IVS 94 > 103 elastin A intron M IVS < 1 54 elastin B intron M IVS 94 > 103 elastin B intron N IVS < 1 54 elastin C intron N IVS 94 > 103 elastin C intron O BASE COUNT 21 A 34 C 25 G 22 T 1 others ORIGIN About 2.0 kb after segment 15. 1 GCCTGACCAG GTGGCATTGG CATTCCTGAG CCGTCATGTG CCTCATCTCC CCAGGTATAC 61 CTCCAGCTGC AGCCGCTAAA GCAGCTAAAT ACGGTGAGTN CCC // LOCUS HUMEL17 118 bp ds-DNA PRI 15-MAR-1988 DEFINITION Human elastin gene, exon 4. ACCESSION M17280 J02948 KEYWORDS alternative splicing; elastin. SEGMENT 17 of 20 SOURCE Human fetal aorta, cDNA to mRNA, clones cHEL[2,3,4]; DNA, clones HEL[1,2,3]. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 118) AUTHORS Indik,Z., Yeh,H., Ornstein-Goldstein,N., Sheppard,P., Anderson,N., Rosenbloom,J.C., Peltonen,L. and Rosenbloom,J. TITLE Alternative splicing of human elastin mRNA indicated by sequence analysis of cloned genomic and complementary DNA JOURNAL Proc. Natl. Acad. Sci. U.S.A. 84, 5680-5684 (1987) STANDARD full staff_review FEATURES from to/span description pep$ + 55 + 108 elastin B precursor, exon 15 /nomgen="ELN" /map="2q31-qter" /hgml_locus_uid="LC0146D" pep$ + 55 + 108 elastin C precursor, exon 16 matp + 55 + 108 elastin B matp + 55 + 108 elastin C pre-msg < 1 > 118 elastin mRNA and intron IVS < 1 > 118 elastin A intron M IVS < 1 54 elastin B intron N IVS 109 > 118 elastin B intron O IVS < 1 54 elastin C intron O IVS 109 > 118 elastin C intron P BASE COUNT 12 A 33 C 43 G 30 T ORIGIN About 0.1 kb after segment 16. 1 AGGGCCTCTT CCCGATGGGG GTGTCTTATC CTGACCCCAC CTGCCTCTTC TCAGGTGCTG 61 CTGGCCTTGG AGGTGTCCTA GGGGGTGCCG GGCAGTTCCC ACTTGGAGGT AGGGGTGG // LOCUS HUMEL18 109 bp ds-DNA PRI 15-MAR-1988 DEFINITION Human elastin gene, exon 4A. ACCESSION M17281 J02948 KEYWORDS alternative splicing; elastin. SEGMENT 18 of 20 SOURCE Human fetal aorta, cDNA to mRNA, clones cHEL[2,3,4]; DNA, clones HEL[1,2,3]. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 109) AUTHORS Indik,Z., Yeh,H., Ornstein-Goldstein,N., Sheppard,P., Anderson,N., Rosenbloom,J.C., Peltonen,L. and Rosenbloom,J. TITLE Alternative splicing of human elastin mRNA indicated by sequence analysis of cloned genomic and complementary DNA JOURNAL Proc. Natl. Acad. Sci. U.S.A. 84, 5680-5684 (1987) STANDARD full staff_review FEATURES from to/span description pept + 55 + 99 elastin A precursor, exon 14 /nomgen="ELN" /map="2q31-qter" /hgml_locus_uid="LC0146D" pep$ + 55 + 99 elastin B precursor, exon 16 pep$ + 55 + 99 elastin C precursor, exon 17 matp + 55 + 99 elastin A matp + 55 + 99 elastin B matp + 55 + 99 elastin C pre-msg < 1 > 109 elastin mRNA and intron IVS < 1 54 elastin A intron M IVS 100 > 109 elastin A intron N IVS < 1 54 elastin B intron O IVS 100 > 109 elastin B intron P IVS < 1 54 elastin C intron P IVS 100 > 109 elastin C intron Q BASE COUNT 23 A 37 C 24 G 25 T ORIGIN About 0.5 kb after segment 17. 1 GCTGGAGTCA GTTTCCACCC CTACCAACCC ACCAACCTGA AATCTCTCCT GCAGGAGTGG 61 CAGCAAGACC TGGCTTCGGA TTGTCTCCCA TTTTCCCAGG TATGCCAGG // LOCUS HUMEL20 1448 bp ds-DNA PRI 30-SEP-1988 DEFINITION Human elastin gene, exon 1. ACCESSION M17282 J02948 KEYWORDS alternative splicing; elastin. SEGMENT 20 of 20 SOURCE Human fetal aorta, cDNA to mRNA, clones cHEL[2,3,4]; DNA, clones HEL[1,2,3] [1]; clone HEL1 [2]. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 98) AUTHORS Indik,Z., Yeh,H., Ornstein-Goldstein,N., Sheppard,P., Anderson,N., Rosenbloom,J.C., Peltonen,L. and Rosenbloom,J. TITLE Alternative splicing of human elastin mRNA indicated by sequence analysis of cloned genomic and complementary DNA JOURNAL Proc. Natl. Acad. Sci. U.S.A. 84, 5680-5684 (1987) STANDARD full staff_review REFERENCE 2 (bases 99 to 1448) AUTHORS Indik,Z., Yoon,K., Morrow,S.D., Cicila,G., Rosenbloom,J., Rosenbloom,J. and Ornstein-Goldstein,N. TITLE Structure of the 3' region of the human elastin gene: Great abundance of Alu repetitive sequences and few coding sequences JOURNAL Connect. Tissue Res. 16, 197-211 (1987) STANDARD full staff_review COMMENT Polyadenylation signals are located at positions 950-955 and 1182-1187. FEATURES from to/span description pept + 55 98 elastin A precursor, exon 15 /nomgen="ELN" /map="2q31-qter" /hgml_locus_uid="LC0146D" pep$ + 55 98 elastin B precursor, exon 17 pep$ + 55 98 elastin C precursor, exon 18 matp + 55 98 elastin A matp + 55 98 elastin B matp + 55 98 elastin C pre-msg < 1 > 98 elastin mRNA and intron IVS < 1 54 elastin A intron N IVS < 1 54 elastin B intron P IVS < 1 54 elastin C intron Q BASE COUNT 298 A 442 C 338 G 350 T 20 others ORIGIN Undetermined number of bases after segment 19. 1 AGCCGAAACT GAGAGGGGCC GGACTCACAG TGATGTGCAC CTCCTCCCGT CCAGGTGGGG 61 CCTGCCTGGG GAAAGCTTGT GGCCGGAAGA GAAAATGAGC TTCCTAGGAC CCCTGACTCA 121 CGACCTCATC AACGTTGGTG CTACTGCTTG GTGGAGAATG TAAACCCTTT GTATCCCCAT 181 CCCATGCCCC TCCGATTCCC CACCCCAGGA GGGAACGGGC AGGCCGGGCG GCTTGCAGAT 241 CCACAGGGCA AGGAAACAAG AGGGGAGCGG CCAAGTGCCC CGACCAGGAG GCCCCCTACT 301 TCAGAGGCAA GGGCCATGTG GTCCTGGCCC CCCAACCCCA TCCCTTCCCA CCTAGGAGCT 361 CCCCCTCCAC ACAGCCTCCA TCTCCAGGGG AACTTGGTGC TACACGCTGG TGCTCTTATC 421 TTCCTGGGGG GAGGGAGGAG GGAAGGGTGG CCCCTCGGGG AACCCCCTAC CTGGGGCTCC 481 TCTAAAGATG GTGCAGACAC TTCCTGGGCA GTCCCAGCTC CCCCTGCCCA CCAGGACCCA 541 CCGTTGGCTG CCATCCAGTT GGTACCCAAG CACCTGAAGC CTCAAAGCTG GATTCGCTCT 601 AGCATCCCTC CTCTCCTGGG TCCACTTGGC CGTCTCCTCC CCACCGATCG CTGTTCCCCA 661 CATCTGGGGC GCTTTTGGGT TGGAAAACCA CCCCACACTG GGAATAGCCA CCTTGCCCCT 721 TGTAAGAATC CATCCGCCCA TCCGTCCATT CATCCATCGG TCCGTCCATC CATGTCCCAG 781 TTGACCGCCG GCACCATTAG CTGGCTGGGT GCACCCACCA TCAACCTGGT TGACCTGTCA 841 TGGCCGCCTG TGCCCTNCCT NANCCCCATC CTACANTCCC CCAGGGCGTG CGGGGCTGTG 901 CAGACTGGGG TGCCAGGCAT CTCNNNCCCA CCCGGGGTNT CCCCANATGC AGTACTGTAT 961 ANNCCCCATC CCTCCCTCGG TCCACTGAAC TTCAGAGCAG TTCCCATTCC TGCCCCGNCC 1021 ATCTTTTTGT GTCTCGCTGT GATAGATCAA TAAATATTTT ATTTTTTGTC CTGGATATTT 1081 GGGGATTATT TTTGATTGTT GATATTCTCT TTTGGTTTTA TTGTTGTGGT TCATTGAAAA 1141 AAAAAAGATA ATTTTTTTTT CTGATCCGGG GAGCTGTATC CCCAGTAGAA AAAAAATTTT 1201 AATCACTCTA ATATACCTCT GGATGANNCA NACCTTTTTT TTTATTAAGA AAAGAGATTT 1261 AACTGCTTCA GAAATGACTA ATAAATGAAA ACCCTTTAAA GGAAACTGTG TCTTNGCTTC 1321 CTTGGTATGA TTTAATCTGC CTTCAACTGT TGGCCTGGNT GGGGNNANGG GCTCTGCTTC 1381 AGGGAACCTC CACCACCCAA ATTGTATTTG AGAGGTTGCC CAACCAAAAG CCCCTGCTGC 1441 CTGGCTTC // LOCUS HUMELA301 701 bp ds-DNA PRI 30-SEP-1988 DEFINITION Human elastase III A gene, exon 1. ACCESSION M18693 M18691 J03516 KEYWORDS elastase. SEGMENT 1 of 8 SOURCE Human pancreas, cDNA to mRNA, clone pCL2; DNA, clone ELIII-39. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 701) AUTHORS Tani,T., Ohsumi,J., Mita,K. and Takiguchi,Y. TITLE Identification of a Novel Class of Elastase Isozyme, Human Pancreatic Elastase III, by cDNA and Genomic Gene Cloning JOURNAL J. Biol. Chem. 263, 1231-1239 (1988) STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly provided by T.Tani, 11-DEC-1987. FEATURES from to/span description pept 527 + 569 elastase III A precursor, exon 1 sigp 527 + 569 elastase III A signal peptide pre-msg 509 > 701 EL III mRNA and intron IVS 570 > 701 EL III intron A BASE COUNT 149 A 199 C 173 G 180 T ORIGIN 1 bp upstream of XbaI site. 1 TCTAGAAGGG CTGAGACCAT GCATTTTTCA CAGCTGTATC CCCTAGAACA GTGCCTGATA 61 CATAGAGAAA AAGTGCTCAA CAAGAATTTT TTTGTGGAGG TGGGTACTAA TATAAATCTC 121 ATGTTAGAGA GCAGAAAACT GAGACTCAGA GAGGTTAAGC CATGCGCCCA AAGTCGCAGA 181 GCTTGAAGTG GCAGAACCAG GATTGAACCG TGGTCTGCCC GGTTCCAGTG CCCCTGTTCT 241 TTCCGAAAGA GGAGGTCTTG TCCTGACATG AGCTTTTCTT CCTGAACTCT TGTTATCAGT 301 TGGCCACTCC ACACCACCTG TCCAGCCACC TGTTGGAGGG AACGCTAGGT CCTCCCTTTG 361 GCTGTCACTT GTCCATCTGC AGCAGGCGAG GACTGTGGCC TTGGACCATG GGAGACCTGG 421 AGTTCCACCA CCTGTTCTGG AGTGGGGCCC AATCAGCCCC CCTGCTTACC TTCTGCCCTA 481 TATTAGAGCC CCAGGCTCTG TGCCCTTTCC TATCATCACA AAACTCATGA TGCTCCGGCT 541 GCTCAGTTCC CTCCTCCTTG TGGCCGTTGG TAAGACCCCA ACCTGTCTGT GTGCTCCCTG 601 GGCTGCCCTG GACTAGGAAT CCTTGAAATC TACCACTCGC TCTGAGTCCC ATGACATGCT 661 ATGCCTGGTT CCACAGGAGG GGGTCTCAGC TTGTACCCGG G // LOCUS HUMELA302 179 bp ds-DNA PRI 30-SEP-1988 DEFINITION Human elastase III A gene, exon 2. ACCESSION M18694 M18691 J03516 KEYWORDS elastase. SEGMENT 2 of 8 SOURCE Human pancreas, cDNA to mRNA, clone pCL2; DNA, clone ELIII-39. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 179) AUTHORS Tani,T., Ohsumi,J., Mita,K. and Takiguchi,Y. TITLE Identification of a Novel Class of Elastase Isozyme, Human Pancreatic Elastase III, by cDNA and Genomic Gene Cloning JOURNAL J. Biol. Chem. 263, 1231-1239 (1988) STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly provided by T.Tani, 11-DEC-1987. FEATURES from to/span description pept + 79 + 164 elastase III A precursor, exon 2 sigp + 79 83 elastase III A signal peptide matp 120 + 164 elastase III A pre-msg < 1 > 179 EL III mRNA and introns IVS < 1 78 EL III intron A IVS 165 > 179 EL III intron B BASE COUNT 26 A 63 C 45 G 45 T ORIGIN About 2 kb after segment 1. 1 GGCTTGGACT GGGACCCTGG CCTCCTCTTT GCTTTTGGGG ACCCTCCAGC TGATTGACAG 61 CTCTCCTCTC CCCTCTAGCC TCAGGCTATG GCCCACCTTC CTCTCACTCT TCCAGCCGCG 121 TTGTCCATGG TGAGGATGCG GTCCCCTACA GCTGGCCCTG GCAGGTAAGA GCAATAGCA // LOCUS HUMELA303 268 bp ds-DNA PRI 30-SEP-1988 DEFINITION Human elastase III A gene, exon 3. ACCESSION M18695 M18691 J03516 KEYWORDS elastase. SEGMENT 3 of 8 SOURCE Human pancreas, cDNA to mRNA, clone pCL2; DNA, clone ELIII-39. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 268) AUTHORS Tani,T., Ohsumi,J., Mita,K. and Takiguchi,Y. TITLE Identification of a Novel Class of Elastase Isozyme, Human Pancreatic Elastase III, by cDNA and Genomic Gene Cloning JOURNAL J. Biol. Chem. 263, 1231-1239 (1988) STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly provided by T.Tani, 11-DEC-1987. FEATURES from to/span description pept + 67 + 164 elastase III A precursor, exon 3 matp + 67 + 164 elastase III A pre-msg < 1 > 268 EL III mRNA and introns IVS < 1 66 EL III intron B IVS 165 > 268 EL III intron C BASE COUNT 45 A 81 C 86 G 56 T ORIGIN About 2 kb after segment 2. 1 ACAGCCATTG GTGGCAACTC TCATGGTGGC CCAGCCCACT GAGGCCCTTT CCTCCTGGGC 61 CACCAGGTTT CCCTGCAGTA TGAGAAAAGT GGAAGCTTCT ACCACACGTG TGGCGGTAGC 121 CTCATCGCCC CCGATTGGGT TGTGACTGCC GGCCACTGCA TCTCGTGAGT TCTCTACCCT 181 GTCCCTGCCT GTGGCCCTGG GCAGCGGGGG AGAGTGGGTG ATGATGGGGC AAGGAGGGAG 241 GTGAGCCAGT CAGGCCCCGA CTGACCTC // LOCUS HUMELA304 229 bp ds-DNA PRI 30-SEP-1988 DEFINITION Human elastase III A gene, exon 4. ACCESSION M18696 M18691 J03516 KEYWORDS elastase. SEGMENT 4 of 8 SOURCE Human pancreas, cDNA to mRNA, clone pCL2; DNA, clone ELIII-39. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 229) AUTHORS Tani,T., Ohsumi,J., Mita,K. and Takiguchi,Y. TITLE Identification of a Novel Class of Elastase Isozyme, Human Pancreatic Elastase III, by cDNA and Genomic Gene Cloning JOURNAL J. Biol. Chem. 263, 1231-1239 (1988) STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly provided by T.Tani, 11-DEC-1987. FEATURES from to/span description pept + 45 + 176 elastase III A precursor, exon 4 matp + 45 + 176 elastase III A pre-msg < 1 > 229 EL III mRNA and introns IVS < 1 44 EL III intron C IVS 177 > 229 EL III intron D BASE COUNT 40 A 67 C 70 G 52 T ORIGIN About 1 kb after segment 3. 1 AGGTGAGCCA GTCAGGCCCC GACTGACCTC ACCTCTGCCT GCAGGAGGGA TCTGACCTAC 61 CAGGTGGTGT TGGGTGAGTA CAACCTTGCT GTGAAGGAGG GCCCCGAGCA GGTGATCCCC 121 ATCAACTCTG AGCTGTTTGT GCATCCACTC TGGAACCGCT CGTGTGTGGC CTGTGGGTGA 181 GTGAATGCTC CGGTCTGGAA CCCAGGGGCT CCTCTACTTG TCCCTCCAA // LOCUS HUMELA305 184 bp ds-DNA PRI 30-SEP-1988 DEFINITION Human elastase III A gene, exon 5. ACCESSION M18697 M18691 J03516 KEYWORDS elastase. SEGMENT 5 of 8 SOURCE Human pancreas, cDNA to mRNA, clone pCL2; DNA, clone ELIII-39. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 184) AUTHORS Tani,T., Ohsumi,J., Mita,K. and Takiguchi,Y. TITLE Identification of a Novel Class of Elastase Isozyme, Human Pancreatic Elastase III, by cDNA and Genomic Gene Cloning JOURNAL J. Biol. Chem. 263, 1231-1239 (1988) STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly provided by T.Tani, 11-DEC-1987. FEATURES from to/span description pept + 6 + 142 elastase III A precursor, exon 5 matp + 45 + 176 elastase III A pre-msg < 1 > 184 EL III mRNA and introns IVS < 1 5 EL III intron D IVS 143 > 184 EL III intron E BASE COUNT 37 A 70 C 42 G 35 T ORIGIN About 1 kb after segment 4. 1 CCCAGCAATG ACATCGCCCT CATCAAGCTC TCACGCAGCG CCCAGCTGGG AGATGCCGTC 61 CAGCTCGCCT CACTCCCTCC CGCTGGTGAC ATCCTTCCCA ACAAGACACC CTGCTACATC 121 ACCGGCTGGG GCCGTCTCTA TAGTACGTGC TGACTTCTCT AGCTGGCCAC AGAGACAGTG 181 GCAG // LOCUS HUMELA306 169 bp ds-DNA PRI 30-SEP-1988 DEFINITION Human elastase III A gene, exon 6. ACCESSION M18698 M18691 J03516 KEYWORDS elastase. SEGMENT 6 of 8 SOURCE Human pancreas, cDNA to mRNA, clone pCL2; DNA, clone ELIII-39. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 169) AUTHORS Tani,T., Ohsumi,J., Mita,K. and Takiguchi,Y. TITLE Identification of a Novel Class of Elastase Isozyme, Human Pancreatic Elastase III, by cDNA and Genomic Gene Cloning JOURNAL J. Biol. Chem. 263, 1231-1239 (1988) STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly provided by T.Tani, 11-DEC-1987. FEATURES from to/span description pept + 21 + 163 elastase III A precursor, exon 6 matp + 21 + 163 elastase III A pre-msg < 1 > 169 EL III mRNA and introns IVS < 1 20 EL III intron E IVS 164 > 169 EL III intron F BASE COUNT 33 A 48 C 52 G 36 T ORIGIN About 0.5 kb after segment 5. 1 GGTGCTTTTT ATCCTTGCAG CCAATGGGCC ACTCCCAGAC AAGCTGCAGC AGGCCCGGCT 61 GCCCGTGGTG GACTATAAGC ACTGCTCCAG GTGGAACTGG TGGGGTTCCA CCGTGAAGAA 121 AACCATGGTG TGTGCTGGAG GGTACATCCG CTCCGGCTGC AACGTCTCA // LOCUS HUMELA307 275 bp ds-DNA PRI 30-SEP-1988 DEFINITION Human elastase III A gene, exon 7. ACCESSION M18699 M18691 J03516 KEYWORDS elastase. SEGMENT 7 of 8 SOURCE Human pancreas, cDNA to mRNA, clone pCL2; DNA, clone ELIII-39. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 275) AUTHORS Tani,T., Ohsumi,J., Mita,K. and Takiguchi,Y. TITLE Identification of a Novel Class of Elastase Isozyme, Human Pancreatic Elastase III, by cDNA and Genomic Gene Cloning JOURNAL J. Biol. Chem. 263, 1231-1239 (1988) STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly provided by T.Tani, 11-DEC-1987. FEATURES from to/span description pept + 46 + 198 elastase III A precursor, exon 7 matp + 46 + 198 elastase III A pre-msg < 1 > 275 EL III mRNA and introns IVS < 1 45 EL III intron F IVS 199 > 275 EL III intron G BASE COUNT 49 A 76 C 88 G 62 T ORIGIN About 2 kb after segment 6. 1 CCCTCAGACA TGGCTCAGCC ACCCACTCCT CTCTGACGGT TCCAGGGTGA CTCTGGAGGA 61 CCCCTCAACT GCCCCACAGA GGATGGTGGC TGGCAGGTCC ACGGTGTGAC CAGCTTTGTT 121 TCTGGCTTTG GCTGCAACTT CATCTGGAAG CCTACGGTGT TCACTCGAGT CTCCGCCTTC 181 ATCGACTGGA TTGAGGAGGT GAGGAGGGCA GGGCGCCCGG AGGGCTTTAG GGTGGTGGCT 241 CTTCTGAGAG GTGACAGGTG AGAAACATCG GATCC // LOCUS HUMELA308 306 bp ds-DNA PRI 30-SEP-1988 DEFINITION Human elastase III A gene, exon 8. ACCESSION M18700 M18691 J03516 KEYWORDS elastase. SEGMENT 8 of 8 SOURCE Human pancreas, cDNA to mRNA, clone pCL2; DNA, clone ELIII-39. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 306) AUTHORS Tani,T., Ohsumi,J., Mita,K. and Takiguchi,Y. TITLE Identification of a Novel Class of Elastase Isozyme, Human Pancreatic Elastase III, by cDNA and Genomic Gene Cloning JOURNAL J. Biol. Chem. 263, 1231-1239 (1988) STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly provided by T.Tani, 11-DEC-1987. FEATURES from to/span description pept + 9 26 elastase III A precursor, exon 8 matp + 9 23 elastase III A pre-msg < 1 119 EL III mRNA and introns IVS < 1 8 EL III intron G BASE COUNT 74 A 74 C 70 G 88 T ORIGIN About 2 kb after segment 7. 1 TTTTCCAGAC CATAGCAAGC CACTAGAACC AAGGCCCAGC TGGCAGTGCT GATCGATCCC 61 AGCTGGCAGT GCTGATCGAT CCCACATCCT GAATAAGAAT AAAGATCCGG GGCCTTCCCA 121 GATCCACTCG TTATTCTCGG ACGAGTGTTC AGTAATGAAC CTCTGGAGAG AACCATGTAT 181 ATGATCGTTA TCTGGGTTGG ACTTCTGCTT TTAAGCCCAG ATAACTGGCC TGAATATGTT 241 AATGAGAGAA TCGGTATTCC TCATGTGTGG CATGTTTTCG TCTTTGCTCT TGCATTTTCG 301 CTAGCA // LOCUS HUMELN1 43 bp ds-DNA PRI 15-SEP-1989 DEFINITION Human elastin gene, exon 5, 3' end. ACCESSION M20425 KEYWORDS elastin. SEGMENT 1 of 4 SOURCE Human DNA, clone HEL1. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 43) AUTHORS Indik,Z., Yoon,K., Morrow,S.D., Cicila,G., Rosenbloom,J., Rosenbloom,J. and Ornstein-Goldstein,N. TITLE Structure of the 3' region of the human elastin gene: Great abundance of alu repetitive sequences and few coding sequences JOURNAL Connect. Tissue Res. 16, 197-221 (1987) STANDARD simple staff_review COMMENT The exons have been numbered starting with the 3' end. FEATURES from to/span description pept < 3 / 41 elastin, exon 5 (AA at 5) /nomgen="ELN" /map="2q31-qter" /hgml_locus_uid="LC0146D" IVS < 1 2 ELN, intron E IVS 42 > 43 ELN, intron D BASE COUNT 13 A 12 C 10 G 8 T ORIGIN Chromosome 2q31-qter. 1 AGGTATACCT CCAGCTGCAG CCGCTAAAGC AGCTAAATAC GGT // LOCUS HUMELN2 58 bp ds-DNA PRI 15-SEP-1989 DEFINITION Human elastin gene, exon 4, 3' end. ACCESSION M20426 KEYWORDS elastin. SEGMENT 2 of 4 SOURCE Human DNA, clone HEL1. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 58) AUTHORS Indik,Z., Yoon,K., Morrow,S.D., Cicila,G., Rosenbloom,J., Rosenbloon,J. and Ornstein-Goldstein,N. TITLE Structure of the 3' region of the human elastin gene: Great abundance of alu repetitive sequences and few coding sequences JOURNAL Connect. Tissue Res. 16, 197-221 (1987) STANDARD simple staff_review COMMENT The exons have been numbered starting with the 3' end. FEATURES from to/span description pept < 3 > 52 elastin, exon 4 (AA at 5) /nomgen="ELN" /map="2q31-qter" /hgml_locus_uid="LC0146D" IVS < 1 2 ELN, intron D IVS 57 > 58 ELN, intron C BASE COUNT 6 A 13 C 25 G 14 T ORIGIN About 100 bp after segment 1; chromosome 2q31-qter. 1 AGGTGCTGCT GGCCTTGGAG GTGTCCTAGG GGGTGCCGGG CAGTTCCCAC TTGGAGGT // LOCUS HUMELN3 46 bp ds-DNA PRI 15-JUN-1989 DEFINITION Human elastin gene, exon 1, 3' end. ACCESSION M20427 KEYWORDS elastin. SEGMENT 3 of 4 SOURCE Human DNA, clone HEL1. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 46) AUTHORS Indik,Z., Yoon,K., Morrow,S.D., Cicila,G., Rosenbloom,J. and Rosenbloom,K. TITLE Structure of the 3' region of the human elastin gene: Great abundance of alu repetitive sequences and few coding sequences JOURNAL Connect. Tissue Res. 16, 197-221 (1987) STANDARD simple staff_review COMMENT The exons have been numbered starting with the 3' end. FEATURES from to/span description pept < 3 46 elastin, exon 1 /nomgen="ELN" /map="2q31-qter" /hgml_locus_uid="LC0146D" IVS < 1 2 ELN, intron B BASE COUNT 12 A 7 C 20 G 7 T ORIGIN About 2.5 kb after segment 2; chromosome 2q31-qter. 1 AGGTGGGGCC TGCCTGGGGA AAGCTTGTGG CCGGAAGAGA AAATGA // LOCUS HUMENK1 1014 bp ds-DNA PRI 07-JAN-1985 DEFINITION human enkephalin gene: exons 1 and 2. ACCESSION J00122 KEYWORDS enkephalin; neuropeptide; polyprotein. SEGMENT 1 of 2 SOURCE human cDNA([1],[3]) and DNA ([2]). ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 158 to 210; 298 to 353; 823 to 963) AUTHORS Comb,M., Seeburg,P.H., Adelman,J., Eiden,L. and Herbert,E. TITLE Primary structure of the human met- and leu-enkephalin precursor and its mRNA JOURNAL Nature 295, 663-666 (1982) STANDARD full staff_review REFERENCE 2 (bases 1 to 1014) AUTHORS Noda,M., Teranishi,Y., Takahashi,H., Toyosato,M., Notake,M., Nakanishi,S. and Numa,S. TITLE Isolation and structural organization of the human preproenkephalin gene JOURNAL Nature 297, 431-434 (1982) STANDARD full staff_review REFERENCE 3 (bases 323 to 353; 823 to 963) AUTHORS Legon,S., Glover,D.M., Hughes,J., Lowry,P.J., Rigby,P.W.J. and Watson,C.J. TITLE The structure and expression of the preproenkephalin gene JOURNAL Nucleic Acids Res. 10, 7905-7918 (1982) STANDARD full staff_review COMMENT Preproenkephalin is a polyprotein (multivalent precursor) which like proopiomelanocortin, to which it is strongly related, yields several small peptide hormones through processing. the preproenkephalin gene region encodes four copies of met-enkephalin, two copies of met-enkephalin extended, and one copy of leu-enkephalin. the gene is about 5.2 kb in length and consists of three exons and two introns, with this exception: [1] shows additional splicing in the 5' untranslated mRNA which [2] (note added in proof) identifies as an alternative mode of mRNA processing. hence the mrna which includes bases 211 to 297 (as shown below) is an alternate mRNA species. furthermore, [2] believes that the first 113 nucleotides reported by [1] (not shown below) are an artifact. the structural organization of the enkephalin gene so closely resembles the proopiomelanocortin gene that they are thought to have evolved by a similar mechanism. see other loci with enk as the 4th-6th characters of the locus name and other loci with pomc as the 4th-7th characters of the locus name. EMBL features not translated to GenBank features: key from to description TRANSCR 141 >1014 primary transcript MSG 141 353 part of messenger RNA, 1st exon MSG 823 963 part of messenger RNA, 2nd exon IVS 354 822 intervening sequence IVS 964 >1014 intervening sequence CDS 826 963 reading frame, amino acids 1 to 46 FEATURES from to/span description pept 826 + 963 preproenkephalin (exon 2) /nomgen="PENK" /map="8q23-q24" /hgml_locus_uid="LA0049H" pre-msg 141 > 1014 enk mRNA IVS 211 297 enk intron [2] (alt.) IVS 353 822 enk intron (alt.) IVS 211 822 enk intron A (alt.) IVS 964 > 1014 enk intron B BASE COUNT 136 A 364 C 312 G 202 T ORIGIN near a SstI site ([2]). 1 CCGACCCCTC CCGCGAAGGC GTCGGCGCGG GGCTGGCGTA GGGCCTGCGT CAGCTGCAGC 61 CCGCCGGCGA TTGGGGCGCG CGCGCCTCCT TCGGTTTGGG GCTAATTATA AAGTGGCTCC 121 AGCAGCCGTT AAGCCCCGGG ACGGCGAGGC AGGCGCTCAG AGCCCCGCAG CCTGGCCCGT 181 GACCCCGCAG AGACGCTGAG GACCGCGACG GTGAGGCCCT ACGTCCGCCA GCACACCCGG 241 GCCCGCTTCT CCCCGACGCC CGCCCTCCTC ACACTTGCCT TCTTCTCTTC CCTCTAGAGT 301 CGTGTCTGAA CCCGGCTTTT CCAATTGGCC TGCTCCATCC GAACAGCGTC AACGTGAGTG 361 AATTTGCCCG AAGCTTGTCT TTGCTGAGCG GGTTTGGGGA CGTCTGCCCG CCCTCTTTCC 421 CTTCACATTT CATTGCATGG GTTCCCCAAC AGCGTTCCCT GGTTCTTCTT TGTGACCCCA 481 GTCAATGTCC TGCCTCCCCC GGCTCCCGCT CTCTCGCCCC TGGTCTGCGG CGTTCTCTCC 541 GGAATCTTGC CCTGGGCCGC GGACGCCCAG GAAAAGAGCC GGGTGCCCCA GGCAGCCTCG 601 CGTTGGGGGC GACCGCGCCA TCCCGGGAAC CGCGAGGCGA TCTGAGTCGC CTCCACGTCT 661 ACCTAAAAGC TGTCGGCCGG GAGGGCGGGG CCCCAGAAAG GAGCATTCCT GCGGGCTTTT 721 GCTCGACGAT CCCCTGCTGA GGCTGTCGCG GCGAGGGTCC TGCCGAGGGA CCCCGTTCTG 781 CGCCCAGGCA GGCTCGAAGC ACGCGTCCCT CTCTCCTCGC AGTCCATGGC GCGGTTCCTG 841 ACACTTTGCA CTTGGCTGCT GTTGCTCGGC CCCGGGCTCC TGGCGACCGT GCGGGCCGAA 901 TGCAGCCAGG ATTGCGCGAC GTGCAGCTAC CGCCTAGTGC GCCCGGCCGA CATCAACTTC 961 CTGGTGAGTG TTGCGCGCGG CGAGTGTTGC GCACCTTGTG AGACAGAGTT TCCG // LOCUS HUMENK2 1056 bp ds-DNA PRI 07-JAN-1985 DEFINITION Human enkephalin gene: exon 3 and 3'flank. ACCESSION J00123 KEYWORDS enkephalin; neuropeptide; polyprotein. SEGMENT 2 of 2 SOURCE Human cDNA ([1],[3]) and DNA ([2]). ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 35 to 1022) AUTHORS Comb,M., Seeburg,P.H., Adelman,J., Eiden,L. and Herbert,E. TITLE Primary structure of the human met- and leu-enkephalin precursor and its mRNA JOURNAL Nature 295, 663-666 (1982) STANDARD full staff_review REFERENCE 2 (bases 1 to 1056) AUTHORS Noda,M., Teranishi,Y., Takahashi,H., Toyosato,M., Notake,M., Nakanishi,S. and Numa,S. TITLE Isolation and structural organization of the human preproenkephalin gene JOURNAL Nature 297, 431-434 (1982) STANDARD full staff_review REFERENCE 3 (bases 35 to 892) AUTHORS Legon,S., Glover,D.M., Hughes,J., Lowry,P.J., Rigby,P.W.J. and Watson,C.J. TITLE The structure and expression of the preproenkephalin gene JOURNAL Nucleic Acids Res. 10, 7905-7918 (1982) STANDARD full staff_review COMMENT Preproenkephalin is a polyprotein (multivalent precursor) which like proopiomelanocortin, to which it is strongly related, yields several small peptide hormones through processing. The preproenkephalin gene region encodes four copies of met-enkephalin, two copies of met-enkephalin extended, and one copy of leu-enkephalin. The gene is about 5.2 kb in length and consists of three exons and two introns, with the exception about the 5' region noted in the comment to segment 1 above. The structural organization of the enkephalin gene so closely resembles the proopiomelanocortin gene that they are thought to have evolved by a similar mechanism. See other loci with enk as the 4th-6th characters of the locus name and other loci with pomc as the 4th-7th characters of the locus name. FEATURES from to/span description pept + 35 700 preproenkephalin (exon 3) /nomgen="PENK" /map="8q23-q24" /hgml_locus_uid="LA0049H" matp 194 208 met-enkephalin matp 215 229 met-enkephalin matp 302 316 met-enkephalin matp 452 475 met-enkephalin-arg-gly-leu matp 524 538 met-enkephalin matp 584 598 met-enkephalin matp 677 697 met-enkephalin-arg-phe pre-msg < 1 1020 enk mRNA IVS < 1 34 enk intron b conflict 553 553 c in [2],[3]; t in [1] conflict 614 615 gc in [2],[3]; ct in [1] conflict 703 704 tc in [1],[2]; t in [3] conflict 723 724 cc in [1],[2]; c in [3] conflict 916 918 ttc in [1]; tc in [2]. conflict 927 929 aat in [1]; at in [2]. conflict 936 938 ttg in [1]; tg in [2]. conflict 958 960 ttg in [1]; tg in [2]. conflict 969 971 cca in [1]; ca in [2]. conflict 1007 1007 a in [2]; g in [1] BASE COUNT 304 A 218 C 258 G 276 T ORIGIN HindIII site, about 3.4 kb 3' to humenkeph1. 1 TTCTTTTTAT CATTACATCA AATTGTTTTC CCAGGCTTGC GTAATGGAAT GTGAAGGTAA 61 ACTGCCTTCT CTGAAAATTT GGGAAACCTG CAAGGAGCTC CTGCAGCTGT CCAAACCAGA 121 GCTTCCTCAA GATGGCACCA GCACCCTCAG AGAAAATAGC AAACCGGAAG AAAGCCATTT 181 GCTAGCCAAA AGGTATGGGG GCTTCATGAA AAGGTATGGA GGCTTCATGA AGAAAATGGA 241 TGAGCTTTAT CCCATGGAGC CAGAAGAAGA GGCCAATGGA AGTGAGATCC TCGCCAAGCG 301 GTATGGGGGC TTCATGAAGA AGGATGCAGA GGAGGACGAC TCGCTGGCCA ATTCCTCAGA 361 CCTGCTAAAA GAGCTTCTGG AAACAGGGGA CAACCGAGAG CGTAGCCACC ACCAGGATGG 421 CAGTGATAAT GAGGAAGAAG TGAGCAAGAG ATATGGGGGC TTCATGAGAG GCTTAAAGAG 481 AAGCCCCCAA CTGGAAGATG AAGCCAAAGA GCTGCAGAAG CGATATGGGG GCTTCATGAG 541 AAGAGTAGGT CGCCCAGAGT GGTGGATGGA CTACCAGAAA CGGTATGGAG GTTTCCTGAA 601 GCGCTTTGCC GAGGCTCTGC CCTCCGACGA AGAAGGCGAA AGTTACTCCA AAGAAGTTCC 661 TGAAATGGAA AAAAGATACG GAGGATTTAT GAGATTTTAA TATCTTTTCC CACTAGTGGC 721 CCCCAGGCCC CAGCAAGCCT CCCTCCATCC TCCAGTGGGA AACTGTTGAT GGTGTTTTAT 781 TGTCATGTGT TGCTTGCCTT GTATAGTTGA CTTCATTGTC TGGATAACTA TACAACCTGA 841 AAACTGTCAT TTCAGGTTCT GTGCTCTTTT TGGAGTCTTT AAGCTCAGTA TTAGTCTATT 901 GCAGCTATCT CGTTTTTCAT GCTAAAAATA GTTTTTTGTT ATCTTGTCTC TTATTTTTTG 961 ACAAACATCC AATAAATGCT TACTTGTATA TAGAGATAAT AAACCTATTA CCCCAAGTGC 1021 ATAATATCCT TGTAAGTCTC TTTTTCTCCA AGGCTC // LOCUS HUMENKB1 1831 bp ds-DNA PRI 03-SEP-1985 DEFINITION Human enkephalin B gene, exon 1. ACCESSION X00174 KEYWORDS dynorphin; endorphin; enkephalin; leumorphin; neuropeptide; polyprotein; rimorphin. SEGMENT 1 of 4 SOURCE Human fetal liver DNA (genomic library of T.Maniatis) and cDNA to mRNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 1831) AUTHORS Horikawa,S., Takai,T., Toyosato,M., Takahashi,H., Noda,M., Kakidani,H., Kubo,T., Hirose,T., Inayama,S., Hayashida,H., Miyata,T. and Numa,S. TITLE Isolation and structural organization of the human preproenkephalin B gene JOURNAL Nature 306, 611-614 (1983) STANDARD full staff_review COMMENT EMBL features not translated to GenBank features: key from to description PRM 422 430 put. CAAT-box PRM 457 466 put. TATA-box Two diferences were found between the cDNA and the genomic clones: both "g" in the genomic DNA and "a" in the cDNA. FEATURES from to/span description pre-msg 491 > 1831 enkB mRNA IVS 1807 > 1831 enkB intron A rpt 207 274 tandem repeat copy A rpt 275 342 tandem repeat copy B rpt 343 410 tandem repeat copy C BASE COUNT 389 A 504 C 454 G 484 T ORIGIN 124 bp upstream of DdeI site. 1 ACAGATGAGC AATCAGAGGT TGAAGTTGGC AGCTTATCCA AGGTCTCTCC GATGGTGAGT 61 GGCAGACCTG AGAGTCAAAC TCACATCTTA AATGTCATTA AGCACAGCCT GTGTATGGAG 121 AGGCTGAGTC CCAGAGAAGA AAACTGACCC CACCCCTCAG TGTGGGCAGA TTCAAGCCTG 181 GGGAGAGGCA AGGGAGGGGG CAGAAACTTG GAGGGTAGAT GGACCTGACT TTAGGTTCTA 241 GCTGGATGAC TTACTTGCTG TGTGTCTCTA GGAACTTGGA GGGTAGATGG ACCTGACTTT 301 AGGTTCTAGC TGGATGACTT ACTTGCTGTG TGTCTCTAGG AACTTGGAGG ATAGATGGAC 361 CTGACTTTAG GTTCTAGCTG GATGACTTAC TTGCTGTGTG TCTCTAGGAA AGTTTCTCAG 421 CTCTCAAACC TCTGTTTTCT CATCTGCAAG ATGGGGATAA TATTAACCAA CTGGCTAGGT 481 CATGAGGATT AAATCTGACA ACTCTACCTA ACCGCCTGGT GCAGCCAGGT GCCCACAAAA 541 TGGGCGGCCC TGCCAGACTT CTGAAATAGT TGTGCCTCCC ACCACAAACT GTCTGCTATT 601 GACTGAGAAG AGAAGCATGT TCTCTTCCAA ACTCAAAGAC CCTTATGTTC TGGAAGATCT 661 CTGTAGCAAT GAAGAGTGCC ACCTTCAAGC TGCTGCTGGT CCAGGCCAAG GGTATATTGA 721 GGTGGGTCCA TCCCTCACCC AGAGTGTGTG CTGAGGCATA TTGCAGCCCA TATTTTGAGG 781 GGTTTTTCCT GCTATTCATT TTGTGTTGCC GCCAGAGAAA ACTAAACTGC AGCTCCTGGC 841 TGTCGGGGAA GAGCTGTGCT TCTGCCAGGG CTAGTGTGCC TGGCACCAGC TCAGGCATGT 901 ACCAGAGCCG AATGCCGTTT GCAGGCAAAT GTACACACAA AAAGATTCCA GCACACGCGT 961 GCACCAGCCA ACACAGAGTT GCCACGGCAC GCAACAGCGG GCCATGCACG TGCCTGCTGA 1021 CTCGGCTGTC ATCTTCCGAC TCTCTTCCGT GATCAGAAGA CTGTCTCAGC CTCTGTCTGT 1081 CTCCACGTCT GTCTGTCTGT GGATCAGTGT GTTTTGAGTC TGACAGTCTA CCTGGCTGGT 1141 TTCCTTTTTT CTTGCTCTGT CCCTCTCTCC TTGTGTTTCT ACCCATCTCC CTATCTCTGC 1201 CACTCTCTGT CTCTGTTTTT CTTTCTCTTT CTCCCCTCCT TTGTATCACT CCCTAGGTCC 1261 GACATTGCCT CTTTCTCGAT TTCCGATTCT CTCTGAGTCT CTCTCTCTCT CTCTCTTTCT 1321 CTAACTCTTC TGTCTCTGTC CTTCCCTGTC TTCTTTTTCC TGTGACACTC TCCTGGAAAC 1381 GCATCAAAAA CTGAAGTGGC CGGATTTCAA GTGACAAACA GCACTACACC CAGAATCCGC 1441 CCCCTCCCCA ACACGGCTGC CTTCCTCCTC CACATCCCTG TCACGAAGAG AAGCCTATTG 1501 TGTCAGGCCC AGGGAGTTCG AGTTGAAGGG CCTGGGGGTC TGTGCTCTGA CTGCTCTCAG 1561 CCACTTCCCC ATTGGCTCCA AGCAGCCTGT GCTCAGCAAG GGCTGAGCGA CAGGGGAGGC 1621 TCTCGTCCAT AAAAGGGGGG AAGAGGCACC AGAACTGCCA TTTGAAGGGG CTTTGGTGGT 1681 GTTCACAGCT GCCTCTTTGG CACCTCCTCC CAAGCCGGAG TCAAGGAGGC CCCTGAGCCT 1741 TGGACCAGCC ACTGCCACCT CCGACCTGCT CGGCCAGAAG CTGCCCAGGG ACAAAGCAGA 1801 GTGCAGGTAA CATCCAGAGG GGGCACTGGA A // LOCUS HUMENKB2 221 bp ds-DNA PRI 15-JUN-1989 DEFINITION Human enkephalin B gene, exon 2. ACCESSION K02267 KEYWORDS dynorphin; endorphin; enkephalin; leumorphin; neuropeptide; polyprotein; rimorphin. SEGMENT 2 of 4 SOURCE Human fetal liver DNA and, cDNA to mRNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 221) AUTHORS Horikawa,S., Takai,T., Toyosato,M., Takahashi,H., Noda,M., Kakidani,H., Kubo,T., Hirose,T., Inayama,S., Hayashida,H., Miyata,T. and Numa,S. TITLE Isolation and structural organization of the human preproenkephalin B gene JOURNAL Nature 306, 611-614 (1983) STANDARD full staff_review FEATURES from to/span description pre-msg < 1 > 221 enkB mRNA IVS < 1 95 enkB mRNA intron A IVS 156 > 221 enkB mRNA intron B BASE COUNT 27 A 77 C 49 G 68 T ORIGIN About 1.2 kb after segment 1. 1 CCTCACTCCC ACCCCTCTGT GGCACCTTCC TTCTCTTTTC CCCTCCTCTT CTCCCCCCAG 61 TCCCACCACA CCTTGCCTTT GTTCTATTTT TGCAGGTCAT TTATCTTCAG GCTTTGAGAT 121 CTGCGTGGGG GGAGCTGTTG CAGCAGCCCA AGCCGGTGAG TTATGCTGCC TGGGGGTGGT 181 TTGCGTTCAC TGACGGGATC TCAAGCCCAC TGCTTTGCCC C // LOCUS HUMENKB3 349 bp ss-DNA PRI 03-SEP-1985 DEFINITION Human enkephalin B, exon 3. ACCESSION X00176 KEYWORDS dynorphin; endorphin; enkephalin; leumorphin; neuropeptide; neuroprotein; rimorphin. SEGMENT 3 of 4 SOURCE Human fetal liver DNA (genomic library of T.Maniatis) and cDNA to mRNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 349) AUTHORS Horikawa,S., Takai,T., Toyosato,M., Takahashi,H., Noda,M., Kakidani,H., Kubo,T., Hirose,T., Inayama,S., Hayashida,H., Miyata,T. and Numa,S. TITLE Isolation and structural organization of the human preproenkephalin B gene JOURNAL Nature 306, 611-614 (1983) STANDARD full staff_review FEATURES from to/span description pept 134 + 262 preproenkaphalin B, exon 3 (first expressed exon) sigp 134 193 enkaphalin B signal peptide (putative) pre-msg < 1 > 349 enkB mRNA IVS < 1 117 enkB intron B IVS 263 > 349 enkB propept intron C BASE COUNT 66 A 95 C 102 G 86 T ORIGIN About 9.9 kb after segment 2. 1 TCAGAATGCT TTGCAGTGCT GTGGGCAGGA GTTAGAGATG ACTTTGTGGG CTCAGAACAC 61 AGTCTACAGA GTGGTGATCA CAGATTTCAT TATCTTCTCT TTCTTCCTCC CCAGCAGGAA 121 TTGCTGAGAC AGGATGGCCT GGCAGGGGCT GGTCCTGGCT GCCTGCCTCC TCATGTTCCC 181 CTCCACCACA GCGGACTGCC TGTCGCGGTG CTCCTTGTGT GCTGTAAAGA CCCAGGATGG 241 TCCCAAACCT ATCAATCCCC TGGTAGGTTT CAGGCAAGGT TCTTCAATGC CCAGGTCCTT 301 GGACCTGTGC GAGGCAGCCC AGAGAGGGGA GGTTGCAGGC CTGGGCAGC // LOCUS HUMENKB4 2275 bp ss-DNA PRI 03-SEP-1985 DEFINITION Human preproenkephalin B, exon 4. ACCESSION K02268 KEYWORDS dynorphin; endorphin; enkephalin; leumorphin; neuropeptide; neuroprotein; polyprotein; rimorphin. SEGMENT 4 of 4 SOURCE Human fetal liver DNA (genomic library of T.Maniatis) and cDNA to mRNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 2275) AUTHORS Horikawa,S., Takai,T., Toyosato,M., Takahashi,H., Noda,M., Kakidani,H., Kubo,T., Hirose,T., Inayama,S., Hayashida,H., Miyata,T. and Numa,S. TITLE Isolation and structural organization of the human preproenkephalin B gene JOURNAL Nature 306, 611-614 (1983) STANDARD full staff_review FEATURES from to/span description pept + 34 669 preproenkephalin B, exon 4 (second expressed exon) matp 427 453 beta-neo-endorphin matp 523 573 dynorphin matp 580 618 rimorphin matp 580 666 leumorphin pre-msg < 1 2232 enkB mRNA IVS < 1 33 enkB intron C BASE COUNT 596 A 518 C 558 G 603 T ORIGIN About 1.7 kb after segment 3. 1 TTTGCCACTG TGGTCTTTTT GGGTCTTTTG CAGATTTGCT CCCTGCAATG CCAGGCTGCC 61 CTGCTGCCCT CTGAGGAATG GGAGAGATGC CAGAGCTTTC TGTCTTTTTT CACCCCCTCC 121 ACCCTTGGGC TCAATGACAA GGAGGACTTG GGGAGCAAGT CGGTTGGGGA AGGGCCCTAC 181 AGTGAGCTGG CCAAGCTCTC TGGGTCATTC CTGAAGGAGC TGGAGAAAAG CAAGTTTCTC 241 CCAAGTATCT CAACAAAGGA GAACACTCTG AGCAAGAGCC TGGAGGAGAA GCTCAGGGGT 301 CTCTCTGACG GGTTTAGGGA GGGAGCAGAG TCTGAGCTGA TGAGGGATGC CCAGCTGAAC 361 GATGGTGCCA TGGAGACTGG CACACTCTAT CTCGCTGAGG AGGACCCCAA GGAGCAGGTC 421 AAACGCTATG GGGGCTTTTT GCGCAAATAC CCCAAGAGGA GCTCAGAGGT GGCTGGGGAG 481 GGGGACGGGG ATAGCATGGG CCATGAGGAC CTGTACAAAC GCTATGGGGG CTTCTTGCGG 541 CGCATTCGTC CCAAGCTCAA GTGGGACAAC CAGAAGCGCT ATGGCGGTTT TCTCCGGCGC 601 CAGTTCAAGG TGGTGACTCG GTCTCAGGAA GATCCGAATG CTTACTCTGG AGAGCTTTTT 661 GATGCATAAG CACCTCTTTT CATGGAGTAG AGTCAGGAGA AACCCCTGAC ACCTTTTCAG 721 GTTGGAGTGC ATTCATTCAT CCTCTTATAT GTGCCCCTTC CCCATGCTCA GCTCAGCATT 781 GTGTACAAAA TATCCAAGCC CAGCCTATCT CTCTTCTGCG TGGGAGTATG TTATTTCTCT 841 GGGGTCTGTG ATGGGGAAGG GTGGATGTCC CTTCCCCACA ATAGGCTTAG TGCTTGGCTC 901 AGACACCTAG ACTCTAAAAC TATCAGCAGC GGCAGCAGCA GCAGCAGCAG CAGTTTGTGA 961 TCTGTCCTTC CAACCTGTTC ACGTGACTCC TCAATTCCAG GGAACCAGAG CGATGTGTTC 1021 TTTGTACCTG TAGGTCTATG ATGTCCAAAC TTAACAGATC ACATGCCCCT CTTAGAAGAA 1081 ATATGAGCAT GCTCCCTCAT GCAGATAGTA TACACATCAT AAACAAAGAG TAGAACTTTA 1141 AAAGAAGGTA AAATAATCAT ACACAGAAAT CCTAACATTA TATTCCCAAA TCTCAAAAGA 1201 TCTCCTGTGC ACCTGACTTT GGAGACGATG CTTTAGGTAA AAGCTTAAAC ATTGCCTTAT 1261 ATTGGATCAG GAACCCTTAC AGTAGAGGGT CCAGTCTTCT AGTGGGTTTA ATGTTTAGTC 1321 AGTGTACTCT GAGTCCTCAT TGTTCAGAAA AGCACCTCTT GAAGAACTGA CTTCCTGAAC 1381 TCCCAGTCAT GTTGGTACCC TGGACAGTGC CTAACTCCTT ACAGAAGGGA GTGAAAACCT 1441 CTTTCGGAAA TGATTGAGAG CAGCCTCTTG AATGCTTAAA TGATCAAGGA GGGAGAAAGG 1501 CAAACCAATT TGTTCTGTGC AACAAACTCA AAATGTGGAC CAGTTCCCTC AGCCCTCATT 1561 AAACTAATTA AACTGATGGG TATCATGCTT CTACTCCATG GTGAACTGAA GCAGAGTCAA 1621 GCTGATGAAG TTAAGCACAA CCATGTTCTT GAGCAGCTGA ATTGGCTGCC AAGAGTCCAA 1681 GCCATCTGGC CCAACATACG CACTGGGCAT TGGGTAAGGG ACTCCAGAAG CAGCAGCTAG 1741 AAAGAGAAAA AGCCCTCTTC AATCCCCATA ATGCTTCTTT CCTCTTAATG TCTCAAAATA 1801 AAACCAGAAA GAGGAATAAA ATGATTAAGT GCTTGAGGCC AAATGAGTTC CCTTGATTCA 1861 AATAACCCTG AATCAGAGGC AGAGACCTCC TGATGTCTTG GTTTCCATCA AAGCCCTCCC 1921 TGTCTGTCTG TCTCTCTTTT GCTCTCTCAT TCCCAGGCAC TCTCTTTTGG TTTGTGGGTC 1981 CAGGAGATGA GGCTGGATAG GAGAGGAAAA GGCTTGAGTC TGGATAATTT GTATAAGATG 2041 CTGCTGAGCA CATCTCTTCA TGCGCAGTCC CCAGGTATCT GATGATGTTC TGAAATGGAT 2101 AGATTGTTTT AGAGTTATTT TGTGTCCTTT AAAAAAATCC CATTTATGCA ATTTACTTGG 2161 AATTTGCTTA GCCTTTAATA GGCTTGTGTA ATTTCCTGCT CCTCCAGTAC AATAAATAAA 2221 AGAAAGATGC TGATGACTTG GTGGGTGTGT GTGTACGTGC ACGTGCACAT GTGTG // LOCUS HUMERBBR 424 bp ds-DNA PRI 10-NOV-1986 DEFINITION Human mammary carcinoma v-erbB-related gene. ACCESSION N00027 KEYWORDS erbB-related protein; proto-oncogene. SOURCE Human mammary carcinoma MAC117 DNA, clone pMAC 117. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 424) AUTHORS King,C.R., Kraus,M.H. and Aaronson,S.A. TITLE Amplification of a novel v-erbB-related gene in a human mammary carcinoma JOURNAL Science 229, 974-976 (1985) STANDARD full staff_review COMMENT Draft entry for [1] kindly provided by C.R.King, 18-OCT-1985. The gene reported by [1] is closely related to, but different from the corresponding sequence of the erbB/EGF receptor. [1] identified a five kilobase transcript distinct from the EGF receptor gene. On the basis of its amplification in a human mammary carcinoma, this sequence must be a new member of the tyrosine kinase proto-oncogene family. FEATURES from to/span description pept / 27 182 v-erbB-related protein, exon X (AA at 27) 305 / 382 v-erbB-related protein, exon Y IVS < 1 26 MAC117 cds intron W IVS 183 304 MAC117 cds intron X IVS 383 > 424 MAC117 cds intron Y BASE COUNT 86 A 100 C 150 G 88 T ORIGIN 2 bp upstream of AccI site. 1 GTCTACATGG GTGCTTCCCA TTCCAGGGGA TGAGCTACCT GGAGGATGTG CGGCTCGTAC 61 ACAGGGACTT GGCCGCTCGG AACGTGCTGG TCAAGAGTCC CAACCATGTC AAAATTACAG 121 ACTTCGGGCT GGCTCGGCTG CTGGACATTG ACGAGACAGA GTACCATGCA GATGGGGGCA 181 AGGTTAGGTG AAGGACCAAG GAGCAGAGGA GGCTGGGTGG AGTGGTGTCT AGCCCATGGG 241 AGAACTCTGA GTGGCCACCT CCCCACAACA CACAGTTGGA GGACTTCCTC TTCTGCCCTC 301 CCAGGTGCCC ATCAAGTGGA TGGCGCTGGA GTCCATTCTC CGCCGGCGGT TCACCCACCA 361 GAGTGATGTG TGGAGTTATG GTGTGTGATG GGGGGTGTTG GGAGGGGTGG GTGAGGAGCC 421 ATGG // LOCUS HUMERP 3398 bp ds-DNA PRI 16-JUN-1986 DEFINITION Human erythropoietin gene, complete cds. ACCESSION X02158 KEYWORDS erythropoietin; glycoprotein. SOURCE Human DNA, clones lambda-HEPO[1,2,3,6] and fetal liver, cDNA to mRNA, clone lambda-HEPOFL13. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 3398) AUTHORS Jacobs,K., Shoemaker,C., Rudersdorf,R., Neill,S.D., Kaufman,R.J., Mufson,A., Seehra,J., Jones,S.S., Hewick,R., Fritsch,E.F., Kawakita,M., Shimizu,T. and Miyake,T. TITLE Isolation and characterization of genomic and cDNA clones of human erythropoietin JOURNAL Nature 313, 806-810 (1985) STANDARD simple staff_review COMMENT The cDNA in clone lambda-HEPOFL8 starts at position 327. FEATURES from to/span description pept 615 627 erythropoietin precursor, exon 1 /nomgen="EPO" /map="7q21" /hgml_locus_uid="LD0046G" 1194 1339 erythropoietin precursor, exon 2 1596 1682 erythropoietin precursor, exon 3 2294 2473 erythropoietin precursor, exon 4 2608 2763 erythropoietin precursor, exon 5 sigp 615 627 erythropoietin signal peptide 1194 1261 erythropoietin signal peptide matp 1262 1339 erythropoietin 1596 1682 erythropoietin 2294 2473 erythropoietin 2608 2760 erythropoietin pre-msg < 615 3327 erp mRNA IVS 628 1193 erp intron A IVS 1340 1595 erp intron B IVS 1683 2293 erp intron C IVS 2474 2607 erp intron D BASE COUNT 698 A 1034 C 991 G 675 T ORIGIN 183 bp upstream of SmaI site. 1 AGCTTCTGGG CTTCCAGACC CAGCTACTTT GCGGAACTCA GCAACCCAGG CATCTCTGAG 61 TCTCCGCCCA AGACCGGGAT GCCCCCCAGG AGGTGTCCGG GAGCCCAGCC TTTCCCAGAT 121 AGCAGCTCCG CCAGTCCCAA GGGTGCGCAA CCGGCTGCAC TCCCCTCCCG CGACCCAGGG 181 CCCGGGAGCA GCCCCCATGA CCCACACGCA CGTCTGCAGC AGCCCCGTCA GCCCCGGAGC 241 CTCAACCCAG GCGTCCTGCC CCTGCTCTGA CCCCGGGTGG CCCCTACCCC TGGCGACCCC 301 TCACGCACAC AGCCTCTCCC CCACCCCCAC CCGCGCACGC ACACATGCAG ATAACAGCCC 361 CGACCCCCGG CCAGAGCCGC AGAGTCCCTG GGCCACCCCG GCCGCTCGCT GCGCTGCGCC 421 GCACCGCGCT GTCCTCCCGG AGCCGGACCG GGGCCACCGC GCCCGCTCTG CTCCGACACC 481 GCGCCCCCTG GACAGCCGCC CTCTCCTCCA GGCCCGTGGG GCTGGCCCTG CACCGCCGAG 541 CTTCCCGGGA TGAGGGCCCC CGGTGTGGTC ACCCGGCGCC CCAGGTCGCT GAGGGACCCC 601 GGCCAGGCGC GGAGATGGGG GTGCACGGTG AGTACTCGCG GGCTGGGCGC TCCCGCCCGC 661 CCGGGTCCCT GTTTGAGCGG GGATTTAGCG CCCCGGCTAT TGGCCAGGAG GTGGCTGGGT 721 TCAAGGACCG GCGACTTGTC AAGGACCCCG GAAGGGGGAG GGGGGTGGGG CAGCCTCCAC 781 GTGCCAGCGG GGACTTGGGG GAGTCCTTGG GGATGGCAAA AACCTGACCT GTGAAGGGGA 841 CACAGTTTGG GGGTTGAGGG GAAGAAGGTT TGGGGGGTTC TGCTGTGCCA GTGGAGAGGA 901 AGCTGATAAG CTGATAACCT GGGCGCTGGA GCCACCACTT ATCTGCCAGA GGGGAAGCCT 961 CTGTCACACC AGGATTGAAG TTTGGCCGGA GAAGTGGATG CTGGTAGCCT GGGGGTGGGG 1021 TGTGCACACG GCAGCAGGAT TGAATGAAGG CCAGGGAGGC AGCACCTGAG TGCTTGCATG 1081 GTTGGGGACA GGAAGGACGA GCTGGGGCAG AGACGTGGGG ATGAAGGAAG CTGTCCTTCC 1141 ACAGCCACCC TTCTCCCTCC CCGCCTGACT CTCAGCCTGG CTATCTGTTC TAGAATGTCC 1201 TGCCTGGCTG TGGCTTCTCC TGTCCCTGCT GTCGCTCCCT CTGGGCCTCC CAGTCCTGGG 1261 CGCCCCACCA CGCCTCATCT GTGACAGCCG AGTCCTGCAG AGGTACCTCT TGGAGGCCAA 1321 GGAGGCCGAG AATATCACGG TGAGACCCCT TCCCCAGCAC ATTCCACAGA ACTCACGCTC 1381 AGGGCTTCAG GGAACTCCTC CCAGATCCAG GAACCTGGCA CTTGGTTTGG GGTGGAGTTG 1441 GGAAGCTAGA CACTGCCCCC CTACATAAGA ATAAGTCTGG TGGCCCCAAA CCATACCTGG 1501 AAACTAGGCA AGGAGCAAAG CCAGCAGATC CTACGCCTGT GGCCAGGGCC AGAGCCTTCA 1561 GGGACCCTTG ACTCCCCGGG CTGTGTGCAT TTCAGACGGG CTGTGCTGAA CACTGCAGCT 1621 TGAATGAGAA TATCACTGTC CCAGACACCA AAGTTAATTT CTATGCCTGG AAGAGGATGG 1681 AGGTGAGTTC CTTTTTTTTT TTTTTTCCTT TCTTTTGGAG AATCTCATTT GCGAGCCTGA 1741 TTTTGGATGA AAGGGAGAAT GATCGAGGGA AAGGTAAAAT GGAGCAGCAG AGATGAGGCT 1801 GCCTGGGCGC AGAGGCTCAC GTCTATAATC CCAGGCTGAG ATGGCCGAGA TGGGAGAATT 1861 GCTTGAGCCC TGGAGTTTCA GACCAACCTA GGCAGCATAG TGAGATCCCC CATCTCTACA 1921 AACATTTAAA AAAATTAGTC AGGTGAAGTG GTGCATGGTG GTAGTCCCAG ATATTTGGAA 1981 GGCTGAGGCG GGAGGATCGC TTGAGCCCAG GAATTTGAGG CTGCAGTGAG CTGTGATCAC 2041 ACCACTGCAC TCCAGCCTCA GTGACAGAGT GAGGCCCTGT CTCAAAAAAG AAAAGAAAAA 2101 AGAAAAATAA TGAGGGCTGT ATGGAATACG TTCATTATTC ATTCACTCAC TCACTCACTC 2161 ATTCATTCAT TCATTCATTC AACAAGTCTT ATTGCATACC TTCTGTTTGC TCAGCTTGGT 2221 GCTTGGGGCT GCTGAGGGGC AGGAGGGAGA GGGTGACATC CCTCAGCTGA CTCCCAGAGT 2281 CCACTCCCTG TAGGTCGGGC AGCAGGCCGT AGAAGTCTGG CAGGGCCTGG CCCTGCTGTC 2341 GGAAGCTGTC CTGCGGGGCC AGGCCCTGTT GGTCAACTCT TCCCAGCCGT GGGAGCCCCT 2401 GCAGCTGCAT GTGGATAAAG CCGTCAGTGG CCTTCGCAGC CTCACCACTC TGCTTCGGGC 2461 TCTGGGAGCC CAGGTGAGTA GGAGCGGACA CTTCTGCTTG CCCTTTCTGT AAGAAGGGGA 2521 GAAGGGTCTT GCTAAGGAGT ACAGGAACTG TCCGTATTCC TTCCCTTTCT GTGGCACTGC 2581 AGCGACCTCC TGTTTTCTCC TTGGCAGAAG GAAGCCATCT CCCCTCCAGA TGCGGCCTCA 2641 GCTGCTCCAC TCCGAACAAT CACTGCTGAC ACTTTCCGCA AACTCTTCCG AGTCTACTCC 2701 AATTTCCTCC GGGGAAAGCT GAAGCTGTAC ACAGGGGAGG CCTGCAGGAC AGGGGACAGA 2761 TGACCAGGTG TGTCCACCTG GGCATATCCA CCACCTCCCT CACCAACATT GCTTGTGCCA 2821 CACCCTCCCC CGCCACTCCT GAACCCCGTC GAGGGGCTCT CAGCTCAGCG CCAGCCTGTC 2881 CCATGGACAC TCCAGTGCCA CCAATGACAT CTCAGGGGCC AGAGGAACTG TCCAGAGAGC 2941 AACTCTGAGA TCTAAGGATG TCACAGGGCC AACTTGAGGG CCCAGAGCAG GAAGCATTCA 3001 GAGAGCAGCT TTAAACTCAG GGACAGACCC ATGCTGGGAA GACGCCTGAG CTCACTCGGC 3061 ACCCTGCAAA ATTGATGCCA GGACACGCTT TGGAGGCGAT TTACCTGTTT TCGCACCTAC 3121 CATCAGGGAC AGGATGACCT GGAGAACTTA GGTGGCAAGC TGTGACTTCT CCAGGTCTCA 3181 CGGGCATGGG CACTCCCTTG GTGGCAAGAG CCCCCTTGAC ACCGGGGTGG TGGGAACCAT 3241 GAAGACAGGA TGGGGGCTGG CCTCTGGCTC TCATGGGGTC CAACTTTTGT GTATTCTTCA 3301 ACCTCATTGA CAAGAACTGA AACCACCAAT ATGACTCTTG GCTTTTCTGT TTTCTGGGAA 3361 CCTCCAAATC CCCTGGCTCT GTCCCACTCC TGGCAGCA // LOCUS HUMERPA 3602 bp ds-DNA PRI 04-AUG-1986 DEFINITION Human erythropoietin gene, complete cds. ACCESSION M11319 KEYWORDS erythropoietin. SOURCE Human fetal liver DNA (T. Maniatis library), clone lambda HE1. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 3602) AUTHORS Lin,F.-K., Suggs,S., Lin,C.-H., Browne,J.K., Smalling,R., Egrie,J.C., Chen,K.K., Fox,G.M., Martin,F., Stabinsky,Z., Badrawi,S.M., Lai,P.-H. and Goldwasser,E. TITLE Cloning and expression of the human erythropoietin gene JOURNAL Proc. Natl. Acad. Sci. U.S.A. 82, 7580-7584 (1985) STANDARD full staff_review COMMENT Draft entry and sequence for [1] were kindly provided in computer-readable form by F.-K.Lin, 18-FEB-1986. The erythropoietin gene, when introduced into Chinese hamster ovary cells, produces the biologically active protein. FEATURES from to/span description pept 625 637 erythropoietin prepeptide, exon 1 /nomgen="EPO" /map="7q21" /hgml_locus_uid="LD0046G" 1201 1346 erythropoietin prepeptide, exon 2 1605 1691 erythropoietin prepeptide, exon 3 2303 2482 erythropoietin prepeptide, exon 4 2617 2772 erythropoietin prepeptide, exon 5 sigp 625 637 erythropoietin signal peptide 1201 1268 erythropoietin signal peptide matp 1269 1346 erythropoietin 1605 1691 erythropoietin 2303 2482 erythropoietin 2617 2769 erythropoietin pre-msg 625 3337 erp mRNA IVS 638 1200 erp intron A IVS 1347 1604 erp intron B IVS 1692 2302 erp intron C IVS 2483 2616 erp intron D rpt 1802 1811 direct repeat flanking Alu repeat 5' copy rpt 1813 2116 Alu repeat rpt 2119 2128 direct repeat flanking Alu repeat 3'copy BASE COUNT 737 A 1091 C 1063 G 711 T ORIGIN 1 bp upstream of HindIII site. 1 AAGCTTCTGG GCTTCCAGAC CCAGCTACTT TGCGGAACTC AGCAACCCAG GCATCTCTGA 61 GTCTCCGCCC AAGACCGGGA TGCCCCCCAG GGGAGGTGTC CGGGAGCCCA GCCTTTCCCA 121 GATAGCACGC TCCGCCAGTC CCAAGGGTGC GCAACCGGCT GCACTCCCCT CCCGCGACCC 181 AGGGCCCGGG AGCAGCCCCC ATGACCCACA CGCACGTCTG CAGCAGCCCC GCTCACGCCC 241 CGGCGAGCCT CAACCCAGGC GTCCTGCCCC TGCTCTGACC CCGGGTGGCC CCTACCCCTG 301 GCGACCCCTC ACGCACACAG CCTCTCCCCC ACCCCCACCC GCGCACGCAC ACATGCAGAT 361 AACAGCCCCG ACCCCCGGCC AGAGCCGCAG AGTCCCTGGG CCACCCCGGC CGCTCGCTGC 421 GCTGCGCCGC ACCGCGCTGT CCTCCCGGAG CCGGACCGGG GCCACCGCGC CCGCTCTGCT 481 CCGACACCGC GCCCCCTGGA CAGCCGCCCT CTCCTCTAGG CCCGTGGGGC TGGCCCTGCA 541 CCGCCGAGCT TCCCGGGATG AGGGCCCCCG GTGTGGTCAC CCGGCGCGCC CCAGGTCGCT 601 GAGGGACCCC GGCCAGGCGC GGAGATGGGG GTGCACGGTG AGTACTCGCG GGCTGGGCGC 661 TCCCGCCGCC CGGGTCCCTG TTTGAGCGGG GATTTAGCGC CCCGGCTATT GGCCAGGAGG 721 TGGCTGGGTT CAAGGACCGG CGACTTGTCA AGGACCCCGG AAGGGGGAGG GGGGTGGGGC 781 AGCCTCCACG TGCCAGCGGG GACTTGGGGG AGTCCTTGGG GATGGCAAAA ACCTGACCTG 841 TGAAGGGGAC ACAGTTTGGG GGTTGAGGGG AAGAAGGTTT GGGGGTTCTG CTGTGCCAGT 901 GGAGAGGAAG CTGATAAGCT GATAACCTGG GCGCTGGAGC CACCACTTAT CTGCCAGAGG 961 GGAAGCCTCT GTCACACCAG GATTGAAGTT TGGCCGGAGA AGTGGATGCT GGTAGCTGGG 1021 GGTGGGGTGT GCACACGGCA GCAGGATTGA ATGAAGGCCA GGGAGGCAGC ACCTGAGTGC 1081 TTGCATGGTT GGGGACAGGA AGGACGAGCT GGGGCAGAGA CGTGGGGATG AAGGAAGCTG 1141 TCCTTCCACA GCCACCCTTC TCCCTCCCCG CCTGACTCTC AGCCTGGCTA TCTGTTCTAG 1201 AATGTCCTGC CTGGCTGTGG CTTCTCCTGT CCCTGCTGTC GCTCCCTCTG GGCCTCCCAG 1261 TCCTGGGCGC CCCACCACGC CTCATCTGTG ACAGCCGAGT CCTGGAGAGG TACCTCTTGG 1321 AGGCCAAGGA GGCCGAGAAT ATCACGGTGA GACCCCTTCC CCAGCACATT CCACAGAACT 1381 CACGCTCAGG GCTTCAGGGA ACTCCTCCCA GATCCAGGAA CCTGGCACTT GGTTTGGGGT 1441 GGAGTTGGGA AGCTAGACAC TGCCCCCCTA CATAAGAATA AGTCTGGTGG CCCCAAACCA 1501 TACCTGGAAA CTAGGCAAGG AGCAAAGCCA GCAGATCCTA CGGCCTGTGG GCCAGGGCCA 1561 GAGCCTTCAG GGACCCTTGA CTCCCCGGGC TGTGTGCATT TCAGACGGGC TGTGCTGAAC 1621 ACTGCAGCTT GAATGAGAAT ATCACTGTCC CAGACACCAA AGTTAATTTC TATGCCTGGA 1681 AGAGGATGGA GGTGAGTTCC TTTTTTTTTT TTTTTCCTTT CTTTTGGAGA ATCTCATTTG 1741 CGAGCCTGAT TTTGGATGAA AGGGAGAATG ATCGGGGGAA AGGTAAAATG GAGCAGCAGA 1801 GATGAGGCTG CCTGGGCGCA GAGGCTCACG TCTATAATCC CAGGCTGAGA TGGCCGAGAT 1861 GGGAGAATTG CTTGAGCCCT GGAGTTTCAG ACCAACCTAG GCAGCATAGT GAGATCCCCC 1921 ATCTCTACAA ACATTTAAAA AAATTAGTCA GGTGAAGTGG TGCATGGTGG TAGTCCCAGA 1981 TATTTGGAAG GCTGAGGCGG GAGGATCGCT TGAGCCCAGG AATTTGAGGC TGCAGTGAGC 2041 TGTGATCACA CCACTGCACT CCAGCCTCAG TGACAGAGTG AGGCCCTGTC TCAAAAAAGA 2101 AAAGAAAAAA GAAAAATAAT GAGGGCTGTA TGGAATACAT TCATTATTCA TTCACTCACT 2161 CACTCACTCA TTCATTCATT CATTCATTCA ACAAGTCTTA TTGCATACCT TCTGTTTGCT 2221 CAGCTTGGTG CTTGGGGCTG CTGAGGGGCA GGAGGGAGAG GGTGACATGG GTCAGCTGAC 2281 TCCCAGAGTC CACTCCCTGT AGGTCGGGCA GCAGGCCGTA GAAGTCTGGC AGGGCCTGGC 2341 CCTGCTGTCG GAAGCTGTCC TGCGGGGCCA GGCCCTGTTG GTCAACTCTT CCCAGCCGTG 2401 GGAGCCCCTG CAGCTGCATG TGGATAAAGC CGTCAGTGGC CTTCGCAGCC TCACCACTCT 2461 GCTTCGGGCT CTGGGAGCCC AGGTGAGTAG GAGCGGACAC TTCTGCTTGC CCTTTCTGTA 2521 AGAAGGGGAG AAGGGTCTTG CTAAGGAGTA CAGGAACTGT CCGTATTCCT TCCCTTTCTG 2581 TGGCACTGCA GCGACCTCCT GTTTTCTCCT TGGCAGAAGG AAGCCATCTC CCCTCCAGAT 2641 GCGGCCTCAG CTGCTCCACT CCGAACAATC ACTGCTGACA CTTTCCGCAA ACTCTTCCGA 2701 GTCTACTCCA ATTTCCTCCG GGGAAAGCTG AAGCTGTACA CAGGGGAGGC CTGCAGGACA 2761 GGGGACAGAT GACCAGGTGT GTCCACCTGG GCATATCCAC CACCTCCCTC ACCAACATTG 2821 CTTGTGCCAC ACCCTCCCCC GCCACTCCTG AACCCCGTCG AGGGGCTCTC AGCTCAGCGC 2881 CAGCCTGTCC CATGGACACT CCAGTGCCAG CAATGACATC TCAGGGGCCA GAGGAACTGT 2941 CCAGAGAGCA ACTCTGAGAT CTAAGGATGT CACAGGGCCA ACTTGAGGGC CCAGAGCAGG 3001 AAGCATTCAG AGAGCAGCTT TAAACTCAGG GACAGAGCCA TGCTGGGAAG ACGCCTGAGC 3061 TCACTCGGCA CCCTGCAAAA TTTGATGCCA GGACACGCTT TGGAGGCGAT TTACCTGTTT 3121 TCGCACCTAC CATCAGGGAC AGGATGACCT GGAGAACTTA GGTGGCAAGC TGTGACTTCT 3181 CCAGGTCTCA CGGGCATGGG CACTCCCTTG GTGGCAAGAG CCCCCTTGAC ACCGGGGTGG 3241 TGGGAACCAT GAAGACAGGA TGGGGGCTGG CCTCTGGCTC TCATGGGGTC CAAGTTTTGT 3301 GTATTCTTCA ACCTCATTGA CAAGAACTGA AACCACCAAT ATGACTCTTG GCTTTTCTGT 3361 TTTCTGGGAA CCTCCAAATC CCCTGGCTCT GTCCCACTCC TGGCAGCAGT GCAGCAGGTC 3421 CAGGTCCGGG AAATGAGGGG TGGAGGGGGC TGGGCCCTAC GTGCTGTCTC ACACAGCCTG 3481 TCTGACCTCT CGACCTACCG GCCTAGGCCA CAAGCTCTGC CTACGCTGGT CAATAAGGTG 3541 TCTCCATTCA AGGCCTCACC GCAGTAAGGC AGCTGCCAAC CCTGCCCAGG GCAAGGCTGC 3601 AG // LOCUS HUMERYA 3074 bp ds-DNA PRI 30-SEP-1988 DEFINITION Human erythrocyte alpha-spectrin gene, 3' end. ACCESSION M13233 KEYWORDS . SOURCE Human DNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 3074) AUTHORS Linnenbach,A.J., Speicher,D.W., Marchesi,V.T. and Forget,B.G. TITLE Cloning of a portion of the chromosomal gene for human erythrocyte alpha-spectrin by using a synthetic gene fragment JOURNAL Proc. Natl. Acad. Sci. U.S.A. 83, 2397-2401 (1986) STANDARD simple staff_entry FEATURES from to/span description pept / 44 198 erythrocyte alpha-spectrin, exon x (AA at 44) (no splice consensus) /nomgen="SPTA1" /map="1q21" /hgml_locus_uid="LS0139Y" 1664 1799 erythrocyte alpha-spectrin, exon x+1 (no splice consensus) 1909 / 2010 erythrocyte alpha-spectrin, exon x+2 (no splice consensus) IVS 199 1665 intron y (no splice consensus) IVS 1800 1908 intron y+1 IVS 2011 > 3074 intron y+2 BASE COUNT 920 A 626 C 576 G 952 T ORIGIN 138 bp upstream of PvuII site. 1 AGAGTACTTG GATTTCCTAC TAGGTTTCTT TTTCATCCCA AAGGTGAAGG AGTTATGTGC 61 TAAAGCAGAG AAGCTGACAC TTTCCCATCC TTCAGATGCA CCTCAGATCC AGGAGATGAA 121 AGAAGATCTG GTCTCCAGCT GGGAGCATAT TCGTGCCCTG GCCACCAGCA GATATGAAAA 181 ACTGCAGGCT ACTTATTGGT GGGAAATCCC TCCCCTTTAT TGCTCTACCT ATCTCTACAG 241 GGCCCAGATA TAATAGCGAG AGAAGAAATA AGAGTAAAGG GAGTCAAATA GCGAGAGAAT 301 CAGCTCTCCT GCCTGCTCAG TAGAGACAAA ATATTTAAGT TGCCAAAGGA AGTCCTTTGT 361 GACTGATATA ATATAGTAGC ACCCCCTTAT CTGTGGTTTC AGTTACCAGC AGTCAACCAC 421 AGTTTGAAAA TGTTAAATGG AAAATTCCAG TAATAAATAA CTTATAAGTT TTAAATTGCT 481 TATTATTCTA AGAGGCATGA TAAAATTTCT GGCTGGTCCA CTTTGTCCCA CTGAAAGATG 541 AATCATTCCT TTGTCCAGCA TATATGTGCT GTATATCTAC TCACTCGTTA GTCACTTAGT 601 AGCAGTCTAA GTTATGGATT GACTGTCTTA GTATCACAGT GCTTATGTTT AAATAATTCT 661 TATTTTATTT AATTGTTCTA TTTTATCATT AGTTATTGTT TGTTAATCTC TTGCTGTACC 721 TAATTTATAA ATCAAACTCT ATCATAGGCA TATAGGTATA TGAAAAAATG TAGTGTATAT 781 AGTGTTTGGT ACTATCTGCA GTTTCTGGCA CCTGCTGGGC ATACTGAAAT GTATCTCCCA 841 CAAATAAAGG GAGACTCCTG TGATCAGCAT CACACCTCCA CTCATCTGCA AATTTCCATC 901 TTCCCCTGCT TGTATTATTG TGGGATGCTA TGAAATGGCT CAAAAAGCTG GAAAAGCAAC 961 TGAAGAAACA TTTCAAAATT CTGACTCTAG ATAAGGAACT GAAAGACATG CAATACAAGC 1021 TGAAGAGCAT CAAAATAACT TCAGTATTAT TTCCTGAATG TGGATTACAA GCTTACAAAA 1081 ACCCCTAATA ATCAGGCATC ATTTCAGTAT TAATTCTTAT GGTTAGTCCT TAAATAACCA 1141 GAGGACAGTG CTTGTCATAT GATTGTGATT TCGAAACAAA AGATGAGCAT TTAACTTTTT 1201 AAAATAGGAA ACTGTTTTGC AGCTCTTGTT TCTTATTAAC AGGGTTAAAG ATGAGCTAGA 1261 TAGATGAGGA ATCCTAACGC TAAGCCTAGT TCTTCCTTCC CATAAGGTGA TTCTTATTTA 1321 TATAACAACA TGTATTGTTT ATTGGTTGCA TTATGAGCAC CTTACTCCAT TTTAAGTACC 1381 TCTAGGGTAG GTGTGACCTT GAGGTGGTGG AGGTGTAATG CTCATGGGTC ACATCCCACA 1441 GCAACACACG TACCCTAGAA ATATAAAACC TGGGGTGAAG GACTTACAAC CTGCACCACT 1501 AATCTCTGAC TGTAAATAAC AGGAAATAAT CAGGAAAAAG GGAACCTAGG ATTAGAGTAA 1561 GGAGCATCTC ACACATCTTA CCTCACCCCA AACCTAAATC TTTGTTCAAT GTCCTTTCAT 1621 ATCTGGTTTC TCCTTCCTGT CTTTTACCTC ACTTTTCCCA CAGGTACCAT CGATTTTCAT 1681 CTGACTTTGA TGAACTCTCA GGCTGGATGA ACGAGAAGAC TGCTGCGATC AATGCTGATG 1741 AGCTGCCAAC AGATGTGGCT GGTGGAGAAG TTCTGCTGGA CAGGCATCAG CAGCATAAGG 1801 TAGAGAAGAA AGGCTGCCCA GTAGGAGGAG GGGGCAGGTT ATTTGGCTGA ATGCCAGAAA 1861 TGCCAGGTTT CTTGCTGCAT TTTGAGACAC TTTGTTTTGT GGCCACAGCA TGAGATTGAC 1921 TCTTACGATG ACCGATTTCA ATCTGCTGAT GAGACTGGTC AAGACCTCGT GAATGCCAAT 1981 CATGAAGCCT CTGATGAAGT TCGGGAAAAG GTAATCTAGT TTAACAGAGT TTGTCAAATT 2041 CCAATTATAT ATGTAGTCAT TACATAATCT CACAATAATA TGATAATCTC TAAATGGGAA 2101 AAGAATAGAA GTCATTTAAA CGTGTTAATA ATATGAATCA AACTTTTACA CTGCTTCCTT 2161 TCATATACAG TTATCACTTA TTATCCATGG GGCATTGGTT CCAGGACCTC CCTCAGGTGC 2221 CAAAATCTGC AGATGCTCAA GAAACGGCAT AATATTTGCA TACGATGCAT GTACATTCTC 2281 CTGTACACTT TAAATTATCT TCAGATTACT TATAATACTT CATACAATGT AAATGCTATG 2341 TAAATAATTG TTATACCGTA TTATTTAGGG AATAAGCAAA AAGTCTGTAC ATGCTCAGTA 2401 CAGATGGAAT TTTTAAAAAA TATTTTATAC CCATGGTTGG TTAAATCTAC AGGTGTGGAA 2461 CCTACAGATA TGGGGAGTTA ACTCTACTGT TTTGCTAAAT TGAGTCACTT GAAATTTTCA 2521 AGACATGATG TGTGCTTCCT CATCTCTGAG CATTTATTTC ATCAACCCTC TGACTTCTAC 2581 CTAATTTGAA ACCTCCATTC CATTTTGCCT CCTGACGTAT GCCCAGTTTC GTGACCTTCC 2641 CATGCTCATC TCTAGACTGT AAACTCCAGG AGGACTGCAC CTTATTCATT TCTGGGTTCC 2701 CCACAGTGCC TAGTACAGCA TCTGACATGG TTGACATCTG TGCATGTTTT TCTCTCCTCT 2761 ATACAATCTC AAATTAAGCT GACATTCCTG CATCTGTGAT TTTCAAATTT GCATTAAATA 2821 ACAGGACAAT CTTAAACACC CAGCATTCCA CTGAATGAAG AAATCAGTTT TACCATCATC 2881 CTAACCTCCT ACATCATAGG GCTTCACCTC TGCACCTTGA TCCAGAGAGA TTTTGCACCA 2941 GTGGCAGAGT CTAGGCACAC CTTGGAATGA TAGTTACAGT AAGATCCGCA TGCCTCTTTG 3001 CATCTCACTA GATGCCAAGC ACAGCCTCTA GCACTGTGCA GTGGATGAGG TTGAGATAAC 3061 CGAGTTTTGA ATTC // LOCUS HUMETN1 682 bp ds-DNA PRI 15-DEC-1989 DEFINITION Human endothelin (ETN) gene, complete cds. ACCESSION M25377 J04819 KEYWORDS endothelin. SEGMENT 1 of 4 SOURCE Human leukocyte DNA and human umbilical vein endothelial cell cDNA to mRNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 682) AUTHORS Bloch,K.D., Friedrich,S.P., Lee,M.-E., Eddy,R.L., Shows,T.B. and Quertermous,T. TITLE Structural organization and chromosomal assignment of the gene encoding endothelin JOURNAL J. Biol. Chem. 264, 10851-10857 (1989) STANDARD simple staff_review COMMENT Draft entry and computer readable copy of sequence [1] kindly submitted by K.D.Bloch (06-AUG-89). FEATURES from to/span description pept 519 + 582 endothelin prepropeptide, exon 1 /nomgen="EDN1" /map="unassigned" /hgml_locus_uid="LU0066V" pre-msg 252 > 682 ETN mRNA and introns (5' end put.) IVS 583 > 682 ETN intron A BASE COUNT 121 A 178 C 169 G 213 T 1 others ORIGIN Chromosome 6. 1 GGCAGGTTTA GCAAAGGTCT CTAATGGGNA TTTTATTTTT CTTAGCCCTG CCCCCGAATT 61 GTCAGACGGC GGCGTCTGCC TCTGAAGTTA GCAGTGATTT CTTTCGGGCC TGGCCTTATC 121 TCCGGCTGCA CGTTGCCTGT TGGTGACTAA TAACACAATA ACATTGTCTG GGGCTGGAAT 181 AAAGTCGGAG CTGTTTACCC CCACTCTAAT AGGGGTTCAA TATAAAAAGC CGGCAGAGAG 241 CTGTCCAAGT CAGACGCGCC TCTGCATCTG CGCCAGGCGA ACGGGTCCTG CGCCTCCTGC 301 AGTCCCAGCT CTCCACCGCC GCGTGCGCCT GCAGACGCTC CGCTCGCTGC CTTCTCTCCT 361 GGCAGGCGCT GCTTTTCTCC CCGTTAAAGG GCACTTGGGC TGAAGGATCG CTTTGAGATC 421 TGAGGAACCC GCAGCGCTTT GAGGGACCTG AAGCTGTTTT TCTTCGTTTT CCTTTGGGTT 481 CAGTTTGAAC GGGAGGTTTT TGATCCCTTT TTTTCAGAAT GGATTATTTG CTCATGATTT 541 TCTCTCTGCT GTTTGTGGCT TGCCAAGGAG CTCCAGAAAC AGGTAGGCAC GCTCGTTGAC 601 TTGTAAGTCT CGGAATTACA AGTTAGTGTG TTCTTATCCA CCTTCATGCT TTTCTTGCTT 661 CTATTTTTCC CCGTTCTTTT TA // LOCUS HUMETN2 282 bp ds-DNA PRI 15-DEC-1989 DEFINITION Human endothelin (ETN) gene, complete cds. ACCESSION M25378 J04819 KEYWORDS endothelin. SEGMENT 2 of 4 SOURCE Human leukocyte DNA and human umbilical vein endothelial cell cDNA to mRNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 282) AUTHORS Bloch,K.D., Friedrich,S.P., Lee,M.-E., Eddy,R.L., Shows,T.B. and Quertermous,T. TITLE Structural organization and chromosomal assignment of the gene encoding endothelin JOURNAL J. Biol. Chem. 264, 10851-10857 (1989) STANDARD simple staff_review COMMENT Draft entry and computer readable copy of sequence [1] kindly submitted by K.D.Bloch (06-AUG-89). FEATURES from to/span description pept + 101 + 269 endothelin prepropetide, exon 2 /nomgen="EDN1" /map="unassigned" /hgml_locus_uid="LU0066V" matp 193 255 endothelin mature peptide pre-msg < 1 > 282 ENT mRNA and introns IVS < 1 100 ENT intron A IVS 270 > 282 ENT intron B BASE COUNT 49 A 91 C 71 G 71 T ORIGIN About 1.5 kb after segment 1. 1 TGCTTTTATC GTCTCTGCTA GCTCTGACTC TACTGTGATC CAGCATGTCT CTCGGCGTTT 61 GAGGAGACAT CCCCCACTGA CCTGCTCTTT CTCTCCCCAG CAGTCTTAGG CGCTGAGCTC 121 AGCGCGGTGG GTGAGAACGG CGGGGAGAAA CCCACTCCCA GTCCACCCTG GCGGCTCCGC 181 CGGTCCAAGC GCTGCTCCTG CTCGTCCCTG ATGGATAAAG AGTGTGTCTA CTTCTGCCAC 241 CTGGACATCA TTTGGGTCAA CACTCCCGAG TAAGTCTCTA GA // LOCUS HUMETN3 664 bp ds-DNA PRI 15-DEC-1989 DEFINITION Human endothelin (ETN) gene, complete cds. ACCESSION M25379 J04819 KEYWORDS endothelin. SEGMENT 3 of 4 SOURCE Human leukocyte DNA and human umbilical vein endothelial cell cDNA to mRNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 664) AUTHORS Bloch,K.D., Friedrich,S.P., Lee,M.-E., Eddy,R.L., Shows,T.B. and Quertermous,T. TITLE Structural organization and chromosomal assignment of the gene encoding endothelin JOURNAL J. Biol. Chem. 264, 10851-10857 (1989) STANDARD simple staff_review COMMENT Draft entry and computer readable copy of sequence [1] kindly submitted by K.D.Bloch (06-AUG-89). FEATURES from to/span description pept + 101 256 endothelin prepropeptide, exon 3 /nomgen="EDN1" /map="unassigned" /hgml_locus_uid="LU0066V" 421 + 564 endothelin prepropeptide, exon 4 pre-msg < 1 > 664 ETN mRNA and introns IVS < 1 100 ETN intron B IVS 257 420 ETN intron C IVS 565 > 664 ETN intron D BASE COUNT 218 A 125 C 153 G 168 T ORIGIN About 1.3 kb after segment 2. 1 GATGAGCTCC TTGTGTGCCC AGTGGAATAG GTGTGTCCAT GTGTCATTTT AAAGACTATT 61 AATTACACTA ATATAGTTTC TTTCTCTCTT TGGATAATAG GCACGTTGTT CCGTATGGAC 121 TTGGAAGCCC TAGGTCCAAG AGAGCCTTGG AGAATTTACT TCCCACAAAG GCAACAGACC 181 GTGAGAATAG ATGCCAATGT GCTAGCCAAA AAGACAAGAA GTGCTGGAAT TTTTGCCAAG 241 CAGGAAAAGA ACTCAGGTGA GCAGAAACAC CTTTGCTTTT CAATCAGTTT AACAGCCTCC 301 TGAACTCCTT CCTATCATGG TACTGCCTTC CTGTTTTAGA GAGACTAACA GAGACATTGA 361 AAGTCAGGGT AAAGCTGAAT ATAACATTGC TGAAATGTTT TTCCTTGTGT ATTTTAACAG 421 GGCTGAAGAC ATTATGGAGA AAGACTGGAA TAATCATAAG AAAGGAAAAG ACTGTTCCAA 481 GCTTGGGAAA AAGTGTATTT ATCAGCAGTT AGTGAGAGGA AGAAAAATCA GAAGAAGTTC 541 AGAGGAACAC CTAAGACAAA CCAGGTAAGA GGGAAGGAAG AAAAATTAGG TAAGAGGTTC 601 ACAAGAACAA CTAGCCCCAG TCAGTGATGC CAGCAGCCTG TTCCTCCAGC CCTTCTTACC 661 CGGG // LOCUS HUMETN4 1330 bp ds-DNA PRI 15-DEC-1989 DEFINITION Human endothelin (ETN) gene, complete cds. ACCESSION M25380 J04819 KEYWORDS endothelin. SEGMENT 4 of 4 SOURCE Human leukocyte DNA and human umbilical vein endothelial cell cDNA to mRNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 1330) AUTHORS Bloch,K.D., Friedrich,S.P., Lee,M.-E., Eddy,R.L., Shows,T.B. and Quertermous,T. TITLE Structural organization and chromosomal assignment of the gene encoding endothelin JOURNAL J. Biol. Chem. 264, 10851-10857 (1989) STANDARD simple staff_review COMMENT Draft entry and computer readable copy of sequence [1] kindly submitted by K.D.Bloch (06-AUG-89). FEATURES from to/span description pept + 101 206 endothelin prepropeptide, exon 5 /nomgen="EDN1" /map="unassigned" /hgml_locus_uid="LU0066V" pre-msg < 1 1330 ENT mRNA and introns IVS < 1 100 ENT intron D BASE COUNT 392 A 271 C 257 G 410 T ORIGIN About 1.3 kb after segment 3. 1 TCAGGTTTTG TTTGTGCCAG ATTCTAATTT TACATGTTTC TTTTGCCAAA GGGTGATTTT 61 TTTAAAATAA CATTTGTTTT CTCTTATCTT GCTTTATTAG GTCGGAGACC ATGAGAAACA 121 GCGTCAAATC ATCTTTTCAT GATCCCAAGC TGAAAGGCAA GCCCTCCAGA GAGCGTTATG 181 TGACCCACAA CCGAGCACAT TGGTGACAGA CTTCGGGGCC TGTCTGAAGC CATAGCCTCC 241 ACGGAGAGCC CTGTGGCCGA CTCTGCACTC TCCACCCTGG CTGGGATCAG AGCAGGAGCA 301 TCCTCTGCTG GTTCCTGACT GGCAAAGGAC CAGCGTCCTC GTTCAAAACA TTCCAAGAAA 361 GGTTAAGGAG TTCCCCCAAC CATCTTCACT GGCTTCCATC AGTGGTAACT GCTTTGGTCT 421 CTTCTTTCAT CTGGGGATGA CAATGGACCT CTCAGCAGAA ACACACAGTC ACATTCGAAT 481 TCGGGTGGCA TCCTCCGGAG AGAGAGAGAG GAAGGAGATT CCACACAGGG GTGGAGTTTC 541 TGACGAAGGT CCTAAGGGAG TGTTTGTGTC TGACTCAGGC GCCTGGCACA TTTCAGGGAG 601 AAACTCCAAA GTCCACACAA AGATTTTCTA AGGAATGCAC AAATTGAAAA CACACTCAAA 661 AGACAAACAT GCAAGTAAAG AAAAAAAAAA GAAAGACTTT TGTTTAAATT TGTAAAATGC 721 AAAACTGAAT GAAACTGTTA CTACCATAAA TCAGGATATG TTTCATGAAT ATGAGTCTAC 781 CTCACCTATA TTGCACTCTG GCAGAAGTAT TTCCCACATT TAATTATTGC CTCCCCAAAC 841 TCTTCCCACC CCTGCTGCCC CTTCCTCCAT CCCCATACTA AATCCTAGCC TCGTAGAAGT 901 CTGGTCTAAT GTGTCAGCAG TAGATATAAT ATTTTCATGG TAATCTACTA GCTCTGATCC 961 ATAAGAAAAA AAAGATCATT AAATCAGGAG ATTCCCTGTC CTTGATTTTT GGAGACACAA 1021 TGGTATAGGG TTGTTTATGA AATATATTGA AAAGTAAGTG TTTGTTACGC TTTAAAGCAG 1081 TAAAATTATT TTCCTTTATA TAACCGGCTA ATGAAAGAGG TTGGATTGAA TTTTGATGTA 1141 CTTATTTTTT TATAGATATT TATATTCAAA CAATTTATTC CTTATATTTA CCATGTTAAA 1201 TATCTGTTTG GGCAGGCCAT ATTGGTCTAT GTATTTTTAA AATATGTATT TCTAAATGAA 1261 ATTGAGAACA TGCTTTGTTT TGCCTGTCAA GGTAGTGACT TTAGAAAATA AATATTTTTT 1321 TCCTTACTGT // LOCUS HUMFABP 5204 bp ds-DNA PRI 15-JUN-1988 DEFINITION Human, intestinal fatty acid binding protein gene, complete cds, and an Alu repetitive element. ACCESSION M18079 J03465 KEYWORDS Alu repetitive sequence; fatty acid binding protein. SOURCE Human DNA (library of T.Maniatis), clone lambda-HIFABP. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 5204) AUTHORS Sweetser,D., Birkenmeier,E., Klisak,I.J., Zollman,S., Sparkes,R.S., Mohandas,T., Lusis,A.J. and Gordon,J.I. TITLE The human and rodent intestinal fatty acid binding protein genes: A comparative analysis of their structure, expression, and linkage relationships JOURNAL J. Biol. Chem. 262, 16060-16071 (1987) STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly provided by D.Sweetser, 19-JAN-1988 FEATURES from to/span description pept 1089 1155 fatty acid binding protein, exon 1 /nomgen="FABP2" /map="4q28-q31" /hgml_locus_uid="LQ0114Z" 2350 2522 fatty acid binding protein, exon 2 3546 3653 fatty acid binding protein, exon 3 4098 4148 fatty acid binding protein, exon 4 pre-msg 1028 > 4393 FABPI mRNA (alt.) and introns pre-msg 1053 > 4393 FABPI mRNA (alt.) and introns IVS 1156 2349 FABPI intron A IVS 2523 3545 FABPI intron B IVS 3654 4097 FABPI intron C rpt 4466 4742 Alu repeat rpt 4466 4472 5' direct repeat rpt 4736 4742 3' direct repeat BASE COUNT 1770 A 867 C 836 G 1731 T ORIGIN 156 bp upstream of HindIII site; chromosome 4q28-q31. 1 GTAATATCTT GGGCAAGCCC TAGAGCTTCT TTCCTGACCC TTAGTTAATA AGATGTTATC 61 TGGTCACATT CAGTCACAAT AATAGACTCA TTTTAGTAAT AAACATCTTA AGACTAGTAA 121 TTAAAACTCT TTACTTCACA CCAAGTTTCC TCCCCAAGCT TGGCCTGTTC CTGGCTGGCA 181 GCCTGAAGTA GGGAAAGGAG AGATATGGTG ACCTTTTCTT TGTACCTTTC TAGCTACCCT 241 CTATACCCTG ACCCCACATA CATAATTGAG CTGTGGCTTC TGACTCTACT GGGTTTGGGG 301 ATGAGAGGCA GTGAGAGTAA AATGAAGGAG TGGTTTTAAT TAATGGCACA GCTAAAACTG 361 GATTTTGTTC TCTCTGCACA TGGCAGATGT TTAAAGCTCA TTCTTTCTTT TATGCAAGTT 421 TTTACACCAT CCAGCCTCAT TTGTACCTCT TGAATTTTTG CTCAGTGGCC TATCACCATT 481 CAGGATCAAG ACAAAAATCA ATGAGCACTT ATTGTGTGTC ATGCACCCTA CAAAGTGCCA 541 GGATATTTAT CCAAACTCCT GGCAATGCTA AACACAATGC AAAAAGACAT ATTAGAAAAC 601 GAATCTTATT AACTTTAGCT TTTCAACTGT ATTTCATCAT AAAGTCTTAC TTTACAAGAT 661 AATTGCTGTT GTGAAAAAGG GAAAGGTCAT GGTCTCATTT CCCAGATGTT ATTTGATATA 721 TGCTATAAAT TATATTACCT CCAACATAGT CTGCACTTTG AACTTAGAAA AACAATCTTC 781 AGACGGCATG CATTCTAATT CTTGAAATAA GTATGCCCAC AAACTGTAGT TTAAGACAGA 841 ATAGGTATGC TTCTCATGTT TTAATTCAGT TGAATTTCAG AAGATCTCAG GAATGTACAG 901 AACGAGAATT AAGAATTAAT AAGAATAAGA ATTAATTAAT TGCTTGACAT AGAGTAGTTA 961 GGTGATTTCC TGAACTTTAA GCTTCCACAT CACAGTATGA AGTTGGTTCA AGATAAGAAA 1021 TATAATAAAT TCTCGCCCAA GGACAGACCT GAATCTCTAG CTGCCTAGAG GCTGACTCAA 1081 CTGAAATCAT GGCGTTTGAC AGCACTTGGA AGGTAGACCG GAGTGAAAAC TATGACAAGT 1141 TCATGGAAAA AATGGGTAAA GACTTTATTT CTTTGTGGCT CATTCTTTGC TTTCTTACAA 1201 ACATTTTTCT TTCTAACTCC TAAATCTCTA GGAGATTACA GATAGCTTAC AGATAGCTCC 1261 TGATGTGGTA GAGAGGGATC CAGAAGATGT TCAGAGGAGG GAAACCATAT TTTCCCTTCT 1321 TACATTAGGA AGAATCCACT ATCTCACTAA TGGAAGAAAA GATTCTTTGA GTGCTGTTCT 1381 CTGAAACACA CCAAAAAGAT CCAGAAATGT TTCCTTCACT CTTTAACTGA AAAATGACTT 1441 TTTTTGTTGT TTACAGTAAG AAAATGGCAG CGTGTAATGA TAACTTCCAG ATCTGAAAAT 1501 GTTAAATTCT AGGAGATGGA AAAACAAAGA CCATATAAGA AAGTAATGGA AAAAGTTCTC 1561 TTAAAATTTA TAGCTCTGAA TAAGTTAGAT TTAATTCTGA TTTCTTCTAA CTTAAAAAAG 1621 TTTTGGAATA ATCTTGAGAA GCTGTGTAGT TTTCTCCAGG GCGTTTAATT TAACTGATTT 1681 ATAATTTGAT ACCAATACTC TGGCAGCCCA TATACTATAC AAGATAGGCA AACAAATTTG 1741 TGTCATTCCC CTAAAAGAAA AATCTGCATC AATTATAGCT TACAGTTTAG GAACTCTAAG 1801 TTTAAATTTA TAAAAGTTGT AGATTCTTAT AGTGATTTTG GCTTAATATT TGCTAATTTT 1861 CTCATTTTTG TGTCAGAAAG AAATGCCACA AGAAGCAAAT AGAACTATAA AGTTCAAAAT 1921 GTTAAAGCCA CTAAGAAAAA CAAAGGGGCA TTTAAGAAAA AAGAATACTG TATATGTGGA 1981 ATTAAAGATG TGCTTCCTTA TAAATATATG AATATACATT TTAATCCTTC ATTTAATATT 2041 TCTAGAATTT GATTTACTTA ACACTGAAAT GAACAGTTTG TTAATCTTAT TAAGGTTGCT 2101 CAGCTCTAAG ATTCTATAAT TCTGTACTCT ACTTAATTTT TCTCAAGTTA TGGAAAAACA 2161 ACTTTAATCA GTTCTCTTGA TCGGATTGAA CCTGAACTTC TATAGAAGCA ATCTGAATGT 2221 TCTTGTGCAA AGGCAATGCT ACCGAGTTTT CTTCCCACCC TCAAAATAAA CAAACAAAAC 2281 ATAACTTGGA AAAATAAACA CTTCCTATGG GATTTGACTT TATTTTCTCC ATTGTCTTAC 2341 CTTTTACAGG TGTTAATATA GTGAAAAGGA AGCTTGCAGC TCATGACAAT TTGAAGCTGA 2401 CAATTACACA AGAAGGAAAT AAATTCACAG TCAAAGAATC AAGCGCTTTT CGAAACATTG 2461 AAGTTGTTTT TGAACTTGGT GTCACCTTTA ATTACAACCT AGCAGACGGA ACTGAACTCA 2521 GGGTAAGAAT TTTTTTTTTT ATGAGCAATG CATTCTTGAT TTTTCTACCC AATATTAAAA 2581 TGATTTCTGC TCTATTTCAT TGGATGGTTT AATTAATGCA GGTCTCCTTC ACTAACTGAA 2641 GAAGCCAATG AAGTTTGTCT ACATTATATA TTACACAAAT TGGCAGGGTA TTTAAATATG 2701 CTTTTATTTT TATACGCATC TGTGAAGAAT CTGAATTGAA CAGTAAGAAT TAGAAAACTA 2761 TCTTTTGAAT GACTGAATAT AGACCTATTC ATAAAGAAAT TTAAAACTGT GTTTTTAAAC 2821 AGTACAGCAA AAGAAGCCTT TAGAGTTAAT ATGTAACTTA ACTGTAACAT GTTGAAATAA 2881 TAAAAGAAAT GAATAGATGA ACAAATGAGT GAGTTACCAA ATGGAAAGAT TTGATGTATT 2941 GTAGGTCATT GGGAGTGTAC CTTTTCATGT TTAAGATAAC ACATTTTAGG AAGTCATCAT 3001 TTTCAACAAA TTTTTTAAAA ACTTTTTTTA GCCTCAACAT TTTTCTATTT AAATTACATG 3061 TTTGTAATGA CAATTTAACT ACTGAATGTT TTATCGTAAG TTATGTCTTT CCTTAATTAG 3121 TACCACAATC ACACAAATTA AAACAAGCAC AGGTTATTAA CATCTCCGTG AAACTAATTT 3181 TAACCATGAC TATATTTCTG GACACGTAAC ATGAAAGATT CAGAAAGAAG TGCTGCTCAT 3241 CTGCCTTAAA ATTCAGCGTA TGGAAATTAT TGAAGAGAAC AAGCATAATG GTTATCAACA 3301 CATACTCTGT AGCCCAATGG CCTAGGTTCA ATCCTCACTC TGTGACTTTA GGTGAATCAC 3361 TGTGCCATTT TACAGTCTCC TCTTCTGCAA AGTAGAGATA GTAGTATCAG TTTCATAGGG 3421 TCACCATGAA GATTAAATGA AAAAGTGTGT CTACAGAACT CAGAACAGTG CCTGACATGT 3481 GTAAGACCCT AATAAATGCC ATTATTATTA TTATTATTAT TATTATTATT ATTATTATTA 3541 TGTAGGGGAC CTGGAGCCTT GAGGGAAATA AACTTATTGG AAAATTCAAA CGGACAGACA 3601 ATGGAAACGA ACTGAATACT GTCCGAGAAA TTATAGGTGA TGAACTAGTC CAGGTGAGTT 3661 GTCAAATTTA TAGCTATTTT CAAAAGGCAA AAATTACTAC AAAACAATAA TTTTTGTCAC 3721 TGCTGAGCCA GATCTTCAGT AAACTGACTA CTTCTTTTCT CATAAATCTT ACTGATTTTA 3781 AAAATATTGT ATAGCTATTT TCTGATGCCT ATTTACTAAA GACAACTTAT ATATGTCAAA 3841 TAATCAATGC CTATTTTAAC TGAAAATATA AATGACTACA AACCAACATG TGTTTTAAAA 3901 TGGCTGTATC CCATATCTGT ATAAATCTTG CTATCAAGTA CAAGAAAAAA TTGTATAAAC 3961 TCATACTCAT ATAATATATA TGAATATATA ATATAAAAAT AGTATAAACT CATATAGTAT 4021 AAAACTATAA TACTACTTTT TCTTAACTTA GATGTAAACC TTAAAGATAA ATTCTTCTGT 4081 TTGTTAACAC CTTTCAGACT TATGTGTATG AAGGAGTAGA AGCCAAAAGG ATCTTTAAAA 4141 AGGATTGAGC ATTATTCTTG GCGCACAGTC CAAAATACAA ATTGGACAGA AGATCTATAT 4201 TGTACCAGAA CTGTTTATTT CACCCCATCA AGTATAAGGT TACTGATTGA TTGGTCCTTT 4261 TATAAACATT GGTATATTTC CATTCATGCC AAAGCAAAAG AAGTAAAAGC TAATTAGGAT 4321 TTAATTTGTT TTATATTCTC TAAGATATAT ATTTACTAAA AGAATTTGTG ACATTTTAAA 4381 AAACAAAAAT AAATATTGCA TCCATGTTGC TTTATATGTA GCCTTGCCTT TTAAAAGAAA 4441 AAGTATGTGA ATATGAATTG ACAGATTGTT TTCGTAGAGA GAGGGTCTTA CTCTTTCACT 4501 CAGGCTGGAA TGCAGTGGAG AGATCATAGC TCACTGTAAC CTCAAACTCC TGGACTCATG 4561 CAATCTTCCT GCCTCAGGCT TCTGAGTAGC TAGGACTATG GGTACATTCC ACAGTGCCCA 4621 GCTAATTTTT GTTTTGTTTT CTTTTTATTT TTTTTAGAGA TGGGGTCTTG CTATATTGCC 4681 CAGGCTGGTC TTGAACCCCT GGCCTCAAGC AATCCTCCTG CCTCAGCCTC TCAAGTTGTT 4741 TTTTTCTTTA CATTTGATAA ACTAAAAGCA TAGGCTGCAT ATGAGTCTTT AACATCTTGA 4801 ACTGGTTGTG AATAATTTTC TGGCACTGGT TGTAAGTAAT ATCTATTATT ATAAAAATAA 4861 TATATGCTCA ACCAGAAAAC TTAGAAATAA GAAACACAAA TGTAAAATAA GTATTTCCAT 4921 AACTCATAAT CCAGAGATAA TTGCCATTCT GATTTTGATA GATATCCTCT CAGCTCTCTT 4981 CCCTGGGGGC AGATATTTCC CAATACATAC CACTTTGAAT AGGATGATAG GAAATAAATG 5041 ATGTACTACA TTAAATTAAA TTATTGTATT ACATTTTTGT ACACATCAGT CATTCCCAGG 5101 CTTGGCTGAA AATCAGGATC ATCTGAGAAA CTTAAACAAT TTCTGCATTC TTAATCTCCA 5161 CTGTTATTCT ATTATATCAG AATCGCTAAT AGAACCAAGA ATTC // LOCUS HUMFBRB1 144 bp ds-DNA PRI 15-JUN-1989 DEFINITION Human fibrinogen beta-chain gene, exon 1. ACCESSION J00130 KEYWORDS beta-fibrinogen; fibrin; fibrinogen; glycoprotein. SEGMENT 1 of 3 SOURCE Human DNA (library of T.Maniatis), clone lambda-HI-beta-18K. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 144) AUTHORS Chung,D.W., Que,B.G., Rixon,M.W., Mace,M.Jr. and Davie,E.W. TITLE Characterization of complementary deoxyribonucleic acid and genomic deoxyribonucleic acid for the beta-chain of human fibrinogen JOURNAL Biochemistry 22, 3244-3250 (1983) STANDARD full staff_review COMMENT Three potential translation initiation codons at positions 17-19, 26-29, and 59-61 were identified. The actual start codon was not confirmed. FEATURES from to/span description pept 17 + 130 beta-fibrinogen precursor, exon 1 /nomgen="FGB" /map="4q31" /hgml_locus_uid="LZ0118Z" sigp 17 106 beta-fibrinogen signal peptide matp 107 + 130 beta-fibrinogen pre-msg < 17 > 144 beta-figrinogen mRNA IVS 131 > 144 b-fibrinogen intron A BASE COUNT 46 A 23 C 27 G 48 T ORIGIN Chromosome 4q31. 1 CTCAAGTTAA GTCTACATGA AAAGGATGGT TTCTTGGAGC TTCCACAAAC TTAAAACCAT 61 GAAACATCTA TTATTGCTAC TATTGTGTGT TTTTCTAGTT AAGTCCCAAG GTGTCAACGA 121 CAATGAGGAG GTGAATTTTT TAAA // LOCUS HUMFBRB2 222 bp ds-DNA PRI 15-JUN-1989 DEFINITION Human fibrinogen beta-chain gene, exon 2. ACCESSION J00131 KEYWORDS beta-fibrinogen; fibrin; fibrinogen; glycoprotein. SEGMENT 2 of 3 SOURCE Human DNA (library of T.Maniatis), clone lambda-HI-beta-18K. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 222) AUTHORS Chung,D.W., Que,B.G., Rixon,M.W., Mace,M.Jr. and Davie,E.W. TITLE Characterization of complementary deoxyribonucleic acid and genomic deoxyribonucleic acid for the beta-chain of human fibrinogen JOURNAL Biochemistry 22, 3244-3250 (1983) STANDARD full staff_review FEATURES from to/span description pept + 17 / 208 beta-fibrinogen precursor, exon 2 /nomgen="FGB" /map="4q31" /hgml_locus_uid="LZ0118Z" matp + 17 / 208 beta-fibrinogen pre-msg < 1 > 222 beta-figrinogen mRNA IVS < 1 16 b-fibrinogen intron A IVS 209 > 222 b-fibrinogen intron B BASE COUNT 52 A 63 C 64 G 43 T ORIGIN About 1.3 kb after segment 1; chromosome 4q31. 1 GCCTCTAAGG TTGTAGGGTT TCTTCAGTGC CCGTGGTCAT CGACCCCTTG ACAAGAAGAG 61 AGAAGAGGCT CCCAGCCTGA GGCCTGCCCC ACCGCCCATC AGTGGAGGTG GCTATCGGGC 121 TCGTCCAGCC AAAGCAGCTG CCACTCAAAA GAAAGTAGAA AGAAAAGCCC CTGATGCTGG 181 AGGCTGTCTT CACGCTGACC CAGACTTGGT GGGTGCACTG AT // LOCUS HUMFBRG 10564 bp ds-DNA PRI 15-JUN-1989 DEFINITION Human fibrinogen gamma chain and gamma-prime chain genes. ACCESSION M10014 J00134 J00135 X00086 KEYWORDS alternate splicing; fibrinogen; gamma-fibrinogen. SOURCE Human liver: cDNA to mRNA, library of S.L.C.Woo and T.Chandra, clone pHI-gamma-2 [1], clones pHI-gamma-prime-[1,2,3] [4]; libraries of K.Matteson and D.Stafford [2]; library of D.Woods, clone pGF1 [3]; DNA, library of T.Maniatis [4], clones p-gamma-[2G7,12A9,12F3,12B8,13C8] [5]. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1728 to 10271) AUTHORS Chung,D.W., Chan,W.-Y. and Davie,E.W. TITLE Characterization of a complementary deoxyribonucleic acid coding for the gamma chain of human fibrinogen JOURNAL Biochemistry 22, 3250-3256 (1983) STANDARD full staff_review REFERENCE 2 (bases 5914 to 10271) AUTHORS Kant,J.A., Lord,S.T. and Crabtree,G.R. TITLE Partial mRNA sequences for human A-alpha, B-beta, and gamma fibrinogen chains: Evolutionary and functional implications JOURNAL Proc. Natl. Acad. Sci. U.S.A. 80, 3953-3957 (1983) STANDARD full staff_review REFERENCE 3 (bases 4737 to 4778; 5758 to 5901) AUTHORS Imam,A.M.A., Eaton,M.A.W., Williamson,R. and Humphries,S. TITLE Isolation and characterisation of cDNA clones for the A-alpha- and gamma-chains of human fibrinogen JOURNAL Nucleic Acids Res. 11, 7427-7434 (1983) STANDARD full staff_review REFERENCE 4 (bases 9324 to 10271) AUTHORS Chung,D.W. and Davie,E.W. TITLE Gamma and gamma-prime chains of human fibrinogen are produced by alternative mRNA processing JOURNAL Biochemistry 23, 4232-4236 (1984) STANDARD full staff_review REFERENCE 5 (bases 1 to 10564) AUTHORS Rixon,M.W., Chung,D.W. and Davie,E.W. TITLE Nucleotide sequence of the gene for the gamma chain of human fibrinogen JOURNAL Biochemistry 24, 2077-2086 (1985) STANDARD full staff_review COMMENT The gamma-prime chain in human fibrinogen results from an alternative processing and polyadenylation reaction that occurs in an intervening sequence of the gamma chain. In the mechanism, the alternative site is occasionally selected in place of the predominant site in the tenth exon, leading to termination of the precursor mRNA within the ninth intron, thus giving rise to the gamma-prime chain. Comparisons of the amino acid sequences of the alpha, beta and gamma chains of fibrinogen showed that the three chains are related and probably evolved from a common ancestor. The beta and gamma chains share a significantly higher degree of homology with each other than with the alpha chain. A comparison of the nucleotide sequences of the human and rat gamma chains shows that regions of high homology are localized [2]. Two tandem direct repeats were found at bases 1280-1309 and 1310-1340 in the 5'-end of the sequence, and two single-copy repeats were found in intron H at bases 8324-8468, and in exon 9 at bases 9367-9524. Promoter sequences were found at bases 1691 (TATA box) and 1724 (CAAT box) [4]. [2],[3] also reported the amino acid and nucleotide sequences of the human fibrinogen alpha-A chain and beta-B chains. Draft entry and sequence on diskette kindly provided by D.W.Chung, May 1985 [5]. FEATURES from to/span description pept 1799 1876 fibrinogen gamma chain, exon 1 /nomgen="FGG" /map="4q31" /hgml_locus_uid="LX0035D" 1973 2017 fibrinogen gamma chain, exon 2 2207 2390 fibrinogen gamma chain, exon 3 2510 2603 fibrinogen gamma chain, exon 4 4211 4341 fibrinogen gamma chain, exon 5 4645 4778 fibrinogen gamma chain, exon 6 5758 5942 fibrinogen gamma chain, exon 7 7426 7703 fibrinogen gamma chain, exon 8 9342 9511 fibrinogen gamma chain, exon 9 10054 10068 fibrinogen gamma chain, exon 10 sigp 1799 1876 fibrinogen gamma chain signal peptide matp 1973 2017 fibrinogen gamma chain mature peptide 2207 2390 fibrinogen gamma chain mature peptide 2510 2603 fibrinogen gamma chain mature peptide 4211 4341 fibrinogen gamma chain mature peptide 4645 4778 fibrinogen gamma chain mature peptide 5758 5942 fibrinogen gamma chain mature peptide 7426 7703 fibrinogen gamma chain mature peptide 9342 9511 fibrinogen gamma chain mature peptide 10054 10065 fibrinogen gamma chain mature peptide pept 1799 1876 fibrinogen gamma-prime chain, exon 1 /nomgen="FGG" /map="4q31" /hgml_locus_uid="LX0035D" 1973 2017 fibrinogen gamma-prime chain, exon 2 2207 2390 fibrinogen gamma-prime chain, exon 3 2510 2603 fibrinogen gamma-prime chain, exon 4 4211 4341 fibrinogen gamma-prime chain, exon 5 4645 4778 fibrinogen gamma-prime chain, exon 6 5758 5942 fibrinogen gamma-prime chain, exon 7 7426 7703 fibrinogen gamma-prime chain, exon 8 9342 9574 fibrinogen gamma-prime chain, exon 9 sigp 1799 1876 fibrinogen gamma-prime chain signal peptide matp 1973 2017 fibrinogen gamma-prime chain mature peptide 2207 2390 fibrinogen gamma-prime chain mature peptide 2510 2603 fibrinogen gamma-prime chain mature peptide 4211 4341 fibrinogen gamma-prime chain mature peptide 4645 4778 fibrinogen gamma-prime chain mature peptide 5758 5942 fibrinogen gamma-prime chain mature peptide 7426 7703 fibrinogen gamma-prime chain mature peptide 9342 9571 fibrinogen gamma-prime chain mature peptide pre-msg 1748 9825 g-p mRNA pre-msg 1748 10271 g mRNA IVS 1877 1972 g, g-p signal pept intron A IVS 2018 2206 g, g-p intron B IVS 2391 2509 g, g-p intron C IVS 2604 4210 g, g-p intron D IVS 4342 4644 g, g-p intron E IVS 4779 5757 g, g-p intron F IVS 5943 7425 g, g-p intron G IVS 7704 9341 g, g-p intron H IVS 9512 10053 g intron I variant 2543 2543 t in DNA [5]; a in mRNA [1] variant 4718 4718 g in DNA [5]; a in mRNA [1] conflict 5817 5817 a in [1],[5]; c in [3] conflict 5823 5823 g in [1],[5]; c in [3] variant 9346 9346 c in DNA [5], mRNA [2]; t in mRNA [1],[4] variant 9424 9424 c in [1],[2],[5]; a in [4] variant 9436 9436 c in [1],[2],[5]; a in [4] conflict 9494 9494 c in [1],[4],[5]; t in [2] variant 9496 9496 g in DNA [4],[5], mRNA [2]; a in mRNA [1] variant 9691 9691 c in [5]; g in [4] variant 10138 10138 t in [1],[2],[5]; g in [4] revision 10155 10156 gc in [4],[5]; gctggc in [1] conflict 10239 10239 t in [1],[4],[5]; c in [2] conflict 10242 10243 at in [1],[4],[5]; att in [2] conflict 10271 10271 t in [1],[4],[5]; c in [2] BASE COUNT 3337 A 1936 C 1991 G 3300 T ORIGIN 299 bp upstream of HpaI site; chromosome 4q31. 1 CTACACACTT CTTGAAGGCA AAGGCAATGC TGAAGTCACC TTTCATGTTC AAATCATATT 61 AAAAAGTTAG CAAGATGTAA TTATCAGTGT ACTATGTAAA TCTTTGTGAA TGATCAATAA 121 TTACATATTT TCATTATATA TATTTTAGTA GATAATATTT ATATACATTC AACATTCTAA 181 ATATAGAAAG TTTACAGAGA AAAATAAAGC CTTTTTTTCC AATCCTGTCC TCCACCTCTG 241 CATCCCATTC TTCTTCACAG AGGCAACTGA TTCAAGTCAT TACATAGTTA TTGAGTGTTA 301 ACTACAACTA TGTTAAGTAC AGCTATATAT GTTAGATGCC GTAGCCACAG AAATCAGTTT 361 ACAATCTAAT GCAGTGGATA CAGCATGTAT ACATATAATA TAAGGTTGCT ACAAATGCTA 421 TCTGAGGTAG AGCTGTTTGA AAGAATACTA ATACTTAAAT GTTTAATTCA ACTGACTTGA 481 TTGACAACTG ATTAGCTGAG TGGAAAAGAT GGATGAGAAA GATTGTGAGA CTTAATTGGC 541 TGGTGGTATG GTGATATGAT TGACAATAAC TGCTAAGTCA GAGAGGGATA TATTAAGGAG 601 GAGAAGAAAA GCAACAAATC TGGTTTTGAT GTGTTCACTT TGTTATAATT ATTGATTATT 661 TACTGAATAT GAATATTTAT CTTTGTTTTT GAGTCAATAA ATATACCTTT GTAAAGACAG 721 AATTAAAGTA TTAGTATTTC TTTCAAACTG GAGGCATTTC TCCCACTAAC ATATTTCATC 781 AAAACTTATA ATAAGCTTGG TTCCAGAGGA AGAAATGAGG GATAACCAAA AATAGAGACA 841 TTAATAATAG TGTAACGCCC AGTGATAAAT CTCAATAGGC AGTGATGACA GACATGTTTT 901 CCCAAACACA AGGATGCTGT AAGGGCCAAA CAGAAATGAT GGCCCCTCCC CAGCACCTCA 961 TTTTGCCCCT TCCTTCAGCT ATGCCTCTAC TCTCCTTTAG ATACAAGGGA GGTGGATTTT 1021 TCTCTTCTCT GAGATAGCTT GATGGAACCA CAGGAACAAT GAAGTGGGCT CCTGGCTCTT 1081 TTCTCTGTGG CAGATGGGGT GCCATGCCCA CCTTCAGACA AAGGGAAGAT TGAGCTCAAA 1141 AGCTCCCTGA GAAGTGAGAG CCTATGAACA TGGTTGACAC AGAGGGACAG GAATGTATTT 1201 CCAGGGTCAT TCATTCCTGG GAATAGTGAA CTGGGACATG GGGGAAGTCA GTCTCCTCCT 1261 GCCACAGCCA CAGATTAAAA ATAATAATGT TAACTGATCC CTAGGCTAAA ATAATAGTGT 1321 TAACTGATCC CTAAGCTAAG AAAGTTCTTT TGGTAATTCA GGTGATGGCA GCAGGACCCA 1381 TCTTAAGGAT AGACTAGGTT TGCTTAGTTC GAGGTCATAT CTGTTTGCTC TCAGCCATGT 1441 ACTGGAAGAA GTTGCATCAC ACAGCCTCCA GGACTGCCCT CCTCCTCACA GCAATGGATA 1501 ATGCTTCACT AGCCTTTGCA GATAATTTTG GATCAGAGAA AAAACCTTGA GCTGGGCCAA 1561 AAAGGAGGAG CTTCAACCTG TGTGCAAAAT CTGGGAACCT GACAGTATAG GTTGGGGGCC 1621 AGGATGAGGA AAAAGGAACG GGAAAGACCT GCCCACCCTT CTGGTAAGGA GGCCCCGTGA 1681 TCAGCTCCAG CCATTTGCAG TCCTGGCTAT CCCAGGAGCT TACATAAAGG GACAATTGGA 1741 GCCTGAGAGG TGACAGTGCT GACACTACAA GGCTCGGAGC TCCGGGCACT CAGACATCAT 1801 GAGTTGGTCC TTGCACCCCC GGAATTTAAT TCTCTACTTC TATGCTCTTT TATTTCTCTC 1861 TTCAACATGT GTAGCAGTAA GTGTGCTCTT CACAAAACGT TGTTTAAAAT GGAAAGCTGG 1921 AAAATAAAAC AGATAATAAA CTAGTGAAAT TTTCGTATTT TTTCTCTTTT AGTATGTTGC 1981 TACCAGAGAC AACTGCTGCA TCTTAGATGA AAGATTCGTA AGTAGTTTTT ATGTTTCTCC 2041 CTTTGTGTGT GAACTGGAGA GGGGCAGAGG AATAGAAATA ATTCCCTCAT AAATATCATC 2101 TGGCACTTGT AACTTTTTAA AAACATAGTC TAGGTTTTAC CTATTTTTCT TAATAGATTT 2161 TAAGAGTAGC ATCTGTCTAC ATTTTTAATC ACTGTTATAT TTTCAGGGTA GTTATTGTCC 2221 AACTACCTGT GGCATTGCAG ATTTCCTGTC TACTTATCAA ACCAAAGTAG ACAAGGATCT 2281 ACAGTCTTTG GAAGACATCT TACATCAAGT TGAAAACAAA ACATCAGAAG TCAAACAGCT 2341 GATAAAAGCA ATCCAACTCA CTTATAATCC TGATGAATCA TCAAAACCAA GTGAGAAAAT 2401 AAAGACTACT GACCAAAAAA TAATAATAAT AATCTGTGAA GTTCTTTTGC TGTTGTTTTA 2461 GTTGTTCTAT TTGCTTAAGG ATTTTTATGT CTCTGATCCT ATATTACAGA TATGATAGAC 2521 GCTGCTACTT TGAAGTCCAG GATAATGTTA GAAGAAATTA TGAAATATGA AGCATCGATT 2581 TTAACACATG ACTCAAGTAT TCGGTAAGGA TTTTTGTTTT AATTTGCTCT GCAAGACTGA 2641 TTTAGTTTTT ATTTAATATT CTATACTTGA GTGAAAGTAA TTTTTAATGT GTTTTCCCCA 2701 TTTATAATAT CCCAGTGACA TTATGCCTGA TTATGTTGAG CATAGTAGAG ATAGAAGTTT 2761 TTAGTGCAAT ATAAATTATA CTGGGTTATA ATTGCTTATT AATAATCACA TTGAAGAAAG 2821 ATGTTCTAGA TGTCTTCAAA TGCTAGTTTG ACCATATTTA TCAAAAATTT TTTCCCCATC 2881 CCCCATTTAT CTTACAACAT AAAATCAATC TCATAGGAAT TTGGGTGTTG AAAATAAAAT 2941 CCTCTTTATA AAAATGCTGA CAAATTGGTG GTTAAAAAAA TTAGCAAGCA GAGGCATAGT 3001 AAGGATTTTG GCTCCTAAAG TAAATTATAT TGAATGTGGA GCAGGAAGAA ACATGTCTTG 3061 AGAGACTAAG TGTGGCAAAT ATTGCAAAGC TCATATTGAT CATTGCAGAA TGAACCTGCA 3121 TAGTCTCTTC CCTTCATTTG GAAGTGAATG TCTCTGTTAA AGCTTCTCAG GGACTCATAA 3181 ACTTTCTGAA CATAAGGTCT CAGATACAGT TTTAATATTT TTCCCCAATT TTTTTTTCTG 3241 AATTTTTCTC AAAGCAGCTT GAGAAATTGA GATAAATAGT AGCTAGGGAG AAGTGGCCCA 3301 GGAAAGATTT CTCCTCTTTT TGCTATCAGA GGGCCCTTGT TATTATTGTT ATTATTATTA 3361 CTTGCATTAT TATTGTCCAT CATTGAAGTT GAAGGAGGTT ATTGTACAGA AATTGCCTAA 3421 GACAAGGTAG AGGGAAAACG TGGACAAATA GTTTGTCTAC CCTTTTTTAC TTCAAAGAAA 3481 GAACGGTTTA TGCATTGTAG ACAGTTTTCT ATCATTTTTG GATATTTGCA AGCCACCCTG 3541 TAAGTAACTA CAAAAGGAGG GTTTTTACTT CCCCCAGTCC ATTCCCAAAG CTATGTAACC 3601 AGAAGCATTA AAGAAGAAAG GGGAAGTATC TGTTGTTTTA TTTTACATAC AATAACGTTC 3661 CAGATCATGT CCCTGTGTAA GTTATATTTT AGATTGAAGC TTATATGTAT AGCCTCAGTA 3721 GATCCACAAG TGAAAGGTAT ACTCCTTCAG CACATGTGAA TTACTGAACT GAGCTTTTCC 3781 TGCTTCTAAA GCATCAGGGG GTGTTCCTAT TAACCAGTCT CGCCACTCTT GCAGGTTGCT 3841 ATCTGCTGTC CCTTATGCAT AAAGTAAAAA GCAAAATGTC AATGACATTT GCTTATTGAC 3901 AAGGACTTTG TTATTTGTGT TGGGAGTTGA GACAATATGC CCCATTCTAA GTAAAAAGAT 3961 TCAGGTCCAC ATTGTATTCC TGTTTTAATT GATTTTTTGA TTTGTTTTTC TTTTTCAAAA 4021 AGTTTATAAT TTTAATTCAT GTTAATTTAG TAATATAATT TTACATTTTC CTCAAGAATG 4081 GAATAATTTA TCAGAAAGCA CTTCTTAAGA AAATACTTAG CAGTTTCCAA AGAAAATATA 4141 AAATTACTCT TCTGAAAGGA ATACTTATTT TTGTCTTCTT ATTTTTGTTA TCTTATGTTT 4201 CTGTTTGTAG ATATTTGCAG GAAATATATA ATTCAAATAA TCAAAAGATT GTTAACCTGA 4261 AAGAGAAGGT AGCCCAGCTT GAAGCACAGT GCCAGGAACC TTGCAAAGAC ACGGTGCAAA 4321 TCCATGATAT CACTGGGAAA GGTAACTGAT GAAGGTTATA TTGGGATTAG GTTCATCAAA 4381 GTAAGTAATG TAAAGGAGAA AGTATGTACT GGAAAGTATA GGAATAGTTT AGAAAGTGGC 4441 TACCCATTAA GTCTAAGAAT TTCAGTTGTC TAGACCTTTC TTGAATAGCT AAAAAAAACA 4501 GTTTAAAAGG AATGCTGATG TGAAAAGTAA GAAAATTATT CTTGGAAAAT GAATAGTTTA 4561 CTACATGTTA AAAGCTATTT TTCAAGGCTG GCACAGTCTT ACCTGCATTT CAAACCACAG 4621 TAAAAGTCGA TTCTCCTTCT CTAGATTGTC AAGACATTGC CAATAAGGGA GCTAAACAGA 4681 GCGGGCTTTA CTTTATTAAA CCTCTGAAAG CTAACCAGCA ATTCTTAGTC TACTGTGAAA 4741 TCGATGGGTC TGGAAATGGA TGGACTGTGT TTCAGAAGGT AATTTTTTCC CCACCATGTG 4801 TATTTAATAA ATTCCTACAT TGTTTCTGCC ATATGGCAGA TACTTTTCTA AGCACCTTGT 4861 GAACCGTAGC TCATTTAATC CTTGCAATAG CCCTAAGAGG AAGGTACTTC TGTTACTCCT 4921 ATTTACAGAA AAGGAAACTG AGGCACACAA GGTTAAATAA CTTGCCCAAG ACCACATAAC 4981 TAATAAGCAA CAGAGTCAGC ATTTGAACCT AGGCAGTATA GTTTCAGAGT TTGTGACTTG 5041 ACTCTATATT GTACTGGCAC TGACTTTGTA GATTCATGGT GGCACATAAT CATAGTACCA 5101 CAGTGACAAA TAAAAAGAAG GAAACTCTTT TGTCAGGTAG GTCAAGACCT GAGGTTTCCC 5161 ATCACAAGAT GAGGAAGCCC AACACCACCC CCCACCACCC CACCACCATC ACCACCCTTT 5221 CACACACCAG AGGATACACT TGGGCTGCTC CAAGACAAGG AACCTGTGTT GCATCTGCCA 5281 CTTGCTGATA CCCACTAGGA ATCTTGGCTC CTTTACTTTC TGTTTACCTC CCACCACTGT 5341 TATAACTGTT TCTACAGGGG GCGCTCAGAG GGAATGAATG GTGGAAGCAT TAGTTGCCAG 5401 ACACCGATTG AGCAATGGGT TCCATCATAA GTGTAAGAAT CAGTAATATC CAGCTAGAGT 5461 TCTGAAGTCG TCTAGGTGTC TTTTTAATAT TACCACTCAT TTAGAATTTA TGATGTGCCA 5521 GAAACCCTCT TAAGTATTTC TCTTATATTC TCTCTCATGA TCCTTGCAGC AACCCTAAGA 5581 AGTAACCATC ATTTTTCCTA TTTGATACAT GAGGAAACTG AGGTAGCTTG GCCAAGATCA 5641 CTTAGTTGGG AGTTGATAGA ACCAGTGCTC TGTATTTTTG ACAAAATGTT GACAGCATTC 5701 TCTTTACATG CATTGATAGT CTATTTTCTC CTTTTGCTCT TGCAAATGTG TAATTAGAGA 5761 CTTGATGGCA GTGTAGATTT CAAGAAAAAC TGGATTCAAT ATAAAGAAGG ATTTGGACAT 5821 CTGTCTCCTA CTGGCACAAC AGAATTTTGG CTGGGAAATG AGAAGATTCA TTTGATAAGC 5881 ACACAGTCTG CCATCCCATA TGCATTAAGA GTGGAACTGG AAGACTGGAA TGGCAGAACC 5941 AGGTACTGTT TTGAAATGAC TTCCAACTTT TTATTGTAAA GATTGCCTGG AATGTGCACT 6001 TTCCAACTAT CAATAGACAA TGGCAAATGC AGCCTGACAA ATGCAAACAG CACATCCAGC 6061 CACCATTTTC TCCAGGAGTC TGTTTGGTTC TTGGGCAATC CAAAAAGGTA AATTCTATTC 6121 AGGATGAATC TAAGTGTATT GGTACAATCT AATTACCCTG GAACCATTCA GAGTAATAGC 6181 TAATTACTGA ACTTTTAATC AGTCCCAGGA ATTGAGCATA AAATTATAAT TTTATCTAGT 6241 CTAAATTACT ATTTCATGAA GCAGGTATTA TTATTAATCC CATTTTATAG ATTAACTTGC 6301 TCAAAGTCAC ATTGCTGATA AGTGGTAGAG GTAGAATTCA GACTCAAGTA GTTTAACTTT 6361 AGAGCCTGTC CTCTTAACAA CTATCCTGGT TGAAAAGCAA ATACAGCCTC TTCAGACTTC 6421 TCAGTGCCTT GATGGCCATT TATTCTGTCA AATCATGAGC TACCCTAAAA GTAAACCAGC 6481 TAGCTCTTTT GATGATCTAG AGGCTTCTTT TTGCTTGAGA TATTTGAAGG TTTTAAGCAT 6541 TGTTACCTAA TTAAAATGCA GAAAAATATC CAACCCTCTT GTTATGTTTA AGGAATAGTG 6601 AAATATATTG TCTTCAAACA CATGGACTTT TTTTTATTGC TTGGTTGGTT TTTAATCCAG 6661 AAAGTGCTAT AGTCAGTAGA CCTTCTTCTA GGAAAGGACC TTCCATTTCC CAGCCACTGG 6721 AGATTAGAAA ATAAGCTAAA TATTTTCTGG AAATTTCTGT TCATTCATTA AGGCCCATCC 6781 TTTCCCCCAC TCTATAGAAG TGTTGTCCAC TTGCACAATT TTTTCCAGGA AAGAATCTCT 6841 CTAACTCCTT CAGCTCACAT GCTTTGGACC ACACAGGGAA GACTTTGATT GTGTAATGCC 6901 CTCAGAAGCT CTCCTTCTTG CCACTACCAC ACTGATTTGA GGAAGAAAAT CCCTTTAGCA 6961 CCTAACCCTT CAGGTGCTAT GAGTGGCTAA TGGAACTGTA CCTCCTTCAA GTTTTGTGCA 7021 ATAATTAAGG GTCACTCACT GTCAGATACT TTCTGTGATC TATGATAATG TGTGTGCAAC 7081 ACATAACATT TCAATAAAAG TAGAAAATAT GAAATTAGAG TCATCTACAC ATCTGGATTT 7141 GATCTTAGAA TGAAACAAGC AAAAAAGCAT CCAAGTGAGT GCAATTATTA GTTTTCAGAG 7201 ATGCTTCAAA GGCTTCTAGG CCCATCCCGG GAAGTGTTAA TGAGCTGTGG ACTGGTTCAC 7261 ATATCTATTG CCTCTTGCCA GATTTGCAAA AAACTTCACT CAATGAGCAA ATTTCAGCCT 7321 TAAGAAACAA AGTCAAAAAT TCCAAGGAAG CATCCTACGA AAGAGGGAAC TTCTGAGATC 7381 CCTGAGGAGG GTCAGCATGT GATGGTTGTA TTTCCTTCTT CTCAGTACTG CAGACTATGC 7441 CATGTTCAAG GTGGGACCTG AAGCTGACAA GTACCGCCTA ACATATGCCT ACTTCGCTGG 7501 TGGGGATGCT GGAGATGCCT TTGATGGCTT TGATTTTGGC GATGATCCTA GTGACAAGTT 7561 TTTCACATCC CATAATGGCA TGCAGTTCAG TACCTGGGAC AATGACAATG ATAAGTTTGA 7621 AGGCAACTGT GCTGAACAGG ATGGATCTGG TTGGTGGATG AACAAGTGTC ACGCTGGCCA 7681 TCTCAATGGA GTTTATTACC AAGGTATGTT TTCCTTTCTT AGATTCCAAG TTAATGTATA 7741 GTGTATACTA TTTTCATAAA AAATAATAAA TAGATATGAA GAAATGAAGA ATAATTTATA 7801 AAGATAGTAG GGATTTTATC ATGTTCTTTA TTTCAACTAA GTTCTTTGAA ACTGGAAGTG 7861 GATAATACCA AGTTCATGCC TAAAATTAGC CCTTCTAAAG AAATCCACCT GCTGCAAAAT 7921 ATCCAGTAGT TTGGCATTAT ATGTGAAACT ATCACCATCA TAGCTGGCAC TGTGGGTTGT 7981 GGGATCTCCT TTAGACATAC AACATAAATG ATCTGGATGG ATTAACATTA CTACATGGAT 8041 GCTTGTTGAC ACATTAACCT GGCTTCCCAT GAGCTTTGTG TCAGATACAC GCAGTGAACA 8101 GGTGTTTGGA GGAACAGAAT AAAGAGAAGG CAAGCACTGG TAAGGGCAGG GGTTTGTGAA 8161 AGCTTGAGAG AAGAGACCAG TCTGAGGACA GTAGACACTT ATTTTAGGAT GGGGGTTGGA 8221 TGAGGAGGCT ATAGTTTGCT ATAAGCTTGG AATGGTTTGG AACACTGGTT TCACTCACCT 8281 ACCCAGCAGT TATGTGTGGG GAAGCCTTAC CGATGCTAAA GGATCCATGT TACAATAATG 8341 GCATTATTTG GAAATCCCAG TGGTATTCCA TGAATAAAAC CACTATGAAG ATAATCCCAC 8401 TCAACAGACT CTCCGTTGGA GAAGGACAGC AACACCACCC TGGGAAAGCC AAACAGTCAG 8461 ACCAGACCTG TTTAGCATCA GTAGGACTTC CCTACCATAT CTGCTGGGTA GATGAGTGAA 8521 ACCAGTGTTC CAAACCACTC CGGGCTTGTA GCAAACCATA GTCTCCTCAT CTACCAAGAT 8581 GAGCAACCTT ACCTCCTGAT GTCCTAGCCA ATCACCAACT AGGAAACTTT GCACAGTTTA 8641 TTTAAAGTAA CAGTTTGATT TTCACAATAT TTTTAAATTG GAGAAACATA ACTTATCTTT 8701 GCACTCACAA ACCACATAAT GAGAAGAAAC TCTAAGGGAA AATGCTTGAT CTGTGTGACC 8761 CGGGGCGCCA TGCCAGAGCT GTAGTTCATG CCAGTGTTGT GCTCTGACAA GCCTTTTACA 8821 GAATTACATG AGATCTGCTT CCCTAGGACA AGGAGAAGGC AAATCAACAG AGGCTGCACT 8881 TTAAAATGGA GACATAAAAT AACATGCCAG AACCATTTCC TAAAGCTCCT CAATCAACCA 8941 ACAAAATTGT GCTTTCAAAT AACCTGAGTT GACCTCATCA GGAATTTTGT GGCTCCTTCT 9001 CTTCTAACCT GCCTGAAGAA AGATGGTCCA CAGCAGCTGA GTCCGGGATG GATAAGCTTA 9061 GGGACAGAGG CCAATTAGGG AACTTTGGGT TTCTAGCCCT ACTAGTAGTG AATAAATTTA 9121 AAGTGTGGAT GTGACTATGA GTCACAGCAC AGATGTTGTT TAATAATATG TTTATTTTAT 9181 AAATTGATAT TTTAGGAATC TTTGGAGATA TTTTCAGTTA GCAGATAATA CTATAAATTT 9241 TATGTAACTG GCAATGCACT TCGTAATAGA CAGCTCTTCA TAGACTTGCA GAGGTAAAAA 9301 GATTCCAGAA TAATGATATG TACATCTACG ACTTGTTTTA GGTGGCACTT ACTCAAAAGC 9361 ATCTACTCCT AATGGTTATG ATAATGGCAT TATTTGGGCC ACTTGGAAAA CCCGGTGGTA 9421 TTCCATGAAG AAAACCACTA TGAAGATAAT CCCATTCAAC AGACTCACAA TTGGAGAAGG 9481 ACAGCAACAC CACCTGGGGG GAGCCAAACA GGTCAGACCA GAGCACCCTG CGGAAACAGA 9541 ATATGACTCA CTTTACCCTG AGGATGATTT GTAGAAAATT AACTGCTAAC TTCTATTGAC 9601 CCACAAAGTT TCAGAAATTC TCTGAAAGTT TCTTCCTTTT TTCTCTTACT ATATTTATTG 9661 ATTTCAAGTC TTCTATTAAG GACATTTAGC CTTCAATGGA AATTAAAACT CATTTAGGAC 9721 TGTATTTCCA AATTACTGAT ATCAGAGTTA TTTAAAAATT GTTTATTTGA GGAGATAACA 9781 TTTCAACTTT GTTCCTAAAT ATATAATAAT AAAATGATTG ACTTTATTTG CATTTTTATG 9841 ACCACTTGTC ATTTATTTTG TCTTCGTAAA TTATTTTCAT TATATCAAAT ATTTTAGTAT 9901 GTACTTAATA AAATAGGAGA ACATTTTAGA GTTTCAAATT CCCAGGTATT TTCCTTGTTT 9961 ATTACCCCTA AATCATTCCT ATTTAATTCT TCTTTTTAAA TGGAGAAAAT TATGTCTTTT 10021 TAATATGGTT TTTGTTTTGT TATATATTCA CAGGCTGGAG ACGTTTAAAA GACCGTTTCA 10081 AAAGAGATTT ACTTTTTTAA AGGACTTTAT CTGAACAGAG AGATATAATA TTTTTCCTAT 10141 TGGACAATGG ACTTGCAAAG CTTCACTTCA TTTTAAGAGC AAAAGACCCC ATGTTGAAAA 10201 CTCCATAACA GTTTTATGCT GATGATAATT TATCTACATG CATTTCAATA AACCTTTTGT 10261 TTCCTAAGAC TAGATACATG GTACCTTTAT TGACCATTAA AAAACCACCA CTTTTTGCCA 10321 ATTTACCAAT TACAATTGGG CAACCATCAG TAGTAATTGA GTCCTCATTT TATGCTAAAT 10381 GTTATGCCTA ACTCTTTGGG AGTTACAAAG GAAATAGCAA TTATGGCTTT TGCCCTCTAG 10441 GAGATACAGG ACAAATACAG GAAAATACAG CAACCCAAAC TGACAATACT CTATACAAGA 10501 ACATAATCAC TAAGCAGGAG TCACAGCCAC ACAACCAAGA TGCATAGTAT CCAAAGTGCA 10561 GCTG // LOCUS HUMFBRGAB 3383 bp ds-DNA PRI 15-JUN-1989 DEFINITION Human gamma fibrinogen gene, exons 6, 7 and 8, and flanks. ACCESSION K02569 KEYWORDS fibrin; fibrinogen. SOURCE Human DNA (genomic library of S.Orbin and T.Maniatis), clones pK68, pEB1, pA11F, pA15, pK14, and pK20. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 985 to 1350; 2128 to 2303) AUTHORS Fornace,A.J.Jr., Cummings,D.E., Comeau,C.M., Kant,J.A. and Crabtree,G.R. TITLE Single-copy inverted repeats associated with regional genetic duplications in gamma fibrinogen and immunoglobulin genes JOURNAL Science 224, 161-164 (1984) STANDARD full staff_review REFERENCE 2 (bases 1 to 3383) AUTHORS Fornace,A.J.Jr., Cummings,D.E., Comeau,C.M., Kant,J.A. and Crabtree,G.R. TITLE Structure of the human gamma-fibrinogen gene: Alternate mRNA splicing near the 3' end of the gene produces gamma-A and gamma-B forms of gamma-fibrinogen JOURNAL J. Biol. Chem. 259, 12826-12830 (1984) STANDARD full staff_review COMMENT The human gamma fibrinogen gene is comprised of eight exons and codes for two forms of gamma fibrinogen, type A and type B, by differential splicing. Type A is the more common form; it is expressed by utilizing a splice donor site within exon 7 to splice on exon 8. Type B is expressed when the splice site within exon 7 is passed over so that the coding region continues on to a stop codon that is within intron G of the type A message. A region within intron F (positions 1097-1239) shows striking homology to part of exon 7. This region is flanked by 100 bp inverted terminal repeats. This duplication probably occurred about 20 million years ago [1]. FEATURES from to/span description pept / 201 478 gamma fibrinogen type A, exon 6 (AA at 202) /nomgen="FGG" /map="4q31" /hgml_locus_uid="LX0035D" 2112 2281 gamma fibrinogen type A, exon 7 2822 2836 gamma fibrinogen type A, exon 8 pept / 201 478 gamma fibrinogen type B, exon 6 (AA at 202) /nomgen="FGG" /map="4q31" /hgml_locus_uid="LX0035D" 2112 2344 gamma fibrinogen type B, exon 7 pre-msg < 1 2595 fbrGb mRNA and intron pre-msg < 1 > 2836 fbrGa mRNA and intron IVS < 1 200 fbrGa, fbrGb intron E IVS 479 2111 fbrGa, fbrGb intron F IVS 2282 2821 fbrGa intron G revision 2137 2138 tc in [2]; ttc in [1] BASE COUNT 1075 A 640 C 649 G 1019 T ORIGIN 3 bp upstream of SmaI site; chromosome 4q31. 1 CCCGGGAAGT GTTAATGAGC TGTGGACTGG TTCACATATC TATTGCCTCT TGCCAGATTT 61 GCAAAAAACT TCACTCAATG AGCAAATTTC ATCCTTAAGA AACAAAGTCA AAAATTCCAA 121 GGAAGCATCC TACGAAAGAG GGAACTTCTG AGATCCCTGA GGAGGGTCAG CATGTGATGG 181 TTGTATTTCC TTCTTCTCAG TACTGCAGAC TATGCCATGT TCAAGGTGGG ACCTGAAGCT 241 GACAAGTACC GCCTAACATA TGCCTACTTC GCTGGTGGGG ATGCTGGAGA TGCCTTTGAT 301 GGCTTTGATT TTGGCGATGA TCCTAGTGAC AAGTTTTTCA CATCCCATAA TGGCATGCAG 361 TTCAGTACCT GGGACAATGA CAATGATAAG TTTGAAGGCA ACTGTGCTGA ACAGGATGGA 421 TCTGGTTGGT GGATGAACAA GTGTCACGCT GGCCATCTCA ATGGAGTTTA TTACCAAGGT 481 ATGTTTTCCT TTCTTAGATT CCAAGTTAAT GTATAGTGTA TACTATTTTC ATAAAAATAA 541 TAAATAGATA TGAAGAAATG AAGAATAATT TATAAAGATA GTAGGATTTT ATCATGTTCT 601 TTATTTCAAC TAAGTTCTTT GAAACTGGAA GTGGATAATA CCAAGTTCAT GCCTAAAATT 661 AGCCCTTCTA AAGAAATCCA CCTGCTGCAA AATATCCAGT AGTTTGGCAT TATATGTGAA 721 ACTATCACCA TCATAGCTGG CACTGTGGGT TGTGGGATCT CCTTTAGACA TACAACATAA 781 ATGATCTGGA TGGATTAACA TTACTACATG GATGCTTGTT GACACATTAA CCTGGCTTCC 841 CATGAGCTTT GTGTCAGATA CACGCAGTGA ACAGGTGTTT GGAGGAACAG AATAAAGAGA 901 AGGCAAGCAC TGGTAAGGGC AGGGGTTTGT GAAAGCTTGA GAGAAGAGAC CAGTCTGAGG 961 ACAGTAGACA CTTATTTTAG GATGGGGGTT GGATGAGGAG GCTATAGTTT GCTATAAGCT 1021 TGGAATGGTT TGGAACACTG GTTTCACTCA CCTACCCAGC AGTTATGTGT GGGGAAGCCT 1081 TACCGATGCT AAAGGATCCA TGTTACAATA ATGGCATTAT TTGGAAATCC CAGTGGTATT 1141 CCATGAATAA AACCACTATG AAGATAATCC CACTCAACAG ACTCTCCGTT GGAGAAGGAC 1201 AGCAACACCA CCCTGGGAAA GCCAAACAGT CAGACCAGAC CTGTTTAGCA TCAGTAGGAC 1261 TTCCCTACCA TATCTGCTGG GTAGATGAGT GAAACCATGT TCCAAACCAC TCCGGGCTTG 1321 TAGCAAACCA TAGTCTCCTC ATCTACCAAG ATGAGCAACC TTACCTCCTG ATGTCCTAGC 1381 CAATCACCAA CTAGGAAACT TTGCACAGTT TATTTAAAGT AACAGTTTGA TTTTCACAAT 1441 ATTTTTAAAT TGGAGAAACA TAACTTATCT TTGCACTCAC AAACCACATA ATGAGAAGAA 1501 ACTCTAAGGG AAAATGCTTG ATCTGTGTGA CCCGGGGCGC CATGCCAGAG CTGTAGTTCA 1561 TGCCAGTGTT GTGCTCTGAC AAGCCTTTTA CAGAATTACA TGAGATCTGC TTCCCTAGGA 1621 CAAGGAGAAG GCAAATCAAC AGAGGCTGCA CTTTAAAATG GAGACATAAA ATAACATGCC 1681 AGAACCATTT CCTAAAGCTC CTCAATCAAC CAACAAAATT GTGCTTTCAA ATAACCTGAG 1741 TTGACCTCAT CAGGAATTTT GTGGCTCCTT CTCTTCTAAC CTGCCTGAAG AAAGATGGTC 1801 CACAGCAGCT GAGTCCGGGA TGGACAAGCT TAGGGACAAG GCCAATTAGG GAACTTTGGG 1861 TTTCTAGCCC TACTAGTAGT GAATAAATTT AAAGTGTGGA TGTGACTATG AGTCACAGCA 1921 CAGATGTTGT TTAATAATAT GTTTATTTTA TAAATTGATA TTTTAGGAAT CTTTGGAGAT 1981 ATTTTCAGTT AGCAGATAAT ACTATAAATT TTATGTAACT GGCAATGCAC TTCTAATAGA 2041 CAGCTCTTCA TAGACTTGCA GAGGTAAAAA GATTCCAGAA TAATGATATG TACATCTACG 2101 ACTTGTTTTA GGTGGCACTT ACTCAAAAGC ATCTACTCCT AATGGTTATG ATAATGGCAT 2161 TATTTGGGCC ACTTGGAAAA CCCGGTGGTA TTCCATGAAG AAAACCACTA TGAAGATAAT 2221 CCCATTCAAC AGACTCACAA TTGGAGAAGG ACAGCAACAC CACCTGGGGG GAGCCAAACA 2281 GGTCAGACCA GAGCACCCTG CGGAAACAGA ATATGACTCA CTTTACCCTG AGGATGATTT 2341 GTAGAAAATT AACTGCTAAC TTCTATTGAC CCACAAAGTT TCAGAAATTC TCTGAAAGTT 2401 TCTTCCCTTT TTCTCTTACT ATATTTATTG ATTTCAAGTC TTCTATTAAG GACATTTAGC 2461 CTTCAATGGA AATTAAAACT CATTTAGGAC TGTATTTCCA AATTACTGAT ATCAGAGTTA 2521 TTTAAAAATT GTTTATTTGA GGAGATAACA TTTCAACTTT GTTCCTAAAT ATATAATAAT 2581 AAAATGATTG ACTTTATTTG CATTTTTATG ACCACTTGTC ATTTATTTTG TCTTCGTAAA 2641 TTATTTTCAT TATATCAAAT ATTTTAGTAT GTACTTAATA AAATAGGAAA ACTTTTAGAG 2701 TTTCAAATTC CCAGGTATTT TCCTGTTTAT TACCCCTAAA TCATTCCTAT TTAATTCTTC 2761 TTTTTAAATG GAGAAAATTA TGTCTTTTTA ATATGGTTTT TGTTTTGTTA TATATTCACA 2821 GGCTGGAGAC GTTTAAAAGA CCGTTTCAAA AGAGATTTAC TTTTTTAAAG GACTTTATCT 2881 GAACAGAGAG ATATAATATT TTTCCTATTG GACAATGGAC TTGCAAAGCT TCACTTCATT 2941 TTAAGAGCAA AAGACCCCAT GTTGAAAACT CCATAACAGT TTTATGCTGA TGATAATTTA 3001 TCTACATGCA TTTCAATAAA CCTTTTGTTT CCTAAGACTA GATACATGGT ACCTTTATTG 3061 ACCATTAAAA ACCACCACTT TTTGCCAATT TACCAATTAC AATTGGGCAA CCATCAGTAG 3121 TAATTGAGTC CTCATTTTAT GCTAAATGTT ATGCCTAACT CTTTGGGAGT TACAAAGGAA 3181 ATAGCAATTA TGGCTTTTGC CCTCTAGGAG ATACAGGACA AATACAGGAA AATACAGCAA 3241 CCCAAACTGA CAATACTCTA TACAAGAACA TAATCACTAA GCAGGAGTCA CAGCCACACA 3301 ACCAAGATGC ATAGTATCCA AAGTGCAGCT GGAGACAGGA AGCTGCTTGG GAGTCTCCAG 3361 CAGAACCTTT AGCTCCAACA CTG // LOCUS HUMFERHC1 515 bp ds-DNA PRI 15-JUN-1989 DEFINITION Human ferritin heavy-chain gene, exon 1. ACCESSION M14211 M18522 KEYWORDS ferritin; iron-storage protein. SEGMENT 1 of 2 SOURCE Human liver DNA, clone pUCM11. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 515) AUTHORS Hentze,M.W., Keim,S., Papadopoulos,P., O'Brien,S., Modi,W., Drysdale,J., Leonard,W.J., Harford,J.B. and Klausner,R.D. TITLE Cloning, characterization, expression, and chromosomal localization of a human ferritin heavy-chain gene JOURNAL Proc. Natl. Acad. Sci. U.S.A. 83, 7226-7230 (1986) STANDARD full staff_review REFERENCE 2 (sites; revisions 134 to 387) AUTHORS Hentze,M.W., Caughman,S.W., Rouault,T.A., Barriocanal,J.G., Dancis,A., Harford,J.B. and Klausner,R.D. TITLE Identification of the Iron-Responsive Element for the Translational Regulation of Human Ferritin mRNA JOURNAL Science 238, 1570-1573 (1987) STANDARD full staff_review COMMENT Human ferritin heavy-chain gene sequences represent a large multigene family dispersed over at least eight different chromosomes. FEATURES from to/span description pept 374 + 487 ferritin heavy-chain, exon 1 /nomgen="FTHL6" /map="11q13" /hgml_locus_uid="LJ0008M" pre-msg 162 > 515 fer mRNA (alt.) pre-msg 170 > 515 fer mRNA (alt.) IVS 488 > 515 fer intron A unsure 502 502 g may be absent BASE COUNT 78 A 212 C 160 G 65 T ORIGIN 5 bp upstream of SacI site; chromosome 11q13. 1 GAGCTCCGCC AGAGCGCGCG AGGGCCTCCA GCGGCCGCCC CTCCCCCACA GCAGGGGCGG 61 GGTCCCGGCC CACCGGAAGG AGCGGGCTCG GGGCGGGCGG CGCTGATTGG CCGGGGCGGG 121 CCTGACGCCG ACGCGGCTAT AAGAGACCAC AAGCGACCCG CAGGGCCAGA CGTTCTTCGC 181 CGAGAGTCGT CGGGGTTTCC TGCTTCAACA GTGCTTGGAC GGAACCCGGC GCTCGTTCCC 241 CACCCCGGCC GGCCGCCCAT AGCCAGCCCT CCGTCACCTC TTCACCGCAC CCTCGGACTG 301 CCCCAAGGCC CCCGCCGCCG CTCCAGCGCC GCGCAGCCAC CGCCGCCGCC GCCGCCTCTC 361 CTTAGTCGCC GCCATGACGA CCGCGTCCAC CTCGCAGGTG CGCCAGAACT ACCACCAGGA 421 CTCAGAGGCC GCCATCAACC GCCAGATCAA CCTGGAGCTC TACGCCTCCT ACGTTTACCT 481 GTCCATGGTG AGCGCGGGCG GGCCTAAGCG GTGGC // LOCUS HUMFERHC2 996 bp ds-DNA PRI 15-JUN-1989 DEFINITION Human ferritin heavy-chain gene, exons 2, 3 and 4. ACCESSION M14212 KEYWORDS ferritin; iron-storage protein. SEGMENT 2 of 2 SOURCE Human liver DNA, clone pUCM11. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 996) AUTHORS Hentze,M.W., Keim,S., Papadopoulos,P., O'Brien,S., Modi,W., Drysdale,J., Leonard,W.J., Harford,J.B. and Klausner,R.D. TITLE Cloning, characterization, expression, and chromosomal localization of a human ferritin heavy-chain gene JOURNAL Proc. Natl. Acad. Sci. U.S.A. 83, 7226-7230 (1986) STANDARD full staff_review COMMENT A polyadenylation signal is located at positions 978-983. FEATURES from to/span description pept + 47 193 ferritin heavy-chain, exon 2 /nomgen="FTHL6" /map="11q13" /hgml_locus_uid="LJ0008M" 450 575 ferritin heavy-chain, exon 3 672 836 ferritin heavy-chain, exon 4 pre-msg < 1 > 836 fer mRNA (alt.) IVS < 1 46 fer intron A IVS 194 449 fer intron B IVS 576 671 fer intron C BASE COUNT 279 A 211 C 227 G 279 T ORIGIN About 1.5 kb after segment 1; chromosome 11q13. 1 AGCTTCTCTG ATCCCTAGTA TAAACACTTC AGTGTTCCCC TTTCAGTCTT ACTACTTTGA 61 CCGCGATGAT GTGGCTTTGA AGAACTTTGC CAAATACTTT CTTCACCAAT CTCATGAGGA 121 GAGGGAACAT GCTGAGAAAC TGATGAAGCT GCAGAACCAA CGAGGTGGCC GAATCTTCCT 181 TCAGGATATC AAGGTGAACA AAAGATCCTA GGGGTGTCAT ACTTCATCAT CTGGCAGTGT 241 TCGGGTATCA GAAATCACTT AAACTAGCAA TTGCCCTTAT AAAGTGATGA TACACTGGGC 301 TTTTGCCTTT TGTGCTTTTT TAGGCTTACC ATCTAAACTA AATTAGGCAA ATAGTAATGT 361 CCCTTTTGCC AAAACGTGGT GGTTAGAGAT GATGGGCTTG CTGACTTCTA GGTTAGTTGG 421 TAGAGATGCA TTAACCTATT CTCATTCAGA AACCAGACTG TGATGACTGG GAGAGCGGGC 481 TGAATGCAAT GGAGTGTGCA TTACATTTGG AAAAAAATGT GAATCAGTCA CTACTGGAAC 541 TGCACAAACT GGCCACTGAC AAAAATGACC CCCATGTGAG TATTGGAACC CCAGGAAATA 601 AATGGAGGAA ATCATTTGCC TTAGGGATTG GGAAAGCTGC CCACTAACTG TCTTGCCCCA 661 TTGTTTTGCA GTTGTGTGAC TTCATTGAGA CACATTACCT GAATGAGCAG GTGAAAGCCA 721 TCAAAGAATT GGGTGACCAC GTGACCAACT TGCGCAAGAT GGGAGCGCCC GAATCTGGCT 781 TGGCGGAATA TCTCTTTGAC AAGCACACCC TGGGAGACAG TGATAATGAA AGCTAAGCCT 841 CGGGCTAATT TCCCCATAGC CGTGGGGTGA CTTCCCTGGT CACCAAGGCA GTGCATGCAT 901 GTTGGGGTTT CCTTTACCTT TTCTATAAGT TGTACCAAAA CATCCACTTA AGTTCTTTGA 961 TTTGTACCAT TCCTTCAAAT AAAGAAATTT GGTACC // LOCUS HUMFESFUR 4160 bp ds-DNA PRI 15-SEP-1989 DEFINITION Human fur gene exon Z and v-fes/fps oncogene exons 1 and 2. ACCESSION M14589 KEYWORDS fes oncogene; fps oncogene. SOURCE Human bone marrow KG-1 cell line from a patient with acute myelogenous leukemia DNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 4160) AUTHORS Roebroek,A.J.M., Schalken,J.A., Bussemakers,M.J.G., Van Heerikhuizen,H., Omnekink,C., Debruyne,F.M.J., Bloemers,H.P.J. and Van de Ven,W.J.M. TITLE Characterization of human c-fes/fps reveals a new transcription unit (fur) in the immediately upstream region of the proto- oncogene JOURNAL Mol. Biol. Rep. 11, 117-125 (1986) STANDARD simple staff_review FEATURES from to/span description pept / 3345 3357 c-fes/fps protein exon 1 (AA at 3346) /nomgen="FES" /map="15q25-qter" /hgml_locus_uid="LV0075X" 3850 / 4071 c-fes/fps protein exon 2 mRNA 296 2271 fur protein exon Z pre-msg 281 > 4160 c-fes/fps mRNA and introns IVS < 1 280 fur intron A (no splice consensus) IVS 2272 3344 fur intron B (no splice consensus) IVS 3358 3849 fes/fps intron C IVS 4072 > 4160 fes/fps intron D BASE COUNT 669 A 1323 C 1324 G 844 T ORIGIN Chromosome 15q25-qter. 1 GGATCCTGGG GATGTGGTGA CTTGGCTTGG GGCTGCTGTG GTCCTGGGGC TACAGTCTGT 61 TTAGCTGACA CACACTTGCC CTCTCTCCCA CGCCGGCAGT GTGCGAGGAA GGCTTCTCCC 121 TGCACCAGAA GAGCTGTGTC CAGCACTGCC CTCCAGGCTT CGCCCCCCAA GTCCTCGATA 181 CGCACTATAG CACCGAGAAT GACGTGGAGA CCATCCGGGC CAGCGTCTGC GCCCCCTGCC 241 ACGCCTCATG TGCCACATGC CAGGGGCCGG CCCTGACAGA CTGCCTCAGC TGCCCCAGCC 301 ACGCCTCCTT GGACCCTGTG GAGCAGACTT GCTCCCGGCA AAGCCAGAGC AGCCGAGAGT 361 CCCCGCCACA GCAGCAGCCA CCTCGGCTGC CCCCGGAGGT GGAGGCGGGG CAACGGCTGC 421 GGGCAGGGCT GCTGCCCTCA CACCTGCCTG AGGTGGTGGC CGGCCTCAGC TGCGCCTTCA 481 TCGTGCTGGT CTTCGTCACT GTCTTCCTGG TCCTGCAGCT GCGCTCTGGC TTTAGTTTTC 541 GGGGGGTGAA GGTGTACACC ATGGACCGTG GCCTCATCTC CTACAAGGGG CTGCCCCCTG 601 AAGCCTGGCA GGAGGAGTGC CCGTCTGACT CAGAAGAGGA CGAGGGCCGG GGCGAGAGGA 661 CCGCCTTTAT CAAAGACCAG AGCGCCCTCT GATGAGCCCA CTGCCCACCC CCTCAAGCCA 721 ATCCCCTCCT TGGGCACTTT TTAATTCACC AAAGTATTTT TTTATCTTGG GACTGGGTTT 781 GGACCCCAGC TGGGAGGCAA GAGGGGTGGA GACTGTTTCC CATCCTACCC TCGGGCCCAC 841 CTGGCCACCT GAGGTGGGCC CAGGACCAGC TGGGGCGTGG GGAGGGCCGT ACCCCACCCT 901 CAGCACCCCT TCCATGTGGA GAAAGGAGTG AAACCTTTAG GGCAGCTTGC CCCGGCCCCG 961 GCCCCAGCCA GAGTTCCTGC GGAGTGAAGA GGGGCAGCCC TTGCTTGTTG GGATTCCTGA 1021 CCCAGGCCGC AGCTCTTGCC CTTCCCTGTC CCTCTAAAGC AATAATGGTC CCATCCAGGC 1081 AGTCGGGGGC TGGCCTAGGA GATATCTGAG GGAGGAGGCC ACCTCTCCAA GGGCTTCTGC 1141 ACCCTCCACC CTGTCCCCCA GCTCTGGTGA GTCTTGGCGG CAGCAGCCAT CATAGGAAGG 1201 GACCAAGGCA AGGCAGGTGC CTCCAGGTGT GCACGTGGCA TGTGGCCTGT GGCCTGTGTC 1261 CCATGACCCA CCCCTGTGCT CCGTGCCTCC ACCACCACTG GCCACCAGGC TGGCGCAGCC 1321 AAGGCCGAAG CTCTGGCTGA ACCCTGTGCT GGTGTCCTGA CCACCCTCCC CTCTCTTGCA 1381 CCCGCCTCTC CCGTCAGGGC CCAAGTCCCT GTTTTCTGAG CCCGGGCTGC CTGGGCTGTT 1441 GGCACTCACA GACCTGGAGC CCCTGGGTGG GTGGTGGGGA GGGGCGCTGG CCCAGCCGGC 1501 CTCTCTGGCC TCCCACCCGA TGCTGCTTTC CCCTGTGGGG ATCTCAGGGG CTGTTTGAGG 1561 ATATATTTTC ACTTTGTGAT TATTTCACTT TAGATGCTGA TGATTTGTTT TTGTATTTTT 1621 AATGGGGGTA GCAGCTGGAC TACCCACGTT CTCACACCCA CCGTCCGCCC TGCTCCTCCC 1681 TGGCTGCCCT GGCCCTGAGG TGTGGGGGCT GCAGCATGTT GCTGAGGAGT GAGGAATAGT 1741 TGAGCCCCAA GTCCTGAAGA GGCGGGCCAG CCAGGCGGGC TCAAGGAAAG GGGGTCCCAG 1801 TGGGAGGGGC AGGCTGACAT CTGTGTTTCA AGTGGGGCTC GCCATGCCGG GGGTTCATAG 1861 GTCACTGGCT CTCCAAGTGC CAGAGGTGGG CAGGTGGTGG CACTGAGCCC CCCCAACACT 1921 GTGCCCTGGT GGAGAAAGCA CTGACCTGTC ATGCCCCCCT CAAACCTCCT CTTCTGACGT 1981 GCCTTTTGCA CCCCTCCCAT TAGGACAATC AGTCCCCTCC CATCTGGGAG TCCCCTTTTC 2041 TTTTCTACCC TAGCCATTCC TGGTACCCAG CCATCTGCCC AGGGGTGCCC CCTCCTCTCC 2101 CATCCCCCTG CCCTCGTGGC CAGCCCGGCT GGTTTTGTAA GATACTGGGT TGGTGCACAG 2161 TGATTTTTTT CTTGTAATTT AAACAGGCCC AGCATTGCTG GTTCTATTTA ATGGACATGA 2221 GATAATGTTA GAGGTTTTAA AGTGATTAAA CGTGCAGACT ATGCAAACCA GGCCCAGTCT 2281 CCAGTGTGGT ACCGTTGCTC CTGCATCGCA GCTGAGGATA GGGGGCCAGT TAGGCCTACA 2341 CAGTGGCCTG CCTGCCTGGA TGTGGGCCCA AGTCAGAAGG CCAAAGTCCT CCAAGGGGCG 2401 GGAGGATGCG CCAGCCCCTA GTGGAGGAGC TGGTGCCCCT GGGGTGGGGC TGGTGACCCC 2461 TGGTCCTCAG GAGCTGAGCA CTAAACTCCC AAAGTCCTGG TTTCCAGCAG TGTGAAGAAC 2521 TGGGCCTATT GTGTCTTCCT GGGCTGAAGT GATCTGGTCG CCACAGGCTA TAGGGCTGAG 2581 GCCTAAGGTG GAGGGAGGCC TGACTGAATC AAGATGACTT CTTGTGGGGA GCCTGAGTCC 2641 CAAATGGAAA ACTCCACGCC TGTCCGCTCC CCAACCCCTG CCCCTTGATT TCCCCAGGTC 2701 TCCCTTGGGA CAGGAAGCCC CTGCCTGGGG GTAGGAGGAT GGGGACAAAA CCACTAGGAT 2761 CTGTATCCGA GAAGCAGTCT CTGTTCGGGA TATTTACTTG GAAATTTTAT TCAAATGGAA 2821 GCTGGCGCCT GAGCCTCTCC TTAGGGAATT CCGTGAGGTG GGGAGGGCTG GGACCAGGGT 2881 TCCCTCTTTC TCTTCTGCGG TGGCCCTGGC CTGGTGCTAG GACTGCGCGC CTCCCCTCAG 2941 TACCCGCGGA CACCCTGGGC TTCCCTGGGC CCAGCATCTG CCTGGGGCCT CGCCTGGGCT 3001 CCCCCTCCTG ACCCCCACCT TGCGCCCCTT CCCGGTGTTC CCGGGGCGCT GCCGGGCCCT 3061 GGGGCCTGCG GGGCGCGGGC GGCTCTTGGC TGGGCCATTC TTTCCCGGCC CCCTCCTCCC 3121 TTCCGTTTCC GTGGCCGTGC GGCCGGCTAG AGGCTGCGGC CCAGCGCGGA GCAGGGGGGC 3181 TGGCAGGCGT CGGGACGGTC GGGCCGGTCC CGCCCGCCCC TTCCCCTCCA CAGGCCCGCC 3241 CCGGGGCCTG GGCCAACTGA AACCGCGGGA GGAGGAAGCG CGGAATCAGG AACTGGCCGG 3301 GGTCCGCACC GGGCCTGAGT CGGTCCGAGG CCGTCCCAGG AGCAGCTGCC CGTGCGGGTA 3361 CCTCTAGCCC CGGGGCCTGG AGGAGCGGTG GGAGCTGGGG GCGCGGCAGG CAGGGGCAGA 3421 GCAGGCGTTC CGAGGGCCAG AGACCCACCC AGGTCGGGGT AGGGGCCGCG GAAGGGCGGG 3481 GATGGCCGCA GGGGCAGGGC TCAGGCTGTG GGCGCCTGAG GCTTCAGCTG GGGCAGGCTT 3541 GGCCTGTCGA GGACCTGGGC AAGGGTGTCC CTGTAAGGGG TGGTGGGTGG AAGGGCCTGG 3601 GGAGGGAGGC TCCAGGTTGG CTCCTGTTCC CGAACGTGCG GAGGAGACCC TGACGCTAAG 3661 GAAGCAATGA GGGCCAGTCC CCAGGCCAGG CTGCTGCTGG GTACCCATGG CTGCGTGTGA 3721 GCGAGGCAGG ACCCCACCTC CTCCCCGTCT GCAGTCCATC CTGACCCTAC AGTCCCCAGT 3781 CTCCTCGTCC CATGCCTCCG TCTCCAGCTG CTGCCTTGCC TCCAGGGATG GCCCCTTTTC 3841 TGTCCCCAGA ACAGCACTAT GGGCTTCTCT TCCGAGCTGT GCAGCCCCCA GGGCCACGGG 3901 GTCCTGCAGC AAATGCAGGA GGCCGAGCTT CGTCTACTGG AGGGCATGAG AAAGTGGATG 3961 GCCCAGCGGG TCAAGAGTGA CAGGGAGTAT GCAGGACTGC TTCACCACAT GTCCCTGCAG 4021 GACAGTGGGG GCCAGAGCCG GGCCATCAGC CCTGACAGCC CCATCAGTCA GGTGGGTCTC 4081 TATGGGACTC TGGTGGGTGC TGGCGTATCT GCCTTCTCCT TCCTCTCCTG GGGGCCCTCT 4141 GGGGCAGTGG CTGGAGATCT // LOCUS HUMFGF51 883 bp ds-DNA PRI 15-JUN-1989 DEFINITION Human FGF-5 oncogene encoding fibroblast growth factor-related proteins, complete cds and exon 1. ACCESSION M23534 M21617 KEYWORDS FGF-5 oncogene; fibroblast growth factor-related protein. SEGMENT 1 of 3 SOURCE Human cell line VMCUB2-1 DNA, and cDNA to mRNA, clone 1-2-2. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 883) AUTHORS Zhan,X., Bates,B., Hu,X. and Goldfarb,M. TITLE The human FGF-5 oncogene encodes a novel protein related to fibroblast growth factors JOURNAL Mol. Cell. Biol. 8, 3487-3495 (1988) STANDARD simple staff_entry FEATURES from to/span description pept 409 525 FGF-5 protein 1 /nomgen="FGF5" /map="4q21" pept 522 + 873 FGF-5 protein 2, exon 1 /nomgen="FGF5" /map="4q21" mRNA 361 > 883 FGF-5 mRNA IVS 874 > 883 FGF-5 intron A BASE COUNT 172 A 281 C 236 G 194 T ORIGIN 289 bp upstream of HindIII site. 1 TGGGGACCAT CTGTTCTTGG CCCTGAGCCG GGGCAGGAAC TGCTTACCAC AGATATCCTG 61 TTTGGCCCAT ATTCAGCTGT TCCATCTGTT CTTGGCCCTG AGCCGGGGCA GGAACTGCTT 121 ACCACAGATA TCCTGTTTGG CCCATATTCA GCTGTTCCAT CTGTTCCTGA CCTTGATCTG 181 AACTTTTCTA TTCTCAGTTA TGTATTTTTC CTAGCCTTGC AAAATGGCGT TACCGCGAGG 241 CTCCCTCCCC GCACCGGCCA GTGAGTACAC AAAGCCGCGG GTGAGGGGAA GCTTCGCAGG 301 CGTGCACGGA GCAGTGAGAT CACTGGCGTT ATAAATATCC CGGTGCCAGC GCGGAGATCC 361 GCTCGGGTGG CCTCTCTCTT CCCCTCTCCC CTTCTCTTCC CCGAGGCTAT GTCCACCCGG 421 TGCGGCGAGG CGGGCCAGAG CAGAGGCACG CAGCCGCACA GGGGCTACAG AGCCCAGAAT 481 CAGCCCTACA AGATGCACTT AGGACCCCCG CGGCTGGAAG AATGAGCTTG TCCTTCCTCC 541 TCCTCCTCTT CTTCAGCCAC CTGATCCTCA GCGCCTGGGC TCACGGGGAG AAGCGTCTCG 601 CCCCCAAAGG GCAACCCGGA CCCGCTGCCA CTGATAGGAA CCCTAGAGGC TCCAGCAGCA 661 GACAGAGCAG CAGTAGCGCT ATGTCTTCCT CTTCTGCCTC CTCCTCCCCC GCAGCTTCTC 721 TGGGCAGCCA AGGAAGTGGC TTGGAGCAGA GCAGTTTCCA GTGGAGCCTC GGGGCGCGGA 781 CCGGCAGCCT CTACTGCAGA GTGGGCATCG GTTTCCATCT GCAGATCTAC CCGGATGGCA 841 AAGTCAATGG ATCCCACGAA GCCAATATGT TAAGTAAGTT ACA // LOCUS HUMFGF52 114 bp ds-DNA PRI 15-JUN-1989 DEFINITION Human FGF-5 oncogene encoding fibroblast growth factor-related protein, exon 2. ACCESSION M23535 M21617 KEYWORDS FGF-5 oncogene; fibroblast growth factor-related protein. SEGMENT 2 of 3 SOURCE Human cell line VMCUB2-1 DNA, and cDNA to mRNA, clone 1-2-2. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 114) AUTHORS Zhan,X., Bates,B., Hu,X. and Goldfarb,M. TITLE The human FGF-5 oncogene encodes a novel protein related to fibroblast growth factors JOURNAL Mol. Cell. Biol. 8, 3487-3495 (1988) STANDARD simple staff_entry FEATURES from to/span description pept + 11 + 114 FGF-5 protein 2, exon 2 (AA at 13) /nomgen="FGF5" /map="4q21" mRNA < 1 > 114 FGF-5 mRNA IVS < 1 10 FGF-5 intron A BASE COUNT 36 A 16 C 28 G 34 T ORIGIN Undetermined number of bp after segment 1. 1 GTCATCCTAG GTGTTTTGGA AATATTTGCT GTGTCTCAGG GGATTGTAGG AATACGAGGA 61 GTTTTCAGCA ACAAATTTTT AGCGATGTCA AAAAAAGGAA AACTCCATGC AAGT // LOCUS HUMFGF53 535 bp ds-DNA PRI 15-JUN-1989 DEFINITION Human FGF-5 oncogene encoding fibroblast growth factor-related protein, exon 3. ACCESSION M23536 M21617 KEYWORDS FGF-5 oncogene; fibroblast growth factor-related protein. SEGMENT 3 of 3 SOURCE Human cell line VMCUB2-1 DNA, and cDNA to mRNA, clone 1-2-2. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 535) AUTHORS Zhan,X., Bates,B., Hu,X. and Goldfarb,M. TITLE The human FGF-5 oncogene encodes a novel protein related to fibroblast growth factors JOURNAL Mol. Cell. Biol. 8, 3487-3495 (1988) STANDARD simple staff_entry FEATURES from to/span description pept + 11 358 FGF-5 protein 2, exon 3 (AA at 11) /nomgen="FGF5" /map="4q21" mRNA < 1 > 535 FGF-5 mRNA IVS < 1 10 FGF-5 intron B BASE COUNT 167 A 120 C 105 G 143 T ORIGIN Undetermined number of bp after segment 2. 1 TCCTCCTTAG GCCAAGTTCA CAGATGACTG CAAGTTCAGG GAGCGTTTTC AAGAAAATAG 61 CTATAATACC TATGCCTCAG CAATACATAG AACTGAAAAA ACAGGGCGGG AGTGGTATGT 121 TGCCCTGAAT AAAAGAGGAA AAGCCAAACG AGGGTGCAGC CCCCGGGTTA AACCCCAGCA 181 TATCTCTACC CATTTTCTTC CAAGATTCAA GCAGTCGGAG CAGCCAGAAC TTTCTTTCAC 241 GGTTACTGTT CCTGAAAAGA AAAATCCACC TAGCCCTATC AAGTCAAAGA TTCCCCTTTC 301 TGCACCTCGG AAAAATACCA ACTCAGTGAA ATACAGACTC AAGTTTCGCT TTGGATAATA 361 TTAATCTTGG CCTTGTGAGA AACCATTCTT TCCCCTCAGG AGTTTCTATA GGTGTCTTCA 421 GAGTTCTGAA GAAAAATTAC TGGACACAGC TTCAGCTATA CTTACACTGT ATTGAAGTCA 481 CGTCATTTGT TTCAGTGTGA CTGAAACAAA ATGTTTTTTG ATAGGAAGGA AACTG // LOCUS HUMFGR1 139 bp ds-DNA PRI 15-JUN-1989 DEFINITION Human c-fgr proto-oncogene, exon 4. ACCESSION M12502 KEYWORDS oncogene; proto-oncogene. SEGMENT 1 of 7 SOURCE Human placenta DNA, clones lambda-[015,010]. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 139) AUTHORS Nishizawa,M., Semba,K., Yoshida,M.C., Yamamoto,T., Sasaki,M. and Toyoshima,K. TITLE Structure, expression, and chromosomal location of the human c-fgr gene JOURNAL Mol. Cell. Biol. 6, 511-517 (1986) STANDARD full staff_review REFERENCE 2 (bases 4 to 138) AUTHORS Nishizawa,M., Semba,K., Yamamoto,T. and Toyoshima,K. TITLE Human c-fgr gene does not contain coding sequence for actin-like protein JOURNAL Jpn. J. Cancer Res. 76, 155-159 (1985) STANDARD full staff_review COMMENT Draft entry and clean copy of the sequence in [1] were kindly provided by K.Toyoshima, 02-JUN-1986. FEATURES from to/span description pept / 21 + 119 c-fgr peptide, exon 4 (AA at 22) /nomgen="FGR" /map="1p36.2-p36.1" /hgml_locus_uid="LN0028P" IVS < 1 20 cfgr intron 3 IVS 120 > 139 cfgr intron 4 BASE COUNT 31 A 33 C 42 G 33 T ORIGIN PstI site; chromosome 1p36.2-p36.1. 1 GTTCTGTTCT GTGCCTACAG TGAAGGTGAC TGGTGGGAGG CTCGGTCTCT CAGCTCCGGA 61 AAAACTGGCT GCATTCCCAG CAACTACGTG GCCCCTGTTG ACTCAATCCA AGCTGAAGAG 121 TAAGTAGGGA TTGGGGCAA // LOCUS HUMFGR2 144 bp ds-DNA PRI 15-JUN-1989 DEFINITION Human c-fgr proto-oncogene, exon 5. ACCESSION M12719 KEYWORDS oncogene; proto-oncogene. SEGMENT 2 of 7 SOURCE Human placenta DNA, clones lambda-[015,010]. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 144) AUTHORS Nishizawa,M., Semba,K., Yoshida,M.C., Yamamoto,T., Sasaki,M. and Toyoshima,K. TITLE Structure, expression, and chromosomal location of the human c-fgr gene JOURNAL Mol. Cell. Biol. 6, 511-517 (1986) STANDARD full staff_review COMMENT Draft entry and clean copy of the sequence in [1] were kindly provided by K.Toyoshima, 02-JUN-1986. FEATURES from to/span description pept + 21 + 124 c-fgr peptide, exon 5 /nomgen="FGR" /map="1p36.2-p36.1" /hgml_locus_uid="LN0028P" IVS < 1 20 cfgr intron 4 IVS 125 > 144 cfgr intron 5 BASE COUNT 34 A 39 C 44 G 27 T ORIGIN About 4.1 kb after segment 1; chromosome 1p36.2-p36.1. 1 TTGCCTGCCT TTCCCAACAG GTGGTACTTT GGAAAGATTG GGAGAAAGGA TGCAGAGAGG 61 CAGCTGCTTT CACCAGGCAA CCCCCAGGGG GCCTTTCTCA TTCGGGAAAG CGAGACCACC 121 AAAGGTAGGG GTGGTGCCAC CCCC // LOCUS HUMFGR3 190 bp ds-DNA PRI 15-JUN-1989 DEFINITION Human c-fgr proto-oncogene, exon 6. ACCESSION M12720 KEYWORDS oncogene; proto-oncogene. SEGMENT 3 of 7 SOURCE Human placenta DNA, clones lambda-[015,010]. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 190) AUTHORS Nishizawa,M., Semba,K., Yoshida,M.C., Yamamoto,T., Sasaki,M. and Toyoshima,K. TITLE Structure, expression, and chromosomal location of the human c-fgr gene JOURNAL Mol. Cell. Biol. 6, 511-517 (1986) STANDARD full staff_review COMMENT Draft entry and clean copy of the sequence in [1] were kindly provided by K.Toyoshima, 02-JUN-1986. FEATURES from to/span description pept + 21 + 170 c-fgr peptide, exon 6 /nomgen="FGR" /map="1p36.2-p36.1" /hgml_locus_uid="LN0028P" IVS < 1 20 cfgr intron 5 IVS 171 > 190 cfgr intron 6 BASE COUNT 48 A 51 C 55 G 36 T ORIGIN About 180 bp after segment 2; chromosome 1p36.2-p26.1. 1 AAAAGTGATC CTCTCCACAG GTGCCTACTC CCTGTCCATC CGGGACTGGG ATCAGACCAG 61 AGGCGATCAT GTGAAGCATT ACAAGATCCG CAAACTGGAC ATGGGCGGTT ACTACATCAC 121 CACACGGGTT CAGTTCAACT CGGTGCAGGA GCTGGTGCAG CACTACATGG GTGAGGGCAG 181 GGGCCTCAGA // LOCUS HUMFGR4 196 bp ds-DNA PRI 15-JUN-1989 DEFINITION Human c-fgr proto-oncogene, exon 7. ACCESSION M12721 KEYWORDS oncogene; proto-oncogene. SEGMENT 4 of 7 SOURCE Human placenta DNA, clones lambda-[015,010]. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 196) AUTHORS Nishizawa,M., Semba,K., Yoshida,M.C., Yamamoto,T., Sasaki,M. and Toyoshima,K. TITLE Structure, expression, and chromosomal location of the human c-fgr gene JOURNAL Mol. Cell. Biol. 6, 511-517 (1986) STANDARD full staff_review COMMENT Draft entry and clean copy of the sequences in [1] were kindly provided by K.Toyoshima, 02-JUN-1986. FEATURES from to/span description pept + 21 + 176 c-fgr peptide, exon 7 /nomgen="FGR" /map="1p36.2-p36.1" /hgml_locus_uid="LN0028P" IVS < 1 20 cfgr intron 6 IVS 177 > 196 cfgr intron 7 BASE COUNT 31 A 66 C 67 G 32 T ORIGIN About 1 kb after segment 3; chromosome 1p36.2-p36.1. 1 CTTCATGACC CCTCCCCTAG AGGTGAATGA CGGGCTGTGC AACCTGCTCA TCGCGCCCTG 61 CACCATCATG AAGCCGCAGA CGCTGGGCCT GGCCAAGGAC GCCTGGGAGA TCAGCCGCAG 121 CTCCATCACG CTGGAGCGCC GGCTGGGCAC CGGCTGCTTC GGGGATGTGT GGCTGGGTAC 181 GGAGCTCCCG GGGGCC // LOCUS HUMFGR5 212 bp ds-DNA PRI 15-JUN-1989 DEFINITION Human c-fgr proto-oncogene, exon 8. ACCESSION M12722 KEYWORDS oncogene; proto-oncogene. SEGMENT 5 of 7 SOURCE Human placenta DNA, clones lambda-[015,010]. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 212) AUTHORS Nishizawa,M., Semba,K., Yoshida,M.C., Yamamoto,T., Sasaki,M. and Toyoshima,K. TITLE Structure, expression, and chromosomal location of the human c-fgr gene JOURNAL Mol. Cell. Biol. 6, 511-517 (1986) STANDARD full staff_review COMMENT Draft entry and clean copy of the sequence in [1] were kindly provided by K.Toyoshima, 02-JUN-1986. FEATURES from to/span description pept + 21 + 200 c-fgr peptide, exon 8 /nomgen="FGR" /map="1p36.2-p36.1" /hgml_locus_uid="LN0028P" IVS < 1 20 cfgr intron 7 IVS 201 > 212 cfgr intron 8 BASE COUNT 46 A 58 C 76 G 32 T ORIGIN About 75 bp after segment 4; chromosome 1p36.2-p36.1. 1 ACAAGACAGC CTCCGAGCAG GCACGTGGAA CGGCAGCACT AAGGTGGCGG TGAAGACGCT 61 GAAGCCGGGC ACCATGTCCC CGAAGGCCTT CCTGGAGGAG GCGCAGGTCA TGAAGCTGCT 121 GCGGCACGAC AAGCTGGTGC AGCTGTACGC CGTGGTGTCG GAGGAGCCCA TCTACATCGT 181 GACCGAGTTC ATGTGTCACG GTCAGGAGGC GG // LOCUS HUMFGR6 117 bp ds-DNA PRI 15-JUN-1989 DEFINITION Human c-fgr proto-oncogene, exon 9. ACCESSION M12723 KEYWORDS oncogene; proto-oncogene. SEGMENT 6 of 7 SOURCE Human placenta DNA, clones lambda-[015,010]. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 117) AUTHORS Nishizawa,M., Semba,K., Yoshida,M.C., Yamamoto,T., Sasaki,M. and Toyoshima,K. TITLE Structure, expression, and chromosomal location of the human c-fgr gene JOURNAL Mol. Cell. Biol. 6, 511-517 (1986) STANDARD full staff_review COMMENT Draft entry and clean copy of the sequence in [1] were kindly provided by K.Toyoshima, 02-JUN-1986. FEATURES from to/span description pept + 21 + 97 c-fgr peptide, exon 9 /nomgen="FGR" /map="1p36.2-p36.1" /hgml_locus_uid="LN0028P" IVS < 1 20 cfgr intron 8 IVS 98 > 117 cfgr intron 9 BASE COUNT 23 A 33 C 33 G 28 T ORIGIN About 480 bp after segment 5; chromosome 1p36.2-p36.1. 1 ACTTTCTGGC TTCTTCCCAG GCAGCTTGCT GGATTTTCTC AAGAACCCAG AGGGCCAGGA 61 TTTGAGGCTG CCCCAATTGG TGGACATGGC AGCCCAGGTA ACTGGGCCAG CAGCCTT // LOCUS HUMFGR7 219 bp ds-DNA PRI 15-JUN-1989 DEFINITION Human c-fgr proto-oncogene, exon 10. ACCESSION M12724 KEYWORDS oncogene; proto-oncogene. SEGMENT 7 of 7 SOURCE Human placenta DNA, clones lambda-[015,010]. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 14 to 207) AUTHORS Nishizawa,M., Semba,K., Yoshida,M.C., Yamamoto,T., Sasaki,M. and Toyoshima,K. TITLE Structure, expression, and chromosomal location of the human c-fgr gene JOURNAL Mol. Cell. Biol. 6, 511-517 (1986) STANDARD full staff_review REFERENCE 2 (bases 1 to 219) AUTHORS Nishizawa,M., Semba,K., Yamamoto,T. and Toyoshima,K. TITLE Human c-fgr gene does not contain coding sequence for actin-like protein JOURNAL Jpn. J. Cancer Res. 76, 155-159 (1985) STANDARD full staff_review COMMENT Draft entry and clean copies of the sequences in [1] and [2] were kindly provided by K.Toyoshima, 02-JUN-1986. FEATURES from to/span description pept + 34 / 187 c-fgr peptide, exon 10 /nomgen="FGR" /map="1p36.2-p36.1" /hgml_locus_uid="LN0028P" IVS < 1 33 cfgr intron 9 IVS 188 > 219 cfgr intron 10 BASE COUNT 47 A 72 C 58 G 42 T ORIGIN About 265 bp after segment 6; chromosome 1p36.2-p36.1. 1 GAGCTCCCAT CTCTCCACAC TATGGTCCCC CAGGTAGCTG AGGGCATGGC CTACATGGAA 61 CGCATGAACT ACATTCACCG CGACCTGAGG GCAGCCAACA TCCTGGTTGG GGAGCGGCTG 121 GCGTGCAAGA TCGCAGACTT TGGCTTGGCG CGTCTCATCA AGGACGATGA GTACAACCCC 181 TGCCAAGGTG CCCTGCTTCA CCCCACCTTC CAAGAGCTC // LOCUS HUMFGRINT 669 bp ds-DNA PRI 15-SEP-1989 DEFINITION Human fgr proto-oncogene encoded p55-c-fgr protein, partial exons 2 and 3. ACCESSION M19721 KEYWORDS p55-c-fgr protein; proto-oncogene. SOURCE Human peripheral blood mononuclear cell DNA, clones pc22 and pc41. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 669) AUTHORS Katamine,S., Notario,V., Rao,C.D., Miki,T., Cheah,M.S.C., Tronick,S.R. and Robbins,K.C. TITLE Primary structure of the human fgr proto-oncogene product p55-c-fgr JOURNAL Mol. Cell. Biol. 8, 259-266 (1988) STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly provided by S.Tronick, 10-NOV-1988. FEATURES from to/span description pept < 1 90 p55-c-fgr protein, exon 2 (AA at 1) /nomgen="FGR" /map="1p36.2-p36.1" /hgml_locus_uid="LN0028P" 640 > 669 p55-c-fgr protein, exon 2 pre-msg < 1 > 669 p55-c-fgr mRNA and intron IVS 91 639 p55-c-fgr intron B BASE COUNT 156 A 167 C 208 G 138 T ORIGIN 425 bp upstream of BamHI site; chromosome 1p36.2-p36.1. 1 TCATTTGCCC ACATCCCCAA CTACAGCAAC TTCTCCTCTC AGGCCATCAA CCCTGGCTTC 61 CTTGATAGTG GCACCATCAG GGGTGTGTCA GTGAGTCCAA GGGGTCGGGA GGCAGGAGCT 121 GCCCGTGGAT CCTGGGAGAA ACTGAGGGAA GAAGAAGAGA TGCGAACTTG CCCTTAGGAG 181 CCTCCAGGAG GATGTGGCAG ATACAACCCC GCTTTCAAGA TCACATGGGC TGAGGGAGTC 241 TGTACAGCCC TCCCATCAGA AACCACAGTC TGCAGGGGAG GGTCAAGAAG CTCTACTCCC 301 AGTCTGAGAG GGGCAGGATC CTGTGACAGA TGCAAGTGAC AAAGAGAAAC TTCTTGCCCT 361 TTTAGGTGCC ACTTCCCAGA TGGGAAGTCT TCTTGGTGGG GAAGAGGAGG AGTGGGCAAC 421 AAGGGGATCC TCCATGGTGG GAGGAATGGG CTTGAAGTTG TGTGTCCTAA GCTGTGGAGA 481 CCAAATCAGA AATTCCTTGG ACCCCAAAGG CCTTTGGGAA CCAGAGCACT AAAGGAGTGG 541 GGAGGTGCAG CACCTGGCTG GGGAACAGGA ATTTGGGGTG CAGCCCCCTT GGTGCTTCTG 601 CCCCATGCCC TACCCTGCTG AGTAGCCCTG ACTCTGCAGG GGATTGGGGT GACCCTGTTC 661 ATTGCCCTG // LOCUS HUMFIXG 38059 bp ds-DNA PRI 15-DEC-1989 DEFINITION Human factor IX gene, complete cds. ACCESSION K02402 KEYWORDS Alu repetitive sequence; Christmas factor; KpnI repetitive sequence; antihemophilic factor B; factor IX; repetitive sequence; simple repetitive sequence. SOURCE Human: genomic DNA library of Maniatis et al., clones FIX-lambda-[6,36,53,61]; fibroblast cell (49,XXXXX) DNA, clone FIX-lambda-4243. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 38059) AUTHORS Yoshitake,S., Schach,B.G., Foster,D.C., Davie,E.W. and Kurachi,K. TITLE Nucleotide sequence of the gene for human factor IX (antihemophilic factor B) JOURNAL Biochemistry 24, 3736-3750 (1985) STANDARD full staff_review REFERENCE 2 (bases 23487 to 23556) AUTHORS Rees,D.J.G., Rizza,C.R. and Brownlee,G.G. TITLE Haemophilia B caused by a point mutation in a donor splice junction of the human factor IX gene JOURNAL Nature 316, 643-645 (1985) STANDARD full staff_entry REFERENCE 3 (bases 23378 to 23387) AUTHORS Graham,J.B., Lubahn,D.B., Lord,S.T., Kirshtein,J., Nilsson,I.M., Wallmark,A., Ljung,R., Frazier,L.D., Ware,J.L., Lin,S.W., Stafford,D.W. and Bosco,J. TITLE The Malmoe polymorphism of coagulation factor IX, an immunologic polymorphism due to dimorphism of residue 148 that is in linkage disequilibrium with two other F.IX polymorphisms JOURNAL Am. J. Hum. Genet. 42, 573-580 (1988) STANDARD full staff_entry COMMENT Sequence for [1] kindly submitted on floppy by K.Kurachi, 05-AUG-1985. [1] notes a potential TATA box (2939-2942) and polyadenylation signal (35701-35706); and notes two start codons (downstream of the start codon annotated below) that may be alternative and/or preferred starts for the factor IX prepropeptide. Several tracts of simple repetitive sequence are present [1], including regions with the potential for hairpin and/or Z-DNA formation. [1] describes six long open reading frames in the intron and on the complementary strand. FEATURES from to/span description pept 2995 3082 factor IX prepropeptide, exon 1 /nomgen="F9" /map="Xq26.3-q27.2" /hgml_locus_uid="LX0023Z" 9291 9454 factor IX prepropeptide, exon 2 9643 9667 factor IX prepropeptide, exon 3 13357 13470 factor IX prepropeptide, exon 4 20634 20762 factor IX prepropeptide, exon 5 23328 23530 factor IX prepropeptide, exon 6 33004 33118 factor IX prepropeptide, exon 7 33787 34334 factor IX prepropeptide, exon 8 sigp 2995 3082 factor IX signal peptide 9291 9340 factor IX signal peptide matp 9341 9454 factor IX 9643 9667 factor IX 13357 13470 factor IX 20634 20762 factor IX 23328 23530 factor IX 33004 33118 factor IX 33787 34331 factor IX 9341 9454 factor IX-a light chain matp 9643 9667 factor IX-a light chain 13357 13470 factor IX-a light chain 20634 20762 factor IX-a light chain 23328 23380 factor IX-a light chain 23487 23530 factor IX-a heavy chain 33004 33118 factor IX-a heavy chain 33787 34331 factor IX-a heavy chain pre-msg 2966 35722 FIX mRNA IVS 3083 9290 FIX cds intron A IVS 9455 9642 FIX cds intron B IVS 9668 13356 FIX cds intron C IVS 13471 20633 FIX cds intron D IVS 20763 23327 FIX cds intron E IVS 23531 33003 FIX cds intron F IVS 33119 33786 FIX cds intron G rpt 1 2596 KpnI repeat copy 1 rpt 7298 7593 Alu repeat copy 1 rpt 18165 20265 KpnI repeat copy 2 rpt 24172 24475 Alu repeat copy 2 rpt 25863 26091 Alu repeat copy 3 rpt 31537 31809 Alu repeat copy 4 rpt 35948 36262 Alu repeat copy 5 allele 8469 8520 t...50 bp...a in FIX-lambda-4243; ta in FIX-lambda-[36,61] [1] allele 10041 10041 g in one allele; c in another allele (loss of XmnI recognition pattern) allele 14076 14079 tcga in one allele; nnnn in another allele (loss of TaqI site) allele 23387 23387 g in one allele; a in another allele mut 23387 23387 g in [1]; a in [3] Ala->Thr mut 23531 23531 g in [1]; t in [2] (haemophilia patient) BASE COUNT 12326 A 7397 C 7441 G 10895 T ORIGIN 333 bp upstream of BalI site; chromosome Xq26.3-27.2. 1 GTATATCTAG AAAACCCCAT TGTCTCATTC CAAAATCACC TTAAGATGGA TAGGCAACTT 61 CAGCAAAGTC TCAGGATAAC AAAATCAATG TGCAAAAATC ACAGGCATTC TTATACACCA 121 ATAGCAGACA AACAGACAGC CAAATCATGA GTGAACTCCC ATTCACAATT GCTTCAAAGA 181 GAATAAAATA CCTAGGAATC CTACTTACAA GGGATGTGAA GGACCTCTTC AAGGAGAACT 241 ACAAACCACT GCTCAATGAA ATAAAAGAGG ATACAAACAA ATGGAAGAAC ATTACATGCT 301 CATGGGTAGG AAGAATCAAT ATCATGAAAA TGGCCATAAT GCCCAAGGTA ATTTATAGAT 361 TCAATGCCAT CCCCATCAAG CTACCAATGA CTTTCTTCAC AGAATTGGAA AAAACTACTT 421 TAAAGTTCAT ATGGAACCAA AAAAGAGCCC GCATCGCCAA GTCAATCCTA AGCCAAAAGA 481 ACAAAGCTGG AGGCATCATG CTACCTGACT TCAAACTATA CTACAAGGCT ACAGTAACCA 541 AAACAGCATG GTACTGGTAC CAAAACAGAG ATACAGACCA ATGGAACAGA ACAGAGCCCT 601 CAGAAATAAT GCCACATATC TACAACTATC TGATCTTTGA AAAACCTGAC AAAAACAAGA 661 AATGGGGAAA GGAATCCCTA ATTAATAAAT GGTGCTGGGA AAACTGGCTA GCCATATGTA 721 GAAAGCTGAA ACTGGATCCC TTCCTTATAC CTTATACAAA AATTAATTCA AGATGGATTA 781 AAGACTTCAT TGTTAGACCT AAAACCATAA AAACCCTAGA AGAAAACCTA GGCAATACCA 841 TTCAGGACAT AGGCATGGGC TTGGACTTCA TGTCTAAAAC ACCAAAAGCA ATGGCAACAA 901 AAGCCAAAAT TGACAAATGG GATCAAATGA AACTAAAGAG CTTCTGCACA GCAAAAGAAA 961 CTACCATCAG AGTGAACAGG CAACCTAAAG AATGGGAGAA AATTTTTGCA ATCTACTCAT 1021 CTGTCCAAGG GCTAATATCT AGAATCTAAA ATGAACTCAA ACAAATTTAC AGAAAAAAAC 1081 AAACAACCCC ATCAACAAGT GGGTGAAGGA TATGAACAGA CACTTCTCAA AAGAAGACAT 1141 TTATGCAGCC AACAGACACA TGAAAAAATG CTCAGCATCA CCGGCCATCA GAGAAATGCA 1201 AATCAAAACC ACAATGAGAT ACCATCTCAC ACAAGTTAGA ATGGCGATCA TCAAAAACTC 1261 AGGAAGCAAC AGGTGCTGGA GAGGATGTGG AGAAATAGGA ACACTTTGAC ACTGTTGGTG 1321 GGACTGTAAA CTAGTTCAAC CATTGTGGAA GTCAGTGTGG CGATTCCTCA GGGATCTAGA 1381 CCTAGAAATA CCATCTGACC CAGCCATCCC ATTATTGGGT ATATACCAAA GTATTATAAA 1441 TCATGCTGCT ATAAAGACAC ATGCACACGT ATGTTTATTG CGGCACTTTT CACAATAGCA 1501 ATGACTTGGA ACCAACCCAA ATGTCCAACA ATGATAGACT GGATTAAGAA AATGTGGCAC 1561 ATATACACCT AGGAATACTA GGCAGCCATA AAAAGAAAAT GAGTTCATGT CCTTTGTAGG 1621 GCATGGATGA AGCTAGAAAC CATCATTCTC AGCAAACTAT CGCAAGGACA AAAAACCAAA 1681 CACCGCATGT TCTCACTCAT AGGTGGGAAC TGAACAATGA GAACACTTGG ACACAGGAAG 1741 GGGAACATCA CACACCGGGG CCTGTTGTGG GGTGGGGGGC GAGGGGAGGG ATAGCATTAG 1801 GGGATATACC TAATGCTAAA TGACGAGTTA ATGGGTACAG CACACCAACA TGGCACATGT 1861 ATACATATGT AACAAACCTG CTCGTTGTGC ACATGTACCC TAAAACTTAA AGTATAATAA 1921 TAAAAAAAAG ATCATTCTAA AATTTATACA AGCCCTTAGA ACAGTTAAAA ATATCTTACC 1981 AAAAGAAGAA TAAAGTTGGA GGAATCACTC TACCTAATAT AAAGTCTTAC TACATAGCTA 2041 CAGTAATTAT GACAGTGTTA TATTGGCAGA GGGATAAATA CATCAATGGC ACAAAGAATA 2101 GATAGAGAAA CTGGAAGTAG ACCCAAAACA ATATGGTTAA CTGACTTACG AAAAAATTTC 2161 AGAAGCCATT CAGTCGAGGA AGGATAGGGT GGTATTGTTG TTTTTTGTTT TAACAAATTG 2221 TGCTGGATAA ATTGGACATA CCTATGGAAA AAAAAATGAA GTTTGACCTA AACATCATAC 2281 TTTACACAAA TATTAACTCA AAATGGAGCA TGGGCATAAA TCTAAAACTT CAAACTGTAA 2341 AACATTTAGA AAAAAATAGG AAAAAAACTA TCAGGATCTA GTGTTAGTGG AAGAGTTCTA 2401 AATGTGATCC ATAAAACAAA AACAAATAAA CTGGACTACA TCAAAACTAA AAAATTCTAC 2461 TCTGTGAAAG ACCTAATTAA GAGGACAAAA GACAAGCTAC AGGCTGGAGA CAATATATTT 2521 AATCCACGTA TCTATGAAAG GATTCATATC TAGAATATAT AAACAACCTT AAGAATCTGA 2581 CAGTAAAAAA AAAAAATCAG ACTAACTGGA CCACTCATAC ATTGCTGATG GAAATGTAAA 2641 GTGGTACAGC CATTTTGGTA AACATCATTG CTCTCTGACA AAGATACGGT GGGTCCCACT 2701 GATGAACTGT GCTGCCACAG TAAATGTAGC CACTATGCCT ATCTCCATTC TGAAGATGTG 2761 TCACTTCCTG TTTCAGACTC AAATCAGCCA CAGTGGCAGA AGCCCACGAA ATCAGAGGTG 2821 AAATTTAATA ATGACCACTG CCCATTCTCT TCACTTGTCC CAAGAGGCCA TTGGAAATAG 2881 TCCAAAGACC CATTGAGGGA GATGGACATT ATTTCCCAGA AGTAAATACA GCTCAGCTTG 2941 TACTTTGGTA CAACTAATCG ACCTTACCAC TTTCACAATC TGCTAGCAAA GGTTATGCAG 3001 CGCGTGAACA TGATCATGGC AGAATCACCA GGCCTCATCA CCATCTGCCT TTTAGGATAT 3061 CTACTCAGTG CTGAATGTAC AGGTTTGTTT CCTTTTTTAA AATACATTGA GTATGCTTGC 3121 CTTTTAGATA TAGAAATATC TGATGCTGTC TTCTTCACTA AATTTTGATT ACATGATTTG 3181 ACAGCAATAT TGAAGAGTCT AACAGCCAGC ACGCAGGTTG GTAAGTACTG GTTCTTTGTT 3241 AGCTAGGTTT TCTTCTTCTT CATTTTTAAA ACTAAATAGA TCGACAATGC TTATGATGCA 3301 TTTATGTTTA ATAAACACTG TTCAGTTCAT GATTTGGTCA TGTAATTCCT GTTAGAAAAC 3361 ATTCATCTCC TTGGTTTAAA AAAATTAAAA GTGGGAAAAC AAAGAAATAG CAGAATATAG 3421 TGAAAAAAAA TAACCACATT ATTTTTGTTT GGACTTACCA CTTTGAAATC AAAATGGGAA 3481 ACAAAAGCAC AAACAATGGC CTTATTTACA CAAAAAGTCT GATTTTAAGA TATATGACAT 3541 TTCAAGGTTT CAGAAGTATG TAATGAGGTG TGTCTCTAAT TTTTTAAATT ATATATCTTC 3601 AATTTAAAGT TTTAGTTAAA ACATAAAGAT TAACCTTTCA TTAGCAAGCT GTTAGTTATC 3661 ACCAAAGCTT TTCATGGATT AGGAAAAAAT CATTTTGTCT CTATGTCAAA CATCTTGGAG 3721 TTTGATTATT TGGGGAAACA CAATACTCAG TTGAGTTCCC TAGGGGAGAA AAGCAAGCTT 3781 AAGAATTGAC ATAAAGAGTA GGAAGTTAGC TAATGCAACA TATATCACTT TGTTTTTTCA 3841 CAACTACAGT GACTTTATGT ATTTCCCAGA GGAAGGCATA CAGGGAAGAA ATTATCCCAT 3901 TTGGACAAAC AGCATGTTCT CACAGGAAGC ATTTATCACA CTTACTTGTC AACTTTCTAG 3961 AATCAAATCT AGTAGCTGAC AGTACCAGGA TCAGGGGTGC CAACCCTAAG CACCCCCAGA 4021 AAGCTGACTG GCCCTGTGGT TCCCACTCCA GACATGATGT CAGCTGTGAA ATCAGACTGA 4081 AATGCTGAAA TAACGATAAA AAAAAATACA GAGGTTAAAC TAGCAAAGTG AGTAAAGTCA 4141 AGGGATAAAG AAAATTTGTT GGAAAACTCA CAAAGCAGGA CATAAAGCAA GGCCATTAGA 4201 TATATCTCAT TAGTGTGACA TCTGGGAGGA CAAAGCATCC AAACCCTTTC TTCTATATAA 4261 GTGGTGAGAT GATGAAGGTT GTAAGAGGCT TCTGCCCCCT TGAAGACTTC AGATGCTGGG 4321 GAAAGGATAG ATAAGAATAA GGATGAACCT GGCTTTTGGA GCCTGGGAAA TAATGACTAG 4381 CGATAAACCT GAAGGGAAGT TAAGTATACG ATCCCCAGAT AATACTAAGG AGAAAGGCAA 4441 TGTGATTCTG CAGCCATTGT AGCCAGAGAT AATAAGCCCT TGAGGAAGGG GCCAGGGGAA 4501 TTTTTCTAAG GATAGACAGT ATTAATGCAG CACTCTCTTC TGCTATTAAA CTCTCATTGG 4561 CTTCTAAAAG GAGTTTCGGT GAGTGATTTG CTGAGATGTT TGCATTTTCA TGCTGCTGCC 4621 TTTAGGTTAT TATTGCAACA GTTTGGAATT TTGAAATTAA AACAGTTCTG TAAAACCAGT 4681 TTAGTTTTGT AAAGTGTATG CATCAAAGAT GTCCTTCATT CAGACATTAC TGAGTTACAA 4741 CTACGGTGCC AGGTACTGTG TCAGGGTACT AGGGGTATGG GGATAAACCA GACTCCCTCT 4801 TTGATCTAAA GCAGCATGAG GCCAGGTGAG AGGTTTCAAT ATATGTGATA AAATGTGCAC 4861 TAGGTACTAA GGGATCATAG AGAAAGGAAC ACATTAAATG GGGAAACAAT TGATAGAGAG 4921 AGAATATTTT CATCTGGGTC TTAAAAGATG AGTAGGCGTT CTCTCTCTTT AAATGTCTGA 4981 TATAAGGGCA TTTTATGCAA AGAAGGATCA CTCGTGCAAA GACTCAGCTT TGCAAGAACG 5041 TGAGGTATTT CAGGAGTTTT GTATGGTTCC ATATGGACTA TGACAAGTGA GACAGGTAAA 5101 CTAGGCAGAG CTGGTCATCA GATAATGAAG TCATTAACCT AAGGAGATTG GACAATAAAA 5161 TGCAATATGG AGGTATCGAA GTATAAACAT AAGGAGTACC ACTGATGGCT GATTTAGGAT 5221 GCCCAGTCTG GCAACACGCT AATGAAATGA TAGTGGGGGA GGGGGCCGTA CCAAGACTAG 5281 GAGAGAGCAG TCCTGAGACT ATTGCAATTA TCTGCGGGAG ACATAAAGGC TAGAACCTGA 5341 ACAGTAGCAG TACAAAAAAA GAGGGGAGTT CAAATGATAT TAAGGAAGTA GAAGTGGTAT 5401 GACTTAACCA TCTGGGTATG GAAGGGGAAA TGGCTAGAGT CTTGGGGACT TTGTGTTTGA 5461 TGTGATTATG GACCACAGAA TAATGTCTAA GAGAACTGGC TCTTTAGTCT GACTGCCAGA 5521 GTCTGAATCC TGAATGTTTT AGTATGTTAC CTTGCAAAGC CCTTAGCCTC TATGAATCTA 5581 TCTTCCTCAT TTATAAAAAT AAGATGACAG TGCCTATCTC GTGGGACTTT TGTGAGGATG 5641 AAGTGAGATA ATGGATGCAA AGTTACTGAG CACAGTGTCC AACACAGCAG AAGCATTACA 5701 TATACATTAG CTATTACTGG CTACATTATG ATATACAGTT AGGGAGTTGG AAAGATAATC 5761 TGAAATTCAG GAGACGTATC TGACTATAGG TGAGTATTTG GAACTCATTG TTCTGTAAAC 5821 AGTAGTTACA GCACGTGTGT GGGCATCTGG AGAGTGAGCA TGGATATTGT GATACCTAGT 5881 ACAGTGCCTG GCAGTAGTGG TTGTATGCTC AGTAAATTTT GTTGACAGGG TCAGGGCCGG 5941 ACTAGACTGT GGTAAGCAAG GCCTGTAGGG CATAAATATA CTTGTATGCC CCGAGAAGTG 6001 AGGACCTCTT AAATATTGTG CCCTACATGC CTTGTTTGGT TCACTCTTGT CCCAGCCCTA 6061 GCAAGTATAT ATAAGGTGAA AAAGGAAAAA GCTGAGGCTG GAGCCTGGGA GAACCCTGGA 6121 CATTTAAGGG CCATGGAGAG GAACAGGAGT TAATCAATTC AAGTGCTGGA TGGATAACAG 6181 GAGTTAGAGC AAAGCGGGGA ACCAGAATAG AGTGATTATT ATAAAAAGAG TTTCCTAAAA 6241 AGGGAGAGAT CAACAATTAG AAATTATTTA GAGCAGCCAG TAAATACATA AACTCAAAAT 6301 TATTCTTTAG GTCATTCCTG ATTGTGACAA TAGTCATTTC ATTATATAAA TGTGATTAAG 6361 GAAGGAAAAG AGCTACACAG AAGTTATTAA AGAGCTAAAG AGAATTGAGA AATTTAAAAC 6421 AGAAGAAAGT AGGGCCAACA TGAAAGGAGT AGGGAGAAAA AGAGATAACC AGCATATTGT 6481 CTCACTGATC CTGCCAACAC CTGTGAGATA GATATTCTTA TTATACTACT GTTAAACCTA 6541 ATTTACATTT GACAAAGTTA AGGTTCAGAG CTTGTGTGAC TTGTCCAAGG TCACAGGTCT 6601 AGAGGAGGCA GATACTTGAT TCAAACCTAT TTCTGTCTGA TCTGATTCTA AAGTCTGTTT 6661 TTTCACTCAA CCACACTGTA CAGTCAGCTC TCCTTGTGAG TTCCACAGCC ACAGATTCAA 6721 TTAACTGCAG ATCAAAAATA TTCAAGAAAA AAATGGATGG TTCGATCTCT ACTGAACATG 6781 TACAGACTCT TTTATCTTTC ATTATTCCCT AAACAATACA GCATAACAAC TATTTACATA 6841 GCATTTACAT TGTATTAGCT ATTAAGAGAA ACCTAGAGAT GATTTAAAGT ACAAAGGAGG 6901 ATGTGTTTAG GTTATATGCA AATAGTAAGC CATTTTTATA TCGGAGACTT GAGCATCCAC 6961 AGATCTTGAT ATTTGCAGGG GGTCTTGCCA CCAATTTTCC ATGGATACTG AGGAACGACT 7021 GTAAATGGAT GCAGGCATGG ATGCTATTTA GGAGTGTCCA GGGCCAAGTA AATGAGTTGC 7081 TGAGCAGAGA GGTGGGTGGA GGCTGTGAGG CATCAATATG TGGTGGCATC CATCTGCATT 7141 TTGGTGATTT TTTTCCTTCA CAGTCCTCGG CTGTCTGGGA AGAGAAGGAT GAAGGCAGAT 7201 GGCTGCTCCA ATTTAGGGGC TAGGATTGCA GGGTGGGCAC AGCATTGCAA ACGAGTGAAG 7261 GAAATTGAGA AATATGGCCA ATGAAGAGTT GAAGAGAGGC CTGGCATGGT GGCTCACACC 7321 TATAATCCCA GCACTTTCAG AGGCCCAGGC AGGCAGATCA CTTGAGGTCA GGAGTTCGAC 7381 ACCAGCCTGG CCAACAAGGT GAAATGGTGA AACCCCGGCT TTACTAAAAA TACAAAAATT 7441 AGCTGGGCAT GGTGGCGGGT GCCTGTAATC CCAGCTACTT GGGAGGCTGA GGCAGGAGAA 7501 TAGCTTGAAC CTGGGAGATG GAGGTTGCAG TGAGCTGAGA TCGCACCACT GCACTCCAGC 7561 CTGGGCGACA GAGCAAGACT CTGTCAAAAA AAAAAGAGTT GAAGAGAAAA AGTCTAGGCT 7621 AAATTCAAAG AAAAAAAGTG AGCCCAAAAG GAACTTGCAG AGCAAGGGAA AAGCAGGGAT 7681 GTCAAGGGAC TAGAACACTC CATAAAGTGA ACAGCTGCAA TGAAAATAAG GGAAGAAAGT 7741 TTAGTTCATC TCCGTTTCTT TCCTTTCCTT TTTACTTTCC TTTCTCTTCC TTTTTGGAGT 7801 TAGTCAGGAA GTAGTCCCAA ATACCCCAGA AAGTTCATCT TATAAGCCCT TGGTCCTCTT 7861 GAGATGGTAT CAGATATATT GCTAGACCCT TGAAGAAAGG AACAACTCCA GGCAACTTCT 7921 TGAGTCCCTG TTATTAATTT TATACATACA CACACATATA TGTATATACA TGAAAACACA 7981 CAAACACATG TGTGTGTATA CAGCCATGCA TTCCTTAACA ATGGGGATAT ATTCTGAGAA 8041 ATGTGTCATT AAGCAATTTC ATCATTGTGC GAACATAATA GAGTGTACTT ACCTAAACCT 8101 AAATGGTATA GCTTACTACA TACCTAGGTT GTATTGATGT GGCCTATTGC TCCTAGGCTC 8161 CTGGGCTGCA AACCTGTACA GCATGTGACT GTACTGAACA CTGTAGGCAA TGGTAACAGT 8221 GGTATTTGTG TATCTAAACA TAGAAAAGGT ACAGTGAAAA TACAGTATTA TAACCTTATG 8281 GGACCACTGT CGTATAATGT GGTCCATCAT TGACCAAAAT GTCATTGTGC AGCAAATGAT 8341 TATCTCATAT ATATATATAT ATGATATGAT ATATATGATA TATATGTGTG TGTATATATG 8401 TATACATATA TATGTGTACA TATATGTATA CATATATACA CACACATATA TATGTACACA 8461 CATATATGTA TATATATGTA CACACATATA TGTATATATA TGTACACACA TATATGTATA 8521 TATATATGTA CACACATATA TGTATATATA TATGTACACA CACACACATA GAGAGAGAGA 8581 GAGAGGAGAG GAGAGGAAGG AGGGAGGGAA GGAGAAATAT GATTCAGATA GAGACATCTA 8641 TCCTCCAGAG TTCAGGAGTG TCTCTTCAGA CTAGGTAGAT GTAGCTTAAA AAAAACATAT 8701 CCTGGAATTC TAGAGAGATG CTTAAATCAC TGCAATTCCT ATAACACTTG CCAACCAAAG 8761 GTGCTGTTGA TCTGAAATTG CTTTTTTAAA TTAATGCAGT GATTTTTCTT TAACATCTAG 8821 TGACAGACAC TGGGGTCACA TTTGCAGCTG GACCATAATT AGGCTTCTGT TCTTCAGGAG 8881 ACATTTGTTC AAAGTCATTT GGGCAACCAT ATTCTGAAAA CAGCCCAGCC AGGGTGATGG 8941 ATCACTTTGC AAAGATCCTC AATGAGCTAT TTTCAAGTGA TGACAAAGTG TGAAGTTAAC 9001 CGCTCAATTT GAGTTCTTTC TTTTTCATCC AAAGTAAATT CAAATATGAT TAGAAATCTG 9061 ACCTTTTATT ACTGGAATTC TCTTGACTAA AAGTAAAATT GAATTTTAAT TCCTAAATCT 9121 CCATGTGTAT ACAGTACTGT GGGAACATCA CAGATTTTGG CTCCATGCCC TAAAGAGAAA 9181 TTGGCTTTCA GATTATTTGG ATTAAAAACA AAGACTTTCT TAAGAGATGT AAAATTTTCA 9241 TGATGTTTTC TTTTTTGCTA AAACTAAAGA ATTATTCTTT TACATTTCAG TTTTTCTTGA 9301 TCATGAAAAC GCCAACAAAA TTCTGAATCG GCCAAAGAGG TATAATTCAG GTAAATTGGA 9361 AGAGTTTGTT CAAGGGAACC TTGAGAGAGA ATGTATGGAA GAAAAGTGTA GTTTTGAAGA 9421 AGCACGAGAA GTTTTTGAAA ACACTGAAAG AACAGTGAGT ATTTCCACAT AATACCCTTC 9481 AGATGCAGAG CATAGAATAG AAAATCTTTA AAAAGACACT TCTCTTTAAA ATTTTAAAGC 9541 ATCCATATAT ATTTATGTAT GTTAAATGTT ATAAAAGATA GGAAATCAAT ACCAAAACAC 9601 TTTAGATATT ACCGTTAATT TGTCTTCTTT TATTCTTTAT AGACTGAATT TTGGAAGCAG 9661 TATGTTGGTA AGCAATTCAT TTTATCCTCT AGCTAATATA TGAAACATAT GAGAATTATG 9721 TGGGTTTTTT CTCTGCATAA ATAGATAATA TATTAAACTT TGTCAAAAGG ACTCAGAAAG 9781 ATCAGTCCAA CCCTCTAACC CATATTGGAT GGTGATATAC TACAGGGTTA TGCCAGTGTG 9841 GGAACTATCG CTGGTAAATA AGTTTAATCC TCCCTAGGGC TTCACAAAGA ACATTGTTCC 9901 ACCCCAGGAG GGTGGAAGGA AGAAACTGAA ATGATTGTGT CTTAGAACCT AATGAAAGTT 9961 TGCATTCCTC AGTAAAATCA GAGACTGCTG ATTGACTTAA ATGTTTATAG CTTCAAAGTC 10021 CTCCTCATTA TCATGGCCCA GAAGCCCTTC CATGATTGTC CTTCCCCACC CTCCCCATTA 10081 CCCTTCTTGC CTCCTCTGCT ACTTCTCTCC TCGCACACTG GGCTCCAGCC ACCCTGGCCT 10141 TCCTGTCACT TCTTGCACAC TCTAGGAATG CTCCCACTTT GGAGGCTTTA TCTGGCTGTT 10201 TCTCTTATTT GGCTGTTCCC AACTTCCTGT GGGCTGACTC CCTCACCTCC TTCGGGTCTT 10261 TGCCCAAATG TTACCATCTT AATGAGGCCT ACCTTCACCA TCTATTAATA CTTCAACCTG 10321 CCCCAGTAGC CTTACCACTC TAGACACCTG TACAGAACTC CACTCTACTT TTTAACAGAG 10381 CTTTTCACCA TCTAATGTAT CATATAATTT CTTACTCATA CTATTTATCA TTTATTTTCT 10441 CCTACTCCAC TAAAATGCAA GTTTCATGTT GGCAGGGATA TTCAATTGTT TTGTTTATTG 10501 ATATATTCCT AGCACCTAGA ACAGTATCTG GAAAAGAGGT ACTCAGTAAA TATTTATCAA 10561 ATGAATTGAC CAAAAGAAGG AAAACTCAAA ACTTTAATGA CAACTAACTT TAAAGCTACA 10621 ATAACTTAAA ATTCAGAGTA GGATTTTGAG GGAGGGTAAG TTTCAAAGAT TGACTTACCT 10681 AAGACTATCT GCATAAATAA AAAGAAATTA ATCCAGACAA CAAATTCACC AACTTCCATC 10741 AATTGGAAAT CCAATTCTAT TTTCTACAGT TTATGTTCTG GAGACACTAC TGGACACTCT 10801 TTTACTCTCA TAACTCATAA CTCCTCCACT TTTGTTTTTA AATCATGAGA GAAAAAGAGT 10861 TGACTCTGTT ATATTGTTTT ATCTACCTTT CCTTGATCTT AGAAACGAAT ACTACCATAC 10921 CAGCTTCTAC TGAGGTGCCC CCTAAAGTTA GTCCAAATAG GTCTTTGCAA TCTCCATTCC 10981 CGCAGAATTT AGAACTTTGA ATCACATGAT TTATTTCTAA AAGTAAATCC ATGCCGATTT 11041 TCCCCACCAA AAAATTCCTG ACTATTAAAC TCCTACAATC CCTTCATTGC TCACTCCCCA 11101 CCCCCAGGAT CATATTTTAA AGTTGGGCCC TTGCCTTTTG GGTCACATAG GTACACTGTT 11161 TGCTATACCA CAGGTATAGC TATCTGGAAA ACATGGAGGG TATTATTCTG TTACTACTGC 11221 TTCGTCAACC AAAAAATAAA ACAAAACAAG AACAAAAAAG AAACAAACTC CCTGCCTCTT 11281 TTCACTTGCA GTCAAGGTTC CTAACCACTA CAAAATTAGC CTATGTTTCT TCTTGCACAT 11341 AGTAGAAACC CAAGCTTCTC ACTGCTGTGC TATTCTGTAC CATCAACTCA TCACATAAAG 11401 AGCCTGGTTG AAGAATGATT GTCCAACCAC ATTACTAGCA TCTGTCAAGA CTTTCCAGTT 11461 TACAAAAGGC CTATCACATT TAACCCTCAC ACCATCCTTG TGACCAAAGC ATTATTAACT 11521 CCATTTTACA GGAGAGTAAA CTGAAGCTTA GGGAAGTTAA AAGAACTGCC AAAGGTCTCC 11581 CAGTTGGGGA GTCATGAAGC CCAGAAGAGA AGCCAAATTC TCTGCTGCTC AACCCCTTGC 11641 TTTCACTATT ACACCTCAGG GCCTTCAAAT CTAAATGCAG TTATTCATTA AACAGGAACC 11701 TGGTAGTCTT AAACAGGAAT CTCTCACTTG GTAAGATCTT GTCTCTTGTT GTATTTGACC 11761 CCAACTGTCT ATGGCTTTGC CTGAACCCAA AGTACACACA GCCTAGAAAC CAAAGGAGAA 11821 CCAAATGTGG GATAAAATGA CACTCATTTT AACGACATGT CTCAGCAAAT GAGTTCCTGT 11881 GTAGCTGGCT GAAAGCCCAG ACCCTTTCAG TAAAACATCC TGAATAATTC ACATTTGTTG 11941 GTCTATAATA TAAAGGGCAA ATGTAGCTCA TTTTTAGACC AGTTCTGAAC ATCAATAGTA 12001 ACAAACCAGA GATAACCGAT TTTGTTTTCA TAGAATTGGA ACAAATTAGA GTATCTGTGC 12061 AAAAGCATAT CAGATCTAGG AGCAGAGGGG ACAAGGTCTA ATTTTTAAAT AAGCAAATTT 12121 TCCAGGGAGG GACTACTTAT GATAAAGGGA TATTAGTCTC TTAGTCAACG GAACCTGGAT 12181 ACACGCTTCT GACAGAGAAG AGGGAGAATA GGCAGGAATC TACACACCAG ATGTCAAGGA 12241 GATTTGCTTT AAAATACGAC TGATAATTAG AAATTTCTCA GTTTCCCCCT TTTCCCTCAT 12301 TCTTTGATTC TTATTGTTAT CTTTATCTCT TACTCCTTTG TTTCTCATAT ATTGAGTCTT 12361 ACAGATCAAG CTCCCATTTT TTTCTTCAGG GGTATTTTTC TAGTTCAAAG TGCCTACCAT 12421 CTCCCTTCTG GTTCTATTCA TCCTTCTCTC CCAAAGCTCC TTTAGAAGTG TGGATTAAGG 12481 CAGAGCACTA AGAAACCAGA CTTAAAGATT CCCTTCTCAT TCTGACTTTT CTCCTTTCAC 12541 CTATTCCTTC CTCCTGTTTT CTTACCATCA GTGTCTTCAA AGGCTTTCAA GTACACGGTA 12601 AATGCAGAAA CTTCAAGAAA GGCAGAATGG AAACATAACC AATGCATACA TAAATAAAGC 12661 ACACTGTAGA ATCTTTTTAA ATTCTGTATG ATATATCGAA TGCTGTCTCT CACATTACCT 12721 AGACCATTTG AAACCGAATT TGTAAAACAT AGACTATCTT TAAGTAGTAA CAGATGCTTC 12781 TGACATGTTT TCTATTGTCT TGAACCATTA CTGCATATGA TACATCAAAG TTAAGTGACA 12841 ATACAAGAAA GCAGATTCAT TTGCTCCCTG CCTAGGCCGT CAGTTCCTAA AGTGGAAACG 12901 CCATATATTA TCTAGCTCAG TTTGCTCTAC AAGACCTGCA ATAGAGCCTT GTGTGACATA 12961 GAGATAATAT TTGTTGAAGC AATTAAATTT GACTTGGAAT TAACTCTGCC ATCATTCTAT 13021 AAGGAAGGAT TGAAAATCCT TCTCACCCTG TGCTGATATA GTACCTTTCT ATACAAAAAC 13081 GTCCTTCTCC CTCTTCCCTT GGATTGCATA AACTATGTAC ATGCCTTCCT CAGGGGCACT 13141 TTTCTAGGAC AGTGTCAGCC TAAGGATCTT TGTTTGGGTG GCTTTTAGAA ACTCAGGAAG 13201 ACAGGAGCAT CATATGCCTA TAGGCAGCTG GCTTCCAGGT CAGTAGTTTT GCTCTGACCC 13261 TAAAATCAGA CTCCCATCCC AATGAGTATC TACAGGGGAG GACCGGGCAT TCTAAGCAGT 13321 TTACGTGCCA ATTCAATTTC TTAACCTATC TCAAAGATGG AGATCAGTGT GAGTCCAATC 13381 CATGTTTAAA TGGCGGCAGT TGCAAGGATG ACATTAATTC CTATGAATGT TGGTGTCCCT 13441 TTGGATTTGA AGGAAAGAAC TGTGAATTAG GTAAGTAACT ATTTTTTGAA TACTCATGGT 13501 TCAAAGTTTC CCTCTGAAAC AAGTTGAAAC TGGAAAATGC AATATTGGTG TATCATAATT 13561 TTTCTTAAAA ACATACCTTT GATGCTTATA AACATTTCAT TTGTAGTGAT AGTTTTCAGG 13621 ATATGAGTTC AAGAAGCTAC ATTAAAATCA ATAACAATAT TTGGTAACTA ATATTAAGTA 13681 ATAATGATGT TCCGACTCAC CTTATTAATC TTTAATACAA CCGTATGTGG TTAGTACTAT 13741 CATTATGCGC ATTCTATGCA GATGAGAAAA CCGCAACTCC AACGGCCAAA AATTACAGAG 13801 GCATAAATGG TTTAGACAGG ACTTAAACTT CAGTGTGACC AAAACCCATG CTTCTAACTA 13861 CTATATTCAA AACTCAGAGA AAACTGAACC CAGAAAATTG AAATCATGAC TAAATTGCTA 13921 TCAACATAGG TGAAAGTCAA TTAAGTACAG AACTGGAGTA TGACTGGCCA ATTATCCCAT 13981 ATAATGGGAA TTCTCCACAT GTACAAACCA CTTCATATGC TAAACTTGTT GACAACATTC 14041 AAAGCTCATC CCTGAATTTG ACTATATTGA TTACATCGAA AATGTTACAT AGCAACCTTA 14101 GAATCCTTGT GTACCTTTTC TTCTCAAAGC CTAGATTATT TCTTTTTCCG ACGTTTTCAG 14161 TAATTGGAGC AGTAAACCCC AGTGTCCCTT ACCTACTTGT TTATTACCTC CAGATGCAAT 14221 ATTACTGGTA CTGTGATTGA GAAACGCACA CAGTGCTAAT GAGGAATTCA CTTTCTACTC 14281 TGACACTCTG GAAGAATAGA GATGCAATCC TAAGGAAGAA TTTAACACCA CAGGCTACAT 14341 GACTAAGGAT AAAGAGTAGA AAATTAGCAG GACTCTATTA ACCGATTACA GCAATCCACC 14401 TGACAGATGA AAAAGGCATG AAATGAAATG AAATGTAGCA GCTACACTCG TCCTATTGAG 14461 AAAGGAAAAA AGTCACCTGT AATGTTGTTC AGAAATCCTT TCAGTACTAA AAAATTCATT 14521 GACCATCTTC CTTTAGTCTC GAAAATTTCT TAGAAGGTAA AAAAAGGAAA AGGTGACAGG 14581 GCAAAGACAT TTGAAAAGAA AGAAAAGAGT GAATGAACTT GCACACCTGG CTTGGACTCC 14641 CCATTCCCCT TAGGTTTCCA TTGTGGGGGA CAAACTAATG CCTGGGTTAC CTTTCTTGAG 14701 AGTGTGTTAA TTGATTCAAT ATCTCTGAAG TGCTACTTTC ATCTGAAAGG TTATAATTTG 14761 AAATTCAGAT TTACCTGGAT AAATTTGATC TTGCTATTAT GGAAACCTCT AGAAATCCTT 14821 GGAGTAGTTA CTCATTATCA GCTTAAATAA TATAGCCGGT GGAGCTGAGG GAATGAGTAA 14881 CTCAATTAGT CTCAGTTACA ACTGAAGGGC ACATTGTTGT AAACTATAAT TGAAAACATA 14941 AATATCTTTA CCTAGTTTAA AAAATAAAGA TGCTTTAAAA GGAGGAAGGG AATAGCCCTG 15001 AGGAATGTAA ATATAAGCAC AAAACTTCTA CAACAGAGTT TGCTACGTGT GTGGCTGTGT 15061 TCCACCCAGC AAAAATGCTA AGTCTACAAC TGACACAACT TGGATACTCT CATGTTCCCA 15121 CATTTTGGTT TGGTCAAGGC TGTGCAGTTG TACTGCAGGC CACCACCACT CCTGGCCTCT 15181 ACAGTATATT GATCTGACCC ACCAATCTGA TCAAGGTTTA GAAAAATATT TTCAGCCCAG 15241 TTAGCTCACA AACAAAATGA GAATTCCCAC AAATTGCTCT TTATCTCAGA CAACAGAGGA 15301 AAGCTACAGC AAAAGCATAA ACAAATTACC ATTTAAGTTT GTTGCTTCAA ATTAAAGACT 15361 AATTGCAACA GCTACTAGAT AGCACAGTTT ATGGGGCATC TCGGCCCCAA GTCTTTTGTC 15421 TTATAAGGTC TTGAAAAAAA GAAAGGAGAT TTTCATCAAT AAGAGTTTTT TGTTATCTTT 15481 TTCCCTTGTT CATCAGGCCC TTCACTGCGA GAGAGAGGTG TAAACGTTCA GGGCATGCAT 15541 TCTAGTTAAA GAATATTAAT TGGCTATTGG GTCCCTTTGG TTAGAATAAA GACCTCTGTA 15601 TGATGTCCCT AGCTGTACAT CAAACCCAAA TATCTCTCAG ATAAATGAAG GTCTGTAAGA 15661 ATTTGGTCAT TCCTGTCTCT TCTAAAGAGT AACAGAGGCA TTTTCCCGCA GTAAAGTAGA 15721 ATGGAAAGAA AACAAAAATC ACAAGCCTAT AAACACCTTC TTCAATTTTC CCAGCATGTC 15781 ACAGACACTA CTGTCTTATT TACTACGTAT TTCTGAGGAG TAAAAAAAGG AAATATGTTG 15841 AGTTTAGCTG AAGCACAGCA TATTTTGTGG TAAACTTGTT AAATAAAACA TCTTTTGTCC 15901 AAGCTTTGGT TGTCACACAA GTGGATATAT CAGGAAATAT AAAGGCAGAA TAAACTAAAG 15961 CAGAACATAC TAACATTTGT AGTAGGCATG AAGGGAATTA GAAAGTGTTT GTGTTAACAT 16021 GGAGGGAGGG AGAACAGATG CTTTGAGATG TTCTTCAACA GATATTCTAG GCACTGAGAC 16081 CCCCTTCGGG ACCAGAGAGA GCCCATATCC ACCACAGTAC CTGACACATA AATGCTCAGT 16141 AATTGATAAA TGAGTCCCAT TCTAACTGTT CCTTAGCCCT GCTCTATGGA ACTCTCCCCT 16201 GAATTCCTTG TGCCATTATT TTATTTCTGG AATCTTCAGC CTTTTAGCTG AGGGCAAAAG 16261 ATTGCTGATT AGGAAGCAAT ATTTCCCACC TCCTGCGCAA AACAAGCCAA AGATCAACAG 16321 CAGCAGCAAC ATACTGAGCC CTAAAGGGCA ATGACAAATG TGGAGAATGA TACAGAGGTC 16381 TGGTTACTTC TTAGCCAATG ACACAGAATC ACAATTGAGA AAACACAGAG TTTATTCATT 16441 CCCATTGTGC ATGCCCTGGA CAAACCAAGC TGCACCTTTC GTAACTTATC ACAATCTCAT 16501 ATTGACGGAA CACTTTCTAC AGGTAATGTT TGATTTGGCT GAACACTTTA GCATTGCTTC 16561 GTAGCAACAA AATGATAGCT AGTAACAGAA AAAGATCCAG GGATATTACC ACTGTTAGTG 16621 AGGAGAAAGG CCTTTTAATT AATTAATTAA TTAATTAATA GGACCAAGTG CCATCTTTTT 16681 GGATCATGCC CTTAGTGGAT TATTGGTAGC AAAGGTTAAA GCTCAAGCTG GTTCCTTTGT 16741 CCCCCTGGCA ACAGTTGATT TGCCTCCCTT ATCTCCTGAA GTACCGTAAG GACTAAGAGC 16801 CAATTATTAC ATTTGGCTAT GCTAGCATAT GTAAAATAGA GTTTAAAAGT TTAGATTCAT 16861 CACTCAAAAA TTCATATTCT CCAAAACCAT ACAGTCACTC TGTTAGCCTG TGTTCCCCCA 16921 GAAAAAAAGT CACAAGCTTA TTATTAACAT GTGCAATCCA GGGGCAAGAG AAAGGAACTG 16981 AAGATGAGGC AGAAAGGAAA AGAAAGCCAA TAAGAGGATG AGTTATCAAA CTACTCGTTT 17041 CTTAACAGCA ACTGATTGCT TAACTTCCTG GGACTGTCTC CAATAAGTCA AATTGGCCTC 17101 AGGTTAGTCC ACCTGAGTGG GAAGAAGCGG TGAAAGAATT TGTCTGTCAG TATCTGTCTC 17161 TCATTGGTTA GAAGTTCGAC TTATGGGGAA TTAACTCCCT CACATTTCCT AGTTGGATAG 17221 CTTGGGTACC AGAGGCATAT GGCATCCATG CTCAGCATGA ACAGGGAAGC TTCAAGGCAA 17281 AAGACACATA GTGCAGCTAT GAGCCAAGGC AATTCAAGGA TACACCCATA GGAGGCTGGT 17341 TGACATCCAC CCAGAGCTAA TCACCACCAT GCTGGAAAAA GACACAGGTG AAGCTGAGAA 17401 GAATGAAGGT GGTGCATAGG AGGTATCTAA TACAGTCACT CATTTTCAAA CTTTCCATGT 17461 TATGATTGCA CTGACCACTG AGGATTTCTA TTGAAAGTTT TACTGTTGTC AAACACGTAC 17521 ACAAGGGGAA AGGTGTCTTA CATTGTTTAT GTTCCTGTGC TGCTCTAGAA ACAGAAATAG 17581 GCTCAAGAGC AGAGCCTGTT TTTCTTAATT CAGCAGGTCT AAGCTAACAA GTCCTGAAAC 17641 ATGGTACTTC CTGTTATTGG TATTGCATAG GAGAAACAAA GGGAAAGCAC AGTAATTAGA 17701 AAATACAAAC AAGATGGCAG GAATAAGCCA AAAATATCAG GAAACACAAT TATTGTGAAT 17761 TGGGATTAAA CTAATCTATT AATAATGACA ACTTTCAGCT TGGAGTTAAA AATTTAATTG 17821 TATACTGTTA ACGAAAGTGA TACCTAAAAT AAAATTACAC TGGGAGGCCA AAATGAAGGG 17881 ATGTGAAAAG AACTATCAGG TAAAAACTAA CAAAAAGAAA CTAGCAAAGC AATCTTAATA 17941 TCAGACAAAA TAGAATCCAA GAGGAAAATC ATTTCAAAAG ACAAGAGATT TTTTTTATTA 18001 ATAAGGGGAA TTGCATAGGA GAGTAAAGAA AATGTGGGCC ACTGGAATGC TTAGCACTAA 18061 TGACATATTG GTCTTTGGTC TTCAGTTACC TTACAGGACC CTATTTCATT CTCTTATGTT 18121 TGATATGTAA CCACCTCAGC CAGCTTCAAG TTGCTTTTTG GCCCTAATGG ACTTCCTAGC 18181 ACTATAATTT CTTTTTTTTT AAATGTTTTA TTTTAGGTTT AGGGGTACAT GTGAAGGTTT 18241 GTTACATAGA TAAACATGTG TCACAGGGGT TTGTTGTACA TATTATTACA TGACGCAGAT 18301 ATTCAGCTCA GTACCAAATA GTGATCTTTT CTGCTCCTCT GCCTCATCCC ACCCTCCTCC 18361 CTCAAGTAGA CTCCAGTATC TGTTGTTTCC TTCTTTGTGT TTATAAGTTC TTAACACTTA 18421 GCTCCCGCTT ACAAGTGAGA ACCTGCAGTA TTTGATTTTT GTTCCTACGC TAGTTTCCTA 18481 AGGATGATAG CCTCCAGCTC CATTCATATT CCCACAAAAG ACATAATCTC CTTCTTTTCT 18541 ATGGCTGCAT AATATTCCAT GGTATATATG AACCACATTT TCTTTATCCA GTCTGTCATT 18601 GATGGGCATT TAGGTTGATT CCATGTCTGC TATTCTAACA CTGTAATTTC TAAAGACTTC 18661 CAGATTCTAC TTTTATAGGT AACCTGTTAA ACAGTCTAGC TCTGGAAGCC AAGCAATTTC 18721 TAGAATAACT AAGCAATAGA AATTACACTT CAATGCAGAA AGGCAGTATC TACATGAGAT 18781 TATGAAATTG CGGTTGCTTT TTGTGTTCAC TGAAAAAAAT AAGTAAAACT GTAACTTTCA 18841 GAAAAAATGA TTGTACATAT AGAAAACCCA AAGCATCTAA ACAATTAAAA TAAATAAGTA 18901 TAGAAAGATT ACTGGATACA GAGTCAACAT ACAAATATCA ATTGTATGTC TATATACCAG 18961 CAACGATTCA AAAATGATTT TTATAATAGC ATTAAAAATT AGACGCTTAG TAATAAATGT 19021 GAGAAAGATG TGCAAGAACT CTACATAAAA AATTATGAGA CGTTATTGAG AAAAATTAAG 19081 GAAAACCTAA ATAAATGAAT GAATAGGCAA TGTTTATCAT TAAAGGATAC AATATAGTAA 19141 ATATATCAAA TGTTTACTAA TGGATTCAAT GCAATACCAA AGTGCCAGCA GGCTTTTTTG 19201 GTGGTGGGAG GTCGGGCAGG ATTCATAAGC TAATTATAAA ATGCATATGG AAATGCAAAG 19261 AGCCAAGGAT AGCCAAGACA GTTTTGAGGA AGAATAAACT TGTACTACTT ACACTACCAG 19321 ATGTCAAGAC TTATTATCGA GTTACATTTA TTAAGACAGT GTGGTACTGA CACAAGGATA 19381 GACAAATAGA TCAGTGAAAC ACACTAGAGT GCTCAGAAGC ACACCTGTAC ATATATAAAG 19441 GCTTGATTTA TGATAGAGGT GCCAGTGCAG TAGAGAAGGA AATTATTGGT GTTTTCAATA 19501 AAAAGTGATA GGTCAATTAG ATATTCATAT GGCATGAAGT ATGAAACAAT AACAATTTAT 19561 ATTCATAACT TGCAGAAAGC AAAAATTTCT TAAAATACAA AAAGTGATCA CCATAAAGGA 19621 AAAGATTGAT AAACTGGACT ATATTAAAAC TAAGGACTCC TGTTCAGCAA AAGACACTAC 19681 TTCGACTGAA AAGACAAGTC ACAGAGTGAG ACAAGATATC TGCAATACAG ATACCTAATA 19741 ACTGAACCCC ATACAGTGAT GGTGGGAATT TAAGTTCGTA CAATCATTTT AGAAAATTGC 19801 TTGGCAGTAT CTACTAGATC TGAACATGTG ATCCAGTAAT TACACTCATA ATTATAAGCC 19861 AGTAAAAAGG CATGTTTATG TCACCAAAAG ATATATACAA GAATGTTCAT TACACTATTA 19921 TACATAAGAG CCAAAAACTG GAAACAAACC AAATATCCAT TAACAGTAGA ATGAATAAAT 19981 AAAAGCTGTA ATAGTAATAC AGTGGAATAC TACACAGCAA TGTAAATGAA CTACTGCTGT 20041 ACAAAACAAC ATGGTTTAAT CTCACAGACA AAATGTTAAA TGAAAGACAC AGACGAGTAC 20101 ATATTGCGAA CTTCTGTTTA TAATTCAAGA ACTGGCAAGA ACTGTTTACT GTGTTAGAAG 20161 TCCAGGTAAT GGTAACCTAT AAAAAGGAAA AAGGGTGGAA TGATTGGGAG GGGGCATCTT 20221 CTGGGGTCTT GATAATGTGC TATGTATTGG TCAGTTTAGT GTTTAAACAG GCTCATTTAC 20281 TTTGTGAAAA CTTACACTAA AATTGTGTGT ATTTTTTGAA TATATGTTAT ACATTAATAA 20341 ATAGGGTTTT TAAACCTGTA GTTCATAATT TAGTGAAAGT AGAATATCCA AACATTTAGT 20401 TTTAAACCAA TCAATTATAG TGCTACCATC ATTTTTATGC ATTATTGAGA AGTTTATTTT 20461 ACCTTTCTTT CCACTCTTAT TTCAAGGCTC CAAAATTTCT CTCCCCAACG TATATTGGGG 20521 GCAACATGAA TGCCCCCAAT GTATATTTGA CCCATACATG AGTCAGTAGT TCCATGTACT 20581 TTTTAGAAAT GCATGTTAAA TGATGCTGTT ACTGTCTATT TTGCTTCTTT TAGATGTAAC 20641 ATGTAACATT AAGAATGGCA GATGCGAGCA GTTTTGTAAA AATAGTGCTG ATAACAAGGT 20701 GGTTTGCTCC TGTACTGAGG GATATCGACT TGCAGAAAAC CAGAAGTCCT GTGAACCAGC 20761 AGGTCATAAT CTGAATAAGA TTTTTTAAAG AAAATCTGTA TCTGAAACTT CAGCATTTTA 20821 ACAAACCTAC ATAATTTTAA TTCCTACTTG AATCTGCTTC CTTTTGAAAT CATAGAAAAT 20881 ATCAGTAGCT TGAATTAGAC CAATTAATTT TCTAGATTGC ATCATATTTT AAATATAACT 20941 ATGTAATCAT CTACAACCTG AATTCTTTCT GTGTCCAATT TGTCCAATTT TTTTCTCTAA 21001 CATTTATATC ACAAAGCAAT TAATTTGTGT GATTTCTGCA TATGTATTTG TAATTCATCA 21061 AGTCAAATCA ATGTAGTAAT ACTATATCAT AAAATATACA CAAATAATTG AGTGATAGGC 21121 TTCTAGTATA AGGACGGTAA GTTTGAAGCA TGATTCTATC TGGGCTGGCT AGTTTACTCT 21181 GAGAAAGTTA TTTTTTATTG TTGGGTCTTA AGCTGAGTTT ACACACTTGG TGTCAGAATG 21241 ATTCCGGCAA TGAACTGTTT TATGTTCTGC TAGGCTGATC AGCACAATCT ATATGGCTGT 21301 GAACAAAACA ATGTTTCCCA GTCATACCAA CCATGCCACC ATTTTAACAG CTGATTAGTG 21361 TATTCAGAAC ATCTCCACTC CATGTTCGTA TGGCTGTTAT CTAAAGATGA AAGCAGTAGA 21421 CACTTTTATT TTTTGAAAAA TTTAGGCTCT GCAGGGTCAA TTATATTTGA TAAATGAGGG 21481 GCTTTTTTGA AGCAAACTAG ATATAATTTC TTTTGCATTT CTAAAGCCTG ATATCTTATT 21541 AATTGGTACA TTAAATTGTG CACCATTTCT CTGTAACTGT TTCAGTACCT GTCTCAGCAC 21601 TATACCAGGC AGAAGAAATA AAGAAAGAAC CAGTGCCAGA TCAGCTTGGT CAGGAGACCC 21661 TAATCCTGCG GCACTAGAGG AATTAAAGAC ACACACACAG AAATATAGAG TATGGAGTGG 21721 GAAATCAGGG GTCTCACAGC CTTCAGAGCT GAGAGCCCCG AACAGAGATT TACCCACATA 21781 TTTATTGACA GCAAGCCAGT CATAAGATTT ACTGAAAGTA TTCCTTATGG GAAATAAAGG 21841 GATGAGTCTG GCTAGTTATC TGCAGCAGGA ACATGTCCTT AAGGCACAAA TCACTTATGC 21901 AATTGTCTGT GGTTTAAGAA CACCTTTAAG CAGTTTTCCG CCCTGGGTGG GCCAGGTGTT 21961 CCTTGCCCTC ATTCTGGTAA ACCCACAACC TTCCAGTGTG GATATCAAGG CCATCACGAG 22021 CATATCACAG TGCTGCAGAG ATTTTGTTTA TGGCCAGTTT TGGGGCCAGT TTATGGCCAG 22081 ATTTGGAGGC CTGTTCCCAA CAAACCAGAA GCTAGGAATA TATATCCTGC AAATAAAATG 22141 AAGAATCTCT AAGGCTTCAG GGCCTGCCCA CTTGTTCTTC TGCCTGGTTC TTCACATACA 22201 CTGTCTCAAA GCTAGTCTAC CTTGAGAGGA GCATGAATAT GTGTGTGGGT GTGTGTCTGT 22261 GTATTTTAAC CTTAAAAACC TAACTTCCAG TATAGACAGA TGGCATACTA GCTAAACCCT 22321 TACAAGTTCT TCTATGCTAT AAAAGAGAAA CAGAATTGAG AACCACCTCC AACTATTAAG 22381 TGTTATATTT GAATATAGCC TTAGCTTTAG CAGAATAAGT AGGCCAAACT TAAAATAAGC 22441 TTTTCTGCCT TTTCAATGAT AAAGGTCCCT TTTCTGTAGC CATTGTTGAT TGTGTACACT 22501 TATACATAAG TATTTTGAAC TAATTTCCTG TTTTCTCAAC CACTTGCTGT CTTCATGATA 22561 CTTTGTCGCA GCTGGTTGCT ATAGAAATGT CTGTTACAAG GAATGTGGCT TGAAGGAAAG 22621 TGATAAATGA AAATGAAATG TGAAGTGACT TTGTTTGACT ACAAATTCCC ATTCTGGTAG 22681 TCCCCAGTGT ATCAATACAT TATTTTTCTT TAGAAAATAA ACCAACCCAA GGAAAAATGG 22741 TGGGCAGGTC CTGGTGAATA TGGCTGTGAT AATTATATTA GCAATCTCTT TGGCTAATAT 22801 TTGAAGCCCA AATAATTGAA TCACAATGAT CTCTCCCCAG AAAATATATA AAATGCACCT 22861 TGGAATCTAG AAGGCCTTTT AGTCTGCAAA AGAAACCTTC TTAATCATAA GCAGCAGAAG 22921 TCCCATTTAC CAAATTGGAA AGTTAAAGTT ACAAAGCATC AATCATCAGA CTTCCATTCA 22981 GGGATGGCAA TTGGGAGTAA GACTTTTTAG TAAAGAAACT AAACACAAAG TCATTAGACT 23041 CTGTAAAAGT CTTACCAAAT TTGATTCTGG AACACCTATT CTATTTCCGT AAAGATGATG 23101 AATTCGGAGC CAAATGTTCT TTTCATGAAG GATTTGAAAA CTGTCCATGA AAATAACGCA 23161 ATCAACCTTT TAGCTTGAGA CTCTATTCAC TGATTAGATT TTTTTAAATA CTGATGGGCC 23221 TGCTTCTCAG AAGTGACAAG GATGGGCCTC AATCTCAATT TTTGTAATAC ATGTTCCATT 23281 TGCCAATGAG AAATATCAGG TTACTAATTT TTCTTCTATT TTTCTAGTGC CATTTCCATG 23341 TGGAAGAGTT TCTGTTTCAC AAACTTCTAA GCTCACCCGT GCTGAGGCTG TTTTTCCTGA 23401 TGTGGACTAT GTAAATTCTA CTGAAGCTGA AACCATTTTG GATAACATCA CTCAAAGCAC 23461 CCAATCATTT AATGACTTCA CTCGGGTTGT TGGTGGAGAA GATGCCAAAC CAGGTCAATT 23521 CCCTTGGCAG GTACTTTATA CTGATGGTGT GTCAAAACTG GAGCTCAGCT GGCAAGACAC 23581 AGGCCAGGTG GGAGACTGAG GCTATTTTAC TAGACAGACC TATTGGGATG TGAGAAGTAT 23641 TTAGGCAAGT TTCAGCACTA ACCAATGTGA GAAGGCCTCC AGAGATGAGC AGTTGGTGAA 23701 AGAGAGGCTC AAAACCAGCT ACCATACAGG TCAAGAAGAA TTTGGCATTA AGGAAACAGC 23761 ATAGCAGGAT TCCAGACAGG CAACTGGTCA ACAACATGAA GGTCTGGAAG AAAGGTCGCA 23821 GTACTCAGGT TCAGGGCACT ACTTCAGCTT CAGCCCTTGC AAAAACTGGT GAGAGTTGGA 23881 AAGTCTTTAG GCTAAGAAAA ATTGGATTAT TTAAAAGGGG TAAAGAAAGG GACTCAAGGA 23941 GGAAGGATTA AGGCAAGAAC TAGGTTCCAA GAAACAGGGC ATGAGAGAGA GTCTTGATCT 24001 ACCACTATAG TTCTCGTGGT AGCATCAGAA TCACCTGGGA ACGTAGAAAT GCAAATTCTC 24061 CTGCTCTACA CTAGACCTAC CAAATCAGAA TATCTAGGGG GTGGGGCCCA GCAGTCTGTG 24121 CGCAAACAAG CACTGCAGGT GATTTTGATG CACATTATAG TTTGAAAACT AGGCCAGGTG 24181 CAGTGGCTCA TGCCAATAAT CCCAGCACTT TGGGAGACTG AGACGGGAGG ATTGCTTAAA 24241 CCCAGGAGTT TGAGACCAGC CTGGGCAACA CGGCGAAACC CCACCTCTAA TTAAAAAAAA 24301 TACAAAAATT AGCTAGGTGT GATGGCTCCC ACCTGTGCTC CCAGCTATTC AGGAGGCTGA 24361 GGTGGGAGAA TCACCTGAGC CTGGAAAGTC GAGGCTGCAG TGAATTGTGA TCACACCACT 24421 GCACTTCAGC CTGAGTGACA GAGTAAGACC CTATCTCAAA AAACAGAAAA AGAAAAACAC 24481 TGGCCCAAAG GAAATGAACT TGTTACAGAA GCCGGGGTTC AAAACACCAA ATAATGCACT 24541 TGTACCTAGT CCTTCCCGGG TGCTCTGCAG ACATTTCTCC AAGCGTAGTC TGCAAACAAC 24601 CTACATATGT AGAATTACCT ATGCACATTT TTCATTTAAC AACCAAGAGC TACATTTGTA 24661 GCAAAATCTG GGTTGTAACT TAGCCTACAG CTGAAGCCTA AGAGATTCCG TCTGTGAGAA 24721 GAAATAACCC ACCTCTTTGG CCCCCCTCCC CAGGCAGGAA GCCAGGATGG TCCTTATATA 24781 AAGTTGTGCT GTCCAATAGG TAACCACTAG CCACATATGG CTATTTAAAT TTAAATTAAC 24841 TACAATTAAG AGAAATTAAA AATTCAATTC CTCAATTGCA CCTGCCAAAT TTTAAGCACA 24901 TAACAACCAC ATGTGGCTAG TAACTACTGT ATTGGAGAGT GCAAGCGGAG ATAGAACACT 24961 CTATTACTGC AGAAATTTCT ATTGGATAGC ACTTATAATA GTTTAGTGTA ACTTAAAACT 25021 CCCTAGTTGC CACAGTCATG ATTTAGTAGT AATTTCATGG ATTTCTCTAC TGAGGTTAGA 25081 ATCTCTGCCA TTAGAGACTG ATAAATTTAA AGTTTGCAAT TATCAAACTG GTGACAATTT 25141 AAGCCAGAAT CAGGTAATGT CCTCAGTTTT AACAGCATTG GAATTTTCTG GGACTAGCTG 25201 TGTATCTATC CAGGATTCTT GAGAATGCCT GCCATTTTTC AACATAATGG ATGTAAGGTA 25261 TTACACATAT ACCTGGGGAT GGGGAGGTAG GTATAATTGC ACAAGCATTG TGGAGAATGG 25321 TATCAAAGAG TGGCAGAACA TCACAATCAA GGTTTTCCCT TTCTTTTACC TTTGCTTTTT 25381 AAAAAGACAA TATTTGCTGG ACCTGATCTT ATAACTCATA AATGGGACAC TGTATGTTCC 25441 TTTTTACCTC CTCTGTTTCT ACTTAATTGC ACCCTATGAG GACTGCTTCC CTTACCTACC 25501 ATAACCCCTT CCTTCACTCA TCCATATCTT TACTCTTCTT CACAACTCTG TAATATTGAC 25561 CTTCTTTATG AACCTTTCCT GGAACAATCC CTCTTAATGC AAGCACTGTT ATTATGCCTT 25621 CAATGTATTT AATATCCATG TATCTATTCT CTCTAATTTT GTCATTTTGT GTTCTCATGT 25681 ATTTTCATTC ATTATGTGTC CAACTTCCAT GGATAACATG GTTACAACAA AAGATCCTAC 25741 TTTATGACAA TTATCTTCCT TGGGTTTGTG GGACATAGAA CAGTGCACAG AGTAGGGGAT 25801 CCAAGAACCC AGGAGAATAT ATTAGCTAAG AAGATAACTT CCGTTTTTAA AAGTCCAAGA 25861 TTCAGGAGAT CAAAACCATC CTGGCTAACA TAGTGAAACC CCGTCTCTTC CAAAAATACA 25921 AAAAATTAGC CCGGCGTGGT GGCAGGCGCC TATAGTCCCA GCTACACGGG AGGCTGAGGC 25981 AGGAGAATGG CGTGAACCGG GGAGGCGGAG CTGGCAGTGA GCCGAGATCC CGCCACTGCA 26041 CTCCAGCCTG GGCGACAGAG CGAGACTCCG TCTCAAAAAA AAAAAAAAAA AAAAAGTCCA 26101 AGATTTTAAA AAAAAAAAAA AAAAGGATGT CTGCTTTGTG AGTTTAGCAT TGTCTCCTTG 26161 TCATTCCAGA AATGAAATGG CAAATACATT TAAATCAGAA CTAAAAAGGG GAACAGGGTA 26221 TAAAGGCTCA ATTTAGTCAC ATCATTTCCG TTTCTCACCC ACCCCCTTTA AACCAGATGT 26281 TTGCCAATGC ATTAACAATG CAGATGTTTC CTGAAAGAAA GTTTAGTAAC TCAAGCAGAC 26341 ACCTTATTTT CTTTTCAAGC AGAAAAGACT ATGAGATGGT GGTTGTGGTT GTTCCGGGAG 26401 GGAGAAGATA TAAATGATAC ACATTATTTC AAATCATTTC ATGACCTCAC TGCACACTTA 26461 TAGTTATTGT ACCTGTTGTC TTTTTGCTGT CAAGCCTAGC TAAGATCATT TGGAATGTTC 26521 AAGATCACTC ATACATGCAT GTGCACACAT ACACATGCAC ATATGTTCAC TCCCTATTTC 26581 ATCCACATGA ACTAAGATTA CTGATGTGTA CAGATTCAAA GCACTTTTAT TCTTTTCCAA 26641 AGGCAAGAAG CTGAGCTACT TTCCAGAATA GTTGTGAAAG ACCCTGTCAT ACTTCTGCAT 26701 TGTTTCCTCC ACACCACCTC CATCCAGTTC CTTATGAATG GTTACTGGTT TTCAAAAATA 26761 TGAGATAAAT TGAGTGTATA AAAGTCATTT TTAGACAAAA TGAAACAGGA AATGAAAGAA 26821 ACCAGAATCT CTCCTCATTT GTGGATGGGC CAGCTCCACC ATGTCATGGT TAATCTGCAG 26881 GGAGGAAATA CTAGATTTGA TTGCAGATCA GACTGCAGCA AACCTGCTGT GACTAAGGCA 26941 TCAAGAGAAA GCAAGCAACA GACTGGGGCT TCAGTGGTGA AAACATTATA TATCTAGCTT 27001 TGAAATATGA AATACTGTTT AGCAGTGTCA CCTAGAAAAG AGTGTTTCAA AATGCTGATG 27061 CTTCATAAGA ACCTTTCTCT TCAGAGTTGG TTTCTTTTAT CTTTCAAATT AGCCAGGGTG 27121 GGAAATAAAG TGATCACTTG GTGAAGAAAT CTCACAAAGA AGAACATAGA GAGTTCACTT 27181 TCATCTGGAG TAATGAACAG ATTGAACAAA CTAGAAATGG TTAGTCTGTT AAAGAAAAGG 27241 TGTAGGTGAG CTGTTTGCAA GAGCCACAAG GGAAAGGGGA AGACAACTTC TTTGTGGACT 27301 TAAGGGTGAA AGTTGCAAGC AGGCAAGACG ATTCTGACCT CCATTAAGAA AGCCCAAACC 27361 AACCAACAAC CACTGGGTTG GTTACGCAGG TTGGGCAGCA TTGGGAGCAA ATGTTGATTG 27421 AACAAATGTT TGTCGGAATT GTTGACTTAA AGAGCTGTTC TGTCACTGGG GACAGCAGCG 27481 GCTAGATAGC CCCATTCAGG GAGAGGGCAT TTGTTCACCT GGCCAGAGAT CAGAGCAGGC 27541 TAAGGGACTG CTGGGATCCT GTCCAGCTTT GAGACCCTAC AGAGCCATGT TCACCTAGCA 27601 CGTATCCCGT CTGCGGTCAC GGTCATTTCT TACCTTATTC CAGGGCTTTC ACCTCAGCTT 27661 GCCAGGCTGG AGCCAAGGGC CAACGCAGCC GCGCCTTGTT CGCGATGGTA GCTTCCCAGG 27721 AGCCCCCTAT GGTTCCGGAA CGCGCTGCCG GCCCCATCCT GTTTGCTACC TCCTAAAGCC 27781 AAAGGCACTG GCGGGCCGGG CCAGCTTCTA AAGTCGCGCA AGGTTAGAAG GTTCCGGACA 27841 GGAACGGCGT GAGGCCAATG GAAGGAGGTA CTTCAGTTTC CCTCCAGATG CCCAGCGATG 27901 GGCTCAGAGC TCCTTGAGAA CTCGGGAAAG GAAGCAGGGT CTCTGAAGAA ATACTTCAGG 27961 AGTAGAAAGA GGAAGCTAGA GGGTTAAATG CACTACACAG GAACAGAAAT GAGTTTTTCT 28021 TAGAGTTAGT ATATGTCTAG AGGTGTAGTA AACTAAAACA AGTCTTGAAT TGCATACCGC 28081 CACGTAGGGA AGAAATGAAA ACCTTTGAAT ATTAGTGAAA AAAGGGAAAC TGCAACGCCT 28141 GTATTACTAG ATAGCTTTCA TCAACAGCTC AAAACCGACA GATTTAAAGA AGCAACACCG 28201 CATTTTGGCT TTCTAAAGCT TTAATTTGGT TTGGATCCCA TGCCCATGAC CCTGCCAGCT 28261 GACAATTCTA AGCATGCGCA AACTGGCCCC AAAAATTCCT CCCACATTTC CGAAGAACTA 28321 TTTGGCCCTT TATGTGAAGT ACCTGGTTTT TCCATTTTCT GTTTTACCAT AGGCCTCAGT 28381 TCGGTGTGTG GCGTATTTAT TCTACATTTA ACAATTTGAA GATCATTCTA TTAGATTAAA 28441 AAAAAAGAAT ACAATGGAAG CCAAGTGATT AAGCTTTCCT TATGCTTATA TTAAGTTGTA 28501 GCATATGCAT TTACCGATAG TTAACCGTAT TAACCTACAG AAAATGTCCA GGGAAATGGT 28561 CTATTTCTTA TTCTATTTTT GACCTAAAGA AAATCTTTAA AATGTCTTAG CATTTTCCCC 28621 AGTCTCCATC CACTTCCCTC AGCTTTGGCC TGAAGCTATC TTTAAAGGTA CCCTGTACAA 28681 GCTCTTGCCC TGTACAGCTA GCTACAGAGA TTCAATCCTT TCTGTTCGAT TAGGACACAT 28741 CTCAGTGGCA GATAACATGC AAAGTTATTA TATGTATGAA CCAGAACTTG TTTTTCCTTA 28801 GGGGCCAGGA TGTTACACTA AGGTCTTAAG ACTATAGTAA TATCTTCACT TGAAAAAGCC 28861 CTCTATTATT CCTATCTCAG ATGATAAAAA TTCAATTAAG AGAAATAAGA ACGTGACATG 28921 TGTAATCGCA CCTGGCTCTA CAAAGCTAGT CTGGACAGAC ATTTAAACAA TTATCCTCTA 28981 AGATTATTTG ATGAAATGCA TTTCAATGAC TAGTTAACCA TTAAAAACCA AAGTGAGCAT 29041 CCCATCTGTT CCCAGTCAAA TGACCTAGAG CAAAGGACTA GGCAAACCAC ATCTGTGGGC 29101 ATAGCAAGCT GTACATCACA AACAAATGAA TTTGCTTTGT ATATGAGTGA GAGCAAACAC 29161 TCTTTATTGT ACAACTTGGG TGGGTAAGTA GGGAGAATAA TGGTTTTACT GAAATCGCAG 29221 GTAACGGTTA CGTTGGAGTT AAAGGTTAGG AAGAAAACCA AAGGGTAAGA GCTGTTGTTC 29281 TGGGCTGGCA TTGTCAATGA AGAGCATAAA TTCAGATGTG AATGTATATT TTGTAGAAGC 29341 ATGTGTGTTG TTGGTTTTTG TGTATGTGTG AGTCTGAAAG AGGGAAAACA GGCTCCCATT 29401 AGACTATGAC TAACAAAAAT GTTTGACAGA TTATAACTCA GATGTCTTAC TCAGAGCATA 29461 TGCCTTCCCA TTTTCCCCAT TATTCCCCAA CATGATGTCT TTAAGAACTT GTCCTTGACC 29521 GAGCAGACAT CTCATACCCC AAATAGCTAA TATTTTGATA GCTATGATCC TGAACGGCCA 29581 AACATTCCAA AACCAAGTAG TTTGTAATAT CTTTAATGCA AATATATTTT AGGCCTTTTC 29641 CTTGGCAAGG ATGTTTGGTC AGGGGTTGGC AAAAATAATG CTCTTCAGAC TTAAAAGAAC 29701 ACAACCATAT TTCTTAGCCA TCCACCAGAA AGTAGTAGAA CGCTCCAGGA AGCAAGTCTT 29761 TGTCAGGAGT CAGACTAGCT ACATCATAAT CTCTCTGCCC AGGGGCTGTG GATGTCATCC 29821 ATCCTGGCCT AACTAGCCTA CTGAGCTGAG AGATGTCCAA TTTCCCCCCA ATACACTAAC 29881 CAGAGGAGAA GGACCGTGAT ATCATTGCAT GTGAATTCTT AATTCCAATT GCTTAAACAA 29941 ATATGTTCAG TTGTAACTAT CAATACCAGT ATATAACAGT GTTGGCCAAG TTTTATTGAT 30001 GCTGACAATC AATTGGAGTT ACAGCCAGAC ACATGGTCTT ATGACCGGCG TACTTACGCA 30061 GGGCTTTGCA CTGAGACAGG TCGTGCATCT GAGGTTTACT GCTTTGCATT TTTGTTTTGT 30121 AACTGAAGTC TGATGAGACA GCCAGAGCAT GTGCTACCTA GGGACTTGAA TCCCTGCAGC 30181 CCCATTTCAC TTCTCACCAC CTTCCGGGGT GGTTTCTCGA CCTCCCACTC CCCTACCACC 30241 TGGTGCCTTA GCCAGCCCTG GCTCTCCCTC CAAACACCTG CCCAATGAGC ACTGCCACCC 30301 CATGGTGCCC AGACATGCTC TCCCTCCTCA TCCCTACCTA GCTACCATTG CCACTCCCCT 30361 CCCCCAGCGG GGACATGGGC ATAGGAGCAG GGAGAGTTAA GGTTGGTCAG GTGCACGTGC 30421 CCTATGCTAT CTTGGAAGGG GGCTTGGCCA TGTGGCATCT CTGGACCAAG AATGCGCCAC 30481 AGCACATTTG GAGGGTGAAT GGTGGGGGCA CACCCCTTGT CCACCTCTAT TTCAGGCATG 30541 GAACACATCC TGGCATGAAA GTTGCAGTCC CTTGGGAATC ACCTCTCCAC CTTGATTGCC 30601 ACAGTAGGCC AGTGACAAGG GAAGATTGAC ACATCATCCC TGCTGGGGCC CAGTGTCCTG 30661 TGGCTGGCAG GCAGGGGATC CTAAGGACAT GTGGGTCTTA AATTGTAGGG TGCACTTCCT 30721 GGGCACCTTT GAGGGTCTGC ACTGCCCCAG CAAATATCCC CATGCTAGAA GGAGCAAAAT 30781 ATTAAATGGC AAATTTTAAA AATGTAACAA GATGGGTTGC AAAAGAGACT ACAGAGGAAA 30841 GCAAAAGTTT TGTATTTTAG TATCTTCCAT GGCACTTTTC TTCCTAGCTT TTGAACAAGG 30901 GGCCCCACAT TTTTATTTCT CACTGAGCCC CACAAAGTAT GTAGCCATTC CTGCCCGGAG 30961 TGAGGACTTT TAAAACATAA AGATTATCAA GTCTTGGAAA TTCTGATTCA GTAGATATAT 31021 AACAGGTCTC AAACTTAATT ATGTAAAGAA TATTCTGGAG AGCTTCCTTT TACCCAGTCC 31081 CACCCACCAA ATATTCTGAT AAATTAAGCT TCGATTAGCC CCCAGATCTG CATTTTATAA 31141 GGATCCCCAG ATGATTCTAC TGCAATTGGT CCACAGACCA TGCCTGGACC GAATTTGGGT 31201 GCTTAGGAGC ACAAATTCTG GAGCCGGGCA GACTTGAGTT TGCTTCCTAG CTTTACCAAC 31261 TGATCTCAGG GGAGTTAATG TTTACCTCTA AACTTTAGCT CATGCATCTA TAAATAAATA 31321 TATTAATATC ATGTCATAAG GATATTATGT TGTATTAAAT GTCTTTAAAA CACCACAATG 31381 ATTAGCCCAA AGTAAACACT CAATAAATGT TCAAAAATTT AGGAAAATTG TTAAGACTGG 31441 GTTGTATGCA CACTGGTGTT TATTATATTA TGTAGTTTTT TCTGTATTTT TACAACATTT 31501 CAGAATTAAA AGCAACAGCT AGAAAAAGAG GGAAATGGCC GGGTGCAGTC GTCACGCCTG 31561 TAATCCCAGC ACTTTGGGAG GCCAAGGCGG GCGGATCACG AGGTCGAGAG ATCGAGACCA 31621 TCCTGGCCAA CATGGTGAAA CCCCATCTCT ACTAAAAATA CAAAAATTAA CTGGGCATGG 31681 TGGCATGCGC CTGTAGTCCC AGGAGAATTG CTTGAACCTG GGAGGCGGAG GTTGCAGTGA 31741 GCCAAGATCT CACCACTGCT CTCCAGCCTG GTGACAGGGC AAGACTCCGT CAAAAAAAAA 31801 AAAGAGAGGG AGAGCCAGAG TATGAAAAAG GAAGTCAGAG CCCTTTAATG AGTCAGCTTT 31861 GTAGGTCTCC AGGTAGGAGG CTAGTGCTTC AGTGTCTAGG ACATAGTAGG TGTTCAGTAA 31921 ATTAAATTCA GGACAAAAAG AACATGCCCC AAGGACCATC TGATATCCAC TTAAAGTGAT 31981 GGACTACCTC GTTTCCCTTG TTTATGAATG GGTTCATGCC TAAGACTGTG TGCACTTTAA 32041 TACAAGGGCA GTCGTTCAGA ACTAGTCAGG TCCTGAAAAG GATTTACCAA ATGTTGAGTG 32101 TGCCCTCTAG TGTTCACACT TCCCAGCTTT CTTCCTATAA AGGTGGATCA AGGCACTTGC 32161 TTACAACTGG AACTGAAATC CTCCAAGTCG ATCTAGACAT TGAGATGGAG AAAATATTCA 32221 TTGTCGACTG TAATTATGCA ACGAATATCC AGTTGAGATA ATGGACTTGC CTCTTATCTA 32281 ATAATACCCA GGCTCAATGC GTCACTGCTT TGTCCACTTT GCCCAAAATT CAAGCACAGC 32341 TAAGTTGATA TTTTAGGACA AAGGCAGCTT ACTATCCAGC CAGAGGGGAG TAGAATATGG 32401 TTAAGAGAGA GTGGAAAGAA TGAATGAGCC CTGCTATTCC TCACTGCCTG GATGGCTATA 32461 AGCACAGCCC TTATGGAGGC CTTAGGTCTT GCTTCACAAT ATTCCAGTTT GAAAAGGGTT 32521 TGAAAAGACC TCCTAGAAAA ATCAGTAGTT TTTCTCTTTT GAGTAACATG TAGCAAAAAA 32581 AATTTCATCA TGTAGGTACA GGGAACACCC TAGTAACTAT TAATCTCAAG GAGTCAAGCC 32641 AGTGTGTTTC CTAATGTATC TGCTGTATCC CCATGAAGCA AATTTTGCCA TCAGAGAAAC 32701 TGACTCATGG GGAAAAAATC CAAGGACCTC AAATCACCAA AAGAAGCCAT TCCTCAGATT 32761 TGCCTAAGCT TAAGCTTCCC TGTCTCTCAT TGTGTGTTGC TTTCAATGCA GTTACATAAA 32821 TGGCTTTTTT GTTTATGCAC CAAAAACACT AATTCATCTG CAAAGCTCAC ATTTCCAGAA 32881 ACATTCCATT TCTGCCAGCA CCTAGAAGCC AATATTTTGC CTATTCCTGT AACCAGCACA 32941 CATATTTATT TTTTTCTAGA TCAAATGTAT TATGCAGTAA GAGTCTTAAT TTTGTTTTCA 33001 CAGGTTGTTT TGAATGGTAA AGTTGATGCA TTCTGTGGAG GCTCTATCGT TAATGAAAAA 33061 TGGATTGTAA CTGCTGCCCA CTGTGTTGAA ACTGGTGTTA AAATTACAGT TGTCGCAGGT 33121 AAATACACAG AAAGAATAAT AATCTGCAGC ACCACTAGCT CTTTAATATG ATTGGTACAC 33181 CATATTTTAC TAAGGTCTAA TAAAATTGTT GTTGAATAAA TTGGGCTAAA GGCAGAAGGG 33241 TCATAATTTC AGAACCCACG TCGCACCGTC CTCCAAGCAT CCATAGTTCT TTTGATATAC 33301 CCCTATTATC ACTCATTTCA GTGAGGTACA ATTAGTTCTT GATGTAGCCA TTTCCATACC 33361 AGAAGGCCTT CCCAAAAATC AGTGTCATGT CACCGATCCT TTTATCTCTG GTGCTTGGCA 33421 CAACCTGTAG CAGGTCCTCA GAAAACAAAC ATTTGAATTA ATGGCCAAAT GAGTTTGTGC 33481 TCAAAAAAGG GGTGAGGATA CTTGAAATTT GGAAAATCTA GGATAATTCA TGACTAGTGG 33541 ATTCATTATC ACCAATGAAA GGCTTATAAC AGCATGAGTG AACAGAACCA TCTCTATGAT 33601 AGTCCTGAAT GGCTTTTTGG TCTGAAAAAT ATGCATTGGC TCTCATTACA TTTAACCAAA 33661 ATTATCACAA TATAAGAATG AGATCTTTAA CATTGCCAAT TAGGTCAGTG GTCCCAAGTA 33721 GTCACTTAGA AAATCTGTGT ATGTGAAATA CTGTTTGTGA CTTAAAATGA AATTTATTTT 33781 TAATAGGTGA ACATAATATT GAGGAGACAG AACATACAGA GCAAAAGCGA AATGTGATTC 33841 GAATTATTCC TCACCACAAC TACAATGCAG CTATTAATAA GTACAACCAT GACATTGCCC 33901 TTCTGGAACT GGACGAACCC TTAGTGCTAA ACAGCTACGT TACACCTATT TGCATTGCTG 33961 ACAAGGAATA CACGAACATC TTCCTCAAAT TTGGATCTGG CTATGTAAGT GGCTGGGGAA 34021 GAGTCTTCCA CAAAGGGAGA TCAGCTTTAG TTCTTCAGTA CCTTAGAGTT CCACTTGTTG 34081 ACCGAGCCAC ATGTCTTCGA TCTACAAAGT TCACCATCTA TAACAACATG TTCTGTGCTG 34141 GCTTCCATGA AGGAGGTAGA GATTCATGTC AAGGAGATAG TGGGGGACCC CATGTTACTG 34201 AAGTGGAAGG GACCAGTTTC TTAACTGGAA TTATTAGCTG GGGTGAAGAG TGTGCAATGA 34261 AAGGCAAATA TGGAATATAT ACCAAGGTAT CCCGGTATGT CAACTGGATT AAGGAAAAAA 34321 CAAAGCTCAC TTAATGAAAG ATGGATTTCC AAGGTTAATT CATTGGAATT GAAAATTAAC 34381 AGGGCCTCTC ACTAACTAAT CACTTTCCCA TCTTTTGTTA GATTTGAATA TATACATTCT 34441 ATGATCATTG CTTTTTCTCT TTACAGGGGA GAATTTCATA TTTTACCTGA GCAAATTGAT 34501 TAGAAAATGG AACCACTAGA GGAATATAAT GTGTTAGGAA ATTACAGTCA TTTCTAAGGG 34561 CCCAGCCCTT GACAAAATTG TGAAGTTAAA TTCTCCACTC TGTCCATCAG ATACTATGGT 34621 TCTCCACTAT GGCAACTAAC TCACTCAATT TTCCCTCCTT AGCAGCATTC CATCTTCCCG 34681 ATCTTCTTTG CTTCTCCAAC CAAAACATCA ATGTTTATTA GTTCTGTATA CAGTACAGGA 34741 TCTTTGGTCT ACTCTATCAC AAGGCCAGTA CCACACTCAT GAAGAAAGAA CACAGGAGTA 34801 GCTGAGAGGC TAAAACTCAT CAAAAACACT ACTCCTTTTC CTCTACCCTA TTCCTCAATC 34861 TTTTACCTTT TCCAAATCCC AATCCCCAAA TCAGTTTTTC TCTTTCTTAC TCCCTCTCTC 34921 CCTTTTACCC TCCATGGTCG TTAAAGGAGA GATGGGGAGC ATCATTCTGT TATACTTCTG 34981 TACACAGTTA TACATGTCTA TCAAACCCAG ACTTGCTTCC ATAGTGGAGA CTTGCTTTTC 35041 AGAACATAGG GATGAAGTAA GGTGCCTGAA AAGTTTGGGG GAAAAGTTTC TTTCAGAGAG 35101 TTAAGTTATT TTATATATAT AATATATATA TAAAATATAT AATATACAAT ATAAATATAT 35161 AGTGTGTGTG TGTATGCGTG TGTGTAGACA CACACGCATA CACACATATA ATGGAAGCAA 35221 TAAGCCATTC TAAGAGCTTG TATGGTTATG GAGGTCTGAC TAGGCATGAT TTCACGAAGG 35281 CAAGATTGGC ATATCATTGT AACTAAAAAA GCTGACATTG ACCCAGACAT ATTGTACTCT 35341 TTCTAAAAAT AATAATAATA ATGCTAACAG AAAGAAGAGA ACCGTTCGTT TGCAATCTAC 35401 AGCTAGTAGA GACTTTGAGG AAGAATTCAA CAGTGTGTCT TCAGCAGTGT TCAGAGCCAA 35461 GCAAGAAGTT GAAGTTGCCT AGACCAGAGG ACATAAGTAT CATGTCTCCT TTAACTAGCA 35521 TACCCCGAAG TGGAGAAGGG TGCAGCAGGC TCAAAGGCAT AAGTCATTCC AATCAGCCAA 35581 CTAAGTTGTC CTTTTCTGGT TTCGTGTTCA CCATGGAACA TTTTGATTAT AGTTAATCCT 35641 TCTATCTTGA ATCTTCTAGA GAGTTGCTGA CCAACTGACG TATGTTTCCC TTTGTGAATT 35701 AATAAACTGG TGTTCTGGTT CATACCTTGG CTTTTTGTGG ATTCCATTGA TGTGAATCAG 35761 TCACCCTGTA TTTGATGATG CATGGGACTA CTGACAAAAT CACTCTGACC CTGCCAAGCT 35821 GCTGCCTTCT CCTGCCCCAA CCTCACCCCC AGCCAGGCCT CACTCTTGCT AGTTCCTTTA 35881 GTTCTTTTAG TCAATATATT TTTGTCTTCG CATATAAGTA TAAATAAACA TATTTTTAAA 35941 TTTCTTGGCT GGGCCCAGTG GCTCACGCCT ATAATCCCAG CACTTCTGGA GGCCAAGGTG 36001 GGCGGATCAC CTGAGGTTAG GAGTTTCAGG CCAGCCTGGC CAACATGGTG AAACCCTGTC 36061 TCTACTAAAA ATAGAACAAT TAGCTGGGCT TGGTAATGTG CACCTATAAT CCCAGCTACT 36121 GGGGAGGCTG AGGCAGGAGA ATCACTTGAG CCTGGGGAGC AGGGGGTGCG GGAGGTTGCA 36181 GTGAGACAAG ATCGCACCAG TGCACTCCCC ATCCTGGGTG ACAGAGTGAG ACTCTGTCTC 36241 AAAGAAAATA AATAAATAAA TACATTTCTT GAGGCGTTTC TTGTTAAATC ATTCATGGAG 36301 AGGCATCCCA AACACCACAT TCAACAAAAC ACTCTGAAAA ATGTTTTCAA ATGCAATATA 36361 ACACAGCAGA GATTTGATGC TCTGTTATCC AGTTTTCATA TAAGGCTGTG TGAGCTGTGT 36421 CCCAGAGAGG ACAGTGGTCT GAATCCACCT GAGACAGAAT TGGGTCTAAC TAACTGTGAG 36481 TATGGCCTTC AATAAGTCAC TCTCCATTTG GGAATTTGAT TTCTCCACTT GTATAATGAG 36541 AGTATTTGAC AGGATGCTCT CCCAAATCCC TTGCAATTTT GTTAGTCTGT GATTTCATGT 36601 TTTTATTTTT ATTCCTTCAT CCAACAAATA GTCAAGGAGT AATTGCTGTG TGCCAAATAC 36661 CAACAGTATT CATTAAATTG TAATTCAGAT TTTATATATA TATAAATAAT GTATAATGTG 36721 TATAAATTGC TTTGTGAGTG CCTACTACAC TGCTAGACAG TAGTTGCTCA ATAACTTGTT 36781 AGCTGAATCA GAATCCATGT TTATCCCAGA GTAGCAATTA GTCTTGCATC GAGTATCGTG 36841 AAAGAAGGCC ACACTTAAAT AAGAATAATG CCTGGGGTTT AGGTTTTATG AAAAAATGAA 36901 AGGAAATTAG TTCTGCTTTT GTTGACTAAA GGAAGGGAAG AGAGAAGAGA CACTATAATT 36961 GTCTGCCTCA GATTTAAGGA GGAGGCTAAT TCATGCATTA AACACGTTAC TTCAAATTTG 37021 AATGACCAAA GGTCTGTAGC CTCAGCACTT CAAAATTGGT AAAAGTAAGA CACTCTGGCC 37081 TTGTTTCCAT AGAGACCACC CCTTACAAAG GCACCAATGG GAAACTGGCC TCAGGACTCC 37141 TGTTATTGGT CTTCTCTGTG GCAGAGAAAG GAGCTCTTGG ACCCATAAAT CTCTGAGCCA 37201 CAGTTCTTTT TGCCATGGGC TCAAAAATGA TTGAATTCAT CATGAGCCAC CTGTGGCATA 37261 TTGCCACACT AAACATGTGG GGCCTTTAAG CTCACTAAGA GCCAATGTCT TCAGAGCCAG 37321 CCCTGGCTTG ATTCTACCTA GGGCATTTGC AGTTGCCATA TAAGAATCAT TAGTGCTTTC 37381 AAAATTACTG TAGATACTTT GCCTAAATAG ACTAAAACAT GCTGCCGTCA TATTGGAAGT 37441 GACAGATTAA AATAGAACTC TTGCCAAGTG AAGGAAAGTG TGCTAATATA ATGCAGTCAT 37501 TTTAACTTGC TGTTTAAGTG TGATTGTTTT TAGTTCTTTT GAATATTATT TGTTTTATAC 37561 TGACAGGAAC GAAGTACTGT CCAATTTTCT CTGCCAAGGA AAAAAGAAAA GGTGTTCTTC 37621 CTTACTTACC TGAACCAAAA CAGACCAGTT TACAAAATTG CCTAATTATA ATTGCTAAAC 37681 AAGTTCCGAA TGCTTACAGT CTAATCCAAG AATGTCAGAG CTGCAAGGGC CCTTAAACAC 37741 CATCCAATCC ACTCCACTCA TTTAGCAGAT GAAGAGATTG AGGGCAACAT AAGGCCAGGC 37801 CCAAGATAAC ACAATGACAG CCAGGACTAG AGCTCAAGTC TCCCACCCTG CACTTTGAAA 37861 GAATAATGCT TTCAACTGGA GTACATTAAC TCTACTGTCT ATATTTTTAG GGCAGCTGGG 37921 GCATTCTGCA TTGGTGGCAA TCCTCTCAAC AACCCTGGGA CTGAAAACTG CCTGGAATTC 37981 TTACTAACAA TTCTCTAATT GACCAAAAGG TGACGAAATC AAGGAGACCA ATAAGGTAGC 38041 CTTGGAAAGC AAGAGTGGC // LOCUS HUMFIXG1 600 bp ds-DNA PRI 15-JUN-1989 DEFINITION Human factor IX gene, exon 1. ACCESSION K02048 KEYWORDS Christmas factor; factor IX. SEGMENT 1 of 6 SOURCE Human: cDNA to liver mRNA, clones cVII, cVI, 108.1, and DB.1; 4X lymphoblastoid cell line (GM1416B) DNA, clone lambda-HIX-4; genomic DNA library of Lawn et al., clones lambda-HIX-1,2,3. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 600) AUTHORS Anson,D.S., Choo,K.H., Rees,D.J.G., Giannelli,F., Gould,K., Huddleston,J.A. and Brownlee,G.G. TITLE The gene structure of human anti-haemophilic factor IX JOURNAL EMBO J. 3, 1053-1060 (1984) STANDARD full staff_review REFERENCE 2 (bases 138 to 139; 309 to 310; 319 to 319; revision of [1]) AUTHORS Anson,D.S., Choo,K.H., Rees,D.J.G., Giannelli,F., Gould,K., Huddleston,J.A. and Brownlee,G.G. JOURNAL Unpublished (1985) School of Path., Oxford U., Oxford, UK STANDARD full staff_review COMMENT The factor IX gene is about 34 kb long, and divided into 8 exons. The introns range in size from .2 kb to 10 kb. [1] sequenced both the cDNA for the complete mRNA and all the genomic exons, and found no evidence for polymorphism between the mRNA and the gene. FEATURES from to/span description pept 325 + 412 factor IX, exon 1 /nomgen="F9" /map="Xq26.3-q27.2" /hgml_locus_uid="LX0023Z" pre-msg 296 > 600 fix mRNA IVS 413 > 600 fix intron 1 revision 138 139 ga in [2]; ag in [1] revision 309 310 tc in [2]; ct in [1] revision 319 319 a in [2]; g in [1] BASE COUNT 168 A 134 C 113 G 185 T ORIGIN 123 bp upstream of BglI site; chromosome Xq26.3-27.2. 1 CTCTCTGACA AAGATACGGT GGGTCCCACT GATGAACTGT GCTGCCACAG TAAATGTAGC 61 CACTATGCCT ATCTCCATTC TGAAGATGTG TCACTTCCTG TTTCAGACTC AAATCAGCCA 121 CAGTGGCAGA AGCCCACGAA ATCAGAGGTG AAATTTAATA ATGACCACTG CCCATTCTCT 181 TCACTTGTCC CAAGAGGCCA TTGGAAATAG TCCAAAGACC CATTGAGGGA GATGGACATT 241 ATTTCCCAGA AGTAAATACA GCTCAGCTTG TACTTTGGTA CAACTAATCG ACCTTACCAC 301 TTTCACAATC TGCTAGCAAA GGTTATGCAG CGCGTGAACA TGATCATGGC AGAATCACCA 361 GGCCTCATCA CCATCTGCCT TTTAGGATAT CTACTCAGTG CTGAATGTAC AGGTTTGTTT 421 CCTTTTTTAA AATACATTGA GTATGCTTGC CTTTTAGATA TAGAAATATC TGATGCTGTC 481 TTCTTCACTA AATTTTGATT ACATGATTTG ACAGCAATAT TGAAGAGTCT AACAGCCAGC 541 ACGCAGGTTG GTAAGTACTG GTTCTTTGTT AGCTAGGTTT TCTTCTTCTT CATTTTTAAA // LOCUS HUMFIXG2 599 bp ds-DNA PRI 15-JUN-1989 DEFINITION Human factor IX gene, exons 2 and 3. ACCESSION K02049 KEYWORDS Christmas factor; factor IX. SEGMENT 2 of 6 SOURCE Human: cDNA to liver mRNA, clones cVII, cVI, 108.1, and DB.1; 4X lymphoblastoid cell line (GM1416B) DNA, clone lambda-HIX-4; genomic DNA library of Lawn et al., clones lambda-HIX-1,2,3. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 599) AUTHORS Anson,D.S., Choo,K.H., Rees,D.J.G., Giannelli,F., Gould,K., Huddleston,J.A. and Brownlee,G.G. TITLE The gene structure of human anti-haemophilic factor IX JOURNAL EMBO J. 3, 1053-1060 (1984) STANDARD full staff_review REFERENCE 2 (bases 49 to 50; revision of [1]) AUTHORS Anson,D.S., Choo,K.H., Rees,D.J.G., Giannelli,F., Gould,K., Huddleston,J.A. and Brownlee,G.G. JOURNAL Unpublished (1985) School of Path., Oxford U., Oxford, UK STANDARD full staff_review COMMENT See segment 1. FEATURES from to/span description pept + 176 339 factor IX, exon 2 /nomgen="F9" /map="Xq26.3-q27.2" /hgml_locus_uid="LX0023Z" 528 + 552 factor IX, exon 3 pre-msg < 1 > 599 fix mRNA IVS < 1 175 fix intron 1 IVS 340 527 fix intron 2 IVS 553 > 599 fix intron 3 revision 49 50 ca in [2]; cta in [1] BASE COUNT 217 A 81 C 100 G 201 T ORIGIN About 5.4 kb after segment 1; chromosome Xq26.3-q27.2. 1 AATCTCCATG TGTATACAGT ACTGTGGGAA CATCACAGAT TTTGGCTCCA TGCCCTAAAG 61 AGAAATTGGC TTTCAGATTA TTTGGATTAA AAACAAAGAC TTTCTTAAGA GATGTAAAAT 121 TTTCATGATG TTTTCTTTTT TGCTAAAACT AAAGAATTAT TCTTTTACAT TTCAGTTTTT 181 CTTGATCATG AAAACGCCAA CAAAATTCTG AATCGGCCAA AGAGGTATAA TTCAGGTAAA 241 TTGGAAGAGT TTGTTCAAGG GAACCTTGAG AGAGAATGTA TGGAAGAAAA GTGTAGTTTT 301 GAAGAAGCAC GAGAAGTTTT TGAAAACACT GAAAGAACAG TGAGTATTTC CACATAATAC 361 CCTTCAGATG CAGAGCATAG AATAGAAAAT CTTTAAAAAG ACACTTCTCT TTAAAATTTT 421 AAAGCATCCA TATATATTTA TGTATGTTAA ATGTTATAAA AGATAGGAAA TCAATACCAA 481 AACACTTTAG ATATTACCGT TAATTTGTCT TCTTTTATTC TTTATAGACT GAATTTTGGA 541 AGCAGTATGT TGGTAAGCAA TTCATTTTAT CCTCTAGCTA ATATATGAAA CATATGAGA // LOCUS HUMFIXG3 240 bp ds-DNA PRI 15-JUN-1989 DEFINITION Human factor IX gene, exon 4. ACCESSION K02050 KEYWORDS Christmas factor; factor IX. SEGMENT 3 of 6 SOURCE Human: cDNA to liver mRNA, clones cVII, cVI, 108.1, and DB.1; 4X lymphoblastoid cell line (GM1416B) DNA, clone lambda-HIX-4; genomic DNA library of Lawn et al., clones lambda-HIX-1,2,3. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 240) AUTHORS Anson,D.S., Choo,K.H., Rees,D.J.G., Giannelli,F., Gould,K., Huddleston,J.A. and Brownlee,G.G. TITLE The gene structure of human anti-haemophilic factor IX JOURNAL EMBO J. 3, 1053-1060 (1984) STANDARD full staff_review COMMENT See segment 1. FEATURES from to/span description pept + 65 + 178 factor IX, exon 4 /nomgen="F9" /map="Xq26.3-q27.2" /hgml_locus_uid="LX0023Z" pre-msg < 1 > 240 fix mRNA IVS < 1 64 fix intron 3 IVS 179 > 240 fix intron 4 BASE COUNT 71 A 40 C 55 G 74 T ORIGIN About 3.96 kb after segment 2; chromosome Xq26.3-q27.2. 1 CAGGGGAGGA CCGGGCATTC TAAGCAGTTT ACGTGCCAAT TCAATTTCTT AACCTATCTC 61 AAAGATGGAG ATCAGTGTGA GTCCAATCCA TGTTTAAATG GCGGCAGTTG CAAGGATGAC 121 ATTAATTCCT ATGAATGTTG GTGTCCCTTT GGATTTGAAG GAAAGAACTG TGAATTAGGT 181 AAGTAACTAT TTTTTGAATA CTCATGGTTC AAAGTTTCCC TCTGAAACAA GTTGAAACTG // LOCUS HUMFIXG4 360 bp ds-DNA PRI 15-JUN-1989 DEFINITION Human factor IX gene, exon 5. ACCESSION K02051 KEYWORDS Christmas factor; factor IX. SEGMENT 4 of 6 SOURCE Human: cDNA to liver mRNA, clones cVII, cVI, 108.1, and DB.1; 4X lymphoblastoid cell line (GM1416B) DNA, clone lambda-HIX-4; genomic DNA library of Lawn et al., clones lambda-HIX-1,2,3. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 360) AUTHORS Anson,D.S., Choo,K.H., Rees,D.J.G., Giannelli,F., Gould,K., Huddleston,J.A. and Brownlee,G.G. TITLE The gene structure of human anti-haemophilic factor IX JOURNAL EMBO J. 3, 1053-1060 (1984) STANDARD full staff_review COMMENT See segment 1. FEATURES from to/span description pept + 143 + 271 factor IX, exon 4 /nomgen="F9" /map="Xq26.3-q27.2" /hgml_locus_uid="LX0023Z" pre-msg < 1 > 360 fix mRNA IVS < 1 142 fix intron 4 IVS 272 > 360 fix intron 5 BASE COUNT 115 A 63 C 65 G 117 T ORIGIN About 7.32 kb after segment 3; chromosome Xq26.3-q27.2. 1 AAAATTTCTC TCCCCAACGT ATATTGGGGG CAACATGAAT GCCCCCAATG TATATTTGAC 61 CCATACATGA GTCAGTAGTT CCATGTACTT TTTAGAAATG CATGTTAAAT GATGCTGTTA 121 CTGTCTATTT TGCTTCTTTT AGATGTAACA TGTAACATTA AGAATGGCAG ATGCGAGCAG 181 TTTTGTAAAA ATAGTGCTGA TAACAAGGTG GTTTGCTCCT GTACTGAGGG ATATCGACTT 241 GCAGAAAACC AGAAGTCCTG TGAACCAGCA GGTCATAATC TGAATAAGAT TTTTTAAAGA 301 AAATCTGTAT CTGAAACTTC AGCATTTTAA CAAACCTACA TAATTTTAAT TCCTACTTGA // LOCUS HUMFIXG5 360 bp ds-DNA PRI 15-JUN-1989 DEFINITION Human factor IX gene, exon 6. ACCESSION K02052 KEYWORDS Christmas factor; factor IX. SEGMENT 5 of 6 SOURCE Human: cDNA to liver mRNA, clones cVII, cVI, 108.1, and DB.1; 4X lymphoblastoid cell line (GM1416B) DNA, clone lambda-HIX-4; genomic DNA library of Lawn et al., clones lambda-HIX-1,2,3. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 360) AUTHORS Anson,D.S., Choo,K.H., Rees,D.J.G., Giannelli,F., Gould,K., Huddleston,J.A. and Brownlee,G.G. TITLE The gene structure of human anti-haemophilic factor IX JOURNAL EMBO J. 3, 1053-1060 (1984) STANDARD full staff_review COMMENT See segment 1. FEATURES from to/span description pept + 82 + 284 factor IX, exon 6 /nomgen="F9" /map="Xq26.3-q27.2" /hgml_locus_uid="LX0023Z" pre-msg < 1 > 360 fix mRNA IVS < 1 81 fix intron 5 IVS 285 > 360 fix intron 6 BASE COUNT 95 A 75 C 74 G 116 T ORIGIN About 2.4 kb after segment 4; chromosome Xq26.3-q27.2. 1 CCTCAATCTC AATTTTTGTA ATACATGTTC CATTTGCCAA TGAGAAATAT CAGGTTACTA 61 ATTTTTCTTC TATTTTTCTA GTGCCATTTC CATGTGGAAG AGTTTCTGTT TCACAAACTT 121 CTAAGCTCAC CCGTGCTGAG GCTGTTTTTC CTGATGTGGA CTATGTAAAT TCTACTGAAG 181 CTGAAACCAT TTTGGATAAC ATCACTCAAA GCACCCAATC ATTTAATGAC TTCACTCGGG 241 TTGTTGGTGG AGAAGATGCC AAACCAGGTC AATTCCCTTG GCAGGTACTT TATACTGATG 301 GTGTGTCAAA ACTGGAGCTC AGCTGGCAAG ACACAGGCCA GGTGGGAGAC TGAGGCTATT // LOCUS HUMFIXG6 3121 bp ds-DNA PRI 15-JUN-1989 DEFINITION Human factor IX gene, exons 7 and 8. ACCESSION K02053 KEYWORDS Christmas factor; factor IX. SEGMENT 6 of 6 SOURCE Human: cDNA to liver mRNA, clones cVII, cVI, 108.1, and DB.1; 4X lymphoblastoid cell line (GM1416B) DNA, clone lambda-HIX-4; genomic DNA library of Lawn et al., clones lambda-HIX-1,2,3. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 3121) AUTHORS Anson,D.S., Choo,K.H., Rees,D.J.G., Giannelli,F., Gould,K., Huddleston,J.A. and Brownlee,G.G. TITLE The gene structure of human anti-haemophilic factor IX JOURNAL EMBO J. 3, 1053-1060 (1984) STANDARD full staff_review REFERENCE 2 (bases 3011 to 3012; 3026 to 3028; 3090 to 3092; revises [1]) AUTHORS Anson,D.S., Choo,K.H., Rees,D.J.G., Giannelli,F., Gould,K., Huddleston,J.A. and Brownlee,G.G. JOURNAL Unpublished (1985) School of Path., Oxford U., Oxford, UK STANDARD full staff_review COMMENT See segment 1. FEATURES from to/span description pept + 143 257 factor IX, exon 7 /nomgen="F9" /map="Xq26.3-q27.2" /hgml_locus_uid="LX0023Z" 926 1473 factor IX, exon 8 pre-msg < 1 2860 fix mRNA IVS < 1 142 fix intron 6 IVS 258 925 fix intron 7 revision 3011 3012 tt in [2]; ttt in [1] revision 3026 3028 gtt in [2]; gt in [1] revision 3090 3092 ttg in [2]; tg in [1] BASE COUNT 956 A 624 C 574 G 967 T ORIGIN About 9.96 kb after segment 5; chromosome Xq26.3-q27.2. 1 AAAGCTCACA TTTCCAGAAA CATTCCATTT CTGCCAGCAC CTAGAAGCCA ATATTTTGCC 61 TATTCCTGTA ACCAGCACAC ATATTTATTT TTTTCTAGAT CAAATGTATT ATGCAGTAAG 121 AGTCTTAATT TTGTTTTCAC AGGTTGTTTT GAATGGTAAA GTTGATGCAT TCTGTGGAGG 181 CTCTATCGTT AATGAAAAAT GGATTGTAAC TGCTGCCCAC TGTGTTGAAA CTGGTGTTAA 241 AATTACAGTT GTCGCAGGTA AATACACAGA AAGAATAATA ATCTGCAGCA CCACTAGCTC 301 TTTAATATGA TTGGTACACC ATATTTTACT AAGGTCTAAT AAAATTGTTG TTGAATAAAT 361 TGGGCTAAAG GCAGAAGGGT CATAATTTCA GAACCCACGT CGCACCGTCC TCCAAGCATC 421 CATAGTTCTT TTGATATACC CCTATTATCA CTCATTTCAG TGAGGTACAA TTAGTTCTTG 481 ATGTAGCCAT TTCCATACCA GAAGGCCTTC CCAAAAATCA GTGTCATGTC ACCGATCCTT 541 TTATCTCTGG TGCTTGGCAC AACCTGTAGC AGGTCCTCAG AAAACAAACA TTTGAATTAA 601 TGGCCAAATG AGTTTGTGCT CAAAAAAGGG GTGAGGATAC TTGAAATTTG GAAAATCTAG 661 GATAATTCAT GACTAGTGGA TTCATTATCA CCAATGAAAG GCTTATAACA GCATGAGTGA 721 ACAGAACCAT CTCTATGATA GTCCTGAATG GCTTTTTGGT CTGAAAAATA TGCATTGGCT 781 CTCATTACAT TTAACCAAAA TTATCACAAT ATAAGAATGA GATCTTTAAC ATTGCCAATT 841 AGGTCAGTGG TCCCAAGTAG TCACTTAGAA AATCTGTGTA TGTGAAATAC TGTTTGTGAC 901 TTAAAATGAA ATTTATTTTT AATAGGTGAA CATAATATTG AGGAGACAGA ACATACAGAG 961 CAAAAGCGAA ATGTGATTCG AATTATTCCT CACCACAACT ACAATGCAGC TATTAATAAG 1021 TACAACCATG ACATTGCCCT TCTGGAACTG GACGAACCCT TAGTGCTAAA CAGCTACGTT 1081 ACACCTATTT GCATTGCTGA CAAGGAATAC ACGAACATCT TCCTCAAATT TGGATCTGGC 1141 TATGTAAGTG GCTGGGGAAG AGTCTTCCAC AAAGGGAGAT CAGCTTTAGT TCTTCAGTAC 1201 CTTAGAGTTC CACTTGTTGA CCGAGCCACA TGTCTTCGAT CTACAAAGTT CACCATCTAT 1261 AACAACATGT TCTGTGCTGG CTTCCATGAA GGAGGTAGAG ATTCATGTCA AGGAGATAGT 1321 GGGGGACCCC ATGTTACTGA AGTGGAAGGG ACCAGTTTCT TAACTGGAAT TATTAGCTGG 1381 GGTGAAGAGT GTGCAATGAA AGGCAAATAT GGAATATATA CCAAGGTATC CCGGTATGTC 1441 AACTGGATTA AGGAAAAAAC AAAGCTCACT TAATGAAAGA TGGATTTCCA AGGTTAATTC 1501 ATTGGAATTG AAAATTAACA GGGCCTCTCA CTAACTAATC ACTTTCCCAT CTTTTGTTAG 1561 ATTTGAATAT ATACATTCTA TGATCATTGC TTTTTCTCTT TACAGGGGAG AATTTCATAT 1621 TTTACCTGAG CAAATTGATT AGAAAATGGA ACCACTAGAG GAATATAATG TGTTAGGAAA 1681 TTACAGTCAT TTCTAAGGGC CCAGCCCTTG ACAAAATTGT GAAGTTAAAT TCTCCACTCT 1741 GTCCATCAGA TACTATGGTT CTCCACTATG GCAACTAACT CACTCAATTT TCCCTCCTTA 1801 GCAGCATTCC ATCTTCCCGA TCTTCTTTGC TTCTCCAACC AAAACATCAA TGTTTATTAG 1861 TTCTGTATAC AGTACAGGAT CTTTGGTCTA CTCTATCACA AGGCCAGTAC CACACTCATG 1921 AAGAAAGAAC ACAGGAGTAG CTGAGAGGCT AAAACTCATC AAAAACACTA CTCCTTTTCC 1981 TCTACCCTAT TCCTCAATCT TTTACCTTTT CCAAATCCCA ATCCCCAAAT CAGTTTTTCT 2041 CTTTCTTACT CCCTCTCTCC CTTTTACCCT CCATGGTCGT TAAAGGAGAG ATGGGGAGCA 2101 TCATTCTGTT ATACTTCTGT ACACAGTTAT ACATGTCTAT CAAACCCAGA CTTGCTTCCA 2161 TAGTGGGGAC TTGCTTTTCA GAACATAGGG ATGAAGTAAG GTGCCTGAAA AGTTTGGGGG 2221 AAAAGTTTCT TTCAGAGAGT TAAGTTATTT TATATATATA ATATATATAT AAAATATATA 2281 ATATACAATA TAAATATATA GTGTGTGTGT GTATGCGTGT GTGTAGACAC ACACGCATAC 2341 ACACATATAA TGGAAGCAAT AAGCCATTCT AAGAGCTTGT ATGGTTATGG AGGTCTGACT 2401 AGGCATGATT TGACGAAGGC AAGATTGGCA TATCATTGTA ACTAAAAAAG CTGACATTGA 2461 CCCAGACATA TTGTACTCTT TCTAAAAATA ATAATAATAA TGCTAACAGA AAGAAGAGAA 2521 CCGTTCGTTT GCAATCTACA GCTAGTAGAG ACTTTGAGGA AGAATTCAAC AGTGTGTCTT 2581 CAGCAGTGTT CAGAGCCAAG CAAGAAGTTG AAGTTGCCTA GACCAGAGGA CATAAGTATC 2641 ATGTCTCCTT TAACTAGCAT ACCCCGAAGT GGAGAAGGGT GCAGCAGGCT CAAAGGCATA 2701 AGTCATTCCA ATCAGCCAAC TAAGTTGTCC TTTTCTGGTT TCGTGTTCAC CATGGAACAT 2761 TTTGATTATA GTTAATCCTT CTATCTTGAA TCTTCTAGAG AGTTGCTGAC CAACTGACGT 2821 ATGTTTCCCT TTGTGAATTA ATAAACTGGT GTTCTGGTTC ATACCTTGGC TTTTTGTGGA 2881 TTCCATTGAT GTGAATCAGT CACCCTGTAT TTGATGATGC ATGGGACTAC TGACAAAATC 2941 ACTCTGACTC TGACCCTGCC AAGCTGCTGC CTTCTCCTGC CCCAACCTCA CCCCCAGCCA 3001 GGCCTCACTC TTGCTAGTTC CTTTAGTTCT TTTAGTCAAT ATATTTTTGT CTTCGCATAT 3061 AAGTATAAAT AAACATATTT TTAAATTTCT TGGCTGGGCC CAGTGGCTCA CGCCTATAAT 3121 C // LOCUS HUMFMSB 256 bp ds-DNA PRI 15-JUN-1989 DEFINITION Human c-fms proto-oncogene, exon 2, partial cds. ACCESSION M11067 KEYWORDS fms oncogene; oncogene; proto-oncogene. SOURCE Human DNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 256) AUTHORS Nienhuis,A.W., Bunn,H.F., Turner,P.H., Gopal,T.V., Nash,W.G., O'Brien,S.J. and Sherr,C.J. TITLE Expression of the human c-fms proto-oncogene in hematopoietic cells and its deletion in the 5q- syndrome JOURNAL Cell 42, 421-428 (1985) STANDARD full staff_review COMMENT A printed copy of the sequence in [1] was kindly provided by A.W.Nienhuis, 17-JAN-1986. Although this sequence is from the second of five regions spanning about 27 kb that hybridizes with the fms probe, it may not actually be exon 2 of the human gene. The first codon in this sequence is equivalent to codon 277 of the SM-FeSV fms coding sequence (Hampe,A. et al. PNASU 81, 85-89 (1984)). The fms protein may be related to or identical to the receptor for macrophage colony stimulating factor (CSF-1) [1]. FEATURES from to/span description pept / 58 / 213 fms protein, exon 2 (AA 277 at 58) IVS < 1 57 fms cds intron A IVS 214 > 256 fms cds intron B BASE COUNT 63 A 76 C 53 G 64 T ORIGIN At BglII site; chromosome 5 long arm. 1 GATCTTTCTA ATCCCTAAAC TACCTTGGTA TAACATCTCC ATTCCTTCTG GCTGCAGCTC 61 GCAATCCCTC AACAATCTGA CTTTCATAAT AACCGTTACC AAAAAGTCCT GACCCTCAAC 121 CTCGATCAAG TAGATTTCCA ACATGCCGGC AACTACTCCT GCGTGGCCAG CAACGTGCAG 181 GGCAAGCACT CCACCTCCAT GTTCTTCCGG GTGGTAGGTA AGCATCAGGG TGGTGGTGGA 241 CAGTCGGTAG GGATCC // LOCUS HUMFMSD 824 bp ds-DNA PRI 02-MAY-1986 DEFINITION Human cellular fms proto-oncogene with 426 bp deletion, partial. ACCESSION K03011 KEYWORDS fms oncogene; oncogene; proto-oncogene. SOURCE Human (acute lymphocytic leukemia patient 1283) leukocyte DNA, clone ALLW-1283-C122. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 824) AUTHORS Verbeek,J.S., Roebroek,A.J.M., van den Ouweland,A.M.W., Bloemers,H.P.J. and Van de Ven,W.J.M. TITLE Human c-fms proto-oncogene: Comparative analysis with an abnormal allele JOURNAL Mol. Cell. Biol. 5, 422-426 (1985) STANDARD full staff_review COMMENT Comparison of restriction maps showed this to be an abnormal allele. FEATURES from to/span description IVS < 1 634 c-fms cds intron D IVS 762 > 824 c-fms cds intron E mut 305 306 aa in abnormal; a...426 bp...a in normal allele BASE COUNT 199 A 229 C 192 G 204 T ORIGIN Sau3A site. 1 GATCATTCCC ATGTCAAGCA TACAGCATTT CCATGCACTG TTCCTGGAAA ATTCTTCCTC 61 TGATGGTCAC ATGGTGGGCT CTTTAGGGGC CTTCCCTGAC TTATCTTACT TTATTTTCTT 121 CATAGCACCA CTTGAGAATC TCCTAGATAC ATGTTTATTT GCGTTTAATG CCTCTCTCAG 181 CCACTAGAAT GCAAACTCCA TGGAGGGGCA GGGACTTTGT CCTGTTCAAC TCTGAATCAG 241 CGGTGCCTGA CACAAATAGA TGTTCAAGAA AGTATGTGGA TGGGCTACTA TTATTCAGCC 301 TTAAAAAGTT CCACCCTCCC TCCTGGCTCA GGAATTACCA GATTATCAGA GATATCAGGG 361 CCCTCAGAGG TTGTCTTGTC CAAGGTCTTC AATACACAAA TAGTGAAACA GGCTTGGAGA 421 AGGGAAGGTC ACACAACAAG GCAGAGTCAA GCAGGAACAT GCTCTCAGTG CTATGTTCAT 481 GAGACGACCT CTCTCAGCCC AGAGCAGGCC TTGCCCTGCC TTCTCCCACT GGGCGCCTTG 541 GGACTGCCCA CACCCCTGCT CTTGGGGGTC AGAAACAAGG TCCAGGAACT GCCTGCCAGC 601 CCCGACTGCC ACGTGCTCCC TTCCTCTTCT GCAGAAGCCC AAGTACCAGG TCCGCTGGAA 661 GATCATCGAG AGCTATGAGG GCAACAGTTA TACTTTCATC GACCCCACGC AGCTGCCTTA 721 CAACGAGAAG TGGGAGTTCC CCCGGAACAA CCTGCAGTTT GGTGAGATGG CAGCTCATCA 781 CTCCACAGCT TCCTATCACA GGGCCCTGTG GGGGTTGCAG GGAG // LOCUS HUMFMSN 1250 bp ds-DNA PRI 02-MAY-1986 DEFINITION Human cellular fms proto-oncogene, partial. ACCESSION K03012 KEYWORDS fms oncogene; oncogene; proto-oncogene. SOURCE Human (chronic myelocytic leukemia patient 25383) leukocyte DNA, clone CMLW-25383-C17-3. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 1250) AUTHORS Verbeek,J.S., Roebroek,A.J.M., van den Ouweland,A.M.W., Bloemers,H.P.J. and Van de Ven,W.J.M. TITLE Human c-fms proto-oncogene: Comparative analysis with an abnormal allele JOURNAL Mol. Cell. Biol. 5, 422-426 (1985) STANDARD full staff_review COMMENT Restriction map analysis has shown this to be a normal allele. The abnormal allele in clone ALLW-1283-C122 has a 426 bp deletion from position 306 to 731 (see separate entry). FEATURES from to/span description ORF / 1061 / 1187 c-fmc open reading frame IVS < 1 1060 c-fms intron D IVS 1188 > 1250 c-fms intron E BASE COUNT 337 A 286 C 311 G 316 T ORIGIN Sau3A site. 1 GATCATTCCC ATGTCAAGCA TACAGCATTT CCATGCACTG TTCCTGGAAA ATTCTTCCTC 61 TGATGGTCAC ATGGTGGGCT CTTTAGGGGC CTTCCCTGAC TTATCTTACT TTATTTTCTT 121 CATAGCACCA CTTGAGAATC TCCTAGATAC ATGTTTATTT GCGTTTAATG CCTCTCTCAG 181 CCACTAGAAT GCAAACTCCA TGGAGGGGCA GGGACTTTGT CCTGTTCAAC TCTGAATCAG 241 CGGTGCCTGA CACAAATAGA TGTTCAAGAA AGTATGTGGA TGGGCTACTA TTATTCAGCC 301 TTAAAAAGGA AGGGAATTCT GACCTGTGCT GCAGCATGAA TGAACCTTGA AGACATTATG 361 CTGGGTGAAA TAAGGCAATC TCAATAGACA CATGCTGTGT GAGTCCACTG AGGTGCAGTG 421 CCTAGAGCAG TGCAATTCAC AGAGACAGCA GAATCATGGT TCGCCAGGGG CTGGAGGAGG 481 GAAAGGGGAG TTGCTTTTTA ACAGGAACAG AATTTCAGTT TTGCAAGATG AAAAGAGCTC 541 TGGAAACTGG TTGCACAAGG TAGAATGTAA TTTACTTAAT ACTACTGAAC CATACACTTA 601 AAAATGGTTG AAATGGTAAA TTTCATGTAT GTTTTATCAC AATTAAAATA TATATATATA 661 TTTGGATGGG AGGTTGGGTG GGTGGATGGA TGGGTAGATG GATGGACAGA TGAACGGATG 721 GATAAGATCT CAAGTTCCAC CCTCCCTCCT GGCTCAGGAA TTACCAGATT ATCAGAGATA 781 TCAGGGCCCT CAGAGGTTGT CTTGTCCAAG GTCTTCAATA CACAAATAGT GAAACAGGCT 841 TGGAGAAGGG AAGGTCACAC AACAAGGCAG AGTCAAGCAG GAACATGCTC TCAGTGCTAT 901 GTTCATGAGA CGACCTCTCT CAGCCCAGAG CAGGCCTTGC CCTGCCTTCT CCCACTGGGC 961 GCCTTGGGAC TGCCCACACC CCTGCTCTTG GGGGTCAGAA ACAAGGTCCA GGAACTGCCT 1021 GCCAGCCCCG ACTGCCACGT GCTCCCTTCC TCTTCTGCAG AAGCCCAAGT ACCAGGTCCG 1081 CTGGAAGATC ATCGAGAGCT ATGAGGGCAA CAGTTATACT TTCATCGACC CCACGCAGCT 1141 GCCTTACAAC GAGAAGTGGG AGTTCCCCCG GAACAACCTG CAGTTTGGTG AGATGGCAGC 1201 TCATCACTCC ACAGCTTCCT ATCACAGGGC CCTGTGGGGG TTGCAGGGAG // LOCUS HUMFN3 742 bp ds-DNA PRI 31-AUG-1987 DEFINITION Human fibronectin (FN) gene, 5' region. ACCESSION M15801 KEYWORDS fibronectin. SOURCE Human fibrosacoma cell line HT1080, DNA, clone pgHF3.7. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 742) AUTHORS Dean,D.C., Bowlus,C.L. and Bourgeois,S. TITLE Cloning and analysis of the promoter region of the human fibronectin gene JOURNAL Proc. Natl. Acad. Sci. U.S.A. 84, 1876-1880 (1987) STANDARD full staff_review COMMENT Draft entry and clean copy of sequence [1] kindly provided by, S.Bourgeois , 01-JUN-1987. FEATURES from to/span description pept 577 / 724 fibronectin, exon 1 /nomgen="FN1" /map="2q34-q36" /hgml_locus_uid="LC0065F" sigp 577 669 fibronectin signal peptide matp 670 / 724 fibronectin mature peptide pre-msg 310 > 742 FN mRNA IVS 725 > 742 FN intron A BASE COUNT 118 A 264 C 240 G 120 T ORIGIN 147 bp upstream of SmaI site; map position 8. 1 CCAGCCGCTT CCCATCCCTT CCCCCATCCC CTAAAAAGTT TGATGACCGC AAAGGAAACC 61 GAAAAAAAGT TGTCTTGCCC CAGTCCTGGC GGGCCATCAG CATCTCTTTT GTTCGCTGCG 121 AACCCACAGT CCCCCGTGAC GTCACCCGGG AGCCCGGGCC AATCGGGCGC GGTCGGCTGC 181 GGCGGCCGGC GGGCGGGCGG GTGGGGTGGG GCGGGGCGGG GACAGCCCGG CGGGTCTCTC 241 CTCCCCCGCG CCCCGGGCCT CCAGAGGGGC GGGAGGGCCG TCCCATATAA GCCCGGCTCC 301 CGCGCTCCGA CGCCCGCGCC GGCTGTGCTG CACAGGGGGA GGAGAGGGAA CCCCAGGCGC 361 GAGCGGGAAG AGGGGACCTG CAGCCACAAC TTCTCTGGTC CTCTGCATCC CTTCTGTCCC 421 TCCACCCGTC CCCTTCCCCA CCCTCTGGCC CCCACCTTCT TGGAGGCGAC AACCCCCGGG 481 AGGCATTAGA AGGGATTTTT CCCGCAGTTG CGAAGGGAAG CAAACTTGGT GGCAACTTGC 541 CTCCCGGTGC GGGCGTCTCT CCCCCACCGT CTCAACATGC TTAGGGGTCC GGGGCCCGGG 601 CTGCTGCTGC TGGCCGTCCT GTGCCTGGGG ACAGCGGTGC CCTCCACGGG AGCCTCGAAG 661 AGCAAGAGGC AGGCTCAGCA AATGGTTCAG CCCCAGTCCC CGGTGGCTGT CAGTCAAAGC 721 AAGCGTGAGT ACTGACCGCG GG // LOCUS HUMFN3A 1236 bp ds-DNA PRI 15-JUN-1988 DEFINITION Human fibronectin gene type III homology unit corresponding to the cell-binding domain, exons 6 and 7. ACCESSION M12549 KEYWORDS fibronectin. SOURCE Human DNA (library of T.Maniatis), clone lambda-HFN4. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 1236) AUTHORS Oldberg,A. and Ruoslahti,E. TITLE Evolution of the fibronectin gene: Exon structure of cell attachment domain JOURNAL J. Biol. Chem. 261, 2113-2116 (1986) STANDARD full staff_entry FEATURES from to/span description pept / 155 271 fibronectin type III, exon 6 (AA at 157) 948 / 1112 fibronectin type III, exon 7 IVS < 1 154 FN intron E IVS 272 947 FN intron F IVS 1113 > 1236 FN intron G BASE COUNT 393 A 238 C 219 G 386 T ORIGIN EcoRI site. 1 AATTCGTCAT AATATAAAAT TCTGCTAATT ACACATAGTA TTTACATGAT TGTATGGAAG 61 AATCATTAAG ACAATTATCT GGAAAATGAA CAAACAGTAA ATCTGAATAT TGTTTGAAAA 121 TTACGGATGT GAAAAGTTTC CCTTTTTTTT CTAGTTTCTG ATGTTCCGAG GGACCTGGAA 181 GTTGTTGCTG CGACCCCCAC CAGCCTACTG ATCAGCTGGG ATGCTCCTGC TGTCACAGTG 241 AGATATTACA GGATCACTTA CGGAGAAACA GGTACAGCAG TAAAATGCTA TTTTACACTC 301 TGATTAAATC AGATTCTGTT GTGGATAACC TGAAAGCCCA ACAGTGAACA AGAATTAAAG 361 AAACTTTGGC AAGTCCATTC AACGGAGCCC TTGTTTTTTC CAAGAAAATA GCTAAGATAT 421 AGATGATATA ATTTGTTCTA AAACCCAAAT AAAAAGTTGT TTATATACTA CAACTAGAGG 481 GGGAACGGCA GAGCTGAGGA AATAAAAGGA TTGTAAATTC TCAAACATAT TATCAGTGGT 541 GGAAATAAAG TGATTTTTAT TTTTTCTTCT CTTTACTTTT CTGTATTTTC CAAATTTTAT 601 TTAAAAGGAA TGTATTCTGT TAAAAGTTTT AAAAAGGACA CAATGCATGC AATCCTGGGT 661 TGAGGGCTTA CCTTCTCCCA CTTCTAATGC TACTCTACTA CTCAGTGACA TTTTAAAGCT 721 GAAATGTTAA AACAGCGCTA ACTGTAATTT TCTCTCAATG TTTATACACT TTACCAAGGT 781 TTGCTACATG CATAAATACC CCTTTCTGTT CAAGATAGCG CTCTTTAAAA GGGAATAAGC 841 AAGAAGATGT GATTTACATG CTGCTATAAA TGTGGTAATT CAATTAATCA GTAATACCCA 901 AGTAGCTCTA AACCCCTCAC ACTCTGAACT AACCCTTTTT CATACAGGAG GAAATAGCCC 961 TGTCCAGGAG TTCACTGTGC CTGGGAGCAA GTCTACAGCT ACCATCAGCG GCCTTAAACC 1021 TGGAGTTGAT TATACCATCA CTGTGTATGC TGTCACTGGC CGTGGAGACA GCCCCGCAAG 1081 CAGCAAGCCA ATTTCCATTA ATTACCGAAC AGGTACAAAC TTCTACTCTG GGGTGACACC 1141 AGCTTTTACT TATTCAGATA CTGTTTTGCA ATGTTCTCCC AAGGTATTTT TCTAATTGTA 1201 GAATAGATTT TCCTTTTTAA TGAGCAACAA CCTGCA // LOCUS HUMFNA1 40 bp ds-DNA PRI 15-MAR-1989 DEFINITION Human fibronectin gene extra type III repeat (EDII), exon x. ACCESSION M18177 KEYWORDS fibronectin. SEGMENT 1 of 3 SOURCE Human liver, cDNA to mRNA, clone lambda-FN4. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 40) AUTHORS Gutman,A. and Kornblihtt,A.R. TITLE Identification of a third region of cell-specific alternative splicing in human fibronectin mRNA JOURNAL Proc. Natl. Acad. Sci. U.S.A. 84, 7179-7182 (1987) STANDARD simple staff_entry FEATURES from to/span description pept < 1 + 28 fibronectin, exon x (AA at 1) /nomgen="FN1" /map="2q34-q36" /hgml_locus_uid="LC0065F" IVS 29 > 40 intron y BASE COUNT 13 A 11 C 5 G 11 T ORIGIN 1 GTCCCTATCT CTGATACCAT CATCCCAGGT AATAGAAAAT // LOCUS HUMFNA2 354 bp ds-DNA PRI 15-MAR-1989 DEFINITION Human fibronectin gene extra type III repeat (EDII), exon x+1. ACCESSION M18178 KEYWORDS fibronectin. SEGMENT 2 of 3 SOURCE Human liver, cDNA to mRNA, clone lambda-FN4. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 354) AUTHORS Gutman,A. and Kornblihtt,A.R. TITLE Identification of a third region of cell-specific alternative splicing in human fibronectin mRNA JOURNAL Proc. Natl. Acad. Sci. U.S.A. 84, 7179-7182 (1987) STANDARD simple staff_entry FEATURES from to/span description pept + 72 + 344 fibronectin, exon x+1 /nomgen="FN1" /map="2q34-q36" /hgml_locus_uid="LC0065F" IVS < 1 71 intron y IVS 345 > 354 intron y+1 BASE COUNT 80 A 94 C 72 G 108 T ORIGIN 94 bp upstream of HindIII site. 1 GCTTTATTTT TCCTCTTTTT TCCCTCTATT TTCCTTTTGC CTCCCCCTCC CTTTGCTTTG 61 TAACTCAATA GAGGTGCCCC AACTCACTGA CCTAAGCTTT GTTGATATAA CCGATTCAAG 121 CATCGGCCTG AGGTGGACCC CGCTAAACTC TTCCACCATT ATTGGGTACC GCATCACAGT 181 AGTTGCGGCA GGAGAAGGTA TCCCTATTTT TGAAGATTTT GTGGACTCCT CAGTAGGATA 241 CTACACAGTC ACAGGGCTGG AGCCGGGCAT TGACTATGAT ATCAGCGTTA TCACTCTCAT 301 TAATGGCGGC GAGAGTGCCC CTACTACACT GACACAACAA ACGGGTGAAT TTTG // LOCUS HUMFNA3 71 bp ds-DNA PRI 15-MAR-1989 DEFINITION Human fibronectin gene extra type III repeat (EDII), exon x+2. ACCESSION M18179 KEYWORDS fibronectin. SEGMENT 3 of 3 SOURCE Human liver, cDNA to mRNA, clone lambda-FN4. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 71) AUTHORS Gutman,A. and Kornblihtt,A.R. TITLE Identification of a third region of cell-specific alternative splicing in human fibronectin mRNA JOURNAL Proc. Natl. Acad. Sci. U.S.A. 84, 7179-7182 (1987) STANDARD simple staff_entry FEATURES from to/span description pept + 46 > 71 fibronectin, exon x+2 /nomgen="FN1" /map="2q34-q36" /hgml_locus_uid="LC0065F" IVS < 1 45 intron y+1 BASE COUNT 17 A 21 C 11 G 22 T ORIGIN 1 TAGGAAAATT ACATTAACTG AGTCTTCCTT CTTGTTCGAA ACCAGCTGTT CCTCCTCCCA 61 CTGACCTGCG A // LOCUS HUMFOL1 1908 bp ds-DNA PRI 15-MAR-1988 DEFINITION Human dihydrofolate reductase gene, exons 1 and 2. ACCESSION K01612 M10235 KEYWORDS dihydrofolate reductase; reductase. SEGMENT 1 of 5 SOURCE Human HeLa cells DNA, clone cos hDHFR-1. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 1908) AUTHORS Chen,M.-J., Shimada,T., Moulton,A.D., Cline,A., Humphries,R.K., Maizel,J. and Nienhuis,A.W. TITLE The functional human dihydrofolate reductase gene JOURNAL J. Biol. Chem. 259, 3933-3943 (1984) STANDARD full staff_review REFERENCE 2 (bases 800 to 1325; mRNA start sites) AUTHORS Masters,J.N. and Attardi,G. TITLE Discrete human dihydrofolate reductase gene transcripts present in polysomal RNA map with their 5' ends several hundred nucleotides upstream of the main mRNA site JOURNAL Mol. Cell. Biol. 5, 493-500 (1985) STANDARD full staff_review FEATURES from to/span description pept 1323 1408 dihydrofolate reductase (EC 1.5.1.3) exon 1 /nomgen="DHFR" /map="5q11.1-q13.2" /hgml_locus_uid="LD0137D" 1772 + 1821 dihydrofolate reductase (EC 1.5.1.3) exon 2 pre-msg 844 > 1908 DHFR mRNA (minor alt.) pre-msg 854 > 1908 DHFR mRNA (minor alt.) pre-msg 858 > 1908 DHFR mRNA (minor alt.) pre-msg 864 > 1908 DHFR mRNA (minor alt.) pre-msg 868 > 1908 DHFR mRNA (minor alt.) pre-msg 875 > 1908 DHFR mRNA (minor alt.) pre-msg 1252 > 1908 DHFR mRNA (major alt.) IVS 1409 1771 DHFR intron A IVS 1822 > 1908 DHFR intron B BASE COUNT 441 A 551 C 531 G 385 T ORIGIN 1 bp upstream of first EcoRI site 5' to exon 1. 1 GAATTCATGA AAACGTAGCT CGTCCTCAAA AAAAACAGAA GAGGAGTAAT CATTTTAAGG 61 GAGAAATATA TACGAAAGGA ACAAGATTTT GAAGCACCCA AGCTGCCACC TACATTAAAA 121 CACGGTAGGT GGCTAAACAC CAGTCTTCAA TGCCCTTCCA CAGCCTCAGT CTGAAAAATA 181 CTGTGCAGGT GACCCAAGTG AGGGGTCACC CTTGGGCTTT TCCTGTGGCA GTATCTCTGG 241 TTTAAAAACA AACAAACGTA CTTATTGCGT TGAAGGACGG CAACAGGAAG GACTCCATGA 301 TTAGTCACAT CTATACCATC CTAAGAAACT TTATCCACCC AAACTGTATT TCAGACTTTA 361 TAATCTAAAC TACAAAAAGT GTTCACTGGG GAACTGCACA ATATGACTGC TTTTAACCGT 421 AGTGATTTCA AATATTGAGC CATGCTGTTG CAGTCTTAAA AACTGGAGAC CTAAGGGCAG 481 CTTTCTTCTA GTCACCCAAT CCAGCACTTT TTTAAAAAAT CAGTAAAACT CTTCGACCAC 541 CAAGGAAAAA AAAAAAGGAT GGAGGTTAAA AGACGCACCC CTTGCCCACA AGCCCCCTCA 601 TCAGAATGGG AGTCAGGAGA CCTGAGTTCC TGTCTCAGGC CTGCCATTAA AAACCTGCAT 661 AACCTTTGCC TATCTCCTCA AACGGAAGTA CTAAAACCTC AGCGCTTCAC CCAATTTGTA 721 GCCCCGGCTG GGCTCTTCCC ACCTTCCCCT TCTTCAGCCC GCCCCTTCCT CCTCCAGCCC 781 TATCATCGGG CGGAGGGTCC CCGCCTCCGC CCGCCTTACC CACAAGCCCC GCCCCCCCAG 841 CCCCGATGGC CCTGCCCAGT CCCAGACAGA ACCTACTACG TGCGGCGGCA GCTGGGGCGG 901 GAAGGCGGGC GCTGGGGGCG CTGCGGCCGC TGCAGCGCAG GGTCCACCTG GTCGGCTGCA 961 CCTGTGGAGG AGGAGGTGGA TTTCAGGCTT CCCGTAGACT GGAAGAATCG GCTCAAAACC 1021 GCTTGCCTCG CAGGGGCTGA GCTGGAGGCA GCGAGGCCGC CCGACGCAGG CTTCCGGCGA 1081 GACATGGCAG GGCAAGGATG GCAGCCCGGC GGCAGGGCCC GGCGAGGAGC GCGAACCCGC 1141 GGCCGCAGTT CCCAGGCGTC TGCGGGCGCG AGCACGCCGC GACCCTGCGT GCGCCGGGGC 1201 GGGGGGGCGG GGCCTCGCCT GCACAAATAG GGACGAGGGG GCGGGGCGGC CACAATTTCG 1261 CGCCAAACTT GACCGCGCGT TCTGCTGTAA CGAGCGGGCT CGGAGGTCCT CCCGCTGCTG 1321 TCATGGTTGG TTCGCTAAAC TGCATCGTCG CTGTGTCCCA GAACATGGGC ATCGGCAAGA 1381 ACGGGGACCT GCCCTGGCCA CCGCTCAGGT ATCTGCCGGG CCGGGGCGAT GGGACCCAAA 1441 CGGGCGCAGG CTGCCCACGG TCGGGGTACC TGGGCGGGAC GCGCCGGCCG ACTCCCGGCG 1501 AGAGGATGGG GCCAGACTTG CGGTCTGCGC TGGCAGGAAG GGTGGGCCCG ACTGGATTCC 1561 CCTTTTCTGC TGCGCGGGAG GCCCAGTTGC TGATTTCTGC CCGGATTCTG CTGCCCGGTG 1621 AGGTCTTGCC CTGCGGCGCC CTCGCCCAGG GCAAAGTCCC AGCCCTGGAG AAAACACCTC 1681 ACCCCTACCC ACAGCGCTCC GTTTGTCAGG TGCCTTAGAG CTCGAGCCCA AGGGATAATG 1741 TTTCGAGTAA CGCTGTTTCT CTAACTTGTA GGAATGAATT CAGATATTTC CAGAGAATGA 1801 CCACAACCTC TTCAGTAGAA GGTAATGTGG GATTAAGTAG GGTCTTGCTT GATGAAGTTT 1861 ACCAGTGCAA ATGTTAGTTA AATGGAAAGT TTTCCGTGTT AATCTGGG // LOCUS HUMFOL3 177 bp ds-DNA PRI 15-MAR-1988 DEFINITION Human dihydrofolate reductase gene, exon 4. ACCESSION J00138 KEYWORDS dihydrofolate reductase; reductase. SEGMENT 3 of 5 SOURCE Human fetal liver DNA library of T.Maniatis; clone lambda-hDHFR-1. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 169) AUTHORS Chen,M.-J., Shimada,T., Moulton,A.D., Harrison,M. and Neinhuis,A.W. TITLE Intronless human dihydrofolate reductase genes are derived from processed RNA molecules JOURNAL Proc. Natl. Acad. Sci. U.S.A. 79, 7435-7439 (1982) STANDARD full staff_entry REFERENCE 2 (bases 1 to 177) AUTHORS Chen,M.-J., Shimada,T., Moulton,A.D., Cline,A., Humphries,R.K., Maizel,J. and Nienhuis,A.W. TITLE The functional human dihydrofolate reductase gene JOURNAL J. Biol. Chem. 259, 3933-3943 (1984) STANDARD full staff_entry COMMENT The location of exon 3 is uncertain [2]. This segment occurs about 19 kb after . [1] includes two bases (ga) before base 1 of the sequence shown, which are left out without comment in [2]. FEATURES from to/span description pept + 20 + 146 dihydrofolate reductase (EC 1.5.1.3) exon 4 /nomgen="DHFR" /map="5q11.1-q13.2" /hgml_locus_uid="LD0137D" pre-msg < 1 > 177 DHFR mRNA IVS < 1 19 DHFR intron C IVS 147 > 177 DHFR intron D BASE COUNT 56 A 28 C 35 G 58 T ORIGIN 10-14 kb after , see comment. 1 ATTCTTTTTT TAACCTTAGG GAACCTCCAC AAGGAGCTCA TTTTCTTTCC AGAAGTCTAG 61 ATGATGCCTT AAAACTTACT GAACAACCAG AATTAGCAAA TAAAGTAGAC ATGGTCTGGA 121 TAGTTGGTGG CAGTTCTGTT TATAAGGTAT GTGTTGTTGT TGGAAATAAA AACCAAT // LOCUS HUMFOL4 211 bp ds-DNA PRI 15-DEC-1989 DEFINITION Human dihydrofolate reductase gene, exon 5. ACCESSION K01614 KEYWORDS dihydrofolate reductase; reductase. SEGMENT 4 of 5 SOURCE Human fetal liver DNA library of T.Maniatis; clone lambda-hDHFR-1. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 211) AUTHORS Chen,M.-J., Shimada,T., Moulton,A.D., Cline,A., Humphries,R.K., Maizel,J. and Nienhuis,A.W. TITLE The functional human dihydrofolate reductase gene JOURNAL J. Biol. Chem. 259, 3933-3943 (1984) STANDARD full staff_entry FEATURES from to/span description pept + 27 + 142 dihydrofolate reductase (EC 1.5.1.3) exon 5 /nomgen="DHFR" /map="5q11.1-q13.2" /hgml_locus_uid="LD0137D" pre-msg < 1 > 211 DHFR mRNA IVS < 1 26 DHFR intron D IVS 143 > 211 DHFR intron E BASE COUNT 74 A 32 C 39 G 66 T ORIGIN about 3.5 kb after . 1 AAAACTGATG ATGTTTTATT TCAAAGGAAG CCATGAATCA CCCAGGCCAT CTTAAACTAT 61 TTGTGACAAG GATCATGCAA GACTTTGAAA GTGACACGTT TTTTCCAGAA ATTGATTTGG 121 AGAAATATAA ACTTCTGCCA GAGTAAGTAT AAGGTTATTA ATTAGTCTGA AGCACTTTGG 181 ATTTCCTGCT TAAGACTATA GAAATAACAT G // LOCUS HUMFOL5 3464 bp ds-DNA PRI 15-MAR-1988 DEFINITION Human dihydrofolate reductase gene, exon 6 and 3' flank. ACCESSION J00139 KEYWORDS dihydrofolate reductase; reductase. SEGMENT 5 of 5 SOURCE Human fetal liver DNA library of T.Maniatis; clone lambda-hDHFR-1. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 2878 to 3047) AUTHORS Chen,M.-J., Shimada,T., Moulton,A.D., Harrison,M. and Neinhuis,A.W. TITLE Intronless human dihydrofolate reductase genes are derived from processed RNA molecules JOURNAL Proc. Natl. Acad. Sci. U.S.A. 79, 7435-7439 (1982) STANDARD full staff_entry REFERENCE 2 (bases 1 to 3464) AUTHORS Chen,M.-J., Shimada,T., Moulton,A.D., Cline,A., Humphries,R.K., Maizel,J. and Nienhuis,A.W. TITLE The functional human dihydrofolate reductase gene JOURNAL J. Biol. Chem. 259, 3933-3943 (1984) STANDARD full staff_entry COMMENT There are 3 DHFR mRNA species, 800 bp, 1000 bp, and 3800 bp. Only the 3800 bp species contains the putative poly-A signal ('aataaa'). FEATURES from to/span description pept + 42 120 dihydrofolate reductase (EC 1.5.1.3) exon 6 /nomgen="DHFR" /map="5q11.1-q13.2" /hgml_locus_uid="LD0137D" pre-msg < 1 207 DHFR mRNA (alt.) pre-msg < 1 456 DHFR mRNA (alt.) pre-msg < 1 2977 DHFR mRNA (alt.) IVS < 1 41 DHFR intron E BASE COUNT 1130 A 646 C 729 G 959 T ORIGIN about 4 kb after . 1 TTTAATGGAC ACATAATTTA ATTATATATT TTTTCTTACA GATACCCAGG TGTTCTCTCT 61 GATGTCCAGG AGGAGAAAGG CATTAAGTAC AAATTTGAAG TATATGAGAA GAATGATTAA 121 TATGAAGGTG TTTTCTAGTT TAAGTTGTTC CCCCTCCCTC TGAAAAAAGT ATGTATTTTT 181 ACATTAGAAA AGGTTTTTTG TTGACTTTAG ATCTATAATT ATTTCTAAGC AACTAGTTTT 241 TATTCCCCAC TACTCTTGTC TCTATCAGAT ACCATTTATG AGACATTCTT GCTATAACTA 301 AGTGCTTCTC CAAGACCCCA ACTGAGTCCC CAGCACCTGC TACAGTGAGC TGCCATTCCA 361 CACCCATCAC ATGTGGCACT CTTGCCAGTC CTTGACATTG TCGGGCTTTT CACATGTTGG 421 TAATATTTAT TAAAGATGAA GATCCACATA CCCTTCAACT GAGCAGTTTC ACTAGTGGAA 481 ATACCAAAAG CTTCCTACGT GTATATCCAG AGGTTTGTAG ATAAATGTTG CCACCTTGTT 541 TGTAACAGTG AAAAATTGAA AACAACCTGG AAGTCCAGTG ATGGGAAAAT GAGTATGTTT 601 CTGTCTTAGA TTGGGGAACC CAAAGCAGAT TGCAAGACTG AAATTTCAGT GAAAGCAGTG 661 TATTTGCTAG GTCATACCAG AAATCATCAA TTGAGGTACG GAGAAACTGA ACTGAGAAGG 721 TAAGAAAAGC AATTTAAAGT CAGCGAGCAG GTTCTCATTG ATAACAAGCT CCATACTGCT 781 GAGATACAGG GAAATGGAGG GGGGAAAGCT GGAGTATTGA TCCCGCCCCC CTCCTTGGTT 841 GTCAGCTCCC TGTCCTGTGT GTGGGCGGAA CATAGTCCAG CTGCTCTATA GCAAGTCTCA 901 GGTGTTTGCA GTAAGAAGCT GCTGGCATGC ACGGGAACAG TGAATGCCAA ACACTTAAAG 961 CAATTCGATG TTTAAGTATG TAAGTTCTTT TTTTTTTAGA CAGCGTTTCG CTCTTGTTGC 1021 CCAGGCTAGC ATGCAATGGT GTGACCTCGG CTTACTGCAA CCTCCGCCTT CCCAGATTCA 1081 AGCGATTCTC CTGCCTCAGG CTCCCAAGTA GCTAGGACCA GGTGCGCGCC ACCACGCCCG 1141 GCTAATTTTT GTATTTTGTA TTTTTAGTAG AGATGGGGTT TCACCATGTT GGTCAGGCTA 1201 GTCTCGAACT CGTGACCGCA AGCGATTCAC CCACCTCAGC CTCCCAAAGT GCTGGGATTA 1261 CCGGCTTGAG CCACCACACC CGGCACATCT TCATTCTTTT TATGTAGTAA AAAGTATAAG 1321 GCCACACATG GTTTATTTGA AGTATTTTAT AATTTAAAAA AATACAGAAG CAGGAAAACC 1381 AATTATAAGT TCAAGTGAGG GATGATGGTT GCTTGAACCA AAGGGTTGCA TGTAGTAAGA 1441 AATTGTGATT TAAGATATAT TTTAAAGTTA TAAGTAGCAG GATATTCTGA TGGAGTTTGA 1501 CTTTGGTTTT GGGCCCAGGG AGTTTCAGAT GCCTTTGAGA AATGAATGAA GTAGAGAGAA 1561 AATAAAAGAA AAACCAGCCA GGCACAGTGG CTCACACCTG TAATCCCAGC GCTTTGGGAG 1621 GCTAAGGCAG GCAGATCACT TGAGACCAGC TTGGGCAACA TGGCAAAGCC CCATCTCTAC 1681 AAAAAACACA AAAATTAGCT GGGCATTGTG GCGCACACCT GTATTCCCAT CTAGTCAGGA 1741 AGCTGAGATG GAAGAATTAA TTGAGCCCAC GAGTTCAAGG CTGCAGTGAG TCGTGATTGT 1801 GCCACTGCAC TCCAGCCGGG GTGACAGAAG AGACCTTGTC TCGAAAACGA ATCTGAAAAC 1861 AATGGAACCA TGCCTTCATA ATTCTAGAAA GTTATTTTCA ACTGATAAAT CTATATTCAC 1921 CCAAATAATC AAGGGTGAAG GTAAAATAAT ACATTTTTAG ACAAGCAAAG ACTCAGGGGT 1981 TACCTCCATG TGCCCTTTTT AGGGAAGCTG TTGGAGAAAA TACTCCAGCA AAATGAAGGA 2041 GTACACAAAC CAGAGAATGA CATGAATCCA GCAAATAGGA TCCAACACAG GCAATATTCC 2101 AGCTATGGAG CTAGCTTTAA AAAGGAACAG TAAAAATATT AATCGGTTAG CTGGGTGGAA 2161 TGGCCCATGC CTGTAGTCCC AGCTACTCAG GAGGCTCAGC AGCAGGACGA CTTGAGCCCA 2221 AGAGTTCCAG ACCAGCCTGG CCACCTTAGT GAGATCCCTT CTCTTAAAAA TAATAACTTA 2281 TTGCCAGATT TGGGGCATTT GGAAAGAAGT TCATTGAAGA TAAAGCAAAA GTAAAAAAAA 2341 AAAAAAAAAA AACAAGGGGA AAGGGTTGGT TAGGCAATCA TTCTAGGGCA GAAAGAAGTA 2401 CAGGATAGGA AGAGCATAAT ACACTGTTTT TCTCAACAAG GAGCAGTATG TACACAGTCA 2461 TAATGATGTG ACTGCTTAGC CCCTAAATAT GGTAACTACT CTGGGACAAT ATGGGAGGAA 2521 AAGTGAAGAT TGTGATGGTG TAAGAGCTAA TCCTCATCTG TCATATCCAG AAATCACTAT 2581 ATAATATATA ATAATGAAAT GACTAAGTTA TGTGAGGAAA AAAACAGAAG ACATTGCTAA 2641 AAGAGTTAAA AGTCATTGCT CTGGAGAATT AGGAGGGATG GGGCAGGGGA CTGTTAGGAT 2701 GCATTATAAA CTGAAAAGCC TTTTTAAAAT TTTATGTATT AATATATGCA TTCACTTGAA 2761 AAACTAAAAA AAAACAATAA TTTGGAAAAA CCCATGAAGG TAACTAACGG AAGGAAAAAC 2821 TAAGAGAATG AAAAGTATTT GCCTCTGGAA AGAACAACTG GCAGGACTGT TGTTTTCATT 2881 GTAAGACTTT TGGAGCCATT TAATTGTACT TAACCATTTT CATCTATTTC TTTAATAAGA 2941 ACAATTCCAT CTTAATAAAG AGTTACACTT GTTAATAAGT GCTGGCCTCC TGTTGTTCTT 3001 TGTACACCCC ACACAAAATT TCAAAGAAAC TTTGATGGCA ATATATCTCC ATGGTCAGCT 3061 TAAAAATAGA GAAAGGAAAA CATAGAATTA GCCAAGAGTC ACACAAAACA AAGATCAGTT 3121 GTTTGTTAGG AAACAATCAA AATCAAGTCT CACTTTTTCC AGATTGGCTT ATGGAACAGC 3181 ACTGTAAGGT GATAACTTGG GGCAAACATG TAAATAATAA AACATATGTT TTAAATATTC 3241 AGGTTAGCAC ATTTTATGTT TCTGTGAGAT TAAAATTGTG TGTGACATAC CCGCTTCCTT 3301 AAAGGCAATG TTTCTGAAAA TGTTGTACCT GCTATTCCTG AATCAGGGAT GGGTCCCAGA 3361 ATCTGCCTTT TAAACATCTC AGATAATCTG AAGCCTGCTT AAGTTTGTAA GGCACTGCTT 3421 TTGCACTCTA AGGAAGAAAA AAACAAGTTT TAATTCCCGT CTCT // LOCUS HUMFOLA1 1275 bp ds-DNA PRI 15-JUN-1989 DEFINITION Human dihydrofolate reductase gene (DHFR), exons 1 & 2 and 5' flank. ACCESSION X00855 KEYWORDS dihydrofolate reductase; reductase. SEGMENT 1 of 5 SOURCE Human MTX-resistant cell line 6A3 DNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 1275) AUTHORS Yang,J.K., Masters,J.N. and Attardi,G. TITLE Human dihydrofolate reductase gene organization: Extensive conservation of the G+C-rich 5' non-coding sequence and strong intron size divergence from homologous mammalian genes JOURNAL J. Mol. Biol. 176, 169-187 (1984) STANDARD simple staff_review COMMENT EMBL features not translated to GenBank features: key from to description SITE 1 394 5' flanking region FEATURES from to/span description pept 395 480 dihydrofolate reductase, exon 1 /nomgen="DHFR" /map="5q11.1-q13.2" /hgml_locus_uid="LD0137D" 846 + 895 dihydrofolate reductase, exon 2 IVS 481 845 DHFR intron A IVS 896 > 1275 DHFR intron B BASE COUNT 265 A 329 C 415 G 266 T ORIGIN Chromosome 5q11.1-q13.2. 1 CTGCAGCGCC AGGGTCCACC TGGTCGGCTG CACCTGTGGA GGAGGAGGTG GATTTCAGGC 61 TTCCCGTAGA CTGGAAGAAT CGGCTCAAAA CCGCTTGCCT CGCAGGGGCT GAGCTGGAGG 121 CAGCGAGGCC GCCCGACGCA GGCTTCCGGC GAGACATGGC AGGGCAAGGA TGGCAGCCCG 181 GCGGCAGGGC CCGGCGAGGA GCGCGAACCC GCGGCCGCAG TTCCCAGGCG TCTGCGGGCG 241 CGAGCACGCC GCGACCCTGC GTGCGCCGGG GCGGGGGGGC GGGGCCTCGC CTGCACAAAT 301 AGGGACGAGG GGGCGGGGCG GCCACAATTT CGCGCCAAAC TTGACCGCGC GTTCTGCTGT 361 AACGAGCGGG CTCGGAGGTC CTCCCGCTGC TGTCATGGTT GGTTCGCTAA ACTGCATCGT 421 CGCTGTGTCC CAGAACATGG GCATCGGCAA GAACGGGGAC CTGCCCTGGC CACCGCTCAG 481 GTATCTGCCG GGCCGGGGCG ATGGGACCCA AACGGGCGCA GGCTGCCCAC GGTCGGGGTA 541 CCTGGGCGGG ACGCGCCAGG CCGACTCCCG GCGAGAGGAT GGGGCCAGAC TTGCGGTCTG 601 CGCTGGCAGG AAGGGTGGGC CCGACTGGAT TCCCCTTTTC TGCTGCGCGG GAGGCCCAGT 661 TGCTGATTTC TGCCCGGATT CTGCTGCCCG GTGAGGTCTT TGCCCTGCGG CGCCCTCGCC 721 CAGGGCAAAG TCCCAGCCCT GGAGAAAACA CCTCACCCCT ACCCACAGCG CTCCGTTTGT 781 CAGGTGCCTT AGAGCTCGAG CCCAAGGGAT AATGTTTCGA GTAACGCTGT TTCTCTAACT 841 TGTAGGAATG AATTCAGATA TTTCCAGAGA ATGACCACAA CCTCTTCAGT AGAAGGTAAT 901 GTGGGATTAA GTAGGGTCTT GCTTGATGAA GTTTACCAGT GCAAATGTTA GTTAAATGGA 961 AAGTTTTCCG TGTTAATCTG GGACCTTTTC TCTTATTATG GATCTGTATG ATCTGTATGC 1021 AGTTCCCAAG GTTCATTTAC CATTATTAAA AAATTTTTGT CTTAGAAATT TTATGTATGT 1081 CAACGCACGA GCAAATTATC AGGCATGGGG CAGAATTGGC AACTGGGTGG AGGCTTCGGT 1141 GGAGGTTAGC ACTCCGAAAG GAAAACAGAG TAGGCCTTTG GAACAGCTGC TGGAAGAGAT 1201 AAGGCCTGAA CAAGGGCAGT GGAGAAGAGA GGGTAAAAAT TTTTTAAGGT TACATGACCC 1261 TGGATTTTGG AGATC // LOCUS HUMFOLA2 595 bp ds-DNA PRI 15-JUN-1989 DEFINITION Human dihydrofolate reductase gene (DHFR), exon 3. ACCESSION X00856 KEYWORDS dihydrofolate reductase; reductase. SEGMENT 2 of 5 SOURCE Human MTX-resistant cell line 6A3 DNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 595) AUTHORS Yang,J.K., Masters,J.N. and Attardi,G. TITLE Human dihydrofolate reductase gene organization: Extensive conservation of the G+C-rich 5' non-coding sequence and strong intron size divergence from homologous mammalian genes JOURNAL J. Mol. Biol. 176, 169-187 (1984) STANDARD simple staff_review FEATURES from to/span description pept + 435 + 540 dihydrofolate reductase, exon 3 /nomgen="DHFR" /map="5q11.1-q13.2" /hgml_locus_uid="LD0137D" IVS < 1 434 DHFR intron B IVS 541 > 595 DHFR intron C BASE COUNT 217 A 101 C 115 G 162 T ORIGIN Chromosome 5q11.1-q13.2. 1 GTTCTTGGAA ACTGTGACTT TAAGTGAAAC AATGTATAGC AAAACCAATT TTACCATAGG 61 CTACTGGATG TACACAAGAG TTAAGTTCCT AGGGCATATT GCTGGTCACA AAAACATCAC 121 CAAACTTCTA AATGAAGGCC AAAAACACTT CTAATAGAAA ACACTGAAGT AAATATGAGC 181 TATACATACA TTTAAGAAGA ATAATGTGCA TGTCTTTGGG ATGTGGAAGG AAACTGGAAT 241 ACCTGGAGAA ACCCCATGCA GACATGGGAA AAGCATGCAG ACTCCACACA GACGGTGGCC 301 CTGGCCAGGA ATTGATTATT TTTCTCATCA ACGTTAGAAG AAAACGATGT TATTCAAGGA 361 CATGTTGTAT ATAAATACAT GAAAATATAT GTATATTCCC AAAATATACT TAGGCTTTCT 421 TTGTGATTTT ATAGGTAAAC AGAATCTGGT GATTATGGGT AAGAAGACCT GGTTCTCCAT 481 TCCTGAGAAG AATCGACCTT TAAAGGGTAG AATTAATTTA GTTCTCAGCA GAGAACTCAA 541 GTAAGTACCT TAACATAAAT TCACCACAAG AAAATCATGT CTCATAGTGG AGATC // LOCUS HUMFOLA3 358 bp ds-DNA PRI 15-JUN-1989 DEFINITION Human dihydrofolate reductase gene (DHFR), exon 4. ACCESSION X00857 KEYWORDS dihydrofolate reductase; reductase. SEGMENT 3 of 5 SOURCE Human MTX-resistant cell line 6A3 DNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 358) AUTHORS Yang,J.K., Masters,J.N. and Attardi,G. TITLE Human dihydrofolate reductase gene organization: Extensive conservation of the G+C-rich 5' non-coding sequence and strong intron size divergence from homologous mammalian genes JOURNAL J. Mol. Biol. 176, 169-187 (1984) STANDARD simple staff_review FEATURES from to/span description pept + 21 + 147 dihydrofolate reductase, exon 4 /nomgen="DHFR" /map="5q11.1-13.2" /hgml_locus_uid="LD0137D" IVS < 1 20 DHFR intron C IVS 148 > 358 DHFR intron D BASE COUNT 116 A 54 C 64 G 124 T ORIGIN Chromosome 5q11.1-q13.2. 1 AATTCTTTTT TTAACCTTAG GGAACCTCCA CAAGGAGCTC ATTTTCTTTC CAGAAGTCTA 61 GATGATGCCT TAAAACTTAC TGAACAACCA GAATTAGCAA ATAAAGTAGA CATGGTCTGG 121 ATAGTTGGTG GCAGTTCTGT TTATAAGGTA AGTGTAGTTG TTGGTAAATA AAAACAAATG 181 TTCTGTTCTA CTTTCTTTCA GCTTCCTTGC CATTTAAAGG AAGTGGAAGA AACAGGTATA 241 ACTCATTGTC TGAACCTTTA AAAAATTAAG CCATGCATAT AGGAAGCAGC ACATTATCTG 301 TACTGTAAAT TATTTGTTGT TAGTTCGAAA ATTATTTGTT TGTTTGATAC TATACAAA // LOCUS HUMFOLA4 605 bp ds-DNA PRI 15-JUN-1989 DEFINITION Human dihydrofolate reductase gene (DHFR), exon 5. ACCESSION X00858 KEYWORDS dihydrofolate reductase; reductase. SEGMENT 4 of 5 SOURCE Human MTX-resistant cell line 6A3 DNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 605) AUTHORS Yang,J.K., Masters,J.N. and Attardi,G. TITLE Human dihydrofolate reductase gene organization: Extensive conservation of the G+C-rich 5' non-coding sequence and strong intron size divergence from homologous mammalian genes JOURNAL J. Mol. Biol. 176, 169-187 (1984) STANDARD simple staff_review FEATURES from to/span description pept + 350 + 465 dihydrofolate reductase, exon 5 /nomgen="DHFR" /map="5q11.1-q13.2" /hgml_locus_uid="LD0137D" IVS < 1 349 DHFR intron D IVS 466 > 605 DHFR intron E BASE COUNT 212 A 99 C 103 G 191 T ORIGIN Chromosome 5q11.1-q13.2. 1 TTTTTCTATG GAGAGCAACT AGATAATTTG GGGCTCTTAA CTCTAAACAG CCAAAGGGAA 61 AACAATAAAA GAGAATGCTT AAATTAAAAA GTTCTTGTAA GCAAACTGGA GGCCAGACTA 121 TATTTAAACT CAGTCTCTCA TAATTATTGA TAATGGTTAG TGTCATTCTC AATATAACTA 181 AGGCATACTT TGCAGCACCC CAGATGAAAG AATAGTATAT TTTAATAATA TGCTTTGGCA 241 GCACCAAGCA TATTTTTAAT ACCTAGTATA AACTTAACTA AACATAAATC TTAGAGTTTC 301 TTTGTCTTTT GACACACATT AAGAAAACTG ATGTTGTTTT ATTTCAAAGG AAGCCATGAA 361 TCACCCAGGC CATCTTAAAC TATTTGTGAC AAGGATCATG CAAGACTTTG AAAGTGACAC 421 GTTTTTTCCA GAAATTGATT TGGAGAAATA TAAACTTCTG CCAGAGTAAG TATAAGGTTA 481 TTAATTAGTC TGAAGCACTT TGGATTTCCT GCTTAAGACT ATAGAAAATA ACATGTCTTT 541 TCAGACTACA AATTGGGTTC ACTGTGTACT GCTAGGATGA TGGGTGCACC AAAATCTCAC 601 AGATC // LOCUS HUMFOLA5 372 bp ds-DNA PRI 15-JUN-1989 DEFINITION Human dihydrofolate reductase gene (DHFR), exon 6. ACCESSION X00859 KEYWORDS dihydrofolate reductase; reductase. SEGMENT 5 of 5 SOURCE Human MTX-resistant cell line 6A3 DNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 372) AUTHORS Yang,J.K., Masters,J.N. and Attardi,G. TITLE Human dihydrofolate reductase gene organization: Extensive conservation of the G+C-rich 5' non-coding sequence and strong intron size divergence from homologous mammalian genes JOURNAL J. Mol. Biol. 176, 169-187 (1984) STANDARD simple staff_review FEATURES from to/span description pept + 201 279 dihydrofolate reductase, exon 6 /nomgen="DHFR" /map="5q11.1-q13.2" /hgml_locus_uid="LD0137D" IVS < 1 200 DHFR intron E BASE COUNT 114 A 50 C 67 G 141 T ORIGIN Chromosome 5q11.1-q13.2. 1 TAGTTTACAG TTTTTTTCTT ACCATTTGCA TATGTAGTAT ACTAGCATGA TATTTATCCA 61 ACTTGACAGT GGCTTACCAT TGTGAAGAAA AAAGGGAAAC CTTTTATTTG CTTTATGCAC 121 TATTGATGCA TTTAAACTGA GAATCAGGGA AGCTGTGTTT TTAATGGACA CATAATTTAA 181 TTATATATTT TTTCTTACAG ATACCCAGGT GTTCTCTCTG ATGTCCAGGA GGAGAAAGGC 241 ATTAAGTACA AATTTGAAGT ATATGAGAAG AATGATTAAT ATGAAGGTGT TTTCTAGTTT 301 AAGTTGTTCC CCCTCCCTCT GAAAAAAGTA TGTATTTTTA CATTAGAAAA GGTTTTTTGT 361 TGACTTTAGA TC // LOCUS HUMFOLLI1 4665 bp ds-DNA PRI 15-SEP-1989 DEFINITION Human follistatin gene, exons 1-5. ACCESSION M19480 J03771 KEYWORDS follistatin. SEGMENT 1 of 2 SOURCE Human testicle, cDNA to mRNA, clones HTF1[02,06-09,10-12]; and human lymphocyte DNA, clone HFG 102. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 4665) AUTHORS Shimasaki,S., Koga,M., Esch,F., Cooksey,K., Mercado,M., Koba,A., Ueno,N., Ying,S.-Y., Ling,N. and Guillemin,R. TITLE Primary structure of the human follistatin precursor and its genomic organization JOURNAL Proc. Natl. Acad. Sci. U.S.A. 85, 4218-4222 (1988) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence [1] kindly submitted by N.Ling, 28-APR-1988. FEATURES from to/span description pept 28 112 follistatin precursor, exon 1 2122 2313 follistatin precursor, exon 2 2744 2962 follistatin precursor, exon 3 3309 3533 follistatin precursor, exon 4 4236 + 4466 follistatin precursor, exon 5 sigp 28 112 follistatin signal peptide, exon 1 sigp 2122 2124 follistatin signal peptide, exon 2 matp 2125 2313 follistatin, exon 2 2744 2962 follistatin, exon 3 3309 3533 follistatin, exon 4 4236 + 4466 follistatin, exon 5 IVS 113 2121 follistatin, intron A IVS 2314 2743 follistatin, intron B IVS 2963 3308 follistatin, intron C IVS 3534 4235 follistatin, intron D IVS 4467 > 4665 follistatin, intron E BASE COUNT 1129 A 1155 C 1069 G 1312 T ORIGIN 1 GCTCCTCGCC CCGCGCCTGC CCCCAGGATG GTCCGCGCGA GGCACCAGCC GGGTGGGCTT 61 TGCCTCCTGC TGCTGCTGCT CTGCCAGTTC ATGGAGGACC GCAGTGCCCA GGGTAAGCGA 121 GTGGGGATGC GCTGGGGAGG CTTGGCTGAG GACAGGGGGG TCGACTTTTC TGTGGTCTCC 181 ACTTGGTCTG TTCTGAGCAA CGCTGCTCTC GGAAGATGTT GAACCAACTT GGGGACTCCG 241 GGATGGAGAG GAGTGGGAAA ATTGTAGGCA ATGACACTTT AGGACTGGGA ACTGAGTTTG 301 GAGCTCGGTT ATCTTGAAAA AGGAAAGGGA GCGAGCTGGG GTCTTAGTAG TTAGGGTTAA 361 ATGCTCCAAG TTCTGGCCAT GATAACTTTA GATGTGATCT CCTCTGTTTG GCTCTGAAGA 421 AGGGAACAAG TATCAGTAGT AGCCTGGAGA TTTCAGTCGA CCCATTTGCA GTCTCATCTC 481 TCTCCCTCCC ACATCTTTTG GATACTTAAA AAAAAAAAAG GAAAGAAAGA AAAGAAAACC 541 TGGGGGTTTG GGGGAGAGAT ACTGAATTTT ATTTGGTGGC TGGGGGACGT GGTTTCAGAA 601 ACTTTGCCCA AACTTGGAAA TGTGGGTGCT GAAGATTTTC AAGGGAAGTA GCTTTCTGCC 661 TAGTGGATGC TTCTTCCAAC CCCATCCCCC ATTCTCGCAA AAAAAAAAAA AAAAAAAAAA 721 AAAAAAAAAA GCCAAATTAT TACCCCAAAG TGCTTTGAGC ACTTTACAGT TTTTTTCCCC 781 TCTGCCTCAT TCTTCTTTCT GACCCTTAGC TTTCTGTAAA ACAGTGAGCA GCCCATAACT 841 GACTTAAATT CTGAAATGGG TTAAGAGCAA ATTTCTGAAG CTTGCCTTTT AAAAATAGAT 901 TTGTTTATTT CAATTAGTCT TTGGGCAGAA TGTTTAAAAT AATAATAGCA GCTTGCGATC 961 CAATGATTGT TAAAGTCATA TTGGTGATTC ATGAGAAGTC TTCTCGACCC AAATCCGAAC 1021 ATTAAAAAGG GGGAAGAGGG AGAAATCACT TGCAAATATT TCTTGCTAGG TATGAGATAG 1081 GTACTTATAA CCGTGAAATG GTAGGAAGAC TCCCCCCTCC CCCCACCTTA AGGCATTACT 1141 TTCCCTGCTT CAGCGTTAAC ATTTTAAGTA TGTTGTAAAC TGCCCTCCTG GCACTGCCCC 1201 TCGCCGTCTG CATGGCACTG CGCTGGGTTG CCCGCCTTTA GAGGAGAGGC TCTGCCTGGT 1261 GCCCGGACGT GGGCGCGGAG GAAAGCAGCC CTCGGAGATT TCGAAAGATC GCGGTGGCCG 1321 CGGGCGTGCT CCGGTCTGGC GGATTGCAAA ACGGGCCCGT CGGGCTGGCT GCGCTGGAGG 1381 CCGCGCGGGG TAGCTCCGGG CTGGGCCGGG TTGCTTTTTT GCTTTCATTT TTCAGAAGTG 1441 CCTTTTGCTT GATCTGTTTC TCTCTCTCTC TCTTTCTTTT CTTTCTTCCT CTCTCTCTTT 1501 CTTTCTTCCT TTTTTTCTTT CTTTCTCTTT CTTTTCTTTC TTTCTCTTCT TTCTTTTCTT 1561 TCTTTCTTTC TCTTTCCTTC TCCCTCTCTC TCCCCTCCAT CTCTCCTCTC TCCTCTCTCT 1621 CGTTCTCTTA CTTTCTTAAC CCCTGTCACA AATAATTCCT GTACCCGCCT ACATGAAACC 1681 AGCATGTAAA AACATCCGTC GCATCCTGTT TTTGTCAGGT TCTTTAATCT GCTCTTCGAG 1741 TCTCTGCAGG TTATGAAATG GGACGAATAA AAGTAAACAG TCTAGTAAAA GTCAATGCAA 1801 GCTGCACGTG TTGTGTCTGG GTCACTGGTA ACTGACATTG ATATGGCTGG GGCGCCCTGT 1861 CTTCTCCCTC TCTCCCTCCC TCCCTCTCGC CCACCTCCCA TCTCTGTGAT CAGGGCTTCC 1921 CCCTCCACTG CCTTCTTTTT CCACCCCTCC ACCCCTTTCG ATTTATTTCC TACTTTTCTC 1981 CCGCGTCTCT CTCACTTCCC CTCCTCCACG CTCACCCCCT CCCCATCCCC GCCGGGTCTC 2041 CTTCGCTAGC CACCTCGCTC TCCCTGCCCT GCCACCGCTC ACTGCTCACT CACCCACCTC 2101 CCCACCCTTG TCTCTTCACA GCTGGGAACT GCTGGCTCCG TCAAGCGAAG AACGGCCGCT 2161 GCCAGGTCCT GTACAAGACC GAACTGAGCA AGGAGGAGTG CTGCAGCACC GGCCGGCTGA 2221 GCACCTCGTG GACCGAGGAG GACGTGAATG ACAACACACT CTTCAAGTGG ATGATTTTCA 2281 ACGGGGGCGC CCCCAACTGC ATCCCCTGTA AAGGTAGGAC TCCTTCTTCC CAACTTGCAG 2341 GCCCTCAGTA GAGGGCGTCT TACCCTTAGC TTCCCCACTA CCTGACTGGG GTTTGGGAGT 2401 AGGAGAGCTT TGTTCCTGGG CTTCCCCTTC CTGTCCCTTG CCCTGGTAAG CCGTGCAGAC 2461 TCTAATTCTG CCTGTTACAG GCTGTAGGGA ATACACGCCA GACTTCTTAG CCAAGTGTGG 2521 TGGTGAAACC CACCAACCTG TGCTCCTTAA TGAGAGAGAT CTGGGTGTGG GCACAGCCCA 2581 AGGTCCACAC TCTTTCACCA ACTCCCAATA TTCCAGGAGA GAGCCTGGGC CCCTCCAGCG 2641 CAAACTCAGG GCTGCATGAT TGCGCAAGGC ACCCGAAGCC CTCCTGGCTG ACCTGCAGAC 2701 TGCCTGGCTC TGGTTTTAAT CCATGCCTGT TTCTAACTCA CAGAAACGTG TGAGAACGTG 2761 GACTGTGGAC CTGGGAAAAA ATGCCGAATG AACAAGAAGA ACAAACCCCG CTGCGTCTGC 2821 GCCCCGGATT GTTCCAACAT CACCTGGAAG GGTCCAGTCT GCGGGCTGGA TGGGAAAACC 2881 TACCGCAATG AATGTGCACT CCTAAAGGCA AGATGTAAAG AGCAGCCAGA ACTGGAAGTC 2941 CAGTACCAAG GCAGATGTAA AAGTAGGTCC TACCCTGTTG AGCAAGACTG GATCTGTCCC 3001 CTCCTCCAGC TTTGTACCTA AAGTAGACCC TCTAGAAGAC CCTTGGGGGA TGGTGTAGTC 3061 CGCAGTAAGA GCCTGATAAT AGTAATACTG AAACCAAATA AAGGAGTCCT TTTCTAACCT 3121 CTAGAGATTC ATTAAGAACA CTGAGGGGAC CAACCTAGTC ATAGATTCTC TCTTGAAAAC 3181 TACAGGGCTC CCTAAGTGCC TTTTGAAAGC TGGATGCTTC AGTGTCATGA TTTCCTTGGT 3241 AACTTCAAGT GCTCACTCCC TAAGGACTAG AAGGTACCTA TTCATGTGTG TTTCCTTCTT 3301 TGTTCCAGAG ACTTGTCGGG ATGTTTTCTG TCCAGGCAGC TCCACATGTG TGGTGGACCA 3361 GACCAATAAT GCCTACTGTG TGACCTGTAA TCGGATTTGC CCAGAGCCTG CTTCCTCTGA 3421 GCAATATCTC TGTGGGAATG ATGGAGTCAC CTACTCCAGT GCCTGCCACC TGAGAAAGGC 3481 TACCTGCCTG CTGGGCAGAT CTATTGGATT AGCCTATGAG GGAAAGTGTA TCAGTAGGTA 3541 TTCTGGATTG AGGAAGGAAA AAGAGAAAAC AGGCTAGTTC TATTATTAAA CTGTGGGGTT 3601 AACTAATAAG TAAAGCCCAA GGCGTCCCCA AACACCATAG GGAGAAATAC GCTGCAATTT 3661 GGGGAAAGTG TTGTGACCAC AGTATTCCTC ATGGAAACCA TTGTCTTCTG GAGGCATTGA 3721 CACATATATT CAAATGCCAG CAGGAAGCAA GGAACAGTAT TCCGTCTTAG AAAACTTAGA 3781 ACTTACTCAA TTTTACACAT TTTTTTAAGT GCCAGACTTG CTGGAAGCGA AAAATAATTA 3841 CTTAGCAGTT CCAGAAATCT GTTGTCAGAT TCTAGTAATT AATGGAATTC TTTCTTTTGT 3901 AAAAGATCCC ACTTGTGGAA ATAATAAGAT ACATATTTAA CTTGAGAATA TTGTAAAATC 3961 CTGTTCTTAA AAAAATACCC TTTTAATGCA CTAATGTAGT ATGTAAGAAA CTGCAGGGGT 4021 TTTGTGCGTG TGTATGTGTG TGTGCATTTG AGTTTCAGTT TTATTATCCA GCATTTTTGC 4081 ATAATATCTC CATTACCCCC ATTAGTAATA GGCTATTACT ATTATGTTTA TATAAAATAA 4141 ATTATGTTTA TATTTATTGA TAGAGGACTA GAGAAAGGGA GAAAAGGGGG ATATGGGGAA 4201 ATCAGTTTAC TCATCACAGA TGTATTATAT CCTAGAAGCA AAGTCCTGTG AAGATATCCA 4261 GTGCACTGGT GGGAAAAAAT GTTTATGGGA TTTCAAGGTT GGGAGAGGCC GGTGTTCCCT 4321 CTGTGATGAG CTGTGCCCTG ACAGTAAGTC GGATGAGCCT GTCTGTGCCA GTGACAATGC 4381 CACTTATGCC AGCGAGTGTG CCATGAAGGA AGCTGCCTGC TCCTCAGGTG TGCTACTGGA 4441 AGTAAAGCAC TCCGGATCTT GCAACTGTAA GTGCGATTTT TAACCTTGCT GCCATTTAAG 4501 GCTTTCCCAG GCAATCCCTA GGGAATGGAC ACTTACAAAG CACGCAGATC TCCCATAAAT 4561 CCATTTCTGT TCAAATTAGG TAGCTGCTAA GTATCACCAG CAATTCAATA ATCCACAGAA 4621 AATTCTCTGC GATGTTTCTT GGCTTTTAGG ACTTATCTGG TGATC // LOCUS HUMFOLLI2 1303 bp ds-DNA PRI 15-SEP-1989 DEFINITION Human follistatin gene, exon 6. ACCESSION M19481 J03771 KEYWORDS follistatin. SEGMENT 2 of 2 SOURCE Human testicle, cDNA to mRNA, clones HTF1[02,06-09,10-12]; and human lymphocyte DNA, clones HFG 10[1,3]. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 1303) AUTHORS Shimasaki,S., Koga,M., Esch,F., Cooksey,K., Mercado,M., Koba,A., Ueno,N., Ying,S.-Y., Ling,N. and Guillemin,R. TITLE Primary structure of the human follistatin precursor and its genomic organization JOURNAL Proc. Natl. Acad. Sci. U.S.A. 85, 4218-4222 (1988) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence [1] kindly submitted by N.Ling, 28-APR-1988. FEATURES from to/span description pept + 263 264 follistatin, exon 6 pep$ + 527 609 follistatin, exon 6 (alt.) pre-msg < 1 669 follistatin mRNA and introns IVS < 1 262 follistatin, intron D IVS < 1 526 follistatin, intron D (alt.) variant 125 125 t in genomic clone; a in cDNA BASE COUNT 380 A 251 C 224 G 448 T ORIGIN 1 GATCATCAAT GGGGTTGCCT CTAGAAATCT ATTTCCAGTT GTTTTCCCCT ATTTCTTGTG 61 TTTTGGTAGT GTGCTTTGCT GTTTGCTTTA TTAACATTTG AGTCTAAACT AACTGCTTCA 121 AAAGTTTTTT TTTTTTTTCC AACGACAGCT TCATATTATC ACACATGGGC TGCTGCTTTT 181 TGCAGTTGCC TCTACTAACT CTGAATTAAG GACCCAAAGC AGTTATACCT AGAACACAAG 241 AGCGCTTTTT ATCTAATTTC AGGAATCTGC CCGTAAAACC TGAGCCATTG ATTCTTCAGA 301 ACTTTCTGCA GTTTTTGACT TCATAGATTA TGCTTTAAAA AATTTTTTTT AACTTATTGC 361 ATAACAGCAG ATGCCAAAAA CAAAAAAAGC ATCTCACTGC AAGTCACATA AAAATGCAAC 421 GCTGTAATAT GGCTGTATCA GAGGGCTTTG AAAACATACA CTGAGCTGCT TCTGCGCTGT 481 TGTTGTCCGT ATTTAAACAA CAGCTCCCCT GTATTCCCCC ATCTAGCCAT TTCGGAAGAC 541 ACCGAGGAAG AGGAGGAAGA TGAAGACCAG GACTACAGCT TTCCTATATC TTCTATTCTA 601 GAGTGGTAAA CTCTCTATAA GTGTTCAGTG TTCACATAGC CTTTGTGCAA AAAAAAAAAA 661 AAAAAAAAAG AAAAAGAAAA AAAGAAAAAT ATATTGTCCA TACTGTAAAT AAGTGTATGC 721 TTATTTATTT GGGGGGAAAA CTATACATTA AAGGACCTTT GTCCTAAAGC TCTCTCCCAG 781 GCCACCTTGT TACTCATTGG ACACGGAGAG GCATTCATTG TGAGGTCTAC TGGATGAGGC 841 CCATAGTTGA GACTTGTAGA CATTTATTTA TACTGTGTCA TGTTTTATAA TTTATACATA 901 AAATGTCTGG TTGACTGTAT ACCTTGTTTT TGAAGAAATT TATTCGTGAA AGGAAGAGCA 961 GTTGTTATTT ATTGTGAGGT CTCTTGCTTG TAAAGTAAAA GCTTTTTTTC CTTGTAAACC 1021 ATTTAAGTCC ATTCCTTACT ATTCACTCAC TCATCTGTCT CCCTTCATTT CACTGTTAGA 1081 CTCTTTTCCA CTTTCAACAA ACTTGCATGT CAGTTTCTGT CATGTTTATT TATTGGATTC 1141 TCTGCTGCCT GATCTGTACA TACATGATCC CTCGGGTTTT GTTTACAAGG AACCTTGACT 1201 GACCAAAAGG CATTATAACT CTGACTCAAA TACAAGGTAC AGAAGATAAG CATCTTTGAG 1261 GAAACTCCTA CTTCAGTTCT TTTGTTATGA TGAAGACATT TGT // LOCUS HUMFOS 6210 bp ds-DNA PRI 15-JUN-1989 DEFINITION Human fos proto-oncogene (c-fos), complete cds. ACCESSION K00650 M16287 KEYWORDS fos oncogene; oncogene; proto-oncogene. SOURCE Human lymphoblastic cell line (CCRF-CM) DNA, clone pc-fos(human)-1. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 4165) AUTHORS van Straaten,F., Mueller,R., Curran,T., Van Beveren,C. and Verma,I.M. TITLE Complete nucleotide sequence of a human c-onc gene: Deduced amino acid sequence of the human c-fos protein JOURNAL Proc. Natl. Acad. Sci. U.S.A. 80, 3183-3187 (1983) STANDARD simple staff_review REFERENCE 2 (sites; promoter region) AUTHORS Treisman,R. TITLE Transient accumulation of c-fos RNA following serum stimulation requires a conserved 5' element and c-fos 3' sequences JOURNAL Cell 42, 889-902 (1985) STANDARD simple staff_review REFERENCE 3 (bases 4166 to 6210) AUTHORS Verma,I.M., Deschamps,J., Van Beveren,C. and Sassone-Corsi,P. TITLE Human fos gene JOURNAL Cold Spring Harb. Symp. Quant. Biol. 51, 949-958 (1986) STANDARD simple staff_review COMMENT C-fos is the human cellular homolog of the v-fos oncogene of Finkel-Biskis-Jinkins murine osteosarcoma virus (FBJ-MuSV). [2] It was found that both human and murine c-fos genes contained an enhancer-like element in their 5' noncoding regions that was necessary for increased transcription following serum activation. The FBJ-MuSV v-fos oncogene contains a deletion relative to murine and human c-fos proto-oncogenes that causes complete divergence of the COOH terminal protein sequences encoded. That deletion corresponds to positions 3182-3285 inclusive of this sequence. The FBJ-MuSV v-fos sequence is more closely related to murine than human c-fos sequences. The FBJ-MuSV v-fos coding sequence ends at a "tag" stop codon coresponding to positions 3434-2436 of this sequence [1]. [1] notes two alu repeats beginning aproximately 500 and 1700 nucleotides downstream of the last base in this sequence. A TATA box is located at positions 701-707. Two potential polyadenylation signals are present in the 3' untranslated region. FEATURES from to/span description pept 889 1029 c-fos protein, exon 1 /nomgen="FOS" /map="14q24.3-q31" /hgml_locus_uid="LS0061U" 1783 2034 c-fos protein, exon 2 2466 2573 c-fos protein, exon 3 2688 3329 c-fos protein, exon 4 pre-msg 734 > 3329 c-fos mRNA [1] IVS 1030 1782 c-fos intron A IVS 2035 2465 c-fos intron B IVS 2574 2687 c-fos intron C site 402 453 transcriptional activator region [2] BASE COUNT 1497 A 1571 C 1619 G 1523 T ORIGIN Chromosome 14q24.3-q31. 1 GCAGGAACAG TGCTAGTATT GCTCGAGCCC GAGGGCTGGA GGTTAGGGGA TGAAGGTCTG 61 CTTCCACGCT TTGCACTGAA TTAGGGCTAG AATTGGGGAT GGGGGTAGGG GCGCATTCCT 121 TCGGGAGCCG AGGCTTAAGT CCTCGGGGTC CTGTACTCGA TGCCGTTTCT CCTATCTCTG 181 AGCCTCAGAA CTGTCTTCAG TTTCCGTACA AGGGTAAAAA GGCGCTCTCT GCCCCATCCC 241 CCCCGACCTC GGGAACAAGG GTCCGCATTG AACCAGGTGC GAATGTTCTC TCTCATTCTG 301 CGCCGTTCCC GCCTCCCCTC CCCCAGCCGC GGCCCCCGCC TCCCCCCGCA CTGCACCCTC 361 GGTGTTGGCT GCAGCCCGCG AGCAGTTCCC GTCAATCCCT CCCCCCTTAC ACAGGATGTC 421 CATATTAGGA CATCTGCGTC AGCAGGTTTC CACGGCCTTT CCCTGTAGCC CTGGGGGGAG 481 CCATCCCCGA AACCCCTCAT CTTGGGGGGC CCACGAGACC TCTGAGACAG GAACTGCGAA 541 ATGCTCACGA GATTAGGACA CGCGCCAAGG CGGGGGCAGG GAGCTGCGAG CGCTGGGGAC 601 GCAGCCGGGC GGCCGCAGAA GCGCCCAGGC CCGCGCGCCA CCCCTCTGGC GCCACCGTGG 661 TTGAGCCCGT GACGTTTACA CTCATTCATA AAACGCTTGT TATAAAAGCA GTGGCTGCGG 721 CGCCTCGTAC TCCAACCGCA TCTGCAGCGA GCAACTGAGA AGCCAAGACT GAGCCGGCGG 781 CCGCGGCGCA GCGAACGAGC AGTGACCGTG CTCCTACCCA GCTCTGCTTC ACAGCGCCCA 841 CCTGTCTCCG CCCCTCGGCC CCTCGCCCGG CTTTGCCTAA CCGCCACGAT GATGTTCTCG 901 GGCTTCAACG CAGACTACGA GGCGTCATCC TCCCGCTGCA GCAGCGCGTC CCCGGCCGGG 961 GATAGCCTCT CTTACTACCA CTCACCCGCA GACTCCTTCT CCAGCATGGG CTCGCCTGTC 1021 AACGCGCAGG TAAGGCTGGC TTCCCGTCGC CGCGGGGCCG GGGGCTTGGG GTCGCGGAGG 1081 AGGAGACACC GGGCGGGACG CTCCAGTAGA TGAGTAGGGG GCTCCCTTGT GCCTGGAGGG 1141 AGGCTGCCGT GGCCGGAGCG GTGCCGGCTC GGGGGCTCGG GACTTGCTCT GAGCGCACGC 1201 ACGCTTGCCA TAGTAAGAAT TGGTTCCCCC TTCGGGAGGC AGGTTCGTTC TGAGCAACCT 1261 CTGGTCTGCA CTCCAGGACG GATCTCTGAC ATTAGCTGGA GCAGACGTGT CCCAAGCACA 1321 AACTCGCTAA CTAGAGCCTG GCTTCTTCGG GGAGGTGGCA GAAAGCGGCA ATCCCCCCTC 1381 CCCCGGCAGC CTGGAGCACG GAGGAGGGAT GAGGGAGGAG GGTGCAGCGG GCGGGTGTGT 1441 AAGGCAGTTT CATTGATAAA AAGCGAGTTC ATTCTGGAGA CTCCGGAGCG GCGCCTGCGT 1501 CAGCGCAGAC GTCAGGGATA TTTATAACAA ACCCCCTTTC AAGCAAGTGA TGCTGAAGGG 1561 ATAACGGGAA CGCAGCGGCA GGATGGAAGA GACAGGCACT GCGCTGCGGA ATGCCTGGGA 1621 GGAAAAGGGG GAGACCTTTC ATCCAGGATG AGGGACATTT AAGATGAAAT GTCCGTGGCA 1681 GGATCGTTTC TCTTCACTGC TGCATGCGGC ACTGGGAACT CGCCCCACCT GTGTCCGGAA 1741 CCTGCTCGCT CACGTCGGCT TTCCCCTTCT GTTTTGTTCT AGGACTTCTG CACGGACCTG 1801 GCCGTCTCCA GTGCCAACTT CATTCCCACG GTCACTGCCA TCTCGACCAG TCCGGACCTG 1861 CAGTGGCTGG TGCAGCCCGC CCTCGTCTCC TCTGTGGCCC CATCGCAGAC CAGAGCCCCT 1921 CACCCTTTCG GAGTCCCCGC CCCCTCCGCT GGGGCTTACT CCAGGGCTGG CGTTGTGAAG 1981 ACCATGACAG GAGGCCGAGC GCAGAGCATT GGCAGGAGGG GCAAGGTGGA ACAGGTGAGG 2041 AACTCTAGCG TACTCTTCCT GGGAATGTGG GGGCTGGGTG GGAAGCAGCC CCGGAGATGC 2101 AGGAGCCCAG TACAGAGGAT GAAGCCACTG ATGGGGCTGG CTGCACATCC GTAACTGGGA 2161 GCCCTGGCTC CAAGCCCATT CCATCCCAAC TCAGACTCTG AGTCTCACCC TAAGAAGTAC 2221 TCTCATAGTT TCTTCCCTAA GTTTCTTACC GCATGCTTTC AGACTGGGCT CTTCTTTGTT 2281 CTCTTGCTGA GGATCTTATT TTAAATGCAA GTCACACCTA TTCTGCAACT GCAGGTCAGA 2341 AATGGTTTCA CAGTGGGGTG CCAGGAAGCA GGGAAGCTGC AGGAGCCAGT TCTACTGGGG 2401 TGGGTGAATG GAGGTGATGG CAGACACTTT TACTGAATGT CGGTCTTTTT TTGTGATTAT 2461 TCTAGTTATC TCCAGAAGAA GAAGAGAAAA GGAGAATCCG AAGGGAAAGG AATAAGATGG 2521 CTGCAGCCAA ATGCCGCAAC CGGAGGAGGG AGCTGACTGA TACACTCCAA GCGGTAGGTA 2581 CTCTGTGGGT TGCTCCTTTT TAAAACTTAA GGGAAAGTTG GAGATTGAGC ATAAGGGCCC 2641 TTGAGTAAGA CTGTGTCTTA TGCTTTCCTT TATCCCTCTG TATACAGGAG ACAGACCAAC 2701 TAGAAGATGA GAAGTCTGCT TTGCAGACCG AGATTGCCAA CCTGCTGAAG GAGAAGGAAA 2761 AACTAGAGTT CATCCTGGCA GCTCACCGAC CTGCCTGCAA GATCCCTGAT GACCTGGGCT 2821 TCCCAGAAGA GATGTCTGTG GCTTCCCTTG ATCTGACTGG GGGCCTGCCA GAGGTTGCCA 2881 CCCCGGAGTC TGAGGAGGCC TTCACCCTGC CTCTCCTCAA TGACCCTGAG CCCAAGCCCT 2941 CAGTGGAACC TGTCAAGAGC ATCAGCAGCA TGGAGCTGAA GACCGAGCCC TTTGATGACT 3001 TCCTGTTCCC AGCATCATCC AGGCCCAGTG GCTCTGAGAC AGCCCGCTCC GTGCCAGACA 3061 TGGACCTATC TGGGTCCTTC TATGCAGCAG ACTGGGAGCC TCTGCACAGT GGCTCCCTGG 3121 GGATGGGGCC CATGGCCACA GAGCTGGAGC CCCTGTGCAC TCCGGTGGTC ACCTGTACTC 3181 CCAGCTGCAC TGCTTACACG TCTTCCTTCG TCTTCACCTA CCCCGAGGCT GACTCCTTCC 3241 CCAGCTGTGC AGCTGCCCAC CGCAAGGGCA GCAGCAGCAA TGAGCCTTCC TCTGACTCGC 3301 TCAGCTCACC CACGCTGCTG GCCCTGTGAG GGGGCAGGGA AGGGGAGGCA GCCGGCACCC 3361 ACAAGTGCCA CTGCCCGAGC TGGTGCATTA CAGAGAGGAG AAACACATCT TCCCTAGAGG 3421 GTTCCTGTAG ACCTAGGGAG GACCTTATCT GTGCGTGAAA CACACCAGGC TGTGGGCCTC 3481 AAGGACTTGA AAGCATCCAT GTGTGGACTC AAGTCCTTAC CTCTTCCGGA GATGTAGCAA 3541 AACGCATGGA GTGTGTATTG TTCCCAGTGA CACTTCAGAG AGCTGGTAGT TAGTAGCATG 3601 TTGAGCCAGG CCTGGGTCTG TGTCTCTTTT CTCTTTCTCC TTAGTCTTCT CATAGCATTA 3661 ACTAATCTAT TGGGTTCATT ATTGGAATTA ACCTGGTGCT GGATATTTTC AAATTGTATC 3721 TAGTGCAGCT GATTTTAACA ATAACTACTG TGTTCCTGGC AATAGTGTGT TCTGATTAGA 3781 AATGACCAAT ATTATACTAA GAAAAGATAC GACTTTATTT TCTGGTAGAT AGAAATAAAT 3841 AGCTATATCC ATGTACTGTA GTTTTTCTTC AACATCAATG TTCATTGTAA TGTTACTGAT 3901 CATGCATTGT TGAGGTGGTC TGAATGTTCT GACATTAACA GTTTTCCATG AAAACGTTTT 3961 ATTGTGTTTT TAATTTATTT ATTAAGATGG ATTCTCAGAT ATTTATATTT TTATTTTATT 4021 TTTTTCTACC TTGAGGTCTT TTGACATGTG GAAAGTGAAT TTGAATGAAA AATTTAAGCA 4081 TTGTTTGCTT ATTGTTCCAA GACATTGTCA ATAAAAGCAT TTAAGTTGAA TGCGACCAAC 4141 CTTGTGCTCT TTTCATTCTG GAAGTCTTGT AAGTTTCTGA AAGGTATTAT TGGAGACCAG 4201 TTTGTCAAGA AGGGTAGCTG CTGGAGGGGG ACACACCCTC TGTCTGATCC CTTATCAAAG 4261 AGGACAAGGA AACTATAGAG CTGATTTTAG AATATTTTAC AAATACATGC CTTCCATTGG 4321 AATGCTAAGA TTTTCTACTG CTTCTGGGGA CGGGAAACCG CTGTGTAACA GCTTTTGTGG 4381 GAATACATTT TTTCTGTTTC AGTACTCGCA GGGGGAAATA TTTAAATTTT GTTGTGCTAA 4441 TATTAAATTC AGATGTTTTG ATCTTAAAGG AACCCTTTAA GCAAACAGAA CCTAGCTTTG 4501 TACAGACTAT TTTAACTTTT TATTCTCACA AAATCACGTG GAGGGTTATT CTACTTCAAA 4561 GATGAGCAAA TTGAAGAATG GTTAGAATAA ACAACTTTCT TGATATTCCG TTATCGGCAT 4621 TAGAATCTTC CTGCTCGTTA TCGTATCCAG CAGGCTGAAC TGCCTCTTGA TACTTGGTTA 4681 AAAAAAATTT TCAGGCCGGG CGCGGTGGCC CATGCCTGTA ATCCTAGCAC TTTGGGAGGC 4741 CGAGGCAGGC GGATCACCTG AGGTCGGGAG TTCGAGACCA GCCTGACCAA CATGGAGAAA 4801 CCCCGTCTTT ACTAAAAATA CAAAATTAGC CTGGTGTGGT GGTGCATGCC TGTAATCCTA 4861 GCTACTTGAG AGGCTGAGAC AGGAAAATCA CTTGAACTCG GGAGGCGGAT GTTGCAGCGA 4921 ACTGAGATTG CGCCATTGCA CTCCAGCCTG GGCAACAAGA TTGAAACTCT GTTTAAAAAA 4981 AAAAGTTTTC ACTAATGTGT ACATTTTTTT GTACTCTTTT ATTCTCGAAA GGGAAGGAGG 5041 GCTATTGCCC TATCCCTTAT TAATAAATGC ATTGTGGTTT CTGGTTTCTC TAATACCATA 5101 TGCCCTTCAT TCAGTTTATA GTGGGCGGAA GTGGGGGAGA AAAAGTTGCT CAGAAATCAA 5161 AAGATATCTC AAACAGCACA AATAATGGCT GATCGTTCTG CAAACAAAAA GTTACATAAT 5221 AGCTCAAGAA GGAGAAGTCA ACATGACTCT GAACAAGCTT TAACTTAGAA ACTTTATCAT 5281 CTTAAGGAAG AACGTGACCT TTGTCCAGGA CGTCTCTGGT AATGGGGCAC TTACACACAC 5341 ATGCACACGT ACAAACCACA GGGAAAGGAG ACCGCCCTTC TGCCTCTGCT CGCGAGTATC 5401 ACGCAGGCAC CATGCACTAT GTTTTCACAC ACACTGGGTG GAAGAAGAGC TTCAGCGCCA 5461 GTCTTCTAAT GCTTTGGTGA TAATGAAAAT CACTGGGTGC TTATGGGGTG TCATATTCAA 5521 TCGAGTTAAA AGTTTTAATT CAAAATGACA GTTTTACTGA GGTTGATGTT CTCGTCTATG 5581 ATATCTCTGC CCCTCCCATA AAAATGGACA TTTAAAAGCA ACTTACCGCT CTTTAGATCA 5641 CTCCTATATC ACACACCACT TGGGGTGCTG TTTCTGCTAG ACTTGTGATG ACAGTGGCCT 5701 TAGGATCCCT GTTTGCTGTT CAAAGGGCAA ATATTTTATA GCCTTTAAAT ATACCTAAAC 5761 TAAATACAGA ATTAATATAA CTAACAAACA CCTGGTCTGA AATAACAAGG TGATCTACCC 5821 TGGAAGGAAC CCAGCTGGTG GGCCAGGAGC GGTGGCTCAC ACCTGTAATT CCAGCACTTT 5881 GGGAGGCTGA GACAGGAGGA TCACTGGAGT CCAGGAGTTT GAGACCAGCC TGGGCAACAT 5941 GGCAAAACCC AGTGTGCTTC TGTTGTCCCA GCTACACTAC TCAGGAGGCT GAGGCAGGAG 6001 TATGACTTGA GCCTGGGAGG GGGAGGTTGC AGAGAACTGA TATTGCACCA CCACTGCACT 6061 CCAGCCTGGG TGACACAGCA AAACCCTATC TCAAAAAAAA AAAAAAAAAA AAGGAACCCA 6121 GCTGGTTCCT GTAGGTGTGC AATAATAACA ACCAGAGGAA GAAAAGGAAG ACGATTTCCC 6181 AGATGAAGAA GGGCAGCTGG ACCTTCGGAC // LOCUS HUMFSH1 720 bp ds-DNA PRI 15-JUN-1989 DEFINITION Human follicle-stimulating hormone beta-subunit gene, exon 1. ACCESSION M21219 M18536 KEYWORDS follicle stimulating hormone. SEGMENT 1 of 3 SOURCE Human placenta DNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 720) AUTHORS Jameson,L. TITLE Human follicle-stimulating beta-subunit gene encodes multiple mRNAs JOURNAL Mol. Endocrinol. (1988) In press STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly provided by L.Jameson, 16-MAY-1988. FEATURES from to/span description pre-msg 531 > 720 FSHB mRNA exon 1 and intron /nomgen="FSHB" /map="11p13" /hgml_locus_uid="LW0166F" IVS 564 > 720 FSHB intron A (alt.) IVS 594 > 720 FSHB intron A (alt.) BASE COUNT 226 A 125 C 139 G 230 T ORIGIN 385 bp upstream of HindIII site; chromosome 11p13. 1 TTGCTAAAAC ATCAAAAAAT AAAGATTTGT GATTAGGGGG CCAGAATGTG GGAGGGAAAG 61 AGAGATACAG TTCACACTTT TAGACAGGAG CCAGATCATG AAATGTTTTC TCTTTGTTTG 121 TTTCTTCCTT CACAGCTTTT GATATGCTCT TGGAGCAATT TATTAACCAT ATTTTTTAAT 181 GCATCTCCTG AACAGAGTCA AAGCAATACT TGGAAAGGAC TCTGAATTTC CTGATTTAAA 241 GATACAAAAG AAAAATCTGG AGTCACAATT AATTTGAGAA GGTAAAGGAG TGGGTGTGCT 301 ACTGTATCAA ATTTAATTTG TACAAAATCA TCATCTCTAG TAACATTATT TTTTCTAATC 361 TACTGCGTTT AGACTACTTT AGTAAAGCTT GATCTCCCTG TCTATCTAAA CACTGATTCA 421 CTTACAGCAA GCTTCAGGCT AGCATTGGTC ATATTAATAC CCAACAAATC CACAAGGTGT 481 TAGTTGCACA TGATTTTGTA TAAAAGGTGA ACTGAGATTT CATTCAGTCT ACAGCTCTTG 541 CCAGGCAAGG CAGCCGACCA CAGGTGAGTC TTGGCATCTA CCGTTTTCAA GTGGTGACAG 601 CTACTTTTGA AATTACAGAT TTGTCAGGCA TGGAGGACAA AACTAGAGCT TCTCACTACT 661 GTTGTGTAGG AAATTTATGC TTGTCAACCT GGCTTGTAAA ATATGGTTAA TATAACGTAA // LOCUS HUMFSH2 519 bp ds-DNA PRI 15-JUN-1989 DEFINITION Human follicle-stimulating hormone beta-subunit gene, exon 2. ACCESSION M21220 M18536 KEYWORDS follicle stimulating hormone. SEGMENT 2 of 3 SOURCE Human placenta DNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 519) AUTHORS Jameson,L. TITLE Human follicle-stimulating beta-subunit gene encodes multiple mRNAs JOURNAL Mol. Endocrinol. (1988) In press STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly provided by L.Jameson, 16-MAY-1988. FEATURES from to/span description pept 153 + 311 follicle stimulating hormone, exon 2 (first expressed exon) /nomgen="FSHB" /map="11p13" /hgml_locus_uid="LW0166F" sigp 153 200 follicle stimulating hormone signal peptide matp 201 + 311 follicle stimulating hormone pre-msg < 1 > 519 FSHB mRNA exon 1 and introns IVS < 1 146 FSHB intron A IVS 312 > 519 FSHB intron B BASE COUNT 138 A 88 C 121 G 172 T ORIGIN About 0.85 kb after segment 1; chromosome 11p13. 1 AAGGCATAAG GAAGGAAAAA AAATTTTGAA GCAAAATGTA TTGAGGAGGA TGAGCAGACC 61 AATTATTTTT GGTTTGGTCA GCTTACATAA TGATTATCGT TCTTTGGTTT CTCAGTTTCT 121 AGTGGGCTTC ATTGTTTGCT TCCCAGACCA GGATGAAGAC ACTCCAGTTT TTCTTCCTTT 181 TCTGTTGCTG GAAAGCAATC TGCTGCAATA GCTGTGAGCT GACCAACATC ACCATTGCAA 241 TAGAGAAAGA AGAATGTCGT TTCTGCATAA GCATCAACAC CACTTGGTGT GCTGGCTACT 301 GCTACACCAG GGTAGGTACC ATGTTTTGCT GGAAGCAAGG GTTGCTCGAA GGTCTGTATT 361 ACCGGGGTTT CATTAGTTTC TACTTTATCA ATATTTTATG TATTCTAAGT AACAGCCATG 421 AGTCCTTTAG CCAAGACTGT CTGTGTTGTG ATTGGGGTTA ATGACCACGA TATCACTTAG 481 ATGTTTGGGC TTGGATTTGA TTTGGGTAAA TTTAGGAAA // LOCUS HUMFSH3 1813 bp ds-DNA PRI 15-JUN-1989 DEFINITION Human follicle-stimulating hormone beta-subunit gene, exon 3. ACCESSION M21221 M18536 KEYWORDS follicle stimulating hormone. SEGMENT 3 of 3 SOURCE Human placenta DNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 1813) AUTHORS Jameson,L. TITLE Human follicle-stimulating beta-subunit gene encodes multiple mRNAs JOURNAL Mol. Endocrinol. (1988) In press STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly provided by L.Jameson, 16-MAY-1988. FEATURES from to/span description pept + 103 333 follicle stimulating hormone, exon 3 /nomgen="FSHB" /map="11p13" /hgml_locus_uid="LW0166F" matp + 103 333 follicle stimulating hormone pre-msg < 1 > 1813 FSHB mRNA exon 1 and introns IVS < 1 102 FSHB intron B BASE COUNT 537 A 354 C 368 G 554 T ORIGIN About 1.1 kb after segment 2; chromosome 11p13. 1 TAAATGTTAG AGCAAGCAGT ATTCAATTTC TGTCTCATTT TGACTAAGCT AAATAGGAAC 61 TTCCACAATA CCATAACCTA ACTCTCTTCT TAAACTCCTC AGGATCTGGT GTATAAGGAC 121 CCAGCCAGGC CCAAAATCCA GAAAACATGT ACCTTCAAGG AACTGGTATA TGAAACAGTG 181 AGAGTGCCCG GCTGTGCTCA CCATGCAGAT TCCTTGTATA CATACCCAGT GGCCACCCAG 241 TGTCACTGTG GCAAGTGTGA CAGCGACAGC ACTGATTGTA CTGTGCGAGG CCTGGGGCCC 301 AGCTACTGCT CCTTTGGTGA AATGAAAGAA TAAAGATCAG TGGACATTTC AGGCCACATA 361 CCCTTGTCCT GAAGGACCAA GATATTCAAA AAGTCTGTGT GTGTGCAATG TGCCCAGGGG 421 ACAAACCACT GGATCAGGGG ATTCAGACTC TACTGATCCC TGGTCTACTG GCAGAGGGAA 481 CTCTGGGAAT TGAGAGTGCT GGGGGCCAGG ACTCCATCAT GATTCAGCTC TATATTCCTA 541 GGTCTGATTT CATAAGGTTT ATTCAGTCTT AACTCACAGA CTTGTGCCTG GTTTCTTCTT 601 TAAAAATCTT AGAAATCTTC TCAGGCAATG CCTCTCTCTT AGGGGGAAAC ATAAGCCTAG 661 AAGGAGGAAG CAGTAATGGG AGTGAGTGAA AGAACTAACT GCAGCAGTCT TCTGGTAGAC 721 TCTTGGGCCC TCTAGAGCAA GGTCAGCATC TTCAGCATTG TAGCGTCAAT GCCTAGCACT 781 CTGCCTGGAA CTTAGAAACA CAACAATGGC TTCTTTAGAT CAGAAAGGTC AAGGGTAGAA 841 AATACTGGAA GAGGATGTTT GAGGTAAGCT GATGAGGCTG CCCGCAGCAC ACCAGTCCCA 901 TGAAAGTTAG TGGCATCAGT TTCACCTCGC CTTTTCTCCA GCACATGAGT ATTGAGACAT 961 GATGTGTCTT TCTGAATTGT TTGGTACAGA TGGGGAGTAA CAGAGCTCGA AGATTTCCAA 1021 GCTATTACTA CCAAGCCTGT TAGTTAAGGG CAAAGGCAAG AAATTGTAAT TTGGGGCTGT 1081 GGAAATTAGC CTGCCTCTAT TCATTACTTA AACAAATTGA TCACATGCTA CTAGGCTCCT 1141 GCAAACTCCT TTTTGAGATA AAGGGAAAAA ACCAAACTAT CTCACCCTAC CCTCCCTAGG 1201 ATCCACTTCT TTGGAATGAC AAAGGATTTG AAAGTAGGTT TGAAAGCAGT TTCAGCAATT 1261 TAATAAATAT AATTAATTTG TCTACCAAAT ATATTTGTAT AAATAATAGC TCCTTTAGAA 1321 AGAATTAGCC ATGGGGGGAT CGAGGGGAAA CTGCTGTTTT CTAGGATCCT GTCTACATCA 1381 ATCTTCTATT TTATCCATCC ATGTTCTCCC AAATCTGTGC TTTCTTTCAA CAGGTTATAT 1441 ATTAAAACTA TTTCATGAGT TGATTTCTTT TAAACGTGTT AACTGTCTTA GTTATGCACT 1501 CAGTTTCACA CTCATATTGT TTAACTAATT TATTTAAAGT CTTATTTTTT TAATAAAGAT 1561 GCTAGCCACC AGAGTCACGG CTTGGATTGT TTTATGTACA AACAGATGAC TTAGAAATTC 1621 TGTATTTTAT AATAATATTA GTGGAATGAA ATCTTAAAAT ATAATTCCCA GTGTTTCTAT 1681 AAATATTACC TTTCCTTATC TTTGGAGATA TTAAAAATAA TTTTGTTGGA TTTCTGAAGT 1741 GTTTTGTCAC TTAAATTTCC TGTCATTTTT TGAAGACATT TTCTGATGTA ATTTGGGAGA 1801 AAAAAAGCAT AGA // LOCUS HUMFX2 185 bp ds-DNA PRI 15-JUN-1989 DEFINITION Human factor X (blood coagulation factor) gene, exon 2. ACCESSION L00391 M14327 KEYWORDS Stuart factor; blood coagulation factor; factor X; glycoprotein; serine protease. SEGMENT 2 of 8 SOURCE Human DNA (libraries of Lawn et al., and Yoshitake et al.). ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 185) AUTHORS Leytus,S.P., Foster,D.C., Kurachi,K. and Davie,E.W. TITLE Gene for human factor X: A blood coagulation factor whose gene organization is essentially identical with that of factor IX and protein C JOURNAL Biochemistry 25, 5098-5102 (1986) STANDARD full staff_review FEATURES from to/span description pept + 13 + 173 preprofactor X, exon 2 /nomgen="F10" /map="13q34" /hgml_locus_uid="LW0143E" sigp + 13 62 preprofactor X signal peptide matp 63 + 173 factor X light chain pre-msg < 1 > 185 FX mRNA IVS < 1 12 FX intron A IVS 174 > 185 FX intron B BASE COUNT 51 A 45 C 59 G 30 T ORIGIN Undetermined number of bp after segment 1; chromosome 13q34. 1 CCTGCCTTCC AGTGTTCATC CGCAGGGAGC AGGCCAACAA CATCCTGGCG AGGGTCACGA 61 GGGCCAATTC CTTTCTTGAA GAGATGAAGA AAGGACACCT CGAAAGAGAG TGCATGGAAG 121 AGACCTGCTC ATACGAAGAG GCCCGCGAGG TCTTTGAGGA CAGCGACAAG ACGGTAAGGG 181 CTGGG // LOCUS HUMFX3 49 bp ds-DNA PRI 15-JUN-1989 DEFINITION Human factor X (blood coagulation factor) gene, exon 3. ACCESSION L00392 M14327 KEYWORDS Stuart factor; blood coagulation factor; factor X; glycoprotein; serine protease. SEGMENT 3 of 8 SOURCE Human DNA (libraries of Lawn et al., and Yoshitake et al.). ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 49) AUTHORS Leytus,S.P., Foster,D.C., Kurachi,K. and Davie,E.W. TITLE Gene for human factor X: A blood coagulation factor whose gene organization is essentially identical with that of factor IX and protein C JOURNAL Biochemistry 25, 5098-5102 (1986) STANDARD full staff_review FEATURES from to/span description pept + 13 + 37 preprofactor X, exon 3 /nomgen="F10" /map="13q34" /hgml_locus_uid="LW0143E" matp + 13 + 37 factor X light chain pre-msg < 1 > 49 FX mRNA IVS < 1 12 FX intron B IVS 38 > 49 FX intron C BASE COUNT 16 A 5 C 7 G 21 T ORIGIN About 7.4 kb after segment 2; chromosome 13q34. 1 TTTTCCTTTT AGAATGAATT CTGGAATAAA TACAAAGGTC AGTATTTTT // LOCUS HUMFX4 138 bp ds-DNA PRI 15-JUN-1989 DEFINITION Human factor X (blood coagulation factor) gene, exon 4. ACCESSION L00393 M14327 KEYWORDS Stuart factor; blood coagulation factor; factor X; glycoprotein; serine protease. SEGMENT 4 of 8 SOURCE Human DNA (libraries of Lawn et al., and Yoshitake et al.). ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 138) AUTHORS Leytus,S.P., Foster,D.C., Kurachi,K. and Davie,E.W. TITLE Gene for human factor X: A blood coagulation factor whose gene organization is essentially identical with that of factor IX and protein C JOURNAL Biochemistry 25, 5098-5102 (1986) STANDARD full staff_review FEATURES from to/span description pept + 13 + 126 preprofactor X, exon 4 /nomgen="F10" /map="13q34" /hgml_locus_uid="LW0143E" matp + 13 + 126 factor X light chain pre-msg < 1 > 138 FX mRNA IVS < 1 12 FX intron C IVS 127 > 138 FX intron D BASE COUNT 38 A 32 C 36 G 32 T ORIGIN About 950 bp after segment 3; chromosome 13q34. 1 CTCTCTTTGC AGATGGCGAC CAGTGTGAGA CCAGTCCTTG CCAGAACCAG GGCAAATGTA 61 AAGACGGCCT CGGGGAATAC ACCTGCACCT GTTTAGAAGG ATTCGAAGGC AAAAACTGTG 121 AATTATGTAG GTTCCTCT // LOCUS HUMFX5 156 bp ds-DNA PRI 15-JUN-1989 DEFINITION Human factor X (blood coagulation factor) gene, exon 5. ACCESSION L00394 M14327 KEYWORDS Stuart factor; blood coagulation factor; factor X; glycoprotein; serine protease. SEGMENT 5 of 8 SOURCE Human DNA (libraries of Lawn et al., and Yoshitake et al.). ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 156) AUTHORS Leytus,S.P., Foster,D.C., Kurachi,K. and Davie,E.W. TITLE Gene for human factor X: A blood coagulation factor whose gene organization is essentially identical with that of factor IX and protein C JOURNAL Biochemistry 25, 5098-5102 (1986) STANDARD full staff_review FEATURES from to/span description pept + 13 + 144 preprofactor X, exon 5 /nomgen="F10" /map="13q34" /hgml_locus_uid="LW0143E" matp + 13 + 144 factor X light chain pre-msg < 1 > 156 FX mRNA IVS < 1 12 FX intron D IVS 145 > 156 FX intron E BASE COUNT 32 A 49 C 48 G 27 T ORIGIN About 1.8 kb after segment 4; chromosome 13q34. 1 CTCCTTTGGC AGTCACACGG AAGCTCTGCA GCCTGGACAA CGGGGACTGT GACCAGTTCT 61 GCCACGAGGA ACAGAACTCT GTGGTGTGCT CCTGCGCCCG CGGGTACACC CTGGCTGACA 121 ACGGCAAGGC CTGCATTCCC ACAGGTAGGA GGCACG // LOCUS HUMFX6 269 bp ds-DNA PRI 15-JUN-1989 DEFINITION Human factor X (blood coagulation factor) gene, exon 6. ACCESSION L00395 M14327 KEYWORDS Stuart factor; blood coagulation factor; factor X; glycoprotein; serine protease. SEGMENT 6 of 8 SOURCE Human DNA (libraries of Lawn et al., and Yoshitake et al.). ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 269) AUTHORS Leytus,S.P., Foster,D.C., Kurachi,K. and Davie,E.W. TITLE Gene for human factor X: A blood coagulation factor whose gene organization is essentially identical with that of factor IX and protein C JOURNAL Biochemistry 25, 5098-5102 (1986) STANDARD full staff_review FEATURES from to/span description pept + 13 + 257 preprofactor X, exon 6 /nomgen="F10" /map="13q34" /hgml_locus_uid="LW0143E" matp + 13 47 factor X light chain matp 57 + 257 factor X heavy chain matp 213 + 257 factor Xa heavy chain pre-msg < 1 > 269 FX mRNA IVS < 1 12 FX intron E IVS 258 > 269 FX intron F BASE COUNT 68 A 82 C 82 G 37 T ORIGIN About 2.9 kb after segment 5; chromosome 13q34. 1 TCTTTCTTTC AGGGCCCTAC CCCTGTGGGA AACAGACCCT GGAACGCAGG AAGAGGTCAG 61 TGGCCCAGGC CACCAGCAGC AGCGGGGAGG CCCCTGACAG CATCACATGG AAGCCATATG 121 ATGCAGCCGA CCTGGACCCC ACCGAGAACC CCTTCGACCT GCTTGACTTC AACCAGACGC 181 AGCCTGAGAG GGGCGACAAC AACCTCACCA GGATCGTGGG AGGCCAGGAA TGCAAGGACG 241 GGGAGTGTCC CTGGCAGGTA ACAGTAGGA // LOCUS HUMFX7 142 bp ds-DNA PRI 15-JUN-1989 DEFINITION Human factor X (blood coagulation factor) gene, exon 7. ACCESSION L00396 M14327 KEYWORDS Stuart factor; blood coagulation factor; factor X; glycoprotein; serine protease. SEGMENT 7 of 8 SOURCE Human DNA (libraries of Lawn et al., and Yoshitake et al.). ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 142) AUTHORS Leytus,S.P., Foster,D.C., Kurachi,K. and Davie,E.W. TITLE Gene for human factor X: A blood coagulation factor whose gene organization is essentially identical with that of factor IX and protein C JOURNAL Biochemistry 25, 5098-5102 (1986) STANDARD full staff_review FEATURES from to/span description pept + 13 + 130 preprofactor X, exon 7 /nomgen="F10" /map="13q34" /hgml_locus_uid="LW0143E" matp + 13 + 130 factor X heavy chain matp + 13 + 130 factor X Xa heavy chain pre-msg < 1 > 142 FX mRNA IVS < 1 12 FX intron F IVS 131 > 142 FX intron G BASE COUNT 38 A 35 C 38 G 31 T ORIGIN About 3.4 kb after segment 6; chromosome 13q34. 1 CGTCTGTCAC AGGCCCTGCT CATCAATGAG GAAAACGAGG GTTTCTGTGG TGGAACCATT 61 CTGAGCGAGT TCTACATCCT AACGGCAGCC CACTGTCTCT ACCAAGCCAA GAGATTCAAG 121 GTGAGGGTAG GTAAGTGACC AA // LOCUS HUMFX8 624 bp ds-DNA PRI 15-JUN-1989 DEFINITION Human factor X (blood coagulation factor) gene, exon 8. ACCESSION N00045 M14327 KEYWORDS Stuart factor; blood coagulation factor; factor X; glycoprotein; serine protease. SEGMENT 8 of 8 SOURCE Human DNA (libraries of Lawn et al., and Yoshitake et al.). ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 624) AUTHORS Leytus,S.P., Foster,D.C., Kurachi,K. and Davie,E.W. TITLE Gene for human factor X: A blood coagulation factor whose gene organization is essentially identical with that of factor IX and protein C JOURNAL Biochemistry 25, 5098-5102 (1986) STANDARD full staff_review FEATURES from to/span description pept + 13 614 preprofactor X, exon 8 /nomgen="F10" /map="13q34" /hgml_locus_uid="LW0143E" matp + 13 611 factor X heavy chain matp + 13 + 458 factor Xa heavy chain matp 459 611 activation peptide pre-msg < 1 624 FX mRNA IVS < 1 12 FX intron G BASE COUNT 140 A 193 C 195 G 96 T ORIGIN About 1.7 kb after segment 7; chromosome 13q34. 1 CGTCTGTCCC AGGGGACCGG AACACGGAGC AGGAGGAGGG CGGTGAGGCG GTGCACGAGG 61 TGGAGGTGGT CATCAAGCAC AACCGGTTCA CAAAGGAGAC CTATGACTTC GACATCGCCG 121 TGCTCCGGCT CAAGACCCCC ATCACCTTCC GCATGAACGT GGCGCCTGCC TGCCTCCCCG 181 AGCGTGACTG GGCCGAGTCC ACGCTGATGA CGCAGAAGAC GGGGATTGTG AGCGGCTTCG 241 GGCGCACCCA CGAGAAGGGC CGGCAGTCCA CCAGGCTCAA GATGCTGGAG GTGCCCTACG 301 TGGACCGCAA CAGCTGCAAG CTGTCCAGCA GCTTCATCAT CACCCAGAAC ATGTTCTGTG 361 CCGGCTACGA CACCAAGCAG GAGGATGCCT GCCAGGGGGA CAGCGGGGGC CCGCACGTCA 421 CCCGCTTCAA GGACACCTAC TTCGTGACAG GCATCGTCAG CTGGGGAGAG GGCTGTGCCC 481 GTAAGGGGAA GTACGGGATC TACACCAAGG TCACCGCCTT CCTCAAGTGG ATCGACAGGT 541 CCATGAAAAC CAGGGGCTTG CCCAAGGCCA AGAGCCATGC CCCGGAGGTC ATAACGTCCT 601 CTCCATTAAA GTGAGATCCC ACTC // LOCUS HUMIAIG1 922 bp ds-DNA PRI 01-SEP-1988 DEFINITION Human Ia-associated invariant gamma-chain gene, exon 1. ACCESSION M13555 KEYWORDS Ia-associated gamma chain; cell surface glycoprotein. SEGMENT 1 of 6 SOURCE Human liver DNA, clones lambda-y[1,2,3]. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 922) AUTHORS O'Sullivan,D.M., Larhammar,D., Wilson,M.C., Peterson,P.A. and Quaranta,V. TITLE Structure of the human Ia-associated invariant (gamma)-chain gene: Identification of 5' sequences shared with major histocompatibility complex class II genes JOURNAL Proc. Natl. Acad. Sci. U.S.A. 83, 4484-4488 (1986) STANDARD simple staff_review FEATURES from to/span description pept 796 + 920 Ia-associated gamma chain gene, exon 1 pre-msg 1 795 Ia-a mRNA and introns IVS 921 922 Ia-a intron A (no splice consensus at 922) BASE COUNT 219 A 246 C 231 G 226 T ORIGIN 1 CGGATCAGAC TTGACTGTTC CCATACATTC ACCTTCCCAC ATTTCCAGCC CTTTGGAACC 61 CTGAAGATAT CCTCTTCTCT GTCCTGTCAC TCTATGGGAC TTACGGCTCT TTGTTAAAAT 121 ACTATTTAAG CAAGGCCCCT AAGCCACTGC CTTGAGAGAG AAATACTTCT CAACTGAGGC 181 CTCTACTGCA TAATGGGTAC AGCATGGGTT AATCAACTTT GGCTGATTTT TCTTTTGTTA 241 AGTTGACTTT TGTTTCTAAG GGAGCGTGTC AACTAAGAAC CTAAACAGGG AAGGAAAGGA 301 AATGTTTTCA CCCCTGCACG TCGTCCAAAC TTCCTCCTGC GTGTTTGAGA CGACATAACG 361 GAGGCCAGAG AAGAGGACTT GTCTGAGCTC ACCAAGGGTG CACAGCCCCT TTCTGCACCC 421 ACTCATCTGT GGATGAGAAA GGGGAGACAA ACAAAGGTGT CTTCTGTTTC AAAGTGCTTT 481 CCTGTCTTGG GAGTGGACAT TTGCCTTTTT CTTGAAACAT TCAAAGAGCC TTATGAATCC 541 AAAGGCCTGC CCAGAAACAA GTGATGAGGG CCTTGGGCCG CCAATGGGGT CGTGCTGGCC 601 TTTCCACCTT GCTTGGGGAG CCCCCCCGCC CCACACCTTG CCCCGCAAAA GGCAGTTCCA 661 CCAAAGTGGG GTATTTCCAG CCTTTGTAGC TTTCACTTCC ACATCTACCA AGTGGGCGGA 721 GTGGCCTTCT GTGGACGAAT CAGATTCCTC TCCAGCACCG ACTTTAAGAG GCGAGCCGGG 781 GGGTCAGGGT CCCAGATGCA CAGGAGGAGA AGCAGGAGCT GTCGGGAAGA TCAGAAGCCA 841 GTCATGGATG ACCAGCGCGA CCTTATCTCC AACAATGAGC AACTGCCCAT GCTGGGCCGG 901 CGCCCTGGGG CCCCGGAGAG GT // LOCUS HUMIAIG2 1139 bp ds-DNA PRI 01-SEP-1988 DEFINITION Human Ia-associated invariant gamma-chain gene, exons 2-4. ACCESSION M13556 KEYWORDS Ia-associated gamma chain; cell surface glycoprotein. SEGMENT 2 of 6 SOURCE Human liver DNA, clones lambda-y[1,2,3]. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 1139) AUTHORS O'Sullivan,D.M., Larhammar,D., Wilson,M.C., Peterson,P.A. and Quaranta,V. TITLE Structure of the human Ia-associated invariant (gamma)-chain gene: Identification of 5' sequences shared with major histocompatibility complex class II genes JOURNAL Proc. Natl. Acad. Sci. U.S.A. 83, 4484-4488 (1986) STANDARD simple staff_review FEATURES from to/span description pept + 38 210 Ia-associated gamma chain, exon 2 402 481 Ia-associated gamma chain, exon 3 1037 + 1099 Ia-associated gamma chain, exon 4 pre-msg < 1 > 1139 IVS < 1 37 Ia-a intron A 211 401 Ia-a intron B 482 1036 Ia-a intron C 1100 > 1139 Ia-a intron D BASE COUNT 234 A 377 C 298 G 230 T ORIGIN About 5.4 kb after segment 1. 1 GGATCCTCCT CTGACCTATC CTCCCCACCT CCCACAGCAA GTGCAGCCGC GGAGCCCTGT 61 ACACAGGCTT TTCCATCCTG GTGACTCTGC TCCTCGCTGG CCAGGCCACC ACCGCCTACT 121 TCCTGTACCA GCAGCAGGGC CGGCTGGACA AACTGACAGT CACCTCCCAG AACCTGCAGC 181 TGGAGAACCT GCGCATGAAG CTTCCCAAGC GTGCGTGCAC CCCTACATCC TTGATACCCC 241 CCACCTCCCA CCATCCCTCA ACTCAGAGAC CCGCATCCCT GCACCCAGCT GGGCCCATGT 301 CCTCTCCCTC CGGTTTGGAA TTCCAGCCCT TCCTCATCTG GGTCTGATAC CCTCCTCCCT 361 GGGCACCGGG GCCACACTTA CCCTCGTTCC TGTCCCCACA GCTCCCAAGC CTGTGAGCAA 421 GATGCGCATG GCCACCCCGC TGCTGATGCA GGCGCTGCCC ATGGGAGCCC TGCCCCAGGG 481 GGTAAGGACA GCCCCAGGGT GGTGGGAGGG GCAAGGTTAT CCCGCCTGGA TGGAGGACAG 541 TGCCAAGGGG AGGGGGAGGG AAGAGAGCCC ACCTCGGGAG GGGTCCTGAC TGCTGCGGGA 601 GGGACAGTGC CTGCCTCAGG AAGAATCGGG CTCCCCAGGT GTGGAGGGCA CAGGTGAAGA 661 GTCTCTTGGT GCCATCCCTG GAGGAAGGCT CAGCCCTCTA CAGTTTACAA AGTGCTTCTC 721 ATTTCCTATA GCATCTCACT GTCCTCTCCC ATTCTCAAAG ACCTTGCTAT CATGCATTGA 781 CAATATTTAT ATTCACAATA CTGTGCTGTG GACAAAACCC TGGGCGGGAA AGCTTATGCC 841 AGTTTGACCA ATGAGGACAT TGAGGCAGGA AGCTAAAGTG ACTTGCTCGA GCTCTATGTT 901 TGGAGGTGGC AGAGGTGGAA CCATTGACCA AGTGCCCGCG ATTCCAGTCT CTATCCAGAT 961 CCCAGCAACC GGCTCCTGCT CCATACCCCC TGCTCAGGGA CCAGCTCTGG TAACCTTCTG 1021 TTAGCTCCTC CCACAGCCCA TGCAGAATGC CACCAAGTAT GGCAACATGA CAGAGGACCA 1081 TGTGATGCAC CTGCTCCAGG TGAGTGCAGG GAGCTAGCTG GGTGGTCCTG CCTGCCCCA // LOCUS HUMIAIG4 1242 bp ds-DNA PRI 01-SEP-1988 DEFINITION Human Ia-associated invarient gamma-chain gene, exons 5 and 6. ACCESSION M13558 KEYWORDS Ia-associated gamma chain; cell surface glycoprotein. SEGMENT 4 of 6 SOURCE Human liver DNA, clone lambda-y[1,2,3]. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 1242) AUTHORS O'Sullivan,D.M., Larhammar,D., Wilson,M.C., Peterson,P.A. and Quaranta,V. TITLE Structure of the human Ia-associated invariant (gamma)-chain gene: Identification of 5' sequences shared with major histocompatibility complex class II genes JOURNAL Proc. Natl. Acad. Sci. U.S.A. 83, 4484-4488 (1986) STANDARD simple staff_review FEATURES from to/span description pept + 26 121 Ia-associated gamma chain, exon 5 437 + 524 Ia-associated gamma chain, exon 6 pre-msg < 1 > 1242 Ia-a mRNA and introns IVS < 1 25 Ia-a intron D 122 436 Ia-a intron E 525 > 1242 Ia-a intron F BASE COUNT 264 A 359 C 330 G 289 T ORIGIN About 170 bp after segment 3. 1 CCTTCCTTGC CCTGCTACCC TGCAGAATGC TGACCCCCTG AAGGTGTACC CGCCACTGAA 61 GGGGAGCTTC CCGGAGAACC TGAGACACCT TAAGAACACC ATGGAGACCA TAGACTGGAA 121 GGTCAGCAGG TTTCCCTGCA TGGGAACTCT CTCTTTCCTC TGGTGTCTAG GGCAGGGCTA 181 GGAGAGGTGG GGTGAGGGTG GGCTGGGGAA GCCATTCTCA GGAAGCTGAA GGGGTTTACC 241 AGGACTTCCA AAACTTGGAT GCTGCAGAGT ATGTGGGGTT CAGCCCTAGG GGTCTTTAAG 301 AGGAACTAGG CTGCAGCTGG ACCCGGGTGT TGGGGCCCTA TTTGGCCCGC ATGTTTCTGC 361 CCTAGGGGAC AAAGCTAGGC AGTGTAGGGG CTTGGTGGTG GCCATCGAAC CTGACCTCCA 421 CCTCTATCCG TATTAGGTCT TTGAGAGCTG GATGCACCAT TGGCTCCTGT TTGAAATGAG 481 CAGGCACTCC TTGGAGCAAA AGCCCACTGA CGCTCCACCG AAAGGTACAG GGAGTGGGAG 541 CTTTAGCGTG CCAGGGCTTC TGGACCCTCG GGGCTCTCCG GAAGCTGCGG AGGCCGGGGC 601 CTCCAGCACT CCCTAGTCCC AGCACCCGCG GCTCCTATCC TCTCAGCTCT CACTTCTTTC 661 TCACTTCTCT CTCTCTCCTG TCTGTTCCTT CTTTGGTTTG GCTGTCCCCC TCCCCTGACC 721 CACCCCATCT TGTCTAAGGG TTCCTAAGGG CCCACAGAGG CCTGTCACCA CAGGGTACAG 781 GTGACCTCTC TCATAGAGGA TGGTACAGCA CAGGGTACTG GGGTAGAACC TGCCCGAAAC 841 ACTGCAGAAA GGAATCCTTG TAATGACCTT GCCCCCAGTG CTGCCCGCAA TCCAGTAGGG 901 CGCCAAGGCA CAGCTGCTGC CAAGAGAGCC TTGGGCGTTT CCCACCTCAT GGACAATGCA 961 GACCAGGATG TTTTTAGACC CAAAGACAGA GTGCTGTTTC TATCCCGGTG CTGTCTCTAA 1021 TTTGCTATGT AACTTTGGAC AAGTCCCCTT CCCTCTAGGA TTCAGGGTCC TGAAGTAGAA 1081 GGTCAAAGGG CCACCCTGCC TGGGGCCTCA GTTTCTGCAT CAGATTCATA GAAGGCACCT 1141 TACATGCTAT CTCCAACTCC TAGCTGATGC TTAAACCCCT CTAAGACATC TCCAACAAAC 1201 GGTAAACCCC ATTTCTACTT GAGAACTCCC AGTAACAGGA AG // LOCUS HUMIAIG5 157 bp ds-DNA PRI 01-SEP-1988 DEFINITION Human Ia-associated invarient gamma-chain gene, exon 7. ACCESSION M13559 KEYWORDS Ia-associated gamma chain; cell surface glycoprotein. SEGMENT 5 of 6 SOURCE Human liver DNA, clone lambda-y[1,2,3]. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 157) AUTHORS O'Sullivan,D.M., Larhammar,D., Wilson,M.C., Peterson,P.A. and Quaranta,V. TITLE Structure of the human Ia-associated invariant (gamma)-chain gene: Identification of 5' sequences shared with major histocompatibility complex class II genes JOURNAL Proc. Natl. Acad. Sci. U.S.A. 83, 4484-4488 (1986) STANDARD simple staff_review FEATURES from to/span description pept + 86 + 148 Ia-associated gamma chain, exon 7 pre-msg < 1 > 157 Ia-a mRNA and introns IVS < 1 85 Ia-a intron F 149 > 157 Ia-a intron G BASE COUNT 31 A 41 C 51 G 34 T ORIGIN About 1.5 Kbp after segment 4. 1 CTAGGTGGGA AGCACTGCCC TCAGGCACTC TTGGGTCAGG TAGGCTGCAC TTCAAGTGAC 61 AAACGGACTT GCTGCTCCTT TGCAGAGTCA CTGGAACTGG AGGACCCGTC TTCTGGGCTG 121 GGTGTGACCA AGCAGGATCT GGGCCCAGGT CACAGTG // LOCUS HUMIAIG6 676 bp ds-DNA PRI 01-SEP-1988 DEFINITION Human Ia-associated invarient gamma-chain gene, exon 8. ACCESSION M13560 KEYWORDS Ia-associated gamma chain; cell surface glycoprotein. SEGMENT 6 of 6 SOURCE Human liver DNA, clone lambda-y[1,2,3]. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 676) AUTHORS O'Sullivan,D.M., Larhammar,D., Wilson,M.C., Peterson,P.A. and Quaranta,V. TITLE Structure of the human Ia-associated invariant (gamma)-chain gene: Identification of 5' sequences shared with major histocompatibility complex class II genes JOURNAL Proc. Natl. Acad. Sci. U.S.A. 83, 4484-4488 (1986) STANDARD simple staff_review FEATURES from to/span description pept + 34 44 Ia-associated gamma chain, exon 8 pre-msg < 1 > 630 Ia-a mRNA and intron IVS < 1 33 Ia-a intron G (no splice consensus at 33) BASE COUNT 155 A 239 C 135 G 147 T ORIGIN About 200 bp after segment 5. 1 ACCCCAACCT CAACCCGCCG CTTCTCTCTC CAGTCCCCAT GTGAGAGCAG CAGAGGCGGT 61 CTTCAACATC CTGCCAGCCC CACACAGCTA CAGCTTTCTT GCTCCCTTCA GCCCCCAGCC 121 CCTCCCCCAT CTCCCACCCT GTACCTCATC CCATGAGACC CTGGTGCCTG GCTCTTTCGT 181 CACCCTTGGA CAAGACAAAC CAAGTCGGAA CAGCAGATAA CAATGCAGCA AGGCCCTGCT 241 GCCCAATCTC CATCTGTCAA CAGGGGCGTG AGGTCCCAGG AAGTGGCCAA AAGCTAGACA 301 GATCCCCGTT CCTGACATCA CAGCAGCCTC CAACACAAGG CTCCAAGACC TAGGCTCATG 361 GACGAGATGG GAAGGCACAG GGAGAAGGGA TAACCCTACA CCCAGACCCC AGGCTGGACA 421 TGCTGACTGT CCTCTCCCCT CCAGCCTTTG GCCTTGGCTT TTCTAGCCTA TTTACCTGCA 481 GGCTGAGCCA CTCTCTTCCC TTTCCCCAGC ATCACTCCCC AAGGAAGAGC CAATGTTTTC 541 CACCCATAAT CCTTTCTGCC GACCCCTAGT TCCCTCTGCT CAGCCAAGCT TGTTATCAGC 601 TTTCAGGGCC ATGGTTCACA TTAGAATAAA AGGTAGTAAT TAGAACACTC TGGTTCCTGC 661 CCTTTCTGTT GAGAGA // LOCUS HUMIFNG 5961 bp ds-DNA PRI 16-FEB-1987 DEFINITION Human immune interferon (ifn-gamma) gene and flanks. ACCESSION J00219 KEYWORDS gamma-interferon; interferon. SOURCE human cdna of mrna ([1],[2],[4],[6]) and dna ([3],[4],[5]). ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 366 to 588; 1828 to 1896; 1992 to 2174; 4600 to 5318) AUTHORS Gray,P.W., Leung,D.W., Pennica,D., Yelverton,E., Najarian,R., Simonsen,C.C., Derynck,R., Sherwood,P.J., Wallace,D.M., Berger,S.L., Levinson,A.D. and Goeddel,D.V. TITLE expression of human immune interferon cdna in e.coli and monkey cells JOURNAL Nature 295, 503-508 (1982) STANDARD full staff_review REFERENCE 2 (bases 384 to 588; 1828 to 1897; 4598 to 5312) AUTHORS Devos,R., Cheroutre,H., Taya,Y., Degrave,W., van Heuverswyn,H. and Fiers,W. TITLE molecular cloning of human immune interferon cdna and its expression in eukaryotic cells JOURNAL Nucleic Acids Res. 10, 2487-2501 (1982) STANDARD full staff_review REFERENCE 3 (bases 167 to 708; 1779 to 2259; 4592 to 5396) AUTHORS Taya,Y., Devos,R., Tavernier,J., Cheroutre,H., Engler,G. and Fiers,W. TITLE cloning and structure of the human immune interferon-gamma chromosomal gene JOURNAL EMBO J. 1, 953-958 (1982) STANDARD full staff_review REFERENCE 4 (bases 347 to 483) AUTHORS Derynck,R., Leung,D.W., Gray,P.W. and Goeddel,D.V. TITLE human interferon gamma is encoded by a single class of mrna JOURNAL Nucleic Acids Res. 10, 3605-3615 (1982) STANDARD full staff_review REFERENCE 5 (bases 1 to 5961) AUTHORS Gray,P.W. and Goeddel,D.V. TITLE structure of the human immune interferon gene JOURNAL Nature 298, 859-863 (1982) STANDARD full staff_review REFERENCE 6 (bases 4720 to 5319) AUTHORS Derynck,R., Singh,A. and Goeddel,D.V. TITLE expression of the human interferon-gamma cdna in yeast JOURNAL Nucleic Acids Res. 11, 1819-1837 (1983) STANDARD full staff_review COMMENT Immune interferon (ifn-gamma) shows little homology with leukocyte (ifn-alpha) and fibroblast (ifn-beta) interferons. See other loci beginning . ifn-gamma protein (35,000 to 70,000 mw as presently estimated) has both antiviral and antitumorigenic capabilities. Unlike the other interferons, its gene has three intervening sequences, the first of which contains a repetitive element. [4],[5] argue that there is only one ifn-gamma gene per haploid genome. [3] analyzes the promoter region and reports two sequences homologous to sequences in the promoter regions of chicken albumin and the human ifn-beta genes(bases 234-241; 276-289). [6] studies the ifn-gamma sequence recombined to the pgk promoter segment of yeast (see ). FEATURES from to/span description pept 475 588 immune interferon gamma, exon 1 /nomgen="IFNG" /map="12q24" /hgml_locus_uid="LV0085W" 1828 1896 immune interferon gamma, exon 2 1992 2174 immune interferon gamma, exon 3 4600 4734 immune interferon gamma, exon 4 sigp 478 534 signal peptide pre-msg 347 4919 ifn-gamma mRNA (alt.) [5],[6] pre-msg 347 5319 ifn-gamma mRNA (alt.) [1],[2],[4],[5] pre-msg 350 4919 ifn-gamma mRNA (alt.) [3],[6] pre-msg 350 5319 ifn-gamma mRNA (alt.) [1],[2],[3],[4] IVS 589 1827 ifn-gamma intron A IVS 1897 1991 ifn-gamma intron B IVS 2175 4599 ifn-gamma intron C variant 4712 4712 a in [1]; g in [2],[3],[5] BASE COUNT 1829 A 1012 C 1170 G 1950 T ORIGIN 300bp upstream from mst-ii site, on 8.6kb bamh-i fragment. 1 AGCAAATGAT CAATGTGCTT TGTGAATGAA GAGTCAACAT TTTACCAGGG CGAAGTGGGG 61 AGGTACAAAA AAATTTCCAG TCCTTGAATG GTGTGAAGTA AAAGTGCCTC AAAGAATCCC 121 ACCAGAATGG CACAGGTGGG CATAATGGGT CTGTCTCATC GTCAAAGGAC CCAAGGAGTC 181 TAAAGGAAAC TCTAACTACA ACACCCAAAT GCCACAAAAC CTTAGTTATT AATACAAACT 241 ATCATCCCTG CCTATCTGTC ACCATCTCAT CTTAAAAAAC TTGTGAAAAT ACGTAATCCT 301 CAGGAGACTT CAATTAGGTA TAAATACCAG CAGCCAGAGG AGGTGCAGCA CATTGTTCTG 361 ATCATCTGAA GATCAGCTAT TAGAAGAGAA AGATCAGTTA AGTCCTTTGG ACCTGATCAG 421 CTTGATACAA GAACTACTGA TTTCAACTTC TTTGGCTTAA TTCTCTCGGA AACGATGAAA 481 TATACAAGTT ATATCTTGGC TTTTCAGCTC TGCATCGTTT TGGGTTCTCT TGGCTGTTAC 541 TGCCAGGACC CATATGTAAA AGAAGCAGAA AACCTTAAGA AATATTTTGT AAGTATGACT 601 TTTTAATAGT ACTTGTTTGT GGTTGAAAAT GACTGAATAT CGACTTGCTG TAGCATCTCT 661 GATAGGCTGT CATCTCTTGT AGGCAGTCAT TTTGAGATTT GGTGTTATTT TGTTAATTAT 721 TGACTAGATG AGTTCCTTGA CTAAATAATC TAGATATTGT TTTAACCTTC TGCTCAGTTT 781 GTATAGAGAC TTAAAAGGGA TTTATGAATT TTCCAAAAGA TGGGCATAAT ATGGGTATGA 841 AGCATAATGA TGTTAATAAT TTTGTGGTGG GAACTCATTC AGTTGTGATA GTCAAGGAGT 901 ATGCAGATTG AAAAAAATGA TTGGTTATTA GTTTTTGACT TCTCAGACTC TAAGGTCAAG 961 ATTAGCATTA AAAAGGTAAT AGGAAATGTT TACAAATTAA AGTCAAAAAG GTCCTTAAAG 1021 CTTTGGCTTA AAAAAATAAC TGATAGGTGA TTTTCTCCAA AAAGTGATTT CAACATTCTG 1081 CTTCTCTATC TATATTACTT GTGAAGTATT CCGGAACTTC GTTGCTCACT GGGATTTTGG 1141 AAGAATTATG ATTCTGGCTA AGGAATGTTT AAAAATTTTA AGTGAATTTT TTGAGTTTCT 1201 TTTAAAATTT TATTGATGGT TAATGAAAAG TTTTTACATT TTAAATATTT CATTATTTGT 1261 TTAAAACTTA GCTGTTATAA TTATAGCTGT CATAATAATA TTCAGACATT CACAATTGAT 1321 TTTATTCTTA CAACACAAAA TCAAATCTCA CACACACACA CACACACACA CACTCGCACA 1381 TGTTTGGAAC TATCTTTTAA AGCTCGTATA ATAATACCCT ACAGGAAGGC ACAGTAGATG 1441 TAATAGAAAC CTGTACCATT GGGGGGCAGT ATTTTATAGT GGGGTGGCTT TGCTGTTTTT 1501 TGTTTTTGTA TTTTTTAGCC TAGCTTGAAA ATACTTTCTT TAGCTTACTA TAGTTTTTGG 1561 GACCTTTGGA GTATCAGCTT TGTTGAGCTC ATTTGTGACA TTGCAATTTA ATGGTTATAT 1621 TGGGAAATAA AAAAGCTAAA AGAACATAAT AGTCTTTGTC TATATCTCAC ATAAGCCTTT 1681 TGGGAATACT TATTGTTAGA ACTAAGCAGA AGAGTTGAAA AGGAAATCAG TGAATATTGT 1741 CACATCTGAG TTCAATGAAA CTTGAAATAT ATTTTTAAGG CAATTTATGG GCTAATTGTA 1801 AACCAATTTT TTCTTTTTTT TTTTTAGAAT GCAGGTCATT CAGATGTAGC GGATAATGGA 1861 ACTCTTTTCT TAGGCATTTT GAAGAATTGG AAAGAGGTAA GCTGAATATT CCCATTTGGC 1921 TAATTTTCCT GTTGCTTGCT TTCTGATGGA TAAATTCACA TCATCCTCTG TTTGTGCTCT 1981 TTCCTTCCAA GGAGAGTGAC AGAAAAATAA TGCAGAGCCA AATTGTCTCC TTTTACTTCA 2041 AACTTTTTAA AAACTTTAAA GATGACCAGA GCATCCAAAA GAGTGTGGAG ACCATCAAGG 2101 AAGACATGAA TGTCAAGTTT TTCAATAGCA ACAAAAAGAA ACGAGATGAC TTCGAAAAGC 2161 TGACTAATTA TTCGGTGAGG CTATTTAAAT TCTTTCTTTG GTTTCATTGC CGAGGGTCTT 2221 GCAAAGCATT TATTCTCCAG AAAGTAGACA TTAGCTATTT AACAGTTGCT AAAGCTATGA 2281 ACTCAACTCA TGGCTGAAAC TCTACCTTAC TATTTCCATT CGTGTTTGGG TGACTTTGCA 2341 AAGCCAGTAA GAGAATCGCT GAAGTATGTA ATGTAGAGAA ATGCTGGCAT TGTAACTATT 2401 GCGTAAAGAC AGGTGAGTTG ACAAATTCCA GTGAAGAGGA AGTAGGTGAG GAAGAAGCAG 2461 GGAGTACTGA GAAGCAGTTC TCTCATTGTC CCTTGCTCAT ATGATGGAAA TTCTCTTACT 2521 TTGAATGAGA GGCTGTCTGT CTTAATGGAA AGAGCAGTGG GAGGAGCTGA GAAGATGTGT 2581 GTTCTCCTCC CAACTCAGCC ACCAAGGAAC TGTGATGAAT CACATGGCTG GCTGGGCTCA 2641 GTTTCCTCAT CTTAAAAGGA AACTGTTAGG TTCACTGTAT AAGTTTGATG ACCTTCTTTG 2701 CTCCAAAACT CTACAATGCA AAGAATAGAA AATGAGAATG AGATAGAAGA AAGCTACAGT 2761 CTTTGAATAG GTACCAGGGA CACCCCACTG CAAGTCTCTA GCCAACCTAT CAGATTGTAC 2821 TGCCCAATTA GAAGCAAGAA TGGTTGCTGT TTGTTTGTTT TTAGGGAAAA ATAGATAGAA 2881 TTTATACCTT ATGAAAAGAT TGTTCTATCA ACTCTCTATC AACTTTCAGA ATATCTCAGC 2941 TGGAGAACTC CTTAGACTCC TAAGTCTTAC CTCATGAACT TGTATCTTTA AGTTATGGCT 3001 TCTATAAACA GAAAGATAAC GTTGAGGCAT AAAGACAAAT CATGTTTTTC AGAATGTTTT 3061 CTAGAAGACA AAGGCCTCTA GATTCCTTTG GGGTTGACTT TGATATAAAT GGGCTCAAAT 3121 GAGAGGGACC AGGGTCTTCA AGCTAGCATT TGTGTTCTTA GGATATGTGC TCAGCTTTCA 3181 CTATTGCTGG GCCTGCCTCT CACTCCTCTC ATGTAAGCCC CCAGAAACAG AAAGGAGAGA 3241 CATGGCAACA GGTCTCCTTT GGTTATAAAC TAGACACTCA GCACTTGTTT CTAATCCAGT 3301 GGTGCCCCTG GCTTACTGTT CAGTCCTGGA TAAGTCTCTT AGTTTCTTGG TGATGATTTG 3361 AACATTGGAA AGTAAAATCT GTCACTTGCA AACACACAGC TTGTCGAAAA TTTTTTCTAC 3421 TCTGCAGGAA CTGGGCCTTA AAAAAATGAA AAAAAATCTG TGGTTTCTTC CTTCTGGAAG 3481 CTACAAACCT CCTGTTTCTT GATGGGCAAT CTTGAGTGAG CTCTATTAAT TATTATTCTC 3541 TTTGGCTCAG TTGCTAAGCT ATTTTATGCA TGTTATGCCC TTTGACAATT AGTCTTTAGC 3601 TGTAATCCCC CAGCCATCCT CAGAAATGTG GTGAGGTAGC CATAGTGTTC CCAAGATTAG 3661 AAAAATGTAA TGGCAGAGCC AAGAGGAAGG TAAATGGTCC ACATCTTATG AAGCATCATC 3721 TAAATGGCCC TATTGGTTAG AGTGAGGAGA TGCAAGTAGT TCAATTTGCT TGCCTAGAAG 3781 GCAGGGTACT GGAAAAGTTG TTGCAATTCT TAATTTTAAA CTTTATATAT CAGTAAGCCA 3841 TATATAAATA TGATTGGGGG TGTTTATTTT AAAATCTATT ATGGAAATTG AGAGACTGAC 3901 CTAATCTGGG AGAAATTAAA AATTACAGTT TTCACTCGTT TTGGATTTGG TGTTTTCTAG 3961 GGTACCTAAC CTAGATCAGT GGTTCTCAAA CTTAGGTGGA TGTCAGAATC ACCTGGGGAG 4021 CTTAGTGAAT GCACAGGGCA CAGTCCTTCC ACTTCATGCA CCTGGATCTC TGAGGTCTTT 4081 GACAGGTTTC CGGATTAATC TGCTATGCAC AACAGTGAGA ATCATTGACC TATAGTTACT 4141 CATTTGATGC ATACAGGAAA GACTGAAGTA TAAAGTGATA TAATTGGTAG ATTGATGATA 4201 GAGAGGTCAT AGAAACAGTC TCATCCTCCT TTAGATGAGA AAATAGAAGT TCAGAGAGGT 4261 TAAGTAGCTG GCTCAAGGTC AGAATTATTG CATGCATGAG ATTCAAACCC ACCTTTTTAT 4321 GCTGACTCCA CAACCAGGAG TCTTTTCACT ATATAATTTC AAGAATTCTA TAGAAGTAGA 4381 TTTAAAGATA TGTGATGGAC TCCACCACAT TATAGCACAA CTAGAAATGT AATTGTAATT 4441 TTTAGCTTCA ACTGCTGAAG AAGTAAATAT TGTATATTAA GGTAATACGG TCCATTTTTT 4501 AAAGGAATAC TTTTATTTTC ACTGACCATC ATGACATTAG CAGAATATCC TGATGGCTTA 4561 TATGCCTGAA ATTAATTTTG CTCTTTTCTT TCCCGATAGG TAACTGACTT GAATGTCCAA 4621 CGCAAAGCAA TACATGAACT CATCCAAGTG ATGGCTGAAC TGTCGCCAGC AGCTAAAACA 4681 GGGAAGCGAA AAAGGAGTCA GATGCTGTTT CGAGGTCGAA GAGCATCCCA GTAATGGTTG 4741 TCCTGCCTGC AATATTTGAA TTTTAAATCT AAATCTATTT ATTAATATTT AACATTATTT 4801 ATATGGGGAA TATATTTTTA GACTCATCAA TCAAATAAGT ATTTATAATA GCAACTTTTG 4861 TGTAATGAAA ATGAATATCT ATTAATATAT GTATTATTTA TAATTCCTAT ATCCTGTGAC 4921 TGTCTCACTT AATCCTTTGT TTTCTGACTA ATTAGGCAAG GCTATGTGAT TACAAGGCTT 4981 TATCTCAGGG GCCAACTAGG CAGCCAACCT AAGCAAGATC CCATGGGTTG TGTGTTTATT 5041 TCACTTGATG ATACAATGAA CACTTATAAG TGAAGTGATA CTATCCAGTT ACTGCCGGTT 5101 TGAAAATATG CCTGCAATCT GAGCCAGTGC TTTAATGGCA TGTCAGACAG AACTTGAATG 5161 TGTCAGGTGA CCCTGATGAA AACATAGCAT CTCAGGAGAT TTCATGCCTG GTGCTTCCAA 5221 ATATTGTTGA CAACTGTGAC TGTACCCAAA TGGAAAGTAA CTCATTTGTT AAAATTATCA 5281 ATATCTAATA TATATGAATA AAGTGTAAGT TCACAACTAC TTATGCTGTG TTGGACTTTT 5341 TCTAAGTGAG ACCTGGAGTG AAAGAACTAC CTATTAATGA ATTAGTAGGG AGGGGAGTCT 5401 TCTTAGCTGT GAAAATTTTA GAGTTGCATT TGGTTCCATT AAATGTGGTA TTTCTTTCCA 5461 CTAGCATTTT GTTGGCTTTC GCTTTTCCAG TTAGCAGCTC TTTGAATTAT CTTTCTAAGA 5521 TACAGATTTA ATTATGTCAC TATTCAATTC AGAGGTTCTG CTATGGAATG TAGTTTAAAC 5581 TGCTTAGCTT GGCACACAGA GATTTATTTC TAGCCCCTTC TCCACCTTCC TATTTCCTCC 5641 TTCGTTTCAG AATCTTCCTC TCCCTCATCC AATGCTGGCA AACACCAGTG GGGGTGGAGT 5701 AGTGGGTGTA AGCTCTAGGG AGAAGGCTTG GATTGGAATC CAAGTTATTC CATTACAAGT 5761 AGTGTGACCT TTAATACATT ATGTATATTG TCTAAGTTTC AGCTTTATTG TCTGAAAAAG 5821 AAAAATAATT GTGTGTTCCT CATAATATTG TGGTACGAAT TGATTCTTTC ACTCAAGAAA 5881 TATTTACTGG AGTACCTACT ACATGCCTGG TGCTGTTGTA GACCTTGAGA TACCTTACTC 5941 AAGCAAAACA GCCAAGGATC C // LOCUS HUMIFNINI 5209 bp ss-mRNA PRI 15-JUN-1989 DEFINITION Human gamma-interferon-inducible early-response gene IP-10, complete cds. ACCESSION X02530 M17752 KEYWORDS interferon response; interferon-induced protein. SOURCE Homo sapiens (U 937 histiocytic lymphoma cell line), cDNA to mRNA [1],[2] and spleen and placenta DNA [3]. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 2135 to 2259; 2813 to 2941; 3177 to 3266; 3684 to 4511) AUTHORS Luster,A.D., Unkeless,J.C. and Ravetch,J.V. TITLE Gamma-interferon transcriptionally regulates an early-response gene containing homology to platelet proteins JOURNAL Nature 315, 672-676 (1985) STANDARD simple automatic REFERENCE 2 (bases 2135 to 2259; 2813 to 2941; 3177 to 3266; 3684 to 4511) AUTHORS Luster,A.D. JOURNAL Unpublished (1986) Rockefeller Univ, New York, NY 10021 STANDARD simple automatic REFERENCE 3 (bases 1 to 5209) AUTHORS Luster,A.D. and Ravetch,J.V. TITLE Genomic characterization of a gamma-interferon-inducible gene (IP-10) and identification of an interferon-inducible hypersensitive site JOURNAL Mol. Cell. Biol. 7, 3723-3731 (1987) STANDARD full staff_entry COMMENT Data kindly reviewed (29-JUL-1986) by Luster A.D. A gamma-interferon-inducible DNaseI hypersensitive site exists from approximately 260 to 60 bases upstream of the transcription start (bp 1874 to 2074). This site is implicated in transcriptional regulation of IP-10 by gamma-interferon. FEATURES from to/span description pept 2201 2261 IP-10 precursor, exon 1 2815 2941 IP-10 precursor, exon 2 3177 3266 IP-10 precursor, exon 3 3684 3702 IP-10 precursor, exon 4 sigp 2201 2261 IP-10 signal peptide (put.) 2815 2816 IP-10 signal peptide (put.) matp 2817 2941 IP-10 mature peptide 3177 3266 IP-10 mature peptide 3684 3699 IP-10 mature peptide pre-msg 2134 4511 IP-10 mRNA + introns pre-msg 2135 4511 IP-10 mRNA + introns (alt.) IVS 2262 2814 IP-10 intron A IVS 2942 3176 IP-10 intron B IVS 3267 3683 IP-10 intron C revision 4477 4480 tgaa in [2]; tga in [1] revision 4485 4487 caa in [2]; ca in [1] variant 2214 2216 cca in [3]; cga in [2] variant 2833 2835 ctg in [3]; ccg in [2] variant 3265 3266 ag in [3]; at in [2] variant 3994 3995 tc in [3]; tcc in [2] variant 4266 4268 aga in [3]; ata in [2] variant 4423 4425 caa in [3]; ca in [2] BASE COUNT 1631 A 1075 C 888 G 1615 T ORIGIN 7 bp upstream of XbaI site. 1 GGATCCTCTA GAACAGGCAA AAATAATCTA TGGTGAAAAA AATAGAAGAG TGGCTGGGTC 61 AGAGGTAGAT GTTATTATAA GAATAAAATG AGTCAATATG GTCAATAGTC ACAAAGCAGT 121 TCAAACAATG CCTGGCACAC AGTTAATGTA ATACAATGTT TAGTAAAAAT CAAATATTTT 181 ATCCCTAATA ACAGATTTTT AGTCCATTTA TCAAGTATCT CCTTTTTACA TTGTCTTCTA 241 TATAGAGACA TTTTAGCTGA CAACTTAGAT ACCAACTAAA ATAAAACTGT CACCATCTCT 301 CATTTTGATT GTTTAATGTT GAACTTTTCT AGGCCTCTTC CATTCATGAT GTCAGACATT 361 TCTCTGAAAT TTGTTGGCAT TCTATTCACA GTAAAATGTT TGTTCTGCAC AAATAGGTAT 421 GGAAGGATCC CTCCATTGTC ACTATTTCTC ATAAGACTTT CCCTCATCAG AACGAACGCC 481 TCTTTGCTAG GTGAAAGTCT TTTGGGTTCT GTCAGTCTCT ACCTCACCTC CCTCTCTTTT 541 GCCTTTCCGG TTTCCCACAG CTAATTCCAC TGACTTATTC ACTTGAAAGC TTAGCTCTAT 601 TCCAGTTCCT AGAACAGATT AACATGGGAT TCAGCTCTCT AGCAAATTCC ATTTACCCCA 661 AGCAACTCAC TCTCTGCTTT AATGATGGAG TTCCTACATT TGAGGATCCA AGTTTTATGT 721 GCACCCAGTT TGCAGATACA GGCTTTATTC TTGGCCCCTG ATACCAGGTT CTGAGACAGT 781 ATCTGCCTAG TACAGTAGCC TGATCCTAGG ATGGAAGCTT TTCCATTCCC TGACCAATCC 841 CTGCATTGTT GCAGCCCAGC TGACTATGAA AAGAACTCCC ATATATTTTC CTAGGTTCTA 901 AATATCACTT GCCTCAGATC TCTCATTGCA GGGTGGTGCC TGCCTTTGAA ACAGTATTTC 961 ATCTTGGATT CCCTGTTACT AATCATTTGC AAAGAACACA ACCAAGGACC ACTATATCCA 1021 TGGATACTGC TAAATTTCCA CTCTCAGGTT CAACATGTGT TCCATTATAT GAGTATCATG 1081 TGCTTTCTAC TGCTAGCTAC TATGAATAAT CAGTCAAGTA CTATTGAATG CTGTGAAATT 1141 AGGTTTTGCC ACGATTCATC ATCCAGTTAA AATTGTGCCA ATTTCAGTTA TCACTGTTAC 1201 TAGCTCAATG CCAACACTTT TTTTTTCTTT TTCTTTGCAT TTAACATTGT CTTGGAGCTG 1261 AACCCCATCG TAAATCAACC TGTTTCCCTT CTGTCTCAAG CTACTTTCCC AAGGTCTGTC 1321 TCTATCTGCC CCCCCGCTTT TAAATTCATT TCCTCAAAAA GCACCCTTTG TTTTTTACAA 1381 TAACCCTAGG ATAGCTATGA ATCCTTTTCT CACACCATTC TAGATACCAT CTTATGATGT 1441 TTTCATTCAG GGACTGCTAT AAGACGTGAA ACTTGTTTTA ACACACCACA AATCAGATAC 1501 CCAAATGAGC AATGTTTTCC CTCAAAATAG TTATGTTGGA GGCTATTTAC TTAATCTAAT 1561 GGTGTTATGA ATGATCAAAG CATTTATTGA ATTATTTAAA ACTTGCCAGT TCCAGATCTT 1621 TGACCCTGTC AAAGCAGGCC AGTCCTATTA CTTTATAGTA TTTGATCAAG GAGGACTGTC 1681 CAGGTAAATC ACTGTTCTAA TAATCAGGCA CAACTTGCTG TTACCAAAAA ATTAGGTTTA 1741 CCTATAAAGG ATGAAAAATT TCTATTACTG GGGATATTTT GAAAAATAAC CTCTGAAAGT 1801 AACTGTAAAA GTTTAAGTGT TGAGAAAAAG CATTATAGTT AGAATGGATT GCAACCTTTG 1861 TTTTTTTCTA TATGCAATGA AGTTCTTTTT TTCAAGAAAC AGTTCATGTT TTGGAAAGTG 1921 AAACCTAATT CACTATTACC AAAAAAAGAG GAGCAGAGGG AAATTCCGTA ACTTGGAGGC 1981 TACAATAAAT AATACCTTCG AGTCTGCAAC ATGGGACTTC CCCAGGAACA GCCAGCAGGT 2041 TTTGCTAAGT CGACTGTAAT GCCCTTATCC AATCAGAATT AGGGAGGGAA AATGGCTTTG 2101 CAGATAAATA TGGCACACTA GCCCCACGTT TTCTGAGACA TTCCTCAATT GCTTAGACAT 2161 ATTCTGAGCC TACAGCAGAG GAACCTCCAG TCTCAGCACC ATGAATCAAA CTGCCATTCT 2221 GATTTGCTGC CTTATCTTTC TGACTCTAAG TGGCATTCAA GGTAAGGAAC ATCAAAGGAT 2281 ACTTAATTTG TAAAATGAGA AATAGGAATA GGTATAAATT CTAAAAATAC AGAAATAATG 2341 TATTTGTAAA AGTTTCACTG CATGCTTATA AATAAGAGGG AAATAAATAG AGATTCCCTC 2401 AGATCATAAA ACTTATATGA ATTGAAGTGA GAGAAACAAA TAGAATAAGA GAAAGAGAAG 2461 GAAAAAGGGA AGGAGGACAG AAGAGATGGG GAAGAGGGAG GATAGAGAGA GAAAATGTGA 2521 GGGAATGCGG ACAGAGATGA GATACAGATA CTTCCTTACC TAACTAAGCT CAATGAACCA 2581 CATGAACTGT GCTTAAGGGT TTGACTTTAT AATCAACAAG CTGCAATTCT TTTCTTCCAG 2641 ATAATCAACT CTTTAATCAT TTACAGTTGT GTTATGATGT GATCCATTCC TCCTCAGATT 2701 AAGTGACTAT TTGCTGATAT GGGGATATAG GTTCTGCTAA ATACCACCAG TCTACATTAA 2761 ATGCCTAAAA TGAACACTGT GCTAACCTTC TCTGCTGTTC CTCTTTTCCT ACAGGAGTAC 2821 CTCTCTCTAG AACTGTACGC TGTACCTGCA TCAGCATTAG TAATCAACCT GTTAATCCAA 2881 GGTCTTTAGA AAAACTTGAA ATTATTCCTG CAAGCCAATT TTGTCCACGT GTTGAGATCA 2941 TGTGAGTGAA ATCCCATCTG ATTATCACTT CCCTGGTTGT AATTATATAC TGTATTAAAT 3001 ATGTAATGAT AATAAAAAAA GATCAGTAAA GGGTTTGTGA TGATTCTAAA ACTAATGTAC 3061 AGCAAACAAA AACATGCAGA GTGAAACTTA AATGTCTGAC TTCAGAACTG CGTATGCCAT 3121 CTGTTTTATT GACCCAACAC AGTTTTAAAT ATTTTCATCC CTATTTATTT CTACAGTGCT 3181 ACAATGAAAA AGAAGGGTGA GAAGAGATGT CTGAATCCAG AATCGAAGGC CATCAAGAAT 3241 TTACTGAAAG CAGTTAGCAA GGAAAGGTAG GTTTGCTGTT GCCTGCAGCC GAATTGCTCT 3301 TTAGGAAACG GCAATCTTGG GAGTCAGAAA TACTTGCATT GTGGTTTGCT GTGCAATCGC 3361 TGGTTTAAAA GTATGTTACC ACCACGCCCT CCCCTACCTC CATTTATTTA AATGCTGAGG 3421 CACCATCTTG TGTGATAAGT ATCAGAAGTT ACCCTGATTA CCAGTCAACC TTGAAGTACA 3481 GCTATAACTA TCTAAGCAAA ACTGACAACA TTTTCCCCAA GTCTTTCATG GTTGAAAAAA 3541 GCAACCCCTA TAATCCATAA TGAATGCATA GCAGCAGGAA AGCTCAGTTA TCTATTCTAT 3601 GAACTCGGTA CTTTCCAAAC ACAACCCAAT CTGAAGCCAG AGTCAGACTA TCACACTTTT 3661 ATATCCCCTT TCTCTTCTTA CAGGTCTAAA AGATCTCCTT AAAACCAGAG GGGAGCAAAA 3721 TCGATGCAGT GCTTCCAAGG ATGGACCACA CAGAGGCTGC CTCTCCCATC ACTTCCCTAC 3781 ATGGAGTATA TGTCAAGCCA TAATTGTTCT TAGTTTGCAG TTACACTAAA AGGTGACCAA 3841 TGATGGTCAC CAAATCAGCT GCTACTACTC CTGTAGGAAG GTTAATGTTC ATCATCCTAA 3901 GCTATTCAGT AATAACTCTA CCCTGGCACT ATAATGTAAG CTCTACTGAG GTGCTATGTT 3961 CTTAGTGGAT GTTCTGACCC TGCTTCAAAT ATTTCCTCAC CTTTCCCATC TTCCAAGGGT 4021 ACTAAGGAAT CTTTCTGCTT TGGGGTTTAT CAGAATTCTC AGAATCTCAA ATAACTAAAA 4081 GGTATGCAAT CAAATCTGCT TTTTAAAGAA TGCTCTTTAC TTCATGGACT TCCACTGCCA 4141 TCCTCCCAAG GGGCCCAAAT TCTTTCAGTG GCTACCTACA TACAATTCCA AACACATACA 4201 GGAAGGTAGA AATATCTGAA AATGTATGTG TAAGTATTCT TATTTAATGA AAGACTGTAC 4261 AAAGTAGAAG TCTTAGATGT ATATATTTCC TATATTGTTT TCAGTGTACA TGGAATAACA 4321 TGTAATTAAG TACTATGTAT CAATGAGTAA CAGGAAAATT TTAAAAATAC AGATAGATAT 4381 ATGCTCTGCA TGTTACATAA GATAAATGTG CTGAATGGTT TTCAAAATAA AAATGAGGTA 4441 CTCTCCTGGA AATATTAAGA AAGACTATCT AAATGTTGAA AGATCAAAAG GTTAATAAAG 4501 TAATTATAAC TAAGATGAAG TGCCTTGTGT GTCACTTGGT TGGGATATTT ATTTCATAAG 4561 AAACTTAGAT TGGAGAAATC TATGGCCTGG AGAAGAAGGT GGCATGGGCA GAAGACAAAT 4621 GAGTTAGAGA ATTACCACTC TCTCTCCTTC CAACTGTAAT CTCCTGAAAG GCCCATCATA 4681 TAAGATCCTA CAGAACTAAT GTAGCATAAC AGAGCTTGGC CTAATCTCCA AATCCCTATA 4741 GCCAAATTTC TTTTTTTTTT AATCTCTTAT TTTTTTAGAC ACAGGGTCTT CCTCTGTCAC 4801 CCAAGCCGGA GTGCAATGGT GCAATCATAG CTCACTGCAG CCTCGAAACT CCTGGGCTCA 4861 AGTAATCCTC TGACCTCAGC CTCCAGAGAA GCTGGGATTA CAGGTGCAAG CCACCACATC 4921 CAGCCCTATG GCCAAATTTC TCTAAGACAT GTCCACTTGA ATATCCCATA GGAACTTTAG 4981 TTTAAACTCC AAAATAAACT AATTCTCCTC TCCCTGAGAT CTGTTTCTCC TCCTCTATTC 5041 TATGTCCTAG TTTAATTGCA CCATCATCCA ATCAGTTTCC CAAACCAGAA ACCTGGGAGT 5101 AATTCTATAC TACTTCTTCC TCTCCTCCCA CATCCAATTA CTTACTAAGA AGATCTTCAC 5161 AATACATTTA GTTGGCAAAT TATTAGCATC TAGAATATGT AAAGAATTC // LOCUS HUMIGCB1 3182 bp ds-DNA PRI 15-DEC-1989 DEFINITION Human Ig germline J-mu-delta region: DHQ52, J, pseudo-J genes, J exon. ACCESSION J00256 KEYWORDS diversity exon; germline; immunoglobulin; immunoglobulin heavy chain; joining exon; pseudogene; variable region. SEGMENT 1 of 9 SOURCE Human placental lymphocyte DNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 3182) AUTHORS Ravetch,J.V., Siebenlist,U., Korsmeyer,S., Waldmann,T. and Leder,P. TITLE Structure of human immunoglobulin mu locus: Characterization of embryonic and rearranged J and D genes JOURNAL Cell 27, 583-591 (1981) STANDARD full staff_review REFERENCE 2 (bases 1498 to 3011; fragments) AUTHORS Flanagan,J.G. and Rabbitts,T.H. TITLE The sequence of a human immunoglobulin epsilon heavy chain constant region gene, and evidence for three non-allelic genes JOURNAL EMBO J. 1, 655-660 (1982) STANDARD full staff_review COMMENT The J-genes presented here account for all but two (EU and DI) of the known human heavy chain amino acid sequences [1]. By sequence homology comparison the authors [1] postulate that J1 duplicated after human/mouse divergence creating the human J1 and J2 genes, and that the J3 gene duplicated to produce J4 and J5. Although the three pseudogenes are substantially different from the active genes, they have each preserved the 5' putative recombination recognition sequence, suggesting that they may undergo V-D-J recombination. In addition to the J- and D-regions homologous to the mouse locus, there are two other regions of homology 5' to DHQ52 in human and DQ52 in mouse [1]. The first is 'at'-rich and the second is 'gc'-rich (bases 333-357 and 424-451). They are separated by a 60 bp spacer in both mouse and human. [1] also reports two recombinant genes involving this region. [2] presents active V and epsilon-1 genes. The J-region in [2] was used as a probe for locating the epsilon-1 gene. The authors [1] have determined a physical map of this multigene region which contains (in this order) the J-region genes, the mu/alpha switch region, the four mu constant region exons, the mu membrane anchor peptide, and the delta constant region. Flanagan and Rabbitts (Nature 300, 709-713 (1982)) have determined the order of this region relative to two gamma-epsilon-alpha regions. FEATURES from to/span description pept / 621 / 631 Ig h-chain diversity region DHQ52 (AA at 623) /nomgen="IGH@" /map="14q32.33" /hgml_locus_uid="LR0220V" pept / 723 / 774 Ig h-chain joining region J1 (AA at 723) pept / 933 / 984 Ig h-chain joining region J2 (AA at 933) pept / 1541 / 1586 Ig h-chain joining region J3 (AA at 1541) pept / 1914 / 1959 Ig h-chain joining region J4 (AA at 1914) pept / 2356 / 2404 Ig h-chain joining region J5 (AA at 2356) pept / 2949 / 3009 Ig h-chain joining region J6 (AA at 2949) pept.ps / 504 / 557 pseudogene J-psi-1 pept.ps / 1320 / 1380 pseudogene J-psi-2 pept.ps / 2746 / 2797 pseudogene J-psi-3 IVS 558 > 3182 intron J-psi-1-C (no splice consensus at 558) IVS 775 > 3182 intron J1-C IVS 985 > 3182 intron J2-C IVS 1381 > 3182 intron J-psi-2-C (no splice consensus at 1381) IVS 1587 > 3182 intron J3-C IVS 1960 > 3182 intron J4-C IVS 2405 > 3182 intron J5-C IVS 2798 > 3182 intron J-psi-3-C (no splice consensus at 2798) IVS 3010 > 3182 intron J6-C iDNA < 1 503 D-J-psi-1 intervening DNA (3' end +/- 1 bp) iDNA < 1 620 V-D intervening DNA (5' end +/- 1 bp; 3' end +/- 1 bp) iDNA 632 722 D-J1 intervening DNA (3' end +/- 1 bp) iDNA < 1 932 D-J2 intervening DNA (3' end +/- 1 bp) iDNA < 1 1319 D-J-psi-2 intervening DNA (3' end +/- 1 bp) iDNA < 1 1540 D-J3 intervening DNA (3' end +/- 1 bp) iDNA < 1 1913 D-J4 intervening DNA (3' end +/- 1 bp) iDNA < 1 2355 D-J5 intervening DNA (3' end +/- 1 bp) iDNA < 1 2745 D-J-psi-3 intervening DNA (3' end +/- 1 bp) iDNA < 1 2948 D-J6 intervening DNA (3' end +/- 1 bp) signal 467 503 putative recombination recognition sequence signal 592 620 putative recombination recognition sequence signal 632 659 putative recombination recognition sequence signal 685 722 putative recombination recognition sequence signal 893 931 putative recombination recognition sequence signal 1284 1319 putative recombinatin recognition sequence signal 1497 1537 putative recombination recognition sequence signal 1873 1911 putative recombination recognition sequence conflict 1912 1912 a in [1]; g in [2] conflict 1937 1937 a in [1]; g in [2] signal 2317 2353 putative recombination recognition sequence signal 2709 2745 putative recombination recognition sequence signal 2908 2946 putative recombination recognition sequence conflict 2918 2918 g in [1]; t in [2] conflict 2981 2981 g in [1]; c in [2] BASE COUNT 551 A 923 C 1065 G 640 T 3 others ORIGIN about 1500 bp downstream of BamHI site on chromosome 14. 1 CAGCTGAGGC TCAGAGAAGG GACCCAGCAC TGGTGGACAC GCAGGGAGCC CACGCCAGGG 61 CGCCGTGGTG AGTGAGGCCC AGTGCCACCC ACTGAGGCCT CCCGTTCAGT GGGACGACGG 121 TGAACAGGTG GAACCAACCA GGCAACCCCG CCGGGCCCCA CAGACGGATC AGAGCAGGAA 181 AGGCTTCCTG CCCTGCAGGC CAGCGAGGAG CCTGCGGGCC GTGCCCTCCA GCGAGGAGGC 241 TCCCCTGCCA CCGCCACCCG GGCCTCTCTG CTGCTGGGAA AACAAGTCAG AAAGCAAGTG 301 GATGAGAGGT GGCGTGACAG ACCCAGCTTC AGATCTGCTC TAATTTACAA AAGAAAAGGA 361 AAAACACACT TGGCAGCCTT CAGCACTCTA ATGATTCTTA ACAGCAGCAA ATTATTGGCA 421 CAAGACTCCA GAGTGACTGG CAGGGTTGAG GGCTGGGGTC TCCCACGTGT TTTGGGGCTA 481 ACAGCGGAAG GGAGAGCACT GGCAAAGGTG CTGGGGGTCC CCTGAACCCG ACCCGCCCTG 541 AGACCGCAGC CACATCAGCC CCCAGCCCCA CAGGCCCCCT ACCAGCCGCA GGGTTTTTGG 601 CTGAGCTGAG AACCACTGTG CTAACTGGGG ACACAGTGAT TGGCAGCTCT ACAAAAACCA 661 TGCTCCCCCG GGACCCCGGG CTGTGGGTTT CTGTGCCCCT GGCTCAGGGC TGACTCACCG 721 TGGCTGAATA CTTCCAGCAC TGGGGCCAGG GCACCCTGGT CACCGTCTCC TCAGGTGAGT 781 CTGCTGTACT GGGGATAGCG GGGAGCCATG TGTACTGGGC CAAGCAAGGG CTTTGGCTTC 841 AGACTTGGGG ACAGGTGCTC AGCAAAGGAG GTCGGCAGGA GGGCGGAGGG TGTGTTTTTG 901 TATGGGAGAA GCAGGAGGGC AGAGGACTGT GCTACTGGTA CTTCGATCTC TGGGGCCGTG 961 GCACCCTGGT CACTGTCTCC TCAGGTGAGT CCCACTGCAG CCCCCTCCCA GTCTTCTCTG 1021 TCCAGCACCA GCCAGTATCT GGGGTCTGCA GCCGGCCTGG GTCGCCTGAG GCCACACCCA 1081 GCTGCCATCC CTGGGGTCTC CGCCATGGGC TGCATGCCAG AGCCCTGCTG TCACTTGCCC 1141 TGGGCCAGCT GGAGCCCCCA AGGACAGGCA GGGACCCCGC TGGGCTTCAG CCCGTCAGGT 1201 CCCTCCACAG GTAGCAAGCA GGCCGAGGGC AGGGACGGGA AGGAGAAGTT GTGGGCAGAG 1261 CCTGGGCTGG GGCTGGCGCC GGCTGTTCAT GTGCCGGGGA CAGGCTGCGC TTTAGTGTGG 1321 CTACAAGTGC TTGGAGCACT GGGGCAGGGC AGCCCGGACA CCGTCTCCCT GGGAACGTCA 1381 CCCCTCCCTG CCTGGGTCTC AGCCCGGGGG TCTGTGTGGC TGGGGACAGG GACGCCGGCT 1441 GCCTCTGTCT CTGTGCTTGG GCCATGTGAC CCATTTCGAG CGTCCTGCAC GGGCACAGGT 1501 TTGTGTCTGG GTCTAGGAAC GGACTGTGTC CCTGTGTGAT GCTTTTGATG TCTGGGGCCA 1561 AGGGACAATG GTCACCGTCT CTTCAGGTAA GATGGGCTTT CCTTCTGCCT CCTTTCTCTG 1621 GGCCCAGCGT CCTCTGTCCT GGAGCTGGGA GATAATGTCC GGGGGCCTCC TTGGTCTGCG 1681 CTGGGCCATG TGGGGCCTCC GGGGCTCCTT CTCCGGCTGT TTGGACCACG TTCAGCAGAA 1741 GGCCTTTCTT TGGGAACTGG GACTCTGCTG CTGGGGCAAA GGGTGGGCAG AGTCATGCTT 1801 GTGCTGGGGA CAAAATGACC TTGGGACACG GGGCTTGGCT GCCACGGCCG GCCCGGGACA 1861 GTCGGAGAGT CAGGTTTTTG TGCACCCCTT AATGGGGCCT CCCACAATGT GACTACTTTG 1921 ACTACTGGGG CCAAGGAACC CTGGTCACCG TCTCCTCAGG TGAGTCCTCA CAACCNNNTC 1981 TCCTCCGTTA ACTCGAGGTT TGTGACTTTT GGGGAATAAG GGTGCTGGGG GCCTGCCAAG 2041 AGAGCCCCGG AGCAGCCCTG GGGGCTGCAG GAGGCCTGAG GCAACAGCGG CACACACAGA 2101 CGAGGGGCAA GGGTCTCCAG ATGCTCCTTC CTCCTGAGCC CAGCAGCACG GGTTCGTCTC 2161 GGCGCCAGGG CCACCCTAGG CGGAGGTTCG TGTCTTCTCT GAGCCAGGAG CACGGGTTCT 2221 CTCGCAGGCA CCCTGTGCCT CTGGGGTCCA ATGCCCAACA ACCCCGGCCC TCCCCGGGCT 2281 CAGTCTGAGA GGGTCCCAGG GACGTGCGGG GCGCCGGTTC TTTGTCGGGG TCTGGCATTG 2341 TTGTCACAAT GTGACAACTG GTTCGACTCC TGGGGCCAAG GAACCCTGGT CACCGTCTCC 2401 TCAGGTGAGT CCTCACCACC CCCTCTCTGA GTCCACTTAG GGAGACTCAG CTTGCAGGGT 2461 CTAGGGTCAG AGTCTTGGAG GCATTTTGGA GGTCAGGAAA GAAAGCCGGG GAGAGGGACC 2521 CTTCGAATGG GAACCCCGCC TGTCCTCCCC AAGTCCGGCC ACAGATGTCG GCAGCTGGGG 2581 GGCTCCTTCG GCTGGTCTGG GGTGACCTCT CTCCGCTTCA CTGGAGCATT CTCAGGGGCT 2641 GTCGTGATGA TTGCGTGGTG GGACTCTGTC CGCTCCAAGG CACCCGCTCT CTGGGCGGGT 2701 GCCCCCGGGG TTTTTGGACT CCTGGGGGTA ACTTGCGGCC GTCTGCTTGC GGTTGGACTT 2761 CCCAGCCGAC AGTGGTGGTC TGGCTTCTGA GGGGTCAGGC CAGAATGTAC TGGGAGGCCA 2821 GCAGAGGGTT CCATGAGAAG GGCAGGACAG GGCCACGGAC AGTCAGCTTC CATGTGACGC 2881 CCGGAGACAG AAGGTCTCGG GTGGACTGGG TTTTTGTGGG GTGAGGATGG ACATTCTGCC 2941 ATTGTGATTA CTACTACTAC TACGGTATGG ACGTCTGGGG GCAAGGGACC ACGGTCACCG 3001 TCTCCTCAGG TAAGAATGGC CACTCTAGGG CCTTTGTTTT CTGCTACTGC CTGTGGGGTT 3061 TCCTGCGCCA TTGCGGTTGG TGCCTCGCGG CATGTCCATG GGGGACCTGG CGGACTGGAC 3121 CATGTTCCGA GGGGACCTGG CGGACTGGCC AGAGGGGACG GCACTGGGGT GCCTTGAGGA 3181 TC // LOCUS HUMIGCB3 54 bp ds-DNA PRI 01-OCT-1984 DEFINITION Human Ig germline H-chain J-mu-delta region: mu CH1, 5' end. ACCESSION K01307 KEYWORDS constant region; germline; immunoglobulin; immunoglobulin heavy chain; mu-immunoglobulin. SEGMENT 3 of 9 SOURCE Human fetal liver DNA, library of Lawn et al, clone lambda-C75. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 54) AUTHORS Rabbitts,T.H., Forster,A. and Milstein,C.P. TITLE Human immunoglobulin heavy chain genes: Evolutionary comparisons of C-mu, C-delta and C-gamma genes and associated switch sequences JOURNAL Nucleic Acids Res. 9, 4509-4524 (1981) STANDARD full staff_review COMMENT This is part of a multigene region containing the J-region, switch region, C-mu-secreted, C-mu-membrane, and C-delta genes. FEATURES from to/span description pept / 5 > 54 Ig mu C-region CH1 domain secreted (aa at 7) /hgml_locus_uid="LW0029F" /nomgen="IGHM" /map="14q32.33" pept / 5 > 54 Ig mu C-region CH1 domain membrane bound (aa at 7) IVS < 1 4 secreted mu intron J-C IVS < 1 4 membrane bound mu intron J-C BASE COUNT 8 A 22 C 10 G 14 T ORIGIN about 3 kb after ; 48 bp 5' to EcoRI site on chrm 14. 1 TCAGGGAGTG CATCCGCCCC AACCCTTTTC CCCCTCGTCT CCTGTGAGAA TTCC // LOCUS HUMIGCB4 248 bp ds-DNA PRI 01-OCT-1984 DEFINITION Human Ig germline H-chain J-mu-delta region: mu CH2, 5' end. ACCESSION K01308 KEYWORDS constant region; germline; immunoglobulin; immunoglobulin heavy chain; mu-immunoglobulin. SEGMENT 4 of 9 SOURCE Human fetal liver DNA, library of Lawn et al, clone lambda-C75. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 248) AUTHORS Rabbitts,T.H., Forster,A. and Milstein,C.P. TITLE Human immunoglobulin heavy chain genes: Evolutionary comparisons of C-mu, C-delta and C-gamma genes and associated switch sequences JOURNAL Nucleic Acids Res. 9, 4509-4524 (1981) STANDARD full staff_review COMMENT This is part of a multigene region containing the J-region, switch region, C-mu-secreted, C-mu-membrane, and C-delta genes. FEATURES from to/span description pept / 4 > 248 Ig mu C-region CH2 domain secreted (aa at 6) /hgml_locus_uid="LW0029F" /nomgen="IGHM" /map="14q32.33" pept / 4 > 248 Ig mu C-region CH2 domain membrane bound (aa at 6) IVS < 1 3 mu secreted intron A IVS < 1 3 mu membrane bound intron A BASE COUNT 51 A 81 C 75 G 41 T ORIGIN about 500 bp after on chromosome 14. 1 CAGTGATTGC TGAGCTGCCT CCCAAAGTGA GCGTCTTCGT CCCACCCCGC GACGGCTTCT 61 TCGGCAACCC CCGCAAGTCC AAGCTCATCT GCCAGGCCAC GGGTTTCAGT CCCCGGCAGA 121 TTCAGGTGTC CTGGCTGCGC GAGGGGAAGC AGGTGGGGTC TGGCGTCACC ACGGACCAGG 181 TGCAGGCTGA GGCCAAAGAG TCTGGGCCCA CGACCTACAA GGTGACCAGC ACACTGACCA 241 TCAAAGAG // LOCUS HUMIGCB5 421 bp ds-DNA PRI 01-OCT-1984 DEFINITION Human Ig germline H-chain J-mu-delta region: mu CH2(3')-CH3(5'). ACCESSION K01309 KEYWORDS constant region; germline; immunoglobulin; immunoglobulin heavy chain; mu-immunoglobulin. SEGMENT 5 of 9 SOURCE Human fetal liver DNA, library of Lawn et al, clone lambda-C75. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 421) AUTHORS Rabbitts,T.H., Forster,A. and Milstein,C.P. TITLE Human immunoglobulin heavy chain genes: Evolutionary comparisons of C-mu, C-delta and C-gamma genes and associated switch sequences JOURNAL Nucleic Acids Res. 9, 4509-4524 (1981) STANDARD full staff_review COMMENT This is part of a multigene region containing the J-region, switch region, C-mu-secreted, C-mu-membrane, and C-delta genes. FEATURES from to/span description pept < 1 52 Ig mu C-region CH2 domain secreted (aa at 1) /hgml_locus_uid="LW0029F" /nomgen="IGHM" /map="14q32.33" 294 > 421 Ig mu C-region CH3 domain secreted pept < 1 52 Ig mu C-region CH2 domain membrane bound (aa at 1) 294 > 421 Ig mu C-region CH3 domain membrane bound IVS 53 293 mu intron B secreted IVS 53 293 mu intron B membrane bound BASE COUNT 77 A 157 C 106 G 81 T ORIGIN about 100 bp after on chromosome 14. 1 GATCACAGGG GCCTGACCTT CCAGCAGAAT GCGTCCTCCA TGTGTGTCCC CGGTGAGTGA 61 CCTGTCCCCA GGGGCAGCAC CCACCGACAC ACAGGGGTCC ACTCGGGTCT GGCATTGCCA 121 CCCCGGATGC AGCCATCTAC TCCCTGAGCC TTGGTTCCCA GAGCGGCCAA GGGAGGGGCT 181 CGGGCGGCAG GACCCCTGGG CTCGGCAGAG GCAGTTGCTA CTCTTTGGGT GGGAACCATG 241 CCTCCGCCCA CATCCACACC TGCCCCACCT CTGACTCCCT TCTCTTGATC CAGATCAAGA 301 CACAGCCATC CGGGTCTTCG CCATCCCCCC ATCCTTTGCC AGCATCTTCC TGACCAAGTC 361 CACCAAGTTG ACCTGCCTGG TCACAGACCT GACCACCTAT GACAGCGTGA CCATCTCCTG 421 G // LOCUS HUMIGCB6 108 bp ds-DNA PRI 15-JUN-1989 DEFINITION Human Ig germline H-chain J-mu-delta region: mu CH3, 3' end. ACCESSION J00259 KEYWORDS constant region; germline; immunoglobulin; immunoglobulin heavy chain; mu-immunoglobulin. SEGMENT 6 of 9 SOURCE Human fetal liver DNA, library of T. Maniatis, clone H24 [1]; library of Lawn et al [2], clone lambda-C75. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 8 to 108) AUTHORS Takahashi,N., Nakai,S. and Honjo,T. TITLE Cloning of human immunoglobulin mu gene and comparison with mouse mu gene JOURNAL Nucleic Acids Res. 8, 5983-5991 (1980) STANDARD full staff_review REFERENCE 2 (bases 1 to 96) AUTHORS Rabbitts,T.H., Forster,A. and Milstein,C.P. TITLE Human immunoglobulin heavy chain genes: Evolutionary comparisons of C-mu, C-delta and C-gamma genes and associated switch sequences JOURNAL Nucleic Acids Res. 9, 4509-4524 (1981) STANDARD full staff_review COMMENT Compared in [1] with the mouse gene. The two are 68% homologous overall, and are 80% homologous in the CH4 domain. This is part of a multigene region containing the J-region, switch region, C-mu-secreted, C-mu-membrane, and C-delta genes. FEATURES from to/span description pept < 1 / 83 Ig mu C-region CH3 domain secreted (aa at 2) /nomgen="IGHM" /map="14q32.3" /hgml_locus_uid="LW0029F" pept < 1 / 83 Ig mu C-region CH3 domain membrane bound (aa at 2) pre-msg < 1 > 108 Ig mu C-region CH3 mRNA and introns IVS 84 > 108 mu intron C secreted IVS 84 > 108 mu intron C membrane bound BASE COUNT 21 A 42 C 27 G 18 T ORIGIN About 100 bp after ; chromosome 14q32.3. 1 GAATTCGGGG GAGAGGTTCA CGTGCACCGT GACCCACACA GACCTGCCCT CGCCACTGAA 61 GCAGACCATC TCCCGGCCCA AGGGTAGGCC CCACTCTTGC CCTCTTCC // LOCUS HUMIGCB7 625 bp ds-DNA PRI 01-OCT-1984 DEFINITION Human Ig germline H-chain J-mu-delta region: mu CH4, secreted tail. ACCESSION J00260 J00261 KEYWORDS alternate splicing; constant region; germline; immunoglobulin; immunoglobulin heavy chain; mu-immunoglobulin. SEGMENT 7 of 9 SOURCE Human fetal liver DNA, library of T. Maniatis [1]; library of Lawn et al [2], clone lambda-C75. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 24 to 96; 281 to 440) AUTHORS Takahashi,N., Nakai,S. and Honjo,T. TITLE Cloning of human immunoglobulin mu gene and comparison with mouse mu gene JOURNAL Nucleic Acids Res. 8, 5983-5991 (1980) STANDARD full staff_review REFERENCE 2 (bases 1 to 625) AUTHORS Rabbitts,T.H., Forster,A. and Milstein,C.P. TITLE Human immunoglobulin heavy chain genes: Evolutionary comparisons of C-mu, C-delta and C-gamma genes and associated switch sequences JOURNAL Nucleic Acids Res. 9, 4509-4524 (1981) STANDARD full staff_review COMMENT The splice point for the mu-membrane segment CH1 is at base 284. The authors in [1] report that the deduced amino acid sequence differs from the human OU mu chain protein at codons 487 and 493 (at bases 76 and 94, respectively). The tail protein differs from that of the mouse by 8% in the amino acid sequence and by 9.6% in the nucleotide sequence [2]. The mouse nucleotide sequence is 80% homologous with this sequence [1]. (They are 68% homologous in the CH4 domain.) The authors in [1] and [2] remark that the S-mu (switch) region contains tandem repeats of the pentanucleotide 'gagct' (also found in the mouse switch regions), which may be involved in class switch recombination. This is part of a multigene region containing the J-region, switch region, C-mu-secreted, C-mu-membrane, and C-delta genes. The last 15 bases in [1] show very little homology with the sequence presented in [2] and are noted in the sites table as a conflict. FEATURES from to/span description pept < 1 345 Ig mu C-region CH4 domain secreted (aa at 1) /hgml_locus_uid="LW0029F" /nomgen="IGHM" /map="14q32.33" pept < 1 + 283 Ig mu C-region CH4 domain membrane bound (aa at 1) IVS 284 > 625 mu-m intron membrane bound conflict 358 358 g in [2]; a in [1] conflict 381 386 ccgctc in [2]; cgcc in [1] conflict 411 412 gt in [2]; g in [1] conflict 419 423 ggtcg in [2]; gtg in [1] conflict 430 444 tgcatcttataaaat in [2]; gactcataatagata in [1] BASE COUNT 126 A 215 C 171 G 113 T ORIGIN about 200 bp after on chromosome 14. 1 GAGCAGCTGA ACCTGCGGGA GTCGGCCACC ATCACGTGCC TGGTGACGGG CTTCTCTCCC 61 GCGGACGTCT TCGTGCAGTG GATGCAGAGG GGGCAGCCCT TGTCCCCGGA GAAGTATGTG 121 ACCAGCGCCC CAATGCCTGA GCCCCAGGCC CCAGGCCGGT ACTTCGCCCA CAGCATCCTG 181 ACCGTGTCCG AGGAGGAATG GAACACGGGG GAGACCTACA CCTGCGTGGC CCATGACGCC 241 CTGCCCAACA GGGTCACCGA GAGGACCGTG GACAAGTCCA CCGGTAAACC CACCCTGTAC 301 AACGTGTCCC TGGTCATGTC CGACACAGCT GGCACCTGCT ACTGACCCTG CTGGCCTGCC 361 CACAGGCTCG GGGCGGCTGG CCGCTCTGTG TGTGCATGCA AACTAACCGT GTCAACGGGG 421 TCGAGATGTT GCATCTTATA AAATTAGAAA TAAAAAGATC CATTCAAAAG ATACTGGTCC 481 TGAGTGCACG ATGCTCTGGC CTACTGGGGC GGCGGCTGTG CTGCACCCAC CCTGCGCCTC 541 CCCTGCAGAA CACCTTCCTC CACAGCCCCC ACCCCTGCCT CACCCACCTG CGTGCCTCAG 601 TGGCTTCTAG AAACCCCTGA ATTCC // LOCUS HUMIGCB8 141 bp ds-DNA PRI 01-OCT-1984 DEFINITION Human Ig germline H-chain J-mu-delta region: mu-membrane exon M1. ACCESSION K01310 KEYWORDS alternate splicing; constant region; germline; immunoglobulin; immunoglobulin heavy chain; membrane bound immunoglobulin; mu-immunoglobulin. SEGMENT 8 of 9 SOURCE Human fetal liver DNA, library of Lawn et al, clone lambda-C75. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 141) AUTHORS Rabbitts,T.H., Forster,A. and Milstein,C.P. TITLE Human immunoglobulin heavy chain genes: Evolutionary comparisons of C-mu, C-delta and C-gamma genes and associated switch sequences JOURNAL Nucleic Acids Res. 9, 4509-4524 (1981) STANDARD full staff_review COMMENT Membrane IgM is made by splicing the mu CH4 domain with the mu-membrane M1 domain, deleting the secreted tail and intervening DNA. The amino acid sequence of M1 is identical to the mouse sequence, and the two nucleotide sequences differ by only 7%. This is part of a multigene region containing the J-region, switch region, C-mu-secreted, C-mu-membrane, and C-delta genes. FEATURES from to/span description pept + 14 / 129 Ig mu C-region membrane bound, exon 5 /hgml_locus_uid="LW0029F" /nomgen="IGHM" /map="14q32.33" IVS < 1 13 mu-m intron membrane bound IVS 130 > 141 mu-m intron membrane bound BASE COUNT 27 A 45 C 37 G 32 T ORIGIN about 1.9 kb after on chromosome 14. 1 GTGTCTCCTG CAGAGGGGGA GGTGAACGCC GAAGAGGAGG GCTTTGAAAA CCTGTGGACC 61 ACCGCCTCCA CCTTCATCGT CCTCTTCCTC CTGAGCCTCT TCTACAGTAC CACCGTCACC 121 TTGTTCAAGG TAGCACGGCT G // LOCUS HUMIGCB9 106 bp ds-DNA PRI 01-OCT-1984 DEFINITION Human Ig germline H-chain J-mu-delta region: C-delta CH1, 5' end. ACCESSION K01311 KEYWORDS constant region; delta-immunoglobulin; germline; immunoglobulin; immunoglobulin heavy chain. SEGMENT 9 of 9 SOURCE Human fetal liver DNA, library of Lawn et al. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 106) AUTHORS Rabbitts,T.H., Forster,A. and Milstein,C.P. TITLE Human immunoglobulin heavy chain genes: Evolutionary comparisons of C-mu, C-delta and C-gamma genes and associated switch sequences JOURNAL Nucleic Acids Res. 9, 4509-4524 (1981) STANDARD full staff_review COMMENT The deduced amino acid sequence is compared in [1] to the J/C-delta-1 junction of human ER1 protein. The delta gene occurs only 5 kb from the mu region. The authors [1] could not detect any switch-related sequences adjacent to the delta gene and state that this implies that the mu/delta switch cannot occur by the class switch recombination method. They speculate that the entire VH-(C-mu)-(C-delta) region is transcribed into one nuclear precursor molecule which is spliced later. This is part of a multigene region containing the J-region, switch region, C-mu-secreted, C-mu-membrane, and C-delta genes. FEATURES from to/span description pept / 27 > 106 Ig delta C-region CH1 domain (aa at 29) /hgml_locus_uid="LX0020F" /nomgen="IGHD" /map="14q32.33" IVS < 1 26 delta intron J-C BASE COUNT 24 A 38 C 24 G 20 T ORIGIN about 3.5 kb after on chromosome 14. 1 TGCCACCCCA GGACTCTGTC TTCCAGCACC CACCAAGGCT CCGGATGTGT TCCCCATCAT 61 ATCAGGGTGC AGACACCCAA AGGATAACAG CCCTGTGGTC CTGGCA // LOCUS HUMIGCC2 1208 bp ds-DNA PRI 20-MAY-1987 DEFINITION Human Ig germline G-E-A region A: gamma-3 CH1, Hinge, CH2. ACCESSION K01313 KEYWORDS constant region; gamma-immunoglobulin; germline; hinge exon; immunoglobulin; immunoglobulin heavy chain. SEGMENT 2 of 8 SOURCE Human fetal liver DNA, library of Lawn et al, clones p-gamma-2RPA3 [1] and Ig-gamma3-122. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 393 to 683; 802 to 874; 990 to 1062) AUTHORS Krawinkel,U. and Rabbitts,T.H. TITLE Comparison of the hinge-coding segments in human immunoglobulin gamma heavy chain genes and the linkage of the gamma 2 and gamma 4 subclass genes JOURNAL EMBO J. 1, 403-407 (1982) STANDARD full staff_review REFERENCE 2 (bases 1 to 1208) AUTHORS Takahashi,N., Ueda,S., Obata,M., Nikaido,T., Nakai,S. and Honjo,T. TITLE Structure of human immunoglobulin gamma genes: Implications for evolution of a gene family JOURNAL Cell 29, 671-679 (1982) STANDARD full staff_review COMMENT [1] also presents the hinge region sequences of gamma-2 and gamma-4. [2] shows sequences for gamma-1, gamma-3, gamma-4 and a gamma pseudogene. This sequence is part of a multigene region containing the gamma-3, gamma-1, pseudo-epsilon, and alpha-1 genes. See segment 1 for more comments. FEATURES from to/span description pept < 1 35 Ig gamma-3 constant region (aa at 2) CH1 domain /nomgen="IGHG3" /map="14q32.3" /hgml_locus_uid="LE0021L" 426 476 Ig gamma-3 constant region hinge exon H1 620 664 Ig gamma-3 constant region hinge exon H2 808 852 Ig gamma-3 constant region hinge exon H3 996 1040 Ig gamma-3 constant region hinge exon H4 1159 > 1208 Ig gamma-3 constant region CH2 domain IVS 36 425 gamma-3 intron A IVS 477 619 gamma-3 intron B IVS 665 807 gamma-3 intron C IVS 853 995 gamma-3 intron D IVS 1041 1158 gamma-3 intron E BASE COUNT 249 A 463 C 295 G 201 T ORIGIN BglII site on chromosome 14q32.3. 1 CAAGCCCAGC AACACCAAGG TGGACAAGAG AGTTGGTGAG AGGCCAGCGC AGGGAGGGAG 61 GGTGTCTACT GGAAGCCAGC TCAGCCCTCC TGCCTGGACG CATCCCGGCT GTGCAGTCCC 121 AGCCCAGGGC ACCAAGGCAG GCCCCGTCTG ACTCCTCACC CGGAGGCCTC TGCCCGCCCC 181 ACTCATGCTC AGGGAGAGGG TCTTCTGGCT TTTTCCACCA GGCTCCAGGC AGGCACAGGC 241 TGGGTGCCCC TACCCCAGGC CCTTCACACA CAGGGGCAGG TGCTGCGCTC AGACGTGCCA 301 GAGCCATATC CAGGAGGACC CTGCCCCTGA CCTAAGCCCA CCCCAAAGGC CAAACTCTCT 361 ACTCACTCAG CTCAGACACC TTCTCTCTTC CCAGATCTGA GTAACTCCCA ATCTTCTCTC 421 TGCAGAGCTC AAAACCCCAC TTGGTGACAC AACTCACACA TGCCCACGGT GCCCAGGTAA 481 GCCAGCCCAG GCCTCGCCCT CCAGCTCAAG GCGGGACAAG AGCCCTAGAG TGGCCTGAGT 541 CCAGGGACAG GCCCCAGCAG GGTGCTGACG CATCCACCTC CATCCCAGAT CCCCGTAACT 601 CCCAATCTTC TCTCTGCAGA GCCTAAATCT TGTGACACAC CTCCCCCGTG CCCACGGTGC 661 CCAGGTAAGC CAGCCCAGGC CTCGCCCTCC AGCTCAAGGC AGGACAGGTG CCCTAGAGTG 721 GCCTGAGTCC AGGGACAGGC CCCAGCAGGG TGCTGACGCA TCCACCTCCA TCCCAGATCC 781 CCGTAACTCC CAATCTTCTC TCTGCAGAGC CTAAATCTTG TGACACACCT CCCCCGTGCC 841 CACGGTGCCC AGGTAAGCCA GCCCAGGCCT CGCCCTCCAG CTCAAGGCAG GACAGGTGCC 901 CTAGAGTGGC CTGAGTCCAG GGACAGGCCC CAGCAGGGTG CTGACGCATC CACCTCCATC 961 CCAGATCCCC GTAACTCCCA ATCTTCTCTC TGCAGAGCCC AAATCTTGTG ACACACCTCC 1021 CCCGTGCCCA AGGTGCCCAG GTAAGCCAGC CCAGGCCTCG CCCTCCAGCT CAAGGCAGGA 1081 CAGGTGCCCT AGAGTGGCCT GCATCCAGGG ACAGGTCCCA GTCGGGTGCT GACACATCTG 1141 CCTCCATCTC TTCCTCAGCA CCTGAACTCC TGGGAGGACC GTCAGTCTTC CTCTTCCCCC 1201 CTAAACCC // LOCUS HUMIGCC4 2009 bp ds-DNA PRI 20-MAY-1987 DEFINITION Human Ig germline G-E-A region A: gamma-1 constant region. ACCESSION J00228 KEYWORDS constant region; gamma-immunoglobulin; germline; hinge exon; immunoglobulin; immunoglobulin heavy chain. SEGMENT 4 of 8 SOURCE Human fetal liver DNA, library of T. Maniatis [1]; library of Lawn et al [2], clone Ig-gamma3-122. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 2009) AUTHORS Ellison,J.W., Berson,B.J. and Hood,L.E. TITLE The nucleotide sequence of a human immunoglobulin C-gamma-1 gene JOURNAL Nucleic Acids Res. 10, 4071-4079 (1982) STANDARD full staff_review REFERENCE 2 (bases 469 to 1070; 1465 to 1821) AUTHORS Takahashi,N., Ueda,S., Obata,M., Nikaido,T., Nakai,S. and Honjo,T. TITLE Structure of human immunoglobulin gamma genes: Implications for evolution of a gene family JOURNAL Cell 29, 671-679 (1982) STANDARD full staff_review COMMENT [1] and [2] report that nucleotide divergence among the four gamma genes is much greater in the hinge regions than anywhere else. [2] also reports the hinge regions of gamma-2, gamma-3, gamma-4, a gamma pseudogene, and the 5' flanking, CH2, and CH3 domains of the gamma genes. This entry is part of a multigene region (region A) containing the gamma-3, gamma-1, pseudo-epsilon, and alpha-1 genes. See segment 1 for more comments. FEATURES from to/span description pept / 210 503 Ig gamma-1 heavy chain C-region CH1 domain (aa at 212) /nomgen="IGHG1" /map="14q32.3" /hgml_locus_uid="LS0110Z" 892 936 Ig gamma-1 heavy chain C-region hinge 1055 1384 Ig gamma-1 heavy chain C-region CH2 domain 1481 1803 Ig gamma-1 heavy chain C-region CH3 domain IVS < 1 209 gamma-1 intron J-C IVS 504 891 gamma-1 intron A IVS 937 1054 gamma-1 intron B IVS 1385 1480 gamma-1 intron C conflict 563 563 t in [1]; c in [2] conflict 593 593 c in [1]; t in [2] conflict 614 614 g in [1]; a in [2] conflict 633 633 g in [1]; gg in [2] conflict 643 643 g in [1]; a in [2] conflict 654 654 g in [1]; a in [2] conflict 684 684 c in [1]; cc in [2] conflict 692 692 g in [1]; a in [2] conflict 765 766 aa in [1]; a in [2] conflict 1475 1475 c in [1]; cc in [2] conflict 1578 1578 t in [1]; c in [2] BASE COUNT 418 A 698 C 566 G 327 T ORIGIN about 27 kb after ; HindIII site on chromosome 14q32.3. 1 AGCTTTCTGG GGCAGGCCAG GCCTGACCTT GGCTTTGGGG CAGGGAGGGG GCTAAGGTGA 61 GGCAGGTGGC GCCAGCAGGT GCACACCCAA TGCCCATGAG CCCAGACACT GGACGCTGAA 121 CCTCGCGGAC AGTTAAGAAC CCAGGGGCCT CTGCGCCTGG GCCCAGCTCT GTCCCACACC 181 GCGGTCACAT GGCACCACCT CTCTTGCAGC CTCCACCAAG GGCCCATCGG TCTTCCCCCT 241 GGCACCCTCC TCCAAGAGCA CCTCTGGGGG CACAGCGGCC CTGGGCTGCC TGGTCAAGGA 301 CTACTTCCCC GAACCGGTGA CGGTGTCGTG GAACTCAGGC GCCCTGACCA GCGGCGTGCA 361 CACCTTCCCG GCTGTCCTAC AGTCCTCAGG ACTCTACTCC CTCAGCAGCG TGGTGACCGT 421 GCCCTCCAGC AGCTTGGGCA CCCAGACCTA CATCTGCAAC GTGAATCACA AGCCCAGCAA 481 CACCAAGGTG GACAAGAAAG TTGGTGAGAG GCCAGCACAG GGAGGGAGGG TGTCTGCTGG 541 AAGCAGGCTC AGCGCTCCTG CCTGGACGCA TCCCGGCTAT GCAGCCCCAG TCCAGGGCAG 601 CAAGGCAGGC CCCGTCTGCC TCTTCACCCG GAGCCTCTGC CCGCCCCACT CATGCTCAGG 661 GAGAGGGTCT TCTGGCTTTT TCCCAGGCTC TGGGCAGGCA CAGGCTAGGT GCCCCTAACC 721 CAGGCCCTGC ACACAAAGGG GCAGGTGCTG GGCTCAGACC TGCCAAGAGC CATATCCGGG 781 AGGACCCTGC CCCTGACCTA AGCCCACCCC AAAGGCCAAA CTCTCCACTC CCTCAGCTCG 841 GACACCTTCT CTCCTCCCAG ATTCCAGTAA CTCCCAATCT TCTCTCTGCA GAGCCCAAAT 901 CTTGTGACAA AACTCACACA TGCCCACCGT GCCCAGGTAA GCCAGCCCAG GCCTCGCCCT 961 CCAGCTCAAG GCGGGACAGG TGCCCTAGAG TAGCCTGCAT CCAGGGACAG GCCCCAGCCG 1021 GGTGCTGACA CGTCCACCTC CATCTCTTCC TCAGCACCTG AACTCCTGGG GGGACCGTCA 1081 GTCTTCCTCT TCCCCCCAAA ACCCAAGGAC ACCCTCATGA TCTCCCGGAC CCCTGAGGTC 1141 ACATGCGTGG TGGTGGACGT GAGCCACGAA GACCCTGAGG TCAAGTTCAA CTGGTACGTG 1201 GACGGCGTGG AGGTGCATAA TGCCAAGACA AAGCCGCGGG AGGAGCAGTA CAACAGCACG 1261 TACCGGGTGG TCAGCGTCCT CACCGTCCTG CACCAGGACT GGCTGAATGG CAAGGAGTAC 1321 AAGTGCAAGG TCTCCAACAA AGCCCTCCCA GCCCCCATCG AGAAAACCAT CTCCAAAGCC 1381 AAAGGTGGGA CCCGTGGGGT GCGAGGGCCA CATGGACAGA GGCCGGCTCG GCCCACCCTC 1441 TGCCCTGAGA GTGACCGCTG TACCAACCTC TGTCCTACAG GGCAGCCCCG AGAACCACAG 1501 GTGTACACCC TGCCCCCATC CCGGGATGAG CTGACCAAGA ACCAGGTCAG CCTGACCTGC 1561 CTGGTCAAAG GCTTCTATCC CAGCGACATC GCCGTGGAGT GGGAGAGCAA TGGGCAGCCG 1621 GAGAACAACT ACAAGACCAC GCCTCCCGTG CTGGACTCCG ACGGCTCCTT CTTCCTCTAC 1681 AGCAAGCTCA CCGTGGACAA GAGCAGGTGG CAGCAGGGGA ACGTCTTCTC ATGCTCCGTG 1741 ATGCATGAGG CTCTGCACAA CCACTACACG CAGAAGAGCC TCTCCCTGTC TCCGGGTAAA 1801 TGAGTGCGAC GGCCGGCAAG CCCCGCTCCC CGGGCTCTCG CGGTCGCACG AGGATGCTTG 1861 GCACGTACCC CCTGTACATA CTTCCCGGGC GCCCAGCATG GAAATAAAGC ACCCAGCGCT 1921 GCCCTGGGCC CCTGCGAGAC TGTGATGGTT CTTTCCACGG GTCAGGCCGA GTCTGAGGCC 1981 TGAGTGGCAT GAGGGAGGCA GAGCGGGTC // LOCUS HUMIGCC8 2533 bp ds-DNA PRI 20-MAY-1987 DEFINITION Human Ig germline G-E-A region A: alpha-1 constant region. ACCESSION J00220 KEYWORDS alpha-immunoglobulin; constant region; germline; immunoglobulin; immunoglobulin heavy chain. SEGMENT 8 of 8 SOURCE Human genomic DNA, cosmid Ig13 [1],[2]. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 754 to 1033) AUTHORS Flanagan,J.G. and Rabbitts,T.H. TITLE Arrangement of human immunoglobulin heavy chain constant region genes implies evolutionary duplication of a segment containing gamma, epsilon and alpha genes JOURNAL Nature 300, 709-713 (1982) STANDARD full staff_review REFERENCE 2 (bases 1 to 2533) AUTHORS Flanagan,J.G., Lefranc,M.-P. and Rabbitts,T.H. TITLE Mechanisms of divergence and convergence of the human immunoglobulin alpha1 and alpha2 constant region gene sequences JOURNAL Cell 36, 681-688 (1984) STANDARD full staff_review COMMENT [2] also reports the complete alpha-2 (A2m(1) allele) and part of the A2m(2) allele. Comparison of the three sequences suggests that the A2m(1) alpha-2 gene may be a hybrid of the alpha-1 and A2m(2) alpha-2 gene. The hinge region in the alpha genes occurs at the beginning of the CH2 domain. The alpha-1 hinge region is 13 amino acids longer than that in alpha-2. Both hinge regions consist of approximate tandem repeats of a 15 bp sequence. The first repeat occurs 5' to the mRNA splice site and is non-coding. The authors [2] suggest that this repetitive structure provides a possible mechanism for the large number of variations observed in hinge regions. There is a coupled 30 bp deletion, 9 bp insertion in alpha-1 relative to alpha-2 (starting at base 97), which the authors believe was caused by slipped mispairing. [1] also reports the alpha-2 CH2 domain and a pseudo-epsilon gene. This entry is part of a multigene region (region A) containing the gamma-3, gamma-1, pseudo-epsilon, and alpha-1 genes. See segment 1 for more comments. FEATURES from to/span description pept / 142 447 Ig alpha-1 heavy chain CH1 domain /nomgen="IGHA1" /map="14q32.3" /hgml_locus_uid="LR0097X" 662 1021 Ig alpha-1 heavy chain CH2 domain 1244 1638 Ig alpha-1 heavy chain CH3 domain IVS < 1 141 alpha-1 intron J-C IVS 448 661 alpha-1 intron A IVS 1022 1243 alpha-1 intron B BASE COUNT 490 A 866 C 753 G 424 T ORIGIN about 11 kb after ; XhoI site on chromosome 14q32.3. 1 GGTCCAACTG CAGGCCTGTG GTGCAGGAGC TGTGTGACCA TGGGGCTGTC ACCAGGCCTC 61 TCTGTGCTGG GTTCCTCCAG TATAGAGGAG AGGCAGTATA GAGGAGAGGG CCGCGTCCTC 121 ACAGTGCATT CTGTGTTCCA GCATCCCCGA CCAGCCCCAA GGTCTTCCCG CTGAGCCTCT 181 GCAGCACCCA GCCAGATGGG AACGTGGTCA TCGCCTGCCT GGTCCAGGGC TTCTTCCCCC 241 AGGAGCCACT CAGTGTGACC TGGAGCGAAA GCGGACAGGG CGTGACCGCC AGAAACTTCC 301 CACCCAGCCA GGATGCCTCC GGGGACCTGT ACACCACGAG CAGCCAGCTG ACCCTGCCGG 361 CCACACAGTG CCTAGCCGGC AAGTCCGTGA CATGCCACGT GAAGCACTAC ACGAATCCCA 421 GCCAGGATGT GACTGTGCCC TGCCCAGGTC AGAGGGCAGG CTGGGGAGTG GGGCGGGGCC 481 ACCCCGTCGT GCCCTGACAC TGCGCCTGCA CCCGTGTTCC CCACAGGGAG CCGCCCCTTC 541 ACTCACACCA GAGTGGACCC CGGGCCGAGC CCCAGGAGGT GGTGGTGGAC AGGCCAGGAG 601 GGGCGAGGCG GGGGCATGGG GAAGTATGTG CTGACCAGCT CAGGCCATCT CTCCACTCCA 661 GTTCCCTCAA CTCCACCTAC CCCATCTCCC TCAACTCCAC CTACCCCATC TCCCTCATGC 721 TGCCACCCCC GACTGTCACT GCACCGACCG GCCCTCGAGG ACCTGCTCTT AGGTTCAGAA 781 GCGAACCTCA CGTGCACACT GACCGGCCTG AGAGATGCCT CAGGTGTCAC CTTCACCTGG 841 ACGCCCTCAA GTGGGAAGAG CGCTGTTCAA GGACCACCTG AGCGTGACCT CTGTGGCTGC 901 TACAGCGTGT CCAGTGTCCT GCCGGGCTGT GCCGAGCCAT GGAACCATGG GAAGACCTTC 961 ACTTGCACTG CTGCCTACCC CGAGTCCAAG ACCCCGCTAA CCGCCACCCT CTCAAAATCC 1021 GGTGGGTCCA GACCCTGCTC GGGGCCCTGC TCAGTGCTCT GGTTTGCAAA GCATATTCCT 1081 GGCCTGCCTC CTCCCTCCCA ATCCTGGGCT CCAGTGCTCA TGCCAAGTAC ACAGGGAAAC 1141 TGAGGCAGGC TGAGGGGCCA GGACACAGCC CGGGGTGCCC ACCAGAGCAG AGGGGCTCTC 1201 TCATCCCCTG CCCAGCCCCC TGACCTGGCT CTCTACCCTC CAGGAAACAC ATTCCGGCCC 1261 GAGGTCCACC TGCTGCCGCC GCCGTCGGAG GAGCTGGCCC TGAACGAGCT GGTGACGCTG 1321 ACGTGCCTGG CACGCGGCTT CAGCCCCAAG GACGTGCTGG TTCGCTGGCT GCAGGGGTCA 1381 CAGGAGCTGC CCCGCGAGAA GTACCTGACT TGGGCATCCC GGCAGGAGCC CAGCCAGGGC 1441 ACCACCACCT TCGCTGTGAC CAGCATACTG CGCGTGGCAG CCGAGGACTG GAAGAAGGGG 1501 GACACCTTCT CCTGCATGGT GGGCCACGAG GCCCTGCCGC TGGCCTTCAC ACAGAAGACC 1561 ATCGACCGCT TGGCGGGTAA ACCCACCCAT GTCAATGTGT CTGTTGTCAT GGCGGAGGTG 1621 GACGGCACCT GCTACTGAGC CGCCCGCCTG TCCCCACCCC TGAATAAACT CCATGCTCCC 1681 CCAAGCAGCC CCACGCTTCC ATCCGGCGCC TGTCTGTCCA TCCTCAGGGT CTCAGCACTT 1741 GGGAAAGGGC CAGGGCATGG ACAGGGAAGA ATACCCCCTG CCCTGAGCCT CGGGGGGCCC 1801 CTGGCACCCC CATGAGACTT TCCACCCTGG TGTGAGTGTG AGTTGTGAGT GTGAGAGTGT 1861 GTGGTGCAGG AGGCCTCGCT GGTGTGAGAT CTTAGGTCTG CCAAGGCAGG CACAGCCCAG 1921 GATGGGTTCT GAGAGACGCA CATGCCCCGG ACAGTTCTGA GTGAGCAGTG GCATGGCCGT 1981 TTGTCCCTGA GAGAGCCGCC TCTGGCTGTA GCTGGGAGGG AATAGGGAGG GTAAAAGGAG 2041 CAGGCTAGCC AAGAAAGGCG CAGGTAGTGG CAGGAGCGGC GAGGGAGTGA GGGGCTGGAC 2101 TCCAGGGCCC CACTGGGAGG ACAAGCTCCA GGAGGGCCCC ACCACCCTAG TGGGTGGGCC 2161 TCAGGACGTC CCACTGACGC ATGCAGGAAG GGGCACCTCC CCTTAACCAC ACTGCTCTGT 2221 ACGGGGCACG TGGGCACAGG TGCACACTCA CACTCACATA TATGCCTGAG CCCTGCAGGA 2281 GCGGAACGTT CACAGCCCAG ACCCAGTTCC AGAAAAGCCA GGGGAGTCCC CTCCCAAGCC 2341 CCCAAGCTCA GCCTGCTCCC CTAGGCCCCT CTGGCTTCCC TGTGTTTCCA CTGTGCACAG 2401 ATCAGGCACC AACTCCACAG ACCCCTCCCA GGCAGCCCCT GCTCCCTGCC TGGCCAAGTC 2461 TCCCATCCCT TCCTAAGCCC AACTAGGACC CAAAGCATAG ACAGGGAGGG GCCACGTGGG 2521 GTGGCATCAG AAG // LOCUS HUMIGCD1 2009 bp ds-DNA PRI 20-MAY-1987 DEFINITION Human Ig germline H-chain G-E-A region B: gamma-2 constant region. ACCESSION J00230 V00554 KEYWORDS constant region; gamma-immunoglobulin; germline; hinge exon; immunoglobulin; immunoglobulin heavy chain. SEGMENT 1 of 7 SOURCE Human fetal liver DNA, library of T. Maniatis [2] and Lawn et al [1],[3]; clones p-gamma-2RPA3 [1], 5A [2], and Ig-gamma-2-15 [3]. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 896 to 1256; 1749 to 1937) AUTHORS Krawinkel,U. and Rabbitts,T.H. TITLE Comparison of the hinge-coding segments in human immunoglobulin gamma heavy chain genes and the linkage of the gamma 2 and gamma 4 subclass genes JOURNAL EMBO J. 1, 403-407 (1982) STANDARD full staff_review REFERENCE 2 (bases 1 to 2009) AUTHORS Ellison,J. and Hood,L. TITLE Linkage and sequence homology of two human immunoglobulin gamma heavy chain constant region genes JOURNAL Proc. Natl. Acad. Sci. U.S.A. 79, 1984-1988 (1982) STANDARD full staff_review REFERENCE 3 (bases 475 to 1071; 1179 to 1330; 1461 to 1524) AUTHORS Takahashi,N., Ueda,S., Obata,M., Nikaido,T., Nakai,S. and Honjo,T. TITLE Structure of human immunoglobulin gamma genes: Implications for evolution of a gene family JOURNAL Cell 29, 671-679 (1982) STANDARD full staff_review COMMENT [1] also reports sequences for gamma-3, gamma-4, and a gamma pseudogene. Most of this sequence is 95% homologous with gamma-4. The hinge exons are only 70% homologous. The authors estimate that gamma-2 and gamma-4 diverged 6.6 million years ago. The authors in [3] speculate that intron-mediated domain transfer played an important role in the evolution of human gamma genes. They also report the hinge regions of gamma-1, gamma-3, gamma-4, and a pseudo-gamma gene. [3] estimates the divergence of the human gamma genes to be between 7.7 and 4.4 million years ago. This entry is part of a multigene region containing the gamma-2, gamma-4, epsilon-1, and alpha-2 genes. The relative locations of the four genes were determined by Flanagan and Rabbitts (Nature 300, 709-713 (1982)). They refer to this gene group as region B. The region A genes are gamma-3, gamma-1, pseudo-epsilon, alpha-1. Flanagan and Rabbits also determined the general locations of the two regions. They place region A between the JH/mu/delta region and region B. FEATURES from to/span description pept / 216 509 Ig gamma-2 constant region CH1 domain (aa at 218) /nomgen="IGHG2" /map="14q32.3" /hgml_locus_uid="LB0153J" 902 937 Ig gamma-2 constant region hinge exon 1056 1382 Ig gamma-2 constant region CH2 domain 1480 1802 Ig gamma-2 constant region CH3 domain IVS < 1 215 gamma-2 intron J-C IVS 510 901 gamma-2 intron A IVS 938 1055 gamma-2 intron B IVS 1383 1479 gamma-2 intron C conflict 537 537 t in [2]; c in [3] conflict 550 551 cc in [2]; c in [3] conflict 570 570 t in [2]; c in [3] conflict 777 778 aa in [2]; a in [3] conflict 791 791 a in [2]; aa in [3] conflict 864 864 c in [2]; t in [3] conflict 1433 1435 acc in [2]; gc in [3] conflict 1493 1493 a in [2]; g in [3] conflict 1802 1806 agtgc in [2]; aatc in [1] conflict 1814 1815 gg in [2]; g in [1] conflict 1825 1825 g in [2]; a in [1] conflict 1844 1853 tcgcgtgagg in [2]; gcacgtgagc in [1] conflict 1890 1890 c in [2]; g in [1] conflict 1909 1918 gcacccagcg in [2]; gacccagg in [1] conflict 1929 1932 cccc in [2]; ccc in [1] BASE COUNT 410 A 700 C 568 G 331 T ORIGIN HindIII site on chromosome 14q32.3. 1 AGCTTTCTGG GGCGAGCCGG GCCTGACTTT GGCTTTGGGG CAGGGAGTGG GCTAAGGTGA 61 GGCAGGTGGC GCCAGCCAGG TGCACACCCA ATGCCCGTGA GCCCAGACAC TGGACCCTGC 121 CTGGACCCTC GTGGATAGAC AAGAACCGAG GGGCCTCTGC GCCTGGGCCC AGCTCTGTCC 181 CACACCGCGG TCACATGGCA CCACCTCTCT TGCAGCCTCC ACCAAGGGCC CATCGGTCTT 241 CCCCCTGGCG CCCTGCTCCA GGAGCACCTC CGAGAGCACA GCCGCCCTGG GCTGCCTGGT 301 CAAGGACTAC TTCCCCGAAC CGGTGACGGT GTCGTGGAAC TCAGGCGCTC TGACCAGCGG 361 CGTGCACACC TTCCCAGCTG TCCTACAGTC CTCAGGACTC TACTCCCTCA GCAGCGTGGT 421 GACCGTGCCC TCCAGCAACT TCGGCACCCA GACCTACACC TGCAACGTAG ATCACAAGCC 481 CAGCAACACC AAGGTGGACA AGACAGTTGG TGAGAGGCCA GCTCAGGGAG GGAGGGTGTC 541 TGCTGGAAGC CAGGCTCAGC CCTCCTGCCT GGACGCACCC CGGCTGTGCA GCCCCAGCCC 601 AGGGCAGCAA GGCAGGCCCC ATCTGTCTCC TCACCCGGAG GCCTCTGCCC GCCCCACTCA 661 TGCTCAGGGA GAGGGTCTTC TGGCTTTTTC CACCAGGCTC CAGGCAGGCA CAGGCTGGGT 721 GCCCCTACCC CAGGCCCTTC ACACACAGGG GCAGGTGCTT GGCTCAGACC TGCCAAAAGC 781 CATATCCGGG AGGACCCTGC CCCTGACCTA AGCCGACCCC AAAGGCCAAA CTGTCCACTC 841 CCTCAGCTCG GACACCTTCT CTCCTCCCAG ATCCGAGTAA CTCCCAATCT TCTCTCTGCA 901 GAGCGCAAAT GTTGTGTCGA GTGCCCACCG TGCCCAGGTA AGCCAGCCCA GGCCTCGCCC 961 TCCAGCTCAA GGCGGGACAG GTGCCCTAGA GTAGCCTGCA TCCAGGGACA GGCCCCAGCT 1021 GGGTGCTGAC ACGTCCACCT CCATCTCTTC CTCAGCACCA CCTGTGGCAG GACCGTCAGT 1081 CTTCCTCTTC CCCCCAAAAC CCAAGGACAC CCTCATGATC TCCCGGACCC CTGAGGTCAC 1141 GTGCGTGGTG GTGGACGTGA GCCACGAAGA CCCCGAGGTC CAGTTCAACT GGTACGTGGA 1201 CGGCGTGGAG GTGCATAATG CCAAGACAAA GCCACGGGAG GAGCAGTTCA ACAGCACGTT 1261 CCGTGTGGTC AGCGTCCTCA CCGTTGTGCA CCAGGACTGG CTGAACGGCA AGGAGTACAA 1321 GTGCAAGGTC TCCAACAAAG GCCTCCCAGC CCCCATCGAG AAAACCATCT CCAAAACCAA 1381 AGGTGGGACC CGCGGGGTAT GAGGGCCACA TGGACAGAGG CCGGCTCGGC CCACCCTCTG 1441 CCCTGGGAGT GACCGCTGTG CCAACCTCTG TCCCTACAGG GCAGCCCCGA GAACCACAGG 1501 TGTACACCCT GCCCCCATCC CGGGAGGAGA TGACCAAGAA CCAGGTCAGC CTGACCTGCC 1561 TGGTCAAAGG CTTCTACCCC AGCGACATCG CCGTGGAGTG GGAGAGCAAT GGGCAGCCGG 1621 AGAACAACTA CAAGACCACA CCTCCCATGC TGGACTCCGA CGGCTCCTTC TTCCTCTACA 1681 GCAAGCTCAC CGTGGACAAG AGCAGGTGGC AGCAGGGGAA CGTCTTCTCA TGCTCCGTGA 1741 TGCATGAGGC TCTGCACAAC CACTACACGC AGAAGAGCCT CTCCCTGTCT CCGGGTAAAT 1801 GAGTGCCACG GCCGGCAAGC CCCCGCTCCC CAGGCTCTCG GGGTCGCGTG AGGATGCTTG 1861 GCACGTACCC CGTGTACATA CTTCCCAGGC ACCCAGCATG GAAATAAAGC ACCCAGCGCT 1921 GCCCTGGGCC CCTGCGAGAC TGTGATGGTT CTTTCCGTGG GTCAGGCCGA GTCTGAGGCC 1981 TGAGTGGCAT GAGGGAGGCA GAGTGGGTC // LOCUS HUMIGCD2 2028 bp ds-DNA PRI 20-MAY-1987 DEFINITION Human Ig germline H-chain G-E-A region B: gamma-4 constant region. ACCESSION K01316 KEYWORDS constant region; gamma-immunoglobulin; germline; hinge exon; immunoglobulin; immunoglobulin heavy chain. SEGMENT 2 of 7 SOURCE Human fetal liver DNA, library of T. Maniatis [2] and Lawn et al [1],[3]; clones 24B [1], lambda-HG4.1 [2], and Ig-gamma-4-2 [3]. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 2028) AUTHORS Ellison,J., Buxbaum,J. and Hood,L. TITLE Nucleotide sequence of a human immunoglobulin C-gamma-4 gene JOURNAL DNA 1, 11-18 (1981) STANDARD full staff_review REFERENCE 2 (bases 894 to 1106) AUTHORS Krawinkel,U. and Rabbitts,T.H. TITLE Comparison of the hinge-coding segments in human immunoglobulin gamma heavy chain genes and the linkage of the gamma 2 and gamma 4 subclass genes JOURNAL EMBO J. 1, 403-407 (1982) STANDARD full staff_review REFERENCE 3 (bases 475 to 1069; 1180 to 1331; 1432 to 1655) AUTHORS Takahashi,N., Ueda,S., Obata,M., Nikaido,T., Nakai,S. and Honjo,T. TITLE Structure of human immunoglobulin gamma genes: Implications for evolution of a gene family JOURNAL Cell 29, 671-679 (1982) STANDARD full staff_review COMMENT [1] reports that the human C-gamma-4 gene is equally homologous to the mouse gamma-1, gamma-2a, and gamma-2b genes (about 75%). [2] also reports partial sequences for human gamma-2, gamma-3, and a gamma pseudogene. [3] presents the gamma-1, gamma-2, gamma-3, and pseudo-gamma hinge regions. This entry is part of a multigene region (region B), which includes the gamma-2, gamma-4, epsilon-1, and alpha-2 genes. See segment 1 for more comments. FEATURES from to/span description pept / 216 509 Ig gamma-4 constant region CH1 domain (aa at 218) /nomgen="IGHG4" /map="14q32.3" /hgml_locus_uid="LL0164P" 900 935 Ig gamma-4 constant region hinge region 1054 1383 Ig gamma-4 constant region CH2 domain 1481 1803 Ig gamma-4 constant region CH3 domain IVS < 1 215 gamma-4 intron J-C IVS 510 899 gamma-4 intron A IVS 936 1053 gamma-4 intron B IVS 1384 1480 gamma-4 intron C conflict 481 481 c in [1]; g in [3] conflict 551 552 ca in [1]; c in [2] conflict 702 707 cggcac in [1]; ggcggc in [3] conflict 747 749 ggg in [1]; gggg in [3] conflict 754 755 tg in [1]; gt in [3] conflict 773 774 aa in [1]; a in [3] conflict 862 862 c in [1]; t in [3] conflict 1022 1025 tgct in [1]; tcctt in [2] conflict 1046 1046 t in [1]; tc in [2] conflict 1434 1435 ac in [1]; a in [3] BASE COUNT 421 A 709 C 567 G 331 T ORIGIN about 19 kb after ; HindIII site on chromosome 14q32.3. 1 AGCTTTCTGG GGCAGGCCGG GCCTGACTTT GGCTGGGGGC AGGGAGGGGG CTAAGGTGAC 61 GCAGGTGGCG CCAGCCAGGT GCACACCCAA TGCCCATGAG CCCAGACACT GGACCCTGCA 121 TGGACCATCG CGGATAGACA AGAACCGAGG GGCCTCTGCG CCCTGGGCCC AGCTCTGTCC 181 CACACCGCGG TCACATGGCA CCACCTCTCT TGCAGCTTCC ACCAAGGGCC CATCCGTCTT 241 CCCCCTGGCG CCCTGCTCCA GGAGCACCTC CGAGAGCACA GCCGCCCTGG GCTGCCTGGT 301 CAAGGACTAC TTCCCCGAAC CGGTGACGGT GTCGTGGAAC TCAGGCGCCC TGACCAGCGG 361 CGTGCACACC TTCCCGGCTG TCCTACAGTC CTCAGGACTC TACTCCCTCA GCAGCGTGGT 421 GACCGTGCCC TCCAGCAGCT TGGGCACGAA GACCTACACC TGCAACGTAG ATCACAAGCC 481 CAGCAACACC AAGGTGGACA AGAGAGTTGG TGAGAGGCCA GCACAGGGAG GGAGGGTGTC 541 TGCTGGAAGC CAGGCTCAGC CCTCCTGCCT GGACGCACCC CGGCTGTGCA GCCCCAGCCC 601 AGGGCAGCAA GGCATGCCCC ATCTGTCTCC TCACCCGGAG GCCTCTGACC ACCCCACTCA 661 TGCTCAGGGA GAGGGTCTTC TGGATTTTTC CACCAGGCTC CCGGCACCAC AGGCTGGATG 721 CCCCTACCCC AGGCCCTGCG CATACAGGGC AGGTGCTGCG CTCAGACCTG CCAAGAGCCA 781 TATCCGGGAG GACCCTGCCC CTGACCTAAG CCCACCCCAA AGGCCAAACT CTCCACTCCC 841 TCAGCTCAGA CACCTTCTCT CCTCCCAGAT CTGAGTAACT CCCAATCTTC TCTCTGCAGA 901 GTCCAAATAT GGTCCCCCAT GCCCATCATG CCCAGGTAAG CCAACCCAGG CCTCGCCCTC 961 CAGCTCAAGG CGGGACAGGT GCCCTAGAGT AGCCTGCATC CAGGGACAGG CCCCAGCCGG 1021 GTGCTGACGC ATCCACCTCC ATCTCTTCCT CAGCACCTGA GTTCCTGGGG GGACCATCAG 1081 TCTTCCTGTT CCCCCCAAAA CCCAAGGACA CTCTCATGAT CTCCCGGACC CCTGAGGTCA 1141 CGTGCGTGGT GGTGGACGTG AGCCAGGAAG ACCCCGAGGT CCAGTTCAAC TGGTACGTGG 1201 ATGGCGTGGA GGTGCATAAT GCCAAGACAA AGCCGCGGGA GGAGCAGTTC AACAGCACGT 1261 ACCGTGTGGT CAGCGTCCTC ACCGTCCTGC ACCAGGACTG GCTGAACGGC AAGGAGTACA 1321 AGTGCAAGGT CTCCAACAAA GGCCTCCCGT CCTCCATCGA GAAAACCATC TCCAAAGCCA 1381 AAGGTGGGAC CCACGGGGTG CGAGGGCCAC ACGGACAGAG GCCAGCTCGG CCCACCCTCT 1441 GCCCTGGGAG TGACCGCTGT GCCAACCTCT GTCCCTACAG GGCAGCCCCG AGAGCCACAG 1501 GTGTACACCC TGCCCCCATC CCAGGAGGAG ATGACCAAGA ACCAGGTCAG CCTGACCTGC 1561 CTGGTCAAAG GCTTCTACCC CAGCGACATC GCCGTGGAGT GGGAGAGCAA TGGGCAGCCG 1621 GAGAACAACT ACAAGACCAC GCCTCCCGTG CTGGACTCCG ACGGCTCCTT CTTCCTCTAC 1681 AGCAGGCTAA CCGTGGACAA GAGCAGGTGG CAGGAGGGGA ATGTCTTCTC ATGCTCCGTG 1741 ATGCATGAGG CTCTGCACAA CCACTACACA CAGAAGAGCC TCTCCCTGTC TCTGGGTAAA 1801 TGAGTGCCAG GGCCGGCAAG CCCCCGCTCC CCGGGCTCTC GGGGTCGCGC GAGGATGCTT 1861 GGCACGTACC CCGTCTACAT ACTTCCCAGG CACCCAGCAT GGAAATAAAG CACCCACCAC 1921 TGCCCTGGGC CCCTGTGAGA CTGTGATGGT TCTTTCCACG GGTCAGGCCG AGTCTGAGGC 1981 CTGAGTGACA TGAGGGAGGC AGAGCGGGTC CCACTGTCCC CACACTGG // LOCUS HUMIGCD6 3389 bp ds-DNA PRI 20-MAY-1987 DEFINITION Human Ig germline H-chain G-E-A region B: epsilon-1 C-region. ACCESSION J00222 KEYWORDS constant region; epsilon-immunoglobulin; germline; immunoglobulin; immunoglobulin heavy chain. SEGMENT 6 of 7 SOURCE Human placenta DNA [1], clone 38A; 5' flank -- fetal liver DNA library (clone CH4A-H-Ig-epsilon-12) [2], C-region -- myeloma cell line 266b1 (clone CH4A-H-Ig-epsilon-11) [2]; fetal liver DNA, library of T. Maniatis [3]. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1112 to 3108) AUTHORS Max,E.E., Battey,J., Ney,R., Kirsch,I.R. and Leder,P. TITLE Duplication and deletion in the human immunoglobulin epsilon genes JOURNAL Cell 29, 691-699 (1982) STANDARD full staff_review REFERENCE 2 (bases 1 to 3389) AUTHORS Ueda,S., Nakai,S., Nishida,Y., Hisajima,H. and Honjo,T. TITLE Long terminal repeat-like elements flank a human immunoglobulin epsilon pseudogene that lacks introns JOURNAL EMBO J. 1, 1539-1544 (1982) STANDARD full staff_review REFERENCE 3 (bases 2032 to 3388) AUTHORS Hisajima,H., Hishida,Y., Nakai,S., Takahashi,N., Ueda,S. and Honjo,T. TITLE Structure of the human immunoglobulin C-epsilon-2 gene, a truncated pseudogene: Implications for its evolutionary origin JOURNAL Proc. Natl. Acad. Sci. U.S.A. 80, 2995-2999 (1983) STANDARD full staff_review COMMENT The germline sequence in [2] is derived by combining the germline 5' flanking sequence with the rearranged C-region sequence. The sequence in [2] and [3] differs from the germline sequence [1] at 13 positions, all of which are in the flanking or intron regions except one (base 2774), which changes the deduced amino acid at that position from Trp to Leu. The authors state that this is consistent with the amino acid sequence of the myeloma protein. Also reported [2] is a rearranged pseudogene epsilon-3. [1] reports a germline pseudogene epsilon-2 and the alpha-1 and alpha-2 genes. [3] reports the epsilon-2 gene. This entry is part of a multigene region (region B), which includes the gamma-2, gamma-4, epsilon-1, and alpha-2 genes. See segment 1 for more comments. FEATURES from to/span description pept / 1210 1518 Ig epsilon-1 constant region CH1 domain (aa at 1212) /nomgen="IGHE" /map="14q32.3" /hgml_locus_uid="LD0142D" 1726 2046 Ig epsilon-1 constant region CH2 domain 2133 2456 Ig epsilon-1 constant region CH3 domain 2540 2871 Ig epsilon-1 constant region CH4 domain IVS < 1 1209 epsilon-1 intron J-C IVS 1519 1725 epsilon-1 intron A IVS 2047 2132 epsilon-1 intron B IVS 2457 2539 epsilon-1 intron C conflict 1146 1146 c in [2]; t in [1] conflict 1155 1155 c in [2]; t in [1] conflict 1337 1337 c in [2]; g in [1] conflict 1628 1629 ag in [2]; a in [1] conflict 1718 1718 t in [2]; a in [1] conflict 2351 2351 c in [1],[2]; g in [3] conflict 2416 2416 c in [1],[2]; a in [3] conflict 2660 2660 g in [2],[3]; t in [1] conflict 2682 2682 c in [2],[3]; t in [1] conflict 2754 2754 g in [2],[3]; t in [1] conflict 3002 3006 caggt in [2]; ccagggt in [3] (unknown in [1]) conflict 3007 3008 ac in [2],[3]; gt in [1] conflict 3066 3071 gccccc in [2],[3]; ggcccccc in [1] conflict 3107 3107 c in [2],[3]; t in [1] BASE COUNT 663 A 1034 C 1108 G 584 T ORIGIN about 3500 bp after on chromosome 14q32.3. 1 GCTGGACTGG GCTGAGCTAG ACTGGGCAAC TGTATTCAGC TGGGCTGGCC TGGCCTCGGC 61 TGGGCTGGGT TCAGCTGGGC TGGGCTGAGC AGGCCTGAGC AGGATTAGTT GAGCTGGTCG 121 TAACTGGATT TAACTAGCTA GGCTAGGCTT AACTGACTAA GCTGATCTGG ACTGTATTCA 181 TCTGGATGAA GCTGGGGTGA GGTGGCTACT TTAGGTCAGC TTTGCTGAGC TAAACTGGAC 241 CGGGCTAAAT TGATCTGGAC TGACCATTCT CACCTGGCTA AGAGGAGCTG AGTCAGAAGC 301 AAGCTGGTTG AGCTGGCTGG ACTGAAATAA GAGTTTGCTG CCTGCAAGGG GAGGTCCTGG 361 GCTGACCTGG GCCAGGCTGA ACCAGGCTGG CTTAGAGTGA ACTTCAGAGG GCGACTCCCC 421 CGGTAGGCCA GTCTCAGCTG AACTTGGCTG TCCCGGTGGG CAGAGCGGGG CTGGATACTG 481 TGATTTTGGG GGTACCTAGA GCAGACTTCA AGACCAAGCT AAACTGGGCT CCAGGGGCAG 541 GATGGGCTGG GGACTTGGGA CTCCAGGCCA GGGGCGAAGG GCCACGCTGT ACAGACCGCA 601 CTATCTGGGC CAGGGTTCTG TGGTGGGAGG GACTGACTGC CTGGGGCATC AGGGCAAGTC 661 TTCCCGCCCT CCCCTAGAGG TCAGGGGTGG GCAGAGCACC ATGGGGGTCT GGCAGGTCAG 721 GTGAGGGCTG CTGTGATGGG GAGATCCAGG CTTGGCACTC AAGAGCCCGA GGAGCTGAGA 781 CCACAGCCTT GGGGGGTTGG GGTCAGGGTT GGAGGGCAGG CAGACCATCC ACCATGAGCC 841 CAGAGAGAGT TTGAAGGGGG AGGGCTCTGG GGTCCGGCCC ATGGGGTCCC TGGGTTTCAG 901 CCTAGGGGCA TGGCCCAGTG TCTCTGCTCC TGAGTGCCCA CCGTGCAGCA CTTGCAGGGG 961 GAGGCTGGGG TCATCCTGGA GGCACCCCCC TTCCTGAGCC CAGCCTGATG ATAGTGGCTG 1021 AGCAACAGCT TCTGGTGGGG GAATGGGCCC TGGGAGCCGC CCTGGGCCTG GGGATTGTGG 1081 GGAAAAAGGC CCAGAATGAG CCTAGCCATC TGGATCCCTG CCACGGGGTC CCCAGCTCCC 1141 CCATCCAGGC CCCCCAGGCC TGATGGGCGC TGGCCTGAGG CTGGCACTGA CTAGGTTCTG 1201 TCCTCACAGC CTCCACACAG AGCCCATCCG TCTTCCCCTT GACCCGCTGC TGCAAAAACA 1261 TTCCCTCCAA TGCCACCTCC GTGACTCTGG GCTGCCTGGC CACGGGCTAC TTCCCGGAGC 1321 CGGTGATGGT GACCTGCGAC ACAGGCTCCC TCAACGGGAC AACTATGACC TTACCAGCCA 1381 CCACCCTCAC GCTCTCTGGT CACTATGCCA CCATCAGCTT GCTGACCGTC TCGGGTGCGT 1441 GGGCCAAGCA GATGTTCACC TGCCGTGTGG CACACACTCC ATCGTCCACA GACTGGGTCG 1501 ACAACAAAAC CTTCAGCGGT AAGAGAGGGC CAAGCTCAGA GACCACAGTT CCCAGGAGTG 1561 CCAGGCTGAG GGCTGGCAGA GTGGGCAGGG GTTGAGGGGG TGGGTGGGCT CAAACGTGGG 1621 AACACCCAGC ATGCCTGGGG ACCCGGGCCA GGACGTGGGG GCAAGAGGAG GGCACACAGA 1681 GCTCAGAGAG GCCAACAACC CTCATGACCA CCAGCTCTCC CCCAGTCTGC TCCAGGGACT 1741 TCACCCCGCC CACCGTGAAG ATCTTACAGT CGTCCTGCGA CGGCGGCGGG CACTTCCCCC 1801 CGACCATCCA GCTCCTGTGC CTCGTCTCTG GGTACACCCC AGGGACTATC AACATCACCT 1861 GGCTGGAGGA CGGGCAGGTC ATGGACGTGG ACTTGTCCAC CGCCTCTACC ACGCAGGAGG 1921 GTGAGCTGGC CTCCACACAA AGCGAGCTCA CCCTCAGCCA GAAGCACTGG CTGTCAGACC 1981 GCACCTACAC CTGCCAGGTC ACCTATCAAG GTCACACCTT TGAGGACAGC ACCAAGAAGT 2041 GTGCAGGTAC GTTCCCACCT GCCCTGGTGG CCGCCACGGA GGCCAGAGAA GAGGGGCGGG 2101 TGGGCCTCAC ACAGCCCTCC GGTGTACCAC AGATTCCAAC CCGAGAGGGG TGAGCGCCTA 2161 CCTAAGCCGG CCCAGCCCGT TCGACCTGTT CATCCGCAAG TCGCCCACGA TCACCTGTCT 2221 GGTGGTGGAC CTGGCACCCA GCAAGGGGAC CGTGAACCTG ACCTGGTCCC GGGCCAGTGG 2281 GAAGCCTGTG AACCACTCCA CCAGAAAGGA GGAGAAGCAG CGCAATGGCA CGTTAACCGT 2341 CACGTCCACC CTGCCGGTGG GCACCCGAGA CTGGATCGAG GGGGAGACCT ACCAGTGCAG 2401 GGTGACCCAC CCCCACCTGC CCAGGGCCCT CATGCGGTCC ACGACCAAGA CCAGCGGTGA 2461 GCCATGGGCA GGCCGGGGTC GTGGGGGAAG GGAGGGAGCG AGTGAGCGGG GCCCGGGCTG 2521 ACCCCACGTC TGGCCACAGG CCCGCGTGCT GCCCCGGAAG TCTATGCGTT TGCGACGCCG 2581 GAGTGGCCGG GGAGCCGGGA CAAGCGCACC CTCGCCTGCC TGATCCAGAA CTTCATGCCT 2641 GAGGACATCT CGGTGCAGTG GCTGCACAAC GAGGTGCAGC TCCCGGACGC CCGGCACAGC 2701 ACGACGCAGC CCCGCAAGAC CAAGGGCTCC GGCTTCTTCG TCTTCAGCCG CCTGGAGGTG 2761 ACCAGGGCCG AATGGGAGCA GAAAGATGAG TTCATCTGCC GTGCAGTCCA TGAGGCAGCG 2821 AGCCCCTCAC AGACCGTCCA GCGAGCGGTG TCTGTAAATC CCGGTAAATG ACGTACTCCT 2881 GCCTCCCTCC CTCCCAGGGC TCCATCCAGC TGTGCAGTGG GGAGGACTGG CCAGACCTTC 2941 TGTCCACTGT TGCAATGACC CCAGGAAGCT ACCCCCAATA AACTGTGCCT GCTCAGAGCC 3001 CCAGGTACAC CCATTCTTGG GAGCGGGCAG GGCTGTGGGC AGGTGCATCT TGGCACAGAG 3061 GAATGGCCCC CAGGAGGGGC AGTGGGAGGA GGTGGGCAGG GCTGAGCCCC CCCTGGAGAG 3121 GCGGTGGGAG GAGGTGGGCA GGGCTGAGGT GCCACTCATC CATCTGCCTT CGTGTCAGGG 3181 TTATTTGTCA AACAGCATAT CTGCAGGGAC TCATCACAGC TACCCCGGGC CCTCTCTGCC 3241 CCCACTCTGG GTCTACCCCC TCCAAGGAGT CCAAAGACCC AGGGGAGGTC CTCAGGGAAG 3301 GGGCAAGGGA GCCCACAGCC CTCTCTCTTG GGGGCTTGGC TTCTACCCCC CTGGACAGGA 3361 GCCCCTGCAC CCCCAGGTAT AGATGGGCA // LOCUS HUMIGCD7 2516 bp ds-DNA PRI 20-MAY-1987 DEFINITION Human Ig germline H-chain G-E-A region B: alpha-2 A2m(1) allele, constant region. ACCESSION J00221 KEYWORDS alpha-immunoglobulin; constant region; germline; immunoglobulin; immunoglobulin heavy chain. SEGMENT 7 of 7 SOURCE Human genomic DNA, cosmid Ig10 [1],[3]; placenta DNA [2] clone H-Ig-alpha-25; genomic DNA from TOU II-5 library clone lambda-TOU-alpha2 (for A2m(2) allele) [3]. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 737 to 1016) AUTHORS Flanagan,J.G. and Rabbitts,T.H. TITLE Arrangement of human immunoglobulin heavy chain constant region genes implies evolutionary duplication of a segment containing gamma, epsilon and alpha genes JOURNAL Nature 300, 709-713 (1982) STANDARD full staff_review REFERENCE 2 (bases 49 to 229; 425 to 514) AUTHORS Hisajima,H., Nishida,Y., Nakai,S., Takahashi,N., Ueda,S. and Honjo,T. TITLE Structure of the human immunoglobulin C-epsilon-2 gene, a truncated pseudogene: Implications for its evolutionary origin JOURNAL Proc. Natl. Acad. Sci. U.S.A. 80, 2995-2999 (1983) STANDARD full staff_review REFERENCE 3 (bases 1 to 2516) AUTHORS Flanagan,J.G., Lefranc,M.-P. and Rabbitts,T.H. TITLE Mechanisms of divergence and convergence of the human immunoglobulin alpha-1 and alpha-2 constant region gene sequences JOURNAL Cell 36, 681-688 (1984) STANDARD full staff_review COMMENT [3] also reports the complete alpha-1 gene and part of the A2m(2) alpha-2 allele (bases 737-2516; see Sites table). Comparison of the three sequences suggests that the A2m(1) alpha-2 allele might be a hybrid of the alpha-1 gene and A2m(2) alpha-2 allele. The hinge region in the alpha genes occurs at beginning of the CH2 domain. The alpha-1 hinge region is 13 amino acids longer than that in alpha-2. Both hinge regions consist of approximate tandem repeats of a 15 bp sequence. The first repeat occurs 5' to the mRNA splice site and is non-coding. The authors [3] suggest that this repetitive structure provides a possible mechanism for the large number of variations observed in hinge regions. There is a coupled 30 bp insertion, 9 bp deletion in alpha-2 relative to alpha-1 (starting at base 97). [1] also reports sequences for the epsilon-1 and epsilon-2 (pseudogene) C-region genes. The authors [1] determined the physical linkage between epsilon-1 and alpha-2 and that between epsilon-2 and alpha-1. [2] also reports the alpha-1 CH2 domain and epsilon-2. This entry is part of a multigene region (region B), which includes the gamma-2, gamma-4, epsilon-1 and alpha-2 genes. See segment 1 for more comments. FEATURES from to/span description pept / 164 469 Ig alpha-2 H-chain constant region CH1 domain (aa at 166) /nomgen="IGHA2" /map="14q32.3" /hgml_locus_uid="LK0135P" 684 1004 Ig alpha-2 H-chain constant region CH2 domain 1227 1621 Ig alpha-2 H-chain constant region CH3 domain IVS < 1 163 alpha-2 intron J-C IVS 470 683 alpha-2 intron A IVS 1005 1226 alpha-2 intron B conflict 151 154 ttct in [3]; cccc in [2] conflict 513 514 gc in [3]; ng in [2] allele 1434 1434 t in A2m(1); a in A2m(2) allele 1441 1441 g in A2m(1); a in A2m(2) allele 1465 1465 c in A2m(1); t in A2m(2) allele 1486 1486 c in A2m(1); g in A2m(2) allele 1553 1553 t in A2m(1); a in A2m(2) allele 1573 1574 tg in A2m(1); ca in A2m(2) allele 1602 1606 tggac in A2m(1); cggat in A2m(2) allele 2060 2060 c in A2m(1); t in A2m(2) allele 2384 2384 a in A2m(1); c in A2m(2) allele 2390 2390 c in A2m(1); g in A2m(2) BASE COUNT 488 A 861 C 754 G 413 T ORIGIN about 13 kb after on chromosome 14q32.3 1 GGTCCAACCG CAGGCCCATG GTGCAGGAGC TGTGTAACCT ATGGGGCTGT CACCAGGCCT 61 CTCTGTGCTG GGTTCCTCCA GTGTAGAGGA GAGGCAGGTA CAGCCTGTCC TCCTGGGGAC 121 ATGGCATGAG GGCCGCGTCC TCACAGCGCA TTCTGTGTTC CAGCATCCCC GACCAGCCCC 181 AAGGTCTTCC CGCTGAGCCT CGACAGCACC CCCCAAGATG GGAACGTGGT CGTCGCATGC 241 CTGGTCCAGG GCTTCTTCCC CCAGGAGCCA CTCAGTGTGA CCTGGAGCGA AAGCGGACAG 301 AACGTGACCG CCAGAAACTT CCCACCTAGC CAGGATGCCT CCGGGGACCT GTACACCACG 361 AGCAGCCAGC TGACCCTGCC GGCCACACAG TGCCCAGACG GCAAGTCCGT GACATGCCAC 421 GTGAAGCACT ACACGAATCC CAGCCAGGAT GTGACTGTGC CCTGCCCAGG TCAGAGGGCA 481 GGCTGGGGAG TGGGGCGGGG CCACCCCGTC CTGCCCTGAC ACTGCGCCTG CACCCGTGTT 541 CCCCACAGGG AGCCGCCCCT TCACTCACAC CAGAGTGGAC CCCGGGCCGA GCCCCAGGAG 601 GTGGTGGTGG ACAGGCCAGG AGGGGCGAGG CGGGGGCACG GGGAAGGGCG TTCTGACCAG 661 CTCAGGCCAT CTCTCCACTC CAGTTCCCCC ACCTCCCCCA TGCTGCCACC CCCGACTGTC 721 GCTGCACCGA CCGGCCCTCG AGGACCTGCT CTTAGGTTCA GAAGCGAACC TCACGTGCAC 781 ACTGACCGGC CTGAGAGATG CCTCTGGTGC CACCTTCACC TGGACGCCCT CAAGTGGGAA 841 GAGCGCTGTT CAAGGACCAC CTGAGCGTGA CCTCTGTGGC TGCTACAGCG TGTCCAGTGT 901 CCTGCCTGGC TGTGCCCAGC CATGGAACCA TGGGGAGACC TTCACCTGCA CTGCTGCCCA 961 CCCCGAGTTG AAGACCCCAC TAACCGCCAA CATCACAAAA TCCGGTGGGT CCAGACCCTG 1021 CTCGGGGCCC TGCTCAGTGC TCTGGTTTGC AAAGCATATT CCCGGCCTGC CTCCTCCCTC 1081 CCAATCCTGG GCTCCAGTGC TCATGCCAAG TACACAGGGA AACTGAGGCA GGCTGAGGGG 1141 CCAGGACACA GCCCAGGGTG CCCACCAGAG CAGAGGGGCT CTCTCATCCC CTGCCCAGCC 1201 CCCTGACCTG GCTCTCTACC CTCCAGGAAA CACATTCCGG CCCGAGGTCC ACCTGCTGCC 1261 GCCGCCGTCG GAGGAGCTGG CCCTGAACGA GCTGGTGACG CTGACGTGCC TGGCACGTGG 1321 CTTCAGCCCC AAGGATGTGC TGGTTCGCTG GCTGCAGGGG TCACAGGAGC TGCCCCGCGA 1381 GAAGTACCTG ACTTGGGCAT CCCGGCAGGA GCCCAGCCAG GGCACCACCA CCTTCGCTGT 1441 GACCAGCATA CTGCGCGTGG CAGCCGAGGA CTGGAAGAAG GGGGACACCT TCTCCTGCAT 1501 GGTGGGCCAC GAGGCCCTGC CGCTGGCCTT CACACAGAAG ACCATCGACC GCTTGGCGGG 1561 TAAACCCACC CATGTCAATG TGTCTGTTGT CATGGCGGAG GTGGACGGCA CCTGCTACTG 1621 AGCCGCCCGC CTGTCCCCAC CCCTGAATAA ACTCCATGCT CCCCCAAGCA GCCCCACGCT 1681 TCCATCCGGC GCCTGTCTGT CCATCCTCAG GGTCTCAGCA CTTGGGAAAG GGCCAGGGCA 1741 TGGACAGGGA AGAATACCCC CTGCCCTGAG CCTCGGGGGG CCCCTGGCAC CCCCATGAGA 1801 CTTTCCACCC TGGTGTGAGT GTGAGTTGTG AGTGTGAGAG TGTGTGGTGC AGGAGGCCTC 1861 GCTGGTGTGA GATCTTAGGT CTGCCAAGGC AGGCACAGCC CAGGATGGGT TCTGAGAGAC 1921 GCACATGCCC CGGACAGTTC TGAGTGAGCA GTGGCATGGC CGTTTGTCCC TGAGAGAGCC 1981 GCCTCTGGCT GTAGCTGGGA GGGAATAGGG AGGGTAAAAG GAGCAGGCTA GCCAAGAAAG 2041 GCGCAGGTAG TGGCAGGAGC GGCGAGGGAG TGAGGGGCTG GACTCCAGGG CCCCACTGGG 2101 AGGACAAGCT CCAGGAGGGC CCCACCACCC TAGTGGGTGG GCCTCAGGAC GTCCCACTGA 2161 CGCATGCAGG AAGGGGCACC TCCCCTTAAC CACACTGCTC TGTACGGGGC ACGTGGGCAC 2221 ACATGCACAC TCACACTCAC ATATACGCCT GAGCCCTGCA GGAGTGGAAC GTTCACAGCC 2281 CAGACCCAGT TCCAGAAAAG CCAGGGGAGT CCCCTCCCAA GCCCCCAAGC TCAGCCTGCT 2341 CCCCCAGGCC CCTCTGGCTT CCCTGTGTTT CCACTGTGCA CAGATCAGGC ACCAACTCCA 2401 CAGACCCCTC CCAGGCAGCC CCTGCTCCCT GCCTGGCCAA GTCTCCCATC CCTTCCTAAG 2461 CCCAACTAGG ACCCAAAGCA TAGACAGGGA GGGGCCGCGT GGGGTGGCAT CAGAAG // LOCUS HUMIGCE 641 bp ds-DNA PRI 15-JUN-1989 DEFINITION Human Ig germline H-chain pseudo-gamma CH1, hinge region, CH2. ACCESSION K01320 KEYWORDS constant region; gamma-immunoglobulin; germline; immunoglobulin; immunoglobulin heavy chain; pseudogene. SOURCE Human fetal liver DNA, library of T. Maniatis, clone lambda-HG12.2 [1] and library of Lawn et al, clone Ig-gamma-11 [2]. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 417 to 641) AUTHORS Krawinkel,U. and Rabbitts,T.H. TITLE Comparison of the hinge-coding segments in human immunoglobulin gammma heavy chain genes and the linkage of the gamma 2 and gamma 4 subclass genes JOURNAL EMBO J. 1, 403-407 (1982) STANDARD full staff_review REFERENCE 2 (bases 1 to 607) AUTHORS Takahashi,N., Ueda,S., Obata,M., Nikaido,T., Nakai,S. and Honjo,T. TITLE Structure of human immunoglobulin gamma genes: Implications for evolution of a gene family JOURNAL Cell 29, 671-679 (1982) STANDARD full staff_review COMMENT This is a presumptive gamma pseudogene. The hinge region (bases 423-473) is related to the first hinge region of gamma-3. Also reported [1] are the hinge regions of gamma-2, gamma-3, and gamma-4. FEATURES from to/span description pept.ps < 1 35 pseudo-Ig gamma, exon 1 /hgml_locus_uid="LM0118M" /nomgen="IGHGP" /map="14q32.33" 423 473 pseudo-Ig gamma, exon 2 592 > 641 pseudo-Ig gamma, exon 3 pre-msg < 1 > 641 pseudo-Ig gamma mRNA and introns IVS 36 422 pseudo-gamma intron A IVS 474 591 pseudo-gamma intron B conflict 495 496 cc in [2]; c in [1] conflict 535 536 ga in [2]; cg in [1] BASE COUNT 139 A 239 C 160 G 103 T ORIGIN Unreported. 1 CAAGCCCAGC AACACCAAGG TGGACAAGAC AGTTGGTGAG AGGCCAGCAC AGGGAGGGAG 61 GGTGTCTGCT GGAAGCCAGC TCAGCCCTCT TGCCTGGACG TACCCCGGCT GTGCAGCCCC 121 AGTCCAGGGC AGCAAGGCAG GCCCCATCTG TCTCCTCACC CGGAGGCCTC TGCCCGCCCC 181 ACTCATGCTC AGGGAGAGGG TCTTCTGGCT TTTTCCACCA GGCTCCAGGC AGGCACAGGC 241 TGGAAGCCCC TACCCCAGGC CCTGCGCACA AAGGGGCAGG TGCTGCACTT AGACTGGCCA 301 GAGCCATATC CGGGAAGACC CTGCCCCTGA CCTAAGCCCA CCCCAAAGGC CAAGATCTCC 361 ACTCCCTCAG CTCAGACACC TCTCCTCCCA GATCTGAGTA ACTCCCAATC TTCTCTCTGC 421 AGAGCCCAAA ACCCCATGTT GTGACACAAC TCACACATGC CCACCATGTG CAAGTAAGCC 481 AGCCCAGGCC TCGCCCTCCA GCTCAAGGCG GGACAGGTGC CCTAGAGTAG CCTGGATCCA 541 GGGACAGGCC CCAACCGGGT GCTGACACGT CCGCCTCCAT CTCTTCCTCA GCAACTGAAC 601 CCCTGGGGGG ACCGTCAGTC TTCCTCTTCC CCCCAAAACC A // LOCUS HUMIGCH1 495 bp ds-DNA PRI 20-MAY-1987 DEFINITION Human Ig germline delta H-chain C-region gene, C-delta-1 domain (CLL lymphocyte). ACCESSION K02875 KEYWORDS constant region; delta-immunoglobulin; germline; immunoglobulin; immunoglobulin heavy chain. SEGMENT 1 of 8 SOURCE Human (chronic lymphocytic leukemia (CLL) patient) lymphocyte DNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 495) AUTHORS White,M.B., Shen,A.L., Word,C.J., Tucker,P.W. and Blattner,F.R. TITLE Human immunoglobulin D: Genomic sequence of the delta heavy chain JOURNAL Science 228, 733-737 (1985) STANDARD full staff_review COMMENT Sequence in computer readable form and draft entry for [1] were kindly provided by M.B.White, 06-AUG-1985. The C-delta and delta-s exon boundaries were located by comparing the translated sequences with known AA sequences [1]. FEATURES from to/span description pept / 101 + 403 Ig delta-chain secreted form, exon 1 of C-region (C-delta-1 domain; AA at 103) /nomgen="IGHD" /map="14q32.3" /hgml_locus_uid="LX0020F" pep$ / 101 + 403 Ig delta-chain membrane-bound form, exon 1 of C-region (C-delta-1 domain; AA at 103) /nomgen="IGHD" /map="14q32.3" /hgml_locus_uid="LX0020F" IVS < 1 100 IgD-s J-C intron IVS < 1 100 IgD-Mb J-C intron IVS 404 > 495 IgD-s intron A IVS 404 > 495 IgD-Mb intron A BASE COUNT 114 A 179 C 120 G 82 T ORIGIN 182 bp upstream of SphI site; chromosome 14q32.3. 1 TTTCCCTGCC TCCCGTCACC CTGCCGCCAG GGCCTCTGCC CTGCCCTGCC CCTTGTCCTC 61 AGGTTTCCAG CCTCAGACTC CCACTGTGTC TGTCTTCCAG CACCCACCAA GGCTCCGGAT 121 GTGTTCCCCA TCATATCAGG GTGCAGACAC CCAAAGGATA ACAGCCCTGT GGTCCTGGCA 181 TGCTTGATAA CTGGGTACCA CCCAACGTCC GTGACTGTCA CCTGGTACAT GGGGACACAG 241 AGCCAGCCCC AGAGAACCTT CCCTGAGATA CAAAGACGGG ACAGCTACTA CATGACAAGC 301 AGCCAGCTCT CCACCCCCCT CCAGCAGTGG CGCCAAGGCG AGTACAAATG CGTGGTCCAG 361 CACACCGCCA GCAAGAGTAA GAAGGAGATC TTCCGCTGGC CAGGTAGGTC GCACCGGAGA 421 TCACCCAGAA GGGCCCCCCA GGACCCCCAG CACCTTCCAC TCAGGGCCTG ACCACAAAGA 481 CAGAAGCAAG GGCTG // LOCUS HUMIGCH2 300 bp ds-DNA PRI 20-MAY-1987 DEFINITION Human Ig germline delta H-chain C-region gene, first hinge domain (CLL lymphocyte). ACCESSION K02876 KEYWORDS constant region; delta-immunoglobulin; germline; hinge exon; immunoglobulin; immunoglobulin heavy chain. SEGMENT 2 of 8 SOURCE Human (chronic lymphocytic leukemia (CLL) patient) lymphocyte DNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 300) AUTHORS White,M.B., Shen,A.L., Word,C.J., Tucker,P.W. and Blattner,F.R. TITLE Human immunoglobulin D: Genomic sequence of the delta heavy chain JOURNAL Science 228, 733-737 (1985) STANDARD full staff_review COMMENT See segment 1. FEATURES from to/span description pept + 152 + 202 Ig delta-chain secreted form, exon 2 of C-region (C-delta-hinge-1 domain) /nomgen="IGHD" /map="14q32.3" /hgml_locus_uid="LX0020F" pep$ + 152 + 202 Ig delta-chain membrane-bound form, exon 2 of C-region (C-delta-hinge-1 domain) /nomgen="IGHD" /map="14q32.3" /hgml_locus_uid="LX0020F" IVS < 1 151 IgD-Mb intron A IVS < 1 151 IgD-s intron A IVS 203 > 300 IgD-Mb intron B IVS 203 > 300 IgD-s intron B BASE COUNT 59 A 133 C 52 G 56 T ORIGIN About 300 bp after segment 1; 118 bp upstream of StuI site. 1 TAGGCTGCCT GTGCCCCCCA CCTGCCTGTC CACAACCCAG CCTCTGGTAC ATCCATGCCC 61 TCTGCCCTAA GCCTCACCTG CACTTTTCCT TGGATTTCAG AGTCTCCAAA GGCACAGGCC 121 TCCTCCGTGC CCACTGCACA ACCCCAAGCA GAGGGCAGCC TCGCCAAGGC AACCACAGCC 181 CCAGCCACCA CCCGTAACAC AGGTGAGAAG CCCCTTCCCT GCACACTCCA CCCCCACCCA 241 CCTGCTCATT CCTCAGCCGC CTCCTCCAGG CAGCCCTTCA TAACTCCTTG TCTGAGTCTC // LOCUS HUMIGCH3 300 bp ds-DNA PRI 20-MAY-1987 DEFINITION Human Ig germline delta H-chain C-region gene, second hinge domain (CLL lymphocyte). ACCESSION K02877 KEYWORDS constant region; delta-immunoglobulin; germline; hinge exon; immunoglobulin; immunoglobulin heavy chain. SEGMENT 3 of 8 SOURCE Human (chronic lymphocytic leukemia (CLL) patient) lymphocyte DNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 300) AUTHORS White,M.B., Shen,A.L., Word,C.J., Tucker,P.W. and Blattner,F.R. TITLE Human immunoglobulin D: Genomic sequence of the delta heavy chain JOURNAL Science 228, 733-737 (1985) STANDARD full staff_review COMMENT See segment 1. FEATURES from to/span description pept + 101 + 172 Ig delta-chain secreted form, exon 3 of C-region (C-delta-hinge-2 domain) /nomgen="IGHD" /map="14q32.3" /hgml_locus_uid="LX0020F" pep$ + 101 + 172 Ig delta-chain membrane-bound form, exon 3 of C-region (C-delta-hinge-2 domain) /nomgen="IGHD" /map="14q32.3" /hgml_locus_uid="LX0020F" IVS < 1 100 IgD-Mb intron B IVS < 1 100 IgD-s intron B IVS 173 > 300 IgD-Mb intron C IVS 173 > 300 IgD-s intron C BASE COUNT 102 A 52 C 70 G 76 T ORIGIN About 1.85 kb after segment 2. 1 GTCATTAGCT GGATTTAGCC ATTCCACAAT GTACACATAT TTCAAACATT GTGTTGTATA 61 TGATAAACAT GTATAATTTT TGTCAATTAA AAATTTTTAG GAAGAGGAGG AGAAGAGAAG 121 AAGAAGGAGA AGGAGAAAGA GGAACAAGAA GAGAGAGAGA CAAAGACACC AGGTTTTTTC 181 TGACCCCTGG GCTATCAAAA CACCTATTGC CCAATAACTA GTTGGCCGTT GGTGCCCTAA 241 ACTATTGAAG CGATTGCTGT TATGTGGATG GGCCCCGGAC ACTTAGAAAC TCGTGACCCC // LOCUS HUMIGCH4 500 bp ds-DNA PRI 20-MAY-1987 DEFINITION Human Ig germline delta H-chain C-region gene, C-delta-2 domain (CLL lymphocyte). ACCESSION K02878 KEYWORDS constant region; delta-immunoglobulin; germline; immunoglobulin; immunoglobulin heavy chain. SEGMENT 4 of 8 SOURCE Human (chronic lymphocytic leukemia (CLL) patient) lymphocyte DNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 500) AUTHORS White,M.B., Shen,A.L., Word,C.J., Tucker,P.W. and Blattner,F.R. TITLE Human immunoglobulin D: Genomic sequence of the delta heavy chain JOURNAL Science 228, 733-737 (1985) STANDARD full staff_review COMMENT See segment 1. FEATURES from to/span description pept + 101 + 424 Ig delta-chain secreted form, exon 4 of C-region (C-delta-2 domain) /nomgen="IGHD" /map="14q32.3" /hgml_locus_uid="LX0020F" pep$ + 101 + 424 Ig delta-chain membrane-bound form, exon 4 of C-region (C-delta-2 domain) /nomgen="IGHD" /map="14q32.3" /hgml_locus_uid="LX0020F" IVS < 1 100 IgD-Mb intron C IVS < 1 100 IgD-s intron C IVS 425 > 500 IgD-Mb intron D IVS 425 > 500 IgD-s intron D BASE COUNT 93 A 171 C 157 G 79 T ORIGIN About 450 bp after segment 3; 131 bp upstream of AccI site. 1 GAAGCTGGGG AGAGGAGAGC ACAGTGGTTA AGTCAGTCCC TGCAGCCCAA CTGCTCCCGA 61 AGGTCCGGCC ACAGCTGCTC TCGTTTGCTC TCCCCTGCAG AGTGTCCGAG CCACACCCAG 121 CCTCTTGGCG TCTACCTGCT AACCCCTGCA GTGCAGGACC TGTGGCTCCG GGACAAAGCC 181 ACCTTCACCT GCTTCGTGGT GGGCAGTGAC CTGAAGGATG CTCACCTGAC CTGGGAGGTG 241 GCTGGGAAGG TCCCCACAGG GGGCGTGGAG GAAGGGCTGC TGGAGCGGCA CAGCAACGGC 301 TCCCAGAGCC AGCACAGCCG TCTGACCCTG CCCAGGTCCT TGTGGAACGC GGGGACCTCC 361 GTCACCTGCA CACTGAACCA TCCCAGCCTC CCACCCCAGA GGTTGATGGC GCTGAGAGAA 421 CCCGGTGAGC CTGGCTCCCA GGTGGGGAGA CGAGGGTGCC CACAGCCTGC TGACCCCTAC 481 GCCCGCCCCA GGGCCATGAC // LOCUS HUMIGCH5 500 bp ds-DNA PRI 20-MAY-1987 DEFINITION Human Ig germline delta H-chain C-region gene, C-delta-3 domain (CLL lymphocyte). ACCESSION K02879 KEYWORDS constant region; delta-immunoglobulin; germline; immunoglobulin; immunoglobulin heavy chain. SEGMENT 5 of 8 SOURCE Human (chronic lymphocytic leukemia (CLL) patient) lymphocyte DNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 500) AUTHORS White,M.B., Shen,A.L., Word,C.J., Tucker,P.W. and Blattner,F.R. TITLE Human immunoglobulin D: Genomic sequence of the delta heavy chain JOURNAL Science 228, 733-737 (1985) STANDARD full staff_review COMMENT See segment 1. FEATURES from to/span description pept + 101 + 424 Ig delta-chain secreted form, exon 5 of C-region (C-delta-3 domain) /nomgen="IGHD" /map="14q32.3" /hgml_locus_uid="LX0020F" pep$ + 101 + 424 Ig delta-chain membrane-bound form, exon 5 of C-region (C-delta-3 domain) /nomgen="IGHD" /map="14q32.3" /hgml_locus_uid="LX0020F" IVS < 1 100 IgD-Mb intron D IVS < 1 100 IgD-s intron D IVS 425 > 500 IgD-Mb intron E IVS 425 > 500 IgD-s intron E BASE COUNT 85 A 188 C 145 G 82 T ORIGIN About 150 bp after segment 4; 118 bp upstream of HindIII site. 1 CCACAGGAAA GGAGAAGGGA GGCACCACAC CCTGGCCGGC CCCACTTCTC TCCCAGTGCC 61 CCCGTGGCCA GAGCCTGACA GCCCCCCCAC CTCCCCGCAG CTGCGCAGGC ACCCGTCAAG 121 CTTTCTCTGA ACCTGCTGGC CTCGTCTGAC CCTCCCGAGG CGGCCTCGTG GCTCCTGTGT 181 GAGGTGTCTG GCTTCTCGCC CCCCAACATC CTCCTGATGT GGCTGGAGGA CCAGCGTGAG 241 GTGAACACTT CTGGGTTTGC CCCCGCACGC CCCCCTCCAC AGCCCAGGAG CACCACGTTC 301 TGGGCCTGGA GTGTGCTGCG TGTCCCAGCC CCGCCCAGCC CTCAGCCAGC CACCTACACG 361 TGTGTGGTCA GCCACGAGGA CTCCCGGACT CTGCTCAACG CCAGCCGGAG CCTAGAAGTC 421 AGCTGTGAGT CACCCCCAGG CCAGGGTTGG GACGGGGACT CTGAGGGGGG CCATAAGGAG 481 CTGGAATCCA TACTAGGCAG // LOCUS HUMIGCH6 100 bp ds-DNA PRI 20-MAY-1987 DEFINITION Human Ig germline delta H-chain C-region gene, secreted terminus (CLL lymphocyte). ACCESSION K02880 KEYWORDS constant region; delta-immunoglobulin; germline; immunoglobulin; immunoglobulin heavy chain. SEGMENT 6 of 8 SOURCE Human (chronic lymphocytic leukemia (CLL) patient) lymphocyte DNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 100) AUTHORS White,M.B., Shen,A.L., Word,C.J., Tucker,P.W. and Blattner,F.R. TITLE Human immunoglobulin D: Genomic sequence of the delta heavy chain JOURNAL Science 228, 733-737 (1985) STANDARD full staff_review COMMENT See segment 1. FEATURES from to/span description pept + 25 53 Ig delta-chain secreted form, exon 6 of C-region (C-delta secreted terminus domain) /nomgen="IGHD" /map="14q32.3" /hgml_locus_uid="LX0020F" IVS < 1 24 IgD-s intron E IVS < 1 > 100 IgD-Mb intron E BASE COUNT 24 A 33 C 22 G 21 T ORIGIN About 1.8 kb after segment 5. 1 GACACGCCGA TTTTTTGTTA TTAGATGTAA CAGACCATGG CCCCATGAAA TGATCCCGGA 61 CCAGATCCGT CCGCACCCGC CACTCAGCAG CTCTGGCCGA // LOCUS HUMIGCH7 200 bp ds-DNA PRI 20-MAY-1987 DEFINITION Human Ig germline H-chain delta C-region gene, first domain of membrane terminus (CLL lymphocyte). ACCESSION K02881 KEYWORDS constant region; delta-immunoglobulin; germline; immunoglobulin; immunoglobulin heavy chain. SEGMENT 7 of 8 SOURCE Human (chronic lymphocytic leukemia (CLL) patient) lymphocyte DNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 200) AUTHORS White,M.B., Shen,A.L., Word,C.J., Tucker,P.W. and Blattner,F.R. TITLE Human immunoglobulin D: Genomic sequence of the delta heavy chain JOURNAL Science 228, 733-737 (1985) STANDARD full staff_review COMMENT See segment 1. FEATURES from to/span description pep$ + 25 + 182 Ig delta-chain membrane-bound form, exon 6 of C-region (first domain of membrane terminus) /nomgen="IGHD" /map="14q32.3" /hgml_locus_uid="LX0020F" IVS < 1 24 IgD-Mb intron E IVS 183 > 200 IgD-Mb intron F BASE COUNT 37 A 72 C 49 G 42 T ORIGIN About 800 bp after segment 6. 1 CGCTCGGCCC CCGTTCCTCC CCAGACCTGG CCATGACCCC CCTGATCCCT CAGAGCAAGG 61 ATGAGAACAG CGATGACTAC ACGACCTTTG ATGATGTGGG CAGCCTGTGG ACCACCCTGT 121 CCACGTTTGT GGCCCTCTTC ATCCTCACCC TCCTCTACAG CGGCATTGTC ACTTTCATCA 181 AGGTCAGGGG AGCGGCCAGG // LOCUS HUMIGCH8 100 bp ds-DNA PRI 20-MAY-1987 DEFINITION Human Ig germline H-chain delta C-region gene, second domain of membrane terminus (CLL lymphocyte). ACCESSION K02882 KEYWORDS constant region; delta-immunoglobulin; germline; immunoglobulin; immunoglobulin heavy chain. SEGMENT 8 of 8 SOURCE Human (chronic lymphocytic leukemia (CLL) patient) lymphocyte DNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 100) AUTHORS White,M.B., Shen,A.L., Word,C.J., Tucker,P.W. and Blattner,F.R. TITLE Human immunoglobulin D: Genomic sequence of the delta heavy chain JOURNAL Science 228, 733-737 (1985) STANDARD full staff_review COMMENT See segment 1. FEATURES from to/span description pep$ + 44 52 Ig delta-chain membrane-bound form, exon 7 of C-region (second domain of membrane terminus) /nomgen=