LOCUS HUM14INV5 195 bp ds-DNA PRI 15-MAR-1989 DEFINITION Human T-cell receptor V-xs-alpha-J-7-alpha gene centromeric breakpoint, partial cds. ACCESSION M22225 KEYWORDS T-cell receptor; centromeric breakpoint. SEGMENT 5 of 5 SOURCE Human T cell lymphoma (cell library SUP-T1) DNA, clone lambda-XS9. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 195) AUTHORS Baer,R., Forster,A. and Rabbitts,T.H. TITLE The mechanism of chromosome 14 inversion in a human T cell lymphoma JOURNAL Cell 50, 97-105 (1987) STANDARD simple staff_entry FEATURES from to/span description pept < 1 46 Ig V-R-H region protein, exon x /nomgen="TCRA" /map="14q11.2" /hgml_locus_uid="LX0123X" 141 151 Ig V-R-H region protein, exon x+1 pre-msg < 1 > 195 Ig V-R-H region mRNA and intron IVS 47 140 Ig H-chain V-region intron A BASE COUNT 42 A 29 C 70 G 54 T ORIGIN About 3.2 kb downstream of segment 1; chromosome 14q32. 1 ATGGAGCTTG GGCTGAGCTG GGTTTTCACT GTTGCTGTTT TAAAAGGTGA ACTAGAGAGA 61 TTGAGCGTGA GTGGATACCC TTGAGAGAAA TGGTGGATTA TGTCTGGGAG TTTCTGACCG 121 GGATGTCTAT GAGTTTGCAG GTGTCCAGTG AGAGGTACAG CTCGTGGAGT CCGGAGAGGA 181 ACTCCTGGGG GATCC // LOCUS HUM4F2HG1 2304 bp ds-DNA PRI 15-JUN-1989 DEFINITION Human 4F2 glycosylated heavy chain (4F2HC) antigen gene, exon 1 and 2. ACCESSION M21898 KEYWORDS 4F2 glycosylated heavy chain; 80- to 90-kilodalton glycosylated heavy chain antigen; antigen; cell surface protein; surface antigen; type II membrane glycoprotein. SEGMENT 1 of 7 SOURCE Human HPB-MLT T-cell tumor line DNA, clone 4F2G1. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 2304) AUTHORS Gottesdiener,K.M., Karpinski,B.A., Lindsten,T., Strominger,J.L., Jones,N.H., Thompson,C.B. and Leiden,J.M. TITLE Isolation and structural characterization of the human 4F2 heavy-chain gene, an inducible gene involved in T-lymphocyte activation JOURNAL Mol. Cell. Biol. 8, 3809-3819 (1988) STANDARD simple staff_review FEATURES from to/span description pept 1222 1645 4F2 heavy chain antigen, exon 1 /nomgen="H4F2" /map="1q21" /hgml_locus_uid="LZ0035F" 2091 + 2264 4F2 heavy chain antigen, exon 2 pre-msg 1111 > 2304 4F2HC mRNA and introns pre-msg 1121 > 2304 4F2HC mRNA and introns (alt.) pre-msg 1122 > 2304 4F2HC mRNA and introns (alt.) IVS 1646 2090 4F2HC intron A IVS 2265 > 2304 4F2HC intron B site 793 799 SP1 binding site (put.) site 996 1001 SP1 binding site (put.) site 1035 1041 SP1 binding site (put.) site 1045 1054 SP1 binding site (put.) rpt 1705 1725 inverted repeat 1 rpt 2030 2047 inverted repeat 1 BASE COUNT 489 A 618 C 724 G 473 T ORIGIN 76 bp upstream of EcoRI site. 1 ACGGTGTTGA TGTCCGGTAG TTCCCTGCTG TGAACCCTGT CTCTGCCCCA CTTCAATCCA 61 AGGACCCTGT GGAGGGAATT CTAGGCTTCT TCAAAGCCTC TGCAGTACCA GCTGCCTCCA 121 AGGCCTAACA TAGTGAAGTG ATAGGGAGAC AGGACCTTGG GGGAGTTTGG GGATAACTGT 181 GGGAGTTCCT GCAAGGTGGG TAAGACTCTG GCCATGGCAT CTTCTGGTCT ATTCCACAAA 241 AAAGGTGGAT GCCTCTGAGA CCTAGCTCTG TTATGTTCAT TACAGCCCTA TCAGTCTCTT 301 CTGGATGTCT GATTCCTAGC TGCCAAACAT CCAGGTGCTA GCAGCTGGGG GTGGGGTAGG 361 ATAAGAAGGG GGGTGTCTAC CTCAACCTTC CTGCCCAAAC CATTCATTTT AAAGTACTGT 421 GAAGAAGGCT GGATTAGTGT TAAATTCAAT AAAGTTAACA AAGCAAAGTA TTGTAGTGAG 481 TACACAATAA ATAGTTGCTG CATGAGGAAT TTCTTGAAAT GAATTCAAGA TGAGTCTTGA 541 AAAACGAGTG GGAGTAAGCC GTGCACGGAA GCAGAGGTCT AGCAGAGGTC TAGCGCCAAG 601 TCCCAGAGGC CTGAGAGGGC AAGCCTTGGT GGGGGAAAAG GCAGAAGTTG TGACTGGCTG 661 CCTCTGAGAG TGCAAGTAGC CATGCAAAGT TGGGGAAAAC CAGACGGCGG CCGGAGCCGT 721 GAACCAGGGG CAAATGGAGG ATGGTGTTGG GGTAGTCACT AGCCTGCCGG AATGGAAGAA 781 GAGGGGCGTT TGGGGGCGGT AAGGCCTAAA GGACACTGGA GGGGCCTCGG GATTTGGGGC 841 AGGATCACAG GCCTAGTCCA CTTAAAGGTG GCTCCGTGCC AGGCCTTCCC AGAATCAGGT 901 TGAAAAGAAC TTTAGGGTTA GACAGGTTGG AGATGAGAAA TTGGACCATG CAGGAAGTAC 961 TGTCGGAGGC GTGTTCCTGA CTTTACTACC CTGAGCCGCC CACGGCTGGG GCAGTCCCCG 1021 AGGTTCCACC TTAAGGGGCG GGCCGGGGCG GGGCTCCGCT GCCCCTTCCC AGAGGCCGCG 1081 CCTGCTGCTG AGCAGATGCA GTAGCCGAAA CTGCGCGGAG GCACAGAGGC CGGGGAGAGC 1141 GTTCTGGGTC CGAGGGTCCA GGTAGGGGTT GAGCCACCAT CTGACCGCAA GCTGCGTCGT 1201 GTCGCCGGTT CTGCAGGCAC CATGAGCCAG GACACCGAGG TGGATATGAA GGAGGTGGAG 1261 CTGAATGAGT TAGAGCCCGA GAAGCAGCCG ATGAACGCGG CGTCTGGGGC GGCCATGTCC 1321 CTGGCGGGAG CCGAGAAGAA TGGTCTGGTG AAGATCAAGG TGGCGGAAGA CGAGGCGGAG 1381 GCGGCAGCCG CGGCTAAGTT CACGGGCCTG TCCAAGGAGG AGCTGCTGAA GGTGGCAGGC 1441 AGCCCCGGCT GGGTACGCAC CCGCTGGGCA CTGCTGCTGC TCTTCTGGCT CGGCTGGCTC 1501 GGCATGCTCG CTGGTGCCGT GGTCATAATC GTGCGAGCGC CGCGTTGTCG CGAGCTACCG 1561 GCGCAGAAGT GGTGGCACAC GGGCGCCCTC TACCGCATCG GCGACCTCCA GGCCTTCCAG 1621 GGCCACGGCG CGGGCAACCT GGCGGGTGAG TGCAGCGCGC CCCCGTCCCG GGTACCTCCG 1681 GTTGAATCTG GTGGCTTGCA CCGACCCCCT CCCCTGTCCC CAGACGGATC TAGATGGTTC 1741 TTCCCTCCAT CCCGTACCGA CGACTGTCCC CCCTTCCCCC ACCCCCTCCC CGGCACATTG 1801 TCCTTCCCTC CTTTCTTTGA AGAAAGCCGA CCCGCCCCTC ACTCCGTCAC GAGGGTGGGT 1861 GACTCAGCGT CCTCCTTCCC CGCGGCGCCA GAAGCCAGTT GCAACCGGTT TCTGAAGTAA 1921 TGTGCAGGAC TCCTTACATC AGCTCCTCTG AGTCTCGTGA TTCAGCCTTG CCTCCCTCTC 1981 TCCCCCTTTG CCCCCTCCCC GTCCCACCCT TAGGCGCTGG GAGAAGGGAG GGTGGGGAGG 2041 TCAGGGGCCT CTCAGAGGGG CCTCACTTGT TAACCCAGCC CCCATTTCAG GTCTGAAGGG 2101 GCGTCTCGAT TACCTGAGCT CTCTGAAGGT GAAGGGCCTT GTGCTGGGTC CAATTCACAA 2161 GAACCAGAAG GATGATGTCG CTCAGACTGA CTTGCTGCAG ATCGACCCCA ATTTTGGCTC 2221 CAAGGAAGAT TTTGACAGTC TCTTGCAATC GGCTAAAAAA AAGAGTGGGT ATCCTGGGGT 2281 TCCAAGGAAA CAGCTAGAAA GGAC // LOCUS HUM4F2HG2 142 bp ds-DNA PRI 15-JUN-1989 DEFINITION Human 4F2 glycosylated heavy chain (4F2HC) antigen gene, exon 3. ACCESSION M21899 KEYWORDS 4F2 glycosylated heavy chain antigen; 80- to 90-kilodalton glycosylated heavy chain antigen; antigen; cell surface protein; surface antigen; type II membrane glycoprotein. SEGMENT 2 of 7 SOURCE Human HPB-MLT T-cell tumor line DNA, clone 4F2G1. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 142) AUTHORS Gottesdiener,K.M., Karpinski,B.A., Lindsten,T., Strominger,J.L., Jones,N.H., Thompson,C.B. and Leiden,J.M. TITLE Isolation and structural characterization of the human 4F2 heavy-chain gene, an inducible gene involved in T-lymphocyte activation JOURNAL Mol. Cell. Biol. 8, 3809-3819 (1988) STANDARD simple staff_review FEATURES from to/span description pept + 11 + 102 4F2 heavy chain antigen, exon 3 /nomgen="H4F2" /map="1q21" /hgml_locus_uid="LZ0035F" pre-msg < 1 > 142 4F2HC mRNA and introns IVS < 1 10 4F2HC intron B IVS 103 > 142 4F2HC intron C BASE COUNT 27 A 30 C 45 G 40 T ORIGIN About 750 bp after segment 1. 1 GTTGTTTTAG GCATCCGTGT CATTCTGGAC CTTACTCCCA ACTACCGGGG TGAGAACTCG 61 TGGTTCTTCA CTCAGGTTGA CACTGTGGCC ACCAAGGTGA AGGTGAGTGT TGGAGCTGAT 121 GGCTGGTGGA AGTCAGATGC TG // LOCUS HUM4F2HG3 307 bp ds-DNA PRI 15-JUN-1989 DEFINITION Human 4F2 glycosylated heavy chain (4F2HC) antigen gene, exon 4 and 5. ACCESSION M21900 KEYWORDS 4F2 glycosylated heavy chain antigen; 80- to 90-kilodalton glycosylated heavy chain antigen; antigen; cell surface protein; surface antigen; type II membrane glycoprotein. SEGMENT 3 of 7 SOURCE Human HPB-MLT T-cell tumor line DNA, clone 4F2G1. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 307) AUTHORS Gottesdiener,K.M., Karpinski,B.A., Lindsten,T., Strominger,J.L., Jones,N.H., Thompson,C.B. and Leiden,J.M. TITLE Isolation and structural characterization of the human 4F2 heavy-chain gene, an inducible gene involved in T-lymphocyte activation JOURNAL Mol. Cell. Biol. 8, 3809-3819 (1988) STANDARD simple staff_review FEATURES from to/span description pept + 31 99 4F2 heavy chain antigen, exon 4 /nomgen="H4F2" /map="1q21" /hgml_locus_uid="LZ0035F" 209 + 267 4F2 heavy chain antigen, exon 5 pre-msg < 1 > 307 4F2HC mRNA and introns IVS < 1 30 4F2HC intron B IVS 100 208 4F2HC intron C IVS 268 > 307 4F2HC intron D BASE COUNT 64 A 69 C 83 G 91 T ORIGIN About 1.3 kb after segment 2. 1 GTCTGTCCTT TATTCTTCTG CCCCCTATAG GATGCTCTGG AGTTTTGGCT GCAAGCTGGC 61 GTGGATGGGT TCCAGGTTCG GGACATAGAG AATCTGAAGG TGAGTTCCCT TTCCACATTA 121 GGGACAAAGC CTTGGGCGAG AACAGAAGGG ACTTCAGCTA GAGCCTCGTA ATATTCTCTG 181 GTCCTTGATT CTGCTTTTTT CTTTCTAGGA TGCATCCTCA TTCTTGGCTG AGTGGCAAAA 241 TATCACCAAG GGCTTCAGTG AAGACAGGTG GGTGCAGGAG CCATTCTGCT GACTCAGCTC 301 CAATGTG // LOCUS HUM4F2HG4 207 bp ds-DNA PRI 15-JUN-1989 DEFINITION Human 4F2 glycosylated heavy chain (4F2HC) antigen gene, exon 6. ACCESSION M21901 KEYWORDS 4F2 glycosylated heavy chain antigen; 80- to 90-kilodalton glycosylated heavy chain antigen; antigen; cell surface protein; surface antigen; type II membrane glycoprotein. SEGMENT 4 of 7 SOURCE Human HPB-MLT T-cell tumor line DNA, clone 4F2G1. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 207) AUTHORS Gottesdiener,K.M., Karpinski,B.A., Lindsten,T., Strominger,J.L., Jones,N.H., Thompson,C.B. and Leiden,J.M. TITLE Isolation and structural characterization of the human 4F2 heavy-chain gene, an inducible gene involved in T-lymphocyte activation JOURNAL Mol. Cell. Biol. 8, 3809-3819 (1988) STANDARD simple staff_review FEATURES from to/span description pept + 7 + 187 4F2 heavy chain antigen, exon 6 /nomgen="H4F2" /map="1q21" /hgml_locus_uid="LZ0035F" pre-msg < 1 > 207 4F2HC mRNA and introns IVS < 1 6 4F2HC intron D IVS 188 > 207 4F2HC intron E BASE COUNT 47 A 55 C 51 G 54 T ORIGIN About 400 bp after segment 3. 1 CTGCAGGCTC TTGATTGCCG GGACTAACTC CTCCGACCTT CAGCAGATCC TGAGCCTACT 61 CGAATCCAAC AAAGACTTGC TGTTGACTAG CTCATACCTG TCTGATTCTG GTTCTACTGG 121 GGAGCATACA AAATCCCTAG TCACACAGTA TTTGAATGCC ACTGGCAATC GCTGGTGCAG 181 CTGGAGTGTG AGTACCATGC TGGTGGG // LOCUS HUM4F2HG5 204 bp ds-DNA PRI 15-JUN-1989 DEFINITION Human 4F2 glycosylated heavy chain (4F2HC) antigen gene, exon 7. ACCESSION M21902 KEYWORDS 4F2 glycosylated heavy chain antigen; 80- to 90-kilodalton glycosylated heavy chain antigen; antigen; cell surface protein; surface antigen; type II membrane glycoprotein. SEGMENT 5 of 7 SOURCE Human HPB-MLT T-cell tumor line DNA, clone 4F2G1. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 204) AUTHORS Gottesdiener,K.M., Karpinski,B.A., Lindsten,T., Strominger,J.L., Jones,N.H., Thompson,C.B. and Leiden,J.M. TITLE Isolation and structural characterization of the human 4F2 heavy-chain gene, an inducible gene involved in T-lymphocyte activation JOURNAL Mol. Cell. Biol. 8, 3809-3819 (1988) STANDARD simple staff_review FEATURES from to/span description pept + 31 + 174 4F2 heavy chain antigen, exon 7 /nomgen="H4F2" /map="1q21" /hgml_locus_uid="LZ0035F" pre-msg < 1 > 204 4F2HC mRNA and introns IVS < 1 30 4F2HC intron E IVS 175 > 204 4F2HC intron F BASE COUNT 29 A 64 C 52 G 59 T ORIGIN About 50 bp after segment 4. 1 TGGGGCTCAC TGGAGTGTCT CTCCCTGTAG TTGTCTCAGG CAAGGCTCCT GACTTCCTTC 61 TTGCCGGCTC AACTTCTCCG ACTCTACCAG CTGATGCTCT TCACCCTCCC AGGGACCCCT 121 GTTTTCAGCT ACGGGGATGA GATTGGCCTG GATGCAGCTG CCCTTCCTGG ACAGGTACTG 181 CTTGCTGTCT TTCTGTCACA GGGA // LOCUS HUM4F2HG6 111 bp ds-DNA PRI 15-JUN-1989 DEFINITION Human 4F2 glycosylated heavy chain (4F2HC) antigen gene, exon 8. ACCESSION M21903 KEYWORDS 4F2 glycosylated heavy chain antigen; 80- to 90-kilodalton glycosylated heavy chain antigen; antigen; cell surface protein; surface antigen; type II membrane glycoprotein. SEGMENT 6 of 7 SOURCE Human HPB-MLT T-cell tumor line DNA, clone 4F2G1. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 111) AUTHORS Gottesdiener,K.M., Karpinski,B.A., Lindsten,T., Strominger,J.L., Jones,N.H., Thompson,C.B. and Leiden,J.M. TITLE Isolation and structural characterization of the human 4F2 heavy-chain gene, an inducible gene involved in T-lymphocyte activation JOURNAL Mol. Cell. Biol. 8, 3809-3819 (1988) STANDARD simple staff_review FEATURES from to/span description pept + 8 + 91 4F2 heavy chain antigen, exon 8 /nomgen="H4F2" /map="1q21" /hgml_locus_uid="LZ0035F" pre-msg < 1 > 111 4F2HC mRNA and introns IVS < 1 7 4F2HC intron F IVS 92 > 111 4F2HC intron G BASE COUNT 25 A 24 C 33 G 29 T ORIGIN About 2.3 kp after segment 5. 1 TTTTCAGCCT ATGGAGGCTC CAGTCATGCT GTGGGATGAG TCCAGCTTCC CTGACATCCC 61 AGGGGCTGTA AGTGCCAACA TGACTGTGAA GGTAAGAGTT CTAGATGGGT A // LOCUS HUM4F2HG7 577 bp ds-DNA PRI 15-JUN-1989 DEFINITION Human 4F2 glycosylated heavy chain (4F2HC) antigen gene, exon 9. ACCESSION M21904 KEYWORDS 4F2 glycosylated heavy chain antigen; 80- to 90-kilodalton glycosylated heavy chain; antigen; cell surface protein; surface antigen; type II membrane glycoprotein. SEGMENT 7 of 7 SOURCE Human HPB-MLT T-cell tumor line DNA, clone 4F2G1. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 577) AUTHORS Gottesdiener,K.M., Karpinski,B.A., Lindsten,T., Strominger,J.L., Jones,N.H., Thompson,C.B. and Leiden,J.M. TITLE Isolation and structural characterization of the human 4F2 heavy-chain gene, an inducible gene involved in T-lymphocyte activation JOURNAL Mol. Cell. Biol. 8, 3809-3819 (1988) STANDARD simple staff_review FEATURES from to/span description pept + 21 383 4F2 heavy chain antigen, exon 9 /nomgen="H4F2" /map="1q21" /hgml_locus_uid="LZ0035F" pre-msg < 1 547 4F2HC mRNA and introns IVS < 1 20 4F2HC intron G BASE COUNT 106 A 184 C 138 G 149 T ORIGIN About 60 bp after segment 6. 1 TATGTTTTTA TCATTCTTAG GGCCAGAGTG AAGACCCTGG CTCCCTCCTT TCCTTGTTCC 61 GGCGGCTGAG TGACCAGCGG AGTAAGGAGC GCTCCCTACT GCATGGGGAC TTCCACGCGT 121 TCTCCGCTGG GCCTGAGCTC TTCTCCTATA TCCGCCACTG GGACCAGAAT GAGCGTTTTC 181 TGGTAGTGCT TAACTTTGGG GATGTGGGCC TCTCGGCTGG ACTGCAGGCC TCCGACCTGC 241 CTGCCAGCGC CAGCCTCCCA GCCAAGGCTG ACCTCCTGCT CAGCACCCAG CCAGGCCGTG 301 AGGAGGGCTC CCCTCTTGAG CTGGAACGCC TGAAACTGGA GCCTCACGAA GGGCTGCTGC 361 TCCGCTTCCC CTACGCGGCC TGACTCCAGC CTGACATGGA CCCACTACCC TTCTCCTTTC 421 CTTCCCAGGC CCTTTGGTTC TGATTTTTCT CTTTTTTAAA AACAAACAAA CAAACTGTTG 481 CAGATTATGA GTGAACCCCC AAATAGGGTG TTTTCTGCCT TCAAATAAAA GTCACCCCTG 541 CATGGTGAAG TCTTCCCTCT GCTTCTCTCA TAGCAGG // LOCUS HUMC1A1 7616 bp ds-DNA PRI 15-MAR-1989 DEFINITION Human, alpha 1 collagen type I gene, exons 1-25. ACCESSION M20789 KEYWORDS collagen. SOURCE Human DNA, clone RMS-8. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 7616) AUTHORS D'Alessio,M., Bernard,M., Pretorius,P.J., de Wet,W. and Ramirez,F. TITLE Complete nucleotide sequence of the region encompassing the first twenty-five exons of the human pro-alpha1(I) collagen gene (COL1A1) JOURNAL Gene 67, 105-115 (1988) STANDARD full staff_entry REFERENCE 2 (bases 1 to 7616; revises [1]) AUTHORS D'Alessio,M. JOURNAL Unpublished (1988) SUNY, Brooklyn, N.Y. 11203 STANDARD full staff_entry COMMENT Draft entry and computer readable sequence [1] kindly submitted by M. D'Alessio 28-SEPT-1988 FEATURES from to/span description pept 120 222 alpha-1 type I collagen precursor, exon 1 (AA at 120) /nomgen="COL1A1" /map="17q21.3-q22" /hgml_locus_uid="LG0047H" 1678 1872 alpha-1 type I collagen precursor, exon 2 2011 2045 alpha-1 type I collagen precursor, exon 3 2148 2183 alpha-1 type I collagen precursor, exon 4 2274 2375 alpha-1 type I collagen precursor, exon 5 3099 3170 alpha-1 type I collagen precursor, exon 6 3405 3449 alpha-1 type I collagen precursor, exon 7 3598 3651 alpha-1 type I collagen precursor, exon 8 3813 3866 alpha-1 type I collagen precursor, exon 9 4359 4412 alpha-1 type I collagen precursor, exon 10 4529 4582 alpha-1 type I collagen precursor, exon 11 4856 4909 alpha-1 type I collagen precursor, exon 12 4999 5043 alpha-1 type I collagen precursor, exon 13 5160 5213 alpha-1 type I collagen precursor, exon 14 5328 5372 alpha-1 type I collagen precursor, exon 15 5551 5604 alpha-1 type I collagen precursor, exon 16 5864 5962 alpha-1 type I collagen precursor, exon 17 6023 6067 alpha-1 type I collagen precursor, exon 18 6170 6268 alpha-1 type I collagen precursor, exon 19 6396 6449 alpha-1 type I collagen precursor, exon 20 6666 6773 alpha-1 type I collagen precursor, exon 21 6870 6923 alpha-1 type I collagen precursor, exon 22 7048 7146 alpha-1 type I collagen precursor, exon 23 7308 7361 alpha-1 type I collagen precursor, exon 24 7450 / 7548 alpha-1 type I collagen precursor, exon 25 sigp 120 185 alpha-1 type I collagen signal peptide matp 3162 3169 alpha-1 type I collagen 3404 3448 alpha-1 type I collagen 3597 3650 alpha-1 type I collagen 3812 3865 alpha-1 type I collagen 4358 4411 alpha-1 type I collagen 4528 4582 alpha-1 type I collagen 4856 4909 alpha-1 type I collagen 4999 5043 alpha-1 type I collagen 5160 5213 alpha-1 type I collagen 5328 5372 alpha-1 type I collagen 5551 5604 alpha-1 type I collagen 5864 5962 alpha-1 type I collagen 6023 6067 alpha-1 type I collagen 6170 6268 alpha-1 type I collagen 6396 6449 alpha-1 type I collagen 6666 6773 alpha-1 type I collagen 6870 6923 alpha-1 type I collagen 7048 7146 alpha-1 type I collagen 7308 7361 alpha-1 type I collagen 7450 / 7548 alpha-1 type I collagen pre-msg 120 > 7616 COL1A1 mRNA and introns IVS 223 1677 COL1A1 intron A IVS 1873 2010 COL1A1 intron B IVS 2046 2147 COL1A1 intron C IVS 2184 2273 COL1A1 intron D IVS 2376 3098 COL1A1 intron E IVS 3171 3404 COL1A1 intron F IVS 3450 3597 COL1A1 intron G IVS 3652 3812 COL1A1 intron H IVS 3867 4358 COL1A1 intron I IVS 4413 4528 COL1A1 intron J IVS 4583 4855 COL1A1 intron K IVS 4910 4998 COL1A1 intron L IVS 5044 5159 COL1A1 intron M IVS 5214 5327 COL1A1 intron N IVS 5373 5550 COL1A1 intron O IVS 5605 5863 COL1A1 intron P IVS 5963 6022 COL1A1 intron Q IVS 6068 6169 COL1A1 intron R IVS 6269 6395 COL1A1 intron S IVS 6450 6665 COL1A1 intron T IVS 6774 6869 COL1A1 intron U IVS 6924 7047 COL1A1 intron V IVS 7147 7307 COL1A1 intron W IVS 7362 7449 COL1A1 intron X IVS 7549 > 7616 COL1A1 intron Y BASE COUNT 1424 A 2273 C 2139 G 1771 T 9 others ORIGIN 113 bp upstream of XbaI site; chromosome 17q21.3-q22. 1 AGCAGACGGG AGTTTCTCCT CGGGGTCGGA GCAGGAGGCA CGCGGAGTGT GAGGCCACGC 61 ATGAGCGGAC GCTAACCCCC TCCCCAGCCA CAAAGAGTCT ACATGTCTAG GGTCTAGACA 121 TGTTCAGCTT TGTGGACCTC CGGCTCCTGC TCCTCTTAGC GGCCACCGCC CTCCTGACGC 181 ACGGCCAAGA GGAAGGCCAA GTCGAGGGCC AAGACGAAGA CAGTAAGTCC CAAACTTTTG 241 GGAGTGCAAG GATACTCTAT ATCGCGCCTT GCGCTTGGTC CCGGGGGCCG CGGCTTAAAA 301 CGAGACGTGG ATGATCCGGA GACTCGGGAA TGGAAGGGAG ATGATGAGGG CTCTTCCTCG 361 GCGCCCTGAG ACAGGAGGGA GCTCACCCTG GGGCGAGGTT GGGGTTGAAC GCGCCCCGGG 421 AGCGGGAGGT GAGGGTGGAG CGCCCCGTGA GTTGGTGCAA GAGAGAATCC CGAGAGCGCA 481 ACCGGGGAAG TGGGGATCAG GGTGCAGAGT GAGGAAAGTA CGTCGAAGAT GGGATGGGGG 541 CGCCGAGCGG GGCATTTGAA GCCCAAGATG TAGAAGCAAT CAGGAAGGCC GTGGGATGAT 601 TCATAAGGAA AGATTGCCCT CTCTGCGGGC TAGAGTGTTG CTGGGCCGTG GGGGTGCTGG 661 GCAGCCGCGG GAAGGGGGTG CGGAGCGTGG GCGGGTGGAG GATGAGAAAC TTTGGCGCGG 721 ACTCGGCGGG GCGGGGTCCT TGCGCCCCCT GCTGACCGAT GCTGAGCACT GCGTCTCCCG 781 GTCCAACGCT TACTGGGGCA GGAGCCGGAG CGGGAAGACC CGGGTTATTG CTGGGTGCGG 841 ACCCCCACCT CTAGATCTGG AAAGTAAAGC CAGGGATGGG GCAGCCCAAG CCTCTTAAAG 901 AGGTAGTCGG GCCGGTGAGG TCGGCCCCGC CCCGGCCCCA TTGCTTAGCG TTGCCCGACA 961 CCTAGTGGCC GTCTGGGGAG CCGCTAGCGC GGTGGGAGTG GTTAGCTAAC TTCTGGACTA 1021 TTTGCGGACT TTTTGGTTCT TTGGCTAAAA GTGATCTGGA GGCATTGGCT GGCTTTGGGG 1081 ACGGGGACGG CCCCGAGAGC GGGCTTTTAA GATGTCTAGG TGCTGGAGGT TAGGGTGTCT 1141 CCTAATTTTG AGGTACATTT CAAGGGGGGG CCTCCCTTCC CAATCAGCCG CTCCCATTCT 1201 CCTAGCCCCG CCCCCGCCAC CCCACCTGCC CAGGGAATGG GGGCGGGATG AGGGCCTGGA 1261 CCTTCCCTTC TCTCCTCCCT CGCCCTCCTC CTGTCTCTAC CACGCAGCCA CTCCCCACGA 1321 GCCTGCCCTC CCGATGGGGC CCCTCCTATT CTCCCCCCGC CCTCCCCCTC TCACCCTGTG 1381 GTTTTATTTC ACTTGGCTTC AGCGCCAATG GGCTGAGGTT GGAGTTGGAA GCCACCGCAG 1441 ACTAAAGCTT TGTTTAAATT CCTGAGAACT GGAAAGAGTT ACAGCCTCCC TGGCCAGGCG 1501 CCTCGGCNGG TCACCCGCGC TGATGAGGAG CAGGCGAGCT TTTAAGGATT TGAGGAAAGA 1561 AGAACGGGGG GAGGGGCGGG AAGTGAAAAA TCCAAGTGCC TCTTAGACCC GGGGGAAAGG 1621 TGGTTAAGCT GGGGGTTGCA GTCACTACTG ACAACGCCCC TCTTCCGCCT GTCCCAGTCC 1681 CACCAATCAC CTGCGTACAG AACGGCCTCA GGTACCATGA CCGAGACGTG TGGAAACCCG 1741 AGCCCTGCCG GATCTGCGTC TGCGACAACG GCAAGGTGTT GTGCGATGAC GTGATCTGTG 1801 ACGAGACCAA GAACTGCCCC GGCGCCGAAG TCCCCGAGGG CGAGTGCTGT CCCGTCTGCC 1861 CCGACGGCTC AGGTGCGCTG CGCTCGGCCT GGGGCCTGGG GCTGGGGCTG GGGGTGGTCG 1921 GCGCTCGCTG GCCCTCCGTG CTGGAGGCCT CTGCCGACGG GAGCAGCATT AGCAAACCTT 1981 GGCTCTAACG GGCGTCTCTT CGTCCCCTAG AGTCACCCAC CGACCAAGAA ACCACCGGCG 2041 TCGAAGTAAT CTCCTGCCCT CGAATTTTGC CCCTGCGCGG CCCGTGACTC CTCACAGTCC 2101 TCCCTTCTCT AACCTGGCCT CTTGTTTCTT CTCCCCCAAT CCCACAGGGA CCCAAGGGAG 2161 ACACTGGCCC CCGAGGCCCA AGGGTAAGCG TTGCACTCTG GGCTGTGGGG GGCTGCAAGG 2221 TGGGCATGGC TCTCGGCCCC ACGCTCACCC CGGCCCCGCC CTCTCCCCTG CAGGGACCCG 2281 CAGGCCCCCC TGGCCGAGAT GGCATCCCTG GACAGCCTGG ACTTCCCGGA CCCCCCGGAC 2341 CCCCCGGACC TCCCGGACCC CCTGGCCTCG GAGGAGTAAG TGGAGAGGCC TTGTGTGTCC 2401 ACTCTCCCCT GTTTTGTTTT TGTTTTTTGG CAGATGACAT AATTTTAATA CTTTGAAATA 2461 ATTTCAAACT TACAGAAAAG TTGCAAGAAT CCTACAGGAA ACTCTCACAT ACCCTTCACA 2521 GTTTGTGACA TGTGCTTTAT TAGTCTCTGT TTATGTATAT GTATCTTTTT TTTTCTGAAC 2581 TGTTTGAGCA AGTTGCTAAC ATCAGGCTCT TTTGCGCCTA AATACTTAGG TGTGTTTTTC 2641 CTAAAAACAA GAGCATTCTC TTAACTGACC TACACAATGA TTAAATTCAC TCTCTAATGT 2701 GCAGTCCGTA CTCAAAGTTC ACCGATGTCC CGATAATGTC CTTTATAGAT TCCAACCCCC 2761 ACCACCCCAA TCTGGGATCC AGTCCAGGAT TATGTATTGC ATTTAATCAT CATGTCTCTA 2821 GTTTCCACAA ATGTAGAACG TTCCTCAGAC TTTCTTTGTC TTTAGTGGCA CTGGGAGTTT 2881 TGATGAGTCC AGTTGTTTTG CAGACTGTCC CTCAATTTGG GATTGTCTCA TTAGATTAGA 2941 TGCAGGGATG CATCTTTGGC AGGAATGTCT TAAAAGCAAT GTTATTCTTC TCAGCACATC 3001 ACACCAGGAA GTGCATGATG TCAGTTTCTT CCATCCTCAG TGCCGTCTTC TGCCTTTCAA 3061 TTCACTGTCC TCACTCTGAC TTCTCTTGTT TGTTCTAGAA CTTTGCTCCC CAGCTGTCTT 3121 ATGGCTATGA TGAGAAATCA ACCGGAGGAA TTTCCGTGCC TGGCCCCATG GTGAGCCAGC 3181 AGGGGGAGCA TGGATGACAG AAGAGAGAAT GGGTATCCAG AGGAGGTGGG CATAGGCGGC 3241 TGGTATAGAC AGCTTGGGAG GTCCAGTTCA CCTTTGGGAC CTCAGAGTCC AGAAAGGATG 3301 CAGGACTGAA CTTGGGTTGG TCCCAACAGG CATGAATTGA CTTACATCCA CATGACTTTC 3361 CTACAGAGGG ATCACCATGA CCCCCCTTTC TTCTCCCTCT ATAGGGTCCC TCTGGTCCTC 3421 GTGGTCTCCC TGGCCCCCCT GGTGCACCTG TGAGTATCCA GGACGTCTTC ATATGCCTCC 3481 TTGGGCTTTG GTCTTTTGGA GGGAAGACTG GGATGAGGGC AGGAGAGATG CTCAGAGATC 3541 TCACTTGGTN GGTTGACGGG CTTCGTCCTC CAACCCATCT TTTTCCTTCC TCTCAAGGGT 3601 CCCCAAGGCT TCCAAGGTCC CCCTGGTGAG CCTGGCGAGC CTGGAGCTTC AGTAAGCACT 3661 CTCTATACAG ATTCATACTC CTTCTACAAA CACACAGACT CTCCTATAGA AGAACTCCCA 3721 GGCCTGGGTC TTCCTTACCT CTTCCCTTCA ATCCCAGCCT TCCCCTTCTT TTTTTCTTAT 3781 CCATATTCTA ACCACCTTTC TATCTTTTCT AGGGTCCCAT GGGTCCCCGA GGTCCCCCAG 3841 GTCCCCCTGG AAAGAATGGA GATGATGTAA GTATCCCAGC AATGCGGTCA TCTCAGACCC 3901 ATGGCCTACA TGGGTGTGGG TGCTGCAATT TCCGCTNCGG CAGACAGTTG GGAACGATAC 3961 TCANAGGAAG GAAGGGCAGG TCCTCTCATG ATGCACGGAC TGCCCTCGAA CATGATCTTT 4021 TCGCTTAGTG AGATGATTCC ATGTCCCCAA CAACAGTGAC TGTCTCCTCA CCCCAGCCAC 4081 CGTTGAGAGG CAATCCCCAA CCCCATCCTT GGGCAAATGT GCGGAGATGT GAAATTAAAA 4141 TGCTGTGACA GAAGTAGACA GAAATTCCTT TAGAGGCACT CAGAATTTCA CCAAACGAAG 4201 GTTTCACTGT AGATTTAAAC TGAGCTCTAG ATCAAGATAT ATCTGGCCCC AAACCTGACC 4261 TGCAACAATC CAAGAGACTG AAGACCTTCT CCACTTTTCC AGCCCCCTAG GCNATGGTGG 4321 GAGGCAAGAG GCATTGATGN TCTTTTCTCT CCCTCTAGGG GGAAGCTGGA AAACCTGGTC 4381 GTCCTGGTGA GCGTGGGCCT CCTGGGCCTC AGGTGAGCAG GGGGCTGTGG CTGAACCTGG 4441 GCTTCACTGC ACTTGGGCTT CATTTAGAAG CTGGGTCCAC AGTGATGTGT TCTAATGGCC 4501 CTTCCTTGTC TTCTTCATCT CTCTCCAGGG TGCTCGAGGA TTGCCCGGAA CAGCTGGCCT 4561 CCCTGGAATG AAGGGACACA GAGTGAGTCA CCTTTGAGTC ATTTAAGTGC CCCAAGTCCC 4621 TAGCATACCC CCATCCAGTC CCAGCCTCTC CCCAAAAGAT CCTGACGTTG CATCATGGTG 4681 GGTGGCAGCT ACAGAAGTCC CAAGGGCCAG AGAGTGGACA TCCAAAAGCA CTCCTCATGG 4741 AATCCCGATT ACCGATTGGG TGAGATCTTA GAGCCATTTG GGGTTTAGTC TAGTGGATGT 4801 CAAGGCTGTT CCACCCCCTC CACAGGTTCT TACCTTCTAC CTCTTTCCTG CTTAGGGTTT 4861 CAGTGGTTTG GATGGTGCCA AGGGAGATGC TGGTCCTGCT GGTCCTAAGG TAAGAGGCTG 4921 TCTGAACATC ATGGTCCTCC ACATCCCCGT GTTCCCACCA TGAATGAATT TCTCACTCGT 4981 TATTCTCTGT TTCTACAGGG TGAGCCTGGC AGCCCTGGTG AAAATGGAGC TCCTGGTCAG 5041 ATGGTGAGTG TGCCCAGTTC CAGAGGGCAG GGATGGGGCA GGAGGCAGGG GCAAGATGGA 5101 GGCCTGGGGG AACAAGGCTG TCTCCCATCT CATCTGACTT CTCTTGGTTT GGTTGTCAGG 5161 GCCCCCGTGG CCTGCCTGGT GAGAGAGGTC GCCCTGGAGC CCCTGGCCCT GCTGTAAGTA 5221 CTCCTGTGGC CTTGGGGGAT CCCTGAGCTC TGGAAGGGGC TCCCCAGGAA CTCTAGGGAC 5281 TGGCCAGTGC TCAGTGGACT TAACGGGGCT TCCCCTCTCT CCTGCAGGGT GCTCGTGGAA 5341 ATGATGGTGC TACTGGTGCT GCCGGGCCCC CTGTGAGTGT GGCCTGTAGG CCGTCAGGGC 5401 CTGGGAGTGG GGAGGGGTCT CAGTGTCTGC TCTTGGGGCT GACAATGGGG GCGAGGTTAT 5461 GTTGGTCTGA ACCCCAGGAC TTCCTCTGTC CCAGGTGTGA CTTGCAGCTG CCATCTCTTC 5521 CTTCTCGCTG ACATCTCCAT TCATTCACAG GGTCCCACCG GCCCCGCTGG TCCTCCTGGC 5581 TTCCCTGGTG CTGTTGGTGC TAAGGTGAGA CCCCCCCACT CTCCTCTAAG CATGACCCTG 5641 CATGGGCCAA GGGGTTCATG TCTCCCTGTT CCCCAAACCA AAGGGACCCA GAGTGGCAAG 5701 AGAGCAGCCC GTTCACTAAC ACCTTTGTCC TGGGGTCTCC GTCTCTGATC TTAGAGTCCT 5761 GATCATTGCT CTCCTGTCCC TGTCTCCCCT TCCTCCTGCC ATCCCGAGCG GCAAGGTTGG 5821 TTTTCCCAGG TGGTGTTGAT ATGTCTCTTT CTTTGTGATT GAGGGTGAAG CTGGTCCCCA 5881 AGGGCCCCGA GGATCTGAAG GTCCCCAGGG TGTCCTGGGT GAGCCTGGTC CCCCTGGCCC 5941 TGCTGGTGCT GCTGGCCCTG CTGTAAGTGT CCCCGACTCA GTGTCCCCTT TGCCACTTTC 6001 TANCCAGGTC CATGTGCCAA AGGGAAACCC TGGTGCTGAT GGACAGCCTG GTGCTAAAGG 6061 TGCCAATGTA AGTGCCTGCC AGGCTTCAGT CCCACTCCTA CCGCCTGCAG CCTGCCTGCC 6121 CTTTCCCTCT GCTNCTAGGC TCACGCAGTG GNTGTCTGCG TACCGATAGG GTGCTCCTGG 6181 TATTGCTGGT GCTCCAGGTT TCCCTGGTGC CCGAGGCCCC TCTGGACCCC AAGGCCCAGG 6241 CGGCCCTCCT GGTCCCAAGG GTAACAGCGT GAGTACCAAA CTCTCCTTCT ACCCATGCAC 6301 TGGCTCCAGC TGGGCTCTCA TCTGGGGAGC AGGCAAGACG CCAGAGCCAA CTGAGCGCCC 6361 CGACTCTCAG CCTATCCTCT TCTCCCCAGT TGCAGGGTGA ACCTGGTGCT CCTGGCAGCA 6421 AAGGAGACAC TGGTGCTAAG GGAGAGCCTG TAAGTCTCCC CGGCCATCCT TCTTGCAGCC 6481 CAGCCCACCC TGCCCTAGGA GCCCCTGAGG AAATCCAGAA AGGAAGAGGA GTCCCCTAGT 6541 CTTCTGGGGG AGTCCCTGCC ACACCCCCAG AACCCCTGAC ACTGGAGGCC CAGCCTCAGC 6601 CGGCTCTGAG GCTGGCACAG GATGGCCCCT CACCACAGGC CGCCTCCTCC TCTCGCCCTC 6661 TCCAGGGCCC TGTTGGTGTT CAAGGACCCC CTGGCCCTGC TGGAGAGGAA GGAAAGCGAG 6721 GAGCTCGAGG TGAACCCGGA CCCACTGGCC TGCCCGGACC CCCTGGCGAG CGTGTAAGTG 6781 TCCCTGCCCG CCCCCTCCCA GCCTCCACCC TCATTGCCTG GCTGGTGCCT GTGTGTCGCG 6841 GAGTTCACTG GCCTCCTCTC CTCCTGCAGG GTGGACCTGG TAGCCGTGGT TTCCCTGGCG 6901 CAGATGGTGT TGCTGGTCCC AAGGTAACCT CTCCTTGCGG CCGGGGGGCG ACCCTGCCGC 6961 TCCCTGGGCA TCTTCTTCCT CTTTTGGCCC GTGGCAAAGA GCCACAAACT TGAGACCCTA 7021 ACTGTTCCTG TGACTTCCCC CAACCAGGGT CCCGCTGGTG AACGTGGTTC TCCTGGCCCC 7081 GCTGGCCCCA AAGGATCTCC TGGTGAAGCT GGTCGTCCCG GTGAAGCTGG TCTGCCTGGT 7141 GCCAAGGTGA GGCCCCAGGC TTCAGCCTGC TTGGCCAGCC TGACCATCCC GTGTAGGTCT 7201 TGGGATGAGG CGTTCCGGAT CAGGCCCAAG GGGCTGCCCT CTGAAGTCCT CCCCCACCTC 7261 CATCATGCTT CTCCCCAAGT TCCACTCATA CCTCTCTGCC TCCCTAGGGT CTGACTGGAA 7321 GCCCTGGCAG CCCTGGTCCT GATGGCAAAA CTGGCCCCCC TGTAAGTATC ACTCCCCCTG 7381 AACCCCCTGC CATTGTCCTG TCTGCCTCCC TGCTGTCCTC ACTGCTGCTT TCGTGCCTCC 7441 CATCCTTAGG GTCCCGCCGG TCAAGATGGT CGCCCCGGAC CCCCAGGCCC ACCTGGTGCC 7501 CGTGGTCAGG CTGGTGTGAT GGGATTCCCT GGACCTAAAG GTGCTGCTGT GAGTATTAGA 7561 GTGAGGATGC CATGAAGGAG CCGAGGGACA AACGACAGCC TAGACGTGAA GGATCC // LOCUS HUMC1AIM1 721 bp ds-DNA PRI 15-JUN-1989 DEFINITION Human (CRL 1262 variant allele) pro-alpha-1 type-I collagen gene, exons 31-27. ACCESSION K03176 KEYWORDS Alu repetitive sequence; alpha-1 collagen; alpha-1 type 1 collagen; alpha-collagen; collagen; repetitive sequence. SEGMENT 1 of 2 SOURCE Human fibroblast (cell line ATCC CRL 1262) DNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 721) AUTHORS Chu,M.-L., Gargiulo,V., Williams,C.J. and Ramirez,F. TITLE Multiexon deletion in an osteogenesis imperfecta variant with increased type III collagen mRNA JOURNAL J. Biol. Chem. 260, 691-694 (1985) STANDARD simple staff_review COMMENT [1] sequenced the normal and defective alleles from the human fibroblast cell line CRL 1262, isolated from a fetus with osteogenisis imperfecta type II (lethal perinatal OI). The defective allele was found to have a deletion of 643 bp spanning exons 29-27 (for normal allele sequence see separate entry). A draft entry and printed copy of this sequence were kindly provided by M.-L.Chu (22-AUG-1985). FEATURES from to/span description pept < 1 48 variant pro-alpha-1 type I collagen, exon 31 (AA at 1) /nomgen="COL1A1" /map="17q21.31-q22.05" /hgml_locus_uid="LG0047H" 145 + 198 variant pro-alpha-1 type-I collagen, exon 30 IVS 49 144 C1AIV intron XXX IVS 199 > 721 C1AIV intron XXIX rpt 322 676 AluI repeat recomb 278 279 C1AIV intron XXIX end/intron XXVI start BASE COUNT 161 A 211 C 200 G 149 T ORIGIN 175 bp upstream of HhaI site; chromosome 17q21.31-q22.05. 1 CGAGGTGAAC CCGGACCCAC TGGCCTGCCC GGACCCCCTG GCGAGCGTGT AAGTGTCCCT 61 GCCCGCCCCC TCCCAGCCTC CACCCTCATT GCCTGGCTGG TGCCTGTGTG TCGCGGAGTT 121 CACTGGCCTC CTCTCCTCCT GCAGGGTGGA CCTGGTAGCC GTGGTTTCCC TGGCGCAGAT 181 GGTGTTGCTG GTCCCAAGGT AACCTCTCCT TGCGGCCGGG GGGCGACCCT GCCGCTCCCT 241 GGGCATCTTC TTCCTCTTTT GGCCCGTGGC AAAGAGCCTG GCCACTCACT CTCACTTTCT 301 GGACTCAGCC TCCCTATCTG TAAAATGAAA GACTTCTCGG CGGGGCACGG TGGCTCATGC 361 CTGTAATCCC AGCACTTTGG GAGGCCAAGG CGGGCGGATC ACCATGAGGT CAGGAGTTTG 421 AGACCAGTCG GGCCAACATA GTGAAACCAC GTCTCTACTA AAAATACAAA AGATTAGCCT 481 GGGTGTGGTG GTGTGCACCC TGTAACCCCA GCTAGTCAGG AGGCTGAGGC AGGAGAATTG 541 CATGAACCCA GGAGGTGGAG GTTGCAGTGA GCTGAGATCG CGCCACTGCA CTCCAGCCTG 601 GGCAACAGTG CGAGACTCCA TCTCAAAAAA AAAAAAAAAA AAAAGAAAGA AAGAAAGAAA 661 AAATGAAACA CTTCTCCAGG CTCCATGACC ACTGCTCTGT CCTGGAAATA AGTGTTGTTG 721 G // LOCUS HUMC1AIM2 56 bp ds-DNA PRI 08-APR-1987 DEFINITION Human (CRL 1262 variant allele) pro-alpha-1 type-1 collagen gene, exon 26, partial. ACCESSION K03177 KEYWORDS Alu repetitive sequence; alpha-1 collagen; alpha-1 type 1 collagen; alpha-collagen; collagen; repetitive sequence. SEGMENT 2 of 2 SOURCE Human fibroblast (cell line ATCC CRL 1262) DNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 56) AUTHORS Chu,M.-L., Gargiulo,V., Williams,C.J. and Ramirez,F. TITLE Multiexon deletion in an osteogenesis imperfecta variant with increased type III collagen mRNA JOURNAL J. Biol. Chem. 260, 691-694 (1985) STANDARD simple staff_review COMMENT See segment 1. FEATURES from to/span description pept + 3 > 56 variant pro-alpha-1 type-I collagen, exon 26 /nomgen="COL1A1" /map="17q21.31-q22.05" /hgml_locus_uid="LG0047H" IVS < 1 2 C1AIV intron XXVI BASE COUNT 9 A 17 C 23 G 7 T ORIGIN About 450 bp downstream of segment 1. 1 AGGGAGAGCC CGGCAAGCGT GGAGAGCGAG GTGTTCCCGG ACCCCCTGGC GCTGTC // LOCUS HUMC1AIN1 1374 bp ds-DNA PRI 08-APR-1987 DEFINITION Human (CRL 1262; normal allele) pro-alpha-1 type-I collagen gene, exons 31-27. ACCESSION K03178 KEYWORDS Alu repetitive sequence; alpha-1 collagen; alpha-1 type 1 collagen; alpha-collagen; collagen; repetitive sequence. SEGMENT 1 of 2 SOURCE Human fibroblast (cell line ATCC CRL 1262) DNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 1374) AUTHORS Chu,M.-L., Gargiulo,V., Williams,C.J. and Ramirez,F. TITLE Multiexon deletion in an osteogenesis imperfecta variant with increased type III collagen mRNA JOURNAL J. Biol. Chem. 260, 691-694 (1985) STANDARD simple staff_review COMMENT [1] sequenced the normal and defective alleles from the human fibroblast cell line CRL 1262, isolated from a fetus with osteogenisis imperfecta type II (lethal perinatal OI). The defective allele was found to have a deletion of 643 bp spanning exons 29-27 (see separate entry). The limits of the deletion correspond to positions 279 and 931 of this sequence. A draft entry and printed copy of this sequence were kindly provided by M.-L.Chu (22-AUG-1985). FEATURES from to/span description pept < 1 48 pro-alpha-1 type-I collagen, exon 31 (AA at 1) /nomgen="COL1A1" /map="17q21.31-q22.05" /hgml_locus_uid="LG0047H" 145 198 pro-alpha-1 type-I collagen, exon 30 323 421 pro-alpha-1 type-I collagen, exon 29 583 636 pro-alpha-1 type-I collagen, exon 28 725 + 823 pro-alpha-1 type-I collagen, exon 27 IVS 49 144 C1AI intron XXX IVS 199 322 C1AI intron XXIX IVS 422 582 C1AI intron XXVIII IVS 637 724 C1AI intron XXVII IVS 824 > 1374 C1AI intron XXVI rpt 975 1329 AluI repeat BASE COUNT 265 A 438 C 375 G 294 T 2 others ORIGIN 175 bp upstream of HhaI site; chromosome 17q21.31-q22.05. 1 CGAGGTGAAC CCGGACCCAC TGGCCTGCCC GGACCCCCTG GCGAGCGTGT AAGTGTCCCT 61 GCCCGCCCCC TCCCAGCCTC CACCCTCATT GCCTGGCTGG TGCCTGTGTG TCGCGGAGTT 121 CACTGGCCTC CTCTCCTCCT GCAGGGTGGA CCTGGTAGCC GTGGTTTCCC TGGCGCAGAT 181 GGTGTTGCTG GTCCCAAGGT AACCTCTCCT TGCGGCCGGG GGGCGACCCT GCCGCTCCCT 241 GGGCATCTTC TTCCTCTTTT GGCCCGTGGC AAAGAGCCAC AAACTTGAGA CCCTAACTGT 301 TCCTGTGACT TCCCCCAACC AGGGTCCCGC TGGTGAACGT GGTTCTCCTG GCCCCGCTGG 361 CCCCAAAGGA TCTCCTGGTG AAGCTGGTCG TCCCGGTGAA GCTGGTCTGC CTGGTGCCAA 421 GGTGAGGCCC CAGGCTTCAG CCTGCTTGGC CAGCCTGACC ATCCCGTGTA GGTCTTGGGA 481 TGAGGCGTTC CGGATCAGGC CCAAGGGGCT GCCCTCTGAA GTCCTCCCCC ACCTCCATCA 541 TGCTTCTCCC CAAGTTCCAC TCATACCTCT CTGCCTCCCT AGGGTCTGAC TGGAAGCCCT 601 GGCAGCCCTG GTCCTGATGG CAAAACTGGC CCCCCTGTAA GTATCACTCC CCCTGAACCC 661 CCTGCCATTG TCCTGTCTGC CTCCCTGCTG TCCCCACTGC TGCTTTCGTG CCTCCCATCC 721 TTAGGGTCCC GCCGGTCAAG ATGGTCGCCC CGGACCCCCA GGCCCACCTG GTGCCCGTGG 781 TCAGGCTGGT GTGATGGGAT TCCCTGGACC TAAAGGTGCT GCTGTGAGTA TTAAGTGTAT 841 NGGATCCNCC CACGAAGAGC TAGGGACAAA CACACCCGAG ACTCGAAGGA GTCTTGGGCT 901 CTGGGCTCAG CTGTGCCGCT GACCTGCCGT GTGGCCACTC ACTCTCACTT TCTGGACTCA 961 GCCTCCCTAT CTGTAAAATG AAAGACTTCT CGGCGGGGCA CGGTGGCTCA TGCCTGTAAT 1021 CCCAGCACTT TGGGAGGCCA AGGCGGGCGG ATCACCATGA GGTCAGGAGT TTGAGACCAG 1081 TCGGGCCAAC ATAGTGAAAC CACGTCTCTA CTAAAAATAC AAAAGATTAG CCTGGGTGTG 1141 GTGGTGTGCA CCCTGTAACC CCAGCTAGTC AGGAGGCTGA GGCAGGAGAA TTGCATGAAC 1201 CCAGGAGGTG GAGGTTGCAG TGAGCTGAGA TCGCGCCACT GCACTCCAGC CTGGGCAACA 1261 GTGCGAGACT CCATCTCAAA AAAAAAAAAA AAAAAAAGAA AGAAAGAAAG AAAAAATGAA 1321 ACACTTCTCC AGGCTCCATG ACCACTGCTC TGTCCTGGAA ATAAGTGTTG TTGG // LOCUS HUMC1AIN2 56 bp ds-DNA PRI 15-JUN-1989 DEFINITION Human (CRL 1262 normal allele) alpha-1 type-I collagen gene, exon 26. ACCESSION K03179 KEYWORDS Alu repetitive sequence; alpha-1 collagen; alpha-1 type 1 collagen; alpha-collagen; collagen; repetitive sequence. SEGMENT 2 of 2 SOURCE Human fibroblast (cell line ATCC CRL 1262) DNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 56) AUTHORS Chu,M.-L., Gargiulo,V., Williams,C.J. and Ramirez,F. TITLE Multiexon deletion in an osteogenesis imperfecta variant with increased type III collagen mRNA JOURNAL J. Biol. Chem. 260, 691-694 (1985) STANDARD simple staff_review COMMENT See comment in segment 1. FEATURES from to/span description pept + 3 > 56 prepro-alpha-1 type-I collagen, exon 26 /nomgen="COL1A1" /map="17q21.31-q22" /hgml_locus_uid="LG0047H" IVS < 1 2 C1AI intron XXVI BASE COUNT 9 A 17 C 23 G 7 T ORIGIN About 450 bp after segment 1. 1 AGGGAGAGCC CGGCAAGCGT GGAGAGCGAG GTGTTCCCGG ACCCCCTGGC GCTGTC // LOCUS HUMC1PA2 251 bp ds-DNA PRI 11-NOV-1985 DEFINITION Human collagen type I: pro-alpha-2 gene, exon 1. ACCESSION K02568 KEYWORDS collagen. SOURCE Human cultured skin fibroblast genomic DNA from patient with osteogenesis imperfecta and from control. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 251) AUTHORS Pihlajaniemi,T., Dickson,L.A., Pope,F.M., Korhonen,V.R., Nicholls,A., Prockop,D.J. and Myers,J.C. TITLE Osteogenesis imperfecta: Cloning of a pro-alpha2(I) collagen gene with a frameshift mutation JOURNAL J. Biol. Chem. 259, 12941-12944 (1984) STANDARD full staff_review COMMENT In osteogenesis imperfecta a four bp deletion occurs, causing a frameshift, which moves the stop codon for the collagen gene. The mutation prevents incorporation of pro-alpha-2(I) chains into the normal type I pro-collagen heterotrimer resulting in secretion of only pro-alpha-1(I) homotrimers. The sequence presented here is the normal gene. The features contain the mutation causing OI. FEATURES from to/span description pept / 101 247 pro-alpha-2(I) collagen, exon 1 (AA 200 at 101) /nomgen="COL1A2" /map="7q21.3-q22.1" /hgml_locus_uid="LP0002V" IVS < 1 100 collagen cds intron mut 145 150 aaataa in normal collagen; aa in OI collagen BASE COUNT 79 A 48 C 46 G 78 T ORIGIN 51 bp upstream of HinfI site. 1 CGTGCCAAAC AGTTGGTTTC TTATTAAATC AAAGGTTTCA GATATCATCA GATTCAGAAA 61 TCGTGATGCT TTGTGTCTAT ATTTTCTTCT CTTTAAACAG AAAAAGACAA ATGAATGGGG 121 AAAGACAATC ATTGAATACA AAACAAATAA GCCATCACGC CTGCCCTTCC TTGATATTGC 181 ACCTTTGGAC ATCGGTGGTG CTGACCATGA ATTCTTTGTG GACATTGGCC CAGTCTGTTT 241 CAAATAAATG A // LOCUS HUMC1QB1 277 bp ds-DNA PRI 10-NOV-1986 DEFINITION Human complement C1q B-chain gene, exon A. ACCESSION X03084 KEYWORDS complement protein; complement protein C1q B-chain. SEGMENT 1 of 2 SOURCE Human liver, cDNA to mRNA; white blood cell DNA (library of A.Palsdottir). ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 277) AUTHORS Reid,K.B.M. TITLE Molecular cloning and characterization of the complementary DNA and gene coding for the B-chain of subcomponent C1q of the human complement system JOURNAL Biochem. J. 231, 729-735 (1985) STANDARD simple staff_review COMMENT Data kindly reviewed by K.B. Reid, 12-DEC-1985. FEATURES from to/span description pept / 58 + 250 complement C1q B-chain precursor, exon A (AA -29 at 58) /nomgen="C1QB" /map="1p" /hgml_locus_uid="LR0054Y" pre-msg < 1 > 277 C1q B mRNA IVS 251 > 277 C1q B intron BASE COUNT 63 A 86 C 75 G 53 T ORIGIN 18 bp upstream of BstEII site. 1 GATAGGATCA CCACGGTGGT AACCTCTCAC ATTGTCTTCT CCACAGGAGG CGTCTGACAC 61 AGTATGATGA TGAAGATCCC ATGGGGCAGC ATCCCAGTAC TGATGTTGCT CCTGCTCCTG 121 GGCCTAATCG ATATCTCCCA GGCCCAGCTC AGCTGCACCG GGCCCCCAGC CATCCCTGGC 181 ATCCCGGGTA TCCCTGGGAC ACCTGGCCCC GATGGCCAAC CTGGGACCCC AGGGATAAAA 241 GGAGAGAAAG GTACCATGGG ATTTAGCAGG CACAGGT // LOCUS HUMC1QB2 977 bp ds-DNA PRI 10-NOV-1986 DEFINITION Human complement C1q B-chain gene, exon A+1. ACCESSION K03430 KEYWORDS complement protein; complement protein C1q B-chain. SEGMENT 2 of 2 SOURCE Human liver, cDNA to mRNA; white blood cell DNA (library of A.Palsdottir). ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 977) AUTHORS Reid,K.B.M. TITLE Molecular cloning and characterization of the complementary DNA and gene coding for the B-chain of subcomponent C1q of the human complement system JOURNAL Biochem. J. 231, 729-735 (1985) STANDARD simple staff_review COMMENT Data kindly reviewed by K.B. Reid, 12-DEC-1985. FEATURES from to/span description pept + 56 630 complement C1q B-chain precursor, exon A+1 /nomgen="C1QB" /map="1p" /hgml_locus_uid="LR0054Y" pre-msg < 1 877 C1q B mRNA IVS < 1 55 C1q B intron BASE COUNT 225 A 302 C 249 G 201 T ORIGIN About 1 kb after segment 1. 1 AGGCCTCCTT CTTTTGGTCT CAGTGATCTC ACTTCTTTGG TCTCTGATTT TCCAGGGCTT 61 CCAGGGCTGG CTGGAGACCA TGGTGAGTTC GGAGAGAAGG GAGACCCAGG GATTCCTGGG 121 AATCCAGGAA AAGTCGGCCC CAAGGGCCCC ATGGGCCCTA AAGGTGGCCC AGGGGCCCCT 181 GGAGCCCCAG GCCCCAAAGG TGAATCGGGA GACTACAAGG CCACCCAGAA AATCGCCTTC 241 TCTGCCACAA GAACCATCAA CGTCCCCCTG CGCCGGGACC AGACCATCCG CTTCGACCAC 301 GTGATCACCA ACATGAACAA CAATTATGAG CCCCGCAGTG GCAAGTTCAC CTGCAAGGTG 361 CCCGGTCTCT ACTACTTCAC CTACCACGCC AGCTCTCGAG GGAACCTGTG CGTGAACCTC 421 ATGCGTGGCC GGGAGCGTGC ACAGAAGGTG GTCACCTTCT GTGACTATGC CTACAACACC 481 TTCCAGGTCA CCACCGGTGG CATGGTCCTC AAGCTGGAGC AGGGGGAGAA CGTCTTCCTG 541 CAGGCCACCG ACAAGAACTC ACTACTGGGC ATGGAGGGTG CCAACAGCAT CTTTTCCGGG 601 TTCCTGCTCT TTCCAGATAT GGAGGCCTGA CCTGTGGGCT GCTTCACATC CACCCCGGCT 661 CCCCCTGCCA GCAACGCTCA CTCTACCCCC AACACCACCC CTTGCCCAGC CAATGGACAC 721 AGTAGGGCTT GGTGAATGCT GCTGAGTGAA TGAGTAAATA AACTCTTCAA GGCCAAGGAA 781 CAGTGGTCTA ATTCAACTCT GTGTCCCAGC ACTGGCACAC CAGAAGTGCC ATGCTCAGAA 841 ATGTTGGTTA CATGAATGAA TGAACCATGA ATGAATGAAA TCTCTGTCCG TGTGCTGCTC 901 TGTGCCAACC ACACAGTGGG AGGTGGGGGT CCAGCACCAA CTATGCTCCC AGTCCTAGCC 961 TACACCTTTT CCTGATT // LOCUS HUMC2A11 102 bp ds-DNA PRI 15-JUN-1989 DEFINITION Human collagen type II; pro-alpha-1, exon 14 from 3' end. ACCESSION K01785 KEYWORDS alpha-1 collagen; alpha-1 type 2 collagen; collagen; type II procollagen. SEGMENT 1 of 2 SOURCE Human DNA, library of T.Maniatis, clone LGHCo1(II)B. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 102) AUTHORS Strom,C.M. and Upholt,W.B. TITLE Isolation and characterization of genomic clones corresponding to the human type II procollagen gene JOURNAL Nucleic Acids Res. 12, 1025-1038 (1984) STANDARD full staff_review REFERENCE 2 (sites; exon number and relationship of to ) AUTHORS Upholt,W.B. JOURNAL Unpublished (1984) Pritzker School of Medicine, Chicago, IL. STANDARD full staff_review FEATURES from to/span description pept / 42 / 95 alpha-II collagen exon 14 (aa at 42) /nomgen="COL2A1" /map="12q14.3" /hgml_locus_uid="LX0121B" IVS < 1 41 pro-a1 intron 14 IVS 96 > 102 pro-a1 intron 13 BASE COUNT 14 A 38 C 30 G 20 T ORIGIN About 102 bp upstream of BamHI site. 1 GAGGGCTTGA GGTTCTCACC CCGTCTCCTC TCCCCACACA GGGAGCCACT GGATTCCCTG 61 GAGCTGCTGG TCGCGTTGGG CCCCCAGGTC CAGAGGTCAC CC // LOCUS HUMC2A12 165 bp ds-DNA PRI 08-APR-1987 DEFINITION Human collagen type II pro-alpha-1, exon 4 from 3' end. ACCESSION X00339 KEYWORDS alpha-1 collagen; alpha-1 type 2 collagen; collagen; type II procollagen. SEGMENT 2 of 2 SOURCE Human DNA, library of T.Maniatis, clone LGHCo1(II)A. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 165) AUTHORS Strom,C.M. and Upholt,W.B. TITLE Isolation and characterization of genomic clones corresponding to the human type II procollagen gene JOURNAL Nucleic Acids Res. 12, 1025-1038 (1984) STANDARD full staff_review REFERENCE 2 (sites; exon number and relationship of & ) AUTHORS Upholt,W.B. JOURNAL Unpublished (1984) Pritzker School of Medicine, Chicago, IL. STANDARD full staff_review FEATURES from to/span description pept < 1 / 154 alpha-1 type II procollagen exon 4 (aa at 1) /nomgen="COL2A1" /map="12q13.1-q14.3" /hgml_locus_uid="LX0121B" IVS 155 > 165 pro-a1 intron 4 BASE COUNT 41 A 53 C 47 G 24 T ORIGIN About 4 kb after . 1 GCCGGTGGCC TGAGACAGCA TGACGCCGAG GTGGATGCCA CACTCAAGTC CCTCAACAAC 61 CAGATTGAGA GCATCCGCAG CCCCGAGGGC TCCCGCAAGA ACCCTGCTCG CACCTGCAGA 121 GACCTGAAAC TCTGCCACCC TGAGTGGAAG AGTGGTAAGC TTGGA // LOCUS HUMC5A2A 269 bp ds-DNA PRI 15-JUN-1989 DEFINITION Human fibrillar collagen (proa2(V)) gene, last exon. ACCESSION J03051 KEYWORDS collagen. SOURCE Human DNA, clones Hf-511 and DMC-2. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 269) AUTHORS Tsipouras,P., Schwartz,R.C., Liddell,A.C., Salkeld,C.S., Weil,D. and Ramirez,F. TITLE Genetic distance of two fibrillar collagen loci, COL3A1 and COL5A2, located on the long arm of human chromosome 2 JOURNAL Genomics 3, 275-277 (1988) STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly provided by S.Halloran-Blanton, 09-AUG-1988. FEATURES from to/span description pept / 64 210 alpha-2 collagen type V, last exon (AA at 64) /nomgen="COL5A2" /map="2q31-q32.3" /hgml_locus_uid="LN0103Q" pre-msg < 1 > 269 COL5A2 mRNA and intron IVS < 1 63 COL5A2 last intron BASE COUNT 78 A 58 C 60 G 68 T 5 others ORIGIN 172 bp upstream of EcoRI site; chromosome 2q31-q32.3. 1 CCTGTCTAAC ATACATGGNN NAAANTGTAT GTACTTTTAA CCCATTGCAT NATTTAATTT 61 CAGAAGCGGA ATGGAAATGT GGGCAAGACT GTCTTTGAAT ATAGAACAGA GAATGTGGCA 121 CGCTTGCCCA TCATAGATCT TGCTCCTGTG GATGTTGGCG GCACAGACCA GGAATTCGGC 181 GTTGAAATTG GGCCAGTTTG TTTTGTGTAA AGTTAAGCCA AGCCAAGACA CATCGACAAT 241 GAGCACCACC ATCAATGACC ACCACCGCC // LOCUS HUMC9A 721 bp ds-DNA PRI 30-SEP-1988 DEFINITION Human complement component C9, exons 3 and 4. ACCESSION J02833 KEYWORDS complement component. SOURCE Human DNA, (libraries of Maniatis et al. and Frischauf et al.), clones lambda-GC9-[1-5]. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 721) AUTHORS Marazziti,D., Eggertsen,G., Fey,G.H. and Stanley,K.K. TITLE Relationships between the gene and protein structure in human complement component C9 JOURNAL Biochemistry 27, 6529-6534 (1988) STANDARD full staff_entry COMMENT Draft entry and printed copy of sequence for [1] kindly provided by K.K.Stanley, 27-MAY-1988. FEATURES from to/span description pept / 62 206 complement component C9, exon 3 (AA at 62) /nomgen="C9" /map="5p14-p12" /hgml_locus_uid="LX00296" 467 / 614 complement component C9, exon 4 pre-msg < 1 > 721 C9 mRNA and introns IVS < 1 61 C9 intron B IVS 207 466 C9 intron C (no splice consensus at 466) IVS 615 > 721 C9 intron D (no splice consensus at 615) BASE COUNT 211 A 129 C 174 G 207 T ORIGIN Unreported. 1 TTTAGACCAT ACCCTTTTTT TGGCATTAGA AATTCTAATC ATTCCAACTG ATATTGTGCA 61 GTTTCGTTCA AGAAGCATTG AGGTCTTTGG ACAATTTAAT GGGAAAAGAT GCACCGACGC 121 TGTGGGAGAC AGACGACAGT GTGTGCCCAC AGAGCCCTGT GAGGATGCTG AGGATGACTG 181 CGGAAATGAC TTTCAATGCA GTACAGGTAA TCTTTGTGCT TGAATTGCTC ATTGTGGCTT 241 TCTGTGTGCC TTCCTGAATG AAAAGAAAAA AAAAAGCATA TGCTTCTGGA CACTTGGATT 301 TCCAAGCGTG AAAGGCCCCA AAAGAGGCCA TCTATATATT CTGTACAAAG AACTCAAATG 361 TGTTTTTCTG ATACCTCACC TCCAGGGTTA AAATGCATTT GATAACTTCA GAAAGAAAGA 421 GAAAATTTGA CATTAGATAC ATTGAGTCTC TCCTGATTTT GATTGCGCAG ATGCATAAAG 481 ATGCGACTTC GGTGTAATGG TGACAATGAC TGCGGAGACT TTTCAGATGA GGATGATTGT 541 GAAAGTGAGC CCCGTCCCCC CTGCAGAGAC AGAGTGGTAG AAGAGTCTGA GCTGGCACGA 601 ACAGCAGGCT ATGGTGTGTA TTTTACTTGT ACTTTTTCAG ATGAAAATGA GTGAAAATGA 661 TCTCCATCTC TCTCATTGTG ATAGACTGGT AGAAGCATTT GTGCGAGGGA CATAGGTGAT 721 A // LOCUS HUMCAATP1 558 bp ds-DNA PRI 15-DEC-1989 DEFINITION Human cardiac Ca2+ ATPase gene, (HK1) 3' end, and (HK2) exons x, x+1. ACCESSION M23116 J04025 KEYWORDS ATPase; Ca2+ ATPase; alternative splicing; calcium-ATPase. SEGMENT 1 of 4 SOURCE Human cardiac muscle DNA, clone RB2-5. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 558) AUTHORS Lytton,J. and MacLennan,D.H. TITLE Molecular Cloning of cDNAs from Human Kidney Coding for Two Alternatively Spliced Products of the Cardiac Ca2+-ATPase Gene JOURNAL J. Biol. Chem. 263, 15024-15031 (1988) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by J.Lytton, 14-MAR-1989. Two alternative splicing products, HK1 and HK2, are realized in the Ca2+ ATPase gene. HK2 codes for a protein identical to rabbit cardiac muscle Ca2+ ATPase, with the exception of 6 scattered amino acid replacements, whereas HK1 codes for a protein identical to that encoded by HK2, but with the carboxyl-terminal 4 amino acids replaced by an extended sequence of 49 amino acids. See accession M23114, M23115 and J04703. FEATURES from to/span description pept / 188 305 HK1 calcium-ATPase, exon x (AA at 189) /nomgen="ATP2B" /map="12" /hgml_locus_uid="LP0123P" 388 > 558 HK1 calcium-ATPase, exon x+1 IVS < 1 187 HK1 Ca2+ ATPase intron y IVS 306 387 HK1 Ca2+ ATPase intron y+1 pre-msg < 1 > 558 HK1 Ca2+ ATPase mRNA and introns pre-msg < 1 > 558 HK2 Ca2+ ATPase mRNA and introns pep$ / 188 305 HK2 calcium-ATPase, exon x (AA at 189) /nomgen="ATP2B" /map="12" /hgml_locus_uid="LP0123P" pep$ 388 + 508 HK2 calcium-ATPase, exon x+1 IVS < 1 187 HK2 HK2 Ca2+ ATPase intron y IVS 509 > 558 HK2 Ca2+ ATPase intron y+1 BASE COUNT 104 A 164 C 145 G 145 T ORIGIN 186 bp upstream of PstI site. 1 GAATTCACAG TTTGTCCTGC ATTAGGACAT TCTCTTCAAC TTTGCCACTG TAGAAAGTGG 61 AGGTAGGTCA GCGGATGGTG CCACATTAAC AGCCGCCTTA CTGAAGTGTA GTCCAACAGG 121 GTCTTACTGC CACTGTGACA CGTGCCTTGC CTTGGGGGTG CGTTTCCCAC CTCTCCTTGC 181 TCTGCAGCTT GTCCGAAAAC CAGTCCTTGC TGAGGATGCC CCCCTGGGAG AACATCTGGC 241 TCGTGGGCTC CATCTGCCTG TCCATGTCAC TCCACTTCCT GATCCTCTAT GTCGAACCCT 301 TGCCAGTAAC TGGTTGGGTG GGGCTTGGGA CCAGCCACCT CCTTCCAGGG GAGGCTGGAG 361 GCGTGACACG TCTTCCCTGT GTGTCAGCTC ATCTTCCAGA TCACACCGCT GAACGTGACC 421 CAGTGGCTGA TGGTGCTGAA AATCTCCTTG CCCGTGATTC TCATGGATGA GACGCTCAAG 481 TTTGTGGCCC GCAACTACCT GGAACCTGGT AAAGAGTGTG TGCAGCCTGC CACCAAATCC 541 TGCTCGTTCT CGGCATGC // LOCUS HUMCAATP3 296 bp ds-DNA PRI 15-DEC-1989 DEFINITION Human cardiac Ca2+ ATPase gene, (HK2) exon x+2. ACCESSION M23278 J04025 KEYWORDS ATPase; Ca2+ ATPase; alternative splicing; calcium-ATPase. SEGMENT 3 of 4 SOURCE Human cardiac muscle DNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 296) AUTHORS Lytton,J. and MacLennan,D.H. TITLE Molecular Cloning of cDNAs from Human Kidney Coding for Two Alternatively Spliced Products of the Cardiac Ca2+-ATPase Gene JOURNAL J. Biol. Chem. 263, 15024-15031 (1988) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by J.Lytton, 14-MAR-1989. Two alternative splicing products, HK1 and HK2, are realized in the Ca2+ ATPase gene. HK2 codes for a protein identical to rabbit cardiac muscle Ca2+ ATPase, with the exception of 6 scattered amino acid replacements, whereas HK1 codes for a protein identical to that encoded by HK2, but with the carboxyl-terminal 4 amino acids replaced by an extended sequence of 49 amino acids. See accession M23114, M23115 and J04703. FEATURES from to/span description pep$ + 192 205 HK2 calcium-ATPase, exon x+2 /nomgen="ATP2B" /map="12" /hgml_locus_uid="LP0123P" IVS < 1 191 HK2 Ca2+ ATPase intron y+2 pre-msg < 1 > 296 HK2 Ca2+ ATPase mRNA and introns BASE COUNT 70 A 68 C 58 G 100 T ORIGIN 2.2 kb after segment 2. 1 TCTAGATGCT ACCCTGTGTG GGCGGCACCT CAGGGACAGT AAATCAGAAA TGCTGGTCTT 61 GAAACCTTGA AAAGATCAAG CTGAATGTTC CTTTTCATCT GTCGCTGTTG ATCTTCATCT 121 ATTTAAATAG GTATTCTAAC GTTTCCTCTC TGTATTTCAT GAAGCTGATT TCCTCTCTCT 181 TTCCTTTTCA GCAATACTGG AGTAACCGCT TCCTAAACCA TTTTGCAGAA ATGTAAGGGT 241 GTTCGGTTGC GTGCATGTGC GTTTTTAGCA ACACATCTAC CAACCCTGTG CATGAC // LOCUS HUMCABLA1 234 bp ds-DNA PRI 15-JUN-1989 DEFINITION Human c-abl gene, exon a1. ACCESSION M13098 KEYWORDS . SEGMENT 1 of 2 SOURCE Human DNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 234) AUTHORS Grosveld,G., Verwoerd,T., van Agthoven,T., deKlein,A., Ramachandran,K.L., Heisterkamp,N., Stam,K. and Groffen,J. TITLE The chronic myelocytic cell line K562 contains a breakpoint in bcr and produces a chimeric bcr/c-abl transcript JOURNAL Mol. Cell. Biol. 6, 607-616 (1986) STANDARD simple staff_entry FEATURES from to/span description pept / 31 + 204 c-abl exon a1 (AA at 3) /nomgen="ABL" /map="9q34" /hgml_locus_uid="LY0005H" pre-msg < 1 30 c-abl DNA IVS 205 > 234 c-abl intron A BASE COUNT 53 A 59 C 57 G 65 T ORIGIN Chromosome 9q34. 1 CTTTTTTCTG TTCCCCCCTT TCTCTTCCAG AAGCCCTTCA GCGGCCAGTA GCATCTGACT 61 TTGAGCCTCA GGGTCTGAGT GAAGCCGCTC GTTGGAACTC CAAGGAAAAC CTTCTCGCTG 121 GACCCAGTGA AAATGACCCC AACCTTTTCG TTGCACTGTA TGATTTTGTG GCCAGTGGAG 181 ATAACACTCT AAGCATAACT AAAGGTAAAA GGGTTGTGGG CAGCTAGTGG TGGT // LOCUS HUMCABLA2 215 bp ds-DNA PRI 15-JUN-1989 DEFINITION Human c-abl gene, exon a2. ACCESSION M13099 KEYWORDS . SEGMENT 2 of 2 SOURCE Human DNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 215) AUTHORS Grosveld,G., Verwoerd,T., van Agthoven,T., deKlein,A., Ramachandran,K.L., Heisterkamp,N., Stam,K. and Groffen,J. TITLE The chronic myelocytic cell line K562 contains a breakpoint in bcr and produces a chimeric bcr/c-abl transcript JOURNAL Mol. Cell. Biol. 6, 607-616 (1986) STANDARD simple staff_entry FEATURES from to/span description pept + 31 > 215 c-abl exon a2 (AA at 3) /nomgen="ABL" /map="9q34" /hgml_locus_uid="LY0005H" IVS < 1 30 c-able intron A BASE COUNT 56 A 54 C 54 G 51 T ORIGIN About 560 bp downstream of exon a1; chromosome 9q34. 1 ATATGTCTGA TTTGGTTCCT TTCTTCTCAG GTGAAAAGCT CCGGGTCTTA GGCTATAATC 61 ACAATGGGGA ATGGTTTGAA GCCCAAACCA AAAATGGCCA AGGCTGGGTC CCAAGCAACT 121 ACATCACGCC AGTCAACAGT CTGGAGAAAC ACTCCTGGTA CCATGGGCCT GTGTCCCGCA 181 ATGCCGCTGA GTATCTGCTG AGCAGCGGGA TCAAT // LOCUS HUMCACY 3671 bp ds-DNA PRI 15-SEP-1989 DEFINITION Human calcyclin gene, complete cds. ACCESSION J02763 KEYWORDS calcyclin. SOURCE Human placenta DNA, (library of P.Leder), clones pG2A9B[1.7,3.0,6.0]. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 3671) AUTHORS Ferrari,S., Calabretta,B., deRiel,J.K., Battini,R., Ghezzo,F., Lauret,E., Griffin,C., Emanuel,B.S., Gurrieri,F. and Baserga,R. TITLE Structural and functional analysis of a growth-regulated gene, the human calcyclin JOURNAL J. Biol. Chem. 262, 8325-8332 (1987) STANDARD simple staff_entry REFERENCE 2 (bases 1321 to 1380) AUTHORS Ghezzo,F., Valpreda,S., de Riel,J.K. and Baserga,R. TITLE Identification of serum-responsive elements in the promoter of human calcyclin, a growth-regulated gene JOURNAL DNA 8, 171-177 (1989) STANDARD simple staff_entry FEATURES from to/span description pept 2024 2161 calcyclin, exon 2 (first expressed exon) /nomgen="CACY" /map="1q21-q25" /hgml_locus_uid="LC0046F" 2534 2668 calcyclin, exon 3 pre-msg 1372 > 2746 CCY mRNA and introns IVS 1416 2002 CCY intron A IVS 2162 2533 CCY intron B BASE COUNT 732 A 987 C 1110 G 842 T ORIGIN 215 bp upstream of HinfI site; chromosome 1q21-q25. 1 AGTACTCGGT GTTCCTGAGG ATGCTGTGCA TGGCCTACAA CGACTTCTTT CTAGAGGACA 61 ACAAGTGACC AGGGCTGCCC TCCACCCTCA CCCTCCACCC TTTGCTGCTG ACCTCGGCTG 121 CTCCTCTCAC AGACCCTCTT TGGCCCCTGC CCTCCTCTCC CTCCCAGATG GACCCTTCCA 181 TGGGAGGAAA TAAAGTTTCC ATCGCAGGTG CTGGGAGTCT GGTTTTGAAG CTGTCTTGTC 241 TACCTTGGCC TGGGGAGAGG GGAGCACAGG AAGGGTCTCT CCTTGAGTGG GTTGAGACAG 301 CTTCTGCCTC TGGGGGTTAG GGTCCTGGGC TCCCACTGCA TTCCTCTCCT TCTTTGGTGT 361 GGACGTCATT GGTTTTGTCA TGGCTTAGTT TTGCCTGCCT GGAAAATGGG GAAGTTAGGC 421 CAGGCGGGAA CTCTGCAAGG ATGCAGAGGA AGTTAAGAGG GAAAGTTGCT TTGAGAGGAG 481 GACACTGGGA GGGGTTGGGA GTGGCTCCTG AGGGCGGTGA TAGGCAGGCA GGCCTGACTT 541 GTCCACAGCT CACCCGGAGG CCACCTTGGC AGCACCTGTA GGAAGGGCAT GTCTGGCCTC 601 CACACCAGCC CCCTCCCTCT TCACCATTTC CCCTTCAATA GCACCACTCT CATCATCTAT 661 GGGGGACAGT GCTTTCTTCT CTCCCTGCCT CCTCCATCAA AATCTTTTCT CAGGGGAGGG 721 TCTGAAAAGG CCTTCACTCC CCCGTAAATA ACGAATGGTG CTTACAGGGC TGGGCTCCCA 781 CGTGCATGCA CATTAACACC AAAGGTGCTG TAGTGAATGG AATTTGGGGC ACTGAGGGGA 841 AGGCGTGGAG GTGTTGGTAG GAACTTGTTG CTGGTGGGGG ATGGGCGCCG TAGATATCCT 901 TTACACCACT GGCTACTCCC CCTATCTCCT CTGGGGTGAC CCTGAGTATC CTCTGTGGGA 961 CACCGGCATC CTGTGAGGCG CCCTCCTTGC CCACATTGAC GCTGCGCTGG CTCGAGGGTC 1021 ACATTCACGG TCTGGCAGAG GAAGCAGGGG TGACCGCCGC AGTCCTCCTC CTGCTCCCCT 1081 TGCCGAGTCA CGTGTCACGA AGAGCAAACT GAGCAAACTG AGCTGCGCAG ATGAGGGGAG 1141 ACTCGTCACC AGGCGTGCAG TGGGCACTGC TGGGCTCCCC CATCCCGTCC TAACCCGGAA 1201 CAGCCCCGGG CAGGAGGCGT GGAAAGTCGA GGGGGTAAAC CGCGAATGTG CGTTGTGTAA 1261 GCCACGGCGC AGGGTGGGGC GCGGGCGGGA CTTGGGCGGG CGGGGTGGGC TTGGCCGAGC 1321 TGGCCTCCGG GGCACCGACC GCTATAAGGC CAGTCGGACT GCGACACAGC CCATCCCCTC 1381 GACCGCTCGC GTCGCATTTG GCCGCCTCCC TACCGGTGAG TTCTCTCCAG GAGCCCTGGG 1441 TACTTTCCAG GGCCAGCTGC CCTCACGCTG GGGGTCCAGC CATCCCCTGC CCAGTTCAGC 1501 CGCTGGATCC AGACTGGGGC CATCTGTGGC GCTCCCCCGC TGGAGGGATA GTCAGGAGCA 1561 GCAGTGCTGT GCCAGGCAGG CCTTGGGCTA AGGGATCGCA ATGGGGTGTG CTCTTTTGGG 1621 GTGCGGAAGG GAGTGCCCTG GGTGTGTCAT TGCCACCATG TGTGGCCCTG TGAAGCTGTG 1681 TTTAAGCTGC CTTTGCAGCC TCCATTCCCC TCCCCTGCCC AGCCATACTC CTCAACTTCT 1741 GGATCCCCTG AAGGACAGTT CTCAGCTGTG CCCAAAGCTA CTGTTCCTAT ATGCTTCTTA 1801 GAATCCTTAA GCCACCTCTC TTGCCTTGGC CCTAGTGTGC TCTCTCCTTC CCCTTCAGCC 1861 CTGGGCTGTC TCCTGATGCC ATTGTGTGTG GCCTGAGACT GGGTGGTTCC AAAGGAGGCG 1921 GGGCTAGTGC AGGCAGCATT ATTGGGGTGT GTGGGTGAGA AGTCCTTGCT CCCATGGCAC 1981 TGACTAGGCC CTCTGCTGCC AGCTCCAAGC CCAGCCCTCA GCCATGGCAT GCCCCCTGGA 2041 TCAGGCCATT GGCCTCCTCG TGGCCATCTT CCACAAGTAC TCCGGCAGGG AGGGTGACAA 2101 GCACACCCTG AGCAAGAAGG AGCTGAAGGA GCTGATCCAG AAGGAGCTCA CCATTGGCTC 2161 GGTGAGTGGC CTCCTCCCCA GGACCCCTTT TCCCACCCTT GTCCTTTGGA AGCAAGGATT 2221 AGGGGAGAGA GAGGTGCCAG GTGCATCTGA CTCACATTTA CCCACATTCT GAGGCCCTGG 2281 TCCACATGTA GACCCTGAGC TGTAGACCCA CTCTCCCAGC GGGTAGGGGA TGCTTCCAGC 2341 CGGATATCCA TCTCTCCAAA TGAGGACCAG TAACTGAGAA GTATCTGAGG AGAAGCAATG 2401 CCAAAGTGAC ATGGGTCCTT GGTGATGAGG GAGCACAGAG CCACTTGCAG AGAGGATTGC 2461 CTAGGAGGGG GAAGGGGAAG AATCCAGGGT TGTCATCACC ACTGAGTATG GATTTCACAT 2521 TCTAACACAT TAGAAGCTGC AGGATGCTGA AATTGCAAGG CTGATGGAAG ACTTGGACCG 2581 GAACAAGGAC CAGGAGGTGA ACTTCCAGGA GTATGTCACC TTCCTGGGGG CCTTGGCTTT 2641 GATCTACAAT GAAGCCCTCA AGGGCTGAAA ATAAATAGGG AAGATGGAGA CACCCTCTGG 2701 GGGTCCTCTC TGAGTCAAAT CCAGTGGTGG GTAATTGTAC AATAAATTTT TTTTGGTCAA 2761 ATTTACCCTT GCGTCTTGGC TTCCGAATGA TTTCTGTTCC TCCTTGGCTT AGTGGGACAC 2821 CAGCCATTGG AAGATTTGCT CACGGTCAAC CTCTGAAAAT GACTCATTGA CTCGCCAGGC 2881 CAGAGGACCC ACCCTGACAA GGCTGCCTCT AGCGCGTAAG GTGCCTTTAT GTGAATGAGG 2941 AGAGATGCCC CTCTTGGCAA CGCCATCCTA AGGAAAGGCT CAAGTGGTTT CCAGTAGAGA 3001 GAGTCCTGGG ATGAGCTTGG AGATGGAAAT GGTCCTTTGG GCCGGGATGT GATGGGGTTT 3061 GGGGGCCTGG AAGTGAGGCA GAGATAGTTC CAGAGGCTCC CAGATGTGTT TTGCTCTGGG 3121 TGTGGCAAGA GGGGCCTTGG GGTGGGGCAA GTCCCTTTCT CATCACAGCG CAGGGGTTAG 3181 ATAGGGCACA TCTGAGATGC CTGAGGCTTG GCTCAGGGAG TTTCCTACAC CAGTGAGGAC 3241 GCTGTGTGAC TGAGTCTACT GCGGCTGCCC AGGTCCCAGG TGGAGTGGGG GAGGCACACT 3301 CTTGGAGTGT GTCCCGTCAT TCAGGGTGAG GGCTTTTTGT TGGAACGGTG GTCTGAGGAG 3361 CTGGCAGCTG CACCAACACG TGAACCACGG GGTGTTCAGT AATGGGGCGG GGTATCCCTG 3421 CAGCCTCAGC GTAATGACTC ACCCGGCACT TCCACGGGAT CCAGCCTGGA TCTCAGCCCC 3481 CATCAGAGAA GATGACTAAT TGAATCATTG TCCATCATCT GGATTAGTGT TTTAAGGCAG 3541 AAGGGAAGAG GATAAGGAGG GTAAACGCTG TTTCCGGGTG ATGCCACATC ATTAAGCCTC 3601 TCTAGGCCTA GTCCGAGCTG GGCAAGTTTA CCTCTAGCTT CTGGGGAAGA GATCTTGACT 3661 TTAGATGGAG A // LOCUS HUMCAII 2152 bp ds-DNA PRI 15-JUN-1989 DEFINITION Human gene fragment for carbonic anhydrase II (exons 1 and 2). ACCESSION X03251 M18100 KEYWORDS anhydrase; carbonic anhydrase; tandem repeat. SOURCE Human. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 2152) AUTHORS Venta,P.J., Montgomery,J.C., Hewett-Emmett,D. and Tashian,R.E. TITLE Comparison of the 5'regions of human and mouse carbonic anhydrase II genes and identification of possible regulatory elements JOURNAL Biochim. Biophys. Acta 826, 195-201 (1985) STANDARD simple automatic REFERENCE 2 (bases 1 to 647; corrects [1]) AUTHORS Shapiro,L.H., Venta,P.J. and Tashian,R.E. TITLE Molecular analysis of g+c-rich upstream sequences regulating transcription of the human carbonic anhydrase II gene JOURNAL Mol. Cell. Biol. 7, 4589-4593 (1987) STANDARD simple staff_review COMMENT Direct repeat 1 (CCGCCC) and direct repeat 2 (GGGCGG) are putative elements controlling transcription efficiency. The conserved tandem-repeat element is similar to a tandem repeat sequence located at about the same position in mammalian B-globin genes. Data kindly reviewed (14-MAY-1986) by R. Tashian EMBL features not translated to GenBank features: key from to description RPT 10 15 direct repeat 1 RPT 71 76 direct repeat 2 RPT 321 326 direct repeat 2 RPT 395 400 direct repeat 1 RPT 414 419 direct repeat 1 RPT 422 427 direct repeat 1 RPT 427 432 direct repeat 1 SITE 443 456 tandem repeat sequence RPT 453 458 direct repeat 1 SITE 456 469 tandem repeat sequence PRM 486 490 pot. CCAAT box RPT 509 514 direct repeat 1 PRM 539 544 TATA box FEATURES from to/span description pept 646 679 carbonic anhydrase II, exon 1 /nomgen="CA2" /map="8q22" /hgml_locus_uid="LN0043N" 1830 / 2027 carbonic anhydrase II, exon 2 IVS 680 1829 CA II, intron A IVS 2028 > 2152 CA II, intron B BASE COUNT 411 A 695 C 647 G 399 T ORIGIN 1 ACCCGCTTGC CGCCCCAGGC CGGGGACACC AGAGACTGAG CCCTTGCGCG CTGGAGACCC 61 GGCGCGGGTG GGGCGGGAGG GCACCGGGGC CAAGAGACAG GTCACAATGG GGACGGGGAC 121 GAAGGCGCCG GGGGTCCGGG TCCCGAGCAG TCCCCGCCGC CGCCAGACTC CGCAGCCGGG 181 AGGGGGATGG GGTCGCGGAG AGGCTCCGCG GCCCGGGTTG GACGGAGGAG CCCAGGAGCC 241 ACCGCTGCCG CCGCACGCCC CGAGCTCCGA GCCCGCGCCA GCCCTGGCCC CCGGGGCCCT 301 GCCGTGGGCC GAGGGGAGCC GGGCGGCGGG AGAGGTGCCC CGGTGCCCCG GTGCCCCGGT 361 GCCCCGCCGG CGCCCAGCCG AGAGGGCGTT TTCCCCGCCC CCAGGAGACT CGCCCGCCCG 421 GCCGCCCCGC CCCGAGCGCG GGGAGTTCAC CTCCGCCCGT CACCTCCTCC CCTTGTCGCC 481 TAGGTCCACC CGAGCCCCCT CCCCCGGGCC GCCCCCGAGC ACGAAGTTGG CGGGAGCCTA 541 TAAAAGCTGG TGCCGGCGCG ACCCGCGGAC ACACAGTGCA GGCGCCCAAG CCGCCGCCGC 601 CAGATCGGTG CCGATTCCTG CCCTGCCCCG ACCGCCAGCG CGACCATGTC CCATCACTGG 661 GGGTACGGCA AACACAACGG TGAGTGCCGG CGACGGCCAG CGCGGGGGCG CCCCGATCCC 721 CGATCCCCGA TCCCCGATCC CCGATCCCCG ATCCCCGATC CCGGTCGCCG GCCCGGGGCG 781 CAGCGCCCGC ACATGCTGTT TACCGCGGCC GCGGTGGTGC TGGAGGCTCA GGTGCGCCCC 841 GGGCGCGCTC CGCTCGCGGC TCCGCGCGCC GGGGATGTCC CCCTTGCCCC AGCTGCGAGG 901 CCACTGTGGA GGAATCCCCG CGCCGCCGGA GGAGGGCCCG AGGGAGGGGA AGGCGCGGCC 961 GACCGCGGGA CCCGAGGACA GTCCCTCCCG GGTCCCGACC TGGGGATCAT TTTAACCGGA 1021 CCTAGGAGGA GGAGGCGGGA AAGGGTTGTA ACGGAAAATT CTAGTTGTTG ATCGCAGAGA 1081 AATTAAGAGA CTCCCCTTCC CCCCTTCCCC CACCTTCCAC CCCCACCCCA CCCCTCCAGC 1141 TTCAGCACCA CCTGTGGACT AAGGCGCTCA GCACGAACTG TCCCGGGGCA TTTTCCAGTG 1201 CTGGTTTGAA TCCATGGCTC TGATTTCCGA GTTTCCCCTT CATCTCTCGA CTTCTAATGT 1261 TAGGGGGTCG GACATCAGGA ATCGGCTTCT TGCCAGATCT GGTTCGGAGC CAGCGGAGCG 1321 AGGAGCATGC GTCTGGCGCA CCTAGCGCTC TTTGGAGGGT GTGGGGCTTC CCAGGTAGTG 1381 GGGAACCCTG ACGGTTAAAG GTGGGGTGGG CCCGGGCCGG GCAGTGAGGA AAGGATCCAG 1441 ACCTCCTTGA ATGTCTTAAG TGAGCTTGCA TATCCCAAAA TCGCAACCAC AAGCCCTGAC 1501 ATTAGTGTCT GCCCGATTTC AGTTGCTGAA TTTCAGTAAA ACGACCTTAA AATAGCTAAT 1561 ATTTATATAG CACTCAGTGA TCTAAGAGCT TTACATATAT CGATTCGAAT TCTTACAGCG 1621 ACATCTATGA GGTAGATTTC TAATTATCCC ATATTACAAA TGTGGAAACT GAGGCACAGA 1681 TTACGTGTTT TCCCAAAATT TAGCCCATTG TTAAGTGATG CTTCTAAAAT TGGAACTGAG 1741 CAGATTGGCT CCGGAATGAT TGCTCTTCTC TAGGGGTCTG GGTGTACCTT TCCCCACAAT 1801 GGGGGATTCA CATGTCTTCT TTCCCCCAGG ACCTGAGCAC TGGCATAAGG ACTTCCCCAT 1861 TGCCAAGGGA GAGCGCCAGT CCCCTGTTGA CATCGACACT CATACAGCCA AGTATGACCC 1921 TTCCCTGAAG CCCCTGTCTG TTTCCTATGA TCAAGCAACT TCCCTGAGGA TCCTCAACAA 1981 TGGTCATGCT TTCAACGTGG AGTTTGATGA CTCTCAGGAC AAAGCAGGTC AGTGTTTAGA 2041 AAATAACTTG TGTCTTTTAG CCAGTAGCTG TTTTCCGAGC TTAATGGAAG GAGCTAGGAA 2101 CAGTGGCAAG GAACCCTCTT AATAATACAG TTTGTCTCAG GACTCAAGGA TG // LOCUS HUMCAIII1 1074 bp ds-DNA PRI 15-DEC-1989 DEFINITION Human carbonic anhydrase III gene, exon 1. ACCESSION M29452 M27974 M14995 M22658 KEYWORDS carbonic anhydrase. SEGMENT 1 of 7 SOURCE Human leukocyte DNA, and adult skeletal and heart muscle, cDNA to mRNA, clone pHMCAIII. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 630 to 715) AUTHORS Wade,R., Gunning,P., Eddy,R., Shows,T. and Kedes,L. TITLE Nucleotide sequence, tissue-specific expression, and chromosome location of human carbonic anhydrase III: The human CAIII gene is located on the same chromosome as the closely linked CAI and CAII genes JOURNAL Proc. Natl. Acad. Sci. U.S.A. 83, 9571-9575 (1986) STANDARD simple staff_review REFERENCE 2 (bases 639 to 715) AUTHORS Lloyd,J., McMillan,S., Hopkinson,D. and Edwards,Y.H. TITLE Nucleotide sequence and derived amino acid sequence of a cDNA encoding human muscle carbonic anhydrase JOURNAL Gene 41, 233-239 (1986) STANDARD simple staff_entry REFERENCE 3 (bases 1 to 1074) AUTHORS Lloyd,J., Brownson,C., Tweedie,S., Charlton,J. and Edwards,Y.H. TITLE Human muscle carbonic anhydrase: Gene structure and DNA methylation patterns in fetal and adult tissues JOURNAL Genes Dev. 1, 594-602 (1987) STANDARD simple staff_entry FEATURES from to/span description pept 682 + 715 carbonic anhydrase III, exon 1 (EC 4.2.1.1) /nomgen="CA3" /map="8q13-q22" /hgml_locus_uid="LP0089S" pre-msg 623 > 1074 CAIII mRNA and intron IVS 716 > 1074 CAIII intron A BASE COUNT 261 A 296 C 297 G 220 T ORIGIN Chromosome 8q13-q22. 1 TCCGGACCCT TTCTTCCCAT CCTTTGCTCC TAGGATTTTA CATGTTGCCT GCAAAGGGAG 61 ACAAACTTAG GGGGCAGGCA AACAAACGAG TTCTTTCCAG CCTCTGTAAC CGGATCGCTA 121 GAGCGAAATA AACTCGCACA AGTGTCCAGA GATCGTAGCC AGACAGCCAG CCTGCGCTTG 181 AAGCAACTTT TAAGTGAGGC TGCAAGAGCC GCCGGGATGT AGATTTTAGT TCGTGGCCAA 241 GCACAACTAC GACGATCCTG TCCCTGCCCC ACCCCATCCC CAAGAATGCA TGGAGGAAGG 301 AGAGAGGAGC AGGTGAGGGC CGCCTGCATT TCTGCACGTC GGCGCCGGTT AGAAACCCTG 361 CAGTTTTGAG AGAGAAGAAG AGGAGATGGA GGGGCCAGGA GCCACGACTC CCGGGAGAGC 421 GCAGGGAGGG GCGTGGGTGC CCCTTCGCCC ACCTCCGCCC CCGTCACCTC GACAGCTGTC 481 CCGCTCTTGG AATTCATTGG CTTCCTCTAC CCGGCCTGGG AAACACCACC CCAATCTAGT 541 TTAGCCCCCC GCCCCACCCT CGCTGACCTA ATAAGGCCAT GCAAGTGTGC GGGGGAGCTA 601 CATAAAAGCG CGGGCTCGCG CGACTCTGCA CCACGCAGGG GAAGAGAAAG CAGGAGCCGT 661 CCAGCACGGA GGAAGGCGAC CATGGCCAAG GAGTGGGGCT ACGCCAGTCA CAACGGTGAG 721 TGCAGGCAGC CGCGACCCGG CCAGGAAGGG ATGCCAGTCC AGGAGAGCCC TGCCATTGCA 781 CAGAAATGGG CAACTTTAGA GACTGCAGTG GAAAATGTAG GAGTAGAATA ACACCTAACA 841 TTTACTGAGG CTTTTCAACT GCCAAATGCT GCTGCTTCTT TTTTTCCTTC ATCTCATTTG 901 GTTCTCCCTA GTATGGAATT TTTATTTCCC TTGGAGAAAA CTGAAGTGCA GAGAAGTTGG 961 ATCACTTGCC TAAGATTCCC ATTGCCTCTG AGAAGTCAGA CAGAAAGAGG TCAGGTGTGA 1021 CTGGCCCTTA TTCTGTTTTA CTGGACAAGC ACCTAACCTG AGCTTGGTGC CGGT // LOCUS HUMCAIII2 238 bp ds-DNA PRI 15-DEC-1989 DEFINITION Human carbonic anhydrase III gene, exon 2. ACCESSION M29453 M27974 M14995 M22658 KEYWORDS carbonic anhydrase. SEGMENT 2 of 7 SOURCE Human leukocyte DNA, and adult skeletal and heart muscle, cDNA to mRNA, clone pHMCAIII. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 21 to 218) AUTHORS Wade,R., Gunning,P., Eddy,R., Shows,T. and Kedes,L. TITLE Nucleotide sequence, tissue-specific expression, and chromosome location of human carbonic anhydrase III: The human CAIII gene is located on the same chromosome as the closely linked CAI and CAII genes JOURNAL Proc. Natl. Acad. Sci. U.S.A. 83, 9571-9575 (1986) STANDARD simple staff_review REFERENCE 2 (bases 21 to 218) AUTHORS Lloyd,J., McMillan,S., Hopkinson,D. and Edwards,Y.H. TITLE Nucleotide sequence and derived amino acid sequence of a cDNA encoding human muscle carbonic anhydrase JOURNAL Gene 41, 233-239 (1986) STANDARD simple staff_entry REFERENCE 3 (bases 1 to 31; 206 to 238) AUTHORS Lloyd,J., Brownson,C., Tweedie,S., Charlton,J. and Edwards,Y.H. TITLE Human muscle carbonic anhydrase: Gene structure and DNA methylation patterns in fetal and adult tissues JOURNAL Genes Dev. 1, 594-602 (1987) STANDARD simple staff_entry FEATURES from to/span description pept + 21 + 218 carbonic anhydrase III, exon 2 (EC 4.2.1.1) /nomgen="CA3" /map="8q13-q22" /hgml_locus_uid="LP0089S" pre-msg < 1 > 238 CAIII mRNA and intron IVS < 1 20 CAIII intron A (no splice consensus) IVS 219 > 238 CAIII intron B BASE COUNT 62 A 56 C 55 G 65 T ORIGIN About 768 bp after segment 1; chromosome 8q13-q22. 1 TCTCCTCGAC TTATTTCCTA GTCCTGACCA CTGGCATGAA CTTTTCCCAA ATGCCAAGGG 61 GGAAAACCAG TCGCCCGTTG AGCTGCATAC TAAAGACATC AGGCATGACC CTTCTCTGCA 121 GCCATGGTCT GTGTCTTATG ATGGTGGCTC TGCCAAGACC ATCCTGAATA ATGGGAAGAC 181 CTGCCGAGTT GTATTTGATG ATACTTATGA TAGGTCAAGT AAGTATGACA ATGAGGTA // LOCUS HUMCAIII3 159 bp ds-DNA PRI 15-DEC-1989 DEFINITION Human carbonic anhydrase III gene, exon 3. ACCESSION M29454 M27974 M14995 M22658 KEYWORDS carbonic anhydrase. SEGMENT 3 of 7 SOURCE Human leukocyte DNA, and adult skeletal and heart muscle, cDNA to mRNA, clone pHMCAIII. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 21 to 139) AUTHORS Wade,R., Gunning,P., Eddy,R., Shows,T. and Kedes,L. TITLE Nucleotide sequence, tissue-specific expression, and chromosome location of human carbonic anhydrase III: The human CAIII gene is located on the same chromosome as the closely linked CAI and CAII genes JOURNAL Proc. Natl. Acad. Sci. U.S.A. 83, 9571-9575 (1986) STANDARD simple staff_review REFERENCE 2 (bases 21 to 139) AUTHORS Lloyd,J., McMillan,S., Hopkinson,D. and Edwards,Y.H. TITLE Nucleotide sequence and derived amino acid sequence of a cDNA encoding human muscle carbonic anhydrase JOURNAL Gene 41, 233-239 (1986) STANDARD simple staff_entry REFERENCE 3 (bases 1 to 31; 125 to 159) AUTHORS Lloyd,J., Brownson,C., Tweedie,S., Charlton,J. and Edwards,Y.H. TITLE Human muscle carbonic anhydrase: Gene structure and DNA methylation patterns in fetal and adult tissues JOURNAL Genes Dev. 1, 594-602 (1987) STANDARD simple staff_entry FEATURES from to/span description pept + 21 + 139 carbonic anhydrase III, exon 3 (EC 4.2.1.1) /nomgen="CA3" /map="8q13-q22" /hgml_locus_uid="LP0089S" pre-msg < 1 > 159 CAIII mRNA and intron IVS < 1 20 CAIII intron B IVS 140 > 159 CAIII intron C BASE COUNT 36 A 41 C 44 G 38 T ORIGIN About 2.36 kb after segment 2; chromosome 8q13-q22. 1 CAAGCTTATC TGAATCACAG TGCTGAGAGG GGGTCCTCTC CCTGGACCCT ACCGACTTCG 61 CCAGTTTCAT CTTCACTGGG GCTCTTCGGA TGATCATGGC TCTGAGCACA CCGTGGATGG 121 AGTCAAGTAT GCAGCGGAGG TAAGAGGAAC TGCCATAAT // LOCUS HUMCAIII4 133 bp ds-DNA PRI 15-DEC-1989 DEFINITION Human carbonic anhydrase III gene, exon 4. ACCESSION M29455 M27974 M14995 M22658 KEYWORDS carbonic anhydrase. SEGMENT 4 of 7 SOURCE Human leukocyte DNA, and adult skeletal and heart muscle, cDNA to mRNA, clone pHMCAIII. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 21 to 113) AUTHORS Wade,R., Gunning,P., Eddy,R., Shows,T. and Kedes,L. TITLE Nucleotide sequence, tissue-specific expression, and chromosome location of human carbonic anhydrase III: The human CAIII gene is located on the same chromosome as the closely linked CAI and CAII genes JOURNAL Proc. Natl. Acad. Sci. U.S.A. 83, 9571-9575 (1986) STANDARD simple staff_review REFERENCE 2 (bases 21 to 113) AUTHORS Lloyd,J., McMillan,S., Hopkinson,D. and Edwards,Y.H. TITLE Nucleotide sequence and derived amino acid sequence of a cDNA encoding human muscle carbonic anhydrase JOURNAL Gene 41, 233-239 (1986) STANDARD simple staff_entry REFERENCE 3 (bases 1 to 32; 99 to 133) AUTHORS Lloyd,J., Brownson,C., Tweedie,S., Charlton,J. and Edwards,Y.H. TITLE Human muscle carbonic anhydrase: Gene structure and DNA methylation patterns in fetal and adult tissues JOURNAL Genes Dev. 1, 594-602 (1987) STANDARD simple staff_entry FEATURES from to/span description pept + 21 + 113 carbonic anhydrase III, exon 4 (EC 4.2.1.1) /nomgen="CA3" /map="8q13-q22" /hgml_locus_uid="LP0089S" pre-msg < 1 > 133 CAIII mRNA and intron IVS < 1 20 CAIII intron C IVS 114 > 133 CAIII intron D BASE COUNT 36 A 24 C 30 G 43 T ORIGIN About 1.9 kb after segment 3; chromosome 8q13-q22. 1 TCTGTCTTCT ATGGTTTCAG CTTCATTTGG TTCACTGGAA CCCGAAGTAT AACACTTTTA 61 AAGAAGCCCT GAAGCAGCGC GATGGGATCG CTGTGATTGG CATTTTTCTG AAGGTAAAGT 121 AAAAATTGAC TAT // LOCUS HUMCAIII5 103 bp ds-DNA PRI 15-DEC-1989 DEFINITION Human carbonic anhydrase III gene, exon 5. ACCESSION M29456 M27974 M14995 M22658 KEYWORDS carbonic anhydrase. SEGMENT 5 of 7 SOURCE Human leukocyte DNA, and adult skeletal and heart muscle, cDNA to mRNA, clone pHMCAIII. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 21 to 83) AUTHORS Wade,R., Gunning,P., Eddy,R., Shows,T. and Kedes,L. TITLE Nucleotide sequence, tissue-specific expression, and chromosome location of human carbonic anhydrase III: The human CAIII gene is located on the same chromosome as the closely linked CAI and CAII genes JOURNAL Proc. Natl. Acad. Sci. U.S.A. 83, 9571-9575 (1986) STANDARD simple staff_review REFERENCE 2 (bases 21 to 83) AUTHORS Lloyd,J., McMillan,S., Hopkinson,D. and Edwards,Y.H. TITLE Nucleotide sequence and derived amino acid sequence of a cDNA encoding human muscle carbonic anhydrase JOURNAL Gene 41, 233-239 (1986) STANDARD simple staff_entry REFERENCE 3 (bases 1 to 32; 69 to 103) AUTHORS Lloyd,J., Brownson,C., Tweedie,S., Charlton,J. and Edwards,Y.H. TITLE Human muscle carbonic anhydrase: Gene structure and DNA methylation patterns in fetal and adult tissues JOURNAL Genes Dev. 1, 594-602 (1987) STANDARD simple staff_entry FEATURES from to/span description pept + 21 + 83 carbonic anhydrase III, exon 5 (EC 4.2.1.1) /nomgen="CA3" /map="8q13-q22" /hgml_locus_uid="LP0089S" pre-msg < 1 > 103 CAIII mRNA and intron IVS < 1 20 CAIII intron D IVS 84 > 103 CAIII intron E BASE COUNT 34 A 16 C 21 G 32 T ORIGIN About 0.75 kb after segment 4; chromosome 8q13-q22. 1 GGATTTCTGT TTTCTTACAG ATAGGACATG AGAATGGCGA GTTCCAGATT TTCCTTGATG 61 CATTGGACAA GATTAAGACA AAGGTAAACA AAAATCATTT TCC // LOCUS HUMCAIII6 196 bp ds-DNA PRI 15-DEC-1989 DEFINITION Human carbonic anhydrase III gene, exon 6. ACCESSION M29457 M27974 M14995 M22658 KEYWORDS carbonic anhydrase. SEGMENT 6 of 7 SOURCE Human leukocyte DNA, and adult skeletal and heart muscle, cDNA to mRNA, clone pHMCAIII. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 21 to 176) AUTHORS Wade,R., Gunning,P., Eddy,R., Shows,T. and Kedes,L. TITLE Nucleotide sequence, tissue-specific expression, and chromosome location of human carbonic anhydrase III: The human CAIII gene is located on the same chromosome as the closely linked CAI and CAII genes JOURNAL Proc. Natl. Acad. Sci. U.S.A. 83, 9571-9575 (1986) STANDARD simple staff_review REFERENCE 2 (bases 21 to 176) AUTHORS Lloyd,J., McMillan,S., Hopkinson,D. and Edwards,Y.H. TITLE Nucleotide sequence and derived amino acid sequence of a cDNA encoding human muscle carbonic anhydrase JOURNAL Gene 41, 233-239 (1986) STANDARD simple staff_entry REFERENCE 3 (bases 1 to 32; 162 to 196) AUTHORS Lloyd,J., Brownson,C., Tweedie,S., Charlton,J. and Edwards,Y.H. TITLE Human muscle carbonic anhydrase: Gene structure and DNA methylation patterns in fetal and adult tissues JOURNAL Genes Dev. 1, 594-602 (1987) STANDARD simple staff_entry FEATURES from to/span description pept + 21 + 176 carbonic anhydrase III, exon 6 (EC 4.2.1.1) /nomgen="CA3" /map="8q13-q22" /hgml_locus_uid="LP0089S" pre-msg < 1 > 196 CAIII mRNA and intron IVS < 1 20 CAIII intron E IVS 177 > 196 CAIII intron F BASE COUNT 39 A 63 C 55 G 39 T ORIGIN About 1.14 kb after segment 5; chromosome 8q13-q22. 1 TGCAACCTGC TCTTACCCAG GGCAAGGAGG CGCCCTTCAC AAAGTTTGAC CCATCCTGCC 61 TGTTCCCGGC ATGCCGGGAC TACTGGACCT ACCAGGGCTC ATTCACCACG CCGCCCTGCG 121 AGGAATGCAT TGTGTGGCTG CTGCTGAAGG AGCCCATGAC CGTGAGCTCT GACCAGGTGA 181 GCAGCCTTGT GAACAG // LOCUS HUMCAIII7 1031 bp ds-DNA PRI 15-DEC-1989 DEFINITION Human carbonic anhydrase III gene, exon 7. ACCESSION M29458 M27974 M14995 M22658 KEYWORDS carbonic anhydrase. SEGMENT 7 of 7 SOURCE Human leukocyte DNA, and adult skeletal and heart muscle, cDNA to mRNA, clone pHMCAIII. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 78 to 140) AUTHORS Lloyd,J.C., Isenberg,H., Hopkinson,D.A. and Edwards,Y.H. TITLE Isolation of a cDNA clone for the human muscle specific carbonic anhydrase, CAIII JOURNAL Ann. Hum. Genet. 49, 241-251 (1985) STANDARD full staff_entry REFERENCE 2 (bases 21 to 1031) AUTHORS Wade,R., Gunning,P., Eddy,R., Shows,T. and Kedes,L. TITLE Nucleotide sequence, tissue-specific expression, and chromosome location of human carbonic anhydrase III: The human CAIII gene is located on the same chromosome as the closely linked CAI and CAII genes JOURNAL Proc. Natl. Acad. Sci. U.S.A. 83, 9571-9575 (1986) STANDARD simple staff_review REFERENCE 3 (bases 21 to 1023) AUTHORS Lloyd,J., McMillan,S., Hopkinson,D. and Edwards,Y.H. TITLE Nucleotide sequence and derived amino acid sequence of a cDNA encoding human muscle carbonic anhydrase JOURNAL Gene 41, 233-239 (1986) STANDARD simple staff_entry REFERENCE 4 (bases 1 to 32) AUTHORS Lloyd,J., Brownson,C., Tweedie,S., Charlton,J. and Edwards,Y.H. TITLE Human muscle carbonic anhydrase: Gene structure and DNA methylation patterns in fetal and adult tissues JOURNAL Genes Dev. 1, 594-602 (1987) STANDARD simple staff_entry FEATURES from to/span description pept + 21 140 carbonic anhydrase III, exon 7 (EC 4.2.1.1) /nomgen="CA3" /map="8q13-q22" /hgml_locus_uid="LP0089S" pre-msg < 1 1023 CAIII mRNA and intron (alt.) pre-msg < 1 1031 CAIII mRNA and intron (alt.) IVS < 1 20 CAIII intron F BASE COUNT 329 A 183 C 190 G 329 T ORIGIN About 1.7 kb after segment 6; chromosome 8q13-q22. 1 GTCTGCCCCT GCCCTTGCAG ATGGCCAAGC TGCGGAGCCT CCTCTCCAGT GCTGAGAACG 61 AGCCCCCAGT GCCTCTTGTG AGCAACTGGC GACCTCCACA GCCTATCAAT AACAGGGTGG 121 TGAGAGCTTC CTTCAAATGA GGCTGCTGGA TCTTGCCCTC TTCAGGAAAG GAAACCTACC 181 ATTGGAGAGC TTGGTTCCTT GCCTCCTTCT GGTGCTCTTA CTCCAAGTCT ATTTCATTTT 241 TCCACACTGA GCAATGAATG TGAGAGATGT GGTCACCAAG ATCTAAGTTA CTTGTTGAAA 301 GAAAGTTACT TTCGACAAGA TCTAATATGA AAGCATAGAT TTCACATTTG ATCTCTGTAA 361 TAATCATCTT TCCTATAAAA GTAGCATTTT TGGTAAAGTT TCAAAGAAGA AGAAACAGAG 421 ATGGAAGAGT AAAGATATTT TTAAAATGGC TAGCTATTGG GCACCAGTTT TTCTGTTATC 481 TAAAATTTCA CACAACTTCA TGTTTTTATT TTTATATTAT GAGTTGTCCA TCTTAAAGAA 541 ATATGAGTAA TTCTACATGT AGTAGAGGTG TATGAAGATC ATATAACAAT TAAACATAAG 601 CCAGAAATTA AAATGACTAT AGACAGCAAG AATTGAGCTA ATAATATGTT TTAACTCTTA 661 ACACCAGCAA GAAGTCAGTC ATTTATTGAA GTTTTAGCTA CTAAGATTAC TTGGTTTTGA 721 TTACCAGTGA AAAGAAAACA CAATACAATC AGGAGTTTTC AAATTTTTGA TTCAGTATTT 781 GAATTTCTTC TTCATAAATG TAGTTGGAAT TTATCCTAGT ATTTTTCTTT ACCTGAAGGA 841 GGGCCATTTA TTTTTAATTT CACTACATTT TTCTTTGCAT GATTATTAAA ATAAAAACTG 901 CCTCTGTTGT GTTTCTCACT GGAGGCTGGA ATGAATGATC ACTAGAACAC AAAAGAGTGA 961 ATGATGACAC TTGAAGTCAA AGCAGTTGTA CTGATCACCA GAACCAATAA AGACATAAAT 1021 GGAAAACGTT G // LOCUS HUMCALCR1 261 bp ds-DNA PRI 15-JUN-1989 DEFINITION Human calcitonin/calcitonin gene-related peptide gene, exon 1. ACCESSION M11303 KEYWORDS alternate splicing; calcitonin; calcitonin gene-related peptide; neuropeptide. SEGMENT 1 of 5 SOURCE Human DNA (library of T.Maniatis), clone lambda-hCal-1, and medullary thyroid carcinoma, cDNA to mRNA, clone pCal-H1. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 261) AUTHORS Jonas,V., Lin,C.R., Kawashima,E., Semon,D., Swanson,L.W., Mermod,J.-J., Evans,R.M. and Rosenfeld,M.G. TITLE Alternative RNA processing events in human calcitonin/calcitonin gene-related peptide gene expression JOURNAL Proc. Natl. Acad. Sci. U.S.A. 82, 1994-1998 (1985) STANDARD full staff_review FEATURES from to/span description pre-msg 133 > 261 CAL mRNA /nomgen="CALCA" /map="11p15.4" /hgml_locus_uid="LN0076N" pre-msg 133 > 261 CGRP mRNA IVS 242 > 261 CAL mRNA intron A IVS 242 > 261 CGRP mRNA intron A BASE COUNT 48 A 91 C 82 G 40 T ORIGIN BamHI site. 1 GATCCTCCAG GTTCTGGAAG CATGAGGGTG ACGCAACCCA GGGGCAAAGG ACCCTCCGCC 61 CATTGGTTGC TGTGCATGGC GGAACTTTCC GACCACAGCG GCGGGAATAA GAGCAGTCGC 121 TGGCGCTGGG AGGCATCAGA GACATGCCCA GCCCAAGTGT CGCCGCCGCT TCCACAGGGG 181 CTCTGGCTGG ACGCCGCCGC CGCCGCTGCC ACCGCCTCTG ATCCAAGCCA CCTCCCGCCA 241 GGTGAGCCCC GAGATTCTGG C // LOCUS HUMCALCR2 135 bp ds-DNA PRI 15-JUN-1989 DEFINITION Human calcitonin/calcitonin gene-related peptide gene, exon 2. ACCESSION M12664 KEYWORDS alternate splicing; calcitonin; calcitonin gene-related peptide; neuropeptide. SEGMENT 2 of 5 SOURCE Human DNA (library of T.Maniatis), clone lambda-hCal-1, and medullary thyroid carcinoma, cDNA to mRNA, clones pCal-H and pCGRP-H. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 135) AUTHORS Jonas,V., Lin,C.R., Kawashima,E., Semon,D., Swanson,L.W., Mermod,J.-J., Evans,R.M. and Rosenfeld,M.G. TITLE Alternative RNA processing events in human calcitonin/calcitonin gene-related peptide gene expression JOURNAL Proc. Natl. Acad. Sci. U.S.A. 82, 1994-1998 (1985) STANDARD full staff_review FEATURES from to/span description pept 30 + 115 preprocalcitonin, exon 2 (first expressed exon) /nomgen="CALCA" /map="11p15.4" /hgml_locus_uid="LN0076N" pep$ 30 + 115 preprocalcitonin gene-related protein, exon 2 (first expressed exon) IVS < 1 20 CAL mRNA intron A IVS < 1 20 CGRP mRNA intron A IVS 116 > 135 CAL intron B IVS 116 > 135 CGRP intron B BASE COUNT 27 A 43 C 31 G 34 T ORIGIN About 0.87 kb after segment 1; chromosome 11p15.4. 1 ATCTCATTCT TCCCTTGCAG AGAGGTGTCA TGGGCTTCCA AAAGTTCTCC CCCTTCCTGG 61 CTCTCAGCAT CTTGGTCCTG TTGCAGGCAG GCAGCCTCCA TGCAGCACCA TTCAGGTAAG 121 ACAGCCTGAA GCCAG // LOCUS HUMCALCR3 181 bp ds-DNA PRI 15-JUN-1989 DEFINITION Human calcitonin/calcitonin gene-related peptide gene, exon 3. ACCESSION M12665 KEYWORDS alternate splicing; calcitonin; calcitonin gene-related peptide; neuropeptide. SEGMENT 3 of 5 SOURCE Human DNA (library of T.Maniatis), clone lambda-hCal-1, and medullary thyroid carcinoma, cDNA to mRNA, clones pCal-H and pCGRP-H. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 181) AUTHORS Jonas,V., Lin,C.R., Kawashima,E., Semon,D., Swanson,L.W., Mermod,J.-J., Evans,R.M. and Rosenfeld,M.G. TITLE Alternative RNA processing events in human calcitonin/calcitonin gene-related peptide gene expression JOURNAL Proc. Natl. Acad. Sci. U.S.A. 82, 1994-1998 (1985) STANDARD full staff_review FEATURES from to/span description pept + 21 + 161 preprocalcitonin, exon 3 /nomgen="CALCA" /map="11q15.4" /hgml_locus_uid="LN0076N" pep$ + 21 + 161 preprocalcitonin gene-related protein, exon 3 IVS < 1 20 CAL intron B IVS < 1 20 CGRP intron B IVS 162 > 181 CAL intron C IVS 162 > 181 CGRP intron C BASE COUNT 38 A 55 C 61 G 27 T ORIGIN About 0.65 kb after segment 2; chromosome 11q15.4. 1 AGTTTGCTTC CCCTCCACAG GTCTGCCCTG GAGAGCAGCC CAGCAGACCC GGCCACGCTC 61 AGTGAGGACG AAGCGCGCCT CCTGCTGGCT GCACTGGTGC AGGACTATGT GCAGATGAAG 121 GCCAGTGAGC TGGAGCAGGA GCAAGAGAGA GAGGGCTCCA GGTGAGGCTC CCCAAGCGCT 181 C // LOCUS HUMCALCR4 536 bp ds-DNA PRI 15-JUN-1989 DEFINITION Human calcitonin/calcitonin gene-related peptide gene, exon 4. ACCESSION M12666 KEYWORDS alternate splicing; calcitonin; neuropeptide. SEGMENT 4 of 5 SOURCE Human DNA (library of T.Maniatis), clone lambda-hCal-1, and medullary thyroid carcinoma, cDNA to mRNA, clone pCal-H. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 536) AUTHORS Jonas,V., Lin,C.R., Kawashima,E., Semon,D., Swanson,L.W., Mermod,J.-J., Evans,R.M. and Rosenfeld,M.G. TITLE Alternative RNA processing events in human calcitonin/calcitonin gene-related peptide gene expression JOURNAL Proc. Natl. Acad. Sci. U.S.A. 82, 1994-1998 (1985) STANDARD full staff_review FEATURES from to/span description pept + 21 219 preprocalcitonin, exon 4 /nomgen="CALCA" /map="11q15.4" /hgml_locus_uid="LN0076N" matp 46 138 calcitonin pre-msg < 1 522 CAL mRNA IVS < 1 > 536 CGRP intron C IVS < 1 20 CAL intron C BASE COUNT 129 A 136 C 128 G 143 T ORIGIN About 1.04 kb after segment 3; chromosome 11q15.4. 1 GGTATGTGTT TTCCCTCCAG CCTGGACAGC CCCAGATCTA AGCGGTGCGG TAATCTGAGT 61 ACTTGCATCC TGGGCACATA CACGCAGGAC TTCAACAAGT TTCACACGTT CCCCCAAACT 121 GCAATTGGGG TTGGAGCACC TGGAAAGAAA AGGGATATGT CCAGCGACTT GGAGAGAGAC 181 CATCGCCCTC ATGTTAGCAT GCCCCAGAAT GCCAACTAAA CTCCTCCCTT TCCTTCCTAA 241 TTTCCCTTCT TGCATCCTTC CTATAACTTG ATGCATGTGG TTTGGTTCCT CTCTGGTGGC 301 TCTTTGGGCT GGTATTGGTG GCTTTCCTTG TGGCAGAGGA TGTCTCAAAC TTCAAGATGG 361 GAGGAAAGAG AGCAGGACTC ACAGGTTGGA AGAGAATCAC CTGGGAAAAT ACCAGAAAAT 421 GAGGGCCGCT TTGAGTCCCC CAGAGATGTC ATCAGCGCTC CTCTGTCCTG CTTCTGAATG 481 TGCTGATCAT TTGAGGAATA AAATTATTTT TCCCCAAAGA TCCTCTAGAG TCGACC // LOCUS HUMCALCR5 221 bp ds-DNA PRI 15-JUN-1989 DEFINITION Human calcitonin/calcitonin gene-related peptide gene, exon 5. ACCESSION M12667 KEYWORDS alternate splicing; calcitonin gene-related peptide. SEGMENT 5 of 5 SOURCE Human DNA (library of T.Maniatis), clone lambda-hCal-1, and medullary thyroid carcinoma, cDNA to mRNA, clone pCGRP-H. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 221) AUTHORS Jonas,V., Lin,C.R., Kawashima,E., Semon,D., Swanson,L.W., Mermod,J.-J., Evans,R.M. and Rosenfeld,M.G. TITLE Alternative RNA processing events in human calcitonin/calcitonin gene-related peptide gene expression JOURNAL Proc. Natl. Acad. Sci. U.S.A. 82, 1994-1998 (1985) STANDARD full staff_review FEATURES from to/span description pep$ + 21 180 preprocalcitonin gene-related peptide, exon 4 /nomgen="CALCA" /map="11p15.4" /hgml_locus_uid="LN0076N" matp 40 150 calcitonin gene-related peptide IVS < 1 20 CGRP intron C IVS 202 > 221 CGRP mRNA intron D BASE COUNT 52 A 57 C 63 G 49 T ORIGIN About 0.59 kb after segment 4. 1 TCTCCATCCT TGCAAATCAG AATCATTGCC CAGAAGAGAG CCTGTGACAC TGCCACCTGT 61 GTGACTCATC GGCTGGCAGG CTTGCTGAGC AGATCAGGGG GTGTGGTGAA GAACAACTTT 121 GTGCCCACCA ATGTGGGTTC CAAAGCCTTT GGCAGGCGCC GCAGGGACCT TCAAGCCTGA 181 GCAGCTGAAT GACTCAAGAA GGTGACTGCC CTTGTATGAT G // LOCUS HUMCALLA01 92 bp ds-DNA PRI 15-DEC-1989 DEFINITION Human CALLA/NEP gene encoding neutral endopeptidase, exon 1. ACCESSION M26605 KEYWORDS common acute lymphoblastic leukemia antigen; integral membrane protein; neutral endopeptidase. SEGMENT 1 of 24 SOURCE Human cell line NALM 6 DNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 92) AUTHORS D'Adamio,L., Shipp,M.A., Masteller,E.L. and Reinherz,E.L. TITLE Organization of gene encoding common acute lymphoblastic leukemia antigen (neutral endopeptidase 24.11): Multiple miniexons and three separate 5' untranslated regions JOURNAL Proc. Natl. Acad. Sci. U.S.A. 86, 7103-7107 (1989) STANDARD full staff_review COMMENT Draft entry and computer-readable copy of sequence [1] kindly submitted by L.D'Adamio 01-AUG-1989. FEATURES from to/span description pre-msg < 1 > 92 neutral endopeptidase mRNA and introns /nomgen="MME" /map="3q21-q27" /hgml_locus_uid="LG0222G" IVS 88 > 92 CALLA/NEP Intron A BASE COUNT 18 A 23 C 38 G 13 T ORIGIN 1 GCGGAGATGT GCAAGTGGCG AAGCTTGACC GAGAGCAGGC TGGAGCAGCC GCCCAACTCC 61 TGGCGCGGGA TCTGCTGAGG GGTCACGGTG AG // LOCUS HUMCALLA02 222 bp ds-DNA PRI 15-DEC-1989 DEFINITION Human CALLA/NEP gene encoding neutral endopeptidase, exons 2A and 2B. ACCESSION M26606 KEYWORDS common acute lymphoblastic leukemia antigen; integral membrane protein; neutral endopeptidase. SEGMENT 2 of 24 SOURCE Human cell line NALM 6 DNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 222) AUTHORS D'Adamio,L., Shipp,M.A., Masteller,E.L. and Reinherz,E.L. TITLE Organization of gene encoding common acute lymphoblastic leukemia antigen (neutral endopeptidase 24.11): Multiple miniexons and three separate 5' untranslated regions JOURNAL Proc. Natl. Acad. Sci. U.S.A. 86, 7103-7107 (1989) STANDARD full staff_review COMMENT Draft entry and computer-readable copy of sequence [1] kindly submitted by L.D'Adamio 01-AUG-1989. FEATURES from to/span description pre-msg < 1 > 222 neutral endopeptidase mRNA and introns /nomgen="MME" /map="3q21-q27" /hgml_locus_uid="LG0222G" IVS 31 > 222 CALLA/NEP Intron BA IVS 218 > 222 CALLA/NEP Intron BB BASE COUNT 36 A 47 C 59 G 80 T ORIGIN Unknown number of bp after segment 1. 1 CCTGGAGGAG GGCTCTGGAA GTCACGTCAG GTTGGCTCTT CAGGTTCATT TCCATAGTTC 61 CCTGCGGCCT CTGCCTTGGG GAGTTATGTT TTGTTACCGA GATCCGCGCT ACCAGATTGC 121 ACCGGGGCTG ATTTGGGGGC TGGGAATTTG CCATTCTGCT GTACAGACAC TGATTTTTTT 181 TTCTTCTTTT TAAAAAGCAA GGTTTGTTTT CATTTTGGTT TC // LOCUS HUMCALLA03 182 bp ds-DNA PRI 15-DEC-1989 DEFINITION Human CALLA/NEP gene encoding neutral endopeptidase, exon 3. ACCESSION M26607 KEYWORDS common acute lymphoblastic leukemia antigen; integral membrane protein; neutral endopeptidase. SEGMENT 3 of 24 SOURCE Human cell line NALM 6 DNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 182) AUTHORS D'Adamio,L., Shipp,M.A., Masteller,E.L. and Reinherz,E.L. TITLE Organization of gene encoding common acute lymphoblastic leukemia antigen (neutral endopeptidase 24.11): Multiple miniexons and three separate 5' untranslated regions JOURNAL Proc. Natl. Acad. Sci. U.S.A. 86, 7103-7107 (1989) STANDARD full staff_review COMMENT Draft entry and computer-readable copy of sequence [1] kindly submitted by L.D'Adamio 01-AUG-1989. FEATURES from to/span description pept 18 + 177 common acute lymphoblastic antigen, exon 3 (first expressed exon) (EC 3.4.24.11) /nomgen="MME" /map="3q21-q27" /hgml_locus_uid="LG0222G" pre-msg < 1 > 182 neutral endopeptidase mRNA and introns IVS < 1 7 CALLA/NEP Intron B IVS 178 > 182 CALLA/NEP Intron C BASE COUNT 54 A 44 C 41 G 43 T ORIGIN Unknown number of bp after segment 2. 1 TTTGCAGATT TTAGGTGATG GGCAAGTCAG AAAGTCAGAT GGATATAACT GATATCAACA 61 CTCCAAAGCC AAAGAAGAAA CAGCGATGGA CTCCACTGGA GATCAGCCTC TCGGTCCTTG 121 TCCTGCTCCT CACCATCATA GCTGTGACAA TGATCGCACT CTATGCAACC TACGATGGTG 181 AG // LOCUS HUMCALLA04 48 bp ds-DNA PRI 15-DEC-1989 DEFINITION Human CALLA/NEP gene encoding neutral endopeptidase, exon 4. ACCESSION M26608 KEYWORDS common acute lymphoblastic leukemia antigen; integral membrane protein; neutral endopeptidase. SEGMENT 4 of 24 SOURCE Human cell line NALM 6 DNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 48) AUTHORS D'Adamio,L., Shipp,M.A., Masteller,E.L. and Reinherz,E.L. TITLE Organization of gene encoding common acute lymphoblastic leukemia antigen (neutral endopeptidase 24.11): Multiple miniexons and three separate 5' untranslated regions JOURNAL Proc. Natl. Acad. Sci. U.S.A. 86, 7103-7107 (1989) STANDARD full staff_review COMMENT Draft entry and computer-readable copy of sequence [1] kindly submitted by L.D'Adamio 01-AUG-1989. FEATURES from to/span description pept + 8 + 43 common acute lymphoblastic antigen, exon 4 (EC 3.4.24.11) /nomgen="MME" /map="3q21-q27" /hgml_locus_uid="LG0222G" pre-msg < 1 > 48 neutral endopeptidase mRNA and introns IVS < 1 7 CALLA/NEP Intron C IVS 44 > 48 CALLA/NEP Intron D BASE COUNT 16 A 7 C 10 G 15 T ORIGIN Unknown number of bp after segment 3. 1 TTTCTAGATG GTATTTGCAA GTCATCAGAC TGCATAAAAT CAGGTAAG // LOCUS HUMCALLA05 174 bp ds-DNA PRI 15-DEC-1989 DEFINITION Human CALLA/NEP gene encoding neutral endopeptidase, exon 5. ACCESSION M26609 KEYWORDS common acute lymphoblastic leukemia antigen; integral membrane protein; neutral endopeptidase. SEGMENT 5 of 24 SOURCE Human cell line NALM 6 DNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 174) AUTHORS D'Adamio,L., Shipp,M.A., Masteller,E.L. and Reinherz,E.L. TITLE Organization of gene encoding common acute lymphoblastic leukemia antigen (neutral endopeptidase 24.11): Multiple miniexons and three separate 5' untranslated regions JOURNAL Proc. Natl. Acad. Sci. U.S.A. 86, 7103-7107 (1989) STANDARD full staff_review COMMENT Draft entry and computer-readable copy of sequence [1] kindly submitted by L.D'Adamio 01-AUG-1989. FEATURES from to/span description pept + 8 + 169 common acute lymphoblastic antigen, exon 5 (EC 3.4.24.11) /nomgen="MME" /map="3q21-q27" /hgml_locus_uid="LCT0104T" pre-msg < 1 > 174 neutral endopeptidase mRNA and introns IVS < 1 7 CALLA/NEP Intron D IVS 170 174 CALLA/NEP Intron E BASE COUNT 47 A 39 C 40 G 48 T ORIGIN Unknown number of bp after segment 4. 1 CTTGCAGCTG CTCGACTGAT CCAAAACATG GATGCCACCA CTGAGCCTTG TACAGACTTT 61 TTCAAATATG CTTGCGGAGG CTGGTTGAAA CGTAATGTCA TTCCCGAGAC CAGCTCCCGT 121 TACGGCAACT TTGACATTTT AAGAGATGAA CTAGAAGTCG TTTTGAAAGG TTAG // LOCUS HUMCALLA06 93 bp ds-DNA PRI 15-DEC-1989 DEFINITION Human CALLA/NEP gene encoding neutral endopeptidase, exon 6. ACCESSION M26610 KEYWORDS common acute lymphoblastic leukemia antigen; integral membrane protein; neutral endopeptidase. SEGMENT 6 of 24 SOURCE Human cell line NALM 6 DNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 93) AUTHORS D'Adamio,L., Shipp,M.A., Masteller,E.L. and Reinherz,E.L. TITLE Organization of gene encoding common acute lymphoblastic leukemia antigen (neutral endopeptidase 24.11): Multiple miniexons and three separate 5' untranslated regions JOURNAL Proc. Natl. Acad. Sci. U.S.A. 86, 7103-7107 (1989) STANDARD full staff_review COMMENT Draft entry and computer-readable copy of sequence [1] kindly submitted by L.D'Adamio 01-AUG-1989. FEATURES from to/span description pept + 8 + 88 common acute lymphoblastic antigen, exon 6 (EC 3.4.24.11) /nomgen="MME" /map="3q21-q27" /hgml_locus_uid="LG0222G" pre-msg < 1 > 93 neutral endopeptidase mRNA and introns IVS < 1 7 CALLA/NEP Intron E IVS 89 > 93 CALLA/NEP Intron F BASE COUNT 37 A 14 C 19 G 23 T ORIGIN Unknown number of bp after segment 4. 1 ATTTCAGATG TCCTTCAAGA ACCCAAAACT GAAGATATAG TAGCAGTGCA GAAAGCAAAA 61 GCATTGTACA GGTCTTGTAT AAATGAATGT AAG // LOCUS HUMCALLA07 108 bp ds-DNA PRI 15-DEC-1989 DEFINITION Human CALLA/NEP gene encoding neutral endopeptidase, exon 7. ACCESSION M26611 KEYWORDS common acute lymphoblastic leukemia antigen; integral membrane protein; neutral endopeptidase. SEGMENT 7 of 24 SOURCE Human cell line NALM 6 DNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 108) AUTHORS D'Adamio,L., Shipp,M.A., Masteller,E.L. and Reinherz,E.L. TITLE Organization of gene encoding common acute lymphoblastic leukemia antigen (neutral endopeptidase 24.11): Multiple miniexons and three separate 5' untranslated regions JOURNAL Proc. Natl. Acad. Sci. U.S.A. 86, 7103-7107 (1989) STANDARD full staff_review COMMENT Draft entry and computer-readable copy of sequence [1] kindly submitted by L.D'Adamio 01-AUG-1989. FEATURES from to/span description pept + 8 + 103 common acute lymphoblastic antigen, exon 7 (EC 3.4.24.11) /nomgen="MME" /map="3q21-q27" /hgml_locus_uid="LG0222G" pre-msg < 1 > 108 neutral endopeptidase mRNA and introns IVS < 1 7 CALLA/NEP Intron F IVS 104 > 108 CALLA/NEP Intron G BASE COUNT 40 A 20 C 27 G 21 T ORIGIN Unknown number of bp after segment 6. 1 CCAAAAGCTG CTATTGATAG CAGAGGTGGA GAACCTCTAC TCAAACTGTT ACCAGACATA 61 TATGGGTGGC CAGTAGCAAC AGAAAACTGG GAGCAAAAAT ATGGTAAG // LOCUS HUMCALLA08 131 bp ds-DNA PRI 15-DEC-1989 DEFINITION Human CALLA/NEP gene encoding neutral endopeptidase, exon 8. ACCESSION M26612 KEYWORDS common acute lymphoblastic leukemia antigen; integral membrane protein; neutral endopeptidase. SEGMENT 8 of 24 SOURCE Human cell line NALM 6 DNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 131) AUTHORS D'Adamio,L., Shipp,M.A., Masteller,E.L. and Reinherz,E.L. TITLE Organization of gene encoding common acute lymphoblastic leukemia antigen (neutral endopeptidase 24.11): Multiple miniexons and three separate 5' untranslated regions JOURNAL Proc. Natl. Acad. Sci. U.S.A. 86, 7103-7107 (1989) STANDARD full staff_review COMMENT Draft entry and computer-readable copy of sequence [1] kindly submitted by L.D'Adamio 01-AUG-1989. FEATURES from to/span description pept + 8 + 126 common acute lymphoblastic antigen, exon 8 (EC 3.4.24.11) /nomgen="MME" /map="3q21-q27" /hgml_locus_uid="LG0222G" pre-msg < 1 > 131 neutral endopeptidase mRNA and introns IVS < 1 7 CALLA/NEP Intron G IVS 127 > 131 CALLA/NEP Intron H BASE COUNT 42 A 17 C 27 G 45 T ORIGIN Unknown number of bp after segment 7. 1 GCTTTAGGTG CTTCTTGGAC AGCTGAAAAA GCTATTGCAC AACTGAATTC TAAATATGGG 61 AAAAAAGTCC TTATTAATTT GTTTGTTGGC ACTGATGATA AGAATTCTGT GAATCATGTA 121 ATTCATGTAA G // LOCUS HUMCALLA09 78 bp ds-DNA PRI 15-DEC-1989 DEFINITION Human CALLA/NEP gene encoding neutral endopeptidase, exon 9. ACCESSION M26613 KEYWORDS common acute lymphoblastic leukemia antigen; integral membrane protein; neutral endopeptidase. SEGMENT 9 of 24 SOURCE Human cell line NALM 6 DNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 78) AUTHORS D'Adamio,L., Shipp,M.A., Masteller,E.L. and Reinherz,E.L. TITLE Organization of gene encoding common acute lymphoblastic leukemia antigen (neutral endopeptidase 24.11): Multiple miniexons and three separate 5' untranslated regions JOURNAL Proc. Natl. Acad. Sci. U.S.A. 86, 7103-7107 (1989) STANDARD full staff_review COMMENT Draft entry and computer-readable copy of sequence [1] kindly submitted by L.D'Adamio 01-AUG-1989. FEATURES from to/span description pept + 8 + 73 common acute lymphoblastic antigen, exon 9 (EC 3.4.24.11) /nomgen="MME" /map="3q21-q27" /hgml_locus_uid="LG0222G" pre-msg < 1 > 78 neutral endopeptidase mRNA and introns IVS < 1 7 CALLA/NEP Intron H IVS 74 > 78 CALLA/NEP Intron I BASE COUNT 25 A 16 C 14 G 23 T ORIGIN Unknown number of bp after segment 8. 1 TTTATAGATT GACCAACCTC GACTTGGCCT CCCTTCTAGA GATTACTATG AATGCACTGG 61 AATCTATAAA GAGGTAAA // LOCUS HUMCALLA10 147 bp ds-DNA PRI 15-DEC-1989 DEFINITION Human CALLA/NEP gene encoding neutral endopeptidase, exon 10. ACCESSION M26614 KEYWORDS common acute lymphoblastic leukemia antigen; integral membrane protein; neutral endopeptidase. SEGMENT 10 of 24 SOURCE Human cell line NALM 6 DNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 147) AUTHORS D'Adamio,L., Shipp,M.A., Masteller,E.L. and Reinherz,E.L. TITLE Organization of gene encoding common acute lymphoblastic leukemia antigen (neutral endopeptidase 24.11): Multiple miniexons and three separate 5' untranslated regions JOURNAL Proc. Natl. Acad. Sci. U.S.A. 86, 7103-7107 (1989) STANDARD full staff_review COMMENT Draft entry and computer-readable copy of sequence [1] kindly submitted by L.D'Adamio 01-AUG-1989. FEATURES from to/span description pept + 8 + 142 common acute lymphoblastic antigen, exon 10 (EC 3.4.24.11) /nomgen="MME" /map="3q21-q27" /hgml_locus_uid="LG0222G" pre-msg < 1 > 142 neutral endopeptidase mRNA and introns IVS < 1 7 CALLA/NEP Intron I IVS 143 > 147 CALLA/NEP Intron J BASE COUNT 49 A 20 C 35 G 43 T ORIGIN Unknown number of bp after segment 9. 1 CTTGCAGGCT TGTACAGCAT ATGTGGATTT TATGATTTCT GTGGCCAGAT TGATTCGTCA 61 GGAAGAAAGA TTGCCCATCG ATGAAAACCA GCTTGCTTTG GAAATGAATA AAGTTATGGA 121 ATTGGAAAAA GAAATTGCCA ATGTAAA // LOCUS HUMCALLA11 114 bp ds-DNA PRI 15-DEC-1989 DEFINITION Human CALLA/NEP gene encoding neutral endopeptidase, exon 11. ACCESSION M26615 KEYWORDS common acute lymphoblastic leukemia antigen; integral membrane protein; neutral endopeptidase. SEGMENT 11 of 24 SOURCE Human cell line NALM 6 DNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 114) AUTHORS D'Adamio,L., Shipp,M.A., Masteller,E.L. and Reinherz,E.L. TITLE Organization of gene encoding common acute lymphoblastic leukemia antigen (neutral endopeptidase 24.11): Multiple miniexons and three separate 5' untranslated regions JOURNAL Proc. Natl. Acad. Sci. U.S.A. 86, 7103-7107 (1989) STANDARD full staff_review COMMENT Draft entry and computer-readable copy of sequence [1] kindly submitted by L.D'Adamio 01-AUG-1989. FEATURES from to/span description pept + 8 + 109 common acute lymphoblastic antigen, exon 11 (EC 3.4.24.11) /nomgen="MME" /map="3q21-q27" /hgml_locus_uid="LG0222G" pre-msg < 1 > 114 neutral endopeptidase mRNA and introns IVS < 1 7 CALLA/NEP Intron J IVS 110 > 114 CALLA/NEP Intron K BASE COUNT 41 A 23 C 23 G 27 T ORIGIN Unknown number of bp after segment 10. 1 TCCATAGGCT ACGGCTAAAC CTGAAGATCG AAATGATCCA ATGCTTCTGT ATAACAAGAT 61 GACATTGGCC CAGATCCAAA ATAACTTTTC ACTAGAGATC AATGGGAAGG TAAG // LOCUS HUMCALLA12 149 bp ds-DNA PRI 15-DEC-1989 DEFINITION Human CALLA/NEP gene encoding neutral endopeptidase, exon 12. ACCESSION M26616 KEYWORDS common acute lymphoblastic leukemia antigen; integral membrane protein; neutral endopeptidase. SEGMENT 12 of 24 SOURCE Human cell line NALM 6 DNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 149) AUTHORS D'Adamio,L., Shipp,M.A., Masteller,E.L. and Reinherz,E.L. TITLE Organization of gene encoding common acute lymphoblastic leukemia antigen (neutral endopeptidase 24.11): Multiple miniexons and three separate 5' untranslated regions JOURNAL Proc. Natl. Acad. Sci. U.S.A. 86, 7103-7107 (1989) STANDARD full staff_review COMMENT Draft entry and computer-readable copy of sequence [1] kindly submitted by L.D'Adamio 01-AUG-1989. FEATURES from to/span description pept + 8 + 144 common acute lymphoblastic antigen, exon 12 (EC 3.4.24.11) /nomgen="MME" /map="3q21-q27" /hgml_locus_uid="LG0222G" pre-msg < 1 > 149 neutral endopeptidase mRNA and introns IVS < 1 7 CALLA/NEP Intron K IVS 145 149 CALLA/NEP Intron L BASE COUNT 48 A 27 C 26 G 48 T ORIGIN Unknown number of bp after segment 11. 1 TTTCCAGCCA TTCAGCTGGT TGAATTTCAC AAATGAAATC ATGTCAACTG TGAATATTAG 61 TATTACAAAT GAGGAAGATG TGGTTGTTTA TGCTCCAGAA TATTTAACCA AACTTAAGCC 121 CATTCTTACC AAATATTCTG CCAGGTAGG // LOCUS HUMCALLA13 106 bp ds-DNA PRI 15-DEC-1989 DEFINITION Human CALLA/NEP gene encoding neutral endopeptidase, exon 13. ACCESSION M26617 KEYWORDS common acute lymphoblastic leukemia antigen; integral membrane protein; neutral endopeptidase. SEGMENT 13 of 24 SOURCE Human cell line NALM 6 DNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 106) AUTHORS D'Adamio,L., Shipp,M.A., Masteller,E.L. and Reinherz,E.L. TITLE Organization of gene encoding common acute lymphoblastic leukemia antigen (neutral endopeptidase 24.11): Multiple miniexons and three separate 5' untranslated regions JOURNAL Proc. Natl. Acad. Sci. U.S.A. 86, 7103-7107 (1989) STANDARD full staff_review COMMENT Draft entry and computer-readable copy of sequence [1] kindly submitted by L.D'Adamio 01-AUG-1989. FEATURES from to/span description pept + 8 + 101 common acute lymphoblastic antigen, exon 13 (EC 3.4.24.11) /nomgen="MME" /map="3q21-q27" /hgml_locus_uid="LG0222G" pre-msg < 1 > 106 neutral endopeptidase mRNA and introns IVS < 1 7 CALLA/NEP Intron L IVS 102 > 106 CALLA/NEP Intron M BASE COUNT 35 A 22 C 22 G 27 T ORIGIN Unknown number of bp after segment 12. 1 TATACAGAGA TCTTCAAAAT TTAATGTCCT GGAGATTCAT AATGGATCTT GTAAGCAGCC 61 TCAGCCGAAC CTACAAGGAG TCCAGAAATG CTTTCCGCAA GGTGAA // LOCUS HUMCALLA14 141 bp ds-DNA PRI 15-DEC-1989 DEFINITION Human CALLA/NEP gene encoding neutral endopeptidase, exon 14. ACCESSION M26618 KEYWORDS common acute lymphoblastic leukemia antigen; integral membrane protein; neutral endopeptidase. SEGMENT 14 of 24 SOURCE Human cell line NALM 6 DNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 141) AUTHORS D'Adamio,L., Shipp,M.A., Masteller,E.L. and Reinherz,E.L. TITLE Organization of gene encoding common acute lymphoblastic leukemia antigen (neutral endopeptidase 24.11): Multiple miniexons and three separate 5' untranslated regions JOURNAL Proc. Natl. Acad. Sci. U.S.A. 86, 7103-7107 (1989) STANDARD full staff_review COMMENT Draft entry and computer-readable copy of sequence [1] kindly submitted by L.D'Adamio 01-AUG-1989. FEATURES from to/span description pept + 8 + 136 common acute lymphoblastic antigen, exon 14 (EC 3.4.24.11) /nomgen="MME" /map="3q21-q27" /hgml_locus_uid="LG0222G" pre-msg < 1 > 141 neutral endopeptidase mRNA and introns IVS < 1 7 CALLA/NEP Intron M IVS 137 > 141 CALLA/NEP Intron N BASE COUNT 42 A 22 C 41 G 36 T ORIGIN Unknown number of bp after segment 13. 1 TCCGTAGGCC CTTTATGGTA CAACCTCAGA AACAGCAACT TGGAGACGTT GTGCAAACTA 61 TGTCAATGGG AATATGGAAA ATGCTGTGGG GAGGCTTTAT GTGGAAGCAG CATTTGCTGG 121 AGAGAGTAAA CATGTGGTAA T // LOCUS HUMCALLA15 111 bp ds-DNA PRI 15-DEC-1989 DEFINITION Human CALLA/NEP gene encoding neutral endopeptidase, exon 15. ACCESSION M26619 KEYWORDS common acute lymphoblastic leukemia antigen; integral membrane protein; neutral endopeptidase. SEGMENT 15 of 24 SOURCE Human cell line NALM 6 DNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 111) AUTHORS D'Adamio,L., Shipp,M.A., Masteller,E.L. and Reinherz,E.L. TITLE Organization of gene encoding common acute lymphoblastic leukemia antigen (neutral endopeptidase 24.11): Multiple miniexons and three separate 5' untranslated regions JOURNAL Proc. Natl. Acad. Sci. U.S.A. 86, 7103-7107 (1989) STANDARD full staff_review COMMENT Draft entry and computer-readable copy of sequence [1] kindly submitted by L.D'Adamio 01-AUG-1989. FEATURES from to/span description pept + 8 + 106 common acute lymphoblastic antigen, exon 15 (EC 3.4.24.11) /nomgen="MME" /map="3q21-q27" /hgml_locus_uid="LG0222G" pre-msg < 1 > 111 neutral endopeptidase mRNA and introns IVS < 1 7 CALLA/NEP Intron N IVS 107 > 111 CALLA/NEP Intron O BASE COUNT 39 A 15 C 29 G 28 T ORIGIN Unknown number of bp after segment 14. 1 ATTTGAGGTC GAGGATTTGA TTGCACAGAT CCGAGAAGTT TTTATTCAGA CTTTAGATGA 61 CCTCACTTGG ATGGATGCCG AGACAAAAAA GAGAGCTGAA GAAAAGGTAA A // LOCUS HUMCALLA16 93 bp ds-DNA PRI 15-DEC-1989 DEFINITION Human CALLA/NEP gene encoding neutral endopeptidase, exon 16. ACCESSION M26620 KEYWORDS common acute lymphoblastic leukemia antigen; integral membrane protein; neutral endopeptidase. SEGMENT 16 of 24 SOURCE Human cell line NALM 6 DNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 93) AUTHORS D'Adamio,L., Shipp,M.A., Masteller,E.L. and Reinherz,E.L. TITLE Organization of gene encoding common acute lymphoblastic leukemia antigen (neutral endopeptidase 24.11): Multiple miniexons and three separate 5' untranslated regions JOURNAL Proc. Natl. Acad. Sci. U.S.A. 86, 7103-7107 (1989) STANDARD full staff_review COMMENT Draft entry and computer-readable copy of sequence [1] kindly submitted by L.D'Adamio 01-AUG-1989. FEATURES from to/span description pept + 8 + 88 common acute lymphoblastic antigen, exon 16 (EC 3.4.24.11) /nomgen="MME" /map="3q21-q27" /hgml_locus_uid="LG0222G" pre-msg < 1 > 93 neutral endopeptidase mRNA and introns IVS < 1 7 CALLA/NEP Intron O IVS 89 > 93 CALLA/NEP Intron P BASE COUNT 34 A 15 C 19 G 25 T ORIGIN Unknown number of bp after segment 15. 1 TTCATAGGCC TTAGCAATTA AAGAAAGGAT CGGCTATCCT GATGACATTG TTTCAAATGA 61 TAACAAACTG AATAATGAGT ACCTCGAGGT AAG // LOCUS HUMCALLA17 116 bp ds-DNA PRI 15-DEC-1989 DEFINITION Human CALLA/NEP gene encoding neutral endopeptidase, exon 17. ACCESSION M26621 KEYWORDS common acute lymphoblastic leukemia antigen; integral membrane protein; neutral endopeptidase. SEGMENT 17 of 24 SOURCE Human cell line NALM 6 DNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 116) AUTHORS D'Adamio,L., Shipp,M.A., Masteller,E.L. and Reinherz,E.L. TITLE Organization of gene encoding common acute lymphoblastic leukemia antigen (neutral endopeptidase 24.11): Multiple miniexons and three separate 5' untranslated regions JOURNAL Proc. Natl. Acad. Sci. U.S.A. 86, 7103-7107 (1989) STANDARD full staff_review COMMENT Draft entry and computer-readable copy of sequence [1] kindly submitted by L.D'Adamio 01-AUG-1989. FEATURES from to/span description pept + 8 + 111 common acute lymphoblastic antigen, exon 17 (EC 3.4.24.11) /nomgen="MME" /map="3q21-q27" /hgml_locus_uid="LG0222G" pre-msg < 1 > 116 neutral endopeptidase mRNA and introns IVS < 1 7 CALLA/NEP Intron P IVS 112 > 116 CALLA/NEP Intron Q BASE COUNT 53 A 17 C 24 G 22 T ORIGIN Unknown number of bp after segment 16. 1 AATACAGTTG AACTACAAAG AAGATGAATA CTTCGAGAAC ATAATTCAAA ATTTGAAATT 61 CAGCCAAAGT AAACAACTGA AGAAGCTCCG AGAAAAGGTG GACAAAGATG AGTGCG // LOCUS HUMCALLA18 71 bp ds-DNA PRI 15-DEC-1989 DEFINITION Human CALLA/NEP gene encoding neutral endopeptidase, exon 18. ACCESSION M26622 KEYWORDS common acute lymphoblastic leukemia antigen; integral membrane protein; neutral endopeptidase. SEGMENT 18 of 24 SOURCE Human cell line NALM 6 DNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 71) AUTHORS D'Adamio,L., Shipp,M.A., Masteller,E.L. and Reinherz,E.L. TITLE Organization of gene encoding common acute lymphoblastic leukemia antigen (neutral endopeptidase 24.11): Multiple miniexons and three separate 5' untranslated regions JOURNAL Proc. Natl. Acad. Sci. U.S.A. 86, 7103-7107 (1989) STANDARD full staff_review COMMENT Draft entry and computer-readable copy of sequence [1] kindly submitted by L.D'Adamio 01-AUG-1989. FEATURES from to/span description pept + 8 + 66 common acute lymphoblastic antigen, exon 18 (EC 3.4.24.11) /nomgen="MME" /map="3q21-q27" /hgml_locus_uid="LG0222G" pre-msg < 1 > 71 neutral endopeptidase mRNA and introns IVS < 1 7 CALLA/NEP Intron Q IVS 67 > 71 CALLA/NEP Intron R BASE COUNT 23 A 10 C 19 G 19 T ORIGIN Unknown number of bp after segment 17. 1 TCTACAGGTG GATAAGTGGA GCAGCTGTAG TCAATGCATT TTACTCTTCA GGAAGAAATC 61 AGATAGGTAA G // LOCUS HUMCALLA19 132 bp ds-DNA PRI 15-DEC-1989 DEFINITION Human CALLA/NEP gene encoding neutral endopeptidase, exon 19. ACCESSION M26623 KEYWORDS common acute lymphoblastic leukemia antigen; integral membrane protein; neutral endopeptidase. SEGMENT 19 of 24 SOURCE Human cell line NALM 6 DNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 132) AUTHORS D'Adamio,L., Shipp,M.A., Masteller,E.L. and Reinherz,E.L. TITLE Organization of gene encoding common acute lymphoblastic leukemia antigen (neutral endopeptidase 24.11): Multiple miniexons and three separate 5' untranslated regions JOURNAL Proc. Natl. Acad. Sci. U.S.A. 86, 7103-7107 (1989) STANDARD full staff_review COMMENT Draft entry and computer-readable copy of sequence [1] kindly submitted by L.D'Adamio 01-AUG-1989. FEATURES from to/span description pept + 8 + 127 common acute lymphoblastic antigen, exon 19 (EC 3.4.24.11) /nomgen="MME" /map="3q21-q27" /hgml_locus_uid="LG0222G" pre-msg < 1 > 132 neutral endopeptidase mRNA and introns IVS < 1 7 CALLA/NEP Intron R 128 > 132 CALLA/NEP Intron S BASE COUNT 31 A 39 C 30 G 32 T ORIGIN Unknown number of bp after segment 18. 1 CTTGTAGTCT TCCCAGCCGG CATTCTGCAG CCCCCCTTCT TTAGTGCCCA GCAGTCCAAC 61 TCATTGAACT ATGGGGGCAT CGGCATGGTC ATAGGACACG AAATCACCCA TGGCTTCGAT 121 GACAATGGTA AA // LOCUS HUMCALLA20 146 bp ds-DNA PRI 15-DEC-1989 DEFINITION Human CALLA/NEP gene encoding neutral endopeptidase, exon 20. ACCESSION M26624 KEYWORDS common acute lymphoblastic leukemia antigen; integral membrane protein; neutral endopeptidase. SEGMENT 20 of 24 SOURCE Human cell line NALM 6 DNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 146) AUTHORS D'Adamio,L., Shipp,M.A., Masteller,E.L. and Reinherz,E.L. TITLE Organization of gene encoding common acute lymphoblastic leukemia antigen (neutral endopeptidase 24.11): Multiple miniexons and three separate 5' untranslated regions JOURNAL Proc. Natl. Acad. Sci. U.S.A. 86, 7103-7107 (1989) STANDARD full staff_review COMMENT Draft entry and computer-readable copy of sequence [1] kindly submitted by L.D'Adamio 01-AUG-1989. FEATURES from to/span description pept + 8 + 141 common acute lymphoblastic antigen, exon 20 (EC 3.4.24.11) /nomgen="MME" /map="3q21-q27" /hgml_locus_uid="LG0222G" pre-msg < 1 > 146 neutral endopeptidase mRNA and introns IVS < 1 7 CALLA/NEP Intron S IVS 142 > 146 CALLA/NEP Intron T BASE COUNT 43 A 28 C 39 G 36 T ORIGIN Unknown number of bp after segment 19. 1 TTATAAGGCA GAAACTTTAA CAAAGATGGA GACCTCGTTG ACTGGTGGAC TCAACAGTCT 61 GCAAGTAACT TTAAGGAGCA ATCCCAGTGC ATGGTGTATC AGTATGGAAA CTTTTCCTGG 121 GACCTGGCAG GTGGACAGCA CGTATG // LOCUS HUMCALLA21 78 bp ds-DNA PRI 15-DEC-1989 DEFINITION Human CALLA/NEP gene encoding neutral endopeptidase, exon 21. ACCESSION M26625 KEYWORDS common acute lymphoblastic leukemia antigen; integral membrane protein; neutral endopeptidase. SEGMENT 21 of 24 SOURCE Human cell line NALM 6 DNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 78) AUTHORS D'Adamio,L., Shipp,M.A., Masteller,E.L. and Reinherz,E.L. TITLE Organization of gene encoding common acute lymphoblastic leukemia antigen (neutral endopeptidase 24.11): Multiple miniexons and three separate 5' untranslated regions JOURNAL Proc. Natl. Acad. Sci. U.S.A. 86, 7103-7107 (1989) STANDARD full staff_review COMMENT Draft entry and computer-readable copy of sequence [1] kindly submitted by L.D'Adamio 01-AUG-1989. FEATURES from to/span description pept + 8 + 73 common acute lymphoblastic antigen, exon 21 (EC 3.4.24.11) /nomgen="MME" /map="3q21-q27" /hgml_locus_uid="LG0222G" pre-msg < 1 > 78 neutral endopeptidase mRNA and introns IVS < 1 7 CALLA/NEP Intron T IVS 74 > 78 CALLA/NEP Intron U BASE COUNT 29 A 10 C 19 G 20 T ORIGIN Unknown number of bp after segment 20. 1 TTAACAGCTT AATGGAATTA ATACACTGGG AGAAAACATT GCTGATAATG GAGGTCTTGG 61 TCAAGCATAC AGAGTAAG // LOCUS HUMCALLA22 108 bp ds-DNA PRI 15-DEC-1989 DEFINITION Human CALLA/NEP gene encoding neutral endopeptidase, exon 22. ACCESSION M26626 KEYWORDS common acute lymphoblastic leukemia antigen; integral membrane protein; neutral endopeptidase. SEGMENT 22 of 24 SOURCE Human cell line NALM 6 DNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 108) AUTHORS D'Adamio,L., Shipp,M.A., Masteller,E.L. and Reinherz,E.L. TITLE Organization of gene encoding common acute lymphoblastic leukemia antigen (neutral endopeptidase 24.11): Multiple miniexons and three separate 5' untranslated regions JOURNAL Proc. Natl. Acad. Sci. U.S.A. 86, 7103-7107 (1989) STANDARD full staff_review COMMENT Draft entry and computer-readable copy of sequence [1] kindly submitted by L.D'Adamio 01-AUG-1989. FEATURES from to/span description pept + 8 + 103 common acute lymphoblastic antigen, exon 22 (EC 3.4.24.11) /nomgen="MME" /map="3q21-q27" /hgml_locus_uid="LG0222G" pre-msg < 1 > 108 neutral endopeptidase mRNA and introns IVS < 1 7 CALLA/NEP Intron U IVS 104 > 108 CALLA/NEP Intron V BASE COUNT 39 A 20 C 16 G 33 T ORIGIN Unknown number of bp after segment 21. 1 TGCCTAGGCC TATCAGAATT ATATTAAAAA GAATGGCGAA GAAAAATTAC TTCCTGGACT 61 TGACCTAAAT CACAAACAAC TATTTTTCTT GAACTTTGCA CAGGTATT // LOCUS HUMCALLA23 89 bp ds-DNA PRI 15-DEC-1989 DEFINITION Human CALLA/NEP gene encoding neutral endopeptidase, exon 23. ACCESSION M26627 KEYWORDS common acute lymphoblastic leukemia antigen; integral membrane protein; neutral endopeptidase. SEGMENT 23 of 24 SOURCE Human cell line NALM 6 DNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 89) AUTHORS D'Adamio,L., Shipp,M.A., Masteller,E.L. and Reinherz,E.L. TITLE Organization of gene encoding common acute lymphoblastic leukemia antigen (neutral endopeptidase 24.11): Multiple miniexons and three separate 5' untranslated regions JOURNAL Proc. Natl. Acad. Sci. U.S.A. 86, 7103-7107 (1989) STANDARD full staff_review COMMENT Draft entry and computer-readable copy of sequence [1] kindly submitted by L.D'Adamio 01-AUG-1989. FEATURES from to/span description pept + 8 + 84 common acute lymphoblastic antigen, exon 23 (EC 3.4.24.11) /nomgen="MME" /map="3q21-q27" /hgml_locus_uid="LG0222G" pre-msg < 1 > 89 neutral endopeptidase mRNA and introns IVS < 1 7 CALLA/NEP Intron V IVS 85 89 CALLA/NEP Intron W BASE COUNT 23 A 18 C 24 G 24 T ORIGIN Unknown number of bp after segment 22. 1 TCTCTAGGTG TGGTGTGGAA CCTATAGGCC AGAGTATGCG GTTAACTCCA TTAAAACAGA 61 TGTGCACAGT CCAGGCAATT TCAGGTGCT // LOCUS HUMCALLA24 121 bp ds-DNA PRI 15-DEC-1989 DEFINITION Human CALLA/NEP gene encoding neutral endopeptidase, exon 24. ACCESSION M26628 KEYWORDS common acute lymphoblastic leukemia antigen; integral membrane protein; neutral endopeptidase. SEGMENT 24 of 24 SOURCE Human cell line NALM 6 DNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 121) AUTHORS D'Adamio,L., Shipp,M.A., Masteller,E.L. and Reinherz,E.L. TITLE Organization of gene encoding common acute lymphoblastic leukemia antigen (neutral endopeptidase 24.11): Multiple miniexons and three separate 5' untranslated regions JOURNAL Proc. Natl. Acad. Sci. U.S.A. 86, 7103-7107 (1989) STANDARD full staff_review COMMENT Draft entry and computer-readable copy of sequence [1] kindly submitted by L.D'Adamio 01-AUG-1989. FEATURES from to/span description pept + 8 107 common acute lymphoblastic antigen, exon 24 (EC 3.4.24.11) /nomgen="MME" /map="3q21-q27" /hgml_locus_uid="LG0222G" pre-msg < 1 107 neutral endopeptidase mRNA and introns IVS < 1 7 CALLA/NEP Intron W BASE COUNT 38 A 23 C 28 G 32 T ORIGIN Unknown number of bp after segment 23. 1 TTCAAAGGAT TATTGGGACT TTGCAGAACT CTGCAGAGTT TTCAGAAGCC TTTCACTGCC 61 GCAAGAATTC ATACATGAAT CCAGAAAAGA AGTGCCGGGT TTGGTGATCT TCAAAAGAAG 121 C // LOCUS HUMCANPRA 1154 bp ds-DNA PRI 15-DEC-1989 DEFINITION Human calcium-dependent protease large subunit (CANPmL) gene, promoter region and exon 1. ACCESSION J04700 KEYWORDS calcium-dependent protease; calpain; protease. SOURCE Human lymph node DNA, clone M1-3. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 1154) AUTHORS Hata,A., Ohno,S., Akita,Y. and Suzuki,K. TITLE Tandemly reiterated negative enhancer-like elements regulate transcription of a human gene for the large subunit of calcium-dependent protease JOURNAL J. Biol. Chem. 264, 6404-6411 (1989) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly provided by A.Hata, 03-FEB-1989. FEATURES from to/span description pept 780 / 1016 calcium-dependent protease large subunit, exon 1 (put.), (EC 3.4.22.17) IVS 1017 > 1154 CANPmL intron A pre-msg 638 > 1154 CANPmL mRNA and intron pre-msg 639 > 1154 CANPmL mRNA and intron pre-msg 643 > 1154 CANPmL mRNA and intron pre-msg 644 > 1154 CANPmL mRNA and intron pre-msg 660 > 1154 CANPmL mRNA and intron pre-msg 661 > 1154 CANPmL mRNA and intron pre-msg 662 > 1154 CANPmL mRNA and intron pre-msg 664 > 1154 CANPmL mRNA and intron pre-msg 666 > 1154 CANPmL mRNA and intron pre-msg 677 > 1154 CANPmL mRNA and intron rpt 656 670 CANPmL degenerate repeat A copy A rpt 671 686 CANPmL degenerate repeat A copy B BASE COUNT 198 A 371 C 391 G 194 T ORIGIN 759 bp upstream of RsaI site. 1 CGGCCTCCCA AAGTGTTGGG ATTACAGGTG TGAACTACCG CACCCAGCCT TATTGGCCAA 61 ATCTGACACC CTATCTATAA TCAATAACAG CTGAGCAGTA GTAATTGCAG CATTATGCAA 121 TAGCTCGGTG CTTAGAGTCC CTGTTCTGCC TGTGACTGGT GTGCCACCAT TGGGGCCGTC 181 ATTTAGCGTC TCCGAGGCTC AGTTTTCTCA TCTGTAAAAT GGGGACAATA TCAGCGCCTT 241 CTTCAGAGTC GCTGGGAGGA TTAAATGAGA TGATGTATGC AGAGCCGTTA AGACGATGTT 301 TGGCACAAAG TTCAGGGCAG CTGGTTTAGT TTCCTCGACT TCACTACCTG ACCCTCTGCT 361 AACTCCCCGG GTGTTTTCCG GACGGCCACA ACTATCCTAG CCTTCTTCCC TATGGGCTGC 421 AAAGGTGGCC TCGGGTTCCG GTGGGAGCCC CAACTCTGGA CCGCGATTCG CGAGCCTCCC 481 CGGCGCCGGG CCGTGCATCC CGGGAGCTGT CCGCAGATGG CAGCACCGGC CCGTGTCGCG 541 GCGTTCCCGG CGCTCGGCAG GCCGAGATGG CTGGTCCCGG GCCGGGAGCC CAGCAGGCCG 601 GGAGCGGCTG AGGCCACACC CCGCGGGGCC GGGCCGCTTC CCTCCGGTGA ATCATCGCTC 661 GCAGCGGCGG CGCCCGCAGT GGCCGCAGCA GGCCGCCGGG CCCTGGCCGC GCCCCAGCCG 721 AGCGCAGCGC GGAGTCGCCC CGACCTTTCT CTGCGCAGTA CGGCCGCCGG GACCGCAGCA 781 TGGCGGGCAT CGCGGCCAAG CTGGCGAAGG ACCGGGAGGC GGCCGAGGGG CTGGGCTCCC 841 ACGAGAGGGC CATCAAGTAC CTCAACCAGG ACTACGAGGC GCTGCGGAAC GAGTGCCTCG 901 AGGCCGGGAC GCTCTTCCAG GACCCGTCCT TCCCGGCCAT CCCCTCGGCC CTGGGCTTCA 961 AGGAGTTGGG GCCCTACTCC GGCAAAACCC GGGGCATCGA GTGGAAGCGC CCCACGGTAG 1021 GAAGCGCGCG GCAGGACGCG GGCAGGGCGG AGAGCCGGGC AGGGCGGGGT GCAGGCCGGC 1081 CCGCGCGCGC TGGGGGGGGG CACCGGGTGC TGCAGTGGGG AAGCCGAAGC AGGCCAGATC 1141 TGGACACTCG GCGC // LOCUS HUMCAPG 3734 bp ds-DNA PRI 15-DEC-1989 DEFINITION Human cathepsin G gene, complete cds. ACCESSION J04990 KEYWORDS cathepsin G; serine protease. SOURCE Human lung fibroblast DNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 3734) AUTHORS Hohn,P.A., Popescu,N.C., Hanson,R.D., Salvesen,G. and Ley,T.J. TITLE Genomic organization and chromosomal localization of the human cathepsin G gene JOURNAL J. Biol. Chem. 264, 13412-13419 (1989) STANDARD full staff_review COMMENT Draft entry and computer-readable copy of sequence [1] kindly submitted by T.J.Ley, 13-JUN-1989. FEATURES from to/span description pept 350 404 cathepsin G, exon 1 1161 1308 cathepsin G, exon 2 1763 1898 cathepsin G, exon 3 2074 2328 cathepsin G, exon 4 2763 2936 cathepsin G, exon 5 pre-msg 322 3020 cathepsin G mRNA and introns IVS 405 1160 cathepsin G Intron A IVS 1309 1762 cathepsin G Intron B IVS 1899 2073 cathepsin G Intron C IVS 2329 2762 cathepsin G Intron D signal 253 256 CAAT box signal 293 297 TATA box signal 2990 2995 polyA signal BASE COUNT 990 A 960 C 959 G 825 T ORIGIN Chromosome 14q11.2. 1 CTTGCTTTGC TGGAGTATTC TGGTAATTTG ATGGGTTGAG GGTTCTGGAC ACAATGCCCC 61 AAGCCCCTTC CTTGTTGTGC TGGGTTCCTA TTTCTGCTCT CGGCACTGAC TTAGCAGCTG 121 CTCAAGAGCT CACTATGTTG GCTTGGATTA CACGGTCTCA CCCACATCTC CGGCAGTTTG 181 TGGGCAAACC TCCTGAGCAG CCTTGGGTGA TGAAACCTTT CATGGTAGCA GGAGAATGGG 241 ACTGTGAATT CTCAATCCCC TGTCCCCACC CCTTCCTTCC TCTCTCAGGG CCTTAAAGTC 301 TAGGAGGAGG AAGCACAGCA GCAACTGACT GGGCAGCCTT TCAGGAAAGA TGCAGCCACT 361 CCTGCTTCTG CTGGCCTTTC TCCTACCCAC TGGGGCTGAG GCAGGTGAGT GACCATCCCC 421 ACCCTCAGAG GCCTGACCTC ATCCCATAGA TTCTTGAGCC AAATTGCCTT GGTATATCCT 481 AATTCTGTAC TGTTGAGCAA GTTATTTGAA TTTGTGTTTC CTCATCTATA AAATGAGAAT 541 AATATTAATA CCGATCTTGC AGAGTTGCCA TGAGAGTTAA ATAAGTTAGA GTATTTAAAT 601 GTCTTGGAAT TGCCCGCACA CTATAAGTGC TATAAAAACA TGCTTTGTGT AAATAATTTG 661 GCAGCATGTG TCAGACCCTA CCTAGGAGGT AAGAATACAG CAATAACAGT ACCATCAGCT 721 CATGTCTAGA TTTTTAAACA CCAGTCCCAC GTGGTCTTGA ATTGGACTCA GAGGGCTCTG 781 GGAAGCTCCA TGAGGATAAA AGTATAAGGG AACTTCAGGA ACAATCCTGT ACTTACAGCA 841 AAGCATTCTC CTCAATACCT GAGGCTGAAG CTGGCCTTGC CTGGAACAAG GGTTGTTCTC 901 CCTCTTTTGG AGAGGAGGAG GGAGGTGAGG CCTAGGATGG GGAAAAGGGC TCCTTTCAAG 961 ACAGCAGTGT TTCCTGTAGA ACCCTGGAGC CCCCTCCCAA TCTGCTGCCC CATAGACTCC 1021 AAGCCTCAGC ACCATCTCCT CCCTCTCCTG CACCCTCTCT CCTGCCGTCC CCATCTTCCA 1081 GCCTTTCTGG AGCCACCAAT CTGGTACCCA CATTGCAGGT TCAGCAAGCA TAGAGCTAAG 1141 TGCCAAATGC TTCCTTCCAG GGGAGATCAT CGGAGGCCGG GAGAGCAGGC CCCACTCCCG 1201 CCCCTACATG GCGTATCTTC AGATCCAGAG TCCAGCAGGT CAGAGCAGAT GTGGAGGGTT 1261 CCTGGTGCGA GAAGACTTTG TGCTGACAGC AGCTCATTGC TGGGGAAGGT GAGGAGCTAA 1321 GGAACTTCCT GGCCAGCCAG GAACACAGCC CTGCGGAGCT CTTCGGTGGA AGAGCCATCT 1381 GAAAGAAGAG TTGTAGCAAT GAAAGGGTGA AAGAAAGACC AAGTGAGTCT TTGCGGGAGG 1441 GAACAGGCCA GTGTAAATGA GGAGGAAAGG AGGATAAGAT CAAAAAGAGC AAGAGGAAGA 1501 GATGGAAGAC ACATATTGGG GCTCAAAATA TAAACTCAGG CTATTTATCA ACTTAATCTG 1561 GGGAAGTAAA CCTGAAGGCA AGTACCACCC TGTCATCCCT AGCTCAGAGC TGCTGAGAAA 1621 GAGGATACAG CTGAGCCCCA GGGCCCTCCC ATCCCCTCGA TTCTGGTTAG CTGCAGTCTT 1681 GCCCTCCCCG TGCTGTCTGC CTACCCTGCA GAGCTGGTGG ACCATAGCTC CTGCAGCCCA 1741 GACCTACCTC TTGCTTTTGC AGCAATATAA ATGTCACCCT GGGCGCCCAC AATATCCAGA 1801 GACGGGAAAA CACCCAGCAA CACATCACTG CGCGCAGAGC CATCCGCCAC CCTCAATATA 1861 ATCAGCGGAC CATCCAGAAT GACATCATGT TATTGCAGGT ACCACCTACC TGGCCCTCTG 1921 GCTCCTTCCT AGTGTGTCCG GGGACAATGG AGGAGGAAGT GAGGGCAAGG CTCCGGGGTG 1981 GCGGGGAGGG CATGGGATGT GTACTGCACC AGCGACCCCC GAGCCTTGGC TGGAGGCCCC 2041 AGCTGAGCGG GAACGCCTAC ATTCTTCCTC CAGCTGAGCA GAAGAGTCAG ACGGAATCGA 2101 AACGTGAACC CAGTGGCTCT GCCTAGAGCC CAGGAGGGAC TGAGACCCGG GACGCTGTGC 2161 ACTGTGGCCG GCTGGGGCAG GGTCAGCATG AGGAGGGGAA CAGATACACT CCGAGAGGTG 2221 CAGCTGAGAG TGCAGAGGGA TAGGCAGTGC CTCCGCATCT TCGGTTCCTA CGACCCCCGA 2281 AGGCAGATTT GTGTGGGGGA CCGGCGGGAA CGGAAGGCTG CCTTCAAGGT AAGGCATGGG 2341 CATTGGCCAA CACACCCCGG GAGAGAGGGG CCCGTGCAGA GCCAGGCAGT GCGAACAGAT 2401 TCCATCCCCA CAGCCTCAGC CTGGCAGCCA GACCAGGGTG GGCTGGGGAT TGTTTTCCCC 2461 ATCAACCTGG TCTCTGGGGG AATAGGAGGA AGACCCACAA CACATACATA GGCAACATTC 2521 TCCTGGAGAA GGGAGAGGTA CCTTGACTCA GATTGGGCTG GAGACAGTAA TTAAGGCAGA 2581 GCTGAAGTCC AGCGACCGAA AAGATCCAGA GGCTTGGCTC CTGTACCCCA CCGATCTTCC 2641 ATCTCACACA CACCCAGCAA TTGAAGGGGC CCACCCACCC CTGCCTTCCC TGAGAGCCCG 2701 GAGCTCAGGG AAGCAGGAGC AGGGAGGCCT GTCTCAGTCT CCCTTCTCCT CTCTACCTAC 2761 AGGGGGATTC CGGAGGCCCC CTGCTGTGTA ACAATGTGGC CCACGGCATC GTCTCCTATG 2821 GAAAGTCGTC AGGGGTTCCT CCAGAAGTCT TCACCAGGGT CTCAAGTTTC CTGCCCTGGA 2881 TAAGGACAAC AATGAGAAGC TTCAAACTGC TGGATCAGAT GGAGACCCCC CTGTGACTGA 2941 CTCTTCTTCT CGGGGACACA GGCCAGCTCC ACAGTGTTGC CAGAGCCTTA ATAAACGTCC 3001 ACAGAGTATA AATAACCAAT TCCTCATTTG TTCATTAAAC GTCATTCAGT ACTTAGTTTG 3061 TTTGGATTGC TACAACAAAA TAGCACAAAT TGGGTGGCTT ATAAATAACA AATTTATTTC 3121 TCACAGGTCT AGAGGCTAAG AAGTCTAAGA TCAAGTCACT AGCAGATTCA GTGTCTAATT 3181 AGGGCCCATT TTCTGGTTCA CAGACAACCA TCCTCTCCCT GTGTCCACAT ATGGCAAAAG 3241 GGGCAAGGGA ATTCTCTGAT GTCTCTTTTA CAAGGGACCT AGTCTCATTC AAAGAGCTCA 3301 GCTTTTACGA CCTAATCACA TCCCAAAGGC CCCACCTAAT GCCATCACGA CATTGGGGAT 3361 TAGGTCTGGG AAACATAGGG AAAGAGTGTC TCTACACAAA AATTTTAAAA TTAGCCAGGC 3421 ATGGTGGCAT GTGTCTATAG TCCCAGCTAC TTGGGAGGCT AAAGTGGAAG GATTAGTTGA 3481 ACCCACGAGG TTGAGGCTTC AGTGAACCAT GCACTCCAGC CTGAGCGACA GAGCAAGACA 3541 CCATTCCAAG AAAGAAAAAA AAAAAGACTG GCAGGCCAAA AAGACAGAAC TGAAATTCCA 3601 AAAAAAAAGA CCTACTTTAG TGTATGAAAA AGGTGGCATC TCAAATCACT GGGAAACAAT 3661 GGAATTTTTG AATAAATAGC ATTAGAACCA ACCTAGATAG ATATTTGGAG GGGATGGAAG 3721 GTATAATTGG ATCC // LOCUS HUMCATF 1848 bp ss-mRNA PRI 15-JUN-1989 DEFINITION Human fibroblast catalase gene, partial exon 1, complete exon 2. ACCESSION K02400 KEYWORDS catalase. SOURCE Human fibroblast: cDNA (library of Okuyama and Berg) to mRNA, clones pCAT1 and pCAT41. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 1848) AUTHORS Korneluk,R.G., Quan,F., Lewis,W.H., Guise,K.S., Willard,H.F., Holmes,M.T. and Gravel,R.A. TITLE Isolation of human fibroblast catalase cDNA clones: Sequence of clones derived from spliced and unspliced mRNA JOURNAL J. Biol. Chem. 259, 13819-13823 (1984) STANDARD full staff_review FEATURES from to/span description pept < 1 675 catalase, exon A (AA at 1) /nomgen="CAT" /map="11p13" /hgml_locus_uid="LL0091V" 1138 1818 catalase, exon B pre-msg < 1 > 1848 catalase mRNA IVS 676 1137 catalase cds intron A variant 674 1137 g...t in pCAT1; gt in pCAT41 BASE COUNT 500 A 418 C 407 G 523 T ORIGIN 139 bp upstream of PvuII site; chromosome 11p13. 1 AAAGGAGCAG GGGCCTTTGG CTACTTTGAG GTCACACATG ACATTACCAA ATACTCCAAG 61 GCAAAGGTAT TTGAGCATAT TGGAAAGAAG ACTCCCATCG CAGTTCGGTT CTCCACTGTT 121 GCTGGAGAAT CGGGTTCAGC TGACACAGTT CGGGACCCTC GTGGGTTTGC AGTGAAATTT 181 TACACAGAAG ATGGTAACTG GGATCTCGTT GGAAATAACA CCCCCATTTT CTTCATCAGG 241 GATCCCATAT TGTTTCCATC TTTTATCCAC AGCCAAAAGA GAAATCCTCA GACACATCTG 301 AAGGATCCGG ACATGGTCTG GGACTTCTGG AGCCTACGTC CTGAGTCTCT GCATCAGGTT 361 TCTTTCTTGT TCAGTGATCG GGGGATTCCA GATGGACATC GCCACATGAA TGGATATGGA 421 TCACATACTT TCAAGCTGGT TAATGCAAAT GGGGAGGCAG TTTATTGCAA ATTCCATTAT 481 AAGACTGACC AGGGCATCAA AAACCTTTCT GTTGAAGATG CGGCGAGACT TTCCCAGGAA 541 GATCCTGACT ATGGCATCCG GGATCTTTTT AACGCCATTG CCACAGGAAA GTACCCCTCC 601 TGGACTTTTT ACATCCAGGT CATGACATTT AATCAGGCAG AAACTTTTCC ATTTAATCCA 661 TTCGATCTCA CCAAGGTGAG TCAGTAAACA ACTATATTGT TTTCTTTTTT AAGTCTCTTC 721 TTACCTAATT AGAAAAAAAA TCTAGTCAAA CAATTATAAT AATGGGGAAG TCATATACAA 781 AATACAGAGG GTACCACTTC AGAGTGTCCT AAGCTGTGAA TGAGTGCTTA CCAGCATCTT 841 ACTTCCACGT TCCTGTTTGT CATTTCATTG AGTATGTGTA TGTGGCTTCA TATATTGTTA 901 TTAACAGGGA ACAGATTATG AAAAGCTGAT GTACTTTTTC CTGGGGAAAC TGTCAGTATT 961 TACCACTTAC TATTGTGAAA GATTTAACTA AGGCACTCAT CTTAAATTCT TATGTTTTAT 1021 TGGATTTAAA AATTATTTTC ATTGGCTTGA TTGTATTTGA AATCTGGTAT TTTTGTGGGT 1081 AGCTTTGATT TCCTTCAGTT GATTGCCTGG TAATTGTGAA TATGACATCA TTTTCAGGTT 1141 TGGCCTCACA AGGACTACCC TCTCATCCCA GTTGGTAAAC TGGTCTTAAA CCGGAATCCA 1201 GTTAATTACT TTGCTGAGGT TGAACAGATA GCCTTCGACC CAAGCAACAT GCCACCTGGC 1261 ATTGAGGCCA GTCCTGACAA AATGCTTCAG GGCCGCCTTT TTGCCTATCC TGACACTCAC 1321 CGCCATCGCC TGGGACCCAA TTATCTTCAT ATACCTGTGA ACTGTCCCTA CCGTGCTCGA 1381 GTGGCCAACT ACCAACGTGA CGGCCCGATG TGCATGCAGG ACAATCAGGG TGGTGCTCCA 1441 AATTACTACC CCAACAGCTT TGGTGCTCCG GAACAACAGC CTTCTGCCTT GGAGCACAGC 1501 ATCCAATATT CTGGAGAAGT GCGGAGATTC AACACTGCCA ATGATGATAA CGTTACTCAG 1561 GTGCGGGCAT TCTATGTGAA CGTGCTGAAT GAGGAACAGA GGAAACGTCT GTGTGAGAAC 1621 ATTGCCGGCC ACCTGAAGGA TGCACAAATT TTCATCCAGA AGAAAGCGGT CAAGAACTTC 1681 ACTGAGGTCC ACCCTGACTA CGGGAGCCAC ATCCAGGCTC TTCTGGACAA GTACAATGCT 1741 GAGAAGCCTA AGAATGCGAT TCACACCTTT GTGCAGTCCG GATCTCACTT GGCGGCAAGG 1801 GAGAAGGCAA ATCTGTGAGG CCGGGGCCCT GCACCTGTGC ATGAAGCT // LOCUS HUMCCK1 529 bp ds-DNA PRI 31-AUG-1987 DEFINITION Human cholecystokinin (CCK) gene, exon 1. ACCESSION N00050 M15843 KEYWORDS cholecystokinin. SEGMENT 1 of 3 SOURCE Human duodenum DNA, clone lambda-ck58. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 529) AUTHORS Takahashi,Y., Fukushige,S., Murotsu,T. and Matsubara,K. TITLE Structure of human cholecystokinin gene and its chromosomal location JOURNAL Gene 50, 353-360 (1986) STANDARD full staff_review COMMENT Draft entry and clean copy sequence for [1] kindly provided by K.Matsubara, 01-JUN-1987. The CCK gene is located on chromosome 3 pter-p21. FEATURES from to/span description pre-msg 349 > 529 CCK mRNA (alt.) /nomgen="CCK" /map="3pter-p21" /hgml_locus_uid="LS0019B" pre-msg 387 > 529 CCK mRNA (alt.) IVS 449 > 529 CCK mRNA intron A BASE COUNT 97 A 167 C 156 G 109 T ORIGIN 145 bp upstream of HindIII site. 1 TTTCTAGCCC TCCGGTCCCC AAAGCGAACT AAGCTAGCGG CAGCTTCTCC ATCCCGGAGG 61 AGACTCCAAA CTCCTAGGTT TCCCACCTTG GAGAGTACTG CTCTGGAATG CGCAGGGGAG 121 CGGCACCTGG AAAGGGTTGG GCGGAAGCTT CTCGGACCCA GAGGGGCGGA CACCCGGCAC 181 CGCGCGTGGG AGGGGATTAA CTGGACCCCA CTAGACCACC TCCCCCTCTC TCCCAGGAGC 241 CACTTCAACC TGGTTGTCGC CCCAGTGGCC GCCCTCTGAG CACGTGTTAC TGCCAGTCTG 301 CGTCAGCGTT GGGTAAATAC ATGACTGGCC GACGCGCCGG GCGGGGCTAT TTAAGAGACA 361 GCCGCCCGCT GGTCCTCCCT GAACTTGGCT CAGCTGCCGG GCTGCTCCGG TTGGAAACGC 421 CAAGCCAGCT GCCGTCCTAA TCCAAAAGGT AGGTCCCTCG GTTCGTCAGC GGGGGATCCA 481 AGTGTCGTGT GTCTTTGAAA GGATGCTGTC TTCAGCTTTG TGCCCATTT // LOCUS HUMCCK2 471 bp ds-DNA PRI 31-AUG-1987 DEFINITION Human cholecystokinin (CCK) gene, exon 2. ACCESSION N00031 M11383 M15843 KEYWORDS cholecystokinin. SEGMENT 2 of 3 SOURCE Human fetal liver DNA, library of R.M.Lawn, clone lambda-CK58 [1]; duodenum DNA, clone lambda-ck58 [2]. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 471) AUTHORS Takahashi,Y., Kato,K., Hayashizaki,Y., Wakabayashi,T., Ohtsuka,E., Matsuki,S., Ikehara,M. and Matsubara,K. TITLE Molecular cloning of the human cholecystokinin gene by use of a synthetic probe containing deoxyinosine JOURNAL Proc. Natl. Acad. Sci. U.S.A. 82, 1931-1935 (1985) STANDARD full staff_review REFERENCE 2 (bases 82 to 471) AUTHORS Takahashi,Y., Fukushige,S., Murotsu,T. and Matsubara,K. TITLE Structure of human cholecystokinin gene and its chromosomal location JOURNAL Gene 50, 353-360 (1986) STANDARD full staff_review COMMENT Various forms of cholecystokinin differ in amino acid chain length, but are all biologically active. These peptides are apparently formed by differential processing from a single precursor polypeptide. They all have the same carboxyl terminus. The CCK gene is located on chromosome 3 pter-p21. FEATURES from to/span description pept 195 + 408 preprocholecystokinin, exon 2, first expressed exon /nomgen="CCK" /map="3pter-p21" /hgml_locus_uid="LS0019B" sigp 195 254 cholecystokinin signal peptide matp 330 + 408 cholecystokinin 58 matp 387 + 408 cholecystokinin 39 matp 405 + 408 cholecystokinin 33 pre-msg < 1 > 471 CCK mRNA IVS < 1 194 CCK mRNA intron A IVS 409 > 471 CCK intron B BASE COUNT 63 A 166 C 157 G 85 T ORIGIN About 1.0 kb after segment 1; 295 upstream of PstI site. 1 AGCCAGGTGC CCGGGGCGCG CGCCATGGCA CTCGGCTGGG TCAGCGCTTG GCGAGCCCTC 61 GTGCTTGCGG CGGCCTGGCG GGTCTGGAGC ACCTGTGTTT GTCCTAAGCC CGCTTCTGGG 121 TGCCCGGCCC GCCGCAGGAG CTCTGTTGCC CAGCCTTTCA GTTAACGCGC CCTCCTCCTT 181 TGCTCCCTAC AGCCATGAAC AGCGGCGTGT GCCTGTGCGT GCTGATGGCG GTACTGGCGG 241 CTGGCGCCCT GACGCAGCCG GTGCCTCCCG CAGATCCCGC GGGCTCCGGG CTGCAGCGGG 301 CAGAGGAGGC GCCCCGTAGG CAGCTGAGGG TATCGCAGAG AACGGATGGC GAGTCCCGAG 361 CGCACCTGGG CGCCCTGCTG GCAAGATACA TCCAGCAGGC CCGGAAAGGT AAGAATGCTG 421 CCTCCCCATC CCTCACTTCT GCCCTTGTTC CCAGGCTCCC GATGCTGACC C // LOCUS HUMCCK3 534 bp ds-DNA PRI 31-AUG-1987 DEFINITION Human cholecystokinin (CCK) gene, exon 3. ACCESSION L00354 M11383 M15843 KEYWORDS cholecystokinin. SEGMENT 3 of 3 SOURCE Human fetal liver DNA, library of R.M.Lawn, clone lambda-CK58 [[1]; duodenum, clone lambda-ck58 [2]. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 386) AUTHORS Takahashi,Y., Kato,K., Hayashizaki,Y., Wakabayashi,T., Ohtsuka,E., Matsuki,S., Ikehara,M. and Matsubara,K. TITLE Molecular cloning of the human cholecystokinin gene by use of a synthetic probe containing deoxyinosine JOURNAL Proc. Natl. Acad. Sci. U.S.A. 82, 1931-1935 (1985) STANDARD full staff_review REFERENCE 2 (bases 1 to 534) AUTHORS Takahashi,Y., Fukushige,S., Murotsu,T. and Matsubara,K. TITLE Structure of human cholecystokinin gene and its chromosomal location JOURNAL Gene 50, 353-360 (1986) STANDARD full staff_review COMMENT The CCK gene is located on chromosome 3 pter-p21. FEATURES from to/span description pept + 89 222 preprocholecystokinin, exon 3 /nomgen="CCK" /map="3pter-p21" /hgml_locus_uid="LS0019B" matp + 89 183 cholecystokinin 58 matp + 89 183 cholecystokinin 39 matp + 89 183 cholecystokinin 33 matp 148 183 cholecystokinin 12 matp 160 183 cholecystokinin 8 pre-msg < 1 498 CCK mRNA IVS < 1 88 CCK intron B BASE COUNT 136 A 137 C 113 G 148 T ORIGIN About 6 kb after segment 1. 1 TTTTTCCATT GCTCTCCTTC CCAACAAATT GAACTAGAGA TGCTTTTAGA TGCAATGTCC 61 CTGTTTCCTT CCTCCCTGCT CCTTGTAGCT CCTTCTGGAC GAATGTCCAT CGTTAAGAAC 121 CTGCAGAACC TGGACCCCAG CCACAGGATA AGTGACCGGG ACTACATGGG CTGGATGGAT 181 TTTGGCCGTC GCAGTGCCGA GGAGTATGAG TACCCCTCCT AGAGGACCCA GCCGCCATCA 241 GCCCAACGGA AGCAACCTCC CAACCCAGAG GAGGCAGAAT AAGACAACAA TCACACTCAT 301 AACTCATTGT CTGTGGAGTT TGACATTGAA TGTATCTATT TATTAAGTTC TCAATGTGAA 361 AATTGTGTCT GTAAGATTGT CCAGTGCAAC CACACACGCT CACCAGAAGT TGTGCAAACT 421 GAAGACAAAA CTGTTTTCTT CATCTGTGAC TCCTGTTCTG AAAATGTTGT TATGCTATTA 481 AAGTGATTTC ATTCTGCCGT GTCGTTCCCT GCGTCTACTG AGGGCAAAGG GCTC // LOCUS HUMCD14G 1570 bp ds-DNA PRI 15-MAR-1989 DEFINITION Human gene for CD14 differentiation antigen. ACCESSION X06882 KEYWORDS CD14 antigen; antigen; monocyte differentiation antigen; surface antigen. SOURCE human (Homo sapiens). ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 1570) AUTHORS Goyert,S.M. JOURNAL Unpublished (1988) Hospital-Joint Disease,301 E 17th St,NY,NY 10003 STANDARD simple automatic REFERENCE 2 (bases 1 to 1570) AUTHORS Ferrero,E. and Goyert,S.M. TITLE Nucleotide sequence of the gene encoding the monocyte differentiation antigen, CD14 JOURNAL Nucleic Acids Res. 16, 4173-4173 (1988) STANDARD simple automatic COMMENT *source: cell type=lymphocyte; library=(lambda)gtWes; **map: long arm of chromosome 5, 5q23-q31; the authors also sequenced corresponding cDNA Data kindly reviewed (28-MAR-1988) by Goyert S.M. EMBL features not translated to GenBank features: key from to description FEATURES from to/span description pept 156 158 CD14 protein precursor, exon1 /nomgen="CD14" /map="5q22-q32" /hgml_locus_uid="LE0121J" 247 1371 CD14 protein precursor, exon2 sigp 156 159 CD14 protein signal peptide 247 300 CD14 protein signal peptide matp 301 1368 CD14 protein pre-msg 51 > 1482 CD14 mRNA and intron IVS 159 246 CD14 intron BASE COUNT 314 A 486 C 453 G 317 T ORIGIN 1 CAGAATGACA TCCCAGGATT ACATAAACTG TCAGAGGCAG CCGAAGAGTT CACAAGTGTG 61 AAGCCTGGAA GCCGGCGGGT GCCGCTGTGT AGGAAAGAAG CTAAAGCACT TCCAGAGCCT 121 GTCCGGAGCT CAGAGGTTCG GAAGACTTAT CGACCATGGT GAGTGTAGGG TCTTGGGGTC 181 GAACGCGTGC CACTCGGGAG CCACAGGGGT TGGATGGGGC CTCCTAGACC TCTGCTCTCT 241 CCCCAGGAGC GCGCGTCCTG CTTGTTGCTG CTGCTGCTGC CGCTGGTGCA CGTCTCTGCG 301 ACCACGCCAG AACCTTGTGA GCTGGACGAT GAAGATTTCC GCTGCGTCTG CAACTTCTCC 361 GAACCTCAGC CCGACTGGTC CGAAGCCTTC CAGTGTGTGT CTGCAGTAGA GGTGGAGATC 421 CATGCCGGCG GTCTCAACCT AGAGCCGTTT CTAAAGCGCG TCGATGCGGA CGCCGACCCG 481 CGGCAGTATG CTGACACGGT CAAGGCTCTC CGCGTGCGGC GGCTCACAGT GGGAGCCGCA 541 CAGGTTCCTG CTCAGCTACT GGTAGGCGCC CTGCGTGTGC TAGCGTACTC CCGCCTCAAG 601 GAACTGACGC TCGAGGACCT AAAGATAACC GGCACCATGC CTCCGCTGCC TCTGGAAGCC 661 ACAGGACTTG CACTTTCCAG CTTGCGCCTA CGCAACGTGT CGTGGGCGAC AGGGCGTTCT 721 TGGCTCGCCG AGCTGCAGCA GTGGCTCAAG CCAGGCCTCA AGGTACTGAG CATTGCCCAA 781 GCACACTCGC CTGCCTTTTC CTACGAACAG GTTCGCGCCT TCCCGGCCCT TACCAGCCTA 841 GACCTGTCTG ACAATCCTGG ACTGGGCGAA CGCGGACTGA TGGCGGCTCT CTGTCCCCAC 901 AAGTTCCCGG CCATCCAGAA TCTAGCGCTG CGCAACACAG GAATGGAGAC GCCCACAGGC 961 GTGTGCGCCG CACTGGCGGC GGCAGGTGTG CAGCCCCACA GCCTAGACCT CAGCCACAAC 1021 TCGCTGCGCG CCACCGTAAA CCCTAGCGCT CCGAGATGCA TGTGGTCCAG CGCCCTGAAC 1081 TCCCTCAATC TGTCGTTCGC TGGGCTGGAA CAGGTGCCTA AAGGACTGCC AGCCAAGCTC 1141 AGAGTGCTCG ATCTCAGCTG CAACAGACTG AACAGGGCGC CGCAGCCTGA CGAGCTGCCC 1201 GAGGTGGATA ACCTGACACT GGACGGGAAT CCCTTCCTGG TCCCTGGAAC TGCCCTCCCC 1261 CACGAGGGCT CAATGAACTC CGGCGTGGTC CCAGCCTGTG CACGTTCGAC CCTGTCGGTG 1321 GGGGTGTCGG GAACCCTGGT GCTGCTCCAA GGGGCCCGGG GCTTTGCCTA AGATCCAAGA 1381 CAGAATAATG AATGGACTCA AACTGCCTTG GCTTCAGGGG AGTCCCGTCA GGACGTTGAG 1441 GACTTTTCGA CCAATTCAAC CCTTTGCCCC ACCTTTATTA AAATCTTAAA CAACGGTTCC 1501 GTGTCATTCA TTTAACAGAC CTTTATTGGA TGTCTGCTAT GTGCTGGGCA CAGTACTGGA 1561 TGGGGAATTC // LOCUS HUMCD1A1 328 bp ds-DNA PRI 15-JUN-1989 DEFINITION Human cortical thymocyte antigen CD1a, exon 1. ACCESSION M22080 M18229 J03584 KEYWORDS thymocyte antigen. SEGMENT 1 of 6 SOURCE Human DNA, clone lambda-R4B3. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 328) AUTHORS Martin,L.H., Calabi,F., Lefebvre,F.-A., Bilsland,C.A.G. and Milstein,C. TITLE Structure and expression of the human thymocyte antigens CD1a, CD1b, and CD1c JOURNAL Proc. Natl. Acad. Sci. U.S.A. 84, 9189-9193 (1987) STANDARD full staff_entry COMMENT Draft entry and printed copy of sequence for [1] kindly provided by C.A.G.Bilsland, 08-NOV-1988. FEATURES from to/span description pept 266 + 323 thymocyte antigen CD1a precursor, exon 1 /nomgen="CD1" /map="1" /hgml_locus_uid="LU0071V" sigp 266 313 thymocyte antigen CD1a signal peptide matp 314 + 323 thymocyte antigen CD1a pre-msg 8 > 328 CD1a mRNA and intron IVS 324 > 328 CD1a intron A BASE COUNT 98 A 39 C 105 G 86 T ORIGIN Chromosome 1. 1 GAATTAGGGG AAGGTGAATA AGTTGGAGGT TGGTGACAAG GAGAGAAGCT GGAACAGAGA 61 GGAGAGTCAG AACCAGAGGG AAATGAGAGA CTGAGTAGGC ATCTCAGGGT TTTTGAAGGA 121 GTGGATTTTC TTTGTTGCAG TCAGGGGAGG TTTGTCTGTT GGCTGCAGAA AGAAGTCAGA 181 ATAGAGATAT CGTGGGGTAG GTTTGTTTGG AACAGAAATC AAAGACCAAT TTTTCTGAGA 241 GAAGGAAATA ACATCTGCAA ATGATATGCT GTTTTTGCTA CTTCCATTGT TAGCTGTTCT 301 CCCAGGTGAT GGCAATGCAG ACGGTAAG // LOCUS HUMCD1A2 286 bp ds-DNA PRI 15-JUN-1989 DEFINITION Human cortical thymocyte antigen CD1a, exon 2. ACCESSION M22163 M18229 J03584 KEYWORDS thymocyte antigen. SEGMENT 2 of 6 SOURCE Human DNA, clone lambda-R4B3. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 286) AUTHORS Martin,L.H., Calabi,F., Lefebvre,F.-A., Bilsland,C.A.G. and Milstein,C. TITLE Structure and expression of the human thymocyte antigens CD1a, CD1b, and CD1c JOURNAL Proc. Natl. Acad. Sci. U.S.A. 84, 9189-9193 (1987) STANDARD full staff_entry COMMENT Draft entry and printed copy of sequence for [1] kindly provided by C.A.G.Bilsland, 08-NOV-1988. FEATURES from to/span description pept + 10 + 276 thymocyte antigen CD1a precursor, exon 2 /nomgen="CD1" /map="1" /hgml_locus_uid="LU0071V" matp + 10 + 276 thymocyte antigen CD1a pre-msg < 1 > 286 CD1a mRNA and intron IVS < 1 9 CD1a intron A IVS 277 > 286 CD1a intron B BASE COUNT 70 A 70 C 69 G 77 T ORIGIN About 331 bp after segment 1; chromosome 1. 1 TTGTCGCAGG GCTCAAGGAG CCTCTCTCCT TCCATGTCAC CTGGATCGCA TCCTTTTACA 61 ACCATTCCTG GAAACAAAAT CTGGTCTCAG GTTGGCTGAG TGATTTGCAG ACTCATACCT 121 GGGACAGCAA TTCCAGCACC ATCGTTTTCC TGTGCCCCTG GTCCAGGGGA AACTTCAGCA 181 ATGAGGAGTG GAAGGAACTG GAAACATTAT TCCGTATACG CACCATTCGG TCATTTGAGG 241 GAATTCGTAG ATACGCCCAT GAATTGCAGT TTGAATGTGA GTTCAG // LOCUS HUMCD1A3 299 bp ds-DNA PRI 15-JUN-1989 DEFINITION Human cortical thymocyte antigen CD1a, exon 3. ACCESSION M22164 M18229 J03584 KEYWORDS thymocyte antigen. SEGMENT 3 of 6 SOURCE Human DNA, clone lambda-R4B3. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 299) AUTHORS Martin,L.H., Calabi,F., Lefebvre,F.-A., Bilsland,C.A.G. and Milstein,C. TITLE Structure and expression of the human thymocyte antigens CD1a, CD1b, and CD1c JOURNAL Proc. Natl. Acad. Sci. U.S.A. 84, 9189-9193 (1987) STANDARD full staff_entry COMMENT Draft entry and printed copy of sequence for [1] kindly provided by C.A.G.Bilsland, 08-NOV-1988. FEATURES from to/span description pept + 11 + 289 thymocyte antigen CD1a precursor, exon 3 /nomgen="CD1" /map="1" /hgml_locus_uid="LU0071V" matp + 11 + 289 thymocyte antigen CD1a pre-msg < 1 > 299 CD1a mRNA and intron IVS < 1 10 CD1a intron B IVS 290 > 299 CD1a intron C BASE COUNT 82 A 71 C 70 G 76 T ORIGIN About 613 bp after segment 2; chromosome 1. 1 ATAACCCCAG ATCCTTTTGA GATACAGGTG ACAGGAGGCT GTGAGCTGCA CTCTGGAAAG 61 GTCTCAGGAA GCTTCTTGCA GTTAGCTTAT CAAGGATCAG ACTTTGTGAG CTTCCAGAAC 121 AATTCATGGT TGCCATATCC AGTGGCTGGG AATATGGCCA AGCATTTCTG CAAAGTGCTC 181 AATCAGAATC AGCATGAAAA TGACATAACA CACAATCTTC TCAGTGACAC CTGCCCACGT 241 TTCATCTTGG GTCTTCTTGA TGCAGGAAAG GCACATCTCC AGCGGCAAGG TCAGTCCTG // LOCUS HUMCD1A4 299 bp ds-DNA PRI 15-JUN-1989 DEFINITION Human cortical thymocyte antigen CD1a, exon 4. ACCESSION M22165 M18229 J03584 KEYWORDS thymocyte antigen. SEGMENT 4 of 6 SOURCE Human DNA, clone lambda-R4B3. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 299) AUTHORS Martin,L.H., Calabi,F., Lefebvre,F.-A., Bilsland,C.A.G. and Milstein,C. TITLE Structure and expression of the human thymocyte antigens CD1a, CD1b, and CD1c JOURNAL Proc. Natl. Acad. Sci. U.S.A. 84, 9189-9193 (1987) STANDARD full staff_entry COMMENT Draft entry and printed copy of sequence for [1] kindly provided by C.A.G.Bilsland, 08-NOV-1988. FEATURES from to/span description pept + 11 + 289 thymocyte antigen CD1a precursor, exon 4 /nomgen="CD1" /map="1" /hgml_locus_uid="LU0071V" matp + 11 + 289 thymocyte antigen CD1a pre-msg < 1 > 299 CD1a mRNA and intron IVS < 1 10 CD1a intron C IVS 290 > 299 CD1a intron D BASE COUNT 55 A 82 C 103 G 59 T ORIGIN About 483 bp after segment 3; chromosome 1. 1 TCCTTTGCAG TGAAGCCCGA GGCCTGGCTG TCCCATGGCC CCAGTCCTGG CCCTGGCCAT 61 CTGCAGCTTG TGTGCCATGT CTCAGGATTC TACCCAAAGC CCGTGTGGGT GATGTGGATG 121 CGGGGTGAGC AGGAGCAGCA GGGCACTCAG CGAGGGGACA TCTTGCCCAG TGCTGATGGG 181 ACATGGTATC TCCGCGCAAC CCTGGAGGTG GCCGCTGGGG AGGCAGCTGA CCTGTCCTGT 241 CGGGTGAAGC ACAGCAGTCT AGAGGGCCAG GACATCGTCC TCTACTGGGG TGAGAAAAA // LOCUS HUMCD1A5 111 bp ds-DNA PRI 15-JUN-1989 DEFINITION Human cortical thymocyte antigen CD1a, exon 5. ACCESSION M22166 M18229 J03584 KEYWORDS thymocyte antigen. SEGMENT 5 of 6 SOURCE Human DNA, clone lambda-R4B3. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 111) AUTHORS Martin,L.H., Calabi,F., Lefebvre,F.-A., Bilsland,C.A.G. and Milstein,C. TITLE Structure and expression of the human thymocyte antigens CD1a, CD1b, and CD1c JOURNAL Proc. Natl. Acad. Sci. U.S.A. 84, 9189-9193 (1987) STANDARD full staff_entry COMMENT Draft entry and printed copy of sequence for [1] kindly provided by C.A.G.Bilsland, 08-NOV-1988. FEATURES from to/span description pept + 11 + 101 thymocyte antigen CD1a precursor, exon 5 /nomgen="CD1" /map="1" /hgml_locus_uid="LU0071V" matp + 11 + 101 thymocyte antigen CD1a pre-msg < 1 > 111 CD1a mRNA and intron IVS < 1 10 CD1a intron D IVS 102 > 111 CD1a intron E BASE COUNT 21 A 24 C 28 G 38 T ORIGIN About 333 bp after segment 4; chromosome 1. 1 AAATTCACAG AGCATCACAG TTCCGTGGGC TTCATCATCT TGGCGGTGAT AGTGCCTTTA 61 CTTCTTCTGA TAGGTCTTGC GCTTTGGTTC AGGAAACGCT GGTGAGTTCT T // LOCUS HUMCD1A6 20 bp ds-DNA PRI 15-JUN-1989 DEFINITION Human cortical thymocyte antigen CD1a, exon 6. ACCESSION M22167 M18229 J03584 KEYWORDS thymocyte antigen. SEGMENT 6 of 6 SOURCE Human DNA, clone lambda-R4B3. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 20) AUTHORS Martin,L.H., Calabi,F., Lefebvre,F.-A., Bilsland,C.A.G. and Milstein,C. TITLE Structure and expression of the human thymocyte antigens CD1a, CD1b, and CD1c JOURNAL Proc. Natl. Acad. Sci. U.S.A. 84, 9189-9193 (1987) STANDARD full staff_entry COMMENT Draft entry and printed copy of sequence for [1] kindly provided by C.A.G.Bilsland, 08-NOV-1988. FEATURES from to/span description pept + 11 20 thymocyte antigen CD1a precursor, exon 6 /nomgen="CD1" /map="1" /hgml_locus_uid="LU0071V" matp + 11 17 thymocyte antigen CD1a pre-msg < 1 > 20 CD1a mRNA and intron IVS < 1 10 CD1a intron E BASE COUNT 4 A 5 C 2 G 9 T ORIGIN About 169 bp after segment 5; chromosome 1. 1 TCTCATCCAG TTTCTGTTAA // LOCUS HUMCD1B1 341 bp ds-DNA PRI 15-JUN-1989 DEFINITION Human cortical thymocyte antigen CD1b, exon 1. ACCESSION M22168 M18230 J03584 KEYWORDS thymocyte antigen. SEGMENT 1 of 6 SOURCE Human DNA, clone lambda-R1L5. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 341) AUTHORS Martin,L.H., Calabi,F., Lefebvre,F.-A., Bilsland,C.A.G. and Milstein,C. TITLE Structure and expression of the human thymocyte antigens CD1a, CD1b, and CD1c JOURNAL Proc. Natl. Acad. Sci. U.S.A. 84, 9189-9193 (1987) STANDARD full staff_entry COMMENT Draft entry and printed copy of sequence for [1] kindly provided by C.A.G.Bilsland, 08-NOV-1988. FEATURES from to/span description pept 276 + 336 thymocyte antigen CD1b precursor, exon 1 /nomgen="CD1" /map="1" /hgml_locus_uid="LU0071V" sigp 276 326 thymocyte antigen CD1b signal peptide matp 327 + 336 thymocyte antigen CD1b pre-msg 8 > 341 CD1b mRNA and intron IVS 337 > 341 CD1b intron A BASE COUNT 105 A 50 C 110 G 76 T ORIGIN 211 bp upstream of HindIII site; chromosome 1. 1 CAAGGAGGTA TGAAGGAAGG TGAGGACAGG GAGAGCGGCT GGAAGTCAGG GGGTAAGAGA 61 AACTCTAAAA ATCAGGGCTT GAGGGAAATA AAAGGTGAGG TAAGAGGCTC AGGGCTGTGG 121 GGAGGCACAT TTTTCTCTGA AAAGCAGTTT GGATGAGGAA GAGATTTGGC AGTTGGAAGA 181 GAGAAGAAGT CACTACAGGG TACTGAGGAA AAGCTTTGCT GAAATTGGAG ATCAAATACC 241 AGCTCTGCCA GTAAGAAGTT GCATCTCCCA GTGAAATGCT GCTGCTGCCA TTTCAACTGT 301 TAGCTGTTCT CTTTCCTGGT GGTAACAGTG AACATGGTAA G // LOCUS HUMCD1B2 286 bp ds-DNA PRI 15-SEP-1989 DEFINITION Human cortical thymocyte antigen CD1b, exon 2. ACCESSION M22169 M18230 J03584 KEYWORDS thymocyte antigen. SEGMENT 2 of 6 SOURCE Human DNA, clone lambda-R1L5. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 286) AUTHORS Martin,L.H., Calabi,F., Lefebvre,F.-A., Bilsland,C.A.G. and Milstein,C. TITLE Structure and expression of the human thymocyte antigens CD1a, CD1b, and CD1c JOURNAL Proc. Natl. Acad. Sci. U.S.A. 84, 9189-9193 (1987) STANDARD full staff_entry COMMENT Draft entry and printed copy of sequence for [1] kindly provided by C.A.G.Bilsland, 08-NOV-1988. FEATURES from to/span description pept + 10 + 276 thymocyte antigen CD1b precursor, exon 2 /nomgen="CD1" /map="1" /hgml_locus_uid="LU0071V" matp + 10 + 276 thymocyte antigen CD1b pre-msg < 1 > 286 CD1b mRNA and intron IVS < 1 9 CD1b intron A IVS 277 > 286 CD1b intron B BASE COUNT 69 A 64 C 72 G 81 T ORIGIN About 296 bp after segment 1; chromosome 1. 1 TCTTCACAGC CTTCCAGGGG CCGACCTCCT TTCATGTTAT CCAGACCTCG TCCTTTACCA 61 ATAGTACCTG GGCACAAACT CAAGGCTCAG GCTGGTTGGA TGATTTGCAG ATTCATGGCT 121 GGGATAGCGA CTCAGGCACT GCCATATTCC TGAAGCCTTG GTCTAAAGGT AACTTTAGTG 181 ATAAGGAGGT TGCTGAGTTA GAGGAGATAT TCCGAGTCTA CATCTTTGGA TTCGCTCGAG 241 AAGTACAAGA CTTTGCCGGT GATTTCCAGA TGAAATGTGA GTCTAG // LOCUS HUMCD1B3 299 bp ds-DNA PRI 15-JUN-1989 DEFINITION Human cortical thymocyte antigen CD1b, exon 3. ACCESSION M22170 M18230 J03584 KEYWORDS thymocyte antigen. SEGMENT 3 of 6 SOURCE Human DNA, clone lambda-R1L5. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 299) AUTHORS Martin,L.H., Calabi,F., Lefebvre,F.-A., Bilsland,C.A.G. and Milstein,C. TITLE Structure and expression of the human thymocyte antigens CD1a, CD1b, and CD1c JOURNAL Proc. Natl. Acad. Sci. U.S.A. 84, 9189-9193 (1987) STANDARD full staff_entry COMMENT Draft entry and printed copy of sequence for [1] kindly provided by C.A.G.Bilsland, 08-NOV-1988. FEATURES from to/span description pept + 11 + 289 thymocyte antigen CD1b precursor, exon 3 /nomgen="CD1" /map="1" /hgml_locus_uid="LU0071V" matp + 11 + 289 thymocyte antigen CD1b pre-msg < 1 > 286 CD1b mRNA and intron IVS < 1 10 CD1b intron B IVS 290 > 299 CD1b intron C BASE COUNT 80 A 69 C 76 G 74 T ORIGIN About 543 bp after segment 2; chromosome 1. 1 CTCTACATAG ACCCCTTTGA GATCCAGGGC ATAGCAGGCT GTGAGCTACA TTCTGGAGGT 61 GCCATAGTAA GCTTCCTGAG GGGAGCTCTA GGAGGATTGG ATTTCCTGAG TGTCAAGAAT 121 GCTTCATGTG TGCCTTCCCC AGAAGGTGGC AGCAGGGCAC AGAAATTCTG TGCACTAATC 181 ATACAATATC AAGGTATCAT GGAAACTGTG AGAATTCTCC TCTATGAAAC CTGCCCCCGA 241 TATCTCTTGG GCGTCCTCAA TGCAGGAAAA GCAGATCTGC AAAGACAAGG TTAGTCCTG // LOCUS HUMCD1B4 299 bp ds-DNA PRI 15-JUN-1989 DEFINITION Human cortical thymocyte antigen CD1b, exon 4. ACCESSION M22171 M18230 J03584 KEYWORDS thymocyte antigen. SEGMENT 4 of 6 SOURCE Human DNA, clone lambda-R1L5. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 299) AUTHORS Martin,L.H., Calabi,F., Lefebvre,F.-A., Bilsland,C.A.G. and Milstein,C. TITLE Structure and expression of the human thymocyte antigens CD1a, CD1b, and CD1c JOURNAL Proc. Natl. Acad. Sci. U.S.A. 84, 9189-9193 (1987) STANDARD full staff_entry COMMENT Draft entry and printed copy of sequence for [1] kindly provided by C.A.G.Bilsland, 08-NOV-1988. FEATURES from to/span description pept + 11 + 289 thymocyte antigen CD1b precursor, exon 4 /nomgen="CD1" /map="1" /hgml_locus_uid="LU0071V" matp + 11 + 289 thymocyte antigen CD1b pre-msg < 1 > 286 CD1b mRNA and intron IVS < 1 10 CD1b intron C IVS 290 > 299 CD1b intron D BASE COUNT 60 A 78 C 99 G 62 T ORIGIN About 184 bp after segment 3; chromosome 1. 1 CCTGCCTTAG TGAAGCCTGA GGCCTGGCTG TCCAGTGGCC CCAGTCCTGG ACCTGGCCGT 61 CTGCAGCTTG TGTGCCATGT CTCAGGATTC TACCCAAAGC CCGTGTGGGT GATGTGGATG 121 CGGGGTGAGC AGGAGCAGCA GGGCACTCAG CTAGGGGACA TCCTGCCCAA TGCTAACTGG 181 ACATGGTATC TCCGAGCAAC CCTGGATGTG GCAGATGGGG AGGCGGCTGG CCTGTCCTGT 241 CGGGTGAAGC ACAGCAGTTT AGAGGGCCAG GACATCATCC TCTACTGGAG TAAGAAATA // LOCUS HUMCD1B5 114 bp ds-DNA PRI 15-JUN-1989 DEFINITION Human cortical thymocyte antigen CD1b, exon 5. ACCESSION M22172 M18230 J03584 KEYWORDS thymocyte antigen. SEGMENT 5 of 6 SOURCE Human DNA, clone lambda-R1L5. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 114) AUTHORS Martin,L.H., Calabi,F., Lefebvre,F.-A., Bilsland,C.A.G. and Milstein,C. TITLE Structure and expression of the human thymocyte antigens CD1a, CD1b, and CD1c JOURNAL Proc. Natl. Acad. Sci. U.S.A. 84, 9189-9193 (1987) STANDARD full staff_entry COMMENT Draft entry and printed copy of sequence for [1] kindly provided by C.A.G.Bilsland, 08-NOV-1988. FEATURES from to/span description pept + 11 + 104 thymocyte antigen CD1b precursor, exon 5 /nomgen="CD1" /map="1" /hgml_locus_uid="LU0071V" matp + 11 + 104 thymocyte antigen CD1b pre-msg < 1 > 114 CD1b mRNA and intron IVS < 1 10 CD1b intron D IVS 105 > 114 CD1b intron E BASE COUNT 22 A 27 C 26 G 39 T ORIGIN About 239 bp after segment 4; chromosome 1. 1 CAATTGCCAG GAAACCCCAC CTCCATTGGC TCAATTGTTT TGGCAATAAT AGTGCCTTCC 61 TTGCTTCTTT TGCTATGCCT TGCATTATGG TATATGAGGC GCCGGTGAGT TGGT // LOCUS HUMCD1B6 32 bp ds-DNA PRI 15-JUN-1989 DEFINITION Human cortical thymocyte antigen CD1b, exon 6. ACCESSION M22173 M18230 J03584 KEYWORDS thymocyte antigen. SEGMENT 6 of 6 SOURCE Human DNA, clone lambda-R1L5. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 32) AUTHORS Martin,L.H., Calabi,F., Lefebvre,F.-A., Bilsland,C.A.G. and Milstein,C. TITLE Structure and expression of the human thymocyte antigens CD1a, CD1b, and CD1c JOURNAL Proc. Natl. Acad. Sci. U.S.A. 84, 9189-9193 (1987) STANDARD full staff_entry COMMENT Draft entry and printed copy of sequence for [1] kindly provided by C.A.G.Bilsland, 08-NOV-1988. FEATURES from to/span description pept + 11 32 thymocyte antigen CD1b precursor, exon 6 /nomgen="CD1" /map="1" /hgml_locus_uid="LU0071V" matp + 11 29 thymocyte antigen CD1b pre-msg < 1 > 32 CD1b mRNA and intron IVS < 1 10 CD1b intron E BASE COUNT 11 A 6 C 4 G 11 T ORIGIN About 553 bp after segment 5; chromosome 1. 1 TTTTTAACAG GTCATATCAG AATATCCCAT GA // LOCUS HUMCD1C1 337 bp ds-DNA PRI 15-JUN-1989 DEFINITION Human cortical thymocyte antigen CD1c, exon 1. ACCESSION M22174 M18231 J03584 KEYWORDS thymocyte antigen. SEGMENT 1 of 6 SOURCE Human DNA, clone lambda-R7L4. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 337) AUTHORS Martin,L.H., Calabi,F., Lefebvre,F.-A., Bilsland,C.A.G. and Milstein,C. TITLE Structure and expression of the human thymocyte antigens CD1a, CD1b, and CD1c JOURNAL Proc. Natl. Acad. Sci. U.S.A. 84, 9189-9193 (1987) STANDARD full staff_entry COMMENT Draft entry and printed copy of sequence for [1] kindly provided by C.A.G.Bilsland, 08-NOV-1988. FEATURES from to/span description pept 272 + 332 thymocyte antigen CD1c precursor, exon 1 /nomgen="CD1" /map="1" /hgml_locus_uid="LU0071V" sigp 272 322 thymocyte antigen CD1c signal peptide matp 323 + 332 thymocyte antigen CD1c pre-msg 8 > 337 CD1c mRNA and intron IVS 333 > 337 CD1c intron A BASE COUNT 105 A 48 C 105 G 79 T ORIGIN Chromosome 1. 1 CAGGAAGGGA AGTAGATATA ATGGAGCTTA GTGGCAGAGC AGCTGGAATC CTGAGAGAAG 61 AGAATAACTT TAGTTCAGAG CAGGTGGGGA AATGAGAGAT TGAGTAGGAG GACCAAGGTT 121 GAGGGAAGCA GATCTTCTTA GTTGCTGTCA GCGGCTGATG GGGAAGATTG TTGGTAGAAG 181 GAAGTCAGAA TATAGGTACA GAGGGATAAG TTTGCTAAGA ACAGAGATCA GCAAACAGCT 241 TTTCTGAGAG AAAGAAACAT CTGCAAATGA CATGCTGTTT CTGCAGTTTC TGCTGCTAGC 301 TCTTCTTCTC CCAGGTGGTG ACAATGCAGA CGGTAAG // LOCUS HUMCD1C3 277 bp ds-DNA PRI 15-JUN-1989 DEFINITION Human cortical thymocyte antigen CD1c gene, exon 2. ACCESSION M22175 M18231 J03584 KEYWORDS thymocyte antigen. SEGMENT 3 of 6 SOURCE Human DNA, clone lambda-R7L4. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 277) AUTHORS Martin,L.H., Calabi,F., Lefebvre,F.-A., Bilsland,C.A.G. and Milstein,C. TITLE Structure and expression of the human thymocyte antigens CD1a, CD1b, and CD1c JOURNAL Proc. Natl. Acad. Sci. U.S.A. 84, 9189-9193 (1987) STANDARD full staff_entry COMMENT Draft entry and printed copy for [1] kindly provided by C.A.G.Bilsland, 08-NOV-1988. FEATURES from to/span description pept + 1 + 267 thymocyte antigen CD1c precursor, exon 2 /nomgen="CD1" /map="1" /hgml_locus_uid="LU0071V" matp + 1 + 267 thymocyte antigen CD1c pre-msg < 1 > 277 CD1c mRNA and intron IVS 268 > 277 CD1c intron B BASE COUNT 75 A 65 C 63 G 74 T ORIGIN About 393 bp after segment 2; chromosome 1. 1 CATCCCAGGA ACACGTCTCC TTCCATGTCA TCCAGATCTT CTCATTTGTC AACCAATCCT 61 GGGCACGAGG TCAGGGCTCA GGATGGCTGG ACGAGTTGCA GACTCATGGC TGGGACAGTG 121 AATCAGGCAC AATAATTTTC CTGCATAACT GGTCCAAGGG CAACTTCAGC AATGAAGAGT 181 TGTCAGACCT AGAGTTGTTA TTTCGTTTCT ACCTCTTTGG ATTAACTCGG GAGATTCAAG 241 ACCATGCAAG TCAAGATTAC TCGAAATGTA AGTTCAA // LOCUS HUMCD1C4 302 bp ds-DNA PRI 15-JUN-1989 DEFINITION Human cortical thymocyte antigen CD1c gene, exon 3. ACCESSION M22176 M18231 J03584 KEYWORDS thymocyte antigen. SEGMENT 4 of 6 SOURCE Human DNA, clone lambda-R7L4. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 302) AUTHORS Martin,L.H., Calabi,F., Lefebvre,F.-A., Bilsland,C.A.G. and Milstein,C. TITLE Structure and expression of the human thymocyte antigens CD1a, CD1b, and CD1c JOURNAL Proc. Natl. Acad. Sci. U.S.A. 84, 9189-9193 (1987) STANDARD full staff_entry COMMENT Draft entry and printed copy for [1] kindly provided by C.A.G.Bilsland, 08-NOV-1988. FEATURES from to/span description pept + 11 + 292 thymocyte antigen CD1c precursor, exon 3 /nomgen="CD1" /map="1" /hgml_locus_uid="LU0071V" matp + 1 + 268 thymocyte antigen CD1c pre-msg < 1 > 302 CD1c mRNA and intron IVS < 1 10 CD1c intron B IVS 293 > 302 CD1c intron C BASE COUNT 79 A 68 C 74 G 81 T ORIGIN About 663 bp after segment 3; chromosome 1. 1 CTTCCTCCAG ATCCCTTTGA AGTACAGGTG AAAGCGGGCT GTGAGCTGCA TTCTGGAAAG 61 AGCCCAGAAG GCTTCTTTCA GGTAGCTTTC AACGGATTAG ACTTACTGAG TTTCCAGAAT 121 ACAACATGGG TGCCATCTCC AGGCTGTGGA AGTTTGGCCC AAAGTGTCTG TCATCTACTC 181 AATCATCAGT ATGAAGGCGT CACAGAAACA GTGTATAATC TCATAAGAAG CACTTGCCCC 241 CGATTTCTCT TGGGTCTCCT TGATGCAGGG AAGATGTATG TACACAGGCA AGGTCAGTAG 301 TT // LOCUS HUMCD1C5 299 bp ds-DNA PRI 15-JUN-1989 DEFINITION Human cortical thymocyte antigen CD1c gene, exon 4. ACCESSION M22177 M18231 J03584 KEYWORDS thymocyte antigen. SEGMENT 5 of 6 SOURCE Human DNA, clone lambda-R7L4. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 299) AUTHORS Martin,L.H., Calabi,F., Lefebvre,F.-A., Bilsland,C.A.G. and Milstein,C. TITLE Structure and expression of the human thymocyte antigens CD1a, CD1b, and CD1c JOURNAL Proc. Natl. Acad. Sci. U.S.A. 84, 9189-9193 (1987) STANDARD full staff_entry COMMENT Draft entry and printed copy for [1] kindly provided by C.A.G.Bilsland, 08-NOV-1988. FEATURES from to/span description pept + 11 + 289 thymocyte antigen CD1c precursor, exon 4 /nomgen="CD1" /map="1" /hgml_locus_uid="LU0071V" matp + 1 + 289 thymocyte antigen CD1c pre-msg < 1 > 299 CD1c mRNA and intron IVS < 1 10 CD1c intron C IVS 290 > 299 CD1c intron D BASE COUNT 59 A 74 C 92 G 74 T ORIGIN About 203 bp after segment 4; chromosome 1. 1 CTCTCTGCAG TGAGGCCAGA AGCCTGGCTG TCCAGTCGCC CCAGCCTTGG GTCTGGCCAG 61 CTGTTGCTGG TTTGTCATGC CTCCGGCTTC TACCCAAAGC CTGTTTGGGT GACATGGATG 121 CGGAATGAAC AGGAGCAACT GGGCACTAAA CATGGTGATA TTCTTCCTAA TGCTGATGGG 181 ACATGGTATC TTCAGGTGAT CCTGGAGGTG GCATCTGAGG AGCCTGCTGG CCTGTCTTGT 241 CGAGTGAGAC ACAGCAGTCT AGGAGGCCAG GACATCATCC TCTACTGGGG TAAGACTGG // LOCUS HUMCD1C6 105 bp ds-DNA PRI 15-JUN-1989 DEFINITION Human cortical thymocyte antigen CD1c gene, exon 5. ACCESSION M22178 M18231 J03584 KEYWORDS thymocyte antigen. SEGMENT 6 of 6 SOURCE Human DNA, clone lambda-R7L4. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 105) AUTHORS Martin,L.H., Calabi,F., Lefebvre,F.-A., Bilsland,C.A.G. and Milstein,C. TITLE Structure and expression of the human thymocyte antigens CD1a, CD1b, and CD1c JOURNAL Proc. Natl. Acad. Sci. U.S.A. 84, 9189-9193 (1987) STANDARD full staff_entry COMMENT Draft entry and printed copy for [1] kindly provided by C.A.G.Bilsland, 08-NOV-1988. FEATURES from to/span description pept + 11 105 thymocyte antigen CD1c precursor, exon 5 /nomgen="CD1" /map="1" /hgml_locus_uid="LU0071V" matp + 1 105 thymocyte antigen CD1c pre-msg < 1 > 105 CD1c mRNA and intron IVS < 1 10 CD1c intron D BASE COUNT 24 A 17 C 27 G 37 T ORIGIN About 306 bp after segment 4; chromosome 1. 1 ATATGTGCAG GACACCACTT TTCCATGAAT TGGATTGCCT TGGTAGTGAT AGTGCCCTTG 61 GTGATTCTAA TAGTCCTTGT GTTATGGTTT AAGAAGCACT GGTGA // LOCUS HUMCD2R1 704 bp ds-DNA PRI 15-JUN-1989 DEFINITION Human CD2 antigen gene, exons 1 and 2. ACCESSION X07871 KEYWORDS CD2 antigen; glycoprotein; surface antigen; surface glycoprotein. SEGMENT 1 of 4 SOURCE Human T cell line HPB-ALL DNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 704) AUTHORS Lang,G., Wotton,D., Owen,M.J., Sewell,W.A., Brown,M.H., Mason,D.Y., Crumpton,M.J. and Kioussis,D. TITLE The structure of the human CD2 gene and its expression in transgenic mice JOURNAL EMBO J. 7, 1675-1682 (1988) STANDARD simple automatic COMMENT *source: library=cos202; clone=CD2-cos1, *source: CD2-cos4, CD2-cos7 and CD2-cos10; see x07871 - x07884 for CD2 antigen gene, exon seq. FEATURES from to/span description pept 225 285 CD2 antigen precursor, exon 1 /nomgen="CD2" /map="1p13" /hgml_locus_uid="LD0093J" 363 + 683 CD2 antigen precursor, exon 2 sigp 225 285 CD2 antigen signal peptide 363 373 CD2 antigen signal peptide matp 374 + 683 CD2 antigen pre-msg 162 > 704 CD2 mRNA and introns IVS 286 362 CD2 intron A IVS 684 > 704 CD2 intron B BASE COUNT 239 A 126 C 144 G 195 T ORIGIN Chromosome 1p13. 1 AAAACACACA CTCATAAACA CATCTGCTTT GGCAAAGGAG CACATCAGAA GGGCTGGCTT 61 GTGCGCGCTC TTGCTCTCTG TGTATGTGTA TTATGTTTTA TGTTACTGTA AAAGATGTAA 121 AGAGAGGCAC GTGGTTAAGC TCTCGGGGTG TGGACTCCAC CAGTCTCACT TCAGTTCCTT 181 TTGCATGAAG AGCTCAGAAT CAAAAGAGGA AACCAACCCC TAAGATGAGC TTTCCATGTA 241 AATTTGTAGC CAGCTTCCTT CTGATTTTCA ATGTTTCTTC CAAAGGTAAG CATAAGAGTC 301 AAAGAAGTCC CAACCCAGCT TTCCCTGAAA GTGACTCTCA GTAACTCTTT TGCTTTTTAT 361 AGGTGCAGTC TCCAAAGAGA TTACGAATGC CTTGGAAACC TGGGGTGCCT TGGGTCAGGA 421 CATCAACTTG GACATTCCTA GTTTTCAAAT GAGTGATGAT ATTGACGATA TAAAATGGGA 481 AAAAACTTCA GACAAGAAAA AGATTGCACA ATTCAGAAAA GAGAAAGAGA CTTTCAAGGA 541 AAAAGATACA TATAAGCTAT TTAAAAATGG AACTCTGAAA ATTAAGCATC TGAAGACCGA 601 TGATCAGGAT ATCTACAAGG TATCAATATA TGATACAAAA GGAAAAAATG TGTTGGAAAA 661 AATATTTGAT TTGAAGATTC AAGGTAAGTG TTCATTCCCT TAAT // LOCUS HUMCD2R2 273 bp ds-DNA PRI 15-JUN-1989 DEFINITION Human CD2 antigen gene, exon 3. ACCESSION X07872 KEYWORDS CD2 antigen; glycoprotein; surface antigen; surface glycoprotein. SEGMENT 2 of 4 SOURCE Human T cell line HPB-ALL DNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 273) AUTHORS Lang,G., Wotton,D., Owen,M.J., Sewell,W.A., Brown,M.H., Mason,D.Y., Crumpton,M.J. and Kioussis,D. TITLE The structure of the human CD2 gene and its expression in transgenic mice JOURNAL EMBO J. 7, 1675-1682 (1988) STANDARD simple automatic COMMENT *soure: library=cos202; *source: clone=CD2-cos1, CD2-cos4, CD2-cos7 and CD2-cos10; see x07871 - x07874 FEATURES from to/span description pept + 22 + 252 CD2 antigen precursor, exon 3 /nomgen="CD2" /map="1p13" /hgml_locus_uid="LD0093J" matp + 22 + 252 CD2 antigen pre-msg < 1 > 273 CD2 mRNA and introns IVS < 1 21 CD2 intron B IVS 253 > 273 CD2 intron C BASE COUNT 84 A 66 C 63 G 60 T ORIGIN About 5.7 kb after segment 1; chromosome 1p13. 1 GAATCTTTAC TTTCTGTTTA GAGAGGGTCT CAAAACCAAA GATCTCCTGG ACTTGTATCA 61 ACACAACCCT GACCTGTGAG GTAATGAATG GAACTGACCC CGAATTAAAC CTGTATCAAG 121 ATGGGAAACA TCTAAAACTT TCTCAGAGGG TCATCACACA CAAGTGGACC ACCAGCCTGA 181 GTGCAAAATT CAAGTGCACA GCAGGGAACA AAGTCAGCAA GGAATCCAGT GTCGAGCCTG 241 TCAGCTGTCC AGGTGCGTGG CGGGCATCAC TTC // LOCUS HUMCD2R3 165 bp ds-DNA PRI 15-JUN-1989 DEFINITION Human CD2 antigen gene, exon 4. ACCESSION X07873 KEYWORDS CD2 antigen; glycoprotein; surface antigen; surface glycoprotein. SEGMENT 3 of 4 SOURCE Human T cell line HPB-ALL DNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 165) AUTHORS Lang,G., Wotton,D., Owen,M.J., Sewell,W.A., Brown,M.H., Mason,D.Y., Crumpton,M.J. and Kioussis,D. TITLE The structure of the human CD2 gene and its expression in transgenic mice JOURNAL EMBO J. 7, 1675-1682 (1988) STANDARD simple automatic COMMENT *source: library=cos202; *source: clone=CD2-cos1, CD2-cos4, CD2-cos7 and CD2-cos10; see x07871 - x07874 FEATURES from to/span description pept + 22 + 144 CD2 antigen precursor, exon 4 /nomgen="CD2" /map="1p13" /hgml_locus_uid="LD0093J" matp + 22 + 144 CD2 antigen pre-msg < 1 > 165 CD2 mRNA and introns IVS < 1 21 CD2 intron C IVS 145 > 165 CD2 intron D BASE COUNT 40 A 38 C 38 G 49 T ORIGIN About 3.4 kb after segment 2; chromosome 1p13. 1 CCACTTCTCT TCCTTTTGCA GAGAAAGGTC TGGACATCTA TCTCATCATT GGCATATGTG 61 GAGGAGGCAG CCTCTTGATG GTCTTTGTGG CACTGCTCGT TTTCTATATC ACCAAAAGGA 121 AAAAACAGAG GAGTCGGAGA AATGGTAAGC TCCCCCTCTT TTGTC // LOCUS HUMCD2R4 807 bp ds-DNA PRI 15-JUN-1989 DEFINITION Human CD2 antigen gene, exon 5. ACCESSION X07874 KEYWORDS CD2 antigen; glycoprotein; surface antigen; surface glycoprotein. SEGMENT 4 of 4 SOURCE Human T cell line HPB-ALL DNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 807) AUTHORS Lang,G., Wotton,D., Owen,M.J., Sewell,W.A., Brown,M.H., Mason,D.Y., Crumpton,M.J. and Kioussis,D. TITLE The structure of the human CD2 gene and its expression in transgenic mice JOURNAL EMBO J. 7, 1675-1682 (1988) STANDARD simple automatic COMMENT *source: library=cos202; *source: clone=CD2-cos1, CD2-cos4, CD2-cos7 and CD2-cos10; see x07871 - x07874 FEATURES from to/span description pept + 23 342 CD2 antigen precursor, exon 5 /nomgen="CD2" /map="1p13" /hgml_locus_uid="LD0093J" matp + 23 339 CD2 antigen pre-msg < 1 > 775 CD2 mRNA and introns IVS < 1 22 CD2 intron D BASE COUNT 204 A 228 C 179 G 196 T ORIGIN About 3.9 kb after segment 3; chromosome 1p13. 1 TATTGAGGTT TTGTTGTTGC AGATGAGGAG CTGGAGACAA GAGCCCACAG AGTAGCTACT 61 GAAGAAAGGG GCCGGAAGCC CCACCAAATT CCAGCTTCAA CCCCTCAGAA TCCAGCAACT 121 TCCCAACATC CTCCTCCACC ACCTGGTCAT CGTTCCCAGG CACCTAGTCA TCGTCCCCCG 181 CCTCCTGGAC ACCGTGTTCA GCACCAGCCT CAGAAGAGGC CTCCTGCTCC GTCGGGCACA 241 CAAGTTCACC AGCAGAAAGG CCCGCCCCTC CCCAGACCTC GAGTTCAGCC AAAACCTCCC 301 CATGGGGCAG CAGAAAACTC ATTGTCCCCT TCCTCTAATT AAAAAAGATA GAAACTGTCT 361 TTTTCAATAA AAAGCACTGT GGATTTCTGC CCTCCTGATG TGCATATCCG TACTTCCATG 421 AGGTGTTTTC TGTGTGCAGA ACATTGTCAC CTCCTGAGGC TGTGGGCCAC AGCCACCTCT 481 GCATCTTCGA ACTCAGCCAT GTGGTCAACA TCTGGAGTTT TTGGTCTCCT CAGAGAGCTC 541 CATCACACCA GTAAGGAGAA GCAATATAAG TGTGATTGCA AGAATGGTAG AGGACCGAGC 601 ACAGAAATCT TAGAGATTTC TTGTCCCCTC TCAGGTCATG TGTAGATGCG ATAAATCAAG 661 TGATTGGTGT GCCTGGGTCT CACTACAAGC AGCCTATCTG CTTAAGAGAC TCTGGAGTTT 721 CTTATGTGCC CTGGTGGACA CTTGCCCACC ATCCTGTGAG TAAAAGTGAA ATAAAAGCTT 781 TGACTAGACC CGTGTCTGCT CATTGTG // LOCUS HUMCEAA 1467 bp ds-DNA PRI 15-DEC-1989 DEFINITION Human carcinoembryonic antigen gene, exon X. ACCESSION M16337 KEYWORDS carcinoembryonic antigen; glycoprotein. SOURCE Human (normal or tumor) colon DNA, clone lambda39.2. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 1467) AUTHORS Thompson,J.A., Pande,H., Paxton,R.J., Shively,L., Padma,A., Simmer,R.L., Todd,C.W., Riggs,A.D. and Shively,J.E. TITLE Molecular cloning of a gene belonging to the carcinoembryonic antigen gene family and discussion of a domain model JOURNAL Proc. Natl. Acad. Sci. U.S.A. 84, 2965-2969 (1987) STANDARD simple staff_review FEATURES from to/span description pept < 935 / 1294 carcinoembryonic antigen, exon X (AA at 937) /nomgen="CEA" /map="19q13.1-q13.2" /hgml_locus_uid="LN0164S" sigp < 935 972 carcinoembryonic antigen signal peptide matp 973 1294 carcinoembryonic antigen IVS < 1 934 CA, intron A IVS 1295 > 1467 CA, intron B BASE COUNT 458 A 348 C 365 G 296 T ORIGIN Chromosome 19q13.1-q13.3. 1 CTGCAGATTG CATGTCCCCT GGAAGGAGGT CCTGCTCACA GGTGAGGGGA GGACTCCCTC 61 GGAGTGGATG GGAGGAGGGA GCACAGAGAC TGGCTAGGGT CTCCTGGGGA GGACAAGGCT 121 CTGAGAGGAG ACAGAGGGCT TTTGTTGAAG CCTGAGGAAA CAGAACACCA GAGAGGGACA 181 GGGGTCACAA CAGGAAAGTC ACACTAAACT GGGATTGATA AAAAGGGAGG AAAATCAATT 241 GATCATGTTT TCCAAGTTAA TCATCATTTG TCATTACCAT TTGAAAAAAA AGAAAAATGA 301 TAGAAATCAG AACTGCATTA GGATGACACT CCAAATAAAA ATATAACAAG GAAACTAAAT 361 GCTGCCCTTA CTCACCAATC AGAAGTTGAA AAATAACCAC CAGATACACT CATTAACTCA 421 TCCACAAGCA TTTGCAATCA ATTTTAGTCA ATGGCATACA ACAAGCATCA GACAAGTCTC 481 AGTCATCACA GAGCTTATGC TGTCATGAAG AGGAAAACAC ACACACAAAG AGATATAGAA 541 TGTGAGGTCA GGTGTTGACA AGAGCCCTGG AAGGAACAGA GCAGGGAAAG GTCAGAAAGA 601 AAAGACCCAG GGTCTGTAGA GGGGGTGTCA GGGAAGGGAT CTCCCAAGAA TGCCCTGATG 661 TGAGCAGGAC CTGAGGCCAG TGGGGAGGGA GCCATGCAGA CCCCTGGGGA AGAGCATTCC 721 ACACAGGGAA ATGCCAAGGT CAAAGGTGCT GAAGGAATGG GGGTGTCACA CTGCTGACTT 781 TGACTCAGTA GGACACACAC ACACACACAC ACACACACAC ACACACACGC TCCAACGTGG 841 AGGGGTGAAG AGACCTGCTC AGGACCCAGG GCCCTGTTTT TCCACCCTAA TGCATAGGTC 901 CCAATATTGA CCGATGCTCT CTCCTCTCTC CTAGCCTCAC TTCTAACCTT CTGGAACCCA 961 CCCACCACTG CCAAGCTCAC TATTGAATCC ACGCCATTCA ATGTCGCAGA GGGGAAGGAG 1021 GTTCTTCTAC TCGCCCACAA CCTGCCCCAG AATCGTATTG GTTACAGCTG GTACAAAGGC 1081 GAAAGAGTGG ATGGCAACAG TCTAATTGTA GGATATGTAA TAGGAACTCA ACAAGCTACC 1141 CCAGGGCCCG CATACAGTGG TCGAGAGACA ATATACCCCA ATGCATCCCT GCTGATCCAG 1201 AACGTCACCC AGAATGACAC AGGATTCTAT ACCCTACAAG TCATAAAGTC AGATCTTGTG 1261 AATGAAGAAG CAACCGGACA GTTCCATGTA TACCGTGAGT ATTTCCACAT GACCTCTGGG 1321 TGTTGGGGGT CAGTTCTACT TCCCACATAC GGGATTGTCA GGCCTGGGTT GTGCCTGTGG 1381 CCCTCTCTGC ATTACATCCT GTATCAGGGT TTGGACATTT AGTGCAGGAC ACACACGGGG 1441 AAGACAAACT TCCACAGATC AGAATTC // LOCUS HUMCEAB 2690 bp ds-DNA PRI 15-JUN-1989 DEFINITION Human carcinoembryonic antigen (CEA) hsCGM1 gene, exons 1 and 2. ACCESSION M22433 D51537 KEYWORDS carcinoembryonic antigen. SOURCE Human fetal liver (lambda-hsCHM1-1 library) DNA, clone hsCHM1. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 2690) AUTHORS Thompson,J.A., Mauch,E.-M., Chen,F.-S., Hinoda,Y., Schrewe,H., Berling,B., Barnert,S., von Kleist,S. and Shively,J.E. TITLE Analysis of the size of the carcinoembryonic antigen (CEA) gene family: Isolation and sequencing of N-terminal domain exons JOURNAL Biochem. Biophys. Res. Commun. 158, 996-1004 (1989) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence [1] kindly submitted by J.A. Thompson 07-FEB-1989. FEATURES from to/span description pept 1250 1313 carcinoembryonic antigen hsCHM1, exon 1 /nomgen="CEA" /map="19q13.1-q13.3" /hgml_locus_uid="LN0164S" 2158 / 2517 carcinoembryonic antigen hsCHM1, exon 2 IVS 1314 2157 carcinoembryonic antigen hsCHM1, intron A IVS 2518 > 2690 carcinoembryonic antigen hsCHM1, intron B BASE COUNT 823 A 637 C 653 G 577 T ORIGIN Chromosome 19q31.1-q31.3. 1 GAATTCCACA GCAATAACCA CGATGACAAC CACCATGTAC TCAACACCCG CCTGGGCACG 61 GGGCTCCCAC AGCAGCTCAC TTATTCCCAA CAACTCTGCA AGGAGGATTT TACCATCCTC 121 CTTTTACAAA TCAGGGAATC AAGGATCATA GAAGCCACGT GCACTTGTCC AAGTCAACAT 181 AGTTAAGTGA CAGAACCATT AGCTGTCCCC AGGTACATCT GGACATAAAG TTCATGTTTA 241 TGCCACTGTG TCAGCATTTC CAAAAACTGA TTTTAGGCGA AACGTAAGTA AGCTTTTTAA 301 AAACTTTAAT ACTTATGCGT TTATTTTAAT ATACATTGAG AAAACATTTA AGCACACATC 361 AAATCTGTAA TTTCATGGAC AATATTGCAT AAGACAAGGA TGTTTTGTCT CCAACTCCTG 421 GCCTCAAGCC ATCCTCCCAC CATAGCTCTC GAGTAGCTGG GATCACAGAC TGAGCCACCG 481 ATCCCTGCTA GGACAGGATG TTTTGTAAAC TAAATTTATT TAGAAAAAAG GATGAAATAT 541 ATAATAATAA AGGTGGTACA AGCTAGAGAG AAAATCATAA AGTCAGCCTA GAAATGTCTG 601 GTGTCTGGAT GACATAAAGC TACAGCACTG TGAAGCCTCA TTCTCAGTTA CTCCCAGGAA 661 ATTAGAGTCA CATAATGCTG CAGAAAGAAC AGCTCAGAAT CTTAGATCCG GCTTTAGCCC 721 TAGATATATC CATTTGTAGG ACCCCAGACA TCTCTGTGAC CTCCTTGCTG GGAGTAAATC 781 CAACCTTCCC AGACATGTGA GAACAGTAAG AAGACCCTGC ACACACAAAG GAGTTTCTCC 841 GTCACAGAGA AAATAACACC AGGTTCAGGG ACCCCAGGGA CTCTGCATGG TGCTGACAGA 901 CCCAAGGCCA AGGCAGAGCA GAGGTCCACG CTGGGGAGGG AGGGTCATCC TGTTATGAAA 961 CAGGGATCCA AGTAAGCCTT GCTTCTCAGA GCCTGGTCTG GGCAACTCAA ATGTAGACAG 1021 AAGGCCCCAA GGAAGAAGAG AAAATGAGGC AAAACTGAGA GGGGAGGGGA CAGAGAGGTG 1081 ACCTGGGCAG AGCTTCACCC ATGACCCTGG AAAGTGCTCC TGCCCTGGGA GGAGGCTCAG 1141 CATGGAAAGA GGAAGGACAG CAGAGCCTAA GTCACAGTAG CCCTGACTAC AGCATTCCTG 1201 GAGCCCAGGC TCTTTTCCAC AGAGGAGGAA AGAGCAGGCA GCAGAGACCA TGGGGCCCCC 1261 CTCAGCCTCT CCCCACAGAG AATGCATCCC CTGGCAGGGG CTTCTGCTCA CAGGTGAGTG 1321 GAGGATTCCT GGGAGTGGGC AAGAGGAGGG ATCACAGAGA ATGGCTGGGG TCTCCTGGGG 1381 AGGATGGGGC TCTGATAGGG GACAGAAGGC TTCTGCTGAA GCCTCAGGGG AGAGAACATC 1441 AGAGAGGGAC ACGGGTCACA ACAAGACAAT CACATTGAAC TGGGATTGAT AAGAGGGAGG 1501 AAAATCCATT GATCATGTTT TCCAAGTTAA TCATTACTGG CCACTACAAT TAGAAAATGA 1561 TAAGAATAAG AATTACATCA GGGTGATACT TTAAATAAAA ATATAACCAG GGCACTAAAA 1621 CCTGTCTTTG CCCCAACCAC AAGTTGCAAA ATAACCACCA CTCCTTAACT CATCCACCAG 1681 TATTTGCAAT CAAATTTTAG GCACTGGCGT ACAACAAATA TCAGACAAGT CTCTGTGTTC 1741 AAAGAGCTTA CACTCTTGCA GAGATGAAGA TAGACACCCA AAGAGATCTA GAATGTGAGT 1801 TCAGGTGTTG ACAAGAGCCC TGGAGGGAAC AGAGCAGAAA AAGGTCAGAA AGGGACGCCC 1861 CAGGGTCTCT AGAGGAGGTG TCAGGGGAGG GATCTCCCAA GGATGCCCTG ATGTGAGCAG 1921 GATCTGAGGG CAGTGGGGAG GGAGCCATGC AGACCCCTGG GGAAGGGGAT TCCACACAGG 1981 AAAATGCCAA GGTTAGAGGT GCTGAAGAAA GAAAGGTCAC GTTACTGACC TTAACCAAGT 2041 GGGACACACC TACACTCTCA AGGCTGAAGG GAGAAGAGAC TCTCTCAGGA CCCAGGGCCC 2101 CATCTTTCCA TCCCAATACA TGGGTACCAA TATTGACTGA TGCTTTCTCC CTCCTAGCCT 2161 CACTTCTAAA CTTCTGGAAC CCGCCCACCA CTGCCAAGCT CACTATTGAA TCCACGCCGT 2221 TCAATGTCGC AGAGGGGAAG GAGGTGCTTC TACTTGTCCA CAATCTGCCC CAACATCTTT 2281 TTGGCTACAG CTGGTACAAA GGGGAAAGAG TGGATGGCAA CAGTCTAATT GTAGGATATG 2341 TAATAGGAAC TCAACAAGCT ACCCCAGGGG CCGCATACAG CGGTCGAGAG ACAATATACA 2401 CCAATGCATC CCTGCTGATC CACAATGTCA CCCAGAATGA CATAGGATTC TACACCCTAC 2461 AAGTCATAAA GTCAGATCTT GTGAATGAAG AAGCAACTGG ACAGTTCCAT GTATACCGTG 2521 AGTATTTCCA CATGACCTCT GGAGGTTGGG GGTCAGTTCT ACTTCCCACA TATGGGATTG 2581 TACGGCCTGG GCTGTGCCTC TGGCCCTCTC TGCATTACAT TCTGTATCAG GGTTTGGACA 2641 TTTAGTGCAG GACACACACG GGGGAGACAA ACTTCCACAG ACTAGAATTC // LOCUS HUMCEAC 781 bp ds-DNA PRI 15-SEP-1989 DEFINITION Human carcinoembryonic antigen (CEA) hsCGM2 gene, exon 2. ACCESSION M22434 D51537 KEYWORDS carcinoembryonic antigen. SOURCE Human fetal liver (lambda-hsCGM2-1 library) DNA, clone hsCGM2. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 781) AUTHORS Thompson,J.A., Mauch,E.-M., Chen,F.-S., Hinoda,Y., Schrewe,H., Berling,B., Barnert,S., von Kleist,S. and Shively,J.E. TITLE Analysis of the size of the carcinoembryonic antigen (CEA) gene family: Isolation and sequencing of N-terminal domain exons JOURNAL Biochem. Biophys. Res. Commun. 158, 996-1004 (1989) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence [1] kindly submitted by J.A. Thompson 07-FEB-1989. FEATURES from to/span description pept / 221 / 583 carcinoembryonic antigen hsCGM2, exon 2 (AA at 223) /nomgen="CEA" /map="19q13.1-q13.3" /hgml_locus_uid="LN0164S" IVS < 1 220 carcinoembryonic antigen, intron A IVS 584 > 781 carcinoembryonic antigen, intron B BASE COUNT 220 A 195 C 184 G 182 T ORIGIN Chromosome 19q31.1-q31.3. 1 CTGCAGATCC CTGGGGAAGA GGATTCCGAA CAGGGAAATG TAAGGTCAGA GGTGCTGATA 61 GGGGACATGC TGCTGTCATT GATCCAGTAG GACACACACA CACACACACA CTTACTTCAA 121 GATGGGGGTG GGTGAAGAGA CCTGCTCAGG ATCCAGGGCC CCATCTTTCC ACCCCAATAC 181 ATAGGTCCCA ATATTGACTG ATGTTCTCTC CCCCTCCTAG CCTCGCTTTT AACCTTCTGG 241 AACCTGCCAA ACAGTGCCCA GACCAATATT GATGTCGTGC CGTTCAATGT CGCAGAAGGG 301 AAGGAGGTCC TTCTAGTAGT CCATAATGAG TCCCAGAATC TTTATGGCTA CAACTGGTAC 361 AAAGGGGAAA GGGTGCATGC CAACTATCGA ATTATAGGAT ATGTAAAAAA TATAAGTCAA 421 GAAAATGCCC CAGGGCCCGC ACACAACGGT CGAGAGACAA TATACCCCAA TGGAACCCTG 481 CTGATCCAGA ACGTCACCCA CAATGACGCA GGATTCTATA CCCTACACGT TATAAAAGAA 541 AATCTTGTGA ATGAAGAAGT AACCAGACAA TTCTACGTAT TCTGTGAGTG ATACCTCCAT 601 GACTTCTGGG TGCTGGGGGC CAGTTCTACT TCATACACAC GGGGTTGTCA GGCCTGGGTT 661 GTGCCTGTGT CCCCATCTAC ATTTTATCCA GTGTTGGAGT TTGGGCATTT AGTGAAGGAC 721 ACACATGGGG GAGACAAACT TCTACAGACC AGAATCCCTT TCCTGCATCC AGACCCTGCA 781 G // LOCUS HUMCEAD 1026 bp ds-DNA PRI 15-SEP-1989 DEFINITION Human carcinoembryonic antigen (CEA) hsCGM3 gene, exon 2. ACCESSION M22435 D51537 KEYWORDS carcinoembryonic antigen. SOURCE Human fetal liver (lambda-hsCGM3-1 library) DNA, clone hsCGM3. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 1026) AUTHORS Thompson,J.A., Mauch,E.-M., Chen,F.-S., Hinoda,Y., Schrewe,H., Berling,B., Barnert,S., von Kleist,S. and Shively,J.E. TITLE Analysis of the size of the carcinoembryonic antigen (CEA) gene family: Isolation and sequencing of N-terminal domain exons JOURNAL Biochem. Biophys. Res. Commun. 158, 996-1004 (1989) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence [1] kindly submitted by J.A. Thompson 07-FEB-1989. FEATURES from to/span description pept / 484 / 846 carcinoembryonic antigen hsCGM3, exon 2 (AA at 486) /nomgen="CEA" /map="19q13.1-q13.3" /hgml_locus_uid="LN0164S" IVS < 1 483 carcinoembryonic antigen hsCGM3, intron A IVS 847 > 1026 carcinoembryonic antigen hsCGM3, intron B BASE COUNT 291 A 235 C 264 G 236 T ORIGIN Chromosome 19q31.1-q31.3. 1 GAGCTCACAC TCTCATGGGG AGGAAGACAG ACATGCAAAG AGATATAGAA TGTGAGGTCA 61 GGTGTTGACA AGAACCCTAG AGGGAGCAGA GCAGGGAAAG GTCAGAAAGG GAAGACCCAG 121 GGTCTCTGAA GCAGGCATCA GGAAAGAAGT CTAAGGATGC CCTGATGTGA GCAGGACCTG 181 AGGGCAGTGT GGAGGGGGCC GTGCGGACCC CTGGGGAAGA GGATTGCAAA CAGAAAAATG 241 CCAAGGTCAG GAGTGTTGAA GGAATGGGGG TCATGCTGCT GACCTTGACC TAGTAGGACA 301 GTAGGACACA CACACATACA CACACACAAA CACACATGCC CTTTTGTGTG TGTGTGTTTG 361 TATGTGTGTG TGTGCATATC TTCAAGGCTG ATGATTGAAG AGACCTTCTC AGGACACAGG 421 GCCCCATCTT TTCACCCCAA TACATAGGTC CAAATATTAA CTGATGCTGT CTCTACCTCC 481 TAGCATCACT TTTAAACTTC TGGAACCTGC CCACCACTGC CCAAGTAATA ATTGAAGCCA 541 AGCCACCCAA AGTTTCCGAG GGGAAGGATG TTCTTCTACT TGTCCACAAT TTGCCCCAGA 601 ATCTTACTGG CTACATCTGG TACAAAGGGC AAATGACGGA CCTCTACCAT TACATTACAT 661 CATATGTAGT ACACGGTCAA ATTATATATG GGCCTGCCTA CAGTGGACGA GAAACAGTAT 721 ATTCCAATGC ATCCCTGCTG ATCCAGAATG TCACACAGGA GGATGCAGGA TCCTACACCT 781 TACACATCAT AAAGCGAGGC GATGGGACTG GAGGAGTAAC TGGATATTTC ACTGTCACCT 841 TATACTGTGA GTGATTCCGC ATGATCCCTG GGTGTTGGGG GGCAGGGGTC ATTTCTACTT 901 CACACACACA GAATTGTCAG GCCTGGACTC TGCCTGTGTC ACTCTCTGCA TTATGTCCCA 961 TGCTGGGGTT TGGGCATTTA GTGCAGGACA CACACAGAGG AGACACATTT CAACAGATCA 1021 GAATTC // LOCUS HUMCEAE 1010 bp ds-DNA PRI 15-SEP-1989 DEFINITION Human carcinoembryonic antigen (CEA) hsCGM4 gene, exon 2. ACCESSION M22436 D51537 KEYWORDS carcinoembryonic antigen. SOURCE Human fetal liver (lambda-hsCGM4-1 library) DNA, clone hsCGM4. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 1010) AUTHORS Thompson,J.A., Mauch,E.-M., Chen,F.-S., Hinoda,Y., Schrewe,H., Berling,B., Barnert,S., von Kleist,S. and Shively,J.E. TITLE Analysis of the size of the carcinoembryonic antigen (CEA) gene family: Isolation and sequencing of N-terminal domain exons JOURNAL Biochem. Biophys. Res. Commun. 158, 996-1004 (1989) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence [1] kindly submitted by J.A. Thompson 07-FEB-1989. FEATURES from to/span description pept / 468 / 833 carcinoembryonic antigen hsCGM4, exon 2 (AA at 470) /nomgen="CEA" /map="19q13.1-q13.3" /hgml_locus_uid="LN0164S" IVS < 1 467 carcinoembryonic antigen hsCGM4, intron A IVS 834 > 1010 carcinoembryonic antigen hsCGM4, intron B BASE COUNT 281 A 225 C 267 G 237 T ORIGIN Chromosome 19q31.1-q31.3. 1 GAGCTCACAC AGTCATCGGG GGGGGAAGAC AGACATGCTA AGTGATCTAG AATGTGAGAT 61 CAGGTGTTGA CAAGAACCCT GGAGGGAGGA GAGCAGGGAA AGGTCAGAAA GGGAAGACCC 121 AGGGTCTCTG AAGGAGGTAT CAGGAAAGAA GTCTAAGGAT GCCCTGATGT GAGCAGGACC 181 TGAGGGCAGT GTGGAGGGGG CCGTGCGGAC CCTGGGGAAG AGGAATCCAA AAAGAAAAAT 241 GCCAAGGTCA GAAGTGTTGA AGGAATGGGG GTCATGCTGC TGATCTTGAC CTAGTGGGAC 301 AGTAGGACAC ACACACATAC ACTCACGCCC CTTTAGTGTG TGTATGTGTT TGTATGTGTG 361 TGTTTGTGTG TCTTCAAGGC TGAGGATTGA AGAGACCTTC TCAGGACCCA TCTTTTCACC 421 CCAATACATA GGTCTCAATA TTAACTGATG CTCTCTGTAC CTCCTAGCAT CACTTTTAAA 481 CTTCTGGAAT CCGCCCACAA CTGCCCAAGT CACGATTGAA GCCCAGCCAC CCAAAGTTTC 541 TGAGGGGAAG GATGTTCTTC TACTTGTCCA CAATTTGCCC CAGAATCTTG CTGGCTACAT 601 TTGGTACAAA GGGCAAATGA CATACCTCTA CCATTACATT ACATCATATG TAGTAGACGG 661 TCAAAGAATT ATATATGGGC CTGCATACAG TGGAAGAGAA AGAGTATATT CCAATGCATC 721 CCTGCTGATC CAGAATGTCA CGCAGGAGGA TGCAGGATCC TACACCTTAC ACATCATAAA 781 GCGACGCGAT GGGACTGGAG GAGTAACTGG ACATTTCACC TTCACCTTAC ACCGTGAGTG 841 ATTCCACATG ATCCCTGGGT GTTGGGGGAC AGGGGTCACT TCTACTTCAC ACACACAGGA 901 TTCTCAGGCC TGGACTCTGC CTGTGTCCCT CTCTGCATTA AGTCCATGCT GGGGTTTGGG 961 CATTTAGTGC AGGACACACA GAGGAGACAA ATTTCAACAG ATCAGAATTC // LOCUS HUMCFVII 12850 bp ds-DNA PRI 15-DEC-1989 DEFINITION Human blood coagulation factor VII gene, complete cds. ACCESSION J02933 KEYWORDS coagulation factor; coagulation factor VII. SOURCE Human DNA, clones 7M1 and 7DC1. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 12850) AUTHORS O'Hara,P.J., Grant,F.J., Haldeman,B.A., Gray,C.L., Insley,M.Y., Hagen,F.S. and Murray,M.J. TITLE Nucleotide sequence of the gene coding for human factor VII, a vitamin K-dependent protein participating in blood coagulation JOURNAL Proc. Natl. Acad. Sci. U.S.A. 84, 5158-5162 (1987) STANDARD full staff_review REFERENCE 2 (bases 856 to 12446; minisatellite imperfect repeats only) AUTHORS O'Hara,P.J. and Grant,F.J. TITLE The human factor VII gene is polymorphic due to variation in repeat copy number in a minisatellite JOURNAL Gene 66, 147-158 (1988) STANDARD full staff_entry COMMENT Draft entry and computer-readable copy of sequence in [1] kindly provided by P.J.O'Hara, 26-JUN-1987. FEATURES from to/span description pept 522 585 factor VII, exon 1a /nomgen="F7" /map="13q34" /hgml_locus_uid="LK0077T" 1654 1719 factor VII, exon 1b (optional) 4294 4454 factor VII, exon 2 6383 6407 factor VII, exon 3 6478 6591 factor VII, exon 4 8307 8447 factor VII, exon 5 9419 9528 factor VII, exon 6 10124 10247 factor VII, exon 7 11064 11659 factor VII, exon 8 pre-msg 487 12660 factor VII pre-mRNA (alt.) pre-msg 487 12664 factor VII pre-mRNA (alt.) pre-msg 487 12686 factor VII pre-mRNA (alt.) IVS 586 1653 intron A1 IVS 1720 4293 intron A IVS 4455 6382 intron B IVS 6408 6477 intron C IVS 6592 8306 Intron D IVS 8448 9418 Intron E IVS 9529 10123 Intron F IVS 10248 11063 Intron G BASE COUNT 2532 A 3888 C 3902 G 2528 T ORIGIN 212 bp upstream of XbaI site. 1 CCCGGCACTT CTCAGTGAGG CTCTGTGGCT CACCTAAGAA ACCAGCCTCC CTTGCAGGCA 61 ACGCCTAGCT GGCCTGGTCT GGAGGCTCTC TTCAAATATT TACATCCACA CCCAAGATAC 121 GGTCTTGAGA TTTGACTCGC ATGATTGCTA TGGGACAAGT TTTCATCTGC AGTTTAAATC 181 TGTTTCCCAA CTTACATTAG GGGTTTGGAA TTCTAGATCG TATTTGAAGT GTTGGTGCCA 241 CACACACCTT AACACCTGCA CGCTGGCAAC AAAACCGTCC GCTCTGCAGC ACAGCTGGGG 301 TCACCTGACC TTTCTCCTGT CCCCCCCACT TGAGCTCAGT GGCTGGGCAG CAGGGGATGC 361 ATGGCCACTG GCGGCCAGGT GCAGCTCTCA GCTGGGGTGT TCAGAGGACG CCTGTGTCCT 421 CCCCTCCCCC ATCCCTCTGT CACCCTTGGA GGCAGAGAAC TTTGCCCGTC AGTCCCATGG 481 GGAATGTCAA CAGGCAGGGG CAGCACTGCA GAGATTTCAT CATGGTCTCC CAGGCCCTCA 541 GGCTCCTCTG CCTTCTGCTT GGGCTTCAGG GCTGCCTGGC TGCAGGTGCG TCCGGGGAGG 601 TTTTCTCCAT AAACTTGGTG GAAGGGCAGT GGGCAAATCC AGGAGCCAGC CCGGGCTTCC 661 CAAACCCCGC CCTTGCTCCG GACACCCCCA TCCACCAGGA GGGTTTTCTG GCGGCTCCTG 721 TTCAATTTCT TTCCTTCTAG AAACCAGCAT CCAGGCACAG GAGGGGAGGC CCTTCTTGGT 781 AGCCCAGGCT TTGGCGGGAT TATTTTTCAA AGAACTTTAG GAGTGGGTGG TGCTTTCCTG 841 GCCCCCATGG CCCTGCCTGT GAGGTCGGAC AAGCGCAGGG AGTCTGGGGC CTCTCAGAGT 901 GCAGGAAGTG CGCACAGGGT GCTCCCAGGC TGGGGAGCAC AGGTAGGGGA CGGTGCGTGG 961 GGGATGGCGC CTGGGGCATG GGGGATGGGG TGTGGGAAAC GGCATGTGGG GCGTAGGGGA 1021 TGGGGTGTGG AGGATCGGGG GTGGGGATGG CGTGTGGGGT GTGGGGGATG GGCCGTGGGG 1081 GGGTGGGGCC TGGGAAACAG CATGTGGGGC ATGGGGTGTG GGGGTGAGGT GTGGGAAAGT 1141 GTGTGGGGTG TGGGGGATGG GGCATGGAAA GGGCGTGTGG GGTGCAGGGG ATGGGGCATG 1201 GAGGTGTGGG GGATGGGGTG TGTGGGGTGT CGGGGATGGG GCATGTGGGG TGTGGGGGAT 1261 GGGGCATGGA AAGGCGTGTG GGGTGCAGAG GATGGGGCAT GGAGGTCTGG GGCATGGGGT 1321 GTGTGGGGTG TCGGGGATGG GGCATGGAAA GGGTGTGTGG GGTGTGGGGA TAGGGTCAGG 1381 GGATGGCGTG GGGGGTGTGG CATGGGGATG GCACGTGTGG CATGGGGATG GGGATGGGGG 1441 GTGGGGCATG GCCGAGTGGG GCTGGGGCTG GGAATGGTGA GTGGGGCATG GGGATGGCGA 1501 GTAGGGGGTG TGGCGTGAGG ATGGCTAGTG GGGCGTGGGG ATGGCGTGTG GGGATGGCGA 1561 GTGGGGGGTG GGCTGTGAGG GACAGTGCCT GGGATGTGGC TGCAGCCCTA GCTCACAGCA 1621 TGGCCTTATG ACCCCGGCCA CCTTCCTGCC CAGGCGGGGT CGCTAAGGCC TCAGGAGGAG 1681 AAACACGGGA CATGCCGTGG AAGCCGGGGC CTCACAGAGG TGAGCAGGGA CTGCCACTGG 1741 TTTTGTCCTG GGGCCCAGTG GGGGCAACAT CACCTCCTTC CCCTCCCATG GCAAAGAGCC 1801 AGCCCGCGGG GTGGCTACTG CAGTGCCCCC CAAGGAGGGT GTTCCCTGCT CGAGAGGAAG 1861 TGACCGCTCC AGCTTGGCCT TCCCTGGGAC TGGGGTGCAG GCGATTTTAT CTTCTTTGCT 1921 CCATTCTGTT CCTTCCAGAT AATCGTGTGT TCTTCATCAG GTTTTCCTCA GTTCTTGAGA 1981 GCTTTTCTGA TGCAAATCTG CTTTCACCCC AGGGCGGTCA CCGGCTCTGC TCACACCAGC 2041 CTCCAAGGGT GTGGGTGTCC CGGGAGTGTG GGTGTCCCGG GGGCGTGGGT GTCCCGGGAG 2101 TGTGGGTGTC CCGGGGGCGT GGGTGTCCCG GGAGTGTGGG TGTCCCGGGG GCGTGGGTGT 2161 CCCGGGAGTG TGGGTGTCCC GGGGGAGTGG GTGTCCCGGG AGTGTGGGTG TCCCAGGGGC 2221 GTGGGTGTCC CGGGAGTGTG GGTGTCCCGG GGGCGTGGGT GTCCCGGGAG TGTGGGTGTC 2281 CCGGAGGCGA GGGTGTCCCG GGAGTGTGGG TGTCCCGGGG GCGTGGGTGT CCCGGGAGTG 2341 TGGGTGTCCC GGGGGAGTGG GTGTCCCGGG AGTGTGGGTG TCCCAGGGGC GTGGGTGTCC 2401 CGGGAGTGTG GGTGTCCCGG GGGCGTGGGT GTCCCGGGAG TGTGGGTGTC CCGGAGCGAG 2461 GGTGTCCCGG GAGTGTGGGT GTCCCGGGGG CGTGGGTGTC CCGGAGGCGA GGGTGTCCCA 2521 GGAGTGTGGG TGTCCCGGGG GCGTGGGTGT CCCGGGAGTG TGGGTGTCCC GGAGGCGAGG 2581 GTGTCCCGGG AGTGTGGGTG TCCCGGGGGC GTGGGTGTCC CGGAGGCGAG GGTGTCCCAG 2641 GAGTGTGGGT GTCCCGGGGG CGTGGGTGTC CCGGGAGTGT GGGTGTTCCA GAGGCGAGGG 2701 TATCCCAGAA GTGTGAGTGT CCCGGGGGTG TGGGTGTCCC GGGGGCGTGG GTGTCCCGGG 2761 AGTGTGGGTG TCCCGGGGGC GTGGGTATCC CAGAAGTGTG AGTGTCCCAG GGGCGTGGGT 2821 GTCCGGGGGC GTGGGTGTCC CGGGGGTGTG GGTGTCCCGG GGGTCGTGGG TGTCCCGGGA 2881 GCGTGGGTGT CGGGGACTGC AGGGACATGG GCCTCCCCTC CCACTCCTGC CGCCCAGGGC 2941 ACCTCCTGTG AGGACTCGGA GTCCGTGAGT TCCCACCTCC TTGAGCCCGA TTCTTTGGTG 3001 TCCCCGCCTG CATCCTCAGC CTCCTTCCAA ACCAGACCAG TTCTCTAGGG GCGTCGACGT 3061 GTGAAACTGA TTTTAAAGAA AACAGGCGGT GGCCTTTCTC TCGGCCCCAC GTGGCCCAGT 3121 AGCGCTCACC TTCCGTCCCT TCTTCCGCGC TCAGTAACCA ATTTAGGCCG CTCCTGCAGA 3181 ACTCGGGCTC CTGCCCACCG GCCCACAGCG TCCACCTGAG GCCTCTTCCT CCCAGCAAAG 3241 GTCGTCCCTC CGGAACGCGC CTCCTGCGGC CTCTCCAGAG CCCCTCCCGC GCGTCCTCTC 3301 AGCCCCGCTC GCCTCCTCCC GGGGCCTCCC TCTCCCGCCT GCCCCCAGGC CCGTCTCCCT 3361 CGCGGGCTGA GGCAGGTTCG GCAGCACGGC GCCCGGGGCG GGGGTCACTC TCCACCACCG 3421 CGTGGTGCCC ACAGCTCACG GCGCTCCCGG GTGACGGTCC CCTCGGCTGT AGGGCGTCCT 3481 GAAGAGCGGC CTGCTCGGAG CTGAGCGCAC GGGGTTGCCT GCCCCTGGGC GTCTCTGGCC 3541 CTCACCAGCC CCGTCTTCCC ATGGGCAAAA CGGCGGTCCT GTTTGTCCAC AAGTAACCGT 3601 CGGGGTTACG GAGGGGCCAG GAGCTGCGGC GGGGGGCTGT GCTCTCAGGA CCGGCCCCAG 3661 GAGGATCCGC GCGAGGTCTG GAGCTCTCAG GGGTCGCGGG GGACAGAGGG GCCCCAAGCG 3721 GAGGCGGGAA GGCGGCAGAA GCCCAGGACC GCCAAGAGCT GGCGAGGAAG CCCGGGGCTC 3781 GCTGTCGGGG GAGCCGGGCA GGGGCCGCGC CTCGGCACCA GGACGCGAGG CCTGGGAAGG 3841 CGGATCTGGC CGCGAGCACG CGGTGCGGGT GGAGACGCAG GGATTTGGAT TTCCGCGGGC 3901 GCTGCACGGA TTTCCACGCG CGGTTCACGT GGGCCCCAGG GGGTGCCCGG CACCCGGGGC 3961 CGCGCCGCCT TCTCCTGCCC GGCATCGACC CGCAGCCTCA CGTTTACCGC GGCGCCCGCA 4021 GCCCCCTTCG CCCGCTTCCG CGCGTGCCCC CGAGCGCGCC CTCGGGATCA GCCCCCGGAA 4081 GCAGAGAGGC CAGGCCGGGA AGGATGGGCG AACGGGGTGG CTGACCCGGG AGCACGGCAG 4141 GGAGGACACC CAGCCAGGCC CGCGAGCAGC GCCGCTCCCC TCCTCCAGGA CGGGCGGGAA 4201 CCTGCGATGC CCCCGCCGCG TGGGCCGTGG GGCGGTCTCC GAGGCACTGG GCGGGGCACG 4261 CGGTGGGCGC TTCACGGAAC TCGCATTTCC CAGTCTTCGT AACCCAGGAG GAAGCCCACG 4321 GCGTCCTGCA CCGGCGCCGG CGCGCCAACG CGTTCCTGGA GGAGCTGCGG CCGGGCTCCC 4381 TGGAGAGGGA GTGCAAGGAG GAGCAGTGCT CCTTCGAGGA GGCCCGGGAG ATCTTCAAGG 4441 ACGCGGAGAG GACGGTGAGC CCAGCCTCGG GGCGCCCCGC GCGGACACTG CACGGCGGCG 4501 GTGAACCAGG CCGCGTGGGG CCGCCTGCGT CTCTTTGGCT GCGGCCTGTG GGCGGCGAAC 4561 ACGCAGCGGC GCCCGCGCGC GCGCTCTCTC TGCGGGGGTC GCTTTCCGCC CGGGGTGACT 4621 CCGCTTTCCT GGGCGATGCC CCCACCCCCA GGCACGCGCT CTCCCCGTGC GGCCGCACCG 4681 CGCATGCCGG TTTTCACATC AGAAAATACG ATTTGCACAA GCACACTTAG GGTGTCCCCC 4741 TTAACTTCCC AAGGGAGTCC CCCCAGTCCC CGAAGTCCAG GGCAGCCTGC GCATCGCAGA 4801 CGCGCGCGGC TCGCAGAAGG GACGTGGTGA GAAGCTGGCC CACAGCATGC CACCAGCGGC 4861 ACCTCCTCAG GGCACGTGTC GGGGAGAAAC AACACTTAGG GACCCTGGGA CTTTCTCCAG 4921 CTCACGCTCA CGGGTCCACC TCACACTACC AAGATCACCT CAATAGACGG ACACTCACAC 4981 AGGGCACACT TCACACTCAC AGGTCACCTC ACACTCACAG GACACCTCAC ACTCACAGGG 5041 CACACTTCAC ACTCACGGGT CACCTCACAC TCCAAGATCA CCTAAAGAGG ACACCTCACA 5101 CAGGGCACAC TTCACACTCA CAGGTCACAC CTCACACAGA TCATCTCATT CTCACAGGAC 5161 ACCTCCCTCT CACAGGTCAC CTCACACTCA CAGGACACCT CACAGAGGTC ACCTCACACC 5221 CACAGGACAC CTCACAGAGG TCACCTCACA CGGGGCACAC TTCACACTCA GGTCACCTCA 5281 CACCCACAGG ACACCTCACA GAGGTCACCT CACACCCACA GGACAACTCA CAGAGGTCAC 5341 CTCACACAGG ACACCTCACA AAGGTCACCT CACACCCACA GGACACCTCA CACTCATAGG 5401 CACCTCAGTC TTACAGGACA ACTCACACTC ACAGGTCACC TATCTCACAG GACACCTCAC 5461 ACTCACAGGT CACCTTACTC TCACAGGACA CCTCACACAG GGCACACTTC ACTCCACAGG 5521 TCACCATACC TCACACAGAT CACCTCATAC TCACAGATCA CTTCATTCAT TCTCACAGGA 5581 TACCTCACAC TCAGGGCACA CTTCACACTC ACAGGTCACA CCTCACACAG ATCATCTCAT 5641 TCTCACAGGA CACCTCCCTC TCACAGGTCA CCTTACACTC ATCTCACACT CACAGGTCGC 5701 CACACCTCAC ACTCACAGGA TGCCTCACAC TCACAGAACC ACATCTCATA TGCACAAGAC 5761 ACCTCACACT CAGGACACCT CATGCTCAAA GAAGCCTCAC ACTCACAGGA GGTCCAGCTG 5821 TCTGAGGCAA AGGCTAACAT GACCCTTTCC AGACAAATTG AGGATGGTCA TGCCTAGCAT 5881 TTTTATACAC CTAGTTTTGA AAGCATTTCT CATCTGTTGT ATTCTCACAG CACCCCGTGA 5941 GTTTAAGTTC AGGTGGCCAA CAGTTTCTTC AGCAATCACT TTTTTCTGTG GAGTGCTTTT 6001 GCTGTTTGTG GAATATTTTG CATCTGCTAC TGCACCCTCT CCCCGTATGT GTGGCCACCC 6061 TGTCAGAGGT GGAGCTGTGG CTCAGAGCCT GTGTACCTCG TCCCAGGTCC ACAGCTCAGC 6121 GACAGAAGAG TCAGGGTTGA ACCTCGGGTG TTCTGACTTG GGAGCAGGAA ATGTGTGGTC 6181 ACCCATAGTT CCAGATGTCC TGGGGAGGGG CCAAGATTAG AAGAAACCTA CCTCAGCTCC 6241 AGAGGAAAGT CTGGCTTCCT GAGCCCACCC CGCCAGACCC AGGTCCAAGT CCCCCAACCC 6301 CAGTTCATGG TGTGTCCAGT GCTTACCGTT GGGTGCTCTG GTGAAGGTGC ATCTCACGAG 6361 GCTTGCTCTC TTGTTCCTTC AGAAGCTGTT CTGGATTTCT TACAGTGGTG AGTGGATGAT 6421 CACCACCAGT CCTGCCTGCA ACCCTTCTCA GCTTACTGAC ACCAGCCCAC TCCACAGATG 6481 GGGACCAGTG TGCCTCAAGT CCATGCCAGA ATGGGGGCTC CTGCAAGGAC CAGCTCCAGT 6541 CCTATATCTG CTTCTGCCTC CCTGCCTTCG AGGGCCGGAA CTGTGAGACG CGTAAGGCCC 6601 CACTTTGGGT CCCATATTTG CAGAGGGCCC TGGGGAGCTG GTGGAGGTGG CCTGGCCAAC 6661 CGGGCTGCAG GGTGCAACAA CCTGGTGGGG TGTGTAGGCC GGGCATTCAG GGCTCAGCCC 6721 AGTTGGAAAT TGGTCTAGGT GACCTTTAAA TCCCTTCCAG TCTGAGGTCT TTGACAGGGA 6781 CCCAAGGTTC TGATTATCAG ACTCAGTGGC CCCCTTCGCG GTCCCGGCCC TGGGCAACTT 6841 CTCAGCCCTG GAGACTGGCC CAGTTGAGAG TCCCTGTGTC CCGTGTGCCC ATTCCAGATC 6901 CCACCTAGCT AGGTACCCGT TTGGTAAACT TCCCCTTCTC CTACTTTCCA TTACAAAGGT 6961 TTGAGGGGTT TGTTTTTTTT TTTAACCATC TGAATATTAA ATTAATCACA AAGTTTAGGG 7021 CCCCCAACCT CCCTTGGGTT CAGTAATTCA CTAGAAGGAC ACATAGAAAT CCAAATATCC 7081 ACTGAGTGGA TACACTCACA GGTACCGTTT ATTACAGCAA AGGATGCAGG CTTAAGTCTG 7141 CAGAGGGACC AGGGACAAGC TTCCCCTTGT CCTCTCCTGT GGGGTCATGT GGACATCCTT 7201 AATTCTCCCA GAATGACGTG TGACGAGAAC GTGGGAAGTA CTGCCAAACT TGGGGAACGC 7261 TACGAGCCCC GTGTCCAGAG GTTTGATCAG GGCTCAATGA CATAGACCCA GCTGACCAGG 7321 CACGCATGGC TGACCTCAGT CTCAGCCCCT CCAGAGCTAC GCCGATAATG CGGCCAAGGC 7381 CCCACCATAC ATCACATTGT CAGCTAGACC ATCCAGCATG GCTCAAGGCC CAGGTAAACA 7441 CCAACATTCC CTCAGGCAAG ACCTTCCAAG GGCTTAGCGG TCATTTCCCA GGAGCCAAGG 7501 CAAAGGCTAC CCTTTCTCTG GCACAGCAGT TCATCCTTGA CCACCCAAGA CCACATTCTT 7561 ACACTGAATG AGCTCTCCTG TGCAGCAGCC ATTTTCTTCT CTAAGCAGAA GAGAGCCCAG 7621 CAAGCTGGAG GAGGCTGAAG AGAGAGGCTT CCTGCTGGTC ATCTGGGTCC AGAATGCCTG 7681 GAGATCTCTG CTCAGCCCTG GTGCCCAGCA GCCCTGGTGT GCATCCTGCA GGGCAGCCTT 7741 CCCGCCGGAG TCCTGGACTT GCTCAGGGCC ACTCCCCTTG CCCATGTCAA CCAAAGTCAG 7801 GCTGCCGGTT CTGCTTCTTC TGTCTGAGCC CATGACCAGT GCTGGGACTA ACTGTCCCCC 7861 AGGCGGGCTC ACGGTGGTAC GAGGCCAGCT TGGAGAACTG TCTCAGCTCT CTGGTCCTCT 7921 CGTCAGTTGG GTCTCTGATT GGAAAGTCCC TTGGACACTT TACCATCCCC ATTGGACTTT 7981 CACTTTCCCC CAGGCTCCCA TCAGCTGCTC GGAAGAGTGG TCACCCTGGA GGCCACTGCC 8041 CACCAGCCAG GCACCCCCCA AATGCAACCG CAGCCAGCAC TGCCAGCCAC TGGCAAGGCT 8101 GTTCAGACAT GTGGCTCCTC TGATCCACGC CTTGTCCTTT GGATCAGTCC ACGGAGCAGT 8161 GTGCCAAGCT CAGGCTCTGT CACCCACAGC TCATGCCACC TTCCAGGCAG AACACCACTG 8221 CTGACCCAGG GGCATGGCCA CCCCGGGGGC TGGCGTCTCG CTGACCCCCA GAAGCCCCTC 8281 TCAGGGTGTC CCCTTCCTGT CCCCAGACAA GGATGACCAG CTGATCTGTG TGAACGAGAA 8341 CGGCGGCTGT GAGCAGTACT GCAGTGACCA CACGGGCACC AAGCGCTCCT GTCGGTGCCA 8401 CGAGGGGTAC TCTCTGCTGG CAGACGGGGT GTCCTGCACA CCCACAGGTG ACCAGGCTTC 8461 ATGTCCCAGT CCCAGATGAC ACCAGTCCCT GTCCCACTAG GATTATCTTA CTGGACAAAA 8521 GACGGGTGGG ACTGGCCTTC ACATCTACTG AGCACTAACT ATGCACTGAC CAATTGTGAG 8581 GTGGGATCTG GGCACCAAGG GTGGCACAGG CCAGCAGCGA CCAGTGACTA GGATGGGCAC 8641 CCTGGGGGCA ATCCCTGAAT GGCCTCAGGC CCCCTGCCAA CTTCTAGGCA GACCAGGGGA 8701 GCCAAGCAAG GCACTATCTC ACGTCCAACT GCCCACTCGC AGGAATCCTC CGCCAGGGTT 8761 CATGAATCTA CTTCGGCACA GCCAATGTCT GTACTGACTG CTGCCCACTC TGCATTCCAA 8821 AACTCGTAAA GGCTCCTGGG AAAATGGGAT GTTTCTCCAA ACCAGCCTGG AACGAATGGG 8881 CTGCACTTCC AAAAGCAGGG ACACCCCACA CCCACTGTCT CTAAAGAGGC GGAACGTGCC 8941 CACCCTGGCC ACACAGCCTG GGACTCAGCC TGCCACCTCC TCGGGCTTCC TTTCTGGCCC 9001 AAGACCTTGA TTGAAGCAGA TCAAAACTAA GCATGGGATC AAAACAACAC AGTTTGATTC 9061 ATCTTTAGGT AGAATTTCAT TCACCTTCTA CTAAAGTCAA ACAACACATC TTCTCCCTGA 9121 AAAGTGAGCA GAGGGCGGTT TTAAGACGTA AGCCCTCTGT TTCCTCCAAA ACCAGCCCTG 9181 ACCATTGTCT CCTCAGCCAG CCACTTCTTC AAGGGCCTCT CATGGCCGGG CCCCACCAGT 9241 CAGGCCCAGC CGAGGCCCTG CCTTCCACCA CCCCTGGGCC CTGGGAGCTC CTGCTCCTGG 9301 GGGCCTCCCA TAGCCTCGGC CTCAAGGCCT CTCAGAGGAT GGGTGTTTCT GAATCTTTCC 9361 TAGTGGCACG TTCATCCCTC ACAAATCTCT GCATCTTTCT GACTTTTGTT TTACACAGTT 9421 GAATATCCAT GTGGAAAAAT ACCTATTCTA GAAAAAAGAA ATGCCAGCAA ACCCCAAGGC 9481 CGAATTGTGG GGGGCAAGGT GTGCCCCAAA GGGGAGTGTC CATGGCAGGT AAGGCTTCCC 9541 CTGGCTTCAG GATTCCAAGC CCTGAGGGTC TTGAAGCCTT TTGAATGTGA ACAACAGCTC 9601 TGGAAGGGAA AATGGGCAGG TCAGCCCAAG CCCACAGGCT CCAAGTCAGC ACACCTAGCA 9661 CCTCCAGCTC GCGGCACCCC CATGCTTTTA GTGGGGCAAG GAAGGAGAAA AGAAAACGAC 9721 ACTCACTGAG GGTCTACCCT GTGCAGAGAA CCCTGCGAGA TGCCCCATCC GAGTTGTCAC 9781 GTCGTCCTCA CGGTTACTCT TTGAGGTGGG ATCTTTGCCT GATCTTTGCA AAATCAGGAG 9841 CATTGGATCA AAGCTATGTG AAGATCCTGT GAGGTGAACA GTGAAATCTC ACAGCGACAT 9901 TTGTATTCTT GGGCCGTGCC CAAGAGCACG TCTCGGCTAG AGAGGGGCAC AGCCTCCCAG 9961 AGCCAGGTCT GAGCAGCTTT GCCTGGGAGG GATCTGCAAA GACCCCAGGA TTTCAGAAAG 10021 AAATTGTGCA ATGCCAGAGG TTCCTTGGCA TGCCCGGGAG GGCGAGTCAT CAGAGAAACA 10081 ATGACAGCAA TGTGACTTCC ACACCTCCTG TCCCCCCGCC CAGGTCCTGT TGTTGGTGAA 10141 TGGAGCTCAG TTGTGTGGGG GGACCCTGAT CAACACCATC TGGGTGGTCT CCGCGGCCCA 10201 CTGTTTCGAC AAAATCAAGA ACTGGAGGAA CCTGATCGCG GTGCTGGGTG GGTACCACTC 10261 TCCCCTGTCC GACCGCGGTG CTGGGTGGGT GCCACTCTTC CCTGTCCGAC CGCGGTGCTG 10321 GGTGGGTGCC ACTCTCCCCT GTCCGACCGC GGTGCTGGGT GGGTGCCACT CTCCCCTGTC 10381 CGACCGCGGT GCTGGGTGGG TGCCACTCTC CGCTGTCCGA CCGCGGTGCT GGGTGGGTAC 10441 CACTCTCCCC TGTCTGACCG CAGCTCTCAA GTGTCTCAGG GGCTGTGGCT CTGGGCTTCG 10501 TGCTGTCACT TCCACAGACA GACAGACATC CCCAAAAGGG GAGCAACCAT GCTGGGCACG 10561 ACTGCCTGTG GCACCGTGCT CTCAGCCACT TTCCCATGCC CAAATAAAAC GATAAAAGAC 10621 TGGGGGCTTC TGCCCATCCT GCCTCACTTG ACCAAGAGCC CAGAAGAGGA TGCGACACCC 10681 AGGGCCTCAT GGGACCACCG GCTGGCAGGG GTTCTGCTCA CTGGGTTTAT GGGTGAGACG 10741 AGCACTCCCA GGAGGGCCAC TGGGCCGGGA AGAACTGTGG AGAATCGGGG CACGCCCTGT 10801 CCTCCCAGCT GCCAGGGCAC AGCATCCCTT CCCCACCTGC AACACCCAGA CCCCAGATTC 10861 ACCCCAGTTC ACTTGTCCCC ACACGAGCCA CAGGCTGCCA CCTGGGGCAG GCTGGCCCAC 10921 CTTGGGGTTA GATGCAGGTC CCCTTGCCCC AGAAGGAGAC TGCAGCCCCT GCAGACCTAG 10981 AAATGGCCAC AGCCCATCCC CATGCACCAG GGGGTGAGGT GGCAGGTGGT GGAAAGGGCC 11041 TGAGGGGGGC TTCTTCCTTC CAGGCGAGCA CGACCTCAGC GAGCACGACG GGGATGAGCA 11101 GAGCCGGCGG GTGGCGCAGG TCATCATCCC CAGCACGTAC GTCCCGGGCA CCACCAACCA 11161 CGACATCGCG CTGCTCCGCC TGCACCAGCC CGTGGTCCTC ACTGACCATG TGGTGCCCCT 11221 CTGCCTGCCC GAACGGACGT TCTCTGAGAG GACGCTGGCC TTCGTGCGCT TCTCATTGGT 11281 CAGCGGCTGG GGCCAGCTGC TGGACCGTGG CGCCACGGCC CTGGAGCTCA TGGTCCTCAA 11341 CGTGCCCCGG CTGATGACCC AGGACTGCCT GCAGCAGTCA CGGAAGGTGG GAGACTCCCC 11401 AAATATCACG GAGTACATGT TCTGTGCCGG CTACTCGGAT GGCAGCAAGG ACTCCTGCAA 11461 GGGGGACAGT GGAGGCCCAC ATGCCACCCA CTACCGGGGC ACGTGGTACC TGACGGGCAT 11521 CGTCAGCTGG GGCCAGGGCT GCGCAACCGT GGGCCACTTT GGGGTGTACA CCAGGGTCTC 11581 CCAGTACATC GAGTGGCTGC AAAAGCTCAT GCGCTCAGAG CCACGCCCAG GAGTCCTCCT 11641 GCGAGCCCCA TTTCCCTAGC CCAGCAGCCC TGGCCTGTGG AGAGAAAGCC AAGGCTGCGT 11701 CGAACTGTCC TGGCACCAAA TCCCATATAT TCTTCTGCAG TTAATGGGGT AGAGGAGGGC 11761 ATGGGAGGGA GGGAGAGGTG GGGAGGGAGA CAGAGACAGA AACAGAGAGA GACAGAGACA 11821 GAGAGAGACT GAGGGAGAGA CTCTGAGGAC ATGGAGAGAG ACTCAAAGAG ACTCCAAGAT 11881 TCAAAGAGAC TAATAGAGAC ACAGAGATGG AATAGAAAAG ATGAGAGGCA GAGGCAGACA 11941 GGCGCTGGAC AGAGGGGCAG GGGAGTGCCA AGGTTGTCCT GGAGGCAGAC AGCCCAGCTG 12001 AGCCTCCTTA CCTCCCTTCA GCCAAGCCCC ACCTGCACGT GATCTGCTGG CCCTCAGGCT 12061 GCTGCTCTGC CTTCATTGCT GGAGACAGTA GAGGCATGAA CACACATGGA TGCACACACA 12121 CACACGCCAA TGCACACACA CAGAGATATG CACACACACG GATGCACACA CAGATGGTCA 12181 CACAGAGATA CGCAAACACA CCGATGCACA CGCACATAGA GATATGCACA CACAGATGCA 12241 CACACAGATA TACACATGGA TGCACGCACA TGCCAATGCA CGCACACATC AGTGCACACG 12301 GATGCACAGA GATATGCACA CACCGATGTG CGCACACACA GATATGCACA CACATGGATG 12361 AGCACACACA CACCAAGTGC GCACACACAC CGATGTACAC ACACAGATGC ACACACAGAT 12421 GCACACACAC CGATGCTGAC TCCATGTGTG CTGTCCTCTG AAGGCGGTTG TTTAGCTCTC 12481 ACTTTTCTGG TTCTTATCCA TTATCATCTT CACTTCAGAC AATTCAGAAG CATCACCATG 12541 CATGGTGGCG AATGCCCCCA AACTCTCCCC CAAATGTATT TCTCCCTTCG CTGGGTGCCG 12601 GGCTGCACAG ACTATTCCCC ACCTGCTTCC CAGCTTCACA ATAAACGGCT GCGTCTCCTC 12661 CGCACACCTG TGGTGCCTGC CACCCACTGG GTTGCCCATG ATTCATTTTT GGAGCCCCCG 12721 GTGCTCATCC TCTGAGATGC TCTTTTCTTT CACAATTTTC AACATCACTG AAATGAACCC 12781 TCACATGGAA GCTATTTTTT AAAAACAAAA GCTGTTTGAT AGATGTTTGA GGCTGTAGCT 12841 CCCAGGATCC // LOCUS HUMCFXI01 509 bp ds-DNA PRI 15-DEC-1988 DEFINITION Human coagulation factor XI gene, exon 1. ACCESSION M18295 KEYWORDS coagulation factor XI. SEGMENT 1 of 14 SOURCE Human DNA, (libraries of Lawn et al., and Yoshitake et al.), clone pTZ18R. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 509) AUTHORS Asakai,R., Davie,E.W. and Chung,D.W. TITLE Organization of the gene for human factor XI JOURNAL Biochemistry 26, 7221-7228 (1987) STANDARD full staff_entry FEATURES from to/span description pre-msg 103 > 509 factor XI mRNA and intron /nomgen="F11" /map="4q35" /hgml_locus_uid="LW0045B" IVS 276 > 509 factor XI intron A BASE COUNT 193 A 78 C 88 G 150 T ORIGIN Unreported. 1 TACGAAATAA AATTAAAAAA ATAAATTCAG TGTATTGAGA AAGCAAGCAA TTCTCTCAAG 61 GTATATTTCT GACATACTAA GATTTTAACG ACTTTCACAA ATATGCTGTA CTGAGAGAGA 121 ATGTTACATA ACATTGAGAA CTAGTACAAG TAAATATTAA AGTGAAGTGA CCATTTCCTA 181 CACAAGCTCA TTCAGAGGAG GATGAAGACC ATTTTGGAGG AAGAAAAGCA CCCTTATTAA 241 GAATTGCAGC AAGTAAGCCA ACAAGGTCTT TTCAGGTACA GTTTCAGAAC TTACTATTTA 301 ACATTCCTCT CAAGCAAATA CGCCTTGAAA TGCTTTTTTT AAATCATAGG AATTTAAAAA 361 CACTTTACAA TAGAGAATGA TTGATTTTTA AAATGTGTCT GATTTAGCTT TGTAGAGATG 421 TTCCGCTAAT ATCCATAACT AATCTGAGAG GAAATGTGGA ACAACAGAAG AGTAACAGTG 481 TCTACTCAGT AACAAGCGTT TTACGAGTT // LOCUS HUMCFXI02 630 bp ds-DNA PRI 15-DEC-1988 DEFINITION Human coagulation factor XI gene, exon 2. ACCESSION M18296 KEYWORDS coagulation factor XI. SEGMENT 2 of 14 SOURCE Human DNA, (libraries of Lawn et al., and Yoshitake et al.), clone pTZ18R. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 630) AUTHORS Asakai,R., Davie,E.W. and Chung,D.W. TITLE Organization of the gene for human factor XI JOURNAL Biochemistry 26, 7221-7228 (1987) STANDARD full staff_entry FEATURES from to/span description pept 391 + 445 factor XI precursor, exon 2 (first expressed exon) /nomgen="F11" /map="4q35" /hgml_locus_uid="LW0045B" sigp 391 444 factor XI signal peptide matp 445 + 445 factor XI pre-msg < 1 > 630 factor XI mRNA and introns IVS < 1 389 factor XI intron A IVS 446 > 630 factor XI intron B BASE COUNT 194 A 106 C 161 G 169 T ORIGIN About 1 kb after segment 1. 1 CCTCAAGGAA AGAAAGAAAG GAAAAAAATT GGGAAAGGAA ACAAAGATGA AAAATTGGGG 61 TGGGGAGAGC GGTCAGATGG TGGCCATGAG AAGGATCTGA ACACAGAGAG CGGCGGGGCC 121 GGCGGGGAAG GAGGGAGGAG GGGAGAGCGC TGCTTCCCCG TGGGTTCCGG CTTCTGCAGA 181 GCTGTAAGAG TTGAATGCCA CACACAGTCA CACTAAGGAA TGCTCCAGGA TTGGGAAAGA 241 TAAAATTCAA CATTATAATG AGAACACTGT GAATGCTATT GAATTAACTA CTCCCCTCTC 301 TCCCTATTTC TTGTAAGTCT TAGTGTCAGT AAACTAATTA TAAATTTACA TTTTATGTTC 361 TAAAAGCATG CACCTTTTTC TCATTGTAGG ATGATTTTCT TATATCAAGT GGTACATTTC 421 ATTTTATTTA CTTCAGTTTC TGGTGGTAAG TAGAGTGTTA TCTTAACTAT GGGCTGGGAG 481 AGGGAAATCA CACTGCAATC TCCACACATG TGGGAGAATC CCACACCATT TATGCCGGGA 541 AGGAAATAAA ATGTTTTTAT TAACTTCCTG CCTGAGGCTC CAGAGGTTTT CAAAGCAGGG 601 TAGGAATTGA GGTGAAAAAA TTGTTTGTAC // LOCUS HUMCFXI04 284 bp ds-DNA PRI 15-DEC-1988 DEFINITION Human coagulation factor XI gene, exon 3. ACCESSION M21184 M18297 KEYWORDS coagulation factor XI. SEGMENT 4 of 14 SOURCE Human DNA, (libraries of Lawn et al., and Yoshitake et al.), clone pTZ18R. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 284) AUTHORS Asakai,R., Davie,E.W. and Chung,D.W. TITLE Organization of the gene for human factor XI JOURNAL Biochemistry 26, 7221-7228 (1987) STANDARD full staff_entry FEATURES from to/span description pept + 78 + 240 factor XI precursor, exon 3 /nomgen="F11" /map="4q35" /hgml_locus_uid="LW0045B" matp + 78 + 240 factor XI pre-msg < 1 > 284 factor XI mRNA and introns IVS < 1 77 factor XI intron B IVS 241 > 284 factor XI intron C BASE COUNT 82 A 71 C 57 G 74 T ORIGIN About 5 kb after segment 3. 1 CCTTTATGAG ATTACCACCT AACTAGATGT ATGCCCAGTA AAATCCAACA TAACGCATGC 61 CATGTACTAC ATCACAGAAT GTGTGACTCA GTTGTTGAAG GACACCTGCT TTGAAGGAGG 121 GGACATTACT ACGGTCTTCA CACCAAGCGC CAAGTACTGC CAGGTAGTCT GCACTTACCA 181 CCCAAGATGT TTACTCTTCA CTTTCACGGC GGAATCACCA TCTGAGGATC CCACCCGATG 241 GTAAATGCTT ATGTTTCTAC ATCGAGGAGA CAGATTTTTA AAGG // LOCUS HUMCFXI05 261 bp ds-DNA PRI 15-DEC-1988 DEFINITION Human coagulation factor XI gene, exon 4. ACCESSION M18298 KEYWORDS coagulation factor XI. SEGMENT 5 of 14 SOURCE Human DNA, (libraries of Lawn et al., and Yoshitake et al.), clone pTZ18R. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 261) AUTHORS Asakai,R., Davie,E.W. and Chung,D.W. TITLE Organization of the gene for human factor XI JOURNAL Biochemistry 26, 7221-7228 (1987) STANDARD full staff_entry FEATURES from to/span description pept + 119 + 225 factor XI precursor, exon 4 /nomgen="F11" /map="4q35" /hgml_locus_uid="LW0045B" matp + 119 + 225 factor XI pre-msg < 1 > 261 factor XI mRNA and introns IVS < 1 118 factor XI intron C IVS 226 > 261 factor XI intron D BASE COUNT 84 A 44 C 46 G 87 T ORIGIN About 1.5 kb after segment 4. 1 GGCATGAGAT AAAGTAGTTT GTTTCCTTCT TTTTGGCTTT CTGTGTGCTG ACTTTTAAGA 61 TCCATTATTT TAAAAACATA AATTCCTATT CATTAATATG TATTTTTTAA AAAAACAGGT 121 TTACTTGTGT CCTGAAAGAC AGTGTTACAG AAACACTGCC AAGAGTGAAT AGGACAGCAG 181 CGATTTCTGG GTATTCTTTC AAGCAATGCT CACACCAAAT AAGCGGTAAG ATATGTTCTC 241 AGAATCAACA AATACCAGCT G // LOCUS HUMCFXI06 515 bp ds-DNA PRI 15-DEC-1988 DEFINITION Human coagulation factor XI gene, exon 5. ACCESSION M18299 KEYWORDS coagulation factor XI. SEGMENT 6 of 14 SOURCE Human DNA, (libraries of Lawn et al., and Yoshitake et al.), clone pTZ18R. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 515) AUTHORS Asakai,R., Davie,E.W. and Chung,D.W. TITLE Organization of the gene for human factor XI JOURNAL Biochemistry 26, 7221-7228 (1987) STANDARD full staff_entry FEATURES from to/span description pept + 48 + 207 factor XI precursor, exon 5 /nomgen="F11" /map="4q35" /hgml_locus_uid="LW0045B" matp + 48 + 207 factor XI pre-msg < 1 > 515 factor XI mRNA and introns IVS < 1 47 factor XI intron D IVS 208 > 515 factor XI intron E BASE COUNT 161 A 106 C 96 G 152 T ORIGIN About 1 kb after segment 5. 1 GCCCCTAGAA TCTGGAAGGT ACTCATGTCT TCTGCTTTTA TTTCCAGCTT GCAACAAAGA 61 CATTTATGTG GACCTAGACA TGAAGGGCAT AAACTATAAC AGCTCAGTTG CCAAGAGTGC 121 TCAAGAATGC CAAGAAAGAT GCACGGATGA CGTCCACTGC CACTTTTTCA CGTACGCCAC 181 AAGGCAGTTT CCCAGCCTGG AGCATCGGTG AGTGAGTCCC AGGACATTCG AGTGGTCGAT 241 GAAAAACAGA ATCGTGATTT ACTAAAAAGC TTTTGCCATC AACTTTATGC CAGAATTTAT 301 TTTGAACCCC TAAAAGACAT TTCTATAAAA GTACTCCTAG TTTTCTTCAT GAAAAATACA 361 CTTAAAGCCT AATTTGGATG CATTTCATTT ATGGTAAGGA GTCTATCTTT TAATAACACT 421 GTCAGAAAAA TATATATACT TGGCTAATTT CAAAAGCGCT ACACTTTTAA ATTGGCACTT 481 TTGAAACAGC TGCAATTGGT ATGATTGTCA GTGCC // LOCUS HUMCFXI07 444 bp ds-DNA PRI 15-DEC-1988 DEFINITION Human coagulation factor XI gene, exon 6. ACCESSION M18300 KEYWORDS coagulation factor XI. SEGMENT 7 of 14 SOURCE Human DNA, (libraries of Lawn et al., and Yoshitake et al.), clone pTZ18R. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 444) AUTHORS Asakai,R., Davie,E.W. and Chung,D.W. TITLE Organization of the gene for human factor XI JOURNAL Biochemistry 26, 7221-7228 (1987) STANDARD full staff_entry FEATURES from to/span description pept + 165 + 274 factor XI precursor, exon 6 /nomgen="F11" /map="4q35" /hgml_locus_uid="LW0045B" matp + 165 + 274 factor XI pre-msg < 1 > 444 factor XI mRNA and introns IVS < 1 164 factor XI intron E IVS 275 > 444 factor XI intron F BASE COUNT 119 A 105 C 78 G 142 T ORIGIN About 1.5 kb after segment 6. 1 TGCTTAGCAA CACTGCTGGG ACCATGCCCA GCCATTCAGC CTCCCAGATG GATGCTTCGG 61 GGTCTCGCAG GTCCTCTCTC CAAAGGGGAC TTTCTTAATA TCTCATGTTT TTTCCTCCTT 121 GCAGTTGGAA GAATAAGACA CTTTTCCTTT TTCTTTTTAT TCAGTAACAT TTGTCTACTG 181 AAGCACACCC AAACAGGGAC ACCAACCAGA ATAACGAAGC TCGATAAAGT GGTGTCTGGA 241 TTTTCACTGA AATCCTGTGC ACTTTCTAAT CTGGGTAATT ATCGACTTCT TGATGATGTA 301 ATTCAACCAT TAAATATGCT GATGATTACA GTAGATCTCA CTCAGGATAC CAGCTTATGC 361 TCACGATGAA ACGGACCCAA AGATCTTTAC CTTCTTCATG TGATAGATTT CATCATGTCC 421 TATACAGTTA GATCCTCTAT TTAA // LOCUS HUMCFXI08 312 bp ds-DNA PRI 15-DEC-1988 DEFINITION Human coagulation factor XI gene, exon 7. ACCESSION M18301 KEYWORDS coagulation factor XI. SEGMENT 8 of 14 SOURCE Human DNA, (libraries of Lawn et al., and Yoshitake et al.), clone pTZ18R. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 312) AUTHORS Asakai,R., Davie,E.W. and Chung,D.W. TITLE Organization of the gene for human factor XI JOURNAL Biochemistry 26, 7221-7228 (1987) STANDARD full staff_entry FEATURES from to/span description pept + 110 + 269 factor XI precursor, exon 7 /nomgen="F11" /map="4q35" /hgml_locus_uid="LW0045B" matp + 110 + 269 factor XI pre-msg < 1 > 312 factor XI mRNA and introns IVS < 1 109 factor XI intron F IVS 270 > 312 factor XI intron G BASE COUNT 80 A 65 C 63 G 104 T ORIGIN About 1 kb after segment 7. 1 GATCTTGGGA TACACTTAAA TTTTTTAATA TGGAATTTAC ACATATGTGA CCGGAATTTT 61 CCTGATAGCT GGTGAATTGA GTCCCTGACA TAGTTCTTCC GTCGCGCAGC TTGTATTAGG 121 GACATTTTCC CTAATACGGT GTTTGCAGAC AGCAACATCG ACAGTGTCAT GGCTCCCGAT 181 GCTTTTGTCT CTGGCCGAAT CTGCACTCAT CATCCCGGTT GCTTGTTTTT TACCTTCTTT 241 TCCCAGGAAT GGCCCAAAGA ATCTCAAAGG TAAGGAGTTA ACAAGTAAGG ATAATTTGTT 301 ATCTTCTAAA AA // LOCUS HUMCFXI09 773 bp ds-DNA PRI 15-DEC-1988 DEFINITION Human coagulation factor XI gene, exons 8, 9 and 10. ACCESSION M18302 KEYWORDS coagulation factor XI. SEGMENT 9 of 14 SOURCE Human DNA, (libraries of Lawn et al., and Yoshitake et al.), clone pTZ18R. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 773) AUTHORS Asakai,R., Davie,E.W. and Chung,D.W. TITLE Organization of the gene for human factor XI JOURNAL Biochemistry 26, 7221-7228 (1987) STANDARD full staff_entry FEATURES from to/span description pept + 85 194 factor XI precursor, exon 8 /nomgen="F11" /map="4q35" /hgml_locus_uid="LW0045B" 296 458 factor XI precursor, exon 9 547 + 653 factor XI precursor, exon 10 matp + 85 194 factor XI 296 458 factor XI 547 + 653 factor XI pre-msg < 1 > 773 factor XI mRNA and introns IVS < 1 84 factor XI intron G IVS 195 295 factor XI intron H IVS 459 546 factor XI intron I IVS 654 > 773 factor XI intron J BASE COUNT 214 A 161 C 168 G 230 T ORIGIN About 3 kb after segment 8. 1 CTGACTTTAC TTTCTCTAGG TGCTGTAAAA ATGTTTTTAT GTGTTTGATA TGATATATTT 61 CTACTTCCCT TTTGTTTTTG TTAGAAATCT TTGTCTCCTT AAAACATCTG AGAGTGGATT 121 GCCCAGTACA CGCATTAAAA AGAGCAAAGC TCTTTCTGGT TTCAGTCTAC AAAGCTGCAG 181 GCACAGCATC CCAGGTAAAC TGAGAGTTCT GCATTCTGGC TGAGAGTGAC CAGCCCCGAG 241 GAGGCTGATA CATGCTGAGG GAGGGTCTCA CTCTGACATG TGGTCTGCTG TCTAGTGTTC 301 TGCCATTCTT CATTTTACCA TGACACTGAT TTCTTGGGAG AAGAACTGGA TATTGTTGCT 361 GCAAAAAGTC ACGAGGCCTG CCAGAAACTG TGCACCAATG CCGTCCGCTG CCAGTTTTTT 421 ACCTATACCC CAGCCCAAGC ATCCTGCAAC GAAGGGAAGT AAGCCATATG AAGGGTTATG 481 CAGACACCCT TGTCCCGTCT GCCTGTGAGG TGCATTATGT TTATACCGTT TTGTTTCCAA 541 CTGCAGGGGC AAGTGTTACT TAAAGCTTTC TTCAAACGGA TCTCCAACTA AAATACTTCA 601 CGGGAGAGGA GGCATCTCTG GATACACATT AAGGTTGTGT AAAATGGATA ATGGTGAGTA 661 TAATGTCACT TGAAAAAATA TAGCTGAAGG AATTATTCCA TGCTTCATAC ATCACAATCA 721 AGACTGTCAG TTATAGCCAC AGAAGGGAGA ACATTCAGGA AATAACAAAT TTT // LOCUS HUMCFXI10 271 bp ds-DNA PRI 15-DEC-1988 DEFINITION Human coagulation factor XI gene, exon 11. ACCESSION M18303 KEYWORDS coagulation factor XI. SEGMENT 10 of 14 SOURCE Human DNA, (libraries of Lawn et al., and Yoshitake et al.), clone pTZ18R. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 271) AUTHORS Asakai,R., Davie,E.W. and Chung,D.W. TITLE Organization of the gene for human factor XI JOURNAL Biochemistry 26, 7221-7228 (1987) STANDARD full staff_entry FEATURES from to/span description pept + 17 + 185 factor XI precursor, exon 11 /nomgen="F11" /map="4q35" /hgml_locus_uid="LW0045B" matp + 17 + 185 factor XI pre-msg < 1 > 271 factor XI mRNA and introns IVS < 1 16 factor XI intron J IVS 186 > 271 factor XI intron K BASE COUNT 73 A 63 C 59 G 76 T ORIGIN About 3 kb after segment 9. 1 AATGCTTCTG TTGCAGAGTG TACCACCAAA ATCAAGCCCA GGATCGTTGG AGGAACTGCG 61 TCTGTTCGTG GTGAGTGGCC GTGGCAGGTG ACCCTGCACA CAACCTCACC CACTCAGAGA 121 CACCTGTGTG GAGGCTCCAT CATTGGAAAC CAGTGGATAT TAACAGCCGC TCACTGTTTC 181 TATGGGTCAG TACCACGGCT GTTTTTATTA GTTCATCTTC TTCACACATT TATAAAAAAT 241 ATTACTAGCA TGTTAGGAAA TAAATACTTT A // LOCUS HUMCFXI11 568 bp ds-DNA PRI 15-DEC-1988 DEFINITION Human coagulation factor XI gene, exon 12. ACCESSION M18304 KEYWORDS coagulation factor XI. SEGMENT 11 of 14 SOURCE Human DNA, (libraries of Lawn et al., and Yoshitake et al.), clone pTZ18R. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 568) AUTHORS Asakai,R., Davie,E.W. and Chung,D.W. TITLE Organization of the gene for human factor XI JOURNAL Biochemistry 26, 7221-7228 (1987) STANDARD full staff_entry FEATURES from to/span description pept + 213 + 388 factor XI precursor, exon 12 /nomgen="F11" /map="4q35" /hgml_locus_uid="LW0045B" matp + 213 + 388 factor XI pre-msg < 1 > 568 factor XI mRNA and introns IVS < 1 212 factor XI intron K IVS 389 > 568 factor XI intron L BASE COUNT 173 A 102 C 108 G 185 T ORIGIN About 1 kb after segment 10. 1 TGCTCATTCA TTTTTTGTGT ATAATGGATT TTCTTTATAG GGTGAATATG TTTTTTATCC 61 CGAAAAATCT TAGGATAAAA TCACTTTTTT CTACCTAAAT GTCCATCATT GGCAGAAAAT 121 ATTAGTAATA ATTAAACAGC CACACACTTC ACAATGTCTG GGAATTATTT TTAGTAAAGG 181 AAATTTCTTT CCCTCTGTTG TTTGCTCCTT AGGGTAGAGT CACCTAAGAT TTTGCGTGTC 241 TACAGTGGCA TTTTAAATCA ATCTGAAATA AAAGAGGACA CATCTTTCTT TGGGGTTCAA 301 GAAATAATAA TCCATGATCA GTATAAAATG GCAGAAAGCG GGTATGATAT TGCCTTGTTG 361 AAACTGGAAA CCACAGTGAA TTACACAGGT ACGGAGAATT TTATCCGGAA AGTTGTCTCC 421 AATGGTGAAC TGGATAAAAT GTTTAACACT ACTAGACTTA CGGCCTGACC CTGCCAATCT 481 CTCCATGCGT TATCATCATG AAAGGGAGAG GGCCTGGAAT GCTAGTCATT CACTCTGCTA 541 AGGCTGACAC ACTTTCCTGG CTATTGAA // LOCUS HUMCFXI12 312 bp ds-DNA PRI 15-DEC-1988 DEFINITION Human coagulation factor XI gene, exon 13. ACCESSION M19417 KEYWORDS coagulation factor XI. SEGMENT 12 of 14 SOURCE Human DNA, (libraries of Lawn et al., and Yoshitake et al.), clone pTZ18R. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 312) AUTHORS Asakai,R., Davie,E.W. and Chung,D.W. TITLE Organization of the gene for human factor XI JOURNAL Biochemistry 26, 7221-7228 (1987) STANDARD full staff_entry FEATURES from to/span description pept + 161 + 256 factor XI precursor, exon 13 /nomgen="F11" /map="4q35" /hgml_locus_uid="LW0045B" matp + 161 + 256 factor XI pre-msg < 1 > 312 factor XI mRNA and introns IVS < 1 160 factor XI intron L IVS 257 > 312 factor XI intron M BASE COUNT 117 A 48 C 68 G 79 T ORIGIN About 0.8 kb after segment 11. 1 ATCGTGCTGA ACCTGAGGGA GGAAAATACA CGACAACAAG GCAAAAAATG AATATAGTAA 61 ACAAAGAAAA CACAGATAAT GTACAGTGGA AGAAGAGTCT CTTCTGGAAA AGAGGATATA 121 TTTTGCGTCT CATATTTAAA CCACGATTTT TTAAATTTAG ATTCTCAACG ACCCATATGC 181 CTGCCTTCCA AAGGAGATAG AAATGTAATA TACACTGATT GCTGGGTGAC TGGATGGGGG 241 TACAGAAAAC TAAGAGGTAA AAATGATGTT GTTATATGTG CTCCATCCTA GAAATGAAGA 301 GCGGAACCTT TT // LOCUS HUMCFXI13 444 bp ds-DNA PRI 15-DEC-1988 DEFINITION Human coagulation factor XI gene, exon 14. ACCESSION M20217 M18306 KEYWORDS coagulation factor XI. SEGMENT 13 of 14 SOURCE Human DNA, (libraries of Lawn et al., and Yoshitake et al.), clone pTZ18R. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 444) AUTHORS Asakai,R., Davie,E.W. and Chung,D.W. TITLE Organization of the gene for human factor XI JOURNAL Biochemistry 26, 7221-7228 (1987) STANDARD full staff_entry FEATURES from to/span description pept + 156 + 295 factor XI precursor, exon 14 /nomgen="F11" /map="4q35" /hgml_locus_uid="LW0045B" matp + 156 + 295 factor XI pre-msg < 1 > 444 factor XI mRNA and introns IVS < 1 155 factor XI intron M IVS 296 > 444 factor XI intron N BASE COUNT 136 A 74 C 97 G 137 T ORIGIN About 1 kb after segment 12. 1 CTGAGATTGC ACCACTGCAC TCCAGCCTGG GCGACAGAAA GAGACTCCGT CTCAATTAAA 61 AATATATATA TATATATATA TTTATATGTA TGCATATATG TTTATGTGTA TTGTGTATGG 121 TTATTCTACA AACGAACCAA AAAAATTTTT TTCAGACAAA ATACAAAATA CTCTCCAGAA 181 AGCCAAGATA CCCTTAGTGA CCAACGAAGA GTGCCAGAAG AGATACAGAG GACATAAAAT 241 AACCCATAAG ATGATCTGTG CCGGCTACAG GGAAGGAGGG AAGGACGCTT GCAAGGTAAC 301 AGAGTGTTCT TAGCCAATGG AATATATGCA AATTGGAATG CTTAATGCGT TGGGGTTTTT 361 TTGTTTGTTT TGTTTTTTTT GTTTGTTTTT TTTTGAGACA GAGTCTCGCT CTGTTGCCCA 421 GGCTGGAGTG CAGTGGCTCG ATCT // LOCUS HUMCFXI14 748 bp ds-DNA PRI 15-DEC-1988 DEFINITION Human coagulation factor XI gene, exon 15. ACCESSION M20218 M18307 KEYWORDS coagulation factor XI. SEGMENT 14 of 14 SOURCE Human DNA, (libraries of Lawn et al., and Yoshitake et al.), clone pTZ18R. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 748) AUTHORS Asakai,R., Davie,E.W. and Chung,D.W. TITLE Organization of the gene for human factor XI JOURNAL Biochemistry 26, 7221-7228 (1987) STANDARD full staff_entry FEATURES from to/span description pept + 102 263 factor XI precursor, exon 15 /nomgen="F11" /map="4q35" /hgml_locus_uid="LW0045B" matp + 102 260 factor XI pre-msg < 1 429 factor XI mRNA and introns IVS < 1 101 factor XI intron N BASE COUNT 215 A 144 C 195 G 194 T ORIGIN About 1 kb after segment 13. 1 CAAGACAACA TTTTAGGCAA AATCAGCCTG AGCAAGATGT GCTGAAGATG GGAAGCGTCT 61 GAGTTGATCT GTGCACCTTT TCTTGTCTCC CCTCGTTCTA GGGAGATTCG GGAGGCCCTC 121 TGTCCTGCAA ACACAATGAG GTCTGGCATC TGGTAGGCAT CACGAGCTGG GGCGAAGGCT 181 GTGCTCAAAG GGAGCGGCCA GGTGTTTACA CCAACGTGGT CGAGTACGTG GACTGGATTC 241 TGGAGAAAAC TCAAGCAGTG TGAATGGGTT CCCAGGGGCC ATTGGAGTCC CTGAAGGACC 301 CAGGATTTGC TGGGAGAGGG TGTTGAGTTC ACTGTGCCAG CATGCTTCCT CCACAGTAAC 361 ACGCTGAAGG GGCTTGGTGT TTGTAAGAAA ATGCTAGAAG AAAACAAACT GTCACAAGTT 421 GTTATGTCCA AAACTCCCGT TCTATGATCG TTGTAGTTTG TTTGAGCATT CAGTCTCTTT 481 GTTTTTGATC ACGCTTCTAT GGAGTCCAAG AATTACCATA AGGCAATGTT TCTGAAGATT 541 ACTATATAGG CAGATATACC AGAAAATAAC CAAGTAGTGG CAGTGGGGAT CAGGCAGAAG 601 AACTGGTAAA AGAAGCCACC ATAAATAGAT TTGTTCGATG AAAGATGAAA ACTGGAAGAA 661 AGGAGAACAA AGACAGTCTT CACCATTTTG CAGGAATCTA CACTCTGCCT ATGTGAACAC 721 ATTTCTTTTG TAAAGAAAGA ATTTGATT // LOCUS HUMCFXII1 596 bp ds-DNA PRI 15-JUN-1989 DEFINITION Human blood coagulation factor XII gene, exon 1. ACCESSION M17464 J02807 KEYWORDS Hageman factor; coagulation factor XII. SEGMENT 1 of 3 SOURCE Human DNA (library of P.Leder), clones lambda-HFXII-[27,76]. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 596) AUTHORS Cool,D.E. and MacGillivray,R.T.A. TITLE Characterization of the human blood coagulation factor XII gene: Intron/exon gene organization and analysis of the 5'-flanking region JOURNAL J. Biol. Chem. 262, 13662-13673 (1987) STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly provided by R.T.A.McGillivray, 20-NOV-1987. FEATURES from to/span description pept 405 + 461 coagulation factor XII precursor, exon 1 /nomgen="F12" /map="5q33-qter" /hgml_locus_uid="LS0075U" sigp 405 461 coagulation factor XII signal peptide pre-msg 356 > 596 F12 mRNA and intron (minor alt.) pre-msg 369 > 596 F12 mRNA and intron (major alt.) pre-msg 377 > 596 F12 mRNA and intron (major alt.) pre-msg 382 > 596 F12 mRNA and intron (minor alt.) pre-msg 388 > 596 F12 mRNA and intron (minor alt.) IVS 462 > 596 F12 intron A BASE COUNT 134 A 165 C 152 G 145 T ORIGIN 408 bp upstream of BglI site; chromosome 5q33-qter. 1 GTGGGTATTG TTGTAAGATG CTGAGTTTAT GGTAGTTTGT TACATGACAA TAGAAAATGA 61 ACACACTTCA CAGTGGACTC CAAGATCCCC ATGATCTTTG ATCTCCTTAA CCTCCTGATC 121 TCCACAGGAC CCAGAGCATA AGAATGTCCC TTCTTCTGCT TCCAGTCCCA CTATCTAGAA 181 AAGAGAGGAG GAGCCCAGCT CTTCATTTCA CCCCCACCCA CAAACTCCCA ACTTTCCGGC 241 CCTCAAGGGG TGACCAAGGA AGTTGCTCCA CTTGGCTTTC CACAAACAGC CTGTGCCCCA 301 CCAGGCTCAG GAGGGCAGCT TGACCAATCT CTATTTCCAA GACCTTTGGC CAGTCCTATT 361 GATCTGGACT CCTGGATAGG CAGCTGGACC AACGGACGGA CGCCATGAGG GCTCTGCTGC 421 TCCTGGGGTT CCTGCTGGTG AGCTTGGAGT CAACACTTTC GGTGAGTGCT GTGGGAACCA 481 GGATTGTCCC AGGATTGTTC TGGGGGGTCG CTATCACAGC CATGAGCCAT GGCCTCTGCT 541 CATGACCTGT GGGTCCAGGT GACTAGGAGG CCTATGTGGA AAGGTGAGGC CAGCCC // LOCUS HUMCFXII2 765 bp ds-DNA PRI 15-JUN-1989 DEFINITION Human blood coagulation factor XII gene, exon 2. ACCESSION M17465 J02807 KEYWORDS Hageman factor; coagulation factor XII. SEGMENT 2 of 3 SOURCE Human DNA (library of P.Leder), clones lambda-HFXII-[27,76]. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 765) AUTHORS Cool,D.E. and MacGillivray,R.T.A. TITLE Characterization of the human blood coagulation factor XII gene: Intron/exon gene organization and analysis of the 5'-flanking region JOURNAL J. Biol. Chem. 262, 13662-13673 (1987) STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly provided by R.T.A.McGillivray, 20-NOV-1987. FEATURES from to/span description pept + 243 + 300 coagulation factor XII precursor, exon 2 /nomgen="F12" /map="5q33-qter" /hgml_locus_uid="LS0075U" matp 243 + 300 coagulation factor XII heavy chain pre-msg < 1 > 765 F12 mRNA and intron (minor alt.) IVS < 1 242 F12 intron A IVS 301 > 765 F12 intron B BASE COUNT 162 A 191 C 232 G 180 T ORIGIN About 4.3 kb after segment 1; chromosome 5q33-qter. 1 AGGCCAGCCC GGAAGGCCCA GGCAGAGGAG ACAGACAACC AGACTGGGTG GATACAAGGG 61 CACAGCCTGC ATTTCTGGGG GAGATGGGCC TTAAGAAGAC AACGGGGGGA GGTAGAAAGG 121 GAAAGGGTCT TGGGAAGAAA TCTCTGCATT TCTGGGCTGT GAGAGGAAGC TGCAGACTAG 181 CAACAGATCG GTGGCAGGCT ATGACTTATA GTCAGTTCCC TGCCTTCTTC TCTCCCTTGT 241 AGATTCCACC TTGGGAAGCC CCCAAGGAGC ATAAGTACAA AGCTGAAGAG CACACAGTCG 301 GTAAGTGGCC TGGCTCCTCC TCCCGGGAAC CCTTGGGTGG GGATGTGTAT GGTGCAGTGT 361 GTGCAGTCTC AGGGCAGTCT AGTCTAGTGC CTACCTGGTG CTAGGTCTTA TGCCCATGGG 421 CACTAGAGTG ATCGTGAGCT GTGTGATCCT TGAGGGCAGG GTATGGGCTG TGTCTAAGTG 481 CCCACGAGCC TGGCTCGGAG CAGGTGCTTG AGATATGTGC TGCTGGCGCC ATCACACCTG 541 GGCTCCTGCC AGCCTTCCTC AGTTTCCCCA GCTTCTCCCC TTCTTTTCCT TTCCCCAGTA 601 CGTCTCATGG GCATCATTCA TGCCACACAG AGGCCAGGGC CTTCAATGGG CAAGGAAGGA 661 TCAAGAGCTT GTCTCTGGCA TCTGAATGCC TCTGAAGCCC AGCTTTATGA GTTATGAGCT 721 GGGTGACTCT GGGCGAGGGA TTTGAGTTCT CCAAGCTTCA ATTTC // LOCUS HUMCFXII3 4139 bp ds-DNA PRI 15-JUN-1989 DEFINITION Human blood coagulation factor XII gene, exons 3-14. ACCESSION M17466 J02807 KEYWORDS Hageman factor; coagulation factor XII. SEGMENT 3 of 3 SOURCE Human DNA (library of P.Leder), clones lambda-HFXII-[27,76]. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 4139) AUTHORS Cool,D.E. and MacGillivray,R.T.A. TITLE Characterization of the human blood coagulation factor XII gene: Intron/exon gene organization and analysis of the 5'-flanking region JOURNAL J. Biol. Chem. 262, 13662-13673 (1987) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [1] kindly provided by R.T.A.McGillivray, 20-NOV-1987. FEATURES from to/span description pept + 25 124 coagulation factor XII precursor, exon 3 /nomgen="F12" /map="5q33-qter" /hgml_locus_uid="LS0075U" 281 351 coagulation factor XII precursor, exon 4 652 762 coagulation factor XII precursor, exon 5 900 1031 coagulation factor XII precursor, exon 6 1171 1275 coagulation factor XII precursor, exon 7 1421 1586 coagulation factor XII precursor, exon 8 1672 1889 coagulation factor XII precursor, exon 9 1994 2225 coagulation factor XII precursor, exon 10 2467 2603 coagulation factor XII precursor, exon 11 2685 2828 coagulation factor XII precursor, exon 12 3384 3532 coagulation factor XII precursor, exon 13 3623 3790 coagulation factor XII precursor, exon 14 matp + 25 124 coagulation factor XII heavy chain 281 351 coagulation factor XII heavy chain 652 762 coagulation factor XII heavy chain 900 1031 coagulation factor XII heavy chain 1171 1275 coagulation factor XII heavy chain 1421 1586 coagulation factor XII heavy chain 1672 1889 coagulation factor XII heavy chain 1994 2091 coagulation factor XII heavy chain matp 2092 2225 coagulation factor XII light chain 2467 2603 coagulation factor XII light chain 2685 2828 coagulation factor XII light chain 3384 3532 coagulation factor XII light chain 3623 3787 coagulation factor XII light chain matp 2035 2061 coagulation factor XII beta-FXIIa pre-msg < 1 > 3923 F12 mRNA and intron (minor alt.) IVS < 1 24 F12 intron B IVS 125 280 F12 intron C IVS 352 651 F12 intron D IVS 763 899 F12 intron E IVS 1032 1170 F12 intron F IVS 1276 1420 F12 intron G IVS 1587 1671 F12 intron H IVS 1890 1993 F12 intron I IVS 2226 2466 F12 intron J (no splice consensus at 2226) IVS 2604 2684 F12 intron K IVS 2829 3383 F12 intron L IVS 3533 3622 F12 intron M BASE COUNT 746 A 1309 C 1327 G 757 T ORIGIN About 2.5 kb after segment 2; chromosome 5q33-qter. 1 TCAGAGAGTG TGTTGTCCCT GCAGTTCTCA CTGTCACCGG GGAGCCCTGC CACTTCCCCT 61 TCCAGTACCA CCGGCAGCTG TACCACAAAT GTACCCACAA GGGCCGGCCA GGCCCTCAGC 121 CCTGGTAAGA CTACGCAGAG GAGTTGGAGC AGGGGCCTGG GAGACATGTA CCCTGCCTGT 181 CCTTCTGTCC AAGGAACTCT GCTTGGAGAG AGGGGACTGT GATAGGGCAG GGTGGGCCAG 241 GCCCCTGGGT AGAGCAGGGA AGCCTTGTCT CTTTCTACAG GTGTGCTACC ACCCCCAACT 301 TTGATCAGGA CCAGCGATGG GGATACTGTT TGGAGCCCAA GAAAGTGAAA GGTGCTACAC 361 ACAGCCTCTG GGGTGGCCTG GGGCTCTCTC CTCCCGCCTC ATTACTCTCC TGGTATCACC 421 AGACCCCACA CACCTGGGAT TCTGGACCCA GCCCCTTCTC TCCCTCCACA ATACCCTTTG 481 GAAGTCCAGA GGGAGAGTTC TGGGAAGGAG TGGTCCCATT TTGCAGGTGG GTAAACCAAG 541 CTTGGAAACT TGGAGTAGCA AGGTCACAAG GCAAGTAGGT TCAAGAAGGG CCTTGGCCCC 601 AGCTGTGTGA CTCAGCTCCC TGCTCTTCCT TCCACCATGT CCATCTCTCA GACCACTGCA 661 GCAAACACAG CCCCTGCCAG AAAGGAGGGA CCTGTGTGAA CATGCCAAGC GGCCCCCACT 721 GTCTCTGTCC ACAACACCTC ACTGGAAACC ACTGCCAGAA AGGTGAGGAG ATGTGGAGGA 781 CCTGGGCGGG GTGCTGGGGG ACAGGGGCAA CCCTGGGCCT ACAGAATAGG TTGCTGGATA 841 CTCGGAGACT TGGCATGGTC CTAGACTCTC CTGAGACCAC TATCCCTCTT TGTCCCCAGA 901 GAAGTGCTTT GAGCCTCAGC TTCTCCGGTT TTTCCACAAG AATGAGATAT GGTATAGAAC 961 TGAGCAAGCA GCTGTGGCCA GATGCCAGTG CAAGGGTCCT GATGCCCACT GCCAGCGGCT 1021 GGCCAGCCAG GGTGAGCAGA TGGTTGGGAA CGGGCCAGGG AGGAGCGTCA GGAAGACAGG 1081 CTGGCAGGAG GCCGGGTGGT GTGCCGGGAA GGAGAGCTCT CTGGGGGGGT CTTTAGGCCC 1141 AGGGGTGGCT CACTGCGTTC CCTCCCCAAG CCTGCCGCAC CAACCCGTGC CTCCATGGGG 1201 GTCGCTGCCT AGAGGTGGAG GGCCACCGCC TGTGCCACTG CCCGGTGGGC TACACCGGAC 1261 CCTTCTGCGA CGTGGGTGAG TGAGGGTCTG GGGCAAGCAG AAGGCCAGCC CCCAGGTGGG 1321 ACGGGCTTGC CAGGAAGGAG GAGGGAGAGT GCGGAAAGCA GATGAGAGGG AGGCAGGAGA 1381 GCCCAGCCTT GGCTGCCCAG GGAGCCCCCT TTCTCCTCAG ACACCAAGGC AAGCTGCTAT 1441 GATGGCCGCG GGCTCAGCTA CCGCGGCCTG GCCAGGACCA CGCTCTCGGG TGCGCCCTGT 1501 CAGCCGTGGG CCTCGGAGGC CACCTACCGG AACGTGACTG CCGAGCAAGC GCGGAACTGG 1561 GGACTGGGCG GCCACGCCTT CTGCCGGTGC GCCGCGTGGG GCTGGGTGAC CCCTCCGCCC 1621 CAGGGCTCCG GGCTCCCGGC GCTCTAACGG CGCCCCGTCG TGTGGCTACA GGAACCCGGA 1681 CAACGACATC CGCCCGTGGT GCTTCGTGCT GAACCGCGAC CGGCTGAGCT GGGAGTACTG 1741 CGACCTGGCA CAGTGCCAGA CCCCAACCCA GGCGGCGCCT CCGACCCCGG TGTCCCCTAG 1801 GCTTCATGTC CCACTCATGC CCGCGCAGCC GGCACCGCCG AAGCCTCAGC CCACGACCCG 1861 GACCCCGCCT CAGTCCCAGA CCCCGGGAGG TTAGGAAGTG GGGGGGGAAG GAGGAGCCGA 1921 GAGGGCGCCG GGCGAGCTAG ATTCCGGCCA GCCGGCCGCG GGCTCCCCGT CCTCAGCCCC 1981 TGCTCCTCCA CAGCCTTGCC GGCGAAGCGG GAGCAGCCGC CTTCCCTGAC CAGGAACGGC 2041 CCACTGAGCT GCGGGCAGCG GCTCCGCAAG AGTCTGTCTT CGATGACCCG CGTCGTTGGC 2101 GGGCTGGTGG CGCTACGCGG GGCGCACCCC TACATCGCCG CGCTGTACTG GGGCCACAGT 2161 TTCTGCGCCG GCAGCCTCAT CGCCCCCTGC TGGGTGCTGA CGGCCGCTCA CTGCCTGCAG 2221 GACCGGCGAG TACCCGCCCG CCCAGAGCCG CCCCAGGGGC CGCGGCTCCT CCGTCTCCCA 2281 GCGCAGCTTC CACGCTGCAC CCGAACCCGT GCCCTACCTT CTCCCGCCCC ACCCTTCTTT 2341 CCACGCCCCT CCGGAGCTCC CGGGGAGGAA GCTGGAACAC GGGATTGGGG TTCGGGAGCA 2401 GGGGGCTTCC CCAGAACGCT TGTGGCCAGG TCTGAGAGCG CTGCCTCTCC CCTACCCTCC 2461 CCGCAGGCCC GCACCCGAGG ATCTGACGGT GGTGCTCGGC CAGGAACGCC GTAACCACAG 2521 CTGTGAGCCG TGCCAGACGT TGGCCGTGCG CTCCTACCGC TTGCACGAGG CCTTCTCGCC 2581 CGTCAGCTAC CAGCACGACC TGGGTGCGTG GGGGCGCCCC GCGGGGACGG GAAGAGAGCT 2641 TGGGCACGGC GTCCCCGCCT CACGCTCCTC TCCGCCCGGG TTAGCTCTGT TGCGCCTTCA 2701 GGAGGATGCG GACGGCAGCT GCGCGCTCCT GTCGCCTTAC GTTCAGCCGG TGTGCCTGCC 2761 AAGCGGCGCC GCGCGACCCT CCGAGACCAC GCTCTGCCAG GTGGCCGGCT GGGGCCACCA 2821 GTTCGAGGGT AGGCACAACT GCTAGGGGCA GGGGTAGGGG AGGAGACCTT TGATCACTGG 2881 GTTAGGCGGA AGAAGCCCGC GACTTTGGTA TCGTTCCGGG TGCCTACAGA ATGGGTGGCG 2941 CTGACCTGAT GGGTTGTGAG AATGTGTAGG TGAATCCCAG GTAGAATCCC AGGGCCTGGG 3001 ATTCACTGCT GGGATCCCCA AATCTCCTGG GGATACAGGG AGAATCGAAC TTGCTCTTGG 3061 TTCCCTCTGG GCGCCGGGCT GCAAAGGCCA ACTAGGACGC TGGCCCCGCG CTCCGGGCTA 3121 GTGTGGGAGC CAGGTTCTGC GACTCTGGAT GGGTGGTGGG GGAGGGGTTT CTGTTTCCGC 3181 TCCGCCCATT CAAATCCTGG CTTTTCTCTG GACCTCAGCC TCCTTGCCTA TGAAATTGAA 3241 TTAATGGCAC CTCCTCCCCT TCGGGCTTGC TGCGAGAGAG GAAGGGCATG AGTGGGTTTA 3301 CAAGCGCCTG GAGCAGCTTT GTCCATCGTC CGGGCGGCAA GCGTTGTCAG ATGGGGTGTG 3361 AAGAAGGCGC TCTGTGTTCG CAGGGGCGGA GGAATATGCC AGCTTCCTGC AGGAGGCGCA 3421 GGTACCGTTC CTCTCCCTGG AGCGCTGCTC AGCCCCGGAC GTGCACGGAT CCTCCATCCT 3481 CCCCGGCATG CTCTGCGCAG GGTTCCTCGA GGGCGGCACC GATGCGTGCC AGGTGAGCTC 3541 TTAGCCCGGT TGGCGCCCTT CCCCGAGGCC GTCAGGCACA AATCTCAGGT CCACAGCGCT 3601 GAGCTGCGTG TTTCCGACCC AGGGTGATTC CGGAGGCCCG CTGGTGTGTG AGGACCAAGC 3661 TGCAGAGCGC CGGCTCACCC TGCAAGGCAT CATCAGCTGG GGATCGGGCT GTGGTGACCG 3721 CAACAAGCCA GGCGTCTACA CCGATGTGGC CTACTACCTG GCCTGGATCC GGGAGCACAC 3781 CGTTTCCTGA TTGCTCAGGG ACTCATCTTT CCCTCCTTGG TGATTCCGCA GTGAGAGAGT 3841 GGCTGGGGCA TGGAAGGCAA GATTGTGTCC CATTCCCCCA GTGCGGCCAG CTCCGCGCCA 3901 GGATGGCGCA GGAACTCAAT AAAGTGCTTT GAAAATGCTG AGAAGGAAAG CTCTTTTCTT 3961 CATGGGTCCG CCGGGAAATG CCAAGACAGA AAAGCGATTC ACAGCTTCTC CACAGCTCTC 4021 AGAGAACAAG GTCTATGAGA TCTTAACGTG CAAAATCTAG ATGCCAGCCC AGCTAATGTT 4081 TACTGAGCCT AGGATACTGT ATACCAAGCC CTGTGCAAGG AGAAGCTGCA TGTTATTCC // LOCUS HUMCG1301 70 bp ds-DNA PRI 15-MAR-1989 DEFINITION Human collagen alpha-1 chain (type XIII) gene, exon 11. ACCESSION M20795 J04085 KEYWORDS alternative splicing; collagen. SEGMENT 1 of 11 SOURCE Human DNA, clones cos[D2,D3]. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 70) AUTHORS Tikka,L., Pihlajaniemi,T., Henttu,P., Prockop,D.J. and Tryggvason,K. TITLE Unique gene structure for the alpha-1 chain of a human short-chain collagen (type XIII) with alternatively spliced transcripts and translation termination codon at the 5' end of the last exon JOURNAL Proc. Natl. Acad. Sci. U.S.A. 85, 7491-7495 (1988) STANDARD full staff_entry COMMENT Draft entry and computer readable copy of sequence [1] kindly submitted by L.Tikka, 28-SEP-1988. FEATURES from to/span description pept / 16 + 60 collagen (type XIII) alpha-1 chain exon 11 (AA at 16) /nomgen="COL13A1" /hgml_locus_uid="LF00325" pep$ / 16 + 60 C collagen (type XIII) alpha-1 chain exon 11 (AA at 16) pep$ / 16 + 60 D collagen (type XIII) alpha-1 chain exon 11 (AA at 16) IVS < 1 15 Intron A IVS 61 > 70 Intron B BASE COUNT 23 A 11 C 28 G 8 T ORIGIN Chromosome 10q21.3-q22.2. 1 GTTTTGTGCC CACAGGGAGA GAAAGGAGAA GCCGGGGAGA AGGGCAATCC AGGAGCAGAG 61 GTACATGAGA // LOCUS HUMCG1302 67 bp ds-DNA PRI 15-MAR-1989 DEFINITION Human collagen alpha-1 chain (type XIII) gene, exon 10. ACCESSION M20805 J04085 KEYWORDS alternative splicing; collagen. SEGMENT 2 of 11 SOURCE Human DNA, clones cos[D2,D3]. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 67) AUTHORS Tikka,L., Pihlajaniemi,T., Henttu,P., Prockop,D.J. and Tryggvason,K. TITLE Unique gene structure for the alpha-1 chain of a human short-chain collagen (type XIII) with alternatively spliced transcripts and translation termination codon at the 5' end of the last exon JOURNAL Proc. Natl. Acad. Sci. U.S.A. 85, 7491-7495 (1988) STANDARD full staff_entry COMMENT Draft entry and computer readable copy of sequence [1] kindly submitted by L.Tikka, 28-SEP-1988. FEATURES from to/span description pept + 16 + 57 collagen (type XIII) alpha-1 chain exon 10 /nomgen="COL13A1" /hgml_locus_uid="LF00325" pep$ / 16 + 57 A collagen (type XIII) alpha-1 chain exon 10 pep$ / 16 + 57 B collagen (type XIII) alpha-1 chain exon 10 IVS < 1 15 Intron B IVS 58 > 67 Intron C BASE COUNT 7 A 20 C 22 G 18 T ORIGIN Chromosome 10q21.3-q22.2; about 4.7 kb downstream of segment 1. 1 CCCCTTTGCT TTTAGGTTCC TGGGCTGCTA GGGCCAGAGG GGCCTCCCGG ACCTCCGGTA 61 AGTTTGG // LOCUS HUMCG1303 52 bp ds-DNA PRI 15-MAR-1989 DEFINITION Human collagen alpha-1 chain (type XIII) gene, exon 9. ACCESSION M20802 J04085 KEYWORDS alternative splicing; collagen. SEGMENT 3 of 11 SOURCE Human DNA, clones cos[D2,D3]. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 52) AUTHORS Tikka,L., Pihlajaniemi,T., Henttu,P., Prockop,D.J. and Tryggvason,K. TITLE Unique gene structure for the alpha-1 chain of a human short-chain collagen (type XIII) with alternatively spliced transcripts and translation termination codon at the 5' end of the last exon JOURNAL Proc. Natl. Acad. Sci. U.S.A. 85, 7491-7495 (1988) STANDARD full staff_entry COMMENT Draft entry and computer readable copy of sequence [1] kindly submitted by L.Tikka, 28-SEP-1988. FEATURES from to/span description pept + 16 + 42 collagen (type XIII) alpha-1 chain exon 9 /nomgen="COL13A1" /hgml_locus_uid="LF00325" pep$ + 16 + 42 A collagen (type XIII) alpha-1 chain exon 9 pep$ + 16 + 42 B collagen (type XIII) alpha-1 chain exon 9 pep$ + 16 + 42 C collagen (type XIII) alpha-1 chain exon 9 pep$ + 16 + 42 D collagen (type XIII) alpha-1 chain exon 9 IVS < 1 15 Intron C IVS 43 > 52 Intron C BASE COUNT 16 A 11 C 15 G 10 T ORIGIN Chromosome 10q21.3-q22.2; about 1 kb downstream of segment 2. 1 TATTCCAAAT GCCAGGGGCT CCAAGGTGTT CCTGGACCAA AGGTAAGAGA AG // LOCUS HUMCG1304 106 bp ds-DNA PRI 15-MAR-1989 DEFINITION Human collagen alpha-1 chain (type XIII) gene, exon 8. ACCESSION M20801 J04085 KEYWORDS alternative splicing; collagen. SEGMENT 4 of 11 SOURCE Human DNA, clones cos[D2,D3]. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 106) AUTHORS Tikka,L., Pihlajaniemi,T., Henttu,P., Prockop,D.J. and Tryggvason,K. TITLE Unique gene structure for the alpha-1 chain of a human short-chain collagen (type XIII) with alternatively spliced transcripts and translation termination codon at the 5' end of the last exon JOURNAL Proc. Natl. Acad. Sci. U.S.A. 85, 7491-7495 (1988) STANDARD full staff_entry COMMENT Draft entry and computer readable copy of sequence [1] kindly submitted by L.Tikka, 28-SEP-1988. FEATURES from to/span description pept + 16 + 96 collagen (type XIII) alpha-1 chain exon 8 /nomgen="COL13A1" /hgml_locus_uid="LF00325" pep$ + 16 + 96 A collagen (type XIII) alpha-1 chain exon 8 pep$ + 16 + 96 B collagen (type XIII) alpha-1 chain exon 8 pep$ + 16 + 96 C collagen (type XIII) alpha-1 chain exon 8 pep$ + 16 + 96 D collagen (type XIII) alpha-1 chain exon 8 IVS < 1 15 Intron C IVS 97 > 106 Intron D BASE COUNT 31 A 28 C 32 G 15 T ORIGIN Chromosome 10q21.3-q22.2; about 2 kb downstream of segment 3. 1 CTCCCTTTCC TCCAGGGGGA AGCAGGACTA GACGGAGCAA AAGGAGAGAA AGGCTTCCAG 61 GGAGAAAAAG GAGACCGTGG TCCCCTGGGA CTACCCGTAA CTACCT // LOCUS HUMCG1305 60 bp ds-DNA PRI 15-MAR-1989 DEFINITION Human collagen alpha-1 chain (type XIII) gene, exon 7. ACCESSION M20800 J04085 KEYWORDS alternative splicing; collagen. SEGMENT 5 of 11 SOURCE Human DNA, clones cos[D2,D3]. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 60) AUTHORS Tikka,L., Pihlajaniemi,T., Henttu,P., Prockop,D.J. and Tryggvason,K. TITLE Unique gene structure for the alpha-1 chain of a human short-chain collagen (type XIII) with alternatively spliced transcripts and translation termination codon at the 5' end of the last exon JOURNAL Proc. Natl. Acad. Sci. U.S.A. 85, 7491-7495 (1988) STANDARD full staff_entry COMMENT Draft entry and computer readable copy of sequence [1] kindly submitted by L.Tikka, 28-SEP-1988. FEATURES from to/span description pept + 16 + 51 collagen (type XIII) alpha-1 chain exon 7 /nomgen="COL13A1" /hgml_locus_uid="LF00325" pep$ + 16 + 51 A collagen (type XIII) alpha-1 chain exon 7 pep$ + 16 + 51 C collagen (type XIII) alpha-1 chain exon 7 IVS < 1 15 Intron D IVS 52 > 60 Intron E BASE COUNT 6 A 18 C 24 G 12 T ORIGIN Chromosome 10q21.3-q22.2; about 0.8 kb downstream of segment 4. 1 GCGCCCTTCG TCCAGGGAGC TTCAGGTTTG GACGGCAGGC CTGGGCCTCC GGTGAGTGCG // LOCUS HUMCG1306 69 bp ds-DNA PRI 15-MAR-1989 DEFINITION Human collagen alpha-1 chain (type XIII) gene, exon 6. ACCESSION M20799 J04085 KEYWORDS alternative splicing; collagen. SEGMENT 6 of 11 SOURCE Human DNA, clones cos[D2,D3]. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 69) AUTHORS Tikka,L., Pihlajaniemi,T., Henttu,P., Prockop,D.J. and Tryggvason,K. TITLE Unique gene structure for the alpha-1 chain of a human short-chain collagen (type XIII) with alternatively spliced transcripts and translation termination codon at the 5' end of the last exon JOURNAL Proc. Natl. Acad. Sci. U.S.A. 85, 7491-7495 (1988) STANDARD full staff_entry COMMENT Draft entry and computer readable copy of sequence [1] kindly submitted by L.Tikka, 28-SEP-1988. FEATURES from to/span description pept + 16 + 69 collagen (type XIII) alpha-1 chain exon 6 /nomgen="COL13A1" /hgml_locus_uid="LF00325" pep$ + 16 + 69 A collagen (type XIII) alpha-1 chain exon 6 pep$ + 16 + 69 B collagen (type XIII) alpha-1 chain exon 6 pep$ + 16 + 69 C collagen (type XIII) alpha-1 chain exon 6 pep$ + 16 + 69 D collagen (type XIII) alpha-1 chain exon 6 IVS < 1 15 Intron E BASE COUNT 15 A 20 C 21 G 13 T ORIGIN Chromosome 10q21.3-q22.2; about 4.8 kb downstream of segment 5. 1 TTCTTTCCCT TCCAGGGTAC TCCAGGACCA ATTGGAGTTC CAGGCCCAGC GGGACCAAAG 61 GGCGAGAGG // LOCUS HUMCG1307 64 bp ds-DNA PRI 15-MAR-1989 DEFINITION Human collagen alpha-1 chain (type XIII) gene, exon 5. ACCESSION M20798 J04085 KEYWORDS alternative splicing; collagen. SEGMENT 7 of 11 SOURCE Human DNA, clones cos[D2,D3]. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 64) AUTHORS Tikka,L., Pihlajaniemi,T., Henttu,P., Prockop,D.J. and Tryggvason,K. TITLE Unique gene structure for the alpha-1 chain of a human short-chain collagen (type XIII) with alternatively spliced transcripts and translation termination codon at the 5' end of the last exon JOURNAL Proc. Natl. Acad. Sci. U.S.A. 85, 7491-7495 (1988) STANDARD full staff_entry COMMENT Draft entry and computer readable copy of sequence [1] kindly submitted by L.Tikka, 28-SEP-1988. FEATURES from to/span description pept + 1 + 54 collagen (type XIII) alpha-1 chain exon 5 /nomgen="COL13A1" /hgml_locus_uid="LF00325" pep$ + 1 + 54 A collagen (type XIII) alpha-1 chain exon 5 pep$ + 1 + 54 B collagen (type XIII) alpha-1 chain exon 5 pep$ + 1 + 54 C collagen (type XIII) alpha-1 chain exon 5 pep$ + 1 + 54 D collagen (type XIII) alpha-1 chain exon 5 IVS 55 > 64 Intron G BASE COUNT 15 A 16 C 23 G 10 T ORIGIN Chromosome 10q21.3-q22.2; about 5 kb downstream of segment 6. 1 GGCAGCAAAG GAGACCCTGG GATGACAGGA CCAACGGGAG CAGCTGGGCT TCCTGTGAGT 61 CTCT // LOCUS HUMCG1308 51 bp ds-DNA PRI 15-MAR-1989 DEFINITION Human collagen alpha-1 chain (type XIII) gene, exon 4. ACCESSION M20797 J04085 KEYWORDS alternative splicing; collagen. SEGMENT 8 of 11 SOURCE Human DNA, clones cos[D2,D3]. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 51) AUTHORS Tikka,L., Pihlajaniemi,T., Henttu,P., Prockop,D.J. and Tryggvason,K. TITLE Unique gene structure for the alpha-1 chain of a human short-chain collagen (type XIII) with alternatively spliced transcripts and translation termination codon at the 5' end of the last exon JOURNAL Proc. Natl. Acad. Sci. U.S.A. 85, 7491-7495 (1988) STANDARD full staff_entry COMMENT Draft entry and computer readable copy of sequence [1] kindly submitted by L.Tikka, 28-SEP-1988. FEATURES from to/span description pept + 16 + 51 collagen (type XIII) alpha-1 chain exon 4 /nomgen="COL13A1" /hgml_locus_uid="LF00325" pep$ + 16 + 51 A collagen (type XIII) alpha-1 chain exon 4 pep$ + 16 + 51 B collagen (type XIII) alpha-1 chain exon 4 pep$ + 16 + 51 C collagen (type XIII) alpha-1 chain exon 4 pep$ + 16 + 51 D collagen (type XIII) alpha-1 chain exon 4 IVS < 1 15 Intron G BASE COUNT 12 A 13 C 16 G 10 T ORIGIN Chromosome 10q21.3-q22.2; about 1 kb downstream of segment 7. 1 TCTGATCTCT TGCAGGGTTT ACATGGACCA CCCGGGGACA AGGGAAACCG G // LOCUS HUMCG1309 112 bp ds-DNA PRI 15-MAR-1989 DEFINITION Human collagen alpha-1 chain (type XIII) gene, exon 3. ACCESSION M20796 J04085 KEYWORDS alternative splicing; collagen. SEGMENT 9 of 11 SOURCE Human DNA, clones cos[D2,D3]. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 112) AUTHORS Tikka,L., Pihlajaniemi,T., Henttu,P., Prockop,D.J. and Tryggvason,K. TITLE Unique gene structure for the alpha-1 chain of a human short-chain collagen (type XIII) with alternatively spliced transcripts and translation termination codon at the 5' end of the last exon JOURNAL Proc. Natl. Acad. Sci. U.S.A. 85, 7491-7495 (1988) STANDARD full staff_entry COMMENT Draft entry and computer readable copy of sequence [1] kindly submitted by L.Tikka, 28-SEP-1988. FEATURES from to/span description pept + 16 + 102 collagen (type XIII) alpha-1 chain exon 3 /nomgen="COL13A1" /hgml_locus_uid="LF00325" pep$ + 16 + 102 A collagen (type XIII) alpha-1 chain exon 3 pep$ + 16 + 102 B collagen (type XIII) alpha-1 chain exon 3 pep$ + 16 + 102 C collagen (type XIII) alpha-1 chain exon 3 pep$ + 16 + 102 D collagen (type XIII) alpha-1 chain exon 3 IVS < 1 15 Intron H IVS 103 > 112 Intron I BASE COUNT 32 A 23 C 39 G 18 T ORIGIN Chromosome 10q21.3-q22.2; about 2.5 kb downstream of segment 8. 1 AACCATTATT TACAGGGGGA GAGGGGGAAG AAAGGCTCTA GAGGGCCTAA AGGGGACAAG 61 GGAGACCAAG GAGCGCCTGG ATTAGATGCC CCCTGCCCAT TGGTAAGGCT TC // LOCUS HUMCG1310 64 bp ds-DNA PRI 15-MAR-1989 DEFINITION Human collagen alpha-1 chain (type XIII) gene, exon 2. ACCESSION M20804 J04085 KEYWORDS alternative splicing; collagen. SEGMENT 10 of 11 SOURCE Human DNA, clones cos[D2,D3]. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 64) AUTHORS Tikka,L., Pihlajaniemi,T., Henttu,P., Prockop,D.J. and Tryggvason,K. TITLE Unique gene structure for the alpha-1 chain of a human short-chain collagen (type XIII) with alternatively spliced transcripts and translation termination codon at the 5' end of the last exon JOURNAL Proc. Natl. Acad. Sci. U.S.A. 85, 7491-7495 (1988) STANDARD full staff_entry COMMENT Draft entry and computer readable copy of sequence [1] kindly submitted by L.Tikka, 28-SEP-1988. FEATURES from to/span description pept + 16 + 54 collagen (type XIII) alpha-1 chain exon 2 /nomgen="COL13A1" /hgml_locus_uid="LF00325" pep$ + 16 + 54 A collagen (type XIII) alpha-1 chain exon 2 pep$ + 16 + 54 B collagen (type XIII) alpha-1 chain exon 2 pep$ + 16 + 54 C collagen (type XIII) alpha-1 chain exon 2 pep$ + 16 + 54 D collagen (type XIII) alpha-1 chain exon 2 IVS < 1 15 Intron I IVS 55 > 64 Intron J BASE COUNT 14 A 18 C 17 G 15 T ORIGIN Chromosome 10q21.3-q22.2; about 5 kb downstream of segment 9. 1 TCCTTTCTCT TGCAGGGCCA AGATGGCTAC CCAGTCCAAG GCTGCTGGAA CAAGGTAAGG 61 CTTC // LOCUS HUMCG1311 244 bp ds-DNA PRI 15-MAR-1989 DEFINITION Human collagen alpha-1 chain (type XIII) gene, exon 1. ACCESSION M20803 J04085 KEYWORDS alternative splicing; collagen. SEGMENT 11 of 11 SOURCE Human DNA, clones cos[D2,D3]. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 244) AUTHORS Tikka,L., Pihlajaniemi,T., Henttu,P., Prockop,D.J. and Tryggvason,K. TITLE Unique gene structure for the alpha-1 chain of a human short-chain collagen (type XIII) with alternatively spliced transcripts and translation termination codon at the 5' end of the last exon JOURNAL Proc. Natl. Acad. Sci. U.S.A. 85, 7491-7495 (1988) STANDARD full staff_entry COMMENT Draft entry and computer readable copy of sequence [1] kindly submitted by L.Tikka, 28-SEP-1988. FEATURES from to/span description pept + 16 18 collagen (type XIII) alpha-1 chain exon 1 /nomgen="COL13A1" /hgml_locus_uid="LF00325" pep$ + 16 18 A collagen (type XIII) alpha-1 chain exon 1 pep$ + 16 18 B collagen (type XIII) alpha-1 chain exon 1 pep$ + 16 18 C collagen (type XIII) alpha-1 chain exon 1 pep$ + 16 18 D collagen (type XIII) alpha-1 chain exon 1 IVS < 1 15 Intron J BASE COUNT 66 A 35 C 42 G 101 T ORIGIN Chromosome 10q21.3-q22.2; about 2.2 kb downstream of segment 10. 1 GTTTTTGTTT TGCAGTGATG CCTCTAACCT TGGATTGGCC TGTGTGTGTG TTTGTACATA 61 GAATATTTAT TTTTATACAG TTTTCACTTT TTGAAAATGC CAGAAGTATG ATGCATCTTA 121 CAGATTATTA AAAAAGAAAG AAAAACCGTT GCATATTTTG TACAGAAAAT ATCAACCTCT 181 TCCCTTTTGT TTACAAGATG TTTTGTATAA GCCTATGTCT CTAGTACATT TTTTGTTTGG 241 TCGT // LOCUS HUMCG1A10 1946 bp ds-DNA PRI 01-SEP-1988 DEFINITION Human alpha-1 type I collagen gene surrounding osteogenesis imperfecta OI type II deletion. ACCESSION M11162 KEYWORDS . SOURCE Human DNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 1946) AUTHORS Barsh,G.S., Roush,C.L., Bonadio,J., Byers,P.H. and Gelinas,R.E. TITLE Intron-mediated recombination may cause a deletion in an alpha-1 type I collagen chain in a lethal form of osteogenesis imperfecta JOURNAL Proc. Natl. Acad. Sci. U.S.A. 82, 2870-2874 (1985) STANDARD simple staff_review FEATURES from to/span description pept / 86 139 alpha-1 type I collagen protein, exon 30 (AA at 86) /nomgen="COL1A1" /map="17q21.3-q22" /hgml_locus_uid="LG0047H" 265 363 alpha-1 type I collagen protein, exon 29 524 577 alpha-1 type I collagen protein, exon 28 668 766 alpha-1 type I collagen protien, exon 27 1670 1723 alpha-1 type I collagen protein, exon 26 1867 / 1920 alpha-1 type I collagen protein, exon 25 IVS 140 264 intron A IVS 364 523 intron B IVS 578 667 intron C IVS 767 1669 intron D IVS 1724 1866 intron E mut 222 872 deleted in osteogenesis imperfecta phenotype BASE COUNT 390 A 598 C 542 G 416 T ORIGIN 1 CCCTGCCCGC CCCCTCCCGC TCCACCCTCA TTGCCTGGCT GGTGCCTGTG TGTCGCGGAG 61 TCACTGGCCT CCTCTCCTCC TGCAGGGTGG ACCTGGTAGC CGTGGTTTCC CTGGCGCAGA 121 TGGTGTTGCT GGTCCCAAGG TAACCTCTCC TTGCGGCCGG GGCTGACCCT GCCGCTCCCT 181 GGGCATCTTC TTCTTCCTCT TTTGGCCCGT GGCAAAGAGC CACAAACTTG AGACCCTAAC 241 TGTTCCTGTG ACTTCCCCAA CCAGGGTCCC GCTGGTGAAC GTGGTTCTCC TGGCCCCGCT 301 GGCCCCAAAG GATCTCCTGG TGAAGCTGGT CGTCCCGGTG AAGCTGGTCT GCCTGGTGCC 361 AAGGTGAGGC ACCCGGTTTC ACTGGCTTGG CCAGGGCCCT GACCATCCCG TGTAGGTCTG 421 GATGAGGCGT TCTGGATCAG GCCAAGGTCT GCCCTCTGGA GGTCCTCCCC CACCTCCATC 481 ATGCTTCTCC CCAAGTCCCA CTCATACCTC TCTGCCTCCC TAGGGTCTGA CTGGAAGCCC 541 TGGCAGCCCT GGTCCTGATG GCAAAACTGG CCCCCCTGTA ATATCACTCC CCCTGAACCC 601 CCTGGCCATG TCCTGTCTGC CTCCCTGCTG TCCTCACTGC TGCTTTCGTG CCGTGCCCTA 661 TCCTTAGGGT CCCGCCGGTC AAGATGGTCG CCCCGGACCC CCAGGCCCAC CTGGTGCCCG 721 TGGTCAGGCT GGTGTGATGG GATTCCCTGG ACCTAAAGGT GCTGCTGTGA GTATTAGAGT 781 GAGGATGCCA TGAAGGAGCC GAGGGACAAA CGACAGCCTA GACGTGAAGG ATCCTGGGCC 841 TCTGGGCTCA GCTGTGTCCG CTGACCTGCG TGTGGCCACT CACTCTCACT TTTCTGGACC 901 TCAGCCTCCC TTATCTGTAA AATGAAAGCA CTTCTCGGCG GGGCACGGTG CTCATGCCTG 961 TAATCCCAGC ACTTTGGGAG GCCAAGGCGG GCAGACCATG AGGTCAGGAG TTTGAGACCA 1021 GTCGGGCCAA CATAGTGAAA CCACGTCTCT ACTAAAAATA CAAAAGATTA GCTGGGTGTG 1081 GTGGTGTGCA CCTGTAACCC CAGCTAGTCA GGAGGCTAGG GCAGGAGAAT TGCATGAACC 1141 CGGGAGGTGG AGGTTGCAGT GAGCTGAGAT CACGCCATTG CACTCCAGCC TGGGCAACAG 1201 TGCAGAGTTC CATCTCAAAA AAAAAAAAAA AAAAGAAGAA AGAAAGAAAG AAAAAATGAA 1261 ACACTTCTCC AGGCTCCATG ACCACTGCTC TGTCCGTGGA AATAAGTGTT GTTGTGGCCC 1321 TCCACCCCGA CACGTGGGCA TAGGACAGGC CTTTGATATG ATAGGCACCC CCAGTCTTGG 1381 TGGATTCTTT GAGGTCCAAA AGGAGATAGC AGAGAAGAAT GAAACCCTTT GCATGCAGGC 1441 CACACGGCAT CTAACAGGAA AAGCAAGGAG CCTGGAAGGG CATCTTGGGA GGAGTGGGCT 1501 CAGAAAGGGC CCAGCAAGAA GCACCTGCAG GGGCATTCCC CGGGGGCCAA AGCAGGTCTT 1561 TTGAAAAGGA AAGGTCCCTT AAAAAGGTCC CACTCAGAGT CAAATGAGAG GCCCAGAGGC 1621 CCTGGCTTCT CACTTCAGCC CCCTCAACCC TAACTCCCTT TCTCCACAGG GAGAGCCCGG 1681 CAAGGCTGGA GAGCGAGGTG TTCCCGGACC CCCTGGCGCT GTCGTAAGTA TCTCCTTTCC 1741 ATCCCTACCT CCTTCCGATT GCTGCCCCGG CACTTTCGTC TCCCTGCAGG AGGGGTGCTA 1801 GAGGCCACGG TCCTCAGCTG CTCGGGGCCT CCTAACCCTG AGTTCCCCTT TGCTCTCTCC 1861 CTGCAGGGTC CTGCTGGCAA AGATGGAGAG GCTGGAGCTC AGGGACCCCC TGGCCCTGCT 1921 GTGAGTGTCC GCTGATGGGA AGATCT // LOCUS HUMCG1A1P 1950 bp ds-DNA PRI 15-JUN-1989 DEFINITION Human alpha-1(I) collagen gene, exons 1 and 2 (partial). ACCESSION J02829 KEYWORDS collagen. SOURCE Human DNA, clones pAI-1 and pRB-alpha-1. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 1950) AUTHORS Rossouw,C.M.S., Vergeer,W.P., du Plooy,S.J., Bernard,M.P., Ramirez,F. and de Wet,W.J. TITLE DNA sequences in the first intron of the human pro-alpha-1(I) collagen gene enhance transcription JOURNAL J. Biol. Chem. 262, 15151-15157 (1987) STANDARD full staff_entry FEATURES from to/span description pept 370 472 alpha-1(I) collagen, exon 1 /nomgen="COL1A1" /map="17q21.3-q22" /hgml_locus_uid="LG0047H" 1920 > 1950 alpha-1(I) collagen, exon 2 pre-msg 251 > 1950 col mRNA and introns IVS 473 1919 col intron A BASE COUNT 362 A 562 C 665 G 360 T 1 others ORIGIN 236 bp upstream of SmaI site; chromosome 17q21.3-q22. 1 GGGGCACCCC TACCCACTGG TTAGCCCACG CCATCCTGAG GACCCAGCTG CACCCCTACC 61 ACAGCACCTC GGGCCTAGGC TGGGCGGGGG GCTGGGGAGG CAGAGCTGCG AAGAGGGGAG 121 ATGTGGGGTG GACTCCCTTC CCTCCTCCTC TTGGCTGGGG CACGGGCGGC CGGCTCCCCC 181 TCTCCGAGGG GCAGGGTTCC TCCCTGCTCT CCATCAGGAC AGTATAAAAG GGGCCCGGGC 241 CAGTCGTCGG AGCAGACGGG AGTTTCTCCT CGGGGTCGGA GCAGGAGGCA CGCGGAGTGT 301 GAGGCCACGC ATGAGCGGAC GCTAACCCCC TCCCCAGCCA CAAAGAGTCT ACATGTCTAG 361 GGTCTAGACA TGTTCAGCTT TGTGGACCTC CGGCTCCTGC TCCTCTTAGC GGCCACCGCC 421 CTCCTGACGC ACGGCCAAGA GGAAGGCCAA GTCGAGGGCC AAGACGAAGA CAGTAAGTCC 481 CAAACTTTTG GGAGTGCAAG GATACTCTAT ATCGCGCCTT GCGCTGGTCC CGGGGGCCGC 541 GGCTTAAAAC GAGACGTGGA TGATCCGGAG ACTCGGGAAT GGAAGGGAGA TGATGAGGGC 601 TCTTCCTCGG CGCCCTGAGA CAGGAGGGAG CTCACCCTGG GGCGAGGTTG GGGTTGAACG 661 CGCCCCGGGA GCGGGAGGTG AGGGTGGAGC GCCCCGTGAG TTGGTGCAAG AGAGAATCCC 721 GAGAGCGCAA CCGGGGAAGT GGGGATCCGG GTGCAGAGTG AGGAAAGTAC GTCGAAGATG 781 GGATGGGGGC GCCGAGCGGG GCATTTGAAG CCCAAGATGT AGAAGCAATC AGGAAGGCCG 841 TGGGATGATT CATAAGGAAA GATTGCCCTC TCTGCGGGCT AGAGTGTTGC TGGGCCGTGG 901 GGGTGCTGGG CAGCCGCGGG AAGGGGGTGC GGAGCGTGGG CGGGTGGAGG ATGAGAAACT 961 TTGGCGCGGA CTCGGCGGGG CGGGGTCCTT GCGCCCCCTG CTGACCGATG CTGAGCACTG 1021 CGTCTCCCGG TCCAACGCTT ACTGGGGCAG GAGCCGGAGC GGGAAGACCC GGGTTATTGC 1081 TGGGTGCGGA CCCCCACCTC TAGATCTGGA AAGTAAAGCC AGGGATGGGG CAGCCCAAGC 1141 CTCTTAAAGA GGTAGTCGGG CCGGTGAGGT CGGCCCCGCC CCGGCCCCAT TGCTTAGCGT 1201 TGCCCGACAC CTAGTGGCCG TCTGGGGAGC CGCTAGCGCG GTGGGAGTGG TTAGCTAACT 1261 TCTGGACTAT TTGCGGACTT TTTGGTTCTT TGGCTAAAAG TGATCTGGAG GCATTGGCTG 1321 GCTTTGGGGG ACGGGGACGG CCCCGAGAGC GGGCTTTTAA GATGTCTAGG GCTGGAGGTT 1381 AGGGTGTCTC CTAATTTTGA GGTACATTTC AAGGGGGGGC CTCCCTTCCA ATCAGCCGCT 1441 CCCATTCTCC TAGCCCCGCC CCCGCCACCC CACCTGCCCA GGGAATGGGG GCGGGATGAG 1501 GGCCTGGACC TTCCCTTCTC TCCTCCCTCG CCCTCCTCCT GTCTCTACCA CGCAGCCACT 1561 CCCCACGAGC CTGCCCTCCC GATGGGGCCC CTCCTATTCT CCCCCCGCCC TCCCCCTCTC 1621 ACCCTGTGGT TTTATTTCAC TTGGCTTCAG CGCCAATGGG CTGAGGTTGG AGTTGGAAGC 1681 CACCGCAGAC TAAGCTTAAA TTCCTGAGAA CTGGAAAGAG TTACAGCCTC CCTGGCCAGG 1741 CGCCTCGGCN GGTCACCCGC GCTGATGAGG AGCAGGCGAG CTTTTAAGGA TTTGAGGAAA 1801 GAAGAACGGG GGGAGGGGCG GGAAGTGAAA AATCCAAGTG CCTCTTAGAC CCGGGGGAAA 1861 GGTGGTTAAG CTGGGGGTTG CAGTCACTAC TGACAACGCC CCTCTTCCGC CTGTCCCAGT 1921 CCCACCAATC ACCTGCGTAC AGAACGGCCT // LOCUS HUMCG2A1 4845 bp ds-DNA PRI 15-JUN-1989 DEFINITION Human alpha-1(II) collagen gene COL2A1, partial cds. ACCESSION J00116 KEYWORDS alpha-collagen; collagen. SOURCE Human DNA, clone CosHco1l. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 1191) AUTHORS Weiss,E.H., Cheah,K.S.E., Grosveld,F.G., Dahl,H.-H.M., Solomon,E. and Flavell,R.A. TITLE Isolation and characterization of a human collagen alpha-1(I)-like gene from a cosmid library JOURNAL Nucleic Acids Res. 10, 1981-1994 (1982) STANDARD full staff_review REFERENCE 2 (bases 1 to 4845) AUTHORS Cheah,K.S.E., Stoker,N.G., Griffin,J.R., Grosveld,F.G. and Solomon,E. TITLE Identification and characterization of the human type II collagen gene (COL2A1) JOURNAL Proc. Natl. Acad. Sci. U.S.A. 82, 2555-2559 (1985) STANDARD full staff_review COMMENT Printed copy of sequence in [1] kindly provided by E.Weiss, 03-MAY-1983. The DNA sequence of clone CosHco1l is highly homologous to chicken alpha-1(II) collagen. Therefore, CosHco1l probably contains the human alpha-1(II) collagen gene. The sequence in [2] is thought to represent about 15% of the estimated gene length and over 30% of the protein-encoding sequence. The entire gene may be about 30kb in length. Exons and introns are located by comparison with other collagen gene sequences and by the "a-g-g-t" splicing rule. A potential polyadenylation signal is located at positions 4806-4811. FEATURES from to/span description pept < 1 18 collagen alpha-1(II), exon 11 (AA at 1) /nomgen="COL2A1" /map="12q14.3" /hgml_locus_uid="LX0121B" 373 426 collagen alpha-1(II), exon 10 599 706 collagen alpha-1(II), exon 9 872 925 collagen alpha-1(II), exon 8 1107 1214 collagen alpha-1(II), exon 7 1459 1512 collagen alpha-1(II), exon 6 1956 2063 collagen alpha-1(II), exon 5 2421 2709 collagen alpha-1(II), exon 4 3164 3351 collagen alpha-1(II), exon 3 3695 3937 collagen alpha-1(II), exon 2 4473 4619 collagen alpha-1(II), exon 1 IVS 19 372 COL2A1 intron J IVS 427 598 COL2A1 intron I IVS 707 871 COL2A1 intron H IVS 926 1106 COL2A1 intron G IVS 1215 1458 COL2A1 intron F IVS 1513 1955 COL2A1 intron E IVS 2064 2420 COL2A1 intron D IVS 2710 3163 COL2A1 intron C IVS 3352 3694 COL2A1 intron B IVS 3938 4472 COL2A1 intron A revision 766 766 c in [2]; t in [1] revision 776 776 c in [2]; t in [1] revision 806 806 t in [2]; c in [1] revision 837 838 gc in [2]; ct in [1] revision 840 842 ttt in [2]; tt in [1] revision 847 850 gacg in [2]; agc in [1] revision 944 944 t in [2]; c in [1] revision 949 949 t in [2]; c in [1] revision 1018 1018 t in [2]; c in [1] revision 1052 1052 g in [2]; c in [1] revision 1086 1086 c in [2]; g in [1] revision 1190 1190 t in [2]; c in [1] BASE COUNT 1010 A 1346 C 1436 G 1053 T ORIGIN 719 bp upstream of EcoRI site; chromosome 12q14.3. 1 GGTGAACCTG GACGAGAGGT GAGCAGTGAG ACCCCCTGGG GTGGCCCTGA TTGGGGAGAG 61 GGGCCCTGTG AGTCTCTGTG CTGGGTCAGC AAGGACAAGC CCCAGTCAGG GCCTCGGAGA 121 AGGGGGCGGC AGCGCTGGCC GACAGGCGAA AGCCTAGGTA CAATGGGAAG GTTGTCGGGG 181 AGAGAGACGG GCATAGAGAC CAAGGGCTGC TTCTGGAAGG AGGAGGGAAA CTTGGTGAGG 241 AAACTTTGGC TTCAAAGTGT GAGTGAGTTG GGCAGAAGAG GAGAGGCCTG GGCTTCTGAG 301 AGGGGCTGGG GGAGCAGAGG GGGAGGTGGA CAGAGGACAG CTCTAGGTGC GTTCTTGTTT 361 CACTTTGTCC AGGGAAGCCC CGGTGCTGAT GGCCCCCCTG GCAGAGATGG CGCTGCTGGA 421 GTCAAGGTGA GTGTCTGGTG TCTGTGTGTG CAGTGGGTTG GGGAGGACAT TGCCTCGGGC 481 CTGACAGGTC AGCTGGGGGT GGCAGGTTGG AACAAGTCTC ATCTCAGCCT AGAAGGACCT 541 TCTGTTCCTG TCTCTTCTGG AACATTCTTC TCTGAGCCTG AGACCTCTCT CCTGACAGGG 601 TAATCGTGGT GAAACCGGTG CTGTGGGAGC TCCTGGAACC CCTGGGCCCC CTGGCTCCCC 661 TGGCCCCGCT GGTCCAACTG GCAAGCAAGG AGACAGAGGA GAAGCTGTAA GTATCCTGGA 721 ATTCAGTAAA AGCCGCCTTC CCCTGCGCGG TGGGGCTGAG GCAGTCCCTG GGTTTCCGCA 781 GTCTCTGGAC TAAGGAGCAG TGGCCTCAGA TGCAGAGGAG GCCCCCACCT GTCCTGGCTT 841 TTCTCTGACG CTGCGCTCAC TCTCTCCTCA GGGTGCACAA GGCCCCATGG GACCCTCAGG 901 ACCAGCTGGA GCCCGGGGAA TCCAGGTGAG TATCCAAGTG TCCTGCACTG AGTCCCCACC 961 AGGGATAGGC TGGGAGGGCA GCCAGCCTCC AGGTGGTTCC TGGCCTCCAG CCCTGTGTTT 1021 CCGGGGATTC CTCAGCTTGG GTGGGACAGG AGGGGGCTCC TGTCCTGGCC CTGACCTGAC 1081 TCAATCGGTG TCTGTCTTGT TCCCAGGGTC CTCAAGGCCC CAGAGGTGAC AAAGGAGAGG 1141 CTGGAGAGCC TGGCGAGAGA GGCCTGAAGG GACACCGTGG CTTCACTGGT CTGCAGGGTC 1201 TGCCCGGCCC TCCTGTGAGT GTCACTGCCT GCGTGGGACT TCCCGAGGCC TCCTGCCACA 1261 CAGAGCCCAC TTGAGCTCCC TGTGCTGCCA GGACAGCTTG GGATCACCCT AAGCAGTTTC 1321 TAGGATTTCC TCAGGGCTGG AGGGAGGAGG AAGTGGAAAG GGAATGGGGC TGGGACATAA 1381 AGCTGTTCCC CCAGCTCCCA GAATATAGAT AGATATGTCT GTGCTGACCG TGGCCTTTTG 1441 CCTCTTCCTT CTACACAGGG TCCTTCTGGA GACCAAGGTG CTTCTGGTCC TGCTGGTCCT 1501 TCTGGCCCTA GAGTAAGTGA CATGGAGTTG GAAGATGGAG GGGGCCCTTC AGAGAGTGTG 1561 GGCCTGTGTT CCCATGGGGA GGGAAATGCT GCTGCTTCTG GGGAAGCTGT GGGCTCAGGG 1621 GTCCTCACTC AGTAATGGGG GCAGGACTGG CTCATGTGCC TATGGCCAGA AAAGCGCCTG 1681 AGGCCACAAT GGCTGTAAGA CAAACATGAA TCAGCCTCTC GCTGTCAGAC AGAACAGCAT 1741 TTTACAAAGA GGAGCTTAGG AGGGTAGGCA AGCCATGGAG CTATCCTGCT GGTTCTTGGC 1801 CAAATAGAGA CCAACTTAGG GTTCCATGAC TGAGCATGTG AAGAACTGGG GGCGGAGTGG 1861 CTGGTGCTAT CAGGACAGCC ACCTACCCAG CCCCAGCGAC TCCCCAGCCT TCCCTGTGGT 1921 GACCACTCTT TCCTCACGAC CTCTCTCTCT TGCAGGGTCC TCCTGGCCCC GTCGGTCCCT 1981 CTGGCAAAGA TGGTGCTAAT GGAATCCCTG GCCCCATTGG GCCTCCTGGT CCCCGTGGAC 2041 GATCAGGCGA AACCGGCCCT GCTGTAAGTG TCCTGACTCC TTCCCTGCTG TCGAGGTGTC 2101 CCTACCATCC GGGAGGCTTG AGCTCTTTTT TGCTCAGGGC CTCTTTTAGG GCATCAGCCT 2161 GCAGCTAACA GTGATGGCAT CCTTTATCCT GAGGTCTCCT CAGAGGTCAC AGGGCCCATG 2221 ATCAGTGCTG GGAAACTGAA GAGAAGGGCT AAGGAAGAAA TAGACATGGT GCTGTGGTTT 2281 CCTTGGTCCT CGCCTGCTAC ACCTCCGCCC CACCCATGGG GCTGGGAAGA GGGACACTCT 2341 AGTACATTCT AGCAAATGGG GATGGACATG GAGGGGCACT TTCACACAAT CCTGGCTGAT 2401 CTCTCTGTTT CCTGCTGCAG GGTCCTCCTG GAAATCCTGG ACCCCCTGGT CCTCCAGGTC 2461 CCCCTGGCCC TGGCATCGAC ATGTCCGCCT TTGCTGGCTT AGGCCCGAGA GAGAAGGGCC 2521 CCGACCCCCT GCAGTACATG CGGGCCGACC AGGCAGCCGG TGGCCTGAGA CAGCATGACG 2581 CCGAGGTGGA TGCCACACTC AAGTCCCTCA ACAACCAGAT TGAGAGCATC CGCAGCCCCG 2641 AGGGCTCCCG CAAGAACCCT GCTCGCACCT GCAGAGACCT GAAACTCTGC CACCCTGAGT 2701 GGAAGAGTGG TAAGCTTGGA GAACAGGATC CCCTGCCCCG GGAAGCAGGG AGTCATCCCT 2761 TAGGCCTAGC AGCAAGGGAG GAGATGCCCC CTAGTACAGG GCAGAGCTGG GCCTGGAAGT 2821 TTCCGCCAGA GGGTTCCTCT CTTATTTCAC AGCAGAGAAG CTGCAGCCCT GGCCCCTGTC 2881 CTGCCATGGC TACCTGGCCG AGGTGACCTC AGGGTGGACT CCATCCACCA GCTGGGCACT 2941 GCTTCTGCTC TCTTTGCATG TGTTCTTCCT TAGGGCTGGA CTTAGCTCAT GCAGATCTCC 3001 CTGCCCCTGC ATCCTCCCAG GTCCCCCTCC TTTCAGGCCA CATGTGAACC TCATCCCTTG 3061 TCCCTGTAGG CCTCTCTGTC TCTTTCAGTC AGGCCTGGGT CTCTCAAGCT TTTGTGTCTG 3121 TGCCTGTCTG AGCCCCCATG GGTGCTGCCT CTTCCCCCTG CAGGAGACTA CTGGATTGAC 3181 CCCAACCAAG GCTGCACCTT GGACGCCATG AAGGTTTTCT GCAACATGGA GACTGGCGAG 3241 ACTTGCGTCT ACCCCAATCC AGCAAACGTT CCCAAGAAGA ACTGGTGGAG CAGCAAGAGC 3301 AAGGAGAAGA AACACATCTG GTTTGGAGAA ACCATCAATG GTGGCTTCCA TGTGAGTACC 3361 TGGGTGCCCT AGATGATGAG CAGAGATGGC TCCTCAAACT CTTTCTTTTC TTTCTCCCTG 3421 GAAGCTTTTA GCACCTTCCC CATATTTTCC TCCAGTTTTC TGTTGGGCTT GAGAGGAGGG 3481 AAAGAGGAGG AAAAGTATTT TTTCCCCACG TGGAGGTGGG AAAAGAGGTC CTCTGAGCTT 3541 GCTCCACTCC TGGAAGCAAA AATGTCCAAC TAGCTCCCTG CTGCCCCAGT ACCCTTGAGG 3601 TCCTTGAACC ATGAACTCTT GGCAGCCCCT ACAGCCCCTG GTCCCATTGA ATGCCAGCTC 3661 CCAGGCCTCA CACTGCCGCT CTCTGCCCCA ACAGTTCAGC TATGGAGATG ACAATCTGGC 3721 TCCCAACACT GCCAACGTCC AGATGACCTT CCTACGCCTG CTGTCCACGG AAGGCTCCCA 3781 GAACATCACC TACCACTGCA AGAACAGCAT TGCCTATCTG GACGAAGCAG CTGGCAACCT 3841 CAAGAAGGCC CTGCTCATCC AGGGCTCCAA TGACGTGGAG ATCCGGGCAG AGGGCAATAG 3901 CAGGTTCACG TACACTGCCC TGAAGGATGG CTGCACGGTG AGTGGGGCTG CCAGAGAGAA 3961 GAGCTGCCTG TGCCCAAACT GCCTGGAGCA GGGCTGAGGG TTGGCCCGCG GCAGCTGTCA 4021 GGTCCTAAAG TGACAGGATC ATCAGAGGCA TGAGTTTGAG GGTCATGTAG AGAAGATAGG 4081 CTGAGTGACA GGTGAGAGAG AGGCACATAT CATTCCATCT TCTCCATTCC CCTGGCTCAG 4141 GGGAACAAAA CCCTACCTGG AACCCAGTGA CTACTGTAGA AGTGTTCTCG CAATGTGTAC 4201 AGGGTGAAGA AGCGGTCACA GGTTGGGAGC TCACTGTGGG GAGTGGGGAA GGAGGGGAAG 4261 GGCAGGGTGG AGAAGGGCCC TGCCGCTAAG GATAGGAGTT GAAGTGGAGA GGCCTTTGGC 4321 AAGCCAAGAA GAGGTCTCAG GAGCCCCCTC AGTGTGGTTC AACCTTGTGG GCTCTGATGC 4381 TCGCCAGTTT GTTCAGTTTT GGGCTTCTGG GCAGCTGGAA CTGGGTAGCA AGGCATCTAC 4441 TGAACAGAGC CTCCTCCTTT TTTCTCCCCT AGAAACATAC CGGTAAGTGG GGCAAGACTG 4501 TTATCGAGTA CCGGTCACAG AAGACCTCAC GCCTCCCCAT CATTGACATT GCACCCATGG 4561 ACATAGGAGG GCCCGAGCAG GAATTCGGTG TGGACATAGG GCCGGTCTGC TTCTTGTAAA 4621 AACCTGAACC CAGAAACAAC ACAATCCGTT GCAAACCCAA AGGACCCAAG TACTTTCCAA 4681 TCTCAGTCAC TCTAGGACTC TGCACTGAAT GGCTGACCTG ACCTGATGTC CATTCATCCC 4741 ACCCTCTCAC AGTTCGGACT TTTCTCCCCT CTCTTTCTAA GAGACCTGAA CTGGGCAGAC 4801 TGCAAAATAA AATCTCGGTG TTCTATTTAT TTATTGTCTT CCTGT // LOCUS HUMCG3B 940 bp ds-DNA PRI 15-JUN-1989 DEFINITION Human chorionic gonadotropin beta-subunit (hCG-beta-3) gene, exon 1. ACCESSION M13504 KEYWORDS gonadotropin. SOURCE Human placental DNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 940) AUTHORS Policastro,P.F., Daniels-McQueen,S., Carle,G. and Boime,I. TITLE A map of the hCG-beta-LH-beta gene cluster JOURNAL J. Biol. Chem. 261, 5907-5916 (1986) STANDARD simple staff_review FEATURES from to/span description pept 798 / 812 human chorionic gonadotropin beta-subunit, exon 1 /nomgen="CGB" /map="19q13.3" /hgml_locus_uid="LH0070L" pre-msg 442 > 940 CG mRNA and intron IVS 813 > 940 CG intron A BASE COUNT 176 A 301 C 289 G 174 T ORIGIN 137 bp upstream of SacII site; chromosome 19q13.3 1 CGCGGGCGCG GTGGAGCCAG CGGTCGCCTC TCCGTGCACA TACCCACACT GGAAGAACAA 61 GCCTTGGGGT TGCCGGCTTG ATGGCATCGC GGGGAAGGGA CTAAGTCCAG ATAATGTCCT 121 CCGAGGCTGC GCCCCGCGGG CAGGACACAC CCCCTGCGGG CCTATTCAAT AATCAGTTAA 181 ATCACCTGAA GCACACGCAT TTCCGGGGAC CGCTCCGGGC ATCTTGGCTT GAGGGTAGGG 241 TGGGCGGAGG TCCCTAAGGG AGAGGCGGGC TGAATCCCTC GTTGGGGGGC GCCAGGGTCA 301 AGTGGCTTCC CTGGCAGCAC AGTCACGGGG AGGCCCTCTC TCATTGGGCA GAAGCTAAGT 361 CCGAAGCCGC GCCCCTCCTG GGAGGTTGGA CTGTGGTGCA GGAAAGCCTC AAGTAGAGGA 421 GGGTTGAGGC TTCAGTCCAG CACTTTGCTC GGGTCACGGC CTCCTCTGGC TTCCAAGACC 481 CCACCATAGG CAGAGGCAGG CCTTCCTACA CCCTACTCCC TGTGCCTCCA GCCTCGACTA 541 GTCCCTAGCA CTCGACGACT GAGTCTCTGA GGTCACTTCA CCGTGGTCTC CGCCTCACCC 601 TTGGCGCTGG ACCACTGAGG GGAGAGGGCT GGGGCGCTCC GCTGAGCCAC TCCTGAGCCC 661 CCTGGCCTTG TCTACCTCTT GCCCCCCAAG GGGTTAGTGT CGAGCTCACC CCAGCATCCT 721 ACCACCTCCT GTGGCCTTGC CGCCCCCACA ACCCCGAGGT ATAAAGCCAG GTACACGAGG 781 CAGGGGACGC ACCAAGGATG GAGATGTTCC AGGTAAGACT GCAGGGCCCC TGGGCACCTT 841 CCACCTCCTT CCAGGCAATC ACTGGCATGA GAAGGGGTCA GACCCGTGTG AGCTGTGTAA 901 GGAGGCCTCT TTCTGGAGGA GCGTGACCCC CAGTAAGCTT // LOCUS HUMCG5B 2112 bp ds-DNA PRI 15-JUN-1989 DEFINITION Human chorionic gonadotropin (HCG) gene 5, beta-subunit. ACCESSION X00265 J03720 X01148 KEYWORDS glycoprotein; gonadotropin. SOURCE Human DNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 448 to 2112) AUTHORS Talmadge,K., Vamvakopoulos,N.C. and Fiddes,J.C. TITLE Evolution of the genes for the beta subunits of human chorionic gonadotropin and luteinizing hormone JOURNAL Nature 307, 37-40 (1984) STANDARD simple staff_review REFERENCE 2 (bases 360 to 637) AUTHORS Talmadge,K., Boorstein,W.R., Vamvakopoulos,N.C., Gething,M.-J. and Fiddes,J.C. TITLE Only three of the seven human chorionic gonadotropin beta subunit genes can be expressed in the placenta JOURNAL Nucleic Acids Res. 12, 8415-8436 (1984) STANDARD simple automatic REFERENCE 3 (bases 1 to 1048) AUTHORS Otani,T., Otani,F., Krych,M., Chaplin,D.D. and Boime,I. TITLE Identification of a promoter region in the CG-beta gene cluster JOURNAL J. Biol. Chem. 263, 7322-7329 (1988) STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [3] kindly provided by T. Otani, 26-APR-1988 FEATURES from to/span description pept 999 1013 beta-gonadotropin, exon 1 /nomgen="CGB" /map="19q13.3" /hgml_locus_uid="LH0070L" 1366 1533 beta-gonadotropin, exon 2 1768 2082 beta-gonadotropin, exon 3 sigp 999 1013 beta-gonadotropin signal peptide 1366 1410 beta-gonadotropin signal peptide matp 1411 1533 beta-gonadotropin 1768 2079 beta-gonadotropin pre-msg 634 2099 CGb mRNA and introns IVS 1014 1365 CGb intron A IVS 1534 1767 CGb intron B variant 975 975 c in DNA; g in cDNA BASE COUNT 387 A 638 C 671 G 416 T ORIGIN 446 bp upstream of MstII site; chromosome 19q13.3. 1 CTACAGAAGG CCTTTCAGTA TCTGGGAGCT GGGGTTCAAA TGAGAAATCT TACTTGGTGA 61 GAGCGGGCAG GGGTCGGCTT AGAATATTTT GTTTTGAGAT AATGAGCTAC CGATCACAGG 121 GGGAGTTTAA GCAAGGTTCA ATGAGAAGCG ATCAAGATGC TGCACAGTTC AGCCCTGGGT 181 GGGGAGCTCA AGTCAGGTTT CTAGCCCTCT TCCCTGTGCC AACCTATACC CTACATTGGG 241 AAAGAAACAG ACCTTAAAAT TGTCCAGCTT GATGGCATCG CGGGGAAGGG ACTAAGTCCA 301 GATAATGTCC TCTGAGGCTT CGGCCCCGTG GGCAGGACAC ACCTCCTGCG GGCCTATTCA 361 ATAATCAGTT AAATCACCTG AAGCACACGC ATTTCCGGGG ACCGCTCCGG GCATCCTGGC 421 TTGAGGGTAG AGTGGGCGGA GGTTCCTAAG GGAGAGGTGG GGCTCGGGCT GAATCCCTCG 481 TTGGGGGGCA TCTGGGTCAA GTGGCTTCCC TGGCAGCACA GTCACGGGGA GGCCCTCTCT 541 CATTGGGCAG AAGCTAAGTC CGAAGCCGCG CCCCTCCTGG GAGGTTGAAC TGTGGTGCAG 601 GAAAGCCTCA AGTAGAGGAG GGTTGAGGCT TCAATCCAGC ACTTTGCTCG GGTCACGGCC 661 TCCTCCTGGC TCCCAGGACC CCACCATAGG CAGAGGCAGG CCTTCCTACA CCCTACTCCC 721 TGTGCCTCCA GGCTCGACTA GTCCCTAGCA CTCGACGACT GAGTCTCTGA GGTCACTTCA 781 CCGTGGTCTC CGCCTCACCC TTGGCGCTGG ACCAGTGAGA GGAGAGGGCT GGGGCGCTCC 841 GCTGAGCCAC TCCTGCGCCC CCCTGGCCTT GTCTACCTCT TGCCCCCGAA GGGTTAGTGT 901 CGAGCTCACC CCAGCATCCT ACAACCTCCT GGTGGCCTTG CCGCCCCCAC AACCCCGAGG 961 TATAAAGCCA GGTACACCAG GCAGGGGACG CACCAAGGAT GGAGATGTTC CAGGTAAGAC 1021 TGCAGGGCCC CTGGGCACCT TCCACCTCCT TCCAGGCAAT CACTGGCATG AGAAGGGGCA 1081 GACCAGTGTG AGCTGTGGAA GGAGGCCTCT TTCTGGAGGA GCGTGACCCC CAGTAAGCTT 1141 CAGGTGGGGC AGTTCCTAAG GGTGGGGATC TGAAATGTTG GGGCATCTCA GGTCCTCTGG 1201 GCTGTGGGGT GGACTCTGAA AGGCAGGTGT CCGGGTGGTG GGTCCTGAAT AGGAGATGCC 1261 GGGAAGGGTC TCTGGGTCTT TGTGGGTGGT GTGCCACGTG GGATGGGAAG GCCGGGGCTC 1321 GGGGCTGCGG TCTCAGACCC GGGTGAAGCA GTGTCCTTGT CCCAGGGGCT GCTGCTGTTG 1381 CTGCTGCTGA GCATGGGCGG GACATGGGCA TCCAAGGAGC CGCTTCGGCC ACGGTGCCGC 1441 CCCATCAATG CCACCCTGGC TGTGGAGAAG GAGGGCTGCC CCGTGTGCAT CACCGTCAAC 1501 ACCACCATCT GTGCCGGCTA CTGCCCCACC ATGGTGAGCT GCCCGGGGCC GGGGCAGGTG 1561 CTGCCACCTC AGGGCCAGAC CCACAGAGGC AGCGGGGGAG GAAGGGTGGT CTGCCTCTCT 1621 GGTCAGGGGC TGCGGAATGG GGTGTGGGAG GGCAGGAACA GAGGGCTTCC CGGACCCCTG 1681 AGTCTGAGAC CTGTGGGGGC AACTGGGGAG CTCAGCTGAG GCGCTGGCCC AGGCACATGC 1741 TCATTCCCCC ACTCACACGG CTTCCAGACC CGCGTGCTGC AGGGGGTCCT GCCGGCCCTG 1801 CCTCAGGTGG TGTGCAACTA CCGCGATGTG CGCTTCGAGT CCATCCGGCT CCCTGGCTGC 1861 CCGCGCGGCG TGAACCCCGT GGTCTCCTAC GCCGTGGCTC TCAGCTGTCA ATGTGCACTC 1921 TGCCGCCGCA GCACCACTGA CTGCGGGGGT CCCAAGGACC ACCCCTTGAC CTGTGATGAC 1981 CCCCGCTTCC AGGACTCCTC TTCCTCAAAG GCCCCTCCCC CCAGCCTTCC AAGTCCATCC 2041 CGACTCCCGG GGCCCTCGGA CACCCCGATC CTCCCACAAT AAAGGCTTCT CAATCCGCAC 2101 TCTGGAGGTG TC // LOCUS HUMCG6BA 698 bp ds-DNA PRI 15-JUN-1989 DEFINITION Human chorionic gonadotropin beta-subunit (hCG-beta-6) gene, exon 1. ACCESSION M13505 KEYWORDS gonadotropin. SOURCE Human placental DNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 698) AUTHORS Policastro,P.F., Daniels-McQueen,S., Carle,G. and Boime,I. TITLE A map of the hCG-beta-LH-beta gene cluster JOURNAL J. Biol. Chem. 261, 5907-5916 (1986) STANDARD simple staff_review FEATURES from to/span description pept 557 / 571 human chorionic gonadotropin beta-subunit, exon 1 /nomgen="CGB" /map="19q13.3" /hgml_locus_uid="LH0070L" pre-msg 200 > 698 CG mRNA and intron IVS 572 > 698 CG intron A BASE COUNT 135 A 223 C 201 G 139 T ORIGIN Chromosome 19q13.3 1 TTCCTAAGGG AGAGGTGGGT GCTCGGGCTG AATCCCTCGT TGGGGGGCAT CTGGGTCAAG 61 TGGCTTCCCT GGCAGCACAG TCACGGGGAG ACCCTCTCTC ACTGGGCAGA AGCTAAGTCC 121 GAAGCCGCGC CCCTCCTGTT AGGTTGGACT GTGGTGCAGG AAAGGCTCAA GTAGAGGAGA 181 GTTGAGGCTT CAGTCCAGCA CTTTCCCGGG TCACGGCCTC CTCCTGGTTC CCAAGACCCC 241 ACCATAGGCA GAGGCAGGCC TTCCTACACC CTACTCTCTG TGCCTCCAGC CTCGACTAGT 301 CCCTAACACT CGACGACTGA GTCTCAGAGG TCACTTCACC GTGGTCTCCG CCTCATCCTT 361 GGCGCTAGAC CACTGAGGGG AGAGGACTGG GGTGCTCCGC TGAGCCACTC CTGTGCCTCC 421 CTGGCCTTGT CTACTTCTCG CCCCCCGAAG GGTTAGTGTC GAGCTCACTC CAGCATCCTA 481 CAACCTCCTG GGGCCTTGAC GCCCCCACAA CCCCGAGGTA TGAAGCCAGG TACACCAGGC 541 AGGGGACGCA CCAAGGATGG AGATGTTCCA GGTAAGACTG CAGGGCCCCT GGGCACCTTC 601 CACCTCCTTC CAGGCAATCA CTGGCATGAG AAGGGGCAGA CCAGTGTGAG CTGTGGAAGG 661 ACGCCTCTTT CTGGAGGAGC GTGACCCCCA ATAAGCTT // LOCUS HUMCG7B2 984 bp ds-DNA PRI 15-JUN-1989 DEFINITION Human chorionic gonadotropin beta-subunit (hCG-beta-7) gene, exon 1. ACCESSION M13503 KEYWORDS gonadotropin. SEGMENT 2 of 2 SOURCE Human placental DNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 984) AUTHORS Policastro,P.F., Daniels-McQueen,S., Carle,G. and Boime,I. TITLE A map of the hCG-beta-LH-beta gene cluster JOURNAL J. Biol. Chem. 261, 5907-5916 (1986) STANDARD simple staff_review FEATURES from to/span description pept 843 / 857 human chorionic gonadotropin beta-subunit, exon 1 /nomgen="CGB" /map="19q13.3" /hgml_locus_uid="LH0070L" pre-msg 485 > 984 CG mRNA and intron IVS 858 > 984 CG intron A BASE COUNT 191 A 313 C 297 G 183 T ORIGIN About 140 bp after segment 1; chromosome 19q13.3 1 CTCGCGCACG AGCCGGCGGG CGCAACCGGC GACGCATACC GGTGCCGACC AGGTGGCCGC 61 ACCATAAGTC ACCGCAGTGT GGAACAGACC AGCCCAGACA CGCCACTCAG CTGCGGCTGG 121 ATCGCGGGGA AGGGACTAAG TCCAGACAAT GTCCTCCGAG GCTGCGCCCC GCGGGCAGGA 181 CACACCTCCT GCGGGCCTAT TCAATAATCA GTTAAATCAC CTGAAGCACA CGCATTTCCG 241 GGGACCGCTC CGGGCATCTT GGCTTGAGGG TAGGGTGGGC GGAGGTTCCT AAGGGAGAGG 301 TGGGTGCTCG GGCTGAATCC CTCGTTGGGG GGCGTCTGGG TCAAGTGGCT TCCCTGGCAG 361 CACAGTCACG GGGAGACCCT CTCTCACTGG GCAGAAGCTA AGTCCGAAGC CGCGCCCCTC 421 CTGTTAGGTT GGACTGTGGT GCAGGAATGG CTCAAGTAGA GGAGGGTTGA GGCTTCAGTC 481 CAGCACCTTT CTCGGGTCAC GGCCTCCTCG TGGTTCCCAA GACCCCACCA TAGGCAGAGG 541 CAGGCCTTCC TACACCCTAC TCTCTGTGCC TCCAGCCTCG ACTAGTCCCT AGCACTCGAC 601 GACTGAGTCT CAGAGGTCAC TTCACCGTGG TCTCCGCCTC ATCCTTGGCG CTAGACCACT 661 GAGGGGAGAG GACTGGGGTG CTCCGCTGAG CCACTCCTGT GCCTCCCTGG CCTTGTCTAC 721 TTCTCGCCCC CCGAGGGGTT AGTGTCCAGC TCACTCCAGC ATCCTACAAC CTCCTGTGGC 781 CTTGACGTCC CCACAAACCC GAGGTATAAA GCCAGGTACA CGAGGCAGGG GACGCACCAA 841 GGATGGAGAT GTTCCAGGTA AGACTGCAGG GCCCCTGGGC ACCTTCCACC TCCTTCCAGG 901 CCATCACTGG CATGAGAAGG GGCAGACCAG TGTGAGCTGT GGAAGGACGC CTCTTTCTGG 961 AGGAGCGTGA CCCCCAATAA GCTT // LOCUS HUMCGBBA1 79 bp ds-DNA PRI 15-JUN-1989 DEFINITION Human chorionic gonadotropin beta subunit gene, exon 1, clone CG-beta-a. ACCESSION K00092 KEYWORDS chorionic gonadotropin; glycoprotein; gonadotropin. SEGMENT 1 of 3 SOURCE Human DNA, libaray of T.maniatis, clone CG-beta-a. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 79) AUTHORS Policastro,P., Ovitt,C.E., Hoshina,M., Fukuoka,H., Boothby,M.R. and Boime,I. TITLE The beta subunit of human chorionic gonadotropin is encoded by multiple genes JOURNAL J. Biol. Chem. 258, 11492-11499 (1983) STANDARD simple staff_review FEATURES from to/span description pept 50 + 64 chorionic gonadotropin beta subunit, exon 1 /nomgen="CGB" /map="19q13.3" /hgml_locus_uid="LH0070L" pre-msg < 50 > 79 CGB mRNA IVS 65 > 79 CGB intron A BASE COUNT 26 A 22 C 22 G 9 T ORIGIN 75 bp upstream of PstI site; chromosome 19q13.3. 1 CCACAACCCG AGGTATAAGC CAGTACACCA AGCAGGGGAC GCACCAAGGA TGGAGATGTT 61 CCAAGTAAGA CTGCAGGCC // LOCUS HUMCGBBA2 203 bp ds-DNA PRI 15-JUN-1989 DEFINITION Human chorionic gonadotropin beta subunit gene, exon 2, clone CG-beta-a. ACCESSION K03182 K00093 KEYWORDS chorionic gonadotropin; glycoprotein; gonadotropin. SEGMENT 2 of 3 SOURCE Human DNA, library of T.Maniatis, clone CG-beta-a. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 203) AUTHORS Policastro,P., Ovitt,C.E., Hoshina,M., Fukuoka,H., Boothby,M.R. and Boime,I. TITLE The beta subunit of human chorionic gonadotropin is encoded by multiple genes JOURNAL J. Biol. Chem. 258, 11492-11499 (1983) STANDARD full staff_review FEATURES from to/span description pept + 21 + 188 chorionic gonadotropin beta subunit, exon 2 /nomgen="CGB" /map="19q13.3" /hgml_locus_uid="LH0070L" pre-msg < 1 > 203 CGB mRNA and intron IVS < 1 20 CGB intron A IVS 189 > 203 CGB intron B BASE COUNT 33 A 66 C 66 G 38 T ORIGIN About 350 bp after segment 1; chromosome 19q13.3. 1 AAGCAGTGTC CTTGTCCCAG GGGCTGCTGC TGTTGCTGCT GCTGAGCATG GGCGGGACAT 61 GGGCATCCAA GGAGATGCTT CGGCCACGGT GCCGCCCCAT CAATGCCACC CTGGCTGTGG 121 AGAAGGAGGG CTGCCCCGTG TGCATCACCG TCAACACCAC CATCTGTGCC GGCTACTGCC 181 CCACCATGGT GAGCTGCCCG GGG // LOCUS HUMCGBBA3 468 bp ds-DNA PRI 15-JUN-1989 DEFINITION Human chorionic gonadotropin beta subunit gene, exon 3, clone CG-beta-a. ACCESSION K03183 K00094 KEYWORDS chorionic gonadotropin; glycoprotein; gonadotropin. SEGMENT 3 of 3 SOURCE Human DNA, library of T.Maniatis, clone CG-beta-a. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 468) AUTHORS Policastro,P., Ovitt,C.E., Hoshina,M., Fukuoka,H., Boothby,M.R. and Boime,I. TITLE The beta subunit of human chorionic gonadotropin is encoded by multiple genes JOURNAL J. Biol. Chem. 258, 11492-11499 (1983) STANDARD full staff_review FEATURES from to/span description pept + 23 337 chorionic gonadotropin beta subunit, exon 3 /nomgen="CGB" /map="19q13.3" /hgml_locus_uid="LH0070L" pre-msg < 1 354 CGB mRNA and intron IVS < 1 22 CGB cds intron B BASE COUNT 75 A 189 C 118 G 84 T 2 others ORIGIN About 235 bp after segment 2; chromosome 19q13.3. 1 CTCCCACTCA CACGGCTTCC AGACCCGCGT GCTGCAGGGG GTCCTGCCGG CCCTGCCTCA 61 GGTGGTGTGC AACTACCGCG ATGTGCGCTT CGAGTCCATC CGGCTCCCTG GCTGCCCGCG 121 CGGCGTGAAC CCCGTGGTCT CCTACGCCGT GGCTCTCAGC TGTCAATGTG CACTCTGCCG 181 CCGCAGCACC ACTGACTGCG GGGGTCCCAA GGACCACCCC TTGACCTGTG ATGACCCCCG 241 CTTCCAGGCC TCCTCTTCCT CAAAGGCCCC TCCCCCCAGC CTTCCAAGTC CATCCCGACT 301 CCCGGGGCCC TCGGACACCC CGATCCTCCC ACAATAAAGG CTTCTCGGCC GCACTCTGGC 361 GGTGTCCTTC CGTGGGCCCA GGGCAACCAC ACACACACAG GGTGGGTCCA ACTNCCAANC 421 ATTTATACAA GCACGTACGA ACTCGTGAAC GGTGCGCGAC CGTGCCCC // LOCUS HUMCGBEL4 107 bp ds-DNA PRI 15-JUN-1989 DEFINITION Human chorionic gonadotropin beta subunit gene, exon 1, clone CG-beta-e. ACCESSION K03187 K00098 KEYWORDS chorionic gonadotropin; glycoprotein; gonadotropin. SEGMENT 4 of 7 SOURCE Human DNA, library of T.Maniatis, clone CG-beta-e. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 107) AUTHORS Policastro,P., Ovitt,C.E., Hoshina,M., Fukuoka,H., Boothby,M.R. and Boime,I. TITLE The beta subunit of human chorionic gonadotropin is encoded by multiple genes JOURNAL J. Biol. Chem. 258, 11492-11499 (1983) STANDARD full staff_review FEATURES from to/span description pept 78 + 92 chorionic gonadotropin beta subunit, exon 1 /nomgen="CGB" /map="19q13.3" /hgml_locus_uid="LH0070L" pre-msg < 78 > 107 CGB mRNA and intron IVS 93 > 107 CGB intron A BASE COUNT 27 A 34 C 29 G 14 T 3 others ORIGIN About 8.1 kb after segment 3; 52 bp upstream of RsaI site. 1 CCACCTCCTA GTGGCCTTGC CGNCCCCACA ACCCCGAGGT ATAANGCCAN GTACACGAGG 61 CAGGGGACGC ACCAAGGATG GAGATGTTCC AAGTAAGACT GCAGGCC // LOCUS HUMCGBEL5 203 bp ds-DNA PRI 15-JUN-1989 DEFINITION Human chorionic gonadotropin beta subunit gene, exon 2, clone CG-beta-e. ACCESSION K03188 K00099 KEYWORDS chorionic gonadotropin; glycoprotein; gonadotropin. SEGMENT 5 of 7 SOURCE Human DNA, library of T.Maniatis, clone CG-beta-e. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 203) AUTHORS Policastro,P., Ovitt,C.E., Hoshina,M., Fukuoka,H., Boothby,M.R. and Boime,I. TITLE The beta subunit of human chorionic gonadotropin is encoded by multiple genes JOURNAL J. Biol. Chem. 258, 11492-11499 (1983) STANDARD full staff_review FEATURES from to/span description pept + 21 + 188 chorionic gonadotropin beta subunit, exon 2 /nomgen="CGB" /map="19q13.3" /hgml_locus_uid="LH0070L" pre-msg < 1 > 203 CGB mRNA and intron IVS < 1 20 CGB intron A IVS 189 > 203 CGB intron B BASE COUNT 32 A 69 C 65 G 37 T ORIGIN About 350 bp after segment 4; chromosome 19q13.3. 1 AAGCAGTGTC CTTGTCCCAG GGGCTGCTGC TGTTGCTGCT GCTGAGCATG GGCGGGACAT 61 GGGCATCCAA GGAGCCGCTT CGGCCACGGT GCCGCCCCAT CAATGCCACC CTGGCTGTGG 121 AGAAGGAGGG CTGCCCCGTG TGCATCACCG TCAACACCAC CATCTGTGCC GGCTACTGCC 181 CCACCATGGT GAGCTGCCCC GGG // LOCUS HUMCGBEL6 475 bp ds-DNA PRI 15-JUN-1989 DEFINITION Human chorionic gonadotropin beta subunit gene, exon 3, clone CG-beta-e. ACCESSION K03189 K00100 KEYWORDS chorionic gonadotropin; glycoprotein; gonadotropin. SEGMENT 6 of 7 SOURCE Human DNA, library of T.Maniatis, clone CG-beta-e. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 475) AUTHORS Policastro,P., Ovitt,C.E., Hoshina,M., Fukuoka,H., Boothby,M.R. and Boime,I. TITLE The beta subunit of human chorionic gonadotropin is encoded by multiple genes JOURNAL J. Biol. Chem. 258, 11492-11499 (1983) STANDARD full staff_review FEATURES from to/span description pept + 23 337 chorionic gonadotropin beta subunit, exon 3 /nomgen="CGB" /map="19q13.3" /hgml_locus_uid="LH0070L" pre-msg < 1 354 CGB mRNA and intron IVS < 1 22 CGB cds intron B BASE COUNT 84 A 177 C 121 G 93 T ORIGIN About 235 bp after segment 5; chromosome 19q13.3. 1 CTCCCACTCA CACGGCTTCC AGACCCGCGT GCTGCAGGGG GTCCTGCCGG CCCTGCCTCA 61 GGTGGTGTGC AACTACCGCG ATGTGCGCTT CGAGTCCATC CGGCTCCCTG GCTGCCCGCG 121 CGGCGTGAAC CCCGTGGTCT CCTACGCCGT GGCTCTCAGC TGTCAATGTG CACTCTGCCG 181 CCGCAGCACC ACTGACTGCG GGGGTCCCAA GGACCACCCC TTGACCTGTG ATGACCCCCG 241 CTTCCAGGAC TCCTCTTCCT CAAAGGCCCC TCCCCCGAGC CTTCCAAGTC CATCCCGACT 301 CCCGGGGCCC TCGGACACCC CGATCCTCCC ACAATAAAGG CTTCTCAATC CGCACTCTGG 361 CGGGTCTTTC TGTGGGCTCA GGGCAACCAC ACACACACAG GATGGGTCCA GCTTCCAAAC 421 CATTTTATAC AGAGTCACAG TATGAGAACT CTGGTAGAAA ACAGTGTGGG GTCGG // LOCUS HUMCGBEL7 83 bp ds-DNA PRI 15-JUN-1989 DEFINITION Human luteinizing hormone beta subunit gene, exon 1, clone pCG-beta-e. ACCESSION K03190 K00101 KEYWORDS chorionic gonadotropin; glycoprotein; gonadotropin. SEGMENT 7 of 7 SOURCE Human DNA, library of T.Maniatis, clone pCG-beta-e. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 83) AUTHORS Policastro,P., Ovitt,C.E., Hoshina,M., Fukuoka,H., Boothby,M.R. and Boime,I. TITLE The beta subunit of human chorionic gonadotropin is encoded by multiple genes JOURNAL J. Biol. Chem. 258, 11492-11499 (1983) STANDARD full staff_review COMMENT [1] states that Boorstein et al (Nature 300, 419-422 (1982)) have identified this gene as a luteinixzing hormone beta subunit gene. FEATURES from to/span description pept 54 / 68 luteinizing hormone beta subunit /nomgen="LHB" /map="19q13.3" /hgml_locus_uid="LA0076A" IVS 69 > 83 LHB cds intron A BASE COUNT 23 A 20 C 24 G 11 T 5 others ORIGIN About 6.4 kb after segment 6; 28 bp upstream of RsaI site. 1 CTTGNGCCNA CAACCCGAGN TATNAAGTAC ACGAGGCAGG GGATGCACCA AGGATGGAGA 61 TGCTCCANGT AAGACTGCAG GCC // LOCUS HUMCGRP21 145 bp ds-DNA PRI 15-JUN-1989 DEFINITION Human CALC-II gene for calcitonin related peptide CGRP-II, exon 2. ACCESSION X04855 KEYWORDS CALC-II gene; calcitonin; calcitonin gene-related peptide; calcitonin gene-related peptide II. SEGMENT 1 of 5 SOURCE Human (Homo sapiens). ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 26 to 120) AUTHORS Steenbergh,P.H. JOURNAL Unpublished (1987) Inst of Molr Biology, Utrecht, NEDERLANDS. STANDARD simple automatic REFERENCE 2 (bases 1 to 145; enum. 1 to 145) AUTHORS Steenbergh,P.H., Hoeppener,J.W.M., Zandberg,J., Visser,A., Lips,C.J.M. and Jansz,H.S. TITLE Structure and expression of the human calcitonin/CGRP genes JOURNAL FEBS Lett. 209, 97-103 (1986) STANDARD simple automatic COMMENT For coresponding cDNA sequence see X02404 *source: clone=Cos2CALC-II; *source: library=pjB8 cosmid from acute lymphatic leukaemia cells; FEATURES from to/span description pept / 26 + 120 CGRP-II (AA at 26) pre-msg < 1 > 145 CGRP-II mRNA and introns IVS < 1 25 intron I IVS 121 > 145 intron II BASE COUNT 25 A 44 C 43 G 33 T ORIGIN 1 TAGTAACGTC ATCCTTCCTT TACAGAGAGG CGGCATGGGT TTCCGGAAGT TCTCCCCCTT 61 CCTGGCTCTC AGTATCTTGG TCCTGTACCA GGCGGGCAGC CTCCAGGCGG CGCCATTCAG 121 GTGAGACAGC CTGGAGCCAG AGGCG // LOCUS HUMCGRP22 188 bp ds-DNA PRI 15-JUN-1989 DEFINITION Human CALC-II gene for calcitonin gene-related peptide CGRP-II, exon3. ACCESSION X04857 KEYWORDS CALC-II gene; calcitonin; calcitonin gene-related peptide; calcitonin gene-related peptide II. SEGMENT 2 of 5 SOURCE Human lymphatic leukaemia cell, cDNA to mRNA, clone Cos2CALC-II. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 26 to 163) AUTHORS Steenbergh,P.H. JOURNAL Unpublished (1987) Inst of Mol Bio, Utrecht, NEDERLANDS. STANDARD simple automatic REFERENCE 2 (bases 1 to 188; enum. 1 to 188) AUTHORS Steenbergh,P.H., Hoeppener,J.W.M., Zandberg,J., Visser,A., Lips,C.J.M. and Jansz,H.S. TITLE Structure and expression of the human calcitonin/CGRP genes JOURNAL FEBS Lett. 209, 97-103 (1986) STANDARD simple automatic COMMENT For corresponding cDNA sequence see X02404 *source: clone=Cos2CALC-II; *source: library=pjB8 cosmid from acute lymphatic leukaemia cells; Data kindly reviewed (05-AUG-1987) by Steenbergh P.H. FEATURES from to/span description pept + 26 + 163 CGRP-II (AA at 28) pre-msg < 1 > 188 CGRP-II mRNA and introns IVS < 1 25 intron II IVS 164 > 188 intron III BASE COUNT 43 A 58 C 59 G 28 T ORIGIN 1 CAAGGAGTTT GCTTCCCTTC CACAGGTCTG CCCTGGAGAG CAGCCCAGAC CCGGCCACAC 61 TCAGTAAAGA GGACGCGCGC CTCCTGCTGG CTGCACTGGT GCAGGACTAT GTGCAGATGA 121 AGGCCAGTGA GCTGAAGCAG GAGCAGGAGA CACAGGGCTC CAGGTGAGGT TCCCCAAGCG 181 CCCAGCAC // LOCUS HUMCGRP24 235 bp ds-DNA PRI 15-JUN-1989 DEFINITION Human gene for calcitonin gene-related peptide CGRP-II, exon 5. ACCESSION X04861 KEYWORDS CALC-II gene; calcitonin; calcitonin gene-related peptide; calcitonin gene-related peptide II. SEGMENT 4 of 5 SOURCE Human lumphatic leukaemia cells, cDNA to mRNA, clone Cos2CALC-II. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 27 to 210) AUTHORS Steenbergh,P.H. JOURNAL Unpublished (1987) Inst of Mol Bio, Utrecht, NEDERLANDS. STANDARD simple automatic REFERENCE 2 (bases 1 to 235; enum. 1 to 235) AUTHORS Steenbergh,P.H., Hoeppener,J.W.M., Zandberg,J., Visser,A., Lips,C.J.M. and Jansz,H.S. TITLE Structure and expression of the human calcitonin/CGRP genes JOURNAL FEBS Lett. 209, 97-103 (1986) STANDARD simple automatic COMMENT For corresponding cDNA sequence see X02404 *source: clone=Cos2CALC-II; *source: library=pjB8 cosmid from acute lymphatic leukaemia cells; Data kindly reviewed (05-AUG-1987) by Steenbergh P.H. EMBL features not translated to GenBank features: key from to description FEATURES from to/span description pept + 26 185 CGRP-II (AA at 3) pre-msg < 1 > 235 CGRP-II mRNA and introns IVS < 1 25 intron IV IVS 211 > 235 intron V BASE COUNT 57 A 64 C 64 G 50 T ORIGIN 1 CTTCTTTCTC TATCTTGCAA ATCAGCTCCG CTGCCCAGAA GAGAGCCTGC AACACTGCCA 61 CCTGTGTGAC TCATCGGCTG GCAGGCTTGC TGAGCAGATC AGGGGGCATG GTGAAGAGCA 121 ACTTCGTGCC CACCAATGTG GGTTCCAAAG CCTTTGGCAG GCGCCGCAGG GACCTTCAAG 181 CCTGAGCAGA TGAATGACTC CAGGAAGAAG GTAACTACCC TAATGCTATG GGATA // LOCUS HUMCGRP25 604 bp ds-DNA PRI 15-JUN-1989 DEFINITION Human gene for calcitonin gene-relared peptide CGRP-II, exon 6. ACCESSION X04863 KEYWORDS CALC-II gene; calcitonin; calcitonin gene-related peptide; calcitonin gene-related peptide II. SEGMENT 5 of 5 SOURCE Human lumphatic leukaemia cells, cDNA to mRNA, clone Cos2CALC-II. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 26 to 580) AUTHORS Steenbergh,P.H. JOURNAL Unpublished (1987) Inst of Mol Bio, Utrecht, NEDERLANDS. STANDARD simple automatic REFERENCE 2 (bases 1 to 604; enum. 1 to 605) AUTHORS Steenbergh,P.H., Hoeppener,J.W.M., Zandberg,J., Visser,A., Lips,C.J.M. and Jansz,H.S. TITLE Structure and expression of the human calcitonin/CGRP genes JOURNAL FEBS Lett. 209, 97-103 (1986) STANDARD simple automatic REFERENCE 3 (bases 1 to 604; enum. 1 to 604) AUTHORS Steenbergh,P.H. JOURNAL Unpublished (1987) Inst of Mol Bio, Utrecht, NEDERLANDS. STANDARD simple automatic COMMENT For corresponding cDNA sequence see X02404 *source: clone=Cos2CALC-II; *source: library=pjB8 cosmid from acute lymphatic leucaemia cells; Data kindly reviewed (05-AUG-1987) by Steenbergh P.H. EMBL features not translated to GenBank features: key from to description IVS 581 >604 intron VI FEATURES from to/span description pre-msg < 1 580 CGRP-II mRNA and introns IVS < 1 25 intron V revision 159 160 tg in [1]; tng in [2] BASE COUNT 176 A 102 C 114 G 212 T ORIGIN 1 CTCTTCTTTT TTCCCCTAAT CTCAGGTTAT CATGAAACTG AACTCACCAT TTCCATTAAT 61 TTCTGTTGGT AAGAACTTGG TGAGAATGCC CCGTGGAAGA TACACATGTT TGCATCCTAA 121 GATACTGAAA AAAGGGCACC TTTGTCACTT GAAAGGAATG AAACTGAATG CAAAATAAGC 181 TAATTCCATA TTTGCTGTGC ATCATTTTTA TATTTAATTC TATGTCCAGT AAAAGTGATG 241 GCATCTCTCA TTGACTTATC TGGTAGCAAA CTGGTTCTTT CGGAGCCATC CTGTTGATCA 301 TGCAGCTCCA CCAAACCTTA GGGGGACGTG AAATCACTGC CTGTTGTGGT CTCCGAGGAC 361 ACATGGTAAT GGTGATGCTG TGCCTTGTTA TCTAAGAACA TGATTGTATA ATTTGTTTAA 421 GAAAATGTCA ATATTGTGCC ATTTGTGAAC TTCATCAAGA TTAAAAGCAT ATTTTGGGTA 481 CATTTGTTTC AAAACCTTGG TGATGCATTA CAACTTGTTT TCTTATGTAA TAATAATGAT 541 GATGATGATG ATAATAATAA ATATTTTTGA GTGCTTACTA TGTATGGGCC AGATATTATT 601 TTGA // LOCUS HUMCGRPA 224 bp ds-DNA PRI 15-DEC-1989 DEFINITION Human calcitonin-related peptide (CGRP) gene, exon X. ACCESSION M28637 KEYWORDS calcitonin-related peptide. SOURCE Human (male) medullary thyroid carcinoma DNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 224) AUTHORS Steenbergh,P.H., Hoeppener,J.W.M., Zandberg,J., van de Ven,W.J.M., Jansz,H.S. and Lips,C.J.M. TITLE Calcitonin gene related peptide coding sequence is conserved in the human genome and is expressed in medullary thyroid carcinoma JOURNAL J. Clin. Endocrinol. Metab. 59, 358-360 (1984) STANDARD simple staff_entry FEATURES from to/span description pept / 35 194 calcitonin-related peptide, exon X (AA at 36) /nomgen="CALCB" /map="11p14.2-p12" /hgml_locus_uid="LL0020U" pre-msg < 1 > 218 CALCB mRNA and intron IVS < 1 34 CALCB intron X-1 BASE COUNT 52 A 60 C 61 G 51 T ORIGIN 198 bp upstream of PvuII site; chromosome 11p14.2-p12. 1 AGATCTTCTC TTCTTTCTCC ATCCTGCAAA TCAGAATCAT TGCCCAGAAG AGAGCCTGTG 61 ACACTGCCAC CTGTGTGACT CATCGGCTGG CAGGCTTGCT GAGCAGATCA GGGGGTGTGG 121 TGAAGAACAA CTTTGTGCCC ACCAATGTGG GTTCCAAAGC CTTTGGCAGG CGCCGCAGGG 181 ACCTTCAAGC CTGAGCAGCT GAATGACTCA AGAAGGTGAC TGCC // LOCUS HUMCGRPB4 602 bp ds-DNA PRI 15-JUN-1989 DEFINITION Human beta-CGRP gene encoding calcitonin-like peptide (CGRP), exon 4. ACCESSION X04407 KEYWORDS calcitonin; calcitonin gene-related peptide. SOURCE Human DNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 602; enum. 1 to 601) AUTHORS Alevizaki,M., Shiraishi,A., Rassool,F.V., Ferrier,G.J.M., MacIntyre,I. and Legon,S. TITLE The calcitonin-like sequence of the beta CGRP gene JOURNAL FEBS Lett. 206, 47-52 (1986) STANDARD simple automatic REFERENCE 2 (bases 1 to 602; enum. 1 to 602) AUTHORS Alevizaki,M. JOURNAL Unpublished (1987) Hammersmith Hospital, Ducane Rd, London, England STANDARD simple automatic COMMENT Data kindly reviewed (28-FEB-1987) by M. Alevizaki EMBL features not translated to GenBank features: key from to description IVS <1 54 intron III MSG 55 567 exon 4 REVISION 571 574 gaac was gac in [1] REVISION 599 600 tc was tac in [1] FEATURES from to/span description IVS < 1 54 intron III pre-msg 55 567 exon 4 unsure 406 406 G may be T revision 405 407 agg in [2]; ang in [1] revision 538 540 tgg in [2]; tcg in [1] revision 540 542 gca in [2]; gga in [1] revision 553 556 caat in [2]; cat in [1] BASE COUNT 158 A 135 C 136 G 173 T ORIGIN Chromosome 11p12-p14.2. 1 CCATGGGGAC AGTCCCTAGT GCATGGTACT GTCTGGCATG TCCTTCCCTT GCAGCTTGAG 61 CAGTCCTAGA TTTAAGTAGC ATATAGTAAT CTGAGTACCT GCTTGCAGGG CACATACTTG 121 CAGTACCTGA AAAACTTTCA TATGTTCCCT GGCATCAACT TCGGGCCTGA AATTCCTGGC 181 AAGAATAGGG ACATAGTCAA CAGCTTGCAG AGGGACCACT ACCCGACTCC AGGGTCCCCC 241 AGATGGCAGC TGAACTTCTC TCAACTCTCC TGATTCCCCT TCTTGCTCCA CTTTATGAAC 301 CTGATGCATG TGGATTCCTC TCTGATTTGT CTTCATGCTG GTATTGGTAT TTTTGCTTAT 361 GACAGAGAAT GTTTTGAAGA CCTCAGGATG GAAGGGAAGA CAGCAGGACT TACTGAACAC 421 GTTAGAGATA AAAGAAAATA AGGGAAGCTT CTTGAGACTG TAGAGGGTGT TATGACAGAG 481 GCATCCAATT TCTGCTTCTA AATGTACTAC GATAAAATAA GCACGTCCTT AATGCCTTGG 541 CATTAGATGA ATCAATCTAT TTTTCTAAAA GGAACTGAGC TGCGGTGCTC ATTGCTCTGG 601 TC // LOCUS HUMCKB1 255 bp ds-DNA PRI 15-JUN-1989 DEFINITION Human creatine kinase B isozyme gene, exon 1. ACCESSION M22354 J03531 KEYWORDS creatine kinase. SEGMENT 1 of 3 SOURCE Human lymphocyte DNA, (library of D.Page), clone 16B2. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 255) AUTHORS Daouk,G.H., Kaddurah-Daouk,R., Putney,S., Kingston,R. and Schimmel,P. TITLE Isolation of a functional human gene for brain creatine kinase JOURNAL J. Biol. Chem. 263, 2442-2446 (1988) STANDARD simple staff_entry FEATURES from to/span description pre-msg 179 > 255 CKB mRNA and intron /nomgen="CKBB" /map="14q32.3" /hgml_locus_uid="LR0095R" IVS 247 > 255 CKB intron A BASE COUNT 34 A 85 C 108 G 28 T ORIGIN Chromosome 14q32.3. 1 GCGCGGGGTC CAGCGAGGGG ACAGCTCGGG TGGGCGGCCA GGGTGTTGGG GGCTCGGGCG 61 GCGGACAAAG CGGCGGCACC ACCCCGCGGC GCGGCCAATG GAATGAATGG GCTATAAATA 121 GCCGCCAATG GGCGGCCCGC GTTGTGCCCC TTAAGAGCCG CGGGAGCGCG GAGCGGCCGC 181 TGTTCGCCTG CGTCGCTCCG GGAGCTGCCG ACGGACGGAG CGCCCCCGCC CCCGCCCGGC 241 CGCCCGGTGA GTGGG // LOCUS HUMCKB2 222 bp ds-DNA PRI 15-JUN-1989 DEFINITION Human creatine kinase B isozyme gene, exon 2. ACCESSION M22355 J03531 KEYWORDS creatine kinase. SEGMENT 2 of 3 SOURCE Human lymphocyte DNA, (library of D.Page), clone 16B2. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 222) AUTHORS Daouk,G.H., Kaddurah-Daouk,R., Putney,S., Kingston,R. and Schimmel,P. TITLE Isolation of a functional human gene for brain creatine kinase JOURNAL J. Biol. Chem. 263, 2442-2446 (1988) STANDARD simple staff_entry FEATURES from to/span description pept 22 + 213 brain creatine kinase, exon 2 (first expressed exon) /nomgen="CKBB" /map="14q32.3" /hgml_locus_uid="LR0095R" pre-msg < 1 > 222 CKB mRNA and intron IVS < 1 9 CKB intron A IVS 214 > 222 CKB intron B BASE COUNT 46 A 90 C 61 G 25 T ORIGIN About 0.5 kb after segment 1; chromosome 14q32.3. 1 TCCCCGCAGC CCGCCGCCGC CATGCCCTTC TCCAACAGCC ACAACGCACT GAAGCTGCGC 61 TTCCCGGCCG AGGACGAGTT CCCCGACCTG AGCGCCCACA ACAACCACAT GGCCAAGGTG 121 CTGACCCCCG AGCTGTACGC GGAGCTGCGC GCCAAGAGCA CGCCGAGCGG CTTCACGCTG 181 GACGACGTCA TCCAGACAGG CGTGGACAAC CCGGTACGCG AC // LOCUS HUMCKB3 108 bp ds-DNA PRI 15-JUN-1989 DEFINITION Human creatine kinase B isozyme gene, exon 3, partial. ACCESSION M22356 J03531 KEYWORDS creatine kinase. SEGMENT 3 of 3 SOURCE Human lymphocyte DNA, (library of D.Page), clone 16B2. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 108) AUTHORS Daouk,G.H., Kaddurah-Daouk,R., Putney,S., Kingston,R. and Schimmel,P. TITLE Isolation of a functional human gene for brain creatine kinase JOURNAL J. Biol. Chem. 263, 2442-2446 (1988) STANDARD simple staff_entry FEATURES from to/span description pept + 10 > 108 brain creatine kinase, exon 3 /nomgen="CKBB" /map="14q32.3" /hgml_locus_uid="LR0095R" pre-msg < 1 > 108 CKB mRNA and intron IVS < 1 9 CKB intron B BASE COUNT 20 A 37 C 33 G 18 T ORIGIN About 0.2 kb after segment 2; chromosome 14q32.3. 1 CCTCCCCAGG GCCACCCGTA CATCATGACC GTGGGCTGCG TGGCGGGCGG CGAGGAGTCC 61 TACGAAGTGT TCAAGGATCT CTTCGACCCC ATCATCGAGG ACCGGCAC // LOCUS HUMCKBB1 483 bp ds-DNA PRI 15-JUN-1989 DEFINITION Human creatine kinase isozyme CK-B gene, exon 1. ACCESSION M21236 J03036 KEYWORDS creatine kinase. SEGMENT 1 of 8 SOURCE Human fibroblast, cDNA to mRNA, and DNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 483) AUTHORS Mariman,E.C.M., Broers,C.A.M., Claesen,C.A.A., Tesser,G.I. and Wieringa,B. TITLE Structure and expression of the human creatine kinase B gene JOURNAL Genomics 1, 126-137 (1987) STANDARD full staff_entry COMMENT Draft entry and printed copy of sequence for [1] kindly provided by E.C.M.Mariman, 02-OCT-1987. FEATURES from to/span description pre-msg 401 > 483 creatine kinase mRNA, exon 1 and intron /nomgen="CKBB" /map="14q32.3" /hgml_locus_uid="LR0095R" IVS 469 > 483 CKB intron A BASE COUNT 68 A 156 C 197 G 62 T ORIGIN 42 bp upstream of SacII site; chromosome 14q32.3. 1 GGAAGCCCCG AAAGCTTTCG CCCGGCCCCT CGCCGCCGCC GCGGGGGCTG GCTGGACTAG 61 GCGGGCAGGC TCGAGGATGC GGATGAACCC AAGCGTCCTC GAGTGCCCGG AGGCTCTCCG 121 CCTCAGTTTC CCGCCCAGAG GCAAGGGCGT GCGAGGGGAT CCAGATATCC AAGGACCTGA 181 GGTTTCGGCC TCGAGGTCTT GGGCGGGGGA CTGGGCAGGC TGCGCGGGGT CCCAGCGAGG 241 GGACAGCTCG GGTGGGCGGC CAGGGTGTTG GGGGCTGTGG GCGGCGGACA AAGCGGCGGC 301 ACCACCGCGG CGCGGGCCAA TGGAATGAAT GGGCTATAAA TAGCCGCCAA TGGGCGGCCC 361 GCGTTGTGCC CCTTAAGAGC CGCGGGAGCG CGGAGCGGCC GCTGTTCGCC TGCGTCGCTC 421 CGGGAGCTGC CGACGGACGG AGCGCCCCCG CCCCCGCCCG GCCGCCCGGT GAGTGGGCCC 481 GGG // LOCUS HUMCKBB2 260 bp ds-DNA PRI 15-JUN-1989 DEFINITION Human creatine kinase isozyme CK-B gene, exon 2. ACCESSION M21237 J03036 KEYWORDS creatine kinase. SEGMENT 2 of 8 SOURCE Human fibroblast, cDNA to mRNA, and DNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 260) AUTHORS Mariman,E.C.M., Broers,C.A.M., Claesen,C.A.A., Tesser,G.I. and Wieringa,B. TITLE Structure and expression of the human creatine kinase B gene JOURNAL Genomics 1, 126-137 (1987) STANDARD full staff_entry COMMENT Draft entry and printed copy of sequence for [1] kindly provided by E.C.M.Mariman, 02-OCT-1987. FEATURES from to/span description pept 53 + 245 creatine kinase, exon 2 (first expressed exon, /nomgen="CKBB" /map="14q32.3" /hgml_locus_uid="LR0095R" pre-msg < 1 > 260 creatine kinase mRNA and introns IVS < 1 40 CKB intron A IVS 246 > 260 CKB intron B BASE COUNT 47 A 110 C 72 G 31 T ORIGIN About 256 bp after segment 1; chromosome 14q32.3. 1 CCGGCGTGCC GGTCCCCTCT GACCCCGCGT CTCCCCGCAG CCCGCCGCCG CCATGCCCTT 61 CTCCAACAGC CACAACGCAC TGAAGCTGCG CTTCCCGGCC GAGGACGAGT TCCCCGACCT 121 GAGCGCCCAC AACAACCACA TGGCCAAGGT GCTGACCCCC GAGCTGTACG CGGAGCTGCG 181 CGCCAAGAGC ACGCCGAGCG GCTTCACGCT GGACGACGTC ATCCAGACAG GCGTGGACAA 241 CCCGGGTACG CGACCCCTCG // LOCUS HUMCKBB3 210 bp ds-DNA PRI 15-JUN-1989 DEFINITION Human creatine kinase isozyme CK-B gene, exon 3. ACCESSION M21238 J03036 KEYWORDS creatine kinase. SEGMENT 3 of 8 SOURCE Human fibroblast, cDNA to mRNA, and DNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 210) AUTHORS Mariman,E.C.M., Broers,C.A.M., Claesen,C.A.A., Tesser,G.I. and Wieringa,B. TITLE Structure and expression of the human creatine kinase B gene JOURNAL Genomics 1, 126-137 (1987) STANDARD full staff_entry COMMENT Draft entry and printed copy of sequence for [1] kindly provided by E.C.M.Mariman, 02-OCT-1987. FEATURES from to/span description pept + 41 + 195 creatine kinase, exon 3 /nomgen="CKBB" /map="14q32.3" /hgml_locus_uid="LR0095R" pre-msg < 1 > 210 creatine kinase mRNA and introns IVS < 1 40 CKB intron B IVS 196 > 210 CKB intron C BASE COUNT 41 A 78 C 62 G 29 T ORIGIN About 119 bp after segment 2; chromosome 14q32.3. 1 CAGTGACGTC ACTGTCCCCG TCCCGCGCCC CCTCCCCCAG GCCACCCGTA CATCATGACC 61 GTGGGCTGCG TGGCGGGCGA CGAGGAGTCC TACGAAGTGT TCAAGGATCT CTTCGACCCC 121 ATCATCGAGG ACCGGCACGG CGGCTACAAG CCCAGCGATG AGCACAAGAC CGACCTCAAC 181 CCCGACAACC TGCAGGTGCG GGGCTGCGGG // LOCUS HUMCKBB4 188 bp ds-DNA PRI 15-JUN-1989 DEFINITION Human creatine kinase isozyme CK-B gene, exon 4. ACCESSION M21239 J03036 KEYWORDS creatine kinase. SEGMENT 4 of 8 SOURCE Human fibroblast, cDNA to mRNA, and DNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 188) AUTHORS Mariman,E.C.M., Broers,C.A.M., Claesen,C.A.A., Tesser,G.I. and Wieringa,B. TITLE Structure and expression of the human creatine kinase B gene JOURNAL Genomics 1, 126-137 (1987) STANDARD full staff_entry COMMENT Draft entry and printed copy of sequence for [1] kindly provided by E.C.M.Mariman, 02-OCT-1987. FEATURES from to/span description pept + 41 + 173 creatine kinase, exon 4 /nomgen="CKBB" /map="14q32.3" /hgml_locus_uid="LR0095R" pre-msg < 1 > 188 creatine kinase mRNA and introns IVS < 1 40 CKB intron C IVS 174 > 188 CKB intron D BASE COUNT 23 A 70 C 70 G 25 T ORIGIN About 72 bp after segment 3; chromosome 14q32.3. 1 GCCGGGGTCT TCGGGCGCTC ACTCCCGTCT CGCCTCCCAG GGCGGCGACG ACCTGGACCC 61 CAACTACGTG CTGAGCTCGC GGGTGCGCAC GGGCCGCAGC ATCCGTGGCT TCTGCCTCCC 121 CCCGCACTGC AGCCGCGGGG AGCGCCGAGC CATCGAGAAG CTCGCGGTGG AAGGTAGGGG 181 CCGGGCGG // LOCUS HUMCKBB5 227 bp ds-DNA PRI 15-JUN-1989 DEFINITION Human creatine kinase isozyme CK-B gene, exon 5. ACCESSION M21240 J03036 KEYWORDS creatine kinase. SEGMENT 5 of 8 SOURCE Human fibroblast, cDNA to mRNA, and DNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 227) AUTHORS Mariman,E.C.M., Broers,C.A.M., Claesen,C.A.A., Tesser,G.I. and Wieringa,B. TITLE Structure and expression of the human creatine kinase B gene JOURNAL Genomics 1, 126-137 (1987) STANDARD full staff_entry COMMENT Draft entry and printed copy of sequence for [1] kindly provided by E.C.M.Mariman, 02-OCT-1987. FEATURES from to/span description pept + 41 + 212 creatine kinase, exon 5 /nomgen="CKBB" /map="14q32.3" /hgml_locus_uid="LR0095R" pre-msg < 1 > 227 creatine kinase mRNA and introns IVS < 1 40 CKB intron D IVS 213 > 227 CKB intron E BASE COUNT 32 A 87 C 74 G 34 T ORIGIN About 368 bp after segment 4; chromosome 14q32.3. 1 CGGGCGCGGG AGCCCAGCGT CCTGAGCGCA CCCCTCGCAG CCCTGTCCAG CCTGGACGGC 61 GACCTGGCGG GCCGATACTA CGCGCTCAAG AGCATGACGG AGGCGGAGCA GCAGCAGCTC 121 ATCGACGACC ACTTCCTCTT CGACAAGCCC GTGTCGCCCC TGCTGTCGGC CTCGGGCATG 181 GCCCGCGACT GGCCCGACGC CCGCGGTATC TGGTGCGTGT CCCTCTG // LOCUS HUMCKBB6 179 bp ds-DNA PRI 15-JUN-1989 DEFINITION Human creatine kinase isozyme CK-B gene, exon 6. ACCESSION M21241 J03036 KEYWORDS creatine kinase. SEGMENT 6 of 8 SOURCE Human fibroblast, cDNA to mRNA, and DNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 179) AUTHORS Mariman,E.C.M., Broers,C.A.M., Claesen,C.A.A., Tesser,G.I. and Wieringa,B. TITLE Structure and expression of the human creatine kinase B gene JOURNAL Genomics 1, 126-137 (1987) STANDARD full staff_entry COMMENT Draft entry and printed copy of sequence for [1] kindly provided by E.C.M.Mariman, 02-OCT-1987. FEATURES from to/span description pept + 41 + 164 creatine kinase, exon 6 /nomgen="CKBB" /map="14q32.3" /hgml_locus_uid="LR0095R" pre-msg < 1 > 179 creatine kinase mRNA and introns IVS < 1 40 CKB intron E IVS 165 > 179 CKB intron F BASE COUNT 37 A 53 C 56 G 33 T ORIGIN About 669 bp after segment 5; chromosome 14q32. 1 GCTTTTTTCT GGGTATGCCC TGAGACCAGC CCTCCCGCAG GCACAATGAC AATAAGACCT 61 TCCTGGTGTG GGTCAACGAG GAGGACCACC TGCGGGTCAT CTCCATGCAG AAGGGGGGCA 121 ACATGAAGGA GGTGTTCACC CGCTTCTGCA CCGGCCTCAC CCAGGTGCCA GGGACGGGG // LOCUS HUMCKBB7 245 bp ds-DNA PRI 15-JUN-1989 DEFINITION Human creatine kinase isozyme CK-B gene, exon 7. ACCESSION M21242 J03036 KEYWORDS creatine kinase. SEGMENT 7 of 8 SOURCE Human fibroblast, cDNA to mRNA, and DNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 245) AUTHORS Mariman,E.C.M., Broers,C.A.M., Claesen,C.A.A., Tesser,G.I. and Wieringa,B. TITLE Structure and expression of the human creatine kinase B gene JOURNAL Genomics 1, 126-137 (1987) STANDARD full staff_entry COMMENT Draft entry and printed copy of sequence for [1] kindly provided by E.C.M.Mariman, 02-OCT-1987. FEATURES from to/span description pept + 41 + 230 creatine kinase, exon 7 /nomgen="CKBB" /map="14q32.3" /hgml_locus_uid="LR0095R" pre-msg < 1 > 245 creatine kinase mRNA and introns IVS < 1 40 CKB intron F IVS 231 > 245 CKB intron G BASE COUNT 54 A 72 C 67 G 52 T ORIGIN About 157 bp after segment 6; chromosome 14q32.3. 1 AGGCAGGCCT TCTCCCTCAT ACCCTCTTCT CCGTCTGCAG ATTGAAACTC TCTTCAAGTC 61 TAAGGACTAT GAGTTCATGT GGAACCCTCA CCTGGGCTAC ATCCTCACCT GCCCATCCAA 121 CCTGGGCACC GGGCTGCGGG CAGGTGTGCA TATCAAGCTG CCCAACCTGG GCAAGCATGA 181 GAAGTTCTCG GAGGTGCTTA AGCGGCTGCG ACTTCAGAAG CGAGGAACAG GTGAGCAGGG 241 CAGGT // LOCUS HUMCKBB8 416 bp ds-DNA PRI 15-JUN-1989 DEFINITION Human creatine kinase isozyme CK-B gene, exon 8. ACCESSION M21243 J03036 KEYWORDS creatine kinase. SEGMENT 8 of 8 SOURCE Human fibroblast, cDNA to mRNA, and DNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 416) AUTHORS Mariman,E.C.M., Broers,C.A.M., Claesen,C.A.A., Tesser,G.I. and Wieringa,B. TITLE Structure and expression of the human creatine kinase B gene JOURNAL Genomics 1, 126-137 (1987) STANDARD full staff_entry COMMENT Draft entry and printed copy of sequence for [1] kindly provided by E.C.M.Mariman, 02-OCT-1987. FEATURES from to/span description pept + 41 219 creatine kinase, exon 8 /nomgen="CKBB" /map="14q32.3" /hgml_locus_uid="LR0095R" pre-msg < 1 > 416 creatine kinase mRNA and intron IVS < 1 40 CKB intron G BASE COUNT 72 A 126 C 122 G 96 T ORIGIN About 79 bp after segment 7; chromosome 14q32.3. 1 GCAGCCCTCT TTCCTCCGCC CTGACTTGCT GTCTCCCCAG GCGGTGTGGA CACGGCTGCG 61 GTGGGCGGGG TCTTCGACGT CTCCAACGCT GACCGCCTGG GCTTCTCAGA GGTGGAGCTG 121 GTGCAGATGG TGGTGGACGG AGTGAAGCTG CTCATCGAGA TGGAACAGCG GCTGGAGCAG 181 GGCCAGGCCA TCGACGACCT CATGCCTGCC CAGAAATGAA GCCCGGCCCA CACCCGACAC 241 CAGCCCTGCT GCTTCCTAAC TTATTGCCTG GGCAGTGCCC ACCATGCACC CCTGATGTTC 301 GCCGTCTGGC GAGCCCTTAG CCTTGCTGTA GAGACTTCCG TCACCCTTGG TAGAGTTTAT 361 TTTTTTGATG GCTAAGATAC TGCTGATGCT GAAATAAACT AGGGTTTTGG CCTGCC // LOCUS HUMCKMM1 2903 bp ds-DNA PRI 15-JUN-1989 DEFINITION Human muscle creatine kinase gene (CKMM), 5' flank. ACCESSION M21487 J04435 KEYWORDS muscle creatine kinase. SEGMENT 1 of 8 SOURCE Human adult leukocyte DNA, clones lambda MCK [12,25]. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 2903) AUTHORS Trask,R.V., Strauss,A.W. and Billadello,J.J. TITLE Developmental regulation and tissue-specific expression of the human muscle creatine kinase gene JOURNAL J. Biol. Chem. 263, 17142-17149 (1988) STANDARD full staff_review COMMENT Draft entry and clean copy of sequence for [1] kindly provided by R.V.Trask, 24-OCT-1988. FEATURES from to/span description pre-msg 2621 > 2903 muscle creatine kinase mRNA and intron (EC 2.7.3.2) /nomgen="CKMM" /map="19q13" /hgml_locus_uid="LM0095M" IVS 2677 > 2903 CKMM, intron A BASE COUNT 507 A 932 C 769 G 695 T ORIGIN 1 bp upstream of BamHI site; Chromosome 19q13. 1 GGATCCTTCC TCCTTGGCCT CCCAAAGTGC TGGGATTACA GGTGTGAGCC ACTGCACCTG 61 GCCTATTACC CTTCTCAGGC TCTGGAGTCC ATCCTTCTGC TCTGTCTCCC TCAGTTCAAT 121 TGTTTTTTGT TTTTTGTTTT TTTTTTAGAC ACAGTCTCGC TCTGTCACCA AGGCTGGAGT 181 GCAGCAGTGC GATCACAGCT CACCGCAGCC TCACCTCCCA GGCTCAAGTG ATCCTCCCAT 241 CTCGGCCTCT GAGTAGCTGA GACTATAGGT GTGTCCACAT GTCCGGCTAA TTTTTGTATT 301 TTTAGTAGAG ACAGGGTTTC ACCGCGTTGG CCAGGGTGGT CTTGAACTCC TGAGCTCAAG 361 CAATCCTCCT GCCTCAGCCT CCTTGTTTTG ATTTTTAGAT CCCACAAATA ACTTGTGATG 421 TTTGTCTTTC TATACCTGGT TCATTTAACA TTTTCTTTTT CTTTTCTTTT CTTTTTTTTT 481 TTTTTTGTGA GACTGAGTCT TGCTCTGTCA CTCAGGCTGG AGGGCAATGG TGCATCTCAG 541 CTCACTGCAA CCTCCACCTC CTAGGTTCAA GCAATTCTTA TGCCTCAGCC TCCTGGCTAG 601 CTGGGATTAC AGGCGTGTGT CACCATGCCA GGCTAATTTT TGTACTTTTA GTAGAGATGG 661 GGTTTCACCA TGTTGGCCAG GCTGGTCTTG AACTCCTGGC CTCAAGTGAT CCACCCGCCT 721 CCGCCTCTGC CTCCCAAAGT GCTGGGATTA CGGGCCTGAG CCACTGTGCC CGGCCCATCT 781 AACATTTTCA CTGTCAATCA CAATGGGATT AAAACTCCTC CCACAGCCCC TAGGGACCAT 841 GGGTCTGCTC CTGTCTCCCC TCCAACCTCA TCTTCTTCCT CCCACTCTCT CCTTGGCCCC 901 ATCTGCTCCA GTCCCCTGGC CTCCTTCCTG TCTGTCCTCA GATGTGCCCA GCCATTCTCA 961 CCTCAGCGCC TTTGCACCTG CTGTTCCCCC CAGAGCCGCA CATGGCTGGC TCCCTGTTCT 1021 CCTTCAGGTC TCTGCTCAGA TGTCATCTTC CCAAAGAGGC CTGCCTCGAC CTCCCCTGCT 1081 GCTGTGCCGT CCCCTCATCT GTGACCCTCT TGCACTATCA CCTCCAGGAC GGCGGGGGTT 1141 TTGTGTTTTG TTGTAGCCTC AGGAAGTGCC TGATAGATCC CTGTTTCGAG ACCAGTTCCA 1201 TTTGGTTTTC TGGGCCTCAG TTTCCGTAAC CGTGAAGGAG ACCCTCGGCA ATCTGAGCTT 1261 GCTGGGAAAG GGCTGGGCCC CATGTAAATA TTTCTAAAGC ACCCCTCTCC CCTCCCCCCT 1321 CAGATCAGGA GTCTGAGGGA GAGGCACAGA GGCTCCCTTT CTCTAAGCCA GTCCTCACCT 1381 GCCTAAGAAG ATGTGAAGGA GACCCAGGAG ACCCTGGGAT AGGGAGGAAC TCAGAGGGAA 1441 GGGACATTCT TTTCTTCGTC GCAATCCTGG GAGCTCCCTG GAGGAGGAGA CCCGATCAGC 1501 CTGCAATCCT GGCGCGTCCC AGGAGGAGAA AGCGGCTTCC TCTATACTGT ACTCTCCTCC 1561 ACAGAACCCC CCTCTCAGCC CTGGAAGTCC TTGCTCACAG CCGAGGCGCC GAGAGCGCTT 1621 GCTCTGCCCA GATCTCGGCG AGTCTGCGCC CGCGCTCTGA ACGGCGTCGC TGCCCAGCCC 1681 CCTTCCCCGG GAGGTGGGAG CGGCCACCCA GGGCCCCGTG GCTGCCCTTG TAAGGAGGCG 1741 AGGCCGAGGA CACCCGAGAC GCCCGGTTAT AATTAACCAG GACACGTGGC GAACCCCCCT 1801 CCAACACCTG CCCCCGAACC CCCCCATACC CAGCGCCTCG GGTCTCGGCC TTTGCGGCAG 1861 AGGAGACAGC AAAGCGCCCT CTAAAAATAA CTCCTTTCCC GGCGACCGAG ACCCTCCCTG 1921 TCCCCGCACA GCGAAATCTC CCAGTGGCAC CGAGGGGGCG AGGGTTAAGT GGGGGGGAGG 1981 GTGACCACCG CCTCCCACCC TTGCCCTGAG TTTGAATCTC TCCAACTCAG CCAGCCTCAG 2041 TTTCCCCTCC ACTCAGTCCC TAGGAGGAAG GGGCGCCCAA GCGGGTTTCT GGGGTTAGAC 2101 TGCCCTCCAT TGCAATTGGT CCTTCTCCCG GCCTCTGCTT CCTCCAGCTC ACAGGGTATC 2161 TGCTCCTCCT GGAGCCACAC CTTGGTTCCC CGAGGTGCCG CTGGGACTCG GGTAGGGGTG 2221 AGGGCCCAGG GGCGACAGGG GGAGCCGAGG GCCACAGGAA GGGCTGGTGG CTGAAGGAGA 2281 CTCAGGGGCC AGGGGACGGT GGCTTCTACG TGCTTGGGAC GTTCCCAGCC ACCGTCCCAT 2341 GTTCCCGGCG GGGGCCAGCT GTCCCCACCG CCAGCCCAAC TCAGCACTTG GTTAGGGTAT 2401 CAGCTTGGTG GGGGCGTGAG CCCAGCCCTG GGGCGCTCAG CCCATACAAG GCCATGGGGC 2461 TGGGCGCAAA GCATGCCTGG GTTCAGGGTG GGTATGGTGC CGGAGCAGGG AGGTGAGAGG 2521 CTCAGCTGCC CTCCAGAACT CCTCCCTGGG GACAACCCCT CCCAGCCAAT AGCACAGCCT 2581 AGGTCCCCCT ATATAAGGCC ACGGCTGCTG GCCCTTCCTT TGGGTCAGTG TCACCTCCAG 2641 GATACAGACA GCCCCCCTTC AGCCCAGCCC AGCCAGGTAC TGCACGGGGC GGGAATCTGG 2701 GTGGGGGCCA GAGTAGGGGA TTTCTGTGGG TGCTAGAGGC TTGGCTTGGG AAAGGGTCTG 2761 TGTGTCACCC CTTGCTCCAC CAACATCCTC CTATACAAAG GCAGGTCGGT GCGTGGGAAG 2821 GTTGACCCTT GTGTGTCTGG GAGGCCCCTC CATCTGTGAG GCTGCCTGAA CCCCCACTGG 2881 GACCTGTGAT TTCTGCGGCA CAG // LOCUS HUMCKMM2 481 bp ds-DNA PRI 15-JUN-1989 DEFINITION Human muscle creatine kinase gene (CKMM), exon 2. ACCESSION M21488 J04435 KEYWORDS muscle creatine kinase. SEGMENT 2 of 8 SOURCE Human adult leukocyte DNA, clones lambda MCK [12,25]. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 481) AUTHORS Trask,R.V., Strauss,A.W. and Billadello,J.J. TITLE Developmental regulation and tissue-specific expression of the human muscle creatine kinase gene JOURNAL J. Biol. Chem. 263, 17142-17149 (1988) STANDARD full staff_review COMMENT Draft entry and clean copy of sequence for [1] kindly provided by R.V.Trask, 24-OCT-1988. FEATURES from to/span description pept 89 + 281 muscle creatine kinase, exon 2 (first coding exon) (EC 2.7.3.2) /nomgen="CKMM" /map="19q13" /hgml_locus_uid="LM0095M" pre-msg < 1 > 481 CKMM mRNA + introns IVS < 1 70 CKMM, intron A IVS 282 > 481 CKMM, intron B BASE COUNT 145 A 134 C 114 G 88 T ORIGIN About 3.2 kb after segment 1; chromosome 19q13. 1 AAAAAAGAAG AAGAAGAAGA AAAAAGATGC CAGCTAATCT TCAATCTTCC CCTCCTGCCC 61 TGCCCATCAG GTCTCCTACA CCGCCACCAT GCCATTCGGT AACACCCACA ACAAGTTCAA 121 GCTGAATTAC AAGCCTGAGG AGGAGTACCC CGACCTCAGC AAACATAACA ACCACATGGC 181 CAAGGTACTG ACCCTTGAAC TCTACAAGAA GCTGCGGGAC AAGGAGACTC CATCTGGCTT 241 CACTGTAGAC GATGTCATCC AGACAGGAGT GGACAACCCA GGTGAGCCTC CCCAGTGGAG 301 CACTGAAGGG GCTACATGGG GGCTCTGGAG CTGCCGCCAT GCCCCAATCC CACCACCTTG 361 AAGGGGGGGG GCCTCAGTTT TCTCATCTGT AAAATGGGGC TGTTGTGGGA ATCAATGCAT 421 TAATATACAT AAAGGCTTGG GACAGTGCGA GGTACTCAGC AAACCACGCA ATAAACATGC 481 A // LOCUS HUMCKMM3 242 bp ds-DNA PRI 15-JUN-1989 DEFINITION Human muscle creatine kinase gene (CKMM), exon 3. ACCESSION M21489 J04435 KEYWORDS muscle creatine kinase. SEGMENT 3 of 8 SOURCE Human adult leukocyte DNA, clones lambda MCK [12,25]. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 242) AUTHORS Trask,R.V., Strauss,A.W. and Billadello,J.J. TITLE Developmental regulation and tissue-specific expression of the human muscle creatine kinase gene JOURNAL J. Biol. Chem. 263, 17142-17149 (1988) STANDARD full staff_review COMMENT Draft entry and clean cop of sequence for [1] kindly provided by R.V.Trask, 24-OCT-1988. FEATURES from to/span description pept + 33 + 187 muscle creatine kinase, exon 3 (EC 2.7.3.2) /nomgen="CKMM" /map="19q13" /hgml_locus_uid="LM0095M" pre-msg < 1 > 242 CKMM mRNA + introns IVS < 1 32 CKMM, intron B IVS 188 > 242 CKMM, intron C BASE COUNT 60 A 72 C 69 G 41 T ORIGIN About 1.2 kb after segment 2; chromosome 19q13. 1 AAGCAGCTCT GACCTCACCC CCACCCCTCC AGGTCACCCC TTCATCATGA CCGTGGGCTG 61 CGTGGCTGGT GATGAGGAGT CCTACGAAGT TTTCAAGGAA CTCTTTGACC CCATCATCTC 121 GGATCGCCAC GGGGGCTACA AACCCACTGA CAAGCACAAG ACTGACCTCA ACCATGAAAA 181 CCTCAAGGTC GGTGTCTGCG CAGGAGGGGA GGGCCGAGGG GTGAGGAGGA GAGAGATCAC 241 AG // LOCUS HUMCKMM4 269 bp ds-DNA PRI 15-JUN-1989 DEFINITION Human muscle creatine kinase gene (CKMM), exon 4. ACCESSION M21490 J04435 KEYWORDS muscle creatine kinase. SEGMENT 4 of 8 SOURCE Human adult leukocyte DNA, clones lambda MCK [12,25]. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 269) AUTHORS Trask,R.V., Strauss,A.W. and Billadello,J.J. TITLE Developmental regulation and tissue-specific expression of the human muscle creatine kinase gene JOURNAL J. Biol. Chem. 263, 17142-17149 (1988) STANDARD full staff_review COMMENT Draft entry and clean copy of sequence for [1] kindly provided by R.V.Trask, 24-OCT-1988. FEATURES from to/span description pept + 12 + 144 muscle creatine kinase, exon 4 (EC 2.7.3.2) /nomgen="CKMM" /map="19q13" /hgml_locus_uid="LM0095M" pre-msg < 1 > 269 CKMM mRNA + introns IVS < 1 11 CKMM, intron C IVS 145 > 269 CKMM, intron D BASE COUNT 41 A 92 C 84 G 52 T ORIGIN About 2.1 kb after segment 3; chromosome 19q13. 1 CCGCTCTCCA GGGTGGAGAC GACCTGGACC CTAACTACGT GCTCAGCAGC CGCGTCCGCA 61 CTGGCCGCAG CATCAAGGGC TACACGTTGC CCCCACACTG CTCCCGTGGC GAGCGCCGGG 121 CGGTGGAGAA GCTCTCTGTG GAAGGTGAGC CCCCCTACCC CGTCCACTGC TGGGGTCACC 181 ACCCTGGCTC TGTTCCTCCA TGGGCAGGTC CTCGGGTCTC TTTGGGCCCC GCTTTTCAAA 241 GTAGGGGAAG AGCTTGGTAA TCGTGGGGC // LOCUS HUMCKMM5 247 bp ds-DNA PRI 15-JUN-1989 DEFINITION Human muscle creatine kinase gene (CKMM), exon 5. ACCESSION M21491 J04435 KEYWORDS muscle creatine kinase. SEGMENT 5 of 8 SOURCE Human adult leukocyte DNA, clones lambda MCK [12,25]. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 247) AUTHORS Trask,R.V., Strauss,A.W. and Billadello,J.J. TITLE Developmental regulation and tissue-specific expression of the human muscle creatine kinase gene JOURNAL J. Biol. Chem. 263, 17142-17149 (1988) STANDARD full staff_review COMMENT Draft entry and clean copy of sequence for [1] kindly provided by R.V.Trask, 24-OCT-1988. FEATURES from to/span description pept + 47 + 218 muscle creatine kinase, exon 5 (EC 2.7.3.2) /nomgen="CKMM" /map="19q13" /hgml_locus_uid="LM0095M" pre-msg < 1 > 247 CKMM mRNA + introns IVS < 1 46 CKMM, intron D IVS 219 > 247 CKMM, intron E BASE COUNT 44 A 87 C 71 G 45 T ORIGIN About 3.5 kb after segment 4; chromosome 19q13. 1 TTGAGGTGGG ACAGGCTGCT GACCACTCCC TCGTGTGTCC CCATAGCTCT CAACAGCCTG 61 ACGGGCGAGT TCAAAGGGAA GTACTACCCT CTGAAGAGCA TGACGGAGAA GGAGCAGCAG 121 CAGCTCATCG ATGACCACTT CCTGTTCGAC AAGCCCGTGT CCCCGCTGCT GCTGGCCTCA 181 GGCATGGCCC GCGACTGGCC CGACGCCCGT GGCATCTGGT GAGGCCCCTC CCCGCCCCCT 241 GGCTTCC // LOCUS HUMCKMM6 290 bp ds-DNA PRI 15-JUN-1989 DEFINITION Human muscle creatine kinase gene (CKMM), exon 6. ACCESSION M21492 J04435 KEYWORDS muscle creatine kinase. SEGMENT 6 of 8 SOURCE Human adult leukocyte DNA, clones lambda MCK [12,25]. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 290) AUTHORS Trask,R.V., Strauss,A.W. and Billadello,J.J. TITLE Developmental regulation and tissue-specific expression of the human muscle creatine kinase gene JOURNAL J. Biol. Chem. 263, 17142-17149 (1988) STANDARD full staff_review COMMENT Draft entry and clean copy of sequence for [1] kindly provided by R.V.Trask, 24-OCT-1988. FEATURES from to/span description pept + 151 + 274 muscle creatine kinase, exon 6 (EC 2.7.3.2) /nomgen="CKMM" /map="19q13" /hgml_locus_uid="LM0095M" pre-msg < 1 > 290 CKMM mRNA + introns IVS < 1 150 CKMM, intron E IVS 275 > 290 CKMM, intron F BASE COUNT 50 A 81 C 95 G 64 T ORIGIN About 3.4 kb after segment 5; chromosome 19q13. 1 GCTTCTGCGT TTTGAGGAGG CAGATGCCAC CCACCTCCCA CCCCTCAGGC CCTCTTCCCC 61 CATCGGTGGG CAGATGTTTT GGGGGAGGGC TGGGAATGGG GTTCCCACGG GGCTGACACC 121 TTGGCCTCTT GCTGCGGCAC CTTACTCTAG GCACAATGAC AACAAGAGCT TCCTGGTGTG 181 GGTGAACGAG GAGGATCACC TCCGGGTCAT CTCCATGGAG AAGGGGGGCA ACATGAAGGA 241 GGTTTTCCGC CGCTTCTGCG TAGGGCTGCA GAAGGTGGGT GTCTGCCCTT // LOCUS HUMCKMM7 221 bp ds-DNA PRI 15-JUN-1989 DEFINITION Human muscle creatine kinase gene (CKMM), exon 7. ACCESSION M21493 J04435 KEYWORDS muscle creatine kinase. SEGMENT 7 of 8 SOURCE Human adult leukocyte DNA, clones lambda MCK [12,25]. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 221) AUTHORS Trask,R.V., Strauss,A.W. and Billadello,J.J. TITLE Developmental regulation and tissue-specific expression of the human muscle creatine kinase gene JOURNAL J. Biol. Chem. 263, 17142-17149 (1988) STANDARD full staff_review COMMENT Draft entry and clean copy of sequence for [1] kindly provided by R.V.Trask, 24-OCT-1988. FEATURES from to/span description pept + 13 + 202 muscle creatine kinase, exon 7 (EC 2.7.3.2) /nomgen="CKMM" /map="19q13" /hgml_locus_uid="LM0095M" pre-msg < 1 > 221 CKMM mRNA + introns IVS < 1 12 CKMM, intron F IVS 203 > 221 CKMM, intron G BASE COUNT 48 A 67 C 62 G 44 T ORIGIN About 0.8 kb after segment 6; chromosome 19q13. 1 TCTGAACCTC AGATTGAGGA GATCTTTAAG AAAGCTGGCC ACCCCTTCAT GTGGAACCAG 61 CACCTGGGCT ACGTGCTCAC CTGCCCATCC AACCTGGGCA CTGGGCTGCG TGGAGGCGTG 121 CATGTGAAGC TGGCGCACCT GAGCAAGCAC CCCAAGTTCG AGGAGATCCT CACCCGCCTG 181 CGTCTGCAGA AGAGGGGTAC AGGTACGGCT CCACATCTTT T // LOCUS HUMCKMM8 676 bp ds-DNA PRI 15-JUN-1989 DEFINITION Human muscle creatine kinase gene (CKMM), exon 8. ACCESSION M21494 J04435 KEYWORDS muscle creatine kinase. SEGMENT 8 of 8 SOURCE Human adult leukocyte DNA, clones lambda MCK [12,25]. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 676) AUTHORS Trask,R.V., Strauss,A.W. and Billadello,J.J. TITLE Developmental regulation and tissue-specific expression of the human muscle creatine kinase gene JOURNAL J. Biol. Chem. 263, 17142-17149 (1988) STANDARD full staff_review COMMENT Draft entry and clean copy of sequence for [1] kindly provided by R.V.Trask, 24-OCT-1988. FEATURES from to/span description pept + 148 326 muscle creatine kinase, exon 8 (EC 2.7.3.2) /nomgen="CKMM" /map="19q13" /hgml_locus_uid="LM0095M" pre-msg < 1 663 CKMM mRNA + introns IVS < 1 147 CKMM, intron G BASE COUNT 137 A 216 C 176 G 147 T ORIGIN About 0.8 kb after segment 7; chromosome 19q13. 1 ACGAATGAAC AGGCATTACC TGTGTGCAGG TAACCTGTTA CACAGTAGGC ACTCACTACA 61 TGCGGCTGTT GCATTTGGCA CGGGGGCACG TCCCAGCCCC TCTGCACCTT CGCTGCCTCT 121 TCACGCCCCT GTCCCTTGCT CCCACAGGTG GCGTGGACAC AGCTGCCGTG GGCTCAGTAT 181 TTGACGTGTC CAACGCTGAT CGGCTGGGCT CGTCCGAAGT AGAACAGGTG CAGCTGGTGG 241 TGGATGGTGT GAAGCTCATG GTGGAAATGG AGAAGAAGTT GGAGAAAGGC CAGTCCATTG 301 ACGACATGAT CCCCGCCCAG AAGTAGGCGC CTGCCCACCT GCCACCGACT GCTGGAACCC 361 AGCCAGTGGG AGGGCCTGGC CCACCAGAGT CCTGCTCCCT CACTCCTCGC CCCGCCCCCT 421 GTCCCAGAGT CCCACCTGGG GGCTCTCCCC ACCCTTCTCA GAGTTCCAGT TTCAACCAGA 481 GTTCCAACCA ATGGGCTCCA TCCTCTGGAT TCTGGCCAAT GAAATATCTC CCTGGCAGGG 541 TCCTCTTCTT TTCCCAGAGC TCCACCCCAA CCAGGAGCTC TAGTTAATGG AGAGCTCCCA 601 GCACACTCGG AGCTTGTGCT TTGTCTCCAC GCAAAGCGAT AAATAAAAGC ATTGGTGGCC 661 TTAGCTTTGT GCGATA // LOCUS HUMCKMT 6896 bp ds-DNA PRI 15-SEP-1989 DEFINITION Human mitochondrial creatine kinase gene, complete cds. ACCESSION J04469 KEYWORDS creatine kinase. SOURCE Human (adult) placenta DNA, clone lambda-hCK39. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 6896) AUTHORS Haas,R.C., Korenfeld,C., Zhang,Z., Perryman,B., Roman,D. and Strauss,A.W. TITLE Isolation and characterization of the gene and cDNA encoding human mitochondrial creatine kinase JOURNAL J. Biol. Chem. 264, 2890-2897 (1989) STANDARD full staff_entry COMMENT Computer-readable copy of sequence for [1] kindly provided by A.W.Strauss, 23-NOV-1988. FEATURES from to/span description pept 404 552 creatine kinase precursor, exon 1 (EC 2.7.3.2) /nomgen="CKMT" /map="15" 1157 1355 creatine kinase precursor, exon 2 1559 1654 creatine kinase precursor, exon 3 2041 2262 creatine kinase precursor, exon 4 2392 2477 creatine kinase precursor, exon 5 2593 2716 creatine kinase precursor, exon 6 4378 4512 creatine kinase precursor, exon 7 5014 5139 creatine kinase precursor, exon 8 5343 5459 creatine kinase precursor, exon 9 sigp 404 517 creatine kinase signal peptide matp 518 552 creatine kinase 1157 1355 creatine kinase 1559 1654 creatine kinase 2041 2262 creatine kinase 2392 2477 creatine kinase 2593 2716 creatine kinase 4378 4512 creatine kinase 5014 5139 creatine kinase 5343 5456 creatine kinase pre-msg 241 > 5573 CK mRNA and introns IVS 553 1156 CK intron A IVS 1356 1558 CK intron B IVS 1655 2040 CK intron C IVS 2263 2391 CK intron D IVS 2478 2592 CK intron E IVS 2717 4377 CK intron F IVS 4513 5013 CK intron G IVS 5140 5342 CK intron H BASE COUNT 1746 A 1718 C 1720 G 1712 T ORIGIN 227 bp upstream of PstI site. 1 CATGCCACAT CCCCGGGGCG GGAGGGGGCT ACATCCCCGG CTTTAGACGC GCGAGTCTCA 61 GGTCCCGCTA ATTACCTGGC GGGTGCTGCC CACCCCTGCC CTCGCGCACC TAGCGCGTGG 121 CAGCGGGAAG GCGGGGCCTG GGGGAGCCCC ACCCCTGGAG ACTGCGGCTG GGGCCTCCCT 181 CTCCTCCGCC CGCCCGCCTG CCACTAGCTC ATTGCGCCTC TCCTGCAGTC TGATTGGGCA 241 CCGGCTCCCA TTCCGGCTCC AGCCTCCAAT CCGACCCCCA TTTCGGCTGC AGCCTCGGAC 301 CTAGCTCCGG CCCTCGGTCT ATCCGGTTGC ATCCTCCCTC CCTGTTCCGG ATCTTATCTT 361 GCGCCAGCGC CTACTCCAGG ATCCCGTAGC CAGACCTCAA GCCATGGCTG GTCCCTTCTC 421 CCGTCTGCTG TCCGCCCGCC CGGGACTCAG GCTCCTGGCT TTGGCCGGAG CGGGGTCTCT 481 AGCCGCTGGG TTTCTGCTCC GACCGGAACC TGTACGAGCT GCCAGTGAAC GACGGAGGCT 541 GTATCCCCCG AGGTAACAGT GCCTGAGGCG CGGGAGGAGG CGGGGGCAGG AGGTGATGGG 601 AACGAAGGTG CGGGTAGAAG TGAGAATCCG GGCAACAGAG AAGGGCTATA ATCACGAAGG 661 CCCTGGAGCT GGAGGGCTGT GCAGTCTGCA GACCTCAGTG GGGTGGGGGT GGGGGCCAAA 721 ACCATAAAGC AAGAACATTC CTGGGGACCT GCCAAGACCA GCTCTGGCCC TACGAGTTCT 781 AGCTGCACTG GCTGCCCAAA TCCCTAATTG TAAAGCCAGG AACTATCCTT TTCGCTCCCC 841 TCCATCTCCT TCCCTCATTT CCTCAATTCC TCTCCTTAGG CTTTTCCCCT CCTCCATCCG 901 TAGTGTTGTG TCATGGGAGG AAAGAACTGA GCAGATCTGA AGAAACTGAG CTGGCCAGCC 961 AGAGGCAACT AGAACTATTA GGAAAGCATA GACTCTGAAA GTCCCTAAAG AGATTACCAA 1021 GGTTTACCCT CTTTCTAATT CCCCTCCTCC CGCGGAGCAA AGCCAGACAT GGCCAACTGG 1081 ACAGCTCCCA GGTAACTGCA CTAGGTCTAG GCGTCTGTGA CCCTCCCTCC ATGGTTACTG 1141 GGTACCCCCT CCCCAGCGCT GAGTACCCAG ACCTCCGAAA GCACAACAAC TGCATGGCCA 1201 GTCACCTGAC CCCAGCAGTC TATGCACGGC TCTGCGACAA GACCACACCC ACTGGTTGGA 1261 CGCTAGATCA GTGTATCCAG ACTGGCGTGG ACAACCCTGG CCACCCCTTC ATCAAGACTG 1321 TGGGCATGGT GGCTGGAGAT GAGGAGACCT ATGAGGTAGG GGGTCCCCAG AGTCTCCCTG 1381 ATGATCCAAT TCATCTTCCC AGTAATCCCA GCTCCTTTCC CTTAAAGACC TCTCACTTTC 1441 CCCCAAGACT CTGAGCCCCC CATACTTAAG TTTTCTGAAC CAGTGAAATC AATGCACAAT 1501 TGAAGTCTGG GGAGGGATTC CCTCTCCTTA ACCATCTCTC CCTCTTAACT CCCCTTAGGT 1561 ATTTGCTGAC CTGTTTGACC CTGTGATCCA AGAGCGACAC AATGGATATG ACCCCCGGAC 1621 AATGAAGCAC ACCACGGATC TAGATGCCAG TAAAGTGAGT TCAAATATCC CACTTCTGAT 1681 TTGCATTGCC TGTGTACAAC ACTCTGTATC TCCAACCCCT TCACCTTATT TCCTGACTCA 1741 TGGTCATTAT ACTGCTGAGC TTTTAATCTT AATGTAAGGA AAGAATCATA TCTTAAGGGG 1801 CAGCATATAT GGAGATGGAA GGATAGATAA GAATGACCAT GACCCAAGGT GGGTGGTTTG 1861 GGGACGGGTC TGCAATGCCC CCTTCAATTC CAGTGCTTTC CCAAAGGGCC TCTTCTTCCA 1921 ATGCATGCAG GAAGAATGCA CACAGAGTCC TCTAATGCCT AAGGAAGGTC TCTCCTTTCC 1981 CAGGGGCCCT CAGTTCCCAC CGTGTTTCTG TGACTTACAT TCATTTCCCT TATCTCCCAG 2041 ATCCGTTCTG GCTACTTTGA TGAGAGGTAT GTATTGTCCT CTAGAGTCAG AACTGGCCGA 2101 AGCATCCGAG GACTCAGTCT GCCTCCAGCT TGCACTCGAG CAGAGCGACG AGAGGTGGAA 2161 CGTGTTGTGG TGGATGCACT GAGTGGCCTG AAGGGTGACC TGGCTGGACG TTACTATAGG 2221 CTCAGTGAGA TGACAGAGGC TGAACAGCAG CAGCTTATTG ATGTGAGGGC CTTAAGAGGG 2281 TGCTGGTTGG TGGGAGCAGA TGGGGAAGGC TGGGCCAGAT GAGACATGGG CTCTGAAAGG 2341 CCCAGGGGCC ACCATGAAGA TTCTTAACCC AAGTCCCGTT ACTCTTCCCA GGACCACTTT 2401 CTGTTTGATA AGCCTGTGTC CCCGTTGCTG ACTGCAGCAG GAATGGCTCG AGACTGGCCA 2461 GATGCTCGTG GAATTTGGTA TGAAGCTGCT CATTACCTCT TTTGTCTTCA TGCCCTCATA 2521 AATGCTTTTT TTCCCTCTAT CTCTCCCAAT TCTTGCCTTG CCTCTTGATC ACTGTCCCTC 2581 TCCGGCCCTC AGGCACAACA ATGAGAAGAG CTTCCTGATC TGGGTGAATG AGGAGGATCA 2641 TACACGGGTG ATCTCCATGG AGAAGGGTGG TAACATGAAG AGAGTGTTTG AAAGATTCTG 2701 CCGAGGCCTC AAAGAGGTTA GAGAAGACTA TGTAGGGGAG CTAGGTGGGA GGACATAAGG 2761 AAAACCAAAG AGTAGCATAA ATAGATTATG TAATTTACCA ACCAACCCAG GACATGTCTT 2821 ATAGTAAAAA GGACTATCTA GGACTCACTC CAGGACTAAA GGTGTAAACC AGCTGGGACC 2881 ATACTGGGAA AACCAGGACA TGTGGTCACA CTAAGATTAG GAAAAGAAAG AGTGTCAGGA 2941 ATCTTAGGAA GTGAACAAGG CTTTTGACAG AGAGTGCAAA GAAGGAATAA ATGAGATGGC 3001 ACGTCAGTGC CTGGGATGTG TGCAGTGGGA TGGTGAGGTG TGCAGATAAG GAAAACATTC 3061 GAGCTTAGAT TGATGTTGGC GGGGAGAGGT TGCTGTGTTC ATGACTCTAA TATAACCACC 3121 CAGTTCTGAG ACAAGGTAGG CCTTGACTCT GGATTCTATC ATTCTTGTTA AAGTTTCGGG 3181 TCTAGGCTTT AAGTTGAGAG TTCGGAGAGA GACTGGGGAA GGTGGAGGAT AGAATGGTTC 3241 GAGTTCTAGA ATATGTGGCT CTAGATGAGA GGTTGAACTG AATCATCAAT CCTACATGGA 3301 TTGGGTCTCC GTATTCAAGT CTACATTAGA AATCCCCATA AACTCAATTC AATTCTTACT 3361 GTATGTTCTC AAACATACAG TTCTATTTTA GGTTTGCAAA GAAAAAGAGC TCCTCTTTTA 3421 GATTCTGAGA AGTTTCTACT ATTTTTGGCA AGTAATAGAT AACATATTCT GACTATGAGT 3481 GGGTAGGGAA GTACCTTTAA ATTATATGCC TCAGTTTCCT CATCTGTAAA ATTGGGATAA 3541 TGAGATTTTC TACATTTTAG GTTGTTGTGG GGATTAAGTG AAATACAGGT AAAGTACTTG 3601 GTCCACAGTA AGTGCTTAAT AAGTGTTAAA GTGTTAGCTG CAATATTATT CTGGATGGAA 3661 GAGTTTCCCC CCATGTTCAG CATGTAAGAT ATCCCCTATG GCATGGTTCC TTCTGAACTA 3721 TAAAGAGGAT CCCTTTACTC ATGTTGGGTT GTGGTCTTTG TGACCATCAT TCTGCTAGAT 3781 CCCTTGTCTC TTGAACTCTA ATAGTCATCT TCATGACTAC ATGGTTAAGT GAAGCCAAAC 3841 GCCTTCCCCC CGCCCCCTAT TCCTATGAAT CTGGCTTTTC TGCTCTGTTT TCATCTTTCT 3901 CTGCATTCAC ACAGGTGCTC CGTTCACAGC TAACAGAATG TTATCTTACC TCTTCCTGGC 3961 AAAGCTTACA CCTTCATCTT CTGTCTGAAG GGACCCTTCT AAGCTCTAGG CTCATTAGCA 4021 AAGCAAAGAT AATCGATGCA TGCAGACCTC ATTGAATAAT CAGTCATCTC TCAGTTCAGT 4081 TTACCACCTC TGTTCATTTC CCTAGATCAT CCTTAATACA CCACTCCTTC GAGTTTTCTT 4141 CTTCCACATA AGATATTTTT TCACAATCTC ATTATTATGC ACATCATAAT TTTGCATCAT 4201 GCATGCATGA AAACAATAAC AAACCTTTTT CATTTAAAAA AAGACCAATG TCATTCATTC 4261 ACAGCCAAGT TTCTGTTCTA GACATATTTC TAGTGTTCTT GTGGGTCTAG CTAAGGGAGG 4321 GTCCAGGGTT AATGAAATAT CCCTGATTTT TCGTTAACAA AACCTTTGTG GACTCAGGTG 4381 GAGAGACTTA TCCAAGAACG TGGCTGGGAG TTCATGTGGA ATGAGCGTTT GGGATACATC 4441 TTGACCTGTC CATCTAACCT GGGCACTGGA CTTCGGGCAG GAGTGCACAT CAAACTGCCC 4501 CTGCTAAGCA AAGTAAAGGA GTTGTGGGGT TACAGAGGGG TGTGAGTAAG GAAGGGTGGG 4561 TTGTGGATGG GGAGGGAGTG GACCCTTTGG AAAGGAGCCA AACATGTTGT GGCTAAAGGG 4621 TCAGAGGACA GGCCAGGCAC AGTGGCTCAT GCCTCTAATC CCAACACTTG GGAGGCCAAG 4681 GCAGGCAGAT TACTTGAGCC CAGGAGTTCA AGACCAGCCT GGGCAACCTG GTGAAACCCC 4741 ATCTCTACCT ACAAATACAA AAGTTAGCTG GGTGTAGTGG AGGCTGAGGT GAGAGGATCA 4801 CTTAAGCCTG GGAAGTCGAG GCTTCAGTGA GCTGTGATCA CTCCAGCCTG GGTGACAGAG 4861 AGAGACCCTG TCTAAAAAAA ATTAAAAAAG AAAAAAGAAA AAAGGAAAAA AAAAGTTCAG 4921 GAGACAGAGC TCTGAGCAGG TTCAGGGCTC TTTCAGGTAG GACCTAGTCT CTGCCTCTAT 4981 TGACCCTGCT CCCAATCCCT ATCTCCTCTC TAGGATAGCC GCTTCCCAAA GATCCTGGAG 5041 AACCTAAGAC TCCAAAAACG TGGTACTGGA GGAGTGGACA CTGCTGCTAC AGGCGGTGTC 5101 TTTGATATTT CTAATTTGGA CCGACTAGGC AAATCAGAGG TGAGATCCTA AGGGATTAGG 5161 ACAAGGAGAG GTATAGGTCT GCGAGGGCCG AAATATGGCA GTGAGTGAGC CTCCGGGATG 5221 TAACATAATC TGAAATGAAA TTCAGGTTGA GTGGGAGGCA ATTGGAAATG AGCAGGCAAG 5281 TCAGTCAGTG ATAAAGAAAA ACTCAGACTG TAGGAAGCAG ATCAAAGATT AGTGTCCCTT 5341 AGGTGGAGCT GGTGCAACTG GTCATCGATG GAGTAAACTA TTTGATTGAT TGTGAACGGC 5401 GTCTGGAGAG AGGCCAGGAT ATCCGCATCC CCACACCTGT CATCCACACC AAGCATTAAC 5461 TCCCCATCGC CAGCTGATGA CTCAAGATTC CCAGGAGTTT TGCTCATTCT AATGATGGCC 5521 CATTCTACTT GCTCTGGACC TGCCCCCGCA TCCCCTGCCT CCATCCTAGT AAAGACTCCT 5581 TGCTATGCTG CAGCTGTCTG TGTTACTTCT AATGGTGGGG TGAGGAGGGA GCAGCCTTCA 5641 GGAAATGAAA AGAGGCAGTG GGATTATTTA TGATGGAAAG AGACTCCAGA TATGGCAACC 5701 CAGGAACACT GATTCTCAGG TGGGTGGAAA GCATTAACAT TTTACCCATA TTCCTCATCA 5761 GCTTCTGAAA ATAATCAGGA TGCACTTCTG TTTGCACTTT ATTCATTATG ACTTAAGATT 5821 TCTCTCCCCA CAATCTCCTT CTACTGTAGA GACAGGCTCA TAGCAGGTGG CCAAGGAAGC 5881 TGATAGTCAA TACCAGGGAC CAGGAAGGTC GTGACCAGTC CTGGAGGCCC CAGGCTGTAC 5941 TTCGACCTAT AATAGACAGG GAATGGGAGT AATATCACAA CTCAGCTCTC CAGGAGCATT 6001 GATACTTGGA AATTAGCGCT CTGCCTGTAG ACTCCTTCAC TCCAGGGATC TCCCTGGGTG 6061 CACTCTAAGA GCCAGACAGC ACCAAATTAG GGGTTTGATT CTGGGTCAGG AGATGGAGGA 6121 TCAAGCTGTG CAGCTGGGAA CTCACCTTGC TGTTCTGGGC TCTCCTTTCC CTCATGTTGG 6181 GCCCATGCAA CTGCTCGTCG CTGCTCAGGA CTCAGAAAGG CCATTTGCTC AGGAGTGACA 6241 GCCACAGCCT GAGCACTGGT GAGACTAGAT AGTTGGATGG GACTAAACAC CACCTGAGGG 6301 CAGGGGTAGG AATCAGTGCA TGCATGTAGT CCCCATTGGG CCCTGGCTCT CCTGTGGTCA 6361 CCCCAGTCCA TTAATACTTA CAGCAAATTT AGGAGGAGGG ATGACAGAAA TGGCAAGAGG 6421 AGTAACGCCC TGGATCTGTC CCCGCAGCAG TGCTGAAAGA GCCAGGTCTG GGATCCCAGC 6481 TGTTGAAGCA AGTGGCATCC AAACATTGTC TTAGACTGAC CTTCCCTCTC TTCAAACCTA 6541 TAGACCTTCT CTAACTACTC CCAAAGTGCC CTATCATAGA CCTTCCCCAA TATGTCTCTA 6601 GCCCCTTATT TAAACACCCT CAGGCCCCCA CCTTAAGAAT TGCAGGGCAG TCTTCCATCC 6661 AGTCCACCCA TGGTATAGAA ACCAAACCAA CTTGCACCAG CAGTGGCCCA GCTCCCCACC 6721 TGCTATGGTG CCAATTTCAG TGAAGATCTC AGGCCCCCAG TTACTGATTG GGCCAAACCC 6781 ACCAGGCAGT ACAAGTAGGT GGGCCAGAAC CTCCAGTTGT TCCTCAGAGC ACTGCAGATG 6841 CAGGGTGCCG AGGAAGAGAG CTGCTTGGCT GTAGAACAGT GGGAAGGAAG GAAGAA // LOCUS HUMCNPB1 749 bp ds-DNA PRI 15-JUN-1989 DEFINITION Human blue cone photoreceptor pigment gene, exon 1. ACCESSION M13295 KEYWORDS apoprotein; blue cone protein; blue pigment protein; membrane protein; opsin. SEGMENT 1 of 5 SOURCE Human (male) DNA, clones gJHN[11,12,14,23]. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 749) AUTHORS Nathans,J., Thomas,D. and Hogness,D.S. TITLE Molecular genetics of human color vision: The genes encoding blue, green, and red pigments JOURNAL Science 232, 193-202 (1986) STANDARD full staff_review FEATURES from to/span description pept 410 + 739 blue pigment protein, exon 1 /nomgen="BCP" /map="7q22-qter" /hgml_locus_uid="LZ0026E" pre-msg 403 > 749 BCP mRNA IVS 740 > 749 BCP intron A BASE COUNT 174 A 178 C 192 G 205 T ORIGIN Chromosome 7q22-qter. 1 GGGCAGATGA GTTGAGGAAA ACTTAACTGA TACAGTTGTG CCAGAAGCCA AAATAAGAGG 61 CGTGCCCTTT CTATAGCCCC ATTAAAAGAA CAAAAAAGTG GAAGCATCTT CAGTGAATAT 121 GGGTCAGCAC CTCCCAGACC TCAGGGAGTC CACTTCTGTT CATCCCAGCA CCCAGCATTG 181 CATATCCAGA TTATTTGAGC CCAATCTCTT ATCCTCTGAA GAACACAATC GGCTTTGGGG 241 CCACAGAAGG TTTAGGTAGT GGTTTAGGGA TTTCTAATCC CAAACTTTGT CCTTGGGAGG 301 TTTAGGATTA GTATTGATCA TTCACAGAGC CCAAGTGTTT TTAGAGGAGG GGTTTTGTGG 361 GGTGGGAGGA TCACCTATAA GAGGACTCAG AGGAGGGTGT GGGGCATCCA TGAGAAAAAT 421 GTCGGAGGAA GAGTTTTATC TGTTCAAAAA TATCTCTTCA GTGGGGCCGT GGGATGGGCC 481 TCAGTACCAC ATTGCCCCTG TCTGGGCCTT CTACCTCCAG GCAGCTTTCA TGGGCACTGT 541 CTTCCTTATA GGGTTCCCAC TCAATGCCAT GGTGCTGGTG GCCACACTGC GCTACAAAAA 601 GTTGCGGCAG CCCCTCAACT ACATTCTGGT CAACGTGTCC TTCGGAGGCT TCCTCCTCTG 661 CATCTTCTCT GTCTTCCCTG TCTTCGTCGC CAGCTGTAAC GGATACTTCG TCTTCGGTCG 721 CCATGTTTGT GCTTTGGAGG TACTGCAGG // LOCUS HUMCNPB2 182 bp ds-DNA PRI 15-JUN-1989 DEFINITION Human blue cone photoreceptor pigment gene, exon 2. ACCESSION M13296 KEYWORDS apoprotein; blue cone protein; blue pigment protein; membrane protein; opsin. SEGMENT 2 of 5 SOURCE Human (male) DNA, clones gJHN[11,12,14,23]. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 182) AUTHORS Nathans,J., Thomas,D. and Hogness,D.S. TITLE Molecular genetics of human color vision: The genes encoding blue, green, and red pigments JOURNAL Science 232, 193-202 (1986) STANDARD full staff_review FEATURES from to/span description pept + 11 + 172 blue pigment protein, exon 2 /nomgen="BCP" /map="7q22-qter" /hgml_locus_uid="LZ0026E" pre-msg < 1 > 182 BCP mRNA IVS < 1 10 BCP intron A IVS 173 > 182 BCP intron B BASE COUNT 30 A 52 C 51 G 49 T ORIGIN About 285 bp after segment 1. 1 TTCACCACAG GGCTTCCTGG GCACTGTAGC AGGTCTGGTT ACAGGATGGT CACTGGCCTT 61 CCTGGCCTTT GAGCGCTACA TTGTCATCTG TAAGCCCTTC GGCAACTTCC GCTTCAGCTC 121 CAAGCATGCA CTGACGGTGG TCCTGGCTAC CTGGACCATT GGTATTGGCG TCGTGAGAGT 181 GC // LOCUS HUMCNPB3 182 bp ds-DNA PRI 15-JUN-1989 DEFINITION Human blue cone photoreceptor pigment gene, exon 3. ACCESSION M13297 KEYWORDS apoprotein; blue cone protein; blue pigment protein; membrane protein; opsin. SEGMENT 3 of 5 SOURCE Human (male) DNA, clones gJHN[11,12,14,23]. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 182) AUTHORS Nathans,J., Thomas,D. and Hogness,D.S. TITLE Molecular genetics of human color vision: The genes encoding blue, green, and red pigments JOURNAL Science 232, 193-202 (1986) STANDARD full staff_review FEATURES from to/span description pept + 11 + 172 blue pigment protein, exon 3 /nomgen="BCP" /map="7q22-qter" /hgml_locus_uid="LZ0026E" pre-msg < 1 > 182 BCP mRNA IVS < 1 10 BCP intron B IVS 173 > 182 BCP intron C BASE COUNT 24 A 61 C 42 G 55 T ORIGIN About 322 bp after segment 2; chromosome 7q22-qter. 1 TCCTTTGCAG TCCATCCCAC CCTTCTTTGG CTGGAGCCGG TTCATCCCTG AGGGCCTGCA 61 GTGTTCCTGT GGCCCTGACT GGTACACCGT GGGCACCAAA TACCGCAGCG AGTCCTATAC 121 GTGGTTCCTC TTCATCTTCT GCTTCATTGT GCCTCTCTCC CTCATCTGCT TCGTGAGTGG 181 CA // LOCUS HUMCNPB4 290 bp ds-DNA PRI 15-JUN-1989 DEFINITION Human blue cone photoreceptor pigment gene, exon 4. ACCESSION M13298 KEYWORDS apoprotein; blue cone protein; blue pigment protein; membrane protein; opsin. SEGMENT 4 of 5 SOURCE Human (male) DNA, clones gJHN[11,12,14,23]. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 290) AUTHORS Nathans,J., Thomas,D. and Hogness,D.S. TITLE Molecular genetics of human color vision: The genes encoding blue, green, and red pigments JOURNAL Science 232, 193-202 (1986) STANDARD full staff_review FEATURES from to/span description pept + 11 + 280 blue pigment protein, exon 4 /nomgen="BCP" /map="7q22-qter" /hgml_locus_uid="LZ0026E" pre-msg < 1 > 290 BCP mRNA IVS < 1 10 BCP intron C IVS 281 > 290 BCP intron D BASE COUNT 63 A 84 C 71 G 72 T ORIGIN About 606 bp after segment 3; chromosome 7q22-qter. 1 TCCACCCCAG TCCTACACTC AGCTGCTGAG GGCCCTGAAA GCTGTTGCAG CTCAGCAGCA 61 GGAGTCAGCT ACGACCCAGA AGGCTGAACG GGAGGTGAGC CGCATGGTGG TTGTGATGGT 121 AGGATCCTTC TGTGTCTGCT ACGTGCCCTA CGCGGCCTTC GCCATGTACA TGGTCAACAA 181 CCGTAACCAT GGGCTGGACT TACGGCTTGT CACCATTCCT TCATTCTTCT CCAAGAGTGC 241 TTGCATCTAC AATCCCATCA TCTACTGCTT CATGAATAAG GTAAAGCTCT // LOCUS HUMCNPB5 377 bp ds-DNA PRI 15-JUN-1989 DEFINITION Human blue cone photoreceptor pigment gene, exon 5. ACCESSION M13299 KEYWORDS apoprotein; blue cone protein; blue pigment protein; membrane protein; opsin. SEGMENT 5 of 5 SOURCE Human (male) DNA, clones gJHN[11,12,14]. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 377) AUTHORS Nathans,J., Thomas,D. and Hogness,D.S. TITLE Molecular genetics of human color vision: The genes encoding blue, green, and red pigments JOURNAL Science 232, 193-202 (1986) STANDARD full staff_review COMMENT A polyadenylation signal is located at positions 167-172. FEATURES from to/span description pept + 11 133 blue pigment protein, exon 5 /nomgen="BCP" /map="7q22-qter" /hgml_locus_uid="LZ0026E" pre-msg < 1 > 290 BCP mRNA IVS < 1 10 BCP intron D BASE COUNT 110 A 89 C 68 G 110 T ORIGIN About 987 bp after segment 4; chromosome 7q22-qter. 1 TTCTCTCCAG CAGTTCCAAG CTTGCATCAT GAAGATGGTG TGTGGGAAGG CCATGACAGA 61 TGAATCCGAC ACATGCAGCT CCCAGAAAAC AGAAGTTTCT ACTGTCTCGT CTACCCAAGT 121 TGGCCCCAAC TGAGGACCCA ATATTGGCCT GTTTGCAACA GCTAGAATTA AATTTTACTT 181 TTAAGTAAGT TTCTATTGTC TCCGTCAGAA ACCAAACTAC TAAAAACACA AAAAAGATGG 241 TAAAAGGAGT GATGGCAGTT TGGGGAGTCA ATTTTTCATT TTCTTACTAT TGCCTTCTTG 301 CCTACAAAGC TACTGTTTCC ACTGGTCTAT TTCAGACCAC CCAAAGGCCA TTTCAACAAT 361 CATCAGTTTC TACTCCT // LOCUS HUMCNPG1 609 bp ds-DNA PRI 15-JUN-1989 DEFINITION Human green cone photoreceptor pigment gene 1, exon 1. ACCESSION M13306 KEYWORDS apoprotein; green cone protein; green pigment protein; membrane protein; opsin. SEGMENT 1 of 6 SOURCE Human DNA, clone gJHN43; retina, cDNA to mRNA, clone hs2. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 609) AUTHORS Nathans,J., Thomas,D. and Hogness,D.S. TITLE Molecular genetics of human color vision: The genes encoding blue, green, and red pigments JOURNAL Science 232, 193-202 (1986) STANDARD full staff_review COMMENT A poly adenylation signal is located at positions 212-217. FEATURES from to/span description pept 495 + 599 green pigment protein, exon 1 /nomgen="GCP" /map="Xq28" /hgml_locus_uid="LM0071N" pre-msg 454 > 609 GCP1 mRNA IVS 600 > 609 GCP1 intron A BASE COUNT 130 A 183 C 174 G 122 T ORIGIN 160 bp upstream of BstNI site; chromosome Xq28. 1 TTATTTAGTA GAAACGGGGT TTCACCATGT TAGTCAGGCT GGTCGGGAAC TCCTGACCTC 61 AGGAGATCTA CCCGCCTTGG CCTCCCAAAG TGCTGGGATT ACAGGCGTGT GCCACTGTGC 121 CCAGCCACTT TTTTTTAGAC AGAGTCTTGG TCTGTTGCCC AGGCTAGAGT TCAGTGGCGC 181 CATCTCAGCT CACTGCAACC TCCGCCTCCC AGATTCAAGC GATTCTCCTG CCTCGACCTC 241 CCAGTAGCTG GGATTACAGG TTTCCAGCAA ATCCCTCTGA GCCGCCCCCG GGGGCTCGCC 301 TCAGGAGCAA GGAAGCAAGG GGTGGGAGGA GGAGGTCTAA GTCCCAGGCC CAATTAAGAG 361 ATCAGATGGT GTAGGATTTG GGAGCTTTTA AGGTGAAGAG GCCCGGGCTG ATCCCACTGG 421 CCGGTATAAA GCGCCGTGAC CCTCAGGTGA CGCACCAGGG CCGGCTGCCG TCGGGGACAG 481 GGCTTTCCAT AGCCATGGCC CAGCAGTGGA GCCTCCAAAG GCTCGCAGGC CGCCATCCGC 541 AGGACAGCTA TGAGGACAGC ACCCAGTCCA GCATCTTCAC CTACACCAAC AGCAACTCCG 601 TGAGCCAGC // LOCUS HUMCNPG2 290 bp ds-DNA PRI 15-JUN-1989 DEFINITION Human green cone photoreceptor pigment gene 1, exon 2. ACCESSION K03490 KEYWORDS apoprotein; green cone protein; green pigment protein; membrane protein; opsin. SEGMENT 2 of 6 SOURCE Human DNA, clone gJHN21; retina, cDNA to mRNA, clone hs2. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 290) AUTHORS Nathans,J., Thomas,D. and Hogness,D.S. TITLE Molecular genetics of human color vision: The genes encoding blue, green, and red pigments JOURNAL Science 232, 193-202 (1986) STANDARD full staff_review COMMENT A poly adenylation signal is located at positions 212-217. FEATURES from to/span description pept + 11 + 280 green pigment protein, exon 2 /nomgen="GCP" /map="Xq28" /hgml_locus_uid="LM0071N" pre-msg < 1 > 290 GCP1 mRNA IVS < 1 10 GCP1 intron A IVS 281 > 290 GCP1 intron B variant 212 212 c in DNA; t in cDNA BASE COUNT 58 A 89 C 77 G 66 T ORIGIN About 4.65 kb after segment 1; chromosome Xq28. 1 CTGCCCTCAG ACCAGAGGCC CCTTCGAAGG CCCGAATTAC CACATCGCTC CCAGATGGGT 61 GTACCACCTC ACCAGTGTCT GGATGATCTT TGTGGTCATT GCATCCGTTT TCACAAATGG 121 GCTTGTGCTG GCGGCCACCA TGAAGTTCAA GAAGCTGCGC CACCCGCTGA ACTGGATCCT 181 GGTGAACCTG GCGGTCGCTG ACCTGGCAGA GACCGTCATC GCCAGCACTA TCAGCGTTGT 241 GAACCAGGTC TATGGCTACT TCGTGCTGGG CCACCCTATG GTAAGCCAGT // LOCUS HUMCNPG3 182 bp ds-DNA PRI 15-JUN-1989 DEFINITION Human green cone photoreceptor pigment gene 1, exon 3. ACCESSION K03491 KEYWORDS apoprotein; green cone protein; green pigment protein; membrane protein; opsin. SEGMENT 3 of 6 SOURCE Human DNA, clone gJHN21; retina, cDNA to mRNA, clone hs2. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 182) AUTHORS Nathans,J., Thomas,D. and Hogness,D.S. TITLE Molecular genetics of human color vision: The genes encoding blue, green, and red pigments JOURNAL Science 232, 193-202 (1986) STANDARD full staff_review FEATURES from to/span description pept + 11 + 172 green pigment protein, exon 3 /nomgen="GCP" /map="Xq28" /hgml_locus_uid="LM0071N" pre-msg < 1 > 182 GCP1 mRNA IVS < 1 10 GCP1 intron B IVS 173 > 182 GCP1 intron C variant 88 88 a in DNA; g in cDNA variant 92 92 a in DNA; c in cDNA variant 100 100 c in DNA; g in cDNA BASE COUNT 28 A 46 C 57 G 51 T ORIGIN About 1.99 kb after segment 2; chromosome Xq28. 1 CTCCCCATAG TGTGTCCTGG AGGGCTACAC CGTCTCCCTG TGTGGGATCA CAGGTCTCTG 61 GTCTCTGGCC ATCATTTCCT GGGAGAGATG GATGGTGGTC TGCAAGCCCT TTGGCAATGT 121 GAGATTTGAT GCCAAGCTGG CCATCGTGGG CATTGCCTTC TCCTGGATCT GGGTAAGGGT 181 GC // LOCUS HUMCNPG4 182 bp ds-DNA PRI 15-JUN-1989 DEFINITION Human green cone photoreceptor pigment gene 1, exon 4. ACCESSION K03492 KEYWORDS apoprotein; green cone protein; green pigment protein; membrane protein; opsin. SEGMENT 4 of 6 SOURCE Human DNA, clone gJHN21; retina, cDNA to mRNA, clone hs2. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 182) AUTHORS Nathans,J., Thomas,D. and Hogness,D.S. TITLE Molecular genetics of human color vision: The genes encoding blue, green, and red pigments JOURNAL Science 232, 193-202 (1986) STANDARD full staff_review FEATURES from to/span description pept + 11 + 172 green pigment protein, exon 4 /nomgen="GCP" /map="Xq28" /hgml_locus_uid="LM0071N" pre-msg < 1 > 182 GCP1 mRNA IVS < 1 10 GCP1 intron C IVS 173 > 182 GCP1 intron D BASE COUNT 29 A 64 C 48 G 41 T ORIGIN About 1.48 kb after segment 3; chromosome Xq28. 1 TTCTCTCCAG GCTGCTGTGT GGACAGCCCC GCCCATCTTT GGTTGGAGCA GGTACTGGCC 61 CCACGGCCTG AAGACTTCAT GCGGCCCAGA CGTGTTCAGC GGCAGCTCGT ACCCCGGGGT 121 GCAGTCTTAC ATGATTGTCC TCATGGTCAC CTGCTGCATC ACCCCACTCA GCGTAAGCCC 181 CC // LOCUS HUMCNPG5 290 bp ds-DNA PRI 15-JUN-1989 DEFINITION Human green cone photoreceptor pigment gene 1, exon 5. ACCESSION K03493 KEYWORDS apoprotein; green cone protein; green pigment protein; membrane protein; opsin. SEGMENT 5 of 6 SOURCE Human DNA, clone gJHN21; retina, cDNA to mRNA, clone hs2. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 290) AUTHORS Nathans,J., Thomas,D. and Hogness,D.S. TITLE Molecular genetics of human color vision: The genes encoding blue, green, and red pigments JOURNAL Science 232, 193-202 (1986) STANDARD full staff_review FEATURES from to/span description pept + 11 + 280 green pigment protein, exon 5 /nomgen="GCP" /map="Xq28" /hgml_locus_uid="LM0071N" pre-msg < 1 > 290 GCP1 mRNA IVS < 1 10 GCP1 intron D IVS 281 > 290 GCP1 intron E BASE COUNT 58 A 89 C 72 G 71 T ORIGIN About 1.65 kb after segment 4; chromosome Xq28. 1 TCTCCCTTAG ATCATCGTGC TCTGCTACCT CCAAGTGTGG CTGGCCATCC GAGCGGTGGC 61 AAAGCAGCAG AAAGAGTCTG AATCCACCCA GAAGGCAGAG AAGGAAGTGA CGCGCATGGT 121 GGTGGTGATG GTCCTGGCAT TCTGCTTCTG CTGGGGACCA TACGCCTTCT TCGCATGCTT 181 TGCTGCTGCC AACCCTGGCT ACCCCTTCCA CCCTTTGATG GCTGCCCTGC CGGCCTTCTT 241 TGCCAAAAGT GCCACTATCT ACAACCCCGT TATCTATGTC GTAAGCAACA // LOCUS HUMCNPG6 208 bp ds-DNA PRI 15-JUN-1989 DEFINITION Human green cone photoreceptor pigment gene 1, exon 6. ACCESSION K03494 KEYWORDS apoprotein; green cone protein; green pigment protein; membrane protein; opsin. SEGMENT 6 of 6 SOURCE Human DNA, clone gJHN21; retina, cDNA to mRNA, clone hs2. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 208) AUTHORS Nathans,J., Thomas,D. and Hogness,D.S. TITLE Molecular genetics of human color vision: The genes encoding blue, green, and red pigments JOURNAL Science 232, 193-202 (1986) STANDARD full staff_review FEATURES from to/span description pept + 11 136 green pigment protein, exon 6 /nomgen="GCP" /map="Xq28" /hgml_locus_uid="LM0071N" pre-msg < 1 > 136 GCP1 mRNA IVS < 1 10 GCP1 intron E BASE COUNT 33 A 74 C 45 G 56 T ORIGIN About 2.27 kb after segment 5; chromosome Xq28. 1 GTCCTTCCAG TTTATGAACC GGCAGTTTCG AAACTGCATC TTGCAGCTTT TCGGGAAGAA 61 GGTTGACGAT GGCTCTGAAC TCTCCAGCGC CTCCAAAACG GAGGTCTCAT CTGTGTCCTC 121 GGTATCGCCT GCATGAGGTC TGCCTCCTAC CCATCCCGCC CACCGGGGCT TTGGCCACCT 181 CTCCTTTCCC CCTCCTTCTC CATCCCTG // LOCUS HUMCNPR1 609 bp ds-DNA PRI 15-JUN-1989 DEFINITION Human red cone photoreceptor pigment gene, exon 1. ACCESSION M13300 KEYWORDS apoprotein; membrane protein; opsin; red cone protein; red pigment protein. SEGMENT 1 of 6 SOURCE Human DNA, clone gJHN33; retina, cDNA to mRNA, clones hs[4,7]. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 609) AUTHORS Nathans,J., Thomas,D. and Hogness,D.S. TITLE Molecular genetics of human color vision: The genes encoding blue, green, and red pigments JOURNAL Science 232, 193-202 (1986) STANDARD full staff_review FEATURES from to/span description pept 495 + 599 red pigment protein, exon 1 /nomgen="RCP" /map="Xq28" /hgml_locus_uid="LT0068A" pre-msg 454 > 609 RCP mRNA IVS 600 > 609 RCP intron A BASE COUNT 143 A 185 C 186 G 95 T ORIGIN 1 bp upstream of BamHI site. 1 GGATCCGGTT CCAGGCCTCG GCCCTAAATA GTCTCCCTGG GCTTTCAAGA GAACCACATG 61 AGAAAGGAGG ATTCGGGCTC TGAGCAGTTT CACCACCCAC CCCCCAGTCT GCAAATCCTG 121 ACCCGAGGGT CCACCTGCCC CAAAGGCGGA CGCAGGACAG TAGAAGGGAA CAGAGAACAC 181 ATAAACACAG AGAGGGCCAC AGCGGCTCCC ACAGTCACCG CCACCTTCCT GGCGGGGATG 241 GGTGGGGCGT CTGAGTTTGG TTCCCAGCAA ATCCCTCTGA GCCGCCCTTG CGGGCTCGCC 301 TCAGGAGCAG GGGAGCAAGA GGTGGGAGGA GGAGGTCTAA GTCCCAGGCC CAATTAAGAG 361 ATCAGGTAGT GTAGGGTTTG GGAGCTTTTA AGGTGAAGAG GCCCGGGCTG ATCCCACAGG 421 CCAGTATAAA GCGCCGTGAC CCTCAGGTGA TGCGCCAGGG CCGGCTGCCG TCGGGGACAG 481 GGCTTTCCAT AGCCATGGCC CAGCAGTGGA GCCTCCAAAG GCTCGCAGGC CGCCATCCGC 541 AGGACAGCTA TGAGGACAGC ACCCAGTCCA GCATCTTCAC CTACACCAAC AGCAACTCCG 601 TGAGCCAGC // LOCUS HUMCNPR2 290 bp ds-DNA PRI 15-JUN-1989 DEFINITION Human red cone photoreceptor pigment gene, exon 2. ACCESSION M13301 KEYWORDS apoprotein; membrane protein; opsin; red cone protein; red pigment protein. SEGMENT 2 of 6 SOURCE Human DNA, clone gJHN33; retina, cDNA to mRNA, clones hs[4,7]. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 290) AUTHORS Nathans,J., Thomas,D. and Hogness,D.S. TITLE Molecular genetics of human color vision: The genes encoding blue, green, and red pigments JOURNAL Science 232, 193-202 (1986) STANDARD full staff_review FEATURES from to/span description pept + 11 + 280 red pigment protein, exon 2 /nomgen="RCP" /map="Xq28" /hgml_locus_uid="LT0068A" pre-msg < 1 > 290 RCP mRNA and intron IVS < 1 10 RCP intron A IVS 281 > 290 RCP intron B variant 193 193 a in DNA; g in cDNA variant 236 236 a in DNA; g in cDNA variant 252 252 c in DNA; a in cDNA BASE COUNT 60 A 91 C 74 G 65 T ORIGIN About 6.6 kb after segment 1; chromosome Xq28. 1 CTGCCCTCAG ACCAGAGGCC CCTTCGAAGG CCCGAATTAC CACATCGCTC CCAGATGGGT 61 GTACCACCTC ACCAGTGTCT GGATGATCTT TGTGGTCACT GCATCCGTTT TCACAAATGG 121 GCTTGTGCTG GCGGCCACCA TGAAGTTCAA GAAGCTGCGC CACCCGCTGA ACTGGATCCT 181 GGTGAACCTG GCAGTCGCTG ACCTAGCAGA GACCGTCATC GCCAGCACTA TCAGCATTGT 241 GAACCAGGTC TCTGGCTACT TCGTGCTGGG CCACCCTATG GTAAGCCAGT // LOCUS HUMCNPR3 182 bp ds-DNA PRI 15-JUN-1989 DEFINITION Human red cone photoreceptor pigment gene, exon 3. ACCESSION M13302 KEYWORDS apoprotein; membrane protein; opsin; red cone protein; red pigment protein. SEGMENT 3 of 6 SOURCE Human DNA, clone gJHN53; retina, cDNA to mRNA, clones hs[4,7]. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 182) AUTHORS Nathans,J., Thomas,D. and Hogness,D.S. TITLE Molecular genetics of human color vision: The genes encoding blue, green, and red pigments JOURNAL Science 232, 193-202 (1986) STANDARD full staff_review FEATURES from to/span description pept + 11 + 172 red pigment protein, exon 3 /nomgen="RCP" /map="Xq28" /hgml_locus_uid="LT0068A" pre-msg < 1 > 182 RCP mRNA and intron IVS < 1 10 RCP intron B IVS 173 > 182 RCP intron C variant 88 88 g in DNA; g or a in cDNA variant 92 92 c in DNA; c or a in cDNA variant 100 100 g in DNA; g or c in cDNA BASE COUNT 26 A 46 C 59 G 51 T ORIGIN About 2.0 kb after segment 2; chromosome Xq28. 1 CTCCCCATAG TGTGTCCTGG AGGGCTACAC CGTCTCCCTG TGTGGGATCA CAGGTCTCTG 61 GTCTCTGGCC ATCATTTCCT GGGAGAGGTG GCTGGTGGTG TGCAAGCCCT TTGGCAATGT 121 GAGATTTGAT GCCAAGCTGG CCATCGTGGG CATTGCCTTC TCCTGGATCT GGGTAAGGGT 181 GC // LOCUS HUMCNPR4 182 bp ds-DNA PRI 15-JUN-1989 DEFINITION Human red cone photoreceptor pigment gene, exon 4. ACCESSION M13303 KEYWORDS apoprotein; membrane protein; opsin; red cone protein; red pigment protein. SEGMENT 4 of 6 SOURCE Human DNA, clone gJHN53; retina, cDNA to mRNA, clones hs[4,7]. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 182) AUTHORS Nathans,J., Thomas,D. and Hogness,D.S. TITLE Molecular genetics of human color vision: The genes encoding blue, green, and red pigments JOURNAL Science 232, 193-202 (1986) STANDARD full staff_review FEATURES from to/span description pept + 11 + 172 red pigment protein, exon 4 /nomgen="RCP" /map="Xq28" /hgml_locus_uid="LT0068A" pre-msg < 1 > 182 RCP mRNA and intron IVS < 1 10 RCP intron C IVS 173 > 182 RCP intron D variant 11 11 t in DNA; g in cDNA BASE COUNT 28 A 63 C 47 G 44 T ORIGIN About 1.5 kb after segment 3; chromosome Xq28. 1 TTCTCTCCAG TCTGCTGTGT GGACAGCCCC GCCCATCTTT GGTTGGAGCA GGTACTGGCC 61 CCACGGCCTG AAGACTTCAT GCGGCCCAGA CGTGTTCAGC GGCAGCTCGT ACCCCGGGGT 121 GCAGTCTTAC ATGATTGTCC TCATGGTCAC CTGCTGCATC ATCCCACTCG CTGTAAGCCC 181 CC // LOCUS HUMCNPR5 290 bp ds-DNA PRI 15-JUN-1989 DEFINITION Human red cone photoreceptor pigment gene, exon 5. ACCESSION M13304 KEYWORDS apoprotein; membrane protein; opsin; red cone protein; red pigment protein. SEGMENT 5 of 6 SOURCE Human DNA, clone gJHN53; retina, cDNA to mRNA, clones hs[4,7]. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 290) AUTHORS Nathans,J., Thomas,D. and Hogness,D.S. TITLE Molecular genetics of human color vision: The genes encoding blue, green, and red pigments JOURNAL Science 232, 193-202 (1986) STANDARD full staff_review FEATURES from to/span description pept + 11 + 280 red pigment protein, exon 5 /nomgen="RCP" /map="Xq28" /hgml_locus_uid="LT0068A" pre-msg < 1 > 290 RCP mRNA and intron IVS < 1 10 RCP intron D IVS 281 > 290 RCP intron E BASE COUNT 61 A 87 C 71 G 71 T ORIGIN About 1.6 kb after segment 4; chromosome Xq28. 1 TCTCCCTTAG ATCATCATGC TCTGCTACCT CCAAGTGTGG CTGGCCATCC GAGCGGTGGC 61 AAAGCAGCAG AAAGAGTCTG AATCCACCCA GAAGGCAGAG AAGGAAGTGA CGCGCATGGT 121 GGTGGTGATG ATCTTTGCGT ACTGCGTCTG CTGGGGACCC TACACCTTCT TCGCATGCTT 181 TGCTGCTGCC AACCCTGGTT ACGCCTTCCA CCCTTTGATG GCTGCCCTGC CGGCCTACTT 241 TGCCAAAAGT GCCACTATCT ACAACCCCGT TATCTATGTC GTAAGCAACA // LOCUS HUMCNPR6 388 bp ds-DNA PRI 15-JUN-1989 DEFINITION Human red cone photoreceptor pigment gene, exon 6. ACCESSION M13305 KEYWORDS apoprotein; membrane protein; opsin; red cone protein; red pigment protein. SEGMENT 6 of 6 SOURCE Human DNA, clone gJHN53; retina, cDNA to mRNA, clones hs[4,7]. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 388) AUTHORS Nathans,J., Thomas,D. and Hogness,D.S. TITLE Molecular genetics of human color vision: The genes encoding blue, green, and red pigments JOURNAL Science 232, 193-202 (1986) STANDARD full staff_review COMMENT A poly adenylation signal is located at positions 212-217. FEATURES from to/span description pept + 11 136 red pigment protein, exon 6 /nomgen="RCP" /map="Xq28" /hgml_locus_uid="LT0068A" pre-msg < 1 240 RCP mRNA IVS < 1 10 RCP intron E BASE COUNT 81 A 113 C 93 G 101 T ORIGIN About 2.3 kb after segment 5; chromosome Xq28. 1 GTCCTTCCAG TTTATGAACC GGCAGTTTCG AAACTGCATC TTGCAGCTTT TCGGGAAGAA 61 GGTTGACGAT GGCTCTGAAC TCTCCAGCGC CTCCAAAACG GAGGTCTCAT CTGTGTCCTC 121 GGTATCGCCT GCATGAGGTC TGCCTCCTAC CCATCCCGCC CACCGGGGCT TTGGCCACCT 181 CTCCTTTCCC CCTCCTTCTC CATCCCTGTA AAATAAATGT AATTTATCTT TGCCAAAACC 241 AACAAAGTCA CAGAGGCTTT CACTGCAGTG TGGGACCACC TGAGCCTCTG CGTGTGCAGG 301 CACTGGGTCT CGAGAGGGTG CTTGGGGGAT AAAGAGGAGA GAGCGCTTCA TAGACTTTAA 361 GTTTTCCCGA GCCTCATGTC TACCGATG // LOCUS HUMCOL1A01 2181 bp ds-DNA PRI 15-DEC-1989 DEFINITION Human alpha-1 collagen type IV gene, exon 1, and alpha-2 collagen type IV gene, exons 2 and 3. ACCESSION J04217 J05039 KEYWORDS collagen. SEGMENT 1 of 42 SOURCE Human (ATCC 577281) DNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 2181) AUTHORS Soininen,R., Huotari,M., Hostikka,S.L., Prockop,D.J. and Tryggvason,K. TITLE Structural organization of the gene for the alpha-1 chain of human type IV collagen JOURNAL J. Biol. Chem. 264, 13565-13571 (1989) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence [1] kindly submitted by K.Tryggvason, 04-OCT-1988. FEATURES from to/span description pept 994 951 (c) alpha-2 collagen type IV, exon 2 (first expressed exon) /nomgen="COL4A2" /map="13q34" /hgml_locus_uid="LJ0118J" 829 / 773 (c) alpha-2 collagen type IV, exon 3 pept 1869 + 1952 alpha-1 collagen type IV, exon 1 /nomgen="COL4A1" /map="13q34" /hgml_locus_uid="LZ0163F" pre-msg 1613 < 772 (c) alpha-2 collagen type IV mRNA and introns IVS 1739 1039 (c) alpha-2 intron A IVS 949 830 (c) alpha-2 intron B IVS 773 < 1 (c) alpha-2 intron C pre-msg 1740 > 2181 alpha-1 collagen type IV mRNA and intron IVS 1953 > 2181 alpha-1 intron A BASE COUNT 383 A 754 C 683 G 361 T ORIGIN Chromosome 13q34. 1 GAATTCACCT ACCAAACCTC TTCATTTTAG CCATGGTTAC ATAAAGCTCA GTAAAAGTGG 61 CTCCTACTCT TTAAACAAAA CGTATAAAAT GGACAAAGCC TCGCAGTGCA GTCGCCAGCT 121 GCACCCACGG TATCGGCAGG GCTGTGCACG GCCGGCGTGG AGGGGCAGGC TCGGCGAACA 181 GGATCCCAGT GTGGCGCCCG GAGCAGCTCT TCCAGGGCCG GAACAGGACT TGGGAGAAAA 241 CGCGGTTTTT CTGCCATTGT TCCCTCGCAA TGCTTCAATT CAAACAATAC TAAGGTTGTC 301 TCGTTCGCCT CCTCGCCCCG CCCCTTTACC CAGAGACTCC CATTTAAAAA CACAAAAAAC 361 CCTCAGTCCG TGACAGAGAC TCCCCACCAG CTCCCGAGTT CCAGGAAAAC TTTTTTCGCC 421 CTAAGCCTGG CGTCTCACTG CCTAGTGACT GACACACTTT CCTGGCCTCT ACGGCTGCAG 481 CTTTGCCCAA AGCAACGAAA ACGGTGCCTC AGGCTACATT TTAAGTTCCC AGTTCCCGTG 541 AAAGGCCGAC GCTTGCAAAA CACAGGATTG ACGCTGGGAA CCCAAGGGGA AGGAAAGGTG 601 GAAAGCATGA TGAGGGAAGA GCAAGCTTTC TCAGAACCTT GGTGGGCAGC CTGGCGAACA 661 TCCACACGCA CACACACACT CGGGAGCGCA CGGACGAGCT GCCTTCTCCA AGGCCACTCA 721 AACGTCCCAA CCACTCTCGG GCGCGCAAGT CCAAGCGCGG GAGCCAGGAC TTACCGCCAA 781 GACGCTCTGG GCGAGGAACC CCACGGTCAC TGTCCCCAGC AGCAGCCACC TGCATGGGAA 841 AGGGAGGAAG AGAGACGTGA ACGTGCGGGC CCGGCTTGTC AGTCCCCGAG AGTTACACCG 901 AAGGGTCCAT GCGCGCGTGA CCCACGGGGA CCAGGCAGAA AGTCGCTTAC CGCCGTAGGG 961 CAGGGCCGGC CACCGCGCGC TGGTCTCTCC CCATGCTGGC GGTTCGTCCA CTCTGGGCCC 1021 CGGTCAGTCC CACTTAGCCT AGAGGCGACA GACAAAGCGA GTTTAGCGCA GGATGAGGGA 1081 GGCAGCCCAT CCTCACCGCC GGTCTCGGTC CGCGAGACGC GGGGACAGCG CGGTGCGCGG 1141 CCCGCATGCA GGGGTGACCG GAGGCCGTCC CCCCCACGAC CCGGAAAGAA GGAGAGCCTC 1201 CCGTTAGGCC CCTGTGGGTG CTCCTTGGCC GAGGAGCTCG GTCTTCGCTC TCCCACCCTC 1261 CCCCTTTCTA CTCCCAAGCA GGAGAGCGTG CAGCCCTAGC CTGCACAAGG CGCTCCAAGT 1321 CGGTGCTCTC GGGCCAGGGT CTGGGCGCCG CTCGGGGGTC GCTCTCACCT CGGGCTCGGC 1381 TTCAGGGGCT GCTGCCCGAA CGCATTGGCC CTTCCAGAAG CACCCGCCGG CGGCACACCG 1441 GCAGGGCGGC CGGCCTTGCT GGGCTCTCTG GGCGCCGGCC CCGGGGGCTC GGGCGGCCCC 1501 TTTCGGTCCT CGGCCTGGAT CCGCGAGCGC GCGGGCGCAA GGGGTTGGGA CGCGGCAGCC 1561 TCTTGAGTGC GGAGCGCGGA GCCCTGGTGT CCCGGCGCAC GGCAGCCACA CTCCCGGGCC 1621 GCGCGCTCCC GCCGCCTCTT ACCCGCGCCG CAGGGTCCTC CCCTTTGAGG CGCCGCCCGC 1681 GCACCGCCGG GGGGGAGGGG GCAGCGCCAA CAAATTGGGG AGCTCGGCCC GCCGCGCTCA 1741 GGTCTCCGCT TGGAGCCGCC GCACCCGGGA CGGTGCGTAT CGCTGGAAGT CCGGCCTTCC 1801 GAGAGCTAGC TGTCCGCCGC GGCCCCCGCA CGCCGGGCAG CCGTCCCTCG CGCCTCGGGC 1861 GCGCCACCAT GGGGCCCCGG CTCAGCGTCT GGCTGCTGCT GCTGCCCGCC GCCCTTCTGC 1921 TCCACGAGGA GCACAGCCGG GCCGCTGCGA AGGTGAGTTC CCGGCCAGCT CCGCTCCCGG 1981 CGTCCCGCCC CGAGCTTGGG CGCCCCGAGA GGCCCCTTTG TCCGCGCCTG GACCCGTCCG 2041 CCTGCCCCTC GGGGGTCGCG CGTGGCACGG CCAGGTGCAT TCTCTGGGCC GGGGTTCGTT 2101 GGGGGTCCCT GTAGGCTACG ATCGCGCATT GGTGGACCGA GCCTCCTTTG TTATGGTATG 2161 GGTACTGGAG AGTTAAGGAA T // LOCUS HUMCOL1A03 241 bp ds-DNA PRI 15-DEC-1989 DEFINITION Human alpha-1 collagen type IV gene, exons 3 and 4. ACCESSION M26540 J05039 KEYWORDS collagen; fibronectin; proteoglycan. SEGMENT 3 of 42 SOURCE Human DNA, (libraries of E.Fritsch, R.Poulsom, and Clontech), clones F[16,21,22] and E5. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 241) AUTHORS Soininen,R., Huotari,M., Ganguly,A., Prockop,D.J. and Tryggvason,K. TITLE Structural organization of the gene for the alpha-1 chain of human type IV collagen JOURNAL J. Biol. Chem. 264, 13565-13571 (1989) STANDARD full staff_review REFERENCE 2 (bases 1 to 241) AUTHORS Tryggvason,K. JOURNAL Unpublished (1989) see COMMENT for author address. STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly provided by K.Tryggvason, 27-JUL-1989. K.Tryggvason, University of Oulu, Department of Biochemistry, University of Oulu 90570 Oulu, Finland FEATURES from to/span description pept + 9 98 alpha-1 collagen type IV, exon 3 /nomgen="COL4A1" /map="13q34" /hgml_locus_uid="LZO163F" 197 + 241 alpha-1 collagen type IV, exon 4 pre-msg < 1 > 241 alpha-1 collagen type IV mRNA and introns IVS < 1 8 alpha-1 intron B IVS 99 196 alpha-1 intron C BASE COUNT 64 A 44 C 81 G 52 T ORIGIN About 0.2 kb after segment 4; chromosome 13q34. 1 TTCTGTAGGG TGAAAGAGGC CTCCCGGGGT TACAAGGTGT CATTGGGTTT CCTGGAATGC 61 AAGGACCTGA GGGGCCACAG GGACCACCAG GACAAAAGGT AAGCACGGCT GTGGGATTGG 121 GGGTGGGTGG ATGTAAGATT GCCCGATTCT TGAGATAGCG GTGATCAATG ACAATGGCAT 181 TTCTATTCTG TTCCAGGGTG ATACTGGAGA ACCAGGACTA CCTGGAACAA AAGGGACAAG 241 A // LOCUS HUMCOL1A04 452 bp ds-DNA PRI 15-DEC-1989 DEFINITION Human alpha-1 collagen type IV gene, exons 5 and 6. ACCESSION M26542 J05039 KEYWORDS collagen; fibronectin; proteoglycan. SEGMENT 4 of 42 SOURCE Human DNA, (libraries of E.Fritsch, R.Poulsom, and Clontech), clones F[16,21,22] and E5. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 452) AUTHORS Soininen,R., Huotari,M., Ganguly,A., Prockop,D.J. and Tryggvason,K. TITLE Structural organization of the gene for the alpha-1 chain of human type IV collagen JOURNAL J. Biol. Chem. 264, 13565-13571 (1989) STANDARD full staff_entry REFERENCE 2 (bases 1 to 452) AUTHORS Tryggvason,K. JOURNAL Unpublished (1989) see COMMENT for author address. STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly provided by K.Tryggvason, 27-JUL-1989. K.Tryggvason, University of Oulu, Department of Biochemistry, University of Oulu 90570 Oulu, Finland FEATURES from to/span description pept + 55 99 alpha-1 collagen type IV, exon 5 /nomgen="COL4A1" /map="13q34" /hgml_locus_uid="LZO163F" 194 + 256 alpha-1 collagen type IV, exon 6 pre-msg < 1 > 452 alpha-1 collagen type IV mRNA and introns IVS < 1 54 alpha-1 intron D IVS 100 193 alpha-1 intron E IVS 257 > 452 alpha-1 intron F BASE COUNT 111 A 102 C 96 G 143 T ORIGIN About 0.6 kb after segment 6; chromosome 13q34. 1 GAGAAGTGAG GCTTCCCGGC TCTTCACTGA CACGCTTTTA CTTCCGCCTC ACAGGGACCT 61 CCGGGAGCAT CTGGCTACCC TGGAAACCCA GGACTTCCCG TATGTATAGA AAACGTGCTC 121 TACTTCTTTT ATGAAATATT CTTCCATCAG GCGATTTTCT GCCTGGGTTA AATTTTCACT 181 TTCTTCATTG CAGGGAATTC CTGGCCAAGA CGGCCCGCCA GGCCCCCCAG GTATTCCAGG 241 ATGCAATGGC ACAAAGGTAA ATCCAGAACC GAGACCCTCC TTTTTGTGTG TGTTTACGTA 301 ATTTTTGCAT ATTAAGGAGT CAGGTAGTGT GATTCTGTTA ATAGAGTTTT ATTTGCCACA 361 ATTGGAAAGT TGCTTGTCTT AAAGTTTGCT TTATTTAGTA AGGAAATACA GTTTTCCCAT 421 ATTTAGTGTA CCAGAAAGAT ATAATTGGTC CC // LOCUS HUMCOL1A05 405 bp ds-DNA PRI 15-DEC-1989 DEFINITION Human alpha-1 collagen type IV gene, exons 7 and 8. ACCESSION M26543 J05039 KEYWORDS collagen; fibronectin; proteoglycan. SEGMENT 5 of 42 SOURCE Human DNA, (libraries of E.Fritsch, R.Poulsom, and Clontech), clones F[16,21,22] and E5. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 405) AUTHORS Soininen,R., Huotari,M., Ganguly,A., Prockop,D.J. and Tryggvason,K. TITLE Structural organization of the gene for the alpha-1 chain of human type IV collagen JOURNAL J. Biol. Chem. 264, 13565-13571 (1989) STANDARD full staff_entry REFERENCE 2 (bases 1 to 405) AUTHORS Tryggvason,K. JOURNAL Unpublished (1989) see COMMENT for author address. STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly provided by K.Tryggvason, 27-JUL-1989. K.Tryggvason, University of Oulu, Department of Biochemistry, University of Oulu 90570 Oulu, Finland FEATURES from to/span description pept + 41 94 alpha-1 collagen type IV, exon 7 /nomgen="COL4A1" /map="13q34" /hgml_locus_uid="LZO163F" 290 + 316 alpha-1 collagen type IV, exon 8 pre-msg < 1 > 405 alpha-1 collagen type IV mRNA and introns IVS < 1 40 alpha-1 intron F IVS 95 289 alpha-1 intron G IVS 317 > 405 alpha-1 intron H BASE COUNT 107 A 79 C 71 G 148 T ORIGIN About 1.2 kb after segment 7; chromosome 13q34. 1 CTGCCACACA CAGTGCGTAT GAATGATGCT CTGTTCCCAG GGGGAGAGAG GGCCGCTCGG 61 GCCTCCTGGC TTGCCTGGTT TCGCAGGAAA TCCCGTGAGT AGAGGTTATT TAGGGCAGAA 121 CTTCTTTCTT TTAGTTATGA CTTCTCTCTT TTCATTCCAT TTCCTTTTCT TTCTTTCTTT 181 GCTTATGTTT TAATAACTTA TATTTTATAT ATAATATTAC ATGATAACAA TTTTAATGCT 241 AATGATATTT TGAGAAAAAC AAAACTAACA CCAAAACTTT CTTTTTTAGG GACCACCAGG 301 CTTACCAGGG ATGAAGGTAC ATTTTATTTT ATTTTGCTTT ATTGGATTCA TTACAGGGAA 361 TCTGTTTACA ACAAGTGCCA TAGAAGCACA TTCTCTTTAA ACCAC // LOCUS HUMCOL1A06 358 bp ds-DNA PRI 15-DEC-1989 DEFINITION Human alpha-1 collagen type IV gene, exons 9 and 10. ACCESSION M26544 J05039 KEYWORDS collagen; fibronectin; proteoglycan. SEGMENT 6 of 42 SOURCE Human DNA, (libraries of E.Fritsch, R.Poulsom, and Clontech), clones F[16,21,22] and E5. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 358) AUTHORS Soininen,R., Huotari,M., Ganguly,A., Prockop,D.J. and Tryggvason,K. TITLE Structural organization of the gene for the alpha-1 chain of human type IV collagen JOURNAL J. Biol. Chem. 264, 13565-13571 (1989) STANDARD full staff_entry REFERENCE 2 (bases 1 to 358) AUTHORS Tryggvason,K. JOURNAL Unpublished (1989) see COMMENT for author address. STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly provided by K.Tryggvason, 27-JUL-1989. K.Tryggvason, University of Oulu, Department of Biochemistry, University of Oulu 90570 Oulu, Finland FEATURES from to/span description pept + 18 101 alpha-1 collagen type IV, exon 9 /nomgen="COL4A1" /map="13q34" /hgml_locus_uid="LZO163F" 188 + 250 alpha-1 collagen type IV, exon 10 pre-msg < 1 > 358 alpha-1 collagen type IV mRNA and introns IVS < 1 17 alpha-1 intron H IVS 102 187 alpha-1 intron I IVS 251 > 358 alpha-1 intron J BASE COUNT 91 A 86 C 80 G 101 T ORIGIN About 0.7 kb after segment 10; chromosome 13q34. 1 CTTTCTTCAT CCTCTAGGGT GATCCAGGTG AGATACTTGG CCATGTGCCC GGGATGCTGT 61 TGAAAGGTGA AAGAGGATTT CCCGGAATCC CAGGGACTCC AGTAAGCATT TGCTGAATCA 121 TTATGAACAT GTGCCACCTC ACCCTCCCAG TTCGCATCTG AACTCATTTC TTTCTCATGC 181 ATTCTAGGGC CCACCAGGAC TGCCAGGGCT TCAAGGTCCT GTTGGGCCTC CAGGATTTAC 241 CGGACCACCA GTAAGTTTTG GGGGCTGTCT CTCCGAGGCA ATCATTTAAA AAACAACTAT 301 ATTGAGGTTT GAAAATAAAT AATAATTTAG TAAAAACTTT ACACAGTTGT GGTCGCTC // LOCUS HUMCOL1A07 246 bp ds-DNA PRI 15-DEC-1989 DEFINITION Human alpha-1 collagen type IV gene, exon 11. ACCESSION M26545 J05039 KEYWORDS collagen; fibronectin; proteoglycan. SEGMENT 7 of 42 SOURCE Human DNA, (libraries of E.Fritsch, R.Poulsom, and Clontech), clones F[16,21,22] and E5. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 246) AUTHORS Soininen,R., Huotari,M., Ganguly,A., Prockop,D.J. and Tryggvason,K. TITLE Structural organization of the gene for the alpha-1 chain of human type IV collagen JOURNAL J. Biol. Chem. 264, 13565-13571 (1989) STANDARD full staff_entry REFERENCE 2 (bases 1 to 246) AUTHORS Tryggvason,K. JOURNAL Unpublished (1989) see COMMENT for author address. STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly provided by K.Tryggvason, 27-JUL-1989. K.Tryggvason, University of Oulu, Department of Biochemistry, University of Oulu 90570 Oulu, Finland FEATURES from to/span description pept + 211 + 246 alpha-1 collagen type IV, exon 11 /nomgen="COL4A1" /map="13q34" /hgml_locus_uid="LZO163F" pre-msg < 1 > 246 alpha-1 collagen type IV mRNA and intron IVS < 1 210 alpha-1 intron J BASE COUNT 69 A 55 C 67 G 55 T ORIGIN About 0.7 kb after segment 11; chromosome 13q34. 1 GGGAGCAGGG AGATGGATTG GTATTGGTAT GGTCACACCT GAGGGTCCTG CACCTGCTAG 61 GAGTGGGGAG GGAGGGGAAG CCCAGGAAAA ATATTTCACA AATAAATTAT CATGTTGACC 121 AGAGAATCTT AAGATAACGT CAGCCTGAAG AAGGGCTTAA AGCTCCCTAG AGTTCTGAAT 181 CCTATTTAAT TAAAATGCTC TCTCTCCCAG GGTCCCCCAG GCCCTCCCGG CCCTCCAGGT 241 GAAAAG // LOCUS HUMCOL1A08 251 bp ds-DNA PRI 15-DEC-1989 DEFINITION Human alpha-1 collagen type IV gene, exon 12. ACCESSION M26546 J05039 KEYWORDS collagen; fibronectin; proteoglycan. SEGMENT 8 of 42 SOURCE Human DNA, (libraries of E.Fritsch, R.Poulsom, and Clontech), clones F[16,21,22] and E5. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 251) AUTHORS Soininen,R., Huotari,M., Ganguly,A., Prockop,D.J. and Tryggvason,K. TITLE Structural organization of the gene for the alpha-1 chain of human type IV collagen JOURNAL J. Biol. Chem. 264, 13565-13571 (1989) STANDARD full staff_entry REFERENCE 2 (bases 1 to 251) AUTHORS Tryggvason,K. JOURNAL Unpublished (1989) see COMMENT for author address. STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly provided by K.Tryggvason, 27-JUL-1989. K.Tryggvason, University of Oulu, Department of Biochemistry, University of Oulu 90570 Oulu, Finland FEATURES from to/span description pept + 167 + 208 alpha-1 collagen type IV, exon 12 /nomgen="COL4A1" /map="13q34" /hgml_locus_uid="LZO163F" pre-msg < 1 > 251 alpha-1 collagen type IV mRNA and introns IVS < 1 166 alpha-1 intron K IVS 209 > 251 alpha-1 intron L BASE COUNT 79 A 41 C 55 G 76 T ORIGIN About 1.4 kb after segment 12; chromosome 13q34. 1 GAATTCCAGC ATGATCAGTT ACTTTCCAGA AAATTTTCAT GAGAGTTATG GGACAAAGCT 61 ATTGCCTGAA ATCTACCATC TTATTGCATT TTGTGTGAGG TTTGTAGAAT CGACTTTGAA 121 CGAAGTAGAC AAATCTAATA ATGAGAGCCT AATTTTTAAT CCACAGGGAC AAATGGGCTT 181 AAGTTTTCAA GGACCAAAAG GTGACAAGGT GAGTGCATAT TGCTCTGGAG TGCCTTTCCA 241 TGTTCGGAAA A // LOCUS HUMCOL1A09 366 bp ds-DNA PRI 15-DEC-1989 DEFINITION Human alpha-1 collagen type IV gene, exon 13. ACCESSION M26547 J05039 KEYWORDS collagen; fibronectin; proteoglycan. SEGMENT 9 of 42 SOURCE Human DNA, (libraries of E.Fritsch, R.Poulsom, and Clontech), clones F[16,21,22] and E5. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 366) AUTHORS Soininen,R., Huotari,M., Ganguly,A., Prockop,D.J. and Tryggvason,K. TITLE Structural organization of the gene for the alpha-1 chain of human type IV collagen JOURNAL J. Biol. Chem. 264, 13565-13571 (1989) STANDARD full staff_entry REFERENCE 2 (bases 1 to 366) AUTHORS Tryggvason,K. JOURNAL Unpublished (1989) see COMMENT for author address. STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly provided by K.Tryggvason, 27-JUL-1989. K.Tryggvason, University of Oulu, Department of Biochemistry, University of Oulu 90570 Oulu, Finland FEATURES from to/span description pept + 1 + 87 alpha-1 collagen type IV, exon 13 /nomgen="COL4A1" /map="13q34" /hgml_locus_uid="LZO163F" pre-msg < 1 > 366 alpha-1 collagen type IV mRNA and intron IVS 88 > 366 alpha-1 intron M BASE COUNT 112 A 71 C 101 G 82 T ORIGIN About 0.7 kb after segment 13; chromosome 13q34. 1 GGTGACCAAG GGGTCAGTGG GCCTCCAGGA GTACCAGGAC AAGCTCAAGT TCAAGAAAAA 61 GGAGACTTCG CCACCAAGGG AGAAAAGGTA TGAATGGGCT TCATCAGTGA ATACTGATTT 121 TCTAATTTTG GACAAATTCC AAAAGACAAA GAGAAAATGG GTCAAAACTC AGTTTTTCAG 181 CTACATTCAA TCCTGGTCTG GAAAACAGAT TTATAGACTG TTCAGTCCAT AAAATACGAG 241 CCCCTTTGAG AGCAGAGACA GTGGAACTGG ATTGGTCCAA GCCTGATCTG AGGCTCACCC 301 GGCTGGATCA GGGCTTCTTA AATGTGCCCC AAGTTGGAGA TGGCTTGGTC AGGAGGAGGG 361 AGTGGG // LOCUS HUMCOL1A10 264 bp ds-DNA PRI 15-DEC-1989 DEFINITION Human alpha-1 collagen type IV gene, exons 14 and 15. ACCESSION M26537 J05039 KEYWORDS collagen; fibronectin; proteoglycan. SEGMENT 10 of 42 SOURCE Human DNA, (libraries of E.Fritsch, R.Poulsom, and Clontech), clones F[16,21,22] and E5. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 264) AUTHORS Soininen,R., Huotari,M., Ganguly,A., Prockop,D.J. and Tryggvason,K. TITLE Structural organization of the gene for the alpha-1 chain of human type IV collagen JOURNAL J. Biol. Chem. 264, 13565-13571 (1989) STANDARD full staff_entry REFERENCE 2 (bases 1 to 264) AUTHORS Tryggvason,K. JOURNAL Unpublished (1989) see COMMENT for author address. STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly provided by K.Tryggvason, 27-JUL-1989. K.Tryggvason, University of Oulu, Department of Biochemistry, University of Oulu 90570 Oulu, Finland FEATURES from to/span description pept + 14 40 alpha-1 collagen type IV, exon 14 /nomgen="COL4A1" /map="13q34" /hgml_locus_uid="LZO163F" 192 + 242 alpha-1 collagen type IV, exon 15 pre-msg < 1 > 264 alpha-1 collagen type IV mRNA and introns IVS < 1 13 alpha-1 intron M IVS 41 191 alpha-1 intron N IVS 243 > 264 alpha-1 intron O BASE COUNT 78 A 51 C 49 G 86 T ORIGIN About 1.6 kb after segment 15; chromosome 13q34. 1 GAATTCCTTT CAGGGCCAAA AAGGTGAACC TGGATTTCAG GTCAGTACTC ACTTTCTGCC 61 TATCATTTTT AGGTCCAAAG ACATAAAGAT TTAAGTTTCA AGTTTTTTTC ACCTTAAGTT 121 TAATATCTAT CTAATTTAAA TTTTGCCTTA AATAGACGAA CTAAATCTCT TTGTCTTTTT 181 ATTCCTTTAA GGGGATGCCA GGGGTCGGAG AGAAAGGTGA ACCCGGAAAA CCAGGACCCA 241 GAGTAAGTGC CTTTTCCAAA GTCC // LOCUS HUMCOL1A11 246 bp ds-DNA PRI 15-DEC-1989 DEFINITION Human alpha-1 collagen type IV gene, exons 16 and 17. ACCESSION M26538 J05039 KEYWORDS collagen; fibronectin; proteoglycan. SEGMENT 11 of 42 SOURCE Human DNA, (libraries of E.Fritsch, R.Poulsom, and Clontech), clones F[16,21,22] and E5. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 246) AUTHORS Soininen,R., Huotari,M., Ganguly,A., Prockop,D.J. and Tryggvason,K. TITLE Structural organization of the gene for the alpha-1 chain of human type IV collagen JOURNAL J. Biol. Chem. 264, 13565-13571 (1989) STANDARD full staff_entry REFERENCE 2 (bases 1 to 246) AUTHORS Tryggvason,K. JOURNAL Unpublished (1989) see COMMENT for author address. STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly provided by K.Tryggvason, 27-JUL-1989. K.Tryggvason, University of Oulu, Department of Biochemistry, University of Oulu 90570 Oulu, Finland FEATURES from to/span description pept + 1 45 alpha-1 collagen type IV, exon 16 /nomgen="COL4A1" /map="13q34" /hgml_locus_uid="LZO163F" 132 + 185 alpha-1 collagen type IV, exon 17 pre-msg < 1 > 246 alpha-1 collagen type IV mRNA and introns IVS 46 131 alpha-1 intron O IVS 186 > 246 alpha-1 intron P BASE COUNT 56 A 66 C 62 G 62 T ORIGIN About 2 kb after segment 17; chromosome 13q34. 1 GGCAAACCCG GAAAAGATGG TGACAAAGGG GAAAAAGGGA GTCCCGTAAG TGTTTCTCTG 61 TGATTTTTAC AAGCAGGCTC ACTGTTCCAC CACTGGCCTC TGACGGTTAC TGCTTCTCTC 121 TTCGGTTTCA GGGTTTTCCT GGTGAACCCG GGTACCCAGG ACTCATAGGC CGCCAGGGCC 181 CGCAGGTAAA CGCAGGCACT ATTGTGAGCC CTTTATCTTC ACTTTCCAAT TCAACCCAGA 241 AGTCTG // LOCUS HUMCOL1A12 89 bp ds-DNA PRI 15-DEC-1989 DEFINITION Human alpha-1 collagen type IV gene, exon 18. ACCESSION M26548 J05039 KEYWORDS collagen; fibronectin; proteoglycan. SEGMENT 12 of 42 SOURCE Human DNA, (libraries of E.Fritsch, R.Poulsom, and Clontech), clones F[16,21,22] and E5. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 89) AUTHORS Soininen,R., Huotari,M., Ganguly,A., Prockop,D.J. and Tryggvason,K. TITLE Structural organization of the gene for the alpha-1 chain of human type IV collagen JOURNAL J. Biol. Chem. 264, 13565-13571 (1989) STANDARD full staff_entry REFERENCE 2 (bases 1 to 89) AUTHORS Tryggvason,K. JOURNAL Unpublished (1989) see COMMENT for author address. STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly provided by K.Tryggvason, 27-JUL-1989. K.Tryggvason, University of Oulu, Department of Biochemistry, University of Oulu 90570 Oulu, Finland FEATURES from to/span description pept + 10 + 51 alpha-1 collagen type IV, exon 18 /nomgen="COL4A1" /map="13q34" /hgml_locus_uid="LZO163F" pre-msg < 1 > 89 alpha-1 collagen type IV mRNA and introns IVS < 1 9 alpha-1 intron P IVS 52 > 89 alpha-1 intron Q BASE COUNT 18 A 18 C 31 G 22 T ORIGIN About 2.5 kb after segment 18; chromosome 13q34. 1 CTTTTGTAGG GAGAAAAGGG TGAAGCAGGT CCTCCTGGCC CACCTGGAAT TGTGAGTAAG 61 AGTGCTGTCC CTGGGTCTGT GAGAGCGCT // LOCUS HUMCOL1A13 400 bp ds-DNA PRI 15-DEC-1989 DEFINITION Human alpha-1 collagen type IV gene, exon 19. ACCESSION M26549 J05039 KEYWORDS collagen; fibronectin; proteoglycan. SEGMENT 13 of 42 SOURCE Human DNA, (libraries of E.Fritsch, R.Poulsom, and Clontech), clones F[16,21,22] and E5. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 400) AUTHORS Soininen,R., Huotari,M., Ganguly,A., Prockop,D.J. and Tryggvason,K. TITLE Structural organization of the gene for the alpha-1 chain of human type IV collagen JOURNAL J. Biol. Chem. 264, 13565-13571 (1989) STANDARD full staff_entry REFERENCE 2 (bases 1 to 400) AUTHORS Tryggvason,K. JOURNAL Unpublished (1989) see COMMENT for author address. STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly provided by K.Tryggvason, 27-JUL-1989. K.Tryggvason, University of Oulu, Department of Biochemistry, University of Oulu 90570 Oulu, Finland FEATURES from to/span description pept + 13 + 97 alpha-1 collagen type IV, exon 19 /nomgen="COL4A1" /map="13q34" /hgml_locus_uid="LZO163F" pre-msg < 1 > 400 alpha-1 collagen type IV mRNA and introns IVS < 1 12 alpha-1 intron Q IVS 98 > 400 alpha-1 intron R BASE COUNT 41 A 168 C 50 G 141 T ORIGIN About 1 kb after segment 19; chromosome 13q34. 1 TTCTCGATTC AGGTTATAGG CACAGGACCT TTGGGAGAAA AAGGAGAGAG GGGCTACCCT 61 GGAACTCCGG GGCCAAGAGG AGAGCCAGGC CCAAAAGGTA GCTTTTTCTT GCCTTTTTTT 121 CCCCCCTCCT CCTACTCCCC CTCCTTCTCT CCTCCTCCTC TTCCTCCTCC TCGTCCTCTT 181 CCTCCTCCTC CTCCTGTTCC TCCTCCTCCT TCCTCCTCCC CCTCCTTCTC TTCCTCCGCC 241 CCCTCCTTCT CTTCTCTTCC TCCTCCCCCT CCTCCTCTTT CTCCTCTTTC TCCTCTTCCT 301 CCTCCTCCTC CTTCTCTCTC TCTCCCCACG CTTTCTATTT CTTCCCCTCT TTTTGTCTAG 361 TTTTGACCTG TGTTCCTCCA TATTTTGGAA GCATCTGCAG // LOCUS HUMCOL1A14 279 bp ds-DNA PRI 15-DEC-1989 DEFINITION Human alpha-1 collagen type IV gene, exon 20. ACCESSION M26551 J05039 KEYWORDS collagen; fibronectin; proteoglycan. SEGMENT 14 of 42 SOURCE Human DNA, (libraries of E.Fritsch, R.Poulsom, and Clontech), clones F[16,21,22] and E5. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 279) AUTHORS Soininen,R., Huotari,M., Ganguly,A., Prockop,D.J. and Tryggvason,K. TITLE Structural organization of the gene for the alpha-1 chain of human type IV collagen JOURNAL J. Biol. Chem. 264, 13565-13571 (1989) STANDARD full staff_entry REFERENCE 2 (bases 1 to 279) AUTHORS Tryggvason,K. JOURNAL Unpublished (1989) see COMMENT for author address. STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly provided by K.Tryggvason, 27-JUL-1989. K.Tryggvason, University of Oulu, Department of Biochemistry, University of Oulu 90570 Oulu, Finland FEATURES from to/span description pept + 152 + 187 alpha-1 collagen type IV, exon 20 /nomgen="COL4A1" /map="13q34" /hgml_locus_uid="LZO163F" pre-msg < 1 > 279 alpha-1 collagen type IV mRNA and introns IVS < 1 151 alpha-1 intron R IVS 188 > 279 alpha-1 intron S BASE COUNT 68 A 69 C 52 G 90 T ORIGIN About 1.3 kb after segment 20; chromosome 13q34. 1 CCTCGGGTTG GTCTCCCGCT CTGTTTTCTA CGTTCCCTTT GGCCATGGAC ACTTACCAAT 61 GCACCAAGCA AGGCTTTCAG TAGAGAAATA AGAAGTGTAT ACAAACGGGT GAACTGTTCT 121 GTTTGATCCA ACACTCTCTT TCTCTTTTCA GGTTTCCCAG GACTACCAGG CCAACCCGGA 181 CCTCCAGGTG AGACTCCTTA ACTGATTTGT TATAGCATAC TTGCACACCT TCTGTGCTGT 241 TTATATTCTG AATATATAAT TTATCACGAA TGTTAGATA // LOCUS HUMCOL1A15 191 bp ds-DNA PRI 15-DEC-1989 DEFINITION Human alpha-1 collagen type IV gene, exon 21. ACCESSION M26552 J05039 KEYWORDS collagen; fibronectin; proteoglycan. SEGMENT 15 of 42 SOURCE Human DNA, (libraries of E.Fritsch, R.Poulsom, and Clontech), clones F[16,21,22] and E5. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 191) AUTHORS Soininen,R., Huotari,M., Ganguly,A., Prockop,D.J. and Tryggvason,K. TITLE Structural organization of the gene for the alpha-1 chain of human type IV collagen JOURNAL J. Biol. Chem. 264, 13565-13571 (1989) STANDARD full staff_entry REFERENCE 2 (bases 1 to 191) AUTHORS Tryggvason,K. JOURNAL Unpublished (1989) see COMMENT for author address. STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly provided by K.Tryggvason, 27-JUL-1989. K.Tryggvason, University of Oulu, Department of Biochemistry, University of Oulu 90570 Oulu, Finland FEATURES from to/span description pept + 1 + 165 alpha-1 collagen type IV, exon 21 /nomgen="COL4A1" /map="13q34" /hgml_locus_uid="LZO163F" pre-msg < 1 > 191 alpha-1 collagen type IV mRNA and intron IVS 166 > 191 alpha-1 intron T BASE COUNT 37 A 56 C 58 G 40 T ORIGIN About 3.8 kb after segment 21; chromosome 13q34. 1 GCCTCCCTGT ACCTGGGCAG GCTGGTGCCC CTGGCTTCCC TGGTGAAAGA GGAGAAAAAG 61 GTGACCGAGG ATTTCCTGGT ACATCTCTGC CAGGACCAAG TGGAAGAGAT GGGCTCCCGG 121 GTCCTCCTGG TTCCCCCGGG CCCCCTGGGC AGCCTGGCTA CACAAGTGAG TTCCCTCAGA 181 AATGGTTTAA T // LOCUS HUMCOL1A16 508 bp ds-DNA PRI 15-DEC-1989 DEFINITION Human alpha-1 collagen type IV gene, exon 22. ACCESSION M26553 J05039 KEYWORDS collagen; fibronectin; proteoglycan. SEGMENT 16 of 42 SOURCE Human DNA, (libraries of E.Fritsch, R.Poulsom, and Clontech), clones F[16,21,22] and E5. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 508) AUTHORS Soininen,R., Huotari,M., Ganguly,A., Prockop,D.J. and Tryggvason,K. TITLE Structural organization of the gene for the alpha-1 chain of human type IV collagen JOURNAL J. Biol. Chem. 264, 13565-13571 (1989) STANDARD full staff_entry REFERENCE 2 (bases 1 to 508) AUTHORS Tryggvason,K. JOURNAL Unpublished (1989) see COMMENT for author address. STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly provided by K.Tryggvason, 27-JUL-1989. K.Tryggvason, University of Oulu, Department of Biochemistry, University of Oulu 90570 Oulu, Finland FEATURES from to/span description pept + 405 + 500 alpha-1 collagen type IV, exon 22 /nomgen="COL4A1" /map="13q34" /hgml_locus_uid="LZO163F" pre-msg < 1 > 508 alpha-1 collagen type IV mRNA and introns IVS < 1 404 alpha-1 intron T IVS 501 > 508 alpha-1 intron U BASE COUNT 152 A 79 C 131 G 146 T ORIGIN About 1.3 kb after segment 22; chromosome 13q34. 1 TTTTCTAGTC TCAGCTTTGA CTAGATGCAT GACCCTGGAC AAATCACTTA CCTTTCTTGA 61 GCCTCAGTTT CTTCATTTGT AAAATAAGGG TGATGGTAGG CACTTCGCAA GATGAAATGA 121 AATAGTATAC AGGACAGGGC TGATGCACCT ACAGATACTT CAGAATTTGT GAAATAGAAA 181 TGTTGGCAGT GATGGCTTGG GATGGTGAGT GGGTGGTGGG TGGTGTGTGG TGATTATGTT 241 TTGCAAGTGG TGGATAGCCT AAGATGATTT TGAAACAAAC TAAAGCTGAA TTTCTAACAC 301 TTATCAAATA GCCGTAAAAT ATTTTGGTTT TATAATAGAG GTTGAGTTAG GAGAGGGTAA 361 AAACACCTAG CTATGATCCT ATAACTCTGA CAACTCAATT TCAGATGGAA TTGTGGAATG 421 TCAGCCCGGA CCTCCAGGTG ACCAGGGTCC TCCTGGAATT CCAGGGCAGC CAGGATTTAT 481 AGGCGAAATT GGAGAGAAAG GTAAGAAA // LOCUS HUMCOL1A17 233 bp ds-DNA PRI 15-DEC-1989 DEFINITION Human alpha-1 collagen type IV gene, exon 23. ACCESSION M26554 J05039 KEYWORDS collagen; fibronectin; proteoglycan. SEGMENT 17 of 42 SOURCE Human DNA, (libraries of E.Fritsch, R.Poulsom, and Clontech), clones F[16,21,22] and E5. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 233) AUTHORS Soininen,R., Huotari,M., Ganguly,A., Prockop,D.J. and Tryggvason,K. TITLE Structural organization of the gene for the alpha-1 chain of human type IV collagen JOURNAL J. Biol. Chem. 264, 13565-13571 (1989) STANDARD full staff_entry REFERENCE 2 (bases 1 to 233) AUTHORS Tryggvason,K. JOURNAL Unpublished (1989) see COMMENT for author address. STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly provided by K.Tryggvason, 27-JUL-1989. K.Tryggvason, University of Oulu, Department of Biochemistry, University of Oulu 90570 Oulu, Finland FEATURES from to/span description pept + 74 + 157 alpha-1 collagen type IV, exon 23 /nomgen="COL4A1" /map="13q34" /hgml_locus_uid="LZO163F" pre-msg < 1 > 233 alpha-1 collagen type IV mRNA and introns IVS < 1 73 alpha-1 intron U IVS 158 > 233 alpha-1 intron V BASE COUNT 53 A 49 C 65 G 66 T ORIGIN About 1.5 kb after segment 23; chromosome 13q34. 1 CTGTTCTCTC AGTGCACTGT GGAGACTTCT GGTCACATGT TACACGTAAG CACCTTTGTG 61 TTTTCTTTTT TAGGTCAAAA AGGAGAGAGT TGCCTCATCT GTGATATAGA CGGATATCGG 121 GGGCCTCCCG GGCCACAGGG ACCCCCGGGA GAAATAGGTA AGACCCACAT GTGAAAAGGA 181 CCAGGTCAGA GTTTGCTTTG GTGTGTTGGC ATCTTTCCTG TGAAAGGGAT TTC // LOCUS HUMCOL1A18 370 bp ds-DNA PRI 15-DEC-1989 DEFINITION Human alpha-1 collagen type IV gene, exon 24. ACCESSION M26555 J05039 KEYWORDS collagen; fibronectin; proteoglycan. SEGMENT 18 of 42 SOURCE Human DNA, (libraries of E.Fritsch, R.Poulsom, and Clontech), clones F[16,21,22] and E5. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 370) AUTHORS Soininen,R., Huotari,M., Ganguly,A., Prockop,D.J. and Tryggvason,K. TITLE Structural organization of the gene for the alpha-1 chain of human type IV collagen JOURNAL J. Biol. Chem. 264, 13565-13571 (1989) STANDARD full staff_entry REFERENCE 2 (bases 1 to 370) AUTHORS Tryggvason,K. JOURNAL Unpublished (1989) see COMMENT for author address. STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly provided by K.Tryggvason, 27-JUL-1989. K.Tryggvason, University of Oulu, Department of Biochemistry, University of Oulu 90570 Oulu, Finland FEATURES from to/span description pept + 279 + 349 alpha-1 collagen type IV, exon 24 /nomgen="COL4A1" /map="13q34" /hgml_locus_uid="LZO163F" pre-msg < 1 > 370 alpha-1 collagen type IV mRNA and introns IVS < 1 278 alpha-1 intron V IVS 350 > 370 alpha-1 intron W BASE COUNT 70 A 76 C 103 G 121 T ORIGIN About 5.5 kb after segment 24; chromosome 13q34. 1 GAATTCATGT TTTTCCTTTA AGTGGTCCCT GGGTGTGTGG GCAGGGGGGC AGGTGTGGGG 61 TGCTGGAAGC ATTCCTGCTT CCGGGTGTAC ATGTGCGTCG GGTCACCTAT TCCCTTTCTG 121 AGTCCGTCTT GGGCATTTTA GTTATTACTC TCTTTGGATT ATCCAAGCAA TCATCCACTT 181 CCTATACAAA TTTTTATGCT TTTGAGTCTT AAGAAAAGGT TTTGAGAAGT GTCTCATGAA 241 TGCTGTGTTG CTGTCTTTTT CCCTCTCTCT CACGAAAGGT TTCCCAGGGC AGCCAGGGGC 301 CAAGGGCGAC AGAGGTTTGC CTGGCAGAGA TGGTGTTGCA GGAGTGCCAG TAAGTAAACC 361 TGTCTGAGTT // LOCUS HUMCOL1A19 285 bp ds-DNA PRI 15-DEC-1989 DEFINITION Human alpha-1 collagen type IV gene, exon 25. ACCESSION M26556 J05039 KEYWORDS collagen; fibronectin; proteoglycan. SEGMENT 19 of 42 SOURCE Human DNA, (libraries of E.Fritsch, R.Poulsom, and Clontech), clones F[16,21,22] and E5. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 285) AUTHORS Soininen,R., Huotari,M., Ganguly,A., Prockop,D.J. and Tryggvason,K. TITLE Structural organization of the gene for the alpha-1 chain of human type IV collagen JOURNAL J. Biol. Chem. 264, 13565-13571 (1989) STANDARD full staff_entry REFERENCE 2 (bases 1 to 285) AUTHORS Tryggvason,K. JOURNAL Unpublished (1989) see COMMENT for author address. STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly provided by K.Tryggvason, 27-JUL-1989. K.Tryggvason, University of Oulu, Department of Biochemistry, University of Oulu 90570 Oulu, Finland FEATURES from to/span description pept + 73 + 264 alpha-1 collagen type IV, exon 25 /nomgen="COL4A1" /map="13q34" /hgml_locus_uid="LZO163F" pre-msg < 1 > 285 alpha-1 collagen type IV mRNA and introns IVS < 1 72 alpha-1 intron W IVS 265 > 285 alpha-1 intron X BASE COUNT 56 A 72 C 88 G 69 T ORIGIN About 0.4 kb after segment 25; chromosome 13q34. 1 CTTGTGTTAC AGTCACACTG GTGTGCTTCA TGGATGGGTT CTTCTGTGGA TCACAAGGCT 61 TTGTTCTTAC AGGGCCCTCA AGGTACACCA GGGCTGATAG GCCAGCCAGG AGCCAAGGGG 121 GAGCCTGGTG AGTTTTATTT CGACTTGCGG CTCAAAGGTG ACAAAGGAGA CCCAGGCTTT 181 CCAGGACAGC CCGGCATGCC AGGGAGAGCG GGTTCTCCTG GAAGAGATGG CCATCCGGGT 241 CTTCCTGGCC CCAAGGGCTC GCCGGTATGT TTATCTCCAG ACTTT // LOCUS HUMCOL1A20 408 bp ds-DNA PRI 15-DEC-1989 DEFINITION Human alpha-1 collagen type IV gene, exon 26. ACCESSION M26557 J05039 KEYWORDS collagen; fibronectin; proteoglycan. SEGMENT 20 of 42 SOURCE Human DNA, (libraries of E.Fritsch, R.Poulsom, and Clontech), clones F[16,21,22] and E5. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 408) AUTHORS Soininen,R., Huotari,M., Ganguly,A., Prockop,D.J. and Tryggvason,K. TITLE Structural organization of the gene for the alpha-1 chain of human type IV collagen JOURNAL J. Biol. Chem. 264, 13565-13571 (1989) STANDARD full staff_entry REFERENCE 2 (bases 1 to 408) AUTHORS Tryggvason,K. JOURNAL Unpublished (1989) see COMMENT for author address. STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly provided by K.Tryggvason, 27-JUL-1989. K.Tryggvason, University of Oulu, Department of Biochemistry, University of Oulu 90570 Oulu, Finland FEATURES from to/span description pept + 52 + 220 alpha-1 collagen type IV, exon 26 /nomgen="COL4A1" /map="13q34" /hgml_locus_uid="LZO163F" pre-msg < 1 > 408 alpha-1 collagen type IV mRNA and introns IVS < 1 51 alpha-1 intron X IVS 221 > 408 alpha-1 intron Y BASE COUNT 73 A 107 C 130 G 98 T ORIGIN About 3.2 kb after segment 26; chromosome 13q34. 1 GGGTGTTGGT GTGTCTCTGA TAACGGTGTC TTTTTCTTCT TCTCACATCA GGGTTCTGTA 61 GGATTGAAAG GAGAGCGTGG CCCCCCTGGA GGAGTTGGAT TCCCAGGCAG TCGTGGTGAC 121 ACCGGCCCCC CTGGGCCTCC AGGATATGGT CCTGCTGGTC CCATTGGTGA CAAAGGACAA 181 GCAGGCTTTC CTGGAGGCCC TGGATCCCCA GGCCTGCCAG GTGAGGCCTG AGAAACTCAT 241 GCAGCGTGAA GTTGTAAACG GAGGTTAGGG CAGGCACCTC GGGTGCACAC AGAGGCCTGG 301 GGCTCGCATA GGCCAGGGCG CACACTTGGC TTTGATCCTT CCAGGAATTG GGGCAGGACC 361 TGGGCAAGCC TTCAGCCTTT TGTGCCTCCC TTTCCTCATG TATGTGAT // LOCUS HUMCOL1A21 500 bp ds-DNA PRI 15-DEC-1989 DEFINITION Human alpha-1 collagen type IV gene, exons 27 and 28. ACCESSION M26539 J05039 KEYWORDS collagen; fibronectin; proteoglycan. SEGMENT 21 of 42 SOURCE Human DNA, (libraries of E.Fritsch, R.Poulsom, and Clontech), clones F[16,21,22] and E5. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 500) AUTHORS Soininen,R., Huotari,M., Ganguly,A., Prockop,D.J. and Tryggvason,K. TITLE Structural organization of the gene for the alpha-1 chain of human type IV collagen JOURNAL J. Biol. Chem. 264, 13565-13571 (1989) STANDARD full staff_entry REFERENCE 2 (bases 1 to 500) AUTHORS Tryggvason,K. JOURNAL Unpublished (1989) see COMMENT for author address. STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly provided by K.Tryggvason, 27-JUL-1989. K.Tryggvason, University of Oulu, Department of Biochemistry, University of Oulu 90570 Oulu, Finland FEATURES from to/span description pept + 136 228 alpha-1 collagen type IV, exon 27 /nomgen="COL4A1" /map="13q34" /hgml_locus_uid="LZO163F" 315 + 419 alpha-1 collagen type IV, exon 28 pre-msg < 1 > 500 alpha-1 collagen type IV mRNA and introns IVS < 1 135 alpha-1 intron Y IVS 229 314 alpha-1 intron Z IVS 420 > 500 alpha-1 intron AA BASE COUNT 97 A 149 C 140 G 114 T ORIGIN About 1.4 kb after segment 28; chromosome 13q34. 1 AGGCAGAGGC TCTGCTCTCC GGGCACTCGT GGTGCTGGGT AGGATGTGGC GTGCAGCCCT 61 CCACTGCGTG TCCTCCCTGT GTTCCGTGTG GTCCTCATTC CTTCATCGTT TGTCTTTGCT 121 CTTTTTTTCC TCTAGGTCCA AAGGGTGAAC CAGGAAAAAT TGTTCCTTTA CCAGGCCCCC 181 CTGGAGCAGA AGGACTGCCG GGGTCCCCAG GCTTCCCAGG TCCCCAAGGT ACGATTGGAA 241 TTCCAGCTCA CCGGGATACC ACGAGAGTTT CCTCACGTTT TTCTCACGCT GACTGTTGGT 301 CTTTCTCACC ACAGGAGACC GAGGCTTTCC CGGAACCCCA GGAAGGCCAG GCCTGCCAGG 361 AGAGAAGGGC GCTGTGGGCC AGCCAGGCAT TGGATTTCCA GGGCCCCCCG GCCCCAAAGG 421 TAACCCTGCC AGACGGACCC GAGCCTTTGG TCCACTGTGA TTTGTGAGAA AAGAGGAGGT 481 GGTAGACATC AAAAACACCA // LOCUS HUMCOL1A22 151 bp ds-DNA PRI 15-DEC-1989 DEFINITION Human alpha-1 collagen type IV gene, exon 29. ACCESSION M26558 J05039 KEYWORDS collagen; fibronectin; proteoglycan. SEGMENT 22 of 42 SOURCE Human DNA, (libraries of E.Fritsch, R.Poulsom, and Clontech), clones F[16,21,22] and E5. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 151) AUTHORS Soininen,R., Huotari,M., Ganguly,A., Prockop,D.J. and Tryggvason,K. TITLE Structural organization of the gene for the alpha-1 chain of human type IV collagen JOURNAL J. Biol. Chem. 264, 13565-13571 (1989) STANDARD full staff_entry REFERENCE 2 (bases 1 to 151) AUTHORS Tryggvason,K. JOURNAL Unpublished (1989) see COMMENT for author address. STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly provided by K.Tryggvason, 27-JUL-1989. K.Tryggvason, University of Oulu, Department of Biochemistry, University of Oulu 90570 Oulu, Finland FEATURES from to/span description pept + 1 + 98 alpha-1 collagen type IV, exon 29 /nomgen="COL4A1" /map="13q34" /hgml_locus_uid="LZO163F" pre-msg < 1 > 151 alpha-1 collagen type IV mRNA and intron IVS 99 > 151 alpha-1 intron AB BASE COUNT 27 A 44 C 44 G 36 T ORIGIN About 1.7 kb after segment 29; chromosome 13q34. 1 GTGTTGACGG CTTACCTGGA GACATGGGGC CACCGGGGAC TCCAGGTCGC CCGGGATTTA 61 ATGGCTTACC TGGGAACCCA GGTGTGCAGG GCCAGAAGGT GAGTGCCAAG CATTCCCTCC 121 TGACCCTTCT CCCCCATCCT TGTTATTAGT T // LOCUS HUMCOL1A23 318 bp ds-DNA PRI 15-DEC-1989 DEFINITION Human alpha-1 collagen type IV gene, exon 30. ACCESSION M26559 J05039 KEYWORDS collagen; fibronectin; proteoglycan. SEGMENT 23 of 42 SOURCE Human DNA, (libraries of E.Fritsch, R.Poulsom, and Clontech), clones F[16,21,22] and E5. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 318) AUTHORS Soininen,R., Huotari,M., Ganguly,A., Prockop,D.J. and Tryggvason,K. TITLE Structural organization of the gene for the alpha-1 chain of human type IV collagen JOURNAL J. Biol. Chem. 264, 13565-13571 (1989) STANDARD full staff_entry REFERENCE 2 (bases 1 to 318) AUTHORS Tryggvason,K. JOURNAL Unpublished (1989) see COMMENT for author address. STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly provided by K.Tryggvason, 27-JUL-1989. K.Tryggvason, University of Oulu, Department of Biochemistry, University of Oulu 90570 Oulu, Finland FEATURES from to/span description pept + 152 + 302 alpha-1 collagen type IV, exon 30 /nomgen="COL4A1" /map="13q34" /hgml_locus_uid="LZO163F" pre-msg < 1 > 318 alpha-1 collagen type IV mRNA and introns IVS < 1 151 alpha-1 intron AB IVS 303 > 318 alpha-1 intron AC BASE COUNT 77 A 65 C 89 G 87 T ORIGIN About 0.2 kb after segment 30; chromosome 13q34. 1 AAGCTTTGCA AACGCTTGAA AAGGGTTGAG CAGATAATAT TCTTACTAAA TGCATTGGCA 61 GTATCTGTTA AGTCTTCACT TACGCTATTG GTTTGATAAA AATACTGACT TTGCAAATCA 121 ATTTGCTTCT CTGTGCTTTG GGGATTTTCA GGGAGAGCCT GGAGTTGGTC TACCGGGACT 181 CAAAGGTTTG CCAGGTCTTC CCGGCATTCC TGGCACACCC GGGGAGAAGG GGAGCATTGG 241 GGTACCAGGC GTTCCTGGAG AACATGGAGC GATCGGACCC CCTGGGCTTC AGGGGATCAG 301 AGGTAACTTC ATGCAGAT // LOCUS HUMCOL1A24 220 bp ds-DNA PRI 15-DEC-1989 DEFINITION Human alpha-1 collagen type IV gene, exon 31. ACCESSION M26560 J05039 KEYWORDS collagen; fibronectin; proteoglycan. SEGMENT 24 of 42 SOURCE Human DNA, (libraries of E.Fritsch, R.Poulsom, and Clontech), clones F[16,21,22] and E5. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 220) AUTHORS Soininen,R., Huotari,M., Ganguly,A., Prockop,D.J. and Tryggvason,K. TITLE Structural organization of the gene for the alpha-1 chain of human type IV collagen JOURNAL J. Biol. Chem. 264, 13565-13571 (1989) STANDARD full staff_entry REFERENCE 2 (bases 1 to 220) AUTHORS Tryggvason,K. JOURNAL Unpublished (1989) see COMMENT for author address. STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly provided by K.Tryggvason, 27-JUL-1989. K.Tryggvason, University of Oulu, Department of Biochemistry, University of Oulu 90570 Oulu, Finland FEATURES from to/span description pept + 15 + 128 alpha-1 collagen type IV, exon 31 /nomgen="COL4A1" /map="13q34" /hgml_locus_uid="LZO163F" pre-msg < 1 > 220 alpha-1 collagen type IV mRNA and introns IVS < 1 14 alpha-1 intron AC IVS 129 > 220 alpha-1 intron AD BASE COUNT 44 A 55 C 69 G 52 T ORIGIN About 0.7 kb after segment 31; 13q34. 1 TTTTTCTTCT CCAGGTGAAC CGGGACCTCC TGGATTGCCA GGCTCCGTGG GGTCTCCAGG 61 AGTTCCAGGA ATAGGCCCCC CTGGAGCTAG GGGTCCCCCT GGAGGACAGG GACCACCGGG 121 GTTGTCAGGT GAGTGACATG CTTCTAGAAT TATCTGTTCC CAAAAGCATG CTTTTCTGAA 181 GCATGGGGGA TGAGGCCGGG GTCCCAATAA AGTGAAGCTT // LOCUS HUMCOL1A25 193 bp ds-DNA PRI 15-DEC-1989 DEFINITION Human alpha-1 collagen type IV gene, exon 32. ACCESSION M26561 J05039 KEYWORDS collagen; fibronectin; proteoglycan. SEGMENT 25 of 42 SOURCE Human DNA, (libraries of E.Fritsch, R.Poulsom, and Clontech), clones F[16,21,22] and E5. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 193) AUTHORS Soininen,R., Huotari,M., Ganguly,A., Prockop,D.J. and Tryggvason,K. TITLE Structural organization of the gene for the alpha-1 chain of human type IV collagen JOURNAL J. Biol. Chem. 264, 13565-13571 (1989) STANDARD full staff_entry REFERENCE 2 (bases 1 to 193) AUTHORS Tryggvason,K. JOURNAL Unpublished (1989) see COMMENT for author address. STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly provided by K.Tryggvason, 27-JUL-1989. K.Tryggvason, University of Oulu, Department of Biochemistry, University of Oulu 90570 Oulu, Finland FEATURES from to/span description pept + 1 + 168 alpha-1 collagen type IV, exon 32 /nomgen="COL4A1" /map="13q34" /hgml_locus_uid="LZO163F" pre-msg < 1 > 193 alpha-1 collagen type IV mRNA and intron IVS 169 > 193 alpha-1 intron AE BASE COUNT 40 A 50 C 61 G 42 T ORIGIN About 0.4 kb after segment 32; chromosome 13q34. 1 GCCCTCCTGG AATAAAAGGA GAGAAGGGTT TCCCCGGATT CCCTGGACTG GACATGCCGG 61 GCCCTAAAGG AGATAAAGGG GCTCAAGGAC TCCCTGGCAT AACGGGACAG TCGGGGCTCC 121 CTGGCCTTCC TGGACAGCAG GGGGCTCCTG GGATTCCTGG GTTTCCAGGT AAGTGATTTT 181 TGAACTTCTG CCT // LOCUS HUMCOL1A26 194 bp ds-DNA PRI 15-DEC-1989 DEFINITION Human alpha-1 collagen type IV gene, exon 33. ACCESSION M26562 J05039 KEYWORDS collagen; fibronectin; proteoglycan. SEGMENT 26 of 42 SOURCE Human DNA, (libraries of E.Fritsch, R.Poulsom, and Clontech), clones F[16,21,22] and E5. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 194) AUTHORS Soininen,R., Huotari,M., Ganguly,A., Prockop,D.J. and Tryggvason,K. TITLE Structural organization of the gene for the alpha-1 chain of human type IV collagen JOURNAL J. Biol. Chem. 264, 13565-13571 (1989) STANDARD full staff_entry REFERENCE 2 (bases 1 to 194) AUTHORS Tryggvason,K. JOURNAL Unpublished (1989) see COMMENT for author address. STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly provided by K.Tryggvason, 27-JUL-1989. K.Tryggvason, University of Oulu, Department of Biochemistry, University of Oulu 90570 Oulu, Finland FEATURES from to/span description pept + 17 + 106 alpha-1 collagen type IV, exon 33 /nomgen="COL4A1" /map="13q34" /hgml_locus_uid="LZO163F" pre-msg < 1 > 194 alpha-1 collagen type IV mRNA and introns IVS < 1 16 alpha-1 intron AE IVS 107 > 194 alpha-1 intron AF BASE COUNT 45 A 43 C 57 G 49 T ORIGIN About 1.1 kb after segment 33; chromosome 13q334. 1 TACGTTTTGG CCACAGGTTC CAAGGGAGAA ATGGGCGTCA TGGGGACCCC CGGGCAGCCG 61 GGCTCACCAG GACCAGTGGG TGCTCCTGGA TTACCGGGTG AAAAAGGTAG GGGAGCTGAT 121 AACCCTAGAA GCTCATTTTG TGTCTATTTC TCGATGCCAC TTGGGAAATT CTCAAGTTGT 181 CATAGATCAT ATTT // LOCUS HUMCOL1A27 843 bp ds-DNA PRI 15-DEC-1989 DEFINITION Human alpha-1 collagen type IV gene, exons 34, 35 and 36. ACCESSION M26536 J05039 KEYWORDS collagen; fibronectin; proteoglycan. SEGMENT 27 of 42 SOURCE Human DNA, (libraries of E.Fritsch, R.Poulsom, and Clontech), clones F[16,21,22] and E5. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 843) AUTHORS Soininen,R., Huotari,M., Ganguly,A., Prockop,D.J. and Tryggvason,K. TITLE Structural organization of the gene for the alpha-1 chain of human type IV collagen JOURNAL J. Biol. Chem. 264, 13565-13571 (1989) STANDARD full staff_entry REFERENCE 2 (bases 1 to 843) AUTHORS Tryggvason,K. JOURNAL Unpublished (1989) see COMMENT for author address. STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly provided by K.Tryggvason, 27-JUL-1989. K.Tryggvason, University of Oulu, Department of Biochemistry, University of Oulu 90570 Oulu, Finland FEATURES from to/span description pept + 100 252 alpha-1 collagen type IV, exon 34 /nomgen="COL4A1" /map="13q34" /hgml_locus_uid="LZO163F" 414 512 alpha-1 collagen type IV, exon 35 625 + 714 alpha-1 collagen type IV, exon 36 pre-msg < 1 > 843 alpha-1 collagen type IV mRNA and introns IVS < 1 99 alpha-1 intron AF IVS 257 413 alpha-1 intron AG IVS 513 624 alpha-1 intron AH IVS 715 > 843 alpha-1 intron AI BASE COUNT 196 A 206 C 247 G 194 T ORIGIN About 1.1 kb after segment 36; chromosome 13q34. 1 GAATTCATGC TGCTGTCCCT GTCAACAAGA ACGTGAGCCT TTTGAGCACC TCTGTGACAC 61 TGGCCCCAGG CTCACCAGTG CCTTCTCTTT CCTCTGCAGG GGACCATGGC TTTCCGGGCT 121 CCTCAGGACC CAGGGGAGAC CCTGGCTTGA AAGGTGATAA GGGGGATGTC GGTCTCCCTG 181 GCAAGCCTGG CTCCATGGAT AAGGTGGACA TGGGCAGCAT GAAGGGCCAG AAAGGAGACC 241 AAGGAGAGAA AGGCAAGTCC AGGAGGCCTT CTCATGGCCA AGCTCGGAAC CACAATGTGC 301 CTTTCCTGGG TTATCGGGTC CTCCATAAAT AATAACCAAA CCAACCTGCG TTTCTGCAGC 361 AACTTGCTTG CAAGCAAATG CGGGTCATTG CATTTTTCTT CTCTGATGTG TAGGACAAAT 421 TGGACCAATT GGTGAGAAGG GATCCCGAGG AGACCCTGGG ACCCCAGGAG TGCCTGGAAA 481 GGACGGGCAG GCAGGACAGC CTGGGCAGCC AGGTACAGTG TGGAGCTCCG GTCCCCAGTG 541 TGGCAAGGGC TCAGGATGAG GCCTGTGTTC CCCAGCTCTT TCTGTAGATG TCAACCTGTC 601 AGACTGCTTT CTTTATGGTT CTAGGACCTA AAGGTGATCC AGGTATAAGT GGAACCCCAG 661 GTGCTCCAGG ACTTCCGGGA CCAAAAGGAT CTGTTGGTGG AATGGGCTTG CCAGGTAAAC 721 TGGATTTAGA AGAGGATTCT TTAGCATGTG TGCGTGTGAT ACTGGTAGGG CATGTCTGCA 781 GAATGAGATC CTCTTCAGCT GAGACGGACA GAATCCAGGC ACCTGTGTCT TTCACATGAA 841 TTC // LOCUS HUMCOL1A28 448 bp ds-DNA PRI 15-DEC-1989 DEFINITION Human alpha-1 collagen type IV gene, exon 37. ACCESSION M26563 J05039 KEYWORDS collagen; fibronectin; proteoglycan. SEGMENT 28 of 42 SOURCE Human DNA, (libraries of E.Fritsch, R.Poulsom, and Clontech), clones F[16,21,22] and E5. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 448) AUTHORS Soininen,R., Huotari,M., Ganguly,A., Prockop,D.J. and Tryggvason,K. TITLE Structural organization of the gene for the alpha-1 chain of human type IV collagen JOURNAL J. Biol. Chem. 264, 13565-13571 (1989) STANDARD full staff_entry REFERENCE 2 (bases 1 to 448) AUTHORS Tryggvason,K. JOURNAL Unpublished (1989) see COMMENT for author address. STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly provided by K.Tryggvason, 27-JUL-1989. K.Tryggvason, University of Oulu, Department of Biochemistry, University of Oulu 90570 Oulu, Finland FEATURES from to/span description pept + 226 + 365 alpha-1 collagen type IV, exon 37 /nomgen="COL4A1" /map="13q34" /hgml_locus_uid="LZO163F" pre-msg < 1 > 448 alpha-1 collagen type IV mRNA and introns IVS < 1 225 alpha-1 intron AI IVS 366 > 448 alpha-1 intron AJ BASE COUNT 118 A 120 C 123 G 87 T ORIGIN About 0.3 kb after segment 37; chromosome 13q34. 1 CCTGCATCCC TTGCCCACAA CGTTCACCTG CTTACCTGAG ACAAAGTGCA AAAGGAGAGA 61 AAGGGCAGGC AGGCCCACCT GGCATAGGCA TCCCAGGACT TGCGTGGTGA AAAGGTAACC 121 AGCGCTCTGC GTGGAAAGGT CCCCTTCTCC CACTCAGTGG AGATGCTGCC TGAGAAATCT 181 CCTAGCTAAG CCCAAATCAA GTTTCAATTT GGTTTGTGTT TATAGGAACA CCTGGAGAGA 241 AAGGTGTGCC TGGCATCCCT GGCCCACAAG GTTCACCTGG CTTACCTGGA GACAAAGGTG 301 CAAAAGGAGA GAAAGGGCAG GCAGGCCCAC CTGGCATAGG CATCCCAGGA CTGCGTGGTG 361 AAAAGGTAAC CAGCGCTCTG GCTGGAAAGG TCCCCTTCTC CCAGCTCAGT GGAGATGCTG 421 CCTGAGAAAT GCTCAGCAAA TATCTCAG // LOCUS HUMCOL1A29 763 bp ds-DNA PRI 15-DEC-1989 DEFINITION Human alpha-1 collagen type IV gene, exons 38 and 39. ACCESSION M26541 J05039 KEYWORDS collagen; fibronectin; proteoglycan. SEGMENT 29 of 42 SOURCE Human DNA, (libraries of E.Fritsch, R.Poulsom, and Clontech), clones F[16,21,22] and E5. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 763) AUTHORS Soininen,R., Huotari,M., Ganguly,A., Prockop,D.J. and Tryggvason,K. TITLE Structural organization of the gene for the alpha-1 chain of human type IV collagen JOURNAL J. Biol. Chem. 264, 13565-13571 (1989) STANDARD full staff_entry REFERENCE 2 (bases 1 to 763) AUTHORS Tryggvason,K. JOURNAL Unpublished (1989) see COMMENT for author address. STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly provided by K.Tryggvason, 27-JUL-1989. K.Tryggvason, University of Oulu, Department of Biochemistry, University of Oulu 90570 Oulu, Finland FEATURES from to/span description pept + 278 404 alpha-1 collagen type IV, exon 38 /nomgen="COL4A1" /map="13q34" /hgml_locus_uid="LZO163F" 503 + 583 alpha-1, exon 39 pre-msg < 1 > 763 alpha-1 collagen type IV mRNA and introns IVS < 1 277 alpha-1 intron AJ IVS 405 502 alpha-1 intron AK IVS 584 > 763 alpha-1 intron AL BASE COUNT 208 A 149 C 225 G 181 T ORIGIN About 0.6 kb after segment 39; chromosome 13q34. 1 TGTCAAGGCT AAGGTGGGAG ACAGCTGTCA GTGGGATAAA ACAGAGAGGA TGGAGGGCAA 61 GAGGAATGGC GGGTAGGAAA CCAGATGGTT GGCTCTACTG CTCTTAAGTT TTCCTCTCCA 121 GGTCAGAAGT GGACTTTGAT TCTTGATGAG AAATCTGGAA AGGAGTTTTG CAATGCAATA 181 TTCATTGCAC CAAAATGTAT ACCATTATCT ATAACATGCC TACAGATGCA TGCCCATGGA 241 CAAATGTAAA GTTAATATGC CTGACTTTTA CTTGCAGGGA GATCAAGGGA TAGCGGGTTT 301 CCCAGGAAGC CCTGGAGAGA AGGGAGAAAA AGGAAGCATT GGGATCCCAG GAATGCCAGG 361 GTCCCCAGGC CTTAAAGGGT CTCCCGGGAG TGTTGGCTAT CCAGGTAGGT GAAGGGGGGC 421 TTCTCTTGGA TTTGGGGAAT CTAAGAAATT CAAAGAAAAG CCCAAACTTA TGGATGTTCT 481 GGTGTTTTGT TTTTTTCTTT AGGAAGTCCT GGGCTACCTG GAGAAAAAGG TGACAAAGGC 541 CTCCCAGGAT TGGATGGCAT CCCTGGTGTC AAAGGAGAAG CAGGTAGAGG GCCCTCTTTG 601 GGACACACCC GGGCACATAG AGAACAGTGG CCAGGGGAAT GGGTAGGCCC CAAGTGAAGT 661 TCCATCACAG ACGGGGGTAG CAGGGGTAGG CCTAACTGTC CCTGTCTCCC ATCTTCAGAT 721 GGCCAAACTG ATCTAGTCAG GTTACAGATC TTCTCTCAGG TCT // LOCUS HUMCOL1A30 288 bp ds-DNA PRI 15-DEC-1989 DEFINITION Human alpha-1 collagen type IV gene, exon 40. ACCESSION M26564 J05039 KEYWORDS collagen; fibronectin; proteoglycan. SEGMENT 30 of 42 SOURCE Human DNA, (libraries of E.Fritsch, R.Poulsom, and Clontech), clones F[16,21,22] and E5. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 288) AUTHORS Soininen,R., Huotari,M., Ganguly,A., Prockop,D.J. and Tryggvason,K. TITLE Structural organization of the gene for the alpha-1 chain of human type IV collagen JOURNAL J. Biol. Chem. 264, 13565-13571 (1989) STANDARD full staff_entry REFERENCE 2 (bases 1 to 288) AUTHORS Tryggvason,K. JOURNAL Unpublished (1989) see COMMENT for author address. STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly provided by K.Tryggvason, 27-JUL-1989. K.Tryggvason, University of Oulu, Department of Biochemistry, University of Oulu 90570 Oulu, Finland FEATURES from to/span description pept + 1 + 99 alpha-1 collagen type IV, exon 40 /nomgen="COL4A1" /map="13q34" /hgml_locus_uid="LZO163F" pre-msg < 1 > 288 alpha-1 collagen type IV mRNA and intron IVS 100 > 288 alpha-1 intron AM BASE COUNT 74 A 72 C 93 G 49 T ORIGIN About 1.5 kb after segment 40; chromosome 13q34. 1 GTCTTCCTGG GACTCCTGGC CCCACAGGCC CAGCTGGCCA GAAAGGGGAG CCAGGCAGTG 61 ATGGAATCCC GGGGTCAGCA GGAGAGAAGG GTGAACCAGG TATGGCCCAA CGCCCCATTC 121 CCTATCAGAA TCAGCAGTGT CCACTCCCAG AGACCTGCAG ACTGCCTGTG TTCAGTGAAT 181 AAAGCTGGGC ACAGAATGGC CCCTAGACTG CACATCCCTG CAACTACTTA AACACTGATG 241 TGGATGTGTG GATAGAGTGG GATGAGTGGA GGGGAAGAAG GGAAGGGG // LOCUS HUMCOL1A31 197 bp ds-DNA PRI 15-DEC-1989 DEFINITION Human alpha-1 collagen type IV gene, exon 41. ACCESSION M26565 J05039 KEYWORDS collagen; fibronectin; proteoglycan. SEGMENT 31 of 42 SOURCE Human DNA, (libraries of E.Fritsch, R.Poulsom, and Clontech), clones F[16,21,22] and E5. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 197) AUTHORS Soininen,R., Huotari,M., Ganguly,A., Prockop,D.J. and Tryggvason,K. TITLE Structural organization of the gene for the alpha-1 chain of human type IV collagen JOURNAL J. Biol. Chem. 264, 13565-13571 (1989) STANDARD full staff_entry REFERENCE 2 (bases 1 to 197) AUTHORS Tryggvason,K. JOURNAL Unpublished (1989) see COMMENT for author address. STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly provided by K.Tryggvason, 27-JUL-1989. K.Tryggvason, University of Oulu, Department of Biochemistry, University of Oulu 90570 Oulu, Finland FEATURES from to/span description pept + 1 + 51 alpha-1 collagen type IV, exon 41 /nomgen="COL4A1" /map="13q34" /hgml_locus_uid="LZO163F" pre-msg < 1 > 197 alpha-1 collagen type IV mRNA and introns IVS 52 > 197 alpha-1 intron AN BASE COUNT 47 A 50 C 59 G 41 T ORIGIN About 2.0 kb after segment 41; chromosome 13q34. 1 GTCTACCAGG AAGAGGATTC CCAGGGTTTC CAGGGGCCAA AGGAGACAAA GGTAATCTTT 61 GCTCACTGTG CTCTTCCTTT TCAACCAACC GCTGCTGCCT GATTAGCAGA ACAAGGGGCA 121 GTGTCTTCAG GATGAAGCCC CATCCCTGTT GGCCAGGCAG CAACGGGATG GAGGATGGGT 181 GCTGTGCCCA GACGAGA // LOCUS HUMCOL1A32 283 bp ds-DNA PRI 15-DEC-1989 DEFINITION Human alpha-1 collagen type IV gene, exon 42. ACCESSION M26566 J05039 KEYWORDS collagen; fibronectin; proteoglycan. SEGMENT 32 of 42 SOURCE Human DNA, (libraries of E.Fritsch, R.Poulsom, and Clontech), clones F[16,21,22] and E5. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 283) AUTHORS Soininen,R., Huotari,M., Ganguly,A., Prockop,D.J. and Tryggvason,K. TITLE Structural organization of the gene for the alpha-1 chain of human type IV collagen JOURNAL J. Biol. Chem. 264, 13565-13571 (1989) STANDARD full staff_entry REFERENCE 2 (bases 1 to 283) AUTHORS Tryggvason,K. JOURNAL Unpublished (1989) see COMMENT for author address. STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly provided by K.Tryggvason, 27-JUL-1989. K.Tryggvason, University of Oulu, Department of Biochemistry, University of Oulu 90570 Oulu, Finland FEATURES from to/span description pept + 48 + 233 alpha-1 collagen type IV, exon 42 /nomgen="COL4A1" /map="13q34" /hgml_locus_uid="LZO163F" pre-msg < 1 > 283 alpha-1 collagen type IV mRNA and introns IVS < 1 47 alpha-1 intron AN IVS 234 > 283 alpha-1 intron AO BASE COUNT 57 A 80 C 94 G 52 T ORIGIN About 0.8 kb after segment 42; chromosome 13q34. 1 TTCTGCTGCT GCGTTTCCCA TCACCTCAAA CTGTTTTTCT TGTTCAGGTT CAAAGGGTGA 61 GGTGGGTTTC CCAGGATTAG CCGGGAGCCC AGGAATTCCT GGATCCAAAG GAGAGCAAGG 121 ATTCATGGGT CCTCCGGGGC CCCAGGGACA GCCGGGGTTA CCGGGATCCC CAGGCCATGC 181 CACGGAGGGG CCCAAAGGAG ACCGCGGACC TCAGGGCCAG CCTGGCCTGC CAGGTAAGGG 241 CATCTCGCCA GGGGCCAGGG CTGAGGACTG GGACAGAATT CAC // LOCUS HUMCOL1A33 346 bp ds-DNA PRI 15-DEC-1989 DEFINITION Human alpha-1 collagen type IV gene, exon 43. ACCESSION M26567 J05039 KEYWORDS collagen; fibronectin; proteoglycan. SEGMENT 33 of 42 SOURCE Human DNA, (libraries of E.Fritsch, R.Poulsom, and Clontech), clones F[16,21,22] and E5. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 346) AUTHORS Soininen,R., Huotari,M., Ganguly,A., Prockop,D.J. and Tryggvason,K. TITLE Structural organization of the gene for the alpha-1 chain of human type IV collagen JOURNAL J. Biol. Chem. 264, 13565-13571 (1989) STANDARD full staff_entry REFERENCE 2 (bases 1 to 346) AUTHORS Tryggvason,K. JOURNAL Unpublished (1989) see COMMENT for author address. STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly provided by K.Tryggvason, 27-JUL-1989. K.Tryggvason, University of Oulu, Department of Biochemistry, University of Oulu 90570 Oulu, Finland FEATURES from to/span description pept + 213 + 346 alpha-1 collagen type IV, exon 43 /nomgen="COL4A1" /map="13q34" /hgml_locus_uid="LZO163F" pre-msg < 1 > 346 alpha-1 collagen type IV mRNA and intron IVS < 1 212 alpha-1 intron AO BASE COUNT 57 A 108 C 87 G 94 T ORIGIN About 2.67 kb after segment 43; chromosome 13q34. 1 TTTCTATTCC CTCCCTCCCT CCCTTCCTCC CTTTCCTCCC TCCCTCCTTC CTTCCTTCCT 61 GGACCTGCCT CGATTTCTGT CTCAAGGGTT GTCACTGTTG GTCCAGTGTT GTATCAGTGA 121 GGGGCCGTGG GTGGCAGTAT TGATAGCCCC ACAGCCCAGG GACACTTGTG CATATTTAAC 181 CCCAAATGTG TGGCTTTCTT CTCTTCCATT AGGACTTCCG GGACCCATGG GGCCTCCAGG 241 GCTTCCTGGG ATTGATGGAG TTAAAGGTGA CAAAGGAAAT CCAGGCTGGC CAGGAGCACC 301 CGGTGTCCCA GGGCCCAAGG GAGACCCTGG ATTCCAGGGC ATGCCT // LOCUS HUMCOL1A34 334 bp ds-DNA PRI 15-DEC-1989 DEFINITION Human alpha-1 collagen type IV gene, exon 44. ACCESSION M26568 J05039 KEYWORDS collagen; fibronectin; proteoglycan. SEGMENT 34 of 42 SOURCE Human DNA, (libraries of E.Fritsch, R.Poulsom, and Clontech), clones F[16,21,22] and E5. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 334) AUTHORS Soininen,R., Huotari,M., Ganguly,A., Prockop,D.J. and Tryggvason,K. TITLE Structural organization of the gene for the alpha-1 chain of human type IV collagen JOURNAL J. Biol. Chem. 264, 13565-13571 (1989) STANDARD full staff_entry REFERENCE 2 (bases 1 to 334) AUTHORS Tryggvason,K. JOURNAL Unpublished (1989) see COMMENT for author address. STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly provided by K.Tryggvason, 27-JUL-1989. K.Tryggvason, University of Oulu, Department of Biochemistry, University of Oulu 90570 Oulu, Finland FEATURES from to/span description pept + 262 + 334 alpha-1 collagen type IV, exon 44 /nomgen="COL4A1" /map="13q34" /hgml_locus_uid="LZO163F" pre-msg < 1 > 334 alpha-1 collagen type IV mRNA and intron IVS < 1 261 alpha-1 intron AO BASE COUNT 84 A 78 C 81 G 91 T ORIGIN About 0.96 kb after segment 44; chromosome 13q34. 1 CCTCAGGCGC TTCCCCTGTG GATGGACAAG CAGAGGAGAT ACATGGGAGG GAAACAAATG 61 CTTCTACTTT CCAGCTTCCC GATTGAGGCA CATGCAAAGG TGTTGTTATT CTGTCATTTA 121 AGAAACCACA AGGCACCATT TGTTCACAAA TGTGCTTTCC AGAACATTGA GGGGCTGTTT 181 CCATTTTCCA ACTAGCAGGT TACTGGAGAG ACTTAACCTA CATATCCTGA GCAATTCCCA 241 ACCTCCTCTT TGTGTGTCTA GGGTATTGGT GGCTCTCCAG GAATCACAGG CTCTAAGGGT 301 GATATGGGGC CTCCAGGAGT TCCAGGATTT CAAG // LOCUS HUMCOL1A35 233 bp ds-DNA PRI 15-DEC-1989 DEFINITION Human alpha-1 collagen type IV gene, exon 45. ACCESSION M26569 J05039 KEYWORDS collagen; fibronectin; proteoglycan. SEGMENT 35 of 42 SOURCE Human DNA, (libraries of E.Fritsch, R.Poulsom, and Clontech), clones F[16,21,22] and E5. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 233) AUTHORS Soininen,R., Huotari,M., Ganguly,A., Prockop,D.J. and Tryggvason,K. TITLE Structural organization of the gene for the alpha-1 chain of human type IV collagen JOURNAL J. Biol. Chem. 264, 13565-13571 (1989) STANDARD full staff_entry REFERENCE 2 (bases 1 to 233) AUTHORS Tryggvason,K. JOURNAL Unpublished (1989) see COMMENT for author address. STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly provided by K.Tryggvason, 27-JUL-1989. K.Tryggvason, University of Oulu, Department of Biochemistry, University of Oulu 90570 Oulu, Finland FEATURES from to/span description pept + 39 + 110 alpha-1 collagen type IV, exon 45 /nomgen="COL4A1" /map="13q34" /hgml_locus_uid="LZO163F" pre-msg < 1 > 233 alpha-1 collagen type IV mRNA and introns IVS < 1 38 alpha-1 intron AP IVS 111 > 233 alpha-1 intron AQ BASE COUNT 66 A 40 C 61 G 66 T ORIGIN About 1.39 kb after segment 45; chromosome 13q34. 1 GAATTCTGTG CAAGACAGTG CTTTCCCTCT GCTTTTAGGT CCAAAAGGTC TTCCTGGCCT 61 CCAGGGAATT AAAGGTGATC AAGGCGATCA AGGCGTCCCG GGAGCTAAAG GTAGGAGAGT 121 TTGTTGATCT GTGGAACCCT TACTGGTGCT TTGTGAAAAT GTAAAAGCCA GGAATGCACA 181 GAATTGGGGT GTTTGGTTTT TCATATGTGA AATGACTCAA AAATCATTAA AAA // LOCUS HUMCOL1A36 213 bp ds-DNA PRI 15-DEC-1989 DEFINITION Human alpha-1 collagen type IV gene, exon 46. ACCESSION M26570 J05039 KEYWORDS collagen; fibronectin; proteoglycan. SEGMENT 36 of 42 SOURCE Human DNA, (libraries of E.Fritsch, R.Poulsom, and Clontech), clones F[16,21,22] and E5. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 213) AUTHORS Soininen,R., Huotari,M., Ganguly,A., Prockop,D.J. and Tryggvason,K. TITLE Structural organization of the gene for the alpha-1 chain of human type IV collagen JOURNAL J. Biol. Chem. 264, 13565-13571 (1989) STANDARD full staff_entry REFERENCE 2 (bases 1 to 213) AUTHORS Tryggvason,K. JOURNAL Unpublished (1989) see COMMENT for author address. STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly provided by K.Tryggvason, 27-JUL-1989. K.Tryggvason, University of Oulu, Department of Biochemistry, University of Oulu 90570 Oulu, Finland FEATURES from to/span description pept + 1 + 129 alpha-1 collagen type IV, exon 46 /nomgen="COL4A1" /map="13q34" /hgml_locus_uid="LZO163F" pre-msg < 1 > 213 alpha-1 collagen type IV mRNA and intron IVS 130 > 213 alpha-1 intron AR BASE COUNT 43 A 62 C 70 G 38 T ORIGIN About 1.5 kb after segment 46; chromosome 13q34. 1 GTCTCCCGGG TCCTCCTGGC CCCCCAGGTC CTTACGACAT CATCAAAGGG GAGCCCGGGC 61 TCCCTGGTCC TGAGGGCCCC CCAGGGCTGA AAGGGCTTCA GGGACTGCCA GGCCCGAAAG 121 GCCAGCAAGG TGAGAAGGCT TGGCTGTGCA GGGGGTATGG GGAGCCCAGA GGAGTGGTCA 181 GAGTTCTCCT ACCCATCTGA TCTAAAATAA GTT // LOCUS HUMCOL1A38 313 bp ds-DNA PRI 15-DEC-1989 DEFINITION Human alpha-1 collagen type IV gene, exon 48. ACCESSION M26572 J05039 KEYWORDS collagen; fibronectin; proteoglycan. SEGMENT 38 of 42 SOURCE Human DNA, (libraries of E.Fritsch, R.Poulsom, and Clontech), clones F[16,21,22] and E5. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 313) AUTHORS Soininen,R., Huotari,M., Ganguly,A., Prockop,D.J. and Tryggvason,K. TITLE Structural organization of the gene for the alpha-1 chain of human type IV collagen JOURNAL J. Biol. Chem. 264, 13565-13571 (1989) STANDARD full staff_entry REFERENCE 2 (bases 1 to 313) AUTHORS Tryggvason,K. JOURNAL Unpublished (1989) see COMMENT for author address. STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly provided by K.Tryggvason, 27-JUL-1989. K.Tryggvason, University of Oulu, Department of Biochemistry, University of Oulu 90570 Oulu, Finland FEATURES from to/span description pept + 101 + 313 alpha-1 collagen type IV, exon 48 /nomgen="COL4A1" /map="13q34" /hgml_locus_uid="LZO163F" pre-msg < 1 > 313 alpha-1 collagen type IV mRNA and intron IVS < 1 100 alpha-1 intron AS BASE COUNT 83 A 80 C 69 G 81 T ORIGIN About 0.96 kb after segment 48; chromosome 13q34. 1 TGGAAATATA AAATATTAAA AACTCTTTGG ACAAATCATT TTAAAGCCTT TAGGGACCTG 61 TTTGTAATTA GGGAAACCTA ATTTCTCTCC TATCTTCCAG GTCCAAGAGG ATTTCCAGGT 121 CCACCAGGCC CCGATGGGTT GCCAGGATCC ATGGGGCCCC CAGGCACCCC ATCTGTTGAT 181 CACGGCTTCC TTGTGACCAG GCATAGTCAA ACAATAGATG ACCCACAGTG TCCTTCTGGG 241 ACCAAAATTC TTTACCACGG GTACTCTTTG CTCTACGTGC AAGGCAATGA ACGGGCCCAT 301 GGACAGGACT TGG // LOCUS HUMCOL1A39 791 bp ds-DNA PRI 15-DEC-1989 DEFINITION Human alpha-1 collagen type IV gene, exon 49. ACCESSION M26573 J05039 KEYWORDS collagen; fibronectin; proteoglycan. SEGMENT 39 of 42 SOURCE Human DNA, (libraries of E.Fritsch, R.Poulsom, and Clontech), clones F[16,21,22] and E5. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 791) AUTHORS Soininen,R., Huotari,M., Ganguly,A., Prockop,D.J. and Tryggvason,K. TITLE Structural organization of the gene for the alpha-1 chain of human type IV collagen JOURNAL J. Biol. Chem. 264, 13565-13571 (1989) STANDARD full staff_entry REFERENCE 2 (bases 1 to 791) AUTHORS Tryggvason,K. JOURNAL Unpublished (1989) see COMMENT for author address. STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly provided by K.Tryggvason, 27-JUL-1989. K.Tryggvason, University of Oulu, Department of Biochemistry, University of Oulu 90570 Oulu, Finland FEATURES from to/span description pept + 212 + 389 alpha-1 collagen type IV, exon 49 /nomgen="COL4A1" /map="13q34" /hgml_locus_uid="LZO163F" pre-msg < 1 > 791 alpha-1 collagen type IV mRNA and introns IVS < 1 211 alpha-1 intron AT IVS 390 > 791 alpha-1 intron AU BASE COUNT 214 A 163 C 177 G 237 T ORIGIN About 13 kb after segment 49; chromosome 13q34. 1 TTCAGCTTAG GAAAAGGTTT CTTTGTGCTG GTAAACTGGT TTTGTTTCCT TTCATCAAAA 61 ATGTCCTACA TAATCCTTTA TACATTTCCA TTTCAGTGAT TATGTTGATT ATATGTTTTT 121 GCTTGTTGTC CCAGCTGAAA TCCACTTTAC ATAGAGATAG ATTGTGAATT ATAAAGGAAA 181 CATTCTCACA ATTGTCTTCT TCCTTGTCTA GGCACGGCCG GCAGCTGCCT GCGCAAGTTC 241 AGCACAATGC CCTTCCTGTT CTGCAATATT AACAACGTGT GCAACTTTGC ATCACGAAAT 301 GACTACTCGT ACTGGCTGTC CACCCCTGAG CCCATGCCCA TGTCAATGGC ACCCATCACG 361 GGGGAAAACA TAAGACCATT TATTAGTAGG TGAGTCGAGC ATCTGTAGGA ACAAATGCAA 421 AATTGAAGAG GAAAAGTCTC TAGTCTGGAC AAGCATATGT TTTCTATTTT TCTTCCAAAA 481 CGTTGAATCC ATTGCCATTT AGCTTATTAT TTGTGATAAT TCCGGAGTTT TGTAATGGAA 541 ATCTACCATA AAAATTATTT GACATAAAAG TCAGTTGGCT GGGCACAGTG GCTCATGCCG 601 TAATCCCAAC ACTTTGGGAA GCCGAGGCGA GAGGGTTGCT TGAGCTCAGG AGTTCAAGAC 661 CAGCCTGGGC AAGATGACGA GACTTCATCT CCACAAAAAT ACAAAAGTTA GCTGGGCGTG 721 GTGGTGTTGA GCCTTGTGGT TCCCAGCTAC GAGGGAGGCT GAGGTGGGAG ATCCATTAAG 781 CCTGGGATGT T // LOCUS HUMCOL1A40 313 bp ds-DNA PRI 15-DEC-1989 DEFINITION Human alpha-1 collagen type IV gene, exon 50. ACCESSION M26574 J05039 KEYWORDS collagen; fibronectin; proteoglycan. SEGMENT 40 of 42 SOURCE Human DNA, (libraries of E.Fritsch, R.Poulsom, and Clontech), clones F[16,21,22] and E5. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 313) AUTHORS Soininen,R., Huotari,M., Ganguly,A., Prockop,D.J. and Tryggvason,K. TITLE Structural organization of the gene for the alpha-1 chain of human type IV collagen JOURNAL J. Biol. Chem. 264, 13565-13571 (1989) STANDARD full staff_entry REFERENCE 2 (bases 1 to 313) AUTHORS Tryggvason,K. JOURNAL Unpublished (1989) see COMMENT for author address. STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly provided by K.Tryggvason, 27-JUL-1989. K.Tryggvason, University of Oulu, Department of Biochemistry, University of Oulu 90570 Oulu, Finland FEATURES from to/span description pept + 157 + 271 alpha-1 collagen type IV, exon 50 /nomgen="COL4A1" /map="13q34" /hgml_locus_uid="LZO163F" pre-msg < 1 > 313 alpha-1 collagen type IV mRNA and introns IVS < 1 156 alpha-1 AU IVS 272 > 313 alpha-1 AV BASE COUNT 61 A 80 C 96 G 76 T ORIGIN About ? kb after segment 38. 1 CCTTAAAGGC CAAATGGGGC AACAATGTTT TCTACTTAAT ATGTAATTGC ACCAGAAAGA 61 CTGAGTGAGG ATGTTGAGTG TAGGTAACGG GGCAGTGCAT GAGGGCGCCT CTGCCCTGCA 121 CCCCGGCTGT GCTGAGTGTC TCTGCTCCAC TTCCAGGTGT GCTGTGTGTG AGGCGCCTGC 181 CATGGTGATG GCCGTGCACA GCCAGACCAT TCAGATCCCA CCGTGCCCCA GCGGGTGGTC 241 CTCGCTGTGG ATCGGCTACT CTTTTGTGAT GGTAAGTGTC TGGGGAGAAG CCATATTTCC 301 CGGGAAGAGC CCC // LOCUS HUMCOL1A41 437 bp ds-DNA PRI 15-DEC-1989 DEFINITION Human alpha-1 collagen type IV gene, exon 51. ACCESSION M26575 J05039 KEYWORDS collagen; fibronectin; proteoglycan. SEGMENT 41 of 42 SOURCE Human DNA, (libraries of E.Fritsch, R.Poulsom, and Clontech), clones F[16,21,22] and E5. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 437) AUTHORS Soininen,R., Huotari,M., Ganguly,A., Prockop,D.J. and Tryggvason,K. TITLE Structural organization of the gene for the alpha-1 chain of human type IV collagen JOURNAL J. Biol. Chem. 264, 13565-13571 (1989) STANDARD full staff_entry REFERENCE 2 (bases 1 to 437) AUTHORS Tryggvason,K. JOURNAL Unpublished (1989) see COMMENT for author address. STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly provided by K.Tryggvason, 27-JUL-1989. K.Tryggvason, University of Oulu, Department of Biochemistry, University of Oulu 90570 Oulu, Finland FEATURES from to/span description pept + 154 + 326 alpha-1 collagen type IV, exon 51 /nomgen="COL4A1" /map="13q34" /hgml_locus_uid="LZO163F" pre-msg < 1 > 437 alpha-1 collagen type IV mRNA and introns IVS < 1 153 alpha-1 AV IVS 327 > 437 alpha-1 AW BASE COUNT 102 A 113 C 109 G 113 T ORIGIN About 1.9 kb after segment 51; chromosome 13q34. 1 ATTAGGACCC AGGCTTTATA CTTTATGCTT CCGGCTCCGT TAATTGTTGT GTGGAATTAT 61 GAGCGGATAA CCAATTTCAC ACAGGAAAAC GCTATGACCA TGATTACGAA TTCGAGCTCC 121 GGTACCCGGG GATCCTCTAC AGTCGACCTG CAGCACACCA GCGCTGGTGC AGAAGGCTCT 181 GGCCAAGCCC TGGCGTCCCC CGGCTCCTGC CTGGAGGAGT TTAGAAGTGC GCCATTCATC 241 GAGTGTCACG GCCGTGGGAC CTGCAATTAC TACGCAAACG CTTACAGCTT TTGGCTCGCC 301 ACCATAGAGA GGAGCGAGAT GTTCAAGTAA GTGGGAGCAC TGCTTTTTTG CAGGCTGCTG 361 GCCCCTTTGT GACTTTATTT TTTAATCATC CGCGATCCTA ACACTTCCAA TGCAAAGTGT 421 CAAAGATGGT TAGCAAT // LOCUS HUMCOL1A42 1409 bp ds-DNA PRI 15-DEC-1989 DEFINITION Human alpha-1 collagen type IV gene, exon 52. ACCESSION M26576 J05039 KEYWORDS collagen; fibronectin; proteoglycan. SEGMENT 42 of 42 SOURCE Human DNA, (libraries of E.Fritsch, R.Poulsom, and Clontech), clones F[16,21,22] and E5. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 1409) AUTHORS Soininen,R., Huotari,M., Ganguly,A., Prockop,D.J. and Tryggvason,K. TITLE Structural organization of the gene for the alpha-1 chain of human type IV collagen JOURNAL J. Biol. Chem. 264, 13565-13571 (1989) STANDARD full staff_entry REFERENCE 2 (bases 1 to 1409) AUTHORS Tryggvason,K. JOURNAL Unpublished (1989) see COMMENT for author address. STANDARD full staff_entry COMMENT Draft entry and computer-readable sequence for [1] kindly provided by K.Tryggvason, 27-JUL-1989. K.Tryggvason, University of Oulu, Department of Biochemistry, University of Oulu 90570 Oulu, Finland FEATURES from to/span description pept + 19 100 alpha-1 collagen type IV, exon 52 /nomgen="COL4A1" /map="13q34" /hgml_locus_uid="LZO163F" matp + 19 97 alpha-1 collagen type IV pre-msg < 1 1404 alpha-1 collagen type IV mRNA and intron IVS < 1 18 alpha-1 intron AW BASE COUNT 433 A 277 C 245 G 454 T ORIGIN Chromosome 13q34. 1 CCTGTGTCGT GTTTTTAGGA AGCCTACGCC GTCCACCTTG AAGGCAGGGG AGCTGCGCAC 61 GCACGTCAGC CGCTGCCAAG TCTGTATGAG AAGAACATAA TGAAGCCTGA CTCAGCTAAT 121 GTCACAACAT GGTGCTACTT CTTCTTCTTT TTGTTAACAG CAACGAACCC TAGAAATATA 181 TCCTGTGTAC CTCACTGTCC AATATGAAAA CCGTAAAGTG CCTTATAGGA ATTTGCGTAA 241 CTAACACACC CTGCTTCATT GACCTCTACT TGCTGAAGGA GAAAAAGACA GCGATAAGCT 301 TCAATAGTGG CATACCAAAT GGCACTTTTG ATGAAATAAA ATATCAATAT TTTCTGCAAT 361 CCAATGCACT GATGTGTGAA GTGAGAACTC CATCAGAAAA CCAAAGGGTG CTAGGAGGTG 421 TGGGTGCCTT CCATACTGTT TGCCCATTTT CATTCTTGTA TTATAATTAA TTTTCTACCC 481 CCAGAGATAA ATGTTTGTTT ATATCACTGT CTAGCTGTTT CAAAATTTAG GTCCCTTGGT 541 CTGTACAAAT AATAGCAATG TAAAAATGGT TTTTTGAACC TCCAAATGGA ATTACAGACT 601 CAGTAGCCAT ATCTTCCAAC CCCCCAGTAT AAATTTCTGT CTTTCTGCTA TGTGTGGTAC 661 TTTGCAGCTG CTTTTGCAGA AATCACAATT TTCCTGTGGA ATAAAGATGG TCCAAAAATA 721 GTCAAAAATT AAATATATAT ATATATTAGT AATTTATATA GATGTCAGCA ATTAGGCAGA 781 TCAAGGTTTA GTTTAACTTC CACTGTTAAA ATAAAGCTTA CATAGTTTTC TTCCTTTGAA 841 AGACTGTGCT GTCCTTTAAC ATAGGTTTTT AAAGACTAGG ATATTGAATG TGAAACATCC 901 GTTTTCATTG TTCACTTCTA AACCAAAAAT TATGTGTTGC CAAAACCAAA CCCAGGTTCA 961 TGAATATGGT GTCTATTATA GTGAAACATG TACTTTGAGC TTATTGTTTT TATTCTGTAT 1021 TAAATATTTT CAGGGTTTTA AACACTAATC ACAAACTGAA TGACTTGACT TCAAAAGCAA 1081 CAACCTTAAA GGCCGTCATT TCATTAGTAT TCCTCATTCT GCATCCTGGC TTGAAAAACA 1141 GCTCTGTTGA ATCACAGTAT CAGTATTTTC ACACGTAAGC ACATTCGGGC CATTTCCGTG 1201 GTTTCTCATG AGCTGTGTTC ACAGACCTCA GCAGGGCATC GCATGGACCG CAGGAGGGCA 1261 GATTCGGACC ACTAGGCCTG AAATGACATT TCACTAAAAG TCTCCAAAAC ATTTCTAAGA 1321 CTACTAAGGC CTTTTATGTA ATTTCTTTAA ATGTGTATTT CTTAAGAATT CAAATTTGTA 1381 ATAAAACTAT TTGTATAAAA ATTAAGCTT // LOCUS HUMCOL1AA 582 bp ds-DNA PRI 15-JUN-1989 DEFINITION Human procollagen type 1 alpha-2 gene, exons 27,28 and 29. ACCESSION M21353 KEYWORDS alpha-2 type 1 collagen; collagen; collagen type 1. SOURCE Human skin fibroblast DNA, clone NJ-3. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 582) AUTHORS Tromp,G. and Prockop,D.J. TITLE Single base mutation in the pro-alpha-2(I) collagen gene that causes efficient splicing of RNA from exon 27 to exon 29 and synthesis of a shortened but in-frame pro-alpha-2(I) chain JOURNAL Proc. Natl. Acad. Sci. U.S.A. 85, 5254-5258 (1988) STANDARD simple staff_review COMMENT Mutation at bp 233 (a to g) causes shortened pro-alpha 2 (I) chains that lack most or all of the 18 amino acids encoded by exon 28; splicing occurs from the last codon of exon 27 to the first codon of exon 28. FEATURES from to/span description pept / 31 84 collagen type 1 alpha-2 precursor, exon 27 (AA at 31) /nomgen="COL1A2" /map="7q21.3-q22.1" /hgml_locus_uid="LP0002V" 235 288 collagen type 1 alpha-2 precursor, exon 28 503 / 556 collagen type 1 alpha-2 precursor, exon 29 pre-msg < 1 > 582 collagen type 1 mRNA and introns IVS < 1 30 collagen type 1 intron 26 IVS 85 234 collagen type 1 intron 27 IVS 289 502 collagen type 1 intron 28 IVS 557 > 582 collagen type 1 intron 29 mut 233 233 a in normal; g in mutant BASE COUNT 152 A 141 C 118 G 171 T ORIGIN Chromosome 7q21.3-q22.1. 1 ACCATCAGCC TTTCTGTTAA ATATTTTTAG GGTGCTCCAG GTCCTGATGG AAACAATGGT 61 GCTCAGGGAC CTCCTGGACC ACAGGTGAGT ATTTCTCCCA CTCTTGTGCT CTTCTGCACT 121 AGAATGTATA TAGTCCTCAA ACTGGCCATC TCCATTTTCA GTCCAAAAGT TATACAGCTA 181 GACAACAGTG GTGACATACG TTGCTATTTA TGCTCTCTTT CCTGTCACTT TCAGGGTGTT 241 CAAGGTGGAA AAGGTGAACA GGGTCCCGCT GGTCCTCCAG GCTTCCAGGT AAGTCAACTC 301 AAGCATATAC AATACTGCCT TTGGTCAGCC TATTGAGCTG TAAATCACCA TACCGTACCT 361 CTCTTCTCCA CCACAATAAT GCTTAATAAC ATACAATCGT GCTCATGTTG ATATTTGGTA 421 GCCACCACCC CCAAACTCAA TTATTAGCAA ATCTCCTGAA CGTAGCCATG GGATTGAGAT 481 TTGTATTTCT TTTCATTTTT AGGGTCTGCC TGGCCCCTCA GGTCCCGCTG GTGAAGTTGG 541 CAAACCAGGA GAAAGGGTGA GTAAAACAAG TAATAGTAAG TA // LOCUS HUMCOLA01 74 bp ds-DNA PRI 15-SEP-1989 DEFINITION Human alpha-1 collagen type II gene, exon 1. ACCESSION M24938 M23757 KEYWORDS collagen. SEGMENT 1 of 7 SOURCE Human DNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 74) AUTHORS Sangiorgi,F.O., Benson-Chanda,V., de Wet,W.J., Sobel,M.E., Tsipouras,P. and Ramirez,F. TITLE Isolation and partial characterization of the entire human pro-Alpha-1 (II) collagen gene JOURNAL Nucleic Acids Res. 13, 2207-2225 (1985) STANDARD simple staff_entry FEATURES from to/span description pept / 1 + 68 alpha-1 collagen type II, exon 1 (AA at 1) /nomgen="COL2A1" /map="12q14.3" /hgml_locus_uid="LX0121B" IVS 69 > 74 COL1A2 intron A BASE COUNT 7 A 24 C 24 G 19 T ORIGIN Chromosome 12q14.3. 1 TCCCCAGTCG CTGGTGCTGC TGACGCTGCT CGTCGCCGCT GTCCTTCGGT GTCAGGGCCA 61 GGATGTCCGT AAGT // LOCUS HUMCOLA02 260 bp ds-DNA PRI 15-SEP-1989 DEFINITION Human alpha-1 collagen type II gene, exons 2 and 3. ACCESSION M23759 KEYWORDS collagen. SEGMENT 2 of 7 SOURCE Human DNA. ORGANISM Homo sapiens Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia; Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae. REFERENCE 1 (bases 1 to 260) AUTHORS Sangiorgi,F.O., Benson-Chanda,V., de Wet,W.J., Sobel,M.E., Tsipouras,P. and Ramirez,F. TITLE Isolation and partial characterization of the entire human pro-Alpha-1 (II) collagen gene JOURNAL Nucleic Acids Res. 13, 2207-2225 (1985) STANDARD simple staff_entry FEATURES from to/span description pept + 3 19 alpha-1 collagen type II, exon 2 (AA at 5) /nomgen="COL2A1" /map="12q14.3" /hgml_locus_uid="LX0121B" 222 / 254 alpha-1 collagen type II, exon 3 IVS < 1 2 COL1A2 intron A IVS 20 221 COL1A2 intron B IVS 255 > 260 COL1A2 intron C BASE COUNT 51 A 43 C 77 G 85 T 4 others ORIGIN Undetermined number of bp after segment 1; chromosome 12q14.3. 1 AGGGCAACCA GGACCAAAGG TAAGGGCTTT CTTCTTTTTC TTTTTTCGTG TTTTTTTGGC 61 TTTGTGTTTC GCTCGGG