LOCUS MMCD14 2404 bp DNA ROD 21-AUG-1997 DEFINITION Mouse CD14 gene. ACCESSION X13987 NID g50336 KEYWORDS CD14 gene. SOURCE house mouse. ORGANISM Mus musculus Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Rodentia; Sciurognathi; Muridae; Murinae; Mus. REFERENCE 1 (bases 1 to 2404) AUTHORS Yamamoto,S. TITLE Direct Submission JOURNAL Submitted (11-JAN-1989) Yamamoto S., Dept of Pathology, Medical College of Oita, Idaigaoka 1-1506, hazamamachi, Oita Gun, Oita, Japan REFERENCE 2 (bases 1 to 2404) AUTHORS Matsuura,K., Setoguchi,M., Nasu,N., Higuchi,Y., Yoshida,S., Akizuki,S. and Yamamoto,S. TITLE Nucleotide and amino acid sequences of the mouse CD14 gene JOURNAL Nucleic Acids Res. 17 (5), 2132 (1989) MEDLINE 89183627 COMMENT Data kindly reviewed (19-Apr-1989) by Yamamoto S. FEATURES Location/Qualifiers source 1..2404 /organism="Mus musculus" /strain="Balb/c" /db_xref="taxon:10090" /clone="MS7GEN1" /haplotype="H-2 d" /tissue_type="liver" /clone_lib="mouse liver genomic" exon 624..739 /number=1 gene 737..1932 /gene="CD14" CDS join(737..739,835..1932) /gene="CD14" /codon_start=1 /db_xref="PID:e334829" /db_xref="PID:g2342527" /translation="MERVLGLLLLLLVHASPAPPEPCELDEESCSCNFSDPKPDWSSA FNCLGAADVELYGGGRSLEYLLKRVDTEADLGQFTDIIKSLSLKRLTVRAARIPSRIL FGALRVLGISGLQELTLENLEVTGTAPPPLLEATGPDLNILNLRNVSWATRDAWLAEL QQWLKPGLKVLSIAQAHSLNFSCEQVRVFPALSTLDLSDNPELGERGLISALCPLKFP TLQVLALRNAGMETPSGVCSALAAARVQLQGLDLSHNSLRDAAGAPSCDWPSQLNSLN LSFTGLKQVPKGLPAKLSVLDLSYNRLDRNPSPDELPQVGNLSLKGNPFLDSESHSEK FNSGVVTAGAPSSQAVALSGTLALLLGDRLFV" sig_peptide join(737..739,835..876) /gene="CD14" intron 740..834 /gene="CD14" /number=1 exon 835..>1932 /gene="CD14" /number=2 mat_peptide 877..1929 /gene="CD14" BASE COUNT 601 a 588 c 609 g 606 t ORIGIN 1 cctagcattt gggaggcaga ggcaggagga aaatcatgcg tttcaggcta ggctagattg 61 ggttactaga ctgagatatc atggggagaa tggagaggta gagagtggga gaagaatgaa 121 ttaataaaga actgaataag atgggaagaa gggagaatta tttttcatat taactctcaa 181 ctttgagctt tattctctgc ctggaatcta tagataagtt cacaatcttt ccacaaatgt 241 ccaattacat tcaaagaaaa tcaagagctg gatttgaacg gtgggaaatt gctagcaact 301 aagactaggg gaaatggagg tgaatcaatg ggactgagca acagaataat gatctaaggc 361 actaggtgtg attcactctt ttcctgtacg caccagacaa gtccggggct cataggtcat 421 cctcctggca cagaatgccc taatgccact ctgaattctt cctgtttttc gtccctccct 481 aaaaaacact tccttgcaat atttactaga agtgagtagg gctgttagga ggaagagaag 541 tggagacgca attagaattc acagaggaag ggacagggtg acaccccagg attacataaa 601 tttacagggg ctgccgaatt ggtcgaacaa gcccgtggaa cctggaagcc agagaacacc 661 atcgctgtaa aggaaagaaa ctgaagcttt tctcggagcc tatctgggct gctcaaactt 721 tcagaatcta ccgaccatgg tgagtcagac agactgtctt ggggtggaac tggagccaac 781 ctgaggaatc tcagggtctg gcaggagtct ccctgacccc tactttctcc tcaggagcgt 841 gtgcttggct tgttgctgtt gcttctggtg cacgcctctc ccgccccacc agagccctgc 901 gagctagacg aggaaagttg ttcctgcaac ttctcagatc cgaagccaga ttggtccagc 961 gctttcaatt gtttgggggc ggcagatgtg gaattgtacg gcggcggccg cagcctggaa 1021 taccttctaa agcgtgtgga cacggaagca gatctggggc agttcactga tattatcaag 1081 tctctgtcct taaagcggct tacggtgcgg gccgcgcgga ttcctagtcg gattctattc 1141 ggagccctgc gtgtgctcgg gatttccggc ctccaggaac tgactcttga aaatctcgag 1201 gtaaccggca ccgcgccgcc accgcttctg gaagccaccg gacccgatct caacatcttg 1261 aacctccgca acgtgtcgtg ggcaacaagg gatgcctggc tcgcagaact gcagcagtgg 1321 ctaaagcctg gactcaaggt actgagtatt gcccaagcac actcactcaa cttttcctgc 1381 gaacaggtcc gcgtcttccc tgccctctcc accttagacc tgtctgacaa tcctgaattg 1441 ggcgagagag gactgatctc agccctctgt cccctcaagt tcccgaccct ccaagtttta 1501 gcgctgcgta acgcggggat ggagacgccc agcggcgtgt gctctgcgct ggccgcagca 1561 agggtacagc tgcaaggact agaccttagt cacaattcac tgcgggatgc tgcaggcgct 1621 ccgagttgtg actggcccag tcagctaaac tcgctcaatc tgtctttcac tgggctgaag 1681 caggtaccta aagggctgcc agccaagctc agcgtgctgg atctcagtta caacaggctg 1741 gataggaacc ctagcccaga tgagctgccc caagtgggga acctgtcact taaaggaaat 1801 ccctttttgg actctgaatc ccactcggag aagtttaact ctggcgtagt caccgccgga 1861 gctccatcat cccaagcagt ggccttgtca ggaactctgg ctttgctcct aggagatcgc 1921 ctctttgttt aaggaacatt tgcatcctcc tggtttctga gggtcctcgt caacgaatcc 1981 tctgctttaa atttattaaa atcttaatcc acgatgtaag gaaagaaagg cagtcaagat 2041 ggttcagtgg gtaaaagcca gcaaacttga cccctgattt taaccctcag gatccacacg 2101 gaaggggaaa actcactcct gaaagttgtc catctgtgct cacaaataaa tattttttaa 2161 aataacaatg tgtttgttgg ttttgttttt gtttgggttt tgttgtggtt ttgtttgttt 2221 tgttttgttt ttgagacagt ctggctatgt atccttggct ggcctcaaac tcataaagat 2281 caagatcggc ctgcctctac ctccaaatgc tctggttaaa gggatgtgcc tccatgccca 2341 gttgaagtca tcctgaacca cgagtccagg ccactcactc tttactaaga tctttactaa 2401 gtat // LOCUS MUS21OHA1 3307 bp DNA ROD 15-MAR-1990 DEFINITION Mouse steroid 21-hydroxylase A (21-OHase A) gene, complete cds. ACCESSION M15009 NID g191497 KEYWORDS steroid 21-hydroxylase. SEGMENT 1 of 2 SOURCE Mouse (strain BALB/c) DNA, clone 21-OH-A. ORGANISM Mus musculus Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Rodentia; Sciurognathi; Myomorpha; Muridae; Murinae; Mus. REFERENCE 1 (bases 1 to 3307) AUTHORS Chaplin,D.D., Galbraith,L.J., Seidman,J.G., White,P.C. and Parker,K.L. TITLE Nucleotide sequence analysis of murine 21-hydroxylase genes: Mutations affecting gene expression JOURNAL Proc. Natl. Acad. Sci. U.S.A. 83, 9601-9605 (1986) MEDLINE 87092295 FEATURES Location/Qualifiers source 1..3307 /organism="Mus musculus" /db_xref="taxon:10090" prim_transcript 150..2782 /note="21-OHase A mRNA and introns" CDS join(160..361,430..519,872..1014,1120..1215,1302..1403, 1492..1578,1689..1889,2088..2257,2326..2429,2514..2782) /note="steroid 21 hydroxylase A" /codon_start=1 /db_xref="PID:g191500" /translation="MLLPGLLLLLLLLAGTRWLWGQWKLRKLHLLRLAPGFLHFLQPN LPIYLLGLTQKLGPIYRIRLGLQDVVVLNSNRTIEEALIQKWVDFAGRPHMLNGKMDL DLSLGDYSLMWKAHKKLSRSALMLGMRDSMEPLIEQLTQEFCERMRAQAGTPVAIHKE FSFLTCSIISCLTFGDKDSTLVQTLHDCVQDLLQAWNHWSIQILTIIPLLRFLPNPGL QKLKQIQESRDHIVKQQLKQHKESLVAGQWKDMIDYMLQGVEKQRDGKDEERLHEGHV HMSVVDLFIGGTETTATTLSWAVAFLLHHPEIQKRLQAELDLKLGPGSQLLYRNRMQL PLLMATIAEVLRLRPVVPLALPHRATRASSISGYDIPKDMVIIPNIQGANLDEMVWEL PSKFWPDRFLEPGKNPRTPSFGCGARVCLGEPLARLELFVVLARLLQAFTLLPPPDGT LPSLQPQPYAGINLPIPPFQVRLQPRNLAPQDQGERP" exon <160..361 /note="steroid 21 hydroxylase A, (EC 1.14.99.10)" /number=1 intron 362..429 /note="21-OHase A, intron A" exon 430..519 /number=2 intron 520..871 /note="21-OHase A, intron B" exon 872..1014 /number=3 intron 1015..1119 /note="21-OHase A, intron C" exon 1120..1215 /number=4 intron 1216..1301 /note="21-OHase A, intron D" exon 1302..1403 /number=5 intron 1404..1491 /note="21-OHase A, intron E" exon 1492..1578 /number=6 intron 1579..1688 /note="21-OHase A, intron F" exon 1689..1889 /number=7 intron 1890..2087 /note="21-OHase A, intron G" exon 2088..2257 /number=8 intron 2258..2325 /note="21-OHase A, intron H" exon 2326..2429 /number=9 intron 2430..2513 /note="21-OHase A, intron I" exon 2514..>2782 /note="steroid 21 hydroxylase A" /number=10 BASE COUNT 633 a 1026 c 876 g 772 t ORIGIN 1 attctccaag gctgatgggg actgtgccaa tgtgaaaaca tactgttctg tgttggggac 61 aggaagggac ctgaagcaaa ggtcagagcc acagcagaac aaaggactgg agttgggggc 121 tataaaaggc catatcaggg ccctcacaag tgctgggcca tgctgctacc tgggctgctg 181 ctgctgttgc tgctgctagc tggcacccgc tggctgtggg gccaatggaa gttgcggaag 241 ctgcacctcc tccgtctggc cccgggtttt ctgcacttcc tacagcctaa ccttcccatt 301 tacctgcttg gcctcactca gaaactcggg cccatctaca ggatccgctt ggggctgcaa 361 ggtaagcaag ccagtcttct gtgttgatgt gacccctctt ccctgcctga cattgtcctt 421 tcccctcaga tgtggtggtg ctaaattcta acagaaccat tgaggaggcc ttgatccaaa 481 agtgggtgga ctttgctggc cgaccccata tgctaaatgg taagggttag gacccttgct 541 cagtttcccc tcccttcccc cccccccccc cgtaaacatg gtgctgtgag attgtggcag 601 agaaggcttc ctcggtgagg cacgatggct cctcttcctc ttcctcttcc tcctcctcct 661 tctccttctc ctcttcctcc tcttcctcct gttcttcctc ctcctccacc actccctctt 721 ctttaagatt aggtcttact atagagccct tgttgggctg aaaaccacta tgtagaaaag 781 gctagtctta aacttacaga gatccacctg gtgcgtttgg cctccttggt agcttgtggt 841 catgtctttc tttctaccat ctctcctgca ggaaagatgg acttggacct gtccctgggg 901 gattactctc taatgtggaa ggcccacaag aaactctctc gctcagccct gatgctgggc 961 atgcgagact ccatggagcc tctgatagag cagctgaccc aggagttctg tgaggtgggg 1021 ctagctcttg tagtagcctg actctcttgg ggtcagcctc cctctcccgc cagtacccct 1081 ctggctccac agcctctgca gcctgtctct cctccacagc gcatgcgagc ccaggctggc 1141 acccccgtgg ccatccataa ggaattctcc ttcctcactt gtagtatcat ctcctgcctc 1201 acttttggag acaaggtcca tgttaccctc ttgcccaccc ctcagccccc tctggccctc 1261 tgcgggtctt gaactcaaag catgccctcc tgttccggca ggacagcacg ttggtacaga 1321 cccttcacga ctgtgtccag gacttgttgc aagcctggaa ccactggtcc atccaaatct 1381 tgacgataat tccccttctc agggtgagga gttgcagccc agtctcccct gggttgtggg 1441 ggaggggacc accagtctcc accctgcagc tgactctctt tcctgcccca gttcctcccc 1501 aacccaggcc tccagaagct gaagcagatc caagagagtc gggaccatat tgtcaagcag 1561 cagctgaagc agcacaaggt gggcgttgca gtggcaccag ctcctctggc ctcagcgatg 1621 atgtgcctgc cacccccacc cccaccgact gccaccactg ctggacacac agcttcccct 1681 gcctttagga aagcctggtt gcaggccaat ggaaagacat gattgactac atgctccagg 1741 gagtggagaa gcaaagggat ggcaaagacg aagagcggct ccacgagggg cacgtgcaca 1801 tgtcggtggt ggacctgttc atcggcggca ccgagaccac ggctaccacg ctctcctggg 1861 ctgtggcttt cctgcttcac caccctgagg tgcactgtgg gaagactgct ccttctggaa 1921 ccttgatcca gttgccctgc tcttgactgc ttggcctggg ctatgctcac agagggggct 1981 ctagggtcag ctagatggca ggagggagga gctgcaggct gggcagctgt gagccacctg 2041 gggcaaagtc tggtccctag agctcagcct tgccccctcc ttctcagatc cagaagcgac 2101 tgcaggccga gttagacctc aagctgggcc caggctccca gctcctgtac aggaaccgaa 2161 tgcagctgcc tctgctcatg gccaccattg ccgaggtgct gcgtttgcgg cctgtggtgc 2221 ccttggcctt gccccatcgt gcaactaggg ctagcaggtg attccctggg ggtaagggtt 2281 gagggaggaa gaccttgaca gctcctgact ccacctttcc ctcagcatct ccggctatga 2341 catccctaag gatatggtca tcatccccaa catccaaggc gccaacctgg atgagatggt 2401 ttgggaactg cccagcaagt tctggccagg tatgaggcac acaggcccgg gagcgggaga 2461 gggccggcct ctgctgggac aaacttctct agctctccat ctactaccca cagatcgctt 2521 cctggaacct gggaagaatc ccagaacacc atcctttggc tgtggggcac gcgtgtgcct 2581 gggagagcct ctggcacggc tggagctctt tgtggtgctg gctcgtctgc tgcaggcctt 2641 cactctgctg cctccaccag atggaaccct gccttccctg cagccccagc cttatgctgg 2701 catcaatctc ccgattcctc ctttccaggt gcggctgcag cccagaaacc tggcgcccca 2761 agaccagggt gagcgtcctt gacaggatag gacgagtctc tttaaagttt ctcctttatt 2821 gagttccccc cccccccccc cgtaaagatg gtgctgtgag attgtggcag agaaggcttc 2881 ctcggtgagg cacggtggct cttctctact ggggtgtgag tgaggtaggc agcaggctca 2941 ggtgccctgg tgggggcctg gaagtttctg ggacggagct tcatttccgt gaagggcaca 3001 gaaaactcga agcccttcca gtggtaccag ctcactccct gaggagaggg ttgtcaaggg 3061 aagagagtca agacagcgcc atatcaatgc cacctcctcc agccccaccc tgctctgtag 3121 gtgactccag ggtccagcca tctcccccat gatccctgag ctggtgcctt ctgcccagta 3181 ataaatggtc tcttcagatt tcctcagcaa ggttggcatg agtgtctcat ttcacagttg 3241 agcttctgag gcatgagttc agcccttcat gtgagaggac ttcagcccgg gtcacccaga 3301 tcccaat // LOCUS MUSOXYNEUI 2003 bp DNA ROD 11-MAR-1992 DEFINITION Mouse oxytocin-neurophysin I gene, complete cds. ACCESSION M88355 NID g200167 KEYWORDS neurophysin I; oxytocin. SOURCE Mus musculus (strain B10.A) DNA. ORGANISM Mus musculus Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Rodentia; Sciurognathi; Myomorpha; Muridae; Murinae; Mus. REFERENCE 1 (bases 1 to 2003) AUTHORS Hara,Y., Battey,J. and Gainer,H. TITLE Structure of mouse vasopressin and oxytocin genes JOURNAL Brain Res. Mol. Brain Res. 8 (4), 319-324 (1990) MEDLINE 91101513 FEATURES Location/Qualifiers source 1..2003 /organism="Mus musculus" /strain="B10.A" /db_xref="taxon:10090" TATA_signal 1018..1023 /gene="neurophysin I/oxytocin" exon 1045..1205 /gene="neurophysin I/oxytocin" /number=1 sig_peptide 1086..1142 /gene="neurophysin I/oxytocin" CDS join(1086..1205,1467..1668,1752..1807) /gene="neurophysin I/oxytocin" /codon_start=1 /product="oxytocin-neurophysin I" /db_xref="PID:g200168" /translation="MACPSLACCLLGLLALTSACYIQNCPLGGKRAVLDLDMRKCLPC GPGGKGRCFGPSICCADELGCFVGTAEALRCQEENYLPSPCQSGQKPCGSGGRCAATG ICCSPDGCRTDPACDPESAFSER" mat_peptide 1143..1169 /gene="neurophysin I/oxytocin" /product="oxytocin" gene join(1179..1205,1467..1668,1752..1801) /gene="neurophysin I/oxytocin" mat_peptide join(1179..1205,1467..1668,1752..1801) /gene="neurophysin I/oxytocin" /product="neurophysin I" intron 1206..1466 /gene="neurophysin I/oxytocin" /number=1 exon 1467..1668 /gene="neurophysin I/oxytocin" /number=2 intron 1669..1751 /gene="neurophysin I/oxytocin" /number=2 exon 1752..1926 /gene="neurophysin I/oxytocin" /number=3 polyA_signal 1906..1911 /gene="neurophysin I/oxytocin" BASE COUNT 460 a 601 c 495 g 447 t ORIGIN 1 ggatccagca cctcttctgg tcgccaagga aacctgcgtg cacaaatgta cacacacaaa 61 attaaaattg aaatgcaaaa actgttccca aaatgtactg ctaccatcat gcgggggact 121 tgccccaccc aacgtcgctc acacactagg caagtactct gctactgggg ctgcatgtaa 181 caccttccca tgcagaccta tgcagacctg cagcccaaac ctgaaatgta cccagagcct 241 gcccaacctg attgcaccaa aatgggcgaa ccatcatatg tggcccacct gagaagggta 301 tgaccttggc acaaatggcc ttgcctgtag cctgaggcca cctgttggcc acactccagc 361 agtctgatgg cccactgtcc tctcaaacag gagtctaggc acctagtgtg gtagtggata 421 actaaactca gcatttggga gacagaagca gatggagcgc tgtgagttca aggccagcct 481 ggtctacaca gcaggttcta atacagagtt ttaaatacta gtgtaatttt ccttttgctg 541 taatttttct ttctttttat tttaatttgg tctgttaact ctgttttggt tttgatctct 601 actgagactt tgttttacgg gctagttaga agagctgagg tgcattcaga gattgaacaa 661 gacgccatct ttttccccat ctaagtttca ggtccatgta agggctccct cactcactgg 721 ccctatcctg ccttattctg agatattgga tatctgtgaa aaacagctcc tggctagggc 781 gcacctccaa cccctcccaa gtctctctag cctcttgtag cctaggccac cccttccagg 841 ctgcttctct tttgagttcc aggtcattag cagagacgat gaccttgacc ctagcccaga 901 ccctgcaaat gaagggcctg cctctaaaca gcgtggaaca atttcaccca agagaccttc 961 tgtgaccagt catgctgtca ccctctttag acagtgctcc accatggcag tgccagacat 1021 aaaaaggtcg gtctgggccg gagaaaccat cacctacagc ggatctcaga ctgagcacca 1081 tcgccatggc ctgccccagt ctcgcttgct gcctgcttgg cttactggct ctgacctcgg 1141 cctgctacat ccagaactgc cccctgggcg gcaagagggc tgtgctggac ctggatatgc 1201 gcaaggttag tctccccgac cctgtccctt cccttcccgt tctggcgatg ctaaggacca 1261 gagaagctct cccacctaca gagagcattc ccgcacactt gccagcccta ccaaggcctc 1321 gcgtgggaac ccagggcttt gggaagtgtt aggctccctc ttgacgccgt gaaggtaacg 1381 acaatgccgg agcacccact gcccctcgct ctgccacagt ccggattcgg attgtgcacg 1441 gcgcccaccc gcatccttcc ccacagtgtc tcccctgcgg cccgggcggc aaaggacgct 1501 gcttcggacc aagcatctgc tgcgcggacg agctgggctg cttcgtgggc accgccgagg 1561 cgctgcgctg ccaggaggag aactacctgc cttcgccctg ccagtctggc cagaagccct 1621 gcgggagcgg aggccgctgc gccgccacag gcatctgctg cagcccgggt gagcaggagg 1681 gggcccagca ggtgacccgg caaggagccg tcgggtttgc agctcagaac actgacccat 1741 ttctcttgca gatggctgcc gcacagaccc cgcctgcgac cctgagtctg ccttctcgga 1801 gcgctgagcc cactttctgg gaataccttt agcgcgcttc cttcgttccc catggccact 1861 gccagaaaaa aaaaaaaaaa agaaaagaaa agaaaagaaa agaaaaataa agtagatttc 1921 ctcttcaaac ttgactggtg tctaattgtc ggaaacggga gggaggaaag gcaccgggaa 1981 cgccgtgctc ttggcatctt gta // LOCUS MUSAPE 4856 bp DNA ROD 26-SEP-1998 DEFINITION Mus musculus gene for apolipoprotein, exons 1,2,3,4, complete cds. ACCESSION D00466 NID g220334 KEYWORDS apolipoprotein E; B2 repetitive sequence. SOURCE Mus musculus (strain:BALB/c) DNA. ORGANISM Mus musculus Eukaryota; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Rodentia; Sciurognathi; Muridae; Murinae; Mus. REFERENCE 1 (sites) AUTHORS Rajavashisth,T.B., Kaptein,J.S., Reue,K.L. and Lusis,A.J. TITLE Evolution of apolipoprotein E: mouse sequence and evidence for an 11-nucleotide ancestral unit JOURNAL Proc. Natl. Acad. Sci. U.S.A. 82 (23), 8085-8089 (1985) MEDLINE 86068046 REFERENCE 2 (bases 1 to 4856) AUTHORS Horiuchi,K., Tajima,S., Menju,M. and Yamamoto,A. TITLE Structure and expression of mouse apolipoprotein E gene JOURNAL J. Biochem. 106 (1), 98-103 (1989) MEDLINE 89380144 COMMENT Submitted in computer readable form by S. Tajima on 12-Sep-1989. FEATURES Location/Qualifiers source 1..4856 /organism="Mus musculus" /note="5' end of Sau3AI site." /strain="BALB/c" /db_xref="taxon:10090" GC_signal 926..931 GC_signal 946..951 TATA_signal 965..971 exon 997..1038 /number=1 prim_transcript 997..3781 /note="apolipoprotein mRNA and introns" intron 1039..1797 /number=1 exon 1798..1863 /number=2 CDS join(1821..1863,2403..2571,2949..3672) /codon_start=1 /product="apolipoprotein" /db_xref="PID:d1000815" /db_xref="PID:g220335" /translation="MKALWAVLLVTLLTGCLAEGEPEVTDQLEWQSNQPWEQALNRFW DYLRWVQTLSDQVQEELQSSQVTQELTALMEDTMTEVKAYKKELEEQLGPVAEETRAR LGKEVQAAQARLGADMEDLRNRLGQYRNEVHTMLGQSTEEIRARLSTHLRKMRKRLMR DAEDLQKRLAVYKAGAREGAERGVSAIRERLGPLVEQGRQRTANLGAGAAQPLRDRAQ AFGDRIRGRLEEVGNQARDRLEEVREHMEEVRSKMEEQTQQIRLQAEIFQARLKGWFE PIVEDMHRQWANLMEKIQASVATNPIITPVAQENQ" intron 1864..2402 /number=2 repeat_region 2162..2197 /note="CA repeat" /rpt_type=tandem /rpt_unit=2162..2163 exon 2403..2571 /number=3 variation 2518 /citation=[1] /replace="t" intron 2572..2948 /number=3 exon 2949..3781 /number=4 variation 3225 /citation=[1] /replace="t" polyA_signal 3762..3767 repeat_unit 4167..4376 /note="rodent repeat(B2)" BASE COUNT 1108 a 1291 c 1267 g 1190 t ORIGIN 1 gatcttcctg cctcagcttt gcatatggct agcactatag acccatgttc cagtgaatga 61 cttatggctt gtcttttttt tttttttttt ttttttttat gtgcattagt gttttgcctg 121 catgtatgcc ttcgtgaggg tagcagatct tggtgttaca gttgtgagct gctgtgtggg 181 tgctgcgatt ttgaacctag gtcctgtgaa atcgagtcag tgctcctaac ctctgagtca 241 tctctccagc tcctgctctt ctgcttttat gaggaaaaag aaaagagaag tggcttgaga 301 gtggaaaatg cacatgcagg ggtgcacacc tgcagtcccc agcatgctac agcagaggca 361 gaaggacctt tgtgggttag agggcagcct gagaatctta tctcaaaaca actttttaaa 421 atgtgctctg taggggtagc tcttccctcc caaggtgaca cattgcaatc gccagaaaca 481 gatcaggagc atcaacgctt ggtttcccag ggcttggctt aatgtatggc ttcaaaccca 541 tcgggagcca ccactgaaca gctcctgaag gaactggagc acgtcccagc cttggaatgg 601 aaagagttca cctgtggtgg aggaatcaac aacgagggat cccagaacaa cgatcttcac 661 cccagaagct gagcctctta gcccccaccc acccatttcc agtttaggct gaccagctct 721 tttctttaca atgcaccaga cccgcggaaa gggaaggagc ggttctcagt gcccagtacc 781 aaggcctgga ttattcaatg aggtgtccgc tccctttgtt ggcggggagg ggagcggggg 841 tcacaaggca tccaaactcc acctctttcc tctgccctgc tgtgaagggg gagagaacaa 901 cccgcctcgt gacagcgtgc acagcccgcc ctagccctga ggagggggcg gacaggggga 961 gtcctataat tggaccggtc tgggatccga tcccctgctc agaccctgga ggctaaggac 1021 ttgtttcgga aggagctggt aagacaagct gggctgggga ttcacccagg gaccttggta 1081 ggatgtgggc tgggaacctt gagatccccc ggagtccagg aaacaggcac aagaattgga 1141 aaagcaggca gcacgataga agtcttgggg gacaaactaa ggactcgagg taactagcct 1201 ttgccagagt cagagcaggt ggaggggtta cctccaggaa ggagtacggg actgtcggtg 1261 ccacggcgta ccggctcaac taggaaccag tcctatggcg aaaaaactcg ggatgagcct 1321 taggctgctt tttatataaa tacctactga tttccatcac agtccccaag taacccggac 1381 tggtttcaaa ctgtggctcc tcatggctga gctccctaag ttctgtagtt gtgggagggt 1441 accacttcgc agggatggag gacgattaaa aatcgtgtta aattaacaca aaatggaaag 1501 caggacttag ccgggaagaa agaggaatgt aagctggacc acccgctggc cctctgtgaa 1561 gtggaatttg aaccctagga gagggagctg gaatttttgg cagcggatcc acccggggtg 1621 ccgagatagc gaactcggca aggggaggta aacagacctt tgggaagagc gggtgctctg 1681 ttttggagat gtttgtgatg gctcacagat ctgagaaggg aagatggggt tctctgggtg 1741 gccggagtcc ctccaccccc gccccctggt gttcaaagac aatttttccc tccgcagact 1801 ggccaatcac aattgcgaag atgaaggctc tgtgggccgt gctgttggtc acattgctga 1861 caggtatgga gcaaggactt gctgtggtgc cgctttttct gctcctctgt ggactctatt 1921 ctagccctag atctcttctg ctggtgggtt gaggctgagg cggttctgag acctctctga 1981 gattcaatag ccctgagcag ctgttttata tactctttag gccttgattt cccctaacta 2041 taagaggatt gatggcttag tgggtaaagg tgcttatcat aaagcttgaa gacttaactc 2101 tgagaccaac atagtggaag gagagagcca attccctcta gaactctgac ctccaagggt 2161 acacacacac acacacacac acacacacac acacacaata aaaagttgat ttcttagttg 2221 ctcttaaaag tgggtataca ttttctaatt ccacacctgc ctagtctcgg ctctgaacta 2281 cataggatac aagacagtgc atgcttgtag tatctaaaca gactccacag cctccagacc 2341 cacttcaaag agacccaaaa agactgtagg tcctgaccca gccttaaact tactctacac 2401 aggatgccta gccgagggag agccggaggt gacagatcag ctcgagtggc aaagcaacca 2461 accctgggag caggccctga accgcttctg ggattacctg cgctgggtgc agacgctgtc 2521 tgaccaggtc caggaagagc tgcagagctc ccaagtcaca caagaactga cgtgagtgtc 2581 cagctctttc accctcggca ggcaccagct gatccagggt tgcctcctat ctgggtcccc 2641 agccccttct tgtttccttt ctcaattagt gtgtagccca ggttggcctt gaatcctcct 2701 gccttcttta gccttctgga tgctgggagg aacagacatt tattacttgc ttggtcgatt 2761 ggcttttggc ttcttgagac aggatcccat tctgtaactc aagctggctt cgaaggctct 2821 gcaattctta tgccgcagct tctcaacttc tgggaacaca agcgagtacc atcacctctt 2881 gcctctgtgg tttctggccc cttctgtcct gccttcatct ccttcctgtg tttcctctgg 2941 gcctgcaggg cactgatgga ggacactatg acggaagtaa aggcttacaa aaaggagctg 3001 gaggaacagc tgggtccagt ggcggaggag acacgggcca ggctgggcaa agaggtgcag 3061 gcggcacagg cccgactcgg agccgacatg gaggatctac gcaaccgact cgggcagtac 3121 cgcaacgagg tgcacaccat gctgggccag agcacagagg agatacgggc gcggctctcc 3181 acacacctgc gcaagatgcg caagcgcttg atgcgggatg ccgaggatct gcagaagcgc 3241 ctagctgtgt acaaggcagg ggcacgcgag ggcgccgagc gcggtgtgag tgccatccgt 3301 gagcgcctgg ggcctctggt ggagcaaggt cgccagcgca ctgccaacct aggcgctggg 3361 gccgcccagc ctctgcgcga tcgcgcccag gcttttggtg accgcatccg agggcggctg 3421 gaggaagtgg gcaaccaggc ccgtgaccgc ctagaggagg tgcgtgagca catggaggag 3481 gtgcgctcca agatggagga acagacccag caaatacgcc tgcaggcgga gatcttccag 3541 gcccgcctca agggctggtt cgagccaata gtggaagaca tgcatcgcca gtgggcaaac 3601 ctgatggaga agatacaggc ctctgtggct accaacccca tcatcacccc agtggcccag 3661 gagaatcaat gagtatcctt ctcctgtcct gcaacaacat ccatatccag ccaggtggcc 3721 ctgtctcaag cacctctctg gccctctggt ggcccttgct taataaagat tctccgagca 3781 cattctgagt ctctgtgagt gattccaatc agcttcagcc tcagtttatt gttttttgcc 3841 ttacctagca cacattccat ggccctgtca ctatctgtag agggaggtgg ttttgcagca 3901 atagaaatga agcctaggac ctagcaacat aaaagaacaa gtgatctacc actgagccac 3961 gcccacagcc tcactggggg attctaggca ggggctctac cactgagcca cccgcagccc 4021 tcactgggga atcatatcta ccactgagtc acgcccctcc agcccctcac tacggaattc 4081 tagtcagtag ctctaccact gagccacacc cacagcctct ggggctcttc accgccccct 4141 acccctggat tctaggcatg ggcatcattt tatttattta tttatttaag atttgtttat 4201 cttatgtata aggtacactg cagctgtctt caggcgtcag gtcccattac agatggttgt 4261 gagccaccat gtggttgctg ggaattgaac tcaggactta tagaagagta gtcagtgctc 4321 ttaactgctg agccatctct ccagcaccca gtacaggctc ttctatttag ctatatccac 4381 ccttcttttt agtctgaaat aggatctcaa ctgatttcct tgcactcctc tagcctagtt 4441 ggtcttgaat attgaatctt gtttcaatca atctctacag gaactgagaa aggcatgtac 4501 acttcatgtg ggtcagttgg gctacttttc ccaacttccc aagcacccac tcgacagcta 4561 tgccttgaat caatcaacat gtaagagacc agggtcgcca ggacggagat ttacttttct 4621 ggttgtctta tctctcctcc tccgctctag tcttatctga ccaccctctc cttgccttgt 4681 ctctcctctt tttcccttct aggcttcctt ttctggcttc ctgttttcct gatcctctgt 4741 tatctcaccc tcccgcggtt tcttttgctc tgggcctttg gttggcggtt tctacggttt 4801 ctacgtggct tttggaacct cagcctttct cccttgctct gaagttagct ggatcc // LOCUS MUSTKM 2939 bp DNA ROD 24-SEP-1992 DEFINITION Mouse thymidine kinase gene, complete cds. ACCESSION M68489 NID g202078 KEYWORDS thymidine kinase. SOURCE Mouse (strain C57BL/10J) liver DNA, clone Mtk116. ORGANISM Mus musculus Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Rodentia; Sciurognathi; Myomorpha; Muridae; Murinae; Mus. REFERENCE 1 (bases 1 to 2939) AUTHORS Gudas,J.M., Fridovich-Keil,J.L., Datta,M.W., Bryan,J. and Pardee,A.B. TITLE Characterization of the murine thymidine kinase-encoding gene and analysis of transcription start point heterogeneity JOURNAL Gene 118, 205-216 (1992) MEDLINE 92380505 FEATURES Location/Qualifiers source 1..2939 /organism="Mus musculus" /strain="C57BL/10J" /db_xref="taxon:10090" /tissue_type="liver" exon 1..238 /gene="thymidine kinase" /number=1 mRNA join(1..238,320..351,844..954,1180..1273,1496..1585, 1737..1856,2162..2775) /gene="thymidine kinase" /product="thymidine kinase" protein_bind 77..85 /bound_moiety="Sp1" protein_bind 90..106 /bound_moiety="MT2" protein_bind 130..142 /bound_moiety="MT3" gene join(173..238,320..351,844..954,1180..1273,1496..1585, 1737..1856,2162..2350) /gene="thymidine kinase" CDS join(173..238,320..351,844..954,1180..1273,1496..1585, 1737..1856,2162..2350) /gene="thymidine kinase" /codon_start=1 /product="thymidine kinase" /db_xref="PID:g202079" /translation="MSYINLPTVLPSSPSKTRGQIQVILGPMFSGKSTELMRRVRRFQ IAQYKCLVIKYAKDTRYSNSFSTHDRNTMDALPACMLRDVTQEALGVAVIGIDEGQFF PDIVDFCEMMANEGKTVIVAALDGTFQRKAFGSILNLVPLAESVVKLTAVCMECFREA AYTKRLGLEKEVEVIGGADKYHSVCRLCYFKKSSAQTAGSDNKNCLVLGQPGEALVVR KLFASQQVLQYNSAN" intron 239..319 /gene="thymidine kinase" /number=1 exon 320..351 /gene="thymidine kinase" /number=2 intron 352..843 /gene="thymidine kinase" /number=2 exon 844..954 /gene="thymidine kinase" /number=3 intron 955..1179 /gene="thymidine kinase" /number=3 exon 1180..1273 /gene="thymidine kinase" /number=4 intron 1274..1495 /gene="thymidine kinase" /number=4 exon 1496..1585 /gene="thymidine kinase" /number=5 intron 1586..1736 /gene="thymidine kinase" /number=5 exon 1737..1856 /gene="thymidine kinase" /number=6 intron 1857..2161 /gene="thymidine kinase" /number=6 exon 2162..2775 /gene="thymidine kinase" /number=7 polyA_signal 2755..2760 /gene="thymidine kinase" BASE COUNT 641 a 883 c 742 g 673 t ORIGIN 1 ccatggcaga tccggaggga tggtcgagct ccaggctttt cacgtagctg agaggtggga 61 cgagtcttgt cttcgtcccg cccccttttg agttcgcggg caaatgcgag cagtaagtcg 121 aaatttttcc acccacggac tctcggtgct aactaaggtt tgcacagcag ccatgagcta 181 catcaatctg cccaccgtgc tgcccagctc ccccagcaag actcgggggc agattcaggt 241 gcggggtctg gagtggggtc gggtggtggc tggtgcgggg tgtacaaggg ggagacccct 301 taaatgactt ctcttctagg tgattctcgg gcccatgttc tcagggaaaa ggtaatgaat 361 gggccttcgg ggcggtgggc ctccctctct ctctgcagcc tctctctctg caggccctgg 421 ctctcctctc ccccctcccc catcttccag gagcctctga ctaggcggca aacgcgctcg 481 acctctgtgt gtacattccc gcccaaaatt cgatttcctg gattgctgcc atcctcctgc 541 ccaagccttt tctcttgggg aaccctatct ttgctgaggc ggaccacatg ggggtacaat 601 tgaagacccc agcccacatt acccaggagc cccaccgaac tctcatgtgt ccctgggagt 661 taggcacctt ctctgacagt ttcctcattc attccccatg cgagctccta agctggagcc 721 aattaactct gaggtgaggt ctaaactctg gagtcccacc cagaataaca ggctaccgac 781 gctaggtggt agtggcagcc cgccttttgc tcgggcctca tgtgctctgg ttaattcttg 841 cagcacagag ctgatgagaa gagtccggcg cttccagatc gcccagtaca agtgcctggt 901 catcaagtat gccaaagaca cgcgctatag caacagcttc tccacacatg atcggtcagt 961 cctacccctg agcctgccct tcggaaaccc ctaagcctcg ggggcacact gactgctagc 1021 ctctagtgga tatgaataag tgctcaaaac tactagatac tgtccagtgt catgtaccac 1081 gagctggaga ctgtcactgt agagctgcag attaggtgac actatagaat actcaagctt 1141 ggtctctgcc cttgctcatt cttggccttc tcttcccagg aacaccatgg acgcattgcc 1201 agcctgcatg ctccgcgatg tgacccagga ggccttgggt gtggccgtca ttggcatcga 1261 tgaggggcag tttgtaagtc agcctgcagc ataacctcac cctgacccct gacccttcct 1321 gccctgctga ctcagaagtc cctgtcatta acttgtcttt actggggtgc ggatccaatt 1381 cgccctatag ctgagtcgat acaatcactg gcgtcgtttg catgctcagc tcccccacgg 1441 tcttcgtcag gcccttgcct ggggcagata acctcatcct ctggactttt cccagtttcc 1501 tgacattgtg gatttctgtg aaatgatggc caacgagggc aagacagtaa ttgtggcagc 1561 gctggatggg accttccaga ggaaggtaag atgtctgact ctggcttggc acagtgcaga 1621 ggcacagctc ctgacgttgc tctctggccc ccacacacat acacatgtga acacaggcag 1681 aggagggagg tgcctggcta actgaccgca cacacctgct gctgtcttcc gtgcaggctt 1741 tcggcagcat cttgaacctg gtgcccctgg cggagagtgt ggtgaagctc accgctgtgt 1801 gcatggagtg cttccgagaa gctgcctaca cgaagaggct gggcctggag aaagaggtac 1861 acacacacac acacacacac acacacacac acacacacac acacacacac acacacaaca 1921 cacacacaca cacacacaca cacacacaca cacacttact tctgcccaag gagtggggtg 1981 ggcttaggct ctggcttctg agtcccaccc tcacatctcc ctcccaccag ggtgaggtga 2041 ttgggcctgg agaaagaggt atacacacac acacacacac acacacacac ttctgcccaa 2101 ggagtggggt gggcttaggc tccggcttct gagtcccgcc ctcacatctc cctcccacca 2161 ggtggaggtg attggcggag ccgacaagta tcactccgtg tgccgcctgt gctactttaa 2221 gaagtcttca gcccagactg ctggctcaga caacaagaac tgtctggtgc tggggcagcc 2281 gggagaggcc ttggttgtca ggaagctctt tgcctctcag caagtcctac aatacaactc 2341 tgccaactga ggggacctga ggcctgccag ctcctaccca ggttggactc tcagagagca 2401 gggggagcgc gggcctgcca ttcctaatgg acaatgtacc ttgaacaggc tgccactcgc 2461 tgaagccgtt ttcagttccc ttcttgattg ccaagatgcc tcaatgcaga ctggagccca 2521 gaccctgcct ggtggctagc ggtcccgtgt tcagccaaag gtgaggacag agctgtccag 2581 cattgtgaca ctggtggggc tagtttcttc cttgttcgtg gctgggtttc agtctcagag 2641 ccccaccctc accaaggctc cacgcctctc acagctcccc catttatgcc taaacattct 2701 ctcctcagaa cctcagctct tagtgagcca cttttcttgt gcaaaatgaa caatattaaa 2761 gtttactact aatgagaacg tgtttctcct tagcctgggt ttccctaact tgcaaccggc 2821 acccacatga cttgggggta gaaatgtgtt ttgtagtacc aatggtctca ccacagccca 2881 agaagaacag tcccccacat ttctatctgg ttggtttcgt gacaaaaaat ggcaaagaa // LOCUS MMPCNAG 4970 bp DNA ROD 07-MAY-1991 DEFINITION Murine PCNA gene for proliferating cell nuclear antigen (DNA polymerase delta auxiliary protein). ACCESSION X57800 NID g53601 KEYWORDS PCNA gene; proliferating cell nuclear antigen. SOURCE house mouse. ORGANISM Mus musculus Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Rodentia; Sciurognathi; Myomorpha; Muridae; Murinae; Mus. REFERENCE 1 (bases 1 to 4970) AUTHORS Matsukage,A. TITLE Direct Submission JOURNAL Submitted (11-FEB-1991) A. Matsukage, Aichi Cancer Center Research Institute, 1-1 Kanokoden, Chikusaku, Nagoya 464, Japan REFERENCE 2 (bases 1 to 4970) AUTHORS Yamaguchi,M., Hayashi,Y., Hirose,F., Matsuoka,S., Moriuchi,T., Shiroishi,T., Moriwaki,K. and Matsukage,A. TITLE Molecular cloning and structural analysis of mouse gene and pseudogenes for proliferating cell nuclear antigen JOURNAL Nucleic Acids Res. 19 (9), 2403-2410 (1991) MEDLINE 91252282 COMMENT See X57798 and X57799 for related sequences. FEATURES Location/Qualifiers source 1..4970 /organism="Mus musculus" /strain="ssp. domesticus; strain C57BL" /db_xref="taxon:10090" /dev_stage="adult" /tissue_type="spleen" GC_signal 828..833 GC_signal 848..853 mRNA join(964..1328,2144..2241,2381..2448,2701..2895, 4241..4364,4443..4853) /gene="PCNA" gene 964..4853 /gene="PCNA" misc_feature 964 /gene="PCNA" /note="cap-site" /evidence=experimental exon 964..1328 /gene="PCNA" /number=1 CDS join(1108..1328,2144..2241,2381..2448,2701..2895, 4241..4364,4443..4522) /gene="PCNA" /codon_start=1 /product="proliferating cell nuclear antigen (DNA polymerase delta auxiliary protein)" /db_xref="PID:g53602" /db_xref="SWISS-PROT:P17918" /translation="MFEARLIQGSILKKVLEALKDLINEACWDVSSGGVNLQSMDSSH VSLVQLTLRSEGFDTYRCDRNLAMGVNLTSMSKILKCAGNEDIITLRAEDNADTLALV FEAPNQEKVSDYEMKLMDLDVEQLGIPEQEYSCVIKMPSGEFARICRDLSHIGDAVVI SCAKNGVKFSASGELGNGNIKLSQTSNVDKEEEAVTIEMNEPVHLTFALRYLNFFTKA TPLSPTVTLSMSADVPLVVEYKIADMGHLKYYLAPKIEDEEAS" intron 1329..2143 /gene="PCNA" /number=1 exon 2144..2241 /gene="PCNA" /number=2 intron 2242..2380 /gene="PCNA" /number=2 exon 2381..2448 /gene="PCNA" /number=3 intron 2449..2700 /gene="PCNA" /number=3 exon 2701..2895 /gene="PCNA" /number=4 intron 2896..4240 /gene="PCNA" /number=4 exon 4241..4364 /gene="PCNA" /number=5 intron 4365..4442 /gene="PCNA" /number=5 exon 4443..4853 /gene="PCNA" /number=6 polyA_signal 4635..4640 /gene="PCNA" polyA_signal 4828..4833 /gene="PCNA" BASE COUNT 1385 a 1000 c 1118 g 1467 t ORIGIN 1 aagctttctt cacaactcaa ccaacttcta tacagttaga agaggcattc caaatggaaa 61 ctatcaagtt tcacattaaa gtattacact ttcattatca aatatatttt acctaccacc 121 aaaaccaaaa gaaaaaaaca tgagaaattg aaaaactcta tatccttttg taaatgatgt 181 taatctacct gcagtcagtt cttccccatc ctcataatca ataggaaaaa caaaaaaaca 241 aaaaaacttt agtagaacaa aatgctgcag cccagggaca tatgaaagac gctccacctg 301 caacagctaa cattttcagg atggggctca ggctcgggtt catcttcagc ctaaaattaa 361 gatgaaaata ctagtaatat ctacctttag aaaaatgtag agaaccagac ataaaaggaa 421 aataattacg tgatacatat ggacttggta tcttctttgg agaagcgttc acgttaagag 481 gtattttttt agtctttaga gagaaaagtt cagagcgacg cacacagaaa agtgatatga 541 aatggggggg ggggtagggg tgttaaaata tggtggcctc tttattactc gatattttgc 601 agcgtatttc ttacgttagg gaaaacgctc cgtagtgttt aaaatactct ccagcttcaa 661 ggcaggccgc cgcgcacagc tcgatttgcc tgtgacttcc acttccgtgg cgcggaaact 721 tcctaaggat ggaaactgca gcctaaactc ccacaaactt gggcggtgac gacagcctac 781 gcgaaccccg tgatgcccct cgcctcccag gctcctaccc cgcagccccg cctttgcata 841 cgcggtgggg cgggccttgc tcaaaccacg ggtacgattg gtccttgagg agaggtgggt 901 ggatcagcgc tgtggcgtca tgacctcgcg cagggaaaag gcgcgcgcct aggaagccgc 961 ggcattagac ggttgcgcgc gcagagggtt ggtagttgtc gctgtaggcc ttcgctgccg 1021 cttctgcatc gtgaatcggg ggaccttggc agccagacct cgttcctctt agagtagctc 1081 tcatctagtc gccacaactc cgccaccatg tttgaggcac gcctgatcca gggctccatc 1141 ctgaagaagg tgctggaggc tctcaaagac ctcatcaatg aggcctgctg ggacgtcagc 1201 tcgggcggcg tgaacctgca gagcatggac tcgtctcacg tctccttggt acagcttact 1261 ctgcgctccg aaggcttcga cacataccgc tgcgaccgca acctagccat gggcgtgaac 1321 ctcaccaggt gagcgggtgg cgggagcggg gccccactct tcccgcttcc gctcttggcg 1381 gggctgtgac tctgcacgct cattggctgg cttggccatc cgcgctttct gattggtcta 1441 tggtgtcggg ggcagccctc accaaagcgc gcggttccga aaagcccgcg ctggcagtgg 1501 cgcccactct gtttccgcgc caaagccaca aagcgggagt ccgcgggaaa atgagtgctc 1561 cggagctgtg ctcattaaat gcctgcagct ttgagtggct ggtcttagcg cctaataaac 1621 gagtcttagt gcaaatgtaa tgtcgactta gagtgacaat agaccttttc ttgacttcca 1681 gagtctcact gcgcatcatg gatttgaggg gaaatctgtc agttttagct tttaactttg 1741 ctacagctac ctaggttagt gcctcctgta tacgtgttca aggacagtgt gtgacttatt 1801 ttagtacaga tacatggatt agtgccactt gtatacattt tgaaagattt acgaaaaggc 1861 cagacgtgat ggggcacatt ctccagtaca ctagaaacca aggacacccc gctcaaaaag 1921 atgctttctc gaatgttggc ttttagtgca ttttactaag tcggttttaa gaatcacata 1981 tacccggtaa tttgcttcac ccctgagaga gtttggggta cccttagccc ctttaacagt 2041 tctccaaccg tgagtgtgaa atggtacaac ttgtaattgc tttttaaaat atagatgtgg 2101 attacatgtt gataaagcct gtcttttttt ttttgggggg tagcatgtcc aaaattctaa 2161 aatgtgctgg taatgaagac atcattacat taagggctga agataatgca gacaccttag 2221 cactagtatt cgaagcacca agtaagttaa acacctttaa aactcggagt tacgtgttgt 2281 ttctgtttct caaaaccaaa aaaaatatta acaatattgt aaattccatc atagatagga 2341 ccgtgtggtg tgcttggtaa cattttcctt cttttggtag atcaagagaa agtttcagac 2401 tatgaaatga agttaatgga cttagatgtg gagcaacttg gaatcccagt gagttacctt 2461 gtttctgatt gtgtgttacc ctgctgtgat accagctgat gcgtgttctg agtggagtgg 2521 tggtattggg gatgaatggc acactgccat ttcactaaac cacagcagtc taaagttgat 2581 tgagttttaa agaaaccaga agtcttgcat tctgagttct ggttaagatg ctaaatcttg 2641 agaacatgaa gctgagcctt cccccttttc tagactgacc tttaacttgt gggtttacag 2701 gaacaggagt acagctgtgt aataaagatg ccgtcgggtg aatttgcacg tatatgccga 2761 gaccttagcc acattggaga tgctgttgtg atatcctgtg caaagaatgg ggtgaagttt 2821 tctgcaagtg gagagcttgg caatgggaac attaagttgt cacaaacaag taatgtggat 2881 aaagaagagg aggcggtgag tagtaagggg gcgtccagtt aggtgtctga agcagggatg 2941 gagcctcggc ttttgttttt atttattcat tcattttgag atggagtctt gagtagacca 3001 agctatctta gagctcagag acgactccat aagcttttac aggtagcatt tggaaagcta 3061 agtgtacagc cttttgcttc ctggaaatac tcttggcaaa taagtgaggg ttggcaagtg 3121 agcaaaagaa aatggttggg ggtgtatgta gctttatgtg ttgcaggttc aagagtattt 3181 gcagtcccaa gggaaataag aaagacttca caaaatgtgg aaagagttgt attaaatgct 3241 cttgacagtt acatccatag agaaagctgg gcatgatgtc tcaaacccac aactgatgta 3301 ctcaaagcta cagcaggaag attctcagct taaagtcaac ctggcagaaa atctagctca 3361 aaaagaatga ggaagaaatt gggaaggcaa aggaagatgt tctccgagtc ctcctcattc 3421 aagtagaaca tactaggcct ctttaatttc taagtatccc tgaatcgagg ctttttctca 3481 ggaatccaat gtatatttca tggctacact tttttttttc ttttttaagt tttgctagct 3541 agcctgagca tcagaattac acacagaagt ctgaactaaa taggattttt agggtttagt 3601 atagtgaaat tcagagtgct tctgcaagta tttaaggtaa atataaggtg ttacttggcc 3661 tctgcatgaa tttaaagtaa atgaaagtgt aagaattcga acatagataa acacacaacc 3721 caagaactag ttcttaacct taatctgctg aattatttct acttccatat caacttcagc 3781 tcctcagttc tcaaatactg acatgtaatt catcagtatt tgtctgatgt gcaagcattt 3841 ccacaacaaa agaaattaag gaatttttca gtatccacat gttcaaggat tgggaattga 3901 ataaaattga taatcataca atgaagactg gtttactgtc ccttagcttg cattcagctg 3961 ttggttcttg tttttggaag tggttatgtg ttatcttcct ctcttcatgc attctttgca 4021 agagaagtat gtacaatctg aataggaaca actttctcct ttgttttgat tgcttggggt 4081 gtggctctac aggatgggca agctagactt ttttcttctt tagtcaaggt tttcatcaac 4141 ccttccaaaa tgataactat ttgttttgct ttgtggtata ataccgtgat ctaatagtgt 4201 gagtttctga tgtctacagt gagcctgttt tctcctctag gtaaccatag agatgaatga 4261 gcctgttcac ctaacgtttg ctctgaggta cctgaacttt ttcacaaaag ccactccact 4321 gtctcctaca gtaacactca gtatgtctgc agatgtgccc cttggtaaga tgataagttt 4381 gaacattgtt ttgtaatgtg gtatttatag tattcggtgg tttaattttt cctgtctttc 4441 agttgtagag tataaaattg ctgacatggg acacttaaag tattatttgg ctcccaagat 4501 tgaagatgag gaagcatctt aggcattgct agaaattgag aaaactaaac ctttgaagat 4561 tgctcctgag atgccagcgt gtcctgaggt cttttctgtc accaagtttg tacctgagta 4621 ttcttaaata ttaaaataaa atatgtagat atcttctgta aataacctac tttcttttct 4681 ctccattctc cataatttgc ttaaagaata agctccaaag taaaaactag ttttgttaac 4741 atgaatgttt ctgctttaca aatactggtg attttccatc aatgatcttg acgctaaatg 4801 cagttttaag aaatattgtt caatttaaat aaagttaaca atttgaaaag tcatagagca 4861 gtgccgcctt tgtaccatgg ctactgaata gttataccct gttaggtaaa tgggcttttg 4921 ttctctcttc catcgtggag atagcaataa aaaaatgttt ctcttgggtt // LOCUS MUSGAD45 3100 bp DNA ROD 23-JUL-1998 DEFINITION Mus musculus GADD45 protein (gadd45) gene, complete cds. ACCESSION U00937 NID g392933 KEYWORDS . SOURCE house mouse. ORGANISM Mus musculus Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Rodentia; Sciurognathi; Myomorpha; Muridae; Murinae; Mus. REFERENCE 1 (bases 1 to 3100) AUTHORS Alimzhanov,M.B., Kuprash,D.V., Turetskaya,R.L., Osipovich,O.A., Borodulina,O.R., Osovskaia,V.S., Chumakov,R.M. and Nedospasov,S.A. TITLE Cloning and characterisics of murine genes coding for the human GADD45 analog -- a protein induced in response to DNA damage JOURNAL Dokl. Akad. Nauk 333, 788-791 (1993) MEDLINE 94154610 REFERENCE 2 (bases 1 to 3100) AUTHORS Kuprash,D.V. TITLE Direct Submission JOURNAL Submitted (19-AUG-1993) Dmitry V. Kuprash, Engelhardt Institute of Molecular Biology, 32 Vavilova Street, Moscow, Russia FEATURES Location/Qualifiers source 1..3100 /organism="Mus musculus" /strain="129SV" /db_xref="taxon:10090" /sex="female" /tissue_type="hepatic" /dev_stage="adult" CDS join(891..934,1024..1125,1220..1457,2362..2475) /codon_start=1 /product="GADD45 protein" /db_xref="PID:g392934" /translation="MTLEEFSAAEQKTERMDTVGDALEEVLSKARSQRTITVGVYEAA KLLNVDPDNVVLCLLAADEDDDRDVALQIHFTLIRAFCCENDINILRVSNPGRLAELL LLENDAGPAESGGAAQTPDLHCVLVTNPHSSQWKDPALSQLICFCRESRYMDQWVPVI NLPER" protein_bind 1547..1566 /note="putative p53 binding site" /bound_moiety="p53 oncoprotein" polyA_signal 3012..3019 BASE COUNT 784 a 759 c 782 g 775 t ORIGIN 1 gttaaacatg cgatcaccag ctccccacct tacatggagt aggcagaaat ttcaccaact 61 ttttgtggaa aagattccaa cttggtactg tgccaccact acacacaaat gcgatttttt 121 tttttttttt cttctccctg gagtttcaaa ttctcacaac caaccaaaac caaatcaaac 181 cagccatcag ggcaccaaaa gactactact gggaataagc aggtttacct gaaacttgca 241 tataaacact gaagcaagct ttgcccggca tgtctggaca cttaaaaata tgcttactaa 301 caacagctag actggagtgc attaatccct ggaagtaaaa tttaaaaagc agccccttaa 361 cacaaagggc acgaaacgac aatcaaaaga tgtccacaac cagaaaaaca attggcaggc 421 tgtttcagtc agagctgagg acacttgaac cactgcaaag ccctccgcac cagaaggctc 481 cacaaactac ctagagctgg ctcaggaccc caccaaccgg gtttcagcac caccttcgtc 541 ccgtgggctc cctcccgcgt acgtccgccc ctttccgctc aactctgcct tgctttggtg 601 gagcgagggt ctgggtgtca acgtctgcta atttgcataa cccaatggcc tgactgcatg 661 caaatgaagc tggacctggt tggctgaggg ctagccccat atctccggaa tcgggccctt 721 tgtcctccag tggccccgag gcagcagtgc agagttcccc agcgaggcta ggcgagcagc 781 cggccggccg gagcggagaa gggagggtgg gagcgagcgc agagccggcg ccgcgcactg 841 tgggggccag gagcagcccg cgcgccgagg gagggactcg cacttgcaat atgactttgg 901 aggaattctc ggctgcagag cagaagaccg aaaggtaagt gtgcctgccg actcggtggc 961 cgccgcccct cccgcctcgc gtccggggac ccggctgacc cgctctcacc ctgccgcccg 1021 caggatggac acggtgggcg atgccctgga ggaagtgctc agcaaggctc ggagtcagcg 1081 caccattacg gtcggcgtgt acgaggctgc caagctgctc aacgtgtaag tggcccgcgt 1141 tccccgctcg cccacccacc cgcctctacc ccgcgccccc ctgagcgtat gcaactcacc 1201 gctgccccgc tgcccgcaga gaccccgata acgtggtact gtgcctgctg gctgctgacg 1261 aagacgacga ccgggatgtg gctctgcaga tccatttcac cctcatccgt gcgttctgct 1321 gcgagaacga catcaacatc ctgcgggtca gcaacccggg tcggctagct gagctgctgc 1381 tactggagaa cgacgcgggc ccggcggaga gcgggggcgc cgcgcagacc ccggacctgc 1441 actgtgtgct ggtgacggta agagacccgg ggctgcagcc agaatctgct ggggtggacc 1501 ttgtccgagc tgaacgctga tctcgcgggg gttgtgatag ggtacggagc gtgtctaagc 1561 tcgtgggtgg cctccagcgg cggaagatat ccctgtgagt cagcaagctg cccagctgct 1621 acctctgctt acctctgcac aacctcgcgt ggctaattct ttgagcagaa cagattagat 1681 aaagccaaat aaattcccgg tttacccttc gttaagaagt tagcttcatt cttcatttgc 1741 tgtcaaagct aaaggtagaa gtggtgtagg agacagacct tattaattcc ctggatatag 1801 atagatactc tgaagtgtac cgagtggaat tcttttcttt gctgctcttg agtttgttta 1861 aacgagctcc ctcctcccca ccgcttaaaa atgtgtggtg ggttgatagt gttttcagaa 1921 agaaccgtgc tgtctaggtc ttctggtttt ttgtttttgt actctttctc ttctgacttg 1981 tgtagggaag tgactgcctg gttacttgtt gctgccagta aaatgtgcta ttcatcgtca 2041 gggaagataa gagttcagca gctgcctgcc ctgcagccct caccgtgcac acagcctgag 2101 aatgcttttc aggagtcttt tacttatgac ccaccagggt acatgcttgg aagggaggtc 2161 ttaagaatgg taacctcatg cagaatgggg tagttaatat gcaaaactct gtgtagttcc 2221 gtgcacgtta caggtgttta gactatgcta gatgctgttt ttaacagaat gttcggggtt 2281 tggccctttc aaaagtagca caattagtag cagaagctga ttattttccg gggaagtaat 2341 tttttcaaaa tctgttcaca gaacccacat tcatcacaat ggaaggatcc tgccttaagt 2401 caacttattt gtttttgccg ggaaagtcgc tacatggatc agtgggtgcc cgtgattaat 2461 ctcccggaac ggtgatggca tccgaatgga aataactgaa ccaaattgca ctgaagtttt 2521 gaaatacctt tgtagttact caagcagtca ctccccacgc tgatgcaagg attacagaaa 2581 ctgatgtcaa ggggccgagt tcaactgcac gagggctcag agatgacttt gcagagggag 2641 agagaggtga gcctgaagaa ggaagctgcg agaaaagaga aatccaaggc aaaagggaca 2701 aaaactacaa agcactgcaa gaaagaaaac tgctaattta ggatggccag gttactttca 2761 aataagccaa atattgcttt gttgaaactt taaatgtata gcaatagttt gggtattttt 2821 tttctttttt ttttttggtc tttatgccct caaataaaag gaaagtaaag aggattaatc 2881 atattttcaa gccacagttt aaatgtattt tgatgagatg ttaaattctc agaagtttta 2941 ttataaatct tactaagtta ttttatgatg tgaaaggtta tttatgataa agtttttgaa 3001 gcacattatc taaaataaac tggtatggaa taattgtgtc atttctcaac gtgtggtttg 3061 tgttttacaa tattgattat tggacattct attgaagctt // LOCUS MUSHEPGFA 6751 bp DNA ROD 07-OCT-1994 DEFINITION Mouse hepatocyte growth factor-like protein gene, complete cds. ACCESSION M74180 NID g193831 KEYWORDS hepatocyte growth factor-like protein. SOURCE Mus musculus (strain BALB/c) DNA. ORGANISM Mus musculus Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Rodentia; Sciurognathi; Myomorpha; Muridae; Murinae; Mus. REFERENCE 1 (bases 1 to 6751) AUTHORS Degen,S.J., Stuart,L.A., Han,S. and Jamison,C.S. TITLE Characterization of the mouse cDNA and gene coding for a hepatocyte growth factor-like protein: expression during development JOURNAL Biochemistry 30 (40), 9781-9791 (1991) MEDLINE 92002017 REFERENCE 2 (bases 1 to 6751) AUTHORS Degen,S.J. TITLE Direct Submission JOURNAL Submitted (02-DEC-1991) S.J.F. Degen, Division of Basic Science Research, Children's Hospital Research Foundation, Cincinnati, OH 45229-3039, USA FEATURES Location/Qualifiers source 1..6751 /organism="Mus musculus" /strain="BALB/c" /sub_species="domesticus" /db_xref="taxon:10090" /tissue_type="liver" CAAT_signal 1096..1100 TATA_signal 1153..1158 mRNA join(1192..1337,1951..2098,2191..2303,2388..2502, 2587..2723,2805..2925,3007..3125,3269..3464,3624..3754, 3833..3935,4058..4194,4283..4318,4398..4506,4668..4745, 4865..5011,5092..5198,5297..5436,5578..5804) exon 1192..1337 /number=1 CDS join(1286..1337,1951..2098,2191..2303,2388..2502, 2587..2723,2805..2925,3007..3125,3269..3464,3624..3754, 3833..3935,4058..4194,4283..4318,4398..4506,4668..4745, 4865..5011,5092..5198,5297..5436,5578..5739) /codon_start=1 /product="hepatocyte growth factor-like protein" /db_xref="PID:g193832" /translation="MGWLPLLLLLVQCSRALGQRSPLNDFQLFRGTELRNLLHTAVPG PWQEDVADAEECARRCGPLLDCRAFHYNMSSHGCQLLPWTQHSLHTQLYHSSLCHLFQ KKDYVRTCIMDNGVSYRGTVARTAGGLPCQAWSRRFPNDHKYTPTPKNGLEENFCRNP DGDPRGPWCYTTNRSVRFQSCGIKTCREAVCVLCNGEDYRGEVDVTESGRECQRWDLQ HPHSHPFQPEKFLDKDLKDNYCRNPDGSERPWCYTTDPNVEREFCDLPSCGPNLPPTV KGSKSQRRNKGKALNCFRGKGEDYRGTTNTTSAGVPCQRWDAQSPHQHRFVPEKYACK DLRENFCRNPDGSEAPWCFTSRPGLRMAFCHQIPRCTEELVPEGCYHGSGEQYRGSVS KTRKGVQCQHWSSETPHKPQFTPTSAPQAGLEANFCRNPDGDSHGPWCYTLDPDILFD YCALQRCDDDQPPSILDPPDQVVFEKCGKRVDKSNKLRVVGGHPGNSPWTVSLRNRQG QHFCGGSLVKEQWVLTARQCIWSCHEPLTGYEVWLGTINQNPQPGEANLQRVPVAKAV CGPAGSQLVLLKLERPVILNHHVALICLPPEQYVVPPGTKCEIAGWGESIGTSNNTVL HVASMNVISNQECNTKYRGHIQESEICTQGLVVPVGACEGDYGGPLACYTHDCWVLQG LIIPNRVCARPRWPAIFTRVSVFVDWINKVMQLE" exon 1951..2098 /number=2 exon 2191..2303 /number=3 exon 2388..2502 /number=4 exon 2587..2723 /number=5 exon 2805..2925 /number=6 exon 3007..3125 /number=7 exon 3269..3464 /number=8 exon 3624..3754 /number=9 exon 3833..3935 /number=10 exon 4058..4194 /number=11 exon 4283..4318 /number=12 exon 4398..4506 /number=13 exon 4668..4745 /number=14 exon 4865..5011 /number=15 exon 5092..5198 /number=16 exon 5297..5436 /number=17 exon 5578..5804 /number=18 polyA_signal 5780..5785 BASE COUNT 1543 a 1800 c 1919 g 1489 t ORIGIN 1 agatctgatc ggccaggggc tcgaggggag tcaccgaacc cgcccggctc atagccaggc 61 cgcctctcac tcacccccgg cctcagcctc cgcgaccggc tcacaacatc cgcccagctt 121 ttcggctacg gcacccgtcc aggccaaacc gcgtgctcgc tcgagcgctg ctccagccgc 181 gcacgcgcat atgcacagac cgcaacaggc tggcagaaaa ccctcctccg tctcctacca 241 aggtgtttac ccgttttgcc tgatggtcca cctgtttcgc ccccaccttt cctagcccag 301 ccgtagcagg gactatgttc taatcggtcc ctaggtccac ctgtcttaac tcctaccttg 361 cctggaggag gcctgaccca catgcagcct gaaagaccac ttctgacagc agatttgcta 421 cctgtcacag ccgcgcacgc cccctccaga tggtcattga caccagatcc aatgggcagg 481 gttgcttagc ttaccctggt ttgacacttc tgaggggcga tgggatggat gctcctcgga 541 tgtgctgcta ggggtgtagg ctgactgccc tacagctggg actcagctga taaggcagct 601 tgaacaggga gaggcagcat tgggactggg gaaattgcag tcctcacttt acaagaagaa 661 actgaggccc agaaaagtat aatccagggg tctgggaaat cttggcaact cctgtatagc 721 agagtctttt ggcatagaag tgtcagtggt gatggcagcc actgtggtca ctagactctt 781 gacatgtgac ccgtgtaact gaaaatttca gtttttcact ttgtaaatcg taatcacata 841 gagtctgact actgtgatgg gtaccacacc tctacagtaa agcaggcacc agggactcca 901 tgcaacttct ggagcgcgtg tagcaacagc atgcgacctc agggatagat ggtggcagga 961 agacagtgga gtgatcttgg caagtctggg gattgcatag agtagacggg ctctgcctca 1021 gggacaccta acgtttccac acagaaccct cctaagtcct gcctaccaca cagagaggcc 1081 tctcaggatc cagctgcaat gagacagcac tcgagggcct caaacctagg ctccacctag 1141 caactgtcac cctatgtgtc agtcaagtcc aggcaggttc agagaggggg tgtggagcca 1201 gagtcaccca atcctgaagg gacagatttc accatttccg ggatggggct gtggtgggtc 1261 accgtgcagc ctccagctta ggagaatggg gtggctccca cttctgctgc ttctggtaca 1321 gtgttcaagg gctcttggtg agtgtcaccc accctgatcc cagtctgcct tcacgaggga 1381 gttcacccct ggtctacata gctattctca ttgagagttt acttttcttt gggtccggga 1441 tcagtgacct tggcctgttg agcagagctg agaaggcctg ggaattcaaa tacacacagt 1501 ctgatcagga ctacattaga gcatactgta gcccagaggc agtctttcaa ccagagaaac 1561 tatccaaccc agaaggcagg gctcctaagc ccgatgcacc actgtaactt atgcctttat 1621 tctggtgaga ggccagactt ggggccttcc ccaggaagtg tccaagcatt ctcatctgag 1681 gggtgagaag gggcaagtgt cacaaggcca acacactgtc acccaaattc tcatggagtg 1741 gatgtggtag accagagccc agtgccaggt ctcctagcag atgggcaata atcactgtat 1801 ctgggcctcc ccagctcact ggcatgaagg gacttgctgg gcccttgaaa atatacataa 1861 ggcctgcccc aaagaccttg tattagattc cctaaatgaa caaaagatag ggtgtgttaa 1921 agtactaatg cgctcatgct caccacgcag ggcagcgctc accactgaat gacttccagc 1981 tgttccgggg cacagagtta aggaacctgt tacacacagc ggtgccgggg ccatggcagg 2041 aggatgtggc agatgctgag gagtgtgcta ggcgctgtgg gccccttctg gactgtcggt 2101 gagtggctaa gtagcctaga tatggctgag ggcatgagaa tctgggttgc cagttaactt 2161 tgtgtctgcc accccccccc ccttctccag ggccttccac tacaacatga gcagccatgg 2221 ttgccagctg ctgccgtgga cccagcactc gctgcacaca cagctatacc actcgagtct 2281 gtgccatctc ttccagaaga aaggcaagtg gtggtgagga ggggaaacag gctgagtaac 2341 aggggccacg aggctcaggc ctgttgacct tcctccattg cttccagatt atgtgcggac 2401 ctgcattatg gacaatgggg tcagctaccg gggcactgtg gccaggacag ctggtggcct 2461 gccctgccaa gcctggagtc gcaggttccc caatgaccac aagtgagtca gacacttcag 2521 gtcagaccgt taggcctgaa gcagtattcc cccagtgtgc actgtagtaa gaatctttgt 2581 ctacaggtat acgcccacgc caaagaatgg cctggaagag aacttctgta ggaaccctga 2641 tggggatccc agaggtccct ggtgctacac aacaaaccgc agtgtgcgtt tccagagctg 2701 tggcatcaaa acctgcaggg agggtaagcg gctggggtca atcaagccta aggagggagt 2761 gataggcctg cccccactta gaagtgcatt ggccctgttt ccagctgttt gtgttctgtg 2821 caacggtgag gattaccgtg gcgaggtaga cgttacagag tcagggcggg agtgtcaacg 2881 ctgggacctg cagcaccccc actcgcaccc tttccagcct gaaaagtatg taggcagaat 2941 ccttattttg agggtggggc tcagctctac tgggactgag tcccagagtc ttgttactgc 3001 tttcaggttc ctagacaaag atctgaaaga caactattgt cgtaatccgg acggatctga 3061 gcggccctgg tgctacacca cagacccgaa tgttgagcga gaattctgcg acctgcccag 3121 ttgcggtagg ctgcagggtc agggtctagg aaggagcttg gaaaaaactg gcgggcacgg 3181 ttcaactggg agaggtacta gggaagttag gcgtgggtag agagcaaagc ctgctgagta 3241 ccagagacca attccagttt tcggtcaggg cctaacctgc ctccgaccgt caaaggatcc 3301 aagtcacagc ggcgcaacaa gggcaaggct cttaactgct tccgcggaaa aggtgaagac 3361 tatcgaggca caaccaatac cacctctgcg ggcgtgccct gccagcggtg ggatgcgcag 3421 agtccacacc agcaccgctt tgtgccagag aaatatgctt gcaagtgagg tgacaggccg 3481 gagcagggag agtgcacctg tgggtggagg cagagcgtat gcgaaggtgg gacctggggg 3541 cggagtcaga ggttccagcc tactgcgggt tggctggtgg gctaggtggg accccactct 3601 cgataaggga agtgactact cagggacctt cgtgagaatt tctgccggaa tcctgatggc 3661 tccgaggcgc cttggtgctt cacatctcga cctggtttgc gcatggcctt ctgccaccag 3721 atcccacgct gcactgaaga actggtgcca gagggtgagg ctggagcggg ggtacagaat 3781 ctgggcagga atcaacccag ggctgaccac cgctcttgcc tgcccaccac aggatgctac 3841 cacggctcag gtgaacagta tcgtggctca gtcagcaaga cgcgcaaggg cgttcagtgc 3901 cagcactggt cctctgagac accgcacaag ccacagtgag tgtgtgctat gtgcagatag 3961 ggccttaact ctagggcaga ataccttaag ttcttgtgag cctaaagagg gtctaagtgg 4021 cctgatgtgt ccccctacct cctgccccta catctagatt tacacccacc tcggcaccgc 4081 aggcgggact ggaggccaac ttctgcagga atcctgatgg ggatagccat gggccctggt 4141 gctatacctt ggacccggat atcctgtttg actactgtgc cctacagcgc tgtggttagt 4201 gcttaagact tccccttgtc tgggtttcaa acctcacctc catagactgg ctcccttaac 4261 ctgagtgaac ttgatcttgc agatgatgac cagccaccat ccattctgga ccccccaggt 4321 atggggttgg gccaattgtg ggtacacagt ctttgaccct gaccctcact gaaggtttca 4381 tcctgcccca tccccagacc aggtggtgtt tgaaaagtgt ggcaagagag ttgacaagag 4441 taataaactt cgtgtggtgg gaggccatcc tgggaactcc ccatggacgg tcagcttgcg 4501 gaatcggtga ggcctaagcg cttatctcaa ggagtggagg ctggaaactc tgtggcttta 4561 tcagtagaag atggatgcct ggccttgtac caaaaggtcc ttgtcagaaa tgacagtcta 4621 gcatgtgtcc caggactcag tgtggcttct catctttact cctctagaca gggccagcat 4681 ttctgtgggg gctccctagt gaaggagcag tgggtactga ctgcccggca atgcatctgg 4741 tcatggtgag cagactgggg actcctagcc tacctctccc tgccattgtc tgtcccacaa 4801 gcaaactaaa ttgtgacagc tgattgggag tcaagcatga actagcagag tctctttctc 4861 ccagccacga acctctcaca ggatacgagg tatggttggg tacaattaac cagaacccac 4921 agcctggaga ggcaaacctg cagagggtcc cagtggccaa ggcagtgtgc ggccctgcag 4981 gctcccagct tgttctgctc aagctggaga ggtatgtgga tgtgttgaga gggtgtgagg 5041 cagggctagc ctcatggtca taggtcctga aaaccctcat tcccactaaa gacctgtgat 5101 cctgaaccat cacgtggccc tgatttgcct gcctcctgaa cagtatgtgg tacctccagg 5161 gaccaagtgt gagatcgcag gctggggtga atccatcggt aagagcacag tgcatagaca 5221 tggactgcta tgggccggga ggtccagcac tggttttggc tcaagggtcc cctccttatc 5281 attgtctgta cttcaggtac aagcaataac acagtccttc atgtggcctc gatgaatgtc 5341 atctccaacc aggaatgtaa cacgaagtac cgaggacaca tacaagagag tgagatatgc 5401 acccagggac tggtggtccc tgtgggggct tgtgaggtca gtgggagagc ccctgggcca 5461 gcctgggaag ggcttgggag ctgaaattat agtacttgat tgccaagggg gtgggatgtc 5521 aggagagggt agtcactgcc gaggtccaga gccttcaccc gtttttctac ctgccagggt 5581 gactacgggg gcccacttgc ctgctatacc catgactgct gggtcctaca gggacttatc 5641 atcccgaaca gagtgtgtgc acggccccgc tggccagcta tcttcacacg ggtgtctgtg 5701 ttcgtggact ggattaacaa ggtcatgcag ctggagtagg cctgcttttg agcccttaga 5761 gatgtcaaga cttctcaaac ataaagcggc cttttctctc tgtctgtata gagtgcttct 5821 tagtttctgt ctctagggaa ggtgttgact ccttgcaaga ggctgtgtgg cttaagacca 5881 gcacactcta ggctaagtgc tctgatccca gaacaacttc aaaaggtatg tactgtgtgt 5941 gggcagggtg caccatcttc cagaggcact cctgggaatg caaggacagt gcagaagttc 6001 ccagcccatg gaccagagca gaaagagtga tgtaggtcta caccagtccc gtttggctag 6061 gacaggcagg ggttgagtct ctcatggctt ctctctgtca catgacaggg atgaatacac 6121 tgtggatatc aaaccaagga cctagggttt ctgaacccca aggtagaggc tggggctggg 6181 gatggcttgt acaaagtacc agcacagacc aggctctgtg tcctccttta ttatgattag 6241 agtccatagt cctctgccca ctcattcgga gtccagagcc caggaaacct ctaggcagtt 6301 ctgccagatc ctggggctta ccgaagagca aagttcgaga cggactgccc agctcacaaa 6361 gagcaacagg gcttcagctg cccaagtgtg tgtgtagcca aagcacagtg ttcatgaagc 6421 tgtctgattc cacctccacc tctgacagcg catgggtgct cttgggatac agcaggagcc 6481 tgtatgagca gcaacacatg acattggagg gtcctgtcct gtttacctgc caccagctgc 6541 ccaactatcc tgtacactca ccggacaggc acattccggg ccttgagggc atggtaatac 6601 tccagaccct gcttgaaggg tacacgccgg tcctcctggc ccagcatcag taacactggt 6661 gtctttacct aggtgtatgg gaggcaagga gctgtggcga gctgagctct ggactctgga 6721 ggaatgggtg gcacaaggat acctgggtac c // LOCUS MMU18295 2893 bp DNA ROD 15-JUL-1995 DEFINITION Mus musculus histone H1(0) gene, complete cds. ACCESSION U18295 NID g897829 KEYWORDS . SOURCE house mouse. ORGANISM Mus musculus Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Rodentia; Sciurognathi; Myomorpha; Muridae; Murinae; Mus. REFERENCE 1 (bases 1 to 2893) AUTHORS Dong,Y., Liu,D. and Skoultchi,A.I. TITLE An upstream control region required for inducible transcription of the mouse H1(zero) histone gene during terminal differentiation JOURNAL Mol. Cell. Biol. 15 (4), 1889-1900 (1995) MEDLINE 95198706 REFERENCE 2 (bases 1 to 2893) AUTHORS Dong,Y. TITLE Direct Submission JOURNAL Submitted (07-DEC-1994) Yonghe Dong, Cell Biology, Albert Einstein College of Medicine, 1300 Morris Park Avenue, Bronx, NY 10461, USA FEATURES Location/Qualifiers source 1..2893 /organism="Mus musculus" /strain="BALB/c" /db_xref="taxon:10090" /clone_lib="Charon 4A" CDS 1016..1600 /codon_start=1 /product="histone H1(0)" /db_xref="PID:g897830" /translation="MTENSTSAPAAKPKRAKASKKSTDHPKYSDMIVAAIQAEKNRAG SSRQSIQKYIKSHYKVGENADSQIKLSIKRLVTTGVLKQTKGVGASGSFRLAKGDEPK RSVAFKKTKKEVKKVATPKKAAKPKKAASKAPSKKPKATPVKKAKKKPAATPKKAKKP KVVKVKPVKASKPKKAKTVKPKAKSSAKRASKKK" BASE COUNT 667 a 757 c 890 g 579 t ORIGIN 1 aagctttggg gggctcgggc tcagagagca atcaactcgc cgaggtccgg gggtgcatcg 61 aggggagaag gcgactcaag agggggaagg gacaaagttt gaaagtgccc ttccaactct 121 ttcagggaaa gtttccttga gggcctcgca ggcagagagg gggtaggggt gtcagaggag 181 ctgtggtcgg gtgacccccg agaggtgacc ggaccctggg gaggcgaccc ctcccccctt 241 gtcccgactc ggcccggctg ggagtcgccc agctcagggc cgcgtgtgtt agttggggcc 301 gcttctgagc cgctccgcca ggttggggga ccccgcaggc atccgggagt ggggctcctg 361 gtagtcccag gggggagctt tccgacgggg ggtctcaagc cccggggctc cccgctccgc 421 agagccagcc ccccgcaaag gggaaatgtg ccggcgcccc agcgctcctc ggcgccgttt 481 gaggctcgtg gcgggcgcgg ggggcggcgg ccggggcgcc agggtcccca gaagggaagc 541 ctagggagcc gggcagagac ctcgagagac ggcgggcagc gggaaggccc gggccgccgg 601 gaaaggtcga gtttgctcgg cggaagaaac acagatggcg gcagcgcggc gccattccgg 661 gccgggagca ggcagccagc agcccagtcc tcaccgcggt ccgcccgccg ccgctaaata 721 cccggatgcg ccgcccgagc gccagacgca gagctgggaa aagggaggct gtggaggcgg 781 aggcagcgcc aggcgccggg cgagcgaccg aacggtgggg gctgggagcg cagagcagct 841 cgcgacccgc gccgggagga caggagccac gcgtagcccg cgtccccggc agccgcactt 901 gcgtctggcc tgctagtgga gcgggagagc agatcgcgag tcaggttctg cacagcctcc 961 ggcgagggct ggcccatcgg aaggctcctt gaacagtggg agcaggccgg ccaccatgac 1021 cgagaactcc acctccgccc cggcggcgaa gcccaaacgg gccaaggctt ccaagaagtc 1081 cacggaccac cccaagtatt cagacatgat cgtggctgct atccaggcag agaagaaccg 1141 tgccggctcc tcgcgccagt ccatccaaaa gtatatcaag agccactaca aggtgggtga 1201 gaacgccgac tcccagatca agttgtccat caagcgccta gtgaccaccg gtgttctcaa 1261 gcaaaccaaa ggggtgggcg cctcggggtc cttcaggctg gccaagggcg atgagcccaa 1321 aaggtcggtg gctttcaaga agaccaagaa ggaagtcaag aaagtggcca ctccaaagaa 1381 ggcagccaag cccaagaagg ctgcctccaa agccccaagc aagaaaccca aagccacccc 1441 tgtcaagaag gccaagaaga agccggctgc cacgcccaag aaagccaaaa agcccaaggt 1501 tgtcaaagtc aaaccagtca aggcctccaa acccaagaag gccaaaaccg tgaagcccaa 1561 agccaagtcg agtgccaaga gggccagcaa gaagaagtga agactttgct tggggacact 1621 ccttcctccc tcctgttttc tgtaaataca tttcctcact tgattccatc tgcaaccctt 1681 tgcccattct attctgactt tattaaagag gacagagttt ggatccctca tacagacatt 1741 gtggaatgac tcctttttcc ttaacctatt gtgcaaggac agcaaacaga cctcatcttt 1801 gtaatgatgg agacgtactt ttttcttgat ttgatattaa ccttcttacg gggttaggga 1861 tgggaggggg gaggatgtgt gtttcagtcg gtggtttgtt tactatgaag gaactggcaa 1921 agttctggct aggtgagggg acccaggaac taaggtttgt cttccaagac tttcttagac 1981 tgcttgtccc tcgtgagctt ttcaaaacct ttgatgggga gcaggtgaca ccccacctag 2041 ctggccaagg agggagggaa aattccctgg ggctgcccta ccaacggtgg taagttggag 2101 acctggttgc tttttctctt ctgccctagt gcctccccat tgtctaaagg ggcaaagggg 2161 tccaagtgac agctggttag agaagccata gcttctcaca accaggatct agccattggg 2221 aaggaggggt ctttttcagt agtctctggt taaatgcgag tggacttagg gggaggggtg 2281 ggtaatcagc caagtgcctc agtgtgccta tggaaacttg ggttttttcc acacgattga 2341 tggattgcgt cctagcagga ctttgtacgt ttcctttctt ccctttcctg tgtaagatgt 2401 ggctttgctt ggtgccgctt caggtctacc agctgccact taaaaccctc caacctcttt 2461 tactctttga gttttttttc taagtagcgg aggaggggga gaggcaggga gtggactgta 2521 agacatactc cagttgattc gaatttgcta ggtagcttta gagaggcagg attgtgtgca 2581 tgtgtgtata tgtatatatc catatctaag actaggactt agtctcactt cgggagctgg 2641 gagaaaaaaa tctgtacagt tgtctttctc ttattttaat aaaattagaa actcgcgcac 2701 cctaccccac ccctttttaa acaagtgtaa ctagtgcccg ggagaaatta ctgtggttgt 2761 aattttaaaa ctttaaaata aaactggaaa ggaaaaaatc tgagtggtgc gtttttcttt 2821 aactggagga gagaggagcg atctgacggg caatgtgggg acgccacgtt cgcgccagag 2881 aatagcaaag ctt // LOCUS MMU01213 3279 bp DNA ROD 03-AUG-1994 DEFINITION Mus musculus 129 olfactory marker protein (OMP) gene, complete cds. ACCESSION U01213 NID g457940 KEYWORDS . SOURCE house mouse. ORGANISM Mus musculus Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Rodentia; Sciurognathi; Myomorpha; Muridae; Murinae; Mus. REFERENCE 1 (bases 1 to 3279) AUTHORS Buiakova,O.I., Rama Krishna,N., Getchell,T.V. and Margolis,F.L. TITLE Human and rodent OMP genes: Conservation of structural and regulatory motifs and cellular localization JOURNAL Genomics 20, 452-462 (1994) MEDLINE 94307732 REFERENCE 2 (bases 1 to 3279) AUTHORS Margolis,F.L. TITLE Direct Submission JOURNAL Submitted (02-SEP-1993) Frank L. Margolis, Roche Institute of Molecular Biology, 340 Kingsland Street, Nutley, NJ 07110-1199 USA FEATURES Location/Qualifiers source 1..3279 /organism="Mus musculus" /strain="129" /db_xref="taxon:10090" /clone="MOMP1, MOMP2" /clone_lib="genomic library in Lambda Dash II from R.Kinloch" /chromosome="7" enhancer 130..140 /note="distal Olf-1 binding site" enhancer 234..258 /note="UBE, potential NF-1 binding site" enhancer 654..664 /note="proximal Olf-1 binding site" CDS 891..1382 /note="intronless open reading frame" /codon_start=1 /product="olfactory marker protein" /db_xref="PID:g520741" /translation="MAEDGPQKQQLEMPLVLDQDLTQQMRLRVESLKQRGEKKQDGEK LIRPAESVYRLDFIQQQKLQFDHWNVVLDKPGKVTITGTSQNWTPDLTNLMTRQLLDP AAIFWRKEDSDAMDWNEADALEFGERLSDLAKIRKVMYFLITFGEGVEPANLKASVVF NQL" polyA_signal 2937..2942 BASE COUNT 650 a 893 c 961 g 775 t ORIGIN 1 atctctgtct ccaccactca gaggcactca cagactccag ttctgccatc tgtccacata 61 cactgcctgg gttccacctc ccactgacat tcccttgtag gtccccagct tcttccctgg 121 cctcacgtct cccatgggag gtggaggatc agtttaggcg gaatggctgg taggattttg 181 gtggacgtga gagccaatcc tgtggctatg tggttggatc gatcaaacca cggcctctgg 241 gagccgagcc agccgtctgt ctggcagatg atttgggatt tgagagctgc aggttcagat 301 gggaggtgac agtgggctgg gtcctgatgg tgataaagga gagggagaca ccagggcacc 361 tgacaggacc tgacaggggc tatgacagag tggggtgggg ggtgcggagg aggaggcaac 421 catggaaagt tggcttggct gactacagaa aactgaaatg tgtgccaccg gtgctacccc 481 gccctgccac ctctttcctg gacagtcttc ggttacctcc atgtgtctat aacctcacct 541 atctcccaac agcgctgtgg agtattccat tcttcacaaa caagcaaagc tccagcttgc 601 cactaccact gtagtcaagg tggttgccac agcagttgat atcagtgctc tggtccccag 661 ggagcccatc accctccagc ctgcctacag cacagcttta ccagttagga ggcagttgga 721 cacacacact cctgtgtccc ctgttctgag aactgggtgg ggccagaaag gctggaaagg 781 gaggcgggcc ttcaggtggc ctcttctctt ggcatcggag gatccagccc acttgattcc 841 ctgacgctgg tggtagtggt ggcagtggca atcgctgtag cacttgggcc atggcagagg 901 atgggccgca gaagcagcag ctggagatgc cgctggttct ggaccaggac ctgacccagc 961 agatgcggct ccgagtagag agcctgaagc agcgtgggga gaagaagcag gatggtgaga 1021 agctgatccg gccggctgag tccgtctacc gcctcgattt catccagcag cagaagctgc 1081 agttcgatca ctggaacgtg gttctggaca agcccggcaa ggtcaccatc acgggcacct 1141 cgcagaactg gacgcccgac ctcaccaacc tcatgacacg ccagctgctg gaccccgccg 1201 ccatcttctg gcgcaaggaa gactccgacg ccatggattg gaatgaggca gacgccctgg 1261 agtttgggga gcgcctttct gatctggcca agatccgcaa ggtcatgtat ttcctcatca 1321 cctttggcga gggcgtggag cctgccaacc taaaggcctc tgtggtgttt aaccagctct 1381 gatgacagcc ctggctgccc tacccctggc cccacctctc ccttgcctgg atctccttcc 1441 tcatgtgtat ttgggggaca ttcttctagc tgctcctcct gtgctcatct tggccagagt 1501 tcccccgagt gctacatccc ctccttttcc ctggtgccag tgctgcggct cacagtgatg 1561 tcccatggct ccgtagtcta gatctagaag ccggatgctg ctactataga ctgtagaggc 1621 cttttgggtc cacgtgggaa gatggatggg ccccctgtgg tgaagagcgg gactgagaga 1681 taaagagact gaccaagaga tgcaaacggc cagcactgat tcctcccttc agggacggga 1741 gactgagact ggacaggaac accttccggg gaacctggca agaaggcgtt tgccctgctg 1801 gccaaagctg gagccaggag gcgaatgccc agcctctggc agcaggaagg ttctcctccc 1861 agtgtcggca gcagcccgct gtgaccttag ggccttcaag acactgggca ggatgacagc 1921 ggggcttgat ctgactgctt ttccaggtct gggcctggtt tttatggaga agtgagagag 1981 tgtgtagaaa ctgaaacaac tctagccacc cacgctcata tgggtattga gagatggcat 2041 aactatttgt atggatgtgg gcctgagggc tagtcttggt gaggagtaag gctaacttta 2101 gtttaattat tgagctggta ctggcttgtg ggcttggtgg aggtgatcct gactgaggcg 2161 tccttggtgc agtgcttttt gaactgggag actgagactc gaatggtgta gcagagttag 2221 aggggtccag ggctctgagc tagcaacagt gatgtccctg ttaggaaggc tggcatttgc 2281 tgctcgctgg tgttgtgccc tgctgtcacc cccctgggca tatcctggct gttctcctgg 2341 agtgcagacc cctaagtaag gcttgggtgg gggcagttag gatgcctgac gtctgaagtg 2401 ggctggagct atctgactgt gatgcctaaa ctgacaggaa aacggtggca cagttagcag 2461 gttcagctct accccaagtc tcattgtccc tcgccttgca catcctgaaa gccttccatt 2521 gcctgttacc tagcatcagc cagaggtacc tcagcagtgt cccctgactg tctcaaggct 2581 gcctccctcg ggcatactga aggtaggatc tgtcccagct ggtgagctgc caggactgca 2641 aaccccagct caggtgcagg attctggagg caggagatag gctgtggtac cggtgtctct 2701 tgagccggtg cctctgctcc ataacatgct tgccgaagca ctggccggtg cttctggatt 2761 ctgctgactc tagggagcca cacccagaca gtgcctctgc ctttctgctt ctcttcctga 2821 cctctcccta cagctttaga gacccctttg gttcacactg cctgtgcccc aactctgcct 2881 cactcggatc cgtctgccct gtggggacat gagtgtctct gttgtgcctg tttcacaata 2941 aagactgtgt gccctcccct ctgtggtgtg gtgtgtgtgc ctccgtggtg tggtttgcac 3001 atcttgctgc aagcccatag catcagaatc cttctctcat gggccctgta gctctgagca 3061 actccaccct gccagccttg aggatgaggc cgagtcgtga gatctctcat gaggattgag 3121 tttcacctgt cagccaggtt tcctggctgc cctgcaggta ccaatcctct agggtatgaa 3181 agagcatgct aaagctatgc ttggggcagg ggagtgtagc gggtaggact gatactaatt 3241 tagcttggtc ttggtcactg tttggctgtg ccctctaga // LOCUS MUSCTNC 4194 bp DNA ROD 15-MAR-1990 DEFINITION M.musculus slow/cardiac troponin C (cTnC) gene, complete cds. ACCESSION J04971 NID g192837 KEYWORDS troponin C. SOURCE M.musculus (strain BALB/c) DNA, clone MGcTnC7. ORGANISM Mus musculus Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Rodentia; Sciurognathi; Myomorpha; Muridae; Murinae; Mus. REFERENCE 1 (bases 1 to 4194) AUTHORS Parmacek,M.S. and Leiden,J.M. TITLE Structure and expression of the murine slow/cardiac troponin C gene JOURNAL J. Biol. Chem. 264, 13217-13225 (1989) MEDLINE 89327294 COMMENT Draft entry and computer-readable sequence for [1] kindly submitted by J.M.Leiden, 13-JUN-1989. FEATURES Location/Qualifiers source 1..4194 /organism="Mus musculus" /db_xref="taxon:10090" TATA_signal 782..786 prim_transcript 805..4194 /note="cTnC mRNA and introns" CDS join(849..872,2291..2321,2548..2694,3049..3163,3766..3902, 3990..4021) /partial /note="slow/cardiac troponin C" /codon_start=1 /db_xref="PID:g387137" /translation="MDDIYKAAVEQLTEEQKNEFKAAFDIFVLGAEDGCISTKELGKV MRMLGQNPTPEELQEMIDEVDEDGSGTVDFDEFLVMMVRCMKDDSKGKSEEELSDLFR MFDKNADGYIDLDELKMMLQATGETITEDDIEELMKDGDKNNDGRIDYDEFLEFMKGV E" exon <849..872 /note="slow/cardiac troponin C" /number=1 intron 873..2290 /note="cTnC intron A" exon 2291..2321 /number=2 intron 2322..2547 /note="cTnC intron B" exon 2548..2694 /number=3 intron 2695..3048 /note="cTnC intron C" exon 3049..3163 /number=4 intron 3164..3765 /note="cTnC intron D" exon 3766..3902 /number=5 intron 3903..3989 /note="cTnC intron E" exon 3990..>4021 /note="slow/cardiac troponin C" /number=6 BASE COUNT 967 a 1014 c 1176 g 1037 t ORIGIN 1 ctcacagaat gcagctgact caaacccgaa gtgaacactg attgttgtcc ttgaaagatg 61 gaaggttgcg gcgtggtaga tcccgctgaa gggcttcagg ccagcctatt ctttgtgggc 121 cagtgacttc ttccattacc agtgtcatat tatactctga accttgcata ggaagtgttt 181 cttggcagga cttacactgc cagcctctgt tggcatgcta ccatctctca gggggtgtaa 241 tataaggtag ggattctccg gtcagcttga cctcaggcaa gtcactgtga ttccatagaa 301 gaccgaactg tttacttcac atttcctgtg tgctcatatt ttcattcatt caggaaactt 361 tttcctgagg ttgttgagga tgtgtgtgtg tgtgtgtgtg tgtgtgtgtg tgtgtattgc 421 atttaggtgc ttgctcccaa acctgaaaac ttgagttcag tcctgggttc ctgggaacct 481 aggtagcaga gagagataga tgactcctgc aagttgtcct ttgacttcca catgcactca 541 taaataaaag caatacattc tttttttttt cttgagactt ccctgggggt ggggtggggt 601 ctggggaaca gatggcaaga gacctggtcc ttgcagtcct tcatagctcc tggccctagg 661 cagatgatac agggactttg gccagcctga gattacaggg accagggagg gggtggagga 721 tattccaggc agtggtgggc tgggctggga atgtagcagg aggtggaggg aggcagggct 781 atataagccc aagcagaggg ctggctggct ggcaacccca gtagcctgtc ctgtgagctg 841 tcgccagaat ggatgacatc tacaaagctg cggtgagaga tgaatcttta aggggttagg 901 gtgggtaggt ccattggggt tcccttagct aaggatggag gacaggggta cactagggaa 961 gtgatggggg actcaaaagg gtccccacac acctgtaacc cagccccttg agctgacatt 1021 tgggggatgc agaaagggta gcaattcagg ggtttgtcaa tgtggctagg ggtatggtct 1081 cagctttcag catcctgtgt ctccctctgt ttctaacctg tcactgtctt gagtgtattt 1141 gtgtttttat tctttagaca tctgttggtc tttcacaagg tctttccatc tctgttcctt 1201 cctcacgatg tctctgagtc tttgtctctc tgtgtgtctc caggtctgcg cctcgcaccc 1261 cctcccaagc ttccatttct cacaatctgc ttcttacata ccagatctgc ctctctcctc 1321 tagcccctga gctggatgag gaggctctag aaggtgggag ggaattgtga gagggctgga 1381 gaacttgagt cctgggatgg gaaagatgct ttgaaggtac aggcctgagg agagggaggg 1441 aagaaggaat ggcttgttgg aacctcagtc cctggcaggc tcaggggtta ggtaggagtt 1501 gagagtgata agagtcaggg ttagagaacc agtgacccta agactgttcg atttcctggg 1561 gacgctgagg gacttttgat atacctgctt cagagacaag aaaatcctag caaacagctg 1621 ttctcaggaa attctctccc aacccccgga atatggcccc acttgtgagc agggaagggc 1681 tgcagctggt tcctatggta tccttggcta tcttgtagga gtctgtggag ctggagtagg 1741 ggggtggccc tccacccagt gcccatacaa ggcctggcgg ggctggggac taatttggct 1801 cctgaggcag caccagtagg ccaggggcag ataacactgc cccaccccct gcataccaaa 1861 gtccccagca caatcaccag gtttaacttt gtcccccttt aaaaatagct cagtggccac 1921 cctggtcagg ttacagtggg tggctttgct cgcccgcaca ttcgttttat tgtctcaacc 1981 tgagggacag ctgtctctca ggccatgcag cttaagtttc attaggatga cataaaggac 2041 atgcaatggt caatgactat tgtcactcag acaccttggt cctgggaggg gtgcctgcct 2101 gtcaccttgc cctagcccag cctctgagaa accatctggg tcaccttagg gcagggaagg 2161 aggactttgt aacctgggat atattctcct tctgttatca ttcttcccac ctgtagaatg 2221 gggtcagtct ctaagtctgc taaggcctga gcttgggtcc tgaggtcctg agttcccctc 2281 ttctctctag gtagaacagt tgacagagga gcagaagaat ggtaagtgtc tcccattgag 2341 gggacagcag ggctgggggg gcgtagaggg gggacgcctt tccaggaatg aaccccagat 2401 tagaggctat aatctgaggg ttctccttgc tgtgacctca aagaactctc gaagggaggt 2461 ggacaatttc ctggtggccc cagggacctg gattcctagt gtcctcctgg ctgctgatcc 2521 aatcttctca ccctctgttg gacccagagt tcaaggctgc ctttgatatc tttgtcttgg 2581 gcgcggagga tggctgcatc agcaccaagg agctgggcaa ggtgatgagg atgctgggcc 2641 agaaccccac acctgaggag ctgcaggaga tgatcgacga agtagacgag gatggtgagt 2701 cccttcaccc actccagtct ccacacgtgt atgtagcacc ctgggttcac tggcccagga 2761 gctacctccc ctccatatct tcaccatctc atccagaaac ttggcaagta gctcatactg 2821 tgattccaag aaggctgagg tagtagatct gctgtgagtt tgaagtcagt ctgattgggt 2881 cacatagctc tctatggtcc agtccatctt ggactgcaga gcaaggttct atctcaaaca 2941 agaggagtgg gcaaaggaac acactagccc tatagtgtgg ggaggagggg ggggcgtgga 3001 ttgctggaat gctaactgtt ctctctccgg ccctgctgcc taccacaggc agtggcacag 3061 tggacttcga tgagtttctt gtcatgatgg ttcggtgcat gaaggacgac agcaaaggga 3121 agtctgagga ggagctgtcg gatctcttcc gcatgtttga caagtgagga cttgcttggc 3181 ttctgaccct ggcccaagca gagaggagaa ggggtggttt cagctgggca gtggtggcgc 3241 acacctttaa tcccagcact cgggagatag aggcaagtga atctctgagt ttgaggtcaa 3301 cctagtctac agagtgagtt ccaggacagc tagggctgca tagaaaaacc ctgtcttgaa 3361 aagaacaaaa gcaaaaacaa acaagaaaga tagaaagaaa gaaagagaga gagagagaga 3421 gagagagaga gagaaggaag gaaggaagga aggaagaaag aaagaaagaa agaaagaaag 3481 aaagaaagaa agaaagaaag aaagaaagaa agaatagttt cgggtagaca ccctgcagtc 3541 tttgcatcca atgagaggtg agagatagaa tgtgatctcc taatgcccct gccctgcctc 3601 ctggggttgt atatgtgggt ttggctcttt ccctcctccc cttccgtctc ccccccctcc 3661 acatcctccc ctaccctctc cctccctcct ccccctctcc tctcccctcc ccctcctccg 3721 catcctcctg ggctctgaga cgctccttgc cctccctccc tgtagaaacg ctgatggcta 3781 cattgactta gatgagctga agatgatgct gcaggccaca ggtgagacca ttacggaaga 3841 tgacattgaa gagctcatga aggacggtga caagaacaac gatggccgaa ttgactatga 3901 cggtgagtgg gtgggaggac caggtgccct cttgttcatc tttcacaggg ctcctaccac 3961 tgacacatac atcaggccct cttccacaga gttcctggaa ttcatgaagg gtgtggagta 4021 gatgctggtc ttgcacggtt gcctgcgcct gttctccccc tccacccaga ccccgtggta 4081 ggagtgcagc tgggctctct agactctgag cctgcctgtg tccttgaacc ttggccttcc 4141 ggactttctc tccccattcc tgtcctgggg aacgcaaata aatccttgct cccc // LOCUS MMLYL1 3678 bp DNA ROD 02-AUG-1991 DEFINITION Mouse Lyl-1 gene. ACCESSION X55055 NID g52962 KEYWORDS Lyl-1 gene. SOURCE house mouse. ORGANISM Mus musculus Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Rodentia; Sciurognathi; Myomorpha; Muridae; Murinae; Mus. REFERENCE 1 (bases 1 to 3678) AUTHORS Kuo,S.S. TITLE Direct Submission JOURNAL Submitted (01-FEB-1991) S.S. Kuo, Stanford University, School of Medicine, Lab of Experimental Medicine, Dept of Pathology, Stanford CA 94305, U S A REFERENCE 2 (bases 1 to 3678) AUTHORS Kuo,S.S., Mellentin,J.D., Copeland,N.G., Gilbert,D.J., Jenkins,N.A. and Cleary,M.L. TITLE Structure, chromosome mapping, and expression of the mouse Lyl-1 gene JOURNAL Oncogene 6 (6), 961-968 (1991) MEDLINE 91296401 COMMENT Sequence is homologous to human LYL1 gene and is a candidate oncogene. FEATURES Location/Qualifiers source 1..3678 /organism="Mus musculus" /strain="BioA" /db_xref="taxon:10090" /germline /dev_stage="adult" /tissue_type="liver" /clone_lib="BioA liver J1 mouse genomic" /clone="M18, M4" /chromosome="8" /map="30.9 +/- 0.9 cM" exon 591..938 /gene="Lyl-1" /number=1 gene 591..3257 /gene="Lyl-1" mRNA join(591..938,1591..1947,2039..2128,2844..3257) /gene="Lyl-1" intron 939..1590 /gene="Lyl-1" /number=1 exon 1591..1947 /gene="Lyl-1" /number=2 CDS join(1615..1947,2039..2128,2844..3257) /gene="Lyl-1" /codon_start=1 /db_xref="PID:g52963" /db_xref="SWISS-PROT:P27792" /translation="MCPPQARAEVGSAMTEKTEMVCASSPAPAPPSKPASPGPLSTEE VDHRNTCTPWLPPGVPVINLGHTRPIGAAMPTTELSAFRPSLLQLTALGRAPPTLAVH YHPHPFLNRVYIGPAGPFSIFPNSRLKRRPSHSELDLADGHQPQKVARRVFTNSRERW RQQHVNGAFAELRKLLPTHPPDRKLSKNEVLRLAMKYIGFLVRLLRDQTAVLTSGPSA PGSRKPPARRGVEGSARFGAGHRVEAARSQPVLPGDCDGDPNGSVRPIKLEQTSLSPE VR" intron 1948..2038 /gene="Lyl-1" /number=2 exon 2039..2128 /gene="Lyl-1" /number=3 intron 2129..2843 /gene="Lyl-1" /number=3 exon 2844..3257 /gene="Lyl-1" /number=4 BASE COUNT 851 a 1043 c 1049 g 733 t 2 others ORIGIN 1 gaattcagaa gatcactcca gaatacctgg cagtgaggga taccacctag ttgtggtatg 61 gctgagtagc atcttgggga acttgggagc aacttctggg aaactcagat atggctttga 121 cactcatctc acccgcctgg gcttggaaat acgttagcag tgccacaggt tcgggacacc 181 tgcacctcgg tgtcaggaca cggcaaggcc acattaccag aattgtctcc ttccggagcc 241 ctattcctgg gtgtgtgtgt gggggggggg gtgcagtcca cagggggatc tgtaaaaaaa 301 gaggaactag gctgaggctc atgggtcatg gggcctcaga ggaccggaaa ggggcccctt 361 ccacccccat cagcataggg ggtgggttca aagctagacc aacctcagag gacatcagag 421 gcagccttgc cacctttccc tttgcacacg tttggcnctg ggctcctgag ggttgggggg 481 tcttccagtg ctgtgggagc agcaaacccc gaaggtgact agccggggtg gatcaggagg 541 tgcgcgctcc gtgatccctg cccgctggtt tcctccgggg tcagcattgc ttcttatcag 601 ccgcggccag gcagccagac ccttatctgc acaggcccag catcctctgg cctctgcgcc 661 agcgggtaag agggaggaag ccaggctgtc gggggcgggg gaaagggcgg gtcggtccag 721 gagcagctca ctttctcttg gaggtaccca ccacgacgcc tccgaagcta ctcaggccct 781 ccggagcccg ggaggtggcg cagcccagct ctgaggccca acgggaggga ttacctggag 841 cacaccggcc gaactggaca gtggaaactg tctcctgacc tggactgaca aacctgacca 901 cacagccggg ttctccaagc tgtggaaact gctacagggt aagaaagacg ctttaaacat 961 taaggagtgt ctgtgcctgt attcggtgtt tgttacttca tttatttatt cattcactgg 1021 aagtgaggag gcactgaaat gtcctcagtt ggggaaaaca gggcagcaat gtgggaagtc 1081 acacactggc aggatacccc agaccttaga gcccagcaaa cgtggtaagt agggtgctga 1141 tggggttggg ggccagtaag aggaactttc agcaagggta gaggcccctg gaatagccat 1201 ggagaggtgg cagagggtga ctcttggaga cttaggaggt cttagagtta gtccggagat 1261 tcctatctta ttcccagctg caggtaggaa gagattctga agatgggaac tagttcagag 1321 actgaagatg cccattacaa acaattcata gggtcaacgc tgcctagtat ctgatagaca 1381 ccccctcttg tggaggaggc tcccctcttg tggaggaggc cctcctttta ggtcctagca 1441 gtttctagga ccacttttct acctttgatt attgaacccc caatcccttt ctcgcctaag 1501 ggtgctctta cattgaaaag ctcaggttaa acaggtggga gaaagatggt ctttggcctt 1561 ccttcccctt aacttcactc tctcccacag gttagcattg tatccgtggg gtccatgtgc 1621 ccgccccagg ccagagcaga ggtgggttcc gccatgactg agaaaactga gatggtatgt 1681 gcctccagcc cagcacctgc ccctccctct aagccggcct cacctgggcc cctttctaca 1741 gaggaggtgg accaccgaaa cacgtgcact ccctggttgc ctcccggcgt gccagtgata 1801 aacctgggac acaccaggcc cataggggca gccatgccca ccacagagct cagtgccttt 1861 cggccctcct tgctgcagct aactgccttg ggaagagctc cacctaccct ggctgtacat 1921 taccaccctc accccttcct caacaggtca gtgggcagct cggactatgg gcagtgctta 1981 ggtttggggg ccagggtctt tgcctacaaa gtgttcatga cccacaaccc cttacagtgt 2041 ctacattggg ccagcaggac ccttcagcat cttccctaac agccggctga agcgcagacc 2101 aagccatagt gagctggact tggctgacgg tgagtttgtg tctctggtgt gtacttgtgg 2161 gtgcctgcag ctcatgtctg attggcctgt actctgccac aggaagaagt atttgggaag 2221 ttcacataca ccccaacaca ggacaactca cacaagtgat agctaatgca agactacagt 2281 acggagccta gcttggggga ttaccctgta atccagactg acacaggaga accagttcag 2341 aagccaacct gggctatgtg agtccccaac aagcaagcat tataaccaga gacatggcca 2401 caataggata tggcagccac aagatgacac tccccaaccc ttgagaacac agtcatgaga 2461 gacaaccttg ggttaggatc acacagagct atgagaccac agtcaacctc agaaccccag 2521 ccacacatac aacagccaca accacatcgc atgatcccac acaaccacag ccatcagata 2581 ggtcaccagt caccctcagg cccacagccg gcaagaacag gacctttgag aacccagaaa 2641 tacacttgct aggaatgaaa ccacagcatg gacctctcaa gggatcacat cgctgtcatc 2701 aaggcttaat cagactcagc agttccgatg tctacacagc ccacagtctg ggttcaagcc 2761 ccacgtgggt gctagtggtg ttcagtgagg acagcagtgg gcgccctctg tcctctacgc 2821 tgatgctatc ccttgggtcc tcagggcacc aaccccagaa ggtggctcgg cgagtgttca 2881 ccaacagccg tgagcgctgg cggcagcagc acgttaacgg cgccttcgca gagctcagga 2941 agctgctgcc cacccacccg cccgaccgga agctgagcaa gaacgaggtg ctgcgcctgg 3001 ccatgaagta tataggcttc ctggtgcggc tgctgcgaga ccaaacggct gtgctgacct 3061 ccggccccag cgctcccggg tcccgcaagc cacctgcgcg caggggcgtg gagggcagcg 3121 cacgcttcgg ggccgggcac agagtagagg ctgcacgctc acagcccgtg cttcctgggg 3181 actgtgatgg cgaccccaat gggtcagtga gacccatcaa gttggagcag acgtccctga 3241 gtcctgaggt gcggtgaccc aaggcagagg cgcctcacct gctgttcagt gaactccctg 3301 taaaaggatc tcaggtcgcc cagatgagga aacgccctgt agctctggaa gggtgaccgc 3361 gacgccaggg atccccagac ttttaagaaa aatctcccac aggctacttg cagtgctttg 3421 ccagtgtcct cttgccaggc cgaggccaga gaatcgacaa ggccaagtag ccatgggatc 3481 ccagaggtcc tgagggaggt agggaagggt ctggaagcct ccgcgcttct catcggcgcc 3541 ccctgctgga ggcgggaaga ccctagacaa gtgtgtggga ggaacctacc ccatgaagcc 3601 caactggttg cgcttgatcc gsaacgttgt tgccattgct gcaggcgcag aactggtagg 3661 tatggaagat ctctagaa // LOCUS MUSTHY1GC 3257 bp DNA ROD 01-SEP-1988 DEFINITION Mouse Thy-1.2 gene, clones pcT108 and pcT34. ACCESSION M11160 NID g202034 KEYWORDS . SOURCE Mouse (Strain C57B1/6) DNA. ORGANISM Mus musculus Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Rodentia; Sciurognathi; Myomorpha; Muridae; Murinae; Mus. REFERENCE 1 (bases 1 to 3257) AUTHORS Chang,H.-C., Seki,T., Moriuchi,T. and Silver,J. TITLE Isolation and characterization of mouse Thy-1 genomic clones JOURNAL Proc. Natl. Acad. Sci. U.S.A. 82, 3819-3823 (1985) MEDLINE 85216583 FEATURES Location/Qualifiers source 1..3257 /organism="Mus musculus" /db_xref="taxon:10090" prim_transcript <1..3130 /note="Thy-1 mRNA and introns" exon <555..591 /note="Thy-1 antigen precursor" /number=1 CDS join(555..591,1182..1520,1907..2019) /note="Thy-1 antigen precursor" /codon_start=1 /db_xref="PID:g202035" /translation="MNPAISVALLLSVLQVSRGQKVTSLTACLVNQNLRLDCRHENNT KDNSIQHEFSLTREKRKHVLSGTLGIPEHTYRSRVTLSNQPYIKVLTLANFTTKDEGD YFCELQVSGANPMSSNKSISVYRDKLVKCGGISLLVQNTSWMLLLLLSLSLLQALDFI SL" intron 592..1181 /note="Thy-1 intron A" exon 1182..1520 /number=2 intron 1521..1906 /note="Thy-1 intron B" exon 1907..>2019 /note="Thy-1 antigen precursor" /number=3 BASE COUNT 737 a 945 c 826 g 749 t ORIGIN 1 ttctgttaga caggctggcc tggaaatcca tctgcctgcc tctgcctctc tgcctctctg 61 cctctctgcc tctctctctg cctctctctg cctctctctg cccctctctc tgcccctctc 121 tgcccctctc tgcccctctc tgccgccctc tgccgccctc tgccttctgc cctctgccct 181 cgcctctggc ctctgccctc tgccctcgct ctggcctctg gcctctgcct cttgagtgct 241 ggaatcaaag gtctgagctc tgtaggtctt aagttccaga agaaagtaat gaagtcaccc 301 agcagggagg tgctcaggga cagcacagac acacacccag gacataggct cccacttcct 361 tggctttctc tgagtggcaa aggccttagg cagtgtcact ccctaagaga aggggataaa 421 gagaggggct gaggtattca tcatgtgctc cgtggatctc aagccctcaa ggtaaatggg 481 gacccacctg tcctaccagc tggctgacct gtagctttcc ccaccacaga atccaagtcg 541 gaactcttgg caccatgaac ccagccatca gcgtcgctct cctgctctca ggtactgggc 601 aagggtcagg gctggcattc taaggaatct ggcttcctcc catcccggga agtagcctct 661 ttgccatagt ctcaggggca caggtggttg ggaggtgcgg gggtggggag tggggaggag 721 cctcaacctc accagtggtg gtctttgaca tattagaaac tccataatgg atctaggaac 781 tcctctgctg ggtggtggtg gttgtggtac acacctttaa tctcagcact caggaggcag 841 agtcaggtgg atctgttagt ctgaagccag ctggtctaca gagcaaattc caggacagcc 901 agagctattc tcaagataga gaatcccttt cttgaaaaaa ccatttaaaa acaaaaacaa 961 aagcaacaca ctcctttgat ctcctgttct tgaaacacat tgttgggacc cagaacttca 1021 gtagattgat ggaagttgga gtctgcaagt ggtggaacat cccaccaata cctcaagggc 1081 gagtgcaaac cccacatccc cccagctcaa gctcactttt cctgcaggtg ggaggcccgg 1141 gtctgtgtct ccccaaattc agagaaggca ctgctgtgca gtcttgcagg tgtcccgagg 1201 gcagaaggtg accagcctga cagcctgcct ggtgaaccaa aaccttcgcc tggactgccg 1261 ccatgagaat aacaccaagg ataactccat ccagcatgag ttcagcctga cccgagagaa 1321 gaggaagcac gtgctctcag gcacccttgg gatacccgag cacacgtacc gctcccgcgt 1381 caccctctcc aaccagccct atatcaaggt ccttacccta gccaacttca ccaccaagga 1441 tgagggcgac tacttttgtg agcttcaagt ctcgggcgcg aatcccatga gctccaataa 1501 aagtatcagt gtgtatagag gtgagactgg ttcccagaaa gataaaatgt ctaggttagc 1561 taggctgggg tagccaataa aaaaaaaaaa aaaaaaaaaa aaaaaaacag gcacctccat 1621 tacccttccc ctaactgctg gtctcctggg aaactgctgc tgtctatgtg agtggggcaa 1681 gattaggggc cagaaagggg gagcttgtag taaaagcaca gttgaggaaa ctaaatggga 1741 aaggcagtac agtggtgatt cttgtggtgt ggaggttctg ttacagcatc cggtggagcc 1801 gctaagatga gaaagcgcca gctagctgcc ttgaacagct gacacctgtc tttgcccgcc 1861 tgagtcctga tctcccctcc tcccggcacc ccttctctat ccacagacaa gctggtcaag 1921 tgtggcggca taagcctgct ggttcagaac acatcctgga tgctgctgct gctgctttcc 1981 ctctccctcc tccaagccct ggacttcatt tctctgtgac tggttgggcc caaggagaaa 2041 caggggccct cgaggagccc ctcgggtcct tcctctgcag aggtcttgct tctcccggtc 2101 agctgactcc ctccccaagt ccttccaata tctcagaaca tggggagaaa cggggacctt 2161 gtccctccta aggaacccca gtgctgcatg ccatcatccc ccccaccctc gcccccaccc 2221 ccgccacttc tccctccatg cataccacta gctgtcattt tgtactctgt atttattcta 2281 gggctgcttc tgattattta gtttgttctt tccctggaga cctgttagaa cataagggcg 2341 tatggtgggt aggggaggca ggatatcagt ccctggggcg agttcctccc tgccaaggaa 2401 gccagatgcc tgaaagagat atggatgagg gaagttggac tgtgcctgta cctggtacag 2461 tcatactctg tggggaatca tcggggaggg ggggggggct caagatggga gagctctgct 2521 agcctttgtg gaccatccaa tgaggatgag ggcttagatt ctaccaggtc attctcagcc 2581 accacacaca agcgctctgc catcactgaa gaagccccct agggccttgg gccagggcac 2641 actcagtaaa gatgcaggtt cagtcaggga atgatgggga aaggggtagg aggtggggga 2701 gggatcaccc cctcctctaa aacacgagcc tgctgtctcc aaaggcctct gcctgtagtg 2761 agggtggcag aagaagacaa ggagccagaa ctctgactcc aggatctaag tccgtgcagg 2821 aaggggatcc tagaaccatc cggttggacc cagcttacca agggagagcc tttattcttc 2881 tttccctctg cccctctgtg ccagcccctc ttgctgtccc tgatccccag acagacgaga 2941 gtcttgcaaa cagcctgttc caagacctcc taatctcagg ggcaggcggt ggagctgaga 3001 tccggcgtgc acactttttg gttgatagct ttcccaagga tcctctcccc cactggcagc 3061 tctgcctgtc ccatcaccat gtataatacc accactgcta cagcatctca ccgaggaaag 3121 aaaaatgcac aataaaacca agcctctgga gtgtgtcctg gtgtctgtct cttctgtgtc 3181 ctggcgtctg tctcttctgt gttcttccaa ggtcagaaac aaaaaccaca cacttcgcct 3241 ggattggctc ggctgag // LOCUS MUSXRCC1G 37349 bp DNA ROD 30-JAN-1995 DEFINITION Mouse XRCC1 DNA repair gene, genomic. ACCESSION L34078 NID g642119 KEYWORDS B1 repeat; B2 repeat; DNA repair protein; tandem satellite array. SOURCE Mus musculus (strain B6/CBAF1J) DNA. ORGANISM Mus musculus Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Rodentia; Sciurognathi; Myomorpha; Muridae; Murinae; Mus. REFERENCE 1 (bases 1 to 37349) AUTHORS Lamerdin,J.E., Carrano,A.V., Thompson,L.H., Montgomery,M.A., Stilwagen,S.A., Scheidecker,L. and Tebbs,R.S. TITLE Genomic sequence comparison of the human and mouse XRCC1 DNA repair gene regions JOURNAL Genomics (1995) In press FEATURES Location/Qualifiers source 1..37349 /organism="Mus musculus" /strain="B6/CBAF1J" /db_xref="taxon:10090" /cell_line="L5178Y" /map="19q13.2" satellite complement(1808..2005) /note="human chromosome 19-specific pE670 repeat" exon 10126..10176 /partial /gene="XRCC1" /note="exon 1; G00-120-737" gene 10126..36154 /gene="XRCC1" #Egen indsættelse: CDS join(10126..10176,10608..10700,22554..22670, 28352..28510,29035..29109,29205..29316, 29674..29783,29861..29960,30130..30388, 30626..30742,33032..33125,33265..33394, 33688..33742,33831..33967,35025..35115, 35870..35945,36041..36154) exon 10608..10700 /partial /gene="XRCC1" /note="exon 2; G00-120-737" satellite 16295..16551 /partial /gene="XRCC1" /note="human chromosome 19-specific pE670 repeat" exon 22554..22670 /partial /gene="XRCC1" /note="exon 3; G00-120-737" repeat_region 25369..25574 /partial exon 28352..28510 /partial /gene="XRCC1" /note="exon 4; G00-120-737" exon 29035..29109 /partial /gene="XRCC1" /note="exon 5; G00-120-737" exon 29205..29316 /partial /gene="XRCC1" /note="exon 6; G00-120-737" exon 29674..29783 /partial /gene="XRCC1" /note="exon 7; G00-120-737" exon 29861..29960 /partial /gene="XRCC1" /note="exon 8; G00-120-737" exon 30130..30388 /partial /gene="XRCC1" /note="exon 9; G00-120-737" exon 30626..30742 /partial /gene="XRCC1" /note="exon 10; G00-120-737" exon 33032..33125 /partial /gene="XRCC1" /note="exon 11; G00-120-737" exon 33265..33394 /partial /gene="XRCC1" /note="exon 12; G00-120-737" exon 33688..33742 /partial /gene="XRCC1" /note="exon 13; G00-120-737" exon 33831..33967 /partial /gene="XRCC1" /note="exon 14; G00-120-737" exon 35025..35115 /partial /gene="XRCC1" /note="exon 15; G00-120-737" exon 35870..35945 /partial /gene="XRCC1" /note="exon 16; G00-120-737" exon 36041..36154 /partial /gene="XRCC1" /note="exon 17; G00-120-737" BASE COUNT 8732 a 9452 c 9558 g 9607 t ORIGIN 1 gatcatagtt caacaataca gtctattgtg aaaaatcata acagcatgaa agtgagacat 61 tgcaaccaca tttgggaagc tgagaaatct ttgtttcacc ccaggaccac agcgctatgg 121 aatagtgttt caaacagtag gtttcctcgc ctcatttaac atagtctagg cactcctcca 181 cagccatgtc cagggtgaac gtaaattctg tcagttaata ttaaccatca cagcaggcat 241 ctaagaactt gctctgttac atgtgcacat gtgtggctaa gcatgcacac actggcatgt 301 gaatgtgctc acacacatac ccagctccca ggtactgcag ccccctctga gaagcttgac 361 actcaaatgc tactcctcta ctgcctgggc cccacttccc ttggtctggc tacagtaaca 421 gccaaactcg gtctcctatt aagtgtcctg agctctctaa accctccatg tcttttctgg 481 tattttgaca tagcttaggc ttgccttcac ctcggcatgt agctggtgat ggccttgaaa 541 tcctgatcct cctgcctgca tctctccagt gccagggtca ggcacgtggc atgatgccct 601 gctctcactg tggcttgctt tggtgccacc tggcatcccc gtcctgtgct ttgctgctct 661 cccttcttga ggcaggaggc ccatgctaga tgcatcgaag gagcatattg tagccattgt 721 gtgtgtcctg gtcagctcca taaacttgta cagggcactt ttgtgttttg tgattccttg 781 cccatttttg ttctgttact tgagatagag gatccaggct gtctcaaaca cttggcgcca 841 ggtgcttctc cggtcctagc ctctccttgt cctgggttac aggtagaggt ggctgccttt 901 taaactatga tctcccaccc ccacccctga gatagggcct tactctacag ctctgggcgt 961 ctaaagagct gggattacag gtgtaaatga ctacatatgt tgcccataca atcttttgtt 1021 aggattttca ggcttttcca gaatgtaagc aaagcagata gccaggcccc tggaagcctc 1081 caaataggaa aggttcagtc tccctcgtca caggccacct gctgccagca gggtgctccc 1141 atgaggcacc tcgggtctga aagagcctgg gcgcaggaac tctgggaaat gtcagtcctc 1201 catgctggaa ttgcagctac acaatcttgg cattgccctt gaaagacgag caggtttcaa 1261 acgttggtga atggatgagg tgccctcctc aacatggaga gagtctcaca cttctggctc 1321 atgaaggagc tgtctgttgg aatgtggggg acaaatctgt ctggtaatcc tgtggtgtgt 1381 tttggaaaaa agaataaaag ggtaggaatg ctattatgtg tgtcttctat ttaatttatt 1441 aattttaatt gtgtgggtgt gtgctgtgaa agatgcagtt ttcacctgct gagcatcttc 1501 cctgccgggc aaggtggtga ttttcaaatg cttttaagtt tgtgatggcc aagcagctga 1561 actcttgagc cagcgtgacc cactgtcccc cagtccagcc tctccgtaga caagcacatg 1621 gatgccctct ctcagtctgg ctctggattt tagccacagg gacgacagca aatgtgatgc 1681 aatcagaggc ttcaagagtg cgcatgaatt gaacttcctc tctgctatcc ttgggctccc 1741 tgctgtcact gccacactga aaagcctaca aagctaaggg ttgggaatgt attcccttag 1801 aacctcccag tgaggggctg ggggtgtggc tcagtggtag agctcctgcc tagaatcccc 1861 cagtgagggg atgggggcgt ggctcagtgg tagagcccct gcctagaatt ccccagtgag 1921 gggctggggg cgtggctctg tggtagagcc cctgcctaga atcccccagt gaggggctgg 1981 gggcgtggcc cagtggtaga gcacctgcct agcatattca aggctctagg tctatcccca 2041 ctactgacac aaaattgacg gaagctttcg gtatgtagtc tgggattcat cctcctccgg 2101 cctcagcctc ccaagcactg ggtttgcagg tgtactctac cacattctgc tacaatctta 2161 aactctgagc caagcgatgg tagtgtactc aggaggcaga ggcgggtgga tctctcgtct 2221 acagagtgag ttccaggaca gccatagcta cacagagaac ccatgcctca aaaaacaaac 2281 aaacaaaaat aaaaaaccac caccaacaac aaaacaaccc cccgttacat ttgagctgtg 2341 caatcttcaa attgatacat gcccccccaa gccccagcct ttgcctcaag actcatctca 2401 agaacataaa tgctactgag tgcctgtcgc caccatcgag cagacaaaat gagaggcagc 2461 ctctcacagg ccaaggtgct cacacttgct gtttctgaaa ttgcccagct ctctgccttc 2521 cgactgggag agaaacctac tcctggttag gtatggctat gtgactagct ctgtccctgg 2581 agctgagagt gttgagtcac ttctgggtca cacagtggtt aatcacacag gtgaggcatt 2641 tctaggcact tccctttggc aaaggcattg gctgatccat acctggaatg agcagaaacc 2701 cgctttggat tatgctgaac ttgtagagca aaggcaagca ccagcctttt atgaatgggt 2761 gctcattcag caaacacacc cgctctcccg gtgaaactcc catgctgatg ttggctttgg 2821 cctggtgact caccttagtc aatggaatgt taccagctct aaccccatca acaacaataa 2881 caaaaaccct aaatgcactt gactttcctg cagtgccagg gatctgctgg aaggaggata 2941 tgtggaagtc cagcggggag ctagttgtgg atcaaaccag ctaggagagc tatggtaggc 3001 aagctacagg cccttagaaa caagattgcg ggctggtgag atggctcagt gggtaagagc 3061 accgactgct cttccgaagg tccagagttc aaatcccagc aaccacatgg tggctcacaa 3121 ccatccacaa caagatcctc ttctggagtg tctgaaaaca gctacagtgt acttacacat 3181 attaataaat aaatctttaa aaaaaaaaga aacaagattg cttgttgtct taaatctcag 3241 tgtggggatt gtttgttaca tggcattatt gttgcaagag ctgactcata caaagttgtt 3301 actgtagcat aatgtagcca gctcagattg gtgtcaacat tagcacctgc ccgctaatgg 3361 cccttctgta agtggcagct gtcacgatga ttaattagta tttactgaga cagactctct 3421 gaagaaccat catttttttt ttttaaggta gcacccactg ctctcatatc tgctttcaaa 3481 ggaggttttt catcacacag ttctgaagtt ctcactgtta atagccacat gttaccaaca 3541 tggacatgga gatgaggcag gagatgtgac cagaactcac atcaggattc tcaccatcct 3601 agctgtttga cttaaggcct cagtttgtcc acttgtaacg tggaataata tcaatttcat 3661 ctgggcgggg ctgttcacag ccataatccc aaccttccag atggtgaggc agagagctca 3721 cctcgagttt gaggctggac tggactacac tacataatga ggccatctca aaggacgaca 3781 gcagagcagc aaatcaattt catggagctg tgatcaggaa atgactagaa agtataaaac 3841 acttagcagg gtgcctctta ctataaagcg ggggctactt gattttcttt ttccctttgc 3901 tttataaaag ggcaggagtc aggatgacag ccctgcctct gtccctccat ttctctctct 3961 ctcttctgca ctgagtctca ctgtgtgact ctggctggcc ttgaactaac aaagatccac 4021 ctgcccctgc ctcccgagtg ctgggattaa aaacatacac catcacacct gatgcatact 4081 gggatgaagg caagagctat tatatcgggt ttttttttct tttttctttt tcttttcttt 4141 ttttcttttg gttggctgtt ctggaactca ctctgcagac cagattggcc tcgaactcag 4201 aaatctgcct gcctctgcct cccaagtgct tggattaaag gcgtgcgcca ccacccggcc 4261 ttcctgtttg tttttgagac agggtctgcc tatgcagcct cagttcaggc cagtcctgtg 4321 ctcatgatcc tcctacctca gcctcagtcc atgctgccac atagggaccc tctcctcttt 4381 gaatcaggaa aagggctccg aaaagaacac tgagacctgt actgtttaca acccaccttc 4441 ccaagatgca tctgtcacaa ggaatgaggt ttgtactgtc gatctgtttt aatttacagg 4501 cagctggaaa gctgcagggc tcctcaagtt acatacttct cggttcttca ctctcccctg 4561 ccaggagggt aaggggctgg aaggcagtct gcccggcgca ggaagtagag atagaaggct 4621 gaagggacct cagcccctgc cttggtgtgg caggcgctct ctgtagcaca gcccctcgtg 4681 gcaaatttcg tgtggataat acctgtgacg acatttcaga atggagtcga cttctgtgtg 4741 acccaggagt ctaacacaca catacatccc attaacacac acacacacca ccaccaccac 4801 caccaccacc accaccacca ccaccaccat catccagtac tacggaactt cctcatcacc 4861 tgtaatgggg aaagtgtgga tctctatttc tgagacaccc cctcccccaa tcaaagcacc 4921 actcaccagc ctgcacattg ccagcaaaat agatgcagtg tgtttcccgg cccacgcagc 4981 gagctgcctg ggttcctgga caggtctcct ggaagggcgc aatgcaggag ggacacatca 5041 ggccattctc tgtccgattg ttcaagggag ctgtgaagaa agacgttgca taaagcaggc 5101 tggaaggctt gctaccccaa caccctctga accctgggaa acagaagtta gttggtagag 5161 tttgcaggta aggaaacgga ggcctggaag tagtcaaggg tggtgtgcca gatgtcacag 5221 gacaggcaca ctgttcttga atggtactta cggggcacag agccactgtt gcaaccgtca 5281 ctttgacagc agtgggcgtt ggacaccatg tagtcactgg gagtcatggt ggttgatacg 5341 aagcctgagt agcagtcttt gtacttcatg caggccttga aggtgttcac tgactttcgg 5401 ccctctgcag agtagggggc aggagtggag ggatggaagt gctgtcaatc acttcttcct 5461 cacccctccc catccctctc cagatcacca tggcatctgc actcatatgt gcatacaaac 5521 acatatgcat attcacgtat acatacatta aaataaacct ttgagacagg gtctcactat 5581 gtagctctga cctacaactc actgtagacc aggctggcct tgaactcaca gagatcctcc 5641 tgcctctgcc tccagggtgt tggcattaaa ggtgtgtgct accacacttg gctaaaataa 5701 gtaaaaaaaa aaattttttt tttttttttt tttttttttt tggtttttcg agacagggtt 5761 tctctgtggt gtgtggtgtg tggtggcgca cgctttgatc ccaacacttg ggaggcagag 5821 gcaggtggat ttctgagttc gaggccagcc tggtctacag agtgagttcc aggacagcca 5881 aggctagaca gagaaaccct gtctcaaaaa aacaaaaaca aaaacaaaaa caaaaacaga 5941 agagcattgg gtgagagaaa cgactcgcca ggatgtcatt aaagaagttg gaaaaatcat 6001 ttacgtggta cacctcgaag ctgaggataa agcccttgag caagagctca gcttgctgaa 6061 tgggttaaag gacatcatgg ggcggggtgg ggggtgcggg agagaaggtt cagtggttaa 6121 gagcactggc tgttcttcct agacgaccag atttaacaac tgtctgtatg taactccagt 6181 tacaggggat ctgacaccaa tgtacataaa attattaaaa aagagaggtg accttcagga 6241 ctgcacacac acacacatac acacacacgt ttaatgttaa aaaggccatg acagaagcca 6301 cggccttcct ttgttacagc cttccgcccg tccgtctctc agctggaatc actgtcctat 6361 aactaaggct ctctcttacc attgaaaatg aacatccaac ccaatcgtcc ttatttcccc 6421 atcatctctg aggtctgggg tgctctcttt gatttaggct agcctctccc aatgtacttc 6481 tgggcctccc catttccatc ctctccctcc ttcctcctct taccccaccc ctccacacct 6541 ctcccttgcc ttctttgctc tctctcatcc cccttcctct cccttcccct ccctcaccct 6601 ccctttccct ttcttccttc agccttcctt cttcctccct ccacctctcc ctctccagat 6661 ctttcccctt cctctctccc ccttcttccc cttcctcctc ctcctcctcc tcctcctcct 6721 cctcctcctc ctcctcctcc tcctctgcca tctgtacctc ccactttctt ctgtgcttgt 6781 ttttaagtca gggtcttgct gtacagctca ggctggtttc agacctgtga ctatcctctt 6841 gctcatcacc ctatcccggc acctctactt tctctagatt tatttgctcc attcctcagg 6901 gtctttctct catcaggcag ctcctttctg ttctctcctg accctaacac catctcctct 6961 tcctttgcgc ccaacagatt tgcttaagag cacaaactgc tggaggctgg agaggtggct 7021 cagtggttaa gagcactggc tgctcttcta gaggtcctga gttcaattcc cagcaaccat 7081 atggtggctc acagacatct gtaacaggat ctgatgccct cttctggtgt gtctgatgac 7141 agctacagtg tattcatata aatgaataaa ccttaaaaaa tcaaatatta aaaaaaaaaa 7201 aaaaagcacc aactgcactt gcagaagatc caaactcagc tcgtctcacc ggtgtctcga 7261 aactgcctga aacgtttgct ccaagggatc tttctcctcc aggcccccat aaacactgtg 7321 ctttcttgca tagccccccc ccccacacac acaaataatt aattacaaat agaatttttc 7381 ttaaaaacaa aaagaaaaac cagtttccct cctgctactt tcttgtgtcc tctcatagtt 7441 acctcaccaa gctcctcata ttttatgtgc atgagtatat atatgtaagt gctttacaag 7501 tgtgcaatac cagtggtaac cagaagaggg cataaaatcc cccgagattt tagttacaga 7561 ccattgtgag ccaccatgtt ggtactggga attgaactct ggtccagtga aattgactgg 7621 cgctttatct cttccttctt ccatttcaac cctcccctga ttttcttcct acatttcact 7681 gtgactatcc cccacctaca gttagaggtc cctcccaatc ccttccgctg aagccacacc 7741 ctcctacttg tgctggactc gctcactagg accacgcatg cgtctttgcc gtcttcacag 7801 gtcttcatct tcccgctgca tgtgtgcccc gagcctttgc acacctcaca agttagaggg 7861 caccctgcag caaggagagt ggagaacagg gtatatggtc atggttaaag aaggtcttca 7921 gaccgtgact tgggctttgg agaggtgact tttggggagg gaccgggctc tttctagttt 7981 tccatgggtg ggacattgag aactgtccag ggaactggct atagagtcaa ggaagccgtg 8041 gcgatgccag tgaagtggcg tccagggcag gaaatgactc taccacagag cctgttttac 8101 ttaataatta caaggcttgg ggactttgga tgccagggtc ccctgagcaa gctcagacgc 8161 agggatagtg gcagaggtgc tatggagctt ttgcaaaatt tcaggtccag ctggttcttg 8221 gggtgtggaa cttccctccc ttaggaaggt accacggaag tctggcagct caggcagagg 8281 tacctgactc tgaggtctgg gtcctacatt tgaaagggag actgaggccc agaggtatcg 8341 ctggcatcag ttccttggtc ctgacagagg aggaacctag ccatttgggt cacagggaca 8401 gtgaagctgg cttggagtac tgtgtctaag taaaagagta agatgaggtc ctgaatccct 8461 ctaggtccag agagaaaggc catctctctc cttattacta attttaaagt ttgcctttga 8521 tggccaagct ctctcatctg ctttcctaga attcagatgt catgttctga ggcccttcct 8581 agaggacaga ccatgcaggg cccatcaagc cttctcagac cccaggtttt cacctcttta 8641 ttctcctaaa tctcatttct caggctatcc ctctcttgag gacattgccc aatgccctct 8701 gaggtactgg ggaagaggac caggtccctt tgtcacttac cgagacccag gagggtacag 8761 agcagtgtga aggccagcag gaaggtcctg tgtcttctaa acagaatcat ggtgtgtgaa 8821 cctgtagtcc caggtcctca gcccttataa gaagctgggg aagggcgggt ctggtcccac 8881 cttcaccctg gggtgagaaa tagctgagct cagaacccat gggagtcttc cacagcctcc 8941 gggtcagagc cacggaacca gcatccacaa gaaaactgaa ccacttggac ccagcccttt 9001 tctgcacctg ggcccaaatt tcttccacct ggacccagaa gctcagttat gtatccaagc 9061 accttttttt aattgttgtt gttcacaaat cttaaggtag gagtgggtca tctgattccc 9121 tgagggtgca gagcaacttt ggagactggc atccatcctc ttctcctttc ccaaggaggc 9181 gcccagcgta ggttgggtcc taaccgggga gcgcccagca tgtctagacc cgtcctgcgg 9241 gggccgggcg ccttgtggcc tttcctttct ctttcgtgtg cctgcctcac cttcatgctt 9301 ctgtattcga gctgtgtcca gctccaaccc cacccaggga tgtgcccctg ggtcttgagg 9361 ggactgaaca aaagaggtcc tgtcagcaga cagagacggg caaagttgga gacaaatgct 9421 tacaaggcat tggggaacca aggtatttgt ggacgcacgc aagtatattg tgcacaattt 9481 gtattaagat ctttctccct ctcttttggt gttaggtcat gagctaagag aggtctgaga 9541 tctttgtcgg ccatctgtct gtcgatgttc gtctctgtgt gtccatccgt ccgtccgtct 9601 gtttgtctgc atttggagag agaaggggaa cagagcgtgt ccgcaacaac aggtgccaga 9661 gatcttattt caaacaggat tcaggcttca ggaaactctc agcttcagcg tttctgagca 9721 aggctctggt ttggttggtg acctcacggt ccaatgacca gggcaaatta cacgtaggac 9781 ccaatcactg aggccgctct tgttgctagg ttcccaggaa gccgagtctc cacgttatca 9841 ggcggctagg ccggagagtg gtggaacctg agagaggcgg ggccggaggg ggaatgcctg 9901 ttgctaaggg aacccggcgc ctcgctcgcg gagaggccca accgagcatg cgcagtgttg 9961 acgtgtgcgc cggcgcgccg cggtttgaaa ggcccgagcc ttgcgcgctt gcgcactgag 10021 tcccgtgcag ggcgcgcgcg ctcccccaac cttctccggg ggccccccgg agctgcaacc 10081 cttcttcttc attccctgga cggtcgggcg tccacgggcg tggacatgcc ggagatcagc 10141 ctccgccacg tcgtgtcctg cagcagccag gactcggtga ggggctgacg tgggagcagc 10201 ggggagacag gggcttcctg agggatgctg aggactttgg aagcgcaaag ggggggggga 10261 tcttggcggg attctgagcg gggctccggg tggggtttcc agaggctgga cgtgcattcc 10321 gagagagacg ggcagttagg gaagtttggg gccactgtgg agagtaagga aaaagagggg 10381 cctgggaagt tctaactgtg aagaaaaact cggtgaaagg aaaactcgag gcaggggtgt 10441 gtgtatgcgt gtgtgtgtgt gtgtgtgtgt gtgtgtgtgg tgggactggc gtgaagtcaa 10501 gtgtggagag tcggctagct cttgggagtt taaacattca aaaataatct tatttttaaa 10561 agtcagcttt atttaaagaa cttccacctt tccacccttg cctgcagacc cattgtgccg 10621 aaaatcttct caaggcggac acttaccgga agtggcgggc agccaaggca ggagagaaga 10681 ccatctctgt ggtcctacag gtgatgctgc catctcccac ctcctcattc ccggggcagt 10741 gagacccccg ctcacattcc actgccagct ggctggaggt tgagtcgttt cctttctgta 10801 gtagagagct tgctcggcta acttgcctgg aaaatgggct tagaaatccc tgtgcctcca 10861 ggaatccttg ggtaaccctg agtaggttga ctctacttct gggcctcagt ttcatcagct 10921 gggacagtgt cagccattac tgttaactgt gtcccccggg gccagaaggt ggccaagaag 10981 agtatctcca taaatcagtt tcattgtaaa gacgaccgac atggaagggt ggatggctca 11041 ggcctgtaat cctggttgct tgggaagttg agataggata acaagttcaa ggccagcctg 11101 cacaacttag aacctgtctg gaaatgagaa ataactaccc tcacgaggac tattaacata 11161 ggattaggca gatagctcgg ctgttcaggg cttgcctgga gcatgtcaga cccttccttg 11221 gttccatccc tagtacctca gagagtaaag gaatgaaacg tgtagggaag ggaagttttg 11281 tttgattggt tttattgttt tgtttgtgag atagagtctc tctacggagc cctggttgtc 11341 ctggaactca ctctatagac catgctggcc tccaacttac agagatcctc ctgcctctgc 11401 tttcagagtg ctgggattaa aagcatgtac caccccaccc tattctcctt ttaaaatttt 11461 tcttaaattg aagctagtct tgaactcctg atccttgtct gtctatctgg ggcagaggag 11521 ttccaagtgt ataccatcat gcctagcatc tgtcagtttg cctttccttc atttagctta 11581 caacacacac acacacacac actcagtact gtgcaggcta ggcaagaggt tactaattga 11641 gccatgtccc tatctttctg actttgacta cttttaaatt tattttaagc tgggcagtgg 11701 tcatggcaca cctttaatcc cagcagttgg gaggcagaag caggtggatc tctgagtttg 11761 aggccaacct ggtctacaga gtgagttccg ggacagccag agctacatag agaagctccc 11821 atggggagtt agggagttgt ttctgacagg tacagaaact tgaggaaggg gctagagaga 11881 tggctcagtg ggtaagagtg ccaaccggtc tcccagagga ctgagttcag ttcttagcac 11941 ctgtgtctga tggcatacag caacagcctg taagcctagc cccagaggct ctcatacctc 12001 tggtctctgg ggcttcatcc ccctttcata cacatattaa aaacaataaa aatgtatctc 12061 agggctggag agatggctca gcggttaaga gcactggctg ctcttccaga ggtcctgagt 12121 tcaattccca gcaaccacat ggtggctcac aaccatctgt aatggaatcc gatgtcctct 12181 tctcgtgtgt ctgaagacag ctacagtgta ctaatataca taaaataaat gaataaatta 12241 aaaaaatgta tcttaaagaa aaatcagtgg gagaaggaaa ttagatgctt ttaagcacct 12301 gcccactgcc agttgggtgt tgaccccgtg attctctgag tgtcctactg taaggggctt 12361 ggttaccact cacaaacggc agtgccaaga ctcagtccac actaagagtg tgggtgcttt 12421 ggacagcttc gtcagcctgt tgtattcata tgcaccactg agctgcaatc cccagccgcc 12481 accgccacca ttatctcttc ctcctccccg gagtgcttcg gtccccagta tgggtcactg 12541 cccagctcac acactgactt tccttgatgt tagccacatc cctcagttct gagtggagaa 12601 ggactttagt ggggacaaag ttgacctgac tattcctata cttaacacac tcttggcctt 12661 tttcttgctg agtagttttg ggttacagga tgttttgagg attatagaga tgttttcaga 12721 aaagaaccat aacattttct ggccaatctg aggtatttat gcctttttaa atccctaatc 12781 agagactggt aatgaaaact aacattgttc cccattgtaa cctgccagtc agcaaccatc 12841 atgcctcaca tggcttgatt cattccctcc ttgtaaagtc ctgtgaccca cagactgtca 12901 ttatccccat tttacagata aggaaaccga ggcccagaga aattatgaaa cccagcacct 12961 gtcttttaaa ctgtgtgaga accagattgc ttagcattta gtttggggat ctggattcaa 13021 atcctagctc ttactcactc tgacctcagg caaatgtttt taatttctct aaacataccc 13081 attgaaaata tagacaccca aggctggggt ggtggttaaa acctggtgac ataatgtctg 13141 cagaagtact ctgtctacat gtgggggctg ggataaaggg aggggttggt gatttcatta 13201 tgggatgttg attcttcagg ccttcatcag gtgacccccc ccgcttcccc cctcccccga 13261 tacccctaca tatactgtgg gaaataatgt tcagctctat aagagtggtg agaacagaat 13321 gcagcatcca gggtcctgag gaggctcagc ggggagggtc ctcgtggcca agcctgacag 13381 cttgagttcc ctcgttggac tccattttaa gagaattgcc ccacagagct tgtcctctgc 13441 ttccacatgc acgctcatcc ccacctgtgt acatcagaat aaatgtccgg ctggagagat 13501 gctgtgggta agagcactgg ctgctccttc agagaacctg ggtttgattc ccagcatcca 13561 catggtggtt tgtaaccttt tgtaactaca aatccaaggg atcaggtgtc ctttctagcc 13621 tctgagaggc aaaataccca tatgcataag aataaaaatt ttaagtcgtt aagaaaacat 13681 gggagacaac tagaggtcat atgcaagcta cagacagtgg tgtagaaggt ggtacttgga 13741 ggggttcaca tgtcacgttt tagctttgtg gcaaaggtca agagtcttag tagcatcatt 13801 tgttccctca ttcatttatt ccacaagtgt gctctaagct cttgctctac gtggacactc 13861 tgtgctcccg ggtctgtaca gctgtgtgca gtcggtgatg aacatgctca cacctggagg 13921 atgggggtgt tgctggacta gaggaggctt gcctagcata ccccaggccc tgggtttgaa 13981 ccccaacacc cccaaaattg gaagcagtag tgacttcggt ctgcaattct ggtactcagg 14041 agcagcagga tcagaaactc aaggtcatct gctacatagc aagcttgagg tcagctgggg 14101 ttgcatgaga ccccagcatg gaaagctgac aggcttggtg aagtgaccct gtattggatg 14161 atgagccatg agtgctgtgg aggaagtggg gcagggtgag tgagtggtgc atccagccgg 14221 agatggtgtt gtcaggtttt tatttaaata tgaggactaa tctctctttt gaaaactatt 14281 cttagttgag tatggtggtg catgccttta atttcagcac gggggaggca gaggcaggtg 14341 gatatctgtg aattcaagac cagcctagtc cacagagtaa agccttaaca gccaggacta 14401 catagtgaga ctctgtctca aaccatgccc tgtccccatt ataaagaaga tttattttat 14461 gcatgtatat catgacgtgc atgcctggtg ctttcgaggt cagacaaggg cattggatcc 14521 cctgaaagtg gagttaggaa tgttggcatg agccatgtag atgctgggag ccaaacccca 14581 gtcctctaca agagcaagca gtaagtgccc actggtgact ttttaaagct ctgtttgttt 14641 gtttgtttgg tggtggtggt ttttgagaca gtctctctgc agagccctcg gtggaacttg 14701 caatgaagac caggctggtc tcgaactcaa agagatccac ttgcctgctt ctgagtgcta 14761 gggttaaagt tgtgaaactt tttttaaact acatttagca tttagagtgt gtgtgtgtgt 14821 gtgtgtgtgt gtgtgtgtgt ctttagaggt cagagggcac tcgcaggagt cactctgcct 14881 ccactctgag tttctaggcc agcatctttg ctcctcgagc cgtctctgca gcctggagtc 14941 tggtgacaga agtctggtga cagagtctgg tgacagaagt ctggtgaccc tgaacagaag 15001 agaagttcat tcccagggaa gggacagagg tcagctgtgt ccgtctggca cgtgagaggt 15061 cggtcaagga ggccagagag caaggtaggg ccagggcttc gccagggctt ctgcatagga 15121 aagataggac ccttcttgtt cccgttgaga ataaacagag cccatcctgt cagacccctg 15181 ggacaatccg attgcacttg acactggcta atttatgggc aactgagttc gaccagggtt 15241 agctttggtt tcttttcagt gacaacaagg gagagatttc tggacaccaa gacagtagac 15301 tcttgtgtgc tcacctattt ccctgtgtgc tctctaaaca ctgttacaca acattggtat 15361 tgccaaacag ggctcagccc tctgcttgtg tcagctccac acacctcttt gtcagccgtg 15421 gctactctgt gcccagctgg tcacctgctt ccagacaggg agttaaatct ttgccagccc 15481 ctcctggcct ggaggggatg ctgggatagg tccaactgga acactgctgt atcacctgag 15541 tgtcagccgt acaatggaac acatcatacc tgctttgtta ttgctgtttg agacagggtc 15601 tcattgtatg atacagccca ggctggctcc aactctgatc ccctgcatat cacatgtgtt 15661 gggatgtcag gtatgagcca ctgtagttgg tgtgattttt ttttccatgt tttttcttat 15721 ttttaaagat ttatgtatgc acatctgcac accaaaagag ggcactggat gccatggagc 15781 tacgtttaca gacagttgtg agctaccatg tgggtgctgg gaattgaact tgggacctct 15841 ggaagaagtc tgcactggaa gctagtactc ttaaccattg agccatctct ctgccccccc 15901 cctcccctcc cctgttttga gtcacggtcc tactttgtaa ctatggccag tttagaactc 15961 actgtgtaga ccaggctggt cttggatgca cacagatttt cctgcctctg ttttctgaat 16021 gctaggatta aaggcgtaca ccaccagacc tggcagatat ttgttttatg tgtatgagtg 16081 tttgtctgca tgtatgtttg tacatatgcc tagtcgctga agaggccaga ggagggcatc 16141 aaatcccctg gaactggagt tacagatggt tgtgaggtgc tatgtgggtg ctgggcatca 16201 agcctgggtc ctctggaaga ccttgtgctt tcacctgctg agctatatcc cagccccctc 16261 actgggggat tcgaggcagg ggctctacca ctgagccacg cccccagccc tcactggggg 16321 attctaggca ggggctctac cactgagcca cgcccccagc acctcactga gggattctag 16381 gcaggggctc tatcactgag ccacgccccc agccctcact gggggattct aggcagaggc 16441 tctaccactg agccatgtcc ccagcccctc actgggggat tctaggcagg ggctctacca 16501 ctgagccaca ccccagcccc tcactggggg attctaggca ggagctcttt cactgggcca 16561 cgcccccaga ccttcagtgt tttgagagag acattcactg tgtgtgtagc tcagactggc 16621 cttgccttgt gatcctccag cttcctctgt agtgctgagg ttgctggtgt gtgctgctgt 16681 tctgctcttt gttactagta tcattttgag acagggtctt actctgtacc tcaggttggc 16741 ctggaattcc ttgtgacctc ctgcccctgc ctcctacatt ccagtccatc actctcctta 16801 cggagccttt tttccagccc ctctctctct tttttaagtc ttgcatacag gacagctttc 16861 aactagccga ctctcagagc tttgcctagt ttctgtgtct gatcagtatc atgtaccatg 16921 tgtgcataga aaaattaaaa aaatacaaac aaatgtgcaa gtatagaata ccaggttggg 16981 gctgtttgtg ctgcccactt ggtatgttgc ctgcctagcg tacatagtat tccgagtttg 17041 atccctggca ttgcattaac ctgggaagag aaggagacag gatccaaagt cgggacattg 17101 ttggctatat agaaagtttg aggctagcct gggttacatg agacctagtg gtgggggaga 17161 aatatatata tatatttata tacaagtcga agccacagga atggctcagt agtagtggag 17221 agagataaat caggctaagg aatagaggga ccccttgtgg gagatggcag atgctacttt 17281 gtttggtctg aggcattgct gtctaaatct aggtcggaag gaaggcttga gcagtgccga 17341 ggtctgaggc atgttcccga ttctggggtg gcagctgcga aggccctgag caggaggcct 17401 cagcctttac tggatataag ataggtgatg ggagcacaca ggacgttgtc tgggctcagt 17461 atccagaagg gtctgtgctg aagcactgta agctatggct ggctctgttc tcctgggctt 17521 tgcctacaca gctgtggtgt aaccacctcc tgcctgaggc tgctcatcta atcgcagcac 17581 ctgcacttac accctgctag aagagcagat cttacatctc ttccacagtt agagatgaga 17641 gcactcagtt ttgctttgtt ttcttttgtg gtgctgaata ttgaaccagg atcctgtgca 17701 ggccagacaa gtgctctacc attgtcccac atcccagacc cctctctctg agacagtctt 17761 gctgagttaa ctcctgctct tgaacttgcc ctgtggctca tgcagacctt gacctctgaa 17821 ttctcctgct tttacctacc aagtcgctgg ggttgcaggc tcaactggct gcccttgtga 17881 ctttgcggca tcttagcttc ctgtggagtg aagccactgg gactcccctt ggaaacatta 17941 actgtagttt agaaatgtca gggctggatt tgaacacaga ccactttgtt caagtcaagg 18001 ttcctttctg ccacggtgct gtagtgccct gactggtctg agagggcaga agggaacagc 18061 agatggtccc agagcagagg agggccatgg gtggaagtac ataggcagtt aaaaatttgt 18121 ttttgcagct gggcagtggt gccgaacgcc tttaatccca gcacttgaga ggcagaggca 18181 ggcagatttc tgagttcgag gccagcctgg tctgcagagt gagttccagg acagccaggg 18241 ctatacagag aaaccctttc tgggaaaaaa aaaattgttt ttgctacaaa aaaaaaaaaa 18301 aaaaaagatt tatttttaag tatctgtatg aggggtaggt gtgtgttagt gcagaaacca 18361 gaagcattgg ttttctcagg acctagagtt acaggatgtt atgaacctct tctgagtatg 18421 aagagctcta acaactgagc catctctgca gctctgacct tgaacttctg atcctcttgc 18481 tgttatagct aggctgttct cacttaagga aaaactatga tgtttttatt gaacccccca 18541 gactgactac atgtgtaaag aggtgggcct ctagctgaga gtaagcaggg agcttataaa 18601 gggaaccacc acacgtttgg taggtcccat tggtgtagct gggatggctg ttaccaagca 18661 tgacaggtat gaacttttcg taagaataag gcacagctgt gccaccatgg tatcactcag 18721 gcagaacaac tggggagctg tggtctctgg gaagtgctgt gcatacccga gtgtcaggct 18781 gacctcagtt ttagccagat tgagccaggt caggcttgag cgcacgtgtt tttctcattt 18841 cactgcgtcc acaccaagtg ctgaaattac aggtgtgctt cattcagtat gctgattatg 18901 cgtgactggg catcgaactc agggtttcct gcctcctggc tccagctgag ccagccacat 18961 ctgaagccag gacatcttcg aaggctgtga tagcgcctca tccgaggacc ctgttaaaga 19021 gccccagact ggacgcttag cattgcctgt cctcactgcc ccacacccac acacctttca 19081 gtgtcttcag cctcaggctg gagccacagg tgagtggagt cttcataaat agctctttct 19141 ttcgatgaat aatgagcctg tatcagctaa tggttcctgt tcctgcataa tagattcagt 19201 aagaagcctg gagcttttct aggccagaga ctgacaaaac aactcctgtc aaagggagac 19261 agggacaggc aaagagtctc ctgcctgcaa ttctaacccc ctggaaggtg aagccagcag 19321 gatcccctta agttcaaggt gtgcctggtt attaagtgac acctgacaaa agaaaactcc 19381 aactaggcat gtgaaaatgg agctgagtgt ggcggctcat gcttgtaatc ccagctgtta 19441 gattgtgtgt agcagtggcc ttgtgggtca caactgttta cagtggccaa gtctcaaaaa 19501 acgaaatggg gtgggggagg ctggagagtt ggctcagcag ttataagcac ttgttcttat 19561 agaggaccag ggtttgattc ccagcaccca caccgcagct cacaactatc ttaactctag 19621 ttccagaggt tctgacactc tccccttggc ctctgtgggg ccaggaagca gatgggacgg 19681 acgtaggcat acgtccaggc aaaatgttca tatgcaagga ataaataaac ttaagaaaga 19741 aaaggaacta cagcctcacg tttggactgg atggcagttt caagctgtct cagagaaatg 19801 actccagccc tacccggccc tcctgtgatg tacaggagag tcactgctgc cgcctccaat 19861 tacagcagcc tctgggttct atatagtgtt tgtttcttat ttaaatggat gctaaaattg 19921 gtttcactct ttttgttcac ttatttttag ttctaaaatc tttagctgaa caacaacaaa 19981 aaaatcctta aaactttttt tttttcaaat ttttattaga tattttcttc atttacattt 20041 caaatgctat ccggaaagtc ccctataccc ttccccctta aaacttcttt tttttttttt 20101 aaagatttat ttattattat atgtaagtac actgtagctg tcttcagaca caccagaaga 20161 gggagtcaga tcttgttacg gatggttgtg agccaccatg tggttgctgg gatttgaact 20221 ctggaccttc ggaagagcag tcgggtgctc ttacccacta tgtgtatgaa tgtgtcttca 20281 tgcatgccac atgtgtgcct atgcccaggg aggctagaag agcgtgccgg atccccgaga 20341 accagagtta ctggtggttg tgagcagcct gacgtaggtg ctggaacgga attcagtccc 20401 tgtgtaagag cagcctgtgc gcgtgtaact tcgagggtcc gtgtctcagg tccctttagt 20461 acatttgagt gtgtcttgtg aacaaaaatt cctgaaaaga agttgtgtct gcagagactg 20521 gtagactgat tagaatttca gtgttgtggc aagagtggaa tcttgtcgct tcagaggccg 20581 attactttct cttgggctgg ttttagcttc aggcagcctt tccttgcctt actccatctg 20641 aactgactcc tcgtaataag taaaaccctc tcggaagctt ttaatgtagc agaacactag 20701 cctccaccag gagagctaag aatatcatca gagagcatct tgggcctata aaaagccccc 20761 tggtgtaccc acatctctgg tatacccgtc agtgtatttt ttaagttgct ctaatttggg 20821 ggcttagtgg ttaagagcac tggctgctct gagttcaatt cccagcaacc acatggtggc 20881 tcacaaccat ctgtaatggg atcctttctg tctgatgccc tcttctaaca tgcaggcata 20941 catgcaaata gagcagtcat atgcataaat aaatcttggg aaaagaagcc tcaatttatg 21001 tctttctttg ttctgcgaaa ctataataca cttaatgaaa ctgcttccac attggaacat 21061 ggagtttgtg gtaacctaaa tctgcgatcc taggccttgg tcactcagaa tggttcctga 21121 ataaactagt tccttaaact ctttaagatg agtcgggtgt ggtggcacgc ttttaatccc 21181 agcactcggg aggcagaggc aggtggattt ctgagttcga ggccagcctg gtctacagag 21241 tgagttccag gacagccagg gctatagaga aaccctgtct cgaaaaaaca aaaacaaaca 21301 aacaaaaaaa aaccctcttt aagatgagag ctgtggtttt cacatcgact ctgtgtgtgg 21361 gcacatgtgg gcctcctccc accgtataga tcttggtgct tgaatttagg tagtcaggct 21421 tggtggcagg ttcctgtaac tgctgggtca ttgtacagga ccaggaataa aggaatctca 21481 agcctagaca tcaacccttg ccgcagttag aaatatggca tttctgaatc tcagcctggc 21541 tcagggggca gcgaatgagt accagagctg ggatccccag cacccacagg aaagttgggt 21601 gggcatggta gcctgcctgc ctgagatccc agtgctcctg aggcagagat ggaatccctg 21661 aaaataaagt ggagagccat cggggaagat gcctgacatc aacctctggc ctccacatgc 21721 actcagacac agacacgtga atacacaaac atgcgcacta cacataagct ctctcgcttg 21781 tgcgcacaca ccccacttct gtattgttag ctatccctta aagctatcac ctgttttgat 21841 ttctttttta tatttgagac aaggtctcat gtggtcctga ctagcctaca atttgctgtg 21901 tagctaaggg tgacctgaaa cttgtgattc ttctgccttc acctcccaaa tgctgggtta 21961 tacggtgtca ccaagcccca gctttgcctt tcttcatctt tttcagtgag actaatttag 22021 tcacaggagt caccaacatt aaaacaacgc ttgctgacag ctgctaagta cctactgtgt 22081 gccctgacct tcagccatgt tgattatgcg gtgatccttg atctaatgag ggcgcacagc 22141 tagtgtgtgg cagagctggg ttcaaatgca gggctccgtg gtatgcaaac cagacacccc 22201 tggcaggcag tagttagtta gtccaggaac ttctgtagtt atgtgtcctc tgtatgacaa 22261 cttgcacata actgtgtctg tgactattgg cacccaggag agctccgtga caagcagtag 22321 tagactgatc agtgtgggag tgtggggtgt gtgctgggga agcaggtgtg tagtgagcat 22381 ccattcagtg aattcttagt gccatgtctg ggctgcctgc tagaaacaga tcactctact 22441 cactacttag ttccggtccc cactgacaca ccggtgacat ctctttctgc cccctgttcc 22501 aaagcaagcc cagggcagaa ccaagcactc agcatccttt ctctgcccca cagttggaga 22561 aggaggagca gatccacagt gtggacattg gcaatgatgg ctcggccttt gtggaggtgc 22621 tagtgggcag ctcggctgga ggggccacag ctggtgaaca ggactatgag gtaagcaggg 22681 ttggagaggc cgacaccccg tgtctctatc actaacaccc ccccacctca ccacccccat 22741 ctccatcaca acaccccccc acacctcacc cctatgtctc catcactagc ccccccaaca 22801 cacacaccag ccgccacatg tgattacatt agtgtggaga gccacccacc ctgcctgtct 22861 ccggagggtc ctgggtatgc cgaggagaca gagcaggcat gggggaaagc tgggctgcct 22921 ggattgtggt caaataagga acaagaggcc ctcacttgta aaattcaatt gcagcttagg 22981 tagtggcctg ggaactgggg ttccacagtg tcatgactct ggtaacctac aaaatacagg 23041 ccctttatga ttgatactta attttttcct tgagatgggt ctttcctttt gcccaggctg 23101 gcattgactc ctagactcaa gcagttctgc ctcagcctcc caggaagctg ggactactat 23161 agtgtgctct gtcaagcctg cctcatagga ctggcagagg ccactgtagc tcagagtatt 23221 agcctactgt ttttccagta aagtgatggc ggtgaaaact gcactcacca tgaggcagtt 23281 gacaagtctg agcctaagcc agtgaagaac gaagggtgtg tgcgcgcgtg cgtgcgtgcg 23341 tgcgtgcgtg cgtgcgtgcg tgcgtgcgtg catgtatgtg cgtgtatgtg tatgtgagag 23401 agagggcaga gccaggtcca gccttctgat gttagactac cccagcgttg gtctttgttg 23461 cacagctgtg cctgttgaaa ataaagatgt tttgaatgtt ttgccgcctc cccctccctt 23521 ttgttttttt taaagatagg gtctcactga gccctgattg gttttgaact cacatgtagc 23581 cgaagctggt cttaaatccc cgatcctccc accccatcta atctgactaa gtgctgccat 23641 tacagctgtg agcggccttc tttcccctct gtcctgccga ctggcacctc agggtactgt 23701 caggttcaag gggtgaatgc aggccccttg tcatcttcct ggccatagag agaacgttgt 23761 gtctttcgct gttaaatatg acgatgactg tagggtgggt tcttagctgg tgggttgccc 23821 caagacccca agaccatcag aaaacacaga tacttacatt acaattcaag acaatagtaa 23881 aattacagtt atgaagtaca atgaaaataa ttgtatggtt gaggtcacca caacatgagg 23941 ggccttggga aggctgaggg ccactgctgt aggtttttgt tttgtttttt taagctgagg 24001 aacttcttgt ttgttcctag ttaatggagt tgtttgtttg cttgcttgct tgcttgcttg 24061 cttgcttgct ttttaaagat agggtcttgc tatatagctc tggctgtctg ggaattcccc 24121 ttgtagaaca ggcttgtctc gcactcacag agatcttgct gagtgctggg gttgggagct 24181 cttttcctga gtgctgtctt ggtctgttag gctgggttgc cctcctcact cggcccttcc 24241 tcatccctgc ggccctggat ggcagcctgt gccacatgcc ctcctctctg ggttgagctg 24301 cttctttcct cacactggaa atgggttgca gggccttcta catgttagag aagactacca 24361 cggaggcccg tccccacctg ggctccataa gatctgtttt tgcagaatga ggttgcttgc 24421 accaaaacca cttttattct caaaattata tgccaagtca gcagattttg gtattactcc 24481 aaaacagttc tttgcccagt tctgctcact cttaagtgac ctgagtcact tgttagccca 24541 ggggtcagac agctcacagt cagctgcagc cccagcttca agggatccca cacttctggg 24601 ttccacagcc ctgcaggcac ctacacgcaa atgcacatgt ctgcacatcg acacacatgc 24661 acataaatac agtcaaacta gagacagctg gaagagcaaa gaaaggacaa ataagcagat 24721 cagggggttg gccagggttg cctcctcctc caggtcactg cttctgcgaa tcagcatgct 24781 tcttcccctc catgcagatc acgcaaagga agactggctt ggaggtaatt cgagctactc 24841 tggaagaggg agaaggggtt tgcaagtcaa ggccagttta gggtgctaca tagagcactt 24901 gaggccagca tgagcaagtt ggccaaaatt ttaaaattaa aacaggaaga aaaatggggt 24961 gcaagttgag gctggagctc catggtggag tagtgtttgc ttagaagtct tcattgtgag 25021 ccaagggtgt gcctgagcag ctctttacat ttctgtgtgc atgagcctaa aactcctttg 25081 cattcctcct tatatacatg ctgcctgttc ctcatgttcc gtgtaaagga gggcacagat 25141 actggattgg aaataacttg tagactggga atgtggtctc agaggccact tccttggaaa 25201 agctggttgg cttttctcct gccactactg cattcccttc ttcctctctc ttagtccccc 25261 attctctctg agtcctaccc agcccatcct cagccattgg ctactgccat ctttactgat 25321 agattaaatt ccagctggga gcaaggacct tcagtgtcaa tagacacaga ttcccgatca 25381 gagcatcaga accaccctct acagccctgc ctagaatccc cctgggaggg gctggggatg 25441 tggctcagtg gtagagctcc tgcctagaat ccccctggga ggggctgggg gcgtggctca 25501 gtggtagagc ccctgcctag aatccccctg ggaggggctg ggggcgttgg ctcagtggta 25561 gagcccctgc ctagcatttg tgaaaatcta gctttcatct ccaatacaaa aaattgtata 25621 agagagattc actagagttg aaggtagccc agactagagt aaaggttgtc tttggaagga 25681 agcagaaaag caatgtcaca accagagtgg tttttttctt tacagtgaat aaggcgagac 25741 atttggggtt ccctgccact tagtgactta gtgttggtaa cagctcccag ctgttggttg 25801 tacacacaca cacacacaca cacacacaca cacacgctca ccatgcaaaa tccagcccag 25861 tagttcactt gagattctgg ccctttgtaa caagtgcaga agaagccagc gagtggcttc 25921 tgttctgtat gtttttgttg ttgtttcttt ttcacaaccc cagctgagca ccaagaggca 25981 tagtgttata ttttttaacc atagtaaaag gcttttattt tctaatttta taaatttttc 26041 agtatgtcag ttggaaaaag aggtgaaata aatagaataa gcctaactga acattgctta 26101 cagcatcagg gaacacctgt tctgtcctca ggagggaagg ctagccccag gcagactcag 26161 cactgctctg acagtttggg ctgggggagg tagcttgatt ggtagctttc tttcggattt 26221 ggcccagtgg tcagaagtga ctgcggttgc tctatctcat cagtggcttc ggaagcagca 26281 agtacagcag gtggcttgag aaacaccgtg gccttggttt gcctccccct cccgtcattc 26341 ctatttcctc ctccatttcc ctgcatcctc tgccccaccc tgggatgttt tttgaggaaa 26401 ggtcttatgt agaccaaact tacctgacct tgcatgctta acctcctgcc agcaccccca 26461 agtgccagtg acaggcttgc aggatcgggc ctcacactgg ctccccaatc ccctgcttca 26521 ctgacctgga acctagttct tcagttaagg gtggccgtgc cttctgaccc tccgacctcc 26581 acctcctaag gaattggatt tcaagtgttt ggcaccatgt tccagttatg cttgctgtac 26641 agggattggg ctccagcttc ctgtgtgcta ggcaaacgcc aactgagcca catcctttcc 26701 ccccactccc ttactgtgta gctcagactg gccttgaact agtgatcttg cctctgcctc 26761 ctgagtatta gaactatagg cccgcccccc acccgacccc tcttctttgt gttctgattg 26821 gccagagctg tatcttggtc caactgctgg cagaggggtg gggcttgctt gcactggtca 26881 tgattccatt ggtttgggca cacagctttt tcatggtcca gaactatgtg gtcagttctg 26941 attgtcatct gttccttcat ctgcgtcagt atctccattt gttttcttga gctgatctta 27001 agcaggccaa ttgtgtgttt gagggtgtga gtatatgctg ccttcctgag tcagagtgtg 27061 tggtttgtgg gttctgggac ccctatgggc cacccactcg ttcttacttc tgtgatgggc 27121 ctgccatcac ctgcagccag agaatgtgca cattattttg tggactgaat ttttttatgt 27181 aagaatgtag ttaccagctg ggcatggtgg cgcagccctt taatcccagc actcgggagg 27241 cagaggcagg cgaatttctg agttcgaggc cagcctggtc tacagagtga gttccagaga 27301 aaccctgtct cgaaaaacca aaaaaaaaaa aaaaaagaat gaaagaatgt agttaccaac 27361 tgttactgta agcctgggtt ctgatgaaga gaagacagag ctgttgtccc cactggggtg 27421 aggaacagaa agcaggttgg aatgatagaa acatctgtca cagtgcggaa gggtcacggg 27481 ggctgagcag gtgtgccctt ccctttctac tgctggtggc cccaagagag gggttagtat 27541 ggggtgctga aaactgggtt catccttgca tcaggaggag ctggggtgat gctcctgtag 27601 atgggcaagg ttgcagcgtg cagcctctct agctcctcct gctgacggag cctggtaagg 27661 cacagctggc caccagagct gtctccacag agctgcccca gcctgagggt gtgggccgga 27721 agcatcactt aattcttgat gctgctgtgc tccatctttg gatttgatct tatttctgac 27781 atcctgtaac tcctggcctt tgagctaact caaccacgca tgtccatagg gcacttgagt 27841 actgtacctc tcaccaagat gcctctgtgg gacagtccct tagggcatgt ggtcctctgt 27901 actgcctgat tttacatgaa aatttgtctg gactgcatat ggcctggtga cctcactatg 27961 gagaactgaa atcagcatgt ctggggacaa ccccacatca tgcaaacagc cctgttaaac 28021 tcagcaaacc ttaaccaaga cacaaaatca ggagagcttg ttgggaagaa gaaggggtcc 28081 ccagggagtt ggagagggtc aaggaagggt gggtgaatat gatcaaaaca aattgcatac 28141 atgcatggat agtgtccaag aacaatgcaa aaattgaaaa aaaaaaatct gtcacggcag 28201 cagttgttac cgaagttgga ggagtaaggc ggagagcagt ggagaagaca cccagcagtc 28261 ctcccctgcc tccacatggt acatgtgccc acacacccat gcccatgcac ccagctccta 28321 gaggtgtcat cttacactgt cttctaccta ggtccttctg gtcacctcct ctttcatgtc 28381 cccctctgag agccgaagtg gctcaaatcc caaccgtgtt cgcatttttg gacctgacaa 28441 gttagtccgg gcagcagcgg agaagcgctg ggaccgtgtt aaaattgtgt gcagccagcc 28501 atacagcaag gtacatggtg taaaagctga ggcatgccca gggcaagggg aggtctgcgg 28561 aggcccatcc tgagctcggg actgctcagg tcaggtcctc ctgggacagg cagcttggtt 28621 ctctggaggg ggtgagctga ggcccgtcag gctctgtttg gtatcactgt tactgtcata 28681 ctggggcccg gaactcaggc tcctcctact tctgcttctg catccagaca gcggaggagc 28741 aaagagcccg gggtagtttt gcattccaga agtggctcca gacagagatt gggcacacag 28801 aagtggactc gctacctgcc attggcagtg agggtcaaag agcggtgcag gttgaggggt 28861 gagccctgac tttggtccac agttaggggt cagtgaggtt taatgtcagc atcaccacca 28921 ttgctccctg ggcaccagct tcaactgagc cataaagggc ttccctaagt gaaggggcat 28981 gagagcctag ctttgtccta ttcagacacc ctcacaccca gccctctgtt ccaggactcg 29041 ccatacggcc tgagttttgt gaagtttcac agccctcctg acaaagatga ggcagaggct 29101 acatctcagg taagttgtac ctggtacccc ccagagcctc tccctgcctc tccacccctg 29161 cctgccagca ttcttctcac agcactgacc ttgtgggacc ttagaaggtg acagtgacca 29221 agctcggcca gtttcgtgtg aaagaggagg atgacagtgc caactccctg aagccagggg 29281 ctctcttctt cagtcgtatc aacaagacgt catcaggtgc ctgtgggact cggggtgcta 29341 gggatggcgt tggtggagtg cttacttagc atagagccca atcctgaaat ctccgctcag 29401 gggagacaag gcaggagggc cagctgtgta aggtctcctc tgctaagtag gaagtcttga 29461 gttcagccta ggatgtatga tgcctgagag ctgagggcat ggccaggtct ctaggtctgg 29521 ggaggagggc tggggcctgg actactgagt caggggtaca atctaaactt ctgggtcacc 29581 tctggatgtg agagtgactt gatacctgat agatgggagc taaggtgact gtggccctgc 29641 cctctcaccc tcagtccact tgtccttcca tagcctctac gagtgaccca gcaggaccca 29701 gctatgcagc agctacactg caggcctcta gtgcagcctc ctcagcctct ccagtcccca 29761 aggttgtggg cagctcttcc aaggtgaggt catcagacgc tggggcagtg gagaatgagg 29821 agagccggaa gccaacccat ctctacctcc tcagcctcag gagcctccca aagggaagag 29881 aaaactggac ttgagtctag aagacaggaa acctcccagc aaaccctcgg cagggccatc 29941 caccctcaag agacccaaat gtaagcaaac tggattcctt gattctgtag tgaccctggc 30001 ttgtgtgtca tgagggaagg gctgtggtct tgaacctgcc atcctaagaa gggaggcgga 30061 aagcctgact tcctgtaaga gctggggctc gatggggcac caacctaatt tctgttcttt 30121 tctccccagt gtctgtccct agtcgtactc cagctgcagc tccagcctct accccagcac 30181 agagagccgt cccagggaag ccccggggag aaggcacaga gcctaggggg gctcgcactg 30241 gaccccaaga gcttggcaag atcctgcaag gggtggtggt ggtgctcagt ggcttccaga 30301 accccttccg ctcagagctc cgggacaagg ccctggaact gggggccaag tatcggccag 30361 actggacccc agacagcaca catctcatgt aggcctctga cctgccacac ggttccctgc 30421 cctgccctgc cctaccctgc atgcccccgc tagcttcatc tgtccctgtt ctgctatcct 30481 ccctccctgc ttctttgtgc cctccagccc ccacttcttt acccccttag ttctcagctt 30541 tctgcgtccc tccctaagct ccatctctgt tctgtctcag ctgtccatca gatcatgtct 30601 gacattcctg ttttgcatct cccagctgtg cctttgccaa cactcccaaa tacagccagg 30661 tcctgggcct tggaggccgg attgtgcgta aagagtgggt gctggactgt caccacatgc 30721 ggcgccggct gccctcccgg aggtgaggcc ccacgacctg cctcttaggc agcgcacaca 30781 cctgcctctt agcccagagc aggcaatgaa gaccctgggg cctgctgggt agctcaggcg 30841 atctcaaacc aacataagcc gttttcaaag acgggctggg cctgcagatg ggaagtcact 30901 agcctgggtg ggccactgga ttttgtaatg acggctgctt tgtgcacatt gtgggttgtg 30961 gctgtgcaga ggtggaaggg agcctcaggg ctgtatgcca ttcctcagaa ctgatccttt 31021 ggcatccttt gggtgccagg cagagctgtg actgaagtga ggacgctggg gttgagccct 31081 gtgctcactt cagcgagtgg gtcagaagtg ttagagccca gctttgagga gctcaggagg 31141 ggcacagtga tcgcgggatg agaagcatgg gtagcagcca ggcagagggc atgcagggct 31201 gcagagcaag cagggatggg cagcatgagc atccatctcc tgacgcagtg ggcgggagaa 31261 gaaccactgg ctgccaacag caggagctgt ggagctggag agaagccgac tccgatccca 31321 gtgaggtcgt ggcttgagtc tgtttaataa gcagggctaa ccctggcaca ttcatgtcgg 31381 ggactgtgag gggaggtggg tgggttggtc agttgatctg gaagtggtgg ggactccaca 31441 tcctgcttgg attccacaga tccctgggag ctagttatct agcttcagta ttgtgatcat 31501 cctgaacaga tgtgcggagg gtgtctgtgg gcgcctttat tctctgaggt gtgtaaggat 31561 agttagctag tgtgccacag tagggacttg aatctaggac acacacacac acagtaccaa 31621 gaatcaaaaa tcttaggatg gttgaaaatg aagagatggc tcagtggtta agagcattgg 31681 ttgttctttt agaagacctt gactcagttt ccagcactca catagcaact ctcaaccatc 31741 tgtaactgca gtctcagagt gattgatgac ttctggctgt tataggcacc aagcacacac 31801 atatgatgca tagacatatg tgcaggcaaa gcattcatac gtgtaaaata aaaaagttag 31861 tctttaaata tatacaagtt gcttatataa aatgctagag tatttacata aaaactgcat 31921 agaatctttt catacattta aaaaagatta ttatttaaat tatgtgtatg tgtatctgtg 31981 cggtggtgca catgtgtgtt caggtacgtg aggagaccag ttgtgtcaaa tcccctatag 32041 ctagagttac aggcagttgt gagcacctga tgtgggtcct ggggaaccag atttggatcc 32101 tttgaacagt ataagctcta aacattgagc cgtctctcca gcctcacgta ctgtgagtgt 32161 ggatgtctaa gaactgaacc caaggcttcc cacatgctaa gccacacccc cagcccctca 32221 ctgggggttc tagtcagggc tctaccaccg tgctgtatat ccctagcctt tttggttttg 32281 atttggagtg taggagggtc ttactaagtt gttcagtctg gacttcaatt cactctgttc 32341 ccaggccatc ctgcctctca gcctcttgag tagttgagat tagagacctg ctcactaggc 32401 cagtgtgctc ttacatgttt ttgggttttt tgttttttgt ttttgtttgt tttgtttttc 32461 gagacagggt ttctctgtat agccctggtt gtcctggaac tcactctgta gactaggctg 32521 gcctcgaact cagaaatccg cctgcctctg cctcccgagt gctgggatta aaggtgtgca 32581 ccaccgcccg gcgctcttcc atgttttgaa tcattctgtg gtttgaatcc agcactatgt 32641 aaatgccatg taaatcttta gggaatgacg agcaaacgga gtctgaatgt tgagtgcaga 32701 cacagacttt ctggtctgtg attggttggg tgtacagatg cagagacccc agatgtgggg 32761 ttccacctga atttgcattt gagactttcc cccattgctt ggcattctcc tttgatcctc 32821 aaagagtaga cccccccccc ccattgctct ttcttgcctt aaatggaaca gatcttcccc 32881 atgagggaaa gagcctgggg tggctgaact gcctggcacg tagtggagcc cctctagggc 32941 tgggaggggt cagagggcag caggattcct tctctatccc cacctccctc tacttctcag 33001 tcctctgcct gaggcccagt tcctccccta ggtacctcat ggcaggcctt ggatccagca 33061 gtgaggacga aggggactct cacagtgaga gcggcgaaga cgaagctccc aagcttcccc 33121 aaaaggtctg atgcccctat ggggccagga gggaatactg ggtgccaacc aggtctgctg 33181 aatacctcag cttcctgctt ctccccgctg ctggctgtcc agagcctcac actgccccaa 33241 cctcacctag cccttcttcc tcagcggccc cagcccaaag ccaagaccca ggcagcagga 33301 cccagctcac ccccgaggcc accaacccca aaagagacca aagcaccgtc accaggaccc 33361 caggacaata gtgacactga gggggaagag tcaggttaga atctgaggga gaggtctaca 33421 cgctttatgt ctgagggagg aggagggcta gggccttgag ccctgggttt gggggcaggg 33481 ctggtgtcta gctttctggg tctgaggggg agggctgtat ggaggtgaag gagtagagga 33541 ggggccactg tgtaccctga aaactagatt gagagggcaa ctgaagtcaa gttcataggc 33601 cctggctgta ctctcgggtg ccattctggg aggggtgtta ctagatgtcc caggaaccaa 33661 gcctgttgac tgggttgcta ctggcagaag gacgggacaa cggggcagaa gattctgggg 33721 acaccgagga tgaactgagg aggtaggatt gaagggggta cgtctgcctg gccttgggtg 33781 tgggtgtctg ctccctaagt gtttaccctc cactactctt gatcgcccag ggtggccaaa 33841 cagagggaac aaaggcagcc cccagcccca gaggagaatg gcgaggaccc gtatgcaggc 33901 tccacagatg agaacacaga cagtgagacc ccctcagagg ctgacctgcc aatcccagag 33961 ctcccaggtt agacatccct gactctgtga catgtcccct aggtcccctg ggtctctgct 34021 gtgtcatgca ccacttgctc cctttcctgt gactcccact ttacccctgc tgacagcttc 34081 ctgctctggc aagcacctgt gtgacattgc tgtagtgacc ttcccagtgt gagcaacctc 34141 atttcctgac ttgtgattca ggacttacca tctcgcccct aaacgcagct ctgtctgtca 34201 tcagctctgc tgcccctgga cacctgagtg ctgggcacag tgtgtttgca ctcattactg 34261 ataacgttaa gccagctctg tttgtgcgtg tgctattgag catgattctt ccctgtttgg 34321 tttttatgag acagggtctc agggagccca ggctagtctg aaacctgtag ctttccatct 34381 gcctcagcct cccaggttct tagattacac gtgtgccagt acaccctgcc taccagaaag 34441 ctgctaagaa gggtgtctga gccccctggg gtggaggaga tcctgctgtg ggggttcttg 34501 cctataatcc cagtacttgg gatgtcaagg cagaaggatc acaagttcaa agccagccag 34561 gctactaaga ccctttttaa agaaaagaat ggagggtcag gagagcagga gagaaaagag 34621 cagtcactct caagcagtga gggtcagggt ccctgcagga gccctttcca ggcttgtggg 34681 caatgtcagg agagtattgt gaacccccaa tgaggggagt gaatccttgg acatcgtgtg 34741 gaaataagaa gaaaactagc tcagaggcca gtgtgggagc cagctgaggt gaagtggtgg 34801 ccagcctggg ccaaggccat aagagcccca tggccgtgtg ctttagctac tgcttagcac 34861 caggcagagg cctgcaggag cttcctggtg acttggatga atgccttagt cacggccctc 34921 cctcctgagc accttatcct tggtgcctct gcctcgccct ccctctgtgc ttctgagtac 34981 tcaggagctt ggtcagtgcc tcacttttcc catctctccc gcagacttct tcgagggcaa 35041 acacttcttc ctgtatggcg agttccctgg ggatgagagg aggaggctca tccgctacgt 35101 gaccgctttc aatgggtggg ttgtggggtg ggtgggtaca gagggtatgg gctgcgggag 35161 gatggcgagg atggcgtcag ggcccagtgg catgaccata gctgcatcgt cctgcacagt 35221 cctgctgccc acgatgcttg tcaccctgca gggcccttcc cccacccctc ctgagggcaa 35281 ggctgctgcg tggaaggcac aggtctcaca ggcagctgca gggggcagag gtcaagctaa 35341 aagccaggct cacaaacaag aaagaggaag aaaagagctg gagagatggc tgctcttcca 35401 gaggtcctaa gttcagttcc cagccaccac tggtggctca ggaccatcta taaaggaatc 35461 tgatgccctc tactggcatg taggtgtaca tgcagacaga tgcttaaaaa gatacgtgga 35521 aataaagaag agaggggtta gagaggatgg cgcagggtta agagaactga ctgctcttcc 35581 agaggtcctg agttcaattc cccgcaacca cgtggtggct cacaaccatc tgtaatgggc 35641 tgcagtgctc tcttctggca tgcaggtata caggcagaac actgtgcata gtgaataaat 35701 cagaggcggg gggtggggat atttgtccag aatgagagcc agaaagctac tgctgtagcc 35761 agcagcgtgg cccgtccgag tgtgtgggca gctgtagctg gcagcgtggc cacctgagtg 35821 tgcaagcagg ctgctcacag ctgccccacc cttccctttc ccccaccagc gagctcgagg 35881 actatatgaa tgagcgggtc cagttcgtca tcacggccca ggagtgggac cccaactttg 35941 aggaggtgag ccctggaggg gcagacatgg agaaccccag cccactttgg cattccctct 36001 gtcttccgat tatattccca agtggaccct tttcccgtag gccctgatgg aaaacccttc 36061 cctggccttc gtgcggcccc ggtggatcta cagttgtaat gagaagcaga agttactccc 36121 ccatcagctc tatggggtgg tgccccaggc ctgagtgtac acctgtgtac gtgcagatgt 36181 gcacatgtat gcacaggcct gagtgcacac gtgtgtacac acaagagttt aataaaggag 36241 actttgtttg aagcaacggt ttctctctta ggagactcaa ggttctgcta gcctgggttg 36301 ccccatgttc ctgtcccagg gtctgggaag agaagagtgg ggccctccac cagagatctg 36361 ctaagtagct ccccgcaccc ctctgtagtt ctgtgctgac aggagccagg gagttggaac 36421 tgtgccactc agtgctgtcc cccaaataga tctggtccat cctgtcacac atctatggtc 36481 agtgagctgg aggcattgat gcttatcaca tgtgccagac tgaccaggtc ccttccctgg 36541 tgtcaccagc gtcgccagca gggtttgcat ccaactcggg aggcaggctg ggtgggagga 36601 aggctgttct taggaagttg cccaaattct ggagccccca aggctcagct gggaatggaa 36661 cagagctgtt agggggccaa agcagagggg gatggagtcc gggtggggtg ggggtggggc 36721 agaagccaca tggtgggagt agcctgaggc tgctcacagc aaggggctct ggggcacagt 36781 cctcgcgtgc tgtccattgt ggccaccagt caggtggcca tttcatataa agaaagaaag 36841 gcatagggcc cagttgttaa aggtggaagc aggaggatct ctaagcattt gaggcctgca 36901 tgaggcgcat tgtgagacct tacccacccc cttccctatg aaaagctgga atagggaggg 36961 agagacggat gcaggcaggg aggccaactt cagggcccca gtcccacatt tcaaaacact 37021 ccatctctag aaaagttctt tggatgccct ggagggccgt gaactccagg ctgaggggtt 37081 tggacttcat tctggggtgg gacctgaagg gactttagac agggcaaagg tcagacagag 37141 tcagtcatgg tgagacctga ccctcgggta acctcctctt ggatgtttcc taactttaac 37201 caaccaaggg ttaccgagct ctcccccaac ccccgactgt ctcatgactt cacactggcc 37261 gtttctgggc tcagtcagtg ggtcagtgac tcgtccattc cactgcatcg acctttctcc 37321 ctgtcactct ccgacactgc agccagatc // LOCUS MMAMH 2870 bp DNA ROD 22-JAN-1992 DEFINITION M.musculus mAmh gene for anti-Mullerian hormone. ACCESSION X63240 NID g49945 KEYWORDS anti-Muellerian hormone; mAmh gene. SOURCE house mouse. ORGANISM Mus musculus Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Rodentia; Sciurognathi; Myomorpha; Muridae; Murinae; Mus. REFERENCE 1 (bases 1 to 2870) AUTHORS Munsterberg,A. and Lovell-Badge,R. TITLE Expression of the mouse anti-mullerian hormone gene suggests a role in both male and female sexual differentiation JOURNAL Development 113 (2), 613-624 (1991) MEDLINE 92146272 REFERENCE 2 (bases 1 to 2870) AUTHORS Muensterberg,A. TITLE Direct Submission JOURNAL Submitted (14-JAN-1992) A. Muensterberg, MRC, National Institute for Medical Res., The Ridgeway, Mill Hill, London NW7 1AA, UK FEATURES Location/Qualifiers source 1..2870 /organism="Mus musculus" /strain="129" /db_xref="taxon:10090" /dev_stage="14.5 dpc" /tissue_type="testis" /cell_type="Sertoli cell" TATA_signal 352..356 exon 379..789 /gene="mAmh" /number=1 mRNA join(379..789,1162..1304,1482..1590,1678..1837, 1937..>2789) /gene="mAmh" gene 379..2789 /gene="mAmh" CDS join(387..789,1162..1304,1482..1590,1678..1837,1937..2789) /gene="mAmh" /codon_start=1 /product="anti-Mullerian hormone" /db_xref="PID:g49946" /db_xref="SWISS-PROT:P27106" /translation="MQGPHLSPLVLLLATMGAVLQPEAVENLATNTRGLIFLEDELWP PSSPPEPLCLVTVRGEGNTSRASLRVVGGLNSYEYAFLEAVQESRWGPQDLATFGVSS TDSQATLPALQRLGAWLGETGEQQLLVLHLAEVIWEPELLLKFQEPPPGGASRWEQAL LVLYSGPGPQVTVTGTGLRGTQNLCPTRDTRYLVLTVDFPAGAWSGFGLILTLQPSRE GATLSIDQLQAFLFGSDSRCFTRMTPTLVVLPPAEPSPQPAHGQLDTMPFPQPGLSLE PEALPHSADPFLETLTRLVRALRGPLTQASNTQLALDPGALASFPQGLVNLSDPAALG RLLDWEEPLLLLLSPTAATEREPIRLHGPASAPWAAGLQRRVAVELQAAASELRDLPG LPPTAPPLLARLLALCPNDSRSSGDPLRALLLLKALQGLRAEWHGREGRGRTRAQRGD KGQDGPCALRELSVDLRAERSVLIPETYQANNCQGACRWPQSDRNPRYGNHVVLLLKM QARGAALGRLPCCVPTAYAGKLLISLSEERISADHVPNMVATECGCR" intron 790..1161 /gene="mAmh" /number=1 exon 1162..1304 /gene="mAmh" /number=2 intron 1305..1481 /gene="mAmh" /number=2 exon 1482..1590 /gene="mAmh" /number=3 intron 1591..1677 /gene="mAmh" /number=3 exon 1678..1837 /gene="mAmh" /number=4 intron 1838..1936 /gene="mAmh" /number=4 exon 1937..>2789 /gene="mAmh" /number=5 polyA_signal 2841..2846 BASE COUNT 548 a 907 c 871 g 544 t ORIGIN 1 ccagtccttg gaaggttttc tatttgtcct gccttgagcc tattaaacag cctcccccgt 61 gtctccctgt ctgctgtgtg tttggtagtg gggagggtgg atactgcttg tctctgttga 121 agctgtggtg acctggggcg tcctccaggt gggctcccca gggagatggg agctactcaa 181 ggacagctca ggcctctgca gttatgggcc cagctctgag gacagaaagc cctttgagac 241 agtcgcctcc cacctgctgg gcatgaaaag tgccaggcac tgtcccccaa ggtcaccttt 301 ggtgttgata ggggcgtccc tcccaagcaa gcaatctggc tcagccatac atataagcag 361 ggccacccgg accttgctgt accaccatgc aggggccaca cctctctcca ctggtactgc 421 tgctagcgac tatgggggct gtgttacagc ctgaggcggt tgaaaacctg gccaccaata 481 ccaggggcct catcttcctt gaagatgagc tctggccccc cagcagcccc cctgaacctt 541 tgtgcctggt gacagtgaga ggagagggga acacaagcag agcttccctg agggtggtgg 601 ggggtctgaa cagctatgag tatgccttcc tggaggctgt ccaggagtct cgctggggac 661 cccaagacct ggccaccttc ggagtctcca gcactgactc ccaggctacc ctgcctgccc 721 tgcagcgcct tggggcctgg ctaggggaga ctggagaaca gcagttgcta gtcctacatc 781 tggctgaagg tacgtgatgg actctgtctc agcgcccatg atgccttggg actgagtgag 841 ttagggcaga agctggggga ggggggtttg ggggaggggg cctcgctgga ttgctaggca 901 gaaccttata caccttcagg aactgtagtg ggcaagatgg caggacacac cccaatgatc 961 acaccttcca gacagactga cagcagagct gctgtgtcct tacaagcaag gtctccacac 1021 cttggatcag ggatccctca gcctacatgg ccccagaaga tctccagtga agagtccagc 1081 tgagggaatt tctgtaggac agatcttgag ggactgaagc taatggcctc agagactcag 1141 tcaaggaata tttgcccaca gtgatatggg agcccgagct cttgctgaag ttccaagagc 1201 ctccacctgg gggagccagc cgctgggagc aggccctgtt agtgctatac tctggaccag 1261 gcccccaggt cacagtcaca gggactggac tgcggggcac acaggtacca gagactggca 1321 tgggcccatc ccctaacagc cttctggttg aggttgtgcc aaatggaagg gttggtgggg 1381 ttaggagtac acctgggagg ctattcccag tatggcaggg gtggtgcctg tagcctgacc 1441 atcttcccca gtcctgctga caagagtccc tgtgtccaca gaacctctgc cctactcggg 1501 acacccgcta tttggtgcta accgtggact tcccagcggg ggcctggagc ggcttcgggc 1561 tcatcttaac ccttcaacca agcagagaag gtaggtcctg gcagagggga ggggacagag 1621 tggaaggaag ctgcccttaa ccccagccct cagccagcac gtgcccaccc tctacaggtg 1681 ccaccctgag catcgatcag ctgcaagcct ttctatttgg ctctgattcc cgctgtttca 1741 cgcggatgac tcccaccctg gtggtgctgc cacccgccga gccgtcaccg cagccagcac 1801 acggccagct ggacaccatg cctttcccgc agcctgggtg cgcacaggga tggaggagga 1861 actgggacgt tggagggggt agttggtcca ccatagcctc ttgtgctcac agctggccca 1921 cttcgttcta ttccagactg tccctggagc ctgaggccct gccacacagc gccgacccct 1981 tcctagagac cctcactcgc ttggttcgtg ctctgcgggg acctctgacc caggcttcga 2041 acacgcaact ggccctggac cctggtgcgc tggccagctt cccacagggc ctggtcaacc 2101 tgtcagaccc cgcagcactg ggacgcctgc tcgactggga ggaaccccta ttactgctgc 2161 tgtcacccac tgcggccacg gagagggaac ctatccggct gcacggcccc gcttctgctc 2221 cctgggcagc gggcctgcaa cgcagggtgg cagtggagct gcaggcggca gcctcagagc 2281 tgcgggacct cccgggtctg ccacccacag ctcccccgct gctggcgcgc ctgctagcgc 2341 tgtgtcccaa cgactcccgc agctccgggg acccgctgcg cgcgctgctg ctgctaaagg 2401 cgctgcaggg cttacgtgcc gagtggcatg ggcgggaagg gcgtgggaga acgcgggcgc 2461 agcgcgggga caagggacaa gacgggcctt gcgcgctgcg cgagctgagt gtagatctgc 2521 gcgcggagcg ttcagtgctc atcccggaga cctaccaagc caacaactgc caaggcgcct 2581 gccggtggcc gcagtctgac cgtaatccgc gctacgggaa ccacgtggtg ctgctgctaa 2641 aaatgcaggc tcgcggggct gccctgggcc gcctgccctg ctgcgtgccc actgcctacg 2701 cgggcaagct gctcatcagc ctgtccgagg agcgcatcag cgccgaccac gtgcccaaca 2761 tggtagccac cgagtgcggc tgccggtgac gcccgccctc ctcctcccct cccccccccc 2821 ccgcccccag tcagcgccct aataaagatc agcaaacact caaaaaaaaa // LOCUS MMU20156 920 bp DNA ROD 08-MAR-1996 DEFINITION Mus musculus macrophage migration inhibitory factor (MIF) gene, complete cds. ACCESSION U20156 NID g896043 KEYWORDS . SOURCE house mouse. ORGANISM Mus musculus Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Rodentia; Sciurognathi; Myomorpha; Muridae; Murinae; Mus. REFERENCE 1 (bases 1 to 920) AUTHORS Kozak,C.A., Adamson,M.C., Buckler,C.E., Segovia,L., Paralkar,V. and Wistow,G. TITLE Genomic cloning of mouse MIF (macrophage inhibitory factor) and genetic mapping of the human and mouse expressed gene and nine mouse pseudogenes JOURNAL Genomics 27 (3), 405-411 (1995) MEDLINE 96047324 REFERENCE 2 (bases 1 to 920) AUTHORS Wistow,G.J. TITLE Direct Submission JOURNAL Submitted (20-JAN-1995) Graeme J. Wistow, National Eye Institute, Section on Mol. Structure and Function, National Institutes of Health, Bethesda, MD 20892-2730, USA COMMENT On Jul 12, 1995 this sequence version replaced gi:841245. FEATURES Location/Qualifiers source 1..920 /organism="Mus musculus" /strain="129/Sv" /db_xref="taxon:10090" exon 1..200 /gene="MIF" gene join(1..200,402..574,718..904) /gene="MIF" mRNA join(1..200,402..574,718..904) /gene="MIF" CDS join(93..200,402..574,718..784) /gene="MIF" /codon_start=1 /product="macrophage migration inhibitory factor" /db_xref="PID:g841246" /translation="MPMFIVNTNVPRASVPEGFLSELTQQLAQATGKPAQYIAVHVVP DQLMTFSGTNDPCALCSLHSIGKIGGAQNRNYSKLLCGLLSDRLHISPDRVYINYYDM NAANVGWNGSTFA" exon 402..574 /gene="MIF" exon 718..904 /gene="MIF" BASE COUNT 166 a 284 c 289 g 181 t ORIGIN 1 actgagctgg gtcacgtagc tcaggtccct ggcttgggtc acaccgcgct ttgtaccgtc 61 ctccggtcca cgctcgcagt ctctccgcca ccatgcctat gttcatcgtg aacaccaatg 121 ttccccgcgc ctccgtgcca gaggggtttc tgtcggagct cacccagcag ctggcgcagg 181 ccaccggcaa gcccgcacag gtttgcaggg aggacacagg aagagagtag ggtggggtgg 241 gccggcccga cgtgtgagga gggatggggc tggaagccaa ggtgtgccgg cgggtggcgg 301 ctggagctct ccggaagacc tgtggccctg taggcagtct ttcaggcggt ctaacagtgt 361 gtctgtatcc ctcccgcctc gccgcccctc cccccaccca gtacatcgca gtgcacgtgg 421 tcccggacca gctcatgact tttagcggca cgaacgatcc ctgcgccctc tgcagcctgc 481 acagcatcgg caagatcggt ggtgcccaga accgcaacta cagtaagctg ctgtgtggcc 541 tgctgtccga tcgcctgcac atcagcccgg accggtgcgt gggggacgag gggaggaggg 601 ggaggagggg cactgggagg tcagcaggca aagagggggg ggcgttcaga ggacactggc 661 acgcagcgcg ctctcctaga ccacgtgctt agctgagcca ggctttcatt ttctcagggt 721 ctacatcaac tattacgaca tgaacgctgc caacgtgggc tggaacggtt ccaccttcgc 781 ttgagtcctg gccccactta cctgcaccgc tgttctttga gcctcgctcc acgtagtgtt 841 ctgtgtttat ccaccggtag cgatgcccac cttccagccg ggagaaataa atggtttata 901 agagaccacg gttgcctcag // LOCUS MMMYOD1 2627 bp DNA ROD 27-JAN-1992 DEFINITION M.musculus myoD1 gene for MyoD1 protein. ACCESSION X61655 NID g53301 KEYWORDS MyoD1 gene; MyoD1 protein. SOURCE house mouse. ORGANISM Mus musculus Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Rodentia; Sciurognathi; Myomorpha; Muridae; Murinae; Mus. REFERENCE 1 (bases 1 to 2627) AUTHORS Zingg,J.M. TITLE Direct Submission JOURNAL Submitted (27-AUG-1991) J.M. Zingg, Friedrich Miescher Inst, Ciba Geigy AG, R-1060.524, Postfach, 4002 Basel, SWITZERLAND REFERENCE 2 (bases 1 to 2627) AUTHORS Zingg,J.M., Alva,G.P. and Jost,J.P. TITLE Characterisation of a genomic clone covering the structural mouse MyoD1 gene and its promoter region JOURNAL Nucleic Acids Res. 19 (23), 6433-6439 (1991) MEDLINE 92093599 FEATURES Location/Qualifiers source 1..2627 /organism="Mus musculus" /strain="DBA/2J" /db_xref="taxon:10090" /dev_stage="adult" /clone_lib="Clontech, ML 1009d" TATA_signal 585..592 mRNA join(654..1442,1882..1960,2289..>2539) /gene="myoD1" gene 654..2539 /gene="myoD1" exon 654..1442 /gene="myoD1" /number=1 CDS join(816..1442,1882..1960,2289..2539) /gene="myoD1" /codon_start=1 /product="MyoD1 protein" /db_xref="PID:g53302" /db_xref="SWISS-PROT:P10085" /translation="MELLSPPLRDIDLTGPDGSLCSFETADDFYDDPCFDSPDLRFFE DLDPRLVHVGALLKPEEHAHFSTAVHPGPGAREDEHVRAPSGHHQAGRCLLWACKACK RKTTNADRRKAATMRERRRLSKVNEAFETLKRCTSSNPNQRLPKVEILRNAIRYIEGL QALLRDQDAAPPGAAAFYAPGPLPPGRGSEHYSGDSDASSPRSNCSDGMMDYSGPPSG PRRQNGYDTAYYSEAVRESRPGKSAAVSSLDCLSSIVERISTDSPAAPALLLADAPPE SPPGPPEGASLSDTEQGTQTPSPDAAPQCPAGSNPNAIYQVL" intron 1443..1881 /gene="myoD1" /number=1 exon 1882..1960 /gene="myoD1" /number=2 intron 1961..2288 /gene="myoD1" /number=2 exon 2289..>2539 /gene="myoD1" /number=3 BASE COUNT 507 a 829 c 742 g 549 t ORIGIN 1 aagcttcccg tggaagaaca gatattctct gagccattcc cagatggaga atgaccaaat 61 agcactgcca ccgattcatt tgtgccaggc ttgctcacct agaccttctg agtctcactg 121 tctcttgccc actttgtcct tggctcaact tctctgggtg tgtcatcatt tctccactct 181 tctctagaac tttcattgtc ccgtagcctt gagtctctct ccaaacctcc tgcaatctga 241 tttctaactc ctatcctttg cctggtctcc cacaagtcac cagagtggag tccgaggtca 301 gctccgaagt gagcactgag gtcagtacag gctggaggag tagacactgg agaggcttgg 361 gcaggctgca ccagatagcc aagtgctacc gcgtgtggct gccagtctct ctgccctcct 421 tcctagctag gcagctgccc cagcacagag tcgcgggagg gggcactccc tggcccagtg 481 gctaccctgg ggaccccaag ctccgcccta ctacactcct attggcttga ggcgcccccg 541 cccccagcct ccctttccag ctcccgggct tttaggctac cctggataaa tagcccaggg 601 cgcctggcgc gaagctaggg gccaggacgc cccaggacac gactgctttc ttcaccacac 661 ctctgacagg acaggacagg gaggaggggt agaggacagc cggtgtgcat tccaacccac 721 agaacctttg tcattgtact gttggggttc cggagtggca gaaagttaag acgactctca 781 cggcttgggt tgaggctgga cccaggaact gggatatgga gcttctatcg ccgccactcc 841 gggacataga cttgacaggc cccgacggct ctctctgctc ctttgagaca gcagacgact 901 tctatgatga tccgtgtttc gactcaccag acctgcgctt ttttgaggac ctggacccgc 961 gcctggtgca cgtgggagcc ctcctgaaac cggaggagca cgcacacttc tctactgcgg 1021 tgcacccagg cccaggcgct cgtgaggatg agcatgtgcg cgcgcccagc gggcaccacc 1081 aggcgggtcg ctgcttgctg tgggcctgca aggcgtgcaa gcgcaagacc accaacgctg 1141 atcgccgcaa ggccgccacc atgcgcgagc gccgccgcct gagcaaagtg aatgaggcct 1201 tcgagacgct caagcgctgc acgtccagca acccgaacca gcggctaccc aaggtggaga 1261 tcctgcgcaa cgccatccgc tacatcgaag gtctgcaggc tctgctgcgc gaccaggacg 1321 ccgcgccccc tggcgccgct gccttctacg cacctggacc gctgccccca ggccgtggca 1381 gcgagcacta cagtggcgac tcagacgcgt ccagcccgcg ctccaactgc tctgatggca 1441 tggtaaggcg gtggactcag gaggatgagc aatggagcgg cgccttgggt atctgcaaca 1501 ggtttccgag gcccttgggg tgggggtgtc cctcatacct agatgctcct ggcatctgac 1561 actggagtcg ctttggagac ccagggcatc tatgattctg ccgattgggg gtggaacact 1621 gctgcgcaga ccccgggata tgcttttcct tctcattatt accctaatgc agattattgt 1681 tcctgagtga ctgtccactc tcagtttggc cccgcatgcg acagcttcca gtgtgtggct 1741 ggctcctagc acctggggct gacccagtcc tggaaccagc agctgagact aagggagtga 1801 gggaggggtg atgacaagga gtgttgcttg agacccactc gggccctgta gacctaactc 1861 tgttatcctt gctattcgca gatggattac agcggccccc caagcggccc ccggcggcag 1921 aatggctacg acaccgccta ctacagtgag gcggtgcgcg gtgcgtattc tcagctgttc 1981 ccagctagca ggcctttatc ggccttctgt gtcccccttg aaactttcct cgctccctag 2041 gcttagtatc ctcttcctgc ctccaccaca tacatacccg taccttggga tggtgggggg 2101 tgggggggag gctggggggg gagcattggg ggaggggaca aagaactatg atgcacacct 2161 tcctctcctt tctccttcca gtctagcaag tcctcagttt cccttttgct acaaagctcc 2221 gtgcctatgg gcaggagact tgagaagggc cgcaagtttg gattactaac cttccactcc 2281 cctcacagag tccaggccag ggaagagtgc ggctgtgtcg agcctcgact gcctgtccag 2341 catagtggag cgcatctcca cagacagccc cgctgcgcct gcgctgcttt tggcagatgc 2401 accaccagag tcgcctccgg gtccgccaga gggggcatcc ctaagcgaca cagaacaggg 2461 aacccagacc ccgtctcccg acgccgcccc tcagtgtcct gcaggctcaa accccaatgc 2521 gatttatcag gtgctttgag agatcgactg cagcagcaga gggcgcacca ccgtaggcac 2581 tcctggggat ggtgcccctg gttcttcacg cccaaaagat gaagctt // LOCUS MMOESTEOP 5782 bp DNA ROD 17-FEB-1997 DEFINITION Murine gene for osteopontin. ACCESSION X51834 NID g53520 KEYWORDS calcium binding protein; osteopontin. SOURCE house mouse. ORGANISM Mus musculus Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Rodentia; Sciurognathi; Muridae; Murinae; Mus. REFERENCE 1 (bases 1 to 315; 321 to 5782) AUTHORS Yamamoto,S. TITLE Direct Submission JOURNAL Submitted (09-JAN-1990) Yamamoto S., Dept of Pathology Medical College of Oita, Idaigaoka 1-1, Hasamamachi Oita Gun, Oita, Japan REFERENCE 2 (bases 1 to 5782) AUTHORS Miyazaki,Y., Setoguchi,M., Yoshida,S., Higuchi,Y., Akizuki,S. and Yamamoto,S. TITLE The mouse osteopontin gene. Expression in monocytic lineages and complete nucleotide sequence JOURNAL J. Biol. Chem. 265 (24), 14432-14438 (1990) MEDLINE 90354433 REFERENCE 3 (bases 1 to 5782) AUTHORS Yamamoto,S. TITLE Direct Submission JOURNAL Submitted (21-NOV-1990) to the EMBL/GenBank/DDBJ databases COMMENT **map; chromosome=5. FEATURES Location/Qualifiers source 1..5782 /organism="Mus musculus" /strain="BALB/c" /db_xref="taxon:10090" /clone="12G26" /haplotype="H-2 d" /tissue_type="liver" misc_feature 1..114 /note="poly (CT) repeat" misc_feature 154..159 /note="IRF-1 binding sequence" misc_feature 173..178 /note="IRF-1 binding sequence" old_sequence 315..321 /citation=[1] /replace="tacat" misc_feature 386..447 /note="poly (TA) repeat" misc_feature 456..461 /note="glucocorticoid responsive element" misc_feature 532..539 /note="Ig octamer" misc_feature 536..541 /note="IRF-1 binding sequence" misc_feature 672..676 /note="GF-1 binding sequence" TATA_signal 707..714 /note="pot. TATA box" TATA_signal 717..723 /note="pot. TATA box" exon 743..992 /number=1 mRNA join(743..992,1091..1126,2202..2282,2779..2820,3819..4100, 4788..5608) prim_transcript 743..5608 TATA_signal 772..778 /note="pot. TATA box" TATA_signal 786..792 /note="pot. TATA box" CDS join(939..992,1091..1126,2202..2282,2779..2820,3819..4100, 4788..5177) /codon_start=1 /product="osteopontin" /db_xref="PID:g297546" /db_xref="MGD:13576" /db_xref="SWISS-PROT:P10923" /translation="MRLAVICFCLFGIASSLPVKVTDSGSSEEKLYSLHPDPIATWLV PDPSQKQNLLAPQNAVSSEEKDDFKQETLPSNSNESHDHMDDDDDDDDDDGDHAESED SVDSDESDESHHSDESDETFTASTQADTFTPIVPTVDVPNGRGDSLAYGLRSKSRSFQ VSDEQYPDATDEDLTSHMKSGESKESLDVIPVAQLLSMPSDQDNNGKGSHESSQLDEP SLETHRLEHSKESQESADQSDVIDSQASSKASLEHQSHKFHSHKDKLVLDPKSKEDDR YLKFRISHELESSSSEVN" sig_peptide 939..986 mat_peptide join(987..992,1091..1126,2202..2282,2779..2820,3819..4100, 4788..5174) /product="osteopontin" intron 993..1090 /number=1 exon 1091..1126 /number=2 intron 1127..2201 /number=2 exon 2202..2282 /number=3 intron 2283..2778 /number=3 exon 2779..2820 /number=4 intron 2821..3818 /number=4 exon 3819..4100 /number=5 intron 4101..4787 /number=5 exon 4788..5608 /number=6 polyA_signal 5448..5453 polyA_signal 5588..5593 BASE COUNT 1808 a 1079 c 1071 g 1824 t ORIGIN 1 tctctctctc cctcctctct ctctccctcc tctctctctc cctcctctct cttcctctct 61 ctctctctcc ctctcccccc ttctctccct ctctcctctc tccctctctc tcctgtcaca 121 cacacattgt aaagttctat gtatctttgg aacaaatgtc tgcatcattt taaagtggct 181 ggatgactgt ggagaccagt aaaggaggat cctctatatt aatataaacc agatacagtt 241 atttttagaa aactgtctca cttcaaaaga gaaccttcct taatttctcg cttctgcact 301 tggcttttac agttttcact aaggtccctg tgtgataaca cagactcatc aacatgcaca 361 gaaaagctgt tgcctgctct cagtgtttta tttttattat tattattgtt attattatta 421 ttattattat tattattatt attattagtc caaatagaac atcttactca aattcaaaga 481 tatctttgtt tctttcagct ttgtataatg taagttaaaa tcacatttga aatgcaaatg 541 gaaaaagcaa ttttctttta tcattctatt ttctcttttt cttccttgca gaaagtactc 601 tcatggtagt tcgttgcttt aattactcga ttactgtgaa acaaatcttg gtgtggagct 661 ctagagacat cagatagtgg gggcataagc tctgagactc cagacgtaaa gtttggatga 721 atagagttct gaaattgccc ttttccttgc taaggatgag aggtggagag gtagaaaagg 781 cacacaaata ttgactcact gaaattttct ctgagatgta gaaagattcc ataaattatt 841 ggtgacttgg tggtgatcta gtggtgccaa gagtgtgttt gaacctgaca agacatcaac 901 tgtgcctcat aaaatatgtt gcaggactaa ctacgaccat gagattggca gtgatttgct 961 tttgcctgtt tggcattgcc tcctccctcc cggtgggtac agctgaatct tagagagaat 1021 tcttgaaaat aaatgaattt tagctttagc acactcacac gcatttcctt gtcacttttc 1081 tgttttaaag gtgaaagtga ctgattctgg cagctcagag gagaaggtaa gcacctccgg 1141 gttgatcatt taactgaaga atgttcgcca gcgtcgtaaa ggttgctagg aacagcaatt 1201 ttttgttgcc attttattca tctatataat tagacctgag tatttcaaag tacttactat 1261 gtattttaga cagagacagt accaaatcgg ccattgtttt taataaacag attctcttat 1321 aattagattt ttaatataat aaaagcatgt tctagacaat tgctacttac ataaagcagg 1381 tcacaggacc ttgggtgact ggttctttcc tgagtcagca tgcttggaaa atcctgcagg 1441 ctgactttag aaggtcactc agatgtcagc tggagctgat catatgacta tgatatttac 1501 aataccatcc aagtccatct cgaaaagaaa agtccatttg tgctgtttgg atctattagc 1561 tgactttaga tcacattttc ctaaatgaga atgaaaacca aaccagtacc tttacatgtt 1621 tcaacataag accagcagca ttggcatacc tgtgcctcca cgtaggtaag aatatgctgg 1681 tgtaacacag accctgtgct tggacaaaaa ggcacacaga gttcaggtcc agtgtagagc 1741 gtcattgcag tttcattgct cagcgaaaac agtcagggaa gcacttgtgc acagaatgtg 1801 tgagctggtg gtggggctgc ctgtcttcca acagtctgga gaacatgggt gctcaactac 1861 aataacaaga tgggagaatt aacatttcca tacaggaaag agagaccata actagtactt 1921 taaaatgtca caagagaaat atttttaaat acaaggatgc attttaaaaa gaagtaacag 1981 tataagtcat caaatgggat ttaggatgta gccgaaaagc tcaaaagtca gttttctaga 2041 atttcagcga tgtaggagca gaagaaaata tttaaaaagc ctttaagaat atactcctaa 2101 agatgagcac tcgttgttca tgttaaggtt atggcttaca gagctctgtg tttctttttt 2161 taataatagc aaatgcatac tttttttttt ttttgaatta gctttacagc ctgcacccag 2221 atcctatagc cacatggctg gtgcctgacc catctcagaa gcagaatctc cttgcgccac 2281 aggtattgtg ttttaatttc tcaaaactga acagctgggc tggattgatg gcagagctgt 2341 tcttacagag gacccaggtt cagttcccag aacctagata tcagttcaca ttcttcttta 2401 ctccattcca ggggtctgtt gtagtcctct gatctccttg gggctcggca gatacatggt 2461 acactactac gcatgcaagc aaaacagtta cacacaaaaa ataaagtaaa caaacaataa 2521 atgaacaaaa ccctaaactg ctaattaatg agcagctact aatcaagcca cacaggctag 2581 gtagccagag cttcttgctt agaagtaagt gagtgtctag aggaccaatg gatactttgc 2641 atccattgta acctttgtat ctattgtatc atctggcagg ctgaaatatg acacctgcat 2701 taggttgaaa agggtaactt tagaattaag caactgatca tgtatgttgt gtactgatat 2761 gtgaccttcc tttttcagaa tgctgtgtcc tctgaagaaa aggatgactt taagcaagaa 2821 gtaagttctc acattcactg agacctactt ttagagacac cgaggaagcc accacatcat 2881 tttcttcaga ttgtcagtcc cttagaaagt atccaagtaa atctccattc aaaaacttgc 2941 ttagaggggc tctgtgttca ttcatcacca aaatacacag gctcttcaga gaagtttagt 3001 gttacagaaa gtgcctactc gtgcctgcta tttgaaactt taagcctctt gtaaaatatg 3061 aaaatacgaa aagaagttta cagtatttta actaatattt gatgactata tcatcaatgc 3121 ttagccaagc caagatactt taaagtgttt atgttgataa tattaaagat tatttaataa 3181 gtgttctttc cctatttgct gactaaataa ataaaaattt ccctcaagat taaatttaac 3241 atagagacaa tttatgataa gaaaaactag atcaaaagaa ccataacctt tgtagatttg 3301 tagattaata gattttaatc tctagttcac tgtatggatt ttggctacag tttgtacacc 3361 catgtataac agagccattt cttaactacc tttgataaca gggaggcaat tatagatttt 3421 gatctttata ttaatataca gatgatctga tgtcttaggg aactatgttt cagattaaag 3481 aaactcaatc tagcaatttt ttgaatttag taaattaata atgtatgcta tttacaagga 3541 aatatgtgtt atcaatatga atgttttact tttaagttta acttgaaaat atctaagaaa 3601 gtgatatttt atttaaaaga caaaaaaagt gcctcagaat gttatttcat ttatttgtgt 3661 gttaaaaagc aaacaagagg cccgtttcat tagcttaaaa accatagaat gtaatgtaaa 3721 cacaaagact ttgaaattca attaaaatgt ttcaacgaag ctattgtagt ctaaaaccca 3781 agcgctaatt tactgagcca acttgctttg tgatttagac tcttccaagc aattccaatg 3841 aaagccatga ccacatggac gacgatgatg acgatgatga tgacgatgga gaccatgcag 3901 agagcgagga ttctgtggac tcggatgaat ctgacgaatc tcaccattcg gatgagtctg 3961 atgagacctt cactgctagt acacaagcag acactttcac tccaatcgtc cctacagtcg 4021 atgtccccaa cggccgaggt gatagcttgg cttatggact gaggtcaaag tctaggagtt 4081 tccaggtttc tgatgaacag gtaaatgatg aacagacatg ttgaagggct atggcttgat 4141 ctcacttcta ggaaaccaga gtctgcagat tcactcactc ttcatcaagc actgtacatt 4201 tatatgaaat ctactgtatg acctttgaat acagagcttt ttttttttat tttaatggtc 4261 ctgtttaaaa gtgtgtgtga tgtcttttta attttaccta tcacacacaa tgtcatcttt 4321 ccatatccaa agtcttagca atatttaaga gcaatttatg ttattaaata gctactttgg 4381 aaaatagcaa ttatatgtta aaaataatgg ccatatatag aaaacagtca tatataatgg 4441 aagctaagtg ggtaagagga attgtaaagt cactgttaca atgaacttag tccggcttag 4501 acttagcctt ggcgttactg ttcagtttaa ctttaacgct caatgccttt tcatcagaga 4561 tagaacctta taggttttaa cacacacaaa gatgatatcc ctattaaacc agaaatgatt 4621 gatatgaagc tgactggtga aatataaagt acacactaat attatttttc ttcagtcaaa 4681 attttaaaaa tattcagtac agtttgacct aaaagttaga gatcaaagca ttatgatgct 4741 cttgttcctg gctgtttata taattcttca atttgttctt gattcagtat cctgatgcca 4801 cagatgagga cctcacctct cacatgaaga gcggtgagtc taaggagtcc ctcgatgtca 4861 tccctgttgc ccagcttctg agcatgccct ctgatcagga caacaacgga aagggcagcc 4921 atgagtcaag tcagctggat gaaccaagtc tggaaacaca cagacttgag cattccaaag 4981 agagccagga gagtgccgat cagtcggatg tgatcgatag tcaagcaagt tccaaagcca 5041 gcctggaaca tcagagccac aagtttcaca gccacaagga caagctagtc ctagacccta 5101 agagtaagga agatgatagg tatctgaaat tccgaatttc tcatgaatta gagagttcat 5161 cttctgaggt caactaaaga agaggcaaaa acacagttcc ttactttgca tttagtaaaa 5221 acaagaaaaa gtgttagtga gggttaagca ggaatactaa ctgctcattt ctcagttcag 5281 tggatatatg tatgtagaga aagagaggta atattttggg ctcttagctt agtctgttgt 5341 ttcatgcaaa caccgttgta accaaaagct tctgcacttt gcttctgttc ttcctgtaca 5401 agaaatgcaa acggccactg cattttaatg attgttattc ttttatgaat aaaatgtatg 5461 tagaaacaag caaatttact gaaacaagca gaattaaaag agaaactgta acagtctata 5521 tcactatacc cttttagttt tataattagc atatattttg ttgtgattat ttttttttgt 5581 tggtgtgaat aaatcttgta acgaatgtaa ggagtctggt ggtgtcaatt attctttatt 5641 agtttctcaa aattattttg taatgtctaa aatataacca tttcaatgga catgcccagt 5701 ttaattaaag ccatttaggg ttttaatggt cagaggaaca cagattaccc gggagttgac 5761 ggcagccaga acccaccgaa tt // LOCUS MUSHSP7A2 3518 bp DNA ROD 26-MAR-1994 DEFINITION Mouse heat shock protein 70.1 (hsp70.1) gene, complete cds. ACCESSION M35021 NID g194022 KEYWORDS heat shock protein 70.1. SOURCE Mouse (strain AJ) kidney DNA, clone pM[1.2,2.3]. ORGANISM Mus musculus Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Rodentia; Sciurognathi; Myomorpha; Muridae; Murinae; Mus. REFERENCE 1 (bases 1 to 3518) AUTHORS Hunt,C. and Calderwood,S.B. TITLE Characterization and sequence of a mouse hsp70 gene and its expression in mouse cell lines JOURNAL Gene 87, 199-204 (1990) MEDLINE 90236310 FEATURES Location/Qualifiers source 1..3518 /organism="Mus musculus" /strain="AJ" /db_xref="taxon:10090" /tissue_type="kidney" CAAT_signal 507..512 /note="inverted CCAAT box" TATA_signal 547..552 CDS 806..2734 /note="hsp70.1" /codon_start=1 /db_xref="PID:g387211" /translation="MAKNTAIGIDLGTTYSCVGVFQHGKVEIIANDQGNRTTPSYVAF TDTERLIGDAAKNQVALNPQNTVFDAKRLIGRKFGDAVVQSDMKHWPFQVVNDGDKPK VQVNYKGESRSFFPEEISSMVLTKMKEIAEAYLGHPVTNAVITVPAYFNDSQRQATKD AGVIAGLNVLRIINEPTAAAIAYGLDRTGKGERNVLIFDLGGGTFDVSILTIDDGIFE VKATAGDTHLGGEDFDNRLVSHFVEEFKRKHKKDISQNKRAVRRLRTACERAKRTLSS STQASLEIDSLFEGIDFYTSITRARFEELCSDLFRGTLEPVEKALRDAKMDKAQIHDL VLVGGSTRIPKVQKLLQDFFNGRDLNKSINPDEAVAYGAAVQAAILMGDKSENVQDLL LLDVAPLSLGLETAGGVMTALIKRNSTIPTKQTQTFTTYSDNQPGVLIQVYEGERAMT RDNNLLGRFELSGIPPAPRGVPQIEVTFDIDANGILNVTATDKSTGKANKITITNDKG RLSKEEIERMVQEAERYKAEDEVQRDRVAAKNALESYAFNMKSAVEDEGLKGKLSEAD KKKVLDKCQEVISWLDSNTLADKEEFVHKREELERVCSPIISGLYQGAGAPGAGGFGA QAPPKGASGSGPTIEEVD" BASE COUNT 827 a 993 c 1014 g 684 t ORIGIN 1 gatctcttct atttccctat tcaaacctaa aatgaagagg gagggggaga catggacaag 61 caagcattcc acaggcgccc ctgcccaacg ctgtcactca aaccaggacc caatcacaga 121 ctttttagcc aagccttatc ccgcctctct tgagaaactt tctgcgtccg ccatcctgta 181 ggaaggattt gtacacttta aactccctcc ctggtctgag tcccacactc tcaccaccca 241 gcaccttcag gagctgaccc ttaacagctt cacccacagg gaccccgaag ttgcgtcgcc 301 tccgcaacag tgtcaatagc agcaccagca cttccccaca ccctccccct caggaatccg 361 tactctctag cgaaccccag aaacctctgg agagttctgg acaagggcgg aacccacaac 421 tccgattact caagggaggc ggggaagctc caccagacgc gaaactgctg gaagattcct 481 ggccccaagg cctcctccgg ctcgctgatt ggcccagcgg agagtgggcg gggccggtga 541 agactcctta aaggcgcagg gcggcgagca gggcaccaga cgctgacagc tactcagaat 601 caaatctggt tccatccaga gacaagcgaa gacaagagaa gcagagcgag cggcgcgttc 661 ccgatcctcg gccaggacca gccttcccca gagcatccac gccgcggagc gcaaccttcc 721 caggagcatc cctgccgcgg agcgcaactt tccccggagc atccacgccg cggagcgcag 781 ccttccagaa gcagagcgcg gcgccatggc caagaacacg gcgatcggca tcgacctggg 841 caccacctac tcgtgcgtgg gcgtgttcca gcacggcaag gtggagatca tcgccaacga 901 ccagggcaac cgcacgaccc ccagctacgt ggccttcacc gacaccgagc gcctcatcgg 961 ggacgccgcc aagaaccagg tggcgctgaa cccgcagaac accgtgttcg acgcgaagcg 1021 gctgatcggc cgcaagttcg gcgatgcggt ggtgcagtcc gacatgaagc actggccctt 1081 ccaggtggtg aacgacggcg acaagcccaa ggtgcaggtg aactacaagg gcgagagccg 1141 gtcgttcttc ccggaggaga tctcgtccat ggtgctgacg aagatgaagg agatcgctga 1201 ggcgtacctg ggccacccgg tgaccaacgc ggtgatcacg gtgcccgcct acttcaacga 1261 ctctcagcgg caggccacca aggacgcggg cgtgatcgcc ggtctaaacg tgctgcggat 1321 catcaacgag cccacggcgg ccgccatcgc ctacgggctg gaccggaccg gcaagggcga 1381 gcgcaacgtg ctcatcttcg acctgggggg cggcacgttc gacgtgtcca tcctgacgat 1441 cgacgacggc atcttcgagg tgaaggccac ggcgggcgac acgcacctgg gaggggagga 1501 cttcgacaac cggctggtga gccacttcgt ggaggagttc aagaggaagc acaagaagga 1561 catcagccag aacaagcgcg cggtgcggcg gctgcgcacg gcgtgtgaga gggccaagag 1621 gacgctgtcg tccagcaccc aggccagcct ggagatcgac tctctgttcg agggcatcga 1681 cttctacaca tccatcacgc gggcgcggtt cgaagagctg tgctcggacc tgttccgcgg 1741 cacgctggag cccgtggaga aggccctgcg cgacgccaag atggacaagg cgcagatcca 1801 cgacctggtg ctggtgggcg gctcgacgcg catccccaag gtgcagaagc tgctgcagga 1861 cttcttcaac gggcgcgacc tgaacaagag catcaacccg gacgaggcgg tggcctacgg 1921 ggcggcggtg caggcggcca tcctgatggg ggacaagtcg gagaacgtgc aggacctgct 1981 gctgctggac gtggcgccgc tgtcgctggg cctggagact gcgggcggcg tgatgacggc 2041 gctcatcaag cgcaactcca ccatccccac caagcagacg cagaccttca ccacctactc 2101 ggacaaccag cccggggtgc tgatccaggt gtacgagggc gagagggcca tgacgcgcga 2161 caacaacctg ctggggcgct tcgagctgag cggcatcccg ccggcgccca ggggcgtgcc 2221 gcagatcgag gtgaccttcg acatcgacgc caacggcatc ctgaacgtca cggccaccga 2281 caagagcacc ggcaaggcca acaagatcac catcaccaac gacaagggcc gcctgagcaa 2341 ggaggagatc gagcgcatgg tgcaggaggc cgagcgctac aaggccgagg acgaggtgca 2401 gcgcgacagg gtggccgcca agaacgcgct cgagtcctat gccttcaaca tgaagagcgc 2461 cgtggaggac gagggtctca agggcaagct cagcgaggct gacaagaaga aggtgctgga 2521 caagtgccag gaggtcatct cctggctgga ctccaacacg ctggccgaca aggaggagtt 2581 cgtgcacaag cgggaggagc tggagcgggt gtgcagcccc atcatcagtg ggctgtacca 2641 gggtgcgggt gctcctgggg ctgggggctt cggggcccag gcgccgccga aaggagcctc 2701 tggctcagga cccaccatcg aggaggtgga ttagaggcct ctgctggctc tcccggtgtg 2761 gtctagaaaa cagactcttt gcacttgata gctgcttggg caccgattac tgtcaaggtt 2821 atttaaagtc ttcttcatgg ttcagtttaa agttacagtc tttcttaagg taattgcgtt 2881 gactgttaaa ttttgtatgc atatatatat atatatatat atatatatat atattcaaat 2941 atattcaaag taatgttggg agcagcactg tgcactgtac caggggatta tgttttatag 3001 ctaatgatgt gtaaagtcta aagatttttt tgtaattttt atatcagtgt tccagtagcc 3061 tgggaagaca tatagtctag ctgcccagtt ccctggagat ggtcatctct aagacaaagt 3121 gtcttaaaca aacgtcttgg cactgtgtac tacataactt tactcttttg tacttaaaac 3181 tttatctgct tgtccatgtt aaggttttgt ggtataacca gtatgttctt tgcatttaat 3241 ctaagtaggt taaagatggt gtatccttcc tgcatacatg tctacactgc caccctgtgt 3301 acattttttt ctttgcatca ctacaaacta atgaaaaaaa cttttatgac ttaaatattc 3361 aaaataaaag gttacaagta tattttgtct gtttgtatgt tggaagggct aatggattct 3421 gggcttctgt ggatttctta agtttttttt aagatttatt attatatgtg aacacattgt 3481 agctatcttc agacacacca gaaaagggca tcagatct // LOCUS MUSMETIII 2649 bp DNA ROD 01-JUL-1992 DEFINITION Mouse metallothionein-III gene, complete cds. ACCESSION M93310 NID g199133 KEYWORDS metallothionein. SOURCE Mus musculus DNA. ORGANISM Mus musculus Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Rodentia; Sciurognathi; Myomorpha; Muridae; Murinae; Mus. REFERENCE 1 (bases 1 to 2649) AUTHORS Palmiter,R.D., Findley,S.D., Whitmore,T.E. and Durnam,D.M. TITLE MT-III, a brain-specific member of the metallothionein gene family JOURNAL Proc. Natl. Acad. Sci. U.S.A. 89, 6333-6337 (1992) MEDLINE 92335292 FEATURES Location/Qualifiers source 1..2649 /organism="Mus musculus" /db_xref="taxon:10090" TATA_signal 777..783 exon 808..894 CDS join(864..894,1089..1154,1965..2074) /codon_start=1 /product="metallothionein-III" /db_xref="PID:g199134" /translation="MDPETCPCPTGGSCTCSDKCKCKGCKCTNCKKSCCSCCPAGCEK CAKDCVCKGEEGAKAEAEKCSCCQ" intron 895..1088 exon 1089..1154 intron 1155..1964 exon 1965..2212 polyA_signal 2191..2196 BASE COUNT 603 a 744 c 681 g 621 t ORIGIN 1 gctagcatcg gtggtgtgca cctttgatcc cagcagctgg caggtggata cagaagggat 61 taagagttta ggtcttcctt ggctgcaaaa caatttgtat aattccttgg ttacataaaa 121 aaaaagacta gactggacta cctaagactc tgtctcaata acaaagagaa gcaacctcga 181 actacctcca aacagagaat ggtggtccct agtgggtgtc tctgtaaaat ttatctgggc 241 taaggaggtg ctcagggtcc tgtagaaggg ctttatagga aatattggga cagaaaaggc 301 cacgagtctt cccaccaaac tgcaggtgtc aggtaggatc tgtcaggtct cccctttcta 361 gccttcttca agcatcttgg gagcatcttt gctgctgctg ctgctgctgc tgctgctgct 421 gctgctgctg ctgctgctgc tgctgctgct gctgctgctg ctgctgagat gatcagcagc 481 aggctcactg ctcagcatcc cgtttggacc aaactgatca agacaatctg ggagaagagg 541 gagaagaacc tacgggaagg ggcaatacta atttgtctct ctcaacttgc aaagatggta 601 ctgcgcaggc accttcaggg agacggctgc agtggcaagg agtggacagc ggacaggcta 661 ctttggtcct acactcagtg gagactcggg acagcagtgc acacacacgg agagcaggcg 721 ctgtgcgtgc ataggggcgg gcgccaagtc gctgcttgcg cgcccccgcc tggggctata 781 aaagccttgc cacctgctgc cctggctacg tagcgcatcc gcttgccggg aggaaccaag 841 ctacggcggc tgctggactg gatatggacc ctgagacctg cccctgtcct actggtgagc 901 cccttccccc tcctgcagca ctttgccctt tcctggcaaa gaacccactc ctctgtcttc 961 actcaaggac atttggggga ggagtccctt ccccctaccc ccatctttaa ccctgtgatg 1021 atgataatct tcatttaggc atggggacgc caggtttccc tagtataatt cttcgtgtgc 1081 tctcttaggt ggttcctgca cctgctcgga caaatgcaag tgcaagggct gcaaatgcac 1141 gaactgcaag aagagtgagt gcaccccccc acccccaacc cctgccataa cctcccgccg 1201 cgcccacccc acccccacca gacactatga agcaaagctt cctgctgcag accttcagat 1261 accgaggcta ttacaaccaa tgttaattaa cccaatgtaa gggggatatt ttgatgaatt 1321 tccggaagct attaagatag ccgctttgaa agctgagccc cactcttcga tttttccaga 1381 gcccgaggat ccccaaggaa gaatgggggg gtcagcttta aagttgtagt gccttggccc 1441 ttcttaggct ttgtggggtc tcctcaatct gctctttcca gatttcagcg atgtaatatg 1501 cttggggtga ggtgtagagg taggaccagc gctttcttct tctcgagtgt ggctatgcat 1561 agatctcacg gatgtgctgg cctcgacatt atccctgccc cttccctgct gcaattccgt 1621 ctcagtacct tttacattgg cttgcctttt ccaccagacc taactctccc tctggcccag 1681 gcctgagccc agcattccca gggaattccc tagcacccac ccaaagagct gtgtgctagt 1741 tatagggtga ctctctacag aggcccggca gtcactgcag taccatggat gctgagtcag 1801 acctgatgtg gcagtatgaa gggagagcca cctgtcctgg gcaaagcctg atgactgtcc 1861 agcctcccat actgctttcc tctgcccttt gatggagaca gatgccgaca tcagactggg 1921 cacatatgtg cgcccgcgca cacacacaca cattttatct acaggctgct gctcctgctg 1981 ccctgccgga tgtgagaagt gtgccaagga ctgtgtgtgc aaaggtgaag agggggccaa 2041 ggcagaggcc gagaaatgca gctgctgcca gtgaggaccc agaccctccc acacagccta 2101 tgtaaatagt gctgggtgtc cctggtgggg cacaactgtt gtcttccccc cccccccccc 2161 ccccccgccg gctgcctgct ccggggtgtg aataaatccc atgcacaaca tgaacccaag 2221 actggtctct ttttcaagtg cgaaggatgt ggaagggtgg gggaggccac tcaagccgga 2281 gatttaggct tcccacctgt ttgggaccgg gacagagccc tggagacagt acagacgtgc 2341 atgtgctcac acacatgtgc acacacacag ataccgtcat gtgacactcc attctcctct 2401 gaagtccacc taagctcaga gagggaagat ggcaggtcag gcccatagag gtttatctgc 2461 ccaacctaga aacctttatc atctgtatag aggggctaca gggaattaaa atcagactct 2521 ggcaccaaac gcttgtgatt tctgtaatgt tctagttgtt cctccaaaaa ctgccagctt 2581 ttatatactg gaggagaatc caggctaccc cggactttat cttggcagaa ctgtgtctcc 2641 aaggtcacc // LOCUS MMU21795 5267 bp DNA ROD 25-MAR-1995 DEFINITION Mus musculus common cytokine receptor gamma chain gene, complete cds. ACCESSION U21795 NID g727349 KEYWORDS . SOURCE house mouse. ORGANISM Mus musculus Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Rodentia; Sciurognathi; Myomorpha; Muridae; Murinae; Mus. REFERENCE 1 (bases 1 to 5267) AUTHORS Cao,X., Kozak,C.A., Liu,Y.J., Noguchi,M., O'Connell,E. and Leonard,W.J. TITLE Characterization of cDNAs encoding the murine interleukin 2 receptor (IL-2R) gamma chain: chromosomal mapping and tissue specificity of IL-2R gamma chain expression JOURNAL Proc. Natl. Acad. Sci. U.S.A. 90 (18), 8464-8468 (1993) MEDLINE 93391374 REFERENCE 2 (bases 1 to 5267) AUTHORS Cao,X., Shores,E.W., Hu-Li,J., Anver,M.R., Kelsall,B.J., Russell,S.M., Drago,J., Noguchi,M., Grinburg,A., Bloom,E.T., Paul,W.E., Katz,S.I., Love,P.E. and Leonard,W.J. TITLE Defective lymphoid development in mice lacking expression of the common cytokine receptor gamma chain JOURNAL Immunity (1995) In press REFERENCE 3 (bases 1 to 5267) AUTHORS Leonard,W.J. TITLE Direct Submission JOURNAL Submitted (27-FEB-1995) Laboratory of Molecular Immunology, NHLBI, NIH, Bldg. 10, Rm. 7N244, Mail Stop 1674, Bethesda, MD 20892, USA FEATURES Location/Qualifiers source 1..5267 /organism="Mus musculus" /strain="129" /db_xref="taxon:10090" /map="between Rsvp and Plp" /chromosome="X" promoter 1..1194 mRNA join(1195..1333,1524..1677,1894..2078,2269..2411, 3048..3210,3665..3761,3949..4015,4353..5267) CDS join(1219..1333,1524..1677,1894..2078,2269..2411, 3048..3210,3665..3761,3949..4015,4353..4538) /note="IL-2R; interleukin-2 receptor gamma chain" /codon_start=1 /product="common cytokine receptor gamma chain" /db_xref="PID:g727350" /translation="MLKLLLSPRSFLVLQLLLLRAGWSSKVLMSSANEDIKADLILTS TAPEHLSAPTLPLPEVQCFVFNIEYMNCTWNSSSEPQATNLTLHYRYKVSDNNTFQEC SHYLFSKEITSGCQIQKEDIQLYQTFVVQLQDPQKPQRRAVQKLNLQNLVIPRAPENL TLSNLSESQLELRWKSRHIKERCLQYLVQYRSNRDRSWTELIVNHEPRFSLPSVDELK RYTFRVRSRYNPICGSSQQWSKWSQPVHWGSHTVEENPSLFALEAVLIPVGTMGLIIT LIFVYCWLERMPPIPPIKNLEDLVTEYQGNFSAWSGVSKGLTESLQPDYSERFCHVSE IPPKGGALGEGPGGSPCSLHSPYWPPPCYSLKPEA" polyA_signal 4981..4986 BASE COUNT 1438 a 1144 c 1347 g 1338 t ORIGIN 1 cttttaaagc agaatctgaa aaaattagtg taattttacc agagagaaaa tagagtagca 61 aatcaggcag aggcaaagag cagctggctg gttcctacct ttgtttaact gtgttttgga 121 gaatctccac atctatgctg tagaactcat tacagaacat tgtattattt atatgttccc 181 cacttatctc tgagcttcta aaatgatgat gtcttatttg tcttatgttc tcagaacata 241 agcactgtac cgagcacata ttaaagactc aataaatgtt ggctggataa acaatttcag 301 taaatggctt ctccaatcaa ccctgtgctc tgagggaagg taatctggcc acagaatgaa 361 ggatggactg gagagcagag gcccttgaga aaggggacca gtttgtgggt tacgggaata 421 atcatgactg gaggtaatga aaggctgatt tagcacagtg gctgtggtta acagaaagga 481 ggaaactgct gggagaaata ccgcagaaac agggctgatt ggattctcag tgtgagagag 541 aaggggggga catgaaaagg atcctgaggt ttcaagtcgg gcaggtgatg atgctattta 601 ttaagcagga caaggaagac agaaagacaa gcagatttgc agggagctag gaagtctgaa 661 gttagtacta gttagtactg ctctcccatc ttagaagccg gctctcacta gggtcaacag 721 ggaccaaact caggcagcag ttaggggtgg ctattctagt ttggattagg tcagaggaaa 781 gacagctatg ttaagtaccc acatgaatca tgtcagtact ttccatccat cctccctaga 841 ctggagaact ttgacagatg tttaagatag cctagaggga aaaggtggct gggaatgatg 901 gtgtgtggtg ggggtgggtg ttcagcagag ccttcctgga cctagggtcc tgaagggtct 961 tgattctgtg acactgttgt tgatcaagta tataatcttg acagaacatc accttagagc 1021 agaacccaaa tctccctggg gacttagctt atgtcactga acacatttac caaccccccc 1081 tctctctaca gcgtggtttc taaggttctt tccaccggaa gctacgacaa aaggaaatgt 1141 atgggtgggg agggcttgtg ggagagtggt tcagggttct gacacagact acaacccaga 1201 gaaagaagag caagcaccat gttgaaacta ttattgtcac ctagatcctt cttagtcctt 1261 cagctgctcc tgctgagggc agggtggagc tccaaggtcc tcatgtccag tgcgaatgaa 1321 gacatcaaag ctggtaggaa acctaggacc agagggagtt gttgagagga aggctatggg 1381 gaaagggcct gctagtgctc actataatga ctaaaacgaa gtgtgcagag ggggagggga 1441 aggctctctg cctcactgct gcttcttctg accaagaatt ctttttcttt cactccacta 1501 tttcattttc ttcccaaact tagatttgat cctgacttct acagcccctg aacacctcag 1561 tgctcctact ctgccccttc cagaggttca gtgctttgtg ttcaacatag agtacatgaa 1621 ttgcacttgg aatagcagtt ctgagcctca ggcaaccaac ctcacgctgc actataggta 1681 tgagaagggg gagggtagca cgggaagaag aaaagggagg ttagctggga gagactgctt 1741 gaggacccaa tcaagtgggt agccagctct tcaggaaccc taccagtttc tcatgggatg 1801 cattgtcagt tcagaccaga tgaggctagc taatgggcat atgcatgccc atgtttggcc 1861 catcattctt ttgccttgtt aacccttctc taggtacaag gtatctgata ataatacatt 1921 ccaggagtgc agtcactatt tgttctccaa agagattact tctggctgtc agatacaaaa 1981 agaagatatc cagctctacc agacatttgt tgtccagctc caggaccccc agaaacccca 2041 gaggcgagct gtacagaagc taaacctaca gaatcttggt aatcgggaaa gaagtagcca 2101 agagagcagg gagcttaaag acactggagt ttatagattg ttggccatgg gcagaaaaga 2161 gaagataggg gggttgggat ggggaaggga ggagggataa ggggaattac ctccaagatc 2221 ctgacttgtc taggccaggg caatgaccac acacatacat atctccagtg atcccacggg 2281 ctccagaaaa tctaacactc agcaatctga gtgaatccca gctagagctg agatggaaaa 2341 gcagacatat taaagaacgc tgtttacaat acttggtgca gtaccggagc aacagagatc 2401 gaagctggac ggtgagtgac ttggcgttgt ggatgcagtg gctaaggcca agcaagagaa 2461 gaaatggttc aactagccag acagaagaat ctaataccag gtcccagttc tctgcccccc 2521 aactcctctg tcttcacttt cacttttttt ttctgatctc accagtgagc tttcactttg 2581 cctctcccac catcctcttc ctgatgttag acaaaaagga agccagtgag tgggtctgga 2641 ggggagtcaa gttagagaag gggagcagta tgttctgatt agtcgggaga ggagcaaagt 2701 acaactagga gggaccagaa ctggaaagaa caagaaatta gaagagccgg gtgttggtgg 2761 cgcacgcctt taatcccagc acttgggagg cagaggcagg cggatttctg agtttgaggc 2821 cagcctggtc tacaaagtga gttccaggac agccagggct acacagagaa accctatctc 2881 ggaaaaacca aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaagg aaagaaagaa 2941 agaaatgaga agaatctgaa gatgacacat tgaggtatag gcatgaatgt tcccattctt 3001 gcttccttta ttgataacga tctatccctc acccttcttc ttcctaggaa ctaatagtga 3061 atcatgaacc tagattctcc ctgcctagtg tggatgagct gaaacggtac acatttcggg 3121 ttcggagccg ctataaccca atctgtggaa gttctcaaca gtggagtaaa tggagccagc 3181 ctgtccactg ggggagtcat actgtagagg gtaaagtggc ccaaatgcat gacctctaaa 3241 tcattcagcc agtaccctag cctttgtaac aacacactgt cactatcgtt tgctttgtct 3301 ctctagccct aagcctctac cccccccctt aactgtttaa cctcagtctc tatgaagcag 3361 ggttttctat ggagggatga gaaggggtgc agctgaaata ggaggccaaa ggatgctcta 3421 gggggatatg atatgtgtaa gaggtctctg taggattcca tagactctgc agagtggaga 3481 tgagcttgga tctgggctgc cctctttgat ttagcttatt catctgtaaa atgtagataa 3541 acattgttcc tccctctcag gagctgtgtg gaaagtaagc ataaggagtg tgtttggcgt 3601 ggtgcctggc atccatgtct tatgagtacc acaatgcaag ggtcattttc cttgtttgtt 3661 acagagaatc cttccttgtt tgcactggaa gctgtgctta tccctgttgg caccatgggg 3721 ttgattatta ccctgatctt tgtgtactgt tggttggaac ggtgagattt caggaagccc 3781 ttaaaatgag gtgggggtag tggggttatt tcaagatctc cagggtagat catttaagag 3841 ctatgataag agtgttgaga ggaagctaga ggttccatgc tggccagttg gataaagggt 3901 aataaaggat ctttcattta atgctgctcc ttctttctct ctgctcagaa tgcctccaat 3961 tccccccatc aagaatctag aggatctggt tactgaatac caagggaact tttcggtgag 4021 aacactacca tacacatact acagtttatc aactgccaac tgccagtcag caagacagac 4081 ttgggggggg ggcagtgcac agcggaggga ggaggggccc tgtaccatca ggatgtggct 4141 gaccaaatgg agggtgggct aggcagagag acggagccaa agagatttgt gtgatacaga 4201 cggaaactac agggcattag gagccctgga gcccagatag cccttctgta atcacaatgg 4261 ttacagattt gtgagagatc cgttggccca gagcctgggt cttttgcttc ctgcccctaa 4321 ttgacctctg acctggagct atctgtcttt aggcctggag tggtgtgtct aaagggctga 4381 ctgagagtct gcagccagac tacagtgaac ggttctgcca cgtcagcgag attcccccca 4441 aaggaggggc cctaggagag gggcctggag gttctccttg cagcctgcat agcccttact 4501 ggcctccccc atgttattct ctgaagccgg aagcctgaac atcaatcctt tgatggaacc 4561 tcaaagtcct atagtcctaa gtgacgctaa cctcccctac tcaccttggc aatctggatc 4621 caatgctcac tgccttccct tggggctaag tttcgatttc ctgtcccatg taactgcttt 4681 tctgttccat atgccctact tgagagtgtc ccttgccctc tttccctgca caagccctcc 4741 catgcccagc ctaacacctt tccactttct ttgaagagag tcttaccctg tagcccaggg 4801 tggctgggag ctcactatgt aggccaggtt ggctccaact cacaggctat cctcccacct 4861 ctgcctcata agagttgggg ttactggcat gcaccaccac acccagcatg gtccttctct 4921 tttataggat tctccctccc tttttctacc tatgattcaa ctgtttccaa atcaacaaga 4981 aataaagttt ttaaccaatg atcatcaaga atgtctgtta caggggttgg ggagcagggg 5041 aaatatggcg aacaatgaag ttgggaggaa tggggtgagg gtaggccaaa ggggtgggat 5101 agtcatactg aggaaggaag aggaccacag attagctgaa aggagtactg ttttggctac 5161 ttactattct ctggtaaatt tctgggctca ctccaggggt cagtttagag ttatctttgc 5221 catgttggcc aattcctcga agctccttta cagatcatcc agtgtct // LOCUS MUSROM1X 2787 bp DNA ROD 14-JUL-1993 DEFINITION Mouse rod outer segment membrane protein 1 (Rom1) gene exons 1-3, complete cds. ACCESSION M96760 NID g293778 KEYWORDS disk morphogenesis; disk rim; peripherin-related protein; rod photoreceptor; transmembrane protein. SOURCE Mus musculus (strain BALB/c, sub_species domesticus) (library: lambda EMBL3) adult liver DNA. ORGANISM Mus musculus Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Rodentia; Sciurognathi; Myomorpha; Muridae; Murinae; Mus. REFERENCE 1 (bases 1 to 2787) AUTHORS Bascom,R.A., Schappert,K.T. and McInnes,R.R. TITLE Cloning of the human and murine ROM1 genes: genomic organization and sequence conservation JOURNAL Hum. Mol. Genet. 2, 385-391 (1993) MEDLINE 93278386 FEATURES Location/Qualifiers source 1..2787 /organism="Mus musculus" /strain="BALB/c" /sub_species="domesticus" /db_xref="taxon:10090" /dev_stage="adult" /tissue_type="liver" /tissue_lib="lambda EMBL3" exon 637..1241 /gene="Rom1" /number=1 mRNA join(637..1241,1601..1847,1938..2434) /gene="Rom1" gene join(652..1241,1601..1847,1938..2156) /gene="Rom1" CDS join(652..1241,1601..1847,1938..2156) /gene="Rom1" /codon_start=1 /product="rod outer segment membrane protein 1" /db_xref="PID:g293779" /translation="MAPVLPVVLPLQPRIRLAQGIWLLSWLLALVGGLTLLCSGHLLV QLGHLGTFLAPSCSFPALPQTALAAGTVALGTGLGGAGASRASLDAAQYPPWRGVLTP LLAVGTAAGGGLLTLALGLALALPVSLNQGLEEGLEAALAHYKDTEVPGRCQAKRLMD ELQLRYHCCGRHGYKDWFGVQWVSNRYLDPSDQDVVDRIQSNVEGLYLIDGVPFSCCN PHSPRPCLQSQLSDPYAHPLFDPRQPNLNLWAQGCHEVLLEHLQGLSGTLGSILAVTL LLQILVLLGLRYLQTALEGLGGVIDGEGEAQGYLFPGGLKDILKTAWLQGGLAHKPAP EEAPPDEEPPKEVLAEA" intron 1242..1600 /gene="Rom1" /number=1 exon 1601..1847 /gene="Rom1" /number=2 intron 1848..1937 /gene="Rom1" /number=2 exon 1938..2434 /gene="Rom1" /number=3 BASE COUNT 577 a 771 c 816 g 623 t ORIGIN 1 cgaggggagc cgtgccaccc ctcggggcgc tgacgagcgt gggggtcagg agttagaggg 61 gggcgctgtg gaaccggttc ttttactctc tcggggttag gctcccgggg tgcggtgacg 121 ggccgtccgg agcccctgcg ctgcagcaca aacaggctcg ggacacggag tcgccagcaa 181 gcgcgggagg cgggccggag ggggggggcg ggccgcaggt aacccttccc tgtccgggac 241 ccgaggctca ggccgacggg gacacacggc tgacccttca ccgcccaggc tcttgcccgg 301 cttccggccg caggctccca gaggacctgg gaggggacgt agagcttgag atctgggtct 361 cttaagcctg aagcccagcg taccaaactc acttccgcga aagggcaggg cggggtcacg 421 aaatgggcct taagcccggg ggcggggcct ttcctaaagg gcaaggctaa gaagcatcct 481 gaatcggagc ctgcagaggg ttattaaggc tagagtggaa ggatgtccag ggtattgggg 541 tcagggtggc attagccctg ttgaagccag gctgggctga ctcagcatcc tgcctgccct 601 agccaacccc aagccccgac ggcttctcac tcccttgggc agagatggga gatggcgccg 661 gtgctgcccg tggtgctgcc cctccaaccc cgtatccgtt tggcacaggg catctggctc 721 ctctcctggc tgctggcatt ggtcggtggc ctcaccctcc tttgtagcgg gcaccttctg 781 gtacagctgg ggcaccttgg caccttcctg gcaccctctt gttcattccc tgctctgccc 841 cagactgccc tggcagcggg aacggtggct ctaggcacag ggctaggagg cgcaggagcc 901 agccgggcaa gtctggatgc agctcaatac cccccctgga gaggggtctt gacgccactg 961 ctagcggttg gcacagctgc aggtgggggg ctgctgaccc ttgccctggg gctagccctg 1021 gctttgccag taagtctcaa ccagggactg gaggagggcc tggaggctgc cttggctcac 1081 tacaaggaca cagaggtgcc tggacgctgt caggccaaac gtctgatgga tgagttgcag 1141 ttgaggtacc attgctgcgg gcgccatggc tacaaggatt ggtttggtgt tcagtgggtc 1201 agcaaccgtt acttggaccc cagtgaccaa gatgtagttg agtaagtgac tgatttgctt 1261 cttttgtctt cctcttcctc cttagccttg agattcttac ctggacgtgc gataaataga 1321 gcacggtggc tgagaggctg ggttataatt ctatttatca gcccagggca tgttgctaaa 1381 ccgctgtggg cccattcctc atctgtaaaa tggagataac tatgcccgtc tttatcttcc 1441 agacagactc ctatcttaca gagttgtgga gataaatgaa ttatgcaagc ttgaatagtg 1501 aagaactcag taaatgttgt ataatcattg ttctcaactg cccctgctct cagatgctcc 1561 tctctgccgc ctcatgacta tcttgtctat gcctttgcag ccggatccag agcaatgtgg 1621 aaggcttata tctgattgat ggcgtcccct tctcctgttg taatccccac tcaccccggc 1681 cttgcctgca aagccaactc tcggacccct atgcccatcc actcttcgat cctcggcagc 1741 ccaacctaaa cctctgggcc caagggtgcc atgaagtgct gctggaacac ctgcagggtt 1801 tatcaggcac actgggaagt attctggctg tcaccttatt gctgcaggtt agtcagctaa 1861 gggccctgac tcctactttc agctctgtga ccccaaaccc actgacaatc accctttgca 1921 cttgcttccc cccacagatt ctagtgctcc ttggtttgcg gtatttgcag acagcgctgg 1981 agggccttgg aggagtcatt gatggggaag gagaggccca gggctatctt tttcctggtg 2041 ggctgaaaga catactgaaa actgcatggc tacagggagg gcttgcccac aagccagcac 2101 ctgaggaggc cccaccagat gaagaacctc ccaaggaagt tctagctgag gcctagaaac 2161 ctgaagaggt gggggtggga aggggaaaga tggacaaatg tggaaaactt acaattcgtt 2221 actcagactc agggaaggaa aggtccctgg ggttatagga gttaagagca agcggagctg 2281 gggctggaat gagaagtcca ggtgtcctgg aacccagctg tctcgttgga aaccaccaag 2341 gttgagaaca gatgagggct acagggaagt gacaaccaaa ggactagaga atgtttctag 2401 catccagatt ctacaataaa gtttgtggac agaaatcact gttctcccgc tctcttcctc 2461 tctctgtctc tgtctctctc tcttggctgg gagcctgaaa gactctgcag aggaggaacc 2521 cagcctatgt aagaacatgt acctgatctc ttgggtaggt ctgccaaatg aaccaaacca 2581 tgaagcccct ccggttgaga ttctttattc tagaggtagg aaggggatcg catgctcagg 2641 tggggagggt accagcccag ctcctctagc cccccactgc atgcttgtcc ccaataagtt 2701 acccttttcc cacagctacc ctcctttctg agtccatcag ttccttgctc cgccccagat 2761 gggtcgatct ggctcactag agaagct // LOCUS MMU63716 2324 bp DNA ROD 02-FEB-1997 DEFINITION Mus musculus cytochrome C oxidase subunit VIa heart isoform (coxVIaH) gene, nuclear gene encoding mitochondrial protein, complete cds. ACCESSION U63716 NID g1813481 KEYWORDS . SOURCE house mouse. ORGANISM Mus musculus Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Rodentia; Sciurognathi; Myomorpha; Muridae; Murinae; Mus. REFERENCE 1 (bases 1 to 2324) AUTHORS Wan,B. and Moreadith,R.W. TITLE Structural characterization and regulatory element analysis of the heart isoform of cytochrome c oxidase VIa JOURNAL J. Biol. Chem. 270 (44), 26433-26440 (1995) MEDLINE 96064721 REFERENCE 2 (bases 1 to 2324) AUTHORS Schmidt,T.A., Jaradat,S.A., Goodman,M., Lomax,M.I. and Grossman,L.I. TITLE Molecular evolution of cytochrome C oxidase subunits: rate variation among subunit VIa isoforms JOURNAL Unpublished REFERENCE 3 (bases 1 to 2324) AUTHORS Jaradat,S.A. and Grossman,L.I. TITLE Direct Submission JOURNAL Submitted (12-JUL-1996) Molecular Medicine and Genetics, Wayne State University, 540 E. Canfield Ave, Detroit, MI 48201, USA FEATURES Location/Qualifiers source 1..2324 /organism="Mus musculus" /db_xref="taxon:10090" gene 1143..1699 /gene="coxVIaH" CDS join(1143..1215,1402..1538,1616..1699) /gene="coxVIaH" /note="similar to cytochrome C oxidase VIa heart isoform encoded by GenBank Accession Number U34801" /codon_start=1 /product="cytochrome C oxidase subunit VIa heart isoform" /db_xref="PID:g1813482" /translation="MALPLKVLSRSMASAAKGDHGGAGANTWRLLTFVLALPGVALCS LNCWMHAGHHERPEFIPYHHLRIRTKPFAWGDGNHTLFHNPHVNPLPTGYEHP" BASE COUNT 600 a 579 c 539 g 606 t ORIGIN 1 gctcagtggg ggaacattta attagcatgc acacagccct gggattcaat tcccagataa 61 ttttttaaat gggcacctaa ctagttttaa gcatattact tgtctactct gtgctagcta 121 ttgagaacag aatattaaat gaagtcattc cgtgccacta tgaaaaaatg gtgcatattt 181 atttagatag acagtccaga gttgcccagg atagctttag tacatattta tttagataga 241 cagtccaggg ctggcaagat ggctcagtgg ttaagagcac tgactgctct tccaaaggtc 301 ctgagttcaa atctcagcaa ccacatggtg gctcacaacc acccgtaatg agatctgatg 361 ccctcttctg gtgctagaag tcagctacag tgtacttatg tataataata aatgaagctt 421 tgggctggag caagcaggga cttagtgatc agagttgacc agagcgagca aaggtcctaa 481 aaattcaatc cacaacaacc acatgaaggc tcacaaccat ctgtagagct actgtatact 541 cacatatata aaacaaataa atctttagat agacagtcca gtgtagccca ggatagcttt 601 gaccttggta cgtagtccaa gatggccttg aactcctgac ccttctgcct ccatgtccca 661 agtgctggct ggcattataa ataagtgcgc cacactgaga ttgtgtagca gtagggaaag 721 aactcaggac attatgggta cttggtaggc agacactcca ggcggtctac caactgagct 781 acactcctac ccccacccca ccccactgta aaatttgttt taaagacaag atcttaggta 841 gttcaagctg tgatcaaact cattaagttg acaacagtgg ccttgaactc tgatcctcca 901 gcttacacat ttcaagtttg tatcacttta aatcttctct tgccaaaata aaaggcacat 961 tgcatcagct gcctgaagag gatctcctgc cagtcaagac ccactttaga tagagaaagc 1021 ccctaaaaat agccatccat gtgcgggctc aacaggtgat tggctctgag aggggaggag 1081 agcctctcga ctgggtgaag gagacagaga aggacagtgc cattcctagc ctccctttga 1141 caatggctct gcctctaaag gtcctgagcc ggagcatggc cagcgcagcc aaaggagacc 1201 atggaggggc aggaggtaag tggggccagg cctgactctg atttcgtagc ctagccgatc 1261 tgttctcttt gcccggcttc cccctccttc tctctcctgg ctcctctctc ctgatctacc 1321 aggcacttgg ctttcagacc tctaggccca gggatagttt ttgcctatga atctactcac 1381 accccacctt tgctcctcca gccaacacct ggcgcctcct gacctttgtg ctggctcttc 1441 ccggcgtagc cctctgctcc cttaactgct ggatgcacgc tggccaccac gagcgcccag 1501 agttcatccc gtatcaccac ctccgcatcc gaaccaaggt acgccagagg atgagctgca 1561 ggggggattg cggggaggga ctgtggtggg tgacctgtct gccttctctc tgcagccctt 1621 cgcctggggg gacggcaacc acacgctttt ccacaatccc cacgtcaatc ctttgcccac 1681 cggttatgag cacccttgat gtctcagcag acacgctctg ccagcaatct tcaaattggc 1741 cttctgcaca ccggctctga gagccactga ggttccagtg gacagttcca agctcaataa 1801 aggtgtggaa gttttgtgtc ctctggctct ttgggaacag gatggtggaa ggggctgggc 1861 aggctcttgg gcagttggta tctgggttcc agttattttt ttttttttat ataaggaaaa 1921 tgtgatgttt ccacctgcat tttcatttta ttttttaaaa atgtatacaa gcaatacatt 1981 gaatattgag catattactc agtcttgtgt tagttcttat ccatatgtaa atttatgctg 2041 accccattcc tgctcttgca taaaaagtgg atcagcactg aaatctgaca catgggcaaa 2101 ccataaaaat ctcccagctc aggaggtagg gcatcatttc tgggaagttt cccaggctag 2161 cccagagctg aagagaactt gttcccatgc cagtagattc ataggcaaaa gcaaaaccaa 2221 tctcgtaatt gtggagatat acaggaaagt agattcctag gacgtgagat gtggaggagt 2281 ctcatggttg gaatctgcct ggccggctgg gtcaggatct gcag // LOCUS MUSALCALB 3045 bp DNA ROD 15-OCT-1992 DEFINITION Mouse alpha-lactalbumin gene, complete cds. ACCESSION M87863 NID g191870 KEYWORDS alpha-lactalbumin; lactoprotein; lactose-synthetase. SOURCE Mus musculus (strain BALB/c, sub_species domesticus) liver DNA. ORGANISM Mus musculus Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Rodentia; Sciurognathi; Myomorpha; Muridae; Murinae; Mus. REFERENCE 1 (bases 1 to 3045) AUTHORS Vilotte,J.-L. and Soulier,S. TITLE Isolation and characterization of the mouse alpha-lactalbumin-encoding gene: interspecies comparison, tissue- and stage-specific expression JOURNAL Gene 119, 287-292 (1992) MEDLINE 93013000 FEATURES Location/Qualifiers source 1..3045 /organism="Mus musculus" /strain="BALB/c" /sub_species="domesticus" /db_xref="taxon:10090" /tissue_type="liver" TATA_signal 538..546 exon 561..731 /number=1 CDS join(596..731,1033..1191,1660..1735,2516..2576) /codon_start=1 /product="alpha-lactalbumin" /db_xref="PID:g191871" /translation="MMHFVPLFLVCILSLPAFQATELTKCKVSHAIKDIDGYQGISLL EWACVLFHTSGYDTQAVVNDNGSTEYGLFQISDRFWCKSSEFPESENICGISCDKLLD DELDDDIACAKKILAIKGIDYWKAYKPMCSEKLEQWRCEKP" intron 732..1032 /number=1 exon 1033..1191 /number=2 intron 1192..1659 /number=2 exon 1660..1735 /number=3 intron 1736..2515 /number=3 exon 2516..2865 polyA_signal 2847..2852 BASE COUNT 688 a 737 c 711 g 908 t 1 others ORIGIN 1 ggatccaagt agtagttgag tctcatgcta aatgccacca tgttccatcc cttttcccaa 61 ggctctcagt tatgagtctc catatcaagg ggctttcctg gactttgtcc tatggctagg 121 ttggacagac aaatatcacc tttgatccta ggatgtgata catccccttt ccacgttctg 181 tatgtgttta ggggtaagca tggagttggc tgtagccaac tgtgttttcc agtcacctcc 241 cttgtattgt ctctgaagcc tcctttgttc caaaagtagg ttaaggaaat cctgcttcct 301 ggaagcagcc ctaaaagaaa tgaaggttta ccagagccaa gtgagaagct gggtcatgtg 361 tggaattatg tgggaagaaa acaatacttg gtattgactg gatcgaggag atggggggag 421 ggtggcagga tggagggagg ctggcaggct cagggtttct attttggcat aagcatctct 481 tcatcattgt cttcctagag agaaggcccg gtgccaggag gccagaggcc ttcttcatac 541 ataaaagcag atgaagtgag cggtgtctgc attacaaggt ccaggagcag tcaaaatgat 601 gcatttcgtt cctttgttcc tggtgtgtat tttgtcgttg cctgcctttc aagccacaga 661 gcttacaaaa tgcaaggtgt cccatgccat taaagacata gatggctatc aaggcatctc 721 tttgcttgaa tgtgagttca tactacgtcc ctgcttcctt ccattcccac ctctcccttc 781 tcctccttct ctgtctcctc gtggtgtggt ctcgtacctc ttcttgttct gtcttgacct 841 cttgtattca attacttctc ctgccctgcc tgtctcccat tgtctggcct ttctcatttc 901 accaagataa tctggacttc accacttttg gagattggtt gggaagcccc cccccccccc 961 ccgggaaacc agtgtttcaa ttatggtact ctgagacatc tctgagaaag ttctttcttc 1021 tgtctccctc aggggcctgt gttttatttc ataccagtgg ctacgacaca caagctgttg 1081 tcaacgacaa cggcagcaca gagtacggac tcttccagat cagtgacaga ttttggtgta 1141 aaagtagtga gttccccgag tcggagaaca tctgtggcat ctcctgtgac agtgagtatc 1201 ctctttcacc acactcttct gtatctctac agcctgcctc cgagtcctcc caggcagcct 1261 ccttgttggt acccagcatg tctcaggtgt gctgggtacc atatgacttg agatagcagt 1321 gcttagtaag aacctggtca ctggtcatga tctccacatg cccagagagt cccctaaggt 1381 tcagggaagg gtagtttggg acacttgtga gtcttgaagt ctgatacgct gtattttcag 1441 ggcgtgatga atctgatgct gtctcgttga ggtgtcctag ggaagggaat ggagttccca 1501 tggaggggag gcattatcag actggccgtt tcatatagaa agtctctagg ccaaaccttc 1561 aactcagaac cagttcccct gactctgggr ccagaggaac cagtccactg agctgtactt 1621 cttatttgtc ttgttccttc ctacactctg ttataacaga gttattggat gacgagttgg 1681 atgatgacat agcgtgtgcc aagaagatcc tggctatcaa aggaatcgac tactggtaag 1741 tcgtcgtctt tgctctcttc tgacctccct ctccagacca tttaattttg gatggagtct 1801 ccaaaacaaa actcatctag agtcccaaag gtctacattt aaaatgcttc ttcgtgtcat 1861 ttttgtgtga caacttcccc gatatgaagt tccatgttgg aagactgatg tccaggactc 1921 tcttcccctg gccagttctc atccttaaag cccttgtggt attattccac atctttccta 1981 gatctgaact cttattgatt ttattattat tattttggct tgttgagaga ggtttttgtg 2041 agtgagtgtt tgtgtgtgta tgtgtgtgtc tgtggtatgt gtgtgtagcc ctggctatcc 2101 tcgaactcac tccgtagacc aggctggcct caaactcaca gagatttgct tacctatgcc 2161 ttccgagtgc tcagattaaa gccatgctcc actagtccta ttattattat tgttattgtt 2221 gttgttatta ttatagtgag cctcattcca cttagaagcc catgccattt cttctctaca 2281 atctttcacc tcaccctgag gacgaaccta aatgcagctt tatgctagtc actgacaacc 2341 atcattaaga ggggagaaag tgtgtgtgtg tgcacgtgcg catgtgagat agctgagcaa 2401 agtatgatga tatacacgta tgaaaatgtc ataacggaac agttatttgg catagttaag 2461 ttaaatagaa agttccagcc agcccattca ttttgtccat ttcttttgtg atcaggaaag 2521 cctacaagcc catgtgctct gagaagcttg aacagtggcg ttgtgagaag ccctgagccc 2581 cccccccccc cccccccgtc cttgctgctc ctgccccgtg gtcaggaatg cctcttccct 2641 aaggctacct cagcttggct cttgctattc ctgtgaagat gatctgcctc tgagccttgt 2701 accctgtagt gacaccaccg gactctagag gacttttttt tccctatggg agtgtgactg 2761 gcgcactgga ctgcaaaccc ttgcttagtg acggcgaggg tctcgatggg ggttttacaa 2821 aatcgagaga gccctctcct gtcccaaata aagggccaga cttgatgtgt ctgtcggtgt 2881 tttctttcta cagggaaagg gtggaaatag ggcagggctc tgaagttcca ggtcgtagcc 2941 actccacccg ctgctgaagg aggagagctt accacaggaa catgaacagc attatctgta 3001 tcaatttgtt ggccatcagc aaccacaggg gatacaagtt gtcga // LOCUS MMU02884 8765 bp DNA ROD 03-FEB-1995 DEFINITION Mus musculus mammary-derived growth inhibitor (MDGI) gene, complete cds. ACCESSION U02884 NID g409956 KEYWORDS . SOURCE house mouse. ORGANISM Mus musculus Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Rodentia; Sciurognathi; Myomorpha; Muridae; Murinae; Mus. REFERENCE 1 (bases 1 to 8765) AUTHORS Treuner,M., Kozak,C.A., Gallahan,D., Grosse,R. and Muller,T. TITLE Cloning and characterization of the mouse gene encoding mammary-derived growth inhibitor/heart-fatty acid-binding protein JOURNAL Gene 147 (2), 237-242 (1994) MEDLINE 95011621 REFERENCE 2 (bases 1 to 8765) AUTHORS Mueller,T. TITLE Direct Submission JOURNAL Submitted (26-OCT-1993) Thomas Mueller, Max-Delbrueck-Centre for Molecular Medicine, Robert-Roessle-Str. 10, Berlin, Germany FEATURES Location/Qualifiers source 1..8765 /organism="Mus musculus" /strain="ICR SWISS" /db_xref="taxon:10090" /tissue_type="liver" TATA_signal 1490..1495 5'UTR 1516..1556 gene join(1557..1629,5071..5243,6734..6835,7952..8005) /gene="MDGI" CDS join(1557..1629,5071..5243,6734..6835,7952..8005) /gene="MDGI" /note="related to heart fatty acid binding protein" /codon_start=1 /product="mammary-derived growth inhibitor" /db_xref="PID:g409957" /translation="MADAFVGTWKLVDSKNFDDYMKSLGVGFATRQVASMTKPTTIIE KNGDTITIKTQSTFKNTEINFQLGIEFDEVTADDRKVKSLVTLDGGKLIHVQKWNGQE TTLTRELVDGKLILTLTHGSVVSTRTYEKEA" 3'UTR 8003..8243 polyA_signal 8219..8224 BASE COUNT 2153 a 2279 c 2343 g 1990 t ORIGIN 1 gagctcagaa attcatttat gtccctcttc ctctgaatcc ccaagtcagg acaggtccct 61 cctagagtca ctcccagagg gagtacttta gcctgtacct tgatccatgt ggacttctcc 121 aaagcagggc ccgcacaaac caggagctta ctttgagaga gtatcttgcc ttggctcaga 181 ctggccccca actcaggaat ctcttgcctc caggtgcagg tgccgtatta actgttggct 241 caagccagag cagcggcaca cagctgtaac ccaggagaca gaggcaggag aatctttagt 301 tcaaggtcaa cccctgctac acattgagat gctgacttgg taataattaa acagaaactg 361 ctgaactaag gaataggctc cactgaggtt cccttactca cctgtaaaaa ggggatgata 421 ccacctacca acgaaaaagt tgagtgtgac catgccctga agtaggctac aaccatcaat 481 agtcgggtct tatttaataa cgtactttaa ggtgacaagc agtctagtgg cagaagtcag 541 gggaaaaaac tgacttcagc agagggtcgc ggctttccgg gagttaaggt ggccgaggcc 601 ggaagaaccc tctgaataga caaattgtct tcgcggagtg aagaacgacc ctggcacaag 661 ctcagaggtc agtaaataaa gcctgaagcg ctttcaggca gcggcgacgg gtgggactgc 721 ggagaaaggc gcaggcggga gacattccgc agggaggggc tagcacgtgt ggggctagca 781 tgagggaagc aaggtcacgt tctccgccag caggtgaggc gctgggcagc tcagccatcc 841 gcggtgtcca aggcaactct tttccacttg tctggtagga gcaagagggc tcaaaggcca 901 ctagaccatg ctctctgtcc aggctccaat tcttttttac ttacggcgac cgcgtcattc 961 ctctccgagc ctctgagcct cttctacaag aagaggacat aggaccgttg agatgggttt 1021 ttgggtaaag gcccttgctg tcaagccttg acaaccccag tttgatacgt gggacccaca 1081 cggtggaagc agagaaggga ctcccgcgag ttacaacgaa cgccccagtc tcccacccct 1141 tccccataag tacgcctaca cgagcataca caatataaga ataaaaccac agcgaattaa 1201 aaaacaaggc ggcagaagga tcaagcggcg tttctccagc gtggcaccag ctcaagggcg 1261 agtttccttt cagtatggcc gggggatgct ctacttgggt tgcgggaagc gccccgcagc 1321 caggccaggg atgggttaga tggcaccaac aggaccgcgg gcgccgctga cgtaggcgac 1381 gggagggctg tgggggatgg gccccagccc tttgcgggag tgcaagcccc ggcttcctat 1441 ttcgggagcg aggggtgtgg gccactttca tcatgtgatg cgagggctat ttaaagaggc 1501 tgtccagccg ggagctgcgg ttctcagtgc ctgctcgcct cctcactcat cgcaccatgg 1561 cggacgcctt tgtcggtacc tggaagctag tggacagcaa gaattttgat gactacatga 1621 agtcactcgg tgagcgaacg aacggcgcag gatctagggt caggagggcc ggcaaggcgg 1681 tcttggcgct gagctcccag ggggagtgcc cccatgtgcc tcccgcaagc tcctagccag 1741 tccagacagg gaatactgag gtgcggaggg tggcctgggc tgaagccact ccactccacc 1801 ccaccccacc ccggcctcct gggagggggg tgtcgcggtc caagcttggc gagcctcgta 1861 gctggagggg aagggtagag gcagctgtgg ccgcagaggt ccgggatggg aggctttcta 1921 ggaagcagtg taggtgatcc ggaggtggaa aggggaggga aagaagggcg ggaggctggc 1981 cgcaggagaa ggcaaagagg agcatggtgg tccagaaatt gaattccgaa agggaataga 2041 gcagctagga gtgtacagag cctggaggaa gactaaagaa aatcagtgaa ttccatctgg 2101 gaagaggtga agatacagcc aggcagtcag caacaagccc tacccctcca tgttggggta 2161 gtgaagaggc ctctctctgg aagatgccct ggttctcacc agcctgacct tcacctacag 2221 tgtgtgcagc cacccctggg atcagtcgga gacgctgctg ctagagcagg gcaagacgac 2281 cactacacat aggcttcccg gccgcaccag tcggccaccg gatcagtgct ggggataggg 2341 tgaagaaagc ctgggatcga gcagagggtg tcagaagaaa ggtgaagagc tatgaaggga 2401 gagtgtggct tggggctggg gaaattgtgt ggtgtgggcg gtgacacaac gcctttaacc 2461 agcactctgg gaagcagagg caggtgaatc tcccgagttc taggcctggt ctatacagag 2521 agaattccag gacagccagg actacacaga gaaaccatgt cttgaaaaaa aaaagaaagg 2581 aagagtccca tgatttactt aataggaaga cagcttggga cacatgagct catcgcctca 2641 taggaaagcc caggatttct ttttgaagac tgaactagag ccttgtgcat gccccctact 2701 gctgagttat actccccact cacacacaca cacacacaca ccctcttttt actctgtgta 2761 acaggttctc actaacttcc ccaggttggc tttgatcttg taacctgcca tctcggcctt 2821 ccaaatagtt gagaacccag aactacctag agttcttccc atttcaacag tggggaatgt 2881 cacatgaacc acttatccaa gacggcccca gcccttcctt tcttgccttg agcttagata 2941 aagacctcta cctgcggagt ccctggctat atcatcctgg tctaggaggc tggggcaggg 3001 aaaacaggac tgtgtcatgc ctgagctagc ttccactccg tcttccccgg gaaggagggc 3061 tggaatcgga catgttgagg gatgtgtgta gttgcctctc acctacttcc agctcttctc 3121 tgaaacaggc ccacaaagca atttgtcctt ttggtttggg gaatggaacc caaggccctt 3181 tgcccgtgct aagcaagcac tgcgactgct gaaccacatc tccagacagg gctcccaccg 3241 gcacctaccc taccctgagg ctctccagga aggcagctgg tcttgtcttt taagacaggg 3301 ttttactgtg tatccctggc tctcctggaa ctccctaagg agaccaggtc ctgttgtctc 3361 tgcctctccc gcactggggt tagaggtatg agcccacacc cagctagttg gctgtcaagt 3421 tagggagtac tagattacct gggcttagtt ctgtttcact gagctgccgc tgccttccta 3481 gacttctttt tgcctcaggg catgttgtct tcaggccatt gttctgtggg tgctgaaccc 3541 agctttagtc aggggactga aattctattt agcctaaaaa tatcgacagg ctgaaggcca 3601 gtaaagtcta gatgcacccc agcttccagc agtaactggc ttcactggca ccctacacct 3661 acctgtaggt gggtcctggg aaaacagctt ggttcactgc tgaagcccaa ctcacacaga 3721 taccagtagg caaagccaag cctctcactt cttgctgtgt agccaaagct cctgacttat 3781 cctatagaac caaaggttct taggacaaag cagcccagcc tagtttaagt gacttcaagc 3841 acagatggtg gcttcaaggg tagagtatgt tattccaaga atgatatagt gagactaaaa 3901 gagagtttgg gaatctatgt cactaaactc ggattattta tttacttagc ctttttgaga 3961 cagggtttca ctatagccca tactctggaa gctacgcagc ccaagctggc cttgaattct 4021 cggcatttcc cctgctccag tcccctgcct cttgagtgag attctagggg agtcaccatg 4081 cttggcccgt ttgactttgg ctaagacagc accgtgtctc agcctcgttt gtaaatggaa 4141 actataaagc ttaggtttag ggctgtccca aggatacttg gccaccactt agagcttgtg 4201 cagtgtgcac agcttgcagc agatgcacaa taccatagcc ttatattggg ctgcctcctg 4261 ccacatcgct gaggatggct cagagtgtgc tggggccaga cgacaggtag tcaaccatgg 4321 aagattccag gaaagctact aacccaaagc accaaaggct tgacccaagg ggtctgtgaa 4381 ctttacctgc ttgagggaca cctgggacct tgcctaggac tcagatccaa tgattatgtc 4441 aggagtctcc ccagggactt ccaagtcatg cagttgtcgc tacttttttc agccctctac 4501 gtctgtggta gacaagactc ctttgtatct ctaaccagga ggctttgaaa ctgacgctgc 4561 catacagacg gcagagagca ctgctgtctc agttttctgg gtggaaatgg gagacgaccc 4621 ttgtccaggg gactctagaa ggcagttgac gatctcttgg ttcttcagtc ctgttctgtg 4681 tgttcaggag actagaagcc agcgggtacc cagctctgga gcgacacagt gcttagcagc 4741 ttccatctga attgtgaccc tgttcacatc tacaaaatct tccaggggca ggactgcgtg 4801 aggggctggc tatgtggctc acaccccatg gtaccttcct tgccgctgtt gccgaccctg 4861 ccctgcatcc acatgtctca tatttgacat gagtttaaca gctgccgagc taagggaaag 4921 agctaaggga atgagaaaat ggccccacac aaagacgtgg gccactgagg atcgccttcc 4981 ccatgcctag gccacaaact tctctttggg tctatataac acgtctctac cctacctaat 5041 gtgtcctcaa ctatctgccc ctgcccttag gtgtgggctt tgccaccagg caggtggcta 5101 gcatgaccaa gcctactacc atcatcgaga agaacgggga tactatcacc ataaagacac 5161 aaagtacctt caagaacaca gagatcaact ttcagctggg aatagagttc gacgaggtga 5221 cagcagatga ccggaaggtc aaggtgagtc agagaaaggg gatggagggc actggatgga 5281 acaccacagg gtaagaggct ggccctctta gctccttggc tttttaaccc caaggggcag 5341 gttcataaag cctgctcagt gctgcgatgg cccagggact aagtataagc tctggcctgt 5401 tttcttctca ccttcctggg aaggatctat cagctgtcac tggagtgggc agcagagcca 5461 agatattctt ccgaccttgt gcctccagct ccagctgggt gaccttgtac accaaggtac 5521 ccagtggctc tgaggacatc agcagccatg gctgtgacct gaagtgtttt aggaatgatg 5581 cccagaagtc agtgctctac agcgaggtca tcctggctct ggcagcagag gaggcagctg 5641 ggaacaacaa gctgagttct atccacaggt ttcctgctct gtagctccca ggctagacca 5701 ctgaccggaa caaactgttt ccagtgccac cagttgacag gcacctgcgt ttaccaacag 5761 gagtgctgtt gggtgtccat agtctcttca agtatttatt taagggtatg ggagggagtg 5821 tctacctcca gtgatgctca gttatctcat ggtggttatc ttcttagcag cttgctcaaa 5881 atctcccagc actgggccac gcacagaact aagtaacatc tgtatctata ggttgtcctt 5941 atacagtatc ctctacatca caaccctaaa attcatctct tcagctcttt cctcctcaca 6001 gttgccccca gaaagaacag ggtaggtacc aaccagtctt gcagttacag aggcgctaaa 6061 gcccagccca ggaccacaca gcaagctgga agtgaagatc cccaaggcgc cctgctgcac 6121 cctgctccac cacatgtccc acctacctac atctttgaac ttgccatctt ccttgagaga 6181 ccagatttca cattaaatat aacagtggtc cactgggatg tctggacaat ggggaatgaa 6241 gattccagaa ggtggaatat agacaggaaa gagatgagtg acagccatca cctttctaga 6301 ctcctaccga caaaccctgc ttagctctcc ttctgacctg caactcccaa attcctaaag 6361 cagatagatt tggggcagct gcccaacacg aactagctga gataggctgg cagcaccaag 6421 gaccgaatgg tcactggagc tagagctaga gaacactcca aggatgcctg ggtccttggt 6481 ccaaggacct tgcaattgcc gctcttttgc cagttaaggg aggtcacatg gtagaaacaa 6541 aagtctaatt tgccaatata tcaaggctga gctgcctgcc atgcccactg ccagagggcc 6601 ccgcccctgc agagcaggaa gcttgggcag ctgggctgga ggtgggctgt ctgctgatcc 6661 cagcaccatc actggttagg ttcctggttt gtcctggctc tggtcacagc ttaggctctg 6721 tctctttcca cagtcactgg tgacgctgga cggaggcaaa ctcatccatg tgcagaagtg 6781 gaacgggcag gagacaacac taactaggga gctagttgac gggaaactca tcctggtaag 6841 atgggcagtg acgggcctta aagcaacaag taccacacct tctccagccc gactaggaca 6901 agagaggcag gggtggggtc ctggctgtgg atttacacag gtcttggttc aagcatcagt 6961 ctaaaggcta tctgacaaca cataaccttc aagggccact gaaatggggg ctgctgggag 7021 gccagtgtat tcagagtcca aagcactggc caaatgggaa gaacgcagta ggcacccaca 7081 aacacttcct cctgcttgct gttccctggg aagaggcaga aagcaggacc agtagatccc 7141 aggctggaga ggagcagctg ccatcctctc ccctcctgag tagaaatgca ggagtctgca 7201 ggaccaagtg tgtgcctggt gggcactgcc agggagcagc cccctcctta ctcacaattt 7261 tgtctactgc tcctggcttc ctggaaatat ctattgtcta gctaggaggt gatacaacca 7321 ggcctgcaga tctgctttgg gacaagtttg aaagcatccc tggaatagtc cctacatcct 7381 cgtaggactg tacctgggcc aggagcatgg accacaatgt cagcatcaca gacctgggtc 7441 gctcagtgga acccactgat tgcatattca catgagactg gggttcccac tgagttagaa 7501 gtgaagggcc tagctctgga agggaggtaa caaagaggat catgttcact aggtagctag 7561 agtaaggtga ggccatggct gggctctaca gtgccagaat actgcatgtt aggggaggga 7621 ggaaaagttg gcagcttagc agtttctcaa ggctctgttc ttatcacccg tttgtctgcc 7681 acatacacca ggcaccccct taggcaggtg ctgaaatgaa cacaaaggag gaacaggaat 7741 gtactggatc ttgaccagtt tacctcctct ctcaagggcc tatttttccc ccaaatctct 7801 aaaatgctaa ttataacatc ttaaaagatt tgtatcagaa aaaaaagtaa agtgcctggc 7861 acacagtagg tgctcaagtg ctggtcagga tgagggtggg gagcactccc tcctctgctc 7921 tgccccatct gaaacctgtc tttctttcta gactctcact catggcagtg tggtgagcac 7981 tcggacttat gagaaggagg cgtgacctgg ctgctccgtc actgaccgcc cgctcctctg 8041 ccaactggcc acccctcagc tcagcaccat gctgcctcat ggttttcccc tctgacattt 8101 tgtataaaca ttcttgggtt gggatttttc tggagatacg gggcatcagc ctggacccag 8161 ttcctactat gtatgtggtt tattttttaa aactgtatcc aaagggtgct ccaaggtcaa 8221 taaagcagaa ccaaggccac ccagttgtct gtctttggtc ctcctttcct gtgtgtcagg 8281 ttgaaatgaa ggcctatagg tcacctggga agcagcactg tcaaggagcc gagtggacag 8341 gctcaaggct cagttaggga acagtagcac ctatgtaata cccttacact gacctgccaa 8401 ggctcagaga agctagctgt cattctagca tctatgcaag cccttacact ggcctgccca 8461 tggcagagca gctggctgtc actgtgtggc tatttcacat tcatcctgca cagacattcc 8521 tggatttgct gtatggtgtg ctgtggtcac cctctctcta gagtacaggc tcaggacatc 8581 aaggtccagg tgtgaacaac tgtggtggga ggtgactgct aagagtcgcc cactcatgcc 8641 cagcaagtcc ccagggttac aaatacaagg gaaagcggtc atcactatgg aagagaaggt 8701 ttatgagtaa tctcaagaaa gctgtgacct gctggtggcc ttggctttgt gcctcggctt 8761 ggcct // LOCUS MUSMK3A 1994 bp DNA ROD 15-NOV-1992 DEFINITION Mouse intronless potassium channel gene MK3. ACCESSION M30441 NID g199712 KEYWORDS potassium channel protein. SOURCE Mus musculus (strain AKR) DNA. ORGANISM Mus musculus Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Rodentia; Sciurognathi; Myomorpha; Muridae; Murinae; Mus. REFERENCE 1 (bases 1 to 1994) AUTHORS Chandy,K.G., Williams,C.B., Spencer,R.H., Aguilar,B.A., Ghanshani,S., Tempel,B.L. and Gutman,G.A. JOURNAL Unpublished (1990) REFERENCE 2 (sites) AUTHORS Chandy,K.G., Williams,C.B., Spencer,R.H., Aguilar,B.A., Ghanshani,S., Tempel,B.L. and Gutman,G.A. TITLE A family of three mouse potassium channel genes with intronless coding regions JOURNAL Science 247, 943-975 (1990) COMMENT [2] sites; for [1]. Authorin Submission [1] kindly submitted by Gutman,G.A., 05-DEC-1989 MK1, MK2 and MK3 represent three members of a family of mouse genes encoding potassium channel proteins, related to the Drosophila shaker locus. Each mouse protein is encoded by a single, uninterrupted exon, although one (or more) introns may be present in the 5' untranslated region (MK1, MK2). MK3 closely resembles the published rat cDNA sequence RCK3 (Stuhmer et al., EMBO J. 8:3235, 1989). FEATURES Location/Qualifiers source 1..1994 /organism="Mus musculus" /strain="AKR" /db_xref="taxon:10090" /cell_line="L47.1" CDS 150..1736 /codon_start=1 /product="potassium channel protein" /db_xref="PID:g199713" /translation="MTVVPGDHLLEPEAAGGGGGDPPQGGCGSGGGGGGCDRYEPLPP ALPAAGEQDCCGERVVINISGLRFETQLKTLCQFPETLLGDPKRRMRYFDPLRNEYFF DRNRPSFDAILYYYQSGGRIRRPVNVPIDIFSEEIRFYQLGEEAMEKFREDEGFLREE ERPLPRRDFQRQVWLLFEYPESSGPARGIAIVSVLVILISIVIFCLETLPEFRDEKDY PASPSQDVFEAANNSTSGAPSGASSFSDPFFVVETLCIIWFSFELLVRFFACPSKATF SRNIMNLIDIVAIIPYFITLGTELAERQGNGQQAMSLAILRVIRLVRVFRIFKLSRHS KGLQILGQTLKASMRELGLLIFFLFIGVILFSSAAYFAEADDPSSGFNSIPDAFWWAV VTMTTVGYGDMHPVTIGGKIVGSLCAIAGVLTIALPVPVIVSNFNYFYHRETEGEEQA QYMHVGSCQHLSSSAEELRKARSNSTLSKSEYMVIEEGGMNQSAFPQTPFKTGNSTAT CTTNNNPNSCVNIKKIFTDV" BASE COUNT 394 a 612 c 534 g 454 t ORIGIN 1 agccgccgct agggaaggaa agcaccgccg cctcccgcgc tcgaccgccg cagccctcca 61 cccatcaccg cgcccaccct gcaccggacc ccgcaggagg cggcgcgcgc atcctgcaga 121 gccccggcca cgccgagctg ccgccagaca tgaccgtggt gcccggggac cacctgctgg 181 agccagaggc ggcgggaggc ggtggcgggg acccgcctca gggaggctgt ggcagtggcg 241 gcggcggtgg cggctgcgac cgctacgagc cactgccacc cgcgctgccc gccgcgggcg 301 agcaagattg ctgcggcgag cgtgtggtca tcaacatctc cgggctgcgc ttcgagacgc 361 agctcaagac cctctgccag ttccccgaga cactgctggg cgaccccaag cggcgcatgc 421 ggtactttga cccactccgc aatgagtact tcttcgaccg caaccgaccc agcttcgacg 481 ccatcctcta ctactaccag tccgggggcc gcattcgccg gccggtcaac gtgcccatcg 541 acatcttctc cgaggagatc cgcttttacc agctgggtga ggaggccatg gaaaagttcc 601 gtgaggatga gggcttcctg cgggaggagg agcgacccct gccccgccgt gacttccagc 661 gccaggtgtg gctgctcttc gaatatccgg agagctccgg gccggcccgg ggcattgcca 721 ttgtgtcagt gctggtcatt ctcatctcca ttgtcatctt ctgcttggag acgcttcccg 781 agtttcgcga tgagaaagac tatcccgcct ccccgtcgca ggacgtgttt gaggctgcca 841 acaacagcac gtcgggggcc ccttctggag cctccagctt ctcggacccc ttcttcgtgg 901 tggagacctt gtgcatcatc tggttctcct ttgagcttct ggtgcggttc tttgcttgcc 961 ccagtaaagc caccttctcc agaaatatca tgaacttgat agacattgtg gccatcattc 1021 cttattttat cactctgggc actgagctgg ctgaacgaca aggtaatggg cagcaggcca 1081 tgtcgctggc catcctaaga gtcatccgcc tagtaagggt tttccgcatc ttcaagctct 1141 cccgccattc taaggggctg cagatcctag gacagacgct gaaggcttcc atgcgggagc 1201 tggggctgct catattcttc ctcttcattg gggtcatcct tttctccagt gcagcttact 1261 ttgctgaggc agacgaccct tcttcgggtt ttaacagtat cccggatgcc ttctggtggg 1321 cagtagtaac catgacaact gttggttatg gtgatatgca cccagtgacc ataggaggca 1381 agattgtggg ctctctttgt gccatcgcag gtgtcttgac cattgcattg ccagttcctg 1441 tgattgtttc caacttcaac tacttctacc accgggagac agaaggggaa gagcaagccc 1501 agtacatgca cgtgggcagt tgccagcacc tctcctcttc agccgaggag ctccgaaaag 1561 cccggagtaa ctccactctg agtaagtcgg agtatatggt gatcgaagag gggggtatga 1621 accagagcgc cttcccgcag acccccttca aaacgggcaa ctccacagcc acttgcacca 1681 cgaacaataa ccccaactcc tgtgtcaaca tcaagaagat attcactgat gtctaatata 1741 tgatacggtt gccaattctg tgcccagtat tgtgtggaac atgccccctt ggtctgtgta 1801 tgcccttgat ttatacattt ccagaccact catcaaggaa agtacaagaa gtgaggaagc 1861 acacttcatt ctccctattg cttcatactg aaacaggtgc ctgtttttgc aagtgggctg 1921 cattctctca gctctttttt tctctctctc cctgtctctt aattttgtga ccaacaaact 1981 tacattaagc gtgg // LOCUS MMENO3G 5472 bp DNA ROD 09-OCT-1991 DEFINITION M.musculus gene for beta-enolase. ACCESSION X61600 NID g50848 KEYWORDS 2-phospho-D-glycerate hydrolase; beta-enolase gene; enolase beta subunit; glycolytic enzyme. SOURCE house mouse. ORGANISM Mus musculus Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Rodentia; Sciurognathi; Myomorpha; Muridae; Murinae; Mus. REFERENCE 1 (bases 1 to 5472) AUTHORS Lamande,N. TITLE Direct Submission JOURNAL Submitted (26-SEP-1991) N. Lamande, College de France, Lab de Biochimie Cellulaire, 11 Place M. Berthelot, 75231 Paris cedex 05, FRANCE REFERENCE 2 (bases 1 to 5472) AUTHORS Lamande,N., Brosset,S., Keller,A., Lucas,M. and Lazar,M. JOURNAL Unpublished FEATURES Location/Qualifiers source 1..5472 /organism="Mus musculus" /sub_strain="BALB/c" /db_xref="taxon:10090" /dev_stage="adult" /tissue_type="liver" /clone_lib="genomic DNA in EMBL-3" mRNA join(1..64,667..753,960..1055,1184..1242,1390..1459, 1820..1953,3453..3675,3790..3987,4182..4383,4756..4864, 5009..5067,5158..5306) /gene="M ENO 3" gene 1..5306 /gene="M ENO 3" exon 1..64 /gene="M ENO 3" /number=1 intron 65..666 /gene="M ENO 3" /number=1 exon 667..753 /gene="M ENO 3" /number=2 CDS join(669..753,960..1055,1184..1242,1390..1459,1820..1953, 3453..3675,3790..3987,4182..4383,4756..4864,5009..5067, 5158..5227) /gene="M ENO 3" /EC_number="4.2.1.11" /note="beta-enolase subunit" /codon_start=1 /product="enolase" /db_xref="PID:g50849" /db_xref="SWISS-PROT:P21550" /translation="MAMQKIFAREILDSRGNPTVEVDLHTAKGRFRAAVPSGASTGIY EALELRDGDKARYLGKGVLKAVEHINKTLGPALLEKKLSVVDQEKVDKFMIELDGTEN KSKFGANAILGVSLAVCKAGAAEKGVPLYRHIADLAGNPDLVLPVPAFNVINGGSHAG NKLAMQEFMILPVGASSFKEAMRIGAEVYHHLKGVIKAKYGKDATNVGDEGGFAPNIL ENNEALELLKTAIQAAGYPDKVVIGMDVAASEFYRNGKYDLDFKSPDDPARHISGEKL GELYKNFIQNYPVVSIEDPFDQDDWATWTSFLSGVDIQIVGDDLTVTNPKRIAQAVEK KACNCLLLKVNQIGSVTESIQACKLAQSNGWGVMVSHRSGETEDTFIADLVVGLCTGQ IKTGAPCRSERLAKYNQLMRIEEALGDKAVFAGRKFRNPKAK" intron 754..959 /gene="M ENO 3" /number=2 exon 960..1055 /gene="M ENO 3" /number=3 intron 1056..1183 /gene="M ENO 3" /number=3 exon 1184..1242 /gene="M ENO 3" /number=4 intron 1243..1389 /gene="M ENO 3" /number=4 exon 1390..1459 /gene="M ENO 3" /number=5 intron 1460..1819 /gene="M ENO 3" /number=5 exon 1820..1953 /gene="M ENO 3" /number=6 intron 1954..3452 /gene="M ENO 3" /number=6 exon 3453..3675 /gene="M ENO 3" /number=7 intron 3676..3789 /gene="M ENO 3" /number=7 exon 3790..3987 /gene="M ENO 3" /number=8 intron 3988..4181 /gene="M ENO 3" /number=8 exon 4182..4383 /gene="M ENO 3" /number=9 intron 4384..4755 /gene="M ENO 3" /number=9 exon 4756..4864 /gene="M ENO 3" /number=10 intron 4865..5008 /gene="M ENO 3" /number=10 exon 5009..5067 /gene="M ENO 3" /number=11 intron 5068..5157 /gene="M ENO 3" /number=11 exon 5158..5306 /gene="M ENO 3" /number=12 BASE COUNT 1287 a 1457 c 1399 g 1329 t ORIGIN 1 gacactgtcc cagctgctac ctagaggaga cactccactc aaagttccaa ggaaggcttt 61 ccaggtttgg taaaaggtta catggtgtgc aggagtgggg gtggtgctgg cttggaaaaa 121 gtgaggagaa tttgggcttt tttttcttcc taggactgag ggtacttgag agaggtctgg 181 ttggaggaag cagaattaaa atactgaggg ttactgacag atacaggaaa gaaggctctg 241 aagaatggga aagtagataa aatacagttg agccaggcag agagcctggg gtctaggagg 301 acttgggaat gggacaatgg gaaccagata ggtttgaggg ggaggaagat gtagcaagca 361 gagtacttaa tcataaagtg caagaaggtg taagctccgt gaggtgttca gggagtggtg 421 tacttgggag ttataaagtt atgaaccctc gattctcttg attggaaggg tggaacaaga 481 gctgttctga gtggggggaa gggaccagtc tgccttcctc tttggtctgt gaccttttta 541 tggggtattt ttagctccag cacctgcctt cttcgggtgg agaagactct taaaagggca 601 agggatttct agttccttaa gggatcaact gtccactctt gctcactcac atctcctgtg 661 gtgcagccat ggccatgcaa aaaatcttcg cccgggaaat cctggactcc aggggcaacc 721 ccacggtgga ggtggacctg cacacagcca agggtaacac aggcttgttt tgcctcagaa 781 aaccctcaac ctcgtcctgt ccttcatcag agattccctc ccctcccccc ttctcttcct 841 ggctccaagg ctgggggaag caggggtttc tctgtattta ctcctattct tgggtccagg 901 ggaggtgacc ccaggccttt gtgggtggac tccttctgat ctgcagttcc tccccccagg 961 tcgattccga gcagctgtgc ccagtggagc ttccacgggt atctatgaag cactggaact 1021 ccgagatgga gacaaagcac gatacctggg gaaaggtgag ccaagactaa cccagcacgg 1081 aaggagcccg tgtgggctga ctcaggacag ggatagggga ctggaactca aagccaaggg 1141 cctaacccgc ttagaaatct gaaccccatt ctctggcctc caggagtgct gaaggctgtg 1201 gaacacatca acaagactct aggtcctgct ctgctggaaa aggcaagtgg aggagccagc 1261 tcccccttcc ctcctccctg tgaccccgtg ccttccctca ggtagcccca ctctgagcat 1321 tctcacttat ccttcatcct gcacattacc tgactgagtc ctgagaaatc ccacctctgc 1381 ctcttccaga aactaagtgt tgtggatcaa gaaaaagttg acaagttcat gattgagctg 1441 gacgggaccg agaataagtg tgagtgaggt gccagaggtg ggaagggtgt ggggtacagc 1501 gagggtggaa gagaaaccac tgacagctgc ccgggctctt tctgtcccac atcttggggc 1561 acaccgtaac tgggtagcct tttattcatc ccacaggcat ttcctgaggc ctggcagaag 1621 acacatgtga aactcttgga ccagggatga gggggtgaac atcccagaga gcacaggcca 1681 ggaaagagag atgagtacaa ctcttaatgt tgagtggtct aggaatcttc ttccgggggt 1741 gaccagagga aatatccaag gacttaatcc ccaaacctgt aagagtcctc atgctttctc 1801 cccgcttgcc tccctctagc caagtttggg gccaacgcca tcctgggtgt gtccctggct 1861 gtctgcaagg ctggagcagc tgagaaaggg gtccctctct accgacacat cgcagatctt 1921 gcaggcaatc ccgacctcgt actccctgtg cctgtgaggt gctggctgtg ctgaagctct 1981 aggacagctg cccctcccta gctaggaatg tttcagcagc ccccaccccc acccccagaa 2041 aatctacctt ttcagatcct gctttccaaa gaagcatttc ctgaaatgaa atctttccta 2101 ctgtgcccca gtccctcctt cgttccacca tctaaagcta agcctctgag gcttccgtcc 2161 ctacgaccag gtcccttcca ggctcaggcc attcctgtcc agagctcatt ccgtctcctg 2221 tctcactttc gcagtcagtc agatttgaaa gcgtcttcag ggcaaagctc tcagtacatc 2281 tttgtattct ctgagactac ttgaaacatt gccctcttaa tagtagcccc ttgaattttg 2341 cagaaagagc cccaagtgtg caaagtcctg ggtgaatgtg tgctgtgcca tgagccagcc 2401 ttgtcaaagg ccccagggct cagacccttg gaggaatggc aaagcttccc cctaggctaa 2461 ggttgagaga cttgatgaag ccttcttcct ttcctcggtg accaagttgc acgtgttgag 2521 agtgtgtcca ccatagctcc ttcccctttg gtgtttacag tctgaagact gctcctctac 2581 aaagactgct cttaccagga tcccctaaac ctgttcttgt ggttcagctg tgtgccataa 2641 accctgcctc cttgtttatc tgtatgcagt aacccacttc tctgttcagc tctatataat 2701 aaacacgctg ggctctgtgt agtaaacacg ttgagctgaa gtgctgaggt tttcccatca 2761 gagagcccac tctgtccagt ccccagcttt tctgtctgtg tgtccatcct ctgtcttttc 2821 ttcattccct ttgctgtccc cgtcacttcc attcttgcag ctctgcgtgg gccggaatat 2881 gttaccatac tggagatacc tggctgggtt cagaccctgg cctttcccat tttctagttg 2941 tgtgatttta ggcaagttat ttaacctctg tccctcactt ttttcatctg taaagaagaa 3001 ataagagagc agagttaagc cgggcagtgc tggcgcacgc tttaatccca gcactcagga 3061 ggcagaggca ggcggatttc tgagttcgag gccagccagg gatacacaga gaaaccctgt 3121 ctcgaaaaaa caaaaacaaa caacaaacaa acaaacgcaa caaaaaagag tgtacagtta 3181 atagaagtat tttgagagtg aaacaagtca tgcatttaga atgccataga attattcctt 3241 gtcataagta tatttgattt cccttaactg tcagagcagt agggaagtaa gcactgttag 3301 cattttgtag aaagagagaa aggctcttag gagtgcagtg ctgcatccta gaccattgaa 3361 ctagggagca gagcgcagct gtcacctcca gttaccctct cacctttgac ccctatgcca 3421 cccagctctg agcaccaact tctgtgtccc aggcctttaa tgtgatcaac ggcggctctc 3481 atgctggaaa caagctggcc atgcaggagt tcatgattct gccagtggga gccagctctt 3541 tcaaggaagc catgcgcatc ggcgctgagg tctaccacca cctcaagggg gtcatcaagg 3601 ccaagtatgg gaaggacgcc accaacgtgg gggatgaggg tggctttgca cccaacatcc 3661 tggagaacaa tgagggtcag tgctgaacat cctggggcag agtgcctgag tgcccctgag 3721 gtgggggtaa agagaggctg cagatacagc catactggag tctcaggtca tctcttgctc 3781 ctctcccagc cctggagctg ctaaagacag ccatccaggc agccggttac ccggacaagg 3841 tggtgatcgg catggatgta gctgcgtctg aattctaccg caacggcaag tatgatctgg 3901 acttcaagtc acccgatgac cctgccaggc acatcagtgg ggagaagctt ggggagctgt 3961 acaagaactt catccagaac tatcccggtg agacggccgg gtgctccccc cgcccttgtc 4021 ttctgcggtt acctagcatg gctctcagca tttggctttg tcacaattcc tgtccttccc 4081 atctttagct catccgtagc ctccatttaa cctctgctac ctgtcacccc tctatcaggc 4141 ccctgtctct aaccccatct gtgctcccaa ctccctacca gtggtctcca ttgaggaccc 4201 ctttgaccag gatgactggg ccacatggac ctcattcctc tctggggtgg acatccagat 4261 tgtgggagat gacctcacgg taaccaaccc caagaggatt gctcaggctg tggagaagaa 4321 ggcctgcaat tgcctgctcc tgaaggtcaa ccagatcggc tccgtgacgg agtccatcca 4381 ggcgtgagtg cctcctgact caggcccccg ggcccccccc ccccccggct ccagccactg 4441 ctgctgctct tccatgggtg cctttcccag ggcccctgac ctactcaagc ctcagaaccc 4501 ctaggctatg actgaccccc aacttggctc taagactttt gccattgtgc tctgccatgt 4561 cccttaacct cccttctgtt ttgtgttctt ctgcttgcca tcttctgacc actctaatcc 4621 aagatcagaa atctctcctc cttcccaaag aacttagtca ctgcccaaac aaagggaaag 4681 ttctatcttg cctgcccctg actttcttat tgaggtggtc acataaggtc tgcatcaaat 4741 gcctttccca cacagctgta aacttgcaca atctaatggc tggggagtga tggtgagcca 4801 ccgctctggg gagaccgaag acactttcat cgctgacctt gtggtgggac tctgcacagg 4861 acaggtactc ggggttctct ctactgagta tctaaccaag tttccttggt caccttggta 4921 tcacttgggc catctaagtt ggataccact acagccatgg ctcttttggg gcccaaacca 4981 acccctccaa acccttttcc cacctcagat caagactggt gctccctgcc gttcagagcg 5041 tctggcaaaa tacaaccagc ttatgaggta caatttgtgc agaggcctag accgtgggga 5101 gaggaaggcg ctcctgggtt agaaagtagg agctctgatt ttctttttgt attgtaggat 5161 tgaggaggct cttggggaca aagctgtctt tgctggaaga aagttccgta atccaaaggc 5221 caaatgagga gctggagact ccaggctttc acaggaaaga cacaggcctt caagcccttc 5281 tcccagaaat aaacactgcc aaaccagacc atgctgtgtt ccttaggagg gaatgaatga 5341 caagctcgcc aggaaggaca gggctggggt ggggtggggg tgggaggacc agggaagaag 5401 gcagagaata tcatgtgacc tgctgtgaca aggaagtaag ggatttgaaa gctgagtgga 5461 atgtagaatg cg // LOCUS MUSPPIA 576 bp DNA ROD 13-AUG-1992 DEFINITION Mouse 19 kDa protein gene, complete cds, clone D4S234E. ACCESSION M98530 NID g200461 KEYWORDS 19 kDa protein. SOURCE Mus musculus DNA. ORGANISM Mus musculus Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Rodentia; Sciurognathi; Myomorpha; Muridae; Murinae; Mus. REFERENCE 1 (bases 1 to 576) AUTHORS Carlock,L., Vo,T., Wisniewski,D. and Lorincz,M. TITLE The Identification of a neuron-specific gene that maps adjacent to the Huntington's Disease marker D4S10 that shows homology to protein phosphatase inhibitors JOURNAL Unpublished (1992) FEATURES Location/Qualifiers source 1..576 /organism="Mus musculus" /db_xref="taxon:10090" gene 1..558 /gene="19 kDa protein" CDS 1..558 /gene="19 kDa protein" /codon_start=1 /function="unknown" /product="19kDa protein" /db_xref="PID:g200462" /translation="MVKLGNNFAEKGTKQPLLEDGFDTIPLMTPLDVNQLQFPPPDKV VVKTKTEYEPDRKKGKARPPKIAEFTVSITEGVTERFKVSVLVLFALAFLTCVVFLVV YKVYKYDRAYPDGFVLKNTQCIPEGLESYYTEQDSSDREKFYTVINHYKLAKQSITRS VSPWMSVLSEEKLSEQETEAAEKSA" BASE COUNT 150 a 153 c 152 g 121 t ORIGIN 1 atggtgaagt tggggaataa tttcgcagag aagggcacca agcagccact gctggaggat 61 ggcttcgaca ccattccttt gatgacgccc ctcgatgtca accagctgca gttcccaccc 121 ccagataagg tcgtggtgaa aactaagact gaatatgaac ctgatcgcaa aaaaggaaaa 181 gcacgtcctc ccaagatagc cgagttcacc gtcagcatca ccgagggtgt caccgagagg 241 tttaaggtct ccgtgctggt cctctttgcc ctggccttcc tcacctgtgt cgtcttcctg 301 gttgtctaca aagtgtacaa gtatgaccgc gcctaccctg atgggtttgt cttgaagaac 361 acccagtgca tcccagaagg cttggagagc tactacacgg agcaagactc cagtgaccgg 421 gagaaatttt acactgtcat aaaccactac aagctggcca agcagagcat cacgcgctcc 481 gtgtcgccat ggatgtcagt tctgtcagaa gagaagctgt cggaacagga gaccgaagct 541 gcagagaagt cagcttagca agcggggcag gttcct // LOCUS MMU60528 5416 bp DNA ROD 08-AUG-1996 DEFINITION Mus musculus guanylin precursor gene, promoter region and complete cds. ACCESSION U60528 U09741 NID g1480667 KEYWORDS . SOURCE house mouse. ORGANISM Mus musculus Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Rodentia; Sciurognathi; Myomorpha; Muridae; Murinae; Mus. REFERENCE 1 (bases 1 to 5416) AUTHORS Sciaky,D., Kosiba,J.L. and Cohen,M.B. TITLE Genomic sequence of the murine guanylin gene JOURNAL Genomics 24 (3), 583-587 (1994) MEDLINE 95229161 REFERENCE 2 (bases 1 to 5416) AUTHORS Hill,O., Kuhn,M., Zucht,H.D., Cetin,Y., Kulaksiz,H., Adermann,K., Klock,G., Rechkemmer,G., Forssmann,W.G. and Magert,H.J. TITLE Analysis of the human guanylin gene and the processing and cellular localization of the peptide JOURNAL Proc. Natl. Acad. Sci. U.S.A. 92 (6), 2046-2050 (1995) MEDLINE 95199289 REFERENCE 3 (bases 1 to 5416) AUTHORS Sciaky,D., Jenkins,N.A., Gilbert,D.J., Copeland,N.G., Sonoda,G., Testa,J.R. and Cohen,M.B. TITLE Mapping of guanylin to murine chromosome 4 and human chromosome 1p34-p35 JOURNAL Genomics 26 (2), 427-429 (1995) MEDLINE 95324946 REFERENCE 4 (bases 1 to 5416) AUTHORS Cohen,M.B., Sciaky,D., Hochman,J.A., Hawkins,J.A. and Witte,D.P. TITLE Guanylin mRNA expression in human intestine and in human intestinal cell lines JOURNAL Unpublished REFERENCE 5 (bases 1 to 5416) AUTHORS Sciaky,D. TITLE Direct Submission JOURNAL Submitted (13-MAY-1994) Daniela Sciaky, Div. of Ped. Gastroent. and Nutrition, Children's Hospital Medical Center, 3333 Burnet Avenue, Cincinnati, OH 45229-3039, USA REFERENCE 6 (bases 1 to 5416) AUTHORS Cohen,M.B., Sciaky,D., Hochman,J.A., Hawkins,J.A. and Witte,D.P. TITLE Direct Submission JOURNAL Submitted (10-JUN-1996) Div. of Ped. Gastroenterology and Nutrition, Children's Hospital Medical Center, 3333 Burnet Avenue, Cincinnati, OH 45229, USA COMMENT On Aug 8, 1996 this sequence version replaced gi:665921. FEATURES Location/Qualifiers source 1..5416 /organism="Mus musculus" /strain="129/Sv" /db_xref="taxon:10090" /chromosome="4" /clone="lambda/mgg10" promoter 1..2697 protein_bind 2644..2658 /bound_moiety="HNF-1" 5'UTR 2698..2733 exon 2698..2808 /number=1 CDS join(2734..2808,3583..3793,4192..4256) /codon_start=1 /product="guanylin precursor" /db_xref="PID:g1480668" /translation="MNACVLSVLCLLGALAVLVEGVTVQDGDLSFPLESVKKLKGLRE VQEPRLVSHKKFAPRLLQPVAPQLCSSHSALPEALRPVCEKPNAEEILQRLEAIAQDP NTCEICAYAACTGC" intron 2809..3582 exon 3583..3793 /number=2 intron 3794..4191 exon 4192..4456 /number=3 mat_peptide 4209..4253 /product="guanylin" BASE COUNT 1281 a 1541 c 1307 g 1287 t ORIGIN 1 gaattcagat gaagctgcca ctggggctgt ctagctctgg catcccctcc taccccagtg 61 tagtgctttg agatcctggt tgaatgagat actcctgggg gtggggggag gggctgacct 121 tgaatgccag ccaggtttgg gacagtagct ggttgtccct agatggtgct gtggctcaga 181 aggccatgga agtgccctgg cttgcagaag atgatgagat gaggtatgct ggaagctgag 241 gacaaggctc gggattcagg atgcaagctt gggcttccag aggtacagct aaggtccaat 301 ctctagcagg tttagtggca ggcagatgag gtcttatgtc tcctgtaaaa aggctgaagc 361 tggccccagg ttaggcaggc accatagtgg gaaagagaag agtaagttcc tacttctcag 421 agcagagttt gggggtgggg tgttggatag ggccagaccc aaaggaatag gacctaatag 481 caacttcctc tttcccatgg tgtggaactg cgtcatcctg aaggtcagat ggagcagctc 541 ctgggaaaaa agagggcagg agaggaaacc ctattccctt cccccagtcg ctgccacact 601 gagggaagac accttgagtc agaaaaaaag ggttccccta acccagtccc accctaaagc 661 caaatcaaat cccagcacag ccttaggtcc cacgtcatcc cagacctaac ccaaatcata 721 aacccagccc cgccccatcc atcaacttca ctgcaaacta ttcaattgta accctacaag 781 gaatccgaac ccagcccagc agcctccaca ctcgagctct atcctaacca gcatccactt 841 ccaaacccat cagactcctc ctaactcagc ccctgaccag gcagctgctg gtcctcatgt 901 tagcccagct cgcccctgtc accccatgca ccccgggaga gtcccctgct ctttctgcct 961 catatcagta ttcgaaacat cagtaagttt atctccaagc cagcacacgg gtactctgca 1021 actgtgaacc cctgctcatc agcacccttg ctaggccctg aaggtcacct ctcaacagac 1081 caatgtggag catgtggact gcacatgtat tgtctgctgg cacaccgaac agacacacac 1141 ccctccacac ccctccacac tgtttcctga agaacacatg aacctcctca caaatatgca 1201 taaacagaca acttcaggat atatgcacat acacacactt tagtgtgtgc acacatcccc 1261 agcacatagg cacatagact tcttagtgta cacatgaata taaccaagat atacacataa 1321 ctcagtgtgt catactctgg catacatagg cacacaacac ttcagtgcac acacaaacac 1381 ctttgcacac atgcctttca gaatgcctac atgtgccctt caatacacac ctcatgtact 1441 cacaaacatc tctgtacaca caaagacccc ttcagtgtac acacctcagc atacatgcaa 1501 tctgcatctc agtgcctgca ggcatttgca tatgaccccg tagtaggggt tggggaaata 1561 gacaacagaa gtccagccta gaacagcttg gtaagaacaa atgtttcttg accttgtccc 1621 agggttaggt actttctagt ttcttcttac aagggtcctc tccagcccct tctttagtac 1681 tgttctcaca tctagggatg ccgtccctca ctgggaatgc aagcagtgtg gtgaatcggc 1741 aggaaggaga ggatccgtca agactgcttt ctctaaggtc ctggggcttg gacactctag 1801 gtcctgcagt gatatatgac ttgccccaat cccagcagag gttagattag agggacccgg 1861 atccacacag acccatctca gccactcggg gcagggcagg gcaggacaga gggggccata 1921 ctgcctaagg accgactgct ctgactgggc taggcaggta gccatcagtg gagaccagga 1981 gccgctgctg ctgttcaggc acagcaactt cttcatcgag ggcttcctgt gtgccaggaa 2041 ctgcatggaa cacttaggtg ctcatagtct ccttagcacg aaggcatctg caaccaggga 2101 ggtattcaca ttggtctgca cagagaacct gatagggcaa cgatgtggag tggaagtcac 2161 atcgaaaaca ggccgctggt ggggtttgag tgcaggcctg tgaatagcaa agttgaatcc 2221 cagagaccaa ggctggactc atctggcttt tgtgttcccg ttctcccccc tctctgggct 2281 cacttactct tcccttcccc ctgaacattc caggctcagc tcatttcttg ctcctcctaa 2341 cagggtcttc tttgaatggg tgggaggtaa ctcttctgtg ctccccaccc ccatgctaat 2401 ggccctgtct ctgttgccac tgtagtcaca gtcaccttgc tgggcctctt tcttcaaggg 2461 caaggctgga tctgccatcc tggtcgggag taaatccagg acccagccca gctgtgagtc 2521 acagtagtgc ccagggagag gccagggctc tgccaggctc tgttgatctc ttatctccca 2581 ggctccactt ggccttatct gatggggctc aacaggcagc agatgaggct gacaaggccc 2641 cagggttact gagtaacccc aacctcttta aaagccccac tgtttacccc aggcactagt 2701 actggcctgt tctctgcatt gcatactgct accatgaatg cctgtgtgct ctctgtgctg 2761 tgcctcctgg gtgccttggc tgtcctggta gaaggggtca ctgtgcaggt atgaccccat 2821 ccctacttag tctagcccca tccttggaat cctgaaacga gggatgggca tcttctggga 2881 cctgtcgtcc cccccccccc ccccgcctcc cgagacctgc cagccagcca tattttccct 2941 tcaaacatgg aagaggaaaa cgttcaagga gacagcaaga aataccagct gggaaagact 3001 atagcaataa gtgcctgcag tctggctgct ccagtgtccc attgtgactc ggcaggcccg 3061 ggatgaaagg gttggtggca tttcacagaa cagaatcatg agcttcacta aagtccccaa 3121 acagccaaag ctgatggctg aaaagggagg tggctgggct caaatctagg catccattgc 3181 acccgatggg aacctagatg gtcaggcata cctactggtg acaccagaca caagctgggt 3241 tgtcagcaaa cagggatcct attctctaaa ggacgcccct gggttccttg tattgcctct 3301 gataggctgg aggtggaagc cctcgggcag gaaaagatag gctgggctag catgaccacg 3361 gataagggtg ccagttcacg ctcatatggc acccacagca tccctggctc tacagaagcg 3421 gctccattga tcacgtggta gatagtatgc agtacttacg gcctctagca aacccagggt 3481 cacagaggtc tgacctcacg ttgttcctta gattgtctca agcaagccac ccaggctagg 3541 gctgcctaca gcctggcctc atgctctggc tgcctctttc aggatggaga cctctccttt 3601 cctctggagt cagtgaaaaa acttaagggt ctccgcgagg tacaggagcc cagactggtg 3661 agtcacaaga agtttgctcc ccggcttctt cagcccgtgg caccacagct atgtagtagc 3721 cactctgcac ttccagaagc actgaggccg gtctgcgaga aacccaacgc cgaggagatc 3781 ctgcagaggc taggtgagtg cgctcttggc tagtgtggct aggacatcct ctgattggtt 3841 ggggcttgag gaggaggtgt gctttgattg gctagttctg agtggcactg gttgaagcct 3901 tttagaaagg ccttcgttca gaagcagggg ttaaaaggag tcctgcatcc catcttcggg 3961 tcatgtctaa caactccgtt tctgttctgg taaaaagctg cactctgatt ggttggcttc 4021 aataattcat acgccccatt tgaggcagaa agtctagagg gtcagaggag atcagcctgc 4081 tgtgattatc ctggtggctg gtgacagata tccaaactct agtggaagag aggcagggga 4141 tatgagaaaa tggtggccgt ccctcaccaa gacttcttct tccttgacca gaggccattg 4201 ctcaggaccc aaacacatgt gagatctgtg cctacgctgc ctgtacggga tgctagagtg 4261 acatcgcttg cctttctctc agcccatgtg gaagcccctc tatcttctag aagtcacgag 4321 gctaacaaac tcataccctc agcatctatt gccctgccac agcggggagg aggctgagga 4381 gacagctacc tggagaagcc ttgcctgagg gctatgttaa ccctcagcca ataaaccaaa 4441 cttcagaact ccgcctggtg ccttcgtttc ttccctagct ctccctcagt gcagtgaaca 4501 cccacctgcc ctcagaacca tgggcttcag ggtccaaagt agaaggttcc acttccttgc 4561 caagttcccc caaccttgag actaccatca cacacaggta cagggaccta ccagagaggg 4621 cagggttttg gcttttctca cctgtctcct ggggtcacta acttgccgct aagccacatg 4681 tgaagacagg attttttttt tttttttttt tttttttttt tggtttctcg agacagggtt 4741 tctctgtata gtcctggctg tcctggaact cactttgtag accaggctgc ctctgtctcc 4801 caagtgctgg gattaaagac gtgcgccacc atgcccagcg aggacaggaa tttgtctgag 4861 ctggtttggg tgtcccgtgc tcacaggcct ttgggccacc aaaggatctg gcttcctcaa 4921 accagcggtg ctcttctgcc ctttctgtgc tcccattcaa agcaagcatt cccatcctgt 4981 cctctccacc tttctgtgtg cctagttagg ttaccaaccc aagctccatg gctactttcc 5041 ccatggcctt tcctgatctt ccctgagaag actgtgtcct gtggtcaccc gtatctctag 5101 aacaactcta ggcaccttat tgataccttc tgtggaggag gaagttgttc caacctggct 5161 caggcttaag tctacacccc ctccccactg cccaaccttc cccccccccc cacttttgcc 5221 tgctcaaaga acaacagtgt gagaggaaaa gccaggattc caggtcccag cccctgatct 5281 tccacatcct gtgtgatgct aggaaaacca catctctgtg ccttaatttc cttctctgta 5341 acaaaactag ctctatacca tgagtgatac attttaaatc ccccagttca gacctaatat 5401 aaacctagag gaattc // LOCUS MMATPB2 7179 bp DNA ROD 20-MAY-1992 DEFINITION Mouse Na/K-ATPase beta 2 subunit gene. ACCESSION X56007 NID g50051 KEYWORDS Na/K-ATPase beta 2 subunit. SOURCE house mouse. ORGANISM Mus musculus Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Rodentia; Sciurognathi; Myomorpha; Muridae; Murinae; Mus. REFERENCE 1 (bases 1 to 7179) AUTHORS Magyar,J.P. TITLE Direct Submission JOURNAL Submitted (08-OCT-1990) Magyar J.P., Swiss Federal Institute of Technology, Dept of Neurobiology, ETH Hoenggerberg HPM E26, CH- 8093 Zurich, Switzerland REMARK revised by [3] REFERENCE 2 (bases 1 to 7179) AUTHORS Magyar,J.P. and Schachner,M. TITLE Genomic structure of the adhesion molecule on glia (AMOG, Na/K-ATPase beta 2 subunit) JOURNAL Nucleic Acids Res. 18 (22), 6695-6696 (1990) MEDLINE 91067470 REFERENCE 3 (bases 1 to 7179) AUTHORS Magyar,J.P. TITLE Direct Submission JOURNAL Submitted (20-MAY-1992) to the EMBL/GenBank/DDBJ databases FEATURES Location/Qualifiers source 1..7179 /organism="Mus musculus" /db_xref="taxon:10090" CAAT_signal 452..455 /note="hypoth. CAAT-box for ORF" TATA_signal 641..646 /note="hypoth. TATAA box for ORF" misc_feature 677 /note="ORF transcription initiation site" mat_peptide 962..1360 /note="open reading frame" CAAT_signal 1585..1588 GC_signal 1680..1685 GC_signal 1771..1776 TATA_signal 1791..1802 gene 1831..6561 /gene="ATPb2" exon 1831..2526 /gene="ATPb2" /number=1 mRNA join(1831..2526,4264..4392,4707..4811,4913..5118, 5329..5385,6186..6284) /gene="ATPb2" misc_feature 1831 /gene="ATPb2" /note="ATPb2 transcription initiation site" CDS join(2415..2526,4264..4392,4707..4811,4913..5118, 5329..5385,6186..6284,6397..6561) /gene="ATPb2" /codon_start=1 /product="Na /K-ATPase beta 2 subunit" /db_xref="PID:g50053" /db_xref="SWISS-PROT:P14231" /translation="MVIQKEKKSCGQVVEEWKEFVWNPRTHQFMGRTGTSWAFILLFY LVFYGFLTAMFSLTMWVMLQTVSDHTPKYQDRLATPGLMIRPKTENLDVIVNISDTES WGQHVQKLNKFLEPYNDSIQAQKNDVCRPGRYYEQPDNGVLNYPKRACQFNRTQLGDC SGIGDPTHYGYSTGQPCVFIKMNRVINFYAGANQSMNVTCVGKRDEDAENLGHFVMFP ANGSIDLMYFPYYGKKFHVNYTQPLVAVKFLNVTPNVEVNVECRINAANIATDDERDK FAGRVAFKLRINKT" mat_peptide join(2415..2526,4264..4392,4707..4811,4913..5118, 5329..5385,6186..6284,6397..6558) /gene="ATPb2" /product="Na /K-ATPase beta 2 subunit" intron 2527..4263 /gene="ATPb2" /number=1 exon 4264..4392 /gene="ATPb2" /number=2 intron 4393..4706 /gene="ATPb2" /number=2 exon 4707..4811 /gene="ATPb2" /number=3 intron 4812..4912 /gene="ATPb2" /number=3 exon 4913..5118 /gene="ATPb2" /number=4 intron 5119..5328 /gene="ATPb2" /number=4 exon 5329..5385 /gene="ATPb2" /number=5 intron 5386..6185 /gene="ATPb2" /number=5 exon 6186..6284 /gene="ATPb2" /number=6 intron 6285..6396 /gene="ATPb2" /number=6 misc_feature 6397 /gene="ATPb2" /note="exon 7 start site" BASE COUNT 1524 a 1965 c 1803 g 1887 t ORIGIN 1 agctttattt ttgtgacatg atttctctat tagcccaggc tggcttcaaa cattatcctc 61 acaccttcag tttgggctac tagcatggat taacatatct ggctcttctt tctgctgcct 121 cctcctcctc ctcctcctcc tcctcctcct cttcctcctc ctcctcttcc tccttcttct 181 tctgagttag ggtcttgtta tgctgtcctg aatgaccttg aactcctgac ctcaatcaat 241 ccttttaccc caggatccca ggtagctgag tccagatatg gccagcatgc ctggaagatc 301 cattaaactt aattatgtca tttcacaatg ctagaattac ttccgtgata cttggagtaa 361 aattggaatt acagtcagac atagtcatga acacctataa tttcagcaga ggaaggcaac 421 ccttggagct ggatatggtg gtacacacct gcaatcccag tattcaggag tggaaaccac 481 cttggctgct tcagaccagt gtcccagacc agcctcagat aaacaggatc ctgtctcaag 541 aaaagaaaaa taaaaaccaa ccaaccaacc aaccaaacaa acaaacaaaa cacattacta 601 gcatcaccga gagtaataaa atatcatagt ggcatatttt tataaaccca gaatgtgggg 661 ggcagaggca ggagcatctc ttcaaggcta gccttgtgta caggatccac atggtagaag 721 aaaaaaacca attcctctaa gtatggactg tggtgttcct gccacagaaa gaaataaaga 781 taatggcctt cttcctcttc ctcttcctct tcctcttcct cttcctcttc ctcttcctct 841 ccttctcctt ctccttctcc ttctccttct ccttcgattt atttattatt atatgtaagt 901 acactgtagc tgtcttcaga cacacaaggt cagatctcat tatggatggt tgtgagccac 961 catgtgttgt tgggatttga actcaggacc ttcagaagag cagtcagtgc tcttaactgc 1021 tgagccatct ctccagcctg cccctcttct tcttaagcta ttgggctgca acatcactcc 1081 aagggaaccc aggttccaga gctggggcat caggtgtagg tgctccagca agcgtgggct 1141 ggagatcaga cctggtgggg aggggcggct ggaaggcagg acccatgcag acctggctgt 1201 agatcaggga cacagggaaa ttttggggct caaacaagtt tggaaagatg gcagggaccc 1261 tggtcatgga gaacaggtag gttggacaaa ggagccagtg gaaccgagaa agaaaggggc 1321 gtcctgtggt ggcttctatc cctctctatg ttctagtttc taggctcttt cttggtgtct 1381 ggcgcttgct tttctttatt cctctcacat tcagccctct gttccggagg ctttcgaagt 1441 aactctatct tagtctcttt ttaaattccc tgccccgggc cccccacttt tgagtacgta 1501 tcagctcgtc tttcagtttg agcctctcag gaggtgttct cttctttagg tctacccaga 1561 ctctcaccct cttgtcttct tttccaatct tgtttcagta ctgtacctgg ggtctggggt 1621 aggaagaacc cgctaggacc aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa ggccccaggg 1681 ggcgggtcca atccatcttc atttgcatat gtatgaggtc tcctgagcga gcaacgaaga 1741 ggcggggttc ctgaggggtg ggtgggaagg gggcggacac cctcagagtc acagactata 1801 aagactgtgc gcagggttcg cgctcccgca ctgctgagga gcggagcctc cgattggggg 1861 gcccctatcc ttgtctttcc cccacaactg cttgtccccg ccccagctcc cactggctgg 1921 gccaactgtg gtgtggctgt cgccgtttgt cggaggagcc ttaatcggcc acctgccgcg 1981 ttctacagcc tctttgcatc tcggatttcg gggcggctcc ctcccccatc tttccctgct 2041 tctttgcacc cccgcttttt ctgcgtttcg ctcgaatttt ggagccgtct ggttttgcac 2101 ccctttttgt tttcttctag gcggtgtgtg gggtgaaggt gcctcggcat ccctttccct 2161 ggttatcttt ccctgttttg acagcccccc tttctatcgc agtcgggggg cctagcctcg 2221 gtgcaccttc gccgcactgc cagcagacca cagtgcgtgg ctgtgcaccc cggaatttgc 2281 agcagctgta tatctgactg gagtctccct gcctgctccc gcgcgcattt ggtgcgtgga 2341 gggcttcagc gcgcgacgcc ccggcttctc cgcagccccc agcagcgcgc cgggactcgc 2401 cccgtgcctc caagatggtc atccagaaag agaagaagag ctgcgggcag gtggttgagg 2461 agtggaagga gttcgtgtgg aacccgcgga cgcaccagtt catggggcgc accgggacca 2521 gctggggtac gcggggctgg cgcgggaaag gtggtagctg accgctcggc gatgcctttg 2581 gggcgcaggg tcccgcgggc gtcgcccagc tcccctgccg ggtccctggc gtccagcccc 2641 cactgtcggg ctttggatcg ggaggggccc cgaatcagcg gtctaatctc tctgactggc 2701 cgctgcggag gcggagaaag taggtcactg ccgcctgccc gccccccgcg gagcccctcg 2761 ggcggggggt cgcgggctct gcgcgcgtgt ccgtgccacg gcgctcccgc tccggctcag 2821 gccctgcggc tgcacacggt catcatccct ctccccgaag agtgccctaa ctctccctct 2881 ggctctcact agctagccaa cctcgtttat ttttagctct cacccacccc cttggaccct 2941 gggaacattc atgaggaggc ggatctggca ggggggtctt gggagggggg gttcttaaca 3001 gcggaagttg tttgtctgta ttccaccatt gggcgtttgg agtcctctgt ggctgctttg 3061 tgggtggggg gctgccagga gggatggtat ttttccaggc ttgaggtttt cagatgtcag 3121 aggtggaggg agtaatggtc actgtgctga tatgggtaca tcctcaggca gggtgtttac 3181 atgggactca gatgccttgg aagaactctc tacccttacc tcatgggtgt gcaaggactt 3241 aggtagtcag gggctgctga tgggtacccg gtgaggtaag ggtgactgac agtcttgcaa 3301 gacagtgcca agctggagaa ttgaagcaca agggattctg tgtgacataa tttacgacat 3361 atccacatcg attggacaca gtcttcatca gaccacccac ctgaggctta ggggaccatg 3421 aaaggagatc aatgcagccc ttgtcctcaa ggaacccaag accctgtgga tgtgaaggat 3481 caacgatgga tgcttgggct actcagaagc ctccacagag gaactgtctt cttaggtaga 3541 aagactgaga gttcagggag attggaacag catgtcctcc tcactggagg cttaggatag 3601 gagtcctgag agatgatgtc cagggaaggc ttctaggaga ttctgtcttc tccttgactc 3661 tggccaagat cctgtgtgag ttaaaggtgg gggtgactgc tggccattag cccctcattt 3721 caggggctgc ttcctcaggg cgagagacag acagacagac agacagatca ctgagctgtc 3781 ctcaggcaac acacacacag atcttgctac ctgagtcctc tgaggtgata agaatttggc 3841 tgagagtgct gcgcagttac caggttgcat ccagatcctt gtgctctgag acggctctaa 3901 cttcgagttc cgatacggca ggaacagatg ctgccttttc aggaccgtgc agggttgaga 3961 agttggaaat gcattagccg cagaatgaca gtggcctatt cttggagttg tcacaacaac 4021 tacagtttag gtctcttgtg gcctccaaaa ggtcagactt cactgtgccc caggaggagt 4081 ctgggagtca gggctggagc tggaccctgc ctgtttccag cttgtttcct ttagctccca 4141 ggggaaggct ctaagatatc ctgtcgctcc ttggcctatc cccgtacaca gcttgagagg 4201 aaggaggaag agaggggttg gtggaagatg tgggagctgc tgactctagt tcctctatcc 4261 tagccttcat cctcctcttc tacctcgtct tctatggttt cctcacggcc atgttcagcc 4321 tcaccatgtg ggtaatgctg cagaccgtct ctgaccatac ccccaagtac caggatcgac 4381 tggccacacc aggtgagaga agagtagttt tccccttacc ggtcactgga actactgtgt 4441 ccttaaggcc tcacaaggaa cgtatagttc tttccaggag ttggagtggg gagttcttga 4501 acttggcctt ccttcccatg ccctgcagcc gggagtcatg tgatttggag actggcagat 4561 gccacaggag attgctctcc caggaaggca gttttccaga tctttgcccc cacctgaccc 4621 tggtgttctt catatctgtc catttccctt gatgtgtttc tcttcatctt atgtgactct 4681 ctagtcttct ctcctgttgg ctctaggctt gatgattcga cccaagactg agaaccttga 4741 tgtcattgtc aacattagtg acactgaaag ctggggtcag catgttcaga agctcaacaa 4801 gttcttggaa cgtgagtgtg gggctagtcc aggaccttgg gggaaggaat ccaggaccct 4861 ggaagtgaga catttggcct ctgaccttct ctgtctgcct cccacctcct agcttacaac 4921 gactccatcc aagcacagaa gaatgatgtc tgccgtccag ggcgatatta tgagcaacct 4981 gataatgggg ttctgaacta cccaaaacgt gcctgccagt tcaaccggac ccaactgggc 5041 gattgctctg gcattgggga ccctacccac tatggttaca gcaccgggca gccctgtgtc 5101 ttcatcaaaa tgaatcgggt acccatgata ttggtctccc gggaggaggg agctggggcc 5161 accatctgtt tactaatgtg tcctttcatt ggggttaatg ggcattaaag agatttttgg 5221 tagtttgttt taagaggtgg ggctattgga agctaagccc cagagctgaa aggctagacg 5281 ccaagatgtg gcaacttctt ctaaatccac cctccctcct ttctctaggt catcaacttc 5341 tatgcagggg caaaccagag catgaatgtc acttgtgttg gcaaggtgag tgtgggggcc 5401 ctccttacct gcccacctgg ttagacttcc tggtttctga gtgcttcacc catatctccc 5461 tatctttttg tgctttcaga ggccacagta tagggacaag ggggtaagag tgggcgccta 5521 tgcagtttta gctctaagag gctcttagcc ctattgcttc tctctaggat aaatgagagc 5581 ctgctgtcct ggagatagac ctatcccttc ctgcaccaaa gctctgacct ctggttcctt 5641 ccctgtcaac tttttcttac atctcagttg tctgggtttc ttccactctc cccatcatgc 5701 cttgtttctc agttccctca gtctgctagc tactgctcag ttagcaccct ttgctacaac 5761 tagttgtcct tggaaccctg cagccaactc tgtcctctct agaaactctc ctccttccca 5821 ctgagccttg actgtttatc tgttctttct tggctctgct ccagagactg attcccaagg 5881 acggggtaag aacttgggga ttgatggtgg agttagaagg ccctcaccgt gttgtcagca 5941 cccttagaag acctagtctg atgggagata ggccacccct atctgcagac atgcagatag 6001 gaacatgtgt gcatgcgcac acacaaatgc acacacagct acctgagcag atgcacagct 6061 caaagaaaac aagtttgaca gggataattt gggatgaagg aggtacagaa ggaagtcttg 6121 tgagcgcttc caggtgcctg ctgttcctaa catcctctcc ccttaacctt cctgcacccc 6181 cacagagaga tgaagatgct gagaaccttg gccactttgt catgttccct gctaatggca 6241 gcattgatct gatgtacttt ccctactatg gcaaaaagtt ccatgtaagt cccacctcgg 6301 aaggtccttg acggtggctc ctgaatgaaa aaatggtgtt cttgggaaag acgccaaggt 6361 accagaggct tacgtttttt tcttctcctg gcccaggtaa actatactca gcctttggtg 6421 gctgtaaagt tcctgaatgt gacccccaac gtggaggtga atgttgaatg ccgcatcaac 6481 gctgccaata ttgccacaga cgatgagcgg gacaagttcg ctggccgtgt ggccttcaaa 6541 ctccggatca acaaaacctg aggctcccca cccccacccg cccacactct cctgtggatg 6601 cttctggaat gtccttgacc ctgcctgatc cctccctcac ccaccccaaa ggtatttttt 6661 ataatagagc tatgacttgt ctgagcctca caccctttcc tcaacttctc tacctagcct 6721 gatgcccaca caatttccaa catcttccaa ccttagctta gccagagaca gagaggagtc 6781 gggagttttc tagttcggga accggagttg tcactcagcg acagaggact tgcctagcaa 6841 gcacgagggc ctcagcattg ttggaggttt ttcctagttt gagtttatga atgagatgcc 6901 cttacagctc ctgtttcagt ttctactccc atccccttag aggtacagga aatggtctca 6961 tccacccagc ctctacccca caagatccct cgaacccgtt tcagccactt gcttcccagg 7021 ggtgaacact gtccttcttc cttttacaag gttctagcca ctttcctttc atctcttcac 7081 acttctgtca ccatagccag tatcttggtg gctttgactt ctggttcctc cagcagttct 7141 gccctctcct ctccctgatc cgttgacctg caggtcgac // LOCUS MMCRPG 2140 bp DNA ROD 09-APR-1993 DEFINITION Murine crp gene for C-reactive protein. ACCESSION X13588 NID g50571 KEYWORDS C-reactive protein; C-reactive protein gene; crp gene. SOURCE house mouse. ORGANISM Mus musculus Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Rodentia; Sciurognathi; Myomorpha; Muridae; Murinae; Mus. REFERENCE 1 (bases 1 to 2140) AUTHORS Ohnishi,S., Maeda,S., Nishiguchi,S., Arao,T. and Shimada,K. TITLE Structure of the mouse C-reactive protein gene JOURNAL Biochem. Biophys. Res. Commun. 156 (2), 814-822 (1988) MEDLINE 89050112 FEATURES Location/Qualifiers source 1..2140 /organism="Mus musculus" /db_xref="taxon:10090" /clone="Lm mP-10" misc_feature 32..39 /note="TRE-like sequence" misc_feature 33..40 /note="TRE-like sequence" misc_feature 62..67 /note="HNF1-like sequence" misc_feature 84..93 /note="HSE-like sequence" CAAT_signal 161..165 misc_feature 162..167 /note="HNF1-like sequence" TATA_signal 200..205 exon 227..374 /number=1 precursor_RNA 227..2122 /note="primary transcript" misc_feature 234..243 /note="HSE-like sequence" misc_feature 252..264 /note="HSE-like sequence" sig_peptide 311..370 CDS join(311..374,588..1201) /codon_start=1 /product="C-reactive protein" /db_xref="PID:g295904" /db_xref="SWISS-PROT:P14847" /translation="MEKLLWCLLIMISFSRTFGHEDMFKKAFVFPKESDTSYVSLEAE SKKPLNTFTVCLHFYTALSTVRSFSVFSYATKKNSNDILIFWNKDKQYTFGVGGAEVR FMVSEIPEAPTHICASWESATGIVEFWIDGKAKVRKSLHKGYTVGPDASIILGQEQDS YGGDFDAKQSLVGDIGDVNMWDFVLSPEQINTVYVGGTLSPNVLNWRALNYKAQGDVF IKPQLWS" mat_peptide join(371..374,588..1198) /product="C-reactive protein" intron 375..587 /number=1 exon 588..2122 /number=2 polyA_signal 2102..2107 polyA_site 2122 /note="polyA site" BASE COUNT 560 a 452 c 438 g 690 t ORIGIN 1 ctttctcatt tttcctgtca cacagaagct ggtgattcag gggtcacagg agtttgtaat 61 aaataaccca cattgatttc tctgttctag aatgattttt tttttgcttc cctttctccc 121 agtggtctga cgtttacccc aagaggcagt gttaggaaat catttacaaa gtggttcagc 181 ccctccatct gctatagtta taaatctgag gatgggctgg gcccgaggca ggcgttccag 241 gactccttgt ccttgatctt tcagacaaaa cactgtcctc ttagtccaga tcccagcagc 301 atccatagcc atggagaagc tactctggtg ccttctgatc atgatcagct tctctcggac 361 ttttggtcat gaaggtagga gctatcataa agatcttttc cctatgggag aatggttgga 421 acttaatatt ttgcataagg aatcaaggat caggatcagg gtagctgtgt atttatgtaa 481 cctgggagag gaccagatga cccttgatcc caaactctac ctgtaaggga ggaataagtc 541 ttcattatct gagaaactac ttactttctt ggttttctgt ttcacagaca tgtttaaaaa 601 ggcctttgta tttcccaagg agtcagatac ttcctatgtg tctctggaag cagagtcaaa 661 gaagccactg aacaccttta ctgtgtgtct ccatttctac actgctctga gcacagtgcg 721 cagcttcagt gtcttctctt atgctaccaa gaagaactct aacgacattc tcatattttg 781 gaataaggat aaacagtata cttttggagt gggtggtgct gaagtacgat tcatggtttc 841 agagattcct gaggctccaa cacacatctg tgccagctgg gagtctgcta cggggattgt 901 agagttctgg attgatggga aagccaaggt gcggaaaagt ctgcacaagg gctacactgt 961 ggggccagat gcaagcatca tcttggggca ggagcaggac tcgtatggcg gtgactttga 1021 tgcaaagcag tctttggtgg gagacatcgg agatgtgaac atgtgggatt ttgtgctatc 1081 tccagaacag atcaacacag tctatgttgg tgggacactc agccccaatg ttttgaactg 1141 gcgggcactg aactataaag cacagggtga tgtgtttatt aagccgcagc tgtggtcctg 1201 acctactgtt gtgaaccctg aagcacctcc tgggattaca ttctctccct tgtctcgggt 1261 tatgaacctt ttagccccag cagatgttgt aggtctgttc tgtgaatatg gcctttcact 1321 tctctgcttt gtggtcctca gcactagagc acggaattta aatggaaggc ttccagcata 1381 agcatcccac taggactcta ccaaagagaa tctgacttac ccatggtttt atatatatat 1441 gtaaatatcc atatatatat acatatatac atatatatat atatacatac atatatatat 1501 atatatatat atataattga aaaaatttca gacataattc ttctccctca catagatgag 1561 aaaatagatg cacagaaagg agaataattt tttattgttt ttgttttata atgtcatctt 1621 gagtgctgta tttacatact ttctatccct ccctcttcag atcctttcct atccttccaa 1681 attctctctc aaattcatga tgtcttatta ttagtcttat gcatatatac atatgcataa 1741 tacctatcat ctatcaatca atctatctac ctatctatca tctattcatc agtcatccat 1801 cttactgatt acatttagtg cttcttgtat tttgttgaag actggacact ggataatcta 1861 tcaggagggc ccctccctga agactgattg tccttttctc agcagccact gattacctct 1921 agctcttcat atagggttct gtctttgtga aatttcttct gtccatgttg catgtcaatt 1981 ggtgtcagta tgcaggtctt gtttgggcaa cctagagtga tggagcactg actacactgt 2041 gctcagaatc agttcttttc tggaataaaa tctgtacctg aacttcccca gtccatgagt 2101 caataaagtc acctttggct tgaatgaatt tgagcagttt // LOCUS MUSENDOBA 7879 bp DNA ROD 26-AUG-1994 DEFINITION Mus musculus cytokeratin (endoB) gene, complete cds. ACCESSION M22832 NID g340757 KEYWORDS cytokeratin. SOURCE Mus musculus (strain 129/SvJ) liver DNA. ORGANISM Mus musculus Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Rodentia; Sciurognathi; Myomorpha; Muridae; Murinae; Mus. REFERENCE 1 (bases 1 to 7879) AUTHORS Ichinose,Y., Morita,T., Zhang,F., Srimahasongcram,S., Tondella,M.L.C., Matsumota,M., Nozaki,M. and Matsushiro,A. TITLE Nucleotide sequence and structure of the mouse cytokeratin endoB gene JOURNAL Gene 70, 85-95 (1988) MEDLINE 89196920 FEATURES Location/Qualifiers source 1..7879 /organism="Mus musculus" /strain="129/SvJ" /db_xref="taxon:10090" /tissue_type="liver" repeat_region 1877..2008 /rpt_family="B1 repeat" TATA_signal 2263..2269 exon 2289..2742 /number=1 CDS join(2347..2742,3535..3617,3892..4048,4688..4852, 4964..5089,5365..5588,5910..6030) /standard_name="endoB" /codon_start=1 /product="cytokeratin" /db_xref="PID:g532610" /translation="MSFTTRSTTFSTNYRSLGSVRTPSQRVRPASSAASVYAGAGGSG SRISVSRSVWGGSVGSAGLAGMGGIQTEKETMQDLNDRLASYLDKVKSLETENRRLES KIREHLEKKGPQGVRDWGHYFKIIEDLRAQIFANSVDNARIVLQIDNARLAADDFRVK YETELAMRQSVESDIHGLRKVVDDTNITRLQLETEIEALKEELLFMKKNHEEEVQGLE AQIASSGLTVEVDAPKSQDLSKIMADIRAQYEALAQKNREELDKYWSQQIEESTTVVT TKSAEIRDAETTLTELRRTLQTLEIDLDSMKNQNINLENSLGDVEARYKAQMEQLNGV LLHLESELAQTRAEGQRQAQEYEALLNIKVKLEAEIATYRRLLEDGEDFSLNDALDSS NSMQTVQKTTTRKIVDGRVVSETNDTRVLRH" intron 2743..3534 /number=1 exon 3535..3617 /number=2 intron 3618..3891 /number=2 exon 3892..4048 /number=3 intron 4049..4687 /number=3 exon 4688..4852 /number=4 intron 4853..4963 /number=4 exon 4964..5089 /number=5 intron 5090..5364 /number=5 exon 5365..5588 /number=6 intron 5589..5909 /number=6 exon 5910..>6073 /number=7 polyA_signal 6068..6073 repeat_region 7099..7221 /rpt_family="B1 repeat" repeat_region 7530..7638 /rpt_family="B1 repeat" BASE COUNT 2066 a 1930 c 2063 g 1820 t ORIGIN 1 gaattctagg gtcttctagt cattactgaa agaactgaaa gtccaccagt ggttggcggt 61 gacttgaagc ttccttcccg tcccctcctg acggaggcct cactctgcaa taagtaactc 121 ctatttcttc agcagtgagt gctgtattaa cattcccgtg cactgcctcc tagaagatag 181 acaaacgatt cttttttttt tttcttctac ccatagaagt gaaaaacaaa acggaagttc 241 agtttagtga ctttctgaag tcacccaacc agcaaagtat ccaagttgga taacaaacgt 301 aaagcttaaa aacagcgaaa agttgcccat cggtacactt cattgtatag ttctataaaa 361 tagagcaatt gtggaactcg ggtggggggg gtgggggggg gagagtttgt aaaattgaaa 421 gcctgggatc tccaaacaca ccccctcccc aggagctttt aggcaaagcc agtgggcatt 481 tgtttgtatg ttattttgca ttccatttgc ctggctgtct ctggcaacac ccaggctccc 541 cagccaagtc ccacggagtc gctgaggttt tccggtccag aaggaggcag gcttcgaaca 601 atccatcaca cccagcatgc agtcagccct gtgtttgctt ccctctagtc tgatctgctc 661 acctaggttc ttcgtagccc ctccccctcc cccttgctct tctgatttta cttttctgtg 721 tgctcgcact cacacagtca gttcagctgg ccgctcgctc gataaggatc aataggtcca 781 ctccagagca aagagcatgg ccacaggtat agcaaacacc aagtcctgcc ggaaaaacgg 841 gctcctggtg ctgaccgatt ccaacctcag ggtttctccc tcctcattca gcaggagtgg 901 tcatccagct ggcgaggagg gggtgcgggg gtgcagaggg gctctttccc ccagaaaaca 961 agcaggaaag ccagaaagca ttataaaaat tgaccaggaa ggaccgaaga aaaggcccgg 1021 ggagcaggag aaggggacag atgcaagggg gtggcagagg ttgaggtggg ggatggagag 1081 gcaggcaatc tggttcaaac ttagctgttt ctacatatct ggtctatgga ctcagacccc 1141 aacctgtcca gacacagtat aatctacagg aaggacagtg gttctctctt ccttgctgtt 1201 cccatagcta ctagaaggtg tctagcaaat actggttaaa tctatgtgct cttgggttgg 1261 ctgtgttctc agctggtaaa gtggtctgcc tgaggagtat gaggccttgg gtttggtcct 1321 cagcaccctg taacctgaaa gtgatggagt ctcatgcctg ggatcccagc acttcaaaac 1381 tgggacggga agatcaggag ttgaaggcca tcctcagcta catggaaagt tcaagaccag 1441 cccgagctac atgagaccct gtctcaaaaa agaaggagga ggaggaggag gaggaggagg 1501 taaatggtag gtgtccagac atagggatac ctcttctaca taatcacaaa ggccccgata 1561 aagcctccag ggaatgagtt gaatgaggac atgtgtaccc acgcgtttga ggatatgtat 1621 ttccacggtt ccctatggag ccacagcagt gtcttggaac catggggccg gagcagaaaa 1681 accaagagac ctgccctcag gttcgtgttt ctcttctttt tgagctccca attttctttc 1741 cccaggattc ctggggttct aggactgccc ccagacttaa gctggcgaca tgccacaaaa 1801 tcttcaccct agtgaacccc tcttctgtct gtgtaaatgg agtttgtttg tttgtttgtt 1861 tgtttgtttg ttttcgagat aaggtttctc tgtgtagccc tggctgtcct ggaactcact 1921 ctgtagccca ggctggcctc gaactcagaa attcgcctgc ctctgcctcc caagtgctgg 1981 gataaaggcg tgcgccacca cgccccggaa taaacggata tttaaaacgc cattttttgc 2041 cctgaaaaca gacactcagg ggctaacacc acctgctatc ctttgtgcac cccttagtta 2101 tccactccct gtgtcggtga ggatggcagg tatgctgagg gcgtgtgcag aagtggagaa 2161 gggctgcgac ctgttaggtc acaggaatcc cgatcctggc gcccccggct ccggtggctg 2221 gggcgtggcc tgctggggga ggtccctacc tcctcccgcg gcgatataac aacaggtcga 2281 ggactgccac cctccgcggc ggaactcctg ttctggtctc tcgcttcgct ctcctctcca 2341 gacaagatga gcttcacaac tcgctccacc accttctcca ccaactaccg gtccctgggc 2401 tctgtgcgaa ctcccagcca gcgggtccgg cctgccagca gcgcagccag cgtctatgca 2461 ggtgctgggg gctccgggtc ccggatatcc gtgtcccgct ctgtctgggg tggctctgtg 2521 gggtccgcag gcctggcggg aatgggtgga atccagaccg agaaggagac catgcaagac 2581 ctgaacgatc gcctggccag ctacctagac aaggtgaaga gcctggaaac tgagaacagg 2641 agactggaga gcaaaatccg ggaacatctg gagaagaagg ggccccaggg cgtcagagac 2701 tggggccact acttcaagat catcgaagac ctgagggctc aggtaagggg cctaaggggg 2761 gaggggcagt agctccgggc tgttctgtct atccagtgtg cccacacccc caaacttcaa 2821 tagagtgtgg gctgttgact ccctccactc tttccacatt caccacatcc ccccacacac 2881 actctatggg acggagttag gaatactctg gactctcacc cacaacctgt aggaattcca 2941 agcgtagtgg ttacacacgc gagccggctt cccccaagca gtgtagtggg aagagaacag 3001 gattggaaaa aaaataaaat aaaaaggtgt gtgtggggga atcataaaat gaactcagac 3061 actcacttct tttccagtag gaacagttag agggttaagc ggatgtggct aagggtgagt 3121 catctaggag taaacaggag cttacctgta ggaggggcca gcgggacaag tggggggggg 3181 gcgcgtccca ggacgggggg gggggggggg ggggttagca ggtgcacctg gagaagatgc 3241 caggagagga tcagagaaat cttgttggaa gctgctcttt tgtaagcaat ccagatagtg 3301 caggccctgg cagaggggca aggaatccag gaagggagag gaggctcagg tgggaaggga 3361 agacagaaat acacgcctct gactgttcct gggactggga tggattcact gaaacacaag 3421 acgtgttttc tcatccccca cttctccaca cacacacaca cacacacaca cacacacaca 3481 ccctcatgtg acccggaact ctcaaagcga ctctctccac tttaatccac ccagatcttt 3541 gcgaattctg tggacaatgc ccgcatcgtc ttgcagatcg acaatgcccg ccttgccgcc 3601 gatgacttta gagtcaagta agtccggggc tgcagctgga gctaagaatg gcctactccc 3661 agaactgtat tagagtaggg ttccttcctt tccgaagttt attcaggtta gggttcattg 3721 actactgagt tagctcttca tctgtgcagc ccagtttaag caaacgctcc aaggccaaga 3781 atcgtgctaa gggaatgact ccgtgactgt ttatggtggg cacggggcta gtctatggag 3841 gtggcactgg ttgagaaggt tcaggccctc tgatcacctc cactccttca ggtatgagac 3901 agaactagcc atgcgccagt ctgtggagag cgacatccat ggactccgca aggtggtaga 3961 tgacaccaac atcacaaggc tgcagctgga gacagaaatc gaggcactca aggaagaact 4021 tctgttcatg aagaagaatc atgaagaggt aagccgggcc actgaccagg cctaggagct 4081 aagggccaag actaggcggg ccaggctgcc cccaaagcag cctagcatgg aagcagaagc 4141 cccggctgtt agcctgtttt gattaagagc agagccattg gctcacacat gttctatggc 4201 tcacctaagt tacactattg agagtcaaga taacattaag ccagttaagg cctggcttgt 4261 gcatcactac ctcctggaaa gctgagacag gaggattgag ttcaatgcca tcctgtctta 4321 gtctagtaaa acagagattg tgtgagtaag accctgagtt ggaggcaaag gtggggactg 4381 ccagtcttag gagctggtgc tcatgtgggc cttctggaag aagcttaagg taagatctgt 4441 ggaaggaaaa agcagctaca acgaggagag gatatcacca gcgagtggca ggttcaggct 4501 gtcaaggagt ggcattccca accagctgga gcagaagccc agtcttaggc ttgccaaggg 4561 tttcctctgc tctcttctgc ctttacactc aacatcacct tctagaaaag gccatcagta 4621 atgaaaggct gggctgaact gactttaagt gtcacctgcc accccccttt atttctgcct 4681 tctttaggaa gtccaaggtc tggaagccca gattgccagc tctggattga ctgtggaagt 4741 ggatgccccc aaatctcagg acctcagcaa gatcatggcg gacatccgcg cccagtatga 4801 agcgctggct cagaagaacc gcgaggaact ggacaagtac tggtctcagc aggtacaaac 4861 taggaaaggc tagctgcccg cccccactcc gcttaaggta agctctgtgt gaaggggagg 4921 tgccaaggac agcaggtcat gactgaccct cctcaccctt cagattgagg agagtaccac 4981 agttgtcacc accaagtctg ccgaaatcag ggacgctgag accacactca cggagctgag 5041 acgcaccctc cagaccttgg agattgactt ggactccatg aaaaaccagg tgagcatcct 5101 ccagtcacct gctgcagttc tgctaggccc agacctcaga cacttggtgg ccccagcccc 5161 aactgtctca cccctaagcc tgctccacac ccccattccc aatcaagcga ttccctaaga 5221 tgaatcactt ctatttgatt tttttttcca ttgctcccac actgtagcct agcatttgat 5281 agacaagagg catcaagacc ctttcaaaga agggttgagc aggctaagag tagagactca 5341 ctctcccatc tctgtgtcct gcagaacatc aacttggaga acagcctcgg ggatgtggag 5401 gcccgataca aggcacagat ggagcagctc aatggggtcc ttctgcatct ggagtcagag 5461 ctggcacaaa ctcgggcaga gggccagcgc caggcccagg aatatgaagc cctcttgaac 5521 atcaaggtga agcttgaggc agagattgcc acctaccgcc gcttgctgga ggatggagaa 5581 gatttcaggt gagtggtacc ctgctaacca cacaagtgag gggcttttct ccactcccgg 5641 agctttggga agtgccaagg tctgtttgtt ggtaaccact caccagtggc ctcaagtgag 5701 tgagttatgg gctccctccc gtgagaaaag aatgattaca tattatccct ggctgtccat 5761 gggatggcac cgagtgcctc ttttctggaa gaaatggctg aggagacttt gggggggtaa 5821 ggaagggcct cagggaaccc ttcttccaca tactctgtct cccctctatt ctgggctgtt 5881 tataactaag gcttggtcat ctgttacagt ctcaacgatg ccctggactc cagcaactcc 5941 atgcaaactg tgcagaagac aactacccgt aagatcgtgg atggcagagt ggtgtccgag 6001 actaatgaca ccagagttct gaggcactga ggcagagaag gagggaaccc ctgggaactg 6061 agggaccaat aaaagttgag agctcactgg acatcacttt gtgtcttcct tggcagtctt 6121 cctgcttgcg cacacaaaca ttaggcagga gtctcttgac cctagagttt cacacgactg 6181 gcctgtggga gaggaaaagt ctgggttctt gagacatcta ggagacgaag taggagatat 6241 cactgtccta ggtggaggtg aaagtgtaat aaaattagaa cctcccattc aagagacctg 6301 gtgaagaagc cagtggtggt gatgtacaca catggtttag gttgaagcag aaggatcgtg 6361 agttcaaggc tggtctgggc tagatagcaa gttccagggc aacctgcatc acatgcaaat 6421 atcctgctaa aacaagagct gggtatcggt tgtgcgcttc taaaatctct gagcttggta 6481 gtggcgagag gatcaagaat ttgaggtgat catctgctac atagcaaatc tgaaaccagc 6541 ctgggctaca tgagaccttg tctcataagg gagtctagga agggacttgg ggaagccaag 6601 catgttgaca caggcctata atcccagcac ctgggaaaga aacagaagca ggaacatctt 6661 gagttcaaag ccagtctaca gcaccctctc tgaagaaaaa aaaagaaaga gagaaaagag 6721 aaaaaaagaa aaaatgtgta attctccagc agagtccaga gaaaactcgc cccctgttct 6781 actacttttt cctttgccta aatcttagtt tttcccaccc aaagattctc aaaccccatc 6841 ccacactcac tacccagaaa cctgtgctcc cctgggctct gctctgacac tcagtgaaga 6901 cacactcttc agaggggtgt ggccagaagt gaggagacat gattagagaa catgaagcaa 6961 tgtacatggt ggatctcagg taggcaccag ggtgggcacc atgcaggtaa aaataggagc 7021 cccatggcac cattctgggc tttgttttgg agctatagga caaagcaatg gaaacaacac 7081 atgtctgaaa taacatgacc aggcatggtg gcatacacct gcaacaggag gcagaggcag 7141 gcggatctct gtgagtttga aaccagcctg gtctacatag ccagttcatg acagtcaatg 7201 ctacatggag aggtcctgtc tttactacta atagtagtaa ataatatgat ttaaaataaa 7261 aataacaatc tgtttcaatt ggggggataa tgtatcttta ttagtttcaa gaattataga 7321 agttaatata ttgtccaaaa ggaaagaggg aaataaaatt gaaaacaaat ctctgttccc 7381 ccaaagtact ttatcccagt tgaatacttc gttaggacag gcatatacaa atattttatt 7441 ttacttcata gatgtagggt tatactataa gttctgcacc ttaattgttt caattctaca 7501 tccttcactt aacaatgtat cttgaaatgg gggatgtggt ggctcaggtc tgtaatccca 7561 gccctctgta ggctaaagtg ggaggtagga ttactcagag ttcaggacca gtgtggttac 7621 aaagtaaagc cttcacctca ctcatttgaa cagctgcttg cagctataga atggagtaca 7681 aagttactta acccaaatac tctagttcct agagatgtct tggtctctag cagttttaat 7741 ttagcaaaag tctgcagctg atatctatgt gtacttttgc actccatgag catgtctgtg 7801 gagtaattta gttaaaaaat cgtaccattt aaacacacat gtgaaggact tactttaggc 7861 caggccctgt gctaagctt // LOCUS MMU77364 7627 bp DNA ROD 23-JAN-1997 DEFINITION Mus musculus homeodomain-containing transcription factor (Hoxd4) gene, complete cds. ACCESSION U77364 X65950 S46532 NID g1791004 KEYWORDS . SOURCE house mouse. ORGANISM Mus musculus Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Rodentia; Sciurognathi; Muridae; Murinae; Mus. REFERENCE 1 (bases 5193 to 7627) AUTHORS Featherstone,M.S., Baron,A., Gaunt,S.J., Mattei,M.G. and Duboule,D. TITLE Hox-5.1 defines a homeobox-containing gene locus on mouse chromosome 2 JOURNAL Proc. Natl. Acad. Sci. U.S.A. 85 (13), 4760-4764 (1988) MEDLINE 88263027 REFERENCE 2 (bases 2999 to 7627) AUTHORS Popperl,H. and Featherstone,M.S. TITLE An autoregulatory element of the murine Hox-4.2 gene JOURNAL EMBO J. 11 (10), 3673-3680 (1992) MEDLINE 93010959 REFERENCE 3 (bases 1 to 7627) AUTHORS Popperl,H. and Featherstone,M.S. TITLE Identification of a retinoic acid response element upstream of the murine Hox-4.2 gene JOURNAL Mol. Cell. Biol. 13 (1), 257-265 (1993) MEDLINE 93109309 REFERENCE 4 (bases 1 to 7627) AUTHORS Folberg,A. and Featherstone,M.S. TITLE Characterization and retinoic acid responsiveness of the murine Hoxd4 transcription unit JOURNAL Unpublished REFERENCE 5 (bases 1 to 7627) AUTHORS Featherstone,M.S., Fischer,H., Folberg,A. and Popperl,H. TITLE Direct Submission JOURNAL Submitted (04-NOV-1996) McGill Cancer Center, McGill University, 3655 Drummond Room 714, Montreal, Qc H3G 1Y6, Canada COMMENT On Jan 23, 1997 this sequence version replaced gi:51417. FEATURES Location/Qualifiers source 1..7627 /organism="Mus musculus" /strain="129Sv" /db_xref="taxon:10090" /chromosome="2" /map="2D" mRNA join(189..388,763..841,1540..1575,3581..4097,4995..5824, 6383..7280) /gene="Hoxd4" /note="alternative mRNA produced by alternative promoter usage and alternative transcription start site" /product="homeodomain-containing transcription factor" gene 189..7280 /gene="Hoxd4" mRNA join(238..388,763..841,1540..1575,3581..4097,4995..5824, 6383..7280) /gene="Hoxd4" /note="alternative mRNA produced by alternative promoter usage and alternative transcription start site" /product="homeodomain-containing transcription factor" mRNA join(258..388,763..841,1540..1575,3581..4097,4995..5824, 6383..7280) /gene="Hoxd4" /note="alternative mRNA produced by alternative promoter usage and alternative transcription start site" /product="homeodomain-containing transcription factor" mRNA join(266..388,763..841,1540..1575,3581..4097,4995..5824, 6383..7280) /gene="Hoxd4" /note="alternative mRNA produced by alternative promoter usage and alternative transcription start site" /product="homeodomain-containing transcription factor" mRNA join(269..388,763..841,1540..1575,3581..4097,4995..5824, 6383..7280) /gene="Hoxd4" /note="alternative mRNA produced by alternative promoter usage and alternative transcription start site" /product="homeodomain-containing transcription factor" mRNA join(272..388,763..841,1540..1575,3581..4097,4995..5824, 6383..7280) /gene="Hoxd4" /note="alternative mRNA produced by alternative promoter usage and alternative transcription start site" /product="homeodomain-containing transcription factor" mRNA join(274..388,763..841,1540..1575,3581..4097,4995..5824, 6383..7280) /gene="Hoxd4" /note="alternative mRNA produced by alternative promoter usage and alternative transcription start site" /product="homeodomain-containing transcription factor" mRNA join(279..388,763..841,1540..1575,3581..4097,4995..5824, 6383..7280) /gene="Hoxd4" /note="alternative mRNA produced by alternative promoter usage and alternative transcription start site" /product="homeodomain-containing transcription factor" mRNA join(281..388,763..841,1540..1575,3581..4097,4995..5824, 6383..7280) /gene="Hoxd4" /note="alternative mRNA produced by alternative promoter usage and alternative transcription start site" /product="homeodomain-containing transcription factor" mRNA join(301..388,763..841,1540..1575,3581..4097,4995..5824, 6383..7280) /gene="Hoxd4" /note="alternative mRNA produced by alternative promoter usage and alternative transcription start site" /product="homeodomain-containing transcription factor" mRNA join(302..388,763..841,1540..1575,3581..4097,4995..5824, 6383..7280) /gene="Hoxd4" /note="alternative mRNA produced by alternative promoter usage and alternative transcription start site" /product="homeodomain-containing transcription factor" enhancer 2873..2889 /gene="Hoxd4" /note="retinoic acid response element (RARE)" enhancer 3468..3684 /gene="Hoxd4" /note="auto-regulatory element" mRNA join(4256..5824,6383..7280) /gene="Hoxd4" /note="alternative mRNA produced by alternative promoter usage" /product="homeodomain-containing transcription factor" mRNA join(4257..5824,6383..7280) /gene="Hoxd4" /note="alternative mRNA produced by alternative promoter usage amd alternative transcription start site" /product="homeodomain-containing transcription factor" mRNA join(4258..5824,6383..7280) /gene="Hoxd4" /note="alternative mRNA produced by alternative promoter usage amd alternative transcription start site" /product="homeodomain-containing transcription factor" mRNA join(4259..5824,6383..7280) /gene="Hoxd4" /note="alternative mRNA produced by alternative promoter usage amd alternative transcription start site" /product="homeodomain-containing transcription factor" CDS join(5398..5824,6383..6708) /gene="Hoxd4" /codon_start=1 /product="homeodomain-containing transcription factor" /db_xref="PID:g1791005" /translation="MAMSSYMVNSKYVDPKFPPCEEYLQGGYLGEQGADYYGSGAQGA DFQPSGLYPRPDFGEQPFGGGGPGPGSALPARGHGQEPSGPGSHYGAPGEPCPAPPPA PLPGARACSQPTGPKQPPPGTALKQPAVVYPWMKKVHVNSVNPNYTGGEPKRSRTAYT RQQVLELEKEFHFNRYLTRRRRIEIAHTLCLSERQIKIWFQNRRMKWKKDHKLPNTKG RSSSSSSCSSSAAPGQHLQPMAKDHHTDLTTL" misc_feature 6409..6591 /gene="Hoxd4" /note="encodes homeobox domain" polyA_signal 6973..6978 /gene="Hoxd4" polyA_signal 7275..7280 /gene="Hoxd4" BASE COUNT 1878 a 1825 c 2074 g 1850 t ORIGIN 1 tgaaaagatg caagttgaaa agcccggggt attgatatga atatgacctc tctcgcattc 61 tggatatttg gaactattta aatccaaggg aacagttgca gtgaaccttt atttagtgaa 121 taaaagttta aagccaaagg ttatttatgg ttctgcagag ccgggacctg agtggctatt 181 tataagcacg tgattccaat aaactttgtt ttatggcttg agagttgaca agccaaaata 241 taattcccac cataaattag gttaagagca tacaagtgca gatgcttggt tcctgtagca 301 gcgataggca agtgagaggc tccagccagc gccccgggct caggagacag ctctcctcta 361 agcttctcta ccaacgaaaa gaaacccagt aagtcactca ttactttgtc cacaataact 421 ctagagttgt ggggcagggg gtctgagccc agccttctct gcttggggaa agcctatcta 481 ttgaagttca ttttaatttt agcctaagcc taacttaata aattgaatgg ggggtcttgg 541 catttccctc tcctgtgtat acctggaacg ttgggaagtg agggccttct accccaaaag 601 caggacttac acttgcctct cagtagtgga gcaacaactt taaaagtgga agaaagcaat 661 gatggggggt ggggaggaga ggggaccggt gtggagatga gttacctctg aggtagttca 721 gagcgggttt gccccccctt ttctttttct ttctgttcct agaagactaa aaacttgaag 781 atggacatga gtgccctggg aaccactgtt ctctttgatc ggtctgtagg ttctcctgca 841 ggtatgttta aaaaattaat gtcaaacaga tagaatgaga tcatatttta gaccatgtct 901 gtaagatcta gatcattttc ctttaaacac atcaatttcc ctttctggtt atcttttggg 961 ttctcatttc ccctcggaag gaaatgactg ttttgaattg gagacatgga aaaaaaaaat 1021 atgtccctgc ctgccatgcc tcctctttga ggagaagcag gtttggacgc tggggtggaa 1081 ttaaaaacca acctgaaact ggaaaaggag gtctgaactg tttccagagg gagcagaatt 1141 ggggtcggga caaatagaat gtctgcaggc tctcaccagt gggctcgcac ccaccctccc 1201 acaccagaag gaaatgcaca aaggcctatt ttctgacttc ccatttcccc cctggggaaa 1261 ccctgcctca ggcagtctga gtcaggccgt ataactggtt gggttttgcc caagggtgcg 1321 tgaagtaggg taggtgacca ggacagactc ccatagccct tagctctgcc tttgaggatg 1381 ctatcagagg aaaactttgt gcagtacccg aaggcggtca ccagaagcaa ggagttctag 1441 gtattggtgg aatgtctggg ccttgtcacg ttctgcatgt cgcctatgga tgctttattg 1501 tcctgtgaaa tttgagagac cccactttca ttgcaacagg ttttggagcg tcagggagaa 1561 cgagggagaa ccactgtaag taccttccct tttcttccga ggaacgaaag ggtagtgcta 1621 agactgggag gaggtgggcg gaaggtccgc gccagcggtg atgggagggt taaggagtag 1681 cggagtggcg ctgcctcccc aggccaaaac ggactcgctt cgtggaggga tggaagctct 1741 aggctgaccc tttaggaagt atttctggga cctagttcac agtcacctgc tcaagtaaac 1801 gcaccgcagg ggaccgccct tgtcttcccg gacctctctg gaccatacag gcctttgcca 1861 ggcttctctg gctcgcacag gtgcagcaga cttgcagcta attagcagcc caccctatca 1921 acgaactgaa ctacgggcgg gccagcatcc acaactgcta gaattgaccc aatcaacacc 1981 aaggcaacgg gggtccctct ctgctgtcca gccattattc tctctgccca gtctatctca 2041 gccttctctt gctgtatata ggttttaatt taaaatttga tttattacag acaaaaacaa 2101 acatataacc cttgatggtc ttgggtgtgt tggattggcc tgtttccagg aagggctaca 2161 acttctgagc ttatgtagac attcctctcc tgtccctttc tgggtcctag cttcactggt 2221 gtgggtccca gtttgggata cccggtgaac aaaaagagta ctttctcttt ccctccgttt 2281 ctccttagct gcagttgaag aatttctact ctactccaga gtccagtgga attgggtggg 2341 atcagaggca gggactccag ttctttggct ctgaagtttg aagctgagaa tgctggtgac 2401 atcttctggg tgtcattaga agggttgtca ctagaaggag agtcaaggaa atagtagagg 2461 cttgcactca ctgcaaacca caggtagttt aattttaatg gaacagtagc agattggtaa 2521 ggaccacagg gaccaggcat cctccctctc ttgtgagatg tcttcccctt ccctggtcag 2581 agaagctgag actgggctca gggcaggttt ctttgctggc catggtgaaa aatctggact 2641 ccccagactt tctctggcct tgcccctgcc ctccttcctt ggcacccttg gctgtgaggg 2701 gctcatctct gctgaggctg cttgcttgct ggcgccttgt agctgcccac agcacaaaaa 2761 gccggagaga tcaaaagcat ttgtatgggc agttgagagg gaggtgaata tttaacgctt 2821 ttgttcatca ataacttgtt ggctttgacc tgtctgaaca agtcgagcaa taaggtgaaa 2881 tgcaggtcac aatgtctaac aaatatgaaa atgtgtgtac tcatcttggt cctcactcaa 2941 cccatcggac ttccccaatt tgtctgggta aaacagaggt aggcccagtg tccccagaaa 3001 ggcctgtcaa cgtgtccatg ggtgaaaggg ctggcattag gaggaactgg ggagagtctt 3061 taaacaaacg gattttaacc cccttttttt tttttagaac tgacacctgg gaaatccctg 3121 caaaatagtg gaagtgaaag atggtgggag ggcatggaag ggcaaagtac agggtcaaag 3181 tgagcaccta attttgccag atatgatgag gggagtctcc ctgccccacc ccatcaaaat 3241 caaggcatgg ggcttgaaag tgatttttcg aattatatag gaagggatca aaacaatact 3301 tagaaatgtg gacatcttca atctgttatc aggcatccat aagctcggcc caggccaggg 3361 catccctctg cccctgaaat cagcccagcc ccagagtgca tctgggctct tgatgccctg 3421 aattgagggg caagcccagg acagcggagg cgctggggcg gcgaaattcg cgaacttttg 3481 tactcttctg tgctgctgtc ggcaaagcaa aaataatgaa actttgtgat atgtttgtaa 3541 atgatttcga atgacccctc gccctgccct tcaccattag ctcgacagtc tcagcccagg 3601 gaggctcggt ggccgcaaac agcgcctgcc tccgggtctg cagcagcagt catagcagca 3661 gcagaagcag cagcagcggg atccgaaagt ccggcggggc tgagggccca gaaaggtaaa 3721 ttgctcgaat tctaggagga ctagctgtga ccactcactt ggcttaggtt ctggtgaaga 3781 ctattccttt tgcctttttc aaaactccct ttctgctgct gagggccttt gcttcttttt 3841 gggaaatacc acttgataag ggggcgggag atttgggttc tgagaagggg caattaagcc 3901 gccatctctg gggtgaaaaa aaaactcccc ttcttttctt ttcttttttt tttttttaag 3961 aagaagaagg tcctggctgc tcagctactt ggtggccccc atcctggcaa gccgatgagg 4021 gaactcattg ctgtagcgag cagcaatact tagaaatgca ccgagcctac ctgcaccatc 4081 tctgaaagcc aggcttggtg agttcctggt ggctgtgctg aagagatcag gtctggcagc 4141 tgcctgaatg tctgctctgg gtaggacccg aggttgtaac gttgtctata tataccctgt 4201 agaaccgaat ttgtgtggta cccacatagt cacagattcg attctagggg aatatatggt 4261 cgatgcaaaa acttcatata tctccgacat ggccagagac tgaggcgcgg agagtactgg 4321 cagcccagcc ggcaagccac ctctttcttt ttcattacct tagactccag gatgatcgac 4381 tttgcctggt gaaattgtac agccttttcc tggagaaact cctatagata ttccagtagg 4441 tgctcaagtg gttgtctgag tgtaaagcca gagagccagg ttttagggta ttagagagtg 4501 aggtgaaagg ccagaaagct ccagcagaag ccctgctctg aaggagcatg agacacaaag 4561 gtacccaaat ggggctcacc tgcccttgtg tgggaagaag acaaactgaa agataggcca 4621 tcggagaccg ttggggtaca tttaggcaag gttaaggtcc ccctccccac catcccaata 4681 tgacccaacc cctcagccac cagactcaga actgtgaggg cctcctgcag aaagatccaa 4741 cttcaactag aaacacccca gcctgcgcat tggcagaaaa gcatagtggt tactcattgc 4801 ctaactcaat tcaggtccgc cagattctgg taacttttgg gtgaccctga tgaagacaaa 4861 gccaagacag gtctttgtat gtcagagtcc aggactcctg ccagctggcc gggcagcctg 4921 cctgcctgcc taggactcta cccaactcgg aggccacatt gcccacactc attcttcctt 4981 attttgcacc ccagtaggag aattcactct tctttgaact atgtccattc tgggggctgc 5041 ttcaggcgat ggtgagtaag gagggcatgg gaggttagca aagtcagtgg aagacaaaag 5101 ccgagattac ctgccggaaa ttgggacccc ggggatagaa ttagaactct attagcatct 5161 gtcagggact ctcaaatgtg gcatggcaag tcacttgatt acacgtatgt tatttagtta 5221 aatttgtgaa aattatgaga tgctcaccaa cccggtgata aacatgcttt cttcctattg 5281 gctggcctgg tcacatggcc gcccaacttt attcagttga cagcaagtag gagggcccaa 5341 tggaaggaga aaaaagacaa cacgagaaaa attagtattt tctaccttct gaaattaatg 5401 gccatgagtt cgtatatggt gaactctaag tacgtggacc ccaagtttcc tccgtgcgag 5461 gaatatttgc agggtggcta cctaggcgag cagggagccg actactacgg cagcggtgca 5521 cagggcgccg acttccagcc ctcggggctc tacccacggc ccgacttcgg cgagcaaccc 5581 ttcggaggcg gcgggcccgg gcctggctca gcgctgcccg cgaggggtca cggacaggag 5641 ccgagcggcc cgggaagtca ttacggtgcc ccgggagagc cgtgcccagc gcccccgcct 5701 gcgcccctgc ctggcgctcg ggcctgcagc cagcccaccg gccccaagca gccgcccccc 5761 gggacagcac tcaagcagcc cgctgtggtc tacccttgga tgaagaaggt gcacgtgaat 5821 tcgggtaagg agtgggggca gcgacctcac agccctcccg caccactcct tgcatctttt 5881 ggcctgcacc ccccacccca ggggttgcga atgggtaggg gcttgtgcag cttccttggg 5941 cgcccgcaat tactctctcc ataaattttt atagctgagg gagcaggccg ggaccatgtg 6001 gctggctgct tggctgtggg cgcaaaaggg ggtggggatg gggatggggt ggggggagga 6061 ctccatttcc acagcagagg gaacgcagag caaaacaagg acctttccaa aatgttggcg 6121 ataccgagct cttggtgggc cagtggcaag ggagaacctt tgaggaggac ctgtgagcgt 6181 ggggagtgtg tgcatggaag ggcttggggt gggagtgtgt gtatgtttag ggagagggcg 6241 ggagggagga agctggccag cttggagcac agggggggcc cgagggctgc gagacgcgcc 6301 aggaagtgag ggcagaggca agcctggggc ctaactagtg gccgggctct gacctggctg 6361 tcttgtatgt tttgtctcat agtgaacccc aactacaccg gcggggagcc caagcgctcc 6421 cggacggcct acaccagaca gcaagtccta gaactggaaa aggaatttca ttttaacagg 6481 tatctgacca ggcgccgtcg gattgaaatc gctcacaccc tgtgtctgtc tgagcgccag 6541 atcaagatct ggttccagaa ccggaggatg aagtggaaaa aagaccacaa actgcccaac 6601 accaagggca ggtcttcttc ctcatcttcc tgctcctctt cagctgcccc gggccagcat 6661 ttgcagccaa tggccaagga ccaccatacg gacctgacga ccttatagaa gtggggacct 6721 tgggcctatc ccttcttgca ctctaggttg agcgaagctg cggggtcagg cggggcctgc 6781 tgtcacctca ctgggctcta aggtactgtg ggggtggacc tgggacctgc aggccaccct 6841 cggactaggt taccttcctg cccgagggca gcccccctcc ctaaagtggg gaagggtggg 6901 agggtgggcg ggcttcttta agtagattat catatggcag gagctactga gaacataaac 6961 ccttggcgag tcattaaact cctgaaaatc tctgctggtg gattgaattt gcaaatgaag 7021 gttggaggct ttaccccagt gggaaccggg atggtctcct ccacaccacc cccttcaggg 7081 agctgtgtgg cttagtaact gtgggggagt tgaggcagaa ggcagccagt gctaggtgct 7141 ttcagtgcaa gcaggagagg ctgagccaga atcccagaac tttctggatt atctatagaa 7201 tttcttctgt gatttaaaaa aaaaaaaaaa agaaagaaag aaattgtttg cttccagtgg 7261 cccatgccca aagaaataaa tccaagaagg aaacgggtta ttttatgttt agtttatatt 7321 tgcttaaata tttatttgtt cggggaatgg actgcagaaa ataaccgaca cctgcgtgcc 7381 aaaaacctta agatggtgaa tagtcttaag cttcagggaa ctgggttaga aagctgtgga 7441 ctcctttctg gccctattcc ggattttgag ctcctccctc ctaaactacc caccctggag 7501 atgtgtggag gaggccctta agccttcaag agggaagctg aggagccagg agaagcgagt 7561 ttttgaaaat atgtttccga atgtgcttct atagcactgt gtttacaagc agttccgggt 7621 ggatgaa // LOCUS MUSVASNEU 3494 bp DNA ROD 11-MAR-1992 DEFINITION Mouse vasopressin-neurophysin II gene, complete cds. ACCESSION M88354 NID g202341 KEYWORDS neurophysin II; vasopressin. SOURCE Mus musculus (strain B10.A) DNA. ORGANISM Mus musculus Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Rodentia; Sciurognathi; Myomorpha; Muridae; Murinae; Mus. REFERENCE 1 (bases 1 to 3494) AUTHORS Hara,Y., Battey,J. and Gainer,H. TITLE Structure of mouse vasopressin and oxytocin genes JOURNAL Brain Res. Mol. Brain Res. 8 (4), 319-324 (1990) MEDLINE 91101513 FEATURES Location/Qualifiers source 1..3494 /organism="Mus musculus" /strain="B10.A" /db_xref="taxon:10090" TATA_signal 1442..1447 exon 1461..1624 /gene="neurophysin II/vasopressin" /number=1 CDS join(1493..1624,2770..2971,3176..3348) /gene="neurophysin II/vasopressin" /codon_start=1 /product="neurophysin II/vasopressin" /db_xref="PID:g202342" /translation="MLARMLNTTLSACFLSLLAFSSACYFQNCPRGGKRAISDMELRQ CLPCGPGGKGRCFGPSICCADELGCFVGTAEALRCQEENYLPSPCQSGQKPCGSGGRC AAVGICCSDESCVAEPECHDGFFRLTRAREPSNATQLDGPARALLLRLVQLAGTRESV DSAKPRVY" sig_peptide 1493..1561 /gene="neurophysin II/vasopressin" mat_peptide 1562..1588 /gene="neurophysin II/vasopressin" /product="vasopressin" gene join(1598..1624,2770..2971,3176..3225) /gene="neurophysin II/vasopressin" mat_peptide join(1598..1624,2770..2971,3176..3225) /gene="neurophysin II/vasopressin" /product="neurophysin II" intron 1625..2769 /gene="neurophysin II/vasopressin" /number=1 exon 2770..2971 /gene="neurophysin II/vasopressin" /number=2 intron 2972..3175 /gene="neurophysin II/vasopressin" /number=2 exon 3176..3400 /gene="neurophysin II/vasopressin" /number=3 mat_peptide 3229..3345 /gene="neurophysin II/vasopressin" /product="glycoprotein" polyA_signal 3385..3390 BASE COUNT 749 a 992 c 984 g 769 t ORIGIN 1 cccgggaggg tggcaagagc gatggggatg ggagggccct ggctatgcat ggtctagaag 61 ccgtgggcta ggtgtgcata ggtcaagcat aggccaacta atctgggccc caaaccataa 121 agtttttctg gtgcctgaag taaaggagtc tgtaaagcct agagcagaat gcggtgatgc 181 acgcctgtaa ccctagccac tggagaggca gagacaaaaa gatgaagagc tcaaagcagc 241 tttcagctgt agtaagttca aggccagcct ggactaaact agactgcctt agaaacaaac 301 aactgactta cagtctaaat caggaattac accagctttc tcagactggc tctgtttagc 361 tgggtctcct cccatttcct gtcctagaat aacacccact tccaatcctg cccttagacc 421 tgagatattc ccaacctcag ggcctggggt ctccccaaag ctctctttcc tctttacggc 481 tgtgggtctc accaaggact atctctgagc tctatcctac accctagcct ctaccctaga 541 aggcctggaa ctcacagaaa ctctcctgcc tctgcttcca atggctggga ctaaaggcac 601 gcgtcacgac tggccttctt ttttatgttt tttaatattg agacagtgtc tcagccaagt 661 tactcagtct ggccttgaac ttttgacctt tctactttag cttcccaagt ggctctgacc 721 acaggcccat gaagccagat cctctgtgca agctttgcac cttactgagc ctggagttaa 781 cattgccacc atagctttcc catttgtgtc cttagtagaa acaatgggca ctattcccaa 841 gctctcccca cccccggctc tacagggtta tgcatgggat agaaacatcc tgggtgcccc 901 cgaagcagcc atgcctggca cagggcagac ctttcactgt gttcagcctt gactccatga 961 ctgtggccgt tagcccatga gactgcaagt gggaatttct ttctaaagcc cacctgatat 1021 gggtgcttcc tctcatccta taccacaact aacaacctgc ccaccccttc tggtgctgac 1081 cctgctccat acgtgccagc ccttgctgga tgggggcctg tggacccctt tagtctgctg 1141 agagcagctg gaatattcaa ctatgatttc caggtgaccc tccagtcggc tcacctcact 1201 gatcgcacag caccaatcac tgtgggcagt ggctcctgtc agacggtggc cggtgacagc 1261 ctgcatggct ggctcccctc ctccaccacc ctctgcactg acacgcccac gtgtgtcccc 1321 agatgcctga atcactgctg acagcttggg acctgtcggc tgaggctcct ggggagccac 1381 tggggagggg gttagcagcc acgttgtcgc ctcctaggga acacctgcag acataaatag 1441 gcagcccagc ccgccaaggc agcagagcct gagctgcaca cagtgcccac ctatgctcgc 1501 caggatgctc aacactacgc tctccgcttg tttcctgagc ctgctggcct tctcctccgc 1561 ctgctacttc cagaactgcc caagaggcgg caagagggcc atctctgaca tggagctgag 1621 acaggtacca ctgtggtccc tttaggctgc tggcagatgc tgtagggaca ggggctagga 1681 gagagggaaa tgttatctga gcagtcagac tttatgggag gttcctggaa ggaggcagta 1741 tcttacagca gagtagatgg actacccaga agggtgagag gggaccaggt gctagagaag 1801 ccgcataaag gatactgtcc ccaggcaggg gatatgccag aaaatgagag acacttcctt 1861 atgactgggc ttgggatgag aacaggttaa actgggtgcc ctggactcct ctgcacaccc 1921 ggaggttgag gactgggcag attatgcaaa actattcttg cctgaattca aatcctttcc 1981 cacccagctc agcctccctt ggtgcctttt ctagccagca gtgccagctt cttcctgtcc 2041 acagaaggtg gccaatgccc catgcccaag tggagcattt cccgcatcga acctcagcct 2101 cttgctcaga tctgttgtat tgtatgttca gctgtgagtc tgcctgcccc tctggcagag 2161 tttgagggaa tctagctact aggctcaatt ctggtcaggc gatgggtggc tgaatgttga 2221 gttgttgaac aagtttcgag tgggcaggta ggcagctcct gtagtctgcc ttccctttgc 2281 agagttcctt tggaggtgtg tccgggacct aatttagtcc ttgccaccta ccaactaaga 2341 cattataggt tggcgggagg taaaggctca tatgaagcca ccagcgtggg gcagaggtga 2401 gagcaaagcc agaagacgag tgagctatct agatactctg tggggagtga gaatctaggg 2461 atgtgtagga ggaccatctg aatgacggag aggtaagcct ccgagagatg gctgcacacc 2521 agtgacactg agaactgagg aaggtctccc tcaagtgttg ccccgcagcg agagggtttt 2581 gagacctcat gagctgacca ctgatctttc tgatgtccca gccggttaga ttttcactct 2641 tgcccttacc gctgcttcgt cctggacatc gccagagcac cagcaacgca aagcagcagg 2701 tgacactagg ttcccaccgc ccctcttggc ctcgttccag ctgacctccc ccaccccttt 2761 cctccacagt gtctcccctg cggcccgggc ggcaaaggac gctgcttcgg accaagcatc 2821 tgctgcgcgg acgagctggg ctgcttcgtg ggcaccgccg aggcgctgcg ctgccaggag 2881 gagaactacc tgccctcgcc ctgccagtcc ggccagaagc cctgcgggag cgggggccgc 2941 tgcgccgccg tgggcatctg ctgcagcgac ggtgcgcaca gggccgggcg ggcatcgcat 3001 ggagtggggt gggaggcaaa ggcggggggc taagtggggg gctggggaat caggacagga 3061 agtggagggt gagaatgaag ggagtcgagg gttggaaggt agcagggggg attgttggga 3121 ccgcaccctt ggagctgagc ccaaggcgcc tgcgctcaca gagctcttcc ttcagagagc 3181 tgcgtggccg agcccgagtg ccacgacggt tttttccgcc tcacccgcgc tcgggagcca 3241 agcaacgcca cacagctgga cggccctgct cgggcgctgc tgctaaggct ggtacagctg 3301 gctgggacac gggagtccgt ggattctgcc aagccccggg tctactgagc catcgccccc 3361 acgcctcgcc cctacagcat ggaaaataaa cttttaaaaa ctgcaccctg gtgtctgtct 3421 ctatttctgg ggtggggaga agaggggagg gagaggcgtt ggagtgggaa ctttctactc 3481 tgccctggct aacc // LOCUS MMHISTH1 1943 bp DNA ROD 22-AUG-1996 DEFINITION M.musculus gene for histone H1. ACCESSION Z38128 NID g558678 KEYWORDS histone H1. SOURCE house mouse. ORGANISM Mus musculus Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Rodentia; Sciurognathi; Myomorpha; Muridae; Murinae; Mus. REFERENCE 1 (bases 1 to 1943) AUTHORS Drabent,B., Franke,K., Bode,C., Kosciessa,U., Bouterfa,H., Hameister,H. and Doenecke,D. TITLE Isolation of two murine H1 histone genes and chromosomal mapping of the H1 gene complement JOURNAL Mamm. Genome 6 (8), 505-511 (1995) MEDLINE 96014286 REFERENCE 2 (bases 1 to 1943) AUTHORS Franke,K. TITLE Direct Submission JOURNAL Submitted (18-OCT-1994) Franke K., Universitaet Goettingen, Biochemie und Molekulare Zellbiologie, Humboldtallee 23, GOETTINGEN, GERMANY, D-37073 FEATURES Location/Qualifiers source 1..1943 /organism="Mus musculus" /strain="Balb/c" /db_xref="taxon:10090" /clone="LiM B.2.1" /tissue_type="Liver" /clone_lib="lambda EMBL3 Sp6/T7" /chromosome="13" misc_signal 566..572 /standard_name="Histone H1 Box" /note="putative" /function="Histone H1 5' consensus signal" CAAT_signal 636..640 /note="putative" TATA_signal 660..665 /note="putative" misc_feature 689 /function="transcription start" /evidence=experimental misc_feature 690 /function="transcription start" /evidence=experimental misc_feature 691 /function="transcription start" /evidence=experimental misc_feature 692 /function="transcription start" /evidence=experimental gene 744..1409 /gene="mH1.3" CDS 744..1409 /gene="mH1.3" /codon_start=1 /evidence=experimental /product="histone H1" /db_xref="PID:g558679" /db_xref="SWISS-PROT:P43277" /translation="MSETAPAAPAAPAPVEKTPVKKKAKKTGAAAGKRKASGPPVSEL ITKAVAASKERSGVSLAALKKALAAAGYDVEKNNSRIKLGLKSLVSKGTLVQTKGTGA SGSFKLNKKAASGEAKPKAKKAGAAKAKKPAGAAKKPKKATGAATPKKTAKKTPKKAK KPAAAAGAKKVSKSPKKVKAAKPKKAAKSPAKAKAPKAKASKPKASKPKATKAKKAAP RKK" misc_RNA 1441..1456 /standard_name="Histone Dyad Element" /note="putative" /function="Histone 3'-end formation signal" BASE COUNT 521 a 480 c 488 g 454 t ORIGIN 1 gatctacaat agggatggca cagactcatc acgtctccag aaaatggcta aatcactaaa 61 acaggaaaac cgaaggggaa ttacacactg gtatctcaaa tatcattaaa aaatccaatc 121 gatattttct acaataatca ttcccccggt atgcactgct ttcccatcta gtgccaattc 181 ctacagcttg gtgcagatag ccctggaagc ttgtgttgtt gctttctcga aaaactcaaa 241 tttccctatg tatttccgtg aaaacaagtt atttcataga gtcctgaaat gataagaatg 301 tgaggaacca tgtcctgaaa caaaagtttg ctaatgaccg cgagagaagt agtattttta 361 aggaaaacct taaagttgtc ccaattttcg tcaagtttga aagagagttg gggcaatcag 421 agaagtttca gggtgccgat tttgaacata ggtgcacaaa ttaaggcaag aatgtagcct 481 acaggctctc ggatcagttt cccccaagtc cctaattatt tcgtgcccat attttttata 541 tttttatact tttttgaggg gcaacaaaca cagccacaag gcaaagctga agatcctttc 601 tctggcaacg cggcgcacgg cgacggcgca gggaaccaat caccacgcag cttctctcat 661 ataaacccag agcctgcagc actgggaaca accttctctg actgtttgtg cttacttttt 721 gctttactag taaagcttag aacatgtccg agaccgctcc cgcggcgcct gctgcccctg 781 cacctgtgga gaagacacct gtgaagaaga aggcgaagaa gaccggcgcc gctgctggga 841 agcgcaaggc gtccggaccc ccggtgtccg agctcatcac caaggctgtg gccgcctcca 901 aggagcgcag cggcgtgtcc ctggctgcgc tcaagaaggc gctggcggcc gcggggtacg 961 atgtggagaa gaacaacagc cgcatcaagc tcgggctgaa gagcctggtg agcaagggta 1021 ccctggtgca gaccaagggc accggcgcct ccggctcctt caaactcaac aagaaggcgg 1081 cttccggtga ggctaagccc aaggctaaga aggcaggcgc ggccaaggcc aagaagcctg 1141 cgggagcagc caagaagcct aagaaggcga ctggtgctgc cacacccaaa aagacggcca 1201 agaagactcc gaagaaggcg aagaagcctg cggcggctgc cggcgccaag aaagtttcca 1261 agagtcccaa gaaggtgaag gctgctaagc ccaagaaggc agcaaagagt ccagccaagg 1321 ccaaggctcc caaggctaag gcttccaagc ctaaagcttc taagccgaag gccaccaagg 1381 caaagaaggc tgcccctcgc aagaagtaga gtggtgcgtc ctgctttgaa aatctcaaac 1441 ggctcttttc agagccaccc acaacctcat tcaaaagagc tgagcctttt tctggttttc 1501 tcatggtatg tccgctgact ctggctatgt gttccgaagc agatcaattc tgtgcacttt 1561 gttatgggta catttaggag catctataaa ctctagttga gcgatagatt gtatcatttt 1621 cagctctagc tggacttctg gccggcagtt ttttgcttct gtgtgtgctc cacacacgtg 1681 ttcatattcc aagagtttgg cgggaagtcc tttctaatgt cgcctaggcg tgataaggca 1741 ttaggggggt tctctaggaa gctactctca caatagtggc tacaggctgg agcctctgtt 1801 ttaagttttc caatccggct acacaaggtg gttttgagaa ttaaccaatc aagccgtact 1861 ttcagtcatg ggtgttgcac caatcagaga ttgtgatatt aagatttaca tttacgtact 1921 gggacgctgg cgttttttct ctg // LOCUS MUSPNMT 3144 bp DNA ROD 28-JUN-1995 DEFINITION Mouse phenylethanolamine N-methyltransferase gene, complete cds. ACCESSION L12687 NID g293767 KEYWORDS phenylethanolamine N-methyltransferase; transgene; transgenic mouse line. SOURCE Mus musculus (strain C57/BL6) DNA. ORGANISM Mus musculus Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Rodentia; Sciurognathi; Myomorpha; Muridae; Murinae; Mus. REFERENCE 1 (bases 1 to 3144) AUTHORS Quaife,C.J., Hoyle,G.W., Froelick,G.J., Findley,S.D., Baetge,E.E., Behringer,R.R., Hammang,J.P., Brinster,R.L. and Palmiter,R.D. TITLE Visualization and ablation of phenylethanolamine N-methyltransferase producing cells in transgenic mice JOURNAL Transgenic Res. 3 (6), 388-400 (1994) MEDLINE 95093482 FEATURES Location/Qualifiers source 1..3144 /organism="Mus musculus" /strain="C57/BL6" /db_xref="taxon:10090" CDS join(1430..1664,2135..2342,2451..2895) /standard_name="PNMT" /note="putative" /codon_start=1 /product="phenylethanolamine N-methyltransferase" /db_xref="PID:g293768" /translation="MNGGSDLKHATGSGSDPKHAAEMDPDSDAGQVAVALAYQRFEPR AYLRNNYAPPRGDLSNPDGVGPWKLRCMAQVFATGEVSGRVLIDIGSGPTIYQLLSAC AHFEDITMTDFLEVNRQELGLWLREEPGAFDWSVYSQHACLIEDKGESWQEKERQLRA RVKRVLPIDVHKPQPLGTPSLVPLPADALVSAFCLEAVSPDLTSFQRALHHITTLLRP GGHLLLIGALEESWYLAGEARLSVVPVSEEEVREALVLGGYEVRELRTYIMPAHLCTG VDDVKGIFFAWAQKMEVQV" exon <1430..1664 exon 2135..2342 exon 2451..>2895 BASE COUNT 664 a 787 c 1047 g 646 t ORIGIN 1 gagctccacc gatatctgtg gaggaggtta tggaaggggc tggcaccagg gccgtccttg 61 gctgtgcctt ggggtgtgga tggggtcagt gaccctaagg cctgtcagtt gtagatccag 121 acagaatcaa tccttggctg gcatcaggtg tcccactgtc cctggcctgg gtgggaggac 181 agggtttaag ttcctgtctg tgacctctgc agctgttgtg atgatcccca tcccagcctg 241 ggtgtctggc ctttgggata aggaagggac actgggtagg actggataga agaccaggac 301 tatcttagca gaggctagta accctcccac cccagaaaga cataggactt tcttaggact 361 taaagggtct ctgctttagc aagactgggg atgctgttgc cagggtagct tgccattttg 421 agaacatggg aaggaagaca ggcagattat gttccagata cctggagcac taagcaaggc 481 ctgaaggcca ggccctgcca tcttcctagt ggggacacga ttgttgactg aggggtcggg 541 atcagggtag gggtgtgggg atgtcctgta cctgcgaaac tgtacctgcg aaatgccaag 601 agtgcgcatg cgctgcccct ctagcggccg tggcagtacc aagaacatgc tctgtactct 661 ttgttcctac ctgagtccag tgtcctggac ctggtaggaa catcctgaac caaccatgct 721 tgcctggccc ccagataagc agcacatagc cctagaggcc tgcaggggat gcccggatgt 781 tctgcttgct aaaagcatta gcacggctca ccttccttat ctctgctgcc atccgatgct 841 cagggcagag acctgctcag gacccagggt cctcaagaca gaggccagaa cagagtgtcc 901 tttctggagg agggaagggg tgctgcaaga tagagatggg ttagaggtct ggaggtaggg 961 atggtattat aaagaagagg agtttgtaag gggtgccccg agagagggga ggaagtctgg 1021 gaaggatgct gggactggga acacaggcta atttagacct cgggaaagag aaggggtgag 1081 atgctgggca aagagtcttt gggaggttgg agaggagctg gggtaccgct ggaactgagg 1141 ctgggggtgt agggcagccc tggagcgatc aggggctggg ggggggggtg gagggtttgt 1201 tgagcaagtg gaaagcaagg gtggggcagg agcgatgttc taaagggcgc cccccacgcc 1261 cccgcgcgtc tgttgctcag acactaactg agatggatgg agtgacagag atgtggtggc 1321 ctcaggcgcc tcatccctca gcagccaccc acccctgtga cggaggggtc cgggcggggg 1381 gacccagtgg tagataaagg gatgggggag aggaggtctc aacagaagca tgaacggtgg 1441 ctcagacctg aagcacgcta cagggagtgg ctcagacccg aagcacgctg cagagatgga 1501 ccctgactcc gacgctggcc aggtagctgt cgccttggct taccagcgct tcgagccccg 1561 cgcctatctc cgcaacaact acgcgcctcc tcgtggagac ctgagcaacc ctgatggcgt 1621 cgggccttgg aagctgcgct gcatggcaca agtctttgct accggtgagc accggaaacg 1681 gaggcatgag agagcagaac tcatggggaa aggtgcctac ctagcaggca cagacaggag 1741 cgggcctgta gtcttagcga ctagggacgc tgaggcttgg aggaccactt gaggccggga 1801 attggatgac tgactagaca gtttagagtc tttttctttc aaggggagga tgtttaagat 1861 tgagagacgg atggggccag gtgagagaga tagggagagg gagggggcgg gagggggagg 1921 ttggagaggc tcattccttg atgtctcggg atcgtgatct cccccactaa atcgtaacta 1981 gagctagttt ctgaggggtg ggacccgagt cagaggagcc ctgtcagcac agcacttgct 2041 ccagatacta ccctgaagaa gtgctgagga gagcacgagg ggaaggggcg gctgaggcag 2101 gagccactgg ttatccctgc ctctgctcca ccaggtgagg tgtcgggacg ggttctcatt 2161 gatattggct ccggccccac catatatcag ctgctcagtg cctgtgccca ctttgaggac 2221 atcaccatga cagacttctt ggaagtcaac cgtcaggagc tgggactctg gctgcgagag 2281 gagccaggag cctttgactg gagtgtgtat agtcagcatg cctgcctcat tgaggacaag 2341 gggtgagagc tggactggca gcttcgtagc agtgggtggc tggggggggg gctgcagaag 2401 gctgagtctt tggggtagtc ctgagccccg ccttgtgccc ccctgtacag tgagtcctgg 2461 caggagaaag aacgccagct tcgagcgagg gtgaagcgag tcctgcctat cgatgtgcac 2521 aagccccagc ccctgggaac tcccagtctg gtccctctgc ctgccgacgc cttggtctct 2581 gccttctgcc tggaggctgt gagcccagat cttactagct tccagcgggc tttgcatcac 2641 atcaccacac tgctgaggcc cgggggtcat ctcctcctca ttggggccct ggaggagtcc 2701 tggtaccttg ctggggaggc caggctgtct gtggtgccag tgtctgagga ggaggtgagg 2761 gaggccctgg tccttggtgg ttacgaggtc cgagagcttc gcacctacat catgcctgcc 2821 cacctctgca cgggggtaga tgatgtcaaa ggcatcttct ttgcctgggc ccagaagatg 2881 gaggtgcagg tgtgagggcc ccaaaaatgc caggtgtcct gtgcctccca aagtccttat 2941 cacctgaagt ggaacctaat aaagtgacag ttcccggctg tgactgtgct ggctttggaa 3001 gaacaacccc tagcttttct cttctccctg gaagcctccg tgaaggctgc tttctacacc 3061 acccatcccc ccaccccagg aaggccatgc tcaggaggcc acacggcacc tgctgtatgc 3121 caggtggctt atgacccagc ctca // LOCUS MUSPGK2 2147 bp DNA ROD 02-NOV-1992 DEFINITION Mouse testis-specific phosphoglycerate kinase (pgk-2) gene, complete cds. ACCESSION M17299 NID g200325 KEYWORDS phosphoglycerate kinase. SOURCE Mus musculus (strain RC3H/He) DNA. ORGANISM Mus musculus Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Rodentia; Sciurognathi; Myomorpha; Muridae; Murinae; Mus. REFERENCE 1 (bases 1 to 2147) AUTHORS Boer,P.H., Adra,C.N., Lau,Y.-F.C. and McBurney,M.W. TITLE The testis-specific phosphoglycerate kinase gene pgk-2 is a recruited retroposon JOURNAL Mol. Cell. Biol. 7, 3107-3112 (1987) MEDLINE 88038861 COMMENT Draft entry and printed copy of sequence for [1] kindly provided by P.H.Boer, 23-OCT-1987. FEATURES Location/Qualifiers source 1..2147 /organism="Mus musculus" /strain="RC3H/He" /db_xref="taxon:10090" mRNA 405..1970 /gene="pgk-2" gene 405..1970 /gene="pgk-2" CDS 462..1715 /gene="pgk-2" /codon_start=1 /product="testis-specific phosphoglycerate kinase" /db_xref="PID:g200326" /translation="MALSAKLTLDKVDLKGKRVIMRVDFNVPMKNNQITNNQRIKAAI PSIKHCLDNGAKSVVLMSHLGRPDGIPMPDKYSLEPVADELKSLLNKDVIFLKDCVGP EVEQACANPDNGSIILLENLRFHVEEEGKGKDSSGKKISADPAKVEAFRASLSKLGDV YVNDAFGTAHRAHSSMVGVNLPQKASGFLMKKELDYFSKALEKPERPFLAILGGAKVK DKIQLIKNMLDKVNFMIIGGGMAYTFLKELKNMQIGASLFDEEGATIVKEIMEKAEKN GVKIVFPVDFVTGDKFDENAKVGQATIESGIPSGWMGLDCGPESIKINAQIVAQAKLI VWNGPIGVFEWDAFAKGTKALMDEVVKATSNGCVTIIGGGDTATCCAKWGTEDKVSHV STGGGASLELLEGKILPGVEALSNM" BASE COUNT 644 a 422 c 492 g 589 t ORIGIN 5 bp upstream of PstI site. 1 ctgcagagga ttttccacag tataattaga gattggatgt ggggaagaat tatgtttttt 61 tttttttctt tttggtaatc ttatggttgg gtatgcttta ttttctaatt gatttgaaag 121 aggatataga aacaaaagac aggagaaaaa taatcctggc cccttctgat acttatttga 181 ggctttctaa tttcccaact ctaaccccaa gctctccgtt tactgtttag tattctaggc 241 tggcagtttg agtctgtacc aggcaaaaaa cgttccaaat caagatagac aggatggaga 301 accaatcaca gagctggatt tcctttcaaa ttctaccaat ggctattgtg caggagactt 361 tgaactcaca aagaaaggcg gggccaagac ttaagcgtta aaaatcacca ccaagccagc 421 ctcccagcag cagtaaagag gctgttgtgc ataccatcaa gatggctctt tctgctaagt 481 tgactctgga caaagtggat cttaagggaa aaagagtaat catgagagta gacttcaacg 541 ttcccatgaa gaataaccaa attacaaaca accagagaat caaggctgcc atcccaagta 601 tcaagcactg tctggacaat ggagccaagt ccgtagttct catgagtcac ctcggtcggc 661 ctgatggtat ccctatgcca gacaagtatt cattagagcc tgttgctgat gagctcaagt 721 ccctgctgaa caaggacgtc atattcttga aggactgtgt gggccctgaa gtagagcaag 781 cctgtgccaa cccagataat gggtctatca tcctgctgga gaacctgcgc ttccatgtgg 841 aggaagaagg taagggtaaa gattcttctg gaaaaaagat tagtgctgac cctgctaaag 901 tagaagcctt ccgagcatca ctgtctaaac ttggcgatgt ctatgtcaac gatgcatttg 961 gcactgcaca tcgggctcac agttctatgg tcggagtaaa tttgccccag aaggcatctg 1021 gtttccttat gaagaaggaa ctggattatt tttccaaggc tttagaaaag ccagagaggc 1081 ccttcctggc tatccttggt ggagccaaag tgaaagacaa gatccaactc attaaaaata 1141 tgttagacaa agtcaatttc atgattattg gtggtggaat ggcttacacc ttcctgaaag 1201 aactcaagaa catgcagatt ggtgcttcct tgtttgatga agagggagcc acgattgtta 1261 aagagatcat ggaaaaagca gaaaagaatg gtgtaaagat agtttttcct gttgactttg 1321 ttactggtga caagtttgat gagaatgcta aagttggaca agccactata gaatctggta 1381 taccatctgg ttggatgggc ttggactgtg gccctgagag cattaaaatc aatgctcaaa 1441 ttgtggccca agcaaagctg atagtttgga atggacctat tggggtattt gaatgggatg 1501 cctttgctaa aggaaccaaa gctctcatgg atgaagttgt aaaggccacc tccaatggct 1561 gtgtcaccat tataggagga ggagatactg ctacttgctg cgccaaatgg ggcactgaag 1621 acaaggtcag ccatgtgagc acaggaggtg gggcaagtct tgagcttctg gaaggtaaaa 1681 tccttccagg ggtagaggcc ctcagcaaca tgtaattgtc ataatgtact tgcttcctgt 1741 ttcctgcgca caggaccaga accaactcaa cctaacctat atctcaacat ttgttaacct 1801 ctactatgaa tcaagacgcc cgtatgtgct gcgtgtgcca tcaatatcac attcagcaag 1861 tcttaattct gtcatcatca tttgttagtc tcttcaagat ctcatcagga tttcccacag 1921 tccttcctag ggaggaaaca ttctcatgtc aactattaaa gaagtgagct aagtaagttg 1981 aatgtattgt ctttacttca aattcatttc tctctggata tgggagacaa catacagtct 2041 gtgataggag aaggataggg taaaggctgt gagagtttta aatggcaaaa gtgacccaaa 2101 ttaacagaac atcattatgt aaatataact cgaattataa ttagatt // LOCUS MMNUCLEO 11478 bp DNA ROD 27-AUG-1998 DEFINITION Mouse nucleolin gene. ACCESSION X07699 NID g53453 KEYWORDS B1 repetitive sequence; B2 repetitive sequence; nucleolar protein; nucleolin; RNA binding protein. SOURCE house mouse. ORGANISM Mus musculus Eukaryota; Metazoa; Chordata; Vertebrata; Mammalia; Eutheria; Rodentia; Sciurognathi; Muridae; Murinae; Mus. REFERENCE 1 (bases 1 to 11478) AUTHORS Bourbon,H.M., Lapeyre,B. and Amalric,F. TITLE Structure of the mouse nucleolin gene. The complete sequence reveals that each RNA binding domain is encoded by two independent exons JOURNAL J. Mol. Biol. 200 (4), 627-638 (1988) MEDLINE 88316930 REFERENCE 2 (bases 1 to 11478) AUTHORS Amalric,F. TITLE Direct Submission JOURNAL Submitted (12-JUL-1988) to the EMBL/GenBank/DDBJ databases FEATURES Location/Qualifiers source 1..11478 /organism="Mus musculus" /strain="Balb/c, 125/c" /db_xref="taxon:10090" /clone="lambda HB3, HBc21" /clone_lib="Charon 4A" misc_feature 770..913 /note="B1 repetitive sequence" misc_feature 928..1009 /note="B1 repetitive sequence" misc_feature 1676..1685 /note="pot.G/C box 1" misc_feature 1702..1711 /note="pot.G/C box 2" misc_feature 1717..1726 /note="pot.G/C box 3" misc_feature 1739..1748 /note="pot.G/C box 4" CAAT_signal 1773..1780 CAAT_signal 1802..1808 /note="alternative put.CAAT-box" TATA_signal 1854..1859 exon 1876..2041 /number=1 CDS join(2024..2041,3024..3140,3797..4292,4508..4693, 4792..4878,5056..5197,6105..6229,7393..7516,7967..8121, 8614..8737,8928..9049,9754..9880,10212..10435, 10551..10627) /codon_start=1 /product="nucleolin" /db_xref="PID:g53454" /db_xref="SWISS-PROT:P09405" /translation="MVKLAKAGKTHGEAKKMAPPPKEVEEDSEDEEMSEDEDDSSGEE EVVIPQKKGKKATTTPAKKVVVSQTKKAAVPTPAKKAAVTPGKKAVATPAKKNITPAK VIPTPGKKGAAQAKALVPTPGKKGAATPAKGAKNGKNAKKEDSDEDEDEEDEDDSDED EDDEEEDEFEPPIVKGVKPAKAAPAAPASEDEEDDEDEDDEEDDDEEEEDDSEEEVME ITTAKGKKTPAKVVPMKAKSVAEEEDDEEEDEDDEDEDDEEEDDEDDDEEEEEEEPVK AAPGKRKKEMTKQKEAPEAKKQKVEGSEPTTPFNLFIGNLNPNKSVNELKFAISELFA KNDLAVVDVRTGTNRKFGYVDFESAEDLEKALELTGLKVFGNEIKLEKPKGRDSKKVR AARTLLAKNLSFNITEDELKEVFEDAMEIRLVSQDGKSKGIAYIEFKSEADAEKNLEE KQGAEIDGRSVSLYYTGEKGQRQERTGKTSTWSGESKTLVLSNLSYSATKETLEEVFE KATFIKVPQNPHGKPKGYAFIEFASFEDAKEALNSCNKMEIEGRTIRLELQGSNSRSQ PSKTLFVKGLSEDTTEETLKESFEGSVRARIVTDRETGSSKGFGFVDFNSEEDAKAAK EAMEDGEIDGNKVTLDWAKPKGEGGFGGRGGGRGGFGGRGGGRGGRGGFGGRGRGGFG GRGGFRGGRGGGGDFKPQGKKTKFE" intron 2042..3023 /number=1 exon 3024..3140 /number=2 intron 3141..3796 /number=2 misc_feature complement(3361..3406) /note="B1 repetitive sequence" misc_feature complement(3520..3680) /note="B2 repetitive sequence" exon 3797..4292 /number=3 intron 4293..4507 /number=3 exon 4508..4693 /number=4 intron 4694..4791 /number=4 exon 4792..4878 /number=5 intron 4879..5055 /number=5 exon 5056..5197 /number=6 intron 5198..6104 /number=6 misc_feature complement(5665..5968) /note="B2 repetitive sequence" exon 6105..6229 /number=7 intron 6230..7392 /number=7 misc_feature complement(6460..6573) /note="B1 repetitive sequence" misc_feature 6782..6895 /note="B1 repetitive sequence" misc_feature complement(7143..7185) /note="B1 repetitive sequence" exon 7393..7516 /number=8 intron 7517..7966 /number=8 exon 7967..8121 /number=9 intron 8122..8613 /number=9 exon 8614..8737 /number=10 intron 8738..8927 /number=10 exon 8928..9049 /number=11 intron 9050..9753 /number=11 misc_feature 9383..9450 /note="B2 repetitive sequence" misc_feature complement(9459..9590) /note="B1 repetitive sequence" misc_feature 9603..9667 /note="B2 repetitive sequence" exon 9754..9880 /number=12 intron 9881..10211 /number=12 exon 10212..10435 /number=13 intron 10436..10550 /number=13 exon 10551..>10627 /number=14 misc_feature 11005..11103 /note="B1 repetitive sequence" misc_feature 11122..11245 /note="B1 repetitive sequence" misc_feature 11319..11439 /note="B1 repetitive sequence" BASE COUNT 3169 a 2293 c 2825 g 3191 t ORIGIN 1 ggatccaata tgatatggtt atattcaatt ataatttgta taagcatcac acatggatta 61 ggtgttgatt ccacagaaac accaaatcct gcaagctgtc cctgctgctg gtgcactcaa 121 acttctgttc actccagtta gcaagtaatt tccaacacta tcatatgcca ggcaatgctt 181 taggtaccaa gaataaagca gacaacagtc tgtgctgtct ggggaagggg caaatgaaca 241 ataaatggcc aagaaaagta ctaggcagct gatatctgag agtgtaacta accttgggtg 301 caagcagagg tctcttagtg gcactgtaca ccagagggct gaggatagcc catacagata 361 gattggattg acaaaggaag ccatgtcacc caagtcattt cacactcaca gctcatcttc 421 ttgtctcctc ccgaatgcct ttagcttgct ctgcttggaa atatctagag tcttcctcac 481 accaccttcc tacctctcta cccaagagtc tcccttggtt ccataatata cagtacaagg 541 tgtgaccggt ctcgctataa agacaatgcc cccacaccac aactcgggtt cgccatctcc 601 tgccctgaca agagcgcact ggtgtgtagc tgtggctgcc ctccaccgcg ttgggaaggc 661 cgtagtaagt aagcatcaca cagtgactat ttctacttat ctccaaaggc ccgagttcag 721 ataaaaaaac ttggggactc aacacatccc actcgcatta aaatctcccg aggttgaagt 781 caggtatgat ggcatatgac tttaatcaca gcactaggga ggcaggggga tctctgagtt 841 caaggccagc ctggactaca gcgtgagtcc cagggcagcc aggaatacac agagaaaacc 901 tgtctcataa aataacatct cctaggtcgg gccaggcgtg ccgatacagg tcgaggctaa 961 catggtctac atgtgtgctc taggacagta aagaataaag tcctatctca actccggacg 1021 aaaactccac aggtgatttc tagtggatgt tgttagctga tgaagggcgc ctgtgtgcta 1081 tgcttggaac cagttggttg caatccattt aaagcaacgc cagacaactc tctcgttccc 1141 cccgccatgg agggaacctg tgagcgtggt cacaggcggg ccgagtactt ctccccacca 1201 caccaggaag tcacctctct caacctggag ttatacctac cgcgagaggt caccgacatt 1261 acatggatcg cttgtgcact gctcgtacac acacacacgc acaactgctt ttattaggag 1321 ctctcaggaa agcggggact cgcatcatag ccaagaagcc gttcgcgact ccgcggagaa 1381 caggccgagg cccgctcatc agcccgaggg aaccctaggc cttccggcgt tcttcagcag 1441 gaccacgcgg cggggggaaa gcaccgagaa acgcccagac cacctgagca tcgccgccca 1501 tgctgcctcg gaacacctga gggaatccgg gccacgccgc cacctacccg cgcctcacac 1561 acaagccgcg ccaaactcgc ccgtcccact gcgcaggcgt ggggagagcc cggcgctcag 1621 gagcattcaa aggacccgca cactgaccag tcaacgtccg cctgggccca aacgagtccc 1681 gccctcagga gcagacgtca ggccccgccc ctaagagccc cgccctcgtc aagccttagg 1741 cccgcctcac gctgccccgc gttaaggctc gcggttggct ccggggacgg acgcgagctc 1801 tggttggtgg agccgaagtc acgaggaccc ccttcgtcgc ctttccagag gcgattactg 1861 ggcaggctca gtcttttgcc tcagacgcta gctgtagctg gcaggcggtt gtacgtgctc 1921 cagagtcgtc ggtacccgct actgcagtcg ctttcgtgtg gcttccgctg agctcttccg 1981 agctgctcgc tctccacacg cgccgccgcc gtaatccgcc accatggtga agctcgcaaa 2041 ggtaagaggc cttggcgcgc cgacgcggac gactaggccc ctgctttcgg agggacgcgc 2101 gcgcgcccgc ccgtccgtcg cggaggggag gagggcttgc gcgcaatccc gggcgcgttc 2161 gagggcgcca tgctgggggg gaaagtctcg cgcgactagc gggaggtctc gcggtgcttg 2221 ccctctgact tagggggatg agaagagcgg aggcaggttt ccgggagggc gatatcgagg 2281 gttcggatgt agcgggcggg aggggacggt gtgaggagag atcggaggag ctgagagcgg 2341 ataggggcac ggcgtgggaa gagagggcca accttaggcg gcgagcggtc ccggggcccc 2401 gcctccccgc gcacgtgctc tggtgcgcgc ccgccacgtg ctctgcggag ccccgcacgt 2461 gtcgcgcgac ccggggcagt gggggagtgt ctgtagtacc ccggaaaggg ggacggcagc 2521 gtggggatgg atgggtggcc cggcgatctg ctgtctctgc cggtgaccgg gatggacacg 2581 tggtggaccc ctgaggtggc ggcgtggtga ctccacgtgg tggggctgga agcgagagaa 2641 agtgggaagc agttgggtta cgtggtgctg ctttaagagg tgatttcgag ataccccctt 2701 ccccagcaaa taacttaaag ggatcccttt aactgggttt tttttttttt tttttttttt 2761 ttttttgtgg aagatgccag aaaatagatg gccaggatta ggagacttta taacctgtgg 2821 ctgtttcttg gtgtagagtt ctgtctgctc agttatctgt gagaaggaaa aaaaaattat 2881 gcgcggttcg cagaaaaaac tgccaggaga atgccatgcc tggccaagaa gaagtcttta 2941 tgcttgtgtc ctttagtaag aaaaaggtgg tggccaaagg caaagtgact gaaaatgcgt 3001 gcaatttttg tgtgcgtttg taggctggca aaacccacgg tgaggccaag aaaatggctc 3061 ctcctccaaa ggaggtggaa gaggatagtg aagatgaaga aatgtcagaa gatgaagatg 3121 acagcagtgg agaagaggag gtaagaagct atttgcagcg aattaaaccg gtggaattga 3181 atgtctggaa gtcttagaaa tacaggatat gtagtaaatg gttgaatggc aagccccttc 3241 ctccctccct ccctccctcc ctccctccct ccctccctcc cttccttcct tccttccttc 3301 cttccttcct tccttctgca agacagtcgg caaaacaggg gacagaaaca ggcagaattt 3361 tgagttccag gcaagcaggg agtagtacat agtgaaacct tgtctcaaga ccgttgttat 3421 ggtcatgctc aatcagattt cttagaaaag ctcaggtgct gagtccagtt tttttttttt 3481 taaagtattg aaagccatgt ctccttattt cagggtttaa tgtttatctt tgtgtgtgcg 3541 cgcacccatt aagcatgctt ggtaccctca tactagaatg tacttggatc ccctggaact 3601 ggagttaaag ccacatgtga atgttacatg ttacaagagt aacacatgtg cttaactttt 3661 gagtcatctc tccagttctt ggttgtttgt tttttttttt ttaagcctat ctaatgtcca 3721 ttttcttgtg ctcaaagtta gtctcttaat gtagcattgg gtataaagga atgcttatga 3781 tttgtttgct ttcaaggttg tcatccctca gaaaaaaggc aaaaaggcta ccacaacccc 3841 agcaaagaag gtggttgttt cacaaacaaa aaaggctgca gttcccacac cagctaagaa 3901 agcagctgtg accccaggca aaaaggcagt agccacacca gctaagaaaa acattacacc 3961 agccaaagtc attccaacac cgggtaagaa gggagctgca caagcaaaag cgttggtacc 4021 aactcctggt aaaaagggag ctgccactcc agctaagggg gctaagaacg gtaagaatgc 4081 caagaaggaa gacagtgatg aggatgaaga tgaagaggat gaagatgata gcgatgagga 4141 tgaagatgat gaggaagagg atgagtttga gccaccaata gtaaaaggag tgaagccagc 4201 aaaagcagct cctgctgctc ctgcctcaga ggatgaggaa gatgatgagg atgaagatga 4261 tgaggaagat gatgatgaag aggaggaaga tggtgagtta gatcttagga tatttagggt 4321 actgcatgta cattccctca ctgtttcatt agattaaaaa ctcattttgt gctcttagtt 4381 ctttccataa cttaataggt tttcatttgc taagtagttt ttgttttttt taagtatttg 4441 tagcatttat cttgtctgga ttggtaggta gcaaatacat ttgcctgatt tgccatcttt 4501 cttccagact ctgaggaaga agttatggag atcacaacag ccaaaggaaa gaaaactcct 4561 gcaaaagttg ttcctatgaa agccaagagt gtggctgagg aggaggatga tgaggaagag 4621 gatgaagatg acgaggatga ggatgatgag gaagaggatg acgaagatga tgatgaggaa 4681 gaagaggagg aaggtaacca tattaacttt taaagtatgc tgacctaagt aaggcttact 4741 ggctatgcta aagtgtctgc ttactcatga atggcatttt aaaacatcta gaacctgtta 4801 aagcagcacc tggaaaacgg aagaaggaga tgaccaagca gaaagaagcc cctgaagcca 4861 agaaacagaa agtagaaggt aagcctgcaa aactggggaa acagatcaga gtagcactag 4921 cacaagtgat gagtgacaaa gggacttaat actgaaccat ggggttgaaa tgaaatatgc 4981 tgatgtgctt tatagtttat gatgaaattt gttgtgtgct taagtgggct gaaagttcat 5041 tttttgtgtg tgcaggctca gaaccaacta cacctttcaa tctgttcatt ggaaacctta 5101 atccaaacaa gtctgttaat gaattaaaat ttgccatcag tgaacttttt gctaaaaatg 5161 atcttgctgt tgtggatgtc agaactggta caaataggta agttttaatt gaatgttaca 5221 tgtgtatcag ctagaatttt tagtttccag ttgtattctt ccctgccttt aaacatgggg 5281 ctatatctaa ctatgttagt aaaagtcagt tgtctcctct cgtggcctta agtacagtta 5341 aggagctgca gtaagaaaga ctatagtatt gaactaaatg atcgagtcat agggcctgca 5401 atttgaagtt cctgtgtttg acttgataaa gataaaataa aatttaaaga agaaaagata 5461 ttaaacacat aaaattttgt gcagtatcta caactatgga tctgcatagt catatgcttt 5521 tagctaaaag tattctctgt acttttagcg gggtccatgc tagctactgc tgttagttac 5581 aatatactga atgaagaaat cgaggtgaat ttgttgtaat gtcttggtac atggacttgt 5641 tttgtttttt gttttttttt ctttaagatt tgtttatgta tatgagcaca ctgtagctgt 5701 ccagatggtt ttgagccttc atgtggttgt tgggaattga atttttagga tctcagctcg 5761 ctctgctctc agtccttgct tgctctggcc caaagattta tttgttgtta tacataagta 5821 cactgtagct gacttaagat gcatcagaag agggcattag gtctcattat gggtggtggt 5881 gagccaccat gtggttgctg ggatttgaac tcaggacctt cagaagagca gtcaatgtgc 5941 ttacccgctg agccatctct ccagcctttg gacttggttt tatggaagat aagggtgatc 6001 tagttttatt tttgttagtg ctgtagatgc tctgtgtgtg tgccacatgg tataagtgca 6061 gatcaccttc tcatacctgt aatcttgttt tttccatctt caaggaaatt tggttatgtg 6121 gactttgagt ctgctgaaga cctagaaaag gccttggagc tcactggttt aaaagtgttt 6181 ggcaatgaaa ttaaactaga aaaaccaaaa ggaagagata gtaagaaagg tatgtaaggg 6241 ggtctgggtg actggatact aacagactta ggcagtctgg tgcctcttcc ttagtttcat 6301 cctcattgtg aaccaatgag atgtcatagg tcatgtgctt gttgacaggt ttgattcctg 6361 ggatatataa tgtcagggct gacaggagga atagcttagt gagtaaagat gcttgctgca 6421 aaatgtttga tctctagaag ccacatgaag agagaagaac ctttaatccc agcatttggg 6481 agacagaggc aggcagattt cagagttcga ggccagcctg gtctacagag tgagttccag 6541 gacatccagg gctacacaga gaaaccctgt ctcggaaaaa aagttttagc ttatcctctg 6601 accacatgtg tatcgtgaca tgcttgaagc ttacctatct cttaaatgaa ttcttgatcc 6661 ctatattttg agtttcagaa tttggatttt aagtgtttgt ttcttagttg tgctgaaaat 6721 tgaacgtggg cttttcacat gctaggcaaa tttgttgggt ttttttgttt gtttttttct 6781 caagacaggg tttttctgtg tagccctggc tgtcctggag accaagctag ccttgaactc 6841 agaaatctgc ctgcctccca agtgctggga ttaaaggcgt gagtcaccac tgccctgcta 6901 ggcaatcact cttaaaactg ctacatatcc tctgtcccct tttgctcatt ttacaaggtt 6961 gctgtgtgct caatctgcag tctatgttat atgcttactg gatctaggct tttgatgtag 7021 aatgaaccat atgagtgatg aggtatctta gagatggaaa ctaagtctaa atagacttgt 7081 tccatataca acttaataca tatggtctaa ggaacatgat atacatgtaa acaagtagga 7141 aggagataag tctggtgtcc agggaagcca ggagagcctc atctgaaact ggacaggggt 7201 ttgtgagtca tcaggtgaca attgaacatg ggtactcttc atgcaaatgg ttagtaacca 7261 ttgagccacc tctccatccc tttataccat tttttttttt agcatatatc cttgtacttt 7321 ataggaatta tttgctttat tctcttgtga cttgtaaatt gatgtactta attaaatctt 7381 tttccaacat agttcgagct gcaagaacac ttctagccaa aaacctctct ttcaacatca 7441 ctgaggatga attaaaggaa gtgtttgaag acgccatgga gatcagatta gtcagccagg 7501 atgggaaaag taaagggtag ttttgtgtct ttgagtgtta aagttttatt aagtttagtg 7561 tcttcttccc ctctccttgt ccttgacagt ctctagtctg ctgcttcaaa cttaacttca 7621 taaccagaaa attgaatatc tggtcctctg gcctctacct cccaagttct gacattaaaa 7681 atgctcaact gatgggtttt gaaggtgtga tcaattttac aggctcaatc caaattggca 7741 tcttttgcca caagtactat ccttccctat tttatgagag aaatgtgatt ctaggcagtt 7801 cagtctattg tgttggctct ttttcctcct caccagttta aaggatgaag atgagctaat 7861 acatagtaaa agaacagtaa aagcacatgt gactaagtcc ttcatgtctg atgcttgagt 7921 aatattttct ctaacgtagt aactgaattg tcttgtactc tttcaggatt gcttatattg 7981 aatttaagtc tgaagctgat gcagagaaaa atttggaaga aaagcagggg gcagaaattg 8041 atggacgatc tgtttcactc tactatactg gagagaaagg tcaaaggcaa gagagaactg 8101 gaaagaccag cacttggagt ggtaagttaa agggtttatt gtgtagtggg aacaggaatc 8161 atttgtatct ttgtatttta agtaattggt tacctacaat tagttcacct ttgttcatat 8221 agctgatgtt tagtcttcat gagtgaaagc tatttgaaat catttccttt ggagtatagt 8281 aggcaaataa agctttttgt tgggtatgtt ttgtacttta aatggcttaa actattttag 8341 aaaatagtgt aagacaacaa agaacagtta tctaattaga atgaaaatga aaggagcaaa 8401 gaaggcatta ctgtatataa tggatataca ctggtggttc tagaattatg gtatatggta 8461 catggttgaa gtgccattgt ttcagttaac attccagtaa ccttgtggat taggttggag 8521 acatgcttta taggtgaccc acttactgag tgtttaaata tacacagaca tactctaaca 8581 taccttgcta atgtgttatc tttgtatttg caggtgaatc aaagactttg gttttaagta 8641 acctttccta cagtgcaaca aaagaaactc ttgaggaagt atttgagaaa gcaactttta 8701 tcaaagtgcc ccagaaccca catggcaaac ctaaagggta aaataatttt tacgttagat 8761 gtgggctgga catacatact cttacgtata agagtaagac tgtcctgtta gcttaaaaaa 8821 aaactaaagt tttagctata caaagggcag taaatattga tagtaaatta catgctgatg 8881 ccaagtgttt ctaagcttta ttctgagaac tgactttcaa ccttcaggta tgcatttata 8941 gaatttgctt catttgaaga tgctaaagaa gctttaaatt cctgtaataa aatggaaatt 9001 gagggcagaa caatcaggct ggagttgcaa ggatccaatt cgagaagtcg taagtccttt 9061 gacatgatat gacttggttg ggtgattttt ttttttattt ttatgtgcct atatgctcat 9121 ttggggctgt ctttatgttg ttgctgagaa aatgacaact ggatatgatg actgattacc 9181 tgagaaataa ttgatgaaat ctcaagaaaa ttcctctaga tagtcaagtt ctgatccagc 9241 tatgtcaact caaagcagca accttgattg ccctctgagt acgctttttt tttgatccag 9301 tgtagtcttt ttttttttaa ccttaatttc ttgtgttaat tgctttttct ggtaaaaggg 9361 ggaaaaaaag acataacaaa atcagtgtaa gggaaggctc agtggttgag cactgagagg 9421 acctgggttc aaatcccagc actcacatgg cacctatcga gacaggattt ctctgtgtag 9481 tcctgcctgt cgtggaactc actctgtaga ccaggctggc ctcaaactca gaaatctgcc 9541 tgcctctgcc tcctaggtgc tgggattaaa ggtgtgcgcc accactgccc aaccctgtct 9601 gtaactctta agatctgaca tagattgcag acaaaacact aatgcacata aaaaaatttt 9661 tttttttaaa aaaggaatct acttcagctg aatgtggcag tatggcagta ttcaccaagg 9721 gttcatagtg aaacaggaat ttttctcttc cagaaccatc caaaactctg tttgtcaaag 9781 gtctgtctga ggataccact gaagagacct taaaagaatc atttgagggc tctgttcgtg 9841 caagaatagt cactgatcgg gaaactggtt cttccaaagg gtaagaaggc gtagtagtgt 9901 tgctgctttt tagtgaattc tgcatggaga acttgggtct gcagtatctt ctcattgagc 9961 tcctttctgt ccatcagtga tagattatgg attcgcacga gaagaagaga gaattcacag 10021 aactggcact tatcttctgt ttttgcagaa gtatatttgg ctgttgtgtg agacattatg 10081 agatactggc gattttctcg acctgaagag tactttggtc actctacttg ggtgacttgg 10141 tacttattgt gttactttaa aatgtgttta cttaatgggt gaggtttttt tgtttttctt 10201 ttctgtttta ggtttggttt tgtagacttt aatagtgagg aagatgccaa agctgccaag 10261 gaggccatgg aagatggaga aattgacgga aacaaagtta ccttggactg ggccaaacct 10321 aagggtgaag gtggctttgg tggtcgaggt ggaggcagag gaggtttcgg aggcagaggt 10381 ggaggcagag gtggaagagg tggatttgga ggaagaggcc ggggaggctt tggaggtaag 10441 gaagggaaag gaactggaaa cggattccta aacctgtgtc ctaaccaacc accttaaatg 10501 ggaaggtcag tcctaattgt atcacccttt gatgtttttc cttcctatag gtagaggagg 10561 cttccgaggc ggcagaggag gagggggaga cttcaagcca caaggaaaga agacgaagtt 10621 tgaatagttc cttccatccc attctttccc tcttcatttt aaagaaagga ctctggggtt 10681 tttactctgt tacctgttca atgacagagc cttctaagga cattccaaga cagtaaagat 10741 cctaaactct gttgccagac ttaatacttc ctagaggtta catttggatt gttgcttaag 10801 agcttggcaa gaccaactct gcccagtggg aatttcagtt gtgttcctac tgctttcaag 10861 aattcccagc tacaaatgtg gaaattggtg gggaacagct tcatcatgat tctgatctgg 10921 aagatttaat ttgtatctta gaagggtgtg tactggaaaa gtggttgata gagcacatgg 10981 atcagtgctg gaaataccat acaagagctt gtgtggtatg taattctatt cccagcataa 11041 taagaacaaa cagtacagtg caatctgaaa tgacctggac aaggaatggt acccattggg 11101 ttgaaaaatt cacatagcca ggatggttgc acatgccttt aaccccagca cttgggaggg 11161 agaggctggt agatctgagt ttgatctggt gtacagagag ttccagaaca gccagggctg 11221 tttattagag aaaaaaccct gtcttgaaaa aacaagaaaa aaaaacctca cagacaacca 11281 aatacagagt tttagccact gagtaccaag tctcagcagc aagctgggag tgacgtaatc 11341 caatcccatg ggaggctaaa gcagtctatg acttcaatgc cagttgggtc tgcagaactg 11401 agttgctgga tagtgaggga tacttagaaa ctttgtcttg aaaaaaaaaa caagacctgt 11461 caaatttgag agaagctt // LOCUS MMU09189 6530 bp DNA ROD 30-NOV-1995 DEFINITION Mus musculus loricrin gene, complete cds. ACCESSION U09189 NID g520479 KEYWORDS . SOURCE mouse. ORGANISM Mus musculus Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Rodentia; Sciurognathi; Myomorpha; Muridae; Murinae; Mus. REFERENCE 1 (bases 1 to 6530) AUTHORS DiSepio,D., Jones,A., Longley,M.A., Bundman,D., Rothnagel,J.A. and Roop,D.R. TITLE The proximal promoter of the mouse loricrin gene contains a functional AP-1 element and directs keratinocyte-specific but not differentiation-specific expression JOURNAL J. Biol. Chem. 270 (18), 10792-10799 (1995) MEDLINE 95256248 REFERENCE 2 (bases 1 to 6530) AUTHORS Joseph A. Rothnagel. TITLE Direct Submission JOURNAL Submitted (22-APR-1994) Joseph A. Rothnagel, Cell Biology, Baylor College of Medicine, One Baylor Plaza, Houston, TX 77030, USA FEATURES Location/Qualifiers source 1..6530 /organism="Mus musculus" /strain="BALB/c" /db_xref="taxon:10090" /clone_lib="Clontech ML1030j" /chromosome="3" exon 1539..1585 intron 1586..2676 exon 2677..4380 CDS 2700..4145 /codon_start=1 /product="loricrin" /db_xref="PID:g520480" /translation="MSHQKKQPTPCPPVGCGKTSGGGGGGGGYYSGGGSGCGGGSSGG GSSCGGGGGGSYGGGSSCGGGGGSGGGVKYSGGGGGSSCGGGYSGGGGGSSCGGGYSG GGGGSSCGGGYSGGGGGSSCGGGSYSGGGSSCGGGGGSGGGVKYSGGGGGGGSSCGGG SSGGGGGGSSCGGGSGGGGSYCGGSSGGGSSGGCGGGSGGGKYSGGGGGSSCGGGYSG GGGSSGGSSCGGGYSGGGGSSCGGGGGYSGGGGTSCGGGSSGGGGGGSSQQYQCQSYG GGSSGGSSCGGGYSGGGGSSCGGGYSGGGGSSCGGGSSGGGSSCGGSGGGGYSGGGGG SCGGGSSGGGGGYYSSQQTSQTSCAPQQSYGGGSSGGGGSCGGGSSGGGGGGGCYSSG GGGSSGGCGGGYSGGGGGCGGGSSGGSGGGCGGGSSGGSGGGCGGGYSGGGGGGSSCG GGSSGGGSGGGKGVPVCHQTQQKQAPTWPCK" BASE COUNT 1635 a 1507 c 1696 g 1692 t ORIGIN 1 ggatcctgat atagctgtct cttttgagac tatcccgggg cctagcaaac acagaagtgg 61 atgctcacag tcagggattg ggtgaatcac agggccccca atgttggagc tagagaaaga 121 acccagggag ctggggggat cacctgagtt catactgtcc aaactgaaac aagtggcaca 181 agtttctgag agccaaagtc taatcaggat cgtttagatc attaatgctc ccccataatt 241 aagacaattt ctgattagaa ttattctttc aacacagctg ggtggaacaa ggttcaacag 301 tggtatctta atagcaactg agttccaatg atgaaagaaa ggaaaaacac tatgttcttc 361 atacacagag gggggctgct cttggcccta gggtcatcag agaactgagt aaatcttata 421 ggaaaatagt taagatgtct tcacacacct cctttccaat agggttcaag ggcaggcatg 481 attggaagga aaagtgttct gtcatgtgag aaaagagcaa aagtattaat atcacatact 541 atgtagtaca ttcatatttc ataacttcca ttttcatgtt tctgtgaaat aaattatagg 601 attcctgctt ggtagaccaa atggggatca gacagctcaa caatgaacaa gtactcagta 661 actgccctgt tggtggcatt gcatgaacta ctgtgctttg cccatggtga catagcttga 721 aatagtaatg gaagacctga acccaactga gatctctaag tacattccac tctatggtgg 781 catctcagag gtcagagtca ctgtgcagcg ccataggaca tcagaatcaa agggtcatgg 841 tgaaaaggct gccagggtct gtcttgttag ttctcacctt tgtaagtaaa gtcagtagtc 901 agtaacaaag atcaaaacac ctgctctcac aaggaataac ttaaagtaga ctaaagtcat 961 gctagttaca gtgctgtctt ttccgtggta ccatcccaaa ctgggagctg gggactcacg 1021 aactctcaca accaataaag taagcagaac agaagcaacc caatgaagtg ttcatgaaac 1081 tggaatggag aaattgtggc ataagagatg gattctaaaa ttttgagaat ttccaagata 1141 atgaaattaa aaccaaacat caaaattgga aagatacaac tgaactagct tctatgtctt 1201 agacaatgtc ttagatctct agattccgta aggctgcttc acaagtctgc aacctagtcc 1261 tctagaatag ccctctggtt atggcacgca acctatacag aagttttgaa aacaatttct 1321 gccatccaca ctgctggcca tctctaatga ccaacctgct cactgttaca tcagagaagt 1381 ggccagtcat acaccaaact gcctatccct atcccaagaa tttgaaatct tcatgaatgg 1441 gtcaatcctt cccctgcaat cacagggagg aggtgcctga tcaatagatg agtcagagca 1501 ggacaagagt ataaaacaca ggagcaccag tgtccctcac atcagcatca cctccttccc 1561 tcactcatct tccctggtgc ttcaggtaag tgtgggctct cctggctgtc tggtctctcc 1621 agttggcctt gctcagcttg cagagaggtt aaggaacaga gcctttctcc cctttggaag 1681 gtactctgtt caaattgaga agggctttag gaaagcactg ggagagtggt aagctggtgc 1741 tgggcagatg atgtgtctgg tcttctgggc agaatgttaa aacttcacaa agatatgact 1801 atctcctact tctctggcac cctgggagct gagggttaga atactggatg actgcagtgg 1861 caggcctcca tgggctggat gaaccttttg aacctgccag aagtggctga atacactatc 1921 aggaagggag agggacgata agtcatagaa tggtgctgat gggagatttg agaagccaca 1981 aaaacccaag ctctgcttta tgagggcaga tgttctgaca gataaatgac ttgtgaggtg 2041 ctgaactaca cagcttccta ttagctacag ctaattggag tctaccaaat ttagactcct 2101 gcatatctca aaaagatgtc tactttcttc tggttagatg tactggtcca aaaggttcag 2161 agttcttcca tttgtttgca gacaggacca cagtagagct gtcttgtcta ataattggcc 2221 cttggaggat atctcactca ataggacaga tcaagagttt aaactaagga ctttatacag 2281 gaaatgctaa tgtccaaaca aatcttttct tattgtgctg ggagtggata aaatccacgt 2341 ggaatttttg caactttcta ctgaatttaa agaatcagca ctgggacttg ggagcaccct 2401 tagacatgga gtgtttatta atgtaagatc aaaagcaggt gggaatgtgg gggttctgct 2461 tcccaaatca catagtagaa gaaaggcaga gttgagggaa aagggggtca ctattaacgg 2521 gacttttgaa gagctaacca gtccaggaat ggagtccaga cacctagtct gcataaagct 2581 aggagtcaga agtatgttgg catggatgca tctgccacct tcacagcgtc ctcttgctgc 2641 tgttggtcta atgttgctct tctgctcttc ttccagggtt ccccttctcc ttaaacaaga 2701 tgtctcacca gaaaaagcag cccactccct gccctcctgt gggttgtgga aagacctctg 2761 gtggaggagg aggcggcggc ggctattata gcggtggcgg ctctggctgc ggaggcggct 2821 catctggagg aggctctagc tgtggaggcg gaggcggtgg ttcctatgga ggtggttcca 2881 gctgcggcgg tggaggcggc tccggtgggg gcgtcaagta ctccggaggc ggcggtggct 2941 ctagctgcgg cggcggctac tccggaggcg gtggtggctc tagctgcggc ggtggctact 3001 ctgggggcgg cggcggctcc agctgcggag gtggctactc cggaggcggc ggcggctcca 3061 gctgcggcgg cggcagctac tccgggggtg gctccagctg tggaggcggt ggcggctctg 3121 gtgggggcgt caagtactcc ggaggtggtg gcggcggcgg ctctagctgc ggcggcggct 3181 cctccggggg cggcggcggc ggctccagct gcggaggcgg atcaggaggc ggcggctcct 3241 actgcggagg ctcctctgga ggcggcagct ccggtggctg cggcggcggt tccggaggcg 3301 gcaagtactc tggtggcggc ggtggctcca gctgcggagg cggctattcc ggcggcggtg 3361 gaagcagcgg cggctctagc tgtggcggcg gctactcagg tggcggtgga tccagctgcg 3421 gcggcggcgg cggctattcc ggtggcggcg gcacgagctg cggaggtggt tcctccggtg 3481 gcggcggcgg cggatcgtcc caacagtatc agtgccagag ctacggaggc ggttctagcg 3541 gtggctccag ctgcggcggc ggctactccg ggggcggagg ctccagctgc ggtggcggct 3601 actccggggg cggaggctct agctgcggag gcggctcctc tggtggtggc tccagttgcg 3661 gcggcagcgg cggcggcggc tattccggtg gtggcggtgg cagctgcggc ggcggctcct 3721 ctggcggcgg agggggctat tactcctctc agcagaccag tcagacctcc tgcgcccccc 3781 agcagagcta cggagggggc tcttccggag gaggtggtag ctgtggaggt ggctcctctg 3841 gcggcggtgg cggcggtggc tgctactcca gcggtggtgg cggcagcagc ggtggctgcg 3901 gtggaggcta ctccggaggc ggcggtggct gtggcggcgg ctcttccggg ggcagcggcg 3961 gtggctgcgg aggtggctct tccggaggca gcggcggtgg ctgcggagga ggctactccg 4021 gaggcggagg cggtggctcc agctgcggag gcggctcctc tggtggcggc tctggaggtg 4081 gcaagggtgt gccagtctgc caccagaccc agcagaagca ggcgcctacc tggccgtgca 4141 agtaaggtca ccgggttgca acggagacaa cagagctgga agagttctcc gtgggcgccg 4201 atgggcttaa ctttctcatg aatttgcctg aggtttccaa acccttcaca ttttaagcgc 4261 cccttccccc agaagaagcc attgagtcgc tcaaggtgta tcctgttctg cagatttttc 4321 atcttggttt ctgaatgact acctcccaat tctagtgtct cctcagtcaa taaatttgct 4381 attcatgaga atctctgagt ttgctgtagt ctttgtagct tgcaaattta ctcagttcat 4441 tctgtgtttg ctttttccat tcattagttc acatttaaat tcactgaaca agtgttctat 4501 cccaaggtgg gggagtagat agatggaatg gggcaaagga tgaccaaggt tgtgaacagt 4561 ctggggtgtg gcttaaaaat catgagatgg tcctcaaaca ccaagaaaag tcttcactgg 4621 acatcctaca catcactgaa attgggcctg cgcaggcaat ttctagcagt gcagagttca 4681 ctctccaagt tctggaagca ggatggctct cagattaggt tagctaccag aggtccaagt 4741 ccactgacat gttctgacct aagaagaagg acattcaccc ctgaacaaaa gacccctgcc 4801 catgcgatct tccggaacac tataactact ttccttactc atgacccatg atagagcttt 4861 gaggcaaaga tacaaaccct ctatgtcttc tcaagattgc cagttcttca ttaagcctga 4921 taccttctta ccagcgcacg tctcctgaat actgataaag tctggttttg ttagtctgtt 4981 agaaaaatat tatatcagat aatcaagatc ctctacagtg tgtgagacag tttactgagc 5041 atctatagag atagaaggca gccctcttga aggattgaac gcgtacgttt cgtccaattt 5101 gagaaggtac atcgtaagta tttaagatgc ttaacatcag tatcacagag gtcactggaa 5161 acattagggg cctcctgatt agcaagcata aagctagagt tgctcaaagg catgtgtaac 5221 aaccatcccc tggccagatc ctgttttaca gtcagatttt atcagcttta ggtaaatgct 5281 aacttactga cttactcaag ttaattttgc tatactaaaa agccaatgtg ccttcctaca 5341 tttagctaat gatagaaata aaaagatttc atctcactct tccatttgga gtcatcacta 5401 ccttcatcat ttgcatcaga gatagagcat gccaagtagc aacctcagtg acacagtagt 5461 cttaccacca catttttatg gattaaatgt atttttttta gcatggttat atgtgcatat 5521 aatacactct gattactcac ttccctatcc tttcttactc ctccccatcc caacctgtat 5581 caatccttac cttccctaca attcccttta ccatgttttt gttagttttg ttggtttgtt 5641 ttgtgaccca ctgagctaac cagggccatc tgtatgacca tgggtttgga ttctgatgga 5701 atcccactgg gtacacaact gaaactagtg actccccttc acagaatcta tcagtagaca 5761 ataattcaac agggaatggt ggggctctct ccatccttgg ctaactgttg acaggacagt 5821 cttgtgcagg cctagtgcag acaaccatag ttgctgtgag ctcatgtttg caatggctgt 5881 gttatacata ggagatagta ttttggagcc attatccatg tctggctctt atattccacc 5941 ttctctttta ggatgttcct tgagtctttg aggaatgttt tggttagaac cgagtgctca 6001 gttgtcattt attttcagaa tcttgagcat caaaggatac ataagatatt atattatagg 6061 atactaaatt tttgtacaga tttttcatat acccttcata ttggttaacc ataatcccca 6121 atttttctct cctctaacac tccactgctc ccataccaga tgaaaccttt caactccatg 6181 tattttccct ctttgctttc attttatcta tattgtatga tctcaactcc cttaatctat 6241 ctcactacca ataacccttt tctaaactgg tagcctacaa ctttagttcc agtacttgat 6301 gcagaagtag atggagcaat gtgaactcat gctcagcctg gtctatggaa tgggttacaa 6361 gccagcccgg actatgtaat aggaccctgt ctcaaaaaca actaaaccaa acaaacaaac 6421 aaacaaagaa caaacaaaca aacaaaccaa aaatctcaac catttctagt ttttctagtt 6481 tttacttgaa catcaagtta agcataacta aagtttcaaa aataggatcc // LOCUS MUSALPBCRY 4181 bp DNA ROD 22-MAY-1995 DEFINITION Mouse alpha-B2-crystallin gene, complete cds. ACCESSION M73741 NID g191890 KEYWORDS alpha-B-crystallin. SOURCE Mus musculus (strain BALB/c, sub_species domesticus) (library: lambda Charon 28 #10277) DNA. ORGANISM Mus musculus Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Rodentia; Sciurognathi; Myomorpha; Muridae; Murinae; Mus. REFERENCE 1 (bases 1 to 4181) AUTHORS Frederikse,P.H., Dubin,R.A., Haynes,J.I. II. and Piatigorsky,J. TITLE Structure and alternate tissue-preferred transcription initiation of the mouse alpha B-crystallin/small heat shock protein gene JOURNAL Nucleic Acids Res. 22 (25), 5686-5694 (1994) MEDLINE 95140633 FEATURES Location/Qualifiers source 1..4181 /organism="Mus musculus" /strain="BALB/c" /sub_species="domesticus" /db_xref="taxon:10090" /germline /tissue_lib="lambda Charon 28 #10277" exon 193..912 /gene="alpha(B)-crystallin" /note="exon I in lung mRNA" /number=1 gene join(193..912,1961..2083,3741..4082) /gene="alpha(B)-crystallin" mRNA join(193..912,1961..2083,3741..4082) /gene="alpha(B)-crystallin" /note="lung mRNA" /product="alpha(B)-2-crystallin" TATA_signal 639..645 /gene="alpha(B)-crystallin" mRNA join(667..912,1961..2083,3741..4082) /gene="alpha(B)-crystallin" /note="lens mRNA" /product="alpha(B)-2-crystallin" exon 667..912 /gene="alpha(B)-crystallin" /note="exon I in lens mRNA" /number=1 CDS join(712..912,1961..2083,3741..3944) /gene="alpha(B)-crystallin" /note="lung mRNA" /codon_start=1 /product="alpha(B)-2-crystallin" /db_xref="PID:g191891" /translation="MDIAIHHPWIRRPFFPFHSPSRLFDQFFGEHLLESDLFSTATSL SPFYLRPPSFLRAPSWIDTGLSEMRLEKDRFSVNLDVKHFSPEELKVKVLGDVIEVHG KHEERQDEHGFISREFHRKYRIPADVDPLTITSSLSSDGVLTVNGPRKQVSGPERTIP ITREEKPAVAAAPKK" intron 913..1960 /gene="alpha(B)-crystallin" /number=1 repeat_region 1285..1324 /gene="alpha(B)-crystallin" exon 1961..2083 /gene="alpha(B)-crystallin" /note="exon I in lens mRNA" /number=2 intron 2084..3740 /gene="alpha(B)-crystallin" /number=2 misc_feature 2817..3015 /gene="alpha(B)-crystallin" /note="B2 repetitive element" misc_feature 3028..3185 /gene="alpha(B)-crystallin" /note="B1 repetitive element" exon 3741..4082 /gene="alpha(B)-crystallin" /note="exon I in lens mRNA" /number=3 polyA_signal 4061..4066 /gene="alpha(B)-crystallin" polyA_site 4082 /gene="alpha(B)-crystallin" BASE COUNT 1058 a 1023 c 957 g 1143 t ORIGIN 1 ctgcaggagg gcaaggagag gactagttgg gccttcacca gttgtacatt ccacatcacc 61 ctttgtcctt ctcagtctca ggcactgtgc acattacttt aaaaaaaaaa aaagacttaa 121 tgttctatga gccacatagc atccacaatg caagaaacat tttctgtctt tgtaaggtca 181 gtgtcttctg aacctagatc agctcagggt tccagtcaga cacctagttc tgctctcctc 241 taggactcca caaagagtta atgtccctgg ggctaagcct aggaagattc cagtccctgc 301 ccaggcccaa gatagttgct ggctcaattc ccctggcatg cgagactgga gaggaggagg 361 ggcccaccag cagctgcttg ggattccagg ctccatccta gctccagaga acaaggatgg 421 ggtgggtgcc actgggtgtg gacagagagc tagtgaaaca agaccatgac aagtcaccgg 481 tcagctcagc cctgcctgtg tttctctttt cttagctcag tgagtaccgg gtatgtgtca 541 ccctgccaaa tccctgatca caagtctcca tgaactggcg gtgagctggg ataataaaac 601 ccctgacctc accattccag aagcttcaga agactgcata tataaggggc cggctggagc 661 tgctgctgaa ggagttgacc agccaaccga ctctgcattc atctagccac aatggacatc 721 gccatccacc acccctggat ccggcgcccc ttcttcccct tccactcccc aagccgcctc 781 ttcgaccagt tcttcggaga gcacctgttg gagtctgacc tcttctcaac agccacttcc 841 ctgagcccct tctaccttcg gccaccctcc ttcctgcggg cacccagctg gattgacacc 901 ggactctcag aggtgagtct gcctatgcca gggcaggggt gagtgtccgt cctggaccct 961 cagtcccttt tccctttcac ccacccccaa gccattttag gcatctgtga gtgtgtgcta 1021 aggtacagag gggaaatgaa gctactgttt tcttctttct tcgggcacct gtttctgttt 1081 gatgctcggt ttcctggtcc atttttctgt gcatggtgag ggtcatagtc tgtctcctaa 1141 tgatggagca ggatgcttat ttctgttatt tctggtttgc ttcagctcct ttacttacct 1201 cttcaatagt catggtctgg gaggatttta tcatctcaaa ctgtaaagat cagatagaaa 1261 cccactcatc ctggactttt tctgctctct ctctctctct ctctctctct ctctctctct 1321 ctctcatccc ttgtctatga ttttctgggt tccttggggc tgagacagtg cactaacttc 1381 agtccatgcc ttaagaggag tgactgtccg accactctcg ggaaaccctg acttatccta 1441 actgggtggg agataagatt aacagagctg tcacagataa cactctggtt taaaaatatt 1501 cgagtgtgag gaaacaggag ctgagtgagc aagggctttg gaaggacagg accagcagaa 1561 cattccagat cgagtgggtt ggaaacttgg cagggagctg aacaaggaaa agggggcctt 1621 tgtcttatag acaaatgaca aagccagcat tggggagcga ggcagcagat gccatgccca 1681 gatccatcct tgtgactagt cccctaatga cctgtcttct tccctgtgcc tctaagaaaa 1741 gtttgggact gatgaccaag aggcgtcctt tgcatatcac tacggtagct cacatgcacc 1801 agagtcgcta ttcaaagggg agggggcgtg ttttccccca gcagtcaagt ggagcatgaa 1861 ttacctggac aggaagtaaa tctgctgaac gatgagcctt ctccagctta ataaacgttc 1921 tactatctcc tctcctcctc ctgtcccctt gttattctag atgcgtttgg agaaggacag 1981 attctctgtg aatctggacg tgaagcactt ctctccggag gaactcaaag tcaaggttct 2041 gggggacgtg attgaggtcc acggcaagca cgaagaacgc caggtgtgtg gacctctccg 2101 tcctcttttg tgaatccact ttgtgcacac tgggtgccag gtactgccac cagcctctga 2161 ggctggcaga attccaggcc ccaaacataa gtgactagat cgggagttac agatcaatgg 2221 ctgggaagct aagctagggc tcgctgtgct gatagtgttg tggttgtgct gagagaagtt 2281 ttcccttgct gttccttttg attgccttgt cttgtccagt gacactagga tatcagaaca 2341 gttcccaaaa ttccaaagca caccttaggc caattaaatc tgaagtttta ggactataac 2401 tcaagcatca acatgaaaaa acaaacaaac aaaagcccaa aaaactcctt gggtcctttc 2461 agtactacaa actggcaacc gttgacctag gcacaggatg ttatagataa tggttgttgt 2521 gcataatgac cataaccgca gggttgaagg cagtaagtct gtggctttgt gatgttcggg 2581 attccatgtt gcttctgctc agtttaatgc taagctactt ggttggtctg cttttcagag 2641 cttcttcagt cttatggctg tgaggaagcc agttaattga gatttgtttg tatggatcct 2701 tgggattcag aaggcatcag aattagcatc aacaattctt cttgcatttt tggccatttt 2761 tttccttgac tccctaaaaa ggtgttatga tttcttctta aaaattaatg acttgggggc 2821 tggagagatg actcagtagt taagagcagt taagactgct tgctcttcca gaggtcctgc 2881 gttcaagtcc cagcaacctc atattggttt acagccatct ataatgagat ctgatgccct 2941 cttctggcat gcaggtgtac atgaagatag agcactcata tacataaaac aaataaataa 3001 atctttttaa aaaaatcaat agctcacccg ggcagtggtg gcgcatgcct ttaatcccag 3061 cacttgggag gcagaggcag gtagattctg agcttgaagc ctgcctggtc tatagaatga 3121 gttccaggag aactggggct caacagagaa accatactta tctcaaacaa aacaaaaaac 3181 aaaaattaat aactcaattt tgctcagaat ttaggactgg accacagtgc cacacataga 3241 gtcatcttat ataatatcag caaccccatg ggctaggaac atttggagac attcctgcac 3301 aattcggaac atttatgggc ctcctaaagg atttttggca tcagacagat cctcaatgcc 3361 aaagtgggag tccaaactct atgggtccag ccaccatgga ttcattcctt tgtggaaagt 3421 gccccgaatt gtaactgtgt ctcgagatag gaagatcaat gaagttcagg actgtttcag 3481 atttcctaga ctctgattgg acagtttatg ctagccttaa acctggacaa acagctctag 3541 acttagctat ctgcatattt acagtgttgg agatgaactc ggggcctttc accactagac 3601 tgtatcccca gctccggata tctcttttta tcttagggcc ttttgatcta acccaaggga 3661 gagggatgtc tgaacttggg gcaaatggtg gtatttcctg ttcttatttc tctgctttgt 3721 cctttattct cttggattag gacgaacatg gcttcatctc cagggagttc cacaggaagt 3781 accggatccc agccgatgtg gatcctctca ccatcacttc atccctgtca tctgatggag 3841 tcctcactgt gaatggacca aggaaacagg tgtctggccc tgagcgcacc attcccatca 3901 cccgtgaaga gaagcctgct gtcgccgcag cccctaagaa gtagatcccc tttcctcatt 3961 gagttttttt taaaacaagg aagtttccca tcagtgattg aaaatctgtg actagtgctg 4021 aagcttatta atgctaaggg ctggcccaga ttattaagct aataaaaata tcattcagca 4081 acagacctgc ctcgtgtttg caaactcaag tgtgttttaa ataaatctgc aaatgtaaca 4141 gatctactaa ttcccaaact atgattcagg ggactccaag g // LOCUS MUSUPAA 9950 bp DNA ROD 15-MAR-1990 DEFINITION Mouse Murine urokinase-type plasminogen activator protein gene, complete cds. ACCESSION M17922 NID g202296 KEYWORDS urokinase-type plasminogen activator gene. SOURCE Mouse (strain Balb/c) DNA. ORGANISM Mus musculus Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Rodentia; Sciurognathi; Myomorpha; Muridae; Murinae; Mus. REFERENCE 1 (bases 1 to 9950) AUTHORS Degen,S.J.F., Heckel,J.L., Reich,E. and Degen,J.L. TITLE The murine urokinase-type plasminogen activator gene JOURNAL Biochemistry 26, 8270-8279 (1987) MEDLINE 88163489 COMMENT Draft entry and computer readable sequence [1] kindly submitted by S.J.F.Degen 12-DEC-1987. FEATURES Location/Qualifiers source 1..9950 /organism="Mus musculus" /db_xref="taxon:10090" prim_transcript 2182..8875 /note="uPA, mRNA and introns" intron 2253..2570 /note="uPA, intron A" exon <2601..2657 /note="urokinase-type plasminogen activator, (first expressed exon)" /number=2 CDS join(2601..2657,3142..3172,3310..3417,4044..4218, 4615..4706,4850..5072,5293..5441,6016..6156,6463..6611, 7790..7966) /note="urokinase-type plasminogen activator" /codon_start=1 /db_xref="PID:g202297" /translation="MKVWLASLFLCALVVKNSEGGSVLGAPDESNCGCQNGGVCVSYK YFSRIRRCSCPRKFQGEHCEIDASKTCYHGNGDSYRGKANTDTKGRPCLAWNAPAVLQ KPYNAHRPDAISLGLGKHNYCRNPDNQKRPWCYVQIGLRQFVQECMVHDCSLSKKPSS SVDQQGFQCGQKALRPRFKIVGGEFTEVENQPWFAAIYQKNKGGSPPSFKCGGSLISP CWVASAAHCFIQLPKKENYVVYLGQSKESSYNPGEMKFEVEQLILHEYYREDSLAYHN DIALLKIRTSTGQCAQPSRSIQTICLPPRFTDAPFGSDCEITGFGKESESDYLYPKNL KMSVVKLVSHEQCMQPHYYGSEINYKMLCAADPEWKTDSCKGDSGGPLICNIEGRPTL SGIVSWGRGCAEKNKPGVYTRVSHFLDWIQSHIGEEKGLAF" intron 2658..3141 /note="uPA, intron B" exon 3142..3172 /number=3 intron 3173..3309 /note="uPA, intron C" exon 3310..3417 /number=4 intron 3418..4043 /note="uPA, intron D" exon 4044..4218 /number=5 intron 4219..4614 /note="uPA, intron E" exon 4615..4706 /number=6 intron 4707..4849 /note="uPA, intron F" exon 4850..5072 /number=7 intron 5073..5292 /note="uPA, intron G" exon 5293..5441 /number=8 intron 5442..6015 /note="uPA, intron H" exon 6016..6156 /number=9 intron 6157..6462 /note="uPA, intron I" exon 6463..6611 /number=10 intron 6612..7789 /note="uPA, intron J" exon 7790..>7966 /note="urokinase-type plasminogen activator" /number=11 BASE COUNT 2746 a 2178 c 2524 g 2502 t ORIGIN 216 bp upstream of HincII site. 1 tatggaatct ctgattcagt ggtcctgtgg gttgctagca gctcggctgc gcaaggaaaa 61 cgtattctga agaacgatgg tcacctgcct acccaaagct gggtattcaa tgtgtacttt 121 cctatccaga ggttggcatt ctgggccact ggctggggta aagtcaagca gcctcctcct 181 tcctacctcc tggcattctt ttccaaggtc caggttgaca gtaaaatgtt gtaggctcag 241 gcttatggaa ataaagcaaa gctggggcac taatgaatgg tgcgggctca ggcacctctt 301 ctgtggaagg agcatcaaaa tgaccactgg tccttggctc tgaggcacat gttgcagtga 361 ctgacccttg gaatcttttc tgagcactcc ctactttgcc tgaggggtgt gtgcatgtgt 421 gcatgtgtgc aagtgtgtgc atgagggagg gtgctttgtc ttcaagatgt tcagggctca 481 tttacacatg acctgtcttt atataagcct attttattag ataaattatt agataaataa 541 ttgttttcgt cttaacatga tgatgtaaaa tgctgaacac aagtgcattt acagtgtgga 601 attggcaaca gaagagatgg ctaggctcac acccaaaaat aaaatgtgta attatttttt 661 tagaatgact gatacgagga cgtgtttaaa gtttttaaaa cagtgtgaaa tagtattcag 721 tgaaaaataa acattccttc atctcttcct tcagacccac agcctctcca ctacaaggga 781 agagcagaca cccccactgt tgttgcatgt tgatagccca cactaacgag gtttgagtat 841 tccttgcttt tttttcctat cctgttacaa tgtactgttc atgttactgt tcacatcact 901 gactgctctt ccagaggtcc tgagttcaat tcccagcgcc acatggtggc tcatgaccat 961 ctacaatggg atctggtact cttttctggt gtgtctgaag agaggactgt gcactcgcat 1021 acataaaata aataaatctt taaaaaaaag acacccccct tgattgctta tttgtgtctg 1081 tgtgcaacca caataccagt gaggtatgca ctcagtcctg atctatgatc ctctgtattg 1141 gttgtattct tttccagaca aataaataga ttgttcatta taggcatttt tcaaaagacc 1201 cattgacact tccatagcag ctgctgtgcc aagactttgc tcccacaatc ctaacatagt 1261 aaccattact gcttttttaa aaatcataat tatttttgag gatttcacta tgtaaattag 1321 tctgggcttg aactcgcaga gatccacctg cctcttcctc cctagtgttt agataaaaga 1381 cgtgcaccat tacgtctggc ccttgccata aattttaaaa ttgtctttca aagattgacc 1441 ttaaaccaaa cagcaaatct gaataaaatc tgaccttgcc tttcaccttc tgatgacagt 1501 attacactcc ctgacgacaa acttcactct tgtcttctga ttcactgctt gcattagtgg 1561 catttggaga actcagcatt tgacatgtgg gagcctttgt tagtaggtat tttttattgt 1621 aaaggaactg cgacttatac ccctctatca gacatctgaa tcagtgtcgt aggcaggtag 1681 ggggacaggt tggagaagaa ctgattaaac cactaaggaa gaggcttgag atcagcgagc 1741 caatggctgg agcgccttca ccacgctaca ctggggccgc actaggtgaa tgaaagaaag 1801 gaagaatgtt caagccgccg gatcatcgct cgatccagac agactgcgtt aagtttgctc 1861 agctgaaatt ccgtgacttc gtcaaagttg ggaagcaagc gcggtccagt tgcggtggga 1921 tgcaggaaaa ggaaaaggag agagagagag agagagagag agagagagag agagagagag 1981 agagggaggg agggagggag ggagggaggg agggagggag ggagggaggg agggagggag 2041 ggagggagag agagagagag agagagagag agagagagag agagagagag agagagagag 2101 agccgccctc cagggaacct gggcggggcc agggctctgg cgggccctaa taaagggcga 2161 gcagcgccga gcagagcctg tagccccaga gctctgtctg tcatccaacc agtccttgcg 2221 tgtctgccag cgcccttccg ctgcagtcac cggtgagtgc tgttggtctg aagcaagcct 2281 ggcgggatga ggcagccagg gctcccgcat gcctcccttc cccctacctt ggctggcgga 2341 actgtgggca aggtcaccac tccagccctt cgcgcccctc tacagagagg ttccatggtg 2401 ttgtgcggat tcagagcccg cagaggggag agactgcccg gcttggggag gttggtcact 2461 gatggcttgc cccgcagggt acctggagtg gcttccttcc cttggtctgt agtaactctg 2521 ccaccttcga gctgctccgc ttcttgtcct gacttctcct tcctttgcag aactgctgtc 2581 tagagcccag cggcactacc atgaaagtct ggctggcgag cctgttcctc tgcgccttgg 2641 tggtgaaaaa ctctgaagtg agtggtctcg ctgctttagc accatcagga aggggcttgc 2701 aggatccctt aagcagcatc aggggaaaaa tgggggctgc acggggaact taggcatcaa 2761 aggcaggtcc aggctttccc aggaaatagg acaatgtatc agtggagggc ttgtgcaccc 2821 aaagaggttt gcactatctg gcaagggagg aagaagccac ggggagtacc ttagcccaag 2881 ggcacctggt ttgtgtgaag tttgcttaag tcagtccatg tctgggtgct ggctaggaat 2941 aaacagaaag gggagagaca gacaggggtg gggtgggaga aagagagaga gagagagaga 3001 gagagagaga gagagagaga gagagagaga gagagagaga atattatgag tgaatgaata 3061 tcactggaag ggattttgag gtggggacct gtttatcctg aacatgaatt ccctagagca 3121 tgtcaccttc atctcttgca gggtggcagt gtacttggag ctcctgatga atgtgagtat 3181 ctgcttcctt gcacaatagt tggctgcaca gagacccttg aaaaacctta ggagacatac 3241 cctccctgtc cctgctgaaa ggctggctcc ccacttgatc cttgcttacc cctcctttgc 3301 atttgctagc aaactgtggc tgtcagaacg gaggtgtatg cgtgtcctac aagtacttct 3361 ccagaattcg ccgatgcagc tgcccaagga aattccaggg ggagcactgt gagataggta 3421 tggggatttg gacttggaat gtgggagtgg gggaggacca gagatcttag aacagggaca 3481 gatgggtggg atgcagaagc aggcagaagc tggccttgga ggtgtgggtc tgtgagccca 3541 gcacttagga ggagactgaa gcctggcctc catagtaagt ccttgtctca aaaggcgggc 3601 gaggcgcagc tagagagaca ggtgagtgat taagaacact ggctgctctt ctagacatcc 3661 tatagtttga gtttcagcac ccacagaggt gggttacagc catctgtacc cccagtccca 3721 gggaatctga tgccctcttc tgacctcttc cagcccaagt gacatacatg gtgtacaagt 3781 atacataaag gaaaaacact catacacata aaataagcaa acaaacaaac aaaacaggca 3841 gcagaagttg ggagtcacac acacacacac acacacacac acacacacac acacacactc 3901 atgaagcagt ggctatacaa gtgtgaaaaa aagggtgaat ctccctcata tcacctgaca 3961 ggtctgaaac cgtgtcacct ctgaaatgcc tgtccaaacc tcatcccttt tctaatactc 4021 tgcactcctc aaatcatttc tagatgcatc aaaaacctgc tatcatggaa atggtgactc 4081 ttaccgagga aaggccaaca ctgataccaa aggtcggccc tgcctggcct ggaatgcgcc 4141 tgctgtcctt cagaaaccct acaatgccca cagacctgat gctattagcc taggcctggg 4201 gaaacacaat tactgcaggt aggtggtgac tgagtaccaa gaatccttcc caagggggat 4261 agggaggtgg ctcagcagtt aagagcacag actgcctttc cagagcacct agtttgattc 4321 ccagggcagc tcgtgacagt ctttaacacc tgttctagag gatccgatgc cctcttctgg 4381 cctcactggg caggcatgca ctgtcatgaa tataggaaaa cacttataca cattaaaaac 4441 aacatccctt cccccatcgt ggcctcttag aaacctttgt tatcaccatg gtatacctgg 4501 gatgggaatc ctggcacaag aatccaggtc tctggttgag cctttgttgg aagggaggat 4561 acagagaaga cattcgggct tggcatgaca ttccctatct ctttgtgtta ccaggaaccc 4621 tgacaaccag aagcgaccct ggtgctatgt gcagattggc ctaaggcagt ttgtccaaga 4681 atgcatggtg catgactgct ctcttagtga gtgtcgctga ctgcttatga caacggggtg 4741 ggaagagaca aactctattg tcactgcagg agggatgaga agtgaggttg gcctcagaga 4801 ctcttcatca ttgctgtctc ccccaaacat gtgtctcttt cttttctagg caaaaagcct 4861 tcttcgtctg tagaccaaca aggcttccag tgtggccaga aggctctaag gccccgcttt 4921 aagattgttg ggggagaatt cactgaggtg gagaaccagc cctggttcgc agccatctac 4981 cagaagaaca agggaggaag tcctccctcc tttaaatgtg gtgggagtct catcagtcct 5041 tgctgggtgg ccagtgccgc acactgcttc atgtacgtcc atccctttgt cccttctctc 5101 tgactcttcc acccaacccc aagactgtcc ttcctccttc cctatggacg gttacaatgt 5161 cattctcctg ctaaccctct aaccatgcag cttgtggtct tgggtacaag taatactttg 5221 aggcctctgg ggtggagtgg agagagtgac cggactttgt gagaccaggc tgacatgttt 5281 catttctcat agtcaactcc caaagaagga aaactacgtt gtctacctgg gtcagtcgaa 5341 ggagagctcc tataatcctg gagagatgaa gtttgaggtg gagcagctca tcttgcacga 5401 atactacagg gaagacagcc tggcctacca taatgatatt ggtgagcaga aagcttagtt 5461 atcagaaagg ctaaagtagt ggtgggaaat gttgggggac ttgaagcccg ggatttatat 5521 aacgagacgg atgaggaaga gtgcagaatg agatacatga gaagctgagg ggtgtgggga 5581 tcctctgtgg agaccttgaa tttcccaaac agatagattc ttctaagtag aaacaatctt 5641 acaggcatac ggcttaggct gagaatgccc tgtttgtaca aagtaggatg gatgcttctt 5701 ctctgtatac cagaatatag aaggtataaa gcaaagcctt ggctggattt cagctcagct 5761 ccctcagcag gaaacaacct gttcagctgt atatggtaga ttttgttgcc cgaacatctg 5821 tcatctgatg aaataaagca tttggagaat gtggcagggg aggcttcagg gtaacaagat 5881 accagcagac cttttggatc tctgtgactc ccatgccacg agtatagatc aatgctcagc 5941 attggtaggg gagagatgat gaccatctga cacagtgata acctttcccc tttgaccttt 6001 cccttcccca cccagccttg ctgaagatac gtaccagcac aggccaatgt gcacagccat 6061 ccaggtccat acagaccatc tgcctgcccc caaggtttac tgatgctccg tttggttcag 6121 actgtgagat cactggcttt ggaaaagagt ctgaaagtag tgacagatga agctcactga 6181 gagagtctgg gggagtgtta tggtccagag caaagagcag actatcaaag gaagactgtg 6241 gaaacaggac tggaaacatt atggagggcc agggatagag tagggggaga tgggcaagca 6301 agtcaaacag ggtgtgaaca attgtgagtg aagtaaaaga ctcagattgg agaaacaaga 6361 acaagagctt ttcatagctg ggatatgttt tttatcttca cccctgcaga gagtctcatt 6421 tatagacaca tcttaatgca aacatctgtt tgttccatct aggtgactat ctctatccaa 6481 agaacctgaa aatgtctgtt gtaaagcttg tttctcatga acagtgtatg cagccccact 6541 actatggctc tgaaattaat tataaaatgc tgtgtgctgc ggacccagag tggaaaacag 6601 attcctgcaa ggtaagactc tcaagcaccc ctctttatca ccccaactcc ccagagctct 6661 tggatttgat ctaacaaccc tggggagtct ctttccagcc aacaatctaa gaatcaagga 6721 cttaggtctt tgggagcttg tcccaatact tataggttca aacgttgggc atgagtccct 6781 gtgctatatg cgttttagac taaaagggac caagactgct aaaaaaaaaa taacccagac 6841 atggtagagc atacctataa cctagcactc ttagcatttt gaatgctgag acaggaggat 6901 catgagttta aggccagccc aaactacaga gtgagaattc aaggcctgtg ccacatagca 6961 agatcctttc tcaaataaaa caaggaaagc acaaaccata aaaaccaaga caatagcaac 7021 aaagggatgg gtgcctagct cagtggttta gtgcttgctt actatgctca aagtcctgag 7081 ttcaagtctc aactcaggga ctggagatca taagaaaaac taagagtctg gggacgtggc 7141 ttagttggta gagtacttac ccagcatgaa agaagccctg gatccagtcc tcggcactgt 7201 atgtaatgac ccaggcctgc agtatcagcc cttgaggggg tcagaatcag tttataagtt 7261 cttcagttac agagtgagtt caaagccagc ctgaaagaca tgggcttatg agactctgtc 7321 ccaatctgaa agaacacaac caaccaacca accaccacca ccaacagcaa aatatagata 7381 ctattcaaat cacttctggg cctttggcaa gacaagtgaa atcaacataa ttctattgtt 7441 caggatcgca gtgaattacc aaagatcagg taggaaagga aggagaagtc ttaaagagac 7501 tatgaactgg taaataaaga gacggaagga aaaaggaagc atgtgtcagt tggaaaaaac 7561 aaaactaaga ctgagcatgc tgtgtgccaa ggcgaggaat agcagggtct gggaaagcac 7621 tggagagtgg gagaggaaag ctaagacttt ttactcttga ttcggtagaa aatggggagt 7681 tgcgaatgtc tctgactctg ggaacctctc cccgttctct cccgtggctg ggtagtggcc 7741 cttccctcag ttcttccagg gcttcacctc tttatctttg gcttcccagg gcgattctgg 7801 aggaccgctt atctgtaaca tcgaaggccg cccaactctg agtgggattg tgagctgggg 7861 ccgaggatgt gcagagaaaa acaagcccgg tgtctacacg agggtctcac acttcctgga 7921 ctggattcaa tcccacattg gagaagagaa aggtctggcc ttctgatggc cctcaggtag 7981 ctgagggaag aaacagatgg gtcacttgtt cccatgctga ccgtcctctc tgcaacagag 8041 tcgtcaaatg gagggaagaa gctgaaaaga caggttttgc attgatcctc tgctgtgctg 8101 cccaccaggg tgagcgccaa tagcattacc ctcagacaca ggcctgggtg ctggccatcc 8161 agaccctccc gaccaggatg gaaagttggt cctgactcag gatgctatag accaggagtt 8221 gcctttttat ggactaaagc catctgcagt ttagaaaaca tctcctgggc aagtgtagga 8281 ggagagctgt ttcccttaat gggtcattca tgagatctgc tgttgggaaa taaatgattt 8341 cccaattagg aagtgcaaca gctgaggtat tgtgagggtg cttgtccaat atgagaacgg 8401 tagcttgagg agtagagaca ctaacggctt gagggaacag ctctagcatc ccatgaatgg 8461 atcaggaaat gttatatttg tgtgtatgtt tgttcactct gcacaggctg tgagtataag 8521 cctgagcaaa agctggtgta tttctgtatc taactgcaag tctaggtatt tccctaactc 8581 cagactgtga tgcggggcca tttggtcttc catgtgatgc tccacgtgaa tgtatcattc 8641 ccgggcgtga cccgtgacta gcactaaatg tcggtttcac tttttatata gatgtccact 8701 tcttggccag ttatcttttt tttttttttt tttttttttt actaattagc ctagttcatc 8761 caatcctcac tgggtggggt aaggaccact tctacatact taatatttaa taattatgtt 8821 ctgctatttt tatttatatc tatttttata attctgagta aaggtgatca ataaatgtga 8881 tttttctgaa gattctggtt tctccatgat tcttgtgtga cagggaagag ggggacatta 8941 aaaggaagaa aataatgagg gctacgtgca tcttagtttc atttggggtt tgcttggact 9001 ttttttggat gagaatgcat ggatgaggct gctgatccaa gccaggcacg gtcctagtcc 9061 acctgaaggc taaatgaaga ttggtgcaaa ttcaaggtca gcctgacgat gtggttattt 9121 caaggccagc taggctacat agcaagacat tgtctttaaa aaaaaatgcg caagaaagaa 9181 aagaaaaaaa tctgattcaa acaaagcagc tgagtcggtg ctgtcgacgg ggtcaggtaa 9241 tgaagatact tgtgtttgca gctcttggtc ccccgctgaa actacttgta acgcttctgg 9301 cctctgtagg caccaacacc catgcacaca cacagatgat tacaaataag tcttacagaa 9361 gaaaacatga aaaaaatcag tgtctcacac ctgtcatccc agcaagtgag aggctgaggc 9421 aagaagactc ctgtgagttt gaagccaatt ttgctacaaa gctttagtct taaaacagac 9481 aaaataaaac aaaaagtggg gtggtagtgg tatgccttta atctcagcag aggcagaggt 9541 tcgaggcctg acttgtctac agagtgagtt caggacagcc aggagctaca catagaaacc 9601 ttgtctcaaa ataacaataa aataataata acaacaaaac caataaaact aaaccattgt 9661 gaatctggga ttccagaaag caaacatact tttccatcat ctgtgtgtag gctgatgcta 9721 aatttccgct gtgctaatgg agcttatctg cacttaatgt ggccttggga aggtacagaa 9781 ggagagttcc agggttggcc ttcatagcac ctaagttaca aaacaggcca caggctgcgg 9841 cttggtaagc ggtgttcggg ttgagctgca gctcacaggt gcttcctcag cctggtgcta 9901 ttgggcagag tacctcgttt attattaatt aattaattaa ttaattaatt // LOCUS MUSACASA 4007 bp DNA ROD 10-OCT-1991 DEFINITION Mouse skeletal alpha-actin gene, complete cds. ACCESSION M12347 NID g191572 KEYWORDS actin; alpha-actin. SOURCE Mouse (BALB/c) DNA, library of M.Steinmetz. ORGANISM Mus musculus Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Rodentia; Sciurognathi; Myomorpha; Muridae; Murinae; Mus. REFERENCE 1 (bases 1 to 4007) AUTHORS Hu,M.C.-T., Sharp,S.B. and Davidson,N. TITLE The complete sequence of the mouse skeletal alpha-actin gene reveals several conserved and inverted repeat sequences outside of the protein-coding region JOURNAL Mol. Cell. Biol. 6, 15-25 (1986) MEDLINE 87064281 COMMENT Draft entry and sequence in computer-readable form for [1] kindly provided by M.C.-T.Hu, 24-JUN-1986. FEATURES Location/Qualifiers source 1..4007 /organism="Mus musculus" /strain="BALB/c" /sub_species="domesticus" /db_xref="taxon:10090" /tissue_lib="of M.Steinmetz" mRNA join(754..811,1773..1913,2012..2336,2477..2638,2732..2923, 3056..3237,3312..3697) /gene="alpha-actin" /product="alpha-actin" gene join(754..811,1773..1913,2012..2336,2477..2638,2732..2923, 3056..3237,3312..3697) /gene="alpha-actin" exon 754..811 /partial /gene="alpha-actin" /number=1 intron 812..1772 /gene="alpha-actin" /number=1 exon 1773..1913 /partial /gene="alpha-actin" /number=2 CDS join(1785..1913,2012..2336,2477..2638,2732..2923, 3056..3237,3312..3455) /partial /gene="alpha-actin" /codon_start=1 /product="alpha-actin" /db_xref="PID:g387081" /translation="MCDEDETTALVCDNGSGLVKAGFAGDDAPRAVFPSIVGRPRHQG VMVGMGQKDSYVGDEAQSKRGILTLKYPIEHGIITNWDDMEKIWHHTFYNELRVAPEE HPTLLTEAPLNPKANREKMTQIMFETFNVPAMYVAIQAVLSLYASGRTTGIVLDSGDG VTHNVPIYEGYALPHAIMRLDLAGRDLTDYLMKILTERGYSFVTTAEREIVRDIKEKL CYVALDFENEMATAASSSSLEKSYELPDGQVITIGNERFRCPETLFQPSFIGMESAGI HETTYNSIMKCDIDIRKDLYANNVMSGGTTMYPGIADRMQKEITALAPSTMKIKIIAP PERKYSVWIGGSILASLSTFQQMWITKQEYDEAGPSIVHRKCF" intron 1914..2011 /gene="alpha-actin" /number=2 exon 2012..2336 /gene="alpha-actin" /number=3 intron 2337..2476 /gene="alpha-actin" /number=3 exon 2477..2638 /gene="alpha-actin" /number=4 intron 2639..2731 /gene="alpha-actin" /number=4 exon 2732..2923 /gene="alpha-actin" /number=5 intron 2924..3055 /gene="alpha-actin" /number=5 exon 3056..3237 /gene="alpha-actin" /number=6 intron 3238..3311 /gene="alpha-actin" /number=6 exon 3312..3697 /partial /gene="alpha-actin" /number=7 BASE COUNT 849 a 1151 c 1094 g 913 t ORIGIN 5 bp upstream of PstI site. 1 ctgcagagag agccaaggct tcttaatgcc ctggctccca ttctgtgtgc cagggcacat 61 agtccccatg cctcctctca ctacctcctc cccctcttcc accagttgtc tgcccagtga 121 cagctgcata tcctacactg actgacagcc ttgtgggggt gaatgggggt gatgtgtcaa 181 atctctggat tgggggagct tcaaagtggg aaagaaaatg gagttcaaat gtggggctta 241 ttttccatcc ctacctggag cccatgactc tcccggctca cctgaccaca ggctacctcc 301 cctgacttaa gcatcaaggc ttagtagtct gagttaagaa ccataaatgg ggtgcattgt 361 ggcaggtcag caatcgtgtg tccaggtggg cagatctggg gagacctttc aaacaggtaa 421 atcttgggaa gtacagacca gcggtcaaag cagtgacctt tggcccagca cagcccttcc 481 gtgagccttg gagccagttg ggaggggcag acagctgggg atactctcca tatacggcct 541 ggtccggtcc tagctacctg ggccagggca gtcctctcct tctttggtca gtgcaggaga 601 cccgggcggg acccaggctg agaaccagcc gaaggaaggg actctagtgc ccgacaccca 661 aatatggctt gggaagggca gcaacattct tcggggcggt gtggggagag ctcccgggac 721 tatataaaaa cctgtgcaag gggacaggcg gtcacacgga cgtaagcctc acttcctacc 781 ctcggcaccc agggcagagt cagagcagca ggtagggtgg aggtggggag ggtgacctgg 841 agacccagca aagaaagcta ttgagccttg gttgtattta gcactgagtt ctggaaattt 901 ctccaaactc acatccagcc cattttgtga ctgggcattt aggatatgcc tgggggtctg 961 acatctatcc attaccaccc gcagactcct ccctcccctc ttactgggga cctaaatcca 1021 agtcctgcaa gtgaacaagc cggtctccta ctgagccaca cgcagcctct ggttggttga 1081 gatttctttc cggcttttca ttcccctttc ccctctttct ctgctgggat caaatctggg 1141 ctcttgtgat gcaagaggtt ggctggatct cccactgagc tacacccagc tcctgggaga 1201 ctgatttgaa gatgagcttg gagagagtca agcctggtct tacgcttggg gcgactccct 1261 ttgagatcaa ggcttttcta aagggcagca accctggcta ctcttctctg aaatgggaga 1321 caggcggaag ggtttcttct gagtttggga tgctgtgagt gttcaatagc ttttccatag 1381 ttgtgacaat atcgccttcc ccgggcagaa tcctcgaccc tctcacagga caggcaggtg 1441 gtgggcagcg tgccttaata cctccttcac agatgagatt gtgagcagag aggataggac 1501 tagtgctagc tgtggtgtgt gtggtggcct acaatgaatg tagagggtct caggatatcc 1561 cctgcatgcc tagttggagg attagtgtga tccacagaca gctcccccat agcgacacag 1621 ccttaagctg aagagcatcc tctgaccccc gtgcaaggaa ggggaagggg gaaaggagtt 1681 tttggatctt caaagtgaag atgggttaag cggacctcaa cctaaccccc ccccatcaca 1741 tatacacagc actcaacacc ttgtctttgc agaaactaga caccatgtgc gacgaagacg 1801 agaccaccgc tcttgtgtgt gacaacggct ctggcctggt gaaagctggc tttgccgggg 1861 atgatgcccc cagggctgtg ttcccatcca tcgtgggccg accccgtcac caggtcaggc 1921 tgctggcagg gaaagatagg ctctctgaat ccagccaatg ttctcctcac ccctggccgt 1981 agtaacaagt gtctgatgtc tctatctgca gggtgtcatg gtaggtatgg gtcagaagga 2041 ctcctacgtg ggtgatgagg cccagagcaa gcgaggtatc ctgaccctga agtaccccat 2101 tgaacatggc atcatcacca actgggacga catggagaag atctggcacc acaccttcta 2161 caatgagctg cgtgtggccc ctgaggagca cccgactctg ctcaccgagg cccccctgaa 2221 ccccaaagct aaccgggaga agatgactca aatcatgttt gagaccttca acgtgcctgc 2281 catgtatgtg gctatccagg cggtgctgtc cctctatgcc tccggccgta ccaccggtaa 2341 gcgctcacac atggcccacg ctggccctgg taggattgct ccaacattcc agccccgctt 2401 tttaatcctc tgcagtatcc cgacctgtgc cgcctgctgg tcccctccac tcacatactg 2461 cctctcccct gcacaggcat cgtgttggat tctggggacg gtgtcaccca caacgtgccc 2521 atctatgagg gctatgccct gccacacgcc atcatgcgtc tggacctggc cggtcgcgac 2581 ctcactgact acctgatgaa aatcctcact gagcgtggct attccttcgt gaccacaggt 2641 cggtgctccc aacctgctga gggtgggcgg gcagagggtg agcacacgcc cagccttcgc 2701 ctgaggctcc tcactgcttt tgctcttgca gctgaacgtg agattgtgcg cgacatcaaa 2761 gagaagctgt gctatgtggc cctggacttc gagaatgaga tggccaccgc tgcctcttcc 2821 tcctccctgg agaagagcta tgagctgcct gacgggcagg tcatcaccat cggcaatgag 2881 cgtttccgtt gcccggagac gctcttccag ccttccttta tcggtgagcc gccggatccg 2941 ctggtgtgcg gggatcagtt ttccctcgcc ccacaccaca gagtacgggg tctccaccgc 3001 cggtccctta gcccgactct gcggttgctc acactgcctc tctcccggac accaggtatg 3061 gagtctgcgg ggatccatga gaccacctac aacagcatca tgaagtgcga catcgacatc 3121 aggaaggacc tgtacgccaa caacgtcatg tcagggggca ccaccatgta ccctggtatc 3181 gctgaccgca tgcagaagga gatcacagct ctggctccca gcaccatgaa gatcaaggtg 3241 gatgacgtgc ctggtgtggg tggagaccag gggcggggga acacgaggca cgtgacactc 3301 ttgtcttgca gatcatcgcc ccccctgagc gcaagtactc agtgtggatc ggtggctcca 3361 tcctggcctc gctgtccacc ttccagcaga tgtggatcac caagcaggag tacgacgagg 3421 ctggcccctc cattgtgcac cgcaaatgct tctaggcgca ccgcatctgc gttcgcgctc 3481 tctctcctca ggacgacaat cgacaatcgt gctgtggttg cagggtggcc ccgtcctccg 3541 ccgtggctcc atcgccgcca ctgcagccgg cgcctgtttt tgacgtgtac atagattgac 3601 tcgttttacc tcattttgtt atttttcaaa caaagccctg tggaaaggaa atggaaaact 3661 tgaagcatta aagccagcca ttctgttttg ctccaataaa ctgtgtgtgg tctttattta 3721 ctgggagtag gcagtgggca ggcgaaggac ccgtctccac ctctcactgt tcacagactg 3781 ggtgaactcc ttaggaaagg aagttagaag ttaggtctct cacccaaact cctggtttcc 3841 ctctgaatca tagccagtta gttcctatag ggatctaagt ggcttgcctg gtccctggct 3901 gcttggcaat gtatcttccc ctttaaggca ggacacctgc tgcgaactgg cggcgctggc 3961 cgtggtgctg cccgaagcgc ctcttgccag ggctggaaat tggatcc // LOCUS MUSHSP25A 3058 bp DNA ROD 04-AUG-1993 DEFINITION Mus musculus small heat shock protein (HSP25) gene,. ACCESSION L07577 NID g293375 KEYWORDS HSP25 gene; small heat shock protein. SOURCE Mus musculus liver DNA. ORGANISM Mus musculus Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Rodentia; Sciurognathi; Myomorpha; Muridae; Murinae; Mus. REFERENCE 1 (bases 1 to 3058) AUTHORS Gaestel,M., Gotthardt,R. and Mueller,T. TITLE Structure and organization of a murine small heat-shock protein Hsp25 JOURNAL Gene 128, 279-283 (1993) MEDLINE 93292999 FEATURES Location/Qualifiers source 1..3058 /organism="Mus musculus" /db_xref="taxon:10090" /tissue_type="liver" misc_signal 497..510 /gene="HSP25" /standard_name="HSE" /note="putative" /function="Heat shock element" misc_signal 517..530 /gene="HSP25" /standard_name="HSE" /note="putative" /function="Heat shock element" GC_signal 583..588 /gene="HSP25" /note="putative" GC_signal 593..598 /gene="HSP25" misc_signal 609..613 /gene="HSP25" /standard_name="ERE" /note="putative" /function="Estrogen responsive element half site" CAAT_signal 618..625 /gene="HSP25" TATA_signal 669..675 /gene="HSP25" misc_signal 690..695 /gene="HSP25" /note="putative" /function="CAP site" prim_transcript 697..2234 /gene="HSP25" exon 697..1130 /gene="HSP25" /number=1 gene join(756..1130,1727..1792,1920..2108) /gene="HSP25" CDS join(756..1130,1727..1792,1920..2108) /gene="HSP25" /codon_start=1 /product="small heat shock protein" /db_xref="PID:g293376" /translation="MTERRVPFSLLRSPSWEPFRDWYPAHSRLFDQAFGVPRLPDEWS QWFSAAGWPGYVRPLPAATAEGPAAVTLAAPAFSRALNRQLSSGVSEIRQTADRWRVS LDVNHFAPEELTVKTKEGVVEITGKHEERQDEHGYISRCFTRKYTLPPGVDPTLVSSS LSPEGTLTVEAPLPKAVTQSAEITIPVTFEARAQIGGPEAGKSEQSGAK" intron 1131..1726 /gene="HSP25" /number=1 exon 1727..1792 /gene="HSP25" /number=2 intron 1793..1919 /gene="HSP25" /number=2 exon 1920..2236 /gene="HSP25" /number=3 polyA_signal 2212..2217 /gene="HSP25" BASE COUNT 795 a 806 c 823 g 634 t ORIGIN 1 ttgtcttcag aaaaaaaaaa aaaaaaaaaa aagcaaagca aaagaaaaca aaaaaaaagc 61 attcaaaata aaaaaagaaa aatcctactt cagagagaga gagagagaga gagagagaga 121 gaggagagag gagagagaaa gagagaaaga agaaagaaag tcagtttctg ggtcaaaaag 181 atactcagtg aacactaagc gcctgccacg aagcctttgc tccccaggag atacactata 241 gaagacagct atcttgccgg tgtcatgaga cctccacagg tgtgccatga catgtgtgtt 301 cccacgggcc tccttctggt tcagaccagg ttcagttgat tcccagttgc gacacaaaag 361 acctaacctc tcctgttatt ctcaataaaa aatggggctg ccccaggcca ccgcccttca 421 gccagcagtg tcctaaaccc cacagtggga atcgctccag ctaccggtat tacgccgtca 481 tttgttttct tcaacaagag aagtttccag atgggggcag aaccttcctg ccccgcctgc 541 ccgccccctt tgcaagctta gggggaggaa tgcagagggg aggggcggcg aggggcggcc 601 cctgagacgg tcattgccat taatagagac ctgaagcacc gcctgctaaa aatacccggc 661 tgggcacaca taaaagcacg ctggggctcc agtccggcac ttctcggatc ctcagcccag 721 tgcttctaga tcctcagcct tgaccagcca agaacatgac cgagcgccgc gtgcccttct 781 cgctgctgcg gagcccgagc tgggaaccat tccgggactg gtaccctgca cacagccgcc 841 tcttcgatca agctttcggg gtgccccggt tgcccgatga gtggtcgcag tggttcagcg 901 ccgctgggtg gcccggatac gtgcgcccgc tgcccgccgc gaccgccgag ggccccgcgg 961 cggtgaccct ggccgcacca gccttcagcc gagcgctcaa ccgacagctc agcagcgggg 1021 tctcggagat ccgacagacg gctgatcgct ggcgcgtgtc cctggacgtc aaccacttcg 1081 ctccggagga gctcacagtg aagaccaagg aaggcgtggt ggagatcact ggtgagtttc 1141 ccttgtgccc agagggacga agctgccgag gcagagtggt ggctgggggg ctgagggtgg 1201 gggctgaaac cctgaggaat agaaccctga gaagttagaa ataacctagg gacccggagc 1261 ccgcatcatt ctcctttgcc tgcttctctt gccccggaac gcgggtgcag gttgctctta 1321 aaagctgtct catcttcgat gatatggacc aacagctggg gatgtagctc agggtagagc 1381 ttcgcctggc ctgccgggac gttgaggctt tgcgttccac ccgcagcaca gaaacaaaat 1441 gaaggaagac caacacacct ctggaagacc tcattccaat agcagcgcag gatcaggtcc 1501 tagacagccg cagagcgttt gagtatacct accctggtat ccaagtctgg gtcagagcca 1561 agcccttcgc atcccaaatc tcagaaggga agtttctgga aagttttaag attccagaca 1621 gtcaggcacc ccgccgcact cgggtttgtt tgcctccctg ggggtcccag ccactcctta 1681 gctaggacag cagagggctg cttctgacct tctgtcccca cccacaggca agcacgaaga 1741 aaggcaggac gaacatggct acatctctcg gtgcttcacc cggaaataca cgtgagttct 1801 gattccttca tggagggcgg ggaggtgggg ggggggggga gcggacccag ggcggagggc 1861 gaagagcccc gggtcaggag ggatgtgtaa cccttgccct gattttctgt gtgtccaggc 1921 tccctccagg tgtggacccc accctagtgt cctcttccct atcccctgag ggcacactta 1981 ccgtggaggc tccgttgccc aaagcagtca cgcagtcagc ggagatcacc attccggtta 2041 ctttcgaggc ccgcgcccaa attgggggcc cagaagctgg gaagtctgaa cagtctggag 2101 ccaagtagaa gccatcagcc tgctgcctat ctcccatagc cattgctggc cacccctctc 2161 tgtcaatctg tgcgctcttt tgatacatac atttacctgc tgtttttctc aaataaaagt 2221 tgcaagctac tgctcaccac cgtctgactc cagagttatt atggtgggct agggatgggt 2281 gtgctaaatt ggaacaccct ttgggtcttt gctaagtgct cactatgggc tcaggcttcc 2341 tgtgggaaaa gagataacct ggagattaaa agtttaccag gggggctgga aaggtggctc 2401 agcgggtaag agcactgact gctcttccga agtcctgagt tcaaatccca gcaaccacat 2461 ggtgctcaca accacccgtg atgaggtctg acaccctctt ctggtgcatc tgaaggcagc 2521 tacggtatac gtacatatat aataaataaa taaatcttaa aaaaaaaaaa agtttaccag 2581 gctgggtaga atgatggtat aattacatct cctgtaattc cagtatccaa aaacggatgc 2641 aaaagcagtg tgaatttgaa gccagcctgg gctacagggt gagttctatg agtgcctggg 2701 ccacacatgt acttgctgtc tcaaaaacaa acagaagagc aaaacaaaac agaagctcag 2761 tagagggttt gcatggcatc cgtgaagtaa tgtttgatcc cccgcattgg ataaactggg 2821 tatggtggcg ccctccagaa gtggagaaag gatcagaaag gagttcaagg ttatcctggg 2881 ctacatggct agtttgaggc cagcctggga cacatgaaaa catctttttt tttttttttt 2941 cacatgaaaa tatcttaaga gtaaaaaaaa aaaaaaaaaa aggatcagag ccagtcagat 3001 ccaaagcaac cccctctctc ccagcctctg gtcctgttct ctatgaacag agaaattt // LOCUS MUSNFORREC 1524 bp DNA ROD 25-JAN-1994 DEFINITION Mouse N-formyl peptide chemotactic receptor gene, complete cds. ACCESSION L22181 NID g347396 KEYWORDS N-formyl peptide chemotactic receptor. SOURCE Mus musculus DNA. ORGANISM Mus musculus Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Rodentia; Sciurognathi; Myomorpha; Muridae; Murinae; Mus. REFERENCE 1 (bases 1 to 1524) AUTHORS Gao,J.L. and Murphy,P.M. TITLE Species and subtype variants of the N-formyl peptide chemotactic receptor reveal multiple important functional domains JOURNAL J. Biol. Chem. 268 (34), 25395-25401 (1993) MEDLINE 94064602 FEATURES Location/Qualifiers source 1..1524 /organism="Mus musculus" /db_xref="taxon:10090" /germline CDS 331..1425 /codon_start=1 /product="N-formyl peptide chemotactic receptor" /db_xref="PID:g347397" /translation="MDTNMSLLMNKSAVNLMNVSGSTQSVSAGYIVLDVFSYLIFAVT FVLGVLGNGLVIWVAGFRMKHTVTTISYLNLAIADFCFTSTLPFYIASMVMGGHWPFG WFMCKFIYTVIDINLFGSVFLIALIALDRCICVLHPVWAQNHRTVSLAKKVIIVPWIC AFLLTLPVIIRLTTVPNSRLGPGKTACTFDFSPWTKDPVEKRKVAVTMLTVRGIIRFI IGFSTPMSIVAICYGLITTKIHRQGLIKSSRPLRVLSFVVAAFFLCWCPFQVVALIST IQVRERLKNMTPGIVTALKITSPLAFFNSCLNPMLYVFMGQDFRERLIHSLPASLERA LTEDSAQTSDTGTNLGTNSTSLSENTLNAM" BASE COUNT 405 a 376 c 306 g 437 t ORIGIN 1 ctcatctctg actcacatac atgtgcacat atgtacataa ccacaccaat acatacatac 61 actcatacat gtatagaaaa cacatgcaca tggaaaatcc caaatgaagc caagcattgt 121 gatatgccac tttaatccca acacaagaga gggaagaggc aggcagattt ctatgagttc 181 catatagaaa acgattctat gagttttgtc tatagaaaaa gttctaggct agcaaggatg 241 tgtactgaga ccttgtctca cagagagaaa aaaaaaggta agacaaataa ataaacacac 301 ctctttttta atgttctagg agtctacaag atggacacca acatgtctct cctcatgaac 361 aagtctgcag tgaacctcat gaatgtatct gggagtactc aatcagtatc tgctggctac 421 atcgttctgg atgtcttctc atatttgatc tttgccgtca catttgtcct tggggttctg 481 ggcaacgggc tcgtgatctg ggtggctggt ttccgcatga aacacactgt caccaccatc 541 tcttacttga acttggccat tgctgacttt tgcttcactt ccactttgcc attttacatt 601 gccagcatgg tcatgggagg acattggcca tttggttggt tcatgtgcaa attcatatat 661 actgtaatag acataaacct atttggaagt gtcttcctga ttgccctcat tgcactggac 721 cgctgtattt gtgttctgca tccagtctgg gctcagaacc accgcactgt gagcctagcc 781 aagaaggtaa tcatcgtacc ctggatttgt gcatttcttc ttacattgcc agttatcatt 841 cgtttgacca cagtccctaa tagtagactt ggaccaggga aaacagcctg tactttcgac 901 ttctccccct ggaccaaaga tcctgtagag aagaggaagg tggccgtcac catgctcact 961 gtcagaggaa tcatcaggtt catcattggg ttcagcactc ccatgtccat tgttgccatt 1021 tgctatgggt taataaccac taaaattcac aggcagggcc tgatcaaatc cagccgtcct 1081 ttgcgggttc tctcctttgt tgtggctgcc tttttcctct gctggtgccc atttcaagta 1141 gtggccctca tatccacaat ccaagtccgt gaacggttga agaacatgac tccaggcatt 1201 gtaactgctt tgaaaatcac aagccccttg gctttcttca acagctgcct caatccaatg 1261 ctttatgtct ttatgggcca ggacttcaga gaaagactaa tccactcttt acctgccagc 1321 ctagagaggg ccctgactga ggactcagct cagaccagtg atacaggcac caatttgggg 1381 accaactcta cttccctttc tgaaaacact ttaaatgcaa tgtaaagaac gggctctaac 1441 ttccagcttc atctgctttg agttccactg tgctataggc attccctgtt gaccttcagg 1501 ctacatgctc attaggaaaa cttg // LOCUS MMU73107 29807 bp DNA ROD 30-MAR-1998 DEFINITION Mus musculus adenosine deaminase (ADA) gene, complete cds. ACCESSION U73107 NID g2996609 KEYWORDS . SOURCE house mouse. ORGANISM Mus musculus Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Rodentia; Sciurognathi; Muridae; Murinae; Mus. REFERENCE 1 (bases 1 to 29807) AUTHORS Xu,P., Winston,J.W., Lu,J., Muzny,D.M., Gibbs,R.A. and Kellems,R.E. TITLE The Comparative Sequence Analysis of Murine and Human ADA Genes JOURNAL Unpublished REFERENCE 2 (bases 1 to 29807) AUTHORS Lu,J. and Gibbs,R.A. TITLE Direct Submission JOURNAL Submitted (02-OCT-1996) Molecular and Human Genetics, Baylor College of Medicine, One Baylor Plaza, Houston, TX 77030, USA COMMENT On Mar 30, 1998 this sequence version replaced gi:1657628. FEATURES Location/Qualifiers source 1..29807 /organism="Mus musculus" /db_xref="taxon:10090" /chromosome="2" /map="2H3" /sub_clone="pKS-EaH1.1, pKS-EE8.0, pKS-SH7.6 and pKS-EE10.4" /cell_line="B1-50" repeat_region complement(342..497) /rpt_family="B1_01144" gene 4139..27658 /gene="ADA" promoter 4139..4289 /gene="ADA" exon 4290..4414 /gene="ADA" /number=1 CDS join(4382..4414,16248..16309,18999..19121,21481..21624, 22106..22221,23246..23373,23926..23997,24079..24180, 24402..24466,26231..26360,26832..26915) /gene="ADA" /codon_start=1 /product="adenosine deaminase" /db_xref="PID:g1657629" /translation="MAQTPAFNKPKVELHVHLDGAIKPETILYFGKKRGIALPADTVE ELRNIIGMDKPLSLPGFLAKFDYYMPVIAGCREAIKRIAYEFVEMKAKEGVVYVEVRY SPHLLANSKVDPMPWNQTEGDVTPDDVVDLVNQGLQEGEQAFGIKVRSILCCMRHQPS WSLEVLELCKKYNQKTVVAMDLAGDETIEGSSLFPGHVEAYEGAVKNGIHRTVHAGEV GSPEVVREAVDILKTERVGHGYHTIEDEALYNRLLKENMHFEVCPWSSYLTGAWDPKT THAVVRFKNDKANYSLNTDDPLIFKSTLDTDYQMTKKDMGFTEEEFKRLNINAAKSSF LPEEEKKELLERLYREYQ" repeat_region complement(6909..6954) /rpt_family="B2_00120" repeat_region 10467..10614 /rpt_family="B1_00266" repeat_region 14425..14584 /rpt_family="B1_00918" exon 16248..16309 /gene="ADA" /number=2 repeat_region complement(18754..18952) /rpt_family="B2_00227" exon 18999..19121 /gene="ADA" /number=3 repeat_region 21113..21187 /rpt_family="B1_01105" exon 21481..21624 /gene="ADA" /number=4 exon 22106..22221 /gene="ADA" /number=5 exon 23246..23373 /gene="ADA" /number=6 exon 23926..23997 /gene="ADA" /number=7 exon 24079..24180 /gene="ADA" /number=8 exon 24402..24466 /gene="ADA" /number=9 repeat_region 25339..25452 /rpt_family="B1_00848" repeat_region 25635..25699 /rpt_family="B2_00191" exon 26231..26360 /gene="ADA" /number=10 exon 26832..26931 /gene="ADA" /number=11 exon 27427..27658 /gene="ADA" /number=12 repeat_region complement(27851..28008) /rpt_family="B2_00096" misc_feature 28320..28777 /note="similar to EST sequences with GenBank Accession Numbers W71509 and AA031028" misc_feature 28778..29022 /note="similar to EST sequence with GenBank Accession Number W65153" misc_feature 29594..29807 /note="similar to EST sequence with GenBank Accession Number W65788" BASE COUNT 6916 a 7563 c 7708 g 7618 t 2 others ORIGIN 1 gccgacttta gatgttccta aactacattt cccagcccat tccacccctc tctgtctctg 61 tgacccctga tccagctcta ccctactaca atgaccccta ctgacttaaa tgtgcttctt 121 ctgagcctgg cttccgcaaa cttcctcttc tgactggctg tggtcaacct gacatccagg 181 gcgagtgctg taaattcatc aggactggag gggaccaccc acccagaata ctgaagcagc 241 aagaccctca gggatcaggg accagggatc agcattgggc ctggtatttg ttgcatacat 301 cctctttgct acagtagtca ccagtgaaag attgtagang tttcctttct ttttcttttt 361 attttttagg tttttagttt ttgagacagg ggctctctct ctacatagct ctgtccttga 421 aactcacata gactgactgg cctttgaaat cacagaaatc tgcctgcttc cacctcccaa 481 gtgcttgcat taaaggctgg cctagcagat tgcagatttt ttaaagtgaa gcacacctgc 541 ccaaccccca cccccacaca ttgttacttt gttacactgg ctggctattg ggttaccaat 601 gttattagca aaacctaact tgtcttagct ggaaatcact cttaagcaat ttccatagtg 661 caattttcta tctggaactg gacttacata tatttgaggc agagtctcac tctgtagccc 721 ttaattttgt ggcaatcttg cttccatctc ctaagcacta gaaatacagg catgaactac 781 tttgtcctgt ttgggattag ccctgagcca ctccaccctg gttggggtta gctcttgaaa 841 cttggttttg cacagaggct aggagagcca gagttccagt gtgcttgaga atctatctca 901 gtaagaaact gtgctatgag caaaatgtgt ttgggggaag gcatctctcc ccatccctcc 961 tcccactttg ggcctggtgt gtgttgtgtg gagaagatca aagccagctt gtagcaactg 1021 gaaggaccaa gtgacacact cagagtatgg tggaaagtct gagagacttg tctacaaagg 1081 tgtgcactat gcctatgggc tgcccaagta accctggcat ccctctacac ggtgaaagct 1141 tagttatgtt gagctaacta cgggtactga aaagagagtg aaatcagact taaacaatac 1201 cagaacaaca gtggggaagg ggatggggaa atacagtgct tccctgccac tcaagaagtc 1261 tggggtttgt tctttgcaaa aatatataca caaacacacc ctttgatgga ggagagtgtg 1321 tgtgtaagcc cctcgctccc caggccctca gttgggttaa ctgcataccc caacagctcc 1381 cagaggctga gtgcagagct cagattcatc acctatctgt tgctaggctt tctcattgtc 1441 ctgccttttt cttcattctc cttgtggtgt ttccttcaat ctcttcaaat aagcctcaaa 1501 ctttgtctca ggatctgctg ataggaaaac cctaatggag gcaacggggg cctggggaaa 1561 acagccacac atgaggcagt tcttggggtc actgccttgg agacggggac aaaaggggag 1621 gtgttggggg aatagaaggg aggggtagcc tgtctttatg gccttgccag aaagacagtt 1681 ctgcttgatg actggggagt gcagggaata aacagcctca cttctccatc ttgtcacctt 1741 ccccccgccc cctgcctgtg ccacaggcaa aagagtccat ggtgttgtgg acactgtaag 1801 gttcctttga cagaaggaca gattgtgtac ccagtggtga ggcctaccaa aggcttgtag 1861 cccactccac tcagctattt tccagaagtt tggtcctgcc tagagagcgg aatttcaaac 1921 actccagaat ctagacactt tccacatctg gtcaggatgg agtcacacat tctgcagggc 1981 tgtggcaggt ggaagtccct gccaccagat acggtagctg atgagaaaca cctgccacag 2041 ctttgtaagt aaggtatgaa ggctgggctt ccctggacac tgaagtcacc tgggttgctg 2101 aagggctgac tttacagtgt ctgacatagt gggattttag gggtggctga aagctgggct 2161 tggctcagca accaactgga tttcccacca gtgacggtgt cttctagtgg ctcaagttcc 2221 cagagcaaaa acaagaggtc ccagtggaag ttgaaagagc cccaaatccc tccagagaag 2281 gaagttcagt agcctcctct gccacattct aagagcaagt ccttcaggac ttcccagggc 2341 aagactccat caggcaaaga caggatcagc aaaggcttgt cacttttggc ggaccaggaa 2401 gtcatttgca aagaatgagt tggatggact ttggagaaag atttgacaca gcccaggccc 2461 agctatagag tcagaatcaa attcaagtgc caccctccct tgcattgcac taatcttcca 2521 ggagcccagt ggcctgtgcc ctcttggggc ctcggccact tcatttgtag actgataaca 2581 gtttctgctt cacagggtgt agacagattg aatggaatga tgttttcaaa gtgcttccag 2641 gacaagggct agggagggtg tgtccacatg tgatggtgta tggtcatgta tctgaagtaa 2701 caaagaaacc tcccacaaag ctacaatgtg ttggaaatgt ttctccttct ttgcaacaga 2761 atctcaaaaa tctcaataat agctcagcat cttcctgtct cagcttggtg aatactaaga 2821 ttacagacac aagtattccc atccctgtct tagaagtacc attttgatcc accctctctc 2881 tcactcccac aaacaactga atcccaaatt gtattctgct gcttacaagt agcagggatc 2941 tgagtccatc ctcttgccca gagtcgccac ggtctcacac ctggatcttt cctggtgtac 3001 cacgctttct ccagctcctt tcatcttgtc ccctcatgtc aggacacccg ggcatcccca 3061 cctcaaggtg cgcacaagtt acttaaggaa cttgctacaa tatagccctg catcccgccc 3121 ccaaaatccc accaaaccta gagtatggtt ctaaacagct cacctgtgaa gtctccctgg 3181 ctaaatttct agacttggga cagctggagt tttccaaaca gaagtgtgat gctgtactga 3241 aaccactgac aaactttcag tgacatctct ctgagcttcc acctcgcctg ctggcctcag 3301 ctcccctctc cccttcccat cccctcctct ttatttgtaa gtttgtttag tttcagatga 3361 gaattgcatc tcacctagtt ctgggatacc accattaaat ggatgtttcc atgtgcccta 3421 ggcttcccac agagtcagca tggcccaccc cagggatgac aggactgaga tttaagagtg 3481 tttcccttgc ttcccatttc tcctccctga tgcatatgtt gttatttata tgttctagaa 3541 gttgaccata gtatgaagtt ttctgcagcg tatttttttc tgcccccctt tcactactgt 3601 gtctgagcac atgtgctgtg ctttgtagct gaaactggct ttattgctgc agaaaccagt 3661 ccactgtatt tacccacagc actgatgtga gcattctaaa tacatctcga tacgtgggca 3721 tatttctcca gcgtaactgc cccaggagag atgaactgtg tgttcctgtc caccccctgt 3781 ctccagccac cctggagact tagttctcag gagtctctct cacacacaac agtgttctct 3841 gcatcccacc cgccctcacc tggtgaactc cggcagtcgc cgctaaatct ccctaattac 3901 acacttcttc tgccttgtga ttctgcaaca agtgggtcta tccctcaaaa tccagcccca 3961 taaggcttca ggactgtgtg gctccagctt cagcctgcac aaagtaggcg cccaagcaac 4021 actggaagcc tcggtactga aggggcccgg aaggggcagg tgaggacatt ggagtcgcgt 4081 ctgcaggggg ctcacctggg agcttcctag ggtgtagcca gcagggaagg tctggggttc 4141 agaattccgg gaaatgcgcg ccagagttgc aggcgggggg gggggggggg ggcggggccg 4201 tgtctccgga aggcggggtc tctctgtggg cgtagcgtgg gcggggctgt gccggggcag 4261 cccggtaaaa aagagcgtgg cgggccgcgg tctctgagag ccatcgggaa gcgaccctgc 4321 cagcgagcca acgcagaccc agagagcttc ggcggagaga accgggaaca cgctcggaac 4381 catggcccag acacccgcat tcaacaaacc caaagtaagc accgaggggc tccgttgcca 4441 gggttctgtc ccgggctgtc ccggggctta gcggggccca gcctttggcg cctttaacct 4501 agaagcatgg agtggcaggg ggactcccgc aggcatctcc cctcgaccca ggccttagct 4561 tgcttccggg atgtcgagcg agagacgatg tggcagggag tgtccagaac ctgggggtgt 4621 ctctggtcgg ccttcgggtt cggctgctgt ctatgcgaac ctgggagtgc ctccagtcgg 4681 ccttcgggtt cggctgctgc ctatgccctg tgccctggag gtctcagcct cgctgtctgc 4741 caatgggcat ccagtgcggc ggggctgcac agctgtgtgg gactgggcta ggacctgggt 4801 gtctgagccc cagtagaatg gggcccaggg tctctagctg ttaaatgttc agtgtatggc 4861 tttatactta agtgttatga ttactttgtg ggcaacaggt aacctaggtt tgtgggtgcg 4921 cccgtgggaa aatctatgat ccaaaccaga aaaggaaggg atagaggctt cagggtgcca 4981 ggaggaaccc ctacacatac tgaccgtttg gccatatggg tttatttggg atgaagtttt 5041 agcccttgac cccagaggag aaccctttat ctgtctttct gcaagctgtg gcttcttgga 5101 aacagggaga ctccaggtcc ccaaggccag atttgcagcc cttacagatt ctgtctagtc 5161 agccaggcaa attgaactgg tcagcagagt gtgggactga gaactcaggg ggagggatca 5221 gagacagtca cccttagact tacccctcca agaaacagat gctgagtggg gggcggggtg 5281 gcagacgtat gaatcccgtg tgcatgttgt gtcatatatg cgtgcatgga gggagcggga 5341 gggaagatgg gcagtgggcc tgtattccat gcacttacca tagggaacac actctgcccc 5401 tctagctaga ggctagaagg gcagggcaag tcttcctacc caaccaatgc ctgctgcaca 5461 tcttgtctgg tggctcctga ccacagttgg tgctcttaga catcaaaggg tgagttttct 5521 tttgatggtc tgaattctcc ctacccttgg tcaggggaac catctctcct gatggaaacc 5581 taagagagtg actaaaggtg agcgacaggt tacacttctg tggcacctac ccttttgtaa 5641 cagctcttgg gggatggggg gagtgccaat atatcatgag ccctggtatt gtcatctgac 5701 ccatgctggg atgtcattcc ttccctcggg gctcttagac tccacacctg gagaaaagga 5761 ctgttagcat tggtgtctgg atgttggaag gtgacgtcag caggaaggta gcatgctggg 5821 ctgctacctc tgaggtggga caggtctcgt gtcatggtga ggaagaaatg ctgctatggc 5881 tggcagagtg ggaaggagcc actgtggtca gcgcctggcc agcctgcgta actgatgcta 5941 ccaggggccc agcatactgt aactgcacac aactagtgcc ctgacttggt gaggtagctg 6001 atgctgtctc catggataga tggggaagca ggcacagagc ttaattagga tgtagccaag 6061 ctgggagaga agcccacatt gagggtcagc ttcagaatcc cattccctaa gccaggcagc 6121 atagtacaga gccggaattc ctattatttg gaagactgag acaggaggat ttttgagttc 6181 caggcctgaa ctatatagca agacctttgt tttggatgaa agaacaaaac ctaagaagca 6241 gaaattctat gtccccatct ttcatgggtg atctctgcaa acgctttagg gataagagga 6301 tatctacctg agcctttgct gggttctttg gtttctcatg cctacaccag ttgggactgg 6361 ggcttggaca acttcaaggt gagcagatga aatgggccag cgtttgtttt tgtttcttct 6421 atgtggtatg tgtgagctaa ggcactgtgt agggacaggc ttgagtgtca ggtttcaccc 6481 tccagcatgt ttgagataca gtgtcttggt tgttcacccc tggaactcca ggctagctgc 6541 ctcgtgggtt tctaggaatt ctcctgtctc tgcctcacat ctcacagtgg gggtgctggg 6601 gatggcaaat ggcagttacc atgcctggct tctgtgcggg ctgtgtggat ttaagtcaga 6661 tccacacact tgtgttgcac gctctttacc tgctgagcct tctccccaac cttgcccttc 6721 agatatagat atacacgcaa gtcacctggc tctaactgtg actggggagg cttgggtggc 6781 actgagatac tgttgtcatc atgccctgcc agcaatgtaa aactgctggt ccatcagcct 6841 gcgtggagaa gaaagacccc tgctttgact gggtgctggc ttccttagta cttgctgcac 6901 ctgaagattg tgtgtgtgtg tgtgtgtgtg tgtgtgtgtg tgtgtgtgtg gtatgtacac 6961 gtgcatgcat gacccagtaa cccttatcac tctgtgccca cttgaagcag gactagaaaa 7021 tcccatggtt cccgttgctg ctgtatgtaa agatgcatgc tgacctcctg cctctggctg 7081 aggctctttg gagtggttgt gagtaccagg aacctttagc agtcattcac aattctatga 7141 atgactgagc aggcagctgt ctgcatccca ttcacaagtg gagttgtcca aggactagag 7201 agggaaaagg cttcctcaaa gcaacagtct ctgtgcactt cacacctctg agggaactgg 7261 cccaggttgg aaggttgttt ctaattgggt ctttcctcct atgcaggttc atagagagca 7321 tgagcccagt gaatctttgc tttctcagac ttttgaggtt tgttctgtac caagtgtagt 7381 tctagctgat ttgtgtgagt taagttgctg gggaccagcc tcggctgcag gcagggtctg 7441 gagaagggtg gatgcaagag cagcaagggg aagcaaggat cagacatggg atgtggggga 7501 cgggacagag agaaagcaca catcgcatct cactatggct tctcctatgc caacaacttt 7561 tgtatctttc aggctttatt tatacataaa aaaatacaga aattttattt gttgattaca 7621 caacttttca aaatacattt tcttgtaaag ttttccttga tcacataaga atattgaaag 7681 aggaacaaag agaacaaaca gaaaatttat gagaactaaa aaatgaatat taagatcaaa 7741 tcaattttct ttaaatgcag tatctccttc cttaagattg gtgtcataac gccttgcagt 7801 tacagaagtg ctaggcaggc tggctctatt tgaatttgat tctatttcag tttggcataa 7861 gattaatagt tcaaaagtta tgtcctacta cctattatat caaacttggc tttgtgcaag 7921 agttcagtat ctaaccctta tttaattctt cctttctttc ttctatattt cttttatatt 7981 tggcataagg ccactgtcct ttttcattga ttcttaaact attctacagc ccctttgctg 8041 aagcaaggac agtctatcag taaagttctg gtctgacaag tctattcagg aaacaggcgg 8101 ttcacaaagg atgtcttcct gtgtttctgt catgaacctt tgtttcaata caatttctat 8161 ctgttactct tcgtaaagaa cagcagaaca tcagaattcc taaacctgct tgcttttgcc 8221 ttcaggaagg tacaaaggct tcataaactt gcatgaccaa aatctcttcc tctgcataca 8281 agttcctagt ttctaaagct gaatattaaa aattcagcaa ttcaattaca gctggtcagc 8341 ttctaaaatg tccctaagtt tttcattctt gtttacacta gctttctgct atcccaggtc 8401 tagccttcgg agagaggcct ttaacagtca gaaacagaag actagcaaaa ggtgtgcagc 8461 gaatatccat caagggcagt gaagggtcag tccagattgt ccaagcaccg gaactttgtc 8521 aaacaaggcc tccaggatgg ttagcatcac cattaagtca cacttcattc acactagcct 8581 ttgggataga tcctgctatc ttcatttact taggaggaaa ctgactcaga acttgccaaa 8641 gtagcatggc cagaaagcag ctgagctggc ctgcaactca gactgcccta aaactgaagt 8701 tggtgctatg aactgctttg ctaattcatc tttcattttc tttttgttaa aaaaaagggg 8761 gggtaggcat gggagtagga agatgttaca gagcaaggag tcagagtctg caccttggcc 8821 ttcagactag atcagggtag agccccctgc ttctcagctt gacagcatca agtcctcagg 8881 cacaccctga agctcagcca ggacagagaa acaactgttg agggctttgg ggaaaagcta 8941 cacgcccatg gcttctgcta cagtgcctgg tgggtagcac aggctctcaa gtgtgtgctt 9001 ccttccggag gttgcaatga gtgcttcctg gactccggga caccctcatt tcgtgtctta 9061 cgagacaaag ccccaggtgc gcctctttgg attcttgatg tctctcattt accagagcca 9121 cctgttctac ccctgggtga gagcagctca cccattttga ctcatgtggc agttgttggg 9181 gaggggacag aaggagattt ggtttcatag aagcctttgt ttgacagaag cagaatgaag 9241 ggccagaaag gggaagggga aagaggtctg tggttacaca ggaaggcagg gaagccagcc 9301 agaccccatc cagggaggct gtcctgcaaa ctgagaggat tgttcatctg cctaggtgtc 9361 ctgtagtggt acccccttag ggttagggaa ggcagcacag tgggatagcc ttgaataagg 9421 ggaacctcaa aatgaggtta aatgctatat ttaccacgag gctaaggata tgggtgagcc 9481 gtgccccaga gtgcccgtag ggctgtgtat acctgtaccc agctcacaga caccacatgt 9541 gggcctgctt agaatgggta tgccactggc agctgctgtc tctagggacc agcaacatcc 9601 tggtcgcatt tccccatcat ctgcctagca acttgcattg caggatgtag gaacatattg 9661 ccatgggggt gggagttggg gggggggctt ccaactccaa ctttagggtc atacaagtct 9721 ctggctgtgg cattcattat tttgtaatac tgttgtgact taaggcaagt tcctgttgag 9781 ccttggtctt atcacctata aagtgggggt aaatgggctg aatttctaga ctgttgagtc 9841 agtcataaga cttctcccaa atggagatac ttcttagtga cttggggact tctggggtag 9901 ctcatgaata gcagtggaga gctcagccag gccctaaggt ttcaggaaac acagaggctg 9961 tttggctcct tccagtctac ttagagctac aggagcccag aaagggcaga gcttagaggg 10021 cttcctgaag gaggtggtgc tgagacagct tcttggccca aggctgcctt gtttgatggt 10081 ttcatcttaa aagctagatg agaggccctt cctagggcat gagtactgac ccttgcccca 10141 ctcatcccct tgccagaagc agagatagga aggagggact ggggcttgct ctgtgggcac 10201 ctgtggctcc tctgtgatct ggctgtagtg ctcagggtct ggcttctctg cccacccagg 10261 gtctctacca cagttgccat tgctgacatt gagcctgggg tgggggaggg gtgaatcagt 10321 gacactcgcc agaagcagag cgcatttggc tgggccgggt gctgtgctag gaccctgaga 10381 cacagcaggg tgacacagac aggtccctga tctctgagag gcaaaagtgg gttgaaaaca 10441 gaacagggag caggtgtggc cactccctta atcccagagc agagagacag gtaagttaga 10501 ggtcagcctg gtcttcacag tatgtttgag aacaactaag gctagagaga ccctctctca 10561 aaaatacata catacataca tacatacata catacataca tacatacata catagaaacg 10621 gaagtggggg acgagaaaaa aagaacaggg ttgtgtgttt taggggcctg gaagcatgag 10681 ccacactgtc ctattgggac aaactccctc taggcctgaa ccagctatgg aagcctgtga 10741 ccaaggaatc tgagatttag cttcctgacc tggttctgtc ccaacatgtc ttgtgacctg 10801 gtgtgacctg gaacgctggg ggttgtgact gtgcttctca ggccctcacc tatgatgctg 10861 acgcctctcg gtgcttcncc cagttcctgg acacagtaat tatctgttag ttactaagct 10921 gtttcctgtc cccagcagca gtgggagggc ttcctgctat tctgacagat aatgggcaac 10981 tatgagccag gcttgcagag tgacgaagat ggagagatgc atgatactcc tctccaggag 11041 gtcaggaggc tggacggagg cttggcgcca gggtcagggt ggtgctgtgg ccatgaacta 11101 gacctggcct tagaggcaga aagacttggg ctctgccttt cctgtactgc aggggttctc 11161 aacttcccta atgctgtgac cctttaatac agttcctcat gttgtggtaa cccccagcca 11221 taccattatt ttcattgcta cttcataact aattgtgtta ctgttatgaa ttgtagtata 11281 agtatatgac gtgcaggata gctgatatgg gacccctgtg ggagtctcga cccattggtt 11341 gagaaccgct gatctcctgg cagtctgtgc aggcctgggg aggtccttaa acagctctaa 11401 gctttgcttc acaaatccaa ttgcaatacg aactaggtct aaggtgaccc ttgcctagcg 11461 atgaagagag actcagatgg agggtctaat gacagctagg gctctgaacg cctcagcagc 11521 cgttgccata gaattcctct tgtttgcatg aggccggcag gagctcaaga gcagactgac 11581 tgatcctgaa tctttagaga ttcctttgag cctcagtttt ctcttgtgta aaatgggtgt 11641 ggaatctttg ccttcaagga gacaatacat actcacacaa tagacactca aatgttaaac 11701 ttcctatctg ttccttttcc cagaatatga cccggaggac aggcttgtac ctcaaagctt 11761 ggccccccct tggtcatctg gggtggtagg ttggcttctc catggcggct ttctggaatg 11821 ttcactgttt aaaatacagg ctgtctgtgg agatggccca ggcttctgct tccttcgccc 11881 ctcagcctgc tccatatggc agtttctttg gagcatggtg ggcagttgga cccagaggga 11941 ccattgctcc agcctgcttc tctgagaccc tcctctggga actgtcatgt tttggactcc 12001 tcctgcttgc caagcaccta ctgggtgcac agtgtggcac actctgtgca ggagctggct 12061 ctcatctgga cacattagca gtcctctgtg aaggttaact tccagcccgt tctccaccct 12121 ggaagccaaa gaaagctttt aaaaggtgtc tctcctattt gaaaatattt ttgttgaaaa 12181 tccatctctg tgagccatta ctggcaggcc tcctgattcc aacacttggt aactcctgga 12241 tccacgcttc tctcagcctg ctctgccctg ggaggtctcc tttgaaccag ccttccggac 12301 atacattcta tatggtaccc cttgatagct tattctcttt actgtgtggt tctgggttct 12361 gtgggtcatt ggaacctggg actcaaacct caggagggct cttgttctgc taagcaggac 12421 aagacggaac agtccaagag ggcgtggatg cttctagagc tggctcactg gctgtctgcc 12481 tagcgcttcc cagagccact ctgctcacaa tggtctcctg gggcctggca ggtagagcag 12541 tcacaccctg actattgtca cagctacact acagcaggga gcagtgtgga ggcatctcat 12601 cttgacttgt gctttcccca gagcgacctt acagacccct gacacccact gatgaagacg 12661 caggacccag agaccctgcc ctgacccaat gtccgatctc ccagctcaca gaccagaacc 12721 gctatgaccc acatcttctc cctgcatccc cctgagctaa ctgtcagcca ttcaaaagca 12781 tggttcatag aagaaccatg aagattttct cctcttcctt tcttatccgt gttgtttcct 12841 agcctgccct ggctcaggcc tctggcacag ggctccaggc cacttttctt aaccttgagt 12901 gcatgttaag aaaaacaaac aacaacaaca aaatttttaa aacggaacag ggctgagggt 12961 gtgacttagc agagggccag gcctgctcaa gggcctagct tccatccctg gcacccccaa 13021 aattacagaa gcacacagtt gtgcctgaac acacccagag ggctgaggca ggtctgaccc 13081 cagttgaagt ccagacatca actttccaac actaccagtg tcctgaaggg ttcttctagc 13141 ctgcgttctt agactaaggg acacagatct gtcctcctag tctcgacctc aggctatatg 13201 agattcccgt ggtggctgca ataggtgacc acgttctaag tggctctaaa gccacaaaac 13261 acactgggct ccatccccag caccacaaag cccaaataaa ggcagttact acatgacttg 13321 ctcccaaccc acactctgcc attctgcagg ccacaagtct aaaatcaaag tgccaggcag 13381 ggccttgctc accctacaag ctcagtgggt cagacctttg cttctctgtc ctctggtagt 13441 gtcaggcact ctgggtttat ggcctcgcag ctcctatctc tgcctctgtg gtcacatagc 13501 tgctgctttt tctgagcacc agatcttctc tgcctctgct actaggacag gagagtacat 13561 tcagggcctg cccagagggt ctaggatagg tttttccaag attatgaatg actccatgtg 13621 ctaccatcta agatagtgga tgactccatg tgttaccatc taaagtcctt gctactggtc 13681 ctggggcttg agatgtgact ccagggtccc atgattggcc caaggttctg acccagggag 13741 gctactgctt ctcgttgatg agccaccagg actggcctct atatgctgtg tcttccagtc 13801 tcctcctatc tctggtctct attctccaga ccctgcctcc ctagcactaa gtgctggctt 13861 aagccagggc ctggtcctag accccagcct caaactctta tctgtggtag cttctctctc 13921 tctcagccta cttctctcga ggtaccttaa gctggcctct ctgtgctctc accagaaaac 13981 ccatctttct gtggtcacgc agcccttccg ggggggttct tccctactat gctcagtctc 14041 agcgtttccc ttcctcccgt acacccccac tttcccctgt cgcagggctt ctgcgaagac 14101 tgttatctcc atggaatatt cttctgcccc gcctcctcac cttcccctgc ccagatggtt 14161 cttttgcacc attcagtgct tggcttggct tttctaggag tccgtttgct gcctggccgg 14221 actgtgagtt catggcacag ttagtgtttc ctgtgggagc atcttggttc ataactatac 14281 attttttcat gtgtctttag tttgtccctt gctggactgt gtgtgcagaa ggaacaagac 14341 tttgtccctc tttcttactc ctgtagcccc agcaacaagc aagggcttca ctatttttcc 14401 tcaaatgaaa agctcctaag tgctgggtgt ggtggctcat gcccacagtc ctgacacctg 14461 gcaggctaat gccggtgaat tgcaagtttg aggccagctt gagctaataa tgaatctcgg 14521 gccagcctgg gctacatagt gagtaagatc atgctcagaa aacaatttga gagataaaca 14581 atcaggaaga ccaagggtca tccttggcta ttatagtgaa tttaatgcta gcctgaacta 14641 catgatacta tagtttgttt aagtgataaa tggcacgatt ttatctattg ccatatgcct 14701 agagcttcgt gactgaccat ccccatggcc cctattcatg aagacctaga ggacggtgta 14761 ctgttcttct ccaggtgaca tgagaaccca gaggcacaga gtgaaatact ggagttatct 14821 ggttactcta aggtcttgcc agctagaggc tttggcacct ctggcccctg ttgccctggc 14881 cctggtttct cttgctggaa gcaggctcag tgacagttcc tgggggctgt cccttggacc 14941 agaagctccc aaatgttcct ccttccagcc aggcctgcca accttcttat ctcagccacc 15001 aatttccaga aatggcccca ccctggcaaa cgccactact gggtctccct tagtccagga 15061 agcttccccc aagacttgga tcaggaagat gacaaagtcc cttaatgtaa tagctgttga 15121 aaacaccatc tcccagagat gcttcctttg ggccacacct ctccacattc actgtttaaa 15181 atctctgtca gggtttttgt atgagagcga gctatcctac actgttgaca cttgggccat 15241 ctgagagcag ccatacaaat gtagatctgt agtctagaca catggactct acagagcccc 15301 tgcttgtggc ctcagctttg tgaacaacgt gtcacacata ggactggcca aataagggat 15361 ggtcaccaag accctttcta gaggatcaga aagacgtagt tgtcaatatg ttgcctggta 15421 agacgggcct gtctagcacc tgtttgtgaa acagagcctg gtatttgtat atttgcattt 15481 actttgcagt gctaaagttg gagcacaggg ccttgcacat gcaaatgtca cgtatgcccc 15541 actggagtct tttatttcgg tgtgtgtttg tgagcacacg tgcgtgtgag tgcaggtatg 15601 ggtgagcgtg tccatatgtg cctgcatgtg aaggctagag gtcagcatca ctcaagtgcc 15661 acccacctta ttctttgaga cagggtctca cacagggatc tgggggctta ttgattaggc 15721 taaactgttt ggctagtgaa ccccagggac ccccccccca tctgccctcc tctggaatta 15781 caggtgtgca gccccacact tgatgtgatt gctgcctgag ctcagcttgt acagcaagta 15841 ccaagctatc tcctaaggcc cctgagcctg tgagtctgca gttggctctt tctgttaaac 15901 tcaagagttg actgagtgaa gtcacctacg gcagtggctc tgatgtggtg gtggctgacc 15961 cccactgaga ccatggctgg cctcgtggct tgctgtgtaa cttaggacaa ggcgcttaaa 16021 cctctctgaa ccccagaacc ccaagctgtg aaccacacgg caccttgact gcccccaaag 16081 cgcacttaca ggtgagctta ggcgaaagcc tgtgtgtgaa gagtttaagg gtactgatta 16141 aggtccagga aaagggtgac ttctgaggag ctcctcgggc tctgtggtgg cttctgaggt 16201 gtcctctggc tctgtggtat ctcacgctct ttttctgtcc cttgcaggta gagttacacg 16261 tccacctgga tggagccatc aagccagaaa ccatcttata ctttggcaag taagtccaag 16321 gacaaccaca gaccttccca ggattgcaga gcgtgtacag ctcttcttgg gggtccctgg 16381 agattcccag ggctctctcc gttccctgtg ggaagcaagc caagccttga tggaagacgg 16441 tgatgctctg gagtctttcc actccagccc ccataggttg aggtgggaag ggagcatcct 16501 tcctctcggg cggtcaggaa aagtgagccc ctccccccag catcttcact catgctgtta 16561 agcattctgt gtagactgga aggacccagc tgtctcccac tgtctccagt gtctagtttc 16621 gactgcctct gtcccataag acagcctcac aggtgtccag gtgctacttg ggaatttgtc 16681 tcccaactct tgagcttagc tcccaagtcc ctggagagtt ctgttaggca tctttcctcc 16741 gtgttgttct tccacagcct tttcattccc tgggtcatgg tgtgctttgc ctaggctgtg 16801 gtccactgag ttaagttagt gactccttcc cacagtgagg taggcagttt tcgtgcctgg 16861 cccgatcctc tttgtagata ggaagatgga gtcatggcag ccaaaggctt cctccagagc 16921 atacggcagg gttttatagt tcaggagtcc gacagagtcc cagctactct caccaatgaa 16981 accagctatt tctatgaggt cagccatgag aaatgagagg ctcgggacat attagcttgc 17041 cccgtaaagg tcactcccag atgttagtat gggtatctat gggtctagtg tcaggggaac 17101 acctctggcc ttatctctgg gtggcctttg gtcatttggc tggtcgatat cgccccaggg 17161 cctcgctgag ctcacttggg tgggacactg agatgggttc caggcagtca ttattaataa 17221 tgtattagcc aggtcatgga ggtgtgggca gccagggtag ggtggagtga aagagcaaca 17281 cgtgggtagt gggggctgct gagattttcc tagcctcagt ggccccccac aggggagggg 17341 cggctcccct ggaatattca agcagcagag aagaggccct ggcttgcggt gtccaagcca 17401 gtgtcccatt cttggtgact gcattgtcaa ccatctcgcc ttcctccagc tccctggctt 17461 ctgttctttc tccatgatgc ctcttacctc cttcccaggc ccttcaccct gtgggctctc 17521 tctaaagcac actgaggatc cagccacgcc ctgttgccgt agttgtagaa ctctgagcca 17581 ccactgtccc ttctcttgtc tatctgcagt ctgtctccag gtgcctacca tagccatcct 17641 ctgggcttta gggctgcagt ctgaccattg tttctaaaca cagttaccag tgtgtccctc 17701 tgaaggctcc ccaggatcac catggctgga tgtggctata ggagggtgac tctgaggcaa 17761 tctggcccgc agcaaaaagt ttaatctgga gggaccaggt ggatattcgg aacctccccc 17821 tttgtgctag gaccatagca gagggaggta ctgtattccc acagtgcatg atgggaacct 17881 agcaggactg gtgaggattc gtgactcaca agcccgagca ggagacatgg gtgtctccta 17941 ctggaaaatt ccaggaatgt gggcttattt tattttattt cattgtggtg ctgggaatgg 18001 aacccagtgc cttctgcatg ctatgaagct acagtctaag cccctcaata gcttacacac 18061 acacacacac aataataaca acaatagtga tgataaaata gaagaaacca aggcctagag 18121 ggatccaatg ggctggatct gaatcagagt gggcagagac ctcagtgcat gcgcaggagg 18181 aggaaacctc cccaagttca caccagcttc tgcagtccat ggggaggcag agaccctgat 18241 gcaggaagaa gactccaagg cagaggtggt ggtgctggat tgggggcggg gtgggaggat 18301 tatgacgctg cacagtagct attgtgtctg atagcttctg tctgggcata tgtccaactg 18361 cctctgcctg cccctggacc tgatgaggaa gtcacgggct tgccccggta ctcaccctgt 18421 tagtcccttg agatgaagtg acttgtctgg ctattccacc ccaaatgcat ttaatcctgg 18481 aagtcaagca gagctgggcc tggttgttct tggatgggag atgagtgact tttttttttt 18541 ttcatggctt ctggttagaa gtttttggcg gtggggggag ctctgctctg ttctctatgc 18601 cttggagttc ggagtccgca cccatcccat gggaggaggt agatggaagg atcccctgtt 18661 tcttactggg gcttccagtc tgttccatgc tgccctgggg ggccaggctg gatcccgccc 18721 tggcatctca ctgttttcat ttgtttttgg ttttgttttt aaatgcatgt attttaaaaa 18781 attcatttta tgtgttgcta gtgttttgcc cccaagtata tgtgtaacac acacatgcct 18841 gatgccagca aaggaggcct gaaggcattg gatcccctgg aattagagtt acagctggtc 18901 atgggcctcc atgtgggtct cgtcttctgc aagaacagcc agtgtgctct tacccaccaa 18961 gccctggtgc agcccctcac ccttgacttt atttttagga agagaggcat cgccctcccg 19021 gcagatacag tggaggagct gcgcaacatt atcggcatgg acaagcccct ctcgctccca 19081 ggcttcctgg ccaagtttga ctactacatg cctgtgattg cgtaagttgc tccccaaccc 19141 ttgtgcccca cagtagcatc catccctata accaaggtca ggcctgagct gctgctgtac 19201 aaggcactct gcacctctaa ccctgcttct taccatagtg tgcacatgag ccgacctctc 19261 ctgtgccttt gggctcctga tctcattccg tccagaggtg gcttctgtct cagggcagag 19321 gaggctctgc ttgctgaagc cccgtgctgg ccgggctatc taggatgtcc tgttcagaat 19381 tcctgcagcc actgctcttt agacccatgc caccattcca gatctcatag aaagagccct 19441 agtcttaagg taaactgagg catgggtgca gaacagggaa gggtctaagc caggtccctc 19501 attccccagg ctagagcttt ggaagggaga ccctggagtg ggacatggaa gtggagttgg 19561 gtctttcttt gatgtcttga gtcactgggt gagtgaagac cattactagt ctctgagaag 19621 gcccaggaat ggtcactaga gagtgccaga gaaaggactc atgactggaa atctaagcca 19681 gccttctgac ccaaataaac caacctttct ccacatctct cagagcaaac accaaaatcc 19741 acaggacagg tgtacaagcc ctacaacttt gagctcctgc tttcatgggt gctgactcct 19801 cccccacccc accccacccc cacctcagat cctctgtccc tgttctctca acctgcacca 19861 ctgttctcta aaagcccacg gtttgtcttc tgcaggtggg gtctttctgg ccgtgctgtg 19921 gcaggccaca tggcctccta gctttacctg tatcggtagc attcctatgt cacagattgt 19981 atgttttccc cctatttaca cctggcttcc ctgtcacgag gcaatggaaa ggcagggatc 20041 ctagctcctg ttcactgtct agtaggagtc ttggcccagc atgagggctc aatacccact 20101 tgaaggacgg atcagactgt agatagctat gtgagagctg tggggggatg gagctaggaa 20161 agacagtgag caagctcccc gctatggcag agctgagacc cgatcctccc tcacccatgg 20221 aacccactcc taggagtgac tccactgcct cagagacagg ccctctcttc tcccccaagg 20281 gctgtcgggg gcacttgctt gccttctcta cattggaggg agtcctaact aggcccatcc 20341 actagcattt gatcactgcc acagaggcct tcacaggtaa cttgtgcctt gctcatggag 20401 tccacactcg acagcaacag caattaaaac ccttccgggc tgtgtctgat cacttcttgg 20461 ccttttggct gagaccaagg gtagcatttg tctgcctgtc caactgccgg cctgggtgag 20521 gtttcggatg ctcccttccc ttacagtaac cctcatcaaa gggtaattga ctagccagga 20581 tcacacagac ttcatagtga gtccatgact ggagaccaag tcctcccatc ccagcgtccc 20641 tctggattca ccatttagct tccctgccag ccttccgtcc attattcatc taatatatgg 20701 ctactgtgtc ctaggcactt cccttggctg aggtatcaca gccatgacaa gatagacaaa 20761 gagatccttg tctcatagga gccaagggga gatgtgatat agcccctaaa aaaagcatgt 20821 ggcatctaat gtcaagtggg gattgaggcg tggggaaagg atgctgaggg gaggtggttt 20881 gaggaggccc gggcagaggg ccttaagctg gcattaggtg ggggcaggtg gagtgcagga 20941 ggctggagca aaggggagac cacgacccaa agtcctgagg tgggagggac aagctcaagg 21001 gtccttggcc agaggaagac tgtgcagtgc agcacaaggc gccaaggaag ctggtgactg 21061 tgacatttgg gctggcatgg agtcagagct atacaactgg caggcatggg ggctcagaag 21121 agaagaatga actttggttt cacacacaca cacacacaca cacacacaca cacacacaca 21181 cacacaccat tttttaaatc actctcactg ctttgtggag agtggagtac tttgggacaa 21241 actggtcgac atgcctgtta gaggcaaagt ggtcacacaa gggtaagggg gtcagaatga 21301 gtttgtagtt caaacgtaca gcttcccggg aggcttggct ctcatgggca ggtgcccagg 21361 cattgagagc ctgtgagctg gaaacaggct ggctcggagg acagttgtag ttacctcgtt 21421 ggctactaga gctcccaagg agctgaggaa ggttgccaac ctgtgttctt cccttcccag 21481 gggctgcaga gaggccatca agaggatcgc ctacgagttt gtggagatga aggcaaagga 21541 gggcgtggtc tatgtggaag tgcgctatag cccacacctg ctggccaatt ccaaggtgga 21601 cccaatgccc tggaaccaga ctgagtgagt gacatcactg gagggggctg tgctgagcgg 21661 ggctctgagc tgaggatgga gtgcttagag ccctggcctg gtccatggac tcagagcgac 21721 tcagctcagt cctaagtgca cgatccctat tcctctgctt gaggctgtgt ctctccccta 21781 agtgaatact agctagttgc catgactctg cctggcttag ccggtgactt ccaaagtcat 21841 ttgctcaatg aatgagtctg gcccgttgcg gtcttctggt cgaccccttg cacctttaag 21901 tctcttgaac cccctattcc ttccctctga gccatgattc tgatacacca catgggaagt 21961 gggaattgaa caggcccaaa ttcaatcccc tctccatcta gaaatagaaa gggctgtgtg 22021 acatcactac atccctgctc cagttccatg gctgcccatg gtcttccctt ggcctaaagt 22081 cctccctctt cctctctcca cacagagggg acgtcacccc tgatgacgtt gtggatcttg 22141 tgaaccaggg cctgcaggag ggagagcaag catttggcat caaggtccgg tccattctgt 22201 gctgcatgcg ccaccagccc agtgagtacc gccgcaccct gctggctgcc tggcctagaa 22261 caaggctgga ccgactatcc cagcgtcccc cacctcgtat ttctagagtt ttctaaaaaa 22321 cacctgtgaa cttttggtga ctctggtgag tctccttaac aggaaatctg ggacttgaca 22381 gccacgtgat ccacaggaag ctcacagcca aaggaaccta aacaatagga aaacgtgtaa 22441 taccatttaa gacttatggt ctggccctgc cttatctggg ccataaagac tcttgatggg 22501 tctgatttct gaggtatccg ggggtcctcc cccactaccg gtcaagtctc caaggctcaa 22561 cctgcagagc cctttctgtg acccagggca ggtcacatca cccctcgtga gctctttcac 22621 acagtgtctg gtgatggtca cactcacttc ttgcccccac cctaaggatt agtgggaacc 22681 aaagccccgg gctgatgtta ggttacaggg tctgttataa tagggcatgg cctgtctgtc 22741 tcttcattaa ccctcagtgt agcctgggga ctgaactcag gtcattaggc ttagggcttt 22801 aactgctgag ccatcttgtt ggcccacatc ataataactt tatgaggctc ctaatataaa 22861 agtgcatgat gcattagaaa taaacgaata ttcccatgaa ttatattcta ggctaaatgt 22921 taggctctga tttcctatct tctcagtggg gtaagtatgc catgggagtt ggtaagcgcc 22981 actggtctag cctgtccatc acttactgaa cccagcccac gtcagcagac actcgccatg 23041 accttgtgca gtggttggtg gtgtccctct gggaaagggc actctgaggc atgggtgagg 23101 aggacacagc agtcaggata tgaagacaag acactggatc ttatatcttc tctggcctca 23161 gatgaggaat tagggcagcc agtgggtggc tggcaggccc tactccatcc ctgtttgatg 23221 tggccccgtt ggacctctct tgcaggctgg tcccttgagg tgttggagct gtgtaagaag 23281 tacaatcaga agaccgtggt ggctatggac ttggctgggg atgagaccat tgaaggaagt 23341 agcctcttcc caggccacgt ggaagcctat gaggtgggcc tgagaagggg agggtggccc 23401 tgggggagct tgggtagtaa gcttggggtc ttctgaatcc ttagatatgt cttggttggg 23461 gctctggact taaaacagga cattacctaa agccccacta ggtgttggga gagttatgtt 23521 tgagtctgca tctgcctatg tattcctgtg tgaccttgga cagggctgct ttttatctgg 23581 acctcattgc cttgattcta aacttcttgt taggccttgg tcatgttaac ttggctttct 23641 tgagctccgc cctaaagcag tgtaatagca tttgctttgg actattagac aatgtgttta 23701 aacccttcaa acagcatcag tgcacagaag ctacagagta aaacgtttcc actctactcc 23761 tgacttggga aactagacca cactgtccta tacacttctt caagtagcct agcctgattc 23821 tggggacagt gacacttgcc aaaagtcaga cccctaaatg tggcagagac agagctcgct 23881 agggtcctct cagagccacg cccaaactgt ctcttcccct tccagggcgc agtaaagaat 23941 ggcattcatc ggaccgtcca cgctggcgag gtgggctctc ctgaggttgt gcgtgaggta 24001 aggagccagt gaccccgggc ctcttcttcc tgattctgtt cctgtccctg gactcacctc 24061 ctctctgctt ctccacaggc tgtggacatc ctcaagacag agagggtggg acatggttat 24121 cacaccatcg aggatgaagc tctctacaac agactactga aagaaaacat gcactttgag 24181 gtgagacgcc aaggcagaga gagtgagctc tggctacccc gtgcctttca gacagaggca 24241 ggacaggcag gctgagtggg agtggccaca tggtggaggt ttcgtggggc ttgaggcaat 24301 gaagcactaa agctatccag aatagaacct cagcaggtgg ctcagccctg accagtctgg 24361 ccccgggccc actatgccag ccacacacct gccccttgca ggtctgcccc tggtccagct 24421 acctcacagg cgcctgggat cccaaaacga cgcatgcggt tgttcggtga gatctggttc 24481 ctgggaccca tttggttttg atctggaaag gaaggggagg ggatggggac ctgaccatcc 24541 tggttcttag tcatggagtc actctgactg gctgggaagc aggcattgct gtccttgtcc 24601 aacaactgag gaccgtgaag ctcagatgtg ccaacctgac tcctcaccca ctgactaaca 24661 gagctgggcc cattgcccat cggccatgag tgtctacctt cttgttactc gactctgact 24721 tttgggctgc gggaagatgc gtctcatctg tctggatgtt cactcttatt gatgcagcca 24781 ggtgtcccct gaaggccagc ttgaaagctt ttccctagag caccttgccc atgctacctg 24841 actcccatca gcaactaaaa acaaggcctt cctggccaat gaccttatca ggcttcagga 24901 ggagaacaaa gttgtctttt cccaaggcag gatagaggtg ggctgccaat ggccaggaca 24961 gcactggtct gtcagctaca cttacacaga tggaaggact gtgctcttat aacaagaggc 25021 tcgtgggctt tcttagtgtg catggtgcct gtggaatggc tcaatggtca ctctaggagg 25081 cacacaagcc tattgcacac aggctgggga agggtcgaat catctgcgca aatacaaagg 25141 ttgaagagca aagactctgg gagatggtgg atcaagatga ggagggaaga gaacctgtaa 25201 cagtgacagg cagaaacaga aggaaagcca aagacctgtc aaggtgccca gggctgggag 25261 acacctcagt ggataaggga gcttgccgtg caagcgtgag accagggttc agactgctag 25321 cacccacata aaaagccagg catggccgtg cacacctgtg accttagcac tagggagggg 25381 agtggagaca ggaggattgc tggcctcaaa atagtgagct tcaggttcca tgagagtccc 25441 tatctcaaag aagtaaggtg aggactatca gagaaagaca cccaatgtct tcctccagct 25501 ccatgcactc ctgaacacac acacaaaaac cacaaaaacc aaagttgcaa atgacttttt 25561 taaaaagtat ggggtcagag atatccaggt ataatgcttt gagaaaagtc aagtgtcaaa 25621 ttctgggact ggtgagatgg ctcagtaggt aaaggtgctt tcctcagagc gtgacaacct 25681 gagtttgatc ccaggaacct atatggtaga aacaggaaag cgacggccat aaattgtcct 25741 ttgacctcca atctcatgcc ccatggcatg cttaaataaa taagaatttg aaagcgttga 25801 aaactgctct ccgagcaagg aaggttggag ctgccagaaa ccagtcaagt tcaacaagat 25861 ggagggcagg gttgtgccag gaagtttcca tgctgtattg gaagagcctt gtggcaaggg 25921 actgaggact gactaggaga ggagggtagg atgctagcca tgctccacca ctgagagggg 25981 gaaggctaga gagcatggta gcttaaggct gtcaccgtct ccctgtcacc tcaggctgtt 26041 ctgcctctgc ctctgagctg gcttttccca gttttataaa tgtttcctct ttaatacgag 26101 aatgcaaccc tttgtgttgt ctaaggttgt ataaagatgg aagagggagg tggtggaagg 26161 gcagtgatgg ttcttggagt gaagaggctc tctctctctc tcttttcttc ctgcctggcc 26221 cctcccccag cttcaagaat gataaggcca actactcact caacacagac gaccccctca 26281 tcttcaagtc caccctagac actgactacc agatgaccaa gaaagacatg ggcttcactg 26341 aggaggagtt caagcgactg gtgagtatgt gtgagctatg agcctgacac tggcccaggt 26401 gtgtgtgtgt gtgtatatgt gtgtgtgtgt gcgcgcgcgc gcgcgtgcac acacacgcac 26461 gtgagtgcat atattgtgtg tagattgcct agaacctctg gagaccaaga atgggttggc 26521 ctgacatgcc aagctcagcc tagtaccaaa agccttccac agagatgcct tggcagtagc 26581 atgctggctg agggacgagg caacttagcc tttgcttggg ttgagcccct tagaagaccc 26641 ctctcccaag acaggtctga ctgtggacgt ttttgtgaca tttaggttgg ttgtaggctc 26701 agaccttggt cttgggttct gatgtcctga gtgagtccct gcaaggatct gtttccccca 26761 ctatgatgcc cttgcccttg ctaacagggc tgcttccttc cttgtcctga ctccatgttt 26821 cccccttcta gaacatcaac gcagcgaagt caagcttcct cccagaggaa gagaagaagg 26881 aacttctgga acggctctac agagaatacc aatagccacc acagactgac ggtacgcttg 26941 tgcagggcgc aataaccacc ccaccacact gtccctcctt aactctgtgc gattgtggca 27001 gaagtccttg ggcaggagca cacctctgca gggttacagc caccacccta tgctgtccct 27061 ccacaacttt ctgtgtggct tgtagcagaa atctttccca gagtagggac agccctgaaa 27121 aaatggagac tgtctggctt ctgggcacag tgctcagctt cttgtcaggg tggctatttc 27181 ctgatcatta tgcaggatag gctcgggcag cctggagtgt gacttgaaca gcagttggcc 27241 cagtcccagt tccatgagtc cctttatttc cttatggcct tggcaagcca cacattccct 27301 tgcttgaagc ccgtcttgct gtctatggaa ggaagttatt actctctggc ccggatttct 27361 gtgctttcta ccatgcctta catgtcatga gacctgacct ttctatttct ctgacttgac 27421 cagcagggcg ggtcccctga agatggcaag gccacttctc tgagcctcat cctgtggata 27481 aagtctttac aactctgaca tattgacctt cattccttcc agaccttgga gaggccaggt 27541 ctgtcctctg attggatatc ctggctaggt cccaggggac ttgacaatca tgcacatgaa 27601 ttgaaaacct tccttctaaa gctaaaatta tggtgttcaa taaagcagct ggtgactggt 27661 atcttgcagc acatggtgaa tatggtctcg gggctgctgg ctaggatgct aagaaaggag 27721 gagccctggg ccctacgctg agtgtcaggc tggggagcca gggtctcttt cctgcagaag 27781 cgattctttc ccagaggggc tgttggagca gatgctcctg aactctccgc ccctttaacc 27841 agtcctttgg atttattttt attattttta aatatttaat tatgtttatg tatatgggtg 27901 ttttgcctgc ttgtatgtat atgcatcatg tgtgcctggt gccagaggtc agaaaagggt 27961 accacctccc ctggtactgg aatttaggta gttctaaccc accatgtgcc catgtgccca 28021 ccagtggaca gcaagtgaga gccgactctc tcttcctacc ctgtcagccc cagggatgaa 28081 ctcaggctgc caggctgggt agtaagtctc tacccactga gcatctcaca ggcccttttc 28141 aggctgtggc aggtgtgggc tgtgggtggg tgaggtgagg aaggactcat gagatgccac 28201 atgaggggtt gtgctggtca ccacaggata gagggcagca gtggggccct gcagggtgag 28261 gatgtgggga tgcagagccc agctgtgcag ttgatagtta aaagaaaaga tcataagcca 28321 gtcacacaaa ccaccagcat ttatttcctg ggcctaagta tcagtctctt ggtacaaagg 28381 gcaaagaatt ggtcagttag ttttgagtcc aaatcctttc ccaaaactgg atctgtgagg 28441 ctccatgagg aaacttagag ctgcatagat catcagaact gagctttaaa cggctttcaa 28501 aacaaaacca aaaccaaaaa ccaaaacaga agaaaagtga ttgtcagctc ccctctctct 28561 ctgaaaaagt tatgttttta aaaaatgtaa aagaggcttt aaaaaagggc taggaagatc 28621 ttcctgacag acaggctggc tggagggtga gtggtgacag gacagtccct gtccaggtgg 28681 cggctgctca cctaggggca gggcctggct cgagccccga ctagctgaat aaataacatc 28741 tcatggtgct ggtgtcttca gagtgaacct gggcaccgtg gggaggtggg gaagggtgga 28801 gcctgtgctg tgggtggagg acttggctgg agtgggggcc acagaggact ccatgaggct 28861 ctcagtccgc ctggctggcc agggctccag tccgagaggt acagcaggtc tcttccagcc 28921 ttgcgaccac aggcagactc aggatgaggt gttcgcatca ctgctctcag gctggctgct 28981 ggcttccttg tcgggggtac ttccctctgc ctgtccttct gcaggggaaa agaagagtgc 29041 attttcttct ttcacccatt tacatgtgag aagcacaggg ggcctggaag gagcagccac 29101 actgatgacc ggggccagcc ttggaagtag gctaatgggt tcgttaggga gggcgtggcc 29161 aaggccggag agacatttaa cctgctctca ggtctccttt taccatggga aggctgccag 29221 ggctgctggt atactaaaac tcagtacagg tcaggactac agacctgctg atcagcatgt 29281 cctctctaga gggagaatgt taagttccca gcaggacgta tgtgcttagc aaatgggagt 29341 ttcctcaaac accatatagg agctgaaacc agtgggcagc tcatgcgtgc ccactgtggc 29401 ctgcacatcc taaggtagag gatacctgac tctgaatacc actggcagca tccagccctg 29461 ctctattcca gccagccaga aacatgggct ggaataaggg acagcagcag gctggggagc 29521 cacctggcct ggcctccaag tccctacctg atctggtcct cactcgaagc ttctctgaag 29581 ccctgatgga aggtgggacc agcagcttct gccccaccca cttgctagca gcttgtattg 29641 ttattcagcc ctctcagggc cagcatcctc cacagatgct ggttatggac aaaactcaac 29701 cagcaatgca gtctcaagcc ccagggatct tggtgccact gaaggtgcat gtcattctgc 29761 cattaggcct gtctttaagt ctatgttgag ctctgggtct ggaattc // LOCUS MUSIAP 5293 bp DNA ROD 02-DEC-1991 DEFINITION Mouse intestinal alkaline phosphatase (IAP) gene, complete cds. ACCESSION M61705 M35029 NID g194048 KEYWORDS intestinal alkaline phosphatase. SOURCE Mus musculus DNA. ORGANISM Mus musculus Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Rodentia; Sciurognathi; Myomorpha; Muridae; Murinae; Mus. REFERENCE 1 (bases 1 to 5293) AUTHORS Manes,T., Glade,K., Ziomek,C.A. and Millan,J.L. TITLE Genomic structure and comparison of mouse tissue-specific alkaline phosphatase genes JOURNAL Genomics 8, 541-554 (1990) MEDLINE 91139124 COMMENT From EMBL entry MMIAP; dated 24-JUL-1991. FEATURES Location/Qualifiers source 1..5293 /organism="Mus musculus" /db_xref="taxon:10090" mRNA join(<511..577,662..778,895..1010,1128..1302,1381..1553, 1815..1949,2031..2103,2233..2367,2441..2632,2833..2946, 3033..>3415) /gene="IAP" /product="intestinal alkaline phosphatase" exon <511..577 /gene="IAP" /number=1 /product="intestinal alkaline phosphatase" gene join(<511..577,662..778,895..1010,1128..1302,1381..1553, 1815..1949,2031..2103,2233..2367,2441..2632,2833..2946, 3033..>3415) /gene="IAP" CDS join(511..577,662..778,895..1010,1128..1302,1381..1553, 1815..1949,2031..2103,2233..2367,2441..2632,2833..2946, 3033..3415) /gene="IAP" /codon_start=1 /product="intestinal alkaline phosphatase" /db_xref="PID:g194049" /translation="MQGPWVLLLLGLRLQLSLSVIPVEEENPAFWNKKAAEALDAAKK LQPIQTSAKNLIIFLGDGMGVPTVTATRILKGQLEGHLGPETPLAMDRFPYMALSKTY SVDRQVPDSASTATAYLCGVKTNYKTIGLSAAARFDQCNTTFGNEVFSVMYRAKKAGK SVGVVTTTRVQHASPSGTYVHTVNRNWYGDADMPASALREGCKDIATQLISNMDINVI LGGGRKYMFPAGTPDPEYPNDANETGTRLDGRNLVQEWLSKHQGSQYVWNREQLIQKA QDPSVTYLMGLFEPVDTKFDIQRDPLMDPSLKDMTETAVKVLSRNPKGFYLFVEGGRI DRGHHLGTAYLALTEAVMFDLAIERASQLTSERDTLTIVTADHSHVFSFGGYTLRGTS IFGLAPLNALDGKPYTSILYGNGPGYVGTGERPNVTAAESSGSSYRRQAAVPVKSETH GGEDVAIFARGPQAHLVHGVQEQNYIAHVMASAGCLEPYTDCGLAPPADESQTTTTTR QTTITTTTTTTTTTTTPVHNSARSLGPATAPLALALLAGMLMLLLGAPAES" exon 662..778 /gene="IAP" /number=2 /product="intestinal alkaline phosphatase" exon 895..1010 /gene="IAP" /number=3 /product="intestinal alkaline phosphatase" exon 1128..1302 /gene="IAP" /number=4 /product="intestinal alkaline phosphatase" exon 1381..1553 /gene="IAP" /number=5 /product="intestinal alkaline phosphatase" exon 1815..1949 /gene="IAP" /number=6 /product="intestinal alkaline phosphatase" exon 2031..2103 /gene="IAP" /number=7 /product="intestinal alkaline phosphatase" exon 2233..2367 /gene="IAP" /number=8 /product="intestinal alkaline phosphatase" exon 2441..2632 /gene="IAP" /number=8 /product="intestinal alkaline phosphatase" exon 2833..2946 /gene="IAP" /number=9 /product="intestinal alkaline phosphatase" exon 3033..>3415 /gene="IAP" /number=10 /product="intestinal alkaline phosphatase" BASE COUNT 1260 a 1467 c 1372 g 1194 t ORIGIN 1 aagcttaatt gggggccaag tagacagcag gacattcagt gtgccttgtt tcctttgtct 61 tttggctcca ggtatcagca agccaaacaa aggcccctca tctaagctgt gttcttcagg 121 cctacctcca gcgcccagaa tgagcctatt ggcccccaca gctctcagga gcaagagtga 181 tgtacaggac attgtgagca agaagtgggt gctgcaaact gcataacccc cctcctaccg 241 gcaagacacc gagtgctcac acagagctta ctcgtaggac ttgccagctg gttaagacac 301 accctgccat tttctctaac aagcaggagt tcagttcagt tcacagggat ggggtgggac 361 caggatggcc actttgatca catgggaggg gcgtggtgtt gtgcagttag gaacaaagtc 421 tccccctatt taagtccagc gctctgtgct ttagttgatc cctggtgtct cgtgtctttg 481 tctgctgctg tcccgccacc agccccagcc atgcagggac cctgggtgct gctgctgctg 541 ggcctcaggc tacagctgtc ccttagtgtc attccaggta atgaggctcc ttccaatgaa 601 caccccattc ccacccatgg acccttcatg ctgacccttc ctctgctatt cccttggcca 661 gtggaggagg agaacccggc cttctggaac aagaaggcag ccgaggccct ggatgctgcc 721 aagaagctgc agcccattca gacatcagct aagaacctca tcatcttcct gggtgacggt 781 gagtgtgtga gcgaggcctg ccaccctggg gcccttgtac tccaagtacc cagggccact 841 ggtgggtacg gacaggcctc agggttcagt cctgacgagg ttctgctcct tcaggaatgg 901 gggtaccaac agtgacagcc accaggatcc taaagggaca gttggaaggt catctaggac 961 ctgagacacc cctagccatg gaccgcttcc catatatggc tctgtccaag gtgagttctt 1021 agccacatct gaaatgactg atgggatcca gggcaaggga ggcagagagg ctcgggtgaa 1081 gaaataaatg tctgctttga gcccagttgg ggtgtctctg tccccagaca tacagtgtgg 1141 acagacaggt tccagacagt gcaagcacgg ccaccgccta cctgtgtggg gtcaagacca 1201 actacaagac catcggcttg agtgcagccg cgagattcga ccagtgcaac accacatttg 1261 gcaatgaggt cttctcagtg atgtaccgtg ccaagaaagc aggtgagttg gagccaggct 1321 cagctatggg gggcaagcct aggggactgg atgtctcacc ctgacctttg ccgtcttcag 1381 gaaaatccgt aggtgtggtg accaccacca gagtgcagca cgcctctccc tcgggcacat 1441 atgttcacac agtgaaccgc aattggtatg gggatgctga catgcctgcc tctgcgctgc 1501 gggaaggttg caaggacatt gctacacaac tcatctccaa catggacatt aatgtaagga 1561 taagcatgtc aaagggagag ggtaagggga gggagaggag gagaaggagg gggagggagg 1621 gggaggtcag gggggtcaag gggggaaggg gtggtcccag gcaaaccttg tagactgaac 1681 tccctggatc ttctggggtc tttgagggcc gggtagttca gttcccacat acctggtgag 1741 gagctaggga ctggcaggaa aaggaggcag aagacaacct aaagttcacc ttccttcatc 1801 ctctctgacc acaggtgatc cttggtgggg ggcgaaaata catgtttcct gctggaaccc 1861 cagaccccga gtatccaaat gatgctaatg agactggaac cagattggat ggcaggaatc 1921 tggtgcagga atggctgtca aagcaccagg tgaccgactg cagaatatta gtgatacagt 1981 ggagaccagg gaagggcttt gaaccttacc agttgcttat gtccctctag ggatcccagt 2041 atgtttggaa tcgtgaacaa ctcattcaga aggcccagga tccgtcagtg acatacctca 2101 tgggtaatgg ccccacactt cctgcactgg tacacctcac atggcaacca ctgatcctct 2161 gtgtatatat gtaccgtgac cccactgcca agcttggtgg tcaccagtat atattttggt 2221 tttgtacctc aggcctcttt gagcctgtag acacaaaatt tgatattcaa cgagatcccc 2281 tgatggaccc atctctgaag gatatgacag agacggccgt gaaagtgcta agcaggaacc 2341 ccaaaggctt ttatctcttt gtggagggtg agtctccaag ctcccatgga aagaggggac 2401 aatggacagg gacaggctca agctcactgg cttcctgcag ggggccgaat cgaccgtggt 2461 caccatctgg gcacagctta tctggcgctg actgaggctg tgatgttcga cttagccatc 2521 gagagggcca gccagctcac tagtgaacgc gacactctga ccatagtcac tgctgaccac 2581 tcccatgtct tctcctttgg tggctacaca cttcgaggga cctccatctt cggtaggttc 2641 gggaacagtg gcaggctgtc aattacgtac agaatacttc tgagccatcg ttttctctgt 2701 ctgtaaaatg gacagaaatg gcacctgcct tgtggggatc tagcaacgac tgaaccactg 2761 gccaggcaaa aggcgggggc tcgtctaagc atcattcttg gcaggaaaaa gtgtccctct 2821 tcccccatgc agggctggct cccctcaatg ctctggacgg caagccctac acctccatcc 2881 tgtatggcaa cggcccaggc tatgtcggta caggggaaag acccaacgtc accgccgctg 2941 aaagcagtga gtgcggtggg gtggcttgcc tgaaggtcgg gtagaggtga ctcagatcag 3001 agtcctctcc cttaacatct tgtccctacc aggtggctca tcgtaccgca ggcaggctgc 3061 tgtgccggtg aagtcggaga cccacggcgg ggaggacgtg gcgatattcg cgcgtggccc 3121 gcaggcgcac ttggtgcacg gggtgcagga gcagaactac atcgcgcacg tcatggcctc 3181 tgcaggctgc ctggagccct acaccgactg cggcttggca ccccctgcag atgaaagcca 3241 gaccaccacg acaacccgcc agaccaccat caccaccacc accaccacca ccaccaccac 3301 aaccaccccg gtccataaca gcgccagaag cctgggccca gccaccgccc cgctggctct 3361 ggcgctgctg gccggaatgc tgatgctact actaggggct cctgcggagt cctaaactcc 3421 agcacatcta ggctccaccc actaggtccc acgccctcac ctggtccttc ccttccctga 3481 cctcagtgct ccctgcattc tccctgcggg ctctacccca ggatcctctc tctgtctttc 3541 tgctactggc ctcatgtcta gccctacctt gcattgcagc ttccaggttc ctcctaccca 3601 ggcactcaca aaggccaatc acctctgagc tagcagccag cctcagaccc cacagagtta 3661 cttctcccca ggcagcatga ccaccaaggc cttggacctc ccggggcaat ccggactctc 3721 cttttgccct catccatcag cccctagaaa aagataggat cccgcaataa tttgtggagg 3781 accaaacatg cacctgccca ttggcacttc ctccgagctt gaatccatct tacaggctct 3841 gtacccagga ctaaggcaca agagaacaca gagagaggct gtcttcccac tactcctcgg 3901 tctaatctgc tggcaggtgg caaggctacg gtgctgggta ccctagccag cctttgacat 3961 agttcttcct cgatgtctct ggaccagctc cacattcaaa accatcatgg ctcagccata 4021 ccaacccaca gagcgaagat tctgaaatcg ttcagccctt tcatgtctat tgcccagcta 4081 ggagattcaa agagctgtac cccaccccac tctcaggtca tctcaggttg cacctaaatt 4141 tctgaactga gaaaagtccc taacttccca ggtctgcatt cccctgggga gagtcaagtc 4201 aataataaaa gaatgtattc aatacaatag caatagtcat tttctttttc ttcggctcaa 4261 aaccagagcc tagtgcctgc taggaacgtg ctctgccact gatccatagc cccatatcat 4321 ctcctcccct cccctctcct cctccctctt ctccttcccc tcctcctcct atgactctgt 4381 agcccaagct ggcctcaaat ttatgacagt ccacttgcta cagtctccca gatgctggat 4441 tttaagtgtg agccacactc ctagcatctt agtaggacct ttgcagaagg aaagcctgaa 4501 gtgtctggag cactgagttc agatggggga ggggtaatag tggagcctca gttggagaga 4561 gacagccagc tgagcaagat cctgaatgag gtgaaggcct gagccaacac cacacagcag 4621 tgctaatccc ccacccccca ggccagcgat cagctggaag gttgcaacga ctgggtcaga 4681 gagggtggct gggacagagg atgcaaagct ggagctgcaa ggagctgtgg gaggagagga 4741 agaactttaa aatccatggc agtgtggtca caagcctttg aataagaatt caggacgtgg 4801 tactttttct attgcaggaa atatgcaatc ttttcccctt ttttcctgtt ttttttttcc 4861 atggggggtg ggaatgggtg ttagatatag gagctggtca gccagagggg agatgcagac 4921 cctaaccatc tctgacttgc attggaactt ggtggagcac caccccagta tagttcttgg 4981 cccctgtcta acctgcccaa tgaggacatt tgaaggaatt acgtaaaggt ggattaagct 5041 gtgtttctca gtaagttttg caacactaca aatttatctg tacatttatg aaggtacaaa 5101 aacacacttt gctcccacta gtaatattag gaagattgaa tatgcatcct tatttgctaa 5161 aatcttgatt taacactgtg aaacatcaat tcgaaatctt ggctctcgga gtagtttatt 5221 tcaattccgg attttagtgg ctgtcgagaa aatatgggag ctgaatggaa aaaggccatc 5281 gttaacaaag ctt // LOCUS MUSH1EH2B 3605 bp DNA ROD 04-AUG-1994 DEFINITION Mouse histone H1e gene, complete cds and histone H2b pseudogene. ACCESSION L26163 NID g418019 KEYWORDS histone H1e; histone H2b; pseudogene. SOURCE Mus musculus (strain BALB/c, sub_species domesticus) (library: lambda Charon 35; lambda Charon 4A) spleen DNA. ORGANISM Mus musculus Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Rodentia; Sciurognathi; Myomorpha; Muridae; Murinae; Mus. REFERENCE 1 (bases 1 to 3605) AUTHORS Brown,D.T. and Sittman,D.B. TITLE Identification through Overexpression and Tagging of the Variant Type of the Mouse H1e and H1c Genes JOURNAL J. Biol. Chem. 268, 713-718 (1993) MEDLINE 93107085 REFERENCE 2 (bases 1 to 3605) AUTHORS Dong,Y., Sirotkin,A.M., Yang,Y.-S., Brown,D.T., Sittman,D.B. and Skoultchi,A.I. TITLE Isolation and characterization of two replication-dependent mouse H1 histone genes JOURNAL Nucleic Acids Res. 22, 1421-1428 (1994) MEDLINE 94248041 FEATURES Location/Qualifiers source 1..3605 /organism="Mus musculus" /strain="BALB/c" /sub_species="domesticus" /db_xref="taxon:10090" /tissue_type="spleen" /tissue_lib="lambda Charon 35; lambda Charon 4A" misc_signal 1153..1178 /note="putative" /citation=[2] /function="H1-consensus signal" CAAT_signal 1231..1240 /note="putative" /citation=[2] TATA_signal 1257..1262 /note="putative" /citation=[2] mRNA 1284..2069 /citation=[2] /evidence=experimental CDS 1347..2006 /citation=[2] /citation=[1] /codon_start=1 /evidence=experimental /product="histone H1e" /db_xref="PID:g418020" /translation="MSETAPAAPAAPAPAEKTPVKKKARKAAGGAKRKTSGPPVSELI TKAVAASKERSGVSLAALKKALAAAGYDVEKNNSRIKLGLKSLVSKGTLVQTKGTGAS GSFKLNKKAASGEAKPKAKRAGAAKAKKPAGAAKKPKKAAGTATAKKSTKKTPKKAKK PAAAAGAKKAKSPKKAKATKAKKAPKSPAKAKTVKPKAAKPKTSKPKAAKPKKTAAKK K" misc_signal 2044..2088 /note="putative" /citation=[2] /function="histone 3'-end formation signal" misc_RNA 2044..2065 /note="putative" /citation=[2] /function="histone dyad element" CDS 2795..3121 /note="putative" /citation=[2] /pseudo /codon_start=1 /product="histone H2b" BASE COUNT 1062 a 835 c 827 g 881 t ORIGIN 1 catcccagaa ataattactg acactggatt atcattgatc tcctagtcgt taccttggtt 61 gaacagctac ataaataaaa caacccattt aacaccttca aggcatgtct ttgaatctgt 121 ttgaatgtct tctcaaatat gattctaaga ttaaatctta acccctcatt taccctgagg 181 agatccacga aacattattc ccatgtttcc ataaggagta gccatatatg gagaacagct 241 gttcagtaac ggggctagga ttctgtaacg aaggcagtag agatgttctg agctatattt 301 ctcatgtgtg tttctcttga tcaatggctc agttatctag ttttaatata ccacattaca 361 atttagtcct aaaacatgtc agaagaacta tgcaagaccc aagagaaatt cctcaaagta 421 aatcccacct tcatttccaa agacgttgaa aacagcattt atatgtttct tggaaattag 481 tcatttttat gtctgcacca taagtatgtg ttgtactctc ccccgacccc agtactctga 541 atttgaggta cctcaaccca aaggagtagt ttgtccagag aaatagaaaa gcaacagccc 601 cgcccccaac ccatgggaac agaaagaaaa gaattatact aagaaattat tttaaaactt 661 aggagaagcg gcagaaataa aaccatagat atatagactc atggcctatg agagaagaat 721 ttaactcaaa atggtaaata atctgttaag cattgaacta aaccagtttg atgaactcaa 781 atgccctctg ctcaataggc aggactctcg gaggagcctg tgttacttcc ttcacttaag 841 tgcagatttg taataaaaat cataatgcca aaggcaaggt tttgggctac acaagaagct 901 aaccacttgg agaatcgtct ttgatgggtc agaaaactgt acagtccaag atcagtttat 961 aatttacgaa gaaaatagaa agttttgttt ctctgagttg aaatttgctg aacaccgcag 1021 aaatattgca agtttttggc agaaggctta atgctttgct cactttttga gatgtgcatg 1081 aagcccgaga cttcgggaac attatttttt aaaaatcggg caattcgatg tagggaattt 1141 tgacattttt ggcttttttt gaggtgtaac aaacacaact ccgcatctga gaggacactc 1201 cgcggctgcc agcgaggcgg gctggacagc gcaccaatca cggcgcacgt ccgccctata 1261 taaacgggcg ggcgcagcgc cgcggcccga gtcctggcca gtccctctgc ttccggctcg 1321 agttctctct cctcacacgc ttcgccatgt ccgagaccgc gcctgctgcg cccgccgcac 1381 cggcccccgc cgagaagaca cccgtcaaga agaaggcccg caaggccgca ggtggcgcga 1441 agcgcaaaac ctccggaccc ccggtgtccg aactcatcac caaggctgtg gccgcctcca 1501 aggagcgcag cggcgtgtcc ctggctgcgc tcaagaaggc gctggcggcc gcggggtacg 1561 atgtggagaa gaacaacagc cgcatcaagc tcggcctgaa gagcctggtg agcaagggta 1621 ccctggtgca gaccaagggc accggcgcct ccggctcctt caaactcaac aagaaggcgg 1681 cttccggtga ggctaagccc aaagcaaaaa gggcaggcgc ggccaaggcg aagaagcctg 1741 cgggcgcagc caagaagccc aagaaggctg cggggacagc caccgccaag aagagcacca 1801 agaagactcc aaagaaagcg aagaagccag ctgcagctgc aggagccaaa aaagctaaga 1861 gcccgaagaa ggcaaaggca actaaggcta agaaggcgcc caagagccct gccaaggcga 1921 aaacggtaaa gcctaaagcg gccaaaccaa aaacctccaa gcctaaggca gctaagccaa 1981 agaaaaccgc agccaagaaa aagtaaattt tctttggcca actgcttaga agcccaacaa 2041 cccaaaggct cttttcagag ccacccacaa ctctcagtaa aagagctgtt gcacattcac 2101 agggggtgtg gttagggtgg gaacaaacct tagtgggctg agcttagcag aggaggctgg 2161 ggatatgtgg gtgtgcgcaa tggtgttaac tctagcttag ctatctgaag gtctgtacat 2221 aatctcctgt atgtgtggtg gtggtgggta agttcatatc ccacaagtga cccttagaga 2281 ccatgggaat gaagtagttt actcccggct gccacgtcga taacttttaa gggagatagc 2341 agaatgattt agagatagaa tcttaagcat tgagcgattc agtgcgtaaa gagcaggatc 2401 gatgtcctga caaccccacc cgcatctcat ctcagttctg actgctgaaa tcaagcatac 2461 aaacccaact agagtacttt tttttctgtg ttcccccatg tttgatagcg atcgtgctta 2521 aataggttag ctatagcaga tttgtgcagt ttaattcgta cgtcccgatt ggtgaaagcg 2581 aagtgacaag acagccaatg agacagtaga ccttcaagtg tcgtttacgt tcctgttcgt 2641 gacgaaacac taaaactaac ccaatcctga aagaaatctt attaacctca tttgaatact 2701 gcgtaatgta aatgaatgtg gtgttaactg gtgacggagt aggtatttta ccaaatgttt 2761 agtgtgccca agggagggag ttaaatttta cacaatccct gagctggcga agtccgagcc 2821 tgccccacag aaggattctt agaagctgcc atcaacaaac agaaggacaa ggagcgcaaa 2881 cgctactgaa aggagagctt ctagcaagtg tacaacgtgc tgaagcaggt gagccccaac 2941 atgagcatct tatccaaggc catgggcatc aattacattt taaagcacat tgcctgcatc 3001 gtgccgcatt acaaaaagcg agtaaccgtc atatctcagt tgatccagac tgccattctc 3061 ctgcattgcc caggaagctg gcagagcacc aagcctgtta ccaagaaaac aaaccaagtg 3121 cgcatttgca cgaatgattt caaggacttt aaagagccag gcattctttc aacaaatgag 3181 ttgttgttgt ttcaagtatc cttaaactta aaactatgcg gttttttggt ttttgtttgt 3241 ttgtttgttt gtttgtttgt tttttctcaa tctgttggac atgtgatcaa gaagtatagg 3301 ccaagttagc ccggaatttt caatccttcc caagtgaaat ttattcccgt gtactaccat 3361 gcctgccagg gttctcatgc ttcaaacttg tgtttatgtg aactcggtgg tgccctggaa 3421 atgagggtga tcgtaatggt ttcagggaac cacacactca ctggcacata catccacatg 3481 gcatacaagt tcagaatagt taatttgcta tcaattatag gtactaaaat taactaaaga 3541 tgttttaagc atgcagctct ttatctgtgc gcttacgtat ccaagttcct cttctagagg 3601 atccc // LOCUS MUSINT1A 5607 bp DNA ROD 21-DEC-1990 DEFINITION Mouse mammary proto-oncogene Wnt-1 (int-1), complete cds. ACCESSION K02593 M34750 NID g198421 KEYWORDS Wnt-1 oncogene; c-myc proto-oncogene; int-1 oncogene; proto-oncogene. SOURCE Mouse (BALB/c) DNA. ORGANISM Mus musculus Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Rodentia; Sciurognathi; Myomorpha; Muridae; Murinae; Mus. REFERENCE 1 (bases 1098 to 5606) AUTHORS Van Ooyen,A. and Nusse,R. TITLE Structure and nucleotide sequence of the putative mammary oncogene int-1; Proviral insertions leave the protein-encoding domain intact JOURNAL Cell 39, 233-240 (1984) MEDLINE 85024897 REFERENCE 2 (bases 1 to 5607) AUTHORS Nusse,R., Theunissen,H., Wagenaar,E., Rijsewijk,F., Gennissen,A., Otte,A., Schuuring,E. and van Ooyen,A. TITLE The Wnt-1 (int-1) oncogene promoter and its mechanism of activation by insertion of proviral DNA of the mouse mammary tumor virus JOURNAL Mol. Cell. Biol. 10, 4170-4179 (1990) MEDLINE 90318383 COMMENT Draft entry and computer readable sequence for [2] kindly submitted by R.Nusse, 31-MAY-1990. FEATURES Location/Qualifiers source 1..5607 /organism="Mus musculus" /db_xref="taxon:10090" /map="15" misc_feature 1133..1134 /note="MMTV insertion site in tumor 17 [1]" misc_feature 1394..1395 /note="MMTV insertion site in tumor 35 [1]" mRNA 1428..5398 /note="Wnt-1 mRNA (alt.)" TATA_signal 1562..1566 TATA_signal 1573..1577 mRNA 1595..5398 /note="Wnt-1 mRNA (alt.)" misc_feature 1631..1632 /note="MMTV insertion site in tumor 102 [1]" exon 1779..1882 /partial /gene="Wnt-1" /note="Wnt-1 protein" /number=1 gene join(1779..1882,2452..2705,3279..3544,4002..4489) /gene="Wnt-1" exon <1779..1882 /gene="Wnt-1" /note="Wnt-1 protein" /number=1 CDS join(1779..1882,2452..2705,3279..3544,4002..4490) /partial /gene="Wnt-1" /note="secretory glycoprotein" /codon_start=1 /db_xref="PID:g387388" /translation="MGLWALLPSWVSTTLLLALTALPAALAANSSGRWWGIVNIASST NLLTDSKSLQLVLEPSLQLLSRKQRRLIRQNPGILHSVSGGLQSAVRECKWQFRNRRW NCPTAPGPHLFGKIVNRGCRETAFIFAITSAGVTHSVARSCSEGSIESCTCDYRRRGP GGPDWHWGGCSDNIDFGRLFGREFVDSGEKGRDLRFLMNLHNNEAGRTTVFSEMRQEC KCHGMSGSCTVRTCWMRLPTLRAVGDVLRDRFDGASRVLYGNRGSNRASRAELLRLEP EDPAHKPPSPHDLVYFEKSPNFCTYSGRLGTAGTAGRACNSSSPALDGCELLCCGRGH RTRTQRVTERCNCTFHWCCHVSCRNCTHTRVLHECL" intron 1883..2451 /note="Wnt-1 cds intron A" exon 2452..2705 /gene="Wnt-1" /note="Wnt-1 protein" /number=2 intron 2706..3278 /note="Wnt-1 cds intron B" exon 3279..3544 /gene="Wnt-1" /note="Wnt-1 protein" /number=3 intron 3545..4001 /note="Wnt-1 cds intron C" exon 4002..>4490 /note="Wnt-1 protein" /number=4 misc_feature 4495..4496 /note="MMTV insertion site in tumor 53 [1]" BASE COUNT 1188 a 1681 c 1519 g 1219 t ORIGIN Chromosome 15. 1 atgtatgtat gtatgtatgt atgtatgtat acgtgcgtgc acctgtgtgt gcttggtgtc 61 agtggggctc agacatcacc tgattccctg gaactggagt tacaggtggc tataagccac 121 cacttgggtg ctgagaacag agtccgggcc tctggcagag cagtcagtgc ttttagccac 181 tgagccactc tcatcccccc aattatgttc atcttgagtt gggcaggtac ggtggcggaa 241 taggcctgta atcccagcag tcactggacc atcatgggtt ctacatatta aacctttatg 301 ttaggtaggg tcacacagca agatccggtc acaaaaccag caacaacaaa aaccaaaagg 361 agccagcttc ttcccacaag cattctttcc ctcaggtctt cagctccatc tgacagctac 421 tcggctggtg gtcctatcct ttctgagcct agttgccaga gaaacaagcc cggttcatct 481 tcatgactag cacatctaat gataagcaca ggttgactca aggtgccata gagtgacact 541 aggtacccag agcgacagaa tgacacctat gagtgcacgt cgttaatcac aaacacacac 601 acacacacac acacacacac acacacacac tcatgcaccc acctgcaaac acaattgcag 661 ccttctggac gtctcctgtc acagccccac ctccttcctg atacactgcg ttaagtggtg 721 actgtaacaa aatgacttca tgctctccct gtcctgagcc aaattacaca attatttgga 781 aagggctcaa aatgttcttc gttagaagtt tctggataca ccaatacaca ggagcgtgca 841 ccctcagaac acatgtacac tttgacttaa tctcacgggt gacacaccga cgcttacact 901 ccccctagcc cacagaggca aactgctggg cgcttctgag tttctcactg ccaccagctc 961 ggtttgctca gcctaccccc gcaccccgcg cccgggaatc cctgaccaca gctccaccca 1021 tgctctgtct ccttcttttc cttctctgtc cagccgtcgg ggttcctggg tgaggaagtg 1081 tctccacgga gtcgctggct agaaccacaa ctttcatcct gccattcaga atagggaaga 1141 gaagagacca cagcgtaggg gggacagagg agacggactt cgagaggaca gccccaccgg 1201 cgcgtgtggg ggaggcaatc caggctgcaa acaggttgtc cccagcgcat tgtccccgcg 1261 ccccctggcg gatgctggtc cccgacgggc tccggacgcg cagaagagtg aggccggcgc 1321 gcgtgggagg ccatcccaag gggaggggtc ggcggccagt gcagacctgg aggcggggcc 1381 accaggcagg gggcgggggt gagccccgac ggttagcctg tcagctcttt gctcagaccg 1441 gcaagagcca cagcttcgct cgccactcat tgtctgtggc cctgaccagt gcgccctggt 1501 gcttttagtg ccgcccgggc ccggaggggc agcctcttct cactgcagtc agcgccgcaa 1561 ctataagagg cctataagag gcggtgcctc ccgcagtggc tgcttcagcc cagcagccag 1621 gacagcgaac catgctgcct gcggcccgcc tccagactta ttagagccag cctgggaact 1681 cgcatcactg ccctcaccgc tgtgtccagt cccaccgtcg cggacagcaa ccacagtcgt 1741 cagaaccgca gcacagaacc agcaaggcca ggcaggccat ggggctctgg gcgctgctgc 1801 ccagctgggt ttctactacg ttgctactgg cactgaccgc tctgcccgca gccctggctg 1861 ccaacagtag tggccgatgg tggtaagtga gctagtacgg ggtccgccac ttgtcctggg 1921 gcaaagagcc aggcacgggc cttacccagc tcccacgctg tggggatcac caacctacag 1981 acccccctcg tgcattgtga cttcacatcc agggtgctca cacctagaac tagctctgct 2041 gaagtggggc acatcattgg catgcagaag cccagataca ccaggctcag agaccattcc 2101 catttaatac gaccccgttt ctgctgagca acaggtccca acctcgctgt ggtgggtgct 2161 caggtgtccc ttaggtcttg aaccaaaaaa aaaaaaaaaa aaaaaaaaaa accagatatt 2221 agctttgagg tgagggagtg gaattcctaa gtttttcaag gtgggcaagg ctgcaggtgg 2281 ggtttctcct cgggggctga cttgaagaaa ggaagagcta aggtagccat gccttttctg 2341 tccactcact agactctgga gctcagggcc aggcaaggat agggtggtac agcctgtatg 2401 gttaggatgc aggtcccctc ccctggactg aacccttatg catcccgcca ggggcatcgt 2461 gaacatagcc tcctccacga acctgttgac ggattccaag agtctgcagc tggtgctcga 2521 gcccagtctg cagctgctga gccgcaagca gcggcgactg atccgacaga acccggggat 2581 cctgcacagc gtgagtggag ggctccagag cgctgtgcga gagtgcaaat ggcaattccg 2641 aaaccgccgc tggaactgcc ccactgctcc ggggccccac ctcttcggca agatcgtcaa 2701 ccgaggtggg tgcccaggaa agcgacgctt ccgggattaa gggaaaagca gggtcatctc 2761 cagggcatag gcgggcgaag gcagggaaga catcccaggg ttatatgtga tcaaactgag 2821 aatcgcctgg tgccggcagt taccgtaggt cagcaccaga ttctttctag ccttgcgttg 2881 tgagcatgat ctttaacgtt gctggccact ggcccacaga aagggaattc cggatcgtgg 2941 gcgctgggcg acagctgttt ttccctagcc ttcctcaaag gtacctggga agctgatctc 3001 tgagggctag ctagggttgt gcttcgcacc cagcaaagtt tgcactgcca atactagtag 3061 cgatcttggc tatgcagatt tgttctactt gggaatctcc ccttggagct gctctgctag 3121 ggctctggag tctcagtaaa gcttagagag gagggcattc catgcttcgc acacatgact 3181 ccaaggatgt tggactgtag ggtaccaagt cttccaaaca gggtgctgag ttggccccac 3241 gccttctctc aactgatgcg gggtcgcttc acccacaggc tgccgagaaa cagcgttcat 3301 cttcgcaatc acctccgccg gggtcacaca ttccgtggcg cgctcctgct ccgaaggctc 3361 catcgagtcc tgcacctgcg actaccggcg gcgcggccct gggggccccg actggcactg 3421 ggggggctgc agtgacaaca tcgattttgg tcgcctcttt ggccgagagt tcgtggactc 3481 cggggagaag gggcgggacc tacgcttcct catgaacctt cacaacaacg aggcagggcg 3541 aacggtacgt cggtgtgtcc ggaaccaatg gcaggggaga tgtaagacag gtgcacgggg 3601 acagaggcac agggaggggc ttcccgagag agtgggactc taggagggaa gacagagaag 3661 aggtggtggt tgagggcaaa gaggttcctg agctgatgac agaacagaag agattagcag 3721 gctatcaaca cgtgggatgt attgagatgg ctccatggca cacttttgaa agataaaagt 3781 gacttgctgg cgtggagcag agtctggccg aatgtcccta tctcagcggg ccattttgca 3841 cttcctctct cccgagctta gtcacacctg gaccttggct gaagtttcca cagcatcgac 3901 gtgacccggg tggggtgggg gtggggaagt atgggtggtg gttcgtggga tgttggcttt 3961 gaccttttct tccctcctcc cctcgtcccc tcctccccca gaccgtgttc tctgagatgc 4021 gccaagagtg caaatgccac gggatgtccg gctcctgcac ggtgcgcacg tgttggatgc 4081 ggctgcccac gctgcgcgct gtgggcgacg tgctgcgcga ccgcttcgac ggcgcctccc 4141 gcgtccttta cggcaaccga ggcagcaacc gcgcctcgcg ggcggagctg ctgcgcctgg 4201 agcccgaaga ccccgcgcac aagcctccct cccctcacga cctcgtctac ttcgagaaat 4261 cgcccaactt ctgcacgtac agtggccgcc tgggcacagc tggcacagct ggacgagctt 4321 gcaacagctc gtctcccgcg ctggacggct gtgagctgct gtgctgtggc cgaggccacc 4381 gcacgcgcac gcagcgcgtc acggagcgct gcaactgcac cttccactgg tgctgccacg 4441 tcagctgccg caactgcacg cacacgcgcg ttctgcacga gtgtctatga ggtgccgcgc 4501 ctccgggaac gggaacgctc tcttccagtt ctcagacaca ctcgctggtc ctgatgtttg 4561 cccaccctac cgcgtccagc cacagtccca gggttcatag cgatccatct ctcccacctc 4621 ctacctgggg actcctgaaa ccacttgcct gagtcggctc gaaccctttt gccatcctga 4681 gggccctgac ccagcctacc tccctccctc tttgagggag actccttttg cactgccccc 4741 caatttggcc agagggtgag agaaagattc ttcttctggg gtgggggtgg ggaggtcaac 4801 tcttgaaggt gttgcggttc ctgatgtatt ttgcgctgtg acctctttgg gtattatcac 4861 ctttccttgt ctctcgggtc cctataggtc ccttgagttc tctaaccagc acctctgggc 4921 ttcaaggcct ttcccctccc acctgtagct gaagagtttc cgagttgaaa gggcacggaa 4981 agctaagtgg gaaaggaggt tgctggaccc agcagcaaaa ccctacattc tccttgtctc 5041 tgcctcggag ccattgaaca gctgtgaacc atgcctccct cagcctcctc ccaccccttc 5101 ctgtcctgcc tcctcatcac tgtgtaaata atttgcaccg aaatgtggcc gcagagccac 5161 gcgttcggtt atgtaaataa aactatttat tgtgctgggt tccagcctgg gttgcagaga 5221 ccaccctcac cccacctcac tgctcctctg ttctgctcgc cagtcctttt gttatccgac 5281 cttttttctc ttttacccag cttctcatag gcgcccttgc ccaccggatc agtatttcct 5341 tccactgtag ctattagtgg ctcctcgccc ccaccaatgt agtatcttcc tctgaggaat 5401 aaaatatcta tttttatcaa cgactctggt ccttgaatcc agaacacagc atggcttcca 5461 acgtcctctt cccttccaat ggacttgctt ctcttctcat agccaaacaa aagagataga 5521 gttgttgaag atctcttttc cagggcctga gcaaggaccc tgagatcctg acccttggat 5581 gaccctaaat gagaccaact agggatc // LOCUS MUSALIFA 8757 bp DNA ROD 10-APR-1991 DEFINITION Mouse leukemia inhibitory factor (LIF) gene, complete cds. ACCESSION M63419 J05435 NID g191877 KEYWORDS glycoprotein; leukemia inhibitory factor. SOURCE Mouse DNA. ORGANISM Mus musculus Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Rodentia; Sciurognathi; Myomorpha; Muridae; Murinae; Mus. REFERENCE 1 (bases 1 to 8757) AUTHORS Stahl,J., Gearing,D.P., Willson,T.A., Brown,M.A., King,J.A. and Gough,N.M. TITLE Structural organization of the genes for murine and human leukemia inhibitory factor: Evolutionary conservation of coding and non-coding regions JOURNAL J. Biol. Chem. 265, 8833-8841 (1990) MEDLINE 90256813 FEATURES Location/Qualifiers source 1..8757 /organism="Mus musculus" /db_xref="taxon:10090" mRNA join(791..980,2541..2722,3280..>8757) /gene="LIF" exon 791..980 /gene="LIF" /number=1 gene join(791..980,2541..2722,3280..>8757) /gene="LIF" mRNA join((800.801)..980,2541..2722,3280..>8757) /gene="LIF" exon (800.801)..980 /gene="LIF" /number=1 exon (898.899)..980 /gene="LIF" /number=1 mRNA join((898.899)..980,2541..2722,3280..>8757) /gene="LIF" exon 902..980 /gene="LIF" /number=1 mRNA join(902..980,2541..2722,3280..>8757) /gene="LIF" CDS join(962..980,2541..2722,3280..3690) /gene="LIF" /codon_start=1 /product="leukemia inhibitory factor" /db_xref="PID:g191878" /translation="MKVLAAGIVPLLLLVLHWKHGAGSPLPITPVNATCAIRHPCHGN LMNQIKNQLAQLNGSANALFISYYTAQGEPFPNNVEKLCAPNMTDFPSFHGNGTEKTK LVELYRMVAYLSASLTNITRDQKVLNPTAVSLQVKLNATIDVMRGLLSNVLCRLCNKY RVGHVDVPPVPDHSDKEAFQRKKLGCQLLGTYKQVISVVVQAF" sig_peptide join(962..980,2541..2563) /gene="LIF" exon 2541..2722 /gene="LIF" /number=2 mat_peptide join(2564..2722,3280..3687) /gene="LIF" /product="leukemia inhibitory factor" exon 3280..>8757 /gene="LIF" /number=3 BASE COUNT 1906 a 2360 c 2397 g 2072 t 22 others ORIGIN 1 gagtcacctt tgtactctga ggccaagcac taggggcttc caaagagtgt cagccattcc 61 taggcaccca gaggaagcag gagccccatt ttgcccagcc tccctgatca ttgctttagg 121 ggtcccttca acgtagtaag gctgagttct tgcccagata gttcccagat agacgataca 181 acctcatact cttaccagtc ctccatagag ggggcatgct gatggctggg ggtgggcaag 241 accctctggc acactggcag caagcccctt tgtgtcttct ctttacccat tggcaatgga 301 caggctggag aaaggccacc cctaaggtga ctcccagcgt ccacattctg cttcacttta 361 tggctgtcta aagttcaaaa gccccggccc cccttctgag aaactgcttc tcagtggaac 421 cccagcctga ctccttttgc tgccccgcaa agccccacca ccaccagggg acacaaaggt 481 aacctccagt ctgttgccac cctcccaaac ccagttgaaa ctggaacgtc tgaagggggg 541 cacaggctga ggacccctct gaatccccct gagacccttt ccccaccaga cccatctgca 601 aaaaacgccc agagcaaacc acttaggaaa accacagggc ggttttttgt tgttgttgaa 661 gacttcatta taattttatc aatcaaattc ttagaagaag gaaaaagtct gccctcccca 721 ccctcccccc tcactcttcc cccctccccc ttcactctca ctttcttcca ttcataattt 781 cctatgatgc acctcaaaca acttcctgga ctggggatcc cggctaaata tagctgtttc 841 tctctgtctt acaacacagg ctccagtata taaatcaggc aaattcccca tttgagcatg 901 aacttctgaa aacggcctgc atctaaggtc tcctccaagg ccctctagag tccagcccat 961 aatgaaggtc ttggccgcag gtaaatccat gcgccgggcc gcgatttaag agtcccggct 1021 gcgggcgcgc cggcagcctg gggcgctggg cgagcacccg cgggacaccc gccccccgcg 1081 ccactcggac acttggggcg cccgcgcagt ccgggcccgc cggaggcgct gggtggcggc 1141 cgggagcgag cgccgcacat ggtccgggca ccgcgcgccc agcccgccgg ggccagcagg 1201 ttggcggtga gggacctggc ccggcgtagc cttgctccac cgccttctct cctcgcccca 1261 ctgctctcct gtgctgctct tacttctccc gtgtcgccta gatttaccct ccctctttct 1321 ttttctttct ttccgctttc tcttttccaa ctgcgtcccg ggctgctccc tgggaggggc 1381 gcaggcggct gagcagcttg caaactccgg ccaggaccgg cgcaggtgcg gcttccgtct 1441 gctagtccct ggaaagctgt gattggcgcg agatgagatg caggtatggg tacccccgca 1501 gctattccgg ggccctaaca gtgtttggga tgctgagatc gatagagact ccaggggagt 1561 aaacttaagg ttctagggtc tttttagtcc caggatgggg agttcaggtg ggaggtctcc 1621 agcctccctt aagagatgac ttcagcctct atcaccttct ggtgtcttta gcctggtgtt 1681 ggaaaggctg gagctggggt gagggggccg gtgaagctaa aagtgaagga cgggcaggcg 1741 ccccggggga ggggtgatcc tcgacaggcc cggacagctc ccggctctcc tgacaccttt 1801 cgctttcctc ttgcgtgtcc gcctgcgacc tttccccacc ccggcctctt tcctggttgc 1861 accacttcct ctcattccaa agtatgtgga tttttttggg cggggggggg gctgtagcaa 1921 gcaggggtga aagatgattt ggttggagag tgggcacctg cagaacaaac aggtggaaag 1981 ttctctatag ggcctcattg aggaagatgg gattaaaggc aatcgtttta ggaagccaga 2041 gtctagtggc agttttaaga gatggtctac agaaacccca gctcccagcc tgcacggcct 2101 gcaaggggtt cagctcagag ttctaggctc cagtgggcca atggacacca accaagttaa 2161 gaatctggcc ggtcctggaa aggggaggac ggtttatttc tcttcccctc tcctggacct 2221 aataccatcc atctagggcc aggaggatga ggctagatgg ttgagtgccc ccccccccag 2281 acctagcttt cagttggtac aaatggctcc caccttgata ccttgcctct taatccagtg 2341 gcatagatgg ggaaactgag gcctgggagc agggaaagag aaagccagag agccagggaa 2401 cctcagtaga gcaggagact aaaggctgat ggagggtggg agggggcagg aagggcctcg 2461 ctgaggtgcc cccccaaccc ccgtgccgtg ccctcagcca ggcttcttgc ctttccagtc 2521 accctccgtg tctcttccag ggattgtgcc cttactgctg ctggttctgc actggaaaca 2581 cggggcaggg agccctcttc ccatcacccc tgtaaatgcc acctgtgcca tacgccaccc 2641 atgccacggc aacctcatga accagatcaa gaatcaactg gcacagctca atggcagcgc 2701 caatgctctc ttcatttcct atgtaagtta ctttcctggg gtactgagga aggggggggc 2761 tgcctggccc ggaggggtgc ccttcagagc tggaagagcg ctgtgggaac ccatggctcc 2821 ctccccacac cctagccaaa gcacagagac tggtgggcac cactcgccag caagttgggg 2881 tgagcggcgg ggactgtgct ttctgtcttg tcccatggct cagggtacca aagaagaggc 2941 tatgcagtga atggacaggg aggtgtcatt gaaagcagtg tgtgtggggg gcccaggaag 3001 aggctggggt gactgaagtg caagtgtatg tggtgttctg gctgaggtga cacctgcgac 3061 atgccacatt tcctctatcc atttatgtca ccgtgacctt ggtgagtgag ttcacatttc 3121 tgatcattgc tgactaatga ttctagttgc ctacagggca gcaagtggag tccccatgtc 3181 acaggtgggg aaacagaagt gcaagagctt gccccaaggg ttggtggcgg gctagaacac 3241 tcaccctgac tcccacatca cctctcctct cttctgcagt acacagctca aggngagccg 3301 tttcccaaca acgtggaaaa gctatgtgcg cctaacatga cagacttccc atctttccat 3361 ggcaacggga cagagaagac caagttggtg gagctgtatc ggatggtcgc atacctgagc 3421 gcctccctga ccaatatcac ccgggaccag aaggtcctga accccactgc cgtgagcctc 3481 caggtcaagc tcaatgctac tatagacgtc atgaggggcc tcctcagcaa tgtgctttgc 3541 cgtctgtgca acaagtaccg tgtgggccac gtggatgtgc cacctgtccc cgaccactct 3601 gacaaagaag ccttccaaag gaaaaagttg ggttgccagc ttctggggac atacaagcaa 3661 gtcataagtg tggtggtcca ggccttctag agaggaggtc ttgaatgtac catggactga 3721 gggacctcag gagcaggatc cggaggtggg gagggggctc aaaatgtgct ggggtttggg 3781 acattgttaa atgcaaaacg gggctgctgg cagaccccag ggatttccag gtactcactg 3841 cactctgggc tgggccatga tggaatctgg caaagttgaa acttccatag gcagagcttc 3901 tatacagccc agcaccagct agaaatggca atgagggtgt tggtctgaga gatttctgtc 3961 tcactcactc actcactcac tctcactcac tcactcactc actcactcag ccccttgctt 4021 gctgggtgta gaacaagctg ccacaagttg tctacagcag acagcaaagg gctgggaagt 4081 gtcctagacc cctacagagt caccatcatc tggtcctttg ctgtctctca gagaaacttt 4141 ggaaggcttg gttgggatgt gagagagcta aggggactgg gatccagaag gaatcctttt 4201 attttatttt attttatttt attttatttt attttataag ttttgtgggt ggaagggtac 4261 cctggggtgg aatgatggaa tgtgtcttct cttgagttgg atgagagagt tcaggcttag 4321 agactgtcag atggaagagt ctacctcacc agtgttcagc tcccacagaa gcacagcggc 4381 cagcttccag ttgtcaaagc ctgacgaact cggttagctt ctatgcagtt ccccccacag 4441 cctggcgtgg ttggggtctg ccagctggac ctagaggtga ggtgtgtgca ggcaggaaga 4501 ggcaggctgc aaaggcaggt tcccagagtc ctcccgggga aggacctcta actgtctagg 4561 agtcagggaa ggagcaaggc agccagccat tgctgaggca gtagccgact gcagctctca 4621 tctgcttctc aacccctgag aacaggtgat cttgagcaga cagacaggta gcataaagta 4681 gaatgtcggg tctgaggccc cggaggtcgc aaaggtactt gaaggggacc agagggctgt 4741 cttgggtccc tggagcatgg agaagcagaa cttgaggtca gggtctcagg gaagatgagg 4801 cccagagtgc tgtgtttgat ccagcacagc tgtctattta ttactatgtc ctatttatat 4861 taacttattg gtgctttaaa tggcaaagtt aattccccga aatggtatga ggctccttcc 4921 atgggagctg gggccgagac tctccaccta gtggggcctg gtctggaggc acatgattgt 4981 tacaggtgca gctcatgggt caaatcagag agctggctag ctcctctgtc tcccactgtg 5041 actcactttt agggtgtcag ggtcccccag aaaaagctgg gccagtttgt ctctctgctt 5101 ctgtctctgt ctctccgagt ctgtctctgt ctgtctctgt gtctctgtct ctttgtctct 5161 ctctgtctct gtctctctct ctctctcccc gccccccccc cttctccctc tggtctccaa 5221 gggggtggaa cagtttcttg ttgttttgtc ccactgagct ctctggcacc ccctagattc 5281 ctgctatgcg gtgcaccatt cataatgaag tgaatggctc tggaaccttg ggcaaaactg 5341 attccttcct caaatcgtag ctgaggagtg ctgaaacatc ctgacccggc acccagcgtg 5401 ctttcgacca gcatggaagc tcctcgggtg gcccgaacac ccacagaggg tgaatacagg 5461 aggttggagc agtgcaggcc ctgaactggg cctgaacagc tgcccagtgc gccagagaag 5521 gggagatcaa ggcccgagac gcctgggaca cagaccagga agctgtggtc cttgcttcat 5581 cgctgccttc ccactcccgc ccatgtctgg gctcccaggc agggaatccg atctgatctc 5641 tcctttgtgc tgaggccagg caagcagagg aacgccctcg atctgggagc agggtaggga 5701 ggaaggcagc caagctgggg cagtggctga ctacagagct agctgcctgc ctctcaggct 5761 ctgaacaggg cggtccttag cagttcagca gtgggattct gcttcacgcg gttttgcacc 5821 tttctctgtc actctctaag cactttacct ggacggcagg tggacaggcc ctggagctct 5881 ggcttaggaa aggcctggaa ccatagatgc agcaaggaga ctatggtggg ggccacgcgt 5941 gtcagcgaca aagttactcc accgtactcc tgttgctgcg tcaggctcat ctcaggactg 6001 gctgcccttc tccaagctga gagtcaattt gtctaaaagc caagatgatg ccacagcctg 6061 gggcctgttg ggctttgtca tcacttcaca tttgtatgga cttggactct ctgctccgcc 6121 cacctggcag ctttgaaggc tcagggacca atggactctc tccgtgcacg cccccgtccc 6181 cccaacgcaa ccacctacct gcgtcttact ccatcagttg cccagcatcc cagaacctta 6241 gagcctttgg ggaaaacaga ctttaggggc aggtagttgc tcacctgaca tctttcacct 6301 ggaagcattg acttccaccg agcatagtag gtagtgtgtc tggaccagag aaaaagggat 6361 ggggcatttt gcagtttatc cagagagaag caaaggggcc tttatttatt atttaaaact 6421 tcaaacctga aagcactgag agtttactgg tctgcccccc tccccccact cttgtctatt 6481 tctgtgtcct tgatccccga ctcaagcaac ccagctctgc tttgcctgct ctctggagca 6541 gacatggtat gtgggccagg accccggagt cttgcatggt agcggcttca gaagggaaat 6601 gatatggctg tctgcattcg gatgactccc cagtcccagc ccagcctctc ctttgcactg 6661 ctgctctccc tctttccttt cctttggaag ggacttggcc ttgggtgaca aattcctctt 6721 tgatgaatgt accctgtggg aatgtttcat actgacagat tatttttatt tattcaatgt 6781 catatttaaa atatttattt tttatactga aggagtgtct ttttttttta aagaaaaaat 6841 gaaataataa agaactcatt cttgttgagc cttctggagt gcattgattt ggtgagggct 6901 tgtgggggaa cctcctgttg ggactatagg aggattggaa gatgagattt cctccatcgg 6961 tccaggaggg gtaggagggt ggtatttggg cccagaataa cagtgagtgg tgtttagaat 7021 atgaggtgcc actaagagag gggaccgatg gcttccatgg gtactggtgt gacatctcac 7081 taagcacaac agccagtttg ttcccggtta gctgaggaga gaactgaagc ccagagcgag 7141 gaactgaata tccagagtaa tctaaaatgt cgaggtcacc cctcccagtg agaggtgacc 7201 ccagctagtc cccagccaat gggtgtgggc ccactcagct cccaaccctc atgttctctc 7261 tgcgtggagt ttcagggatt tggaaatggc gtctgtctgg agctccgtca cccaggaagg 7321 gtccacaggc acaggagcag catcaacatt ggtccagagg ctcctatcag aatcacacga 7381 gacattgtgc cctaagtggg ctaactaagc agccatgaca gcatttctgc aaggcttttt 7441 tggggggggt accgatggct ctcagggagt gcttcctccc annnagagat gccttctcca 7501 cagcnccctt tctctacctt ggggaacaat ggggtcaaac tgttagacac aatgtgtctg 7561 gtaggtcccc taaatatctc ctgtactttg ctctggggtt tgaatcttgg gctccccctg 7621 agagattcag ttactggaaa gcannnnnnn nnnnnnnnaa gctttttagg cattggtaag 7681 tccaaggcag gaggttacag aatcatnngg caggcatctt cagccagcat ctcatcttct 7741 ctctgccagg ataggactga ctgcctccca cttctccagc tctgcttagt atgggcttcg 7801 cagggagctg gtgaggaatg tggagtccgt ggagtccagt gtcttgcctc gatgtatgag 7861 aggctgcagc ttaaaaacca gccagactcc tgggcatctg tggatatgtg gagtgagagg 7921 aaggctaccc tcaccagccc ctcatgacat cacccactaa cccagcaaag tggcctggtg 7981 gcgaccaaca tcgggaaagg ttctgaacat ccccttgctg gaagcccacg tgtcctttcc 8041 tcctgcgggc agatgagtgc tgccccgccc aaacaggccc ttatccatcc ttaaacctta 8101 tctacagttc agggaagagc agattgaaga tgcttagaag cacagtcagg agctgcaggg 8161 gtgtggacct aaggcagcct ctggggttgg tttctcaccc agggctttgg ttcttgggtt 8221 gaatccacca cctgcctgcc ctctgggaga ggccctggcc tggcctgatg gaacaatgga 8281 ctattaaagg ccattaggtc acactgagag tactgtgaca atcactgtgt gataggaaga 8341 agcggtggtg ggcctgctgt gctggaaagg ttggctgggg tccttgcctc tctccatcac 8401 cgtttatctc tcaatctatg aacctgtgag agcaatcaga gaggagagcg gccctcttta 8461 cttaaagtgg tgcccgaggg agctaaggga cagaaagaca gagatgtcat gtgaattgca 8521 tatctcgcct ggcctagctt gtttagccaa ggatggcagg acaggaagca gggtgaatgt 8581 cacagcccag ctgcctgctt tgctgtgaga gtctcatggc tccctgatgg ttgtggaagg 8641 gaaggttctg ctaactctga gtccaaagga tgccaggaca atctggaaga aggtcaacca 8701 atatcccaac aattgcttcc ctgatggtaa ctgagtccca ggtcccagga ccggatc // LOCUS MMTNFBG 3219 bp DNA ROD 08-MAY-1993 DEFINITION Mouse tumor necrosis factor-beta (lymphotoxin) gene. ACCESSION Y00137 NID g54842 KEYWORDS lymphotoxin; signal peptide; tumor necrosis factor. SOURCE house mouse. ORGANISM Mus musculus Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Rodentia; Sciurognathi; Myomorpha; Muridae; Murinae; Mus. REFERENCE 1 (bases 1 to 3219) AUTHORS Gray,P.W., Chen,E., Li,C.B., Tang,W.L. and Ruddle,N. TITLE The murine tumor necrosis factor-beta (lymphotoxin) gene sequence JOURNAL Nucleic Acids Res. 15 (9), 3937 (1987) MEDLINE 87231097 FEATURES Location/Qualifiers source 1..3219 /organism="Mus musculus" /db_xref="taxon:10090" TATA_signal 1153..1158 exon 1179..1285 /number=1 prim_transcript 1179..3121 intron 1286..1622 /number=1 exon 1623..1727 /number=2 CDS join(1632..1727,1811..1910,2135..2547) /codon_start=1 /product="lymphotoxin" /db_xref="PID:g54843" /db_xref="SWISS-PROT:P09225" /translation="MTLLGRLHLLRVLGTPPVFLLGLLLALPLGAQGLSGVRFSAART AHPLPQKHLTHGILKPAAHLVGYPSKQNSLLWRASTDRAFLRHGFSLSNNSLLIPTSG LYFVYSQVVFSGESCSPRAIPTPIYLAHEVQLFSSQYPFHVPLLSAQKSVYPGLQGPW VRSMYQGAVFLLSKGDQLSTHTDGISHLHFSPSSVFFGAFAL" misc_feature 1632..1727 /note="precursor polypeptide" sig_peptide join(1632..1727,1811..1813) intron 1728..1810 /number=2 exon 1811..1910 /number=3 mat_peptide join(1814..1910,2135..2544) /product="lymphotoxin" intron 1911..2134 /number=3 exon 2135..3121 /number=4 polyA_signal 3100..3105 polyA_site 3121 /note="polyA site" BASE COUNT 724 a 971 c 744 g 780 t ORIGIN 1 tgaaagctcc ctctgtacag agcattggaa gcctggggtg tacatttggg gttacatgat 61 cttggggttc taagagaata cccccaaatc atcttccaga cctggaacat tctaggacag 121 ggttctcaac cttcctaact ccatgaccct ttaatacagt tcctcatgtt gtggtgaccc 181 caaccataca attattttcg ttgctatttc ataactgtaa tttcgctgct attatgaaca 241 taatgtaaat atttgtttta aatagaggtt tgccaaagag accttgccac aggttgagac 301 tgccgctcca gagagtaagg gacacattaa aattgttaca caccagatcc cccaaatttg 361 gggagagggc actgtaatgg aacttcttga cattaaactg gcagataaac tggcagaaaa 421 aaaaaaaaaa aagctgggca gtggtggcac acacctttaa tcccagcact tgggaggcag 481 aggcaggcgg atttctgagt tctaggccag cctggtcgac agagtgagtt tcaggacagc 541 cagggctaca cagagaaacc ctgtctcgaa aaaagcaaaa aaaaaaaaaa aaaactggca 601 gatgaccaga aaatacagat atattggaat aactgtgact tgaaccccca aagacaagag 661 aggaaatagg cctgaagggg cggcaggcat gtcaagcatc cagagccctg ggttcgaacc 721 tgaaaaaaca aaggtgccgc taaccacatg tggcttcgga gccctccaga catgaccatg 781 atcgacagag agggaaatgt gcagagaagc ctgtgagcag tcaagggtgc agaagtgata 841 taaaccatca ctcttcaggg aaccaggctt ccagtcacag cccagctgta ccctctccac 901 gaattgctcg gccgttcact ggaactcctg ggcctgaccc agctccctgc tagtccctgc 961 ggcccacagt tccccggacc cgactccctt tcccagaacg cagtagtcta agcccttagc 1021 ctgcggttct ctcctaggcc ccagcctttc ctgccttcga ctgaaacagc agcatcttct 1081 aagcctgggg cttccccaag ccccagcccc gacctagaac ccgcccgctg cctgccacac 1141 tgccgcttcc tctataaagg gacccgacgc cagcgcccag gaccccgcac agcaggtgag 1201 cctctcctac cctgtctcct tgggcttacc ctggtatcag gcatccctca ggatccccag 1261 ccttaatggg tctggtcctc ctgtcgtggc tttgattttt ggtctgttcc tgtggcggcc 1321 ttatcagtct ctctctctct ctctctctct ctctctctct ctctctctct ctctctctct 1381 ctctctctct ctctttctct ctctctgcct ctgttagcca ttgtctgttt ctatggtgga 1441 gctttcctct tcccctctgt ctctccttat ccctgctcac ttcagggttc ccctgcctgt 1501 ccccttttct gtctgtcgcc ctgtctctca gggtggctgt ctcagctggg aggtaaggtc 1561 tgtcttcctc tgtgtgcccc gcctccgcta cacacacaca ctctctctct ctctctcagc 1621 aggttctcca catgacactg ctcggccgtc tccacctctt gagggtgctt ggcacccctc 1681 ctgtcttcct cctggggctg ctgctggccc tgcctctagg ggcccaggtg aggcagcaag 1741 agattggggg tgctggggtg gcctagctaa ctcagagtcc tagagtcctc tccactctct 1801 tctgtcccag ggactctctg gtgtccgctt ctccgctgcc aggacagccc atccactccc 1861 tcagaagcac ttgacccatg gcatcctgaa acctgctgct caccttgttg gtaaacttct 1921 gcctccagag gagaggtcca gtccctgcct tttgtcctac ttgcccaggg gcccaggcga 1981 tcttcccatc tccccacacc aacttttctt accctaaggg caggcacccc actcccaatc 2041 tccctaccaa ccatcccact tgtccagtgc ctgctcctca gggatgggga cctctgatct 2101 tgatagcccc ccaatgtctt gtgcctcttc ccagggtacc ccagcaagca gaactcactg 2161 ctctggagag caagcacgga tcgtgccttt ctccgacatg gcttctcttt gagcaacaac 2221 tccctcctga tccccaccag tggcctctac tttgtctact cccaggtggt tttctctgga 2281 gaaagctgct cccccagggc cattcccact cccatctacc tggcacacga ggtccagctc 2341 ttttcctccc aatacccctt ccatgtgcct ctcctcagtg cgcagaagtc tgtgtatccg 2401 ggacttcaag gaccgtgggt gcgctcaatg taccaggggg ctgtgttcct gctcagtaag 2461 ggagaccagc tgtccaccca caccgacggc atctcccatc tacacttcag ccccagcagt 2521 gtattctttg gagcctttgc actgtagatt ctaaagaaac ccaagaattg gattccaggc 2581 ctccatcctg accgttgttt caagggtcac atccccacag tctccagcct tccccactaa 2641 aataacctgg agctctcacg ggagtctgag acacttcagg ggactacatc ttccccaggg 2701 ccactccaga tgctcagggg acgactcaag cctacctaga agttcctgca cagagcaggg 2761 tttttgtggg tctaggtcgg acagagacct ggacatgaag gagggacaga catgggagag 2821 gtggctggga acaggggaag gttgactatt tatggagaga aaagttaagt tatttattta 2881 tagagaatag aaagagggga aaaatagaaa gccgtcagat gacaactagg tcccagacac 2941 aaaggtgtct cacctcagac aggacccatc taagagagag atggcgagag aattagatgt 3001 gggtgaccaa ggggttctag aagaaagcac gaagctctaa aagccagcca ctgcttggct 3061 agacatccac agggaccccc tgcaccatct gtgaaaccca ataaacctct tttctctgag 3121 attctgtctg cttgtgtctg tcttgcgttg ggggagaaac ttcctggtct ctttaaggag 3181 tggagcaggg gacagaggcc tcagttggcc atgggatcc // LOCUS MMINT2 8283 bp DNA ROD 29-NOV-1994 DEFINITION Mouse int-2 gene. ACCESSION Y00848 M26284 X68450 NID g52716 KEYWORDS int-2 gene; oncogene. SOURCE house mouse. ORGANISM Mus musculus Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Rodentia; Sciurognathi; Myomorpha; Muridae; Murinae; Mus. REFERENCE 1 (bases 1 to 7869) AUTHORS Dickson,C. TITLE Direct Submission JOURNAL Submitted (16-MAY-1988) Dickson C., Imperial Cancer Research Fund, P.O.Box 123, Lincolns Inn Fields, London WC2A 3PX REFERENCE 2 (bases 1 to 7869) AUTHORS Moore,R., Casey,G., Brookes,S., Dixon,M., Peters,G. and Dickson,C. TITLE Sequence, topography and protein coding potential of mouse int-2: a putative oncogene activated by mouse mammary tumour virus JOURNAL EMBO J. 5 (5), 919-924 (1986) MEDLINE 86247582 REFERENCE 3 (bases 801 to 2000) AUTHORS Smith,R., Peters,G. and Dickson,C. TITLE Multiple RNAs expressed from the int-2 gene in mouse embryonal carcinoma cell lines encode a protein with homology to fibroblast growth factors JOURNAL EMBO J. 7 (4), 1013-1022 (1988) MEDLINE 88296404 REFERENCE 4 (bases 1 to 8283) AUTHORS Peters,G. TITLE Direct Submission JOURNAL Submitted (14-SEP-1992) G. Peters, Imperial Cancer Research Fund, PO Box 123, 44 Lincoln's Inn Fields, London WC2A 3PX, UK REFERENCE 5 (bases 1 to 8283) AUTHORS Clausse,N., Baines,D., Moore,R., Dickson,C. and Peters,G. TITLE Activation of both Wnt-1 and Fgf-3 by insertion of mouse mammary tumor virus downstream in the reverse orientation JOURNAL Unpublished FEATURES Location/Qualifiers source 1..8283 /organism="Mus musculus" /db_xref="taxon:10090" misc_feature 910 /note="exon 1A major start [3]" exon 910..1993 /number=1 misc_feature 924..957 /note="exon 1A multiple minor starts [3]" misc_feature 1084 /note="MMTV-LTR insertion site" misc_feature 1230..1373 /note="exon 1 multiple minor starts [3]" misc_feature 1320 /note="exon 1 (major) start [3]" misc_feature 1671 /note="exon 1B start [3]" CDS join(1774..1993,3737..3840,5641..6054) /codon_start=1 /product="int-2" /db_xref="PID:g599879" /db_xref="SWISS-PROT:P05524" /translation="MGLIWLLLLSLLEPSWPTTGPGTRLRRDAGGRGGVYEHLGGAPR RRKLYCATKYHLQLHPSGRVNGSLENSAYSILEITAVEVGVVAIKGLFSGRYLAMNKR GRLYASDHYNAECEFVERIHELGYNTYASRLYRTGSSGPGAQRQPGAQRPWYVSVNGK GRPRRGFKTRRTQKSSLFLPRVLGHKDHEMVRLLQSSQPRAPGEGSQPRQRRQKKQSP GDHGKMETLSTRATPSTQLHTGGLAVA" intron 1994..3736 /number=1 exon 3737..3840 /number=2 intron 3841..5640 /number=2 exon 5641..7508 /number=3 BASE COUNT 1733 a 2362 c 2253 g 1935 t ORIGIN 1 ggatccagat gccctctgga ttcatttgtg acatcttagg agcttaggtt ggtcttcgag 61 acacagggct gtcccctgta aagcaggttc catcagtgac tccagggttt tagcagttca 121 gtggcgtagt tttcagactg cttaagattt ctcaggggct aggcgtgggg cagagaccct 181 gcagaccctg gctagaacag aggccctggg agacagttga gggtgctcag ctgtggagga 241 catgtgcatt cagcatgctg aagagggcct gctaccctaa atgaggttct cagggtggaa 301 aatgtcccag tgtgtgctag taatagcttc tgtatcgtgg aggtcccccc accccaacac 361 ttctcaggat ctcacatcaa caagggcttc ccgccagact cctcccaaga gctcaggatc 421 aacttaggaa ccaatgttgt gacccatgcc acagggtcct gaggcatgcc tgagggtagt 481 tgctggccag gggaccgttg gaagcacagg atgtggggtg ggaccttttc tgatacctca 541 ggaagtcctt cttctttgag gcccttggca gataaggaac tgaggtcaag agctcagggt 601 ccagggagct attccctgaa cctggactcc ccctcagcct gccaggtggg catgcgggcg 661 ccattgtggg atcttctttc ccttcggttc tgttgccctc tgtatctcca cattcctccc 721 cctctccact gctctcctcc ttctttctct tcagccaaat ctgtcatctg cctcacccac 781 ctttatttct gtctctgcct agatttcctg ggctctaagg ctctgtgact ctattgtctc 841 tgctcctatc tgtgcagtcc cctctctgga ccaccgctgg gacagaccac tcccacttaa 901 ggtgtgcaaa cttcctgtgc taccttctgg tcagttaggt agcttcgagc ttagagtccc 961 tgggccgttg ggccacaaga ctggacgcct gaggaccgct ctgcggtgtg agaagcgacc 1021 tgcacgaacc caaccccggg ctccccacgc accgccctcc gcaattgctc tggattaggc 1081 gccgcctttc atgagcgttt gtcagctaga cttccccgca agttgtttgc gcagacatca 1141 gtcatccaca gtcccattgt gcacccagga ttgatgtaag gcgggagggg gtgacagggc 1201 ctgggggcgg ttgcctttag gcctcgctct ataattttcc gaggcataat tggtctgggg 1261 gcgggggcgg ggcggggacc tttcagaggc gggagggggc tcagggcgca cggcggagga 1321 gcggcggccc gacggctctg gcccgggagc tgtgcgcagg cgacgcccgg cctgagtccc 1381 gcgcccccgc cagggaccac ggccgccttt tgttgtcaag cgccctttct tcagaactgt 1441 gttcggcaaa gaaacacgac ccccattcct gggtgaaaat tcaaagtctt ctttctccat 1501 ctccctctcc tctttcttgt cttctctcca ctatctcttc ccctctcctg ctccacccgt 1561 ccttatctcc atctccacct ctctccacct ctccctgtct cccctcctct tcctccctcc 1621 cctgtctccc acctcttccc ggtccccctc tccctccctt cctctcctag ccacagtcga 1681 gccggcctgg cgcgcgggcg tgtgctccca gcgccgcgcc ttcgtgagac ccgcgctggc 1741 gcagcagccg ctgcgggcgg gcgcgatgcc gggatgggcc tgatctggct tctgctgctc 1801 agcttgctgg aacccagctg gccaactacg gggcccggga cgcgactacg acgcgatgcg 1861 ggcggccgtg gtggcgttta cgagcacctc ggcggggcgc cacggcgccg caagctctac 1921 tgcgctacca agtaccacct ccagctgcac ccaagcggcc gcgtgaacgg cagccttgag 1981 aacagcgcct atagtgagtg ttcaggtccg gacagggagg ggagggtggc ggaagagacc 2041 ggagggctcg gtcccctgcc gccacgtcct agatagaggc agagctgagg tcagtccgaa 2101 gacgccgccg cggtcccagc taccgaactc agcaagctgc cccgcccgtc ccgctccccc 2161 aggagctgcg cgggaggcga ccccggttct gacacgtttc tattactcaa tgaaaagccg 2221 ttggacaagg atttatcgcc ctctttattg ttttgacgac cccggggtgc atggccagtc 2281 cctttcgttc tctttgctct cagccgatgc gcggaggaca ctccactagg cagagctcgg 2341 tttccgcaac gcggcatctg acgacaggct tgtcccgcgt cctgcgccct gcaagagagc 2401 gcgacaggac aaacaggcgt cccctcgtgt tccggatgct gctgacacgc acccacagcc 2461 cgcagctcga tcatacccag ggccagttcc cgagtattgc agggtttggc ttcttcagtt 2521 gggtacccac gagcggagct ggaccaggag cgttgctcgc tgcgtgggag ggaggccgcg 2581 caaaactcac cgcgcccggg gcgcaggaag ccgagcgggg ggcgcgggcg gcggggggga 2641 accgcagctt cctcctgcaa agccgcagcc ccgcgggggc gtcctgggga ctagacctgt 2701 cgggcctgga gcgcggggct cctcatgcgc accactcccc atttgctgga tcccgggagg 2761 tcccagatgt agaggagggg cacattcgct gccacctagg ggcactcagt ctggtgctcg 2821 tctggcgcca gctgtagagg cctcatcctg cacccctcca aaccgtggct agggcagaat 2881 ctcaactgag tccggtaatc tggaccatgc tgggtcccag cggccgctcc ctagaaaggg 2941 agtgaatcct cgacccacta tcccgggggt ctttgacctc cgctccctcc caaaggccac 3001 tatactgttc acggcaaggc tggatgaagt ttgggttgtt tttgtatttt gtttgttttg 3061 tgtgaccaca gtgagtgtgt ggaggtcatt gagccactag ctggagtctg gtctctactt 3121 ccatcatgtg ggttcctcga tcaaactctg gtgatcagtc tctgcaactt ctctgtcatc 3181 tgcgctgaga gattgttaac ttttgtacct actcttaggc ctgggtgagc cctgtcacct 3241 ttcctcagca cccaggccct tttctgtggc aggctacctc tcagcacaca tctttggatc 3301 acttgctgcc agtggggggc tgggctgcaa accctggctt ccctaagagt cttggccacc 3361 tccccttttg ggtctccatc tgctctgggg tttggatctc ccaaaccatt atgtgtggct 3421 taagttactt gatgcttgcg agaccgaggc attcacaaga ttgcacaaaa tacagactgg 3481 gtcgatctgt ccaggttgca cccaaggtgc atggcagggg cagaacattc ccatcatgtc 3541 ttctcctgag gttaggatta aggtctcccg cacagtggca ggatgcccct ttccttacca 3601 cctgctcaat gttttcttcc taccaggact tccaaccaga cctgtcccag cttcctgagc 3661 ctagaggggg aagctggctc aggctgaggt tctgcttagg ttgagacgtt gatggactct 3721 tcctgactct ttccaggcat cctggagatt actgcggtgg aagtgggcgt ggtggccatc 3781 aaagggctct tttctgggcg gtacctggcc atgaacaaga gaggacggct gtatgcttcg 3841 gtgagttcca ttactggtgg gcaggtgctg atggaataac catctggctt gagcatctga 3901 ttgggggaca gaagaggaca tgagagatag acttcttagc ccccaggtca tgccagactc 3961 aagctcgtcc agatccctcc tgggcttctg agtgcctgcc caacacagac tttaggtgct 4021 caggaaaatg ccctttccac ctgtctgaag tcagtctctt tctctaaacg tcgaaatgat 4081 ggagacagga agtgagcatg gcagcagggt cccaaacctc catggtgtga gtagccacca 4141 tagctgcagt tgagggagag caggtcatgg caaacctgcc ctgcatctgt ggaaggagga 4201 aggctggtgc ttggcctgtt cttctcaggg tgggaaggca acagttgtct ggcttagcag 4261 gagactggag aaatggctgt cagtctagta ctggttgcac agcatgagaa cctgagggtc 4321 tccagcaccc atgtatcaaa gccaggataa agtgtggtca tcccagggca gggggaggag 4381 gcaggatggc ttttggggtt ccccagctaa cctagctgaa ttggcctcca cgctcctgga 4441 tgggcaacag gaggagctca ggggtctcca ttctcctgac ctgaggtgtg acagaaacac 4501 actcttgcct gagtgatggg tcttagctgc tgcagagctg ggctggtgat ttccctgact 4561 cttagactct gtgtcctgac ccactctagt gagtgaactc agctgacttt acaatgaacc 4621 tgactatccc ccagaaactc tgccttaggg tgaggttttg ggtacactca atgacagatc 4681 tgcccacagg gaagaacatg atggggggca tgcctgtgct ctcttggctt caggggagca 4741 gggatgatga ggtcaggggt cccactggtt ggagtagaac tttccagata acccatcagg 4801 gatggtccta cagacttgca gacaggtccc agcttctttc agctgctggc tggtgtctaa 4861 ggcctcttcc tgtagacctg tccctgggtg tcctgggtgc gtaatgtggc tgcctatgtg 4921 ctatatccat gggacagtga cattccatag tcaatcccca cctcctggaa gtcttcgacc 4981 acaactgccc acagctccct ctactgtaaa agcaggctga atgaagtcag ctcatccttc 5041 atgacagtct gtccattcat tgtctatcat ccatccatcc atctatccac ccatctactc 5101 agcctgccat ctatccaccc acctacacac tcagtcatct acccacacct ccatctatac 5161 actcgttcat acgtcccaac ctatcatcca tccatctatc cacccatgta ccatccttac 5221 acccaattat ctaaccactg atacatttat ttgtctatcc acctacacag tcactcaccc 5281 acccagcaac atacctagct acccttccac ctattacatc cacctgcctg ttcagccact 5341 caccaacaca tccattcatc caccctctga cacaccagct agccagccac ctccagcatg 5401 ctctgtctac cacataccaa gcactgagcc aagtgcaagg ctgcagcaaa aggggatagt 5461 ggaaggaagt cacttggagc agggcacaga cttggacagg ttggatagcc aagctgccta 5521 gacagtggca ggtgtggact gctggtccag gcagccttag gagaaaggga ggcattgggg 5581 agatgtagcc ccattgacat catggctcca aggctgttga ctgtggcttc tgtctcacag 5641 gatcactaca acgcagagtg tgagtttgtg gaacggatcc atgagctggg ctacaataca 5701 tatgcttccc gcctgtaccg cacagggtcc agtgggccag gagctcaacg gcaacctggt 5761 gcccagagac cttggtacgt gtcggtgaat ggcaagggtc ggccacgcag gggcttcaag 5821 acccgccgca cacaaaagtc ctctctcttc ctgccccgag tgctgggcca caaggaccat 5881 gagatggtgc ggctgctgca gagtagccag ccacgagccc caggagaggg cagccagccc 5941 aggcagcgga ggcagaagaa gcagagccca ggtgatcatg gcaagatgga gactttgtct 6001 accagggcca ctccaagcac ccagctgcat acaggtggac tggctgtggc ctgagtggcc 6061 acctggaaag ccctctgaga ccaactccag tgggcaccca agattcactt ggagccctgg 6121 cctccccacc cttgtctttg ggctggctgc ttgggggacc aagaacttgc atgcctttac 6181 agcttcaaga gcaagtgcca gctgctaagg ggcttgagtc agagactctg gaagactcga 6241 agttcaagat gtatgtggag ttacatgaga gggaattctt attacagggg tcatcctaat 6301 cccggatgga gcccagccaa tggcagtcag tcctggcaac ctgaggcagt ggaccgtgga 6361 gggggctgtg attcacacat taaaggtgtc tttctgtctt gctatttaac agctgggata 6421 cagacttagc atgccctgag gctccaactg agatattctg tggagggtga agaaggcagt 6481 gatctgcggg cttccctgca aggtgtcaga ctggctgtgt cctgagaaag ggctgaggtg 6541 ctacatgcaa gcatgtgttt gtgggttctt atgtctgcaa atacccacaa gcatgtgtgt 6601 agaagggtgt tggtgtattc ctgagaacat gtgtgtgtga agcatgcaca caaactggcc 6661 ctgaactttt gacttccagg cctctgcctc tctgcgcgca cacacacact cgcactcctg 6721 tatatgaagc gtatatgtgt ttctctggga actgttttta tcaggtgaag tacttccttt 6781 gttcttgcta cccacctcca gggctccagg atctccagac agccaaccct aagacaggcc 6841 cagcttcctc tgtatctctg tgatgagaac cttggcatag agctgccctc accctcggga 6901 tagggcttat gttccccgga acgagccagg cacctcaaca gctcctgggg aggaataggg 6961 gactgggaag tgtctgttgg ctaatattta aaactgaccc aacaggagtc tgggccactt 7021 cggggatctg tcccttgccc ttagtcagag gccagtggct tcacactggg cgtgggttgg 7081 gagggaggca ggaacattca tgtcctgctc agttcctatc taccatttga ccctggttga 7141 cacaacacaa ccttttctga aaggttttgc tagatgtgag tagcttacag aaggccatgc 7201 aggctcaggg aaggcaccac gataagccat tagtgtcctg atgaagaaat ttggctgtag 7261 gctgggacac acacacgcac acaagcccca tggggaggca catagacatt cagagcattc 7321 caccagtgag attccatgtg gacctggggg ctaagtcagg gtgaagcttc cacagctaag 7381 tggctggagg ctgccctaaa agctcaggag gcaccgcaag caagccttga aaaaccttac 7441 ccaccagctt gaccttagac ttctggcctt caggctgtga caatacattc ctgctgttta 7501 aagaaccata tggttggtga tgttttgttt gtttctggtt cttttgtgtt ggtgtttttt 7561 gtttgcgggg tgtgtgtgtg tgtgtgtgtg tgtgtgtgtg tgtgtgtgtg tgtgttgcag 7621 tgctagagat aagatctgag gtctagaaca taccacacag gccattgaac cacaccccaa 7681 ggcagctgtg acatttttgc cagcttccct agcaccatag taccaggtat tgggtaaccc 7741 taaaaagaag accccgtaat gcaggaaggt gggggggggg gagcaggtca actcccagac 7801 tcaaagtcat ccaaggaaaa ttctggatct ttgtgagttc aaggctagcc tggcctatat 7861 agagaattcc agaccttcca gggttctgtt gagagactct atcttaattt tttaaaaaaa 7921 tctaaggagg aaagaggaag atactagcct ggtaccactg gcctccatgt acacacacgt 7981 gagcacacac acacacacac acacataaat gcatatacca cacacacagg gagaaagaaa 8041 aagggaggga agagacagag ggagagagag agagaagaga gagagaattt tagaatttca 8101 atctctccaa gatgggcatc agcacctctt tggcccctgc ccttttacct tatgaccccc 8161 gacccttgct caccttcacc actagagcca tgtttgttcc aaactccttt gcacgtgcca 8221 agggcactat tcttccctct gtttgtcccc agattaccca cttctcaaca ttcacttctg 8281 agt // LOCUS MUSOGC 949 bp DNA ROD 17-FEB-1994 DEFINITION Mus musculus osteocalcin gene, complete cds. ACCESSION L24429 NID g455452 KEYWORDS osteocalcin. SOURCE Mus musculus DNA. ORGANISM Mus musculus Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Rodentia; Sciurognathi; Myomorpha; Muridae; Murinae; Mus. REFERENCE 1 (bases 1 to 949) AUTHORS Desbois,C., Hogue,D.A. and Karsenty,G. TITLE The mouse osteocalcin gene cluster contains three genes with two separate spatial and temporal patterns of expression JOURNAL J. Biol. Chem. 269, 1183-1190 (1994) MEDLINE 94117426 FEATURES Location/Qualifiers source 1..949 /organism="Mus musculus" /db_xref="taxon:10090" exon 1..112 /number=1 CDS join(49..112,257..289,431..488,695..827) /codon_start=1 /product="osteocalcin" /db_xref="PID:g455453" /translation="MRTLSLLTLLALAALCLSDLTDAKPSGPESDKAFMSKQEGNKVV NRLRRYLGASVPSPDPLEPTREQCELNPACDELSDQYGLKTAYKRIYGITI" intron 113..256 /number=1 exon 257..289 /number=2 intron 290..430 /number=2 exon 431..488 /number=3 intron 489..694 /number=3 exon 695..949 /number=4 BASE COUNT 200 a 279 c 241 g 229 t ORIGIN 1 gaacagacaa gtcccacaca gcagcttggt gcacacctag cagacaccat gaggaccctc 61 tctctgctca ctctgctggc cctggctgcg ctctgtctct ctgacctcac aggtatgtgt 121 cctcctggtt catttctttg ggtaactacc ctcctgaagg tctcacaatc tgctttggga 181 tggcagaggg gaagggacaa cacatgaggg agacagcagg gaggaaacag aactaactac 241 actgtttgct ttacagatgc caagcccagc ggccctgagt ctgacaaagg tactagcagg 301 aagcctggca gggcctcggc ttggcctcac cctgtcccct aagcccccaa atccccttgc 361 cttctgcctg ggtgtcccca cttttcctcc tgaactcaga attacctgac cttgtgtgtc 421 ttctccacag ccttcatgtc caagcaggag ggcaataagg tagtgaacag actccggcgc 481 taccttgggt aagtggccag agcccttagc cttccatatt ggtagggagg agttgtgctg 541 gggtggtttc tgtgacccgc agagggctac acgtgcaggt caatccccat gtccaggacc 601 ctggagcctc ttgtacagtg tgggaagagg gtgtgtgtac cccgtgtata ttaatgccac 661 tgtgtgttgg ttgatgttac tttatgcttc tcagagcctc agtccccagc ccagatcccc 721 tggagcccac ccgggagcag tgtgagctta accctgcttg tgacgagcta tcagaccagt 781 atggcttgaa gaccgcctac aaacgcatct acggtatcac tatttaggac ctgtgctgcc 841 ctaaagccaa actctggcag ctcggctttg gctgctctcc gggacttgat cctccctgtc 901 ctctctctct gccctgcaag tatggatgtc acagcagctc caaaataaa // LOCUS MUSSAPRB 1350 bp DNA ROD 15-MAR-1990 DEFINITION Mouse serum amyloid P component gene, complete cds. ACCESSION M29535 NID g200925 KEYWORDS serum amyloid P component. SOURCE Mouse DNA, clone Lm mP-2. ORGANISM Mus musculus Eukaryotae; mitochondrial eukaryotes; Metazoa; Chordata; Vertebrata; Eutheria; Rodentia; Sciurognathi; Myomorpha; Muridae; Murinae; Mus. REFERENCE 1 (bases 1 to 1350) AUTHORS Nishiguchi,S., Maeda,S., Araki,S. and Shimada,K. TITLE Structure of the mouse serum amyloid p component gene JOURNAL Biochem. Biophys. Res. Commun. 155, 1366-1373 (1988) MEDLINE 89025810 FEATURES Location/Qualifiers source 1..1350 /organism="Mus musculus" /db_xref="taxon:10090" prim_transcript 219..1296 /note="SAPR mRNA and intron" CDS join(371..437,548..1155) /note="serum amyloid P component precursor" /codon_start=1 /db_xref="PID:g200926" /translation="MDKLLLWMFVFTSLLSEAFCQTDLKRKVFVFPRESETDHVKLIP HLEKPLQNFTLCFRTYSDLSRSQSLFSYSVKGRDNELLIYKEKVGEYSLYIGQSKVTV RGMEEYLSPVHLCTTWESSSGIVEFWVNGKPWVKKSLQREYTVKAPPSIVLGQEQDNY GGGFQRSQSFVGEFSDLYMWDYVLTPQDILFVYRDSPVNPNILNWQALNYEINGYVVI RPRVWD" sig_peptide 371..430 /note="serum amyloid P component signal peptide" exon <371..437 /note="serum amyloid P component precursor" /number=1 mat_peptide 431..434 /note="serum amyloid P component" intron 438..547 /note="SAPR intron" exon 548..>1155 /note="serum amyloid P component precursor" /number=2 BASE COUNT 378 a 295 c 283 g 394 t ORIGIN 343 bp upstream of PvuII site. 1 agaatggaga actcatgtga ccaacctggt ctcttgttct atctgtagga ccttgaagat 61 ttgcagctct tcccttcccg gcaatgaaat ctgggtcaca ggggttaaag ctctagcact 121 ggcttgtgtc taatgaataa ctgttattga tttcccagca caggggctaa tcatttattt 181 tctaacaaca gctctaatta ttagcagaac gaaggaggat ctgggagtac ctcacatggt 241 attacttctc tccacccttc attatcatcc aaggcacata caaaacctga aatctgaaaa 301 gcatagggag acaccacact tttgttccac acccaagtaa cagctgctgc tgtcataccc 361 tgggccaagc atggacaagc tgctgctttg gatgtttgtc ttcaccagcc ttctttcaga 421 agccttttgt cagacaggta agatgctttg gctgctgtca gggagcataa aatgagaaaa 481 gatgaatctg agatattgtt tgtctgaatt tgttgaaatg cattatattt cttattttat 541 ctcacagacc tcaagaggaa agtatttgtg ttccccagag aatctgaaac tgatcatgtg 601 aagctgatcc cacatctaga gaaacctctg cagaatttta cactgtgttt ccgaacctac 661 agtgaccttt cccgctctca gagtcttttc tcctacagtg tcaagggcag agacaatgag 721 ctactaattt ataaagaaaa agttggagaa tacagcctat acatcggaca atcaaaagtc 781 acagtccgtg gtatggaaga atacctttct ccagtacacc tatgtaccac ttgggagtcc 841 tcctctggca ttgttgaatt ttgggtcaat ggaaagcctt gggtaaaaaa gtctctgcag 901 agggaataca ctgtgaaagc cccacccagt atagtcctgg gacaggagca ggataactat 961 ggaggagggt ttcaaaggtc acagtccttt gtaggagagt tttcagattt atacatgtgg 1021 gactatgtgc tgaccccaca agacattcta tttgtgtaca gagattcccc tgtcaatcct 1081 aatattttga attggcaggc tcttaactat gaaataaatg gctacgtagt catcaggccc 1141 cgtgtctggg attgagatct tacaacaaaa cctcatggac atcagatggc cgatgtgtaa 1201 gaggtcaagg cggcagagtt cactctatct ggagcttttt cttctttgtg aacatcttgt 1261 atacatatct gccaaataaa aatcctctcc aattccacct gtattggttt gctgcttgac 1321 ataatgtcaa acgctttctc ttggctctat // LOCUS MUSCRKNB 45