BLAST PubMed Nucleotide Protein Genome Structure PopSet Taxonomy Help

FASTA view

Sequence feature view of the region: gi|16127994:2855116-2857677

LOCUS       16127994     2562 bp    DNA             BCT       26-OCT-2001
DEFINITION  Gene from: Escherichia coli K12, complete genome.
ACCESSION   16127994
VERSION     16127994
KEYWORDS    .
SOURCE      Escherichia coli K12.
  ORGANISM  Escherichia coli K12
            Bacteria; Proteobacteria; gamma subdivision; Enterobacteriaceae;
            Escherichia.
REFERENCE   1  (bases 2855116 to 2857677)
  AUTHORS   Blattner,F.R., Plunkett,G. III, Bloch,C.A., Perna,N.T., Burland,V.,
            Riley,M., Collado-Vides,J., Glasner,J.D., Rode,C.K., Mayhew,G.F.,
            Gregor,J., Davis,N.W., Kirkpatrick,H.A., Goeden,M.A., Rose,D.J.,
            Mau,B. and Shao,Y.
  TITLE     The complete genome sequence of Escherichia coli K-12
  JOURNAL   Science 277 (5331), 1453-1474 (1997)
  MEDLINE   97426617
REFERENCE   2  (bases 2855116 to 2857677)
  AUTHORS   Blattner,F.R.
  TITLE     Direct Submission
  JOURNAL   Submitted (16-JAN-1997) Guy Plunkett III, Laboratory of Genetics,
            University of Wisconsin, 445 Henry Mall, Madison, WI 53706, USA.
            Email: ecoli@genetics.wisc.edu Phone: 608-262-2534 Fax:
            608-263-7459
REFERENCE   3  (bases 2855116 to 2857677)
  AUTHORS   Blattner,F.R.
  TITLE     Direct Submission
  JOURNAL   Submitted (02-SEP-1997) Guy Plunkett III, Laboratory of Genetics,
            University of Wisconsin, 445 Henry Mall, Madison, WI 53706, USA.
            Email: ecoli@genetics.wisc.edu Phone: 608-262-2534 Fax:
            608-263-7459
REFERENCE   4  (bases 2855116 to 2857677)
  AUTHORS   NCBI Microbial Genomes Annotation Project.
  TITLE     Direct Submission
  JOURNAL   Submitted (26-SEP-2001) National Center for Biotechnology
            Information, NIH, Bethesda, MD 20894, USA
COMMENT     PROVISIONAL REFSEQ: This record has not yet been subject to final
            NCBI review. The reference sequence was derived from U00096.
            This sequence was determined by the E. coli Genome Project at the
            University of Wisconsin-Madison (Frederick R. Blattner, director). 
            Supported by NIH grants HG00301 and HG01428 (from the Human Genome
            Project and NCHGR). The entire sequence was independently
            determined from E. coli K-12 strain MG1655. Predicted open reading
            frames were determined using GeneMark software, kindly supplied by
            Mark Borodovsky, Georgia Institute of Technology, Atlanta, GA,
            30332 [e-mail: mark@amber.gatech.edu].  Open reading frames that
            have been correlated with genetic loci are being annotated with CG
            Site Nos., unique ID nos. for the genes in the E. coli Genetic
            Stock Center (CGSC) database at Yale University, kindly supplied by
            Mary Berlyn. A public version of the database is accessible
            (http://cgsc.biology.yale.edu). Annotation of the genome is an
            ongoing task whose goal is to make the genome sequence more useful
            by correlating it with other data.  Comments to the authors are
            appreciated. Updated information will be available at the E. coli
            Genome Project's World Wide Web site
            (http://www.genetics.wisc.edu). *** The E. coli K-12 sequence and
            its annotations are periodically updated; this is version M54. No
            sequence changes. Annotation updates: updated gene identifications
            and products; all new functional assignments courtesy of Monica
            Riley; added promoters, protein binding sites, and repeated
            sequences described in reference 1. The unique numeric identifiers
            beginning with a lowercase 'b' assigned to each gene (protein- or
            RNA-encoding) are now designated as gene synonyms instead of
            labels. This should allow them to be searched for in Entrez as gene
            names.
            COMPLETENESS: full length.
FEATURES             Location/Qualifiers
     gene            1..2562
                     /gene="mutS"
                     /note="b2733"
     CDS             1..2562
                     /gene="mutS"
                     /function="enzyme; DNA - replication, repair,
                     restriction/modification"
                     /note="o853; 100 pct identical to MUTS_ECOLI SW: P23909"
                     /codon_start=1
                     /transl_table=11
                     /product="methyl-directed mismatch repair"
                     /protein_id="NP_417213.1"
                     /db_xref="GI:16130640"
                     /translation="MSAIENFDAHTPMMQQYLRLKAQHPEILLFYRMGDFYELFYDDA
                     KRASQLLDISLTKRGASAGEPIPMAGIPYHAVENYLAKLVNQGESVAICEQIGDPATS
                     KGPVERKVVRIVTPGTISDEALLQERQDNLLAAIWQDSKGFGYATLDISSGRFRLSEP
                     ADRETMAAELQRTNPAELLYAEDFAEMSLIEGRRGLRRRPLWEFEIDTARQQLNLQFG
                     TRDLVGFGVENAPRGLCAAGCLLQYAKDTQRTTLPHIRSITMEREQDSIIMDAATRRN
                     LEITQNLAGGAENTLASVLDCTVTPMGSRMLKRWLHMPVRDTRVLLERQQTIGALQDF
                     TAGLQPVLRQVGDLERILARLALRTARPRDLARMRHAFQQLPELRAQLETVDSAPVQA
                     LREKMGEFAELRDLLERAIIDTPPVLVRDGGVIASGYNEELDEWRALADGATDYLERL
                     EVRERERTGLDTLKVGFNAVHGYYIQISRGQSHLAPINYMRRQTLKNAERYIIPELKE
                     YEDKVLTSKGKALALEKQLYEELFDLLLPHLEALQQSASALAELDVLVNLAERAYTLN
                     YTCPTFIDKPGIRITEGRHPVVEQVLNEPFIANPLNLSPQRRMLIITGPNMGGKSTYM
                     RQTALIALMAYIGSYVPAQKVEIGPIDRIFTRVGAADDLASGRSTFMVEMTETANILH
                     NATEYSLVLMDEIGRGTSTYDGLSLAWACAENLANKIKALTLFATHYFELTQLPEKME
                     GVANVHLDALEHGDTIAFMHSVQDGAASKSYGLAVAALAGVPKEVIKRARQKLRELES
                     ISPNAAATQVDGTQMSLLSVPEETSPAVEALENLDPDSLTPRQALEWIYRLKSLV"
BASE COUNT      590 a    690 c    748 g    534 t
ORIGIN      
        1 atgagtgcaa tagaaaattt cgacgcccat acgcccatga tgcagcagta tctcaggctg
       61 aaagcccagc atcccgagat cctgctgttt taccggatgg gtgattttta tgaactgttt
      121 tatgacgacg caaaacgcgc gtcgcaactg ctggatattt cactgaccaa acgcggtgct
      181 tcggcgggag agccgatccc gatggcgggg attccctacc atgcggtgga aaactatctc
      241 gccaaactgg tgaatcaggg agagtccgtt gccatctgcg aacaaattgg cgatccggcg
      301 accagcaaag gtccggttga gcgcaaagtt gtgcgtatcg ttacgccagg caccatcagc
      361 gatgaagccc tgttgcagga gcgtcaggac aacctgctgg cggctatctg gcaggacagc
      421 aaaggtttcg gctacgcgac gctggatatc agttccgggc gttttcgcct gagcgaaccg
      481 gctgaccgcg aaacgatggc ggcagaactg caacgcacta atcctgcgga actgctgtat
      541 gcagaagatt ttgctgaaat gtcgttaatt gaaggccgtc gcggcctgcg ccgtcgcccg
      601 ctgtgggagt ttgaaatcga caccgcgcgc cagcagttga atctgcaatt tgggacccgc
      661 gatctggtcg gttttggcgt cgagaacgcg ccgcgcggac tttgtgctgc cggttgtctg
      721 ttgcagtatg cgaaagatac ccaacgtacg actctgccgc atattcgttc catcaccatg
      781 gaacgtgagc aggacagcat cattatggat gccgcgacgc gtcgtaatct ggaaatcacc
      841 cagaacctgg cgggtggtgc ggaaaatacg ctggcttctg tgctcgactg caccgtcacg
      901 ccgatgggca gccgtatgct gaaacgctgg ctgcatatgc cagtgcgcga tacccgcgtg
      961 ttgcttgagc gccagcaaac tattggcgca ttgcaggatt tcaccgccgg gctacagccg
     1021 gtactgcgtc aggtcggcga cctggaacgt attctggcac gtctggcttt acgaactgct
     1081 cgcccacgcg atctggcccg tatgcgccac gctttccagc aactgccgga gctgcgtgcg
     1141 cagttagaaa ctgtcgatag tgcaccggta caggcgctac gtgagaagat gggcgagttt
     1201 gccgagctgc gcgatctgct ggagcgagca atcatcgaca caccgccggt gctggtacgc
     1261 gacggtggtg ttatcgcatc gggctataac gaagagctgg atgagtggcg cgcgctggct
     1321 gacggcgcga ccgattatct ggagcgtctg gaagtccgcg agcgtgaacg taccggcctg
     1381 gacacgctga aagttggctt taatgcggtg cacggctact acattcaaat cagccgtggg
     1441 caaagccatc tggcacccat caactacatg cgtcgccaga cgctgaaaaa cgccgagcgc
     1501 tacatcattc cagagctaaa agagtacgaa gataaagttc tcacctcaaa aggcaaagca
     1561 ctggcactgg aaaaacagct ttatgaagag ctgttcgacc tgctgttgcc gcatctggaa
     1621 gcgttgcaac agagcgcgag cgcgctggcg gaactcgacg tgctggttaa cctggcggaa
     1681 cgggcctata ccctgaacta cacctgcccg accttcattg ataaaccggg cattcgcatt
     1741 accgaaggtc gccatccggt agttgaacaa gtactgaatg agccatttat cgccaacccg
     1801 ctgaatctgt cgccgcagcg ccgcatgttg atcatcaccg gtccgaacat gggcggtaaa
     1861 agtacctata tgcgccagac cgcactgatt gcgctgatgg cctacatcgg cagctatgta
     1921 ccggcacaaa aagtcgagat tggacctatc gatcgcatct ttacccgcgt aggcgcggca
     1981 gatgacctgg cgtccgggcg ctcaaccttt atggtggaga tgactgaaac cgccaatatt
     2041 ttacataacg ccaccgaata cagtctggtg ttaatggatg agatcgggcg tggaacgtcc
     2101 acctacgatg gtctgtcgct ggcgtgggcg tgcgcggaaa atctggcgaa taagattaag
     2161 gcattgacgt tatttgctac ccactatttc gagctgaccc agttaccgga gaaaatggaa
     2221 ggcgtcgcta acgtgcatct cgatgcactg gagcacggcg acaccattgc ctttatgcac
     2281 agcgtgcagg atggcgcggc gagcaaaagc tacggcctgg cggttgcagc tctggcaggc
     2341 gtgccaaaag aggttattaa gcgcgcacgg caaaagctgc gtgagctgga aagcatttcg
     2401 ccgaacgccg ccgctacgca agtggatggt acgcaaatgt ctttgctgtc agtaccagaa
     2461 gaaacttcgc ctgcggtcga agctctggaa aatcttgatc cggattcact caccccgcgt
     2521 caggcgctgg agtggattta tcgcttgaag agcctggtgt aa
//