FASTA
view |
|
Sequence feature view of the region: gi|16127994:2855116-2857677LOCUS 16127994 2562 bp DNA BCT 26-OCT-2001
DEFINITION Gene from: Escherichia coli K12, complete genome.
ACCESSION 16127994
VERSION 16127994
KEYWORDS .
SOURCE Escherichia coli K12.
ORGANISM Escherichia coli K12
Bacteria; Proteobacteria; gamma subdivision; Enterobacteriaceae;
Escherichia.
REFERENCE 1 (bases 2855116 to 2857677)
AUTHORS Blattner,F.R., Plunkett,G. III, Bloch,C.A., Perna,N.T., Burland,V.,
Riley,M., Collado-Vides,J., Glasner,J.D., Rode,C.K., Mayhew,G.F.,
Gregor,J., Davis,N.W., Kirkpatrick,H.A., Goeden,M.A., Rose,D.J.,
Mau,B. and Shao,Y.
TITLE The complete genome sequence of Escherichia coli K-12
JOURNAL Science 277 (5331), 1453-1474 (1997)
MEDLINE 97426617
REFERENCE 2 (bases 2855116 to 2857677)
AUTHORS Blattner,F.R.
TITLE Direct Submission
JOURNAL Submitted (16-JAN-1997) Guy Plunkett III, Laboratory of Genetics,
University of Wisconsin, 445 Henry Mall, Madison, WI 53706, USA.
Email: ecoli@genetics.wisc.edu Phone: 608-262-2534 Fax:
608-263-7459
REFERENCE 3 (bases 2855116 to 2857677)
AUTHORS Blattner,F.R.
TITLE Direct Submission
JOURNAL Submitted (02-SEP-1997) Guy Plunkett III, Laboratory of Genetics,
University of Wisconsin, 445 Henry Mall, Madison, WI 53706, USA.
Email: ecoli@genetics.wisc.edu Phone: 608-262-2534 Fax:
608-263-7459
REFERENCE 4 (bases 2855116 to 2857677)
AUTHORS NCBI Microbial Genomes Annotation Project.
TITLE Direct Submission
JOURNAL Submitted (26-SEP-2001) National Center for Biotechnology
Information, NIH, Bethesda, MD 20894, USA
COMMENT PROVISIONAL REFSEQ: This record has not yet been subject to final
NCBI review. The reference sequence was derived from U00096.
This sequence was determined by the E. coli Genome Project at the
University of Wisconsin-Madison (Frederick R. Blattner, director).
Supported by NIH grants HG00301 and HG01428 (from the Human Genome
Project and NCHGR). The entire sequence was independently
determined from E. coli K-12 strain MG1655. Predicted open reading
frames were determined using GeneMark software, kindly supplied by
Mark Borodovsky, Georgia Institute of Technology, Atlanta, GA,
30332 [e-mail: mark@amber.gatech.edu]. Open reading frames that
have been correlated with genetic loci are being annotated with CG
Site Nos., unique ID nos. for the genes in the E. coli Genetic
Stock Center (CGSC) database at Yale University, kindly supplied by
Mary Berlyn. A public version of the database is accessible
(http://cgsc.biology.yale.edu). Annotation of the genome is an
ongoing task whose goal is to make the genome sequence more useful
by correlating it with other data. Comments to the authors are
appreciated. Updated information will be available at the E. coli
Genome Project's World Wide Web site
(http://www.genetics.wisc.edu). *** The E. coli K-12 sequence and
its annotations are periodically updated; this is version M54. No
sequence changes. Annotation updates: updated gene identifications
and products; all new functional assignments courtesy of Monica
Riley; added promoters, protein binding sites, and repeated
sequences described in reference 1. The unique numeric identifiers
beginning with a lowercase 'b' assigned to each gene (protein- or
RNA-encoding) are now designated as gene synonyms instead of
labels. This should allow them to be searched for in Entrez as gene
names.
COMPLETENESS: full length.
FEATURES Location/Qualifiers
gene 1..2562
/gene="mutS"
/note="b2733"
CDS 1..2562
/gene="mutS"
/function="enzyme; DNA - replication, repair,
restriction/modification"
/note="o853; 100 pct identical to MUTS_ECOLI SW: P23909"
/codon_start=1
/transl_table=11
/product="methyl-directed mismatch repair"
/protein_id="NP_417213.1"
/db_xref="GI:16130640"
/translation="MSAIENFDAHTPMMQQYLRLKAQHPEILLFYRMGDFYELFYDDA
KRASQLLDISLTKRGASAGEPIPMAGIPYHAVENYLAKLVNQGESVAICEQIGDPATS
KGPVERKVVRIVTPGTISDEALLQERQDNLLAAIWQDSKGFGYATLDISSGRFRLSEP
ADRETMAAELQRTNPAELLYAEDFAEMSLIEGRRGLRRRPLWEFEIDTARQQLNLQFG
TRDLVGFGVENAPRGLCAAGCLLQYAKDTQRTTLPHIRSITMEREQDSIIMDAATRRN
LEITQNLAGGAENTLASVLDCTVTPMGSRMLKRWLHMPVRDTRVLLERQQTIGALQDF
TAGLQPVLRQVGDLERILARLALRTARPRDLARMRHAFQQLPELRAQLETVDSAPVQA
LREKMGEFAELRDLLERAIIDTPPVLVRDGGVIASGYNEELDEWRALADGATDYLERL
EVRERERTGLDTLKVGFNAVHGYYIQISRGQSHLAPINYMRRQTLKNAERYIIPELKE
YEDKVLTSKGKALALEKQLYEELFDLLLPHLEALQQSASALAELDVLVNLAERAYTLN
YTCPTFIDKPGIRITEGRHPVVEQVLNEPFIANPLNLSPQRRMLIITGPNMGGKSTYM
RQTALIALMAYIGSYVPAQKVEIGPIDRIFTRVGAADDLASGRSTFMVEMTETANILH
NATEYSLVLMDEIGRGTSTYDGLSLAWACAENLANKIKALTLFATHYFELTQLPEKME
GVANVHLDALEHGDTIAFMHSVQDGAASKSYGLAVAALAGVPKEVIKRARQKLRELES
ISPNAAATQVDGTQMSLLSVPEETSPAVEALENLDPDSLTPRQALEWIYRLKSLV"
BASE COUNT 590 a 690 c 748 g 534 t
ORIGIN
1 atgagtgcaa tagaaaattt cgacgcccat acgcccatga tgcagcagta tctcaggctg
61 aaagcccagc atcccgagat cctgctgttt taccggatgg gtgattttta tgaactgttt
121 tatgacgacg caaaacgcgc gtcgcaactg ctggatattt cactgaccaa acgcggtgct
181 tcggcgggag agccgatccc gatggcgggg attccctacc atgcggtgga aaactatctc
241 gccaaactgg tgaatcaggg agagtccgtt gccatctgcg aacaaattgg cgatccggcg
301 accagcaaag gtccggttga gcgcaaagtt gtgcgtatcg ttacgccagg caccatcagc
361 gatgaagccc tgttgcagga gcgtcaggac aacctgctgg cggctatctg gcaggacagc
421 aaaggtttcg gctacgcgac gctggatatc agttccgggc gttttcgcct gagcgaaccg
481 gctgaccgcg aaacgatggc ggcagaactg caacgcacta atcctgcgga actgctgtat
541 gcagaagatt ttgctgaaat gtcgttaatt gaaggccgtc gcggcctgcg ccgtcgcccg
601 ctgtgggagt ttgaaatcga caccgcgcgc cagcagttga atctgcaatt tgggacccgc
661 gatctggtcg gttttggcgt cgagaacgcg ccgcgcggac tttgtgctgc cggttgtctg
721 ttgcagtatg cgaaagatac ccaacgtacg actctgccgc atattcgttc catcaccatg
781 gaacgtgagc aggacagcat cattatggat gccgcgacgc gtcgtaatct ggaaatcacc
841 cagaacctgg cgggtggtgc ggaaaatacg ctggcttctg tgctcgactg caccgtcacg
901 ccgatgggca gccgtatgct gaaacgctgg ctgcatatgc cagtgcgcga tacccgcgtg
961 ttgcttgagc gccagcaaac tattggcgca ttgcaggatt tcaccgccgg gctacagccg
1021 gtactgcgtc aggtcggcga cctggaacgt attctggcac gtctggcttt acgaactgct
1081 cgcccacgcg atctggcccg tatgcgccac gctttccagc aactgccgga gctgcgtgcg
1141 cagttagaaa ctgtcgatag tgcaccggta caggcgctac gtgagaagat gggcgagttt
1201 gccgagctgc gcgatctgct ggagcgagca atcatcgaca caccgccggt gctggtacgc
1261 gacggtggtg ttatcgcatc gggctataac gaagagctgg atgagtggcg cgcgctggct
1321 gacggcgcga ccgattatct ggagcgtctg gaagtccgcg agcgtgaacg taccggcctg
1381 gacacgctga aagttggctt taatgcggtg cacggctact acattcaaat cagccgtggg
1441 caaagccatc tggcacccat caactacatg cgtcgccaga cgctgaaaaa cgccgagcgc
1501 tacatcattc cagagctaaa agagtacgaa gataaagttc tcacctcaaa aggcaaagca
1561 ctggcactgg aaaaacagct ttatgaagag ctgttcgacc tgctgttgcc gcatctggaa
1621 gcgttgcaac agagcgcgag cgcgctggcg gaactcgacg tgctggttaa cctggcggaa
1681 cgggcctata ccctgaacta cacctgcccg accttcattg ataaaccggg cattcgcatt
1741 accgaaggtc gccatccggt agttgaacaa gtactgaatg agccatttat cgccaacccg
1801 ctgaatctgt cgccgcagcg ccgcatgttg atcatcaccg gtccgaacat gggcggtaaa
1861 agtacctata tgcgccagac cgcactgatt gcgctgatgg cctacatcgg cagctatgta
1921 ccggcacaaa aagtcgagat tggacctatc gatcgcatct ttacccgcgt aggcgcggca
1981 gatgacctgg cgtccgggcg ctcaaccttt atggtggaga tgactgaaac cgccaatatt
2041 ttacataacg ccaccgaata cagtctggtg ttaatggatg agatcgggcg tggaacgtcc
2101 acctacgatg gtctgtcgct ggcgtgggcg tgcgcggaaa atctggcgaa taagattaag
2161 gcattgacgt tatttgctac ccactatttc gagctgaccc agttaccgga gaaaatggaa
2221 ggcgtcgcta acgtgcatct cgatgcactg gagcacggcg acaccattgc ctttatgcac
2281 agcgtgcagg atggcgcggc gagcaaaagc tacggcctgg cggttgcagc tctggcaggc
2341 gtgccaaaag aggttattaa gcgcgcacgg caaaagctgc gtgagctgga aagcatttcg
2401 ccgaacgccg ccgctacgca agtggatggt acgcaaatgt ctttgctgtc agtaccagaa
2461 gaaacttcgc ctgcggtcga agctctggaa aatcttgatc cggattcact caccccgcgt
2521 caggcgctgg agtggattta tcgcttgaag agcctggtgt aa
//
|