Key for assignment 3
1. How many hits will you get if you search genes associated with colon
cancer in human genome?
Search results for query "colon cancer" ->
17 hits
2. How many loci will you find if you search locus link for human in
Genebank?
3. Give the locus ID and position of MLH1.
LocusID -> 4292
Position ->
3p21.3
4. Find the %ID of nucleotide sequence for its possible orthologs in
mouse.
Click on the Homologene data of 4292 MLH1--> Click
more of Hs.57301
5. Find the total number of mutations of MLH1 reported in human gene
mutation database.
147 (now
become 149)
Click on 4292 -> Click on HGMD (Human Gene Mutaion
Database)
6. Give the DNA sequence of MLH1.
>gi|13905125|gb|BC006850.1|BC006850
Homo sapiens, mutL (E. coli) homolog 1 (colon cancer, nonpolyposis type 2),
clone MGC:5172 IMAGE:3451538, mRNA, complete cds
GGCACTTCCGTTGAGCATCTAGACGTTTCCTTGGCTCTTCTGGCGCCAAAATGTCGTTCGTGGCAGGGGTTATTCGGCGG
CTGGACGAGACAGTGGTGAACCGCATCGCGGCGGGGGAAGTTATCCAGCGGCCAGCTAATGCTATCAAAGAGATGATTGA
GAACTGTTTAGATGCAAAATCCACAAGTATTCAAGTGATTGTTAAAGAGGGAGGCCTGAAGTTGATTCAGATCCAAGACA
ATGGCACCGGGATCAGGAAAGAAGATCTGGATATTGTATGTGAAAGGTTCACTACTAGTAAACTGCAGTCCTTTGAGGAT
TTAGCCAGTATTTCTACCTATGGCTTTCGAGGTGAGGCTTTGGCCAGCATAAGCCATGTGGCTCATGTTACTATTACAAC
GAAAACAGCTGATGGAAAGTGTGCATACAGAGCAAGTTACTCAGATGGAAAACTGAAAGCCCCTCCTAAACCATGTGCTG
GCAATCAAGGGACCCAGATCACGGTGGAGGACCTTTTTTACAACATAGCCACGAGGAGAAAAGCTTTAAAAAATCCAAGT
GAAGAATATGGGAAAATTTTGGAAGTTGTTGGCAGGTATTCAGTACACAATGCAGGCATTAGTTTCTCAGTTAAAAAACA
AGGAGAGACAGTAGCTGATGTTAGGACACTACCCAATGCCTCAACCGTGGACAATATTCGCTCCATCTTTGGAAATGCTG
TTAGTCGAGAACTGATAGAAATTGGATGTGAGGATAAAACCCTAGCCTTCAAAATGAATGGTTACATATCCAATGCAAAC
TACTCAGTGAAGAAGTGCATCTTCTTACTCTTCATCAACCATCGTCTGGTAGAATCAACTTCCTTGAGAAAAGCCATAGA
AACAGTGTATGCAGCCTATTTGCCCAAAAACACACACCCATTCCTGTACCTCAGTTTAGAAATCAGTCCCCAGAATGTGG
ATGTTAATGTGCACCCCACAAAGCATGAAGTTCACTTCCTGCACGAGGAGAGCATCCTGGAGCGGGTGCAGCAGCACATC
GAGAGCAAGCTCCTGGGCTCCAATTCCTCCAGGATGTACTTCACCCAGACTTTGCTACCAGGACTTGCTGGCCCCTCTGG
GGAGATGGTTAAATCCACAACAAGTCTGACCTCGTCTTCTACTTCTGGAAGTAGTGATAAGGTCTATGCCCACCAGATGG
TTCGTACAGATTCCCGGGAACAGAAGCTTGATGCATTTCTGCAGCCTCTGAGCAAACCCCTGTCCAGTCAGCCCCAGGCC
ATTGTCACAGAGGATAAGACAGATATTTCTAGTGGCAGGGCTAGGCAGCAAGATGAGGAGATGCTTGAACTCCCAGCCCC
TGCTGAAGTGGCTGCCAAAAATCAGAGCTTGGAGGGGGATACAACAAAGGGGACTTCAGAAATGTCAGAGAAGAGAGGAC
CTACTTCCAGCAACCCCAGAAAGAGACATCGGGAAGATTCTGATGTGGAAATGGTGGAAGATGATTCCCGAAAGGAAATG
ACTGCAGCTTGTACCCCCCGGAGAAGGATCATTAACCTCACTAGTGTTTTGAGTCTCCAGGAAGAAATTAATGAGCAGGG
ACATGAGGTTCTCCGGGAGATGTTGCATAACCACTCCTTCGTGGGCTGTGTGAATCCTCAGTGGGCCTTGGCACAGCATC
AAACCAAGTTATACCTTCTCAACACCACCAAGCTTAGTGAAGAACTGTTCTACCAGATACTCATTTATGATTTTGCCAAT
TTTGGTGTTCTCAGGTTATCGGAGCCAGCACCGCTCTTTGACCTTGCCATGCTTGCCTTAGATAGTCCAGAGAGTGGCTG
GACAGAGGAAGATGGTCCCAAAGAAGGACTTGCTGAATACATTGTTGAGTTTCTGAAGAAGAAGGCTGAGATGCTTGCAG
ACTATTTCTCTTTGGAAATTGATGAGGAAGGGAACCTGATTGGATTACCCCTTCTGATTGACAACTATGTGCCCCCTTTG
GAGGGACTGCCTATCTTCATTCTTCGACTAGCCACTGAGGTGAATTGGGACGAAGAAAAGGAATGTTTTGAAAGCCTCAG
TAAAGAATGCGCTATGTTCTATTCCATCCGGAAGCAGTACATATCTGAGGAGTCGACCCTCTCAGGCCAGCAGAGTGAAG
TGCCTGGCTCCATTCCAAACTCCTGGAAGTGGACTGTGGAACACATTGTCTATAAAGCCTTGCGCTCACACATTCTGCCT
CCTAAACATTTCACAGAAGATGGAAATATCCTGCAGCTTGCTAACCTGCCTGATCTATACAAAGTCTTTGAGAGGTGTTA
AATATGGTTATTTATGCACTGTGGGATGTGTTCTTCTTTCTCTGTATTCCGATACAAAGTGTTGTATCAAAGTGTGATAT
ACAAAGTGTACCAACATAAGTGTTGGTAGCACTTAAGACTTATACTTGCCTTCTGATAGTATTCCTTTATACACAGTGGA
TTGATTATAAATAAATAGATGTGTCTTAACATAAAAAAAAAAAAAAAAAA
Click on 4292--> GenBank Sequences--> Click on
Nucleotide BC006850
7. Give the DNA sequence of E. coli mismatch repair gene
mutS.
>gi|146905|gb|M64730.1|ECOMUTS
Escherichia coli DNA mismatch repair protein (fdv) gene, complete cds
AACTGCAAATTGCCGGACAGATCTGCCTGTCCGGCATACTATTCATGAGGTTTTTTCGGACGATATTTTTCCGGCAGTTC
TGGCACCGGACGCTTGTCATCGATGAGATGACGCACGGTTAAGATCGGATGACGCCACAGCATTCTCGGCCCGGCCCAAC
GCATAATCTGTTTCATCTCTTCACGCTTTGCAGGCTGGTAACAGTGCACCGGACACTGCTTACAGGCTGGTTTCTCTTCG
CCGAACACACATTTATCCAGCCGCTTTTGCGCGTAAACAAACAACGCCTCGTAATGCTCCGGCTCCGCTGACGCCTGCGG
GCATTTCGCTTGATAAAGATCGATCATTTTTTTAATCGTCAGTTTTTCACGAGAGATACGCTTGCCGGACATGCTGCCTC
CACCTCATTAAGATGTATTTATATTACATCTTAATCTTAAAGGGCACTATGACTCCAAAGAAGAAGGGTTAGCCAACCGA
TACAATTTTGCGTACTTGCTTCATAAGCATCACGCAAAAGCTGCAAAACAGCATCTTTCCCGGAACCAGCATCAAGAACT
CGCCGTTCGCTTCTTCCCCTGAAATGATTAACTCCGGTATCATGTGCGCCTTATGTGATTACAACGAAAATAAAAACCAT
CACACCCCATTTAATATCAGGGAACCGGACATAACCCCATGAGTGCAATAGAAAATTTCGACGCCCATACGCCCATGATG
CAGCAGTATCTCAGGCTGAAAGCCCAGCATCCCGAGATCCTGCTGTTTTACCGGATGGGTGATTTTTATGAACTGTTTTA
TGACGACGCAAAACGCGCGTCGCAACTGCTGGATATTTCACTGACCAAACGCGGTGCTTCGGCGGGAGAGCCGATCCCGA
TGGCGGGGATTCCCTACCATGCGGTGGAAAACTATCTCGCCAAACTGGTGAATCAGGGAGAGTCCGTTGCCATCTGCGAA
CAAATTGGCGATCCGGCGACCAGCAAAGGTCCGGTTGAGCGCAAAGTTGTGCGTATCGTTACGCCAGGCACCATCAGCGA
TGAAGCCCTGTTGCAGGAGCGTCAGGACAACCTGCTGGCGGCTATCTGGCAGGACAGCAAAGGTTTCGGCTACGCGACGC
TGGATATCAGTTCCGGGCGTTTTCGCCTGAGCGAACCGGCTGACCGCGAAACGATGGCGGCAGAACTGCAACGCACTAAT
CCTGCGGAACTGCTGTATGCAGAAGATTTTGCTGAAATGTCGTTAATTGAAGGCCGTCGCGGCCTGCGCCGTCGCCCGCT
GTGGGAGTTTGAAATCGACACCGCGCGCCAGCAGTTGAATCTGCAATTTGGGACCCGCGATCTGGTCGGTTTTGGCGTCG
AGAACGCGCCGCGCGGACTTTGTGCTGCCGGTTGTCTGTTGCAGTATGCGAAAGATACCCAACGTACGACTCTGCCGCAT
ATTCGTTCCATCACCATGGAACGTGAGCAGGACAGCATCATTATGGATGCCGCGACGCGTCGTAATCTGGAAATCACCCA
GAACCTGGCGGGTGGTGCGGAAAATACGCTGGCTTCTGTGCTCGACTGCACCGTCACGCCGATGGGCAGCCGTATGCTGA
AACGCTGGCTGCATATGCCAGTGCGCGATACCCGCGTGTTGCTTGAGCGCCAGCAAACTATTGGCGCATTGCAGGATTTC
ACCGCCGGGCTACAGCCGGTACTGCGTCAGGTCGGCGACCTGGAACGTATTCTGGCACGTCTGGCTTTACGAACTGCTCG
CCCACGCGATCTGGCCCGTATGCGCCACGCTTTCCAGCAACTGCCGGAGCTGCGTGCGCAGTTAGAAACTGTCGATAGTG
CACCGGTACAGGCGCTACGTGAGAAGATGGGCGAGTTTGCCGAGCTGCGCGATCTGCTGGAGCGAGCAATCATCGACACA
CCGCCGGTGCTGGTACGCGACGGTGGTGTTATCGCATCGGGCTATAACGAAGAGCTGGATGAGTGGCGCGCGCTGGCTGA
CGGCGCGACCGATTATCTGGAGCGTCTGGAAGTCCGCGAGCGTGAACGTACCGGCCTGGACACGCTGAAAGTTGGCTTTA
ATGCGGTGCACGGCTACTACATTCAAATCAGCCGTGGGCAAAGCCATCTGGCACCCATCAACTACATGCGTCGCCAGACG
CTGAAAAACGCCGAGCGCTACATCATTCCAGAGCTAAAAGAGTACGAAGATAAAGTTCTCACCTCAAAAGGCAAAGCACT
GGCACTGGAAAAACAGCTTTATGAAGAGCTGTTCGACCTGCTGTTGCCGCATCTGGAAGCGTTGCAACAGAGCGCGAGCG
CGCTGGCGGAACTCGACGTGCTGGTTAACCTGGCGGAACGGGCCTATACCCTGAACTACACCTGCCCGACCTTCATTGAT
AAACCGGGCATTCGCATTACCGAAGGTCGCCATCCGGTAGTTGAACAAGTACTGAATGAGCCATTTATCGCCAACCCGCT
GAATCTGTCGCCGCAGCGCCGCATGTTGATCATCACCGGTCCGAACATGGGCGGTAAAAGTACCTATATGCGCCAGACCG
CACTGATTGCGCTGATGGCCTACATCGGCAGCTATGTACCGGCACAAAAAGTCGAGATTGGACCTATCGATCGCATCTTT
ACCCGCGTAGGCGCGGCAGATGACCTGGCGTCCGGGCGCTCAACCTTTATGGTGGAGATGACTGAAACCGCCAATATTTT
ACATAACGCCACCGAATACAGTCTGGTGTTAATGGATGAGATCGGGCGTGGAACGTCCACCTACGATGGTCTGTCGCTGG
CGTGGGCGTGCGCGGAAAATCTGGCGAATAAGATTAAGGCATTGACGTTATTTGCTACCCACTATTTCGAGCTGACCCAG
TTACCGGAGAAAATGGAAGGCGTCGCTAACGTGCATCTCGATGCACTGGAGCACGGCGACACCATTGCCTTTATGCACAG
CGTGCAGGATGGCGCGGCGAGCAAAAGCTACGGCCTGGCGGTTGCAGCTCTGGCAGGCGTGCCAAAAGAGGTTATTAAGC
GCGCACGGCAAAAGCTGCGTGAGCTGGAAAGCATTTCGCCGAACGCCGCCGCTACGCAAGTGGATGGTACGCAAATGTCT
TTGCTGTCAGTACCAGAAGAAACTTCGCCTGCGGTCGAAGCTCTGGAAAATCTTGATCCGGATTCACTCACCCCGCGTCA
GGCGCTGGAGTGGATTTATCGCTTGAAGAGCCTGGTGTAATAACAATTCCCGATAGTCTTTTGCTATCGGGAATATTAAC
GACAACTGACGAATAAAATAAAAACACCCTGTATAATAGGAAAGCTT
Entrez -> Search mutS gene (by keyword)-> page 3
M64730
PCL 2001/11/24