Key for assignment 3


1. How many hits will you get if you search genes associated with colon cancer in human genome?

Search results for query "colon cancer" -> 17 hits

2. How many loci will you find if you search locus link for human in Genebank?

21 loci found

3. Give the locus ID and position of MLH1.

LocusID -> 4292

Position  -> 3p21.3

 

4. Find the %ID of nucleotide sequence for its possible orthologs in mouse.

 90.5%

Click on the Homologene data of 4292 MLH1--> Click more of Hs.57301

5. Find the total number of mutations of MLH1 reported in human gene mutation database.

147 (now become 149)

Click on 4292 -> Click on HGMD (Human Gene Mutaion Database)

6. Give the DNA sequence of MLH1.

>gi|13905125|gb|BC006850.1|BC006850 Homo sapiens, mutL (E. coli) homolog 1 (colon cancer, nonpolyposis type 2), clone MGC:5172 IMAGE:3451538, mRNA, complete cds
GGCACTTCCGTTGAGCATCTAGACGTTTCCTTGGCTCTTCTGGCGCCAAAATGTCGTTCGTGGCAGGGGTTATTCGGCGG
CTGGACGAGACAGTGGTGAACCGCATCGCGGCGGGGGAAGTTATCCAGCGGCCAGCTAATGCTATCAAAGAGATGATTGA
GAACTGTTTAGATGCAAAATCCACAAGTATTCAAGTGATTGTTAAAGAGGGAGGCCTGAAGTTGATTCAGATCCAAGACA
ATGGCACCGGGATCAGGAAAGAAGATCTGGATATTGTATGTGAAAGGTTCACTACTAGTAAACTGCAGTCCTTTGAGGAT
TTAGCCAGTATTTCTACCTATGGCTTTCGAGGTGAGGCTTTGGCCAGCATAAGCCATGTGGCTCATGTTACTATTACAAC
GAAAACAGCTGATGGAAAGTGTGCATACAGAGCAAGTTACTCAGATGGAAAACTGAAAGCCCCTCCTAAACCATGTGCTG
GCAATCAAGGGACCCAGATCACGGTGGAGGACCTTTTTTACAACATAGCCACGAGGAGAAAAGCTTTAAAAAATCCAAGT
GAAGAATATGGGAAAATTTTGGAAGTTGTTGGCAGGTATTCAGTACACAATGCAGGCATTAGTTTCTCAGTTAAAAAACA
AGGAGAGACAGTAGCTGATGTTAGGACACTACCCAATGCCTCAACCGTGGACAATATTCGCTCCATCTTTGGAAATGCTG
TTAGTCGAGAACTGATAGAAATTGGATGTGAGGATAAAACCCTAGCCTTCAAAATGAATGGTTACATATCCAATGCAAAC
TACTCAGTGAAGAAGTGCATCTTCTTACTCTTCATCAACCATCGTCTGGTAGAATCAACTTCCTTGAGAAAAGCCATAGA
AACAGTGTATGCAGCCTATTTGCCCAAAAACACACACCCATTCCTGTACCTCAGTTTAGAAATCAGTCCCCAGAATGTGG
ATGTTAATGTGCACCCCACAAAGCATGAAGTTCACTTCCTGCACGAGGAGAGCATCCTGGAGCGGGTGCAGCAGCACATC
GAGAGCAAGCTCCTGGGCTCCAATTCCTCCAGGATGTACTTCACCCAGACTTTGCTACCAGGACTTGCTGGCCCCTCTGG
GGAGATGGTTAAATCCACAACAAGTCTGACCTCGTCTTCTACTTCTGGAAGTAGTGATAAGGTCTATGCCCACCAGATGG
TTCGTACAGATTCCCGGGAACAGAAGCTTGATGCATTTCTGCAGCCTCTGAGCAAACCCCTGTCCAGTCAGCCCCAGGCC
ATTGTCACAGAGGATAAGACAGATATTTCTAGTGGCAGGGCTAGGCAGCAAGATGAGGAGATGCTTGAACTCCCAGCCCC
TGCTGAAGTGGCTGCCAAAAATCAGAGCTTGGAGGGGGATACAACAAAGGGGACTTCAGAAATGTCAGAGAAGAGAGGAC
CTACTTCCAGCAACCCCAGAAAGAGACATCGGGAAGATTCTGATGTGGAAATGGTGGAAGATGATTCCCGAAAGGAAATG
ACTGCAGCTTGTACCCCCCGGAGAAGGATCATTAACCTCACTAGTGTTTTGAGTCTCCAGGAAGAAATTAATGAGCAGGG
ACATGAGGTTCTCCGGGAGATGTTGCATAACCACTCCTTCGTGGGCTGTGTGAATCCTCAGTGGGCCTTGGCACAGCATC
AAACCAAGTTATACCTTCTCAACACCACCAAGCTTAGTGAAGAACTGTTCTACCAGATACTCATTTATGATTTTGCCAAT
TTTGGTGTTCTCAGGTTATCGGAGCCAGCACCGCTCTTTGACCTTGCCATGCTTGCCTTAGATAGTCCAGAGAGTGGCTG
GACAGAGGAAGATGGTCCCAAAGAAGGACTTGCTGAATACATTGTTGAGTTTCTGAAGAAGAAGGCTGAGATGCTTGCAG
ACTATTTCTCTTTGGAAATTGATGAGGAAGGGAACCTGATTGGATTACCCCTTCTGATTGACAACTATGTGCCCCCTTTG
GAGGGACTGCCTATCTTCATTCTTCGACTAGCCACTGAGGTGAATTGGGACGAAGAAAAGGAATGTTTTGAAAGCCTCAG
TAAAGAATGCGCTATGTTCTATTCCATCCGGAAGCAGTACATATCTGAGGAGTCGACCCTCTCAGGCCAGCAGAGTGAAG
TGCCTGGCTCCATTCCAAACTCCTGGAAGTGGACTGTGGAACACATTGTCTATAAAGCCTTGCGCTCACACATTCTGCCT
CCTAAACATTTCACAGAAGATGGAAATATCCTGCAGCTTGCTAACCTGCCTGATCTATACAAAGTCTTTGAGAGGTGTTA
AATATGGTTATTTATGCACTGTGGGATGTGTTCTTCTTTCTCTGTATTCCGATACAAAGTGTTGTATCAAAGTGTGATAT
ACAAAGTGTACCAACATAAGTGTTGGTAGCACTTAAGACTTATACTTGCCTTCTGATAGTATTCCTTTATACACAGTGGA
TTGATTATAAATAAATAGATGTGTCTTAACATAAAAAAAAAAAAAAAAAA
 

Click on 4292--> GenBank Sequences--> Click on Nucleotide BC006850

 

7. Give the DNA sequence of E. coli mismatch repair gene mutS.

>gi|146905|gb|M64730.1|ECOMUTS Escherichia coli DNA mismatch repair protein (fdv) gene, complete cds
AACTGCAAATTGCCGGACAGATCTGCCTGTCCGGCATACTATTCATGAGGTTTTTTCGGACGATATTTTTCCGGCAGTTC
TGGCACCGGACGCTTGTCATCGATGAGATGACGCACGGTTAAGATCGGATGACGCCACAGCATTCTCGGCCCGGCCCAAC
GCATAATCTGTTTCATCTCTTCACGCTTTGCAGGCTGGTAACAGTGCACCGGACACTGCTTACAGGCTGGTTTCTCTTCG
CCGAACACACATTTATCCAGCCGCTTTTGCGCGTAAACAAACAACGCCTCGTAATGCTCCGGCTCCGCTGACGCCTGCGG
GCATTTCGCTTGATAAAGATCGATCATTTTTTTAATCGTCAGTTTTTCACGAGAGATACGCTTGCCGGACATGCTGCCTC
CACCTCATTAAGATGTATTTATATTACATCTTAATCTTAAAGGGCACTATGACTCCAAAGAAGAAGGGTTAGCCAACCGA
TACAATTTTGCGTACTTGCTTCATAAGCATCACGCAAAAGCTGCAAAACAGCATCTTTCCCGGAACCAGCATCAAGAACT
CGCCGTTCGCTTCTTCCCCTGAAATGATTAACTCCGGTATCATGTGCGCCTTATGTGATTACAACGAAAATAAAAACCAT
CACACCCCATTTAATATCAGGGAACCGGACATAACCCCATGAGTGCAATAGAAAATTTCGACGCCCATACGCCCATGATG
CAGCAGTATCTCAGGCTGAAAGCCCAGCATCCCGAGATCCTGCTGTTTTACCGGATGGGTGATTTTTATGAACTGTTTTA
TGACGACGCAAAACGCGCGTCGCAACTGCTGGATATTTCACTGACCAAACGCGGTGCTTCGGCGGGAGAGCCGATCCCGA
TGGCGGGGATTCCCTACCATGCGGTGGAAAACTATCTCGCCAAACTGGTGAATCAGGGAGAGTCCGTTGCCATCTGCGAA
CAAATTGGCGATCCGGCGACCAGCAAAGGTCCGGTTGAGCGCAAAGTTGTGCGTATCGTTACGCCAGGCACCATCAGCGA
TGAAGCCCTGTTGCAGGAGCGTCAGGACAACCTGCTGGCGGCTATCTGGCAGGACAGCAAAGGTTTCGGCTACGCGACGC
TGGATATCAGTTCCGGGCGTTTTCGCCTGAGCGAACCGGCTGACCGCGAAACGATGGCGGCAGAACTGCAACGCACTAAT
CCTGCGGAACTGCTGTATGCAGAAGATTTTGCTGAAATGTCGTTAATTGAAGGCCGTCGCGGCCTGCGCCGTCGCCCGCT
GTGGGAGTTTGAAATCGACACCGCGCGCCAGCAGTTGAATCTGCAATTTGGGACCCGCGATCTGGTCGGTTTTGGCGTCG
AGAACGCGCCGCGCGGACTTTGTGCTGCCGGTTGTCTGTTGCAGTATGCGAAAGATACCCAACGTACGACTCTGCCGCAT
ATTCGTTCCATCACCATGGAACGTGAGCAGGACAGCATCATTATGGATGCCGCGACGCGTCGTAATCTGGAAATCACCCA
GAACCTGGCGGGTGGTGCGGAAAATACGCTGGCTTCTGTGCTCGACTGCACCGTCACGCCGATGGGCAGCCGTATGCTGA
AACGCTGGCTGCATATGCCAGTGCGCGATACCCGCGTGTTGCTTGAGCGCCAGCAAACTATTGGCGCATTGCAGGATTTC
ACCGCCGGGCTACAGCCGGTACTGCGTCAGGTCGGCGACCTGGAACGTATTCTGGCACGTCTGGCTTTACGAACTGCTCG
CCCACGCGATCTGGCCCGTATGCGCCACGCTTTCCAGCAACTGCCGGAGCTGCGTGCGCAGTTAGAAACTGTCGATAGTG
CACCGGTACAGGCGCTACGTGAGAAGATGGGCGAGTTTGCCGAGCTGCGCGATCTGCTGGAGCGAGCAATCATCGACACA
CCGCCGGTGCTGGTACGCGACGGTGGTGTTATCGCATCGGGCTATAACGAAGAGCTGGATGAGTGGCGCGCGCTGGCTGA
CGGCGCGACCGATTATCTGGAGCGTCTGGAAGTCCGCGAGCGTGAACGTACCGGCCTGGACACGCTGAAAGTTGGCTTTA
ATGCGGTGCACGGCTACTACATTCAAATCAGCCGTGGGCAAAGCCATCTGGCACCCATCAACTACATGCGTCGCCAGACG
CTGAAAAACGCCGAGCGCTACATCATTCCAGAGCTAAAAGAGTACGAAGATAAAGTTCTCACCTCAAAAGGCAAAGCACT
GGCACTGGAAAAACAGCTTTATGAAGAGCTGTTCGACCTGCTGTTGCCGCATCTGGAAGCGTTGCAACAGAGCGCGAGCG
CGCTGGCGGAACTCGACGTGCTGGTTAACCTGGCGGAACGGGCCTATACCCTGAACTACACCTGCCCGACCTTCATTGAT
AAACCGGGCATTCGCATTACCGAAGGTCGCCATCCGGTAGTTGAACAAGTACTGAATGAGCCATTTATCGCCAACCCGCT
GAATCTGTCGCCGCAGCGCCGCATGTTGATCATCACCGGTCCGAACATGGGCGGTAAAAGTACCTATATGCGCCAGACCG
CACTGATTGCGCTGATGGCCTACATCGGCAGCTATGTACCGGCACAAAAAGTCGAGATTGGACCTATCGATCGCATCTTT
ACCCGCGTAGGCGCGGCAGATGACCTGGCGTCCGGGCGCTCAACCTTTATGGTGGAGATGACTGAAACCGCCAATATTTT
ACATAACGCCACCGAATACAGTCTGGTGTTAATGGATGAGATCGGGCGTGGAACGTCCACCTACGATGGTCTGTCGCTGG
CGTGGGCGTGCGCGGAAAATCTGGCGAATAAGATTAAGGCATTGACGTTATTTGCTACCCACTATTTCGAGCTGACCCAG
TTACCGGAGAAAATGGAAGGCGTCGCTAACGTGCATCTCGATGCACTGGAGCACGGCGACACCATTGCCTTTATGCACAG
CGTGCAGGATGGCGCGGCGAGCAAAAGCTACGGCCTGGCGGTTGCAGCTCTGGCAGGCGTGCCAAAAGAGGTTATTAAGC
GCGCACGGCAAAAGCTGCGTGAGCTGGAAAGCATTTCGCCGAACGCCGCCGCTACGCAAGTGGATGGTACGCAAATGTCT
TTGCTGTCAGTACCAGAAGAAACTTCGCCTGCGGTCGAAGCTCTGGAAAATCTTGATCCGGATTCACTCACCCCGCGTCA
GGCGCTGGAGTGGATTTATCGCTTGAAGAGCCTGGTGTAATAACAATTCCCGATAGTCTTTTGCTATCGGGAATATTAAC
GACAACTGACGAATAAAATAAAAACACCCTGTATAATAGGAAAGCTT
 

Entrez -> Search mutS gene (by keyword)-> page 3 M64730



PCL 2001/11/24