720 a
535 c
594 g
635 t
ORIGIN
1 cttggctctt ctggcgccaa aatgtcgttc gtggcagggg ttattcggcg gctggacgag
61 acagtggtga accgcatcgc ggcgggggaa gttatccagc ggccagctaa tgctatcaaa
121 gagatgattg agaactgttt agatgcaaaa tccacaagta ttcaagtgat tgttaaagag
181 ggaggcctga agttgattca gatccaagac aatggcaccg ggatcaggaa agaagatctg
241 gatattgtat gtgaaaggtt cactactagt aaactgcagt cctttgagga tttagccagt
301 atttctacct atggctttcg aggtgaggct ttggccagca taagccatgt ggctcatgtt
361 actattacaa cgaaaacagc tgatggaaag tgtgcataca gagcaagtta ctcagatgga
421 aaactgaaag cccctcctaa accatgtgct ggcaatcaag ggacccagat cacggtggag
481 gacctttttt acaacatagc cacgaggaga aaagctttaa aaaatccaag tgaagaatat
541 gggaaaattt tggaagttgt tggcaggtat tcagtacaca atgcaggcat tagtttctca
601 gttaaaaaac aaggagagac agtagctgat gttaggacac tacccaatgc ctcaaccgtg
661 gacaatattc gctccatctt tggaaatgct gttagtcgag aactgataga aattggatgt
721 gaggataaaa ccctagcctt caaaatgaat ggttacatat ccaatgcaaa ctactcagtg
781 aagaagtgca tcttcttact cttcatcaac catcgtctgg tagaatcaac ttccttgaga
841 aaagccatag aaacagtgta tgcagcctat ttgcccaaaa acacacaccc attcctgtac
901 ctcagtttag aaatcagtcc ccagaatgtg gatgttaatg tgcaccccac aaagcatgaa
961 gttcacttcc tgcacgagga gagcatcctg gagcgggtgc agcagcacat cgagagcaag
1021 ctcctgggct ccaattcctc caggatgtac ttcacccaga ctttgctacc aggacttgct
1081 ggcccctctg gggagatggt taaatccaca acaagtctga cctcgtcttc tacttctgga
1141 agtagtgata aggtctatgc ccaccagatg gttcgtacag attcccggga acagaagctt
1201 gatgcatttc tgcagcctct gagcaaaccc ctgtccagtc agccccaggc cattgtcaca
1261 gaggataaga cagatatttc tagtggcagg gctaggcagc aagatgagga gatgcttgaa
1321 ctcccagccc ctgctgaagt ggctgccaaa aatcagagct tggaggggga tacaacaaag
1381 gggacttcag aaatgtcaga gaagagagga cctacttcca gcaaccccag aaagagacat
1441 cgggaagatt ctgatgtgga aatggtggaa gatgattccc gaaaggaaat gactgcagct
1501 tgtacccccc ggagaaggat cattaacctc actagtgttt tgagtctcca ggaagaaatt
1561 aatgagcagg gacatgaggt tctccgggag atgttgcata accactcctt cgtgggctgt
1621 gtgaatcctc agtgggcctt ggcacagcat caaaccaagt tataccttct caacaccacc
1681 aagcttagtg aagaactgtt ctaccagata ctcatttatg attttgccaa ttttggtgtt
1741 ctcaggttat cggagccagc accgctcttt gaccttgcca tgcttgcctt agatagtcca
1801 gagagtggct ggacagagga agatggtccc aaagaaggac ttgctgaata cattgttgag
1861 tttctgaaga agaaggctga gatgcttgca gactatttct ctttggaaat tgatgaggaa
1921 gggaacctga ttggattacc ccttctgatt gacaactatg tgcccccttt ggagggactg
1981 cctatcttca ttcttcgact agccactgag gtgaattggg acgaagaaaa ggaatgtttt
2041 gaaagcctca gtaaagaatg cgctatgttc tattccatcc ggaagcagta catatctgag
2101 gagtcgaccc tctcaggcca gcagagtgaa gtgcctggct ccattccaaa ctcctggaag
2161 tggactgtgg aacacattgt ctataaagcc ttgcgctcac acattctgcc tcctaaacat
2221 ttcacagaag atggaaatat cctgcagctt gctaacctgc ctgatctata caaagtcttt
2281 gagaggtgtt aaatatggtt atttatgcac tgtgggatgt gttcttcttt ctctgtattc
2341 cgatacaaag tgttgtatca aagtgtgata tacaaagtgt accaacataa gtgttggtag
2401 cacttaagac ttatacttgc cttctgatag tattccttta tacacagtgg attgattata
2461 aataaataga tgtgtcttaa cata
802 a
892 c
896 g
737 t
ORIGIN

1 aactgcaaat tgccggacag atctgcctgt ccggcatact attcatgagg ttttttcgga
61 cgatattttt ccggcagttc tggcaccgga cgcttgtcat cgatgagatg acgcacggtt
121 aagatcggat gacgccacag cattctcggc ccggcccaac gcataatctg tttcatctct
181 tcacgctttg caggctggta acagtgcacc ggacactgct tacaggctgg tttctcttcg
241 ccgaacacac atttatccag ccgcttttgc gcgtaaacaa acaacgcctc gtaatgctcc
301 ggctccgctg acgcctgcgg gcatttcgct tgataaagat cgatcatttt tttaatcgtc
361 agtttttcac gagagatacg cttgccggac atgctgcctc cacctcatta agatgtattt
421 atattacatc ttaatcttaa agggcactat gactccaaag aagaagggtt agccaaccga
481 tacaattttg cgtacttgct tcataagcat cacgcaaaag ctgcaaaaca gcatctttcc
541 cggaaccagc atcaagaact cgccgttcgc ttcttcccct gaaatgatta actccggtat
601 catgtgcgcc ttatgtgatt acaacgaaaa taaaaaccat cacaccccat ttaatatcag
661 ggaaccggac ataaccccat gagtgcaata gaaaatttcg acgcccatac gcccatgatg
721 cagcagtatc tcaggctgaa agcccagcat cccgagatcc tgctgtttta ccggatgggt
781 gatttttatg aactgtttta tgacgacgca aaacgcgcgt cgcaactgct ggatatttca
841 ctgaccaaac gcggtgcttc ggcgggagag ccgatcccga tggcggggat tccctaccat
901 gcggtggaaa actatctcgc caaactggtg aatcagggag agtccgttgc catctgcgaa
961 caaattggcg atccggcgac cagcaaaggt ccggttgagc gcaaagttgt gcgtatcgtt
1021 acgccaggca ccatcagcga tgaagccctg ttgcaggagc gtcaggacaa cctgctggcg
1081 gctatctggc aggacagcaa aggtttcggc tacgcgacgc tggatatcag ttccgggcgt
1141 tttcgcctga gcgaaccggc tgaccgcgaa acgatggcgg cagaactgca acgcactaat
1201 cctgcggaac tgctgtatgc agaagatttt gctgaaatgt cgttaattga aggccgtcgc
1261 ggcctgcgcc gtcgcccgct gtgggagttt gaaatcgaca ccgcgcgcca gcagttgaat
1321 ctgcaatttg ggacccgcga tctggtcggt tttggcgtcg agaacgcgcc gcgcggactt
1381 tgtgctgccg gttgtctgtt gcagtatgcg aaagataccc aacgtacgac tctgccgcat
1441 attcgttcca tcaccatgga acgtgagcag gacagcatca ttatggatgc cgcgacgcgt
1501 cgtaatctgg aaatcaccca gaacctggcg ggtggtgcgg aaaatacgct ggcttctgtg
1561 ctcgactgca ccgtcacgcc gatgggcagc cgtatgctga aacgctggct gcatatgcca
1621 gtgcgcgata cccgcgtgtt gcttgagcgc cagcaaacta ttggcgcatt gcaggatttc
1681 accgccgggc tacagccggt actgcgtcag gtcggcgacc tggaacgtat tctggcacgt
1741 ctggctttac gaactgctcg cccacgcgat ctggcccgta tgcgccacgc tttccagcaa
1801 ctgccggagc tgcgtgcgca gttagaaact gtcgatagtg caccggtaca ggcgctacgt
1861 gagaagatgg gcgagtttgc cgagctgcgc gatctgctgg agcgagcaat catcgacaca
1921 ccgccggtgc tggtacgcga cggtggtgtt atcgcatcgg gctataacga agagctggat
1981 gagtggcgcg cgctggctga cggcgcgacc gattatctgg agcgtctgga agtccgcgag
2041 cgtgaacgta ccggcctgga cacgctgaaa gttggcttta atgcggtgca cggctactac
2101 attcaaatca gccgtgggca aagccatctg gcacccatca actacatgcg tcgccagacg
2161 ctgaaaaacg ccgagcgcta catcattcca gagctaaaag agtacgaaga taaagttctc
2221 acctcaaaag gcaaagcact ggcactggaa aaacagcttt atgaagagct gttcgacctg
2281 ctgttgccgc atctggaagc gttgcaacag agcgcgagcg cgctggcgga actcgacgtg
2341 ctggttaacc tggcggaacg ggcctatacc ctgaactaca cctgcccgac cttcattgat
2401 aaaccgggca ttcgcattac cgaaggtcgc catccggtag ttgaacaagt actgaatgag
2461 ccatttatcg ccaacccgct gaatctgtcg ccgcagcgcc gcatgttgat catcaccggt
2521 ccgaacatgg gcggtaaaag tacctatatg cgccagaccg cactgattgc gctgatggcc
2581 tacatcggca gctatgtacc ggcacaaaaa gtcgagattg gacctatcga tcgcatcttt
2641 acccgcgtag gcgcggcaga tgacctggcg tccgggcgct caacctttat ggtggagatg
2701 actgaaaccg ccaatatttt acataacgcc accgaataca gtctggtgtt aatggatgag
2761 atcgggcgtg gaacgtccac ctacgatggt ctgtcgctgg cgtgggcgtg cgcggaaaat
2821 ctggcgaata agattaaggc attgacgtta tttgctaccc actatttcga gctgacccag
2881 ttaccggaga aaatggaagg cgtcgctaac gtgcatctcg atgcactgga gcacggcgac
2941 accattgcct ttatgcacag cgtgcaggat ggcgcggcga gcaaaagcta cggcctggcg
3001 gttgcagctc tggcaggcgt gccaaaagag gttattaagc gcgcacggca aaagctgcgt
3061 gagctggaaa gcatttcgcc gaacgccgcc gctacgcaag tggatggtac gcaaatgtct
3121 ttgctgtcag taccagaaga aacttcgcct gcggtcgaag ctctggaaaa tcttgatccg
3181 gattcactca ccccgcgtca ggcgctggag tggatttatc gcttgaagag cctggtgtaa
3241 taacaattcc cgatagtctt ttgctatcgg gaatattaac gacaactgac gaataaaata
3301 aaaacaccct gtataatagg aaagctt

 

Assignment 3

Search genes associated with colon cancer in human genome.

MLH1 was identified as a locus frequently mutated in hereditary nonpolyposis colon cancer (HNPCC). When cloned, it was discovered to be a human homolog of the E. coli mismatch repair gene mutS.

1. How many hits will you get if you search genes associated with colon cancer in human genome?

Ans : 2020

2. How many loci will you find if you search locus link for human in Genebank?

Ans : 21

3. Give the locus ID and position of MLH1.

Ans : Locus ID : 4292 . Position : 3p21.3

4. Find the %ID of nucleotide sequence for its possible orthologs in mouse.

Ans : 90.5 %

5. Find the total number of mutations of MLH1 reported in human gene mutation database.

Ans : 147

6. Give the DNA sequence of MLH1.

Ans : BASE COUNT



 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

7. Give the DNA sequence of E. coli mismatch repair gene mutS.

Ans : BASE COUNT


 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

[Answer]