HOMEWORK#4

                                            Retrieving DNA sequence / GenBank

                                                            due on 12/10

1. Sho-Hua cloned a gene in the lab. The DNA sequence is listed here:

AATTAATTCTCTCAATCAAATCCAATTTTCTCCCTATAAAAACCCTAAGGTCCTATA

GTGTTCTATATCCAACACTAGCTCCTACTCCCTAAAGCATTTATTATATCTTCCCCT

AGCTAGATACTTCATTCCACAAATAGTTTGCAGCTTTTTCTTTTCCTCTAAAACAAT

GGAAATAGCTGGGAAAATTGCATGCTTTGTGGTATTGTGCATGGTGGTAGCTGCACC

CTGCGCAGAAGCCATAACCTGTGGCCAGGTTACGTCGAATTTGGCACCTTGTCTTGC

TTATCTTAGAAACACGGGGCCTCTGGGACGTTGTTGCGGTGGCGTTAAGGCTCTGGT

GAATTCTGCAAGGACCACAGAAGATCGTCAAATTGCATGCACTTGCCTGAAATCAGC

TGCAGGTGCTATTTCTGGAATCAATTTGGGCAAAGCTGCTGGTCTCCCTAGTACTTG

TGGTGTCAATATTCCTTACAAGATCAGCCCTTCCACTGACTGCTCCAAGTACCTCAC

TTTTTTTCTCTCTCATGCTATTCTTATCCTTATATTCTATCTGCTTCATTTTCGCTT

ATCTTTTAAATTTTTTATTCGGAATCTTTATACCA

Please help him to identify the gene and its protein sequence.

N.tabacum ltp1 gene

protein sequence:

M E I A G K I A C F V V L C M V V A A P C A E A I T C G Q V T S N L A P C
L A Y L R N T G P L G R C C G G V K A L V N S A R T T E D R Q I A C T
C L K S A A G A I S G I N L G K A A G L P S T C G V N I P Y K I S P S T D
C S K Y L T F F L S H A I L I L I F Y L L H F R L S F K F F I R N L Y T


2. How many nucleotide and protein sequence of Lycopersicon
     esculentum were know?
    Please find its class II small heat shock protein mRNA,
     complete cds.

814 nucleotide sequences

1088 protein sequences

Its class II small heat shock protein mRNA, complete cds :

Base count 229a 117c 183g 209t

origin:
1 tacggctgcg agaagacgac agaaggggac tgcaattaca aatcaaacca aaattgacaa
61 atttcacgca caaaatcaca atatccaaaa atttctcaat actgaaaatg gatttgaggt
121 tgttgggtat cgataacaca ccactcttcc acactctcca ccatatgatg gaagctgccg
181 gtgaagattc cgacaagtct gtcaatgcac catcaaggaa ctatgttcgt gatgctaagg
241 ccatggctgc tacaccagcg gatgtgaagg agtatcctaa ttcgtatgtt tttgttgtgg
301 atatgccagg gttgaaatct ggagatatca aagtgcaggt ggaagaagac aatgtgctgt
361 tgattagtgg tgaaaggaag agggaagaag agaaagaagg tgcaaagttt attaggatgg
421 agagaagggt tgggaaattc atgaggaagt ttagtctgcc agagaatgcg aatactgatg
481 caatttctgc agtttgtcaa gatggagttc tgactgttac tgttcagaaa ttgcctcctc
541 ctgagccaaa gaaacccaaa acaattgagg tgaaagttgc ttgaagttat ggactctgtt
601 ttgatggttt gtggtatgat gtagtagaaa taaagttgta ggagtagtga acttttcctt
661 tcatctttct gctatgtttt cacgtctgtt tgaatgttac aatagccatg ggtattgttt
721 gttttgatgc caaaaaaa