HOMEWORK#4
Retrieving DNA sequence / GenBank
due on 12/10
1. Sho-Hua cloned a gene in the lab. The DNA sequence is
listed here:
AATTAATTCTCTCAATCAAATCCAATTTTCTCCCTATAAAAACCCTAAGGTCCTATA
GTGTTCTATATCCAACACTAGCTCCTACTCCCTAAAGCATTTATTATATCTTCCCCT
AGCTAGATACTTCATTCCACAAATAGTTTGCAGCTTTTTCTTTTCCTCTAAAACAAT
GGAAATAGCTGGGAAAATTGCATGCTTTGTGGTATTGTGCATGGTGGTAGCTGCACC
CTGCGCAGAAGCCATAACCTGTGGCCAGGTTACGTCGAATTTGGCACCTTGTCTTGC
TTATCTTAGAAACACGGGGCCTCTGGGACGTTGTTGCGGTGGCGTTAAGGCTCTGGT
GAATTCTGCAAGGACCACAGAAGATCGTCAAATTGCATGCACTTGCCTGAAATCAGC
TGCAGGTGCTATTTCTGGAATCAATTTGGGCAAAGCTGCTGGTCTCCCTAGTACTTG
TGGTGTCAATATTCCTTACAAGATCAGCCCTTCCACTGACTGCTCCAAGTACCTCAC
TTTTTTTCTCTCTCATGCTATTCTTATCCTTATATTCTATCTGCTTCATTTTCGCTT
ATCTTTTAAATTTTTTATTCGGAATCTTTATACCA
Please help him to identify the gene and its protein sequence.
N.tabacum ltp1 gene
protein sequence:
M E I A G K I A C F V V L C M V V A A P C A E A I T C G Q V T S N L A P C
L A Y L R N T G P L G R C C G G V K A L V N S A R T T E D R Q I A C T
C L K S A A G A I S G I N L G K A A G L P S T C G V N I P Y K I S P S T D
C S K Y L T F F L S H A I L I L I F Y L L H F R L S F K F F I R N L Y T
2. How many nucleotide and protein sequence of Lycopersicon
esculentum were know?
Please find its class II small heat shock protein mRNA,
complete cds.
814 nucleotide sequences
1088 protein sequences
Its class II small heat shock protein mRNA, complete cds :
Base count 229a 117c 183g 209t
origin:
1 tacggctgcg agaagacgac agaaggggac tgcaattaca aatcaaacca aaattgacaa
61 atttcacgca caaaatcaca atatccaaaa atttctcaat actgaaaatg gatttgaggt
121 tgttgggtat cgataacaca ccactcttcc acactctcca ccatatgatg gaagctgccg
181 gtgaagattc cgacaagtct gtcaatgcac catcaaggaa ctatgttcgt gatgctaagg
241 ccatggctgc tacaccagcg gatgtgaagg agtatcctaa ttcgtatgtt tttgttgtgg
301 atatgccagg gttgaaatct ggagatatca aagtgcaggt ggaagaagac aatgtgctgt
361 tgattagtgg tgaaaggaag agggaagaag agaaagaagg tgcaaagttt attaggatgg
421 agagaagggt tgggaaattc atgaggaagt ttagtctgcc agagaatgcg aatactgatg
481 caatttctgc agtttgtcaa gatggagttc tgactgttac tgttcagaaa ttgcctcctc
541 ctgagccaaa gaaacccaaa acaattgagg tgaaagttgc ttgaagttat ggactctgtt
601 ttgatggttt gtggtatgat gtagtagaaa taaagttgta ggagtagtga acttttcctt
661 tcatctttct gctatgtttt cacgtctgtt tgaatgttac aatagccatg ggtattgttt
721 gttttgatgc caaaaaaa