Selected Sequence(s)

  • Rattus norvegicus mismatch repair protein (MLH1) mRNA, complete
  • Drosophila melanogaster mutL homolog (Mlh1) gene, complete cds_
  • Mus musculus MutL homolog 1 protein (MLH1) mRNA, complete cds.
  • Saccharomyces cerevisiae DNA mismatch repair (MLH1) gene, complete
  • DNA MISMATCH REPAIR PROTEIN MLH1 (MUTL PROTEIN HOMOLOG 1) [Homo sapiens (Human)]
  • Caenorhabditis elegans cosmid T28A8, complete sequence_

    Fasta label (*)Workbench label
    GENPEPT:1724118Rattus norvegicus mismatch repair protein (MLH1) mRNA, complete
    GENPEPT:3192877Drosophila melanogaster mutL homolog (Mlh1) gene, complete cds_
    GENPEPT:7595954Mus musculus MutL homolog 1 protein (MLH1) mRNA, complete cds.
    GENPEPT:460627Saccharomyces cerevisiae DNA mismatch repair (MLH1) gene, complete
    MLH1_HUMANDNA MISMATCH REPAIR PROTEIN MLH1 (MUTL PROTEIN HOMOLOG 1) [Homo sapiens (Human)]
    GENPEPT:3880333Caenorhabditis elegans cosmid T28A8, complete sequence_

    (*) Clustalw cuts off Fasta labels after the first space (e.g. ">abc def" becomes ">abc").


    Sequence alignment

    Consensus key (see documentation for details)
    * - single, fully conserved residue
    : - conservation of strong groups
    . - conservation of weak groups
      - no consensus
    
    
    CLUSTAL W (1.81) multiple sequence alignment
    
    
    GENPEPT_7595954      -----------------MAFVAGVIRRLDETVVNRIAAGEVIQRPANAIKEMIENCLDAK
    GENPEPT_1724118      -----------------MSFVAGVIRRLDETVVNRIAAGEVIQRPANAIKEMTENCLDAK
    MLH1_HUMAN           -----------------MSFVAGVIRRLDETVVNRIAAGEVIQRPANAIKEMIENCLDAK
    GENPEPT_3192877      ---------------MAEYLQPGVIRKLDEVVVNRIAAGEIIQRPANALKELLENSLDAQ
    GENPEPT_460627       --------------------MSLRIKALDASVVNKIAAGEIIISPVNALKEMMENSIDAN
    GENPEPT_3880333      MWHCGYRTRNCDEFSKIEFSLMGLIQRLPQDVVNRMAAGEVLARPCNAIKELVENSLDAG
                                                 *: *   ***::****::  * **:**: **.:** 
    
    GENPEPT_7595954      STNIQVVVKEGGLKLIQIQDNGTGIRKEDLDIVCERFTTSKLQTFEDLASISTYGFRGEA
    GENPEPT_1724118      STNIQVIVREGGLKLIQIQDNGTGIRKEDLDIVCERFTTSKLQTFEDLAMISTYGFRGEA
    MLH1_HUMAN           STSIQVIVKEGGLKLIQIQDNGTGIRKEDLDIVCERFTTSKLQSFEDLASISTYGFRGEA
    GENPEPT_3192877      STHIQVQVKAGGLKLLQIQDNGTGIRREDLAIVCERFTTSKLTRFEDLSQIATFGFRGEA
    GENPEPT_460627       ATMIDILVKEGGIKVLQITDNGSGINKADLPILCERFTTSKLQKFEDLSQIQTYGFRGEA
    GENPEPT_3880333      ATEIMVNMQNGGLKLLQVSDNGKGIEREDFALVCERFATSKLQKFEDLMHMKTYGFRGEA
                         :* * : :: **:*::*: ***.**.: *: ::****:****  ****  : *:******
    
    GENPEPT_7595954      LASISHVAHVTITTKTADGKCAYRASYSDGKLQAPPKPCAGNQGTLITVEDLFYNIITRR
    GENPEPT_1724118      LASISHVAHVTITTKTADGKCAYRASYSDGKLQAPPKPCAGNQGTLITVEDLFYNIITRK
    MLH1_HUMAN           LASISHVAHVTITTKTADGKCAYRASYSDGKLKAPPKPCAGNQGTQITVEDLFYNIATRR
    GENPEPT_3192877      LASISHVAHLSIQTKTAKEKCGYKATYADGKLQGQPKPCAGNQGTIICIEDLFYNMPQRR
    GENPEPT_460627       LASISHVARVTVTTKVKEDRCAWRVSYAEGKMLESPKPVAGKDGTTILVEDLFFNIPSRL
    GENPEPT_3880333      LASLSHVAKVNIVSKRADAKCAYQANFLDGKMTADTKPAAGKNGTCITATDLFYNLPTRR
                         ***:****::.: :*  . :*.::..: :**:   .** **::** *   ***:*:  * 
    
    GENPEPT_7595954      KALKNPSEEYGKILEVVGRYSIHNSGISFSVKKQGETVSDVRTLPNATTVDNIRSIFGNA
    GENPEPT_1724118      KALKNPSEEYGKILEVVGRYSIHNSGISFSVKKQGETVSDVRTLPNATTVDNIRSIFGNA
    MLH1_HUMAN           KALKNPSEEYGKILEVVGRYSVHNAGISFSVKKQGETVADVRTLPNASTVDNIRSIFGNA
    GENPEPT_3192877      QALRSPAEEFQRLSEVLARYAVHNPRVGFTLRKQGDAQPALRTPVASSRSENIRIIYGAA
    GENPEPT_460627       RALRSHNDEYSKILDVVGRYAIHSKDIGFSCKKFGDSNYSLSVKPSYTVQDRIRTVFNKS
    GENPEPT_3880333      NKMTTHGEEAKMVNDTLLRFAIHRPDVSFALRQ--NQAGDFRTKGDGNFRDVVCNLLGRD
                         . : .  :*   : :.: *:::*   :.*: ::  :    . .    .  : :  : .  
    
    GENPEPT_7595954      VSRELIEVG-CEDKTLAFK-MNGYISNANYSVKKCI----------FLLFINHRLVESAA
    GENPEPT_1724118      VSRELIEVG-CEDKTLAFK-MNGYISNANYSVKKCI----------FLLFINHRLVESAA
    MLH1_HUMAN           VSRELIEIG-CEDKTLAFK-MNGYISNANYSVKKCI----------FLLFINHRLVESTS
    GENPEPT_3192877      ISKELLEFS-HRDEVYKFE-AECLITQVNYSAKKCQ----------MLLFINQRLVESTA
    GENPEPT_460627       VASNLITFHISKVEDLNLESVDGKVCNLNFISKKSIS---------LIFFINNRLVTCDL
    GENPEPT_3880333      VADTILPLS-LNSTRLKFT-FTGHISKPIASATAAIAQNRKTSRSFFSVFINGRSVRCDI
                         ::  :: .   .     :      : :     . .           : .*** * * .  
    
    GENPEPT_7595954      LRKAIETVYAAYLPKNTHPFLYLSLEISPQNVDVNVHPTKHEVHFLHEESILQRVQQHIE
    GENPEPT_1724118      LKKAIEAVYAAYLPKNTHPFLYLILEISPQNVDVNVHPTKHEVHFLHEESILERVQQHIE
    MLH1_HUMAN           LRKAIETVYAAYLPKNTHPFLYLSLEISPQNVDVNVHPTKHEVHFLHEESILERVQQHIE
    GENPEPT_3192877      LRTSVDSIYATYLPRGHHPFVYMSLTLPPQNLDVNVHPTKHEVHFLYQEEIVDSIKQQVE
    GENPEPT_460627       LRRALNSVYSNYLPKGFRPFIYLGIVIDPAAVDVNVHPTKREVRFLSQDEIIEKIANQLH
    GENPEPT_3880333      LKHPIDEVLG--ARQLHAQFCALHLQIDETRIDVNVHPTKNSVIFLEKEEIIEEIRAYFE
                         *: .:: : .    :    *  : : :    :********..* ** ::.*:: :   ..
    
    GENPEPT_7595954      SKLLGSNSSRMYFTQTLLPGLAG------PSGEAARPTTGVASSSTSGSGDKVYAYQMVR
    GENPEPT_1724118      SKLLGSNSSRMYFTQTLLPGLAG------PSGEAVKSTTGIASSSTSGSGDKVHAYQMVR
    MLH1_HUMAN           SKLLGSNSSRMYFTQTLLPGLAG------PSGEMVKSTTSLTSSSTSGSSDKVYAHQMVR
    GENPEPT_3192877      ARLLGSNATRTFYKQLRLPGAP-----------------DLDETQLADKTQRIYPKEMVR
    GENPEPT_460627       AELSAIDTSRTFKASSISTNKPESLIPFNDTIESDRNRKSLRQAQVVENSYTTANSQLRK
    GENPEPT_3880333      KVIGEIFGFEALDVEKPEEEQPD--------IENLVMIPMSQSLKSIEAIRKPDTKPEFK
                           :      .    .      .                    . .              :
    
    GENPEPT_7595954      TDSRDQKLDAFLQPVSSLVPSQPQDPAPVRGARTEGSPERATREDEEMLALPAPAEAAAE
    GENPEPT_1724118      TDSRDQKLDAFMQPVSRRLPSQPQD--PVPGNRTEGSPEKAMQKDQEISELPAPMEAAAD
    MLH1_HUMAN           TDSREQKLDAFLQPLSKPLSSQPQ--AIVTEDKTDISSGRARQQDEEMLELPAPAEVAAK
    GENPEPT_3192877      TDSTEQKLDKFLAPLVK-------------------------------------------
    GENPEPT_460627       AKRQENKLVRIDASQAKITSFLSSS--QQFNFEGSSTKRQLSEPKVTNVSHSQEAEKLTL
    GENPEPT_3880333      SSPSAWKSDKKRVDYMEVRTDAKERKIDEFVTRGGAVGPTTSNDDIFGGSGILKRARTED
                         :.    *                                                     
    
    GENPEPT_7595954      SENLERESLMETSDAAQKAAPTSSPGSSRKRHREDSDVEMVENASGKEMTAACYPRRRII
    GENPEPT_1724118      SASLERESVIGASEVVAPQRHPSSPGSSRKRHPEDSDVEMMENDSRKEMTAACYPRRRII
    MLH1_HUMAN           NQSLEGDTTKGTSEMSEKRGPTSS--NPRKRHREDSDVEMVEDDSRKEMTAACTPRRRII
    GENPEPT_3192877      ----------------SDSGVSSSSSQEASRLPEES------------FRVTAAKKSREV
    GENPEPT_460627       NESEQPRDANTINDNDLKDQPKKKQKLGDYKVPSIADDEKNALPISKDGYIRVPKERVNV
    GENPEPT_3880333      STGGEKEPEDLNTDFDDVSMVSLVSTADGRRLNESQD-----LGEDDDVDFEYGKTHREF
                                                       :  .                         .
    
    GENPEPT_7595954      NLTSVLSLQEEISERCHETLREILRNHSFVGCVNPQW--ALAQHQTKLYLLNTTKLSEEL
    GENPEPT_1724118      NLTSVLSLQEEINDRGHETLREMLRNHTFVGCVNPQW--ALAQHQTKLYLLNTTKLSEEL
    MLH1_HUMAN           NLTSVLSLQEEINEQGHEVLREMLHNHSFVGCVNPQW--ALAQHQTKLYLLNTTKLSEEL
    GENPEPT_3192877      RLSSVLDMRKRVERQCSVQLRSTLKNLVYVGCVDERR--ALFQHETRLYMCNTRSFSEEL
    GENPEPT_460627       NLTSIKKLREKVDDSIHRELTDIFANLNYVGVVDEERRLAAIQHDLKLFLIDYGSVCYEL
    GENPEPT_3880333      HFESIEVLRKEIIANSSQSLREMFKTSTFVGSINVKQ--VLIQFGTSLYHLDFSTVLREF
                         .: *:  :::.:       * . : .  :** :: .   .  *.   *:  :  ..  *:
    
    GENPEPT_7595954      FYQILIYDFANFGVLRLSEPAPLFDLAMLALDSPESGWTEDDGPKEGLA-----EYIVEF
    GENPEPT_1724118      FYQILIYDFANFGVLRLPEPAPLFDFAMLALDSPESGWTEEDGPKEGLA-----EYIVEF
    MLH1_HUMAN           FYQILIYDFANFGVLRLSEPAPLFDLAMLALDSPESGWTEEDGPKEGLA-----EYIVEF
    GENPEPT_3192877      FYQRMIYEFQNCSEITICPPLPLKELLILSLESRAAGWTPEDEDKAELA-----DGAADI
    GENPEPT_460627       FYQIGLTDFANFGKINLQSTNVSDDIVLYNLLSEFDELN-DDASK---------EKIISK
    GENPEPT_3880333      FYQISVFSFGNYGSYRLDE-EPPAIIEILELLGELSTREPNYAAFEVFANVENRFAAEKL
                         ***  : .* * .   :        : :  * .       :                 . 
    
    GENPEPT_7595954      LKKKAEMLADYFSVEIDEEGN--------LIGLPLLIDSYVPPLEGLPIFILRLATEVNW
    GENPEPT_1724118      LKKKAKMLADYFSVEIDEEGN--------LIGLPLLIDSYVPPLEGLPIFILRLATEVNW
    MLH1_HUMAN           LKKKAEMLADYFSLEIDEEGN--------LIGLPLLIDNYVPPLEGLPIFILRLATEVNW
    GENPEPT_3192877      LLKKAPIMREYFGLRISEDGM--------LESLPSLLHQHRPCVAHLPVYLLRLATEVDW
    GENPEPT_460627       IWDMSSMLNEYYSIELVNDGLDNDLKSVKLKSLPLLLKGYIPSLVKLPFFIYRLGKEVDW
    GENPEPT_3880333      LAEHADLLHDYFAIKLDQLENGR----LHITEIPSLVHYFVPQLEKLPFLIATLVLNVDY
                         : . : :: :*:.:.: :           :  :* *:. . * :  **. :  *  :*::
    
    GENPEPT_7595954      DEEKECFESLSKECAMFYSIRKQYILEESTLSGQQSDMPGSTSKPWKWT--VEHIIYKAF
    GENPEPT_1724118      DEE-ECFESLSKECAVFYSIRKQYILEESALSGQQSDMPGSPSKPWKWT--VEHIIYKAF
    MLH1_HUMAN           DEEKECFES
    
    
    MLH1_HUMAN           DEEKECFES