; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0018384 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0018384
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionDNA mismatch repair protein MSH5
Genome locationchr5:25352436..25362253
RNA-Seq ExpressionLag0018384
SyntenyLag0018384
Gene Ontology termsGO:0009987 - cellular process (biological process)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7990634.1 hypothetical protein I3843_02G035100 [Carya illinoinensis]2.8e-7836.63Show/hide
Query:  MRSNKVVNLFPLDLEIDRTLRTIRRDKRLAEAMTHQDEAPKAIRDFLQLVLPIENSGIVYAPIQATNFELKTGLIQMARNNSFKGHPSEDPHSHLRSFSE
        MR  +  ++ P+D EI+RTLR++RR+K LA A   ++  P+ ++D+++ V+    S I+  PI A NFELK  LI M +   F G P +DP+ HL  F E
Subjt:  MRSNKVVNLFPLDLEIDRTLRTIRRDKRLAEAMTHQDEAPKAIRDFLQLVLPIENSGIVYAPIQATNFELKTGLIQMARNNSFKGHPSEDPHSHLRSFSE

Query:  ICGTVKMNGVPANAIGLRLFPFFYKTRQR-------------------------------------IGSNQS-----------------RRA--------
        IC TVK+NGV  + I LRLFPF  + + R                                     IG  +                  RR         
Subjt:  ICGTVKMNGVPANAIGLRLFPFFYKTRQR-------------------------------------IGSNQS-----------------RRA--------

Query:  --ALLFYNGLNPSTKTVLDTSAGGSFLSKKVTKAKDLLEEMAATSYQWPTEREAISKKAGIYELDELSSLKAQMASLTNALNKLTSSEVVKSISTLAEGY
            +FYNGLN  T+T++D ++GG+ +SK    A  LLEEMA+ +YQWPTER    K AGI++L+ +++L AQ+A+L++ ++ LT+  + +S   LA   
Subjt:  --ALLFYNGLNPSTKTVLDTSAGGSFLSKKVTKAKDLLEEMAATSYQWPTEREAISKKAGIYELDELSSLKAQMASLTNALNKLTSSEVVKSISTLAEGY

Query:  SKKEGQDV--EEVQYVGNKPFT---QGVPNFYHPSLRNHENFSYSNTKNVL--KLPPGFASSSVPEKKNNLEEMVALFIKEQRVLNV-------NLQTTV
              +   E+VQYV N+ +      +PN+YHP LRNHEN SY NTKNVL  + PPGF  S   E+K +LE+ +  F++E             N++T  
Subjt:  SKKEGQDV--EEVQYVGNKPFT---QGVPNFYHPSLRNHENFSYSNTKNVL--KLPPGFASSSVPEKKNNLEEMVALFIKEQRVLNV-------NLQTTV

Query:  NNHDTTLKNMEVQIRHIASAVNALQKGKFPSDTEPNPREQCKMVTLRSGRKDERQD-------------DRDREKLNEEEVVPCNHHDKGLHISPPKRRG
        +N    +KN+EVQI  +A+ +NA Q+G FPS+TE NP+EQCK +TLRSG++ ER                + + K+ E+E+V  N   +    +P     
Subjt:  NNHDTTLKNMEVQIRHIASAVNALQKGKFPSDTEPNPREQCKMVTLRSGRKDERQD-------------DRDREKLNEEEVVPCNHHDKGLHISPPKRRG

Query:  ECPTFDYRELPFPQRFKNVKLDEQFQKFLEMF-KLTVNIPLVDALK
        + P      LP+PQRF+  KLD+QF KFL++F K+ +NIP  DAL+
Subjt:  ECPTFDYRELPFPQRFKNVKLDEQFQKFLEMF-KLTVNIPLVDALK

WP_217833153.1 retrotransposon gag domain-containing protein, partial [Synechococcus sp. PCC 7002]2.9e-7541.67Show/hide
Query:  LIKYLTFHFEGYRRLRFLLIDWEYCMRSNKVVNLFPLDLEIDRTLRTIRRDKRLAEAMTHQDEAPKAIRDFLQLVLPIENSGIVYAPIQATNFELKTGLI
        +I YL F+F       + L  W    R N   NL PLD EIDRT R   R   L +     +E PKAIRD+ Q  LP    GI+  PI   NFELK GLI
Subjt:  LIKYLTFHFEGYRRLRFLLIDWEYCMRSNKVVNLFPLDLEIDRTLRTIRRDKRLAEAMTHQDEAPKAIRDFLQLVLPIENSGIVYAPIQATNFELKTGLI

Query:  QMARNNSFKGHPSEDPHSHLRSFSEICGTVKMNGVPANAIGLRLFPFFYKTRQR---------------------------IGSNQSRRAAL--------
        QMAR  +F+G  +EDPH HLRSF EICGTVKMNGV  +AI LRLFPF  + R +                              +Q  R  +        
Subjt:  QMARNNSFKGHPSEDPHSHLRSFSEICGTVKMNGVPANAIGLRLFPFFYKTRQR---------------------------IGSNQSRRAAL--------

Query:  -----------------------------LFYNGLNPSTKTVLDTSAGGSFLSKKVTKAKDLLEEMAATSYQWPTEREA--ISKKAGIYELDELSSLKAQ
                                     LFYNGL  STK++LD +AGGS  SK   +A  +LE++A TSY WP ER +  I K AG+YE+DE++SLKAQ
Subjt:  -----------------------------LFYNGLNPSTKTVLDTSAGGSFLSKKVTKAKDLLEEMAATSYQWPTEREA--ISKKAGIYELDELSSLKAQ

Query:  MASLTNALNKLTSSEVVK----SISTLAEGYSKKEGQ-DVEEVQYVGNKPFT----QGVPNFYHPSLRNHENFSYSNTKNVLKLPPGFASSSVPEKKNNL
        MASLTNAL+KLT+    +    SI++LA   S+     D E   YV    +     Q +P  YHP+LRNHENFSY+N KNVL+ P GF + +   K ++L
Subjt:  MASLTNALNKLTSSEVVK----SISTLAEGYSKKEGQ-DVEEVQYVGNKPFT----QGVPNFYHPSLRNHENFSYSNTKNVLKLPPGFASSSVPEKKNNL

Query:  EEMVALFIKEQRVLNVNLQ-------TTVNNHDTTLKNMEVQIRHIASAVNALQKGKFPSDTEPNPREQCKMVTLRSGRK
        E+++  F+KE R     L+       +TV +    L+N+EVQ+  + +++  +QKGKFPS  E NPRE+CK VTLRSG+K
Subjt:  EEMVALFIKEQRVLNVNLQ-------TTVNNHDTTLKNMEVQIRHIASAVNALQKGKFPSDTEPNPREQCKMVTLRSGRK

XP_023874613.1 uncharacterized protein LOC111987139 [Quercus suber]8.4e-7537.13Show/hide
Query:  QDEAPKAIRDFLQLVLPIENSGIVYAPIQATNFELKTGLIQMARNNSFKGHPSEDPHSHLRSFSEICGTVKMNGVPANAIGLRLFPFFYKTRQR------
        Q+  P+ ++D+++ ++    SGI    I A NFELK  LI M +   F G P +DP+ HL  F EIC T+KMNGV  + I LRLFPF  + + R      
Subjt:  QDEAPKAIRDFLQLVLPIENSGIVYAPIQATNFELKTGLIQMARNNSFKGHPSEDPHSHLRSFSEICGTVKMNGVPANAIGLRLFPFFYKTRQR------

Query:  ----IGSNQ-------------SRRAAL-----------------------------------------LFYNGLNPSTKTVLDTSAGGSFLSKKVTKAK
            I S Q             ++ A L                                         +FYNGLN  T+T++D ++GG+ +SK    A 
Subjt:  ----IGSNQ-------------SRRAAL-----------------------------------------LFYNGLNPSTKTVLDTSAGGSFLSKKVTKAK

Query:  DLLEEMAATSYQWPTEREAISKKAGIYELDELSSLKAQMASLTNALNKLTSSEVVKSISTLAEGYSKKEGQDV--EEVQYVGNKPFT---QGVPNFYHPS
         LLEEMA+ +YQWPTER    K AGI+EL+  ++L AQ+ASL++ ++ LT+  + +    +A         +   E+VQY+ N+ +      +PN+YHP 
Subjt:  DLLEEMAATSYQWPTEREAISKKAGIYELDELSSLKAQMASLTNALNKLTSSEVVKSISTLAEGYSKKEGQDV--EEVQYVGNKPFT---QGVPNFYHPS

Query:  LRNHENFSYSNTKNVLKLPPGFASSSVPEKKNNLEEMVALFIKEQRVLNV-------NLQTTVNNHDTTLKNMEVQIRHIASAVNALQKGKFPSDTEPNP
        LRNHENFSY NTKNVL+ PPGF  S   EKK +LE+ +  F++E +           N++T  +N   T+KN+EVQI  +A+ +NA Q+G FPS+TE NP
Subjt:  LRNHENFSYSNTKNVLKLPPGFASSSVPEKKNNLEEMVALFIKEQRVLNV-------NLQTTVNNHDTTLKNMEVQIRHIASAVNALQKGKFPSDTEPNP

Query:  REQCKMVTLRSGRKDERQDDRDRE-------------KLNEEEVVPCNHHDKGLHISPPKRRGECPTFDYRELPFPQRFKNVKLDEQFQKFLEMF-KLTV
        +EQCK +TLRSGR+ ER   ++ E             K+ EEE+V     +  +   P     + P      LP+PQRF+  KLD+QF KFL++F K+ +
Subjt:  REQCKMVTLRSGRKDERQDDRDRE-------------KLNEEEVVPCNHHDKGLHISPPKRRGECPTFDYRELPFPQRFKNVKLDEQFQKFLEMF-KLTV

Query:  NIPLVDALK
        NIP  DAL+
Subjt:  NIPLVDALK

XP_038903565.1 DNA mismatch repair protein MSH5 isoform X2 [Benincasa hispida]1.6e-7353.19Show/hide
Query:  RVGVSYDDSSIRQLHVLEIWEDGSMEYPLIELVKYQANPLMIYSSTKCEESFLAALQRSDGMSEAPTVKLVKSSILSYEQAWHR----------------
        RVGVSY DSSIRQLHVLE+WEDGSMEYPLI+LVKYQA PLMIY+STK EESFLAALQRSDGMSEAPTVKLVKSSI SYEQAWHR                
Subjt:  RVGVSYDDSSIRQLHVLEIWEDGSMEYPLIELVKYQANPLMIYSSTKCEESFLAALQRSDGMSEAPTVKLVKSSILSYEQAWHR----------------

Query:  -------------------------------------------------------------------------------------------FSVFGMMNK
                                                                                                   FSVFGMMNK
Subjt:  -------------------------------------------------------------------------------------------FSVFGMMNK

Query:  ------------------------------ISVFLSSDELMHSLRKTLKIVKDIPHILKKFNSPSSTYSSGDWTAFLKSACSLLHVNKIFEVGISENLGD
                                      IS F+SSDELMHSLR+TLKIVKDIPHILKKFNSPSSTYSSGDWTAFLKS CSLLHVNKIFEVG+SENL +
Subjt:  ------------------------------ISVFLSSDELMHSLRKTLKIVKDIPHILKKFNSPSSTYSSGDWTAFLKSACSLLHVNKIFEVGISENLGD

Query:  NMQYLNLDIVEKANSCISTELAYVYELVI
        NM+YLNLDIVEKAN+CI+TELAYVYELVI
Subjt:  NMQYLNLDIVEKANSCISTELAYVYELVI

XP_038903566.1 DNA mismatch repair protein MSH5 isoform X3 [Benincasa hispida]1.6e-7353.19Show/hide
Query:  RVGVSYDDSSIRQLHVLEIWEDGSMEYPLIELVKYQANPLMIYSSTKCEESFLAALQRSDGMSEAPTVKLVKSSILSYEQAWHR----------------
        RVGVSY DSSIRQLHVLE+WEDGSMEYPLI+LVKYQA PLMIY+STK EESFLAALQRSDGMSEAPTVKLVKSSI SYEQAWHR                
Subjt:  RVGVSYDDSSIRQLHVLEIWEDGSMEYPLIELVKYQANPLMIYSSTKCEESFLAALQRSDGMSEAPTVKLVKSSILSYEQAWHR----------------

Query:  -------------------------------------------------------------------------------------------FSVFGMMNK
                                                                                                   FSVFGMMNK
Subjt:  -------------------------------------------------------------------------------------------FSVFGMMNK

Query:  ------------------------------ISVFLSSDELMHSLRKTLKIVKDIPHILKKFNSPSSTYSSGDWTAFLKSACSLLHVNKIFEVGISENLGD
                                      IS F+SSDELMHSLR+TLKIVKDIPHILKKFNSPSSTYSSGDWTAFLKS CSLLHVNKIFEVG+SENL +
Subjt:  ------------------------------ISVFLSSDELMHSLRKTLKIVKDIPHILKKFNSPSSTYSSGDWTAFLKSACSLLHVNKIFEVGISENLGD

Query:  NMQYLNLDIVEKANSCISTELAYVYELVI
        NM+YLNLDIVEKAN+CI+TELAYVYELVI
Subjt:  NMQYLNLDIVEKANSCISTELAYVYELVI

TrEMBL top hitse value%identityAlignment
A0A0A0L665 DNA_MISMATCH_REPAIR_2 domain-containing protein1.5e-7751.94Show/hide
Query:  MASSLLRVGVSYDDSSIRQLHVLEIWEDGSMEYPLIELVKYQANPLMIYSSTKCEESFLAALQRSDGMSEAPTVKLVKSSILSYEQAWHR----------
        +AS LLRVGVSY DSSIRQLHVLE+WEDGS+EYPLI+LVKYQA PLMIY+STK EESFLAALQRSDGMSEAPTVKLVKSSI SYEQAWHR          
Subjt:  MASSLLRVGVSYDDSSIRQLHVLEIWEDGSMEYPLIELVKYQANPLMIYSSTKCEESFLAALQRSDGMSEAPTVKLVKSSILSYEQAWHR----------

Query:  -------------------------------------------------------------------------------------------------FSV
                                                                                                         FSV
Subjt:  -------------------------------------------------------------------------------------------------FSV

Query:  FGMMNK------------------------------ISVFLSSDELMHSLRKTLKIVKDIPHILKKFNSPSSTYSSGDWTAFLKSACSLLHVNKIFEVGI
        FGMMNK                              IS F+SSDELMHSLR+TLKIVKDIPHILKKFNSPSSTYSSGDWTAFLKS CSLLHVNKIFEVG+
Subjt:  FGMMNK------------------------------ISVFLSSDELMHSLRKTLKIVKDIPHILKKFNSPSSTYSSGDWTAFLKSACSLLHVNKIFEVGI

Query:  SENLGDNMQYLNLDIVEKANSCISTELAYVYELVIISYFP--GSLSLSYALQ-MYITIIE
        SENL +NM+Y NLDIVEKAN+CI+TELAYVYELVI+SYFP  G L +S + +  Y TI++
Subjt:  SENLGDNMQYLNLDIVEKANSCISTELAYVYELVIISYFP--GSLSLSYALQ-MYITIIE

A0A6J1DCL3 DNA mismatch repair protein MSH52.9e-7352.89Show/hide
Query:  RVGVSYDDSSIRQLHVLEIWEDGSMEYPLIELVKYQANPLMIYSSTKCEESFLAALQRSDGMSEAPTVKLVKSSILSYEQAWHR----------------
        RVGVSY DSSIRQLHVLE+WEDGSM++PLIELVKYQA PLMIY+STK EESFLAALQRSDGMSEAPT+KLVKSSI SYEQAWHR                
Subjt:  RVGVSYDDSSIRQLHVLEIWEDGSMEYPLIELVKYQANPLMIYSSTKCEESFLAALQRSDGMSEAPTVKLVKSSILSYEQAWHR----------------

Query:  -------------------------------------------------------------------------------------------FSVFGMMNK
                                                                                                   FSVFGMMNK
Subjt:  -------------------------------------------------------------------------------------------FSVFGMMNK

Query:  ------------------------------ISVFLSSDELMHSLRKTLKIVKDIPHILKKFNSPSSTYSSGDWTAFLKSACSLLHVNKIFEVGISENLGD
                                      IS FLSS+ELMHSLR+TLKIVKDIPHILKKFNSPSSTYSSGDWTAFLKS CSLLHVNKIFEVG+SENL D
Subjt:  ------------------------------ISVFLSSDELMHSLRKTLKIVKDIPHILKKFNSPSSTYSSGDWTAFLKSACSLLHVNKIFEVGISENLGD

Query:  NMQYLNLDIVEKANSCISTELAYVYELVI
        NM+YLNLD+VEKANSCI+TELAYVYELVI
Subjt:  NMQYLNLDIVEKANSCISTELAYVYELVI

A0A6J1F178 DNA mismatch repair protein MSH5 isoform X42.9e-7352.1Show/hide
Query:  ASSLLRVGVSYDDSSIRQLHVLEIWEDGSMEYPLIELVKYQANPLMIYSSTKCEESFLAALQRSDGMSEAPTVKLVKSSILSYEQAWHR-----------
        AS LLRVGVSY DSSIRQLHVL++WEDGSMEYPLI+LVKYQA PLMIY+STK EESFLAALQRSDG+SEAPTVKLVKSSI SYEQAWHR           
Subjt:  ASSLLRVGVSYDDSSIRQLHVLEIWEDGSMEYPLIELVKYQANPLMIYSSTKCEESFLAALQRSDGMSEAPTVKLVKSSILSYEQAWHR-----------

Query:  ------------------------------------------------------------------------------------------------FSVF
                                                                                                        FSVF
Subjt:  ------------------------------------------------------------------------------------------------FSVF

Query:  GMMNK------------------------------ISVFLSSDELMHSLRKTLKIVKDIPHILKKFNSPSSTYSSGDWTAFLKSACSLLHVNKIFEVGIS
        GMMNK                              I+ F+SS+ELMHSLR+TLK VKDIPHILKKFNSPSSTYSSGDWT+FLKS CSLLHVNKIFEVG+S
Subjt:  GMMNK------------------------------ISVFLSSDELMHSLRKTLKIVKDIPHILKKFNSPSSTYSSGDWTAFLKSACSLLHVNKIFEVGIS

Query:  ENLGDNMQYLNLDIVEKANSCISTELAYVYELVI
        ENL DNM+YLNLDIVEKA+SCI+TELAYVYELVI
Subjt:  ENLGDNMQYLNLDIVEKANSCISTELAYVYELVI

A0A6J1L1D5 DNA mismatch repair protein MSH5 isoform X13.8e-7351.8Show/hide
Query:  ASSLLRVGVSYDDSSIRQLHVLEIWEDGSMEYPLIELVKYQANPLMIYSSTKCEESFLAALQRSDGMSEAPTVKLVKSSILSYEQAWHR-----------
        AS LLRVGVSY DSSIRQLHVL++WEDGSMEYPL++LVKYQA PLMIY+STK EESFLAALQRSDG+SEAPTVKLVKSSI SYEQAWHR           
Subjt:  ASSLLRVGVSYDDSSIRQLHVLEIWEDGSMEYPLIELVKYQANPLMIYSSTKCEESFLAALQRSDGMSEAPTVKLVKSSILSYEQAWHR-----------

Query:  ------------------------------------------------------------------------------------------------FSVF
                                                                                                        FSVF
Subjt:  ------------------------------------------------------------------------------------------------FSVF

Query:  GMMNK------------------------------ISVFLSSDELMHSLRKTLKIVKDIPHILKKFNSPSSTYSSGDWTAFLKSACSLLHVNKIFEVGIS
        GMMNK                              I+ F+SS+ELMHSLR+TLK VKDIPHILKKFNSPSSTYSSGDWT+FLKS CSLLHVNKIFEVG+S
Subjt:  GMMNK------------------------------ISVFLSSDELMHSLRKTLKIVKDIPHILKKFNSPSSTYSSGDWTAFLKSACSLLHVNKIFEVGIS

Query:  ENLGDNMQYLNLDIVEKANSCISTELAYVYELVI
        ENL DNM++LNLDIVEKANSCI+TELAYVYELVI
Subjt:  ENLGDNMQYLNLDIVEKANSCISTELAYVYELVI

A0A6J1L5Q4 DNA mismatch repair protein MSH5 isoform X23.8e-7351.8Show/hide
Query:  ASSLLRVGVSYDDSSIRQLHVLEIWEDGSMEYPLIELVKYQANPLMIYSSTKCEESFLAALQRSDGMSEAPTVKLVKSSILSYEQAWHR-----------
        AS LLRVGVSY DSSIRQLHVL++WEDGSMEYPL++LVKYQA PLMIY+STK EESFLAALQRSDG+SEAPTVKLVKSSI SYEQAWHR           
Subjt:  ASSLLRVGVSYDDSSIRQLHVLEIWEDGSMEYPLIELVKYQANPLMIYSSTKCEESFLAALQRSDGMSEAPTVKLVKSSILSYEQAWHR-----------

Query:  ------------------------------------------------------------------------------------------------FSVF
                                                                                                        FSVF
Subjt:  ------------------------------------------------------------------------------------------------FSVF

Query:  GMMNK------------------------------ISVFLSSDELMHSLRKTLKIVKDIPHILKKFNSPSSTYSSGDWTAFLKSACSLLHVNKIFEVGIS
        GMMNK                              I+ F+SS+ELMHSLR+TLK VKDIPHILKKFNSPSSTYSSGDWT+FLKS CSLLHVNKIFEVG+S
Subjt:  GMMNK------------------------------ISVFLSSDELMHSLRKTLKIVKDIPHILKKFNSPSSTYSSGDWTAFLKSACSLLHVNKIFEVGIS

Query:  ENLGDNMQYLNLDIVEKANSCISTELAYVYELVI
        ENL DNM++LNLDIVEKANSCI+TELAYVYELVI
Subjt:  ENLGDNMQYLNLDIVEKANSCISTELAYVYELVI

SwissProt top hitse value%identityAlignment
F4JEP5 DNA mismatch repair protein MSH57.9e-5241.34Show/hide
Query:  RVGVSYDDSSIRQLHVLEIWEDGSMEYPLIELVKYQANPLMIYSSTKCEESFLAALQRSDGMSEAPTVKLVKSSILSYEQAWHR----------------
        RVGVSY D S+RQLHVLE WE+   ++ LI +VKYQA P +IY+STK EESF+AALQ++DG  E   VKLVKSS  SYEQAWHR                
Subjt:  RVGVSYDDSSIRQLHVLEIWEDGSMEYPLIELVKYQANPLMIYSSTKCEESFLAALQRSDGMSEAPTVKLVKSSILSYEQAWHR----------------

Query:  -------------------------------------------------------------------------------------------FSVFGMMNK
                                                                                                   FSVFGMMNK
Subjt:  -------------------------------------------------------------------------------------------FSVFGMMNK

Query:  ------------------------------ISVFLSSDELMHSLRKTLKIVKDIPHILKKFNSPSSTYSSGDWTAFLKSACSLLHVNKIFEVGISENLGD
                                      IS F+SS ELM SLR+TLK VKDI H+LKKFNSP+S  +S DWTAFLKS  +LLHVNKIFEVG+SE+L +
Subjt:  ------------------------------ISVFLSSDELMHSLRKTLKIVKDIPHILKKFNSPSSTYSSGDWTAFLKSACSLLHVNKIFEVGISENLGD

Query:  NMQYLNLDIVEKANSCISTELAYVYELVI
        +M+  NLDI+EKA  CISTEL YVYELVI
Subjt:  NMQYLNLDIVEKANSCISTELAYVYELVI

Q6L4V0 DNA mismatch repair protein MSH52.9e-4637.8Show/hide
Query:  RVGVSYDDSSIRQLHVLEIWEDGSMEYPLIELVKYQANPLMIYSSTKCEESFLAALQRSDGMSEAPTVKLVKSSILSYEQAWHR----------------
        RVG++Y DSS+ QL VLEIWED + ++PLI+LVKYQ+ P  IY+STK +E+ L ALQR+D   EAP VKL+KSS  SYEQAWHR                
Subjt:  RVGVSYDDSSIRQLHVLEIWEDGSMEYPLIELVKYQANPLMIYSSTKCEESFLAALQRSDGMSEAPTVKLVKSSILSYEQAWHR----------------

Query:  ------------------------------------------------------------------------------------------FSVFGMMNK-
                                                                                                  FSVFGM+NK 
Subjt:  ------------------------------------------------------------------------------------------FSVFGMMNK-

Query:  -----------------------------ISVFLSSDELMHSLRKTLKIVKDIPHILKKFNSPSSTYSSGDWTAFLKSACSLLHVNKIFEVGISENLGDN
                                     IS FL  +++M +LR TLK V+DIPH+LKKFNSPSS  +S DW AFLK  CSLLH+NKIFEVGISE+L   
Subjt:  -----------------------------ISVFLSSDELMHSLRKTLKIVKDIPHILKKFNSPSSTYSSGDWTAFLKSACSLLHVNKIFEVGISENLGDN

Query:  MQYLNLDIVEKANSCISTELAYVYELVI
        +Q++N+D+V KANS I+ EL YV +LV+
Subjt:  MQYLNLDIVEKANSCISTELAYVYELVI

Arabidopsis top hitse value%identityAlignment
AT3G20475.1 MUTS-homologue 55.6e-5341.34Show/hide
Query:  RVGVSYDDSSIRQLHVLEIWEDGSMEYPLIELVKYQANPLMIYSSTKCEESFLAALQRSDGMSEAPTVKLVKSSILSYEQAWHR----------------
        RVGVSY D S+RQLHVLE WE+   ++ LI +VKYQA P +IY+STK EESF+AALQ++DG  E   VKLVKSS  SYEQAWHR                
Subjt:  RVGVSYDDSSIRQLHVLEIWEDGSMEYPLIELVKYQANPLMIYSSTKCEESFLAALQRSDGMSEAPTVKLVKSSILSYEQAWHR----------------

Query:  -------------------------------------------------------------------------------------------FSVFGMMNK
                                                                                                   FSVFGMMNK
Subjt:  -------------------------------------------------------------------------------------------FSVFGMMNK

Query:  ------------------------------ISVFLSSDELMHSLRKTLKIVKDIPHILKKFNSPSSTYSSGDWTAFLKSACSLLHVNKIFEVGISENLGD
                                      IS F+SS ELM SLR+TLK VKDI H+LKKFNSP+S  +S DWTAFLKS  +LLHVNKIFEVG+SE+L +
Subjt:  ------------------------------ISVFLSSDELMHSLRKTLKIVKDIPHILKKFNSPSSTYSSGDWTAFLKSACSLLHVNKIFEVGISENLGD

Query:  NMQYLNLDIVEKANSCISTELAYVYELVI
        +M+  NLDI+EKA  CISTEL YVYELVI
Subjt:  NMQYLNLDIVEKANSCISTELAYVYELVI


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCCACGGAGTCTGTCCGACGGCGTACAAACTTCAATCGACGTCAATTTTGGGAGAGGAGCCTTCGTGGAGGAGTGAAGTTCATTTTAATGGCAGGGTCGAAGGTGGA
GAGAGTGAAGAGAAGAAGAGAAACAAACATGGCAAGTTCTCTTCTCAGAGTGGGAGTTTCATACGATGATTCTAGCATCCGTCAGCTTCATGTGCTGGAAATTTGGGAAG
ATGGCAGCATGGAATATCCTCTGATTGAACTAGTGAAGTATCAAGCTAACCCCCTAATGATATATTCTAGCACTAAATGTGAGGAGTCTTTCTTGGCTGCTTTGCAACGG
AGTGATGGCATGTCTGAGGCTCCTACAGTGAAGCTTGTGAAGAGTTCAATTTTAAGCTATGAACAGGCCTGGCACAGGTTCTCTGTATTTGGCATGATGAATAAGATATC
AGTCTTTCTTTCTTCTGATGAATTGATGCATTCTTTACGCAAAACTCTAAAGATTGTCAAGGACATTCCTCATATACTCAAGAAATTCAATTCTCCGAGCTCAACCTATT
CTTCTGGTGATTGGACAGCATTCTTGAAGAGTGCTTGCTCACTTTTGCACGTGAATAAGATATTTGAAGTTGGCATATCAGAGAATCTTGGAGATAACATGCAGTACTTG
AATTTGGACATTGTTGAGAAGGCCAATTCATGCATTTCAACAGAGTTGGCGTATGTTTATGAATTAGTCATTATTTCTTACTTCCCTGGGTCATTATCCCTTTCCTATGC
TCTTCAAATGTACATTACTATTATTGAATTTCTTATGAGCTCATTGGATTGCCCTTTAAGCACTTCTGTAATGGACCTCATAAAATATTTGACTTTTCATTTTGAAGGTT
ATCGGCGTCTTAGATTTCTTTTGATTGATTGGGAGTATTGTATGCGAAGCAACAAGGTGGTTAATTTGTTTCCGCTAGATCTCGAAATTGACAGGACTCTTAGAACCATT
CGTAGAGATAAAAGATTAGCAGAAGCAATGACCCATCAAGATGAAGCTCCCAAGGCAATCAGAGACTTCTTACAGCTAGTTCTTCCCATCGAGAATTCTGGAATTGTTTA
CGCCCCAATCCAAGCTACCAATTTTGAGTTAAAGACAGGATTGATTCAGATGGCACGCAATAACTCTTTTAAGGGACATCCTTCTGAGGACCCACACTCACATCTGCGAT
CATTCTCTGAAATTTGTGGGACGGTAAAGATGAACGGAGTTCCGGCAAACGCTATAGGACTGAGGCTCTTCCCATTTTTCTACAAGACAAGGCAAAGGATTGGCTCGAAT
CAGTCGAGACGAGCAGCATTATTGTTTTACAATGGATTGAATCCCTCCACCAAGACAGTCCTAGATACATCAGCAGGAGGAAGTTTTCTTTCCAAGAAAGTAACGAAAGC
CAAAGATCTATTGGAAGAAATGGCGGCAACAAGTTATCAGTGGCCGACCGAGAGGGAAGCAATTTCAAAGAAGGCTGGAATTTATGAATTGGATGAGTTGAGTTCGTTGA
AGGCACAGATGGCATCTCTGACCAATGCGCTGAACAAGCTGACTTCATCTGAGGTGGTCAAATCCATTTCCACCTTAGCTGAAGGTTATTCAAAGAAGGAAGGTCAAGAT
GTGGAGGAAGTCCAGTACGTGGGAAACAAACCATTTACTCAAGGAGTACCGAACTTCTACCACCCCAGCCTGCGCAATCACGAGAACTTCTCATATTCAAATACGAAGAA
TGTTTTGAAGCTGCCGCCAGGTTTTGCATCATCCAGTGTGCCTGAAAAGAAGAATAACTTGGAGGAGATGGTGGCTCTGTTTATTAAAGAACAAAGAGTGTTGAACGTGA
ATCTCCAGACGACAGTTAATAACCATGACACAACTCTGAAAAACATGGAAGTTCAAATAAGACATATCGCTTCAGCGGTGAATGCCCTTCAGAAGGGAAAATTTCCTAGC
GACACTGAACCTAACCCAAGAGAGCAGTGCAAGATGGTAACACTAAGAAGTGGAAGAAAAGATGAAAGGCAAGATGACAGGGATAGAGAGAAACTGAACGAGGAAGAAGT
GGTTCCATGCAACCATCATGACAAAGGCTTGCACATAAGCCCGCCCAAGCGGAGGGGCGAATGTCCTACCTTTGACTACAGGGAGTTACCTTTTCCTCAACGCTTTAAAA
ATGTTAAATTAGATGAGCAGTTTCAGAAATTCCTAGAAATGTTTAAGTTGACTGTGAATATTCCCTTAGTAGATGCCTTGAAGATTGGTTTAACAGGAATGAAGGACACC
GACGTCACTCTCCAGCTTGCGATAGATCGATTACCCACCCGATGGGAGGACAAAGAGGTACCTATTATTCTAGGCAGTCCCTTCTTAACGACTGGTAAGGCTGAGATTAG
TGTGCATACAGTTGATGAAAGATCTACAGTTGATGATTTACCTTCTTTTGAGAATGAATTGAATTTACCTGAAATTATTGATTTTGATGATAATATTGATTTTCCTGACA
TTGAGGATGAGCATGAATTGCATAAAAAAGATCACTTGATAGATAATTTTGAGTTTGATCATGATAACACTGAATCTATCGAGTCTGATCTTGATAGTACTGAATGCATG
AATCCTGATAATGAACACCTTAGCAAATCCCAAAATTCCAAAAATCCTATTTTTCTGGATCCAAAGGAGGAGGATGAAGCGCTCATAGCTCAGGGACGTCTCAGTATCCC
TCTTTATGCTTTACTGTTTTCAAAACATTGGGGACAATGTTTAGTTTAA
mRNA sequenceShow/hide mRNA sequence
ATGGCCACGGAGTCTGTCCGACGGCGTACAAACTTCAATCGACGTCAATTTTGGGAGAGGAGCCTTCGTGGAGGAGTGAAGTTCATTTTAATGGCAGGGTCGAAGGTGGA
GAGAGTGAAGAGAAGAAGAGAAACAAACATGGCAAGTTCTCTTCTCAGAGTGGGAGTTTCATACGATGATTCTAGCATCCGTCAGCTTCATGTGCTGGAAATTTGGGAAG
ATGGCAGCATGGAATATCCTCTGATTGAACTAGTGAAGTATCAAGCTAACCCCCTAATGATATATTCTAGCACTAAATGTGAGGAGTCTTTCTTGGCTGCTTTGCAACGG
AGTGATGGCATGTCTGAGGCTCCTACAGTGAAGCTTGTGAAGAGTTCAATTTTAAGCTATGAACAGGCCTGGCACAGGTTCTCTGTATTTGGCATGATGAATAAGATATC
AGTCTTTCTTTCTTCTGATGAATTGATGCATTCTTTACGCAAAACTCTAAAGATTGTCAAGGACATTCCTCATATACTCAAGAAATTCAATTCTCCGAGCTCAACCTATT
CTTCTGGTGATTGGACAGCATTCTTGAAGAGTGCTTGCTCACTTTTGCACGTGAATAAGATATTTGAAGTTGGCATATCAGAGAATCTTGGAGATAACATGCAGTACTTG
AATTTGGACATTGTTGAGAAGGCCAATTCATGCATTTCAACAGAGTTGGCGTATGTTTATGAATTAGTCATTATTTCTTACTTCCCTGGGTCATTATCCCTTTCCTATGC
TCTTCAAATGTACATTACTATTATTGAATTTCTTATGAGCTCATTGGATTGCCCTTTAAGCACTTCTGTAATGGACCTCATAAAATATTTGACTTTTCATTTTGAAGGTT
ATCGGCGTCTTAGATTTCTTTTGATTGATTGGGAGTATTGTATGCGAAGCAACAAGGTGGTTAATTTGTTTCCGCTAGATCTCGAAATTGACAGGACTCTTAGAACCATT
CGTAGAGATAAAAGATTAGCAGAAGCAATGACCCATCAAGATGAAGCTCCCAAGGCAATCAGAGACTTCTTACAGCTAGTTCTTCCCATCGAGAATTCTGGAATTGTTTA
CGCCCCAATCCAAGCTACCAATTTTGAGTTAAAGACAGGATTGATTCAGATGGCACGCAATAACTCTTTTAAGGGACATCCTTCTGAGGACCCACACTCACATCTGCGAT
CATTCTCTGAAATTTGTGGGACGGTAAAGATGAACGGAGTTCCGGCAAACGCTATAGGACTGAGGCTCTTCCCATTTTTCTACAAGACAAGGCAAAGGATTGGCTCGAAT
CAGTCGAGACGAGCAGCATTATTGTTTTACAATGGATTGAATCCCTCCACCAAGACAGTCCTAGATACATCAGCAGGAGGAAGTTTTCTTTCCAAGAAAGTAACGAAAGC
CAAAGATCTATTGGAAGAAATGGCGGCAACAAGTTATCAGTGGCCGACCGAGAGGGAAGCAATTTCAAAGAAGGCTGGAATTTATGAATTGGATGAGTTGAGTTCGTTGA
AGGCACAGATGGCATCTCTGACCAATGCGCTGAACAAGCTGACTTCATCTGAGGTGGTCAAATCCATTTCCACCTTAGCTGAAGGTTATTCAAAGAAGGAAGGTCAAGAT
GTGGAGGAAGTCCAGTACGTGGGAAACAAACCATTTACTCAAGGAGTACCGAACTTCTACCACCCCAGCCTGCGCAATCACGAGAACTTCTCATATTCAAATACGAAGAA
TGTTTTGAAGCTGCCGCCAGGTTTTGCATCATCCAGTGTGCCTGAAAAGAAGAATAACTTGGAGGAGATGGTGGCTCTGTTTATTAAAGAACAAAGAGTGTTGAACGTGA
ATCTCCAGACGACAGTTAATAACCATGACACAACTCTGAAAAACATGGAAGTTCAAATAAGACATATCGCTTCAGCGGTGAATGCCCTTCAGAAGGGAAAATTTCCTAGC
GACACTGAACCTAACCCAAGAGAGCAGTGCAAGATGGTAACACTAAGAAGTGGAAGAAAAGATGAAAGGCAAGATGACAGGGATAGAGAGAAACTGAACGAGGAAGAAGT
GGTTCCATGCAACCATCATGACAAAGGCTTGCACATAAGCCCGCCCAAGCGGAGGGGCGAATGTCCTACCTTTGACTACAGGGAGTTACCTTTTCCTCAACGCTTTAAAA
ATGTTAAATTAGATGAGCAGTTTCAGAAATTCCTAGAAATGTTTAAGTTGACTGTGAATATTCCCTTAGTAGATGCCTTGAAGATTGGTTTAACAGGAATGAAGGACACC
GACGTCACTCTCCAGCTTGCGATAGATCGATTACCCACCCGATGGGAGGACAAAGAGGTACCTATTATTCTAGGCAGTCCCTTCTTAACGACTGGTAAGGCTGAGATTAG
TGTGCATACAGTTGATGAAAGATCTACAGTTGATGATTTACCTTCTTTTGAGAATGAATTGAATTTACCTGAAATTATTGATTTTGATGATAATATTGATTTTCCTGACA
TTGAGGATGAGCATGAATTGCATAAAAAAGATCACTTGATAGATAATTTTGAGTTTGATCATGATAACACTGAATCTATCGAGTCTGATCTTGATAGTACTGAATGCATG
AATCCTGATAATGAACACCTTAGCAAATCCCAAAATTCCAAAAATCCTATTTTTCTGGATCCAAAGGAGGAGGATGAAGCGCTCATAGCTCAGGGACGTCTCAGTATCCC
TCTTTATGCTTTACTGTTTTCAAAACATTGGGGACAATGTTTAGTTTAA
Protein sequenceShow/hide protein sequence
MATESVRRRTNFNRRQFWERSLRGGVKFILMAGSKVERVKRRRETNMASSLLRVGVSYDDSSIRQLHVLEIWEDGSMEYPLIELVKYQANPLMIYSSTKCEESFLAALQR
SDGMSEAPTVKLVKSSILSYEQAWHRFSVFGMMNKISVFLSSDELMHSLRKTLKIVKDIPHILKKFNSPSSTYSSGDWTAFLKSACSLLHVNKIFEVGISENLGDNMQYL
NLDIVEKANSCISTELAYVYELVIISYFPGSLSLSYALQMYITIIEFLMSSLDCPLSTSVMDLIKYLTFHFEGYRRLRFLLIDWEYCMRSNKVVNLFPLDLEIDRTLRTI
RRDKRLAEAMTHQDEAPKAIRDFLQLVLPIENSGIVYAPIQATNFELKTGLIQMARNNSFKGHPSEDPHSHLRSFSEICGTVKMNGVPANAIGLRLFPFFYKTRQRIGSN
QSRRAALLFYNGLNPSTKTVLDTSAGGSFLSKKVTKAKDLLEEMAATSYQWPTEREAISKKAGIYELDELSSLKAQMASLTNALNKLTSSEVVKSISTLAEGYSKKEGQD
VEEVQYVGNKPFTQGVPNFYHPSLRNHENFSYSNTKNVLKLPPGFASSSVPEKKNNLEEMVALFIKEQRVLNVNLQTTVNNHDTTLKNMEVQIRHIASAVNALQKGKFPS
DTEPNPREQCKMVTLRSGRKDERQDDRDREKLNEEEVVPCNHHDKGLHISPPKRRGECPTFDYRELPFPQRFKNVKLDEQFQKFLEMFKLTVNIPLVDALKIGLTGMKDT
DVTLQLAIDRLPTRWEDKEVPIILGSPFLTTGKAEISVHTVDERSTVDDLPSFENELNLPEIIDFDDNIDFPDIEDEHELHKKDHLIDNFEFDHDNTESIESDLDSTECM
NPDNEHLSKSQNSKNPIFLDPKEEDEALIAQGRLSIPLYALLFSKHWGQCLV