; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr013024 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr013024
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionAcidic endochitinase
Genome locationtig00153648:23919..44115
RNA-Seq ExpressionSgr013024
SyntenySgr013024
Gene Ontology termsGO:0005975 - carbohydrate metabolic process (biological process)
GO:0004553 - hydrolase activity, hydrolyzing O-glycosyl compounds (molecular function)
InterPro domainsIPR001223 - Glycoside hydrolase family 18, catalytic domain
IPR001579 - Glycosyl hydrolases family 18 (GH18) active site
IPR017853 - Glycoside hydrolase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAF2312204.1 hypothetical protein GH714_028494 [Hevea brasiliensis]3.2e-12747.44Show/hide
Query:  GIAVYWGQNGNEGSLAEACATSNYKYVNIAFLYSFGSGQMPRLNLAGHCDPSFGGCQYLSAEIKACQSQGIKIFLSIGGGAGNYRLSSAEDAREVATYIW
        GIA+YWGQNGNEG+L + C+T  Y YVNIAFL  FG+GQ P++NLAGHC+P+ GGC  +S  I++CQ QGIK+ LS+GGG G+Y L+S  DA+ VA Y+W
Subjt:  GIAVYWGQNGNEGSLAEACATSNYKYVNIAFLYSFGSGQMPRLNLAGHCDPSFGGCQYLSAEIKACQSQGIKIFLSIGGGAGNYRLSSAEDAREVATYIW

Query:  NNYLGGASSSRPLGDAVLDGIDFDIEGGSNLYWDVLAQALKDYSSD---VLLAAAPQCFYPDYYLDNAIKTGLFDYVWVQFYNNPPCQYSAGN-------
        NN+LGG SSSRPLGDAVLDGIDFDIE GS LYWD LA+ L  YS     V L AAPQC +PD YL  A+ TGLFDYVWVQFYNNPPCQYS+GN       
Subjt:  NNYLGGASSSRPLGDAVLDGIDFDIEGGSNLYWDVLAQALKDYSSD---VLLAAAPQCFYPDYYLDNAIKTGLFDYVWVQFYNNPPCQYSAGN-------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  -----------------------------------------YRLSSADDAKQVADYLWNNYLGGTSASRPLGDAVLDGIDFYIAAGSNLYWDVLAQALKN
                                                 Y L+S  DAK VADYLWNN+LGG S+SRPLGDAVLDGIDF I  GS LYWD LA+ L  
Subjt:  -----------------------------------------YRLSSADDAKQVADYLWNNYLGGTSASRPLGDAVLDGIDFYIAAGSNLYWDVLAQALKN

Query:  YSSD---VLLAAAPQCFYPDYYLDTAINTGLFDYVWVQFYNNPPCQYSPGNSQNLLNSWLTWSNT-NAKLVFLGLPASPDAAPTGGYIPPQVLITQSLPV
        YS     V L AAPQC +PD YL TA+NTGLFDYVWVQFYNNPPCQYS GN  N++NSW  W+ + NA  +FLGLPA+P+AA + GY+PP VLI+Q LP 
Subjt:  YSSD---VLLAAAPQCFYPDYYLDTAINTGLFDYVWVQFYNNPPCQYSPGNSQNLLNSWLTWSNT-NAKLVFLGLPASPDAAPTGGYIPPQVLITQSLPV

Query:  IKTDPKYGGVMLWNRYFDVITKYSDAI
        IK  P+YGGVMLW++++D    YS +I
Subjt:  IKTDPKYGGVMLWNRYFDVITKYSDAI

KAF9667422.1 hypothetical protein SADUNF_Sadunf15G0021400 [Salix dunnii]1.0e-12552.38Show/hide
Query:  SVTFLAFFFLNSLFESAHAAGIAVYWGQNGNEGSLAEACATSNYKYVNIAFLYSFGSGQMPRLNLAGHCDPSFGGCQYLSAEIKACQSQGIKIFLSIGGG
        ++T L    + SL + ++ AGIA+YWGQNG E SLA  CA+ NY++VN+AFL SFG+ + P LNLAG C+ +FG C   S EI++CQS+GIK+ LSIGGG
Subjt:  SVTFLAFFFLNSLFESAHAAGIAVYWGQNGNEGSLAEACATSNYKYVNIAFLYSFGSGQMPRLNLAGHCDPSFGGCQYLSAEIKACQSQGIKIFLSIGGG

Query:  AGNYRLSSAEDAREVATYIWNNYLGG-ASSSRPLGDAVLDGIDFDIEGGSNLYWDVLAQALKDYSSD--VLLAAAPQCFYPDYYLDNAIKTGLFDYVWVQ
         GNY LSSA+DA +VA YIWNN+LGG  SS RPLGDAVLDG+DF I+  S  +WD LA+AL  +S    V LAAAPQC +PD +LD AIKTGLFDYVWVQ
Subjt:  AGNYRLSSAEDAREVATYIWNNYLGG-ASSSRPLGDAVLDGIDFDIEGGSNLYWDVLAQALKDYSSD--VLLAAAPQCFYPDYYLDNAIKTGLFDYVWVQ

Query:  FYNNPPCQYSAGNYRL-------------------------------SSADDAK---------------------------------QVADYLWNNYLGG
        F+NNPPCQYS     L                                SADD                                    V +Y+WN++LGG
Subjt:  FYNNPPCQYSAGNYRL-------------------------------SSADDAK---------------------------------QVADYLWNNYLGG

Query:  TSASRPLGDAVLDGIDFYIAAGSNLYWDVLAQALKNYSSD--VLLAAAPQCFYPDYYLDTAINTGLFDYVWVQFYNNPPCQYSPGNSQNLLNSWLTWSNT
         S S PLGDA+LDG+DF I  GS  +WD LA+AL  +S    V LAAAPQC +PD +LDTAI TGLFDYVWVQFYNNP CQYS G++ NLL +W  W   
Subjt:  TSASRPLGDAVLDGIDFYIAAGSNLYWDVLAQALKNYSSD--VLLAAAPQCFYPDYYLDTAINTGLFDYVWVQFYNNPPCQYSPGNSQNLLNSWLTWSNT

Query:  NAKLVFLGLPASPDAAPTGGYIPPQVLITQSLPVIKTDPKYGGVMLWNRYFDVITKYSDAIK
         +  +FLGLPA+P+AAP+GG+I    LIT+ LP IK+ PKYGGVMLW++ FD    YSDAIK
Subjt:  NAKLVFLGLPASPDAAPTGGYIPPQVLITQSLPVIKTDPKYGGVMLWNRYFDVITKYSDAIK

KAF9667427.1 hypothetical protein SADUNF_Sadunf15G0021900 [Salix dunnii]1.0e-12544.33Show/hide
Query:  SVTFLAFFFLNSLFESAHAAGIAVYWGQNGNEGSLAEACATSNYKYVNIAFLYSFGSGQMPRLNLAGHCDPSFGGCQYLSAEIKACQSQGIKIFLSIGGG
        ++T L    + SL + ++ AGIA+YWGQ+G EGSLA+ C T NY++VN+AFL +FG+GQ P LNLA HCDPS G C   S EIK+CQS+GIK+ LSIGGG
Subjt:  SVTFLAFFFLNSLFESAHAAGIAVYWGQNGNEGSLAEACATSNYKYVNIAFLYSFGSGQMPRLNLAGHCDPSFGGCQYLSAEIKACQSQGIKIFLSIGGG

Query:  AGNYRLSSAEDAREVATYIWNNYLGGASSSRPLGDAVLDGIDFDIEGGSNLYWDVLAQALKDYSSDVLLAAAPQCFYPDYYLDNAIKTGLFDYVWVQFYN
           Y LSSA+DA +VA YIWNN+LGG SSSRPLGDA+LDG+DFDIE GS  +WD LA+ALK ++  + LAAAPQC +PD +LD AIKTGLFDYVWVQFYN
Subjt:  AGNYRLSSAEDAREVATYIWNNYLGGASSSRPLGDAVLDGIDFDIEGGSNLYWDVLAQALKDYSSDVLLAAAPQCFYPDYYLDNAIKTGLFDYVWVQFYN

Query:  NPPCQYS---------------------------------------------------------------------------------------------
        NP CQYS                                                                                             
Subjt:  NPPCQYS---------------------------------------------------------------------------------------------

Query:  ----------------------------------------------------------------------------------------------AGNYRL
                                                                                                      AG+Y L
Subjt:  ----------------------------------------------------------------------------------------------AGNYRL

Query:  SSADDAKQVADYLWNNYLGGTSASRPLGDAVLDGIDFYIAAGSNLYWDVLAQALKNYSSD--VLLAAAPQCFYPDYYLDTAINTGLFDYVWVQFYNNPPC
        SSADDA QVA+Y+W+N+LGG S+SRPLGDA+LDG+DF I AGS  +WD LA+AL  +S    V LAAAPQC +PD  LDTAI TGLFDYVWVQFYNNPPC
Subjt:  SSADDAKQVADYLWNNYLGGTSASRPLGDAVLDGIDFYIAAGSNLYWDVLAQALKNYSSD--VLLAAAPQCFYPDYYLDTAINTGLFDYVWVQFYNNPPC

Query:  QYSPGNSQNLLNSWLTWSNTNAKLVFLGLPASPDAAPTGGYIPPQVLITQSLPVIKTDPKYGGVMLWNRYFDVITKYSDAIK
        QYS G++ NLLN+W  W       VFLGLPA+ +AAP+GG+IP   LI+Q LP IK+ PKYGGVMLW++ FD    YSDAIK
Subjt:  QYSPGNSQNLLNSWLTWSNTNAKLVFLGLPASPDAAPTGGYIPPQVLITQSLPVIKTDPKYGGVMLWNRYFDVITKYSDAIK

PHT40860.1 hypothetical protein CQW23_19714 [Capsicum baccatum]9.7e-12443.8Show/hide
Query:  LFESAHAAGIAVYWGQNGNEGSLAEACATSNYKYVNIAFLYSFGSGQMPRLNLAGHCDPSFGGCQYLSAEIKACQSQGIKIFLSIGGGAGNYRLSSAEDA
        L  S  A GIA+YWGQNG EG+LAE CAT NY +V IAFL +FG+GQ P +NLAGHCDPS G C  LS +IK+CQ++GIK+ LSIGGGAG+Y L+SA+DA
Subjt:  LFESAHAAGIAVYWGQNGNEGSLAEACATSNYKYVNIAFLYSFGSGQMPRLNLAGHCDPSFGGCQYLSAEIKACQSQGIKIFLSIGGGAGNYRLSSAEDA

Query:  REVATYIWNNYLGGASSSRPLGDAVLDGIDFDIEGGSNLYWDVLAQALKDYSS---DVLLAAAPQCFYPDYYLDNAIKTGLFDYVWVQFYNNPPCQYS--
        R+VATY+WNN+LGG S++RPLGDAVLDGIDFDIEGG+NLYWDVLA++L  YSS    V L AAPQC +PD ++ NA+KTGLFDYVWVQFYNNPPCQYS  
Subjt:  REVATYIWNNYLGGASSSRPLGDAVLDGIDFDIEGGSNLYWDVLAQALKDYSS---DVLLAAAPQCFYPDYYLDNAIKTGLFDYVWVQFYNNPPCQYS--

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ---------------------------------------------------------------------------------------------------A
                                                                                                           A
Subjt:  ---------------------------------------------------------------------------------------------------A

Query:  GNYRLSSADDAKQVADYLWNNYLGGTSASRPLGDAVLDGIDFYIAAGSNLYWDVLAQALKNYSS---DVLLAAAPQCFYPDYYLDTAINTGLFDYVWVQF
        G+Y L+SADDAKQVA YLWNN+LGG S +RPLGDAVLDGIDF I  G+NLYWDVLA++L  YSS    V L AAPQC +PD ++  A+ TGLFDYVWVQF
Subjt:  GNYRLSSADDAKQVADYLWNNYLGGTSASRPLGDAVLDGIDFYIAAGSNLYWDVLAQALKNYSS---DVLLAAAPQCFYPDYYLDTAINTGLFDYVWVQF

Query:  YNNPPCQYSPGNSQNLLNSWLTW-SNTNAKLVFLGLPASPDAAPTGGYIPPQVLITQSLPVIKTDPKYGGVMLWNRYFDVITKYSDAIK
        YNNPPCQYS  +  NL  +W  W ++  A  +FLGLPA+P AA   G+IP   L +Q LP IKT  KYGGVMLW++Y+D  T YS +IK
Subjt:  YNNPPCQYSPGNSQNLLNSWLTW-SNTNAKLVFLGLPASPDAAPTGGYIPPQVLITQSLPVIKTDPKYGGVMLWNRYFDVITKYSDAIK

PPR95643.1 hypothetical protein GOBAR_AA25025 [Gossypium barbadense]9.1e-13051.72Show/hide
Query:  ISVTFLAFFFLNSLFESAHAAGIAVYWGQNGNEGSLAEACATSNYKYVNIAFLYSFGSGQMPRLNLAGHCDPSFGGCQYLSAEIKACQSQGIKIFLSIGG
        ISV F+ +  +  L   ++  GI++YWGQNGNEG+LA+ CAT NY+YVNIAFL +FG+GQ P +NLAGHCDP   GC  LS++IK+CQ++G+K+ LSIGG
Subjt:  ISVTFLAFFFLNSLFESAHAAGIAVYWGQNGNEGSLAEACATSNYKYVNIAFLYSFGSGQMPRLNLAGHCDPSFGGCQYLSAEIKACQSQGIKIFLSIGG

Query:  GAGNYRLSSAEDAREVATYIWNNYLGGASSSRPLGDAVLDGIDFDIEGGSNLYWDVLAQALKDYS---SDVLLAAAPQCFYPDYYLDNAIKTGLFDYVWV
        GAG+Y L+S++DAR+VATY+WNN+LGG SSSRPLG A+LDGIDFDIEGG+  YWD LA+ L  YS     V L AAPQC +PD ++ NA+KTGLFDYVWV
Subjt:  GAGNYRLSSAEDAREVATYIWNNYLGGASSSRPLGDAVLDGIDFDIEGGSNLYWDVLAQALKDYS---SDVLLAAAPQCFYPDYYLDNAIKTGLFDYVWV

Query:  QFYNNPPCQYSAGNYR--------------------------------------------------------------LSSADDAKQVADYLWNNYLGGT
        QFYNNPPCQYS+ +                                                                    +DA+QVA YLWNN+LGGT
Subjt:  QFYNNPPCQYSAGNYR--------------------------------------------------------------LSSADDAKQVADYLWNNYLGGT

Query:  SASRPLGDAVLDGIDFYIAAGSNLYWDVLAQALKNYS---SDVLLAAAPQCFYPDYYLDTAINTGLFDYVWVQFYNNPPCQYSPGNSQNLLNSWLTW-SN
        S+SRPLG A+LDGIDF I  G+  YWD LA+ L  YS     V L AAPQC +PD ++  A+ TGLFDYVWVQFYNNPPCQYS  +  NL ++W  W S+
Subjt:  SASRPLGDAVLDGIDFYIAAGSNLYWDVLAQALKNYS---SDVLLAAAPQCFYPDYYLDTAINTGLFDYVWVQFYNNPPCQYSPGNSQNLLNSWLTW-SN

Query:  TNAKLVFLGLPASPDAAPTGGYIPPQVLITQSLPVIKTDPKYGGVMLWNRYFDVITKYSDAIKPFI
          A  +FLGLPA+PDAA + G+IP   L ++ LP IK   KYGGVMLW++Y+D  + YS  IK  +
Subjt:  TNAKLVFLGLPASPDAAPTGGYIPPQVLITQSLPVIKTDPKYGGVMLWNRYFDVITKYSDAIKPFI

TrEMBL top hitse value%identityAlignment
A0A2G2W6L4 Uncharacterized protein4.7e-12443.8Show/hide
Query:  LFESAHAAGIAVYWGQNGNEGSLAEACATSNYKYVNIAFLYSFGSGQMPRLNLAGHCDPSFGGCQYLSAEIKACQSQGIKIFLSIGGGAGNYRLSSAEDA
        L  S  A GIA+YWGQNG EG+LAE CAT NY +V IAFL +FG+GQ P +NLAGHCDPS G C  LS +IK+CQ++GIK+ LSIGGGAG+Y L+SA+DA
Subjt:  LFESAHAAGIAVYWGQNGNEGSLAEACATSNYKYVNIAFLYSFGSGQMPRLNLAGHCDPSFGGCQYLSAEIKACQSQGIKIFLSIGGGAGNYRLSSAEDA

Query:  REVATYIWNNYLGGASSSRPLGDAVLDGIDFDIEGGSNLYWDVLAQALKDYSS---DVLLAAAPQCFYPDYYLDNAIKTGLFDYVWVQFYNNPPCQYS--
        R+VATY+WNN+LGG S++RPLGDAVLDGIDFDIEGG+NLYWDVLA++L  YSS    V L AAPQC +PD ++ NA+KTGLFDYVWVQFYNNPPCQYS  
Subjt:  REVATYIWNNYLGGASSSRPLGDAVLDGIDFDIEGGSNLYWDVLAQALKDYSS---DVLLAAAPQCFYPDYYLDNAIKTGLFDYVWVQFYNNPPCQYS--

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ---------------------------------------------------------------------------------------------------A
                                                                                                           A
Subjt:  ---------------------------------------------------------------------------------------------------A

Query:  GNYRLSSADDAKQVADYLWNNYLGGTSASRPLGDAVLDGIDFYIAAGSNLYWDVLAQALKNYSS---DVLLAAAPQCFYPDYYLDTAINTGLFDYVWVQF
        G+Y L+SADDAKQVA YLWNN+LGG S +RPLGDAVLDGIDF I  G+NLYWDVLA++L  YSS    V L AAPQC +PD ++  A+ TGLFDYVWVQF
Subjt:  GNYRLSSADDAKQVADYLWNNYLGGTSASRPLGDAVLDGIDFYIAAGSNLYWDVLAQALKNYSS---DVLLAAAPQCFYPDYYLDTAINTGLFDYVWVQF

Query:  YNNPPCQYSPGNSQNLLNSWLTW-SNTNAKLVFLGLPASPDAAPTGGYIPPQVLITQSLPVIKTDPKYGGVMLWNRYFDVITKYSDAIK
        YNNPPCQYS  +  NL  +W  W ++  A  +FLGLPA+P AA   G+IP   L +Q LP IKT  KYGGVMLW++Y+D  T YS +IK
Subjt:  YNNPPCQYSPGNSQNLLNSWLTW-SNTNAKLVFLGLPASPDAAPTGGYIPPQVLITQSLPVIKTDPKYGGVMLWNRYFDVITKYSDAIK

A0A2G2YXU9 Acidic endochitinase1.4e-12343.63Show/hide
Query:  LFESAHAAGIAVYWGQNGNEGSLAEACATSNYKYVNIAFLYSFGSGQMPRLNLAGHCDPSFGGCQYLSAEIKACQSQGIKIFLSIGGGAGNYRLSSAEDA
        L  S  A GIA+YWGQNG EG+LAE CAT NY +V IAFL +FG+GQ P +NLAGHCDPS G C  LS +IK+CQ++GIK+ LSIGGGAG+Y L+SA+DA
Subjt:  LFESAHAAGIAVYWGQNGNEGSLAEACATSNYKYVNIAFLYSFGSGQMPRLNLAGHCDPSFGGCQYLSAEIKACQSQGIKIFLSIGGGAGNYRLSSAEDA

Query:  REVATYIWNNYLGGASSSRPLGDAVLDGIDFDIEGGSNLYWDVLAQALKDYSS---DVLLAAAPQCFYPDYYLDNAIKTGLFDYVWVQFYNNPPCQYS--
        R+VATY+WNN+LGG S++RPLGDAVLDGIDFDIEGG+NLYWDVLA++L  YSS    V L AAPQC +PD ++ NA+KTGLFDYVWVQFYNNPPCQYS  
Subjt:  REVATYIWNNYLGGASSSRPLGDAVLDGIDFDIEGGSNLYWDVLAQALKDYSS---DVLLAAAPQCFYPDYYLDNAIKTGLFDYVWVQFYNNPPCQYS--

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ---------------------------------------------------------------------------------------------------A
                                                                                                           A
Subjt:  ---------------------------------------------------------------------------------------------------A

Query:  GNYRLSSADDAKQVADYLWNNYLGGTSASRPLGDAVLDGIDFYIAAGSNLYWDVLAQALKNYSS---DVLLAAAPQCFYPDYYLDTAINTGLFDYVWVQF
        G+Y L+SADDA+QVA YLWNN+LGG S +RPLGDAVLDGIDF I  G+NLYWDVLA++L  YSS    V L AAPQC +PD ++  A+ TGLFDYVWVQF
Subjt:  GNYRLSSADDAKQVADYLWNNYLGGTSASRPLGDAVLDGIDFYIAAGSNLYWDVLAQALKNYSS---DVLLAAAPQCFYPDYYLDTAINTGLFDYVWVQF

Query:  YNNPPCQYSPGNSQNLLNSWLTW-SNTNAKLVFLGLPASPDAAPTGGYIPPQVLITQSLPVIKTDPKYGGVMLWNRYFDVITKYSDAIK
        YNNPPCQYS  +  NL  +W  W ++  A  +FLGLPA+P AA   G+IP   L +Q LP IKT  KYGGVMLW++Y+D  T YS +IK
Subjt:  YNNPPCQYSPGNSQNLLNSWLTW-SNTNAKLVFLGLPASPDAAPTGGYIPPQVLITQSLPVIKTDPKYGGVMLWNRYFDVITKYSDAIK

A0A2P5WX23 Glyco_hydro_18 domain-containing protein4.4e-13051.72Show/hide
Query:  ISVTFLAFFFLNSLFESAHAAGIAVYWGQNGNEGSLAEACATSNYKYVNIAFLYSFGSGQMPRLNLAGHCDPSFGGCQYLSAEIKACQSQGIKIFLSIGG
        ISV F+ +  +  L   ++  GI++YWGQNGNEG+LA+ CAT NY+YVNIAFL +FG+GQ P +NLAGHCDP   GC  LS++IK+CQ++G+K+ LSIGG
Subjt:  ISVTFLAFFFLNSLFESAHAAGIAVYWGQNGNEGSLAEACATSNYKYVNIAFLYSFGSGQMPRLNLAGHCDPSFGGCQYLSAEIKACQSQGIKIFLSIGG

Query:  GAGNYRLSSAEDAREVATYIWNNYLGGASSSRPLGDAVLDGIDFDIEGGSNLYWDVLAQALKDYS---SDVLLAAAPQCFYPDYYLDNAIKTGLFDYVWV
        GAG+Y L+S++DAR+VATY+WNN+LGG SSSRPLG A+LDGIDFDIEGG+  YWD LA+ L  YS     V L AAPQC +PD ++ NA+KTGLFDYVWV
Subjt:  GAGNYRLSSAEDAREVATYIWNNYLGGASSSRPLGDAVLDGIDFDIEGGSNLYWDVLAQALKDYS---SDVLLAAAPQCFYPDYYLDNAIKTGLFDYVWV

Query:  QFYNNPPCQYSAGNYR--------------------------------------------------------------LSSADDAKQVADYLWNNYLGGT
        QFYNNPPCQYS+ +                                                                    +DA+QVA YLWNN+LGGT
Subjt:  QFYNNPPCQYSAGNYR--------------------------------------------------------------LSSADDAKQVADYLWNNYLGGT

Query:  SASRPLGDAVLDGIDFYIAAGSNLYWDVLAQALKNYS---SDVLLAAAPQCFYPDYYLDTAINTGLFDYVWVQFYNNPPCQYSPGNSQNLLNSWLTW-SN
        S+SRPLG A+LDGIDF I  G+  YWD LA+ L  YS     V L AAPQC +PD ++  A+ TGLFDYVWVQFYNNPPCQYS  +  NL ++W  W S+
Subjt:  SASRPLGDAVLDGIDFYIAAGSNLYWDVLAQALKNYS---SDVLLAAAPQCFYPDYYLDTAINTGLFDYVWVQFYNNPPCQYSPGNSQNLLNSWLTW-SN

Query:  TNAKLVFLGLPASPDAAPTGGYIPPQVLITQSLPVIKTDPKYGGVMLWNRYFDVITKYSDAIKPFI
          A  +FLGLPA+PDAA + G+IP   L ++ LP IK   KYGGVMLW++Y+D  + YS  IK  +
Subjt:  TNAKLVFLGLPASPDAAPTGGYIPPQVLITQSLPVIKTDPKYGGVMLWNRYFDVITKYSDAIKPFI

A0A6A6MFZ3 Uncharacterized protein1.6e-12747.44Show/hide
Query:  GIAVYWGQNGNEGSLAEACATSNYKYVNIAFLYSFGSGQMPRLNLAGHCDPSFGGCQYLSAEIKACQSQGIKIFLSIGGGAGNYRLSSAEDAREVATYIW
        GIA+YWGQNGNEG+L + C+T  Y YVNIAFL  FG+GQ P++NLAGHC+P+ GGC  +S  I++CQ QGIK+ LS+GGG G+Y L+S  DA+ VA Y+W
Subjt:  GIAVYWGQNGNEGSLAEACATSNYKYVNIAFLYSFGSGQMPRLNLAGHCDPSFGGCQYLSAEIKACQSQGIKIFLSIGGGAGNYRLSSAEDAREVATYIW

Query:  NNYLGGASSSRPLGDAVLDGIDFDIEGGSNLYWDVLAQALKDYSSD---VLLAAAPQCFYPDYYLDNAIKTGLFDYVWVQFYNNPPCQYSAGN-------
        NN+LGG SSSRPLGDAVLDGIDFDIE GS LYWD LA+ L  YS     V L AAPQC +PD YL  A+ TGLFDYVWVQFYNNPPCQYS+GN       
Subjt:  NNYLGGASSSRPLGDAVLDGIDFDIEGGSNLYWDVLAQALKDYSSD---VLLAAAPQCFYPDYYLDNAIKTGLFDYVWVQFYNNPPCQYSAGN-------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  -----------------------------------------YRLSSADDAKQVADYLWNNYLGGTSASRPLGDAVLDGIDFYIAAGSNLYWDVLAQALKN
                                                 Y L+S  DAK VADYLWNN+LGG S+SRPLGDAVLDGIDF I  GS LYWD LA+ L  
Subjt:  -----------------------------------------YRLSSADDAKQVADYLWNNYLGGTSASRPLGDAVLDGIDFYIAAGSNLYWDVLAQALKN

Query:  YSSD---VLLAAAPQCFYPDYYLDTAINTGLFDYVWVQFYNNPPCQYSPGNSQNLLNSWLTWSNT-NAKLVFLGLPASPDAAPTGGYIPPQVLITQSLPV
        YS     V L AAPQC +PD YL TA+NTGLFDYVWVQFYNNPPCQYS GN  N++NSW  W+ + NA  +FLGLPA+P+AA + GY+PP VLI+Q LP 
Subjt:  YSSD---VLLAAAPQCFYPDYYLDTAINTGLFDYVWVQFYNNPPCQYSPGNSQNLLNSWLTWSNT-NAKLVFLGLPASPDAAPTGGYIPPQVLITQSLPV

Query:  IKTDPKYGGVMLWNRYFDVITKYSDAI
        IK  P+YGGVMLW++++D    YS +I
Subjt:  IKTDPKYGGVMLWNRYFDVITKYSDAI

A0A6N2LR29 Protein kinase domain-containing protein1.0e-12345.26Show/hide
Query:  SLFESAHAAGIAVYWGQNGNEGSLAEACATSNYKYVNIAFLYSFGSGQMPRLNLAGHCDPSFGGCQYLSAEIKACQSQGIKIFLSIGGGAGNYRLSSAED
        SL + ++   IA+YWGQ+G EGSLA+ CA+ NY++VN+AFL +FG+GQ P LNLAGHCDPS G C   S EIK+CQSQGIK+ LSIGGGAGNY LSSA+D
Subjt:  SLFESAHAAGIAVYWGQNGNEGSLAEACATSNYKYVNIAFLYSFGSGQMPRLNLAGHCDPSFGGCQYLSAEIKACQSQGIKIFLSIGGGAGNYRLSSAED

Query:  AREVATYIWNNYLGGASSSRPLGDAVLDGIDFDIEGGSNLYWDVLAQALKDYSSDVLLAAAPQCFYPDYYLDNAIKTGLFDYVWVQFYNNPPCQYS----
        A +VA YIWNN+LGG S  RPLGDA+LDG+DF IE GS  +WD LA+AL  +S  + LAAAPQC +PD  LD AIKTGLFDYVWVQFYNNP CQYS    
Subjt:  AREVATYIWNNYLGGASSSRPLGDAVLDGIDFDIEGGSNLYWDVLAQALKDYSSDVLLAAAPQCFYPDYYLDNAIKTGLFDYVWVQFYNNPPCQYS----

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  -----------AGNYR------------------------------------------------------------LSSADDAKQVADYLWNNYLGGTSA
                    GNY+                                                            LSSADDA QVA Y+WNN+LGG S 
Subjt:  -----------AGNYR------------------------------------------------------------LSSADDAKQVADYLWNNYLGGTSA

Query:  SRPLGDAVLDGIDFYIAAGSNLYWDVLAQALKNYSSD--VLLAAAPQCFYPDYYLDTAINTGLFDYVWVQFYNNPPCQYSPGNSQNLLNSWLTWSNTNAK
         RPLGDA+LDG+DF I AGS  +WD LA+AL  +S    V LAAAPQC +PD  LDTAI TGLFDYVWVQ YNNP CQYS G++ NLLN+W  W      
Subjt:  SRPLGDAVLDGIDFYIAAGSNLYWDVLAQALKNYSSD--VLLAAAPQCFYPDYYLDTAINTGLFDYVWVQFYNNPPCQYSPGNSQNLLNSWLTWSNTNAK

Query:  LVFLGLPASPDAAPTGGYIPPQVLITQSLPVIKTDPKYGGVMLWNRYFDVITKYSDAIK
         VFLGLPA+ +AAP+GG+IP   LIT+ LP IK+ PKYGGVMLW++ +D    YSDAIK
Subjt:  LVFLGLPASPDAAPTGGYIPPQVLITQSLPVIKTDPKYGGVMLWNRYFDVITKYSDAIK

SwissProt top hitse value%identityAlignment
P17541 Acidic endochitinase2.3e-9145.89Show/hide
Query:  MANHHRSISVTFLAFFFLNSLFESAHAAGIAVYWGQNGNEGSLAEACATSNYKYVNIAFLYSFGSGQMPRLNLAGHCDPSFGGCQYLSAEIKACQSQGIK
        MA H   I+ T   FF L+S+F S+ AAGIA+YWGQNGNEGSLA  CAT NY++VNIAFL SFGSGQ P LNLAGHC+P   GC +LS EI +C+SQ +K
Subjt:  MANHHRSISVTFLAFFFLNSLFESAHAAGIAVYWGQNGNEGSLAEACATSNYKYVNIAFLYSFGSGQMPRLNLAGHCDPSFGGCQYLSAEIKACQSQGIK

Query:  IFLSIGGGAGNYRLSSAEDAREVATYIWNNYLGGASSSRPLGDAVLDGIDFDIEGGSNLYWDVLAQALKDYSSDVLLAAAPQCFYPDYYLDNAIKTGLFD
        + LSIGGGAG+Y LSSA+DA++VA +IWN+YLGG S SRPLG AVLDG+DFDIE GS  +WDVLAQ LK++   V+L+AAPQC  PD +LD AIKTGLFD
Subjt:  IFLSIGGGAGNYRLSSAEDAREVATYIWNNYLGGASSSRPLGDAVLDGIDFDIEGGSNLYWDVLAQALKDYSSDVLLAAAPQCFYPDYYLDNAIKTGLFD

Query:  YVWVQFYNNPPCQYSAGNYRLSSADDAKQVADYLWNNYLGGTSASRPLGDAVLDGIDFYIAAGSNLYWDVLAQALKNYSSDVLLAAAPQCFYPDYYLDTA
         VWVQFYNNPPC ++                                                                                     
Subjt:  YVWVQFYNNPPCQYSAGNYRLSSADDAKQVADYLWNNYLGGTSASRPLGDAVLDGIDFYIAAGSNLYWDVLAQALKNYSSDVLLAAAPQCFYPDYYLDTA

Query:  INTGLFDYVWVQFYNNPPCQYSPGNSQNLLNSWLTWSNTNAKLVFLGLPASPDAAPTGGYIPPQVLITQSLPVIKTDPKYGGVMLWNRYFDVITKYSDAI
                                N+ NLL+SW  W+      +++GLPA+ +AAP+GG+IP  VLI+Q LP IK    YGGVMLW++ FD    YSD+I
Subjt:  INTGLFDYVWVQFYNNPPCQYSPGNSQNLLNSWLTWSNTNAKLVFLGLPASPDAAPTGGYIPPQVLITQSLPVIKTDPKYGGVMLWNRYFDVITKYSDAI

Query:  K
        K
Subjt:  K

P23472 Hevamine-A5.6e-9044.67Show/hide
Query:  MANHHRSISVTFLAFFFLNSLFESAH--AAGIAVYWGQNGNEGSLAEACATSNYKYVNIAFLYSFGSGQMPRLNLAGHCDPSFGGCQYLSAEIKACQSQG
        MA   ++I +  LA   ++ +  S+H    GIA+YWGQNGNEG+L + C+T  Y YVNIAFL  FG+GQ P++NLAGHC+P+ GGC  +S  I++CQ QG
Subjt:  MANHHRSISVTFLAFFFLNSLFESAH--AAGIAVYWGQNGNEGSLAEACATSNYKYVNIAFLYSFGSGQMPRLNLAGHCDPSFGGCQYLSAEIKACQSQG

Query:  IKIFLSIGGGAGNYRLSSAEDAREVATYIWNNYLGGASSSRPLGDAVLDGIDFDIEGGSNLYWDVLAQALKDYSSDVLLAAAPQCFYPDYYLDNAIKTGL
        IK+ LS+GGG G+Y L+S  DA+ VA Y+WNN+LGG SSSRPLGDAVLDGIDFDIE GS LYWD LA+ L  YS                          
Subjt:  IKIFLSIGGGAGNYRLSSAEDAREVATYIWNNYLGGASSSRPLGDAVLDGIDFDIEGGSNLYWDVLAQALKDYSSDVLLAAAPQCFYPDYYLDNAIKTGL

Query:  FDYVWVQFYNNPPCQYSAGNYRLSSADDAKQVADYLWNNYLGGTSASRPLGDAVLDGIDFYIAAGSNLYWDVLAQALKNYSSDVLLAAAPQCFYPDYYLD
                                                                        G  +Y                L AAPQC +PD YL 
Subjt:  FDYVWVQFYNNPPCQYSAGNYRLSSADDAKQVADYLWNNYLGGTSASRPLGDAVLDGIDFYIAAGSNLYWDVLAQALKNYSSDVLLAAAPQCFYPDYYLD

Query:  TAINTGLFDYVWVQFYNNPPCQYSPGNSQNLLNSWLTWSNT-NAKLVFLGLPASPDAAPTGGYIPPQVLITQSLPVIKTDPKYGGVMLWNRYFDVITKYS
        TA+NTGLFDYVWVQFYNNPPCQYS GN  N++NSW  W+ + NA  +FLGLPA+P+AA + GY+PP VLI++ LP IK  PKYGGVMLW++++D    YS
Subjt:  TAINTGLFDYVWVQFYNNPPCQYSPGNSQNLLNSWLTWSNT-NAKLVFLGLPASPDAAPTGGYIPPQVLITQSLPVIKTDPKYGGVMLWNRYFDVITKYS

Query:  DAI
         +I
Subjt:  DAI

P29024 Acidic endochitinase1.9e-9045.32Show/hide
Query:  RSISVTFLAFFFLNSLFESAHAAGIAVYWGQNGNEGSLAEACATSNYKYVNIAFLYSFGSGQMPRLNLAGHCDPSFGGCQYLSAEIKACQSQGIKIFLSI
        + +S   L   F+ S F+ +HA GI+VYWGQNGNEGSLA+AC T NYKYVNIAFL++FG GQ P+LNLAGHC+PS   C   S +IK CQS+ IK+ LS+
Subjt:  RSISVTFLAFFFLNSLFESAHAAGIAVYWGQNGNEGSLAEACATSNYKYVNIAFLYSFGSGQMPRLNLAGHCDPSFGGCQYLSAEIKACQSQGIKIFLSI

Query:  GGGAGNYRLSSAEDAREVATYIWNNYLGGASSSRPLGDAVLDGIDFDIEGGSNLYWDVLAQALKDYSSDVLLAAAPQCFYPDYYLDNAIKTGLFDYVWVQ
        GG +G+Y L+SA+DA +VA YIWNN+LGG SSSRPLGDA+LDG+DFDIE G+  +WD LA+ALK ++S +LL AAPQC                      
Subjt:  GGGAGNYRLSSAEDAREVATYIWNNYLGGASSSRPLGDAVLDGIDFDIEGGSNLYWDVLAQALKDYSSDVLLAAAPQCFYPDYYLDNAIKTGLFDYVWVQ

Query:  FYNNPPCQYSAGNYRLSSADDAKQVADYLWNNYLGGTSASRPLGDAVLDGIDFYIAAGSNLYWDVLAQALKNYSSDVLLAAAPQCFYPDYYLDTAINTGL
                                                 P+                                            PD +LDTAI TGL
Subjt:  FYNNPPCQYSAGNYRLSSADDAKQVADYLWNNYLGGTSASRPLGDAVLDGIDFYIAAGSNLYWDVLAQALKNYSSDVLLAAAPQCFYPDYYLDTAINTGL

Query:  FDYVWVQFYNNPPCQYSPGNSQNLLNSWLTWSNTNAKLVFLGLPASPDAAPTGGYIPPQVLITQSLPVIKTDPKYGGVMLWNRYFDVITKYSDAI
        FD VWVQFYNNPPCQYS GN+ +L++SW  W+++ AK +FLG+PAS  AA   G+IP  VL +Q LP IK   KYGGVMLW+R+ D  + YS AI
Subjt:  FDYVWVQFYNNPPCQYSPGNSQNLLNSWLTWSNTNAKLVFLGLPASPDAAPTGGYIPPQVLITQSLPVIKTDPKYGGVMLWNRYFDVITKYSDAI

P36908 Acidic endochitinase2.7e-9246.39Show/hide
Query:  AFFFLNSLFESAHAAGIAVYWGQNGNEGSLAEACATSNYKYVNIAFLYSFGSGQMPRLNLAGHCDPSFGGCQYLSAEIKACQSQGIKIFLSIGGGAGNYR
        +   ++ L +S++AAGIAVYWGQNGNEGSL +AC T+NY++VNIAFL +FG+GQ P++NLAGHCDPS  GC   S EI+ACQ++GIK+ LS+GGGAG+Y 
Subjt:  AFFFLNSLFESAHAAGIAVYWGQNGNEGSLAEACATSNYKYVNIAFLYSFGSGQMPRLNLAGHCDPSFGGCQYLSAEIKACQSQGIKIFLSIGGGAGNYR

Query:  LSSAEDAREVATYIWNNYLGGASSSRPLGDAVLDGIDFDIEGGSNLYWDVLAQALKDYSSDVLLAAAPQCFYPDYYLDNAIKTGLFDYVWVQFYNNPPCQ
        L+SAE+A  +A Y+WNN+LGG S+SRPLGDAVLDGIDFDIE G   ++D LA+AL  +S                                         
Subjt:  LSSAEDAREVATYIWNNYLGGASSSRPLGDAVLDGIDFDIEGGSNLYWDVLAQALKDYSSDVLLAAAPQCFYPDYYLDNAIKTGLFDYVWVQFYNNPPCQ

Query:  YSAGNYRLSSADDAKQVADYLWNNYLGGTSASRPLGDAVLDGIDFYIAAGSNLYWDVLAQALKNYSSDVLLAAAPQCFYPDYYLDTAINTGLFDYVWVQF
                                                                            V L+AAPQC YPD +LD+AI TGLFDYVWVQF
Subjt:  YSAGNYRLSSADDAKQVADYLWNNYLGGTSASRPLGDAVLDGIDFYIAAGSNLYWDVLAQALKNYSSDVLLAAAPQCFYPDYYLDTAINTGLFDYVWVQF

Query:  YNNPPCQYSPGNSQNLLNSWLTWSNTNAKLVFLGLPASPDAAPTGGYIPPQVLITQSLPVIKTDPKYGGVMLWNRYFDVITKYSDAIK
        YNNP CQYS GN  NL+N+W  W+++ AK VFLG+PAS  AAP+GG IP  VL +Q LP IKT PKYGGVM+W+R+ D  + YS+AIK
Subjt:  YNNPPCQYSPGNSQNLLNSWLTWSNTNAKLVFLGLPASPDAAPTGGYIPPQVLITQSLPVIKTDPKYGGVMLWNRYFDVITKYSDAIK

P51614 Acidic endochitinase6.8e-8846.21Show/hide
Query:  SLFESAHAAGIAVYWGQNGNEGSLAEACATSNYKYVNIAFLYSFGSGQMPRLNLAGHCDPSFGGCQYLSAEIKACQSQGIKIFLSIGGGAGNYRLSSAED
        +L ++++A GIA+YWGQNGNEG+L + C T  Y YVNIAFL  FG+GQ P +NLAGHC+P+  GC  +S  I+ CQ++GIK+ LSIGGGAG+Y LSS+ D
Subjt:  SLFESAHAAGIAVYWGQNGNEGSLAEACATSNYKYVNIAFLYSFGSGQMPRLNLAGHCDPSFGGCQYLSAEIKACQSQGIKIFLSIGGGAGNYRLSSAED

Query:  AREVATYIWNNYLGGASSSRPLGDAVLDGIDFDIEGGSNLYWDVLAQALKDYSSDVLLAAAPQCFYPDYYLDNAIKTGLFDYVWVQFYNNPPCQYSAGNY
        A+ VA Y+WNN+LGG SSSRPLGDAVLDGIDFDIE GS L+WD LA+A                                                    
Subjt:  AREVATYIWNNYLGGASSSRPLGDAVLDGIDFDIEGGSNLYWDVLAQALKDYSSDVLLAAAPQCFYPDYYLDNAIKTGLFDYVWVQFYNNPPCQYSAGNY

Query:  RLSSADDAKQVADYLWNNYLGGTSASRPLGDAVLDGIDFYIAAGSNLYWDVLAQALKNYSSDVLLAAAPQCFYPDYYLDTAINTGLFDYVWVQFYNNPPC
                                         L  I+F    G  +Y                L AAPQC +PD    TA+NTGLFDYVWVQFYNNPPC
Subjt:  RLSSADDAKQVADYLWNNYLGGTSASRPLGDAVLDGIDFYIAAGSNLYWDVLAQALKNYSSDVLLAAAPQCFYPDYYLDTAINTGLFDYVWVQFYNNPPC

Query:  QYSPGNSQNLLNSWLTW-SNTNAKLVFLGLPASPDAAPTGGYIPPQVLITQSLPVIKTDPKYGGVMLWNRYFDVITKYSDAIK
        QYS GN+ NLLNSW  W S+ N+   F+GLPAS  AA   G+IP  VL +Q LPVIK  PKYGGVMLW++Y+D  + YS +IK
Subjt:  QYSPGNSQNLLNSWLTW-SNTNAKLVFLGLPASPDAAPTGGYIPPQVLITQSLPVIKTDPKYGGVMLWNRYFDVITKYSDAIK

Arabidopsis top hitse value%identityAlignment
AT5G24090.1 chitinase A7.5e-8241.62Show/hide
Query:  VTFLAFFFLNSLFESAHAA--GIAVYWGQNGNEGSLAEACATSNYKYVNIAFLYSFGSGQMPRLNLAGHCDPSFGGCQYLSAEIKACQSQGIKIFLSIGG
        V +  FF   SL + + A+  GIA+YWGQNGNEG+L+  CAT  Y YVN+AFL  FG+GQ P LNLAGHC+P+   C +  +++K CQS+GIK+ LS+GG
Subjt:  VTFLAFFFLNSLFESAHAA--GIAVYWGQNGNEGSLAEACATSNYKYVNIAFLYSFGSGQMPRLNLAGHCDPSFGGCQYLSAEIKACQSQGIKIFLSIGG

Query:  GAGNYRLSSAEDAREVATYIWNNYLGGASSSRPLGDAVLDGIDFDIEGGSNLYWDVLAQALKDYSSDVLLAAAPQCFYPDYYLDNAIKTGLFDYVWVQFY
        G GNY + S EDA+ +A Y+WNN+LGG SSSRPLGDAVLDGIDF+IE GS  +WD LA+ L  +S                                   
Subjt:  GAGNYRLSSAEDAREVATYIWNNYLGGASSSRPLGDAVLDGIDFDIEGGSNLYWDVLAQALKDYSSDVLLAAAPQCFYPDYYLDNAIKTGLFDYVWVQFY

Query:  NNPPCQYSAGNYRLSSADDAKQVADYLWNNYLGGTSASRPLGDAVLDGIDFYIAAGSNLYWDVLAQALKNYSSDVLLAAAPQCFYPDYYLDTAINTGLFD
                                                               G  +Y                L  APQC +PD  + +A+NT  FD
Subjt:  NNPPCQYSAGNYRLSSADDAKQVADYLWNNYLGGTSASRPLGDAVLDGIDFYIAAGSNLYWDVLAQALKNYSSDVLLAAAPQCFYPDYYLDTAINTGLFD

Query:  YVWVQFYNNPPCQYSPGNSQNLLNSWLTWSNT-NAKLVFLGLPASPDAAPTGGYIPPQVLITQSLPVIKTDPKYGGVMLWNRYFDVITKYSDAI
        YVW+QFYNNPPC YS GN+QNL +SW  W+ +  A+  FLGLPA+P+AA + GYIPP VL +Q LP +K   KYGGVMLW++++D    YS +I
Subjt:  YVWVQFYNNPPCQYSPGNSQNLLNSWLTWSNT-NAKLVFLGLPASPDAAPTGGYIPPQVLITQSLPVIKTDPKYGGVMLWNRYFDVITKYSDAI


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCGAACCACCACAGATCAATCTCAGTAACCTTCTTAGCTTTCTTCTTCCTCAATTCTCTGTTCGAGTCCGCCCATGCAGCGGGCATTGCAGTCTATTGGGGCCAAAA
TGGCAATGAAGGCAGCCTTGCCGAAGCATGCGCTACTTCAAACTACAAATATGTAAACATAGCCTTCTTGTACTCCTTCGGTAGTGGCCAAATGCCGCGGCTGAATCTTG
CAGGCCACTGCGACCCAAGCTTCGGCGGCTGCCAATACCTAAGCGCTGAGATCAAAGCTTGCCAAAGTCAAGGCATCAAAATCTTTCTCTCCATCGGAGGCGGTGCCGGA
AACTACCGACTTTCCTCGGCCGAAGATGCGAGAGAAGTGGCTACCTACATCTGGAACAACTACCTCGGCGGCGCATCCTCTTCACGGCCCCTCGGAGATGCCGTCTTGGA
CGGCATAGATTTCGATATTGAAGGTGGTTCGAATCTATATTGGGATGTGCTTGCTCAAGCTCTGAAAGATTATAGCAGTGATGTTCTTCTTGCAGCAGCTCCACAATGCT
TTTATCCAGACTATTATTTGGACAATGCTATAAAAACAGGTCTATTTGATTACGTTTGGGTACAATTCTATAACAACCCTCCATGTCAATACTCTGCCGGAAACTACCGA
CTTTCCTCGGCCGACGATGCGAAACAAGTGGCTGACTACCTCTGGAACAACTACCTCGGCGGCACATCCGCTTCACGGCCGCTCGGAGATGCCGTCTTGGACGGCATCGA
TTTCTATATTGCAGCTGGTTCGAATCTGTATTGGGATGTGCTTGCTCAAGCTCTGAAAAATTACAGCAGTGATGTTCTTCTTGCAGCAGCTCCACAATGCTTTTATCCAG
ACTATTATTTGGACACTGCTATAAATACAGGTCTATTTGATTACGTTTGGGTACAATTCTATAACAATCCTCCATGTCAATATTCTCCTGGAAACAGTCAAAATCTATTG
AACTCATGGCTAACATGGTCAAACACCAATGCTAAGTTGGTGTTTTTAGGTTTGCCTGCTTCTCCAGATGCTGCACCAACTGGTGGATACATTCCTCCACAAGTGCTTAT
TACTCAAAGTCTTCCAGTCATCAAGACTGATCCTAAGTATGGAGGAGTTATGCTATGGAACAGATATTTTGATGTTATTACTAAATATAGTGATGCCATCAAGCCTTTTA
TTAATGTTAATAATGTGCAATTGCTAAAGCAAGCTTTTGAGTAA
mRNA sequenceShow/hide mRNA sequence
ATGGCGAACCACCACAGATCAATCTCAGTAACCTTCTTAGCTTTCTTCTTCCTCAATTCTCTGTTCGAGTCCGCCCATGCAGCGGGCATTGCAGTCTATTGGGGCCAAAA
TGGCAATGAAGGCAGCCTTGCCGAAGCATGCGCTACTTCAAACTACAAATATGTAAACATAGCCTTCTTGTACTCCTTCGGTAGTGGCCAAATGCCGCGGCTGAATCTTG
CAGGCCACTGCGACCCAAGCTTCGGCGGCTGCCAATACCTAAGCGCTGAGATCAAAGCTTGCCAAAGTCAAGGCATCAAAATCTTTCTCTCCATCGGAGGCGGTGCCGGA
AACTACCGACTTTCCTCGGCCGAAGATGCGAGAGAAGTGGCTACCTACATCTGGAACAACTACCTCGGCGGCGCATCCTCTTCACGGCCCCTCGGAGATGCCGTCTTGGA
CGGCATAGATTTCGATATTGAAGGTGGTTCGAATCTATATTGGGATGTGCTTGCTCAAGCTCTGAAAGATTATAGCAGTGATGTTCTTCTTGCAGCAGCTCCACAATGCT
TTTATCCAGACTATTATTTGGACAATGCTATAAAAACAGGTCTATTTGATTACGTTTGGGTACAATTCTATAACAACCCTCCATGTCAATACTCTGCCGGAAACTACCGA
CTTTCCTCGGCCGACGATGCGAAACAAGTGGCTGACTACCTCTGGAACAACTACCTCGGCGGCACATCCGCTTCACGGCCGCTCGGAGATGCCGTCTTGGACGGCATCGA
TTTCTATATTGCAGCTGGTTCGAATCTGTATTGGGATGTGCTTGCTCAAGCTCTGAAAAATTACAGCAGTGATGTTCTTCTTGCAGCAGCTCCACAATGCTTTTATCCAG
ACTATTATTTGGACACTGCTATAAATACAGGTCTATTTGATTACGTTTGGGTACAATTCTATAACAATCCTCCATGTCAATATTCTCCTGGAAACAGTCAAAATCTATTG
AACTCATGGCTAACATGGTCAAACACCAATGCTAAGTTGGTGTTTTTAGGTTTGCCTGCTTCTCCAGATGCTGCACCAACTGGTGGATACATTCCTCCACAAGTGCTTAT
TACTCAAAGTCTTCCAGTCATCAAGACTGATCCTAAGTATGGAGGAGTTATGCTATGGAACAGATATTTTGATGTTATTACTAAATATAGTGATGCCATCAAGCCTTTTA
TTAATGTTAATAATGTGCAATTGCTAAAGCAAGCTTTTGAGTAA
Protein sequenceShow/hide protein sequence
MANHHRSISVTFLAFFFLNSLFESAHAAGIAVYWGQNGNEGSLAEACATSNYKYVNIAFLYSFGSGQMPRLNLAGHCDPSFGGCQYLSAEIKACQSQGIKIFLSIGGGAG
NYRLSSAEDAREVATYIWNNYLGGASSSRPLGDAVLDGIDFDIEGGSNLYWDVLAQALKDYSSDVLLAAAPQCFYPDYYLDNAIKTGLFDYVWVQFYNNPPCQYSAGNYR
LSSADDAKQVADYLWNNYLGGTSASRPLGDAVLDGIDFYIAAGSNLYWDVLAQALKNYSSDVLLAAAPQCFYPDYYLDTAINTGLFDYVWVQFYNNPPCQYSPGNSQNLL
NSWLTWSNTNAKLVFLGLPASPDAAPTGGYIPPQVLITQSLPVIKTDPKYGGVMLWNRYFDVITKYSDAIKPFINVNNVQLLKQAFE