; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr013776 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr013776
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionPathogenesis-related thaumatin superfamily protein
Genome locationtig00153996:144823..148997
RNA-Seq ExpressionSgr013776
SyntenySgr013776
Gene Ontology termsNA
InterPro domainsIPR001938 - Thaumatin family
IPR017949 - Thaumatin, conserved site
IPR032675 - Leucine-rich repeat domain superfamily
IPR037176 - Osmotin/thaumatin-like superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022155220.1 TMV resistance protein N-like isoform X1 [Momordica charantia]1.3e-11457.87Show/hide
Query:  MGREIVREESPKEPKRRSRLFLNDEVLDVLKKQQGADVTEGLSLKLPRSSVKFVQGSALIKMQKLRLLQLNYAHIVGGYFEHLSQELRWLCWHGFPLNFL
        MGREIV  ES KEP +RSR+F+ND+VLDVL+KQ+GADVTEGLSLKL  SS K     A  +MQKLRLLQLNYA I G  F+H+SQELRW+CWHGFPL  L
Subjt:  MGREIVREESPKEPKRRSRLFLNDEVLDVLKKQQGADVTEGLSLKLPRSSVKFVQGSALIKMQKLRLLQLNYAHIVGGYFEHLSQELRWLCWHGFPLNFL

Query:  PENFYLEKIVAIDLRYSQL-RFFGKKSKF-LEKLTILNLSRSHSLRHTPNFLKLPNLEKLKLKDCKSLVKVDDSIGSLKKLVFINLKGCICLNNLPVTLC
        P++F+LEK VAIDLRYS L +FF K+ KF LEKLTILNLS S  L  TPNF KLP+L+KL+L+DCKSLVKVDDSIG L+KL FINLK CICLN LP   C
Subjt:  PENFYLEKIVAIDLRYSQL-RFFGKKSKF-LEKLTILNLSRSHSLRHTPNFLKLPNLEKLKLKDCKSLVKVDDSIGSLKKLVFINLKGCICLNNLPVTLC

Query:  ELKFLGTLILSGCSKLESLPKNLEKMESLTTLTAEDTCIKQLPSTILGLKKLKCLILCRY----------------------------------------
        ELK L  LILSGCSKLE LP  L KMESLTTL A+ TCI++LPSTI+GLKKLKCL + R                                         
Subjt:  ELKFLGTLILSGCSKLESLPKNLEKMESLTTLTAEDTCIKQLPSTILGLKKLKCLILCRY----------------------------------------

Query:  ---------DAIPKDIGSLVSLVELDLQDNDFHGQPSSISHLSKLETLCLNDCIKLKKIPNLTPNLKTLIASKCKALESISDLSK-IEYMDILQVDDCPN
                 DAIPKDIGSLVSL+ LDL  NDF   PSSIS LS L  L  + C KL++I NL P  + +   KCKALE+IS+L K +E ++ + V +CP 
Subjt:  ---------DAIPKDIGSLVSLVELDLQDNDFHGQPSSISHLSKLETLCLNDCIKLKKIPNLTPNLKTLIASKCKALESISDLSK-IEYMDILQVDDCPN

Query:  LVEISGFDKLLPLIAYIYMEGS-SKMKQSFKE
        +VEIS FD ++P I  I M G  SK+ QSFK+
Subjt:  LVEISGFDKLLPLIAYIYMEGS-SKMKQSFKE

XP_022155243.1 TMV resistance protein N [Momordica charantia]1.3e-12256.33Show/hide
Query:  MGREIVREESPKEPKRRSRLFLNDEVLDVLKKQQGADVTEGLSLKLPRSSVKFVQGSALIKMQKLRLLQLNYAHIVGGYFEHLSQELRWLCWHGFPLNFL
        MGREIVRE+ PKEP+R SRL L+DEVL VL +Q+G D  EGL+LKLPR S + +   A  +MQ LRLLQLN+ ++  G F+HLSQELRWLCWHGFPL FL
Subjt:  MGREIVREESPKEPKRRSRLFLNDEVLDVLKKQQGADVTEGLSLKLPRSSVKFVQGSALIKMQKLRLLQLNYAHIVGGYFEHLSQELRWLCWHGFPLNFL

Query:  PENFYLEKIVAIDLRYSQLRFFGKKSKFLEKLTILNLSRSHSLRHTPNFLKLPNLEKLKLKDCKSLVKVDDSIGSLKKLVFINLKGCICLNNLPVTLCEL
        P++F++EK+VAIDLRYS +RFF K+SKFLEKL ILNLS SH L HTP+FLKLP+LEKLKLKDCK+LV++  SIG LK LVFINLK C  L +LP +  +L
Subjt:  PENFYLEKIVAIDLRYSQLRFFGKKSKFLEKLTILNLSRSHSLRHTPNFLKLPNLEKLKLKDCKSLVKVDDSIGSLKKLVFINLKGCICLNNLPVTLCEL

Query:  KFLGTLILSGCSKLESLPKNLEKMESLTTLTAEDTCIKQLPSTILGLKKLKCLILC--------------------------------------------
        K L TL +SGCSK+ +LP++L +++SL TL A+DT I+Q+PSTI+ LK LK L LC                                            
Subjt:  KFLGTLILSGCSKLESLPKNLEKMESLTTLTAEDTCIKQLPSTILGLKKLKCLILC--------------------------------------------

Query:  ------RYDAIPKDIGSLVSLVELDLQDNDFHGQPSSISHLSKLETLCLNDCIKLKKIPNLTPNLKTLIASKCKALESISDLSKIEYMDILQVDDCPNLV
                + IPKDIGSLVSL+ELDL+DN FH  PSSIS LSKL+TL L+ C +L+ IP+L P LK+L AS C +LES SDLS+++ M+ L V +CP LV
Subjt:  ------RYDAIPKDIGSLVSLVELDLQDNDFHGQPSSISHLSKLETLCLNDCIKLKKIPNLTPNLKTLIASKCKALESISDLSKIEYMDILQVDDCPNLV

Query:  EISGFDKLLPLIAYIYMEGSSKMKQSFKENILQGW-MISGFG
        EI G +KLL  I  I+MEG SKM  SFKE ILQGW ++SGFG
Subjt:  EISGFDKLLPLIAYIYMEGSSKMKQSFKENILQGW-MISGFG

XP_022983687.1 TMV resistance protein N [Cucurbita maxima]7.7e-11553.18Show/hide
Query:  MGREIVREESPKEPKRRSRLFLNDEVLDVLKKQQGADVTEGLSLKLPRSSVKFVQGSALIKMQKLRLLQLNYAHIVGGYFEHLSQELRWLCWHGFPLNFL
        MGREIVRE+ PKEP++ SRL L++EV+ VL + +G    EGLSLKLPR S + +   A  +MQ LRLLQLN+ ++ G  F+H+SQE+RWLCWHGFPL FL
Subjt:  MGREIVREESPKEPKRRSRLFLNDEVLDVLKKQQGADVTEGLSLKLPRSSVKFVQGSALIKMQKLRLLQLNYAHIVGGYFEHLSQELRWLCWHGFPLNFL

Query:  PENFYLEKIVAIDLRYSQLRFFGKKSKFLEKLTILNLSRSHSLRHTPNFLKLPNLEKLKLKDCKSLVKVDDSIGSLKKLVFINLKGCICLNNLPVTLCEL
        P++F++EK+VA+DLRYSQ+RFF K+SKFL+ L  LNLS SH L +TP+F KLPNLE LKLKDCKSLV++  +IG LK L+ INLK C CL +LP     L
Subjt:  PENFYLEKIVAIDLRYSQLRFFGKKSKFLEKLTILNLSRSHSLRHTPNFLKLPNLEKLKLKDCKSLVKVDDSIGSLKKLVFINLKGCICLNNLPVTLCEL

Query:  KFLGTLILSGCSKLESLPKNLEKMESLTTLTAEDTCIKQLPSTILGLKKLKCLILC---------------------RY---------------------
        K L TLILSGCSKL +LP++L +M SL TLTA+DT I+++PSTI+ LKKLK L LC                     +Y                     
Subjt:  KFLGTLILSGCSKLESLPKNLEKMESLTTLTAEDTCIKQLPSTILGLKKLKCLILC---------------------RY---------------------

Query:  -------DAIPKDIGSLVSLVELDLQDNDFHGQPSSISHLSKLETLCLNDCIKLKKIPNLTPNLKTLIASKCKALESISDLSKIEYMDILQVDDCPNLVE
               + IPKDIGSL+SL ELDL+DN FH  PSSIS L KLETL L+ C +L+ IP+L P+L +L AS C +LE   DLS ++ M  L V +CP L++
Subjt:  -------DAIPKDIGSLVSLVELDLQDNDFHGQPSSISHLSKLETLCLNDCIKLKKIPNLTPNLKTLIASKCKALESISDLSKIEYMDILQVDDCPNLVE

Query:  ISGFDKLLPLIAYIYMEGSSKMKQSFKENILQGWMISGFG
        I G + LL  I  I+MEG S M  SFKE+IL GW +SGFG
Subjt:  ISGFDKLLPLIAYIYMEGSSKMKQSFKENILQGWMISGFG

XP_023528161.1 TMV resistance protein N [Cucurbita pepo subsp. pepo]2.6e-11553.41Show/hide
Query:  MGREIVREESPKEPKRRSRLFLNDEVLDVLKKQQGADVTEGLSLKLPRSSVKFVQGSALIKMQKLRLLQLNYAHIVGGYFEHLSQELRWLCWHGFPLNFL
        MGREIVRE+ PKEP+R SRL L++EVL VL + +G    EGLSLKLPR S + +   A  +MQ LRLLQLN+ ++ G  F+H+SQE+RWLCWHGFPL FL
Subjt:  MGREIVREESPKEPKRRSRLFLNDEVLDVLKKQQGADVTEGLSLKLPRSSVKFVQGSALIKMQKLRLLQLNYAHIVGGYFEHLSQELRWLCWHGFPLNFL

Query:  PENFYLEKIVAIDLRYSQLRFFGKKSKFLEKLTILNLSRSHSLRHTPNFLKLPNLEKLKLKDCKSLVKVDDSIGSLKKLVFINLKGCICLNNLPVTLCEL
        P++F++EK+VA+DLRYSQ+RFF K+SKFL+ L  LNLS SH L +TP+F KLPNLE LKLKDCKSLV++  +IG LK L+ INLK C CL +LP     L
Subjt:  PENFYLEKIVAIDLRYSQLRFFGKKSKFLEKLTILNLSRSHSLRHTPNFLKLPNLEKLKLKDCKSLVKVDDSIGSLKKLVFINLKGCICLNNLPVTLCEL

Query:  KFLGTLILSGCSKLESLPKNLEKMESLTTLTAEDTCIKQLPSTILGLKKLKCLILC---------------------RY---------------------
        K L TLILSGCSKL +LP++L +M SL TLTA+DT I+++PSTI+ LK LK L LC                     +Y                     
Subjt:  KFLGTLILSGCSKLESLPKNLEKMESLTTLTAEDTCIKQLPSTILGLKKLKCLILC---------------------RY---------------------

Query:  -------DAIPKDIGSLVSLVELDLQDNDFHGQPSSISHLSKLETLCLNDCIKLKKIPNLTPNLKTLIASKCKALESISDLSKIEYMDILQVDDCPNLVE
               + IPKDIGSL+SL ELDL+DN FH  PSSIS L KLETL L+ C +L+ IP+L P+L +L AS C +LE   DLS ++ M  L V +CP L++
Subjt:  -------DAIPKDIGSLVSLVELDLQDNDFHGQPSSISHLSKLETLCLNDCIKLKKIPNLTPNLKTLIASKCKALESISDLSKIEYMDILQVDDCPNLVE

Query:  ISGFDKLLPLIAYIYMEGSSKMKQSFKENILQGWMISGFG
        I G + LL  I  I+MEG S +  SFKENIL GW +SGFG
Subjt:  ISGFDKLLPLIAYIYMEGSSKMKQSFKENILQGWMISGFG

XP_038905549.1 disease resistance protein RPV1-like isoform X1 [Benincasa hispida]2.4e-11653.51Show/hide
Query:  MGREIVREESPKEPKRRSRLFLNDEVLDVLKKQQGADVTEGLSLKLPRSSVKFVQGSALIKMQKLRLLQLNYAHIVGGYFEHLSQELRWLCWHGFPLNFL
        MGREIV E  PK P+R +RLF+++EVL VL +Q+G D TEGLSLKLPR S + +   A  +MQ LRLLQLN+ +I G  F+H+SQE+RWLCWHGFPL F+
Subjt:  MGREIVREESPKEPKRRSRLFLNDEVLDVLKKQQGADVTEGLSLKLPRSSVKFVQGSALIKMQKLRLLQLNYAHIVGGYFEHLSQELRWLCWHGFPLNFL

Query:  PENFYLEKIVAIDLRYSQLRFFGKKS-KFLEKLTILNLSRSHSLRHTPNFLKLPNLEKLKLKDCKSLVKVDDSIGSLKKLVFINLKGCICLNNLPVTLCE
        P++F+++K+VA+DLRYSQ+RFF K S KFL+ L  LNLS SH L HTPNF KLPNLEKLKLKDCKSLV++  +IG LK L+ +NLK C CLN+LP +   
Subjt:  PENFYLEKIVAIDLRYSQLRFFGKKS-KFLEKLTILNLSRSHSLRHTPNFLKLPNLEKLKLKDCKSLVKVDDSIGSLKKLVFINLKGCICLNNLPVTLCE

Query:  LKFLGTLILSGCSKLESLPKNLEKMESLTTLTAEDTCIKQLPSTILGLKKLKCLILC-------------------------------------------
        LK L TLILSGCS L +LP++L +M+SL TL A++T I+++PSTI  LK LK L LC                                           
Subjt:  LKFLGTLILSGCSKLESLPKNLEKMESLTTLTAEDTCIKQLPSTILGLKKLKCLILC-------------------------------------------

Query:  ------RYDAIPKDIGSLVSLVELDLQDNDFHGQPSSISHLSKLETLCLNDCIKLKKIPNLTPNLKTLIASKCKALESISDLSKIEYMDILQVDDCPNLV
                + IPKDIGSLVSL ELDL +N FH  PSSIS L KLETL L+ C +L++IPNL P+L +L AS C +LE  SDLS ++ M  L + +CP LV
Subjt:  ------RYDAIPKDIGSLVSLVELDLQDNDFHGQPSSISHLSKLETLCLNDCIKLKKIPNLTPNLKTLIASKCKALESISDLSKIEYMDILQVDDCPNLV

Query:  EISGFDKLLPLIAYIYMEGSSKMKQSFKENILQGWMISGFG
        ++ G DKLL  I  I+MEG S M  SFKE ILQ W +SGFG
Subjt:  EISGFDKLLPLIAYIYMEGSSKMKQSFKENILQGWMISGFG

TrEMBL top hitse value%identityAlignment
A0A0A0LG09 TIR domain-containing protein9.2e-11452.03Show/hide
Query:  MGREIVREESPKEPKRRSRLFLNDEVLDVLKKQQGADVTEGLSLKLPRSSVKFVQGSALIKMQKLRLLQLNYAHIVGGYFEHLSQELRWLCWHGFPLNFL
        MGREIVRE  PK P+R SRLFL++EVL VL +Q+G D TEGLSLKLPR S + +   A  +MQKLRLLQLN+   V G F+H+S+E+RW+CWHGFPL FL
Subjt:  MGREIVREESPKEPKRRSRLFLNDEVLDVLKKQQGADVTEGLSLKLPRSSVKFVQGSALIKMQKLRLLQLNYAHIVGGYFEHLSQELRWLCWHGFPLNFL

Query:  PENFYLEKIVAIDLRYSQLRFFGKKSKFLEKLTILNLSRSHSLRHTPNFLKLPNLEKLKLKDCKSLVKVDDSIGSLKKLVFINLKGCICLNNLPVTLCEL
        P+ F+++K+VA+DLRYSQ+RFF K+SKFL+ L  LNL  SH L HTPNF KLPNLE L LKDCK+L+++  +IG LK L+ +NLK C  LN+LP +   L
Subjt:  PENFYLEKIVAIDLRYSQLRFFGKKSKFLEKLTILNLSRSHSLRHTPNFLKLPNLEKLKLKDCKSLVKVDDSIGSLKKLVFINLKGCICLNNLPVTLCEL

Query:  KFLGTLILSGCSKLESLPKNLEKMESLTTLTAEDTCIKQLPSTILGLKKLKCLILC--------------------------------------------
        K L TLI+SGCSKL SLP++L ++ SL TL A++T I+++P+TI+ LK LK L LC                                            
Subjt:  KFLGTLILSGCSKLESLPKNLEKMESLTTLTAEDTCIKQLPSTILGLKKLKCLILC--------------------------------------------

Query:  ---------RYDAIPKDIGSLVSLVELDLQDNDFHGQPSSISHLSKLETLCLNDCIKLKKIPNLTPNLKTLIASKCKALESISDLSKIEYMDILQVDDCP
                   + IPKDIGSL SL ELDL +N FH  PS+IS L KLETL L++C +L+ IPNL P+L +L AS C +LE  SDLS ++ M  L + +CP
Subjt:  ---------RYDAIPKDIGSLVSLVELDLQDNDFHGQPSSISHLSKLETLCLNDCIKLKKIPNLTPNLKTLIASKCKALESISDLSKIEYMDILQVDDCP

Query:  NLVEISGFDKLLPLIAYIYMEGSSKMKQSFKENILQGWMISGFG
         L+EI G DKLL  I  I+MEG S M  SFK+ ILQGW +SGFG
Subjt:  NLVEISGFDKLLPLIAYIYMEGSSKMKQSFKENILQGWMISGFG

A0A6J1DLU8 TMV resistance protein N-like isoform X16.3e-11557.87Show/hide
Query:  MGREIVREESPKEPKRRSRLFLNDEVLDVLKKQQGADVTEGLSLKLPRSSVKFVQGSALIKMQKLRLLQLNYAHIVGGYFEHLSQELRWLCWHGFPLNFL
        MGREIV  ES KEP +RSR+F+ND+VLDVL+KQ+GADVTEGLSLKL  SS K     A  +MQKLRLLQLNYA I G  F+H+SQELRW+CWHGFPL  L
Subjt:  MGREIVREESPKEPKRRSRLFLNDEVLDVLKKQQGADVTEGLSLKLPRSSVKFVQGSALIKMQKLRLLQLNYAHIVGGYFEHLSQELRWLCWHGFPLNFL

Query:  PENFYLEKIVAIDLRYSQL-RFFGKKSKF-LEKLTILNLSRSHSLRHTPNFLKLPNLEKLKLKDCKSLVKVDDSIGSLKKLVFINLKGCICLNNLPVTLC
        P++F+LEK VAIDLRYS L +FF K+ KF LEKLTILNLS S  L  TPNF KLP+L+KL+L+DCKSLVKVDDSIG L+KL FINLK CICLN LP   C
Subjt:  PENFYLEKIVAIDLRYSQL-RFFGKKSKF-LEKLTILNLSRSHSLRHTPNFLKLPNLEKLKLKDCKSLVKVDDSIGSLKKLVFINLKGCICLNNLPVTLC

Query:  ELKFLGTLILSGCSKLESLPKNLEKMESLTTLTAEDTCIKQLPSTILGLKKLKCLILCRY----------------------------------------
        ELK L  LILSGCSKLE LP  L KMESLTTL A+ TCI++LPSTI+GLKKLKCL + R                                         
Subjt:  ELKFLGTLILSGCSKLESLPKNLEKMESLTTLTAEDTCIKQLPSTILGLKKLKCLILCRY----------------------------------------

Query:  ---------DAIPKDIGSLVSLVELDLQDNDFHGQPSSISHLSKLETLCLNDCIKLKKIPNLTPNLKTLIASKCKALESISDLSK-IEYMDILQVDDCPN
                 DAIPKDIGSLVSL+ LDL  NDF   PSSIS LS L  L  + C KL++I NL P  + +   KCKALE+IS+L K +E ++ + V +CP 
Subjt:  ---------DAIPKDIGSLVSLVELDLQDNDFHGQPSSISHLSKLETLCLNDCIKLKKIPNLTPNLKTLIASKCKALESISDLSK-IEYMDILQVDDCPN

Query:  LVEISGFDKLLPLIAYIYMEGS-SKMKQSFKE
        +VEIS FD ++P I  I M G  SK+ QSFK+
Subjt:  LVEISGFDKLLPLIAYIYMEGS-SKMKQSFKE

A0A6J1DMG1 TMV resistance protein N6.3e-12356.33Show/hide
Query:  MGREIVREESPKEPKRRSRLFLNDEVLDVLKKQQGADVTEGLSLKLPRSSVKFVQGSALIKMQKLRLLQLNYAHIVGGYFEHLSQELRWLCWHGFPLNFL
        MGREIVRE+ PKEP+R SRL L+DEVL VL +Q+G D  EGL+LKLPR S + +   A  +MQ LRLLQLN+ ++  G F+HLSQELRWLCWHGFPL FL
Subjt:  MGREIVREESPKEPKRRSRLFLNDEVLDVLKKQQGADVTEGLSLKLPRSSVKFVQGSALIKMQKLRLLQLNYAHIVGGYFEHLSQELRWLCWHGFPLNFL

Query:  PENFYLEKIVAIDLRYSQLRFFGKKSKFLEKLTILNLSRSHSLRHTPNFLKLPNLEKLKLKDCKSLVKVDDSIGSLKKLVFINLKGCICLNNLPVTLCEL
        P++F++EK+VAIDLRYS +RFF K+SKFLEKL ILNLS SH L HTP+FLKLP+LEKLKLKDCK+LV++  SIG LK LVFINLK C  L +LP +  +L
Subjt:  PENFYLEKIVAIDLRYSQLRFFGKKSKFLEKLTILNLSRSHSLRHTPNFLKLPNLEKLKLKDCKSLVKVDDSIGSLKKLVFINLKGCICLNNLPVTLCEL

Query:  KFLGTLILSGCSKLESLPKNLEKMESLTTLTAEDTCIKQLPSTILGLKKLKCLILC--------------------------------------------
        K L TL +SGCSK+ +LP++L +++SL TL A+DT I+Q+PSTI+ LK LK L LC                                            
Subjt:  KFLGTLILSGCSKLESLPKNLEKMESLTTLTAEDTCIKQLPSTILGLKKLKCLILC--------------------------------------------

Query:  ------RYDAIPKDIGSLVSLVELDLQDNDFHGQPSSISHLSKLETLCLNDCIKLKKIPNLTPNLKTLIASKCKALESISDLSKIEYMDILQVDDCPNLV
                + IPKDIGSLVSL+ELDL+DN FH  PSSIS LSKL+TL L+ C +L+ IP+L P LK+L AS C +LES SDLS+++ M+ L V +CP LV
Subjt:  ------RYDAIPKDIGSLVSLVELDLQDNDFHGQPSSISHLSKLETLCLNDCIKLKKIPNLTPNLKTLIASKCKALESISDLSKIEYMDILQVDDCPNLV

Query:  EISGFDKLLPLIAYIYMEGSSKMKQSFKENILQGW-MISGFG
        EI G +KLL  I  I+MEG SKM  SFKE ILQGW ++SGFG
Subjt:  EISGFDKLLPLIAYIYMEGSSKMKQSFKENILQGW-MISGFG

A0A6J1DR26 TMV resistance protein N-like isoform X26.3e-11557.87Show/hide
Query:  MGREIVREESPKEPKRRSRLFLNDEVLDVLKKQQGADVTEGLSLKLPRSSVKFVQGSALIKMQKLRLLQLNYAHIVGGYFEHLSQELRWLCWHGFPLNFL
        MGREIV  ES KEP +RSR+F+ND+VLDVL+KQ+GADVTEGLSLKL  SS K     A  +MQKLRLLQLNYA I G  F+H+SQELRW+CWHGFPL  L
Subjt:  MGREIVREESPKEPKRRSRLFLNDEVLDVLKKQQGADVTEGLSLKLPRSSVKFVQGSALIKMQKLRLLQLNYAHIVGGYFEHLSQELRWLCWHGFPLNFL

Query:  PENFYLEKIVAIDLRYSQL-RFFGKKSKF-LEKLTILNLSRSHSLRHTPNFLKLPNLEKLKLKDCKSLVKVDDSIGSLKKLVFINLKGCICLNNLPVTLC
        P++F+LEK VAIDLRYS L +FF K+ KF LEKLTILNLS S  L  TPNF KLP+L+KL+L+DCKSLVKVDDSIG L+KL FINLK CICLN LP   C
Subjt:  PENFYLEKIVAIDLRYSQL-RFFGKKSKF-LEKLTILNLSRSHSLRHTPNFLKLPNLEKLKLKDCKSLVKVDDSIGSLKKLVFINLKGCICLNNLPVTLC

Query:  ELKFLGTLILSGCSKLESLPKNLEKMESLTTLTAEDTCIKQLPSTILGLKKLKCLILCRY----------------------------------------
        ELK L  LILSGCSKLE LP  L KMESLTTL A+ TCI++LPSTI+GLKKLKCL + R                                         
Subjt:  ELKFLGTLILSGCSKLESLPKNLEKMESLTTLTAEDTCIKQLPSTILGLKKLKCLILCRY----------------------------------------

Query:  ---------DAIPKDIGSLVSLVELDLQDNDFHGQPSSISHLSKLETLCLNDCIKLKKIPNLTPNLKTLIASKCKALESISDLSK-IEYMDILQVDDCPN
                 DAIPKDIGSLVSL+ LDL  NDF   PSSIS LS L  L  + C KL++I NL P  + +   KCKALE+IS+L K +E ++ + V +CP 
Subjt:  ---------DAIPKDIGSLVSLVELDLQDNDFHGQPSSISHLSKLETLCLNDCIKLKKIPNLTPNLKTLIASKCKALESISDLSK-IEYMDILQVDDCPN

Query:  LVEISGFDKLLPLIAYIYMEGS-SKMKQSFKE
        +VEIS FD ++P I  I M G  SK+ QSFK+
Subjt:  LVEISGFDKLLPLIAYIYMEGS-SKMKQSFKE

A0A6J1J021 TMV resistance protein N3.7e-11553.18Show/hide
Query:  MGREIVREESPKEPKRRSRLFLNDEVLDVLKKQQGADVTEGLSLKLPRSSVKFVQGSALIKMQKLRLLQLNYAHIVGGYFEHLSQELRWLCWHGFPLNFL
        MGREIVRE+ PKEP++ SRL L++EV+ VL + +G    EGLSLKLPR S + +   A  +MQ LRLLQLN+ ++ G  F+H+SQE+RWLCWHGFPL FL
Subjt:  MGREIVREESPKEPKRRSRLFLNDEVLDVLKKQQGADVTEGLSLKLPRSSVKFVQGSALIKMQKLRLLQLNYAHIVGGYFEHLSQELRWLCWHGFPLNFL

Query:  PENFYLEKIVAIDLRYSQLRFFGKKSKFLEKLTILNLSRSHSLRHTPNFLKLPNLEKLKLKDCKSLVKVDDSIGSLKKLVFINLKGCICLNNLPVTLCEL
        P++F++EK+VA+DLRYSQ+RFF K+SKFL+ L  LNLS SH L +TP+F KLPNLE LKLKDCKSLV++  +IG LK L+ INLK C CL +LP     L
Subjt:  PENFYLEKIVAIDLRYSQLRFFGKKSKFLEKLTILNLSRSHSLRHTPNFLKLPNLEKLKLKDCKSLVKVDDSIGSLKKLVFINLKGCICLNNLPVTLCEL

Query:  KFLGTLILSGCSKLESLPKNLEKMESLTTLTAEDTCIKQLPSTILGLKKLKCLILC---------------------RY---------------------
        K L TLILSGCSKL +LP++L +M SL TLTA+DT I+++PSTI+ LKKLK L LC                     +Y                     
Subjt:  KFLGTLILSGCSKLESLPKNLEKMESLTTLTAEDTCIKQLPSTILGLKKLKCLILC---------------------RY---------------------

Query:  -------DAIPKDIGSLVSLVELDLQDNDFHGQPSSISHLSKLETLCLNDCIKLKKIPNLTPNLKTLIASKCKALESISDLSKIEYMDILQVDDCPNLVE
               + IPKDIGSL+SL ELDL+DN FH  PSSIS L KLETL L+ C +L+ IP+L P+L +L AS C +LE   DLS ++ M  L V +CP L++
Subjt:  -------DAIPKDIGSLVSLVELDLQDNDFHGQPSSISHLSKLETLCLNDCIKLKKIPNLTPNLKTLIASKCKALESISDLSKIEYMDILQVDDCPNLVE

Query:  ISGFDKLLPLIAYIYMEGSSKMKQSFKENILQGWMISGFG
        I G + LL  I  I+MEG S M  SFKE+IL GW +SGFG
Subjt:  ISGFDKLLPLIAYIYMEGSSKMKQSFKENILQGWMISGFG

SwissProt top hitse value%identityAlignment
O80327 Thaumatin-like protein 12.9e-5650.21Show/hide
Query:  FFDASLGVHAATITVKNNCPTTIWPAVLTTNPGQPQPSTTGFQLVSGASRRFDVPAPWVGRVWARTRCS-TTAGNFTCLTGDCGSGQVACGGAGGLPPAT
        F     GV++A  T  N CP T+WP  L T  G PQ  +TGF+L SGAS    V APW GR W R+ CS  ++G F C TGDCGSGQ++C GAG  PPA+
Subjt:  FFDASLGVHAATITVKNNCPTTIWPAVLTTNPGQPQPSTTGFQLVSGASRRFDVPAPWVGRVWARTRCS-TTAGNFTCLTGDCGSGQVACGGAGGLPPAT

Query:  LAEFTLSSSGSTQASFYSLSLA---------AAQGLAG----RRAVLQMNAVCPVELQLKASGSGGEVIGCKGACSALGQARYCCTGEFNSPDKCKPTAY
        L E TL+++G     FY +SL          A +G +G          +N VCP EL  K  GS G VIGCK AC AL Q +YCCTG + +PD C PT +
Subjt:  LAEFTLSSSGSTQASFYSLSLA---------AAQGLAG----RRAVLQMNAVCPVELQLKASGSGGEVIGCKGACSALGQARYCCTGEFNSPDKCKPTAY

Query:  SKIFKAQCPEAYSYAYDDEGSTFSCGNQPDYVITF
        SK+FK QCP+AYSYAYDD+ STF+C   P+Y ITF
Subjt:  SKIFKAQCPEAYSYAYDDEGSTFSCGNQPDYVITF

P50694 Glucan endo-1,3-beta-glucosidase3.5e-5451.09Show/hide
Query:  GVHAATITVKNNCPTTIWPAVLTTNPGQPQPSTTGFQLVSGASRRFDVPAPWVGRVWARTRCSTTA-GNFTCLTGDCGSGQVACGGAGGLPPATLAEFTL
        G HAATI+ KNNCP  +WP  LT++  +PQ STTGF+L S AS + D P PW GR WART CST A G F C T DC SGQV C G G +PPATLAEF +
Subjt:  GVHAATITVKNNCPTTIWPAVLTTNPGQPQPSTTGFQLVSGASRRFDVPAPWVGRVWARTRCSTTA-GNFTCLTGDCGSGQVACGGAGGLPPATLAEFTL

Query:  SSSGSTQASFYSLSL---------AAAQGLAG----RRAVLQMNAVCPVELQLKASGSGGEVIGCKGACSALGQARYCCTGEFNSPDKCKPTAYSKIFKA
         + G     FY +SL            QG  G          +NAVCP ELQ K  GS G V+ C  AC   G  +YCCT   N+P+ C PT YS+IF  
Subjt:  SSSGSTQASFYSLSL---------AAAQGLAG----RRAVLQMNAVCPVELQLKASGSGGEVIGCKGACSALGQARYCCTGEFNSPDKCKPTAYSKIFKA

Query:  QCPEAYSYAYDDEGSTFSCGNQPDYVITF
         CP+AYSYAYDD+  TF+C   P+Y ITF
Subjt:  QCPEAYSYAYDDEGSTFSCGNQPDYVITF

P83332 Thaumatin-like protein 11.1e-5550.41Show/hide
Query:  TNLWLSFFFDASLGVHAATITVKNNCPTTIWPAVLTTNPGQPQPSTTGFQLVSGASRRFDVPAPWVGRVWARTRCSTTA-GNFTCLTGDCGSGQVACGGA
        T L + FF     G HAA IT  N C  T+WP  LT +  +PQ S TGF+L +G SR  D P+PW GR + RTRCST A G FTC T DCGSGQV+C G 
Subjt:  TNLWLSFFFDASLGVHAATITVKNNCPTTIWPAVLTTNPGQPQPSTTGFQLVSGASRRFDVPAPWVGRVWARTRCSTTA-GNFTCLTGDCGSGQVACGGA

Query:  GGLPPATLAEFTLSSSGSTQASFYSLSL---------AAAQGLAGR----RAVLQMNAVCPVELQLKASGSGGEVIGCKGACSALGQARYCCTGEFNSPD
        G  PPATL E T++S+G     FY +SL          A QG  G+         +N VCP  LQ+K  GS G VI CK AC A  Q +YCCT   + P+
Subjt:  GGLPPATLAEFTLSSSGSTQASFYSLSL---------AAAQGLAGR----RAVLQMNAVCPVELQLKASGSGGEVIGCKGACSALGQARYCCTGEFNSPD

Query:  KCKPTAYSKIFKAQCPEAYSYAYDDEGSTFSCGNQPDYVITF
         C P  YSK+FK QCP+AYSYAYDD+ STF+C  +P Y+ITF
Subjt:  KCKPTAYSKIFKAQCPEAYSYAYDDEGSTFSCGNQPDYVITF

Q9FSG7 Thaumatin-like protein 1a3.3e-6054.15Show/hide
Query:  GVHAATITVKNNCPTTIWPAVLTTNPGQPQPSTTGFQLVSGASRRFDVPAPWVGRVWARTRCST-TAGNFTCLTGDCGSGQVACGGAGGLPPATLAEFTL
        G HAA IT  NNCP T+WP  LT +  +PQ S TGF+L S ASR  D P+PW GR W RTRCST  AG FTC T DCGSGQVAC GAG +PPATL E T+
Subjt:  GVHAATITVKNNCPTTIWPAVLTTNPGQPQPSTTGFQLVSGASRRFDVPAPWVGRVWARTRCST-TAGNFTCLTGDCGSGQVACGGAGGLPPATLAEFTL

Query:  SSSGSTQASFYSLSL---------AAAQGLAGR----RAVLQMNAVCPVELQLKASGSGGEVIGCKGACSALGQARYCCTGEFNSPDKCKPTAYSKIFKA
        +++G     +Y +SL          A QG  G          +N VCP  LQ+KA  + G VI CK AC A G ++YCCT   N+P+ C PT YS+IF+ 
Subjt:  SSSGSTQASFYSLSL---------AAAQGLAGR----RAVLQMNAVCPVELQLKASGSGGEVIGCKGACSALGQARYCCTGEFNSPDKCKPTAYSKIFKA

Query:  QCPEAYSYAYDDEGSTFSCGNQPDYVITF
        QCP+AYSYAYDD+ STF+C   PDYVITF
Subjt:  QCPEAYSYAYDDEGSTFSCGNQPDYVITF

Q9SMH2 Thaumatin-like protein 11.4e-5550.21Show/hide
Query:  LWLSFFFDASLGVHAATITVKNNCPTTIWPAVLTTNPGQPQPSTTGFQLVSGASRRFDVPAPWVGRVWARTRCSTTAGNFTCLTGDCGSGQVACGGAGGL
        L L+ FF +  G H+A IT  NNCP TIWP  LT++  +PQ   TGF L S AS    V APW GR WARTRC+T +G FTC T DC +GQVAC G G +
Subjt:  LWLSFFFDASLGVHAATITVKNNCPTTIWPAVLTTNPGQPQPSTTGFQLVSGASRRFDVPAPWVGRVWARTRCSTTAGNFTCLTGDCGSGQVACGGAGGL

Query:  PPATLAEFTLSSSGSTQASFYSLSL---------AAAQGLAG----RRAVLQMNAVCPVELQLKASGSGGEVIGCKGACSALGQARYCCTGEFNSPDKCK
        PPA+L E  ++++      FY +SL          A +G  G          +NAVCP ELQ+K  GS   V+ CK AC+A  Q +YCCTG F++   C 
Subjt:  PPATLAEFTLSSSGSTQASFYSLSL---------AAAQGLAG----RRAVLQMNAVCPVELQLKASGSGGEVIGCKGACSALGQARYCCTGEFNSPDKCK

Query:  PTAYSKIFKAQCPEAYSYAYDDEGSTFSCGNQPDYVITF
         T YS+IFK QCP+AYSYAYDD  STF+C   PDYVITF
Subjt:  PTAYSKIFKAQCPEAYSYAYDDEGSTFSCGNQPDYVITF

Arabidopsis top hitse value%identityAlignment
AT1G20030.1 Pathogenesis-related thaumatin superfamily protein6.8e-5346.9Show/hide
Query:  TITVKNNCPTTIWPAVLTTNPGQPQPSTTGFQLVSGASRRFDVPAPWVGRVWARTRCSTTA-GNFTCLTGDCGSGQVACGGAGGLPPATLAEFTLSSSGS
        + T  N C  T+WP +L+     P P TTGF L+ G +R  + P+ W GR W RT CST + G F+C TGDCGSG++ C GAG  PPATLAEFTL  SG 
Subjt:  TITVKNNCPTTIWPAVLTTNPGQPQPSTTGFQLVSGASRRFDVPAPWVGRVWARTRCSTTA-GNFTCLTGDCGSGQVACGGAGGLPPATLAEFTLSSSGS

Query:  TQASFYSLSL--------------AAAQGLAGRRAVLQMNAVCPVELQLKA-SGSGGEVIGCKGACSALGQARYCCTGEFNSPDKCKPTAYSKIFKAQCP
            FY +SL               + Q  +    V+ +N  CP EL++ +  G     + CK AC A  Q  YCC+G F SPD CKP+ YS+IFK+ CP
Subjt:  TQASFYSLSL--------------AAAQGLAGRRAVLQMNAVCPVELQLKA-SGSGGEVIGCKGACSALGQARYCCTGEFNSPDKCKPTAYSKIFKAQCP

Query:  EAYSYAYDDEGSTFSCGNQPDYVITF
         AYSYAYDD+ STF+C   P+YVITF
Subjt:  EAYSYAYDDEGSTFSCGNQPDYVITF

AT1G20030.2 Pathogenesis-related thaumatin superfamily protein1.6e-5446.31Show/hide
Query:  LSFFFDAS----LGVHAATITVKNNCPTTIWPAVLTTNPGQPQPSTTGFQLVSGASRRFDVPAPWVGRVWARTRCSTTA-GNFTCLTGDCGSGQVACGGA
        L+FFF  S     GV + + T  N C  T+WP +L+     P P TTGF L+ G +R  + P+ W GR W RT CST + G F+C TGDCGSG++ C GA
Subjt:  LSFFFDAS----LGVHAATITVKNNCPTTIWPAVLTTNPGQPQPSTTGFQLVSGASRRFDVPAPWVGRVWARTRCSTTA-GNFTCLTGDCGSGQVACGGA

Query:  GGLPPATLAEFTLSSSGSTQASFYSLSL--------------AAAQGLAGRRAVLQMNAVCPVELQLKA-SGSGGEVIGCKGACSALGQARYCCTGEFNS
        G  PPATLAEFTL  SG     FY +SL               + Q  +    V+ +N  CP EL++ +  G     + CK AC A  Q  YCC+G F S
Subjt:  GGLPPATLAEFTLSSSGSTQASFYSLSL--------------AAAQGLAGRRAVLQMNAVCPVELQLKA-SGSGGEVIGCKGACSALGQARYCCTGEFNS

Query:  PDKCKPTAYSKIFKAQCPEAYSYAYDDEGSTFSCGNQPDYVITF
        PD CKP+ YS+IFK+ CP AYSYAYDD+ STF+C   P+YVITF
Subjt:  PDKCKPTAYSKIFKAQCPEAYSYAYDDEGSTFSCGNQPDYVITF

AT1G75800.1 Pathogenesis-related thaumatin superfamily protein3.1e-5348.05Show/hide
Query:  GVHAATITVKNNCPTTIWPAVLTTNPGQPQPSTTGFQLVSGASRRFDVPAPWVGRVWARTRCST-TAGNFTCLTGDCGSGQVACGGAGGLPPATLAEFTL
        GV + +  + N C  T+WP +L +N G P   TTGF L  G  R    P  W GR W RT+CST T G FTCLTGDCGSG + C G+G  PPATLAEFTL
Subjt:  GVHAATITVKNNCPTTIWPAVLTTNPGQPQPSTTGFQLVSGASRRFDVPAPWVGRVWARTRCST-TAGNFTCLTGDCGSGQVACGGAGGLPPATLAEFTL

Query:  SSSGSTQASFYSLS---------LAAAQGLAGRR-----AVLQMNAVCPVELQLKA-SGSGGEVIGCKGACSALGQARYCCTGEFNSPDKCKPTAYSKIF
           GS    FY +S         L A QG +G        V+ +N  CP EL++ +  G G + +GCK AC A     YCC+G   +PD CKP++YS +F
Subjt:  SSSGSTQASFYSLS---------LAAAQGLAGRR-----AVLQMNAVCPVELQLKA-SGSGGEVIGCKGACSALGQARYCCTGEFNSPDKCKPTAYSKIF

Query:  KAQCPEAYSYAYDDEGSTFSCGNQPDYVITF
        K  CP AYSYAYDD+ STF+C   P+YVITF
Subjt:  KAQCPEAYSYAYDDEGSTFSCGNQPDYVITF

AT5G36930.1 Disease resistance protein (TIR-NBS-LRR class) family1.2e-6840.37Show/hide
Query:  MGREIVREESPKEPKRRSRLFLNDEVLDVLKKQQGADVTEGLSLKLPRSSVKFVQGSALIKMQKLRLLQLNYAHIVGGYFEHLSQELRWLCWHGFPLNFL
        MGR+IVRE SPK+   RSRL+ +++V+ VLKK+ G +  EGLSLK      ++ +  A  KMQ+LRLL+L Y  + G Y EH  ++LRWLCWHGF L   
Subjt:  MGREIVREESPKEPKRRSRLFLNDEVLDVLKKQQGADVTEGLSLKLPRSSVKFVQGSALIKMQKLRLLQLNYAHIVGGYFEHLSQELRWLCWHGFPLNFL

Query:  PENFYLEKIVAIDLRYSQLRFFGKKS---KFLEKLTILNLSRSHSLRHTPNFLKLPNLEKLKLKDCKSLVKVDDSIGSL-KKLVFINLKGCICLNNLPVT
        P N  LE + A+DL+YS L+ F K     +    +  L+LS S  LR TP+F   PN+EKL L +CKSLV V  SIG L KKLV +NL  CI L+ LP  
Subjt:  PENFYLEKIVAIDLRYSQLRFFGKKS---KFLEKLTILNLSRSHSLRHTPNFLKLPNLEKLKLKDCKSLVKVDDSIGSL-KKLVFINLKGCICLNNLPVT

Query:  LCELKFLGTLILSGCSKLESLPKNLEKMESLTTLTAEDTCIKQLPSTILGLKKLKCLIL--CR-------------------------------------
        + +LK L +L LS CSKLE L   L ++ESLTTL A+ T ++++PSTI  LKKLK L L  C+                                     
Subjt:  LCELKFLGTLILSGCSKLESLPKNLEKMESLTTLTAEDTCIKQLPSTILGLKKLKCLIL--CR-------------------------------------

Query:  ------YDAIPKDIGSLVSLVELDLQDNDFHGQPSSISHLSKLETLCLNDCIKLKKIPNLTPNLKTLIASKCKALESISDLSKIEYMDILQVDDCPNLVE
               + IP+DIGSL  L +LDL+ N F   P+  + L  L  L L+DC KL+ I +L  +L  L   KC  L+   D+SK   +  LQ++DC +L E
Subjt:  ------YDAIPKDIGSLVSLVELDLQDNDFHGQPSSISHLSKLETLCLNDCIKLKKIPNLTPNLKTLIASKCKALESISDLSKIEYMDILQVDDCPNLVE

Query:  ISGFDKLLPLIAYIYMEGSSKMKQSFKEN-ILQGWM
        I G       +++I ++G          N +L+ W+
Subjt:  ISGFDKLLPLIAYIYMEGSSKMKQSFKEN-ILQGWM

AT5G36930.2 Disease resistance protein (TIR-NBS-LRR class) family1.2e-6840.37Show/hide
Query:  MGREIVREESPKEPKRRSRLFLNDEVLDVLKKQQGADVTEGLSLKLPRSSVKFVQGSALIKMQKLRLLQLNYAHIVGGYFEHLSQELRWLCWHGFPLNFL
        MGR+IVRE SPK+   RSRL+ +++V+ VLKK+ G +  EGLSLK      ++ +  A  KMQ+LRLL+L Y  + G Y EH  ++LRWLCWHGF L   
Subjt:  MGREIVREESPKEPKRRSRLFLNDEVLDVLKKQQGADVTEGLSLKLPRSSVKFVQGSALIKMQKLRLLQLNYAHIVGGYFEHLSQELRWLCWHGFPLNFL

Query:  PENFYLEKIVAIDLRYSQLRFFGKKS---KFLEKLTILNLSRSHSLRHTPNFLKLPNLEKLKLKDCKSLVKVDDSIGSL-KKLVFINLKGCICLNNLPVT
        P N  LE + A+DL+YS L+ F K     +    +  L+LS S  LR TP+F   PN+EKL L +CKSLV V  SIG L KKLV +NL  CI L+ LP  
Subjt:  PENFYLEKIVAIDLRYSQLRFFGKKS---KFLEKLTILNLSRSHSLRHTPNFLKLPNLEKLKLKDCKSLVKVDDSIGSL-KKLVFINLKGCICLNNLPVT

Query:  LCELKFLGTLILSGCSKLESLPKNLEKMESLTTLTAEDTCIKQLPSTILGLKKLKCLIL--CR-------------------------------------
        + +LK L +L LS CSKLE L   L ++ESLTTL A+ T ++++PSTI  LKKLK L L  C+                                     
Subjt:  LCELKFLGTLILSGCSKLESLPKNLEKMESLTTLTAEDTCIKQLPSTILGLKKLKCLIL--CR-------------------------------------

Query:  ------YDAIPKDIGSLVSLVELDLQDNDFHGQPSSISHLSKLETLCLNDCIKLKKIPNLTPNLKTLIASKCKALESISDLSKIEYMDILQVDDCPNLVE
               + IP+DIGSL  L +LDL+ N F   P+  + L  L  L L+DC KL+ I +L  +L  L   KC  L+   D+SK   +  LQ++DC +L E
Subjt:  ------YDAIPKDIGSLVSLVELDLQDNDFHGQPSSISHLSKLETLCLNDCIKLKKIPNLTPNLKTLIASKCKALESISDLSKIEYMDILQVDDCPNLVE

Query:  ISGFDKLLPLIAYIYMEGSSKMKQSFKEN-ILQGWM
        I G       +++I ++G          N +L+ W+
Subjt:  ISGFDKLLPLIAYIYMEGSSKMKQSFKEN-ILQGWM


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGAAGAGAGATCGTTCGAGAAGAGTCTCCAAAAGAGCCTAAAAGGCGAAGTAGACTTTTTCTTAATGATGAAGTGCTTGATGTACTTAAGAAACAACAGGGAGCTGA
TGTAACTGAAGGTCTATCTTTAAAGTTGCCAAGATCTAGCGTGAAATTTGTGCAAGGAAGTGCACTTATTAAAATGCAAAAATTGAGGTTACTTCAACTTAATTATGCAC
ACATTGTTGGAGGCTACTTCGAACATCTTTCTCAAGAGTTAAGATGGCTTTGTTGGCATGGATTTCCTTTGAACTTCCTACCAGAAAACTTTTATTTGGAGAAAATCGTA
GCTATTGACTTAAGATATAGCCAACTCAGATTCTTTGGGAAGAAATCAAAGTTCCTGGAGAAGTTGACAATTCTTAATCTTAGCCGCTCTCATTCCCTGAGACATACGCC
AAATTTCTTGAAACTCCCAAATCTTGAGAAATTAAAACTCAAAGACTGCAAGAGTTTGGTTAAGGTGGATGATTCCATTGGAAGTCTTAAAAAACTTGTTTTCATAAATT
TGAAAGGTTGCATATGCCTGAATAACCTTCCAGTGACTCTATGTGAGTTAAAATTCCTTGGGACGCTTATTCTTTCTGGCTGCTCAAAGTTAGAAAGTCTGCCTAAGAAT
TTGGAGAAGATGGAATCATTGACAACTCTCACTGCAGAGGATACTTGTATAAAACAATTGCCCTCTACAATTTTGGGGTTGAAGAAACTCAAATGTTTAATTCTATGTAG
ATATGATGCAATTCCCAAAGATATTGGGAGTCTGGTGTCTCTAGTGGAATTGGATCTACAAGACAATGACTTTCATGGCCAACCATCAAGCATAAGTCACCTTTCAAAAC
TTGAAACTCTTTGTTTGAATGATTGCATAAAGCTTAAGAAAATACCAAATCTAACCCCTAATTTGAAAACTTTGATTGCATCAAAGTGCAAGGCATTGGAAAGCATATCA
GACCTATCAAAGATCGAGTATATGGACATCTTGCAGGTTGATGATTGCCCGAACCTAGTTGAGATTTCAGGCTTTGATAAGTTGTTGCCATTAATTGCTTACATTTATAT
GGAAGGGAGTAGCAAGATGAAACAATCATTCAAAGAAAACATCTTACAGGGATGGATGATTAGTGGATTTGGTGTTGCCGAGTTGCTGAAGGGTGGATTGGATGCTATTG
ATGTTCCAATTTGTGATTATACAACTAACTTATGGCTTTCTTTCTTCTTCGATGCTTCTTTAGGGGTTCATGCCGCGACAATCACCGTCAAAAACAACTGCCCCACCACC
ATATGGCCGGCTGTATTGACGACCAATCCAGGGCAGCCACAACCGTCGACCACCGGATTTCAGTTAGTTTCCGGAGCATCCAGGCGGTTTGACGTCCCCGCTCCTTGGGT
AGGTCGAGTCTGGGCACGAACAAGATGCTCCACCACCGCCGGAAACTTCACTTGCCTCACCGGAGACTGCGGCTCAGGTCAAGTCGCTTGCGGCGGCGCAGGAGGACTTC
CGCCTGCAACTCTAGCGGAATTCACCCTCTCTTCTAGTGGAAGCACTCAAGCCTCGTTCTACTCTCTCAGCCTCGCGGCGGCGCAGGGACTTGCCGGTCGTCGAGCTGTG
TTGCAGATGAACGCTGTTTGTCCCGTTGAGTTGCAACTGAAGGCGTCCGGCTCCGGCGGCGAAGTGATCGGCTGCAAGGGCGCTTGTTCTGCGTTGGGGCAGGCTCGATA
CTGTTGCACTGGAGAATTCAACAGCCCCGACAAGTGCAAGCCCACGGCGTATTCGAAGATTTTCAAGGCCCAATGCCCTGAGGCTTACAGCTATGCATATGATGACGAAG
GCAGCACTTTCTCATGCGGAAATCAACCAGATTATGTCATCACCTTCTATGGGTTTGACACAAAGATCTCTGTTTGTAGAGGCCCGATTCTACTTGATATTTCCAAGGTC
AAAATTTTGTAG
mRNA sequenceShow/hide mRNA sequence
ATGGGAAGAGAGATCGTTCGAGAAGAGTCTCCAAAAGAGCCTAAAAGGCGAAGTAGACTTTTTCTTAATGATGAAGTGCTTGATGTACTTAAGAAACAACAGGGAGCTGA
TGTAACTGAAGGTCTATCTTTAAAGTTGCCAAGATCTAGCGTGAAATTTGTGCAAGGAAGTGCACTTATTAAAATGCAAAAATTGAGGTTACTTCAACTTAATTATGCAC
ACATTGTTGGAGGCTACTTCGAACATCTTTCTCAAGAGTTAAGATGGCTTTGTTGGCATGGATTTCCTTTGAACTTCCTACCAGAAAACTTTTATTTGGAGAAAATCGTA
GCTATTGACTTAAGATATAGCCAACTCAGATTCTTTGGGAAGAAATCAAAGTTCCTGGAGAAGTTGACAATTCTTAATCTTAGCCGCTCTCATTCCCTGAGACATACGCC
AAATTTCTTGAAACTCCCAAATCTTGAGAAATTAAAACTCAAAGACTGCAAGAGTTTGGTTAAGGTGGATGATTCCATTGGAAGTCTTAAAAAACTTGTTTTCATAAATT
TGAAAGGTTGCATATGCCTGAATAACCTTCCAGTGACTCTATGTGAGTTAAAATTCCTTGGGACGCTTATTCTTTCTGGCTGCTCAAAGTTAGAAAGTCTGCCTAAGAAT
TTGGAGAAGATGGAATCATTGACAACTCTCACTGCAGAGGATACTTGTATAAAACAATTGCCCTCTACAATTTTGGGGTTGAAGAAACTCAAATGTTTAATTCTATGTAG
ATATGATGCAATTCCCAAAGATATTGGGAGTCTGGTGTCTCTAGTGGAATTGGATCTACAAGACAATGACTTTCATGGCCAACCATCAAGCATAAGTCACCTTTCAAAAC
TTGAAACTCTTTGTTTGAATGATTGCATAAAGCTTAAGAAAATACCAAATCTAACCCCTAATTTGAAAACTTTGATTGCATCAAAGTGCAAGGCATTGGAAAGCATATCA
GACCTATCAAAGATCGAGTATATGGACATCTTGCAGGTTGATGATTGCCCGAACCTAGTTGAGATTTCAGGCTTTGATAAGTTGTTGCCATTAATTGCTTACATTTATAT
GGAAGGGAGTAGCAAGATGAAACAATCATTCAAAGAAAACATCTTACAGGGATGGATGATTAGTGGATTTGGTGTTGCCGAGTTGCTGAAGGGTGGATTGGATGCTATTG
ATGTTCCAATTTGTGATTATACAACTAACTTATGGCTTTCTTTCTTCTTCGATGCTTCTTTAGGGGTTCATGCCGCGACAATCACCGTCAAAAACAACTGCCCCACCACC
ATATGGCCGGCTGTATTGACGACCAATCCAGGGCAGCCACAACCGTCGACCACCGGATTTCAGTTAGTTTCCGGAGCATCCAGGCGGTTTGACGTCCCCGCTCCTTGGGT
AGGTCGAGTCTGGGCACGAACAAGATGCTCCACCACCGCCGGAAACTTCACTTGCCTCACCGGAGACTGCGGCTCAGGTCAAGTCGCTTGCGGCGGCGCAGGAGGACTTC
CGCCTGCAACTCTAGCGGAATTCACCCTCTCTTCTAGTGGAAGCACTCAAGCCTCGTTCTACTCTCTCAGCCTCGCGGCGGCGCAGGGACTTGCCGGTCGTCGAGCTGTG
TTGCAGATGAACGCTGTTTGTCCCGTTGAGTTGCAACTGAAGGCGTCCGGCTCCGGCGGCGAAGTGATCGGCTGCAAGGGCGCTTGTTCTGCGTTGGGGCAGGCTCGATA
CTGTTGCACTGGAGAATTCAACAGCCCCGACAAGTGCAAGCCCACGGCGTATTCGAAGATTTTCAAGGCCCAATGCCCTGAGGCTTACAGCTATGCATATGATGACGAAG
GCAGCACTTTCTCATGCGGAAATCAACCAGATTATGTCATCACCTTCTATGGGTTTGACACAAAGATCTCTGTTTGTAGAGGCCCGATTCTACTTGATATTTCCAAGGTC
AAAATTTTGTAG
Protein sequenceShow/hide protein sequence
MGREIVREESPKEPKRRSRLFLNDEVLDVLKKQQGADVTEGLSLKLPRSSVKFVQGSALIKMQKLRLLQLNYAHIVGGYFEHLSQELRWLCWHGFPLNFLPENFYLEKIV
AIDLRYSQLRFFGKKSKFLEKLTILNLSRSHSLRHTPNFLKLPNLEKLKLKDCKSLVKVDDSIGSLKKLVFINLKGCICLNNLPVTLCELKFLGTLILSGCSKLESLPKN
LEKMESLTTLTAEDTCIKQLPSTILGLKKLKCLILCRYDAIPKDIGSLVSLVELDLQDNDFHGQPSSISHLSKLETLCLNDCIKLKKIPNLTPNLKTLIASKCKALESIS
DLSKIEYMDILQVDDCPNLVEISGFDKLLPLIAYIYMEGSSKMKQSFKENILQGWMISGFGVAELLKGGLDAIDVPICDYTTNLWLSFFFDASLGVHAATITVKNNCPTT
IWPAVLTTNPGQPQPSTTGFQLVSGASRRFDVPAPWVGRVWARTRCSTTAGNFTCLTGDCGSGQVACGGAGGLPPATLAEFTLSSSGSTQASFYSLSLAAAQGLAGRRAV
LQMNAVCPVELQLKASGSGGEVIGCKGACSALGQARYCCTGEFNSPDKCKPTAYSKIFKAQCPEAYSYAYDDEGSTFSCGNQPDYVITFYGFDTKISVCRGPILLDISKV
KIL