; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MC01g1179 (gene) of Bitter gourd (Dali-11) v1 genome

Gene IDMC01g1179
OrganismMomordica charantia cv. Dali-11 (Bitter gourd (Dali-11) v1)
DescriptionThaumatin
Genome locationMC01:17081190..17087055
RNA-Seq ExpressionMC01g1179
SyntenyMC01g1179
Gene Ontology termsGO:0006952 - defense response (biological process)
GO:0005576 - extracellular region (cellular component)
GO:0016021 - integral component of membrane (cellular component)
InterPro domainsIPR001938 - Thaumatin family
IPR017949 - Thaumatin, conserved site
IPR037176 - Osmotin/thaumatin-like superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022926390.1 thaumatin-like protein 1 isoform X1 [Cucurbita moschata]1.45e-19081.4Show/hide
Query:  MGLFCHSHHSSSFFSSLVVLLLVKEVSGATFRFINKCDFTVWPGILASAGSPKLETTGFELEQDSSRSLQAPTGWSGRFWGRTSCNFDSSGRGSCNTGDC
        M L CH  HS  F SSLV+LLL KEVS ATF F+NKCDFTVWPGILASAGSPKLETTGFEL++ +SRSL+APTGWSGRFWGRT C+FD SGRGSCNTGDC
Subjt:  MGLFCHSHHSSSFFSSLVVLLLVKEVSGATFRFINKCDFTVWPGILASAGSPKLETTGFELEQDSSRSLQAPTGWSGRFWGRTSCNFDSSGRGSCNTGDC

Query:  GSGEIECNGAGATPPATLAEFTLGSGSQDFYDVSLVDGYNLPMIVEGTGGSGTCASTGCITDLNQLCPTELKADGGGACKSACEAFAAPEYCCSGAYNSP
        GSGE+ECNGAGATPPATLAEFTLGSGSQDFYDVSLVDGYNLPMIVEGTGGSGTCA+TGCIT++NQLCP ELKA+GGGACKSACEAFA PEYCCSGAYNSP
Subjt:  GSGEIECNGAGATPPATLAEFTLGSGSQDFYDVSLVDGYNLPMIVEGTGGSGTCASTGCITDLNQLCPTELKADGGGACKSACEAFAAPEYCCSGAYNSP

Query:  ATCRPSMYSQMFKSACPKSYSYAYDDATSTFTCNGADYTITFCPSTPSRKSSRDSSPVIPDPTTNPTGSEPVIGTGAGVGMGSDSIQGAALSSSWLADMA
        ATCRPS+YS MFKSACP+SYSYAYDDATSTFTC GADYTITFCPSTPSRKSSRDS P++P+PTT  +GSEP+  TG+G+  GSDSIQGAALS+SWLADMA
Subjt:  ATCRPSMYSQMFKSACPKSYSYAYDDATSTFTCNGADYTITFCPSTPSRKSSRDSSPVIPDPTTNPTGSEPVIGTGAGVGMGSDSIQGAALSSSWLADMA

Query:  IGDSAIANQPSFKFQSLSTLIFSLVLSL
        IG+S++  + S  FQSL+ LI  L+L L
Subjt:  IGDSAIANQPSFKFQSLSTLIFSLVLSL

XP_022977635.1 thaumatin-like protein 1 isoform X2 [Cucurbita maxima]2.20e-18880.3Show/hide
Query:  MGLFCHSHHSSSFFSSLVVLLLVKEVSGATFRFINKCDFTVWPGILASAGSPKLETTGFELEQDSSRSLQAPTGWSGRFWGRTSCNFDSSGRGSCNTGDC
        MGLF   HHSS FFS+L+VLLLVKEVSGATF F+NKC+FTVWPGILASAGSPKLE+TGFEL Q SSRS QAPT WSGRFWGRT+CN D SGR  CNTGDC
Subjt:  MGLFCHSHHSSSFFSSLVVLLLVKEVSGATFRFINKCDFTVWPGILASAGSPKLETTGFELEQDSSRSLQAPTGWSGRFWGRTSCNFDSSGRGSCNTGDC

Query:  GSGEIECNGAGATPPATLAEFTLGSGSQDFYDVSLVDGYNLPMIVEGTGGSGTCASTGCITDLNQLCPTELKADGGGACKSACEAFAAPEYCCSGAYNSP
        GSGEIECNGAGATPPATLAEFTLGSGSQDFYDVSLVDGYNLPMI+EGTGGSGTC+STGCITDLNQLCP ELKA+GGGACKSACEAFA+PEYCCSG YNSP
Subjt:  GSGEIECNGAGATPPATLAEFTLGSGSQDFYDVSLVDGYNLPMIVEGTGGSGTCASTGCITDLNQLCPTELKADGGGACKSACEAFAAPEYCCSGAYNSP

Query:  ATCRPSMYSQMFKSACPKSYSYAYDDATSTFTCNGADYTITFCPSTPSRKSSRDSSPVIPDPTTNPTGSEPVIGTGAGVGMGSDSIQGAALSSSWLADMA
        ATCRPS+YS+MFKSACP+SYSYAYDDATSTFTCNGADYTITFCPS PS+KSSRDSSP+IP+PTT   GSEP    G+   +GS  +QGA LS+SWLADMA
Subjt:  ATCRPSMYSQMFKSACPKSYSYAYDDATSTFTCNGADYTITFCPSTPSRKSSRDSSPVIPDPTTNPTGSEPVIGTGAGVGMGSDSIQGAALSSSWLADMA

Query:  IGDSAIANQPSFKFQSLSTLIFSLVLSLLL
        IG S++A +P+  FQ +S LIF +V+ L+L
Subjt:  IGDSAIANQPSFKFQSLSTLIFSLVLSLLL

XP_023517221.1 thaumatin-like protein 1 [Cucurbita pepo subsp. pepo]2.80e-18880.79Show/hide
Query:  MGLFCHSHHSSSFFSSLVVLLLVKEVSGATFRFINKCDFTVWPGILASAGSPKLETTGFELEQDSSRSLQAPTGWSGRFWGRTSCNFDSSGRGSCNTGDC
        M L CH  HS  F SSLV+LLL KEVS ATF F+NKCDFTVWPGILASAGSPKLETTGFEL++ +SRSL+APTGWSGRFWGRT C+FD SGRGSCNTGDC
Subjt:  MGLFCHSHHSSSFFSSLVVLLLVKEVSGATFRFINKCDFTVWPGILASAGSPKLETTGFELEQDSSRSLQAPTGWSGRFWGRTSCNFDSSGRGSCNTGDC

Query:  GSGEIECNGAGATPPATLAEFTLGSGSQDFYDVSLVDGYNLPMIVEGTGGSGTCASTGCITDLNQLCPTELKADGGGACKSACEAFAAPEYCCSGAYNSP
        GSGE+ECNGAGATPPATLAEFTLGSGSQDFYDVSLVDGYNLPMIVEGTGGSGTCA+TGCIT++NQLCP ELKA+GGGACKSACEAFA PEYCCSGAYNSP
Subjt:  GSGEIECNGAGATPPATLAEFTLGSGSQDFYDVSLVDGYNLPMIVEGTGGSGTCASTGCITDLNQLCPTELKADGGGACKSACEAFAAPEYCCSGAYNSP

Query:  ATCRPSMYSQMFKSACPKSYSYAYDDATSTFTCNGADYTITFCPSTPSRKSSRDSSPVIPDPTTNPTGSEPVIGTGAGVGMGSDSIQGAALSSSWLADMA
        ATCRPS+YS MFKSACP+SYSYAYDDATSTFTC GADYTITFCPSTPSRKSSRDS P++P+ TT  +GSEP+  TG+G+  GSD IQGAALS+SWLADMA
Subjt:  ATCRPSMYSQMFKSACPKSYSYAYDDATSTFTCNGADYTITFCPSTPSRKSSRDSSPVIPDPTTNPTGSEPVIGTGAGVGMGSDSIQGAALSSSWLADMA

Query:  IGDSAIANQPSFKFQSLSTLIFSLVLSL
        IG+S++A + S  FQ L+ LI  L+L L
Subjt:  IGDSAIANQPSFKFQSLSTLIFSLVLSL

XP_023544101.1 thaumatin-like protein 1 isoform X2 [Cucurbita pepo subsp. pepo]5.41e-18980.61Show/hide
Query:  MGLFCHSHHSSSFFSSLVVLLLVKEVSGATFRFINKCDFTVWPGILASAGSPKLETTGFELEQDSSRSLQAPTGWSGRFWGRTSCNFDSSGRGSCNTGDC
        MGLF   HHSS FFS+L+VLLLVKEVSGATF F+NKC+FTVWPGILASAGSPKLE+TGFEL Q +SRS QAPT WSGRFWGRT+CN D S R SCNTGDC
Subjt:  MGLFCHSHHSSSFFSSLVVLLLVKEVSGATFRFINKCDFTVWPGILASAGSPKLETTGFELEQDSSRSLQAPTGWSGRFWGRTSCNFDSSGRGSCNTGDC

Query:  GSGEIECNGAGATPPATLAEFTLGSGSQDFYDVSLVDGYNLPMIVEGTGGSGTCASTGCITDLNQLCPTELKADGGGACKSACEAFAAPEYCCSGAYNSP
        GSGEIECNGAGATPPATLAEFTLGSGSQDFYDVSLVDGYNLPMI+EGTGGSGTC+STGCITDLNQLCP ELKA+GGGACKSACEAFA+PEYCCSG YNSP
Subjt:  GSGEIECNGAGATPPATLAEFTLGSGSQDFYDVSLVDGYNLPMIVEGTGGSGTCASTGCITDLNQLCPTELKADGGGACKSACEAFAAPEYCCSGAYNSP

Query:  ATCRPSMYSQMFKSACPKSYSYAYDDATSTFTCNGADYTITFCPSTPSRKSSRDSSPVIPDPTTNPTGSEPVIGTGAGVGMGSDSIQGAALSSSWLADMA
        ATC+PS+YS+MFKSACP+SYSYAYDDATSTFTCNGADYTITFCPS PS+KSSRDS PVIP+PTT  +GSEP    G+  G+GS  +QGA LS+SWLADMA
Subjt:  ATCRPSMYSQMFKSACPKSYSYAYDDATSTFTCNGADYTITFCPSTPSRKSSRDSSPVIPDPTTNPTGSEPVIGTGAGVGMGSDSIQGAALSSSWLADMA

Query:  IGDSAIANQPSFKFQSLSTLIFSLVLSLLL
        IG S++A +P   FQ +S LIF +VLSL+L
Subjt:  IGDSAIANQPSFKFQSLSTLIFSLVLSLLL

XP_038880897.1 thaumatin-like protein 1 [Benincasa hispida]5.14e-19684.04Show/hide
Query:  MGLFCHSHHSSSFFSSLVVLLLVKEVSGATFRFINKCDFTVWPGILASAGSPKLETTGFELEQDSSRSLQAPTGWSGRFWGRTSCNFDSSGRGSCNTGDC
        MGLFC+ +HSS FFS+++VLLL+KE SGATF F+NKCDFTVWPGILASAGSPKLE TGFELE  SSRSLQA TGWSGRFWGRT+CNFD SGRGSCNTGDC
Subjt:  MGLFCHSHHSSSFFSSLVVLLLVKEVSGATFRFINKCDFTVWPGILASAGSPKLETTGFELEQDSSRSLQAPTGWSGRFWGRTSCNFDSSGRGSCNTGDC

Query:  GSGEIECNGAGATPPATLAEFTLGSGSQDFYDVSLVDGYNLPMIVEGTGGSGTCASTGCITDLNQLCPTELKADGGGACKSACEAFAAPEYCCSGAYNSP
        GSGEIECNGAGATPPATLAEFTLGSGSQDFYDVSLVDGYNLPMIVEGTGGSGTCASTGCITDLNQLCP ELKA+ GGACKSACEAFA PEYCCSG YNSP
Subjt:  GSGEIECNGAGATPPATLAEFTLGSGSQDFYDVSLVDGYNLPMIVEGTGGSGTCASTGCITDLNQLCPTELKADGGGACKSACEAFAAPEYCCSGAYNSP

Query:  ATCRPSMYSQMFKSACPKSYSYAYDDATSTFTCNGADYTITFCPSTPSRKSSRDSSPVIPDPTTNPTGSEPVIGTGAGVGMGSDSIQGAALSSSWLADMA
        ATCRPSMYSQMFKSACPKSYSYAYDDATSTFTCNGADYTITFCPS+PSRKSSR SSP+IP+P T+ +GSEP+  TG+G   GS +IQGAALSSSWLADMA
Subjt:  ATCRPSMYSQMFKSACPKSYSYAYDDATSTFTCNGADYTITFCPSTPSRKSSRDSSPVIPDPTTNPTGSEPVIGTGAGVGMGSDSIQGAALSSSWLADMA

Query:  IG-DSAIANQPSFKFQSLSTLIFSLVLSLLLS
        IG DS+IA +PS   Q  S LI S+ LSL+LS
Subjt:  IG-DSAIANQPSFKFQSLSTLIFSLVLSLLLS

TrEMBL top hitse value%identityAlignment
A0A1S3B2S8 thaumatin-like protein 15.74e-18781.02Show/hide
Query:  MGLFCHSHHSSSFFSSLVVLLLVKEVSGATFRFINKCDFTVWPGILASAGSPKLETTGFELEQDSSRSLQAPTGWSGRFWGRTSCNFDSSGRGSCNTGDC
        MG  C+  HSS  FS+L+VLLL+KEVSGATF F+NKCD+TVWPGILAS+GSPKLETTGFEL+  +SRSLQA TGWSGRFWGRT CNFD SGRGSCNTGDC
Subjt:  MGLFCHSHHSSSFFSSLVVLLLVKEVSGATFRFINKCDFTVWPGILASAGSPKLETTGFELEQDSSRSLQAPTGWSGRFWGRTSCNFDSSGRGSCNTGDC

Query:  GSGEIECNGAGATPPATLAEFTLGSGSQDFYDVSLVDGYNLPMIVEGTGGSGTCASTGCITDLNQLCPTELKADGGGACKSACEAFAAPEYCCSGAYNSP
        GSGEIECNGAGATPPATLAEFTLGSGSQDFYDVSLVDGYNLPMIVEGTGGSGTCASTGCITDLNQLCP ELKA+ GGACKSACEAF  PEYCCSG YNSP
Subjt:  GSGEIECNGAGATPPATLAEFTLGSGSQDFYDVSLVDGYNLPMIVEGTGGSGTCASTGCITDLNQLCPTELKADGGGACKSACEAFAAPEYCCSGAYNSP

Query:  ATCRPSMYSQMFKSACPKSYSYAYDDATSTFTCNGADYTITFCPSTPSRKSSRDSSPVIPDPTTNPTGSEPVIGTGAGVGMGSDSIQGAALSSSWLADMA
        ATCRPSMYSQMFKSACPKSYSYAYDDATSTFTCNGADYTITFCPS+PSRKSS DSSP++P+  T  T SEP   + +G       IQGAALSSSWLA+MA
Subjt:  ATCRPSMYSQMFKSACPKSYSYAYDDATSTFTCNGADYTITFCPSTPSRKSSRDSSPVIPDPTTNPTGSEPVIGTGAGVGMGSDSIQGAALSSSWLADMA

Query:  IG-DSAIANQPSFKFQSLSTLIFSLVLSLLLS
        IG DS++A +PS  FQ +S LIF+LVLSL+LS
Subjt:  IG-DSAIANQPSFKFQSLSTLIFSLVLSLLLS

A0A6J1EKZ4 thaumatin-like protein 1 isoform X17.04e-19181.4Show/hide
Query:  MGLFCHSHHSSSFFSSLVVLLLVKEVSGATFRFINKCDFTVWPGILASAGSPKLETTGFELEQDSSRSLQAPTGWSGRFWGRTSCNFDSSGRGSCNTGDC
        M L CH  HS  F SSLV+LLL KEVS ATF F+NKCDFTVWPGILASAGSPKLETTGFEL++ +SRSL+APTGWSGRFWGRT C+FD SGRGSCNTGDC
Subjt:  MGLFCHSHHSSSFFSSLVVLLLVKEVSGATFRFINKCDFTVWPGILASAGSPKLETTGFELEQDSSRSLQAPTGWSGRFWGRTSCNFDSSGRGSCNTGDC

Query:  GSGEIECNGAGATPPATLAEFTLGSGSQDFYDVSLVDGYNLPMIVEGTGGSGTCASTGCITDLNQLCPTELKADGGGACKSACEAFAAPEYCCSGAYNSP
        GSGE+ECNGAGATPPATLAEFTLGSGSQDFYDVSLVDGYNLPMIVEGTGGSGTCA+TGCIT++NQLCP ELKA+GGGACKSACEAFA PEYCCSGAYNSP
Subjt:  GSGEIECNGAGATPPATLAEFTLGSGSQDFYDVSLVDGYNLPMIVEGTGGSGTCASTGCITDLNQLCPTELKADGGGACKSACEAFAAPEYCCSGAYNSP

Query:  ATCRPSMYSQMFKSACPKSYSYAYDDATSTFTCNGADYTITFCPSTPSRKSSRDSSPVIPDPTTNPTGSEPVIGTGAGVGMGSDSIQGAALSSSWLADMA
        ATCRPS+YS MFKSACP+SYSYAYDDATSTFTC GADYTITFCPSTPSRKSSRDS P++P+PTT  +GSEP+  TG+G+  GSDSIQGAALS+SWLADMA
Subjt:  ATCRPSMYSQMFKSACPKSYSYAYDDATSTFTCNGADYTITFCPSTPSRKSSRDSSPVIPDPTTNPTGSEPVIGTGAGVGMGSDSIQGAALSSSWLADMA

Query:  IGDSAIANQPSFKFQSLSTLIFSLVLSL
        IG+S++  + S  FQSL+ LI  L+L L
Subjt:  IGDSAIANQPSFKFQSLSTLIFSLVLSL

A0A6J1GCX1 thaumatin-like protein 1 isoform X23.05e-18880.79Show/hide
Query:  MGLFCHSHHSSSFFSSLVVLLLVKEVSGATFRFINKCDFTVWPGILASAGSPKLETTGFELEQDSSRSLQAPTGWSGRFWGRTSCNFDSSGRGSCNTGDC
        MGLF   H SS FFS+L+VLLLVKEVSGATF F+NKC+FTVWPGILASAGSPKLE+TGFEL Q SSRS QAPT WSGRFWGRT+CN + SGR  CNTGDC
Subjt:  MGLFCHSHHSSSFFSSLVVLLLVKEVSGATFRFINKCDFTVWPGILASAGSPKLETTGFELEQDSSRSLQAPTGWSGRFWGRTSCNFDSSGRGSCNTGDC

Query:  GSGEIECNGAGATPPATLAEFTLGSGSQDFYDVSLVDGYNLPMIVEGTGGSGTCASTGCITDLNQLCPTELKADGGGACKSACEAFAAPEYCCSGAYNSP
        GSGEIECNGAGATPPATLAEFTLGSGSQDFYDVSLVDGYNLPMI+EGTGGSGTC+STGCITDLNQLCP ELKA+GGGACKSACEAFA+PEYCCSG YNSP
Subjt:  GSGEIECNGAGATPPATLAEFTLGSGSQDFYDVSLVDGYNLPMIVEGTGGSGTCASTGCITDLNQLCPTELKADGGGACKSACEAFAAPEYCCSGAYNSP

Query:  ATCRPSMYSQMFKSACPKSYSYAYDDATSTFTCNGADYTITFCPSTPSRKSSRDSSPVIPDPTTNPTGSEPVIGTGAGVGMGSDSIQGAALSSSWLADMA
        ATC+PS+YS+MFKSACP+SYSYAYDDATSTFTCNGADYTITFCPS PS+KSSRDSSPVIP+PTT  +GSEP    G+  G+GS  +QGA LS+SWLADMA
Subjt:  ATCRPSMYSQMFKSACPKSYSYAYDDATSTFTCNGADYTITFCPSTPSRKSSRDSSPVIPDPTTNPTGSEPVIGTGAGVGMGSDSIQGAALSSSWLADMA

Query:  IGDSAIANQPSFKFQSLSTLIFSLVLSL
        IG S+IA +P+  FQ  S++IF +VLSL
Subjt:  IGDSAIANQPSFKFQSLSTLIFSLVLSL

A0A6J1IMW8 thaumatin-like protein 1 isoform X21.06e-18880.3Show/hide
Query:  MGLFCHSHHSSSFFSSLVVLLLVKEVSGATFRFINKCDFTVWPGILASAGSPKLETTGFELEQDSSRSLQAPTGWSGRFWGRTSCNFDSSGRGSCNTGDC
        MGLF   HHSS FFS+L+VLLLVKEVSGATF F+NKC+FTVWPGILASAGSPKLE+TGFEL Q SSRS QAPT WSGRFWGRT+CN D SGR  CNTGDC
Subjt:  MGLFCHSHHSSSFFSSLVVLLLVKEVSGATFRFINKCDFTVWPGILASAGSPKLETTGFELEQDSSRSLQAPTGWSGRFWGRTSCNFDSSGRGSCNTGDC

Query:  GSGEIECNGAGATPPATLAEFTLGSGSQDFYDVSLVDGYNLPMIVEGTGGSGTCASTGCITDLNQLCPTELKADGGGACKSACEAFAAPEYCCSGAYNSP
        GSGEIECNGAGATPPATLAEFTLGSGSQDFYDVSLVDGYNLPMI+EGTGGSGTC+STGCITDLNQLCP ELKA+GGGACKSACEAFA+PEYCCSG YNSP
Subjt:  GSGEIECNGAGATPPATLAEFTLGSGSQDFYDVSLVDGYNLPMIVEGTGGSGTCASTGCITDLNQLCPTELKADGGGACKSACEAFAAPEYCCSGAYNSP

Query:  ATCRPSMYSQMFKSACPKSYSYAYDDATSTFTCNGADYTITFCPSTPSRKSSRDSSPVIPDPTTNPTGSEPVIGTGAGVGMGSDSIQGAALSSSWLADMA
        ATCRPS+YS+MFKSACP+SYSYAYDDATSTFTCNGADYTITFCPS PS+KSSRDSSP+IP+PTT   GSEP    G+   +GS  +QGA LS+SWLADMA
Subjt:  ATCRPSMYSQMFKSACPKSYSYAYDDATSTFTCNGADYTITFCPSTPSRKSSRDSSPVIPDPTTNPTGSEPVIGTGAGVGMGSDSIQGAALSSSWLADMA

Query:  IGDSAIANQPSFKFQSLSTLIFSLVLSLLL
        IG S++A +P+  FQ +S LIF +V+ L+L
Subjt:  IGDSAIANQPSFKFQSLSTLIFSLVLSLLL

A0A6J1KQK9 thaumatin-like protein 1 isoform X11.75e-18880.79Show/hide
Query:  MGLFCHSHHSSSFFSSLVVLLLVKEVSGATFRFINKCDFTVWPGILASAGSPKLETTGFELEQDSSRSLQAPTGWSGRFWGRTSCNFDSSGRGSCNTGDC
        M L CH  HS  F SSLV+LLL KEVS ATF F+NKCDFTVWPGILASAGSPKLETTGFEL++ +SRSL+APTGWSGRFWGRT C+FD SGRGSC+TGDC
Subjt:  MGLFCHSHHSSSFFSSLVVLLLVKEVSGATFRFINKCDFTVWPGILASAGSPKLETTGFELEQDSSRSLQAPTGWSGRFWGRTSCNFDSSGRGSCNTGDC

Query:  GSGEIECNGAGATPPATLAEFTLGSGSQDFYDVSLVDGYNLPMIVEGTGGSGTCASTGCITDLNQLCPTELKADGGGACKSACEAFAAPEYCCSGAYNSP
         SGE+ECNGAGATPPATLAEFTLGSGSQDFYDVSLVDGYNLPMIVEGTGGSGTCA+TGCIT++NQLCP ELKA+GGGACKSACEAFA PEYCCSGAYNSP
Subjt:  GSGEIECNGAGATPPATLAEFTLGSGSQDFYDVSLVDGYNLPMIVEGTGGSGTCASTGCITDLNQLCPTELKADGGGACKSACEAFAAPEYCCSGAYNSP

Query:  ATCRPSMYSQMFKSACPKSYSYAYDDATSTFTCNGADYTITFCPSTPSRKSSRDSSPVIPDPTTNPTGSEPVIGTGAGVGMGSDSIQGAALSSSWLADMA
        ATCRPS+YS MFKSACP+SYSYAYDDATSTFTC GADYTITFCPSTPSRKSSRDS P++P+PTT  +GSEPV  TG+G+  GSDSI+GA LS+SWLADM 
Subjt:  ATCRPSMYSQMFKSACPKSYSYAYDDATSTFTCNGADYTITFCPSTPSRKSSRDSSPVIPDPTTNPTGSEPVIGTGAGVGMGSDSIQGAALSSSWLADMA

Query:  IGDSAIANQPSFKFQSLSTLIFSLVLSL
        IG+S+IA + S  FQSL+ LI  L+L L
Subjt:  IGDSAIANQPSFKFQSLSTLIFSLVLSL

SwissProt top hitse value%identityAlignment
A0A1P8B554 Thaumatin-like protein 18.1e-9266.93Show/hide
Query:  HSHHSSSF----FSSLVVLLLVKEVSGATFRFINKCDFTVWPGILASAGSPKLETTGFELEQDSSRSLQAPTGWSGRFWGRTSCNFDS-SGRGSCNTGDC
        HSH S  F    F     L LV    GAT   +N+C FTVWPGIL+++GS  + TTGFEL    SRS QAP  WSGRFW RT CNF+S +G+G+C TGDC
Subjt:  HSHHSSSF----FSSLVVLLLVKEVSGATFRFINKCDFTVWPGILASAGSPKLETTGFELEQDSSRSLQAPTGWSGRFWGRTSCNFDS-SGRGSCNTGDC

Query:  GSGEIECNGAGATPPATLAEFTLGSG------SQDFYDVSLVDGYNLPMIVEGTGGS-GTCASTGCITDLNQLCPTELKADGGGACKSACEAFAAPEYCC
        GS ++ECNGAGA PPATLAEFT+GSG       QDFYDVSLVDGYN+PM+VE +GGS GTC +TGC+TDLNQ CPTEL+   G ACKSACEAF +PEYCC
Subjt:  GSGEIECNGAGATPPATLAEFTLGSG------SQDFYDVSLVDGYNLPMIVEGTGGS-GTCASTGCITDLNQLCPTELKADGGGACKSACEAFAAPEYCC

Query:  SGAYNSPATCRPSMYSQMFKSACPKSYSYAYDDATSTFTCNGADYTITFCPSTP
        SGAY SP  C+PSMYS++FKSACP+SYSYA+DDATSTFTC  ADYTITFCPS P
Subjt:  SGAYNSPATCRPSMYSQMFKSACPKSYSYAYDDATSTFTCNGADYTITFCPSTP

O80327 Thaumatin-like protein 15.1e-7055.37Show/hide
Query:  FSSLVVLLLV-----KEVSGATFRFINKCDFTVWPGILASAGSPKLETTGFELEQDSSRSLQAPTGWSGRFWGRTSCNFDSSGRGSCNTGDCGSGEIECN
        F +L+ L+LV       V  A F F NKC  TVWPG L   G P+L +TGFEL   +S SL     WSGRFWGR+ C+ DSSG+  C+TGDCGSG+I CN
Subjt:  FSSLVVLLLV-----KEVSGATFRFINKCDFTVWPGILASAGSPKLETTGFELEQDSSRSLQAPTGWSGRFWGRTSCNFDSSGRGSCNTGDCGSGEIECN

Query:  GAGATPPATLAEFTLG-SGSQDFYDVSLVDGYNLPMIVEGTGGSGTCASTGCITDLNQLCPTELKADGGG----ACKSACEAFAAPEYCCSGAYNSPATC
        GAGA+PPA+L E TL  +G QDFYDVSLVDG+NLP+ +   GGSG C ST C  ++N +CP EL   G       CKSAC A   P+YCC+GAY +P TC
Subjt:  GAGATPPATLAEFTLG-SGSQDFYDVSLVDGYNLPMIVEGTGGSGTCASTGCITDLNQLCPTELKADGGG----ACKSACEAFAAPEYCCSGAYNSPATC

Query:  RPSMYSQMFKSACPKSYSYAYDDATSTFTC-NGADYTITFCP
         P+ +S++FK+ CP++YSYAYDD +STFTC  G +Y ITFCP
Subjt:  RPSMYSQMFKSACPKSYSYAYDDATSTFTC-NGADYTITFCP

P28493 Pathogenesis-related protein 51.1e-6757.32Show/hide
Query:  SSLVVLLLVKEVSG-----ATFRFINKCDFTVWPGILASAGSPKLETTGFELEQDSSRSLQAPTGWSGRFWGRTSCNFDSSGRGSCNTGDCGSGEIECNG
        SS+ +L LV   SG       F   N C  TVW G LA  G PKL   GFEL   +SR L AP GWSGRFW RT CNFD+SG G C TGDCG   + CNG
Subjt:  SSLVVLLLVKEVSG-----ATFRFINKCDFTVWPGILASAGSPKLETTGFELEQDSSRSLQAPTGWSGRFWGRTSCNFDSSGRGSCNTGDCGSGEIECNG

Query:  AGATPPATLAEFTL-GSGSQDFYDVSLVDGYNLPMIVEGTGGSGTCASTGCITDLNQLCPTELKA---DGGGACKSACEAFAAPEYCCSGAYNSPATCRP
         G  PP TLAEFTL G G +DFYDVSLVDGYN+ + +  +GGSG C   GC++DLN  CP  LK    +   ACKSACE F   +YCC GA + P TC P
Subjt:  AGATPPATLAEFTL-GSGSQDFYDVSLVDGYNLPMIVEGTGGSGTCASTGCITDLNQLCPTELKA---DGGGACKSACEAFAAPEYCCSGAYNSPATCRP

Query:  SMYSQMFKSACPKSYSYAYDDATSTFTCNGADYTITFCP
        + YS++FK+ACP +YSYAYDD TSTFTC GA+Y ITFCP
Subjt:  SMYSQMFKSACPKSYSYAYDDATSTFTCNGADYTITFCP

P50699 Thaumatin-like protein9.6e-6956.52Show/hide
Query:  LLLVKEVSGATFRFINKCDFTVWPGILASAGSPKLETTGFELEQDSSRSLQAPTGWSGRFWGRTSCNFDSSGRGSCNTGDCGSGEIECNGAGATPPATLA
        LLL+   S +T  F NKC   VWPGI  SAG   L   GF+L  + + SLQ P  WSGRFWGR  C FD SGRG C TGDCG G + CNGAG  PPATLA
Subjt:  LLLVKEVSGATFRFINKCDFTVWPGILASAGSPKLETTGFELEQDSSRSLQAPTGWSGRFWGRTSCNFDSSGRGSCNTGDCGSGEIECNGAGATPPATLA

Query:  EFTLGSGSQDFYDVSLVDGYNLPMIVEGTGGSGTCASTGCITDLNQLCPTELKADGGG-----ACKSACEAFAAPEYCCSGAYNSPATCRPSMYSQMFKS
        E TLG    DFYDVSLVDGYNL M +    GSG C+  GC++DLNQ+CP  L+          ACKSAC AF +P+YCC+G + +P +C+P+ YS++FK 
Subjt:  EFTLGSGSQDFYDVSLVDGYNLPMIVEGTGGSGTCASTGCITDLNQLCPTELKADGGG-----ACKSACEAFAAPEYCCSGAYNSPATCRPSMYSQMFKS

Query:  ACPKSYSYAYDDATSTFTCNGADYTITFCP
        ACPK+YSYAYDD TS  TC+ A+Y +TFCP
Subjt:  ACPKSYSYAYDDATSTFTCNGADYTITFCP

Q5DWG1 Pathogenesis-related thaumatin-like protein 3.59.9e-7460.71Show/hide
Query:  SGAT-FRFINKCDFTVWPGILASAGSPKLETTGFELEQDSSRSLQAPTGWSGRFWGRTSCNFDSSGRGSCNTGDCGSGEIECNGAGATPPATLAEFTLGS
        +GAT F  +NKC +TVWPG L+ +GS  L   GF L    S  L A + WSGRFWGRT C+FD+SG+GSC TGDCG+  + C  AG TPP +LAEFTL  
Subjt:  SGAT-FRFINKCDFTVWPGILASAGSPKLETTGFELEQDSSRSLQAPTGWSGRFWGRTSCNFDSSGRGSCNTGDCGSGEIECNGAGATPPATLAEFTLGS

Query:  GSQDFYDVSLVDGYNLPMIVEGTGGSGTCASTGCITDLNQLCPTELKADGGG---ACKSACEAFAAPEYCCSGAYNSPATCRPSMYSQMFKSACPKSYSY
        G +DFYDVSLVDGYN+P+ +   GG+G C + GC++DL   CP EL     G   ACKSAC AF+ PEYCC+G + SP TC PS YSQ+FKSACP +YSY
Subjt:  GSQDFYDVSLVDGYNLPMIVEGTGGSGTCASTGCITDLNQLCPTELKADGGG---ACKSACEAFAAPEYCCSGAYNSPATCRPSMYSQMFKSACPKSYSY

Query:  AYDDATSTFTCNGADYTITFCPST
        AYDDATSTFTC+ ADYTITFCPS+
Subjt:  AYDDATSTFTCNGADYTITFCPST

Arabidopsis top hitse value%identityAlignment
AT1G20030.2 Pathogenesis-related thaumatin superfamily protein9.5e-8057.55Show/hide
Query:  FSSLVVLLLVKEVSGATFRFINKCDFTVWPGILASAGSPKLETTGFELEQDSSRSLQAPTGWSGRFWGRTSCNFDSSGRGSCNTGDCGSGEIECNGAGAT
        F  L+  L    V   +F F NKCD+TVWPGIL++AG   L TTGF L +  +R++ AP+ W GRFWGRT C+ DS G+ SC TGDCGSG+IEC+GAGA 
Subjt:  FSSLVVLLLVKEVSGATFRFINKCDFTVWPGILASAGSPKLETTGFELEQDSSRSLQAPTGWSGRFWGRTSCNFDSSGRGSCNTGDCGSGEIECNGAGAT

Query:  PPATLAEFTL-GSGSQDFYDVSLVDGYNLPMIVEGTGGSG-TCASTGCITDLNQLCPTELK---ADG----GGACKSACEAFAAPEYCCSGAYNSPATCR
        PPATLAEFTL GSG  DFYDVSLVDGYN+ M+V   GGSG  C+STGC+ DLN  CP+EL+    DG      ACKSACEAF  PEYCCSGA+ SP TC+
Subjt:  PPATLAEFTL-GSGSQDFYDVSLVDGYNLPMIVEGTGGSG-TCASTGCITDLNQLCPTELK---ADG----GGACKSACEAFAAPEYCCSGAYNSPATCR

Query:  PSMYSQMFKSACPKSYSYAYDDATSTFTC-NGADYTITFCPS-TPSRKSSRDSSP---VIPDPTTNPTGSEPVIGTGA
        PS YS++FKSACP++YSYAYDD +STFTC    +Y ITFCPS   S KS+ + S        P++  T S  ++  GA
Subjt:  PSMYSQMFKSACPKSYSYAYDDATSTFTC-NGADYTITFCPS-TPSRKSSRDSSP---VIPDPTTNPTGSEPVIGTGA

AT1G75800.1 Pathogenesis-related thaumatin superfamily protein1.2e-8253.46Show/hide
Query:  LLVKEVSGATFRFINKCDFTVWPGILASAGSPKLETTGFELEQDSSRSLQAPTGWSGRFWGRTSCNFDSSGRGSCNTGDCGSGEIECNGAGATPPATLAE
        L V  V   +F  +NKC++TVWPG+L++AG P L TTGF L++   R++ APT W GRFWGRT C+ D+ G+ +C TGDCGSG +EC+G+GATPPATLAE
Subjt:  LLVKEVSGATFRFINKCDFTVWPGILASAGSPKLETTGFELEQDSSRSLQAPTGWSGRFWGRTSCNFDSSGRGSCNTGDCGSGEIECNGAGATPPATLAE

Query:  FTL-GSGSQDFYDVSLVDGYNLPMIVEGTGGSG-TCASTGCITDLNQLCPTELKA---DGGG----ACKSACEAFAAPEYCCSGAYNSPATCRPSMYSQM
        FTL GS   DFYDVSLVDGYN+PM+V   GGSG  C+STGC+ DLN  CP+ELK    DG G     CKSACEAF  PEYCCSGA+ +P TC+PS YS M
Subjt:  FTL-GSGSQDFYDVSLVDGYNLPMIVEGTGGSG-TCASTGCITDLNQLCPTELKA---DGGG----ACKSACEAFAAPEYCCSGAYNSPATCRPSMYSQM

Query:  FKSACPKSYSYAYDDATSTFTC-NGADYTITFCPS-TPSRKSSRDSSPVIPDPTTNPTGSEPVIGTGAGVGMGSDSIQGAALSSSWLADMAIGDSAIANQ
        FK+ACP++YSYAYDD +STFTC    +Y ITFCP+   S+KSS+D SP  P PTT PT      GT +    G  S   + + +S + + A+  +     
Subjt:  FKSACPKSYSYAYDDATSTFTC-NGADYTITFCPS-TPSRKSSRDSSPVIPDPTTNPTGSEPVIGTGAGVGMGSDSIQGAALSSSWLADMAIGDSAIANQ

Query:  PSFKFQSLSTLIFSLVLS
        PS    SL  +  +L L+
Subjt:  PSFKFQSLSTLIFSLVLS

AT4G24180.1 THAUMATIN-LIKE PROTEIN 15.7e-9366.93Show/hide
Query:  HSHHSSSF----FSSLVVLLLVKEVSGATFRFINKCDFTVWPGILASAGSPKLETTGFELEQDSSRSLQAPTGWSGRFWGRTSCNFDS-SGRGSCNTGDC
        HSH S  F    F     L LV    GAT   +N+C FTVWPGIL+++GS  + TTGFEL    SRS QAP  WSGRFW RT CNF+S +G+G+C TGDC
Subjt:  HSHHSSSF----FSSLVVLLLVKEVSGATFRFINKCDFTVWPGILASAGSPKLETTGFELEQDSSRSLQAPTGWSGRFWGRTSCNFDS-SGRGSCNTGDC

Query:  GSGEIECNGAGATPPATLAEFTLGSG------SQDFYDVSLVDGYNLPMIVEGTGGS-GTCASTGCITDLNQLCPTELKADGGGACKSACEAFAAPEYCC
        GS ++ECNGAGA PPATLAEFT+GSG       QDFYDVSLVDGYN+PM+VE +GGS GTC +TGC+TDLNQ CPTEL+   G ACKSACEAF +PEYCC
Subjt:  GSGEIECNGAGATPPATLAEFTLGSG------SQDFYDVSLVDGYNLPMIVEGTGGS-GTCASTGCITDLNQLCPTELKADGGGACKSACEAFAAPEYCC

Query:  SGAYNSPATCRPSMYSQMFKSACPKSYSYAYDDATSTFTCNGADYTITFCPSTP
        SGAY SP  C+PSMYS++FKSACP+SYSYA+DDATSTFTC  ADYTITFCPS P
Subjt:  SGAYNSPATCRPSMYSQMFKSACPKSYSYAYDDATSTFTCNGADYTITFCPSTP

AT4G38660.1 Pathogenesis-related thaumatin superfamily protein3.6e-10360.44Show/hide
Query:  SSSFFSSLVVLLLVKEVS-GATFRFINKCDFTVWPGILASAGSPKLETTGFELEQDSSRSLQAPTGWSGRFWGRTSCNFDSSGRGSCNTGDCGSGEIECN
        SS+   S  VLLL    S G+TF F N+C +TVWPGIL++AGSP L TTGFEL + +SRSLQAPTGWSGRFW RT C FDSSG G+C TGDCGS  +EC 
Subjt:  SSSFFSSLVVLLLVKEVS-GATFRFINKCDFTVWPGILASAGSPKLETTGFELEQDSSRSLQAPTGWSGRFWGRTSCNFDSSGRGSCNTGDCGSGEIECN

Query:  GAGATPPATLAEFTLGSGSQDFYDVSLVDGYNLPMIVEGTGGSGTCASTGCITDLNQLCPTELKADGGGACKSACEAFAAPEYCCSGAYNSPATCRPSMY
        G GA PP TLAEFTLG+G  DFYDVSLVDGYN+PMIVE  GGSG CASTGC TDLN  CP EL+   G ACKSAC AF +PEYCCSGAY +P++CRPS+Y
Subjt:  GAGATPPATLAEFTLGSGSQDFYDVSLVDGYNLPMIVEGTGGSGTCASTGCITDLNQLCPTELKADGGGACKSACEAFAAPEYCCSGAYNSPATCRPSMY

Query:  SQMFKSACPKSYSYAYDDATSTFTCNGADYTITFCPSTPSRKSSRDSSPVIPDPTTNPTGSEPVIGTGAGV-----------------GMGSDSIQGAAL
        S+MFK+ACP+SYSYAYDDATSTFTC G DYT+TFCPS+PS+KS+   SP + D ++   GS+PV G+  G                  G GS+   G  +
Subjt:  SQMFKSACPKSYSYAYDDATSTFTCNGADYTITFCPSTPSRKSSRDSSPVIPDPTTNPTGSEPVIGTGAGV-----------------GMGSDSIQGAAL

Query:  --SSSWLADMAIGDSA
            SW+A +A+G+++
Subjt:  --SSSWLADMAIGDSA

AT4G38660.2 Pathogenesis-related thaumatin superfamily protein3.0e-10260.73Show/hide
Query:  LVKEVSGATFRFINKCDFTVWPGILASAGSPKLETTGFELEQDSSRSLQAPTGWSGRFWGRTSCNFDSSGRGSCNTGDCGSGEIECNGAGATPPATLAEF
        ++K   G+TF F N+C +TVWPGIL++AGSP L TTGFEL + +SRSLQAPTGWSGRFW RT C FDSSG G+C TGDCGS  +EC G GA PP TLAEF
Subjt:  LVKEVSGATFRFINKCDFTVWPGILASAGSPKLETTGFELEQDSSRSLQAPTGWSGRFWGRTSCNFDSSGRGSCNTGDCGSGEIECNGAGATPPATLAEF

Query:  TLGSGSQDFYDVSLVDGYNLPMIVEGTGGSGTCASTGCITDLNQLCPTELKADGGGACKSACEAFAAPEYCCSGAYNSPATCRPSMYSQMFKSACPKSYS
        TLG+G  DFYDVSLVDGYN+PMIVE  GGSG CASTGC TDLN  CP EL+   G ACKSAC AF +PEYCCSGAY +P++CRPS+YS+MFK+ACP+SYS
Subjt:  TLGSGSQDFYDVSLVDGYNLPMIVEGTGGSGTCASTGCITDLNQLCPTELKADGGGACKSACEAFAAPEYCCSGAYNSPATCRPSMYSQMFKSACPKSYS

Query:  YAYDDATSTFTCNGADYTITFCPSTPSRKSSRDSSPVIPDPTTNPTGSEPVIGTGAGV-----------------GMGSDSIQGAAL--SSSWLADMAIG
        YAYDDATSTFTC G DYT+TFCPS+PS+KS+   SP + D ++   GS+PV G+  G                  G GS+   G  +    SW+A +A+G
Subjt:  YAYDDATSTFTCNGADYTITFCPSTPSRKSSRDSSPVIPDPTTNPTGSEPVIGTGAGV-----------------GMGSDSIQGAAL--SSSWLADMAIG

Query:  DSA
        +++
Subjt:  DSA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGATTGTTCTGCCATTCCCACCACTCTTCTTCCTTCTTCTCGAGCTTGGTTGTGCTTCTTCTTGTCAAAGAGGTTTCCGGTGCTACATTCAGGTTCATTAACAAGTG
CGATTTCACCGTCTGGCCGGGAATTCTCGCCAGCGCCGGCAGCCCAAAACTGGAGACCACCGGCTTCGAATTAGAACAGGACAGTTCCCGCTCATTACAAGCTCCAACTG
GTTGGTCGGGTCGGTTCTGGGGCCGGACGAGCTGCAATTTCGACAGCTCCGGCCGTGGCAGCTGCAACACCGGCGACTGCGGCTCCGGCGAGATTGAGTGCAACGGCGCC
GGCGCCACGCCTCCGGCAACCTTAGCCGAGTTCACGCTCGGCTCCGGCTCGCAGGATTTCTACGATGTCAGCCTCGTCGATGGCTACAATTTGCCGATGATCGTCGAGGG
AACGGGTGGGTCAGGGACGTGCGCGTCCACTGGCTGCATTACGGATCTGAACCAGCTATGCCCGACGGAGCTGAAGGCGGACGGCGGCGGCGCGTGCAAGAGCGCTTGTG
AAGCCTTTGCGGCGCCGGAATACTGTTGCAGTGGCGCGTACAATTCGCCGGCGACATGCCGGCCGTCAATGTACTCGCAAATGTTCAAATCGGCGTGCCCCAAATCGTAC
AGCTACGCTTATGATGACGCCACCAGCACATTCACTTGCAATGGAGCAGATTATACCATCACATTTTGCCCATCTACCCCAAGCCGAAAATCTTCGAGGGATTCATCGCC
GGTGATACCAGACCCGACGACAAACCCGACGGGGTCGGAGCCCGTGATAGGGACGGGGGCGGGGGTGGGGATGGGATCAGATTCCATACAAGGAGCAGCTCTGAGTAGTT
CATGGCTTGCAGATATGGCCATAGGGGATTCTGCCATTGCCAATCAACCGAGCTTCAAATTTCAATCTCTGTCAACTTTGATCTTCTCTCTTGTTCTATCTCTCCTTCTC
TCGTAG
mRNA sequenceShow/hide mRNA sequence
TGCCCAAAAAAAAAAATTATTTGACAACCTACTAATTTTTAAAGAATATTATGTAAAATTCAAATTTTGAAATTAAATAAATAATAATAATAGGTTAAAACCAGACAATA
AATGAATGAACGAGGGGAAGACGAAGTGGTACCCATGGGCCGTAACTTCTACTCTAAATTTAGGGTCGTTCTTTAGAGGGAAAAAAAGGAAAAAGAAAAAGGAAAAAGAA
AAAGGAAAAAAAGGGTGAAGATGATTGGTCGGTGAGTGAGAGTGATGTCGGTGACGGGGCACTTCAATTTCCCTTCGATTGATCACGTTTCATTATAAATAACAGAGCAT
CGCCATGTCCATGCTTAAGGTTCTCTCACCACTCTCTCCTTCTACTTTCCTTTCCCCTTTCCTTTTTATTCCTTCTTTTTCAAAACAATTTTACAGGAAATAATCTCTAT
CGCCTTCTCCAATATTCCTTCTCTCTCATCCCCATTTTTCTCTCCCATAAACCCCTGTTTCCGCCATTGCTCTCTCTCTCTCTCTGCATCTTCACCTTTTCTCTTCTCTC
TCCTAACTTCAAAAATCTCATAACTCAAATCCCACTTCGCAAAATTCATCTTAACCATACCCATTCTTACAAATCTGTGATATCAAGCAATTGTTTCAGCAGAATCAGTG
AGCTTTAGCACGAGGATAAAACAGAGCATTTCAATGGGATTGTTCTGCCATTCCCACCACTCTTCTTCCTTCTTCTCGAGCTTGGTTGTGCTTCTTCTTGTCAAAGAGGT
TTCCGGTGCTACATTCAGGTTCATTAACAAGTGCGATTTCACCGTCTGGCCGGGAATTCTCGCCAGCGCCGGCAGCCCAAAACTGGAGACCACCGGCTTCGAATTAGAAC
AGGACAGTTCCCGCTCATTACAAGCTCCAACTGGTTGGTCGGGTCGGTTCTGGGGCCGGACGAGCTGCAATTTCGACAGCTCCGGCCGTGGCAGCTGCAACACCGGCGAC
TGCGGCTCCGGCGAGATTGAGTGCAACGGCGCCGGCGCCACGCCTCCGGCAACCTTAGCCGAGTTCACGCTCGGCTCCGGCTCGCAGGATTTCTACGATGTCAGCCTCGT
CGATGGCTACAATTTGCCGATGATCGTCGAGGGAACGGGTGGGTCAGGGACGTGCGCGTCCACTGGCTGCATTACGGATCTGAACCAGCTATGCCCGACGGAGCTGAAGG
CGGACGGCGGCGGCGCGTGCAAGAGCGCTTGTGAAGCCTTTGCGGCGCCGGAATACTGTTGCAGTGGCGCGTACAATTCGCCGGCGACATGCCGGCCGTCAATGTACTCG
CAAATGTTCAAATCGGCGTGCCCCAAATCGTACAGCTACGCTTATGATGACGCCACCAGCACATTCACTTGCAATGGAGCAGATTATACCATCACATTTTGCCCATCTAC
CCCAAGCCGAAAATCTTCGAGGGATTCATCGCCGGTGATACCAGACCCGACGACAAACCCGACGGGGTCGGAGCCCGTGATAGGGACGGGGGCGGGGGTGGGGATGGGAT
CAGATTCCATACAAGGAGCAGCTCTGAGTAGTTCATGGCTTGCAGATATGGCCATAGGGGATTCTGCCATTGCCAATCAACCGAGCTTCAAATTTCAATCTCTGTCAACT
TTGATCTTCTCTCTTGTTCTATCTCTCCTTCTCTCGTAGTAGAAGGAGAAAGAAAGGGAATTAATTGACCAAGTCCAGATTGCAAACCATTAACTCGAGAATCTAGGAAA
CTTTATGATTGATCTATGTACAAAATTTAAGGCAAAAAGATGTCGCTTTTTTTTTTTTATTTAGAGGTAGGGGGAAAATAAAATAATAAAAAGGCATTTGATTTGGTTTG
AGGATTTGTTGAATTTAGATGATTCCGCAGCTTGAAAGGTGAGTGGGAATTTGGAATACTAGAGCCACAAAGTTTACACCTTTATGACAACACAAATAATGGGTGAATTT
TTATGTAAGTTGGCAAGTTGGACTTGGCAAAAATGACCCACAGCTACAATTTACATATAAACAATCAAACTAAAATCTGCCCGACATGTAAAATGTTGAGGGAAAGAATG
AAATATGCTCTCTTTTAAGTTTTGGATTGATTAGGATGAATTCCTAGTTTCAAAATGATAAAATTGGATATATGCAAAACTTGGCTAGTTGGTGCTAGCCAAACAAAATA
ATGGGTTGTCATAGGCAATTTTCATGATTTTTCCTTCATTAGTACTCTATTTTATTCCATCTAAGATATCAGATTAAACTATTCAAAAATAAATAGTGAGTTTGAAATGA
CACTTTTTTAAGCTAAAACATTCTTTTACCAACATTTTTAGAAGAAACACTATATAACTGTTTTTAAAATGTTTTTAGGATCTTTAAAATCACTTTTATTATCTAACCTA
CCACCATAAAAATGGCTAAAAACATATTATACCCTTCTAAAATTCATCCCGAACTCATCCAAAGAAGTGTCTAACCAAGACAAGTCAACTAAGCTGAGGTTACAAAAAAC
CCTAAGTTGTGAAAATCTAGGGAAAGAAAATGCTTAAAAACGTGTTTTACCCTTCCAAAAATCATCCCAAACTCACCCAAAAATGTGTCTAACAGGCTAAGTCGACTAAG
CTCGGGTGACCGAAAACCAAGTTACGAAAGTCTAGGGGAGAGAAAAGGGCAACGACTAACGAATGATCATAAAAAGGGAGAAAGAAGAAAGAGCATTTGATTAGTTTAAT
GTGGTGAATGAATGATAAGTTGGGGAGTATGGGAGATTGAATGGGAAGCAGCACATGGAGAGCAGCATGGGGTTGATGAGGCCAAATGGGTGTTGATTGGACGGCATGAA
GGAAAGATTCTTCTCGCTTTGGTATCTCTACTTTATACCTGAAAACCCACACTCGATGCCCCCAATAACACACTTCTTCATGTTTTTTTCCCTTTCTTTCTTTCTTTTTT
TTTTCTGTAATCTTTTCTGTTTTAATTTTAATGTGTATTTTGTCATAATCCCATGCAAGTTAGACAAGAATTTCATGTGTTTAAATTGTTTTTAAAAGGAGGGTATCTTC
AAGAGTTATTCTCAAATTTTGTTTTGTTGTTTCGAATAAAAGTTGTATTTTGAAAGGAGTGGTTAGGTGATTTTTATAGGAATTGACTTTGGTATTTGAACAATGTGGGA
TTATGATAAGAATTTGATACGATTTGCTAAAGTGAAGACTGAGGAATTAAGATGAGGGGATTAGAGAAATGGAAAGGAGTAGTAGGAAGGAAGGATTCTTGAACAAGTTG
GGTGTGAGGGAGAAGCAGAGAAAAAGCACCAAAAACAAAAGCAAAGATGGAAAAATGTTCCTTTTTCTTTACTGAAGATAAGAGGGGAAGAGAATTGGAGGCCATTCCTA
AGTGGGGAGTGGAGACTTTTTTCTTTTTTTTTTTTCGGGGTTCTGTTTCTACTACTAAAATCCCAAACCCCATTTATCGATTTTCAATTTTAAACCAATCCAACATGTTC
TTTCTCTTTCTCACTCATAATCAATCATTTTTCACTCTCTTTCTACTCTTTTTTTGTTCTTTTTTCCTCGGGATATTTTTCTGAAATATCTTCCCGTCAGTTTCATTTAT
TCAAAATTTCAAATTTTGGATATCAAAACTATTTTTTCTCTATTAGTTCAACAAAGATAGAAGATAAGAGATCGAAACTTTAACCTTATGAAAGGTTAATAGATAACTTT
ATCACTGAGCTATATTTGAATTTGGCAGATGCACATGGTTTTCTCAACTAATTTAGCATGATGTTACCGTTCTCATCATTTAATTCATCCCTAAATATTATTGAATAGAT
AAGAGAAAATGAAAATTATAAAGCGCAAAACAAATCACCCTAAAATAAAAGGCATAGTTGTAATCGAATATCATAGAAAATAATAGAAACTACCATTAAGTTGGAGAAAC
GATATGAGTACCCTTGAAATTATAAGAATAAAGAAAGACAATTACGCCTTCTCAAAATATAGTAAATAACCATTACCTCAATAAACATATGCCTTTACTTCAAACTGAAC
ATGTTTTGCACTCATTCCAAGCTGAACATCTAACACCAAAATTAATTAGTCATATTCCTAAAATAGTAAAAGAGCTATAATAAGAACCACAACATTACTTTTAAAAGATA
GTCAAATATCCTAATTAATGCCTCACATTTAATTTACATCCCACATCAACTCACACTGGATGTAAGTAGGTACAAGTCTAAAGATTGAACCCACTTTTTCATAACATGGC
AGCAAAATAGTGTCATTTTGTCATTTTTATTAAATCTCACCCTCATAGTTAGTCTTATAAAATATTGGAAAGGAAGACCCTAAATGATCACTTTCACATGAATGGTGTAG
AGTTATTTGACCCATCAAATAGCTTTTGACATTGTTGCATGCATCTATGTGGAAGGCAATATCCAATGCTTTTGGGATTGTTTTTGTATCATATAATAACTTTAAGGAAC
CTACCTCTTCGTCAAAAGTTTCCAAGCACAAACTAAACCATCCATAATGCCACAAATTAAGGGCAAGGCAACAGACACCATGTTTACATATTAATAATCCCAGAATTTGT
GGAGCTGAAAATATGTGCAGATGTCAGCATATAGAAATCATAGAGAAATTTGGTTCCGGAATTTGGAAGGCCTAAATTTGAGAAGTTTGGATTTGTGTAACCTATATTTA
GATTATGCAAAAGAATATGTTTTTTGACAGTTTGAGTTTATATTCCTTGTCTTGAGTTTTTAACTTTCTGATATGCAGAAGCTAATATATAGATTGAGAGGTGCTGGCTT
TAGAAACTAAACCATATAATGTTTCTATTTCTAACTCCTGAGGCTGACAAATTTGCCTTGTAAAAGCAAACTCAGAAACTAAAGTAGGCTCCACTTTCCAACTTTAACAC
TATACAAACAACCGGTGAATGAAGCATCTCCTAAAGGATTGGACAAGACAGCCTTTGAGGACAGCCGCTTTGGGTGTAGGACCAGAAGCTGAAGGGGATCCATCTGAGAC
ACCACCCCCACCAGTTGGGGGTGAAATGCTCCATCCACAATCACAAGTACAGTAGTATGGGCCAAACCAGCATCAAGGGTGCTAGGAAACAAATAATCGTGGCGGCTCCA
CCAGGACGGCGGTGCACAGTTAAAAGTAAGGAACAAAGCTGTAAGAAAGAGCAGCTACCGCAGTTATAACCGTGGTGGCCCTTTGCATGTTGAGTAATGGATCTAAAAGC
AGAGGTTGTTGATGAGCTTAATCCAATCCCAACAACCTTGTGCCTAAGCCAAGCCATAAGAGGAGCTAAAAAACTATTTGATAGATCGTTAGGCGCCAAGATTCGACAAG
GTCCAACCAGAACCCAGAAGCATCACAAAAATATATGGTTGTCGGAACAAGTCCATGGGCTACCTGCCGGAGGCAAATACTGTGTGGCATATATTGGGAATGGTGGACAT
GAGACACCATGCTGTGTCTCTTGCTGTTCCTATAGGTTTCCCAGCTGCTCCAGTATACCTAATACCTGGTACAAAGTAACTCCCTTTCTT
Protein sequenceShow/hide protein sequence
MGLFCHSHHSSSFFSSLVVLLLVKEVSGATFRFINKCDFTVWPGILASAGSPKLETTGFELEQDSSRSLQAPTGWSGRFWGRTSCNFDSSGRGSCNTGDCGSGEIECNGA
GATPPATLAEFTLGSGSQDFYDVSLVDGYNLPMIVEGTGGSGTCASTGCITDLNQLCPTELKADGGGACKSACEAFAAPEYCCSGAYNSPATCRPSMYSQMFKSACPKSY
SYAYDDATSTFTCNGADYTITFCPSTPSRKSSRDSSPVIPDPTTNPTGSEPVIGTGAGVGMGSDSIQGAALSSSWLADMAIGDSAIANQPSFKFQSLSTLIFSLVLSLLL
S