; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sed0017435 (gene) of Chayote v1 genome

Gene IDSed0017435
OrganismSechium edule (Chayote v1)
DescriptionThaumatin
Genome locationLG10:8748061..8749790
RNA-Seq ExpressionSed0017435
SyntenySed0017435
Gene Ontology termsGO:0006952 - defense response (biological process)
GO:0005576 - extracellular region (cellular component)
GO:0016021 - integral component of membrane (cellular component)
InterPro domainsIPR001938 - Thaumatin family
IPR017949 - Thaumatin, conserved site
IPR037176 - Osmotin/thaumatin-like superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022926390.1 thaumatin-like protein 1 isoform X1 [Cucurbita moschata]4.0e-14983.13Show/hide
Query:  MGLFCHFQHFNPFFSTLILLLLLKEVSGATFTFVNKCDFTVWPGILASSGSPKLATTGFKLETGNSRSLQAPTGWSGRFWGRTACDFDGSGRGRCNTGDC
        M L CHF+H   F S+L+LLLL KEVS ATF FVNKCDFTVWPGILAS+GSPKL TTGF+L+ GNSRSL+APTGWSGRFWGRT CDFDGSGRG CNTGDC
Subjt:  MGLFCHFQHFNPFFSTLILLLLLKEVSGATFTFVNKCDFTVWPGILASSGSPKLATTGFKLETGNSRSLQAPTGWSGRFWGRTACDFDGSGRGRCNTGDC

Query:  GSGEEECNGAGATPPATLAEFTLGSGSQDFYDVSLVDGYNLPMVVEGTGGSGACASTGCITDVNQLCPAELRAEGGGACRSACEAFATAEYCCSGAFNTP
        GSGE ECNGAGATPPATLAEFTLGSGSQDFYDVSLVDGYNLPM+VEGTGGSG CA+TGCIT+VNQLCP EL+AEGGGAC+SACEAFAT EYCCSGA+N+P
Subjt:  GSGEEECNGAGATPPATLAEFTLGSGSQDFYDVSLVDGYNLPMVVEGTGGSGACASTGCITDVNQLCPAELRAEGGGACRSACEAFATAEYCCSGAFNTP

Query:  ATCRPSIYSEVFKSACPKSYSYAYDDATSTFTCNGADYTITFCPSSPSRKSSRDSPPMVPEPTTEQSGSDSITG----SGPGSIQGAAFSSSWLADMAIG
        ATCRPSIYS +FKSACP+SYSYAYDDATSTFTC GADYTITFCPS+PSRKSSRDSPPMVPEPTTEQSGS+ ITG    SG  SIQGAA S+SWLADMAIG
Subjt:  ATCRPSIYSEVFKSACPKSYSYAYDDATSTFTCNGADYTITFCPSSPSRKSSRDSPPMVPEPTTEQSGSDSITG----SGPGSIQGAAFSSSWLADMAIG

Query:  NSSTATKSSLAFQCLSIMILVLSLVL
        NSS  TKSSL FQ L+I+I +L LVL
Subjt:  NSSTATKSSLAFQCLSIMILVLSLVL

XP_022926391.1 thaumatin-like protein 1 isoform X2 [Cucurbita moschata]6.0e-14581.6Show/hide
Query:  MGLFCHFQHFNPFFSTLILLLLLKEVSGATFTFVNKCDFTVWPGILASSGSPKLATTGFKLETGNSRSLQAPTGWSGRFWGRTACDFDGSGRGRCNTGDC
        M L CHF+H   F S+L+LLLL KEVS ATF FVNKCDFTVWPGILAS+GSPKL TTGF+L+ GNSRSL+APTGWSGRFWGRT CDFDGSGRG CNTGDC
Subjt:  MGLFCHFQHFNPFFSTLILLLLLKEVSGATFTFVNKCDFTVWPGILASSGSPKLATTGFKLETGNSRSLQAPTGWSGRFWGRTACDFDGSGRGRCNTGDC

Query:  GSGEEECNGAGATPPATLAEFTLGSGSQDFYDVSLVDGYNLPMVVEGTGGSGACASTGCITDVNQLCPAELRAEGGGACRSACEAFATAEYCCSGAFNTP
        GSGE ECNGAGATPPATLAEFTLGSGSQDFYDVSLVDGYNLPM+VEGTGGSG CA+TGCIT+VNQLCP EL+AEGGGAC+SACEAFAT EYCCSGA+N+P
Subjt:  GSGEEECNGAGATPPATLAEFTLGSGSQDFYDVSLVDGYNLPMVVEGTGGSGACASTGCITDVNQLCPAELRAEGGGACRSACEAFATAEYCCSGAFNTP

Query:  ATCRPSIYSEVFKSACPKSYSYAYDDATSTFTCNGADYTITFCPSSPSRKSSRDSPPMVPEPTTEQSGSDSITG----SGPGSIQGAAFSSSWLADMAIG
        ATCRPSIYS +FKSACP+SYSYAYDDATSTFTC GADYTITFCPS+P     RDSPPMVPEPTTEQSGS+ ITG    SG  SIQGAA S+SWLADMAIG
Subjt:  ATCRPSIYSEVFKSACPKSYSYAYDDATSTFTCNGADYTITFCPSSPSRKSSRDSPPMVPEPTTEQSGSDSITG----SGPGSIQGAAFSSSWLADMAIG

Query:  NSSTATKSSLAFQCLSIMILVLSLVL
        NSS  TKSSL FQ L+I+I +L LVL
Subjt:  NSSTATKSSLAFQCLSIMILVLSLVL

XP_023003911.1 thaumatin-like protein 1 isoform X1 [Cucurbita maxima]6.4e-14781.9Show/hide
Query:  MGLFCHFQHFNPFFSTLILLLLLKEVSGATFTFVNKCDFTVWPGILASSGSPKLATTGFKLETGNSRSLQAPTGWSGRFWGRTACDFDGSGRGRCNTGDC
        M L CHF+H   F S+L+LLLL KEVS ATF FVNKCDFTVWPGILAS+GSPKL TTGF+L+ GNSRSL+APTGWSGRFWGRT CDFDGSGRG C+TGDC
Subjt:  MGLFCHFQHFNPFFSTLILLLLLKEVSGATFTFVNKCDFTVWPGILASSGSPKLATTGFKLETGNSRSLQAPTGWSGRFWGRTACDFDGSGRGRCNTGDC

Query:  GSGEEECNGAGATPPATLAEFTLGSGSQDFYDVSLVDGYNLPMVVEGTGGSGACASTGCITDVNQLCPAELRAEGGGACRSACEAFATAEYCCSGAFNTP
         SGE ECNGAGATPPATLAEFTLGSGSQDFYDVSLVDGYNLPM+VEGTGGSG CA+TGCIT+VNQLCP EL+AEGGGAC+SACEAFAT EYCCSGA+N+P
Subjt:  GSGEEECNGAGATPPATLAEFTLGSGSQDFYDVSLVDGYNLPMVVEGTGGSGACASTGCITDVNQLCPAELRAEGGGACRSACEAFATAEYCCSGAFNTP

Query:  ATCRPSIYSEVFKSACPKSYSYAYDDATSTFTCNGADYTITFCPSSPSRKSSRDSPPMVPEPTTEQSGSDSITG----SGPGSIQGAAFSSSWLADMAIG
        ATCRPSIYS +FKSACP+SYSYAYDDATSTFTC GADYTITFCPS+PSRKSSRDSPPMVPEPTTEQSGS+ +TG    SG  SI+GA  S+SWLADM IG
Subjt:  ATCRPSIYSEVFKSACPKSYSYAYDDATSTFTCNGADYTITFCPSSPSRKSSRDSPPMVPEPTTEQSGSDSITG----SGPGSIQGAAFSSSWLADMAIG

Query:  NSSTATKSSLAFQCLSIMILVLSLVL
        NSS ATKSSL FQ L+I+I VL LVL
Subjt:  NSSTATKSSLAFQCLSIMILVLSLVL

XP_023517221.1 thaumatin-like protein 1 [Cucurbita pepo subsp. pepo]4.5e-14882.82Show/hide
Query:  MGLFCHFQHFNPFFSTLILLLLLKEVSGATFTFVNKCDFTVWPGILASSGSPKLATTGFKLETGNSRSLQAPTGWSGRFWGRTACDFDGSGRGRCNTGDC
        M L CHF+H   F S+L+LLLL KEVS ATF FVNKCDFTVWPGILAS+GSPKL TTGF+L+ GNSRSL+APTGWSGRFWGRT CDFDGSGRG CNTGDC
Subjt:  MGLFCHFQHFNPFFSTLILLLLLKEVSGATFTFVNKCDFTVWPGILASSGSPKLATTGFKLETGNSRSLQAPTGWSGRFWGRTACDFDGSGRGRCNTGDC

Query:  GSGEEECNGAGATPPATLAEFTLGSGSQDFYDVSLVDGYNLPMVVEGTGGSGACASTGCITDVNQLCPAELRAEGGGACRSACEAFATAEYCCSGAFNTP
        GSGE ECNGAGATPPATLAEFTLGSGSQDFYDVSLVDGYNLPM+VEGTGGSG CA+TGCIT+VNQLCP EL+AEGGGAC+SACEAFAT EYCCSGA+N+P
Subjt:  GSGEEECNGAGATPPATLAEFTLGSGSQDFYDVSLVDGYNLPMVVEGTGGSGACASTGCITDVNQLCPAELRAEGGGACRSACEAFATAEYCCSGAFNTP

Query:  ATCRPSIYSEVFKSACPKSYSYAYDDATSTFTCNGADYTITFCPSSPSRKSSRDSPPMVPEPTTEQSGSDSITGSGPGS----IQGAAFSSSWLADMAIG
        ATCRPSIYS +FKSACP+SYSYAYDDATSTFTC GADYTITFCPS+PSRKSSRDSPPMVPE TTEQSGS+ ITGSG  S    IQGAA S+SWLADMAIG
Subjt:  ATCRPSIYSEVFKSACPKSYSYAYDDATSTFTCNGADYTITFCPSSPSRKSSRDSPPMVPEPTTEQSGSDSITGSGPGS----IQGAAFSSSWLADMAIG

Query:  NSSTATKSSLAFQCLSIMILVLSLVL
        NSS ATKSSL FQ L+I+I +L L+L
Subjt:  NSSTATKSSLAFQCLSIMILVLSLVL

XP_038880897.1 thaumatin-like protein 1 [Benincasa hispida]9.3e-14680.79Show/hide
Query:  MGLFCHFQHFNPFFSTLILLLLLKEVSGATFTFVNKCDFTVWPGILASSGSPKLATTGFKLETGNSRSLQAPTGWSGRFWGRTACDFDGSGRGRCNTGDC
        MGLFC+F H +PFFST+I+LLL+KE SGATFTFVNKCDFTVWPGILAS+GSPKL  TGF+LE G+SRSLQA TGWSGRFWGRT C+FD SGRG CNTGDC
Subjt:  MGLFCHFQHFNPFFSTLILLLLLKEVSGATFTFVNKCDFTVWPGILASSGSPKLATTGFKLETGNSRSLQAPTGWSGRFWGRTACDFDGSGRGRCNTGDC

Query:  GSGEEECNGAGATPPATLAEFTLGSGSQDFYDVSLVDGYNLPMVVEGTGGSGACASTGCITDVNQLCPAELRAEGGGACRSACEAFATAEYCCSGAFNTP
        GSGE ECNGAGATPPATLAEFTLGSGSQDFYDVSLVDGYNLPM+VEGTGGSG CASTGCITD+NQLCPAEL+AE GGAC+SACEAFAT EYCCSG +N+P
Subjt:  GSGEEECNGAGATPPATLAEFTLGSGSQDFYDVSLVDGYNLPMVVEGTGGSGACASTGCITDVNQLCPAELRAEGGGACRSACEAFATAEYCCSGAFNTP

Query:  ATCRPSIYSEVFKSACPKSYSYAYDDATSTFTCNGADYTITFCPSSPSRKSSRDSPPMVPEPTTEQSGSDSITG----SGPGSIQGAAFSSSWLADMAI-
        ATCRPS+YS++FKSACPKSYSYAYDDATSTFTCNGADYTITFCPSSPSRKSSR S PM+PEP T++SGS+ +TG    SG G+IQGAA SSSWLADMAI 
Subjt:  ATCRPSIYSEVFKSACPKSYSYAYDDATSTFTCNGADYTITFCPSSPSRKSSRDSPPMVPEPTTEQSGSDSITG----SGPGSIQGAAFSSSWLADMAI-

Query:  GNSSTATKSSLAFQCLSIMILVLSLVLI
        G+SS ATK SL  Q  SI+IL ++L L+
Subjt:  GNSSTATKSSLAFQCLSIMILVLSLVLI

TrEMBL top hitse value%identityAlignment
A0A6J1EES9 thaumatin-like protein 1 isoform X22.9e-14581.6Show/hide
Query:  MGLFCHFQHFNPFFSTLILLLLLKEVSGATFTFVNKCDFTVWPGILASSGSPKLATTGFKLETGNSRSLQAPTGWSGRFWGRTACDFDGSGRGRCNTGDC
        M L CHF+H   F S+L+LLLL KEVS ATF FVNKCDFTVWPGILAS+GSPKL TTGF+L+ GNSRSL+APTGWSGRFWGRT CDFDGSGRG CNTGDC
Subjt:  MGLFCHFQHFNPFFSTLILLLLLKEVSGATFTFVNKCDFTVWPGILASSGSPKLATTGFKLETGNSRSLQAPTGWSGRFWGRTACDFDGSGRGRCNTGDC

Query:  GSGEEECNGAGATPPATLAEFTLGSGSQDFYDVSLVDGYNLPMVVEGTGGSGACASTGCITDVNQLCPAELRAEGGGACRSACEAFATAEYCCSGAFNTP
        GSGE ECNGAGATPPATLAEFTLGSGSQDFYDVSLVDGYNLPM+VEGTGGSG CA+TGCIT+VNQLCP EL+AEGGGAC+SACEAFAT EYCCSGA+N+P
Subjt:  GSGEEECNGAGATPPATLAEFTLGSGSQDFYDVSLVDGYNLPMVVEGTGGSGACASTGCITDVNQLCPAELRAEGGGACRSACEAFATAEYCCSGAFNTP

Query:  ATCRPSIYSEVFKSACPKSYSYAYDDATSTFTCNGADYTITFCPSSPSRKSSRDSPPMVPEPTTEQSGSDSITG----SGPGSIQGAAFSSSWLADMAIG
        ATCRPSIYS +FKSACP+SYSYAYDDATSTFTC GADYTITFCPS+P     RDSPPMVPEPTTEQSGS+ ITG    SG  SIQGAA S+SWLADMAIG
Subjt:  ATCRPSIYSEVFKSACPKSYSYAYDDATSTFTCNGADYTITFCPSSPSRKSSRDSPPMVPEPTTEQSGSDSITG----SGPGSIQGAAFSSSWLADMAIG

Query:  NSSTATKSSLAFQCLSIMILVLSLVL
        NSS  TKSSL FQ L+I+I +L LVL
Subjt:  NSSTATKSSLAFQCLSIMILVLSLVL

A0A6J1EKZ4 thaumatin-like protein 1 isoform X12.0e-14983.13Show/hide
Query:  MGLFCHFQHFNPFFSTLILLLLLKEVSGATFTFVNKCDFTVWPGILASSGSPKLATTGFKLETGNSRSLQAPTGWSGRFWGRTACDFDGSGRGRCNTGDC
        M L CHF+H   F S+L+LLLL KEVS ATF FVNKCDFTVWPGILAS+GSPKL TTGF+L+ GNSRSL+APTGWSGRFWGRT CDFDGSGRG CNTGDC
Subjt:  MGLFCHFQHFNPFFSTLILLLLLKEVSGATFTFVNKCDFTVWPGILASSGSPKLATTGFKLETGNSRSLQAPTGWSGRFWGRTACDFDGSGRGRCNTGDC

Query:  GSGEEECNGAGATPPATLAEFTLGSGSQDFYDVSLVDGYNLPMVVEGTGGSGACASTGCITDVNQLCPAELRAEGGGACRSACEAFATAEYCCSGAFNTP
        GSGE ECNGAGATPPATLAEFTLGSGSQDFYDVSLVDGYNLPM+VEGTGGSG CA+TGCIT+VNQLCP EL+AEGGGAC+SACEAFAT EYCCSGA+N+P
Subjt:  GSGEEECNGAGATPPATLAEFTLGSGSQDFYDVSLVDGYNLPMVVEGTGGSGACASTGCITDVNQLCPAELRAEGGGACRSACEAFATAEYCCSGAFNTP

Query:  ATCRPSIYSEVFKSACPKSYSYAYDDATSTFTCNGADYTITFCPSSPSRKSSRDSPPMVPEPTTEQSGSDSITG----SGPGSIQGAAFSSSWLADMAIG
        ATCRPSIYS +FKSACP+SYSYAYDDATSTFTC GADYTITFCPS+PSRKSSRDSPPMVPEPTTEQSGS+ ITG    SG  SIQGAA S+SWLADMAIG
Subjt:  ATCRPSIYSEVFKSACPKSYSYAYDDATSTFTCNGADYTITFCPSSPSRKSSRDSPPMVPEPTTEQSGSDSITG----SGPGSIQGAAFSSSWLADMAIG

Query:  NSSTATKSSLAFQCLSIMILVLSLVL
        NSS  TKSSL FQ L+I+I +L LVL
Subjt:  NSSTATKSSLAFQCLSIMILVLSLVL

A0A6J1IMW8 thaumatin-like protein 1 isoform X21.5e-14177.95Show/hide
Query:  MGLFCHFQHFNPFFSTLILLLLLKEVSGATFTFVNKCDFTVWPGILASSGSPKLATTGFKLETGNSRSLQAPTGWSGRFWGRTACDFDGSGRGRCNTGDC
        MGLF  F H +PFFSTLI+LLL+KEVSGATFTFVNKC+FTVWPGILAS+GSPKL +TGF+L  G+SRS QAPT WSGRFWGRT C+ DGSGR  CNTGDC
Subjt:  MGLFCHFQHFNPFFSTLILLLLLKEVSGATFTFVNKCDFTVWPGILASSGSPKLATTGFKLETGNSRSLQAPTGWSGRFWGRTACDFDGSGRGRCNTGDC

Query:  GSGEEECNGAGATPPATLAEFTLGSGSQDFYDVSLVDGYNLPMVVEGTGGSGACASTGCITDVNQLCPAELRAEGGGACRSACEAFATAEYCCSGAFNTP
        GSGE ECNGAGATPPATLAEFTLGSGSQDFYDVSLVDGYNLPM++EGTGGSG C+STGCITD+NQLCPAEL+AEGGGAC+SACEAFA+ EYCCSG +N+P
Subjt:  GSGEEECNGAGATPPATLAEFTLGSGSQDFYDVSLVDGYNLPMVVEGTGGSGACASTGCITDVNQLCPAELRAEGGGACRSACEAFATAEYCCSGAFNTP

Query:  ATCRPSIYSEVFKSACPKSYSYAYDDATSTFTCNGADYTITFCPSSPSRKSSRDSPPMVPEPTTEQSGSDSITGSGP----GSIQGAAFSSSWLADMAIG
        ATCRPS+YSE+FKSACP+SYSYAYDDATSTFTCNGADYTITFCPS PS+KSSRDS PM+PEPTTEQ GS+  T  G     GS+QGA  S+SWLADMAIG
Subjt:  ATCRPSIYSEVFKSACPKSYSYAYDDATSTFTCNGADYTITFCPSSPSRKSSRDSPPMVPEPTTEQSGSDSITGSGP----GSIQGAAFSSSWLADMAIG

Query:  NSSTATKSSLAFQCLS--IMILVLSLVLINE
         SS A K +LAFQ +S  I ++V+ LVL+ E
Subjt:  NSSTATKSSLAFQCLS--IMILVLSLVLINE

A0A6J1KNY0 thaumatin-like protein 1 isoform X24.7e-14380.37Show/hide
Query:  MGLFCHFQHFNPFFSTLILLLLLKEVSGATFTFVNKCDFTVWPGILASSGSPKLATTGFKLETGNSRSLQAPTGWSGRFWGRTACDFDGSGRGRCNTGDC
        M L CHF+H   F S+L+LLLL KEVS ATF FVNKCDFTVWPGILAS+GSPKL TTGF+L+ GNSRSL+APTGWSGRFWGRT CDFDGSGRG C+TGDC
Subjt:  MGLFCHFQHFNPFFSTLILLLLLKEVSGATFTFVNKCDFTVWPGILASSGSPKLATTGFKLETGNSRSLQAPTGWSGRFWGRTACDFDGSGRGRCNTGDC

Query:  GSGEEECNGAGATPPATLAEFTLGSGSQDFYDVSLVDGYNLPMVVEGTGGSGACASTGCITDVNQLCPAELRAEGGGACRSACEAFATAEYCCSGAFNTP
         SGE ECNGAGATPPATLAEFTLGSGSQDFYDVSLVDGYNLPM+VEGTGGSG CA+TGCIT+VNQLCP EL+AEGGGAC+SACEAFAT EYCCSGA+N+P
Subjt:  GSGEEECNGAGATPPATLAEFTLGSGSQDFYDVSLVDGYNLPMVVEGTGGSGACASTGCITDVNQLCPAELRAEGGGACRSACEAFATAEYCCSGAFNTP

Query:  ATCRPSIYSEVFKSACPKSYSYAYDDATSTFTCNGADYTITFCPSSPSRKSSRDSPPMVPEPTTEQSGSDSITG----SGPGSIQGAAFSSSWLADMAIG
        ATCRPSIYS +FKSACP+SYSYAYDDATSTFTC GADYTITFCPS+P     RDSPPMVPEPTTEQSGS+ +TG    SG  SI+GA  S+SWLADM IG
Subjt:  ATCRPSIYSEVFKSACPKSYSYAYDDATSTFTCNGADYTITFCPSSPSRKSSRDSPPMVPEPTTEQSGSDSITG----SGPGSIQGAAFSSSWLADMAIG

Query:  NSSTATKSSLAFQCLSIMILVLSLVL
        NSS ATKSSL FQ L+I+I VL LVL
Subjt:  NSSTATKSSLAFQCLSIMILVLSLVL

A0A6J1KQK9 thaumatin-like protein 1 isoform X13.1e-14781.9Show/hide
Query:  MGLFCHFQHFNPFFSTLILLLLLKEVSGATFTFVNKCDFTVWPGILASSGSPKLATTGFKLETGNSRSLQAPTGWSGRFWGRTACDFDGSGRGRCNTGDC
        M L CHF+H   F S+L+LLLL KEVS ATF FVNKCDFTVWPGILAS+GSPKL TTGF+L+ GNSRSL+APTGWSGRFWGRT CDFDGSGRG C+TGDC
Subjt:  MGLFCHFQHFNPFFSTLILLLLLKEVSGATFTFVNKCDFTVWPGILASSGSPKLATTGFKLETGNSRSLQAPTGWSGRFWGRTACDFDGSGRGRCNTGDC

Query:  GSGEEECNGAGATPPATLAEFTLGSGSQDFYDVSLVDGYNLPMVVEGTGGSGACASTGCITDVNQLCPAELRAEGGGACRSACEAFATAEYCCSGAFNTP
         SGE ECNGAGATPPATLAEFTLGSGSQDFYDVSLVDGYNLPM+VEGTGGSG CA+TGCIT+VNQLCP EL+AEGGGAC+SACEAFAT EYCCSGA+N+P
Subjt:  GSGEEECNGAGATPPATLAEFTLGSGSQDFYDVSLVDGYNLPMVVEGTGGSGACASTGCITDVNQLCPAELRAEGGGACRSACEAFATAEYCCSGAFNTP

Query:  ATCRPSIYSEVFKSACPKSYSYAYDDATSTFTCNGADYTITFCPSSPSRKSSRDSPPMVPEPTTEQSGSDSITG----SGPGSIQGAAFSSSWLADMAIG
        ATCRPSIYS +FKSACP+SYSYAYDDATSTFTC GADYTITFCPS+PSRKSSRDSPPMVPEPTTEQSGS+ +TG    SG  SI+GA  S+SWLADM IG
Subjt:  ATCRPSIYSEVFKSACPKSYSYAYDDATSTFTCNGADYTITFCPSSPSRKSSRDSPPMVPEPTTEQSGSDSITG----SGPGSIQGAAFSSSWLADMAIG

Query:  NSSTATKSSLAFQCLSIMILVLSLVL
        NSS ATKSSL FQ L+I+I VL LVL
Subjt:  NSSTATKSSLAFQCLSIMILVLSLVL

SwissProt top hitse value%identityAlignment
A0A1P8B554 Thaumatin-like protein 15.4e-8868.42Show/hide
Query:  GATFTFVNKCDFTVWPGILASSGSPKLATTGFKLETGNSRSLQAPTGWSGRFWGRTACDFDG-SGRGRCNTGDCGSGEEECNGAGATPPATLAEFTLGSG
        GAT T VN+C FTVWPGIL++SGS  + TTGF+L  G SRS QAP  WSGRFW RT C+F+  +G+G C TGDCGS + ECNGAGA PPATLAEFT+GSG
Subjt:  GATFTFVNKCDFTVWPGILASSGSPKLATTGFKLETGNSRSLQAPTGWSGRFWGRTACDFDG-SGRGRCNTGDCGSGEEECNGAGATPPATLAEFTLGSG

Query:  ------SQDFYDVSLVDGYNLPMVVEGTGGS-GACASTGCITDVNQLCPAELRAEGGGACRSACEAFATAEYCCSGAFNTPATCRPSIYSEVFKSACPKS
               QDFYDVSLVDGYN+PM+VE +GGS G C +TGC+TD+NQ CP ELR   G AC+SACEAF + EYCCSGA+ +P  C+PS+YSE+FKSACP+S
Subjt:  ------SQDFYDVSLVDGYNLPMVVEGTGGS-GACASTGCITDVNQLCPAELRAEGGGACRSACEAFATAEYCCSGAFNTPATCRPSIYSEVFKSACPKS

Query:  YSYAYDDATSTFTCNGADYTITFCPSSP
        YSYA+DDATSTFTC  ADYTITFCPS P
Subjt:  YSYAYDDATSTFTCNGADYTITFCPSSP

O80327 Thaumatin-like protein 12.5e-6955.51Show/hide
Query:  ILLLLLKEVSG---ATFTFVNKCDFTVWPGILASSGSPKLATTGFKLETGNSRSLQAPTGWSGRFWGRTACDFDGSGRGRCNTGDCGSGEEECNGAGATP
        ++L+ L E +G   A FTF NKC  TVWPG L   G P+L +TGF+L +G S SL     WSGRFWGR+ C  D SG+ +C+TGDCGSG+  CNGAGA+P
Subjt:  ILLLLLKEVSG---ATFTFVNKCDFTVWPGILASSGSPKLATTGFKLETGNSRSLQAPTGWSGRFWGRTACDFDGSGRGRCNTGDCGSGEEECNGAGATP

Query:  PATLAEFTLG-SGSQDFYDVSLVDGYNLPMVVEGTGGSGACASTGCITDVNQLCPAELRAEGGG----ACRSACEAFATAEYCCSGAFNTPATCRPSIYS
        PA+L E TL  +G QDFYDVSLVDG+NLP+ +   GGSG C ST C  ++N +CPAEL  +G       C+SAC A    +YCC+GA+ TP TC P+ +S
Subjt:  PATLAEFTLG-SGSQDFYDVSLVDGYNLPMVVEGTGGSGACASTGCITDVNQLCPAELRAEGGG----ACRSACEAFATAEYCCSGAFNTPATCRPSIYS

Query:  EVFKSACPKSYSYAYDDATSTFTC-NGADYTITFCP
        +VFK+ CP++YSYAYDD +STFTC  G +Y ITFCP
Subjt:  EVFKSACPKSYSYAYDDATSTFTC-NGADYTITFCP

P50699 Thaumatin-like protein4.7e-6856.96Show/hide
Query:  LLLLKEVSGATFTFVNKCDFTVWPGILASSGSPKLATTGFKLETGNSRSLQAPTGWSGRFWGRTACDFDGSGRGRCNTGDCGSGEEECNGAGATPPATLA
        LLLL   S +T  F NKC   VWPGI  S+G   LA  GFKL    + SLQ P  WSGRFWGR  C FD SGRG C TGDCG G   CNGAG  PPATLA
Subjt:  LLLLKEVSGATFTFVNKCDFTVWPGILASSGSPKLATTGFKLETGNSRSLQAPTGWSGRFWGRTACDFDGSGRGRCNTGDCGSGEEECNGAGATPPATLA

Query:  EFTLGSGSQDFYDVSLVDGYNLPMVVEGTGGSGACASTGCITDVNQLCPA--ELRAEGGG---ACRSACEAFATAEYCCSGAFNTPATCRPSIYSEVFKS
        E TLG    DFYDVSLVDGYNL M +    GSG C+  GC++D+NQ+CP   ++R+  G    AC+SAC AF + +YCC+G F  P +C+P+ YS++FK 
Subjt:  EFTLGSGSQDFYDVSLVDGYNLPMVVEGTGGSGACASTGCITDVNQLCPA--ELRAEGGG---ACRSACEAFATAEYCCSGAFNTPATCRPSIYSEVFKS

Query:  ACPKSYSYAYDDATSTFTCNGADYTITFCP
        ACPK+YSYAYDD TS  TC+ A+Y +TFCP
Subjt:  ACPKSYSYAYDDATSTFTCNGADYTITFCP

P83332 Thaumatin-like protein 18.9e-6754.04Show/hide
Query:  TLILLLLLKEVSGATFTFVNKCDFTVWPGILASSGSPKLATTGFKLETGNSRSLQAPTGWSGRFWGRTACDFDGSGRGRCNTGDCGSGEEECNGAGATPP
        T + +L       A  TF NKC +TVWPG L     P+L+ TGF+L TG SRS+ AP+ WSGRF+GRT C  D SG+  C T DCGSG+  CNG GA PP
Subjt:  TLILLLLLKEVSGATFTFVNKCDFTVWPGILASSGSPKLATTGFKLETGNSRSLQAPTGWSGRFWGRTACDFDGSGRGRCNTGDCGSGEEECNGAGATPP

Query:  ATLAEFTLGS-GSQDFYDVSLVDGYNLPMVVEGTGGSGACASTGCITDVNQLCPAELRAEGGG----ACRSACEAFATAEYCCSGAFNTPATCRPSIYSE
        ATL E T+ S G QDFYDVSLVDG+NLPM V   GG+G C ++ C  D+N++CPA L+ +G      AC+SAC AF   +YCC+   + P TC P  YS+
Subjt:  ATLAEFTLGS-GSQDFYDVSLVDGYNLPMVVEGTGGSGACASTGCITDVNQLCPAELRAEGGG----ACRSACEAFATAEYCCSGAFNTPATCRPSIYSE

Query:  VFKSACPKSYSYAYDDATSTFTCNGAD-YTITFCP
        +FK+ CP++YSYAYDD +STFTC+G   Y ITFCP
Subjt:  VFKSACPKSYSYAYDDATSTFTCNGAD-YTITFCP

Q5DWG1 Pathogenesis-related thaumatin-like protein 3.51.1e-7259.49Show/hide
Query:  STLILLLLLKEV-SGAT-FTFVNKCDFTVWPGILASSGSPKLATTGFKLETGNSRSLQAPTGWSGRFWGRTACDFDGSGRGRCNTGDCGSGEEECNGAGA
        +TL +++L     +GAT FT VNKC +TVWPG L+ SGS  L   GF L  G S  L A + WSGRFWGRT C FD SG+G C TGDCG+    C  AG 
Subjt:  STLILLLLLKEV-SGAT-FTFVNKCDFTVWPGILASSGSPKLATTGFKLETGNSRSLQAPTGWSGRFWGRTACDFDGSGRGRCNTGDCGSGEEECNGAGA

Query:  TPPATLAEFTLGSGSQDFYDVSLVDGYNLPMVVEGTGGSGACASTGCITDVNQLCPAELRAEGGG---ACRSACEAFATAEYCCSGAFNTPATCRPSIYS
        TPP +LAEFTL  G +DFYDVSLVDGYN+P+ +   GG+G C + GC++D+   CPAEL     G   AC+SAC AF+T EYCC+G   +P TC PS YS
Subjt:  TPPATLAEFTLGSGSQDFYDVSLVDGYNLPMVVEGTGGSGACASTGCITDVNQLCPAELRAEGGG---ACRSACEAFATAEYCCSGAFNTPATCRPSIYS

Query:  EVFKSACPKSYSYAYDDATSTFTCNGADYTITFCPSS
        +VFKSACP +YSYAYDDATSTFTC+ ADYTITFCPSS
Subjt:  EVFKSACPKSYSYAYDDATSTFTCNGADYTITFCPSS

Arabidopsis top hitse value%identityAlignment
AT1G20030.2 Pathogenesis-related thaumatin superfamily protein2.8e-7650.79Show/hide
Query:  FSTLILLLLLKEVSGATFTFVNKCDFTVWPGILASSGSPKLATTGFKLETGNSRSLQAPTGWSGRFWGRTACDFDGSGRGRCNTGDCGSGEEECNGAGAT
        F  L+  L    V   +FTF NKCD+TVWPGIL+++G   L TTGF L  G +R++ AP+ W GRFWGRT C  D  G+  C TGDCGSG+ EC+GAGA 
Subjt:  FSTLILLLLLKEVSGATFTFVNKCDFTVWPGILASSGSPKLATTGFKLETGNSRSLQAPTGWSGRFWGRTACDFDGSGRGRCNTGDCGSGEEECNGAGAT

Query:  PPATLAEFTL-GSGSQDFYDVSLVDGYNLPMVVEGTGGSGA-CASTGCITDVNQLCPAELRAEG-------GGACRSACEAFATAEYCCSGAFNTPATCR
        PPATLAEFTL GSG  DFYDVSLVDGYN+ M+V   GGSG  C+STGC+ D+N  CP+ELR            AC+SACEAF   EYCCSGAF +P TC+
Subjt:  PPATLAEFTL-GSGSQDFYDVSLVDGYNLPMVVEGTGGSGA-CASTGCITDVNQLCPAELRAEG-------GGACRSACEAFATAEYCCSGAFNTPATCR

Query:  PSIYSEVFKSACPKSYSYAYDDATSTFTC-NGADYTITFCPSSPSRKSSRDSPPMVPEPTTEQSGSDSITGSGPGSIQGAAFSSSWLADMAIGNSSTATK
        PS YS +FKSACP++YSYAYDD +STFTC    +Y ITFCPS           P     + E+  ++++T + P S  G+  SS  + + A+  SS  + 
Subjt:  PSIYSEVFKSACPKSYSYAYDDATSTFTC-NGADYTITFCPSSPSRKSSRDSPPMVPEPTTEQSGSDSITGSGPGSIQGAAFSSSWLADMAIGNSSTATK

Query:  SSLAFQCLSIMILVLSL
        S+      +I +++LSL
Subjt:  SSLAFQCLSIMILVLSL

AT1G75800.1 Pathogenesis-related thaumatin superfamily protein8.8e-7851.24Show/hide
Query:  PFFSTLILLLLLKEVSGATFTFVNKCDFTVWPGILASSGSPKLATTGFKLETGNSRSLQAPTGWSGRFWGRTACDFDGSGRGRCNTGDCGSGEEECNGAG
        P    +   L +  V   +F  VNKC++TVWPG+L+++G P L TTGF L+ G  R++ APT W GRFWGRT C  D  G+  C TGDCGSG  EC+G+G
Subjt:  PFFSTLILLLLLKEVSGATFTFVNKCDFTVWPGILASSGSPKLATTGFKLETGNSRSLQAPTGWSGRFWGRTACDFDGSGRGRCNTGDCGSGEEECNGAG

Query:  ATPPATLAEFTL-GSGSQDFYDVSLVDGYNLPMVVEGTGGSGA-CASTGCITDVNQLCPAELRA---EGGG----ACRSACEAFATAEYCCSGAFNTPAT
        ATPPATLAEFTL GS   DFYDVSLVDGYN+PM+V   GGSG  C+STGC+ D+N  CP+EL+    +G G     C+SACEAF T EYCCSGA  TP T
Subjt:  ATPPATLAEFTL-GSGSQDFYDVSLVDGYNLPMVVEGTGGSGA-CASTGCITDVNQLCPAELRA---EGGG----ACRSACEAFATAEYCCSGAFNTPAT

Query:  CRPSIYSEVFKSACPKSYSYAYDDATSTFTC-NGADYTITFCPS-SPSRKSSRDSPPMVPEPTTEQSGSDSITGSGPGSIQGAAFSSSWLADMAIGNSST
        C+PS YS +FK+ACP++YSYAYDD +STFTC    +Y ITFCP+ + S+KSS+D  P  P+PTT  +G+ S T +G  S   +   +S + + A+  +  
Subjt:  CRPSIYSEVFKSACPKSYSYAYDDATSTFTC-NGADYTITFCPS-SPSRKSSRDSPPMVPEPTTEQSGSDSITGSGPGSIQGAAFSSSWLADMAIGNSST

Query:  ATKSSLAFQCLSIMILVLSLVL
          K S +   LS+  + ++L L
Subjt:  ATKSSLAFQCLSIMILVLSLVL

AT4G24180.1 THAUMATIN-LIKE PROTEIN 13.8e-8968.42Show/hide
Query:  GATFTFVNKCDFTVWPGILASSGSPKLATTGFKLETGNSRSLQAPTGWSGRFWGRTACDFDG-SGRGRCNTGDCGSGEEECNGAGATPPATLAEFTLGSG
        GAT T VN+C FTVWPGIL++SGS  + TTGF+L  G SRS QAP  WSGRFW RT C+F+  +G+G C TGDCGS + ECNGAGA PPATLAEFT+GSG
Subjt:  GATFTFVNKCDFTVWPGILASSGSPKLATTGFKLETGNSRSLQAPTGWSGRFWGRTACDFDG-SGRGRCNTGDCGSGEEECNGAGATPPATLAEFTLGSG

Query:  ------SQDFYDVSLVDGYNLPMVVEGTGGS-GACASTGCITDVNQLCPAELRAEGGGACRSACEAFATAEYCCSGAFNTPATCRPSIYSEVFKSACPKS
               QDFYDVSLVDGYN+PM+VE +GGS G C +TGC+TD+NQ CP ELR   G AC+SACEAF + EYCCSGA+ +P  C+PS+YSE+FKSACP+S
Subjt:  ------SQDFYDVSLVDGYNLPMVVEGTGGS-GACASTGCITDVNQLCPAELRAEGGGACRSACEAFATAEYCCSGAFNTPATCRPSIYSEVFKSACPKS

Query:  YSYAYDDATSTFTCNGADYTITFCPSSP
        YSYA+DDATSTFTC  ADYTITFCPS P
Subjt:  YSYAYDDATSTFTCNGADYTITFCPSSP

AT4G38660.1 Pathogenesis-related thaumatin superfamily protein5.7e-10159.31Show/hide
Query:  STLILLLLLKEVSGATFTFVNKCDFTVWPGILASSGSPKLATTGFKLETGNSRSLQAPTGWSGRFWGRTACDFDGSGRGRCNTGDCGSGEEECNGAGATP
        S  +LLL      G+TFTF N+C +TVWPGIL+++GSP L+TTGF+L  G SRSLQAPTGWSGRFW RT C FD SG G C TGDCGS   EC G GA P
Subjt:  STLILLLLLKEVSGATFTFVNKCDFTVWPGILASSGSPKLATTGFKLETGNSRSLQAPTGWSGRFWGRTACDFDGSGRGRCNTGDCGSGEEECNGAGATP

Query:  PATLAEFTLGSGSQDFYDVSLVDGYNLPMVVEGTGGSGACASTGCITDVNQLCPAELRAEGGGACRSACEAFATAEYCCSGAFNTPATCRPSIYSEVFKS
        P TLAEFTLG+G  DFYDVSLVDGYN+PM+VE  GGSG CASTGC TD+N  CPAELR   G AC+SAC AF + EYCCSGA+ TP++CRPS+YSE+FK+
Subjt:  PATLAEFTLGSGSQDFYDVSLVDGYNLPMVVEGTGGSGACASTGCITDVNQLCPAELRAEGGGACRSACEAFATAEYCCSGAFNTPATCRPSIYSEVFKS

Query:  ACPKSYSYAYDDATSTFTCNGADYTITFCPSSPSRKSSRDSPPMVPEPTTEQSGSDSITGS------------GPGSIQGAAFS-------------SSW
        ACP+SYSYAYDDATSTFTC G DYT+TFCPSSPS+KS+  SPP+    +T Q GSD + GS            G G++ G+  +              SW
Subjt:  ACPKSYSYAYDDATSTFTCNGADYTITFCPSSPSRKSSRDSPPMVPEPTTEQSGSDSITGS------------GPGSIQGAAFS-------------SSW

Query:  LADMAIGNSSTATKSSL
        +A +A+G +S     SL
Subjt:  LADMAIGNSSTATKSSL

AT4G38660.2 Pathogenesis-related thaumatin superfamily protein4.4e-10160Show/hide
Query:  LLKEVSGATFTFVNKCDFTVWPGILASSGSPKLATTGFKLETGNSRSLQAPTGWSGRFWGRTACDFDGSGRGRCNTGDCGSGEEECNGAGATPPATLAEF
        +LK   G+TFTF N+C +TVWPGIL+++GSP L+TTGF+L  G SRSLQAPTGWSGRFW RT C FD SG G C TGDCGS   EC G GA PP TLAEF
Subjt:  LLKEVSGATFTFVNKCDFTVWPGILASSGSPKLATTGFKLETGNSRSLQAPTGWSGRFWGRTACDFDGSGRGRCNTGDCGSGEEECNGAGATPPATLAEF

Query:  TLGSGSQDFYDVSLVDGYNLPMVVEGTGGSGACASTGCITDVNQLCPAELRAEGGGACRSACEAFATAEYCCSGAFNTPATCRPSIYSEVFKSACPKSYS
        TLG+G  DFYDVSLVDGYN+PM+VE  GGSG CASTGC TD+N  CPAELR   G AC+SAC AF + EYCCSGA+ TP++CRPS+YSE+FK+ACP+SYS
Subjt:  TLGSGSQDFYDVSLVDGYNLPMVVEGTGGSGACASTGCITDVNQLCPAELRAEGGGACRSACEAFATAEYCCSGAFNTPATCRPSIYSEVFKSACPKSYS

Query:  YAYDDATSTFTCNGADYTITFCPSSPSRKSSRDSPPMVPEPTTEQSGSDSITGS------------GPGSIQGAAFS-------------SSWLADMAIG
        YAYDDATSTFTC G DYT+TFCPSSPS+KS+  SPP+    +T Q GSD + GS            G G++ G+  +              SW+A +A+G
Subjt:  YAYDDATSTFTCNGADYTITFCPSSPSRKSSRDSPPMVPEPTTEQSGSDSITGS------------GPGSIQGAAFS-------------SSWLADMAIG

Query:  NSSTATKSSL
         +S     SL
Subjt:  NSSTATKSSL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGGCTGTTCTGCCATTTCCAGCACTTCAATCCCTTCTTCTCCACCCTCATTCTGCTCCTTCTTCTCAAAGAGGTTTCCGGTGCTACATTCACGTTCGTTAACAAGTG
CGATTTCACCGTCTGGCCGGGAATCCTCGCCAGCTCCGGCAGCCCCAAACTGGCAACCACCGGTTTCAAGTTGGAAACCGGAAATTCCCGATCTCTACAAGCCCCGACGG
GCTGGTCCGGTCGCTTCTGGGGCCGAACCGCCTGCGATTTCGACGGCTCCGGGCGAGGACGCTGCAACACCGGCGACTGCGGCTCCGGCGAGGAGGAGTGCAACGGCGCC
GGAGCCACGCCGCCGGCGACCTTAGCCGAATTCACCCTCGGCTCGGGGTCGCAGGACTTTTACGACGTGAGCCTGGTCGACGGCTACAATTTGCCGATGGTGGTGGAAGG
AACGGGCGGGTCGGGGGCGTGCGCGTCGACGGGGTGCATTACGGACGTGAACCAGCTGTGCCCGGCGGAGCTGAGGGCGGAGGGAGGCGGCGCGTGCAGAAGCGCGTGCG
AGGCGTTTGCGACGGCGGAGTACTGCTGCAGCGGCGCTTTCAACACGCCGGCGACTTGCCGGCCGTCGATTTACTCGGAAGTGTTCAAATCGGCGTGCCCCAAATCGTAT
AGCTACGCGTATGACGACGCCACCAGCACATTCACTTGTAATGGAGCTGATTACACCATCACATTTTGCCCTTCTTCTCCAAGTCGAAAATCTTCGAGGGATTCGCCGCC
AATGGTTCCAGAGCCAACCACAGAGCAATCGGGATCAGATTCAATTACTGGGTCGGGACCGGGATCAATACAAGGGGCAGCTTTTAGTAGTTCATGGCTTGCAGACATGG
CCATAGGGAACTCTTCCACTGCTACAAAATCAAGCTTGGCATTTCAATGTTTGTCAATTATGATCTTGGTTCTATCTCTAGTTCTGATCAATGAATCCAAAAAATCATAG
mRNA sequenceShow/hide mRNA sequence
CGCCTCTCCAATCCTTTTACTTTCTATCTTTCCATTTTATTCCCTCTTTTCATTTTCATTAAATTATTTACATTATTAATCTCCATCTATATTCTTCATCCCCCATTCCC
CATTCCCCATTCCATTTCTCAAACCCCATTTCGTAAAATTCATCTTAACCCATTCTTTCATTTCTGTACAAACACTCTGATTTAGCTTCAAACACTGCATTTTCAATGGG
GCTGTTCTGCCATTTCCAGCACTTCAATCCCTTCTTCTCCACCCTCATTCTGCTCCTTCTTCTCAAAGAGGTTTCCGGTGCTACATTCACGTTCGTTAACAAGTGCGATT
TCACCGTCTGGCCGGGAATCCTCGCCAGCTCCGGCAGCCCCAAACTGGCAACCACCGGTTTCAAGTTGGAAACCGGAAATTCCCGATCTCTACAAGCCCCGACGGGCTGG
TCCGGTCGCTTCTGGGGCCGAACCGCCTGCGATTTCGACGGCTCCGGGCGAGGACGCTGCAACACCGGCGACTGCGGCTCCGGCGAGGAGGAGTGCAACGGCGCCGGAGC
CACGCCGCCGGCGACCTTAGCCGAATTCACCCTCGGCTCGGGGTCGCAGGACTTTTACGACGTGAGCCTGGTCGACGGCTACAATTTGCCGATGGTGGTGGAAGGAACGG
GCGGGTCGGGGGCGTGCGCGTCGACGGGGTGCATTACGGACGTGAACCAGCTGTGCCCGGCGGAGCTGAGGGCGGAGGGAGGCGGCGCGTGCAGAAGCGCGTGCGAGGCG
TTTGCGACGGCGGAGTACTGCTGCAGCGGCGCTTTCAACACGCCGGCGACTTGCCGGCCGTCGATTTACTCGGAAGTGTTCAAATCGGCGTGCCCCAAATCGTATAGCTA
CGCGTATGACGACGCCACCAGCACATTCACTTGTAATGGAGCTGATTACACCATCACATTTTGCCCTTCTTCTCCAAGTCGAAAATCTTCGAGGGATTCGCCGCCAATGG
TTCCAGAGCCAACCACAGAGCAATCGGGATCAGATTCAATTACTGGGTCGGGACCGGGATCAATACAAGGGGCAGCTTTTAGTAGTTCATGGCTTGCAGACATGGCCATA
GGGAACTCTTCCACTGCTACAAAATCAAGCTTGGCATTTCAATGTTTGTCAATTATGATCTTGGTTCTATCTCTAGTTCTGATCAATGAATCCAAAAAATCATAGATTAT
GAACCATTTACTCCAGAGTTTACGAAACTTTATGGTTCATGTCTGTACAAAATTTAAGGCAAAAACATGTTGCTTTTTTTCATTTAAGAGATAGTGAAAAAAAAAAAAAA
AAAGCATTTGATTTGGTTTTAAAGATTTGTTGAATTTGGATGATTCCAAAGATTGAAAGGCAATTTACAAGTGGAATTTTGGAATATCATAGCCACAAAGTTGACACCTT
TGTATGACCATGGCATAATGGATGCATTTTTTATGTTCTTATTTGTTGGGCGTTGGCAAGTGGCCTACACAAAAATGACTCAAAT
Protein sequenceShow/hide protein sequence
MGLFCHFQHFNPFFSTLILLLLLKEVSGATFTFVNKCDFTVWPGILASSGSPKLATTGFKLETGNSRSLQAPTGWSGRFWGRTACDFDGSGRGRCNTGDCGSGEEECNGA
GATPPATLAEFTLGSGSQDFYDVSLVDGYNLPMVVEGTGGSGACASTGCITDVNQLCPAELRAEGGGACRSACEAFATAEYCCSGAFNTPATCRPSIYSEVFKSACPKSY
SYAYDDATSTFTCNGADYTITFCPSSPSRKSSRDSPPMVPEPTTEQSGSDSITGSGPGSIQGAAFSSSWLADMAIGNSSTATKSSLAFQCLSIMILVLSLVLINESKKS