; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0018898 (gene) of Snake gourd v1 genome

Gene IDTan0018898
OrganismTrichosanthes anguina (Snake gourd v1)
Descriptionthaumatin-like protein 1
Genome locationLG01:27778140..27780970
RNA-Seq ExpressionTan0018898
SyntenyTan0018898
Gene Ontology termsGO:0006952 - defense response (biological process)
GO:0016021 - integral component of membrane (cellular component)
InterPro domainsIPR001938 - Thaumatin family
IPR017949 - Thaumatin, conserved site
IPR037176 - Osmotin/thaumatin-like superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7028202.1 hypothetical protein SDJN02_09382 [Cucurbita argyrosperma subsp. argyrosperma]1.3e-15586.93Show/hide
Query:  MDLFENHSVSCFIFWLFSLLLLSHGAFGAKFTFVNKCDFTVWPGILSGAGSLKFDTTGFELRKGSSRSFQAPAGWSGRFWGRTGCSFDGAGRGACTTGDC
        MDLF NHSVSC +F LF LL +S GAFGAKFTFVNKCDFTVWPGILSGAGSLKFDTTGFELRKG +R+FQAP  WSGRFWGRTGCSFDGAGRGAC TGDC
Subjt:  MDLFENHSVSCFIFWLFSLLLLSHGAFGAKFTFVNKCDFTVWPGILSGAGSLKFDTTGFELRKGSSRSFQAPAGWSGRFWGRTGCSFDGAGRGACTTGDC

Query:  GSGEIECNGAGAAPPATLAEFTLGSGAAASLDFYDVSLVDGYNLPMIVEGTGGTGACGSTGCVTDLNRQCPAELKVANGGACSSACDKFGTPEYCCSGAY
        GSGEIECNG GAAPPATL EFTLGSG+A S DFYD SLVDGYNLP+IVEGTGGTGACGSTGCVTDLNRQCPAELKVANGGACSSAC+KFGTPEYCCSGAY
Subjt:  GSGEIECNGAGAAPPATLAEFTLGSGAAASLDFYDVSLVDGYNLPMIVEGTGGTGACGSTGCVTDLNRQCPAELKVANGGACSSACDKFGTPEYCCSGAY

Query:  GSPATCKPSKYSEIFKSACPRSYSYAYDDATSTFTCSGADYTVTFCPSSPSLKSSTDSPTKTAGAGSATGGGGGGAVVGQMTPLPENSWLADLAIGGSTR
        GSPATCKP+KYS IFKSACPRSYSYAYDDATSTFTCS ADYTVTFC SSPSLKSSTDSP K AGAG+AT    GG VVGQM PLPENSWLAD+AIG STR
Subjt:  GSPATCKPSKYSEIFKSACPRSYSYAYDDATSTFTCSGADYTVTFCPSSPSLKSSTDSPTKTAGAGSATGGGGGGAVVGQMTPLPENSWLADLAIGGSTR

Query:  STHPDLAFLLFVLIFGSSFFFSFLNLFSS
         THP   FLLFVLIFGSSFFFS+ NLFSS
Subjt:  STHPDLAFLLFVLIFGSSFFFSFLNLFSS

XP_022949172.1 thaumatin-like protein 1 [Cucurbita moschata]7.6e-15686.63Show/hide
Query:  MDLFENHSVSCFIFWLFSLLLLSHGAFGAKFTFVNKCDFTVWPGILSGAGSLKFDTTGFELRKGSSRSFQAPAGWSGRFWGRTGCSFDGAGRGACTTGDC
        MDLF NHSVSC +F LF LL +S GAFGAKFTFVNKCDFTVWPGILSGAGSLKFDTTGFELRKG +R+FQAP  WSGRFWGRTGCSFDGAG GAC TGDC
Subjt:  MDLFENHSVSCFIFWLFSLLLLSHGAFGAKFTFVNKCDFTVWPGILSGAGSLKFDTTGFELRKGSSRSFQAPAGWSGRFWGRTGCSFDGAGRGACTTGDC

Query:  GSGEIECNGAGAAPPATLAEFTLGSGAAASLDFYDVSLVDGYNLPMIVEGTGGTGACGSTGCVTDLNRQCPAELKVANGGACSSACDKFGTPEYCCSGAY
        GSGEIECNG GAAPPATL EFTLGSG+A S DFYD SLVDGYNLP+IVEGTGGTGACGSTGCVTDLNRQCPAELKVANGGACSSAC+KFGTPEYCCSGAY
Subjt:  GSGEIECNGAGAAPPATLAEFTLGSGAAASLDFYDVSLVDGYNLPMIVEGTGGTGACGSTGCVTDLNRQCPAELKVANGGACSSACDKFGTPEYCCSGAY

Query:  GSPATCKPSKYSEIFKSACPRSYSYAYDDATSTFTCSGADYTVTFCPSSPSLKSSTDSPTKTAGAGSATGGGGGGAVVGQMTPLPENSWLADLAIGGSTR
        GSPATCKP+KYS IFKSACPRSYSYAYDD TSTFTCSGADYTVTFCPSSPSLKSSTDSP K AGAG+AT    GG VVGQM PLPENSWLAD+AIG STR
Subjt:  GSPATCKPSKYSEIFKSACPRSYSYAYDDATSTFTCSGADYTVTFCPSSPSLKSSTDSPTKTAGAGSATGGGGGGAVVGQMTPLPENSWLADLAIGGSTR

Query:  STHPDLAFLLFVLIFGSSFFFSFLNLFSS
         THP   FL+FVLIFGSSFFFS+ NLFSS
Subjt:  STHPDLAFLLFVLIFGSSFFFSFLNLFSS

XP_023005926.1 thaumatin-like protein 1 [Cucurbita maxima]1.2e-15687.23Show/hide
Query:  MDLFENHSVSCFIFWLFSLLLLSHGAFGAKFTFVNKCDFTVWPGILSGAGSLKFDTTGFELRKGSSRSFQAPAGWSGRFWGRTGCSFDGAGRGACTTGDC
        MDLF NHSVSC IF LF LL +  GAFGAKFTFVNKCDFTVWPGILSGAGSLKFDTTGFELRKG +R+FQAPA WSGRFWGRTGCSFDGAGRGAC+TGDC
Subjt:  MDLFENHSVSCFIFWLFSLLLLSHGAFGAKFTFVNKCDFTVWPGILSGAGSLKFDTTGFELRKGSSRSFQAPAGWSGRFWGRTGCSFDGAGRGACTTGDC

Query:  GSGEIECNGAGAAPPATLAEFTLGSGAAASLDFYDVSLVDGYNLPMIVEGTGGTGACGSTGCVTDLNRQCPAELKVANGGACSSACDKFGTPEYCCSGAY
        GSGEIECNG GAAPPATL EFTLGSG+A S DFYD SLVDGYNLP+IVEGTGGTGACGSTGCVTDLNRQCP ELKVANGGACSSAC+KFGTPEYCCSGAY
Subjt:  GSGEIECNGAGAAPPATLAEFTLGSGAAASLDFYDVSLVDGYNLPMIVEGTGGTGACGSTGCVTDLNRQCPAELKVANGGACSSACDKFGTPEYCCSGAY

Query:  GSPATCKPSKYSEIFKSACPRSYSYAYDDATSTFTCSGADYTVTFCPSSPSLKSSTDSPTKTAGAGSATGGGGGGAVVGQMTPLPENSWLADLAIGGSTR
        GSPATCKP+KYS IFKSACPRSYSYAYDDATSTFTCSGADYTVTFCPSSPSLKSSTDSP K AGAG+A+    GG VVGQM PLPENSWLAD+AIG STR
Subjt:  GSPATCKPSKYSEIFKSACPRSYSYAYDDATSTFTCSGADYTVTFCPSSPSLKSSTDSPTKTAGAGSATGGGGGGAVVGQMTPLPENSWLADLAIGGSTR

Query:  STHPDLAFLLFVLIFGSSFFFSFLNLFSS
         THP   FLLFVLIFGSSFFFS+ NLFSS
Subjt:  STHPDLAFLLFVLIFGSSFFFSFLNLFSS

XP_023539979.1 thaumatin-like protein 1 [Cucurbita pepo subsp. pepo]2.8e-15888.15Show/hide
Query:  MDLFENHSVSCFIFWLFSLLLLSHGAFGAKFTFVNKCDFTVWPGILSGAGSLKFDTTGFELRKGSSRSFQAPAGWSGRFWGRTGCSFDGAGRGACTTGDC
        MDLF NHSVSC IF+ F LL +S GAFGAKFTFVNKCDFTVWPGILSGAGSLKFDTTGFELRKG +R+FQAPA WSGRFWGRTGCSFDGAGRGAC TGDC
Subjt:  MDLFENHSVSCFIFWLFSLLLLSHGAFGAKFTFVNKCDFTVWPGILSGAGSLKFDTTGFELRKGSSRSFQAPAGWSGRFWGRTGCSFDGAGRGACTTGDC

Query:  GSGEIECNGAGAAPPATLAEFTLGSGAAASLDFYDVSLVDGYNLPMIVEGTGGTGACGSTGCVTDLNRQCPAELKVANGGACSSACDKFGTPEYCCSGAY
        GSGEIECNG GAAPPATL EFTLGSG+A S DFYD SLVDGYNLP+IVEGTGGTGACGSTGCVTDLNRQCPAELKVANGGACSSAC+KFGTPEYCCSGAY
Subjt:  GSGEIECNGAGAAPPATLAEFTLGSGAAASLDFYDVSLVDGYNLPMIVEGTGGTGACGSTGCVTDLNRQCPAELKVANGGACSSACDKFGTPEYCCSGAY

Query:  GSPATCKPSKYSEIFKSACPRSYSYAYDDATSTFTCSGADYTVTFCPSSPSLKSSTDSPTKTAGAGSATGGGGGGAVVGQMTPLPENSWLADLAIGGSTR
        GSPATCKP+KYS IFKSACPRSYSYAYDDATSTFTCSGADYTVTFCPSSPSLKSSTDSP KTAGAG+AT    GG VVGQM PLPENSWLAD+AIG STR
Subjt:  GSPATCKPSKYSEIFKSACPRSYSYAYDDATSTFTCSGADYTVTFCPSSPSLKSSTDSPTKTAGAGSATGGGGGGAVVGQMTPLPENSWLADLAIGGSTR

Query:  STHPDLAFLLFVLIFGSSFFFSFLNLFSS
         THP   FLLFVLIFGSSFFFS+ NLFSS
Subjt:  STHPDLAFLLFVLIFGSSFFFSFLNLFSS

XP_038876615.1 thaumatin-like protein 1 isoform X1 [Benincasa hispida]3.8e-15587.61Show/hide
Query:  MDLFENHSVSCFIFWLFSLLLLSHGAFGAKFTFVNKCDFTVWPGILSGAGSLKFDTTGFELRKGSSRSFQAPAGWSGRFWGRTGCSFDGAGRGACTTGDC
        MDLF N SVSC IF +FS+LL+ HGAFGAKFTFVNKCDFTVWPGILSGAGSLKF+TTGFELRKGSS+SFQAPAGWSGRFWGRTGCSFDGAGRGACTTGDC
Subjt:  MDLFENHSVSCFIFWLFSLLLLSHGAFGAKFTFVNKCDFTVWPGILSGAGSLKFDTTGFELRKGSSRSFQAPAGWSGRFWGRTGCSFDGAGRGACTTGDC

Query:  GSGEIECNGAGAAPPATLAEFTLGSGAAASLDFYDVSLVDGYNLPMIVEGTGGTGACGSTGCVTDLNRQCPAELKVANGGACSSACDKFGTPEYCCSGAY
        GSGEIECNGAGAAPPATLAEFTLGSG A+S DFYDVSLVDGYNLPMIVEGTGGTGACGSTGCVTDLNRQCPAELK  NGGAC SAC+KFGTPEYCCSGAY
Subjt:  GSGEIECNGAGAAPPATLAEFTLGSGAAASLDFYDVSLVDGYNLPMIVEGTGGTGACGSTGCVTDLNRQCPAELKVANGGACSSACDKFGTPEYCCSGAY

Query:  GSPATCKPSKYSEIFKSACPRSYSYAYDDATSTFTCSGADYTVTFCPSSPSLKSSTDSPTKTA-GAGSATGGGGGGAVVGQMTPLPENSWLADLAIGGST
        GSPATCKP+KYSEIFKSACPRSYSYAYDDATSTFTCSGADYTVTFCPSSPSLKSSTDS  +TA G G+ T G      V Q T LPENSWLADLAIG ST
Subjt:  GSPATCKPSKYSEIFKSACPRSYSYAYDDATSTFTCSGADYTVTFCPSSPSLKSSTDSPTKTA-GAGSATGGGGGGAVVGQMTPLPENSWLADLAIGGST

Query:  RSTHP-DLAFLLFVLIFGSSFFFSFLNLFSS
        R THP DL+FLLFVLIFGSSFFFSFL+ FSS
Subjt:  RSTHP-DLAFLLFVLIFGSSFFFSFLNLFSS

TrEMBL top hitse value%identityAlignment
A0A1S3BJX4 thaumatin-like protein 16.7e-15083.94Show/hide
Query:  MDLFENHSVSCFIFWLFSLLLLSHGAFGAKFTFVNKCDFTVWPGILSGAGSLKFDTTGFELRKGSSRSFQAPAGWSGRFWGRTGCSFDGAGRGACTTGDC
        MDLF NHSVSC IF  FS+LL  HGAFGAKFTFVNKCDFTVWPG+LSGAGSLKFDTTGFELRKGSSRSFQAPAGWSGRFWGRT C+FDG+GRG C TGDC
Subjt:  MDLFENHSVSCFIFWLFSLLLLSHGAFGAKFTFVNKCDFTVWPGILSGAGSLKFDTTGFELRKGSSRSFQAPAGWSGRFWGRTGCSFDGAGRGACTTGDC

Query:  GSGEIECNGAGAAPPATLAEFTLGSGAAASLDFYDVSLVDGYNLPMIVEGTGGTGACGSTGCVTDLNRQCPAELKVANGGACSSACDKFGTPEYCCSGAY
        GSGEIECNGAGAAPPATLAEFTLGSG   S DFYDVSLVDGYNL MIVEGTGGTGACGSTGCVTDLNRQCP EL+   GGAC SAC+KFGTPEYCCSGAY
Subjt:  GSGEIECNGAGAAPPATLAEFTLGSGAAASLDFYDVSLVDGYNLPMIVEGTGGTGACGSTGCVTDLNRQCPAELKVANGGACSSACDKFGTPEYCCSGAY

Query:  GSPATCKPSKYSEIFKSACPRSYSYAYDDATSTFTCSGADYTVTFCPSSPSLKSSTDSPTKTAGAGSATGGGGGGAVVGQMTPLPENSWLADLAIGGSTR
        GSPATCKPSKYSEIFKSACPRSYSYAYDDATSTFTCSGADYTVTFCPSSPSLKSSTDS  +T    + T G     VV QMTPLP++SW+AD+AIGGSTR
Subjt:  GSPATCKPSKYSEIFKSACPRSYSYAYDDATSTFTCSGADYTVTFCPSSPSLKSSTDSPTKTAGAGSATGGGGGGAVVGQMTPLPENSWLADLAIGGSTR

Query:  STHPDLAFLLFVLIFGSSF-FFSFLNLFSS
        +   DL+FLLFVLIFGSS  FFS LNLFSS
Subjt:  STHPDLAFLLFVLIFGSSF-FFSFLNLFSS

A0A6J1CUS7 thaumatin-like protein 1b1.5e-14180.24Show/hide
Query:  MDLFENHSVSCFIFWLFSLLLLSHGAFGAKFTFVNKCDFTVWPGILSGAGSLKFDTTGFELRKGSSRSFQAPAGWSGRFWGRTGCSFDGAGRGACTTGDC
        M+ F ++SVS  +FW+FSLLL+SHGA GAKFTFVNKCDFTVWPGILSGAGSLK DTTGFELR+G SRSFQAPAGWSGRFWGRT C+FDGAGRG C TGDC
Subjt:  MDLFENHSVSCFIFWLFSLLLLSHGAFGAKFTFVNKCDFTVWPGILSGAGSLKFDTTGFELRKGSSRSFQAPAGWSGRFWGRTGCSFDGAGRGACTTGDC

Query:  GSGEIECNGAGAAPPATLAEFTLGSGAAASLDFYDVSLVDGYNLPMIVEGTGGTGACGSTGCVTDLNRQCPAELKVANGGACSSACDKFGTPEYCCSGAY
        GSGEIECNGAGAAPPATLAEFTLG     S DFYDVSLVDGYNLPMIVEGTGGTGACGSTGCVTDLNRQCP ELK  NGGAC SAC+KFGT EYCCSGAY
Subjt:  GSGEIECNGAGAAPPATLAEFTLGSGAAASLDFYDVSLVDGYNLPMIVEGTGGTGACGSTGCVTDLNRQCPAELKVANGGACSSACDKFGTPEYCCSGAY

Query:  GSPATCKPSKYSEIFKSACPRSYSYAYDDATSTFTCSGADYTVTFCPSSPSLKSSTDSPTKTAGAGSATGGGGGGAVVGQMTPLPENSWLADLAIGGSTR
        GSPATCKPSKYSEIFKSACPRSYSYAYDDATSTFTCSGADYTVTFCPSSPSLKSSTDS   TA   +A+ G    A       LPENSWL DLA+G S+R
Subjt:  GSPATCKPSKYSEIFKSACPRSYSYAYDDATSTFTCSGADYTVTFCPSSPSLKSSTDSPTKTAGAGSATGGGGGGAVVGQMTPLPENSWLADLAIGGSTR

Query:  STHPDLAFLLFVLIFGSSFFFSFLNLFSS
           PDL FL+ VLI GSSFF S+LNL  S
Subjt:  STHPDLAFLLFVLIFGSSFFFSFLNLFSS

A0A6J1GBA8 thaumatin-like protein 13.7e-15686.63Show/hide
Query:  MDLFENHSVSCFIFWLFSLLLLSHGAFGAKFTFVNKCDFTVWPGILSGAGSLKFDTTGFELRKGSSRSFQAPAGWSGRFWGRTGCSFDGAGRGACTTGDC
        MDLF NHSVSC +F LF LL +S GAFGAKFTFVNKCDFTVWPGILSGAGSLKFDTTGFELRKG +R+FQAP  WSGRFWGRTGCSFDGAG GAC TGDC
Subjt:  MDLFENHSVSCFIFWLFSLLLLSHGAFGAKFTFVNKCDFTVWPGILSGAGSLKFDTTGFELRKGSSRSFQAPAGWSGRFWGRTGCSFDGAGRGACTTGDC

Query:  GSGEIECNGAGAAPPATLAEFTLGSGAAASLDFYDVSLVDGYNLPMIVEGTGGTGACGSTGCVTDLNRQCPAELKVANGGACSSACDKFGTPEYCCSGAY
        GSGEIECNG GAAPPATL EFTLGSG+A S DFYD SLVDGYNLP+IVEGTGGTGACGSTGCVTDLNRQCPAELKVANGGACSSAC+KFGTPEYCCSGAY
Subjt:  GSGEIECNGAGAAPPATLAEFTLGSGAAASLDFYDVSLVDGYNLPMIVEGTGGTGACGSTGCVTDLNRQCPAELKVANGGACSSACDKFGTPEYCCSGAY

Query:  GSPATCKPSKYSEIFKSACPRSYSYAYDDATSTFTCSGADYTVTFCPSSPSLKSSTDSPTKTAGAGSATGGGGGGAVVGQMTPLPENSWLADLAIGGSTR
        GSPATCKP+KYS IFKSACPRSYSYAYDD TSTFTCSGADYTVTFCPSSPSLKSSTDSP K AGAG+AT    GG VVGQM PLPENSWLAD+AIG STR
Subjt:  GSPATCKPSKYSEIFKSACPRSYSYAYDDATSTFTCSGADYTVTFCPSSPSLKSSTDSPTKTAGAGSATGGGGGGAVVGQMTPLPENSWLADLAIGGSTR

Query:  STHPDLAFLLFVLIFGSSFFFSFLNLFSS
         THP   FL+FVLIFGSSFFFS+ NLFSS
Subjt:  STHPDLAFLLFVLIFGSSFFFSFLNLFSS

A0A6J1KWD3 thaumatin-like protein 15.7e-15787.23Show/hide
Query:  MDLFENHSVSCFIFWLFSLLLLSHGAFGAKFTFVNKCDFTVWPGILSGAGSLKFDTTGFELRKGSSRSFQAPAGWSGRFWGRTGCSFDGAGRGACTTGDC
        MDLF NHSVSC IF LF LL +  GAFGAKFTFVNKCDFTVWPGILSGAGSLKFDTTGFELRKG +R+FQAPA WSGRFWGRTGCSFDGAGRGAC+TGDC
Subjt:  MDLFENHSVSCFIFWLFSLLLLSHGAFGAKFTFVNKCDFTVWPGILSGAGSLKFDTTGFELRKGSSRSFQAPAGWSGRFWGRTGCSFDGAGRGACTTGDC

Query:  GSGEIECNGAGAAPPATLAEFTLGSGAAASLDFYDVSLVDGYNLPMIVEGTGGTGACGSTGCVTDLNRQCPAELKVANGGACSSACDKFGTPEYCCSGAY
        GSGEIECNG GAAPPATL EFTLGSG+A S DFYD SLVDGYNLP+IVEGTGGTGACGSTGCVTDLNRQCP ELKVANGGACSSAC+KFGTPEYCCSGAY
Subjt:  GSGEIECNGAGAAPPATLAEFTLGSGAAASLDFYDVSLVDGYNLPMIVEGTGGTGACGSTGCVTDLNRQCPAELKVANGGACSSACDKFGTPEYCCSGAY

Query:  GSPATCKPSKYSEIFKSACPRSYSYAYDDATSTFTCSGADYTVTFCPSSPSLKSSTDSPTKTAGAGSATGGGGGGAVVGQMTPLPENSWLADLAIGGSTR
        GSPATCKP+KYS IFKSACPRSYSYAYDDATSTFTCSGADYTVTFCPSSPSLKSSTDSP K AGAG+A+    GG VVGQM PLPENSWLAD+AIG STR
Subjt:  GSPATCKPSKYSEIFKSACPRSYSYAYDDATSTFTCSGADYTVTFCPSSPSLKSSTDSPTKTAGAGSATGGGGGGAVVGQMTPLPENSWLADLAIGGSTR

Query:  STHPDLAFLLFVLIFGSSFFFSFLNLFSS
         THP   FLLFVLIFGSSFFFS+ NLFSS
Subjt:  STHPDLAFLLFVLIFGSSFFFSFLNLFSS

E5RDD8 Zeamatin6.7e-15083.94Show/hide
Query:  MDLFENHSVSCFIFWLFSLLLLSHGAFGAKFTFVNKCDFTVWPGILSGAGSLKFDTTGFELRKGSSRSFQAPAGWSGRFWGRTGCSFDGAGRGACTTGDC
        MDLF NHSVSC IF  FS+LL  HGAFGAKFTFVNKCDFTVWPG+LSGAGSLKFDTTGFELRKGSSRSFQAPAGWSGRFWGRT C+FDG+GRG C TGDC
Subjt:  MDLFENHSVSCFIFWLFSLLLLSHGAFGAKFTFVNKCDFTVWPGILSGAGSLKFDTTGFELRKGSSRSFQAPAGWSGRFWGRTGCSFDGAGRGACTTGDC

Query:  GSGEIECNGAGAAPPATLAEFTLGSGAAASLDFYDVSLVDGYNLPMIVEGTGGTGACGSTGCVTDLNRQCPAELKVANGGACSSACDKFGTPEYCCSGAY
        GSGEIECNGAGAAPPATLAEFTLGSG   S DFYDVSLVDGYNL MIVEGTGGTGACGSTGCVTDLNRQCP EL+   GGAC SAC+KFGTPEYCCSGAY
Subjt:  GSGEIECNGAGAAPPATLAEFTLGSGAAASLDFYDVSLVDGYNLPMIVEGTGGTGACGSTGCVTDLNRQCPAELKVANGGACSSACDKFGTPEYCCSGAY

Query:  GSPATCKPSKYSEIFKSACPRSYSYAYDDATSTFTCSGADYTVTFCPSSPSLKSSTDSPTKTAGAGSATGGGGGGAVVGQMTPLPENSWLADLAIGGSTR
        GSPATCKPSKYSEIFKSACPRSYSYAYDDATSTFTCSGADYTVTFCPSSPSLKSSTDS  +T    + T G     VV QMTPLP++SW+AD+AIGGSTR
Subjt:  GSPATCKPSKYSEIFKSACPRSYSYAYDDATSTFTCSGADYTVTFCPSSPSLKSSTDSPTKTAGAGSATGGGGGGAVVGQMTPLPENSWLADLAIGGSTR

Query:  STHPDLAFLLFVLIFGSSF-FFSFLNLFSS
        +   DL+FLLFVLIFGSS  FFS LNLFSS
Subjt:  STHPDLAFLLFVLIFGSSF-FFSFLNLFSS

SwissProt top hitse value%identityAlignment
A0A1P8B554 Thaumatin-like protein 16.8e-9166.8Show/hide
Query:  HSVSCFIFWLFSLLLLSH----GAFGAKFTFVNKCDFTVWPGILSGAGSLKFDTTGFELRKGSSRSFQAPAGWSGRFWGRTGCSFDG-AGRGACTTGDCG
        HS   F F + S L        G+ GA  T VN+C FTVWPGILS +GS    TTGFEL  G SRSFQAPA WSGRFW RTGC+F+   G+G C TGDCG
Subjt:  HSVSCFIFWLFSLLLLSH----GAFGAKFTFVNKCDFTVWPGILSGAGSLKFDTTGFELRKGSSRSFQAPAGWSGRFWGRTGCSFDG-AGRGACTTGDCG

Query:  SGEIECNGAGAAPPATLAEFTLGSG---AAASLDFYDVSLVDGYNLPMIVEGTGGT-GACGSTGCVTDLNRQCPAELKVANGGACSSACDKFGTPEYCCS
        S ++ECNGAGA PPATLAEFT+GSG    A   DFYDVSLVDGYN+PM+VE +GG+ G C +TGCVTDLN++CP EL+  +G AC SAC+ FG+PEYCCS
Subjt:  SGEIECNGAGAAPPATLAEFTLGSG---AAASLDFYDVSLVDGYNLPMIVEGTGGT-GACGSTGCVTDLNRQCPAELKVANGGACSSACDKFGTPEYCCS

Query:  GAYGSPATCKPSKYSEIFKSACPRSYSYAYDDATSTFTCSGADYTVTFCPSSP
        GAY SP  CKPS YSEIFKSACPRSYSYA+DDATSTFTC+ ADYT+TFCPS P
Subjt:  GAYGSPATCKPSKYSEIFKSACPRSYSYAYDDATSTFTCSGADYTVTFCPSSP

O80327 Thaumatin-like protein 14.7e-6855.7Show/hide
Query:  GAFGAKFTFVNKCDFTVWPGILSGAGSLKFDTTGFELRKGSSRSFQAPAGWSGRFWGRTGCSFDGAGRGACTTGDCGSGEIECNGAGAAPPATLAEFTLG
        G + AKFTF NKC  TVWPG L+G G  +  +TGFEL  G+S S    A WSGRFWGR+ CS D +G+  C+TGDCGSG+I CNGAGA+PPA+L E TL 
Subjt:  GAFGAKFTFVNKCDFTVWPGILSGAGSLKFDTTGFELRKGSSRSFQAPAGWSGRFWGRTGCSFDGAGRGACTTGDCGSGEIECNGAGAAPPATLAEFTLG

Query:  SGAAASLDFYDVSLVDGYNLPMIVEGTGGTGACGSTGCVTDLNRQCPAEL--KVANGG--ACSSACDKFGTPEYCCSGAYGSPATCKPSKYSEIFKSACP
        +      DFYDVSLVDG+NLP+ +   GG+G C ST C  ++N  CPAEL  K ++G    C SAC     P+YCC+GAYG+P TC P+ +S++FK+ CP
Subjt:  SGAAASLDFYDVSLVDGYNLPMIVEGTGGTGACGSTGCVTDLNRQCPAEL--KVANGG--ACSSACDKFGTPEYCCSGAYGSPATCKPSKYSEIFKSACP

Query:  RSYSYAYDDATSTFTC-SGADYTVTFCP
        ++YSYAYDD +STFTC  G +Y +TFCP
Subjt:  RSYSYAYDDATSTFTC-SGADYTVTFCP

P50699 Thaumatin-like protein2.6e-6655.92Show/hide
Query:  SVSCFIFWLFSLLLLSHGAFGAKFTFVNKCDFTVWPGILSGAGSLKFDTTGFELRKGSSRSFQAPAGWSGRFWGRTGCSFDGAGRGACTTGDCGSGEIEC
        S++ F+F    LLLLSH A  +   F NKC   VWPGI   AG       GF+L    + S Q P  WSGRFWGR GC+FD +GRG C TGDCG G + C
Subjt:  SVSCFIFWLFSLLLLSHGAFGAKFTFVNKCDFTVWPGILSGAGSLKFDTTGFELRKGSSRSFQAPAGWSGRFWGRTGCSFDGAGRGACTTGDCGSGEIEC

Query:  NGAGAAPPATLAEFTLGSGAAASLDFYDVSLVDGYNLPMIVEGTGGTGACGSTGCVTDLNRQCPAELKV--ANGG---ACSSACDKFGTPEYCCSGAYGS
        NGAG  PPATLAE TLG      LDFYDVSLVDGYNL M +    G+G C   GCV+DLN+ CP  L+V   NG    AC SAC  F +P+YCC+G +G+
Subjt:  NGAGAAPPATLAEFTLGSGAAASLDFYDVSLVDGYNLPMIVEGTGGTGACGSTGCVTDLNRQCPAELKV--ANGG---ACSSACDKFGTPEYCCSGAYGS

Query:  PATCKPSKYSEIFKSACPRSYSYAYDDATSTFTCSGADYTVTFCP
        P +CKP+ YS+IFK ACP++YSYAYDD TS  TCS A+Y VTFCP
Subjt:  PATCKPSKYSEIFKSACPRSYSYAYDDATSTFTCSGADYTVTFCP

P83332 Thaumatin-like protein 15.8e-6654.94Show/hide
Query:  LLLSHGAFGAKFTFVNKCDFTVWPGILSGAGSLKFDTTGFELRKGSSRSFQAPAGWSGRFWGRTGCSFDGAGRGACTTGDCGSGEIECNGAGAAPPATLA
        +L   GA  AK TF NKC +TVWPG L+G    +   TGFEL  G SRS  AP+ WSGRF+GRT CS D +G+  C T DCGSG++ CNG GAAPPATL 
Subjt:  LLLSHGAFGAKFTFVNKCDFTVWPGILSGAGSLKFDTTGFELRKGSSRSFQAPAGWSGRFWGRTGCSFDGAGRGACTTGDCGSGEIECNGAGAAPPATLA

Query:  EFTLGSGAAASLDFYDVSLVDGYNLPMIVEGTGGTGACGSTGCVTDLNRQCPAELKVANGG----ACSSACDKFGTPEYCCSGAYGSPATCKPSKYSEIF
        E T+ S      DFYDVSLVDG+NLPM V   GGTG C ++ C  D+N+ CPA L+V        AC SAC  F  P+YCC+     P TC P  YS++F
Subjt:  EFTLGSGAAASLDFYDVSLVDGYNLPMIVEGTGGTGACGSTGCVTDLNRQCPAELKVANGG----ACSSACDKFGTPEYCCSGAYGSPATCKPSKYSEIF

Query:  KSACPRSYSYAYDDATSTFTCSGAD-YTVTFCP
        K+ CP++YSYAYDD +STFTCSG   Y +TFCP
Subjt:  KSACPRSYSYAYDDATSTFTCSGAD-YTVTFCP

Q5DWG1 Pathogenesis-related thaumatin-like protein 3.55.2e-7562.61Show/hide
Query:  FTFVNKCDFTVWPGILSGAGSLKFDTTGFELRKGSSRSFQAPAGWSGRFWGRTGCSFDGAGRGACTTGDCGSGEIECNGAGAAPPATLAEFTLGSGAAAS
        FT VNKC +TVWPG LSG+GS      GF L  G S    A + WSGRFWGRT CSFD +G+G+C TGDCG+  + C  AG  PP +LAEFTLG      
Subjt:  FTFVNKCDFTVWPGILSGAGSLKFDTTGFELRKGSSRSFQAPAGWSGRFWGRTGCSFDGAGRGACTTGDCGSGEIECNGAGAAPPATLAEFTLGSGAAAS

Query:  LDFYDVSLVDGYNLPMIVEGTGGTGACGSTGCVTDLNRQCPAELKVANGG---ACSSACDKFGTPEYCCSGAYGSPATCKPSKYSEIFKSACPRSYSYAY
         DFYDVSLVDGYN+P+ +   GGTG C + GCV+DL   CPAEL V + G   AC SAC  F TPEYCC+G +GSP TC PSKYS++FKSACP +YSYAY
Subjt:  LDFYDVSLVDGYNLPMIVEGTGGTGACGSTGCVTDLNRQCPAELKVANGG---ACSSACDKFGTPEYCCSGAYGSPATCKPSKYSEIFKSACPRSYSYAY

Query:  DDATSTFTCSGADYTVTFCPSS
        DDATSTFTCS ADYT+TFCPSS
Subjt:  DDATSTFTCSGADYTVTFCPSS

Arabidopsis top hitse value%identityAlignment
AT4G24180.1 THAUMATIN-LIKE PROTEIN 14.8e-9266.8Show/hide
Query:  HSVSCFIFWLFSLLLLSH----GAFGAKFTFVNKCDFTVWPGILSGAGSLKFDTTGFELRKGSSRSFQAPAGWSGRFWGRTGCSFDG-AGRGACTTGDCG
        HS   F F + S L        G+ GA  T VN+C FTVWPGILS +GS    TTGFEL  G SRSFQAPA WSGRFW RTGC+F+   G+G C TGDCG
Subjt:  HSVSCFIFWLFSLLLLSH----GAFGAKFTFVNKCDFTVWPGILSGAGSLKFDTTGFELRKGSSRSFQAPAGWSGRFWGRTGCSFDG-AGRGACTTGDCG

Query:  SGEIECNGAGAAPPATLAEFTLGSG---AAASLDFYDVSLVDGYNLPMIVEGTGGT-GACGSTGCVTDLNRQCPAELKVANGGACSSACDKFGTPEYCCS
        S ++ECNGAGA PPATLAEFT+GSG    A   DFYDVSLVDGYN+PM+VE +GG+ G C +TGCVTDLN++CP EL+  +G AC SAC+ FG+PEYCCS
Subjt:  SGEIECNGAGAAPPATLAEFTLGSG---AAASLDFYDVSLVDGYNLPMIVEGTGGT-GACGSTGCVTDLNRQCPAELKVANGGACSSACDKFGTPEYCCS

Query:  GAYGSPATCKPSKYSEIFKSACPRSYSYAYDDATSTFTCSGADYTVTFCPSSP
        GAY SP  CKPS YSEIFKSACPRSYSYA+DDATSTFTC+ ADYT+TFCPS P
Subjt:  GAYGSPATCKPSKYSEIFKSACPRSYSYAYDDATSTFTCSGADYTVTFCPSSP

AT4G36010.1 Pathogenesis-related thaumatin superfamily protein6.5e-8160.24Show/hide
Query:  WLFSLLLLSHGAFGAKFTFVNKCDFTVWPGILSGAGSLKFDTTGFELRKGSSRSFQAPAGWSGRFWGRTGCSFDG-AGRGACTTGDCGSGEIECNGAGAA
        +L  +L+  +G     FT VN+C +TVWPG+LSGAG+    TTGF L    +R    PA WSGR WGRT C+ D   GR  C TGDCGS  +EC+G+GAA
Subjt:  WLFSLLLLSHGAFGAKFTFVNKCDFTVWPGILSGAGSLKFDTTGFELRKGSSRSFQAPAGWSGRFWGRTGCSFDG-AGRGACTTGDCGSGEIECNGAGAA

Query:  PPATLAEFTLGSGAAASLDFYDVSLVDGYNLPMIVEGTGG------TGACGSTGCVTDLNRQCPAELKVANGG----ACSSACDKFGTPEYCCSGAYGSP
        PPATLAEFTL    A  LDFYDVSLVDGYN+PM +   GG       G C +TGCV +LN  CPA+LKVA  G    AC SAC+ FGTPEYCCSGA+G+P
Subjt:  PPATLAEFTLGSGAAASLDFYDVSLVDGYNLPMIVEGTGG------TGACGSTGCVTDLNRQCPAELKVANGG----ACSSACDKFGTPEYCCSGAYGSP

Query:  ATCKPSKYSEIFKSACPRSYSYAYDDATSTFTCSGADYTVTFCPS-SPSLKSST
         TCKPS+YS+ FK+ACPR+YSYAYDD TSTFTC GADY +TFCPS +PS+KS+T
Subjt:  ATCKPSKYSEIFKSACPRSYSYAYDDATSTFTCSGADYTVTFCPS-SPSLKSST

AT4G36010.2 Pathogenesis-related thaumatin superfamily protein6.5e-8160.24Show/hide
Query:  WLFSLLLLSHGAFGAKFTFVNKCDFTVWPGILSGAGSLKFDTTGFELRKGSSRSFQAPAGWSGRFWGRTGCSFDG-AGRGACTTGDCGSGEIECNGAGAA
        +L  +L+  +G     FT VN+C +TVWPG+LSGAG+    TTGF L    +R    PA WSGR WGRT C+ D   GR  C TGDCGS  +EC+G+GAA
Subjt:  WLFSLLLLSHGAFGAKFTFVNKCDFTVWPGILSGAGSLKFDTTGFELRKGSSRSFQAPAGWSGRFWGRTGCSFDG-AGRGACTTGDCGSGEIECNGAGAA

Query:  PPATLAEFTLGSGAAASLDFYDVSLVDGYNLPMIVEGTGG------TGACGSTGCVTDLNRQCPAELKVANGG----ACSSACDKFGTPEYCCSGAYGSP
        PPATLAEFTL    A  LDFYDVSLVDGYN+PM +   GG       G C +TGCV +LN  CPA+LKVA  G    AC SAC+ FGTPEYCCSGA+G+P
Subjt:  PPATLAEFTLGSGAAASLDFYDVSLVDGYNLPMIVEGTGG------TGACGSTGCVTDLNRQCPAELKVANGG----ACSSACDKFGTPEYCCSGAYGSP

Query:  ATCKPSKYSEIFKSACPRSYSYAYDDATSTFTCSGADYTVTFCPS-SPSLKSST
         TCKPS+YS+ FK+ACPR+YSYAYDD TSTFTC GADY +TFCPS +PS+KS+T
Subjt:  ATCKPSKYSEIFKSACPRSYSYAYDDATSTFTCSGADYTVTFCPS-SPSLKSST

AT4G38660.1 Pathogenesis-related thaumatin superfamily protein6.3e-10056.73Show/hide
Query:  MDLFENHSVSCFIFWLFSLLLLSHGAFGAKFTFVNKCDFTVWPGILSGAGSLKFDTTGFELRKGSSRSFQAPAGWSGRFWGRTGCSFDGAGRGACTTGDC
        M+L    + S        LLL S G++G+ FTF N+C +TVWPGILS AGS    TTGFEL KG+SRS QAP GWSGRFW RTGC FD +G G C TGDC
Subjt:  MDLFENHSVSCFIFWLFSLLLLSHGAFGAKFTFVNKCDFTVWPGILSGAGSLKFDTTGFELRKGSSRSFQAPAGWSGRFWGRTGCSFDGAGRGACTTGDC

Query:  GSGEIECNGAGAAPPATLAEFTLGSGAAASLDFYDVSLVDGYNLPMIVEGTGGTGACGSTGCVTDLNRQCPAELKVANGGACSSACDKFGTPEYCCSGAY
        GS  +EC G GAAPP TLAEFTLG+G     DFYDVSLVDGYN+PMIVE  GG+G C STGC TDLN QCPAEL+  +G AC SAC  F +PEYCCSGAY
Subjt:  GSGEIECNGAGAAPPATLAEFTLGSGAAASLDFYDVSLVDGYNLPMIVEGTGGTGACGSTGCVTDLNRQCPAELKVANGGACSSACDKFGTPEYCCSGAY

Query:  GSPATCKPSKYSEIFKSACPRSYSYAYDDATSTFTCSGADYTVTFCPSSPSLKSS------TDSPTKTAGAGSATGGGGGGAVVGQMTP-----------
         +P++C+PS YSE+FK+ACPRSYSYAYDDATSTFTC+G DYTVTFCPSSPS KS+      TDS + + G+    G   G A  GQ TP           
Subjt:  GSPATCKPSKYSEIFKSACPRSYSYAYDDATSTFTCSGADYTVTFCPSSPSLKSS------TDSPTKTAGAGSATGGGGGGAVVGQMTP-----------

Query:  ----------LPENSWLADLAIGGSTRSTHPDLAFLLFVLIFGSSFFFS
                  L + SW+A LA+G ++R     L  LL    F   F FS
Subjt:  ----------LPENSWLADLAIGGSTRSTHPDLAFLLFVLIFGSSFFFS

AT4G38660.2 Pathogenesis-related thaumatin superfamily protein9.1e-9958.77Show/hide
Query:  GAFGAKFTFVNKCDFTVWPGILSGAGSLKFDTTGFELRKGSSRSFQAPAGWSGRFWGRTGCSFDGAGRGACTTGDCGSGEIECNGAGAAPPATLAEFTLG
        G++G+ FTF N+C +TVWPGILS AGS    TTGFEL KG+SRS QAP GWSGRFW RTGC FD +G G C TGDCGS  +EC G GAAPP TLAEFTLG
Subjt:  GAFGAKFTFVNKCDFTVWPGILSGAGSLKFDTTGFELRKGSSRSFQAPAGWSGRFWGRTGCSFDGAGRGACTTGDCGSGEIECNGAGAAPPATLAEFTLG

Query:  SGAAASLDFYDVSLVDGYNLPMIVEGTGGTGACGSTGCVTDLNRQCPAELKVANGGACSSACDKFGTPEYCCSGAYGSPATCKPSKYSEIFKSACPRSYS
        +G     DFYDVSLVDGYN+PMIVE  GG+G C STGC TDLN QCPAEL+  +G AC SAC  F +PEYCCSGAY +P++C+PS YSE+FK+ACPRSYS
Subjt:  SGAAASLDFYDVSLVDGYNLPMIVEGTGGTGACGSTGCVTDLNRQCPAELKVANGGACSSACDKFGTPEYCCSGAYGSPATCKPSKYSEIFKSACPRSYS

Query:  YAYDDATSTFTCSGADYTVTFCPSSPSLKSS------TDSPTKTAGAGSATGGGGGGAVVGQMTP---------------------LPENSWLADLAIGG
        YAYDDATSTFTC+G DYTVTFCPSSPS KS+      TDS + + G+    G   G A  GQ TP                     L + SW+A LA+G 
Subjt:  YAYDDATSTFTCSGADYTVTFCPSSPSLKSS------TDSPTKTAGAGSATGGGGGGAVVGQMTP---------------------LPENSWLADLAIGG

Query:  STRSTHPDLAFLLFVLIFGSSFFFS
        ++R     L  LL    F   F FS
Subjt:  STRSTHPDLAFLLFVLIFGSSFFFS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGACTTGTTTGAGAATCACTCTGTTTCTTGTTTCATTTTTTGGCTCTTCTCTCTTCTCCTACTTTCACATGGGGCTTTTGGAGCTAAATTTACGTTTGTGAATAAGTG
CGATTTCACTGTCTGGCCGGGAATTCTCTCCGGCGCCGGCAGCTTGAAATTCGACACCACCGGTTTCGAGCTTCGAAAGGGCAGTTCGAGGTCTTTTCAAGCGCCGGCTG
GATGGTCCGGCCGGTTCTGGGGACGAACTGGCTGCAGCTTCGACGGCGCCGGCCGTGGTGCCTGTACTACCGGTGATTGTGGGTCCGGCGAAATTGAGTGCAACGGCGCC
GGAGCTGCCCCACCGGCGACACTGGCGGAGTTCACTCTCGGGTCTGGTGCCGCCGCCTCGCTGGACTTCTACGACGTCAGCCTAGTCGATGGCTACAATTTGCCGATGAT
TGTCGAGGGGACCGGCGGGACTGGCGCGTGTGGGTCGACCGGGTGTGTAACGGACTTGAACCGGCAGTGTCCGGCGGAGCTTAAGGTGGCGAACGGGGGCGCGTGTAGCA
GCGCGTGTGATAAGTTTGGGACGCCGGAGTACTGCTGCAGCGGCGCGTATGGTTCACCGGCGACCTGTAAGCCGTCGAAGTACTCGGAGATATTTAAATCGGCGTGTCCT
CGGTCGTATAGCTACGCCTATGACGACGCCACCAGCACCTTCACTTGCTCCGGCGCCGATTATACTGTCACATTTTGCCCTTCTTCCCCAAGCCTGAAATCATCAACAGA
TTCGCCGACAAAGACGGCGGGAGCGGGGTCGGCAACTGGCGGTGGCGGTGGCGGTGCGGTGGTGGGACAGATGACGCCGCTGCCTGAAAATTCATGGCTGGCAGATTTGG
CCATTGGAGGCTCAACCAGATCAACTCACCCAGATTTGGCTTTTCTTCTGTTTGTGTTGATTTTTGGAAGCTCTTTCTTCTTCTCATTTTTAAATTTATTTTCTTCTTAG
mRNA sequenceShow/hide mRNA sequence
CAGAACTCTCCCCCCACTTACACAAAACACAGAGCCCCCACAATTAAACACATAAGTCCCAATCCAAAAAACCCACTTCCATTTTCAACCCCAAAATCCCACATTTGTGA
GAAAAGAAGGGGCTAAAATCAGAAGTGATTAGTGAAAATTTGAAAAGTGATTCCATTTCTCATGGACTTGTTTGAGAATCACTCTGTTTCTTGTTTCATTTTTTGGCTCT
TCTCTCTTCTCCTACTTTCACATGGGGCTTTTGGAGCTAAATTTACGTTTGTGAATAAGTGCGATTTCACTGTCTGGCCGGGAATTCTCTCCGGCGCCGGCAGCTTGAAA
TTCGACACCACCGGTTTCGAGCTTCGAAAGGGCAGTTCGAGGTCTTTTCAAGCGCCGGCTGGATGGTCCGGCCGGTTCTGGGGACGAACTGGCTGCAGCTTCGACGGCGC
CGGCCGTGGTGCCTGTACTACCGGTGATTGTGGGTCCGGCGAAATTGAGTGCAACGGCGCCGGAGCTGCCCCACCGGCGACACTGGCGGAGTTCACTCTCGGGTCTGGTG
CCGCCGCCTCGCTGGACTTCTACGACGTCAGCCTAGTCGATGGCTACAATTTGCCGATGATTGTCGAGGGGACCGGCGGGACTGGCGCGTGTGGGTCGACCGGGTGTGTA
ACGGACTTGAACCGGCAGTGTCCGGCGGAGCTTAAGGTGGCGAACGGGGGCGCGTGTAGCAGCGCGTGTGATAAGTTTGGGACGCCGGAGTACTGCTGCAGCGGCGCGTA
TGGTTCACCGGCGACCTGTAAGCCGTCGAAGTACTCGGAGATATTTAAATCGGCGTGTCCTCGGTCGTATAGCTACGCCTATGACGACGCCACCAGCACCTTCACTTGCT
CCGGCGCCGATTATACTGTCACATTTTGCCCTTCTTCCCCAAGCCTGAAATCATCAACAGATTCGCCGACAAAGACGGCGGGAGCGGGGTCGGCAACTGGCGGTGGCGGT
GGCGGTGCGGTGGTGGGACAGATGACGCCGCTGCCTGAAAATTCATGGCTGGCAGATTTGGCCATTGGAGGCTCAACCAGATCAACTCACCCAGATTTGGCTTTTCTTCT
GTTTGTGTTGATTTTTGGAAGCTCTTTCTTCTTCTCATTTTTAAATTTATTTTCTTCTTAGCATTAAAAATTTGAATTACCATAATGAAGAAGAGAAGAAGATGATGATT
TGGGGAAACAAAATGGATTTAAATTTCAGATCACCAAAGGATCACTTTCCATTTTGCTAGCTGCAATTTTAAAAGTACATTTAAGTGTATATAAAGATGCATGCATTATT
TTTTGTTGACAAAAACAGAGCCTTTTCTATTTACTCTCATCTTTTTCTTTTGGTTGAATTATAACCTTACACTTAAACTTTTAAGTTTGTATCTATTTGATCAGGGTGTC
TTATATGTCTATAAACTTAAGAAATATCTAATAAATTTCTAAACTTTCAATTTTGTATCTAATAAATTTCTAACATATTTAAATATTTTAAAAATTAATTAAACTATTTG
ATATAAAA
Protein sequenceShow/hide protein sequence
MDLFENHSVSCFIFWLFSLLLLSHGAFGAKFTFVNKCDFTVWPGILSGAGSLKFDTTGFELRKGSSRSFQAPAGWSGRFWGRTGCSFDGAGRGACTTGDCGSGEIECNGA
GAAPPATLAEFTLGSGAAASLDFYDVSLVDGYNLPMIVEGTGGTGACGSTGCVTDLNRQCPAELKVANGGACSSACDKFGTPEYCCSGAYGSPATCKPSKYSEIFKSACP
RSYSYAYDDATSTFTCSGADYTVTFCPSSPSLKSSTDSPTKTAGAGSATGGGGGGAVVGQMTPLPENSWLADLAIGGSTRSTHPDLAFLLFVLIFGSSFFFSFLNLFSS