; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MC02g0858 (gene) of Bitter gourd (Dali-11) v1 genome

Gene IDMC02g0858
OrganismMomordica charantia cv. Dali-11 (Bitter gourd (Dali-11) v1)
Descriptionthaumatin-like protein 1
Genome locationMC02:6804566..6807043
RNA-Seq ExpressionMC02g0858
SyntenyMC02g0858
Gene Ontology termsGO:0006952 - defense response (biological process)
GO:0016021 - integral component of membrane (cellular component)
InterPro domainsIPR001938 - Thaumatin family
IPR017949 - Thaumatin, conserved site
IPR037176 - Osmotin/thaumatin-like superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022146197.1 thaumatin-like protein 1 [Momordica charantia]1.20e-305100Show/hide
Query:  MCLKWGEIEVEEEVCEKKEMGNINNGKKRENRDEIPKYYYFSCSCSHSHAFSFFFFILSLSSTELNLQPTPPSSFSALFLKSLSMASSYSALIIAFLHLL
        MCLKWGEIEVEEEVCEKKEMGNINNGKKRENRDEIPKYYYFSCSCSHSHAFSFFFFILSLSSTELNLQPTPPSSFSALFLKSLSMASSYSALIIAFLHLL
Subjt:  MCLKWGEIEVEEEVCEKKEMGNINNGKKRENRDEIPKYYYFSCSCSHSHAFSFFFFILSLSSTELNLQPTPPSSFSALFLKSLSMASSYSALIIAFLHLL

Query:  ATSMAATFTFVNRCDFTVWPGILANAGSPTLQTTGFELPTDSSRSFLAPTGWSGRFWGRTSCSFDASGSGSCITGDCGSGHVECNGAGATPPATLAEFTL
        ATSMAATFTFVNRCDFTVWPGILANAGSPTLQTTGFELPTDSSRSFLAPTGWSGRFWGRTSCSFDASGSGSCITGDCGSGHVECNGAGATPPATLAEFTL
Subjt:  ATSMAATFTFVNRCDFTVWPGILANAGSPTLQTTGFELPTDSSRSFLAPTGWSGRFWGRTSCSFDASGSGSCITGDCGSGHVECNGAGATPPATLAEFTL

Query:  GTGGQDFYDVSLVDGYNLPMIVEGTGGSGQCASTGCSTDLNRQCPAELRVGEGDACKSACEAFETEEYCCSGAYGSPATCKPSIYSEMFKSACPRSYSYA
        GTGGQDFYDVSLVDGYNLPMIVEGTGGSGQCASTGCSTDLNRQCPAELRVGEGDACKSACEAFETEEYCCSGAYGSPATCKPSIYSEMFKSACPRSYSYA
Subjt:  GTGGQDFYDVSLVDGYNLPMIVEGTGGSGQCASTGCSTDLNRQCPAELRVGEGDACKSACEAFETEEYCCSGAYGSPATCKPSIYSEMFKSACPRSYSYA

Query:  YDDATSTFTCSGADYTITFCPSSPSQKSSSFPTPTTAETSQGFGSETGSGSGSGSEYGMGYSGASGTDMLGAGASSGSGSGSGTGETMLADGSWLAGLAM
        YDDATSTFTCSGADYTITFCPSSPSQKSSSFPTPTTAETSQGFGSETGSGSGSGSEYGMGYSGASGTDMLGAGASSGSGSGSGTGETMLADGSWLAGLAM
Subjt:  YDDATSTFTCSGADYTITFCPSSPSQKSSSFPTPTTAETSQGFGSETGSGSGSGSEYGMGYSGASGTDMLGAGASSGSGSGSGTGETMLADGSWLAGLAM

Query:  GNSAKTVSSTSTPLWMASFIAILSSLFL
        GNSAKTVSSTSTPLWMASFIAILSSLFL
Subjt:  GNSAKTVSSTSTPLWMASFIAILSSLFL

XP_022943994.1 thaumatin-like protein 1 [Cucurbita moschata]5.33e-19982.82Show/hide
Query:  MASSY---SALIIAFLHLLATSMAATFTFVNRCDFTVWPGILANAGSPTLQTTGFELPTDSSRSFLAPTGWSGRFWGRTSCSFDASGSGSCITGDCGSGH
        MASS+   +A I A LHLLA   AATFTFVNRCDFTVWPGILANAG+PTL TTGFELP D+SRSFLAPTGWSGRFWGRT+C+FD SGSGSC+TGDCGSG 
Subjt:  MASSY---SALIIAFLHLLATSMAATFTFVNRCDFTVWPGILANAGSPTLQTTGFELPTDSSRSFLAPTGWSGRFWGRTSCSFDASGSGSCITGDCGSGH

Query:  VECNGAGATPPATLAEFTLGTGGQDFYDVSLVDGYNLPMIVEGTGGSGQCASTGCSTDLNRQCPAELRVGEGDACKSACEAFETEEYCCSGAYGSPATCK
        VECNGAGA PPATLAEFTLGTGGQDFYDVSLVDGYNLPMIVEGTGGSGQCASTGCSTDLNRQCP ELRVGEGDACKSACEAF T EYCCSGAYGSP +CK
Subjt:  VECNGAGATPPATLAEFTLGTGGQDFYDVSLVDGYNLPMIVEGTGGSGQCASTGCSTDLNRQCPAELRVGEGDACKSACEAFETEEYCCSGAYGSPATCK

Query:  PSIYSEMFKSACPRSYSYAYDDATSTFTCSGADYTITFCPSSPSQKSSSFPTPTTAETSQGFGSETGSGSGSGSEYGMGYSGASGTDMLGAGASSGS---
        PS+YSEMFKSACPRSYSYAYDDATSTFTC+GADYTITFCPSSPSQKSSS PTPTT + SQ +GSE+G  SGSGSEYGMGYSGASGTDMLG GASSGS   
Subjt:  PSIYSEMFKSACPRSYSYAYDDATSTFTCSGADYTITFCPSSPSQKSSSFPTPTTAETSQGFGSETGSGSGSGSEYGMGYSGASGTDMLGAGASSGS---

Query:  -----GSGSGTGETMLADGSWLAGLAMGNSAKTVSSTSTPLWMASFIAILSSLFL
             GS SG+GE MLADGSWLAGLAMG++AKTVS  S  L++ +FI ILSS+FL
Subjt:  -----GSGSGTGETMLADGSWLAGLAMGNSAKTVSSTSTPLWMASFIAILSSLFL

XP_022986123.1 thaumatin-like protein 1 isoform X1 [Cucurbita maxima]8.60e-20083.85Show/hide
Query:  MASSY---SALIIAFLHLLATSMAATFTFVNRCDFTVWPGILANAGSPTLQTTGFELPTDSSRSFLAPTGWSGRFWGRTSCSFDASGSGSCITGDCGSGH
        MASS+   +ALI   LHLLA   AATFTFVNRCDFTVWPGILANAG+PTL TTGFELP D+ RSFLAPTGWSGRFWGRT+C+FD SGSGSC+TGDCGSG 
Subjt:  MASSY---SALIIAFLHLLATSMAATFTFVNRCDFTVWPGILANAGSPTLQTTGFELPTDSSRSFLAPTGWSGRFWGRTSCSFDASGSGSCITGDCGSGH

Query:  VECNGAGATPPATLAEFTLGTGGQDFYDVSLVDGYNLPMIVEGTGGSGQCASTGCSTDLNRQCPAELRVGEGDACKSACEAFETEEYCCSGAYGSPATCK
        VECNGAGA PPATLAEFTLGTGGQDFYDVSLVDGYNLPMIVEGTGGSGQCASTGCSTDLNRQCP ELRVGEGDACKSACEAF T EYCCSGAYGSP +CK
Subjt:  VECNGAGATPPATLAEFTLGTGGQDFYDVSLVDGYNLPMIVEGTGGSGQCASTGCSTDLNRQCPAELRVGEGDACKSACEAFETEEYCCSGAYGSPATCK

Query:  PSIYSEMFKSACPRSYSYAYDDATSTFTCSGADYTITFCPSSPSQKSSSFPTPTTAETSQGFGSETG--SGSGSGSEYGMGYSGASGTDMLGAGASSGS-
        PS+YSEMFKSACPRSYSYAYDDATSTFTC+GADYTITFCPSSPSQKSSSFPTPTT + SQ +GSE+G  SGSGSGSEYGMGYSGASGTDMLG GASSGS 
Subjt:  PSIYSEMFKSACPRSYSYAYDDATSTFTCSGADYTITFCPSSPSQKSSSFPTPTTAETSQGFGSETG--SGSGSGSEYGMGYSGASGTDMLGAGASSGS-

Query:  ---GSGSGTGETMLADGSWLAGLAMGNSAKTVSSTSTPLWMASFIAILSSLFL
           GS SG+GE MLADGSWLAGLAMG++AKTVS  S  L++ +FI ILSS+FL
Subjt:  ---GSGSGTGETMLADGSWLAGLAMGNSAKTVSSTSTPLWMASFIAILSSLFL

XP_023512917.1 thaumatin-like protein 1 [Cucurbita pepo subsp. pepo]4.36e-19883.86Show/hide
Query:  SALIIAFLHLLATSMAATFTFVNRCDFTVWPGILANAGSPTLQTTGFELPTDSSRSFLAPTGWSGRFWGRTSCSFDASGSGSCITGDCGSGHVECNGAGA
        +A I A LHLLA   AATFTFVNRCDFTVWPGILANAG+PTL TTGFELP D+SRSFLAPTGWSGRFWGRT+C+FD SGSGSC+TGDCGSG VECNGAGA
Subjt:  SALIIAFLHLLATSMAATFTFVNRCDFTVWPGILANAGSPTLQTTGFELPTDSSRSFLAPTGWSGRFWGRTSCSFDASGSGSCITGDCGSGHVECNGAGA

Query:  TPPATLAEFTLGTGGQDFYDVSLVDGYNLPMIVEGTGGSGQCASTGCSTDLNRQCPAELRVGEGDACKSACEAFETEEYCCSGAYGSPATCKPSIYSEMF
         PPATLAEFTLGTGGQDFYDVSLVDGYNLPMIVEGTGGSGQCASTGCSTDLNRQCP ELRVGEGDACKSACEAF T EYCCSGAYGSP +CKPS+YSEMF
Subjt:  TPPATLAEFTLGTGGQDFYDVSLVDGYNLPMIVEGTGGSGQCASTGCSTDLNRQCPAELRVGEGDACKSACEAFETEEYCCSGAYGSPATCKPSIYSEMF

Query:  KSACPRSYSYAYDDATSTFTCSGADYTITFCPSSPSQKSSSFPTPTTAETSQGFGSETGSGSGSGSEYGMGYSGASGTDMLGAGASSGS--GSGSGT---
        KSACPRSYSYAYDDATSTFTC+GADYTITFCPSSPSQKSSSFPTPTT + SQ +GSE+G  SGSGSEYGMGYSG+SGTDMLG GASSGS  GSGSG+   
Subjt:  KSACPRSYSYAYDDATSTFTCSGADYTITFCPSSPSQKSSSFPTPTTAETSQGFGSETGSGSGSGSEYGMGYSGASGTDMLGAGASSGS--GSGSGT---

Query:  ---GETMLADGSWLAGLAMGNSAKTVSSTSTPLWMASFIAILSSLFL
           GE MLADGSWLAGLAMG++AKTVS  S  L++ +FI ILSS+FL
Subjt:  ---GETMLADGSWLAGLAMGNSAKTVSSTSTPLWMASFIAILSSLFL

XP_038901689.1 thaumatin-like protein 1 [Benincasa hispida]5.30e-20284.97Show/hide
Query:  SMASSYSALIIAFLHLLATSMAATFTFVNRCDFTVWPGILANAGSPTLQTTGFELPTDSSRSFLAPTGWSGRFWGRTSCSFDASGSGSCITGDCGSGHVE
        S A  + +LI AFLHLL TS AATFTFVNRCDFTVWPGILANAG+PTL TTGFELP D+SRSFLAPTGWSGRFWGRTSC+FD S SGSC+TGDCGSG VE
Subjt:  SMASSYSALIIAFLHLLATSMAATFTFVNRCDFTVWPGILANAGSPTLQTTGFELPTDSSRSFLAPTGWSGRFWGRTSCSFDASGSGSCITGDCGSGHVE

Query:  CNGAGATPPATLAEFTLGTGGQDFYDVSLVDGYNLPMIVEGTGGSGQCASTGCSTDLNRQCPAELRVGEGDACKSACEAFETEEYCCSGAYGSPATCKPS
        CNGAGA PPATLAEFTLGTGG DFYDVSLVDGYNLPMIVEGTGGSGQCASTGCSTDLNRQCP ELRVGE DACKSACEAF T EYCCSGAYGSP +CKPS
Subjt:  CNGAGATPPATLAEFTLGTGGQDFYDVSLVDGYNLPMIVEGTGGSGQCASTGCSTDLNRQCPAELRVGEGDACKSACEAFETEEYCCSGAYGSPATCKPS

Query:  IYSEMFKSACPRSYSYAYDDATSTFTCSGADYTITFCPSSPSQKSSSFPTPTTAET-SQGFGSETGSGSGSGSEYGMGYSGASGTDMLGAGASSGSGSGS
        +YSEMFK+ACP+SYSYAYDDATSTFTCSGADYTITFCPSSPSQKSSSFPTPTT +T SQG+GSE+G GS SGSEYGMGYSGASGTDMLG GASSGS  GS
Subjt:  IYSEMFKSACPRSYSYAYDDATSTFTCSGADYTITFCPSSPSQKSSSFPTPTTAET-SQGFGSETGSGSGSGSEYGMGYSGASGTDMLGAGASSGSGSGS

Query:  GTGETMLADGSWLAGLAMGNSAKTVSSTSTPLWMASFIAILSSLFL
        G+GE MLADGSWLAGLAMG+SAKTVS  ST L++ +FI +L+SLFL
Subjt:  GTGETMLADGSWLAGLAMGNSAKTVSSTSTPLWMASFIAILSSLFL

TrEMBL top hitse value%identityAlignment
A0A0A0LMB1 Uncharacterized protein9.60e-19782.1Show/hide
Query:  SMASSYSALIIAFLHLLATSMAATFTFVNRCDFTVWPGILANAGSPTLQTTGFELPTDSSRSFLAPTGWSGRFWGRTSCSFDASGSGSCITGDCGSGHVE
        S A  + +LI   L LL TS AATFTFVNRCDFTVWPGILANAG+PTL TTGFELP D+SRSFLAPTGWSGRFWGRTSC+FD S SGSC+TGDCGSG +E
Subjt:  SMASSYSALIIAFLHLLATSMAATFTFVNRCDFTVWPGILANAGSPTLQTTGFELPTDSSRSFLAPTGWSGRFWGRTSCSFDASGSGSCITGDCGSGHVE

Query:  CNGAGATPPATLAEFTLGTGGQDFYDVSLVDGYNLPMIVEGTGGSGQCASTGCSTDLNRQCPAELRVGEGDACKSACEAFETEEYCCSGAYGSPATCKPS
        CNGAGA PPATLAEFTLG GG DFYDVSLVDGYNLPMIVEGTGGSGQCA+TGCSTDLNRQCP ELRVGEGDACKSACEAF T EYCCSGAYGSP +CKPS
Subjt:  CNGAGATPPATLAEFTLGTGGQDFYDVSLVDGYNLPMIVEGTGGSGQCASTGCSTDLNRQCPAELRVGEGDACKSACEAFETEEYCCSGAYGSPATCKPS

Query:  IYSEMFKSACPRSYSYAYDDATSTFTCSGADYTITFCPSSPSQKSSSFPTPTTAET-SQGFGSETG------SGSGSGSEYGMGYSGASGTDMLGAGASS
        +YSEMFK+ACP+SYSYAYDDATSTFTC+GADYTITFCPSSPSQKSSSFPTPTT +T SQG+GSETG      SGSGSGSEYGMGYSGASGTDMLG GAS+
Subjt:  IYSEMFKSACPRSYSYAYDDATSTFTCSGADYTITFCPSSPSQKSSSFPTPTTAET-SQGFGSETG------SGSGSGSEYGMGYSGASGTDMLGAGASS

Query:  GSGSGSGTGETMLADGSWLAGLAMGNSAKTVSSTSTPLWMASFIAILSSLFL
        GS SGSG+GE MLADGSWLAGLAMG+SA+ VS  S  L++ +F+ ILSSLFL
Subjt:  GSGSGSGTGETMLADGSWLAGLAMGNSAKTVSSTSTPLWMASFIAILSSLFL

A0A1S3C5L8 thaumatin-like protein 1b3.81e-19882.86Show/hide
Query:  SMASSYSALIIAFLHLLATSMAATFTFVNRCDFTVWPGILANAGSPTLQTTGFELPTDSSRSFLAPTGWSGRFWGRTSCSFDASGSGSCITGDCGSGHVE
        S A  + +LI   L LL TS AATFTFVNRCDFTVWPGILANAG+PTL TTGFELP D+SRSFLAPTGWSGRFWGRTSC+FD S SGSC+TGDCGSG VE
Subjt:  SMASSYSALIIAFLHLLATSMAATFTFVNRCDFTVWPGILANAGSPTLQTTGFELPTDSSRSFLAPTGWSGRFWGRTSCSFDASGSGSCITGDCGSGHVE

Query:  CNGAGATPPATLAEFTLGTGGQDFYDVSLVDGYNLPMIVEGTGGSGQCASTGCSTDLNRQCPAELRVGEGDACKSACEAFETEEYCCSGAYGSPATCKPS
        CNGAGA PPATLAEFTLGTGG DFYDVSLVDGYNLPMIVEGTGGSGQCA+TGCSTDLNRQCP ELRVG+GDACKSACEAF T EYCCSGAYGSP +CKPS
Subjt:  CNGAGATPPATLAEFTLGTGGQDFYDVSLVDGYNLPMIVEGTGGSGQCASTGCSTDLNRQCPAELRVGEGDACKSACEAFETEEYCCSGAYGSPATCKPS

Query:  IYSEMFKSACPRSYSYAYDDATSTFTCSGADYTITFCPSSPSQKSSSFPTPTTAET-SQGFGSETG----SGSGSGSEYGMGYSGASGTDMLGAGASSGS
        +YSEMFK+ACP+SYSYAYDDATSTFTC+GADYTITFCPSSPSQKSSSFPTPTT +T SQG+GSETG    SGSGSGSEYGMGYSGASGTDMLG GAS+GS
Subjt:  IYSEMFKSACPRSYSYAYDDATSTFTCSGADYTITFCPSSPSQKSSSFPTPTTAET-SQGFGSETG----SGSGSGSEYGMGYSGASGTDMLGAGASSGS

Query:  GSGSGTGETMLADGSWLAGLAMGNSAKTVSSTSTPLWMASFIAILSSLFL
         SGSG+GE MLADGSWLAGLAMG+SA+ VS  S  L++ +F+ ILSSLFL
Subjt:  GSGSGTGETMLADGSWLAGLAMGNSAKTVSSTSTPLWMASFIAILSSLFL

A0A6J1CYX5 thaumatin-like protein 15.83e-306100Show/hide
Query:  MCLKWGEIEVEEEVCEKKEMGNINNGKKRENRDEIPKYYYFSCSCSHSHAFSFFFFILSLSSTELNLQPTPPSSFSALFLKSLSMASSYSALIIAFLHLL
        MCLKWGEIEVEEEVCEKKEMGNINNGKKRENRDEIPKYYYFSCSCSHSHAFSFFFFILSLSSTELNLQPTPPSSFSALFLKSLSMASSYSALIIAFLHLL
Subjt:  MCLKWGEIEVEEEVCEKKEMGNINNGKKRENRDEIPKYYYFSCSCSHSHAFSFFFFILSLSSTELNLQPTPPSSFSALFLKSLSMASSYSALIIAFLHLL

Query:  ATSMAATFTFVNRCDFTVWPGILANAGSPTLQTTGFELPTDSSRSFLAPTGWSGRFWGRTSCSFDASGSGSCITGDCGSGHVECNGAGATPPATLAEFTL
        ATSMAATFTFVNRCDFTVWPGILANAGSPTLQTTGFELPTDSSRSFLAPTGWSGRFWGRTSCSFDASGSGSCITGDCGSGHVECNGAGATPPATLAEFTL
Subjt:  ATSMAATFTFVNRCDFTVWPGILANAGSPTLQTTGFELPTDSSRSFLAPTGWSGRFWGRTSCSFDASGSGSCITGDCGSGHVECNGAGATPPATLAEFTL

Query:  GTGGQDFYDVSLVDGYNLPMIVEGTGGSGQCASTGCSTDLNRQCPAELRVGEGDACKSACEAFETEEYCCSGAYGSPATCKPSIYSEMFKSACPRSYSYA
        GTGGQDFYDVSLVDGYNLPMIVEGTGGSGQCASTGCSTDLNRQCPAELRVGEGDACKSACEAFETEEYCCSGAYGSPATCKPSIYSEMFKSACPRSYSYA
Subjt:  GTGGQDFYDVSLVDGYNLPMIVEGTGGSGQCASTGCSTDLNRQCPAELRVGEGDACKSACEAFETEEYCCSGAYGSPATCKPSIYSEMFKSACPRSYSYA

Query:  YDDATSTFTCSGADYTITFCPSSPSQKSSSFPTPTTAETSQGFGSETGSGSGSGSEYGMGYSGASGTDMLGAGASSGSGSGSGTGETMLADGSWLAGLAM
        YDDATSTFTCSGADYTITFCPSSPSQKSSSFPTPTTAETSQGFGSETGSGSGSGSEYGMGYSGASGTDMLGAGASSGSGSGSGTGETMLADGSWLAGLAM
Subjt:  YDDATSTFTCSGADYTITFCPSSPSQKSSSFPTPTTAETSQGFGSETGSGSGSGSEYGMGYSGASGTDMLGAGASSGSGSGSGTGETMLADGSWLAGLAM

Query:  GNSAKTVSSTSTPLWMASFIAILSSLFL
        GNSAKTVSSTSTPLWMASFIAILSSLFL
Subjt:  GNSAKTVSSTSTPLWMASFIAILSSLFL

A0A6J1FVS7 thaumatin-like protein 12.58e-19982.82Show/hide
Query:  MASSY---SALIIAFLHLLATSMAATFTFVNRCDFTVWPGILANAGSPTLQTTGFELPTDSSRSFLAPTGWSGRFWGRTSCSFDASGSGSCITGDCGSGH
        MASS+   +A I A LHLLA   AATFTFVNRCDFTVWPGILANAG+PTL TTGFELP D+SRSFLAPTGWSGRFWGRT+C+FD SGSGSC+TGDCGSG 
Subjt:  MASSY---SALIIAFLHLLATSMAATFTFVNRCDFTVWPGILANAGSPTLQTTGFELPTDSSRSFLAPTGWSGRFWGRTSCSFDASGSGSCITGDCGSGH

Query:  VECNGAGATPPATLAEFTLGTGGQDFYDVSLVDGYNLPMIVEGTGGSGQCASTGCSTDLNRQCPAELRVGEGDACKSACEAFETEEYCCSGAYGSPATCK
        VECNGAGA PPATLAEFTLGTGGQDFYDVSLVDGYNLPMIVEGTGGSGQCASTGCSTDLNRQCP ELRVGEGDACKSACEAF T EYCCSGAYGSP +CK
Subjt:  VECNGAGATPPATLAEFTLGTGGQDFYDVSLVDGYNLPMIVEGTGGSGQCASTGCSTDLNRQCPAELRVGEGDACKSACEAFETEEYCCSGAYGSPATCK

Query:  PSIYSEMFKSACPRSYSYAYDDATSTFTCSGADYTITFCPSSPSQKSSSFPTPTTAETSQGFGSETGSGSGSGSEYGMGYSGASGTDMLGAGASSGS---
        PS+YSEMFKSACPRSYSYAYDDATSTFTC+GADYTITFCPSSPSQKSSS PTPTT + SQ +GSE+G  SGSGSEYGMGYSGASGTDMLG GASSGS   
Subjt:  PSIYSEMFKSACPRSYSYAYDDATSTFTCSGADYTITFCPSSPSQKSSSFPTPTTAETSQGFGSETGSGSGSGSEYGMGYSGASGTDMLGAGASSGS---

Query:  -----GSGSGTGETMLADGSWLAGLAMGNSAKTVSSTSTPLWMASFIAILSSLFL
             GS SG+GE MLADGSWLAGLAMG++AKTVS  S  L++ +FI ILSS+FL
Subjt:  -----GSGSGTGETMLADGSWLAGLAMGNSAKTVSSTSTPLWMASFIAILSSLFL

A0A6J1JFK9 thaumatin-like protein 1 isoform X14.16e-20083.85Show/hide
Query:  MASSY---SALIIAFLHLLATSMAATFTFVNRCDFTVWPGILANAGSPTLQTTGFELPTDSSRSFLAPTGWSGRFWGRTSCSFDASGSGSCITGDCGSGH
        MASS+   +ALI   LHLLA   AATFTFVNRCDFTVWPGILANAG+PTL TTGFELP D+ RSFLAPTGWSGRFWGRT+C+FD SGSGSC+TGDCGSG 
Subjt:  MASSY---SALIIAFLHLLATSMAATFTFVNRCDFTVWPGILANAGSPTLQTTGFELPTDSSRSFLAPTGWSGRFWGRTSCSFDASGSGSCITGDCGSGH

Query:  VECNGAGATPPATLAEFTLGTGGQDFYDVSLVDGYNLPMIVEGTGGSGQCASTGCSTDLNRQCPAELRVGEGDACKSACEAFETEEYCCSGAYGSPATCK
        VECNGAGA PPATLAEFTLGTGGQDFYDVSLVDGYNLPMIVEGTGGSGQCASTGCSTDLNRQCP ELRVGEGDACKSACEAF T EYCCSGAYGSP +CK
Subjt:  VECNGAGATPPATLAEFTLGTGGQDFYDVSLVDGYNLPMIVEGTGGSGQCASTGCSTDLNRQCPAELRVGEGDACKSACEAFETEEYCCSGAYGSPATCK

Query:  PSIYSEMFKSACPRSYSYAYDDATSTFTCSGADYTITFCPSSPSQKSSSFPTPTTAETSQGFGSETG--SGSGSGSEYGMGYSGASGTDMLGAGASSGS-
        PS+YSEMFKSACPRSYSYAYDDATSTFTC+GADYTITFCPSSPSQKSSSFPTPTT + SQ +GSE+G  SGSGSGSEYGMGYSGASGTDMLG GASSGS 
Subjt:  PSIYSEMFKSACPRSYSYAYDDATSTFTCSGADYTITFCPSSPSQKSSSFPTPTTAETSQGFGSETG--SGSGSGSEYGMGYSGASGTDMLGAGASSGS-

Query:  ---GSGSGTGETMLADGSWLAGLAMGNSAKTVSSTSTPLWMASFIAILSSLFL
           GS SG+GE MLADGSWLAGLAMG++AKTVS  S  L++ +FI ILSS+FL
Subjt:  ---GSGSGTGETMLADGSWLAGLAMGNSAKTVSSTSTPLWMASFIAILSSLFL

SwissProt top hitse value%identityAlignment
A0A1P8B554 Thaumatin-like protein 12.0e-9067.47Show/hide
Query:  SYSALIIAFLHL----LATSMAATFTFVNRCDFTVWPGILANAGSPTLQTTGFELPTDSSRSFLAPTGWSGRFWGRTSCSFDA-SGSGSCITGDCGSGHV
        S+  +I++FL      L  S  AT T VNRC FTVWPGIL+N+GS  + TTGFEL    SRSF AP  WSGRFW RT C+F++ +G G+C+TGDCGS  V
Subjt:  SYSALIIAFLHL----LATSMAATFTFVNRCDFTVWPGILANAGSPTLQTTGFELPTDSSRSFLAPTGWSGRFWGRTSCSFDA-SGSGSCITGDCGSGHV

Query:  ECNGAGATPPATLAEFTLGTG------GQDFYDVSLVDGYNLPMIVEGTGGS-GQCASTGCSTDLNRQCPAELRVGEGDACKSACEAFETEEYCCSGAYG
        ECNGAGA PPATLAEFT+G+G       QDFYDVSLVDGYN+PM+VE +GGS G C +TGC TDLN++CP ELR G G ACKSACEAF + EYCCSGAY 
Subjt:  ECNGAGATPPATLAEFTLGTG------GQDFYDVSLVDGYNLPMIVEGTGGS-GQCASTGCSTDLNRQCPAELRVGEGDACKSACEAFETEEYCCSGAYG

Query:  SPATCKPSIYSEMFKSACPRSYSYAYDDATSTFTCSGADYTITFCPSSP
        SP  CKPS+YSE+FKSACPRSYSYA+DDATSTFTC+ ADYTITFCPS P
Subjt:  SPATCKPSIYSEMFKSACPRSYSYAYDDATSTFTCSGADYTITFCPSSP

O80327 Thaumatin-like protein 15.0e-7056.36Show/hide
Query:  LIIAFLHLLATSMAATFTFVNRCDFTVWPGILANAGSPTLQTTGFELPTDSSRSFLAPTGWSGRFWGRTSCSFDASGSGSCITGDCGSGHVECNGAGATP
        L++ FL   A   +A FTF N+C  TVWPG L   G P L +TGFEL + +S S      WSGRFWGR+ CS D+SG   C TGDCGSG + CNGAGA+P
Subjt:  LIIAFLHLLATSMAATFTFVNRCDFTVWPGILANAGSPTLQTTGFELPTDSSRSFLAPTGWSGRFWGRTSCSFDASGSGSCITGDCGSGHVECNGAGATP

Query:  PATLAEFTLGT-GGQDFYDVSLVDGYNLPMIVEGTGGSGQCASTGCSTDLNRQCPAELRVGEGD----ACKSACEAFETEEYCCSGAYGSPATCKPSIYS
        PA+L E TL T GGQDFYDVSLVDG+NLP+ +   GGSG C ST C+ ++N  CPAEL     D     CKSAC A    +YCC+GAYG+P TC P+ +S
Subjt:  PATLAEFTLGT-GGQDFYDVSLVDGYNLPMIVEGTGGSGQCASTGCSTDLNRQCPAELRVGEGD----ACKSACEAFETEEYCCSGAYGSPATCKPSIYS

Query:  EMFKSACPRSYSYAYDDATSTFTC-SGADYTITFCP
        ++FK+ CP++YSYAYDD +STFTC  G +Y ITFCP
Subjt:  EMFKSACPRSYSYAYDDATSTFTC-SGADYTITFCP

P28493 Pathogenesis-related protein 51.2e-6857.32Show/hide
Query:  SSYSALIIAFLHLLATSMAATFTFVNRCDFTVWPGILANAGSPTLQTTGFELPTDSSRSFLAPTGWSGRFWGRTSCSFDASGSGSCITGDCGSGHVECNG
        SS   L + F+      MA  FT  N C  TVW G LA  G P L   GFEL   +SR   AP GWSGRFW RT C+FDASG+G C+TGDCG   + CNG
Subjt:  SSYSALIIAFLHLLATSMAATFTFVNRCDFTVWPGILANAGSPTLQTTGFELPTDSSRSFLAPTGWSGRFWGRTSCSFDASGSGSCITGDCGSGHVECNG

Query:  AGATPPATLAEFTL-GTGGQDFYDVSLVDGYNLPMIVEGTGGSGQCASTGCSTDLNRQCPAELRVGEGD---ACKSACEAFETEEYCCSGAYGSPATCKP
         G  PP TLAEFTL G GG+DFYDVSLVDGYN+ + +  +GGSG C   GC +DLN  CP  L+V + +   ACKSACE F T++YCC GA   P TC P
Subjt:  AGATPPATLAEFTL-GTGGQDFYDVSLVDGYNLPMIVEGTGGSGQCASTGCSTDLNRQCPAELRVGEGD---ACKSACEAFETEEYCCSGAYGSPATCKP

Query:  SIYSEMFKSACPRSYSYAYDDATSTFTCSGADYTITFCP
        + YS +FK+ACP +YSYAYDD TSTFTC+GA+Y ITFCP
Subjt:  SIYSEMFKSACPRSYSYAYDDATSTFTCSGADYTITFCP

P50699 Thaumatin-like protein1.8e-6754.58Show/hide
Query:  SSYSALIIAFLHLLATSMAATFTFVNRCDFTVWPGILANAGSPTLQTTGFELPTDSSRSFLAPTGWSGRFWGRTSCSFDASGSGSCITGDCGSGHVECNG
        +S +  + AFL LL+ + A+T  F N+C   VWPGI  +AG   L   GF+LP + + S   P  WSGRFWGR  C+FD SG G C TGDCG G + CNG
Subjt:  SSYSALIIAFLHLLATSMAATFTFVNRCDFTVWPGILANAGSPTLQTTGFELPTDSSRSFLAPTGWSGRFWGRTSCSFDASGSGSCITGDCGSGHVECNG

Query:  AGATPPATLAEFTLGTGGQDFYDVSLVDGYNLPMIVEGTGGSGQCASTGCSTDLNRQCPAELRVGEGD-----ACKSACEAFETEEYCCSGAYGSPATCK
        AG  PPATLAE TLG    DFYDVSLVDGYNL M +    GSGQC+  GC +DLN+ CP  L+V   +     ACKSAC AF + +YCC+G +G+P +CK
Subjt:  AGATPPATLAEFTLGTGGQDFYDVSLVDGYNLPMIVEGTGGSGQCASTGCSTDLNRQCPAELRVGEGD-----ACKSACEAFETEEYCCSGAYGSPATCK

Query:  PSIYSEMFKSACPRSYSYAYDDATSTFTCSGADYTITFCP
        P+ YS++FK ACP++YSYAYDD TS  TCS A+Y +TFCP
Subjt:  PSIYSEMFKSACPRSYSYAYDDATSTFTCSGADYTITFCP

Q5DWG1 Pathogenesis-related thaumatin-like protein 3.54.0e-7563.51Show/hide
Query:  AATFTFVNRCDFTVWPGILANAGSPTLQTTGFELPTDSSRSFLAPTGWSGRFWGRTSCSFDASGSGSCITGDCGSGHVECNGAGATPPATLAEFTLGTGG
        A  FT VN+C +TVWPG L+ +GS  L   GF L    S    A + WSGRFWGRT CSFDASG GSCITGDCG+  + C  AG TPP +LAEFTL  G 
Subjt:  AATFTFVNRCDFTVWPGILANAGSPTLQTTGFELPTDSSRSFLAPTGWSGRFWGRTSCSFDASGSGSCITGDCGSGHVECNGAGATPPATLAEFTLGTGG

Query:  QDFYDVSLVDGYNLPMIVEGTGGSGQCASTGCSTDLNRQCPAELRV---GEGDACKSACEAFETEEYCCSGAYGSPATCKPSIYSEMFKSACPRSYSYAY
        +DFYDVSLVDGYN+P+ +   GG+G C + GC +DL   CPAEL V   G+  ACKSAC AF T EYCC+G +GSP TC PS YS++FKSACP +YSYAY
Subjt:  QDFYDVSLVDGYNLPMIVEGTGGSGQCASTGCSTDLNRQCPAELRV---GEGDACKSACEAFETEEYCCSGAYGSPATCKPSIYSEMFKSACPRSYSYAY

Query:  DDATSTFTCSGADYTITFCPSS
        DDATSTFTCS ADYTITFCPSS
Subjt:  DDATSTFTCSGADYTITFCPSS

Arabidopsis top hitse value%identityAlignment
AT1G20030.1 Pathogenesis-related thaumatin superfamily protein3.8e-8160.87Show/hide
Query:  MAATFTFVNRCDFTVWPGILANAGSPTLQTTGFELPTDSSRSFLAPTGWSGRFWGRTSCSFDASGSGSCITGDCGSGHVECNGAGATPPATLAEFTL-GT
        M+ +FTF N+CD+TVWPGIL+NAG   L TTGF L    +R+  AP+ W GRFWGRT CS D+ G  SC TGDCGSG +EC+GAGA PPATLAEFTL G+
Subjt:  MAATFTFVNRCDFTVWPGILANAGSPTLQTTGFELPTDSSRSFLAPTGWSGRFWGRTSCSFDASGSGSCITGDCGSGHVECNGAGATPPATLAEFTL-GT

Query:  GGQDFYDVSLVDGYNLPMIVEGTGGSGQ-CASTGCSTDLNRQCPAELRVGEGD-------ACKSACEAFETEEYCCSGAYGSPATCKPSIYSEMFKSACP
        GG DFYDVSLVDGYN+ M+V   GGSGQ C+STGC  DLN  CP+ELRV   D       ACKSACEAF   EYCCSGA+GSP TCKPS YS +FKSACP
Subjt:  GGQDFYDVSLVDGYNLPMIVEGTGGSGQ-CASTGCSTDLNRQCPAELRVGEGD-------ACKSACEAFETEEYCCSGAYGSPATCKPSIYSEMFKSACP

Query:  RSYSYAYDDATSTFTCS-GADYTITFCPS-SPSQKSS-SFPTPTTAETSQGFGSETGS----------GSGSGSEY
        R+YSYAYDD +STFTC+   +Y ITFCPS + S KS+    T T   TS   GS T S           SGS S Y
Subjt:  RSYSYAYDDATSTFTCS-GADYTITFCPS-SPSQKSS-SFPTPTTAETSQGFGSETGS----------GSGSGSEY

AT1G20030.2 Pathogenesis-related thaumatin superfamily protein3.8e-8160.87Show/hide
Query:  MAATFTFVNRCDFTVWPGILANAGSPTLQTTGFELPTDSSRSFLAPTGWSGRFWGRTSCSFDASGSGSCITGDCGSGHVECNGAGATPPATLAEFTL-GT
        M+ +FTF N+CD+TVWPGIL+NAG   L TTGF L    +R+  AP+ W GRFWGRT CS D+ G  SC TGDCGSG +EC+GAGA PPATLAEFTL G+
Subjt:  MAATFTFVNRCDFTVWPGILANAGSPTLQTTGFELPTDSSRSFLAPTGWSGRFWGRTSCSFDASGSGSCITGDCGSGHVECNGAGATPPATLAEFTL-GT

Query:  GGQDFYDVSLVDGYNLPMIVEGTGGSGQ-CASTGCSTDLNRQCPAELRVGEGD-------ACKSACEAFETEEYCCSGAYGSPATCKPSIYSEMFKSACP
        GG DFYDVSLVDGYN+ M+V   GGSGQ C+STGC  DLN  CP+ELRV   D       ACKSACEAF   EYCCSGA+GSP TCKPS YS +FKSACP
Subjt:  GGQDFYDVSLVDGYNLPMIVEGTGGSGQ-CASTGCSTDLNRQCPAELRVGEGD-------ACKSACEAFETEEYCCSGAYGSPATCKPSIYSEMFKSACP

Query:  RSYSYAYDDATSTFTCS-GADYTITFCPS-SPSQKSS-SFPTPTTAETSQGFGSETGS----------GSGSGSEY
        R+YSYAYDD +STFTC+   +Y ITFCPS + S KS+    T T   TS   GS T S           SGS S Y
Subjt:  RSYSYAYDDATSTFTCS-GADYTITFCPS-SPSQKSS-SFPTPTTAETSQGFGSETGS----------GSGSGSEY

AT4G24180.1 THAUMATIN-LIKE PROTEIN 11.4e-9167.47Show/hide
Query:  SYSALIIAFLHL----LATSMAATFTFVNRCDFTVWPGILANAGSPTLQTTGFELPTDSSRSFLAPTGWSGRFWGRTSCSFDA-SGSGSCITGDCGSGHV
        S+  +I++FL      L  S  AT T VNRC FTVWPGIL+N+GS  + TTGFEL    SRSF AP  WSGRFW RT C+F++ +G G+C+TGDCGS  V
Subjt:  SYSALIIAFLHL----LATSMAATFTFVNRCDFTVWPGILANAGSPTLQTTGFELPTDSSRSFLAPTGWSGRFWGRTSCSFDA-SGSGSCITGDCGSGHV

Query:  ECNGAGATPPATLAEFTLGTG------GQDFYDVSLVDGYNLPMIVEGTGGS-GQCASTGCSTDLNRQCPAELRVGEGDACKSACEAFETEEYCCSGAYG
        ECNGAGA PPATLAEFT+G+G       QDFYDVSLVDGYN+PM+VE +GGS G C +TGC TDLN++CP ELR G G ACKSACEAF + EYCCSGAY 
Subjt:  ECNGAGATPPATLAEFTLGTG------GQDFYDVSLVDGYNLPMIVEGTGGS-GQCASTGCSTDLNRQCPAELRVGEGDACKSACEAFETEEYCCSGAYG

Query:  SPATCKPSIYSEMFKSACPRSYSYAYDDATSTFTCSGADYTITFCPSSP
        SP  CKPS+YSE+FKSACPRSYSYA+DDATSTFTC+ ADYTITFCPS P
Subjt:  SPATCKPSIYSEMFKSACPRSYSYAYDDATSTFTCSGADYTITFCPSSP

AT4G38660.1 Pathogenesis-related thaumatin superfamily protein2.8e-11665.12Show/hide
Query:  LKSLSMASSYSALIIAFLHLLAT---SMAATFTFVNRCDFTVWPGILANAGSPTLQTTGFELPTDSSRSFLAPTGWSGRFWGRTSCSFDASGSGSCITGD
        + +LS   + S L ++F  LL     S  +TFTF NRC +TVWPGIL+NAGSPTL TTGFELP  +SRS  APTGWSGRFW RT C FD+SGSG+C TGD
Subjt:  LKSLSMASSYSALIIAFLHLLAT---SMAATFTFVNRCDFTVWPGILANAGSPTLQTTGFELPTDSSRSFLAPTGWSGRFWGRTSCSFDASGSGSCITGD

Query:  CGSGHVECNGAGATPPATLAEFTLGTGGQDFYDVSLVDGYNLPMIVEGTGGSGQCASTGCSTDLNRQCPAELRVGEGDACKSACEAFETEEYCCSGAYGS
        CGS  VEC G GA PP TLAEFTLGTGG DFYDVSLVDGYN+PMIVE  GGSGQCASTGC+TDLN QCPAELR G+GDACKSAC AF + EYCCSGAY +
Subjt:  CGSGHVECNGAGATPPATLAEFTLGTGGQDFYDVSLVDGYNLPMIVEGTGGSGQCASTGCSTDLNRQCPAELRVGEGDACKSACEAFETEEYCCSGAYGS

Query:  PATCKPSIYSEMFKSACPRSYSYAYDDATSTFTCSGADYTITFCPSSPSQKSSSFPTPTTAETSQGFGSETGSGSGSGSEYGMGYSGASGTDMLGAG---
        P++C+PS+YSEMFK+ACPRSYSYAYDDATSTFTC+G DYT+TFCPSSPSQKS+S+  P T  +S   GS+   GS +      GY+G  G    G G   
Subjt:  PATCKPSIYSEMFKSACPRSYSYAYDDATSTFTCSGADYTITFCPSSPSQKSSSFPTPTTAETSQGFGSETGSGSGSGSEYGMGYSGASGTDMLGAG---

Query:  ASSGSGSGSGTGETMLADGSWLAGLAMGNSAKTVSSTSTPLWMA
         S G+GS  GTGETML DGSW+AGLAMG +++    + T L  A
Subjt:  ASSGSGSGSGTGETMLADGSWLAGLAMGNSAKTVSSTSTPLWMA

AT4G38660.2 Pathogenesis-related thaumatin superfamily protein2.1e-11667.7Show/hide
Query:  LLATSMAATFTFVNRCDFTVWPGILANAGSPTLQTTGFELPTDSSRSFLAPTGWSGRFWGRTSCSFDASGSGSCITGDCGSGHVECNGAGATPPATLAEF
        +L  S  +TFTF NRC +TVWPGIL+NAGSPTL TTGFELP  +SRS  APTGWSGRFW RT C FD+SGSG+C TGDCGS  VEC G GA PP TLAEF
Subjt:  LLATSMAATFTFVNRCDFTVWPGILANAGSPTLQTTGFELPTDSSRSFLAPTGWSGRFWGRTSCSFDASGSGSCITGDCGSGHVECNGAGATPPATLAEF

Query:  TLGTGGQDFYDVSLVDGYNLPMIVEGTGGSGQCASTGCSTDLNRQCPAELRVGEGDACKSACEAFETEEYCCSGAYGSPATCKPSIYSEMFKSACPRSYS
        TLGTGG DFYDVSLVDGYN+PMIVE  GGSGQCASTGC+TDLN QCPAELR G+GDACKSAC AF + EYCCSGAY +P++C+PS+YSEMFK+ACPRSYS
Subjt:  TLGTGGQDFYDVSLVDGYNLPMIVEGTGGSGQCASTGCSTDLNRQCPAELRVGEGDACKSACEAFETEEYCCSGAYGSPATCKPSIYSEMFKSACPRSYS

Query:  YAYDDATSTFTCSGADYTITFCPSSPSQKSSSFPTPTTAETSQGFGSETGSGSGSGSEYGMGYSGASGTDMLGAG---ASSGSGSGSGTGETMLADGSWL
        YAYDDATSTFTC+G DYT+TFCPSSPSQKS+S+  P T  +S   GS+   GS +      GY+G  G    G G    S G+GS  GTGETML DGSW+
Subjt:  YAYDDATSTFTCSGADYTITFCPSSPSQKSSSFPTPTTAETSQGFGSETGSGSGSGSEYGMGYSGASGTDMLGAG---ASSGSGSGSGTGETMLADGSWL

Query:  AGLAMGNSAKTVSSTSTPLWMA
        AGLAMG +++    + T L  A
Subjt:  AGLAMGNSAKTVSSTSTPLWMA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTGTCTAAAATGGGGGGAAATAGAGGTAGAAGAAGAAGTGTGTGAAAAAAAAGAAATGGGAAACATCAATAATGGAAAGAAGAGAGAAAATAGAGACGAAATCCCCAA
ATATTATTATTTTTCCTGCTCCTGCTCACATTCACATGCCTTCTCCTTCTTCTTTTTCATTCTCTCTCTCTCTTCTACTGAACTGAATCTCCAACCAACGCCTCCTTCTT
CATTCTCTGCTCTGTTTCTCAAATCCCTTTCCATGGCTTCCTCCTATTCCGCTCTCATTATAGCTTTTCTTCATCTACTCGCCACTTCCATGGCTGCAACCTTCACCTTC
GTCAACAGGTGCGATTTCACCGTCTGGCCCGGCATTCTCGCCAATGCCGGCAGCCCCACTCTCCAGACCACCGGATTCGAGCTCCCCACAGACTCTTCCCGCTCTTTTCT
AGCTCCCACCGGCTGGTCCGGCCGATTCTGGGGCCGGACTTCCTGTTCCTTCGACGCGTCCGGTTCCGGATCCTGCATCACCGGCGACTGCGGCTCCGGCCACGTTGAGT
GCAACGGCGCCGGAGCCACCCCGCCCGCCACTCTCGCCGAGTTCACTCTCGGTACCGGCGGCCAGGACTTCTACGACGTCAGCCTCGTCGACGGCTACAATCTACCCATG
ATAGTCGAGGGAACCGGCGGGTCGGGTCAGTGCGCGTCCACGGGTTGCTCCACGGATCTGAACCGGCAGTGTCCGGCGGAGCTGAGGGTCGGCGAAGGGGACGCGTGTAA
GAGCGCGTGCGAGGCGTTTGAGACGGAAGAGTACTGTTGCAGCGGCGCGTACGGTTCACCCGCCACCTGCAAACCCTCAATCTACTCGGAAATGTTCAAATCGGCTTGCC
CTAGATCCTACAGCTACGCCTACGACGACGCCACCAGCACCTTCACCTGCTCCGGCGCCGACTACACCATTACATTCTGCCCCTCATCTCCAAGCCAAAAATCATCAAGC
TTTCCTACCCCAACAACAGCAGAAACATCTCAAGGGTTCGGATCTGAGACTGGGTCTGGGTCGGGGTCGGGGTCGGAATATGGGATGGGGTATTCAGGAGCATCTGGAAC
TGATATGTTAGGGGCAGGTGCCAGTTCAGGTTCAGGATCGGGGTCGGGAACTGGGGAAACTATGCTAGCAGATGGTTCATGGTTGGCTGGTTTGGCCATGGGAAACTCAG
CAAAAACAGTATCGTCTACATCCACTCCACTGTGGATGGCATCATTTATAGCCATTCTATCTTCTCTCTTCTTGTAG
mRNA sequenceShow/hide mRNA sequence
TCTTCTTTATGAGTTAAATTATAAAATTAGTAGGTCTTCCTTTTTTCCTTCTAGAAAATTCAAACTTAACTAATTTTTGGTCTAAGATAGATGCTGTAACCAGTTGAACT
ATAATGACATGTCTAATAGACGTTTAAATTTTGCATCTAATGAATATCTAACCTAACAGCGATAACCTTATAATTTAATCTATAACCCACAATGCAAACACGACATCGGA
GGTTCAAACCTCCGTATCCAAATTTCTTACCATGTGTCTAAAATGGGGGGAAATAGAGGTAGAAGAAGAAGTGTGTGAAAAAAAAGAAATGGGAAACATCAATAATGGAA
AGAAGAGAGAAAATAGAGACGAAATCCCCAAATATTATTATTTTTCCTGCTCCTGCTCACATTCACATGCCTTCTCCTTCTTCTTTTTCATTCTCTCTCTCTCTTCTACT
GAACTGAATCTCCAACCAACGCCTCCTTCTTCATTCTCTGCTCTGTTTCTCAAATCCCTTTCCATGGCTTCCTCCTATTCCGCTCTCATTATAGCTTTTCTTCATCTACT
CGCCACTTCCATGGCTGCAACCTTCACCTTCGTCAACAGGTGCGATTTCACCGTCTGGCCCGGCATTCTCGCCAATGCCGGCAGCCCCACTCTCCAGACCACCGGATTCG
AGCTCCCCACAGACTCTTCCCGCTCTTTTCTAGCTCCCACCGGCTGGTCCGGCCGATTCTGGGGCCGGACTTCCTGTTCCTTCGACGCGTCCGGTTCCGGATCCTGCATC
ACCGGCGACTGCGGCTCCGGCCACGTTGAGTGCAACGGCGCCGGAGCCACCCCGCCCGCCACTCTCGCCGAGTTCACTCTCGGTACCGGCGGCCAGGACTTCTACGACGT
CAGCCTCGTCGACGGCTACAATCTACCCATGATAGTCGAGGGAACCGGCGGGTCGGGTCAGTGCGCGTCCACGGGTTGCTCCACGGATCTGAACCGGCAGTGTCCGGCGG
AGCTGAGGGTCGGCGAAGGGGACGCGTGTAAGAGCGCGTGCGAGGCGTTTGAGACGGAAGAGTACTGTTGCAGCGGCGCGTACGGTTCACCCGCCACCTGCAAACCCTCA
ATCTACTCGGAAATGTTCAAATCGGCTTGCCCTAGATCCTACAGCTACGCCTACGACGACGCCACCAGCACCTTCACCTGCTCCGGCGCCGACTACACCATTACATTCTG
CCCCTCATCTCCAAGCCAAAAATCATCAAGCTTTCCTACCCCAACAACAGCAGAAACATCTCAAGGGTTCGGATCTGAGACTGGGTCTGGGTCGGGGTCGGGGTCGGAAT
ATGGGATGGGGTATTCAGGAGCATCTGGAACTGATATGTTAGGGGCAGGTGCCAGTTCAGGTTCAGGATCGGGGTCGGGAACTGGGGAAACTATGCTAGCAGATGGTTCA
TGGTTGGCTGGTTTGGCCATGGGAAACTCAGCAAAAACAGTATCGTCTACATCCACTCCACTGTGGATGGCATCATTTATAGCCATTCTATCTTCTCTCTTCTTGTAGCC
CTCTCCAATTTCCCCCAACTTATGGCTTCAGCGTCAGGATTTGCAGTTGGTAGGAACGGAAAGGATCTTCCCCAGATTGTAAAATGCCATCTTCTTGAATGATTCATTTT
CTTTTTAGTTTGAGAATGTTTTGGTGCTCTTTTTTCATTTGGGTTTTGTTGTTCCCTTCCAAAGCTGTCATTGGGTTTCTGATTTAGATTGGAAGTTTGTTCCTTTTGAT
TACTTGAATCAGGAAGGAAAGAAATGTTTCTGAATCAGAACATTGTTTGTGGTGTGAACAAGATGACAAGCATGGCCGGTTTTGTAACAGAGAAGGATCGCTTGGCCTGT
AATGCAAGTCACAGGTCACAGCTTTGGCTTTTTGTATGTCAGCTGAGGGAAAAAAAATGAAAAAGATGCAACTTTTCAAAGTCCCTTTAAGTGTACTATTATTTTGGTTT
AGTGGAATGTGAAATTTTGTCAAGCAGTGGAGAAAGACAGATCATTTAAGTTGGG
Protein sequenceShow/hide protein sequence
MCLKWGEIEVEEEVCEKKEMGNINNGKKRENRDEIPKYYYFSCSCSHSHAFSFFFFILSLSSTELNLQPTPPSSFSALFLKSLSMASSYSALIIAFLHLLATSMAATFTF
VNRCDFTVWPGILANAGSPTLQTTGFELPTDSSRSFLAPTGWSGRFWGRTSCSFDASGSGSCITGDCGSGHVECNGAGATPPATLAEFTLGTGGQDFYDVSLVDGYNLPM
IVEGTGGSGQCASTGCSTDLNRQCPAELRVGEGDACKSACEAFETEEYCCSGAYGSPATCKPSIYSEMFKSACPRSYSYAYDDATSTFTCSGADYTITFCPSSPSQKSSS
FPTPTTAETSQGFGSETGSGSGSGSEYGMGYSGASGTDMLGAGASSGSGSGSGTGETMLADGSWLAGLAMGNSAKTVSSTSTPLWMASFIAILSSLFL