; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MS006573 (gene) of Bitter gourd (TR) v1 genome

Gene IDMS006573
OrganismMomordica charantia cv. TR (Bitter gourd (TR) v1)
DescriptionThaumatin
Genome locationscaffold404:1437327..1458599
RNA-Seq ExpressionMS006573
SyntenyMS006573
Gene Ontology termsGO:0110165 - cellular anatomical structure (cellular component)
InterPro domainsIPR001938 - Thaumatin family
IPR037176 - Osmotin/thaumatin-like superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
OMO73718.1 Thaumatin [Corchorus olitorius]1.2e-14056.43Show/hide
Query:  VHAATFVVKNNCPQTIWPATLTSGSGQSQLSSTGFQLLSGASRTVDVPAPWTGRIWARTRCFVDSSSRFSCETGDCASGSISCNGAGGIPPATLAEFTLA
        V +ATF   NNCP ++WP  L SG G  Q SSTGF+L S AS T+D+ APW+GRIW RT+C  D+  +F C T DC SG ++CNGAG IPPA+L EFTLA
Subjt:  VHAATFVVKNNCPQTIWPATLTSGSGQSQLSSTGFQLLSGASRTVDVPAPWTGRIWARTRCFVDSSSRFSCETGDCASGSISCNGAGGIPPATLAEFTLA

Query:  PNGGMDFYDVSLVDGFNLPASIATVGGTGECQSTACSANVNGVCPTELQVRSGDGSVIGCKSACLAFNEPQYCCTAEFNDPSKCAHSQYSLIFKNQCPQA
         +GG DFYDVSLVDGFNLP  I   GG+G+C++T+C ANVN VCP ELQV+ GDGSVI CKSACLAFN+PQYCCT  F   S                  
Subjt:  PNGGMDFYDVSLVDGFNLPASIATVGGTGECQSTACSANVNGVCPTELQVRSGDGSVIGCKSACLAFNEPQYCCTAEFNDPSKCAHSQYSLIFKNQCPQA

Query:  YSYAYDDKTSTFTCSVVHAATFVVKNNCPQTIWPATLTSGSGQSQLSSTGFQLLSGASRTVDVPAPWTGRIWARTRCFVDSSSRFSCETGDCASGSISCN
                           ATF + NNC  TIWPA LT G+G  Q+  TG +L S AS   D+P PW+ R WART+C  D++ +F C TGDCASG I+CN
Subjt:  YSYAYDDKTSTFTCSVVHAATFVVKNNCPQTIWPATLTSGSGQSQLSSTGFQLLSGASRTVDVPAPWTGRIWARTRCFVDSSSRFSCETGDCASGSISCN

Query:  GAGGIPPATLAEFTLAPNGGMDFYDVSLVDGFNLPASIATVGG-TGECQSTACSANVNGVCPTELQVRSGDGSVIGCKSACLAFNEPQYCCTAEFNDPSK
        GAGG+PP +LAEFTLA N G DFYD+SLVDGFN+P SI   GG    C +  C AN+N  CP ELQV++ DGSV+ C SAC AFN+PQYCCT  F  P  
Subjt:  GAGGIPPATLAEFTLAPNGGMDFYDVSLVDGFNLPASIATVGG-TGECQSTACSANVNGVCPTELQVRSGDGSVIGCKSACLAFNEPQYCCTAEFNDPSK

Query:  CAHSQYSLIFKNQCPQAYSYAYDDKTSTFTCSGSPNYVVTFCP
        C  + YS  FK QCPQAYSYAYDDK+   +C G PNYV+TFCP
Subjt:  CAHSQYSLIFKNQCPQAYSYAYDDKTSTFTCSGSPNYVVTFCP

RDY04111.1 hypothetical protein CR513_12220, partial [Mucuna pruriens]6.5e-15057.11Show/hide
Query:  NNCPQTIWPATLTSGSGQSQLSSTGFQLLSGASRTVDVPAPWTGRIWARTRCFVDSSSRFSCETGDCASGSISCNGAGGIPPATLAEFTLAPNGGMDFYD
        N C  T+WP TLT G  + QLS+TGF+L SGAS +VD+P+PW+GR WART C   S++ FSC TGDCASG + CNGAGG PPATL E T+A NGG DFYD
Subjt:  NNCPQTIWPATLTSGSGQSQLSSTGFQLLSGASRTVDVPAPWTGRIWARTRCFVDSSSRFSCETGDCASGSISCNGAGGIPPATLAEFTLAPNGGMDFYD

Query:  VSLVDGFNLPASIATVGGTGECQSTACSANVNGVCPTELQVRSGDGSVIGCKSACLAFNEPQYCCTAEFNDPSKCAHSQYSLIFKNQCPQAYSYAYDDKT
        VS VDGFN+P SI   GG+G C++++C + +N VCP +LQV+  DGSVI CKSACLAF   QYCCT + N    C  + YS  F  QCP AYSYAYDDK 
Subjt:  VSLVDGFNLPASIATVGGTGECQSTACSANVNGVCPTELQVRSGDGSVIGCKSACLAFNEPQYCCTAEFNDPSKCAHSQYSLIFKNQCPQAYSYAYDDKT

Query:  STFTCS----------VVHAATFVVKNNCPQTIWPATLTSGSGQSQLSSTGFQLLSGASRTVDVPAPWTGRIWARTRCFVDSSSRFSCETGDCASGSISC
         TFTCS          V   A     N CP T+WP TLT G  + QLS +GF+L +GAS +VD+P+PW+GR W RT C  ++  +FSC T DC SG ++C
Subjt:  STFTCS----------VVHAATFVVKNNCPQTIWPATLTSGSGQSQLSSTGFQLLSGASRTVDVPAPWTGRIWARTRCFVDSSSRFSCETGDCASGSISC

Query:  NGAGGIPPATLAEFTLAPNGGMDFYDVSLVDGFNLPASIATVGGTGECQSTACSANVNGVCPTELQVRSGDGSVIGCKSACLAFNEPQYCCTAEFNDPSK
        NGAG  PPATL E T+A NGG D+YDVS VDGFN+P S++  GG+G+C++++C  N+N  CP ELQ++  DG+VIGCKSACL FN+PQYCCT + + P  
Subjt:  NGAGGIPPATLAEFTLAPNGGMDFYDVSLVDGFNLPASIATVGGTGECQSTACSANVNGVCPTELQVRSGDGSVIGCKSACLAFNEPQYCCTAEFNDPSK

Query:  CAHSQYSLIFKNQCPQAYSYAYDDKTSTFTCSGSPNYVVTFCP
        C  + +S  F+ QC +AYSYAYDDK STFTCS  P+YV+TFCP
Subjt:  CAHSQYSLIFKNQCPQAYSYAYDDKTSTFTCSGSPNYVVTFCP

RYR71254.1 hypothetical protein Ahy_A02g005535 isoform A [Arachis hypogaea]4.5e-13551.06Show/hide
Query:  VVHAATFVVKNNCPQTIWPATLTSGSGQSQLSSTGFQLLSGASRTVDVPAPWTGRIWARTRCFVDSSSRFSCETGDCASGSISCNGAGGIPPATLAEFTL
        V   A     NNCP T+WP T  SG+   QLSSTGF+L SGAS T+++P+ W+G+ WART C  +++  FSC T DC +  + C GAG   PA+L E T 
Subjt:  VVHAATFVVKNNCPQTIWPATLTSGSGQSQLSSTGFQLLSGASRTVDVPAPWTGRIWARTRCFVDSSSRFSCETGDCASGSISCNGAGGIPPATLAEFTL

Query:  A-PNGGMDFYDVSLVDGFNLPASIATVGGTGECQSTACSANVNGVCPTELQVRSGDGSVIGCKSACLAFNEPQYCCTAEFNDPSKCAHSQYSLIFKNQCP
           NG  DFYDVS VDGFN+P+S++  GG+G C + +C  N+N  CP ELQ +  D SV+GCKSAC+ FN P+YCC  + N P  C  + YS  F  QCP
Subjt:  A-PNGGMDFYDVSLVDGFNLPASIATVGGTGECQSTACSANVNGVCPTELQVRSGDGSVIGCKSACLAFNEPQYCCTAEFNDPSKCAHSQYSLIFKNQCP

Query:  QAYSYAYDDKTSTFTCS--------------------------VVHAATFVVKNNCPQTIWPATLTSGSGQSQLSSTGFQLLSGASRTVDVPAPWTGRIW
         AYSYAYDDK  TFTCS                          V H A   + N C  T+WP +  + +  +QLS+TGF+L SG S+TVDVPAPW+G+ W
Subjt:  QAYSYAYDDKTSTFTCS--------------------------VVHAATFVVKNNCPQTIWPATLTSGSGQSQLSSTGFQLLSGASRTVDVPAPWTGRIW

Query:  ARTRCFVDSSSRFSCETGDCASGSISCNGAGGIPPATLAEFTLAPNGGMDFYDVSLVDGFNLPASIATVGGTGECQSTACSANVNGVCPTELQVRSGDGS
        ART C  +++  FSC T DC +  + C+GAG   PA+L EFT+A NGG DFYDVS VDGFN+P+SI   GG+G C   +C AN+N  CP  LQ +  DGS
Subjt:  ARTRCFVDSSSRFSCETGDCASGSISCNGAGGIPPATLAEFTLAPNGGMDFYDVSLVDGFNLPASIATVGGTGECQSTACSANVNGVCPTELQVRSGDGS

Query:  VIGCKSACLAFNEPQYCCTAEFNDPSKCAHSQYSLIFKNQCPQAYSYAYDDKTSTFTCSGSPNYVVTFCP
        VIGCKSAC+ F  P+YCCT + N P+ C  + YS  F NQCP AYSYAYDDK  TFTCSGSPNY + FCP
Subjt:  VIGCKSACLAFNEPQYCCTAEFNDPSKCAHSQYSLIFKNQCPQAYSYAYDDKTSTFTCSGSPNYVVTFCP

RZC77122.1 hypothetical protein C5167_001310 [Papaver somniferum]8.0e-13260.75Show/hide
Query:  KNNCPQTIWPATLTSGSGQSQLSSTGFQLLSGASRTVDVPAPWTGRIWARTRCFVDSSSRFSCETGDCASGSISCNGAGGIPPATLAEFTLAPNGGMDFY
        KNNCP T+WP TLT GSG SQLS TGF+L SGAS +VD PA W+GR WART C   SS R +C T DCASG+  CNGAG IPPATL EFTL  +GG DFY
Subjt:  KNNCPQTIWPATLTSGSGQSQLSSTGFQLLSGASRTVDVPAPWTGRIWARTRCFVDSSSRFSCETGDCASGSISCNGAGGIPPATLAEFTLAPNGGMDFY

Query:  DVSLVDGFNLPASIATVGGTGECQSTACSANVNGVCPTELQVRSGDGSVIGCKSACLAFNEPQYCCTAEFNDPSKCAHSQYSLIFKNQCPQAYSYAYDDK
        DVS VDGFNLPASI   GG   C ST C +N+N VCP EL V+   GSV+ CKSACLA  +PQYCCT  FN    C  + YS IFK+ CPQAYSYAYDD+
Subjt:  DVSLVDGFNLPASIATVGGTGECQSTACSANVNGVCPTELQVRSGDGSVIGCKSACLAFNEPQYCCTAEFNDPSKCAHSQYSLIFKNQCPQAYSYAYDDK

Query:  TSTFTCSVVHAATFVVK--NNCPQTIWPATLTSGSGQSQLSSTGFQLLSGASRTVDVPAPWTGRIWARTRCFVDSSSRFSCETGDCASGSISCNGAGGIP
        +STFTC+    +  ++      P          G   SQLS TGF L  GAS +VD PA W+GR WART C  DSS R  C T DCASG + CNGAG IP
Subjt:  TSTFTCSVVHAATFVVK--NNCPQTIWPATLTSGSGQSQLSSTGFQLLSGASRTVDVPAPWTGRIWARTRCFVDSSSRFSCETGDCASGSISCNGAGGIP

Query:  PATLAEFTLAPNGGMDFYDVSLVDGFNLPASIATVGGTGECQSTACSANVNGVCPTELQVRSGDGSVIGCKSACLAFNEPQYCCTAEFNDPSKCAHSQYS
        PATL EFTL  NGG DFYDVSLVDGFNLPASI      G C ST C  NVN VCP +L VR   GSVI CKSAC AF +PQYCCT  FN P+ C  + YS
Subjt:  PATLAEFTLAPNGGMDFYDVSLVDGFNLPASIATVGGTGECQSTACSANVNGVCPTELQVRSGDGSVIGCKSACLAFNEPQYCCTAEFNDPSKCAHSQYS

XP_019420884.1 PREDICTED: uncharacterized protein LOC109331061 isoform X2 [Lupinus angustifolius]1.4e-13153.08Show/hide
Query:  HAATFVVKNNCPQTIWPATLTSGSGQSQLSSTGFQLLSGASRTVDVPAPWTGRIWARTRCFVDSSS-RFSCETGDCASGSISCNGAGGIPPATLAEFTLA
        ++ TF + N C  T+WP  LT G+G   LS+TGF L  G S T+ +PA W+GRIW RT C  D+++ +FSC TGDC S +  C G G  PPATLAEFTL 
Subjt:  HAATFVVKNNCPQTIWPATLTSGSGQSQLSSTGFQLLSGASRTVDVPAPWTGRIWARTRCFVDSSS-RFSCETGDCASGSISCNGAGGIPPATLAEFTLA

Query:  PNGGMDFYDVSLVDGFNLPASIATVGGT--GECQSTACSANVNGVCPTELQ-VRSGDG-SVIGCKSACLAFNEPQYCCTAEFNDPSKCAHSQYSLIFKNQ
          GG+DFYDVS VDG+NLP  +   GGT  G C +T C  ++N  CPTEL+ V SG+G   + CKSAC AF +PQYCC+  +  P  C  S YS  FK+ 
Subjt:  PNGGMDFYDVSLVDGFNLPASIATVGGT--GECQSTACSANVNGVCPTELQ-VRSGDG-SVIGCKSACLAFNEPQYCCTAEFNDPSKCAHSQYSLIFKNQ

Query:  CPQAYSYAYDDKTSTFTCSVV--------HAATFVVKNNCPQTIWPATLTSGSGQSQLSSTGFQLLSGASRTVDVPAPWTGRIWARTRCFVDSSSRFSCE
        CP AYSYAYDD TSTFTC+            ATF   N C  T+WP  L    G+  L +TGF+L  G SR+   PA W+GR WART C  D S R +C 
Subjt:  CPQAYSYAYDDKTSTFTCSVV--------HAATFVVKNNCPQTIWPATLTSGSGQSQLSSTGFQLLSGASRTVDVPAPWTGRIWARTRCFVDSSSRFSCE

Query:  TGDCASGSISCNGAGGIPPATLAEFTLAPNGGMDFYDVSLVDGFNLPASIATVGGTGECQSTACSANVNGVCPTELQVRSGDGSVIGCKSACLAFNEPQY
        T DC SG I+CNGAG  PPATLAEFTL   G MD+YDVSLVDG+NLP  +A  GG+G C +T C  ++N  CP+EL+V  GD     CKSAC AF + +Y
Subjt:  TGDCASGSISCNGAGGIPPATLAEFTLAPNGGMDFYDVSLVDGFNLPASIATVGGTGECQSTACSANVNGVCPTELQVRSGDGSVIGCKSACLAFNEPQY

Query:  CCTAEFNDPSKCAHSQYSLIFKNQCPQAYSYAYDDKTSTFTCSGSPNYVVTFCP
        CC  EF++PS C  S YS +FK+ CP++YSYAYDD TSTFTC+G+ +Y +TFCP
Subjt:  CCTAEFNDPSKCAHSQYSLIFKNQCPQAYSYAYDDKTSTFTCSGSPNYVVTFCP

TrEMBL top hitse value%identityAlignment
A0A1R3HTS8 Thaumatin5.9e-14156.43Show/hide
Query:  VHAATFVVKNNCPQTIWPATLTSGSGQSQLSSTGFQLLSGASRTVDVPAPWTGRIWARTRCFVDSSSRFSCETGDCASGSISCNGAGGIPPATLAEFTLA
        V +ATF   NNCP ++WP  L SG G  Q SSTGF+L S AS T+D+ APW+GRIW RT+C  D+  +F C T DC SG ++CNGAG IPPA+L EFTLA
Subjt:  VHAATFVVKNNCPQTIWPATLTSGSGQSQLSSTGFQLLSGASRTVDVPAPWTGRIWARTRCFVDSSSRFSCETGDCASGSISCNGAGGIPPATLAEFTLA

Query:  PNGGMDFYDVSLVDGFNLPASIATVGGTGECQSTACSANVNGVCPTELQVRSGDGSVIGCKSACLAFNEPQYCCTAEFNDPSKCAHSQYSLIFKNQCPQA
         +GG DFYDVSLVDGFNLP  I   GG+G+C++T+C ANVN VCP ELQV+ GDGSVI CKSACLAFN+PQYCCT  F   S                  
Subjt:  PNGGMDFYDVSLVDGFNLPASIATVGGTGECQSTACSANVNGVCPTELQVRSGDGSVIGCKSACLAFNEPQYCCTAEFNDPSKCAHSQYSLIFKNQCPQA

Query:  YSYAYDDKTSTFTCSVVHAATFVVKNNCPQTIWPATLTSGSGQSQLSSTGFQLLSGASRTVDVPAPWTGRIWARTRCFVDSSSRFSCETGDCASGSISCN
                           ATF + NNC  TIWPA LT G+G  Q+  TG +L S AS   D+P PW+ R WART+C  D++ +F C TGDCASG I+CN
Subjt:  YSYAYDDKTSTFTCSVVHAATFVVKNNCPQTIWPATLTSGSGQSQLSSTGFQLLSGASRTVDVPAPWTGRIWARTRCFVDSSSRFSCETGDCASGSISCN

Query:  GAGGIPPATLAEFTLAPNGGMDFYDVSLVDGFNLPASIATVGG-TGECQSTACSANVNGVCPTELQVRSGDGSVIGCKSACLAFNEPQYCCTAEFNDPSK
        GAGG+PP +LAEFTLA N G DFYD+SLVDGFN+P SI   GG    C +  C AN+N  CP ELQV++ DGSV+ C SAC AFN+PQYCCT  F  P  
Subjt:  GAGGIPPATLAEFTLAPNGGMDFYDVSLVDGFNLPASIATVGG-TGECQSTACSANVNGVCPTELQVRSGDGSVIGCKSACLAFNEPQYCCTAEFNDPSK

Query:  CAHSQYSLIFKNQCPQAYSYAYDDKTSTFTCSGSPNYVVTFCP
        C  + YS  FK QCPQAYSYAYDDK+   +C G PNYV+TFCP
Subjt:  CAHSQYSLIFKNQCPQAYSYAYDDKTSTFTCSGSPNYVVTFCP

A0A371HMT6 Uncharacterized protein (Fragment)3.1e-15057.11Show/hide
Query:  NNCPQTIWPATLTSGSGQSQLSSTGFQLLSGASRTVDVPAPWTGRIWARTRCFVDSSSRFSCETGDCASGSISCNGAGGIPPATLAEFTLAPNGGMDFYD
        N C  T+WP TLT G  + QLS+TGF+L SGAS +VD+P+PW+GR WART C   S++ FSC TGDCASG + CNGAGG PPATL E T+A NGG DFYD
Subjt:  NNCPQTIWPATLTSGSGQSQLSSTGFQLLSGASRTVDVPAPWTGRIWARTRCFVDSSSRFSCETGDCASGSISCNGAGGIPPATLAEFTLAPNGGMDFYD

Query:  VSLVDGFNLPASIATVGGTGECQSTACSANVNGVCPTELQVRSGDGSVIGCKSACLAFNEPQYCCTAEFNDPSKCAHSQYSLIFKNQCPQAYSYAYDDKT
        VS VDGFN+P SI   GG+G C++++C + +N VCP +LQV+  DGSVI CKSACLAF   QYCCT + N    C  + YS  F  QCP AYSYAYDDK 
Subjt:  VSLVDGFNLPASIATVGGTGECQSTACSANVNGVCPTELQVRSGDGSVIGCKSACLAFNEPQYCCTAEFNDPSKCAHSQYSLIFKNQCPQAYSYAYDDKT

Query:  STFTCS----------VVHAATFVVKNNCPQTIWPATLTSGSGQSQLSSTGFQLLSGASRTVDVPAPWTGRIWARTRCFVDSSSRFSCETGDCASGSISC
         TFTCS          V   A     N CP T+WP TLT G  + QLS +GF+L +GAS +VD+P+PW+GR W RT C  ++  +FSC T DC SG ++C
Subjt:  STFTCS----------VVHAATFVVKNNCPQTIWPATLTSGSGQSQLSSTGFQLLSGASRTVDVPAPWTGRIWARTRCFVDSSSRFSCETGDCASGSISC

Query:  NGAGGIPPATLAEFTLAPNGGMDFYDVSLVDGFNLPASIATVGGTGECQSTACSANVNGVCPTELQVRSGDGSVIGCKSACLAFNEPQYCCTAEFNDPSK
        NGAG  PPATL E T+A NGG D+YDVS VDGFN+P S++  GG+G+C++++C  N+N  CP ELQ++  DG+VIGCKSACL FN+PQYCCT + + P  
Subjt:  NGAGGIPPATLAEFTLAPNGGMDFYDVSLVDGFNLPASIATVGGTGECQSTACSANVNGVCPTELQVRSGDGSVIGCKSACLAFNEPQYCCTAEFNDPSK

Query:  CAHSQYSLIFKNQCPQAYSYAYDDKTSTFTCSGSPNYVVTFCP
        C  + +S  F+ QC +AYSYAYDDK STFTCS  P+YV+TFCP
Subjt:  CAHSQYSLIFKNQCPQAYSYAYDDKTSTFTCSGSPNYVVTFCP

A0A445E6Y1 Uncharacterized protein2.2e-13551.06Show/hide
Query:  VVHAATFVVKNNCPQTIWPATLTSGSGQSQLSSTGFQLLSGASRTVDVPAPWTGRIWARTRCFVDSSSRFSCETGDCASGSISCNGAGGIPPATLAEFTL
        V   A     NNCP T+WP T  SG+   QLSSTGF+L SGAS T+++P+ W+G+ WART C  +++  FSC T DC +  + C GAG   PA+L E T 
Subjt:  VVHAATFVVKNNCPQTIWPATLTSGSGQSQLSSTGFQLLSGASRTVDVPAPWTGRIWARTRCFVDSSSRFSCETGDCASGSISCNGAGGIPPATLAEFTL

Query:  A-PNGGMDFYDVSLVDGFNLPASIATVGGTGECQSTACSANVNGVCPTELQVRSGDGSVIGCKSACLAFNEPQYCCTAEFNDPSKCAHSQYSLIFKNQCP
           NG  DFYDVS VDGFN+P+S++  GG+G C + +C  N+N  CP ELQ +  D SV+GCKSAC+ FN P+YCC  + N P  C  + YS  F  QCP
Subjt:  A-PNGGMDFYDVSLVDGFNLPASIATVGGTGECQSTACSANVNGVCPTELQVRSGDGSVIGCKSACLAFNEPQYCCTAEFNDPSKCAHSQYSLIFKNQCP

Query:  QAYSYAYDDKTSTFTCS--------------------------VVHAATFVVKNNCPQTIWPATLTSGSGQSQLSSTGFQLLSGASRTVDVPAPWTGRIW
         AYSYAYDDK  TFTCS                          V H A   + N C  T+WP +  + +  +QLS+TGF+L SG S+TVDVPAPW+G+ W
Subjt:  QAYSYAYDDKTSTFTCS--------------------------VVHAATFVVKNNCPQTIWPATLTSGSGQSQLSSTGFQLLSGASRTVDVPAPWTGRIW

Query:  ARTRCFVDSSSRFSCETGDCASGSISCNGAGGIPPATLAEFTLAPNGGMDFYDVSLVDGFNLPASIATVGGTGECQSTACSANVNGVCPTELQVRSGDGS
        ART C  +++  FSC T DC +  + C+GAG   PA+L EFT+A NGG DFYDVS VDGFN+P+SI   GG+G C   +C AN+N  CP  LQ +  DGS
Subjt:  ARTRCFVDSSSRFSCETGDCASGSISCNGAGGIPPATLAEFTLAPNGGMDFYDVSLVDGFNLPASIATVGGTGECQSTACSANVNGVCPTELQVRSGDGS

Query:  VIGCKSACLAFNEPQYCCTAEFNDPSKCAHSQYSLIFKNQCPQAYSYAYDDKTSTFTCSGSPNYVVTFCP
        VIGCKSAC+ F  P+YCCT + N P+ C  + YS  F NQCP AYSYAYDDK  TFTCSGSPNY + FCP
Subjt:  VIGCKSACLAFNEPQYCCTAEFNDPSKCAHSQYSLIFKNQCPQAYSYAYDDKTSTFTCSGSPNYVVTFCP

A0A6N2MKB6 Uncharacterized protein2.0e-14959.47Show/hide
Query:  AATFVVKNNCPQTIWPATLTSGSGQSQLSSTGFQLLSGASRTVDVPAPWTGRIWARTRCFVDSSSRFSCETGDCASGSISCNGAGGIPPATLAEFTLAPN
        + TF   N CP T+WP TLT+ +G+ QLSSTGF L +GAS ++           ART+C   +S +F C T DCASG I CNGAG IPPA+LAEFTL  +
Subjt:  AATFVVKNNCPQTIWPATLTSGSGQSQLSSTGFQLLSGASRTVDVPAPWTGRIWARTRCFVDSSSRFSCETGDCASGSISCNGAGGIPPATLAEFTLAPN

Query:  GGMDFYDVSLVDGFNLPASIATVGGTGECQSTACSANVNGVCPTELQVRSGDGSVIGCKSACLAFNEPQYCCTAEFNDPSKCAHSQYSLIFKNQCPQAYS
        GG DFYD+SLVDGFN+P SI   GG+G CQST+C+ANVN VC   L VR  DG+VI CKSAC AFN+PQYCCT  ++ P  C  +QYS+ FK +CPQAYS
Subjt:  GGMDFYDVSLVDGFNLPASIATVGGTGECQSTACSANVNGVCPTELQVRSGDGSVIGCKSACLAFNEPQYCCTAEFNDPSKCAHSQYSLIFKNQCPQAYS

Query:  YAYDDKTSTFTCSV---------VHAATFVVKNNCPQTIWPATLTSGSGQSQLSSTGFQLLSGASRTVDVPAPWTGRIWARTRCFVDSSSRFSCETGDCA
        YAYDD++STFTC V           + TF   N CP T+WP TLT+ +G+ QLSSTGF L +GAS ++  PA W+GR WART+C   +S +F C T DCA
Subjt:  YAYDDKTSTFTCSV---------VHAATFVVKNNCPQTIWPATLTSGSGQSQLSSTGFQLLSGASRTVDVPAPWTGRIWARTRCFVDSSSRFSCETGDCA

Query:  SGSISCNGAGGIPPATLAEFTLAPNGGMDFYDVSLVDGFNLPASIATVGGTGECQSTACSANVNGVCPTELQVRSGDGSVIGCKSACLAFNEPQYCCTAE
        SG I CNGAG IPPA+LAEFTL  +GG DFYD+SLVDGFN+P SI   GG   CQST+C+ANVN VC   L VR  DG+VI CKSAC+AFN+PQYCCT  
Subjt:  SGSISCNGAGGIPPATLAEFTLAPNGGMDFYDVSLVDGFNLPASIATVGGTGECQSTACSANVNGVCPTELQVRSGDGSVIGCKSACLAFNEPQYCCTAE

Query:  FNDPSKCAHSQYSLIFKNQCPQAYSYAYDDKTSTFTCSGSPNYVVTFCP
         N P  C  +QYS+ FK QCPQAYSYAYDDK+STFTC    NY++TFCP
Subjt:  FNDPSKCAHSQYSLIFKNQCPQAYSYAYDDKTSTFTCSGSPNYVVTFCP

A0A7N2M7N9 Uncharacterized protein3.4e-15254.86Show/hide
Query:  TVVHAATFVVKNNCPQTIWPATLTSGSGQSQLSSTGFQLLSGASRTVDVPAPWTGRIWARTRCFVDSSSRFSCETGDCASGSISCNGAGGIPPATLAEFT
        T  H+A     NNCP+TIWP TLTS   + QLS+TGF+LLS AS T+DV APW GR WART+C  +S++ F+CET DC +G ++CNG G IPPA+L E  
Subjt:  TVVHAATFVVKNNCPQTIWPATLTSGSGQSQLSSTGFQLLSGASRTVDVPAPWTGRIWARTRCFVDSSSRFSCETGDCASGSISCNGAGGIPPATLAEFT

Query:  LAPNGGMDFYDVSLVDGFNLPASIATVGGTGECQSTACSANVNGVCPTELQVRSGDGSVIGCKSACLAFNEPQYCCTAEFNDPSKCAHSQYSLIFKN---
        +A N GMD+YDVSLVDGFNLP S+AT GGTG+C++++C A+VN VCP ELQV   DGSV+ CKSAC AFN+PQYCCT  F+ P  C  ++YS+   +   
Subjt:  LAPNGGMDFYDVSLVDGFNLPASIATVGGTGECQSTACSANVNGVCPTELQVRSGDGSVIGCKSACLAFNEPQYCCTAEFNDPSKCAHSQYSLIFKN---

Query:  ----------------------QCPQAYSYAYDD------------KTSTFTCSVV-------------HAATFVVKNNCPQTIWPATLTSGSGQSQLSS
                              +C Q  S  Y              KT+  T   V             H+A     NNCP T+WP TLTS   + QLS+
Subjt:  ----------------------QCPQAYSYAYDD------------KTSTFTCSVV-------------HAATFVVKNNCPQTIWPATLTSGSGQSQLSS

Query:  TGFQLLSGASRTVDVPAPWTGRIWARTRCFVDSSSRFSCETGDCASGSISCNGAGGIPPATLAEFTLAPNGGMDFYDVSLVDGFNLPASIATVGGTGECQ
        TGF+L S AS  +DV APW GR WART C  DSS +FSC T +C+SG +SCNG G +PPA+L E  +A +GGMDFYDVSLVDGFNLP S+AT GGTGEC+
Subjt:  TGFQLLSGASRTVDVPAPWTGRIWARTRCFVDSSSRFSCETGDCASGSISCNGAGGIPPATLAEFTLAPNGGMDFYDVSLVDGFNLPASIATVGGTGECQ

Query:  STACSANVNGVCPTELQVRSGDGSVIGCKSACLAFNEPQYCCTAEFNDPSKCAHSQYSLIFKNQCPQAYSYAYDDKTSTFTCSGSPNYVVTFCP
        +++C ANVN  CP ELQV+  DGSVI CKSAC AFN+PQYCCT   N P  C  + YS IF+NQCPQAYSYAYDD+ STFTCSG+PNYV+TFCP
Subjt:  STACSANVNGVCPTELQVRSGDGSVIGCKSACLAFNEPQYCCTAEFNDPSKCAHSQYSLIFKNQCPQAYSYAYDDKTSTFTCSGSPNYVVTFCP

SwissProt top hitse value%identityAlignment
O80327 Thaumatin-like protein 12.4e-8664.16Show/hide
Query:  VHAATFVVKNNCPQTIWPATLTSGSGQSQLSSTGFQLLSGASRTVDVPAPWTGRIWARTRCFVDSSSRFSCETGDCASGSISCNGAGGIPPATLAEFTLA
        V++A F   N CP T+WP TLT G G  QL STGF+L SGAS ++ V APW+GR W R+ C +DSS +F C TGDC SG ISCNGAG  PPA+L E TLA
Subjt:  VHAATFVVKNNCPQTIWPATLTSGSGQSQLSSTGFQLLSGASRTVDVPAPWTGRIWARTRCFVDSSSRFSCETGDCASGSISCNGAGGIPPATLAEFTLA

Query:  PNGGMDFYDVSLVDGFNLPASIATVGGTGECQSTACSANVNGVCPTELQVRSGDGSVIGCKSACLAFNEPQYCCTAEFNDPSKCAHSQYSLIFKNQCPQA
         NGG DFYDVSLVDGFNLP  +A  GG+G+C ST+C+AN+N VCP EL  +  DGSVIGCKSACLA N+PQYCCT  +  P  C  + +S +FKNQCPQA
Subjt:  PNGGMDFYDVSLVDGFNLPASIATVGGTGECQSTACSANVNGVCPTELQVRSGDGSVIGCKSACLAFNEPQYCCTAEFNDPSKCAHSQYSLIFKNQCPQA

Query:  YSYAYDDKTSTFTCSGSPNYVVTFCP
        YSYAYDDK+STFTC G PNY +TFCP
Subjt:  YSYAYDDKTSTFTCSGSPNYVVTFCP

P50694 Glucan endo-1,3-beta-glucosidase1.5e-8061.33Show/hide
Query:  HAATFVVKNNCPQTIWPATLTSGSGQSQLSSTGFQLLSGASRTVDVPAPWTGRIWARTRCFVDSSSRFSCETGDCASGSISCNGAGGIPPATLAEFTLAP
        HAAT   KNNCP  +WP TLTS   + QLS+TGF+L S AS  +D P PW GR WART C  D+S +F C T DCASG + CNG G IPPATLAEF +  
Subjt:  HAATFVVKNNCPQTIWPATLTSGSGQSQLSSTGFQLLSGASRTVDVPAPWTGRIWARTRCFVDSSSRFSCETGDCASGSISCNGAGGIPPATLAEFTLAP

Query:  NGGMDFYDVSLVDGFNLPASIATVGGTGECQSTACSANVNGVCPTELQVRSGDGSVIGCKSACLAFNEPQYCCTAEFNDPSKCAHSQYSLIFKNQCPQAY
         GG DFYDVSLVDGFNLP S+   GGTG+C++ +C ANVN VCP+ELQ +  DGSV+ C SAC+ F  PQYCCT   N P  C  + YS IF N CP AY
Subjt:  NGGMDFYDVSLVDGFNLPASIATVGGTGECQSTACSANVNGVCPTELQVRSGDGSVIGCKSACLAFNEPQYCCTAEFNDPSKCAHSQYSLIFKNQCPQAY

Query:  SYAYDDKTSTFTCSGSPNYVVTFCP
        SYAYDDK  TFTC+G PNY +TFCP
Subjt:  SYAYDDKTSTFTCSGSPNYVVTFCP

P83332 Thaumatin-like protein 14.6e-8261.4Show/hide
Query:  SVVHAATFVVKNNCPQTIWPATLTSGSGQSQLSSTGFQLLSGASRTVDVPAPWTGRIWARTRCFVDSSSRFSCETGDCASGSISCNGAGGIPPATLAEFT
        S  HAA     N C  T+WP TLT G  + QLS TGF+L +G SR+VD P+PW+GR + RTRC  D+S +F+C T DC SG +SCNG G  PPATL E T
Subjt:  SVVHAATFVVKNNCPQTIWPATLTSGSGQSQLSSTGFQLLSGASRTVDVPAPWTGRIWARTRCFVDSSSRFSCETGDCASGSISCNGAGGIPPATLAEFT

Query:  LAPNGGMDFYDVSLVDGFNLPASIATVGGTGECQSTACSANVNGVCPTELQVRSGDGSVIGCKSACLAFNEPQYCCTAEFNDPSKCAHSQYSLIFKNQCP
        +A NGG DFYDVSLVDGFNLP S+A  GGTG+C+++ C A++N VCP  LQV+  DGSVI CKSACLAFN+P+YCCT   + P  C    YS +FK QCP
Subjt:  LAPNGGMDFYDVSLVDGFNLPASIATVGGTGECQSTACSANVNGVCPTELQVRSGDGSVIGCKSACLAFNEPQYCCTAEFNDPSKCAHSQYSLIFKNQCP

Query:  QAYSYAYDDKTSTFTCSGSPNYVVTFCP
        QAYSYAYDDK+STFTCSG P Y++TFCP
Subjt:  QAYSYAYDDKTSTFTCSGSPNYVVTFCP

Q9FSG7 Thaumatin-like protein 1a1.1e-8664.04Show/hide
Query:  SVVHAATFVVKNNCPQTIWPATLTSGSGQSQLSSTGFQLLSGASRTVDVPAPWTGRIWARTRCFVDSSSRFSCETGDCASGSISCNGAGGIPPATLAEFT
        S  HAA     NNCP T+WP TLT G  + QLS TGF+L S ASR+VD P+PW+GR W RTRC  D++ +F+CET DC SG ++CNGAG +PPATL E T
Subjt:  SVVHAATFVVKNNCPQTIWPATLTSGSGQSQLSSTGFQLLSGASRTVDVPAPWTGRIWARTRCFVDSSSRFSCETGDCASGSISCNGAGGIPPATLAEFT

Query:  LAPNGGMDFYDVSLVDGFNLPASIATVGGTGECQSTACSANVNGVCPTELQVRSGDGSVIGCKSACLAFNEPQYCCTAEFNDPSKCAHSQYSLIFKNQCP
        +A NGG D+YDVSLVDGFNLP S+A  GGTGEC+ ++C ANVN VCP  LQV++ DGSVI CKSACLAF + +YCCT   N P  C  ++YS IF+ QCP
Subjt:  LAPNGGMDFYDVSLVDGFNLPASIATVGGTGECQSTACSANVNGVCPTELQVRSGDGSVIGCKSACLAFNEPQYCCTAEFNDPSKCAHSQYSLIFKNQCP

Query:  QAYSYAYDDKTSTFTCSGSPNYVVTFCP
        QAYSYAYDDK STFTCSG P+YV+TFCP
Subjt:  QAYSYAYDDKTSTFTCSGSPNYVVTFCP

Q9SMH2 Thaumatin-like protein 13.2e-8362.77Show/hide
Query:  FTCSVVHAATFVVKNNCPQTIWPATLTSGSGQSQLSSTGFQLLSGASRTVDVPAPWTGRIWARTRCFVDSSSRFSCETGDCASGSISCNGAGGIPPATLA
        F  S  H+A     NNCP+TIWP TLTS   + QL +TGF L S AS T+ V APW GR WARTRC   +S +F+CET DC++G ++CNG G IPPA+L 
Subjt:  FTCSVVHAATFVVKNNCPQTIWPATLTSGSGQSQLSSTGFQLLSGASRTVDVPAPWTGRIWARTRCFVDSSSRFSCETGDCASGSISCNGAGGIPPATLA

Query:  EFTLAPNGGMDFYDVSLVDGFNLPASIATVGGTGECQSTACSANVNGVCPTELQVRSGDGSVIGCKSACLAFNEPQYCCTAEFNDPSKCAHSQYSLIFKN
        E  +A N GMDFYDVSLVDG+NLP S+AT GGTG+C++T+C ANVN VCP ELQV+  D SV+ CKSAC AFN+PQYCCT  F+    C  ++YS IFK 
Subjt:  EFTLAPNGGMDFYDVSLVDGFNLPASIATVGGTGECQSTACSANVNGVCPTELQVRSGDGSVIGCKSACLAFNEPQYCCTAEFNDPSKCAHSQYSLIFKN

Query:  QCPQAYSYAYDDKTSTFTCSGSPNYVVTFCP
        QCPQAYSYAYDD TSTFTCSG+P+YV+TFCP
Subjt:  QCPQAYSYAYDDKTSTFTCSGSPNYVVTFCP

Arabidopsis top hitse value%identityAlignment
AT1G20030.1 Pathogenesis-related thaumatin superfamily protein5.2e-7356.64Show/hide
Query:  TFVVKNNCPQTIWPATLTSGSGQSQLSSTGFQLLSGASRTVDVPAPWTGRIWARTRCFVDSSSRFSCETGDCASGSISCNGAGGIPPATLAEFTLAPNGG
        +F   N C  T+WP  L S +G S L +TGF LL G +RT++ P+ W GR W RT C  DS  +FSC TGDC SG I C+GAG  PPATLAEFTL  +GG
Subjt:  TFVVKNNCPQTIWPATLTSGSGQSQLSSTGFQLLSGASRTVDVPAPWTGRIWARTRCFVDSSSRFSCETGDCASGSISCNGAGGIPPATLAEFTLAPNGG

Query:  MDFYDVSLVDGFNLPASIATVGGTGE-CQSTACSANVNGVCPTELQVRSGDGS---VIGCKSACLAFNEPQYCCTAEFNDPSKCAHSQYSLIFKNQCPQA
        +DFYDVSLVDG+N+   +   GG+G+ C ST C  ++NG CP+EL+V S DG     + CKSAC AF +P+YCC+  F  P  C  S YS IFK+ CP+A
Subjt:  MDFYDVSLVDGFNLPASIATVGGTGE-CQSTACSANVNGVCPTELQVRSGDGS---VIGCKSACLAFNEPQYCCTAEFNDPSKCAHSQYSLIFKNQCPQA

Query:  YSYAYDDKTSTFTCSGSPNYVVTFCP
        YSYAYDDK+STFTC+ SPNYV+TFCP
Subjt:  YSYAYDDKTSTFTCSGSPNYVVTFCP

AT1G20030.2 Pathogenesis-related thaumatin superfamily protein2.4e-7356.03Show/hide
Query:  SVVHAATFVVKNNCPQTIWPATLTSGSGQSQLSSTGFQLLSGASRTVDVPAPWTGRIWARTRCFVDSSSRFSCETGDCASGSISCNGAGGIPPATLAEFT
        S V + +F   N C  T+WP  L S +G S L +TGF LL G +RT++ P+ W GR W RT C  DS  +FSC TGDC SG I C+GAG  PPATLAEFT
Subjt:  SVVHAATFVVKNNCPQTIWPATLTSGSGQSQLSSTGFQLLSGASRTVDVPAPWTGRIWARTRCFVDSSSRFSCETGDCASGSISCNGAGGIPPATLAEFT

Query:  LAPNGGMDFYDVSLVDGFNLPASIATVGGTGE-CQSTACSANVNGVCPTELQVRSGDGS---VIGCKSACLAFNEPQYCCTAEFNDPSKCAHSQYSLIFK
        L  +GG+DFYDVSLVDG+N+   +   GG+G+ C ST C  ++NG CP+EL+V S DG     + CKSAC AF +P+YCC+  F  P  C  S YS IFK
Subjt:  LAPNGGMDFYDVSLVDGFNLPASIATVGGTGE-CQSTACSANVNGVCPTELQVRSGDGS---VIGCKSACLAFNEPQYCCTAEFNDPSKCAHSQYSLIFK

Query:  NQCPQAYSYAYDDKTSTFTCSGSPNYVVTFCP
        + CP+AYSYAYDDK+STFTC+ SPNYV+TFCP
Subjt:  NQCPQAYSYAYDDKTSTFTCSGSPNYVVTFCP

AT1G75800.1 Pathogenesis-related thaumatin superfamily protein1.1e-7053.45Show/hide
Query:  SVVHAATFVVKNNCPQTIWPATLTSGSGQSQLSSTGFQLLSGASRTVDVPAPWTGRIWARTRCFVDSSSRFSCETGDCASGSISCNGAGGIPPATLAEFT
        S V + +F++ N C  T+WP  L S +G   L +TGF L  G  RT+  P  W GR W RT+C  D+  +F+C TGDC SG++ C+G+G  PPATLAEFT
Subjt:  SVVHAATFVVKNNCPQTIWPATLTSGSGQSQLSSTGFQLLSGASRTVDVPAPWTGRIWARTRCFVDSSSRFSCETGDCASGSISCNGAGGIPPATLAEFT

Query:  LAPNGGMDFYDVSLVDGFNLPASIATVGGTG-ECQSTACSANVNGVCPTELQVRSGDG---SVIGCKSACLAFNEPQYCCTAEFNDPSKCAHSQYSLIFK
        L  + G+DFYDVSLVDG+N+P  +A  GG+G  C ST C  ++NG CP+EL+V S DG     +GCKSAC AF  P+YCC+     P  C  S YSL+FK
Subjt:  LAPNGGMDFYDVSLVDGFNLPASIATVGGTG-ECQSTACSANVNGVCPTELQVRSGDG---SVIGCKSACLAFNEPQYCCTAEFNDPSKCAHSQYSLIFK

Query:  NQCPQAYSYAYDDKTSTFTCSGSPNYVVTFCP
          CP+AYSYAYDD++STFTC+ SPNYV+TFCP
Subjt:  NQCPQAYSYAYDDKTSTFTCSGSPNYVVTFCP

AT4G18250.1 receptor serine/threonine kinase, putative2.5e-8338.34Show/hide
Query:  VVHAATFVVKNNCPQTIWPATLTSGSGQSQLSSTGFQLLSGASRTVDVPAPWTGRIWARTRCFVDSSSRFSCETGDCASGSISCNGAGGIPPATLAEFTL
        V+  +   ++N C  T+WP  + S +  SQ+S TGF L  G +R +  P+ W G I ART C +DS+  FSC TGDC SG+I C G  G  P T   F  
Subjt:  VVHAATFVVKNNCPQTIWPATLTSGSGQSQLSSTGFQLLSGASRTVDVPAPWTGRIWARTRCFVDSSSRFSCETGDCASGSISCNGAGGIPPATLAEFTL

Query:  APNGGMDFYDVSLVDGFNLPASIATVGGTGECQSTACSANVNGVCPTELQVRSGDGSVIGCKSACLAFNEPQYCCTAEFNDPSKCAHSQYSLIFKNQCPQ
             M+ Y +S+  G+NLP  +     +  C S  C   +   CP +L   S + +++ C S C+ F+ P+ CCT +F     C  + Y+  F+  CP 
Subjt:  APNGGMDFYDVSLVDGFNLPASIATVGGTGECQSTACSANVNGVCPTELQVRSGDGSVIGCKSACLAFNEPQYCCTAEFNDPSKCAHSQYSLIFKNQCPQ

Query:  AYSYAYDDKTSTFTC--SVVHAATFV-VKNNCPQTIWPATLTSGSGQSQLSSTGFQLLSGASRTVDVPAPWTGRIWARTRCFVDSSSRFSCETGDCASGS
        A+ YAYDD  ST TC  S  +  T + ++N C  TIWP      S +SQ+S+TGF L +G  R ++ P+ W G I ART C  DS+  FSC TGDC SG 
Subjt:  AYSYAYDDKTSTFTC--SVVHAATFV-VKNNCPQTIWPATLTSGSGQSQLSSTGFQLLSGASRTVDVPAPWTGRIWARTRCFVDSSSRFSCETGDCASGS

Query:  ISCNGAGGIPPATLAEFTLAPNGGMDFYDVSLVDGFNLPASIATVGGTGECQSTACSANVNGVCPTELQVRSGDGSVIGCKSACLAFNEPQYCCTAEFND
        I C G     P T   F +  +G ++ Y +SL  G+NLP ++  V     C S+ C  ++N  CP +L+  S  G ++ CKSAC      + CCT  F  
Subjt:  ISCNGAGGIPPATLAEFTLAPNGGMDFYDVSLVDGFNLPASIATVGGTGECQSTACSANVNGVCPTELQVRSGDGSVIGCKSACLAFNEPQYCCTAEFND

Query:  PSKCAHSQYSLIFKNQCPQAYSYAYDDKTSTFTCSGSPNYVVTFCP
           C  + Y   F   CP AYSY +    STFTC+ S +YV+TFCP
Subjt:  PSKCAHSQYSLIFKNQCPQAYSYAYDDKTSTFTCSGSPNYVVTFCP

AT4G36010.1 Pathogenesis-related thaumatin superfamily protein5.6e-6751.93Show/hide
Query:  VHAATFVVKNNCPQTIWPATLTSGSGQSQLSSTGFQLLSGASRTVDVPAPWTGRIWARTRCFVDSSS-RFSCETGDCASGSISCNGAGGIPPATLAEFTL
        V + TF + N C  T+WP  L SG+G S L +TGF L    +R + +PA W+GRIW RT C  D+++ RF+C TGDC S ++ C+G+G  PPATLAEFTL
Subjt:  VHAATFVVKNNCPQTIWPATLTSGSGQSQLSSTGFQLLSGASRTVDVPAPWTGRIWARTRCFVDSSS-RFSCETGDCASGSISCNGAGGIPPATLAEFTL

Query:  APNGGMDFYDVSLVDGFNLPASIATVGG------TGECQSTACSANVNGVCPTELQVRSGDGSVIGCKSACLAFNEPQYCCTAEFNDPSKCAHSQYSLIF
            G+DFYDVSLVDG+N+P +I   GG       G C +T C A +NG CP +L+V +     + CKSAC AF  P+YCC+  F  P  C  S+YS  F
Subjt:  APNGGMDFYDVSLVDGFNLPASIATVGG------TGECQSTACSANVNGVCPTELQVRSGDGSVIGCKSACLAFNEPQYCCTAEFNDPSKCAHSQYSLIF

Query:  KNQCPQAYSYAYDDKTSTFTCSGSPNYVVTFCP
        KN CP+AYSYAYDD TSTFTC G+ +YV+TFCP
Subjt:  KNQCPQAYSYAYDDKTSTFTCSGSPNYVVTFCP


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
TGTGTTTGTAAAACAGTAGTCCATGCAGCCACATTTGTTGTCAAAAACAATTGTCCACAGACCATATGGCCCGCAACACTAACCAGCGGCAGCGGACAATCCCAACTCTC
CTCCACTGGATTTCAGTTGCTTTCTGGAGCGTCGAGGACCGTAGATGTGCCTGCGCCGTGGACGGGTAGGATCTGGGCTCGAACACGCTGCTTCGTCGATAGCTCTTCAA
GATTTTCGTGCGAAACTGGAGACTGTGCCTCCGGCTCCATCTCTTGCAATGGCGCAGGAGGAATTCCACCGGCCACACTGGCCGAGTTCACATTAGCCCCCAATGGAGGC
ATGGATTTCTACGACGTCAGCCTCGTCGACGGCTTCAACCTGCCGGCTTCCATAGCCACAGTGGGTGGAACAGGCGAATGCCAATCCACAGCTTGTTCTGCAAATGTAAA
CGGAGTTTGCCCAACGGAGTTGCAAGTCAGGTCAGGAGATGGGAGCGTGATTGGGTGTAAGAGCGCATGCCTTGCGTTTAATGAACCACAGTATTGTTGCACCGCGGAAT
TCAACGACCCCAGCAAGTGCGCTCACAGCCAGTACTCGTTGATCTTCAAGAACCAATGCCCTCAGGCTTACAGCTACGCTTATGACGACAAAACCAGCACCTTCACATGC
AGCGTAGTCCATGCAGCCACATTTGTTGTCAAAAACAATTGTCCACAGACCATATGGCCCGCAACACTAACCAGCGGCAGCGGACAATCCCAACTCTCCTCCACTGGATT
TCAGTTGCTTTCTGGAGCGTCGAGGACCGTAGATGTGCCTGCACCGTGGACGGGTAGGATCTGGGCTCGAACGCGCTGCTTCGTCGATAGCTCTTCAAGATTTTCGTGCG
AAACTGGAGACTGTGCCTCCGGCTCCATCTCTTGCAATGGCGCAGGAGGAATTCCACCGGCCACACTGGCCGAGTTCACATTAGCCCCCAATGGAGGCATGGATTTCTAC
GACGTCAGCCTCGTCGACGGCTTCAACCTGCCGGCTTCCATAGCCACAGTGGGTGGAACAGGAGAATGCCAATCCACAGCTTGTTCTGCAAATGTAAACGGGGTTTGCCC
AACGGAGTTGCAAGTCAGGTCAGGAGATGGGAGCGTGATTGGGTGTAAGAGCGCATGCCTTGCGTTTAATGAACCACAGTATTGTTGCACCGCGGAATTCAACGACCCCA
GCAAGTGCGCTCACAGCCAGTACTCGTTGATCTTCAAGAACCAATGCCCTCAGGCTTACAGCTACGCTTATGACGACAAAACCAGCACCTTCACATGCAGCGGTTCGCCC
AATTATGTTGTTACCTTCTGCCCG
mRNA sequenceShow/hide mRNA sequence
TGTGTTTGTAAAACAGTAGTCCATGCAGCCACATTTGTTGTCAAAAACAATTGTCCACAGACCATATGGCCCGCAACACTAACCAGCGGCAGCGGACAATCCCAACTCTC
CTCCACTGGATTTCAGTTGCTTTCTGGAGCGTCGAGGACCGTAGATGTGCCTGCGCCGTGGACGGGTAGGATCTGGGCTCGAACACGCTGCTTCGTCGATAGCTCTTCAA
GATTTTCGTGCGAAACTGGAGACTGTGCCTCCGGCTCCATCTCTTGCAATGGCGCAGGAGGAATTCCACCGGCCACACTGGCCGAGTTCACATTAGCCCCCAATGGAGGC
ATGGATTTCTACGACGTCAGCCTCGTCGACGGCTTCAACCTGCCGGCTTCCATAGCCACAGTGGGTGGAACAGGCGAATGCCAATCCACAGCTTGTTCTGCAAATGTAAA
CGGAGTTTGCCCAACGGAGTTGCAAGTCAGGTCAGGAGATGGGAGCGTGATTGGGTGTAAGAGCGCATGCCTTGCGTTTAATGAACCACAGTATTGTTGCACCGCGGAAT
TCAACGACCCCAGCAAGTGCGCTCACAGCCAGTACTCGTTGATCTTCAAGAACCAATGCCCTCAGGCTTACAGCTACGCTTATGACGACAAAACCAGCACCTTCACATGC
AGCGTAGTCCATGCAGCCACATTTGTTGTCAAAAACAATTGTCCACAGACCATATGGCCCGCAACACTAACCAGCGGCAGCGGACAATCCCAACTCTCCTCCACTGGATT
TCAGTTGCTTTCTGGAGCGTCGAGGACCGTAGATGTGCCTGCACCGTGGACGGGTAGGATCTGGGCTCGAACGCGCTGCTTCGTCGATAGCTCTTCAAGATTTTCGTGCG
AAACTGGAGACTGTGCCTCCGGCTCCATCTCTTGCAATGGCGCAGGAGGAATTCCACCGGCCACACTGGCCGAGTTCACATTAGCCCCCAATGGAGGCATGGATTTCTAC
GACGTCAGCCTCGTCGACGGCTTCAACCTGCCGGCTTCCATAGCCACAGTGGGTGGAACAGGAGAATGCCAATCCACAGCTTGTTCTGCAAATGTAAACGGGGTTTGCCC
AACGGAGTTGCAAGTCAGGTCAGGAGATGGGAGCGTGATTGGGTGTAAGAGCGCATGCCTTGCGTTTAATGAACCACAGTATTGTTGCACCGCGGAATTCAACGACCCCA
GCAAGTGCGCTCACAGCCAGTACTCGTTGATCTTCAAGAACCAATGCCCTCAGGCTTACAGCTACGCTTATGACGACAAAACCAGCACCTTCACATGCAGCGGTTCGCCC
AATTATGTTGTTACCTTCTGCCCG
Protein sequenceShow/hide protein sequence
CVCKTVVHAATFVVKNNCPQTIWPATLTSGSGQSQLSSTGFQLLSGASRTVDVPAPWTGRIWARTRCFVDSSSRFSCETGDCASGSISCNGAGGIPPATLAEFTLAPNGG
MDFYDVSLVDGFNLPASIATVGGTGECQSTACSANVNGVCPTELQVRSGDGSVIGCKSACLAFNEPQYCCTAEFNDPSKCAHSQYSLIFKNQCPQAYSYAYDDKTSTFTC
SVVHAATFVVKNNCPQTIWPATLTSGSGQSQLSSTGFQLLSGASRTVDVPAPWTGRIWARTRCFVDSSSRFSCETGDCASGSISCNGAGGIPPATLAEFTLAPNGGMDFY
DVSLVDGFNLPASIATVGGTGECQSTACSANVNGVCPTELQVRSGDGSVIGCKSACLAFNEPQYCCTAEFNDPSKCAHSQYSLIFKNQCPQAYSYAYDDKTSTFTCSGSP
NYVVTFCP