; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg000023 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg000023
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionTrypsin family protein
Genome locationscaffold1:402956..407710
RNA-Seq ExpressionSpg000023
SyntenySpg000023
Gene Ontology termsGO:0043231 - intracellular membrane-bounded organelle (cellular component)
InterPro domainsIPR009003 - Peptidase S1, PA clan


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0052425.1 uncharacterized protein E6C27_scaffold120G00200 [Cucumis melo var. makuwa]2.6e-20185.41Show/hide
Query:  FSYYGAPAATPKEEIYTELVDGLRGSDPTIGSGSQVASQETYGTW----------------------VQLSKAVQE-----PGNLGPGVYLGAVERATSF
        FSYYGAPAATPKEE+YTELVDGLRGSDPT+GSGSQVASQETYGT                       V L    Q+     P +LGPGVYLGAVERATSF
Subjt:  FSYYGAPAATPKEEIYTELVDGLRGSDPTIGSGSQVASQETYGTW----------------------VQLSKAVQE-----PGNLGPGVYLGAVERATSF

Query:  ITDDVWYGIFAGTNPETFVRADGAFIPFAEDFNMNNVVTFVKGVGEIGDVNKIDLQSPINSLIGRKVIKVGRSSGLTRGTIMAYALEYNDVKGICFFTDF
        ITDDVWYGIFAGTNPETFVRADGAFIPFAEDFNMNNVVTFVKGVGEIGDVNKIDLQSPINSLIGRKVIKVGRSSGLTRGTIMAYALEYNDVKGICFFTDF
Subjt:  ITDDVWYGIFAGTNPETFVRADGAFIPFAEDFNMNNVVTFVKGVGEIGDVNKIDLQSPINSLIGRKVIKVGRSSGLTRGTIMAYALEYNDVKGICFFTDF

Query:  LVVGDDQQTFDLEGDSGSLILLTGQDEEKPRPVGIIWGGTANRGRLKLKVGQPPENWTSGVDLGRLLDLLELDLITTSDGLQVHEQRNNSVGGIDSTVAE
        LVVGDDQQTFDLEGDSGSLILLTGQDEEKPRPVGIIWGGTANRGRLKLKVGQPPENWTSGVDLGRLLDLLELDLITT+DGLQVHEQRNNSVGGIDSTVAE
Subjt:  LVVGDDQQTFDLEGDSGSLILLTGQDEEKPRPVGIIWGGTANRGRLKLKVGQPPENWTSGVDLGRLLDLLELDLITTSDGLQVHEQRNNSVGGIDSTVAE

Query:  SCLDRLPLKYRLKENSDPLGLCVQQISPEGESSQGLISPFKHAAFHIENGFEMAPSVELQFIPGLTSSSPLHQKNEERQEMKNLSSTRNGYDSEVSVSLQ
        SCLDR+PLKYRLKENS+PLG  VQQISPEGESSQG+ISPFKHA FHIENG+E+ PSVELQFIP LTS+SPLHQKNEE QE+KNL++ R G+DSEVSVSLQ
Subjt:  SCLDRLPLKYRLKENSDPLGLCVQQISPEGESSQGLISPFKHAAFHIENGFEMAPSVELQFIPGLTSSSPLHQKNEERQEMKNLSSTRNGYDSEVSVSLQ

Query:  LG--EPEAKRRKHSDCLSSIKESST
        LG  EPEAKRRK  DCLSSIKESS+
Subjt:  LG--EPEAKRRKHSDCLSSIKESST

XP_008439446.1 PREDICTED: uncharacterized protein LOC103484249 isoform X1 [Cucumis melo]2.8e-20085.25Show/hide
Query:  FSYYGAPAATPKEEIYTELVDGLRGSDPTIGSGSQVASQETYGTW----------------------VQLSKAVQE-----PGNLGPGVYLGAVERATSF
        FSYYGAPAATPKEE+YTELVDGLRGSDPT+GSGSQVASQETYGT                       V L    Q+     P +LGPGVYLGAVERATSF
Subjt:  FSYYGAPAATPKEEIYTELVDGLRGSDPTIGSGSQVASQETYGTW----------------------VQLSKAVQE-----PGNLGPGVYLGAVERATSF

Query:  ITDDVWYGIFAGTNPETFVRADGAFIPFAEDFNMNNVVTFVKGVGEIGDVNKIDLQSPINSLIGRKVIKVGRSSGLTRGTIMAYALEYNDVKGICFFTDF
        ITDDVWYGIFAGTNPETFVRADGAFIPFAEDFNMNNVVTFVKGVGEIGDVNKIDLQSPINSLIGRKVIKVGRSSGLTRGTIMAYALEYNDVKGICFFTDF
Subjt:  ITDDVWYGIFAGTNPETFVRADGAFIPFAEDFNMNNVVTFVKGVGEIGDVNKIDLQSPINSLIGRKVIKVGRSSGLTRGTIMAYALEYNDVKGICFFTDF

Query:  LVVGDDQQTFDLEGDSGSLILLTGQDEEKPRPVGIIWGGTANRGRLKLKVGQPPENWTSGVDLGRLLDLLELDLITTSDGLQ--VHEQRNNSVGGIDSTV
        LVVGDDQQTFDLEGDSGSLILLTGQDEEKPRPVGIIWGGTANRGRLKLKVGQPPENWTSGVDLGRLLDLLELDLITT+DGLQ  VHEQRNNSVGGIDSTV
Subjt:  LVVGDDQQTFDLEGDSGSLILLTGQDEEKPRPVGIIWGGTANRGRLKLKVGQPPENWTSGVDLGRLLDLLELDLITTSDGLQ--VHEQRNNSVGGIDSTV

Query:  AESCLDRLPLKYRLKENSDPLGLCVQQISPEGESSQGLISPFKHAAFHIENGFEMAPSVELQFIPGLTSSSPLHQKNEERQEMKNLSSTRNGYDSEVSVS
        AESCLDR+PLKYRLKENS+PLG  VQQISPEGESSQG+ISPFKHA FHIENG+E+ PSVELQFIP LTS+SPLHQKN+E QE+KNLS+ R GYDSEVSVS
Subjt:  AESCLDRLPLKYRLKENSDPLGLCVQQISPEGESSQGLISPFKHAAFHIENGFEMAPSVELQFIPGLTSSSPLHQKNEERQEMKNLSSTRNGYDSEVSVS

Query:  LQLG--EPEAKRRKHSDCLSSIKESST
        LQLG  EPEAKRRK  DCLSSIKESS+
Subjt:  LQLG--EPEAKRRKHSDCLSSIKESST

XP_008439448.1 PREDICTED: uncharacterized protein LOC103484249 isoform X2 [Cucumis melo]8.8e-20285.65Show/hide
Query:  FSYYGAPAATPKEEIYTELVDGLRGSDPTIGSGSQVASQETYGTW----------------------VQLSKAVQE-----PGNLGPGVYLGAVERATSF
        FSYYGAPAATPKEE+YTELVDGLRGSDPT+GSGSQVASQETYGT                       V L    Q+     P +LGPGVYLGAVERATSF
Subjt:  FSYYGAPAATPKEEIYTELVDGLRGSDPTIGSGSQVASQETYGTW----------------------VQLSKAVQE-----PGNLGPGVYLGAVERATSF

Query:  ITDDVWYGIFAGTNPETFVRADGAFIPFAEDFNMNNVVTFVKGVGEIGDVNKIDLQSPINSLIGRKVIKVGRSSGLTRGTIMAYALEYNDVKGICFFTDF
        ITDDVWYGIFAGTNPETFVRADGAFIPFAEDFNMNNVVTFVKGVGEIGDVNKIDLQSPINSLIGRKVIKVGRSSGLTRGTIMAYALEYNDVKGICFFTDF
Subjt:  ITDDVWYGIFAGTNPETFVRADGAFIPFAEDFNMNNVVTFVKGVGEIGDVNKIDLQSPINSLIGRKVIKVGRSSGLTRGTIMAYALEYNDVKGICFFTDF

Query:  LVVGDDQQTFDLEGDSGSLILLTGQDEEKPRPVGIIWGGTANRGRLKLKVGQPPENWTSGVDLGRLLDLLELDLITTSDGLQVHEQRNNSVGGIDSTVAE
        LVVGDDQQTFDLEGDSGSLILLTGQDEEKPRPVGIIWGGTANRGRLKLKVGQPPENWTSGVDLGRLLDLLELDLITT+DGLQVHEQRNNSVGGIDSTVAE
Subjt:  LVVGDDQQTFDLEGDSGSLILLTGQDEEKPRPVGIIWGGTANRGRLKLKVGQPPENWTSGVDLGRLLDLLELDLITTSDGLQVHEQRNNSVGGIDSTVAE

Query:  SCLDRLPLKYRLKENSDPLGLCVQQISPEGESSQGLISPFKHAAFHIENGFEMAPSVELQFIPGLTSSSPLHQKNEERQEMKNLSSTRNGYDSEVSVSLQ
        SCLDR+PLKYRLKENS+PLG  VQQISPEGESSQG+ISPFKHA FHIENG+E+ PSVELQFIP LTS+SPLHQKN+E QE+KNLS+ R GYDSEVSVSLQ
Subjt:  SCLDRLPLKYRLKENSDPLGLCVQQISPEGESSQGLISPFKHAAFHIENGFEMAPSVELQFIPGLTSSSPLHQKNEERQEMKNLSSTRNGYDSEVSVSLQ

Query:  LG--EPEAKRRKHSDCLSSIKESST
        LG  EPEAKRRK  DCLSSIKESS+
Subjt:  LG--EPEAKRRKHSDCLSSIKESST

XP_022978960.1 uncharacterized protein LOC111478754 isoform X2 [Cucurbita maxima]4.1e-19985.55Show/hide
Query:  FSYYGAPAATPKEEIYTELVDGLRGSDPTIGSGSQVASQETYGTW----------------------VQLSKAVQE-----PGNLGPGVYLGAVERATSF
        FSYYGAPAATPKEE+YTELVDGLRGSDPTIGSGSQVASQETYGT                       V L    Q+     P +LGPGVYLGAVERATSF
Subjt:  FSYYGAPAATPKEEIYTELVDGLRGSDPTIGSGSQVASQETYGTW----------------------VQLSKAVQE-----PGNLGPGVYLGAVERATSF

Query:  ITDDVWYGIFAGTNPETFVRADGAFIPFAEDFNMNNVVTFVKGVGEIGDVNKIDLQSPINSLIGRKVIKVGRSSGLTRGTIMAYALEYNDVKGICFFTDF
        ITDDVWYGIFAGTNPETFVRADGAFIPFAEDFNMNNV+TFVKGVGEI DVNKIDLQSPINSLIGRKVIKVGRSSGLTRGTIMAYALEYNDVKGICFFTDF
Subjt:  ITDDVWYGIFAGTNPETFVRADGAFIPFAEDFNMNNVVTFVKGVGEIGDVNKIDLQSPINSLIGRKVIKVGRSSGLTRGTIMAYALEYNDVKGICFFTDF

Query:  LVVGDDQQTFDLEGDSGSLILLTGQDEEKPRPVGIIWGGTANRGRLKLKVGQPPENWTSGVDLGRLLDLLELDLITTSDGLQVHEQRNNSVGGIDSTVAE
        LVVGDDQQTFDLEGDSGSLILLTG D+EKPRPVGIIWGGTANRGRLKLKVGQPPENWTSGVDLGRLLDLLELDLITT+DGLQVHEQRNNSVGGIDSTVAE
Subjt:  LVVGDDQQTFDLEGDSGSLILLTGQDEEKPRPVGIIWGGTANRGRLKLKVGQPPENWTSGVDLGRLLDLLELDLITTSDGLQVHEQRNNSVGGIDSTVAE

Query:  SCLDRLPLKYRLKENSDPLGLCVQQISPEGESSQGLISPFKHAAFHIENGFEMAPSVELQFIPGLTSSSPLHQKNEERQEMKNLSSTRNGYDSEVSVSLQ
        SCLDR+PL YRLKENS+PLGL VQ+ISPEGESSQGLISPFKHAA  IENGFE+ PSVELQFIP L SSSPLHQKNEERQE+K LS+ RNGYD EVSVSL+
Subjt:  SCLDRLPLKYRLKENSDPLGLCVQQISPEGESSQGLISPFKHAAFHIENGFEMAPSVELQFIPGLTSSSPLHQKNEERQEMKNLSSTRNGYDSEVSVSLQ

Query:  LGEPEAKRRKHSDCLSSIKESS
        LGEPEAKRRKH D LSSIKESS
Subjt:  LGEPEAKRRKHSDCLSSIKESS

XP_023543759.1 uncharacterized protein LOC111803536 isoform X2 [Cucurbita pepo subsp. pepo]1.7e-20085.78Show/hide
Query:  FSYYGAPAATPKEEIYTELVDGLRGSDPTIGSGSQVASQETYGTW----------------------VQLSKAVQE-----PGNLGPGVYLGAVERATSF
        FSYYGAPAATPKEE+YTELVDGLRGSDPTIGSGSQVASQETYGT                       V L    Q+     P +LGPGVYLGAVERATSF
Subjt:  FSYYGAPAATPKEEIYTELVDGLRGSDPTIGSGSQVASQETYGTW----------------------VQLSKAVQE-----PGNLGPGVYLGAVERATSF

Query:  ITDDVWYGIFAGTNPETFVRADGAFIPFAEDFNMNNVVTFVKGVGEIGDVNKIDLQSPINSLIGRKVIKVGRSSGLTRGTIMAYALEYNDVKGICFFTDF
        ITDDVWYGIFAGTNPETFVRADGAFIPFAEDFNMNNV+TFVKGVGEIGDVNKIDLQSPINSLIGRKVIKVGRSSGLTRGTIMAYALEYNDVKGICFFTDF
Subjt:  ITDDVWYGIFAGTNPETFVRADGAFIPFAEDFNMNNVVTFVKGVGEIGDVNKIDLQSPINSLIGRKVIKVGRSSGLTRGTIMAYALEYNDVKGICFFTDF

Query:  LVVGDDQQTFDLEGDSGSLILLTGQDEEKPRPVGIIWGGTANRGRLKLKVGQPPENWTSGVDLGRLLDLLELDLITTSDGLQVHEQRNNSVGGIDSTVAE
        LVVGDDQQTFDLEGDSGSLILLTG DEEKPRPVGIIWGGTANRGRLKLKVGQPPENWTSGVDLGRLLDLLELDLITT+DGLQVHEQRNNSVGGIDSTVAE
Subjt:  LVVGDDQQTFDLEGDSGSLILLTGQDEEKPRPVGIIWGGTANRGRLKLKVGQPPENWTSGVDLGRLLDLLELDLITTSDGLQVHEQRNNSVGGIDSTVAE

Query:  SCLDRLPLKYRLKENSDPLGLCVQQISPEGESSQGLISPFKHAAFHIENGFEMAPSVELQFIPGLTSSSPLHQKNEERQEMKNLSSTRNGYDSEVSVSLQ
        SC DR+PL YRL+ENS+PLGL VQ+ISPEGESSQGLISPFKHAA  IENGFE+ PSVELQFIP L SSSPLHQKNEERQE+KNLS+ RNGYD EVSVSL+
Subjt:  SCLDRLPLKYRLKENSDPLGLCVQQISPEGESSQGLISPFKHAAFHIENGFEMAPSVELQFIPGLTSSSPLHQKNEERQEMKNLSSTRNGYDSEVSVSLQ

Query:  LGEPEAKRRKHSDCLSSIKESS
        LGEPEAKRRKH D LSSIKESS
Subjt:  LGEPEAKRRKHSDCLSSIKESS

TrEMBL top hitse value%identityAlignment
A0A1S3AYD6 uncharacterized protein LOC103484249 isoform X11.4e-20085.25Show/hide
Query:  FSYYGAPAATPKEEIYTELVDGLRGSDPTIGSGSQVASQETYGTW----------------------VQLSKAVQE-----PGNLGPGVYLGAVERATSF
        FSYYGAPAATPKEE+YTELVDGLRGSDPT+GSGSQVASQETYGT                       V L    Q+     P +LGPGVYLGAVERATSF
Subjt:  FSYYGAPAATPKEEIYTELVDGLRGSDPTIGSGSQVASQETYGTW----------------------VQLSKAVQE-----PGNLGPGVYLGAVERATSF

Query:  ITDDVWYGIFAGTNPETFVRADGAFIPFAEDFNMNNVVTFVKGVGEIGDVNKIDLQSPINSLIGRKVIKVGRSSGLTRGTIMAYALEYNDVKGICFFTDF
        ITDDVWYGIFAGTNPETFVRADGAFIPFAEDFNMNNVVTFVKGVGEIGDVNKIDLQSPINSLIGRKVIKVGRSSGLTRGTIMAYALEYNDVKGICFFTDF
Subjt:  ITDDVWYGIFAGTNPETFVRADGAFIPFAEDFNMNNVVTFVKGVGEIGDVNKIDLQSPINSLIGRKVIKVGRSSGLTRGTIMAYALEYNDVKGICFFTDF

Query:  LVVGDDQQTFDLEGDSGSLILLTGQDEEKPRPVGIIWGGTANRGRLKLKVGQPPENWTSGVDLGRLLDLLELDLITTSDGLQ--VHEQRNNSVGGIDSTV
        LVVGDDQQTFDLEGDSGSLILLTGQDEEKPRPVGIIWGGTANRGRLKLKVGQPPENWTSGVDLGRLLDLLELDLITT+DGLQ  VHEQRNNSVGGIDSTV
Subjt:  LVVGDDQQTFDLEGDSGSLILLTGQDEEKPRPVGIIWGGTANRGRLKLKVGQPPENWTSGVDLGRLLDLLELDLITTSDGLQ--VHEQRNNSVGGIDSTV

Query:  AESCLDRLPLKYRLKENSDPLGLCVQQISPEGESSQGLISPFKHAAFHIENGFEMAPSVELQFIPGLTSSSPLHQKNEERQEMKNLSSTRNGYDSEVSVS
        AESCLDR+PLKYRLKENS+PLG  VQQISPEGESSQG+ISPFKHA FHIENG+E+ PSVELQFIP LTS+SPLHQKN+E QE+KNLS+ R GYDSEVSVS
Subjt:  AESCLDRLPLKYRLKENSDPLGLCVQQISPEGESSQGLISPFKHAAFHIENGFEMAPSVELQFIPGLTSSSPLHQKNEERQEMKNLSSTRNGYDSEVSVS

Query:  LQLG--EPEAKRRKHSDCLSSIKESST
        LQLG  EPEAKRRK  DCLSSIKESS+
Subjt:  LQLG--EPEAKRRKHSDCLSSIKESST

A0A1S3AYT3 uncharacterized protein LOC103484249 isoform X24.3e-20285.65Show/hide
Query:  FSYYGAPAATPKEEIYTELVDGLRGSDPTIGSGSQVASQETYGTW----------------------VQLSKAVQE-----PGNLGPGVYLGAVERATSF
        FSYYGAPAATPKEE+YTELVDGLRGSDPT+GSGSQVASQETYGT                       V L    Q+     P +LGPGVYLGAVERATSF
Subjt:  FSYYGAPAATPKEEIYTELVDGLRGSDPTIGSGSQVASQETYGTW----------------------VQLSKAVQE-----PGNLGPGVYLGAVERATSF

Query:  ITDDVWYGIFAGTNPETFVRADGAFIPFAEDFNMNNVVTFVKGVGEIGDVNKIDLQSPINSLIGRKVIKVGRSSGLTRGTIMAYALEYNDVKGICFFTDF
        ITDDVWYGIFAGTNPETFVRADGAFIPFAEDFNMNNVVTFVKGVGEIGDVNKIDLQSPINSLIGRKVIKVGRSSGLTRGTIMAYALEYNDVKGICFFTDF
Subjt:  ITDDVWYGIFAGTNPETFVRADGAFIPFAEDFNMNNVVTFVKGVGEIGDVNKIDLQSPINSLIGRKVIKVGRSSGLTRGTIMAYALEYNDVKGICFFTDF

Query:  LVVGDDQQTFDLEGDSGSLILLTGQDEEKPRPVGIIWGGTANRGRLKLKVGQPPENWTSGVDLGRLLDLLELDLITTSDGLQVHEQRNNSVGGIDSTVAE
        LVVGDDQQTFDLEGDSGSLILLTGQDEEKPRPVGIIWGGTANRGRLKLKVGQPPENWTSGVDLGRLLDLLELDLITT+DGLQVHEQRNNSVGGIDSTVAE
Subjt:  LVVGDDQQTFDLEGDSGSLILLTGQDEEKPRPVGIIWGGTANRGRLKLKVGQPPENWTSGVDLGRLLDLLELDLITTSDGLQVHEQRNNSVGGIDSTVAE

Query:  SCLDRLPLKYRLKENSDPLGLCVQQISPEGESSQGLISPFKHAAFHIENGFEMAPSVELQFIPGLTSSSPLHQKNEERQEMKNLSSTRNGYDSEVSVSLQ
        SCLDR+PLKYRLKENS+PLG  VQQISPEGESSQG+ISPFKHA FHIENG+E+ PSVELQFIP LTS+SPLHQKN+E QE+KNLS+ R GYDSEVSVSLQ
Subjt:  SCLDRLPLKYRLKENSDPLGLCVQQISPEGESSQGLISPFKHAAFHIENGFEMAPSVELQFIPGLTSSSPLHQKNEERQEMKNLSSTRNGYDSEVSVSLQ

Query:  LG--EPEAKRRKHSDCLSSIKESST
        LG  EPEAKRRK  DCLSSIKESS+
Subjt:  LG--EPEAKRRKHSDCLSSIKESST

A0A5A7UFD1 Uncharacterized protein1.2e-20185.41Show/hide
Query:  FSYYGAPAATPKEEIYTELVDGLRGSDPTIGSGSQVASQETYGTW----------------------VQLSKAVQE-----PGNLGPGVYLGAVERATSF
        FSYYGAPAATPKEE+YTELVDGLRGSDPT+GSGSQVASQETYGT                       V L    Q+     P +LGPGVYLGAVERATSF
Subjt:  FSYYGAPAATPKEEIYTELVDGLRGSDPTIGSGSQVASQETYGTW----------------------VQLSKAVQE-----PGNLGPGVYLGAVERATSF

Query:  ITDDVWYGIFAGTNPETFVRADGAFIPFAEDFNMNNVVTFVKGVGEIGDVNKIDLQSPINSLIGRKVIKVGRSSGLTRGTIMAYALEYNDVKGICFFTDF
        ITDDVWYGIFAGTNPETFVRADGAFIPFAEDFNMNNVVTFVKGVGEIGDVNKIDLQSPINSLIGRKVIKVGRSSGLTRGTIMAYALEYNDVKGICFFTDF
Subjt:  ITDDVWYGIFAGTNPETFVRADGAFIPFAEDFNMNNVVTFVKGVGEIGDVNKIDLQSPINSLIGRKVIKVGRSSGLTRGTIMAYALEYNDVKGICFFTDF

Query:  LVVGDDQQTFDLEGDSGSLILLTGQDEEKPRPVGIIWGGTANRGRLKLKVGQPPENWTSGVDLGRLLDLLELDLITTSDGLQVHEQRNNSVGGIDSTVAE
        LVVGDDQQTFDLEGDSGSLILLTGQDEEKPRPVGIIWGGTANRGRLKLKVGQPPENWTSGVDLGRLLDLLELDLITT+DGLQVHEQRNNSVGGIDSTVAE
Subjt:  LVVGDDQQTFDLEGDSGSLILLTGQDEEKPRPVGIIWGGTANRGRLKLKVGQPPENWTSGVDLGRLLDLLELDLITTSDGLQVHEQRNNSVGGIDSTVAE

Query:  SCLDRLPLKYRLKENSDPLGLCVQQISPEGESSQGLISPFKHAAFHIENGFEMAPSVELQFIPGLTSSSPLHQKNEERQEMKNLSSTRNGYDSEVSVSLQ
        SCLDR+PLKYRLKENS+PLG  VQQISPEGESSQG+ISPFKHA FHIENG+E+ PSVELQFIP LTS+SPLHQKNEE QE+KNL++ R G+DSEVSVSLQ
Subjt:  SCLDRLPLKYRLKENSDPLGLCVQQISPEGESSQGLISPFKHAAFHIENGFEMAPSVELQFIPGLTSSSPLHQKNEERQEMKNLSSTRNGYDSEVSVSLQ

Query:  LG--EPEAKRRKHSDCLSSIKESST
        LG  EPEAKRRK  DCLSSIKESS+
Subjt:  LG--EPEAKRRKHSDCLSSIKESST

A0A6J1ECL8 uncharacterized protein LOC111432974 isoform X23.4e-19985.55Show/hide
Query:  FSYYGAPAATPKEEIYTELVDGLRGSDPTIGSGSQVASQETYGTW----------------------VQLSKAVQE-----PGNLGPGVYLGAVERATSF
        FSYYGAPAATPKEE+YTELVDGLRGSDPTIGSGSQVASQETYGT                       V L    Q+     P +LGPGVYLGAVERATSF
Subjt:  FSYYGAPAATPKEEIYTELVDGLRGSDPTIGSGSQVASQETYGTW----------------------VQLSKAVQE-----PGNLGPGVYLGAVERATSF

Query:  ITDDVWYGIFAGTNPETFVRADGAFIPFAEDFNMNNVVTFVKGVGEIGDVNKIDLQSPINSLIGRKVIKVGRSSGLTRGTIMAYALEYNDVKGICFFTDF
        ITDDVWYGIFAGTNPETFVRADGAFIPFAEDFNMNNV+TFVKGVGEIGDVNKIDLQSPINSLIGRKVIKVGRSSGLTRGTIMAYALEYNDVKGICFFTDF
Subjt:  ITDDVWYGIFAGTNPETFVRADGAFIPFAEDFNMNNVVTFVKGVGEIGDVNKIDLQSPINSLIGRKVIKVGRSSGLTRGTIMAYALEYNDVKGICFFTDF

Query:  LVVGDDQQTFDLEGDSGSLILLTGQDEEKPRPVGIIWGGTANRGRLKLKVGQPPENWTSGVDLGRLLDLLELDLITTSDGLQVHEQRNNSVGGIDSTVAE
        LVVGDDQQTFDLEGDSGSLILLTG DEEKPRPVGIIWGGTANRGRLKLKVGQPPENWTSGVDLGRLLDLLELDLITT+DGLQVHEQRNNSVGGIDSTVAE
Subjt:  LVVGDDQQTFDLEGDSGSLILLTGQDEEKPRPVGIIWGGTANRGRLKLKVGQPPENWTSGVDLGRLLDLLELDLITTSDGLQVHEQRNNSVGGIDSTVAE

Query:  SCLDRLPLKYRLKENSDPLGLCVQQISPEGESSQGLISPFKHAAFHIENGFEMAPSVELQFIPGLTSSSPLHQKNEERQEMKNLSSTRNGYDSEVSVSLQ
        SC DR+PL YRLKENS+PLGL VQ+ISPEGESSQGLISPFK AA  IENGFE+ PSVELQFIP L SSS LHQKNEERQE+KNLS+ RNGYD EVSVSL+
Subjt:  SCLDRLPLKYRLKENSDPLGLCVQQISPEGESSQGLISPFKHAAFHIENGFEMAPSVELQFIPGLTSSSPLHQKNEERQEMKNLSSTRNGYDSEVSVSLQ

Query:  LGEPEAKRRKHSDCLSSIKESS
        LGEPEAKRRKH D LSSIKESS
Subjt:  LGEPEAKRRKHSDCLSSIKESS

A0A6J1IRT3 uncharacterized protein LOC111478754 isoform X22.0e-19985.55Show/hide
Query:  FSYYGAPAATPKEEIYTELVDGLRGSDPTIGSGSQVASQETYGTW----------------------VQLSKAVQE-----PGNLGPGVYLGAVERATSF
        FSYYGAPAATPKEE+YTELVDGLRGSDPTIGSGSQVASQETYGT                       V L    Q+     P +LGPGVYLGAVERATSF
Subjt:  FSYYGAPAATPKEEIYTELVDGLRGSDPTIGSGSQVASQETYGTW----------------------VQLSKAVQE-----PGNLGPGVYLGAVERATSF

Query:  ITDDVWYGIFAGTNPETFVRADGAFIPFAEDFNMNNVVTFVKGVGEIGDVNKIDLQSPINSLIGRKVIKVGRSSGLTRGTIMAYALEYNDVKGICFFTDF
        ITDDVWYGIFAGTNPETFVRADGAFIPFAEDFNMNNV+TFVKGVGEI DVNKIDLQSPINSLIGRKVIKVGRSSGLTRGTIMAYALEYNDVKGICFFTDF
Subjt:  ITDDVWYGIFAGTNPETFVRADGAFIPFAEDFNMNNVVTFVKGVGEIGDVNKIDLQSPINSLIGRKVIKVGRSSGLTRGTIMAYALEYNDVKGICFFTDF

Query:  LVVGDDQQTFDLEGDSGSLILLTGQDEEKPRPVGIIWGGTANRGRLKLKVGQPPENWTSGVDLGRLLDLLELDLITTSDGLQVHEQRNNSVGGIDSTVAE
        LVVGDDQQTFDLEGDSGSLILLTG D+EKPRPVGIIWGGTANRGRLKLKVGQPPENWTSGVDLGRLLDLLELDLITT+DGLQVHEQRNNSVGGIDSTVAE
Subjt:  LVVGDDQQTFDLEGDSGSLILLTGQDEEKPRPVGIIWGGTANRGRLKLKVGQPPENWTSGVDLGRLLDLLELDLITTSDGLQVHEQRNNSVGGIDSTVAE

Query:  SCLDRLPLKYRLKENSDPLGLCVQQISPEGESSQGLISPFKHAAFHIENGFEMAPSVELQFIPGLTSSSPLHQKNEERQEMKNLSSTRNGYDSEVSVSLQ
        SCLDR+PL YRLKENS+PLGL VQ+ISPEGESSQGLISPFKHAA  IENGFE+ PSVELQFIP L SSSPLHQKNEERQE+K LS+ RNGYD EVSVSL+
Subjt:  SCLDRLPLKYRLKENSDPLGLCVQQISPEGESSQGLISPFKHAAFHIENGFEMAPSVELQFIPGLTSSSPLHQKNEERQEMKNLSSTRNGYDSEVSVSLQ

Query:  LGEPEAKRRKHSDCLSSIKESS
        LGEPEAKRRKH D LSSIKESS
Subjt:  LGEPEAKRRKHSDCLSSIKESS

SwissProt top hitse value%identityAlignment
B4XT64 Protein NARROW LEAF 12.9e-11555.53Show/hide
Query:  FSYYGAPAATPKEEIYTELVDGLRGSDPTIGSGSQVASQETYGTW----------------------VQLSKAVQE-----PGNLGPGVYLGAVERATSF
        FSYYGAPA TPKE++++ELVD L GSD  IGSGSQVAS ET+GT                       V L    Q+     P NLGPGVYLGAVERATSF
Subjt:  FSYYGAPAATPKEEIYTELVDGLRGSDPTIGSGSQVASQETYGTW----------------------VQLSKAVQE-----PGNLGPGVYLGAVERATSF

Query:  ITDDVWYGIFAGTNPETFVRADGAFIPFAEDFNMNNVVTFVKGVGEIGDVNKIDLQSPINSLIGRKVIKVGRSSGLTRGTIMAYALEYNDVKGICFFTDF
        ITDDVWYGI+AGTNPETFVRADGAFIPFA+DF+++ V T V+GVG+IGDV  IDLQ P+NSLIGR+V KVGRSSG T GT+MAYALEYND KGICFFTD 
Subjt:  ITDDVWYGIFAGTNPETFVRADGAFIPFAEDFNMNNVVTFVKGVGEIGDVNKIDLQSPINSLIGRKVIKVGRSSGLTRGTIMAYALEYNDVKGICFFTDF

Query:  LVVGDDQQTFDLEGDSGSLILLTGQDEEKPRPVGIIWGGTANRGRLKLKVGQPPENWTSGVDLGRLLDLLELDLITTSDGLQ--VHEQRNNSVGGIDSTV
        LVVG+++QTFDLEGDSGSLI+LT QD EKPRP+GIIWGGTANRGRLKL     PENWTSGVDLGRLLD LELD+I T++ LQ  V +QR   V  + S V
Subjt:  LVVGDDQQTFDLEGDSGSLILLTGQDEEKPRPVGIIWGGTANRGRLKLKVGQPPENWTSGVDLGRLLDLLELDLITTSDGLQ--VHEQRNNSVGGIDSTV

Query:  AESCLDRLPL-KYRLKENSDPLGLCVQQISPEGESSQGLISPFKHAAFHIENGFEMAPSV----ELQFIPGLTSSSPLHQKNEERQEMKNLSSTRNGYDS
         ES    + + + +++E  +PLG+ +QQ+     ++ G              G E + +V    E QFI      SP+    +  + + NL+   N  + 
Subjt:  AESCLDRLPL-KYRLKENSDPLGLCVQQISPEGESSQGLISPFKHAAFHIENGFEMAPSV----ELQFIPGLTSSSPLHQKNEERQEMKNLSSTRNGYDS

Query:  EVSVSLQLGEPEAKRRKHSDCLSSI
        E+++SL LG+ E KR + SD  SS+
Subjt:  EVSVSLQLGEPEAKRRKHSDCLSSI

Arabidopsis top hitse value%identityAlignment
AT2G35155.1 Trypsin family protein4.0e-12859.62Show/hide
Query:  FSYYGAPAATPKEEIYTELVDGLRGSDPTIGSGSQVASQETYGTW----------------------VQLSKAVQE-----PGNLGPGVYLGAVERATSF
        F YYGAPAATPKE++Y ELVDGLRGSDP IGSGSQVASQETYGT                       V L    Q+     P +LGPGVYLGAVERATSF
Subjt:  FSYYGAPAATPKEEIYTELVDGLRGSDPTIGSGSQVASQETYGTW----------------------VQLSKAVQE-----PGNLGPGVYLGAVERATSF

Query:  ITDDVWYGIFAGTNPETFVRADGAFIPFAEDFNMNNVVTFVKGVGEIGDVNKIDLQSPINSLIGRKVIKVGRSSGLTRGTIMAYALEYNDVKGICFFTDF
        ITDD WYGIFAGTNPETFVRADGAFIPFAEDFN +NV T +KG+GEIGDV+ IDLQSPI+SLIG++V+KVGRSSG T GTIMAYALEYND KGICF TDF
Subjt:  ITDDVWYGIFAGTNPETFVRADGAFIPFAEDFNMNNVVTFVKGVGEIGDVNKIDLQSPINSLIGRKVIKVGRSSGLTRGTIMAYALEYNDVKGICFFTDF

Query:  LVVGDDQQTFDLEGDSGSLILLTGQDEEKPRPVGIIWGGTANRGRLKLKVGQPPENWTSGVDLGRLLDLLELDLITTSDGLQ----VHEQRNNSVGGIDS
        LV+G++QQTFDLEGDSGSLILLTG + +KPRPVGIIWGGTANRGRLKL  GQ PENWTSGVDLGRLLDLLELDLIT++  L+      E+RN SV  +DS
Subjt:  LVVGDDQQTFDLEGDSGSLILLTGQDEEKPRPVGIIWGGTANRGRLKLKVGQPPENWTSGVDLGRLLDLLELDLITTSDGLQ----VHEQRNNSVGGIDS

Query:  TVAESCLDRLPLKYRLKENSDPLGLCVQQISPEGESSQGLISPFKHAAFHIENGFEMAPSVE--LQFIPGLTSSSPLHQKNEERQEMKNLSSTRNGYDSE
        TV++S               DP+        P G+       PF    FHIE   +    VE  +   P   + S    K +E  ++ NL + +N  + E
Subjt:  TVAESCLDRLPLKYRLKENSDPLGLCVQQISPEGESSQGLISPFKHAAFHIENGFEMAPSVE--LQFIPGLTSSSPLHQKNEERQEMKNLSSTRNGYDSE

Query:  VSVSLQLGEPEAKRRK
        V++SL LGEP+ K+ K
Subjt:  VSVSLQLGEPEAKRRK

AT3G12950.1 Trypsin family protein8.2e-10552.51Show/hide
Query:  FSYYGAP--AATPKEEIYTELVDGLRGSDPTIGSGSQVASQETYGTW----------------------VQLSKAVQE-----PGNLGPGVYLGAVERAT
        FSY+G P    TPK+   T++VD L+GSDP IGSGSQVASQET GT                       V L    Q+     P  LGPGVYLGAVERAT
Subjt:  FSYYGAP--AATPKEEIYTELVDGLRGSDPTIGSGSQVASQETYGTW----------------------VQLSKAVQE-----PGNLGPGVYLGAVERAT

Query:  SFITDDVWYGIFAGTNPETFVRADGAFIPFAEDFNMNNVVTFVK-GVGEIGDVNKIDLQSPINSLIGRKVIKVGRSSGLTRGTIMAYALEYNDVKGICFF
        SFITDD+W+GIFAGTNPETFVRADGAFIPFA+D++++ V T VK GVGEIG+V  I+LQSP+ SL+G++V+KVGRSSGLT GT++AYALEYND +G+CF 
Subjt:  SFITDDVWYGIFAGTNPETFVRADGAFIPFAEDFNMNNVVTFVK-GVGEIGDVNKIDLQSPINSLIGRKVIKVGRSSGLTRGTIMAYALEYNDVKGICFF

Query:  TDFLVVGDDQQT-FDLEGDSGSLILLTGQDEEKPRPVGIIWGGTANRGRLKLKVGQPPENWTSGVDLGRLLDLLELDLITTSDGLQ--VHEQRNNSVGGI
        TDFLVVG++ ++ FDLEGDSGSLI++ G  EEK RP+GIIWGGT +RGRLKLKVG+ PE+WT+GVDLGRLL  L+LDLITT +GL+  V EQR  S  G+
Subjt:  TDFLVVGDDQQT-FDLEGDSGSLILLTGQDEEKPRPVGIIWGGTANRGRLKLKVGQPPENWTSGVDLGRLLDLLELDLITTSDGLQ--VHEQRNNSVGGI

Query:  DSTVAESCLDRLPLKYRLKENSDPLGLCVQQISPEGESSQGLISPFKHAAFHIENGFEM---APSVELQFIPGLTSSSPLHQKNEERQEMKNLSSTRNGY
         S VA+S    + LK   KE   P            E  +  + P +     +E   E    APSVE QF+P  +         E  +E      T    
Subjt:  DSTVAESCLDRLPLKYRLKENSDPLGLCVQQISPEGESSQGLISPFKHAAFHIENGFEM---APSVELQFIPGLTSSSPLHQKNEERQEMKNLSSTRNGY

Query:  DSEVSVSLQLGEPEAKRRK
        D ++ V L+LG+  AKRR+
Subjt:  DSEVSVSLQLGEPEAKRRK

AT5G45030.1 Trypsin family protein6.0e-13262.74Show/hide
Query:  FSYYGAPAATPKEEIYTELVDGLRGSDPTIGSGSQVASQETYGTW----------------------VQLSKAVQE-----PGNLGPGVYLGAVERATSF
        F YYGAPA TPKE++YTELVD LRGS  +IGSGSQVASQETYGT                       V L    Q+     P +LGPGVYLGAVERATSF
Subjt:  FSYYGAPAATPKEEIYTELVDGLRGSDPTIGSGSQVASQETYGTW----------------------VQLSKAVQE-----PGNLGPGVYLGAVERATSF

Query:  ITDDVWYGIFAGTNPETFVRADGAFIPFAEDFNMNNVVTFVKGVGEIGDVNKIDLQSPINSLIGRKVIKVGRSSGLTRGTIMAYALEYNDVKGICFFTDF
        ITDD+WYGIFAGTNPETFVRADGAFIPFAEDFN NNV T VKG+GEIGD++  DLQSP+NSLIGRKV+KVGRSSGLT GTIMAYALEYND KGICF TDF
Subjt:  ITDDVWYGIFAGTNPETFVRADGAFIPFAEDFNMNNVVTFVKGVGEIGDVNKIDLQSPINSLIGRKVIKVGRSSGLTRGTIMAYALEYNDVKGICFFTDF

Query:  LVVGDDQQTFDLEGDSGSLILLTGQDE--EKPRPVGIIWGGTANRGRLKLKVGQPPENWTSGVDLGRLLDLLELDLITTSDGLQ--VHEQRNNSV-GGID
        LVVG++QQTFDLEGDSGSLILL   DE  EKPRPVGIIWGGTANRGRLKLKVG+ PENWTSGVDLGR+L+LLELDLIT+++GLQ  V EQRN  +   +D
Subjt:  LVVGDDQQTFDLEGDSGSLILLTGQDE--EKPRPVGIIWGGTANRGRLKLKVGQPPENWTSGVDLGRLLDLLELDLITTSDGLQ--VHEQRNNSV-GGID

Query:  STVAESCLDRLPL-KYRLKENSDPLGLCVQQISPEGESSQGLISPFKHAAFHIENGFE-MAPSVELQFIPGLTSS-SPLHQK--NEERQEMKNLSSTR-N
        STV ES      + + +  EN +P+ L VQQ+  E ++S        H  F IE+  E +A   E QFIP  +++ S LHQK    E  E KNLSS + +
Subjt:  STVAESCLDRLPL-KYRLKENSDPLGLCVQQISPEGESSQGLISPFKHAAFHIENGFE-MAPSVELQFIPGLTSS-SPLHQK--NEERQEMKNLSSTR-N

Query:  GYDSEVSVSLQLGEPEAKRRKHSD
            E+  SLQLGE + K+RK +D
Subjt:  GYDSEVSVSLQLGEPEAKRRKHSD

AT5G45030.2 Trypsin family protein6.0e-13262.74Show/hide
Query:  FSYYGAPAATPKEEIYTELVDGLRGSDPTIGSGSQVASQETYGTW----------------------VQLSKAVQE-----PGNLGPGVYLGAVERATSF
        F YYGAPA TPKE++YTELVD LRGS  +IGSGSQVASQETYGT                       V L    Q+     P +LGPGVYLGAVERATSF
Subjt:  FSYYGAPAATPKEEIYTELVDGLRGSDPTIGSGSQVASQETYGTW----------------------VQLSKAVQE-----PGNLGPGVYLGAVERATSF

Query:  ITDDVWYGIFAGTNPETFVRADGAFIPFAEDFNMNNVVTFVKGVGEIGDVNKIDLQSPINSLIGRKVIKVGRSSGLTRGTIMAYALEYNDVKGICFFTDF
        ITDD+WYGIFAGTNPETFVRADGAFIPFAEDFN NNV T VKG+GEIGD++  DLQSP+NSLIGRKV+KVGRSSGLT GTIMAYALEYND KGICF TDF
Subjt:  ITDDVWYGIFAGTNPETFVRADGAFIPFAEDFNMNNVVTFVKGVGEIGDVNKIDLQSPINSLIGRKVIKVGRSSGLTRGTIMAYALEYNDVKGICFFTDF

Query:  LVVGDDQQTFDLEGDSGSLILLTGQDE--EKPRPVGIIWGGTANRGRLKLKVGQPPENWTSGVDLGRLLDLLELDLITTSDGLQ--VHEQRNNSV-GGID
        LVVG++QQTFDLEGDSGSLILL   DE  EKPRPVGIIWGGTANRGRLKLKVG+ PENWTSGVDLGR+L+LLELDLIT+++GLQ  V EQRN  +   +D
Subjt:  LVVGDDQQTFDLEGDSGSLILLTGQDE--EKPRPVGIIWGGTANRGRLKLKVGQPPENWTSGVDLGRLLDLLELDLITTSDGLQ--VHEQRNNSV-GGID

Query:  STVAESCLDRLPL-KYRLKENSDPLGLCVQQISPEGESSQGLISPFKHAAFHIENGFE-MAPSVELQFIPGLTSS-SPLHQK--NEERQEMKNLSSTR-N
        STV ES      + + +  EN +P+ L VQQ+  E ++S        H  F IE+  E +A   E QFIP  +++ S LHQK    E  E KNLSS + +
Subjt:  STVAESCLDRLPL-KYRLKENSDPLGLCVQQISPEGESSQGLISPFKHAAFHIENGFE-MAPSVELQFIPGLTSS-SPLHQK--NEERQEMKNLSSTR-N

Query:  GYDSEVSVSLQLGEPEAKRRKHSD
            E+  SLQLGE + K+RK +D
Subjt:  GYDSEVSVSLQLGEPEAKRRKHSD


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGATCATTGGGGTCAGGGACCTGGAGGTATATGGTGTGATGTTGATGTTGTGGAGGTTCTCCTATTATGGTGCTCCGGCAGCTACACCTAAAGAAGAAATATACACAGA
GCTTGTTGATGGCCTGAGGGGAAGTGATCCAACAATTGGTTCTGGTTCCCAGGTTGCTAGCCAAGAAACTTATGGGACTTGGGTGCAATTGTCAAAAGCCGTACAGGAAC
CCGGCAACCTTGGACCTGGTGTATATCTGGGTGCTGTGGAGAGAGCAACATCGTTTATCACTGATGATGTCTGGTATGGCATCTTTGCTGGAACAAATCCAGAAACATTT
GTGCGAGCTGATGGAGCGTTCATTCCCTTCGCCGAAGATTTCAACATGAATAACGTCGTTACATTTGTAAAAGGCGTCGGTGAGATTGGTGATGTCAACAAAATAGACCT
GCAGTCCCCGATCAACAGTCTCATTGGACGAAAAGTGATCAAGGTTGGAAGAAGTTCGGGCTTGACCAGAGGGACTATAATGGCATATGCCCTCGAGTATAACGATGTAA
AAGGGATTTGTTTCTTCACCGACTTTCTTGTTGTTGGAGATGACCAGCAGACGTTTGACCTTGAAGGTGATAGTGGAAGCCTTATTCTTTTAACTGGTCAGGATGAGGAA
AAACCACGTCCAGTTGGGATTATCTGGGGAGGAACAGCTAATCGAGGTCGGCTGAAATTAAAAGTTGGTCAGCCTCCAGAGAATTGGACCAGTGGAGTTGATCTTGGACG
CCTTCTTGATCTCCTTGAGCTCGATCTTATTACAACAAGTGATGGTTTACAAGTGCATGAACAAAGGAACAATTCAGTTGGAGGGATTGATTCTACTGTTGCGGAGTCCT
GTCTCGATCGGCTGCCGTTAAAATATAGACTTAAAGAGAACTCCGATCCACTTGGCTTATGTGTCCAGCAAATTTCTCCTGAAGGTGAATCCTCCCAGGGGCTGATCTCA
CCTTTTAAGCATGCTGCATTCCACATAGAAAACGGGTTTGAGATGGCTCCAAGTGTCGAACTCCAGTTTATACCAGGATTAACCAGCAGCTCTCCGCTGCATCAGAAGAA
CGAAGAACGCCAAGAGATGAAAAATCTGTCCTCCACGAGAAATGGCTATGATAGCGAGGTATCAGTTTCACTGCAGTTGGGTGAGCCAGAAGCAAAGAGAAGGAAGCACT
CGGATTGTCTTTCAAGTATCAAAGAGTCATCAACATGA
mRNA sequenceShow/hide mRNA sequence
ATGATCATTGGGGTCAGGGACCTGGAGGTATATGGTGTGATGTTGATGTTGTGGAGGTTCTCCTATTATGGTGCTCCGGCAGCTACACCTAAAGAAGAAATATACACAGA
GCTTGTTGATGGCCTGAGGGGAAGTGATCCAACAATTGGTTCTGGTTCCCAGGTTGCTAGCCAAGAAACTTATGGGACTTGGGTGCAATTGTCAAAAGCCGTACAGGAAC
CCGGCAACCTTGGACCTGGTGTATATCTGGGTGCTGTGGAGAGAGCAACATCGTTTATCACTGATGATGTCTGGTATGGCATCTTTGCTGGAACAAATCCAGAAACATTT
GTGCGAGCTGATGGAGCGTTCATTCCCTTCGCCGAAGATTTCAACATGAATAACGTCGTTACATTTGTAAAAGGCGTCGGTGAGATTGGTGATGTCAACAAAATAGACCT
GCAGTCCCCGATCAACAGTCTCATTGGACGAAAAGTGATCAAGGTTGGAAGAAGTTCGGGCTTGACCAGAGGGACTATAATGGCATATGCCCTCGAGTATAACGATGTAA
AAGGGATTTGTTTCTTCACCGACTTTCTTGTTGTTGGAGATGACCAGCAGACGTTTGACCTTGAAGGTGATAGTGGAAGCCTTATTCTTTTAACTGGTCAGGATGAGGAA
AAACCACGTCCAGTTGGGATTATCTGGGGAGGAACAGCTAATCGAGGTCGGCTGAAATTAAAAGTTGGTCAGCCTCCAGAGAATTGGACCAGTGGAGTTGATCTTGGACG
CCTTCTTGATCTCCTTGAGCTCGATCTTATTACAACAAGTGATGGTTTACAAGTGCATGAACAAAGGAACAATTCAGTTGGAGGGATTGATTCTACTGTTGCGGAGTCCT
GTCTCGATCGGCTGCCGTTAAAATATAGACTTAAAGAGAACTCCGATCCACTTGGCTTATGTGTCCAGCAAATTTCTCCTGAAGGTGAATCCTCCCAGGGGCTGATCTCA
CCTTTTAAGCATGCTGCATTCCACATAGAAAACGGGTTTGAGATGGCTCCAAGTGTCGAACTCCAGTTTATACCAGGATTAACCAGCAGCTCTCCGCTGCATCAGAAGAA
CGAAGAACGCCAAGAGATGAAAAATCTGTCCTCCACGAGAAATGGCTATGATAGCGAGGTATCAGTTTCACTGCAGTTGGGTGAGCCAGAAGCAAAGAGAAGGAAGCACT
CGGATTGTCTTTCAAGTATCAAAGAGTCATCAACATGA
Protein sequenceShow/hide protein sequence
MIIGVRDLEVYGVMLMLWRFSYYGAPAATPKEEIYTELVDGLRGSDPTIGSGSQVASQETYGTWVQLSKAVQEPGNLGPGVYLGAVERATSFITDDVWYGIFAGTNPETF
VRADGAFIPFAEDFNMNNVVTFVKGVGEIGDVNKIDLQSPINSLIGRKVIKVGRSSGLTRGTIMAYALEYNDVKGICFFTDFLVVGDDQQTFDLEGDSGSLILLTGQDEE
KPRPVGIIWGGTANRGRLKLKVGQPPENWTSGVDLGRLLDLLELDLITTSDGLQVHEQRNNSVGGIDSTVAESCLDRLPLKYRLKENSDPLGLCVQQISPEGESSQGLIS
PFKHAAFHIENGFEMAPSVELQFIPGLTSSSPLHQKNEERQEMKNLSSTRNGYDSEVSVSLQLGEPEAKRRKHSDCLSSIKESST