; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

HG10015783 (gene) of Bottle gourd (Hangzhou Gourd) v1 genome

Gene IDHG10015783
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
DescriptionTrypsin family protein
Genome locationChr03:229044..237506
RNA-Seq ExpressionHG10015783
SyntenyHG10015783
Gene Ontology termsGO:0043231 - intracellular membrane-bounded organelle (cellular component)
InterPro domainsIPR009003 - Peptidase S1, PA clan


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0052425.1 uncharacterized protein E6C27_scaffold120G00200 [Cucumis melo var. makuwa]0.0e+0096.02Show/hide
Query:  MDRTRLDLSFHHAVSTQSEESALDLERNYCSHLNMPSSSPSPSQCFAPGSQLSETN-AYFSWPTSSRLNDAAEDRANYFGNLQKGVLPEILGRLPTGQRA
        MDRTRLDL+FHH+VSTQSEESALDLERNYCSHL++PSSSPSPSQCFAPGSQLSETN AYFSWPTSSRLNDAAEDRANYFGNLQKGVLPEILGRLPTGQRA
Subjt:  MDRTRLDLSFHHAVSTQSEESALDLERNYCSHLNMPSSSPSPSQCFAPGSQLSETN-AYFSWPTSSRLNDAAEDRANYFGNLQKGVLPEILGRLPTGQRA

Query:  TTLLELMTIRAFHSKILRRFSLGTAIGFRIQKGMLTDIPAIIVFVARKVHMQWLSDVQCLPAALEGPGGIWCDVDVVEFSYYGAPAATPKEEVYTELVDG
        TTLLELMTIRAFHSKILRRFSLGTAIGFRIQKGMLTDIPAIIVFVARKVH QWLSDVQCLPAALEGPGGIWCDVDVVEFSYYGAPAATPKEEVYTELVDG
Subjt:  TTLLELMTIRAFHSKILRRFSLGTAIGFRIQKGMLTDIPAIIVFVARKVHMQWLSDVQCLPAALEGPGGIWCDVDVVEFSYYGAPAATPKEEVYTELVDG

Query:  LRGSDPTIGSGSQVASQETYGTLGAIVKSRTGTRQVGFLTNRHVAVDLDYPSQKMFHPLPPSLGPGVYLGAVERATSFITDDVWYGIFAGTNPETFVRAD
        LRGSDPT+GSGSQVASQETYGTLGAIVKSRTGTRQVGFLTNRHVAVDLDYPSQKMFHPLPPSLGPGVYLGAVERATSFITDDVWYGIFAGTNPETFVRAD
Subjt:  LRGSDPTIGSGSQVASQETYGTLGAIVKSRTGTRQVGFLTNRHVAVDLDYPSQKMFHPLPPSLGPGVYLGAVERATSFITDDVWYGIFAGTNPETFVRAD

Query:  GAFIPFAEDFNMNNVITFVKGVGEIGDVNKIDLQSPINSLIGRKVIKVGRSSGLTRGTIMAYALEYNDVKGICFFTDFLVVGDDQQTFDLEGDSGSLILL
        GAFIPFAEDFNMNNV+TFVKGVGEIGDVNKIDLQSPINSLIGRKVIKVGRSSGLTRGTIMAYALEYNDVKGICFFTDFLVVGDDQQTFDLEGDSGSLILL
Subjt:  GAFIPFAEDFNMNNVITFVKGVGEIGDVNKIDLQSPINSLIGRKVIKVGRSSGLTRGTIMAYALEYNDVKGICFFTDFLVVGDDQQTFDLEGDSGSLILL

Query:  TGQDEEKPRPVGIIWGGTANRGRLKLKVGQPPENWTSGVDLGRLLDLLELDLITTNDGLQVHEQRNNSVGGIESTVAESCLDRMPLKYRLKENSEPLGLS
        TGQDEEKPRPVGIIWGGTANRGRLKLKVGQPPENWTSGVDLGRLLDLLELDLITTNDGLQVHEQRNNSVGGI+STVAESCLDRMPLKYRLKENSEPLG S
Subjt:  TGQDEEKPRPVGIIWGGTANRGRLKLKVGQPPENWTSGVDLGRLLDLLELDLITTNDGLQVHEQRNNSVGGIESTVAESCLDRMPLKYRLKENSEPLGLS

Query:  VQQISPEGESSQGMISPFKHAAFHIENGFEATPSVELQFIPRLTSSYPLHQKNEESQKLKNLSALRNGYDSEVSVSLQLG--EPEAKRRKHLDSLSSIKE
        VQQISPEGESSQGMISPFKHA FHIENG+E TPSVELQFIPRLTS+ PLHQKNEE+Q+LKNL+ALR G+DSEVSVSLQLG  EPEAKRRK LD LSSIKE
Subjt:  VQQISPEGESSQGMISPFKHAAFHIENGFEATPSVELQFIPRLTSSYPLHQKNEESQKLKNLSALRNGYDSEVSVSLQLG--EPEAKRRKHLDSLSSIKE

Query:  SSS
        SSS
Subjt:  SSS

XP_008439446.1 PREDICTED: uncharacterized protein LOC103484249 isoform X1 [Cucumis melo]0.0e+0095.87Show/hide
Query:  MDRTRLDLSFHHAVSTQSEESALDLERNYCSHLNMPSSSPSPSQCFAPGSQLSETN-AYFSWPTSSRLNDAAEDRANYFGNLQKGVLPEILGRLPTGQRA
        MDRTRLDL+FHH+VSTQSEESALDLERNYCSHL++PSSSPSPSQCFAPGSQLSETN AYFSWPTSSRLNDAAEDRANYFGNLQKGVLPEILGRLPTGQRA
Subjt:  MDRTRLDLSFHHAVSTQSEESALDLERNYCSHLNMPSSSPSPSQCFAPGSQLSETN-AYFSWPTSSRLNDAAEDRANYFGNLQKGVLPEILGRLPTGQRA

Query:  TTLLELMTIRAFHSKILRRFSLGTAIGFRIQKGMLTDIPAIIVFVARKVHMQWLSDVQCLPAALEGPGGIWCDVDVVEFSYYGAPAATPKEEVYTELVDG
        TTLLELMTIRAFHSKILRRFSLGTAIGFRIQKGMLTDIPAIIVFVARKVH QWLSDVQCLPAALEGPGGIWCDVDVVEFSYYGAPAATPKEEVYTELVDG
Subjt:  TTLLELMTIRAFHSKILRRFSLGTAIGFRIQKGMLTDIPAIIVFVARKVHMQWLSDVQCLPAALEGPGGIWCDVDVVEFSYYGAPAATPKEEVYTELVDG

Query:  LRGSDPTIGSGSQVASQETYGTLGAIVKSRTGTRQVGFLTNRHVAVDLDYPSQKMFHPLPPSLGPGVYLGAVERATSFITDDVWYGIFAGTNPETFVRAD
        LRGSDPT+GSGSQVASQETYGTLGAIVKSRTGTRQVGFLTNRHVAVDLDYPSQKMFHPLPPSLGPGVYLGAVERATSFITDDVWYGIFAGTNPETFVRAD
Subjt:  LRGSDPTIGSGSQVASQETYGTLGAIVKSRTGTRQVGFLTNRHVAVDLDYPSQKMFHPLPPSLGPGVYLGAVERATSFITDDVWYGIFAGTNPETFVRAD

Query:  GAFIPFAEDFNMNNVITFVKGVGEIGDVNKIDLQSPINSLIGRKVIKVGRSSGLTRGTIMAYALEYNDVKGICFFTDFLVVGDDQQTFDLEGDSGSLILL
        GAFIPFAEDFNMNNV+TFVKGVGEIGDVNKIDLQSPINSLIGRKVIKVGRSSGLTRGTIMAYALEYNDVKGICFFTDFLVVGDDQQTFDLEGDSGSLILL
Subjt:  GAFIPFAEDFNMNNVITFVKGVGEIGDVNKIDLQSPINSLIGRKVIKVGRSSGLTRGTIMAYALEYNDVKGICFFTDFLVVGDDQQTFDLEGDSGSLILL

Query:  TGQDEEKPRPVGIIWGGTANRGRLKLKVGQPPENWTSGVDLGRLLDLLELDLITTNDGLQ--VHEQRNNSVGGIESTVAESCLDRMPLKYRLKENSEPLG
        TGQDEEKPRPVGIIWGGTANRGRLKLKVGQPPENWTSGVDLGRLLDLLELDLITTNDGLQ  VHEQRNNSVGGI+STVAESCLDRMPLKYRLKENSEPLG
Subjt:  TGQDEEKPRPVGIIWGGTANRGRLKLKVGQPPENWTSGVDLGRLLDLLELDLITTNDGLQ--VHEQRNNSVGGIESTVAESCLDRMPLKYRLKENSEPLG

Query:  LSVQQISPEGESSQGMISPFKHAAFHIENGFEATPSVELQFIPRLTSSYPLHQKNEESQKLKNLSALRNGYDSEVSVSLQLG--EPEAKRRKHLDSLSSI
         SVQQISPEGESSQGMISPFKHA FHIENG+E TPSVELQFIPRLTS+ PLHQKN+E+Q+LKNLSALR GYDSEVSVSLQLG  EPEAKRRK LD LSSI
Subjt:  LSVQQISPEGESSQGMISPFKHAAFHIENGFEATPSVELQFIPRLTSSYPLHQKNEESQKLKNLSALRNGYDSEVSVSLQLG--EPEAKRRKHLDSLSSI

Query:  KESSS
        KESSS
Subjt:  KESSS

XP_008439448.1 PREDICTED: uncharacterized protein LOC103484249 isoform X2 [Cucumis melo]0.0e+0096.19Show/hide
Query:  MDRTRLDLSFHHAVSTQSEESALDLERNYCSHLNMPSSSPSPSQCFAPGSQLSETN-AYFSWPTSSRLNDAAEDRANYFGNLQKGVLPEILGRLPTGQRA
        MDRTRLDL+FHH+VSTQSEESALDLERNYCSHL++PSSSPSPSQCFAPGSQLSETN AYFSWPTSSRLNDAAEDRANYFGNLQKGVLPEILGRLPTGQRA
Subjt:  MDRTRLDLSFHHAVSTQSEESALDLERNYCSHLNMPSSSPSPSQCFAPGSQLSETN-AYFSWPTSSRLNDAAEDRANYFGNLQKGVLPEILGRLPTGQRA

Query:  TTLLELMTIRAFHSKILRRFSLGTAIGFRIQKGMLTDIPAIIVFVARKVHMQWLSDVQCLPAALEGPGGIWCDVDVVEFSYYGAPAATPKEEVYTELVDG
        TTLLELMTIRAFHSKILRRFSLGTAIGFRIQKGMLTDIPAIIVFVARKVH QWLSDVQCLPAALEGPGGIWCDVDVVEFSYYGAPAATPKEEVYTELVDG
Subjt:  TTLLELMTIRAFHSKILRRFSLGTAIGFRIQKGMLTDIPAIIVFVARKVHMQWLSDVQCLPAALEGPGGIWCDVDVVEFSYYGAPAATPKEEVYTELVDG

Query:  LRGSDPTIGSGSQVASQETYGTLGAIVKSRTGTRQVGFLTNRHVAVDLDYPSQKMFHPLPPSLGPGVYLGAVERATSFITDDVWYGIFAGTNPETFVRAD
        LRGSDPT+GSGSQVASQETYGTLGAIVKSRTGTRQVGFLTNRHVAVDLDYPSQKMFHPLPPSLGPGVYLGAVERATSFITDDVWYGIFAGTNPETFVRAD
Subjt:  LRGSDPTIGSGSQVASQETYGTLGAIVKSRTGTRQVGFLTNRHVAVDLDYPSQKMFHPLPPSLGPGVYLGAVERATSFITDDVWYGIFAGTNPETFVRAD

Query:  GAFIPFAEDFNMNNVITFVKGVGEIGDVNKIDLQSPINSLIGRKVIKVGRSSGLTRGTIMAYALEYNDVKGICFFTDFLVVGDDQQTFDLEGDSGSLILL
        GAFIPFAEDFNMNNV+TFVKGVGEIGDVNKIDLQSPINSLIGRKVIKVGRSSGLTRGTIMAYALEYNDVKGICFFTDFLVVGDDQQTFDLEGDSGSLILL
Subjt:  GAFIPFAEDFNMNNVITFVKGVGEIGDVNKIDLQSPINSLIGRKVIKVGRSSGLTRGTIMAYALEYNDVKGICFFTDFLVVGDDQQTFDLEGDSGSLILL

Query:  TGQDEEKPRPVGIIWGGTANRGRLKLKVGQPPENWTSGVDLGRLLDLLELDLITTNDGLQVHEQRNNSVGGIESTVAESCLDRMPLKYRLKENSEPLGLS
        TGQDEEKPRPVGIIWGGTANRGRLKLKVGQPPENWTSGVDLGRLLDLLELDLITTNDGLQVHEQRNNSVGGI+STVAESCLDRMPLKYRLKENSEPLG S
Subjt:  TGQDEEKPRPVGIIWGGTANRGRLKLKVGQPPENWTSGVDLGRLLDLLELDLITTNDGLQVHEQRNNSVGGIESTVAESCLDRMPLKYRLKENSEPLGLS

Query:  VQQISPEGESSQGMISPFKHAAFHIENGFEATPSVELQFIPRLTSSYPLHQKNEESQKLKNLSALRNGYDSEVSVSLQLG--EPEAKRRKHLDSLSSIKE
        VQQISPEGESSQGMISPFKHA FHIENG+E TPSVELQFIPRLTS+ PLHQKN+E+Q+LKNLSALR GYDSEVSVSLQLG  EPEAKRRK LD LSSIKE
Subjt:  VQQISPEGESSQGMISPFKHAAFHIENGFEATPSVELQFIPRLTSSYPLHQKNEESQKLKNLSALRNGYDSEVSVSLQLG--EPEAKRRKHLDSLSSIKE

Query:  SSS
        SSS
Subjt:  SSS

XP_023543759.1 uncharacterized protein LOC111803536 isoform X2 [Cucurbita pepo subsp. pepo]0.0e+0096.33Show/hide
Query:  MDRTRLDLSFHHAVSTQSEESALDLERNYCSHLNMPSSSPSPSQCFAPGSQLSETN-AYFSWPTSSRLNDAAEDRANYFGNLQKGVLPEILGRLPTGQRA
        MDRTRLDLS HH+VSTQSEESALDLERNYCSHLNMPSSSPSPSQCFAPGSQLSETN AYFSWPTSSRLNDAAEDRANYFGNLQKGVLPEILGRLPTGQRA
Subjt:  MDRTRLDLSFHHAVSTQSEESALDLERNYCSHLNMPSSSPSPSQCFAPGSQLSETN-AYFSWPTSSRLNDAAEDRANYFGNLQKGVLPEILGRLPTGQRA

Query:  TTLLELMTIRAFHSKILRRFSLGTAIGFRIQKGMLTDIPAIIVFVARKVHMQWLSDVQCLPAALEGPGGIWCDVDVVEFSYYGAPAATPKEEVYTELVDG
        TTLLELMTIRAFHSKILRRFSLGTAIGFRIQKGMLTDIPAIIVFVARKVH QWLSDVQCLPAALEGPGGIWCDVDVVEFSYYGAPAATPKEEVYTELVDG
Subjt:  TTLLELMTIRAFHSKILRRFSLGTAIGFRIQKGMLTDIPAIIVFVARKVHMQWLSDVQCLPAALEGPGGIWCDVDVVEFSYYGAPAATPKEEVYTELVDG

Query:  LRGSDPTIGSGSQVASQETYGTLGAIVKSRTGTRQVGFLTNRHVAVDLDYPSQKMFHPLPPSLGPGVYLGAVERATSFITDDVWYGIFAGTNPETFVRAD
        LRGSDPTIGSGSQVASQETYGTLGAIVKSRTGT+QVGFLTNRHVAVDLDYPSQKMFHPLPPSLGPGVYLGAVERATSFITDDVWYGIFAGTNPETFVRAD
Subjt:  LRGSDPTIGSGSQVASQETYGTLGAIVKSRTGTRQVGFLTNRHVAVDLDYPSQKMFHPLPPSLGPGVYLGAVERATSFITDDVWYGIFAGTNPETFVRAD

Query:  GAFIPFAEDFNMNNVITFVKGVGEIGDVNKIDLQSPINSLIGRKVIKVGRSSGLTRGTIMAYALEYNDVKGICFFTDFLVVGDDQQTFDLEGDSGSLILL
        GAFIPFAEDFNMNNVITFVKGVGEIGDVNKIDLQSPINSLIGRKVIKVGRSSGLTRGTIMAYALEYNDVKGICFFTDFLVVGDDQQTFDLEGDSGSLILL
Subjt:  GAFIPFAEDFNMNNVITFVKGVGEIGDVNKIDLQSPINSLIGRKVIKVGRSSGLTRGTIMAYALEYNDVKGICFFTDFLVVGDDQQTFDLEGDSGSLILL

Query:  TGQDEEKPRPVGIIWGGTANRGRLKLKVGQPPENWTSGVDLGRLLDLLELDLITTNDGLQVHEQRNNSVGGIESTVAESCLDRMPLKYRLKENSEPLGLS
        TG DEEKPRPVGIIWGGTANRGRLKLKVGQPPENWTSGVDLGRLLDLLELDLITTNDGLQVHEQRNNSVGGI+STVAESC DRMPL YRL+ENSEPLGLS
Subjt:  TGQDEEKPRPVGIIWGGTANRGRLKLKVGQPPENWTSGVDLGRLLDLLELDLITTNDGLQVHEQRNNSVGGIESTVAESCLDRMPLKYRLKENSEPLGLS

Query:  VQQISPEGESSQGMISPFKHAAFHIENGFEATPSVELQFIPRLTSSYPLHQKNEESQKLKNLSALRNGYDSEVSVSLQLGEPEAKRRKHLDSLSSIKESS
        VQ+ISPEGESSQG+ISPFKHAA  IENGFE TPSVELQFIPRL SS PLHQKNEE Q+LKNLS LRNGYD EVSVSL+LGEPEAKRRKHLDSLSSIKESS
Subjt:  VQQISPEGESSQGMISPFKHAAFHIENGFEATPSVELQFIPRLTSSYPLHQKNEESQKLKNLSALRNGYDSEVSVSLQLGEPEAKRRKHLDSLSSIKESS

XP_038877731.1 LOW QUALITY PROTEIN: protein NARROW LEAF 1-like [Benincasa hispida]0.0e+0097.67Show/hide
Query:  MDRTRLDLSFHHAVSTQSEESALDLERNYCSHLNMPSSSPSPSQCFAPGSQLSETNAYFSWPTSSRLNDAAEDRANYFGNLQKGVLPEILGRLPTGQRAT
        MDRTRLDLSFHH+VSTQSEESALDLERNYCSHLNMPSSSPSPSQCFAPGSQLSETNAYFSWPTSSRLNDAAEDRANYFGNLQKGVLPEILGRLPTGQRAT
Subjt:  MDRTRLDLSFHHAVSTQSEESALDLERNYCSHLNMPSSSPSPSQCFAPGSQLSETNAYFSWPTSSRLNDAAEDRANYFGNLQKGVLPEILGRLPTGQRAT

Query:  TLLELMTIRAFHSKILRRFSLGTAIGFRIQKGMLTDIPAIIVFVARKVHMQWLSDVQCLPAALEGPGGIWCDVDVVEFSYYGAPAATPKEEVYTELVDGL
        TLLELMTIRAFHSKILRRFSLGTAIGFRIQKGMLTDIPAIIVFVARKVH QWLSDVQCLPAALEGPGGIWCDVDVVEFSYYGAPAATPKEEVYTELVDGL
Subjt:  TLLELMTIRAFHSKILRRFSLGTAIGFRIQKGMLTDIPAIIVFVARKVHMQWLSDVQCLPAALEGPGGIWCDVDVVEFSYYGAPAATPKEEVYTELVDGL

Query:  RGSDPTIGSGSQVASQETYGTLGAIVKSRTGTRQVGFLTNRHVAVDLDYPSQKMFHPLPPSLGPGVYLGAVERATSFITDDVWYGIFAGTNPETFVRADG
        RGSDPTIGSGSQVASQETYGTLGAIVKSRTGTRQVGFLTNRHVAVDLDYPSQKMFHPLPPSLGPGVYLGAVERATSFITDDVWYGIFAGTNPETFVRADG
Subjt:  RGSDPTIGSGSQVASQETYGTLGAIVKSRTGTRQVGFLTNRHVAVDLDYPSQKMFHPLPPSLGPGVYLGAVERATSFITDDVWYGIFAGTNPETFVRADG

Query:  AFIPFAEDFNMNNVITFVKGVGEIGDVNKIDLQSPINSLIGRKVIKVGRSSGLTRGTIMAYALEYNDVKGICFFTDFLVVGDDQQTFDLEGDSGSLILLT
        AFIPFAEDFNMNNVITFVKGVGEIGDVNKIDLQSPINSLIGRKVIKVGRSSGLTRGTIM YALEYNDVKGIC FTDFLVVGDDQQTFDLEGDSGSLILLT
Subjt:  AFIPFAEDFNMNNVITFVKGVGEIGDVNKIDLQSPINSLIGRKVIKVGRSSGLTRGTIMAYALEYNDVKGICFFTDFLVVGDDQQTFDLEGDSGSLILLT

Query:  GQDEEKPRPVGIIWGGTANRGRLKLKVGQPPENWTSGVDLGRLLDLLELDLITTNDGLQ--VHEQRNNSVGGIESTVAESCLDRMPLKYRLKENSEPLGL
        GQDEEKPRPVGIIWGGTANRGRLKLKVGQPPENWTSGVDLGRLLDLLELDLITTNDG Q  VHEQRNNSVGGI+STVAESCLDRMPLKYRLKENSE LGL
Subjt:  GQDEEKPRPVGIIWGGTANRGRLKLKVGQPPENWTSGVDLGRLLDLLELDLITTNDGLQ--VHEQRNNSVGGIESTVAESCLDRMPLKYRLKENSEPLGL

Query:  SVQQISPEGESSQGMISPFKHAAFHIENGFEATPSVELQFIPRLTSSYPLHQKNEESQKLKNLSALRNGYDSEVSVSLQLGEPEAKRRKHLDSLSSIKES
        SVQQI PEGESSQGMISPFKHAAFHIENGFE TPSVELQFIPRLTSS  L QKNEESQKLKNLSALRNGYDSEVSVSLQLGEPEAKRRKHLDSLSSIKES
Subjt:  SVQQISPEGESSQGMISPFKHAAFHIENGFEATPSVELQFIPRLTSSYPLHQKNEESQKLKNLSALRNGYDSEVSVSLQLGEPEAKRRKHLDSLSSIKES

Query:  SS
        SS
Subjt:  SS

TrEMBL top hitse value%identityAlignment
A0A1S3AYD6 uncharacterized protein LOC103484249 isoform X10.0e+0095.87Show/hide
Query:  MDRTRLDLSFHHAVSTQSEESALDLERNYCSHLNMPSSSPSPSQCFAPGSQLSETN-AYFSWPTSSRLNDAAEDRANYFGNLQKGVLPEILGRLPTGQRA
        MDRTRLDL+FHH+VSTQSEESALDLERNYCSHL++PSSSPSPSQCFAPGSQLSETN AYFSWPTSSRLNDAAEDRANYFGNLQKGVLPEILGRLPTGQRA
Subjt:  MDRTRLDLSFHHAVSTQSEESALDLERNYCSHLNMPSSSPSPSQCFAPGSQLSETN-AYFSWPTSSRLNDAAEDRANYFGNLQKGVLPEILGRLPTGQRA

Query:  TTLLELMTIRAFHSKILRRFSLGTAIGFRIQKGMLTDIPAIIVFVARKVHMQWLSDVQCLPAALEGPGGIWCDVDVVEFSYYGAPAATPKEEVYTELVDG
        TTLLELMTIRAFHSKILRRFSLGTAIGFRIQKGMLTDIPAIIVFVARKVH QWLSDVQCLPAALEGPGGIWCDVDVVEFSYYGAPAATPKEEVYTELVDG
Subjt:  TTLLELMTIRAFHSKILRRFSLGTAIGFRIQKGMLTDIPAIIVFVARKVHMQWLSDVQCLPAALEGPGGIWCDVDVVEFSYYGAPAATPKEEVYTELVDG

Query:  LRGSDPTIGSGSQVASQETYGTLGAIVKSRTGTRQVGFLTNRHVAVDLDYPSQKMFHPLPPSLGPGVYLGAVERATSFITDDVWYGIFAGTNPETFVRAD
        LRGSDPT+GSGSQVASQETYGTLGAIVKSRTGTRQVGFLTNRHVAVDLDYPSQKMFHPLPPSLGPGVYLGAVERATSFITDDVWYGIFAGTNPETFVRAD
Subjt:  LRGSDPTIGSGSQVASQETYGTLGAIVKSRTGTRQVGFLTNRHVAVDLDYPSQKMFHPLPPSLGPGVYLGAVERATSFITDDVWYGIFAGTNPETFVRAD

Query:  GAFIPFAEDFNMNNVITFVKGVGEIGDVNKIDLQSPINSLIGRKVIKVGRSSGLTRGTIMAYALEYNDVKGICFFTDFLVVGDDQQTFDLEGDSGSLILL
        GAFIPFAEDFNMNNV+TFVKGVGEIGDVNKIDLQSPINSLIGRKVIKVGRSSGLTRGTIMAYALEYNDVKGICFFTDFLVVGDDQQTFDLEGDSGSLILL
Subjt:  GAFIPFAEDFNMNNVITFVKGVGEIGDVNKIDLQSPINSLIGRKVIKVGRSSGLTRGTIMAYALEYNDVKGICFFTDFLVVGDDQQTFDLEGDSGSLILL

Query:  TGQDEEKPRPVGIIWGGTANRGRLKLKVGQPPENWTSGVDLGRLLDLLELDLITTNDGLQ--VHEQRNNSVGGIESTVAESCLDRMPLKYRLKENSEPLG
        TGQDEEKPRPVGIIWGGTANRGRLKLKVGQPPENWTSGVDLGRLLDLLELDLITTNDGLQ  VHEQRNNSVGGI+STVAESCLDRMPLKYRLKENSEPLG
Subjt:  TGQDEEKPRPVGIIWGGTANRGRLKLKVGQPPENWTSGVDLGRLLDLLELDLITTNDGLQ--VHEQRNNSVGGIESTVAESCLDRMPLKYRLKENSEPLG

Query:  LSVQQISPEGESSQGMISPFKHAAFHIENGFEATPSVELQFIPRLTSSYPLHQKNEESQKLKNLSALRNGYDSEVSVSLQLG--EPEAKRRKHLDSLSSI
         SVQQISPEGESSQGMISPFKHA FHIENG+E TPSVELQFIPRLTS+ PLHQKN+E+Q+LKNLSALR GYDSEVSVSLQLG  EPEAKRRK LD LSSI
Subjt:  LSVQQISPEGESSQGMISPFKHAAFHIENGFEATPSVELQFIPRLTSSYPLHQKNEESQKLKNLSALRNGYDSEVSVSLQLG--EPEAKRRKHLDSLSSI

Query:  KESSS
        KESSS
Subjt:  KESSS

A0A1S3AYT3 uncharacterized protein LOC103484249 isoform X20.0e+0096.19Show/hide
Query:  MDRTRLDLSFHHAVSTQSEESALDLERNYCSHLNMPSSSPSPSQCFAPGSQLSETN-AYFSWPTSSRLNDAAEDRANYFGNLQKGVLPEILGRLPTGQRA
        MDRTRLDL+FHH+VSTQSEESALDLERNYCSHL++PSSSPSPSQCFAPGSQLSETN AYFSWPTSSRLNDAAEDRANYFGNLQKGVLPEILGRLPTGQRA
Subjt:  MDRTRLDLSFHHAVSTQSEESALDLERNYCSHLNMPSSSPSPSQCFAPGSQLSETN-AYFSWPTSSRLNDAAEDRANYFGNLQKGVLPEILGRLPTGQRA

Query:  TTLLELMTIRAFHSKILRRFSLGTAIGFRIQKGMLTDIPAIIVFVARKVHMQWLSDVQCLPAALEGPGGIWCDVDVVEFSYYGAPAATPKEEVYTELVDG
        TTLLELMTIRAFHSKILRRFSLGTAIGFRIQKGMLTDIPAIIVFVARKVH QWLSDVQCLPAALEGPGGIWCDVDVVEFSYYGAPAATPKEEVYTELVDG
Subjt:  TTLLELMTIRAFHSKILRRFSLGTAIGFRIQKGMLTDIPAIIVFVARKVHMQWLSDVQCLPAALEGPGGIWCDVDVVEFSYYGAPAATPKEEVYTELVDG

Query:  LRGSDPTIGSGSQVASQETYGTLGAIVKSRTGTRQVGFLTNRHVAVDLDYPSQKMFHPLPPSLGPGVYLGAVERATSFITDDVWYGIFAGTNPETFVRAD
        LRGSDPT+GSGSQVASQETYGTLGAIVKSRTGTRQVGFLTNRHVAVDLDYPSQKMFHPLPPSLGPGVYLGAVERATSFITDDVWYGIFAGTNPETFVRAD
Subjt:  LRGSDPTIGSGSQVASQETYGTLGAIVKSRTGTRQVGFLTNRHVAVDLDYPSQKMFHPLPPSLGPGVYLGAVERATSFITDDVWYGIFAGTNPETFVRAD

Query:  GAFIPFAEDFNMNNVITFVKGVGEIGDVNKIDLQSPINSLIGRKVIKVGRSSGLTRGTIMAYALEYNDVKGICFFTDFLVVGDDQQTFDLEGDSGSLILL
        GAFIPFAEDFNMNNV+TFVKGVGEIGDVNKIDLQSPINSLIGRKVIKVGRSSGLTRGTIMAYALEYNDVKGICFFTDFLVVGDDQQTFDLEGDSGSLILL
Subjt:  GAFIPFAEDFNMNNVITFVKGVGEIGDVNKIDLQSPINSLIGRKVIKVGRSSGLTRGTIMAYALEYNDVKGICFFTDFLVVGDDQQTFDLEGDSGSLILL

Query:  TGQDEEKPRPVGIIWGGTANRGRLKLKVGQPPENWTSGVDLGRLLDLLELDLITTNDGLQVHEQRNNSVGGIESTVAESCLDRMPLKYRLKENSEPLGLS
        TGQDEEKPRPVGIIWGGTANRGRLKLKVGQPPENWTSGVDLGRLLDLLELDLITTNDGLQVHEQRNNSVGGI+STVAESCLDRMPLKYRLKENSEPLG S
Subjt:  TGQDEEKPRPVGIIWGGTANRGRLKLKVGQPPENWTSGVDLGRLLDLLELDLITTNDGLQVHEQRNNSVGGIESTVAESCLDRMPLKYRLKENSEPLGLS

Query:  VQQISPEGESSQGMISPFKHAAFHIENGFEATPSVELQFIPRLTSSYPLHQKNEESQKLKNLSALRNGYDSEVSVSLQLG--EPEAKRRKHLDSLSSIKE
        VQQISPEGESSQGMISPFKHA FHIENG+E TPSVELQFIPRLTS+ PLHQKN+E+Q+LKNLSALR GYDSEVSVSLQLG  EPEAKRRK LD LSSIKE
Subjt:  VQQISPEGESSQGMISPFKHAAFHIENGFEATPSVELQFIPRLTSSYPLHQKNEESQKLKNLSALRNGYDSEVSVSLQLG--EPEAKRRKHLDSLSSIKE

Query:  SSS
        SSS
Subjt:  SSS

A0A5A7UFD1 Uncharacterized protein0.0e+0096.02Show/hide
Query:  MDRTRLDLSFHHAVSTQSEESALDLERNYCSHLNMPSSSPSPSQCFAPGSQLSETN-AYFSWPTSSRLNDAAEDRANYFGNLQKGVLPEILGRLPTGQRA
        MDRTRLDL+FHH+VSTQSEESALDLERNYCSHL++PSSSPSPSQCFAPGSQLSETN AYFSWPTSSRLNDAAEDRANYFGNLQKGVLPEILGRLPTGQRA
Subjt:  MDRTRLDLSFHHAVSTQSEESALDLERNYCSHLNMPSSSPSPSQCFAPGSQLSETN-AYFSWPTSSRLNDAAEDRANYFGNLQKGVLPEILGRLPTGQRA

Query:  TTLLELMTIRAFHSKILRRFSLGTAIGFRIQKGMLTDIPAIIVFVARKVHMQWLSDVQCLPAALEGPGGIWCDVDVVEFSYYGAPAATPKEEVYTELVDG
        TTLLELMTIRAFHSKILRRFSLGTAIGFRIQKGMLTDIPAIIVFVARKVH QWLSDVQCLPAALEGPGGIWCDVDVVEFSYYGAPAATPKEEVYTELVDG
Subjt:  TTLLELMTIRAFHSKILRRFSLGTAIGFRIQKGMLTDIPAIIVFVARKVHMQWLSDVQCLPAALEGPGGIWCDVDVVEFSYYGAPAATPKEEVYTELVDG

Query:  LRGSDPTIGSGSQVASQETYGTLGAIVKSRTGTRQVGFLTNRHVAVDLDYPSQKMFHPLPPSLGPGVYLGAVERATSFITDDVWYGIFAGTNPETFVRAD
        LRGSDPT+GSGSQVASQETYGTLGAIVKSRTGTRQVGFLTNRHVAVDLDYPSQKMFHPLPPSLGPGVYLGAVERATSFITDDVWYGIFAGTNPETFVRAD
Subjt:  LRGSDPTIGSGSQVASQETYGTLGAIVKSRTGTRQVGFLTNRHVAVDLDYPSQKMFHPLPPSLGPGVYLGAVERATSFITDDVWYGIFAGTNPETFVRAD

Query:  GAFIPFAEDFNMNNVITFVKGVGEIGDVNKIDLQSPINSLIGRKVIKVGRSSGLTRGTIMAYALEYNDVKGICFFTDFLVVGDDQQTFDLEGDSGSLILL
        GAFIPFAEDFNMNNV+TFVKGVGEIGDVNKIDLQSPINSLIGRKVIKVGRSSGLTRGTIMAYALEYNDVKGICFFTDFLVVGDDQQTFDLEGDSGSLILL
Subjt:  GAFIPFAEDFNMNNVITFVKGVGEIGDVNKIDLQSPINSLIGRKVIKVGRSSGLTRGTIMAYALEYNDVKGICFFTDFLVVGDDQQTFDLEGDSGSLILL

Query:  TGQDEEKPRPVGIIWGGTANRGRLKLKVGQPPENWTSGVDLGRLLDLLELDLITTNDGLQVHEQRNNSVGGIESTVAESCLDRMPLKYRLKENSEPLGLS
        TGQDEEKPRPVGIIWGGTANRGRLKLKVGQPPENWTSGVDLGRLLDLLELDLITTNDGLQVHEQRNNSVGGI+STVAESCLDRMPLKYRLKENSEPLG S
Subjt:  TGQDEEKPRPVGIIWGGTANRGRLKLKVGQPPENWTSGVDLGRLLDLLELDLITTNDGLQVHEQRNNSVGGIESTVAESCLDRMPLKYRLKENSEPLGLS

Query:  VQQISPEGESSQGMISPFKHAAFHIENGFEATPSVELQFIPRLTSSYPLHQKNEESQKLKNLSALRNGYDSEVSVSLQLG--EPEAKRRKHLDSLSSIKE
        VQQISPEGESSQGMISPFKHA FHIENG+E TPSVELQFIPRLTS+ PLHQKNEE+Q+LKNL+ALR G+DSEVSVSLQLG  EPEAKRRK LD LSSIKE
Subjt:  VQQISPEGESSQGMISPFKHAAFHIENGFEATPSVELQFIPRLTSSYPLHQKNEESQKLKNLSALRNGYDSEVSVSLQLG--EPEAKRRKHLDSLSSIKE

Query:  SSS
        SSS
Subjt:  SSS

A0A6J1ECL8 uncharacterized protein LOC111432974 isoform X20.0e+0096.17Show/hide
Query:  MDRTRLDLSFHHAVSTQSEESALDLERNYCSHLNMPSSSPSPSQCFAPGSQLSETN-AYFSWPTSSRLNDAAEDRANYFGNLQKGVLPEILGRLPTGQRA
        MDRTRLDLS HH+VSTQSEESALDLERNYCSHLNMPSSSPSPSQCFAPGSQLSETN AYFSWPTSSRLNDAAEDRANYFGNLQKGVLPEILGRLPTGQRA
Subjt:  MDRTRLDLSFHHAVSTQSEESALDLERNYCSHLNMPSSSPSPSQCFAPGSQLSETN-AYFSWPTSSRLNDAAEDRANYFGNLQKGVLPEILGRLPTGQRA

Query:  TTLLELMTIRAFHSKILRRFSLGTAIGFRIQKGMLTDIPAIIVFVARKVHMQWLSDVQCLPAALEGPGGIWCDVDVVEFSYYGAPAATPKEEVYTELVDG
        TTLLELMTIRAFHSKILRRFSLGTAIGFRIQKGMLTDIPAIIVFVARKVH QWLSDVQCLPAALEGPGGIWCDVDVVEFSYYGAPAATPKEEVYTELVDG
Subjt:  TTLLELMTIRAFHSKILRRFSLGTAIGFRIQKGMLTDIPAIIVFVARKVHMQWLSDVQCLPAALEGPGGIWCDVDVVEFSYYGAPAATPKEEVYTELVDG

Query:  LRGSDPTIGSGSQVASQETYGTLGAIVKSRTGTRQVGFLTNRHVAVDLDYPSQKMFHPLPPSLGPGVYLGAVERATSFITDDVWYGIFAGTNPETFVRAD
        LRGSDPTIGSGSQVASQETYGTLGAIVKSRTGT+QVGFLTNRHVAVDLDYPSQKMFHPLPPSLGPGVYLGAVERATSFITDDVWYGIFAGTNPETFVRAD
Subjt:  LRGSDPTIGSGSQVASQETYGTLGAIVKSRTGTRQVGFLTNRHVAVDLDYPSQKMFHPLPPSLGPGVYLGAVERATSFITDDVWYGIFAGTNPETFVRAD

Query:  GAFIPFAEDFNMNNVITFVKGVGEIGDVNKIDLQSPINSLIGRKVIKVGRSSGLTRGTIMAYALEYNDVKGICFFTDFLVVGDDQQTFDLEGDSGSLILL
        GAFIPFAEDFNMNNVITFVKGVGEIGDVNKIDLQSPINSLIGRKVIKVGRSSGLTRGTIMAYALEYNDVKGICFFTDFLVVGDDQQTFDLEGDSGSLILL
Subjt:  GAFIPFAEDFNMNNVITFVKGVGEIGDVNKIDLQSPINSLIGRKVIKVGRSSGLTRGTIMAYALEYNDVKGICFFTDFLVVGDDQQTFDLEGDSGSLILL

Query:  TGQDEEKPRPVGIIWGGTANRGRLKLKVGQPPENWTSGVDLGRLLDLLELDLITTNDGLQVHEQRNNSVGGIESTVAESCLDRMPLKYRLKENSEPLGLS
        TG DEEKPRPVGIIWGGTANRGRLKLKVGQPPENWTSGVDLGRLLDLLELDLITTNDGLQVHEQRNNSVGGI+STVAESC DRMPL YRLKENSEPLGLS
Subjt:  TGQDEEKPRPVGIIWGGTANRGRLKLKVGQPPENWTSGVDLGRLLDLLELDLITTNDGLQVHEQRNNSVGGIESTVAESCLDRMPLKYRLKENSEPLGLS

Query:  VQQISPEGESSQGMISPFKHAAFHIENGFEATPSVELQFIPRLTSSYPLHQKNEESQKLKNLSALRNGYDSEVSVSLQLGEPEAKRRKHLDSLSSIKESS
        VQ+ISPEGESSQG+ISPFK AA  IENGFE TPSVELQFIPRL SS  LHQKNEE Q+LKNLS LRNGYD EVSVSL+LGEPEAKRRKHLDSLSSIKESS
Subjt:  VQQISPEGESSQGMISPFKHAAFHIENGFEATPSVELQFIPRLTSSYPLHQKNEESQKLKNLSALRNGYDSEVSVSLQLGEPEAKRRKHLDSLSSIKESS

A0A6J1IRT3 uncharacterized protein LOC111478754 isoform X20.0e+0096Show/hide
Query:  MDRTRLDLSFHHAVSTQSEESALDLERNYCSHLNMPSSSPSPSQCFAPGSQLSETN-AYFSWPTSSRLNDAAEDRANYFGNLQKGVLPEILGRLPTGQRA
        MDRTRLDLS HH+VSTQSEESALDLERNYCSHLNMPSSSPSPSQCFAPGSQLSETN AYFSWPTSSRLNDAAEDRANYFGNLQKGVLPEILGRLPTGQRA
Subjt:  MDRTRLDLSFHHAVSTQSEESALDLERNYCSHLNMPSSSPSPSQCFAPGSQLSETN-AYFSWPTSSRLNDAAEDRANYFGNLQKGVLPEILGRLPTGQRA

Query:  TTLLELMTIRAFHSKILRRFSLGTAIGFRIQKGMLTDIPAIIVFVARKVHMQWLSDVQCLPAALEGPGGIWCDVDVVEFSYYGAPAATPKEEVYTELVDG
        TTLLELMTIRAFHSKILRRFSLGTAIGFRIQKGMLTDIPAIIVFVARKVH QWLSDVQCLPAALEGPGGIWCDVDVVEFSYYGAPAATPKEEVYTELVDG
Subjt:  TTLLELMTIRAFHSKILRRFSLGTAIGFRIQKGMLTDIPAIIVFVARKVHMQWLSDVQCLPAALEGPGGIWCDVDVVEFSYYGAPAATPKEEVYTELVDG

Query:  LRGSDPTIGSGSQVASQETYGTLGAIVKSRTGTRQVGFLTNRHVAVDLDYPSQKMFHPLPPSLGPGVYLGAVERATSFITDDVWYGIFAGTNPETFVRAD
        LRGSDPTIGSGSQVASQETYGTLGAIVKSRTGT+QVGFLTNRHVAVDLDYPSQKMFHPLPPSLGPGVYLGAVERATSFITDDVWYGIFAGTNPETFVRAD
Subjt:  LRGSDPTIGSGSQVASQETYGTLGAIVKSRTGTRQVGFLTNRHVAVDLDYPSQKMFHPLPPSLGPGVYLGAVERATSFITDDVWYGIFAGTNPETFVRAD

Query:  GAFIPFAEDFNMNNVITFVKGVGEIGDVNKIDLQSPINSLIGRKVIKVGRSSGLTRGTIMAYALEYNDVKGICFFTDFLVVGDDQQTFDLEGDSGSLILL
        GAFIPFAEDFNMNNVITFVKGVGEI DVNKIDLQSPINSLIGRKVIKVGRSSGLTRGTIMAYALEYNDVKGICFFTDFLVVGDDQQTFDLEGDSGSLILL
Subjt:  GAFIPFAEDFNMNNVITFVKGVGEIGDVNKIDLQSPINSLIGRKVIKVGRSSGLTRGTIMAYALEYNDVKGICFFTDFLVVGDDQQTFDLEGDSGSLILL

Query:  TGQDEEKPRPVGIIWGGTANRGRLKLKVGQPPENWTSGVDLGRLLDLLELDLITTNDGLQVHEQRNNSVGGIESTVAESCLDRMPLKYRLKENSEPLGLS
        TG D+EKPRPVGIIWGGTANRGRLKLKVGQPPENWTSGVDLGRLLDLLELDLITTNDGLQVHEQRNNSVGGI+STVAESCLDRMPL YRLKENSEPLGLS
Subjt:  TGQDEEKPRPVGIIWGGTANRGRLKLKVGQPPENWTSGVDLGRLLDLLELDLITTNDGLQVHEQRNNSVGGIESTVAESCLDRMPLKYRLKENSEPLGLS

Query:  VQQISPEGESSQGMISPFKHAAFHIENGFEATPSVELQFIPRLTSSYPLHQKNEESQKLKNLSALRNGYDSEVSVSLQLGEPEAKRRKHLDSLSSIKESS
        VQ+ISPEGESSQG+ISPFKHAA  IENGFE  PSVELQFIPRL SS PLHQKNEE Q+LK LS LRNGYD EVSVSL+LGEPEAKRRKHLDSLSSIKESS
Subjt:  VQQISPEGESSQGMISPFKHAAFHIENGFEATPSVELQFIPRLTSSYPLHQKNEESQKLKNLSALRNGYDSEVSVSLQLGEPEAKRRKHLDSLSSIKESS

SwissProt top hitse value%identityAlignment
B4XT64 Protein NARROW LEAF 11.7e-20163.82Show/hide
Query:  QSEESALDLERNYCSHLNMPSSSPSPSQCFAPGSQLSETNAYFSWPTSSRLNDAAEDRANYFGNLQKGVLPEILGRLPTGQRATTLLELMTIRAFHSKIL
        QSEES+LD++    S    PS  P  S C    +    + AYF WPTS+  + AAE RANYFGNLQKG+LP   GRLP GQ+A +LL+LMTIRAFHSKIL
Subjt:  QSEESALDLERNYCSHLNMPSSSPSPSQCFAPGSQLSETNAYFSWPTSSRLNDAAEDRANYFGNLQKGVLPEILGRLPTGQRATTLLELMTIRAFHSKIL

Query:  RRFSLGTAIGFRIQKGMLTDIPAIIVFVARKVHMQWLSDVQCLPAALEGPGGIWCDVDVVEFSYYGAPAATPKEEVYTELVDGLRGSDPTIGSGSQVASQ
        RRFSLGTA+GFRI+KG LTDIPAI+VFVARKVH +WL+  QCLPA LEGPGG+WCDVDVVEFSYYGAPA TPKE++++ELVD L GSD  IGSGSQVAS 
Subjt:  RRFSLGTAIGFRIQKGMLTDIPAIIVFVARKVHMQWLSDVQCLPAALEGPGGIWCDVDVVEFSYYGAPAATPKEEVYTELVDGLRGSDPTIGSGSQVASQ

Query:  ETYGTLGAIVKSRTGTRQVGFLTNRHVAVDLDYPSQKMFHPLPPSLGPGVYLGAVERATSFITDDVWYGIFAGTNPETFVRADGAFIPFAEDFNMNNVIT
        ET+GTLGAIVK RTG +QVGFLTN HVAVDLDYP+QKMFHPLPP+LGPGVYLGAVERATSFITDDVWYGI+AGTNPETFVRADGAFIPFA+DF+++ V T
Subjt:  ETYGTLGAIVKSRTGTRQVGFLTNRHVAVDLDYPSQKMFHPLPPSLGPGVYLGAVERATSFITDDVWYGIFAGTNPETFVRADGAFIPFAEDFNMNNVIT

Query:  FVKGVGEIGDVNKIDLQSPINSLIGRKVIKVGRSSGLTRGTIMAYALEYNDVKGICFFTDFLVVGDDQQTFDLEGDSGSLILLTGQDEEKPRPVGIIWGG
         V+GVG+IGDV  IDLQ P+NSLIGR+V KVGRSSG T GT+MAYALEYND KGICFFTD LVVG+++QTFDLEGDSGSLI+LT QD EKPRP+GIIWGG
Subjt:  FVKGVGEIGDVNKIDLQSPINSLIGRKVIKVGRSSGLTRGTIMAYALEYNDVKGICFFTDFLVVGDDQQTFDLEGDSGSLILLTGQDEEKPRPVGIIWGG

Query:  TANRGRLKLKVGQPPENWTSGVDLGRLLDLLELDLITTNDGLQ--VHEQRNNSVGGIESTVAESCLDRMPL-KYRLKENSEPLGLSVQQISPEGESSQGM
        TANRGRLKL     PENWTSGVDLGRLLD LELD+I TN+ LQ  V +QR   V  + S V ES    + + + +++E  EPLG+ +QQ+     ++ G 
Subjt:  TANRGRLKLKVGQPPENWTSGVDLGRLLDLLELDLITTNDGLQ--VHEQRNNSVGGIESTVAESCLDRMPL-KYRLKENSEPLGLSVQQISPEGESSQGM

Query:  ISPFKHAAFHIENGFEATPSV----ELQFIPRLTSSYPLHQKNEESQKLKNLSALRNGYDSEVSVSLQLGEPEAKRRKHLDSLSSI
                     G EA+ +V    E QFI       P+    +  + + NL+   N  + E+++SL LG+ E KR +  DS SS+
Subjt:  ISPFKHAAFHIENGFEATPSV----ELQFIPRLTSSYPLHQKNEESQKLKNLSALRNGYDSEVSVSLQLGEPEAKRRKHLDSLSSI

Arabidopsis top hitse value%identityAlignment
AT2G35155.1 Trypsin family protein6.7e-23370.89Show/hide
Query:  AVSTQSEESALDLERN-YCSHLNMP-SSSPSPSQCFAPGSQLSETNA-YFSWPTSSRLNDAAEDRANYFGNLQKGVLPEILGRLPTGQRATTLLELMTIR
        A S++SE+SALDLERN +C+HL++P SSSPSP Q F    Q +E+NA YFSWPT SRLND  EDRANYFGNLQKGVLPE +GRLP+GQ+ATTLLELMTIR
Subjt:  AVSTQSEESALDLERN-YCSHLNMP-SSSPSPSQCFAPGSQLSETNA-YFSWPTSSRLNDAAEDRANYFGNLQKGVLPEILGRLPTGQRATTLLELMTIR

Query:  AFHSKILRRFSLGTAIGFRIQKGMLTDIPAIIVFVARKVHMQWLSDVQCLPAALEGPGGIWCDVDVVEFSYYGAPAATPKEEVYTELVDGLRGSDPTIGS
        AFHSKILRRFSLGTA+GFRI +G+LT++PAI+VFVARKVH QWL+ +QCLP+ALEGPGG+WCDVDVVEF YYGAPAATPKE+VY ELVDGLRGSDP IGS
Subjt:  AFHSKILRRFSLGTAIGFRIQKGMLTDIPAIIVFVARKVHMQWLSDVQCLPAALEGPGGIWCDVDVVEFSYYGAPAATPKEEVYTELVDGLRGSDPTIGS

Query:  GSQVASQETYGTLGAIVKSRTGTRQVGFLTNRHVAVDLDYPSQKMFHPLPPSLGPGVYLGAVERATSFITDDVWYGIFAGTNPETFVRADGAFIPFAEDF
        GSQVASQETYGTLGAIVKSRTG  QVGFLTNRHVAVDLDYPSQKMFHPLPPSLGPGVYLGAVERATSFITDD WYGIFAGTNPETFVRADGAFIPFAEDF
Subjt:  GSQVASQETYGTLGAIVKSRTGTRQVGFLTNRHVAVDLDYPSQKMFHPLPPSLGPGVYLGAVERATSFITDDVWYGIFAGTNPETFVRADGAFIPFAEDF

Query:  NMNNVITFVKGVGEIGDVNKIDLQSPINSLIGRKVIKVGRSSGLTRGTIMAYALEYNDVKGICFFTDFLVVGDDQQTFDLEGDSGSLILLTGQDEEKPRP
        N +NV T +KG+GEIGDV+ IDLQSPI+SLIG++V+KVGRSSG T GTIMAYALEYND KGICF TDFLV+G++QQTFDLEGDSGSLILLTG + +KPRP
Subjt:  NMNNVITFVKGVGEIGDVNKIDLQSPINSLIGRKVIKVGRSSGLTRGTIMAYALEYNDVKGICFFTDFLVVGDDQQTFDLEGDSGSLILLTGQDEEKPRP

Query:  VGIIWGGTANRGRLKLKVGQPPENWTSGVDLGRLLDLLELDLITTNDGLQ----VHEQRNNSVGGIESTVAESCLDRMPLKYRLKENSEPLGLSVQQISP
        VGIIWGGTANRGRLKL  GQ PENWTSGVDLGRLLDLLELDLIT+N  L+      E+RN SV  ++STV++S                    S     P
Subjt:  VGIIWGGTANRGRLKLKVGQPPENWTSGVDLGRLLDLLELDLITTNDGLQ----VHEQRNNSVGGIESTVAESCLDRMPLKYRLKENSEPLGLSVQQISP

Query:  EGESSQGMISPFKHAAFHIENGFEATPSVE--LQFIPRLTSSYPLHQKNEESQKLKNLSALRNGYDSEVSVSLQLGEPEAKRRK
         G+       PF    FHIE   + T  VE  +   P   +      K +E  KL NL AL+N  + EV++SL LGEP+ K+ K
Subjt:  EGESSQGMISPFKHAAFHIENGFEATPSVE--LQFIPRLTSSYPLHQKNEESQKLKNLSALRNGYDSEVSVSLQLGEPEAKRRK

AT3G12950.1 Trypsin family protein8.0e-18662.89Show/hide
Query:  AYFSWPTSSRLNDAAEDRANYFGNLQK------GVLPEILGRLPTGQRATTLLELMTIRAFHSKILRRFSLGTAIGFRIQKGMLTDIPAIIVFVARKVHM
        +YFSWPTSSRL++AAE+RANYF NLQK       V PE +   P GQRATTLLELMTIRAFHSK+LR +SLGTAIGFRI++G+LTDIPAIIVFV+RKVH 
Subjt:  AYFSWPTSSRLNDAAEDRANYFGNLQK------GVLPEILGRLPTGQRATTLLELMTIRAFHSKILRRFSLGTAIGFRIQKGMLTDIPAIIVFVARKVHM

Query:  QWLSDVQCLPAALEGPGGIWCDVDVVEFSYYGAP--AATPKEEVYTELVDGLRGSDPTIGSGSQVASQETYGTLGAIVKSRTGTRQVGFLTNRHVAVDLD
        QWLS +QCLP ALEG GGIWCDVDVVEFSY+G P    TPK+   T++VD L+GSDP IGSGSQVASQET GTLGAIV+S+TG RQVGF+TNRHVAV+LD
Subjt:  QWLSDVQCLPAALEGPGGIWCDVDVVEFSYYGAP--AATPKEEVYTELVDGLRGSDPTIGSGSQVASQETYGTLGAIVKSRTGTRQVGFLTNRHVAVDLD

Query:  YPSQKMFHPLPPSLGPGVYLGAVERATSFITDDVWYGIFAGTNPETFVRADGAFIPFAEDFNMNNVITFVK-GVGEIGDVNKIDLQSPINSLIGRKVIKV
        YPSQKMFHPLPP+LGPGVYLGAVERATSFITDD+W+GIFAGTNPETFVRADGAFIPFA+D++++ V T VK GVGEIG+V  I+LQSP+ SL+G++V+KV
Subjt:  YPSQKMFHPLPPSLGPGVYLGAVERATSFITDDVWYGIFAGTNPETFVRADGAFIPFAEDFNMNNVITFVK-GVGEIGDVNKIDLQSPINSLIGRKVIKV

Query:  GRSSGLTRGTIMAYALEYNDVKGICFFTDFLVVGDDQQT-FDLEGDSGSLILLTGQDEEKPRPVGIIWGGTANRGRLKLKVGQPPENWTSGVDLGRLLDL
        GRSSGLT GT++AYALEYND +G+CF TDFLVVG++ ++ FDLEGDSGSLI++ G  EEK RP+GIIWGGT +RGRLKLKVG+ PE+WT+GVDLGRLL  
Subjt:  GRSSGLTRGTIMAYALEYNDVKGICFFTDFLVVGDDQQT-FDLEGDSGSLILLTGQDEEKPRPVGIIWGGTANRGRLKLKVGQPPENWTSGVDLGRLLDL

Query:  LELDLITTNDGLQ--VHEQRNNSVGGIESTVAESCLDRMPLKYRLKENSEP-LGLSVQQISPEGESSQGMISPFKHAAFHIENGFE---ATPSVELQFIP
        L+LDLITT++GL+  V EQR  S  G+ S VA+S              S P + L  ++ SPE E  +  + P +     +E   E     PSVE QF+P
Subjt:  LELDLITTNDGLQ--VHEQRNNSVGGIESTVAESCLDRMPLKYRLKENSEP-LGLSVQQISPEGESSQGMISPFKHAAFHIENGFE---ATPSVELQFIP

Query:  RLTSSYPLHQKNEESQKLKNLSALRNGYDSEVSVSLQLGEPEAKRRK
          +         E +++           D ++ V L+LG+  AKRR+
Subjt:  RLTSSYPLHQKNEESQKLKNLSALRNGYDSEVSVSLQLGEPEAKRRK

AT5G45030.1 Trypsin family protein5.7e-23270.1Show/hide
Query:  MDRTRLDLSFHHAVSTQSEES-ALDLERNYCSHLNMPSSSPSPSQCFAPGSQLSETN---AYFSWPTSSRLNDAAEDRANYFGNLQKGVLPEILGRLPTG
        M+  RLDL FHH+ S+QS ES ALDL++N  +H+ + SS  SP Q F  G+Q  ET+   AYFSWPTSSRLND+AEDRANYF NLQKGVLPE    LPTG
Subjt:  MDRTRLDLSFHHAVSTQSEES-ALDLERNYCSHLNMPSSSPSPSQCFAPGSQLSETN---AYFSWPTSSRLNDAAEDRANYFGNLQKGVLPEILGRLPTG

Query:  QRATTLLELMTIRAFHSKILRRFSLGTAIGFRIQKGMLTDIPAIIVFVARKVHMQWLSDVQCLPAALEGPGGIWCDVDVVEFSYYGAPAATPKEEVYTEL
        ++ATTLLELM IRAFHSK LRRFSLGTAIGFRI++G+LT+I AI+VFVARKVH QWL+ +QCLP ALEGPGG+WCDVDVVEF YYGAPA TPKE+VYTEL
Subjt:  QRATTLLELMTIRAFHSKILRRFSLGTAIGFRIQKGMLTDIPAIIVFVARKVHMQWLSDVQCLPAALEGPGGIWCDVDVVEFSYYGAPAATPKEEVYTEL

Query:  VDGLRGSDPTIGSGSQVASQETYGTLGAIVKSRTGTRQVGFLTNRHVAVDLDYPSQKMFHPLPPSLGPGVYLGAVERATSFITDDVWYGIFAGTNPETFV
        VD LRGS  +IGSGSQVASQETYGTLGAIVKS+TG RQVGFLTNRHVAVDLDYPSQKMFHPLPPSLGPGVYLGAVERATSFITDD+WYGIFAGTNPETFV
Subjt:  VDGLRGSDPTIGSGSQVASQETYGTLGAIVKSRTGTRQVGFLTNRHVAVDLDYPSQKMFHPLPPSLGPGVYLGAVERATSFITDDVWYGIFAGTNPETFV

Query:  RADGAFIPFAEDFNMNNVITFVKGVGEIGDVNKIDLQSPINSLIGRKVIKVGRSSGLTRGTIMAYALEYNDVKGICFFTDFLVVGDDQQTFDLEGDSGSL
        RADGAFIPFAEDFN NNV T VKG+GEIGD++  DLQSP+NSLIGRKV+KVGRSSGLT GTIMAYALEYND KGICF TDFLVVG++QQTFDLEGDSGSL
Subjt:  RADGAFIPFAEDFNMNNVITFVKGVGEIGDVNKIDLQSPINSLIGRKVIKVGRSSGLTRGTIMAYALEYNDVKGICFFTDFLVVGDDQQTFDLEGDSGSL

Query:  ILLTGQDE--EKPRPVGIIWGGTANRGRLKLKVGQPPENWTSGVDLGRLLDLLELDLITTNDGLQ--VHEQRNNSV-GGIESTVAESCLDRMPL-KYRLK
        ILL   DE  EKPRPVGIIWGGTANRGRLKLKVG+ PENWTSGVDLGR+L+LLELDLIT+N+GLQ  V EQRN  +   ++STV ES      + + +  
Subjt:  ILLTGQDE--EKPRPVGIIWGGTANRGRLKLKVGQPPENWTSGVDLGRLLDLLELDLITTNDGLQ--VHEQRNNSV-GGIESTVAESCLDRMPL-KYRLK

Query:  ENSEPLGLSVQQISPEGESSQGMISPFKHAAFHIENGFEATPSV-ELQFIPRLTSS-YPLHQK--NEESQKLKNLSALR-NGYDSEVSVSLQLGEPEAKR
        EN EP+ L+VQQ+  E ++S        H  F IE+  E+   + E QFIP  +++   LHQK    E+ + KNLS+L+ +    E+  SLQLGE + K+
Subjt:  ENSEPLGLSVQQISPEGESSQGMISPFKHAAFHIENGFEATPSV-ELQFIPRLTSS-YPLHQK--NEESQKLKNLSALR-NGYDSEVSVSLQLGEPEAKR

Query:  RKHLDSLSSIKE
        RK  DS    +E
Subjt:  RKHLDSLSSIKE

AT5G45030.2 Trypsin family protein5.7e-23270.1Show/hide
Query:  MDRTRLDLSFHHAVSTQSEES-ALDLERNYCSHLNMPSSSPSPSQCFAPGSQLSETN---AYFSWPTSSRLNDAAEDRANYFGNLQKGVLPEILGRLPTG
        M+  RLDL FHH+ S+QS ES ALDL++N  +H+ + SS  SP Q F  G+Q  ET+   AYFSWPTSSRLND+AEDRANYF NLQKGVLPE    LPTG
Subjt:  MDRTRLDLSFHHAVSTQSEES-ALDLERNYCSHLNMPSSSPSPSQCFAPGSQLSETN---AYFSWPTSSRLNDAAEDRANYFGNLQKGVLPEILGRLPTG

Query:  QRATTLLELMTIRAFHSKILRRFSLGTAIGFRIQKGMLTDIPAIIVFVARKVHMQWLSDVQCLPAALEGPGGIWCDVDVVEFSYYGAPAATPKEEVYTEL
        ++ATTLLELM IRAFHSK LRRFSLGTAIGFRI++G+LT+I AI+VFVARKVH QWL+ +QCLP ALEGPGG+WCDVDVVEF YYGAPA TPKE+VYTEL
Subjt:  QRATTLLELMTIRAFHSKILRRFSLGTAIGFRIQKGMLTDIPAIIVFVARKVHMQWLSDVQCLPAALEGPGGIWCDVDVVEFSYYGAPAATPKEEVYTEL

Query:  VDGLRGSDPTIGSGSQVASQETYGTLGAIVKSRTGTRQVGFLTNRHVAVDLDYPSQKMFHPLPPSLGPGVYLGAVERATSFITDDVWYGIFAGTNPETFV
        VD LRGS  +IGSGSQVASQETYGTLGAIVKS+TG RQVGFLTNRHVAVDLDYPSQKMFHPLPPSLGPGVYLGAVERATSFITDD+WYGIFAGTNPETFV
Subjt:  VDGLRGSDPTIGSGSQVASQETYGTLGAIVKSRTGTRQVGFLTNRHVAVDLDYPSQKMFHPLPPSLGPGVYLGAVERATSFITDDVWYGIFAGTNPETFV

Query:  RADGAFIPFAEDFNMNNVITFVKGVGEIGDVNKIDLQSPINSLIGRKVIKVGRSSGLTRGTIMAYALEYNDVKGICFFTDFLVVGDDQQTFDLEGDSGSL
        RADGAFIPFAEDFN NNV T VKG+GEIGD++  DLQSP+NSLIGRKV+KVGRSSGLT GTIMAYALEYND KGICF TDFLVVG++QQTFDLEGDSGSL
Subjt:  RADGAFIPFAEDFNMNNVITFVKGVGEIGDVNKIDLQSPINSLIGRKVIKVGRSSGLTRGTIMAYALEYNDVKGICFFTDFLVVGDDQQTFDLEGDSGSL

Query:  ILLTGQDE--EKPRPVGIIWGGTANRGRLKLKVGQPPENWTSGVDLGRLLDLLELDLITTNDGLQ--VHEQRNNSV-GGIESTVAESCLDRMPL-KYRLK
        ILL   DE  EKPRPVGIIWGGTANRGRLKLKVG+ PENWTSGVDLGR+L+LLELDLIT+N+GLQ  V EQRN  +   ++STV ES      + + +  
Subjt:  ILLTGQDE--EKPRPVGIIWGGTANRGRLKLKVGQPPENWTSGVDLGRLLDLLELDLITTNDGLQ--VHEQRNNSV-GGIESTVAESCLDRMPL-KYRLK

Query:  ENSEPLGLSVQQISPEGESSQGMISPFKHAAFHIENGFEATPSV-ELQFIPRLTSS-YPLHQK--NEESQKLKNLSALR-NGYDSEVSVSLQLGEPEAKR
        EN EP+ L+VQQ+  E ++S        H  F IE+  E+   + E QFIP  +++   LHQK    E+ + KNLS+L+ +    E+  SLQLGE + K+
Subjt:  ENSEPLGLSVQQISPEGESSQGMISPFKHAAFHIENGFEATPSV-ELQFIPRLTSS-YPLHQK--NEESQKLKNLSALR-NGYDSEVSVSLQLGEPEAKR

Query:  RKHLDSLSSIKE
        RK  DS    +E
Subjt:  RKHLDSLSSIKE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGACAGGACAAGACTGGATTTAAGCTTTCATCATGCAGTATCAACACAATCAGAGGAATCTGCCTTGGACTTAGAAAGGAACTATTGCAGTCATCTTAATATGCCTTC
ATCAAGTCCATCACCTAGTCAATGTTTTGCTCCGGGTAGTCAGCTGTCTGAGACCAATGCTTATTTTTCATGGCCCACTTCCAGCCGTTTAAACGACGCTGCAGAAGATA
GAGCAAACTATTTTGGGAACCTTCAGAAAGGCGTGCTTCCTGAAATTCTGGGCCGGCTTCCCACTGGGCAGCGAGCTACTACTTTGCTCGAGCTAATGACCATAAGGGCA
TTCCATAGCAAGATCTTGCGTCGTTTTAGCCTTGGAACTGCAATAGGATTCCGAATCCAGAAGGGTATGTTGACAGATATCCCTGCTATTATTGTCTTTGTTGCACGCAA
AGTTCACATGCAGTGGCTCAGTGATGTTCAATGTCTACCTGCTGCACTTGAGGGACCTGGAGGTATATGGTGCGATGTTGACGTTGTGGAGTTCTCCTATTATGGGGCAC
CGGCAGCTACACCTAAAGAAGAAGTATACACAGAGCTTGTTGATGGCCTGAGGGGAAGCGATCCAACAATTGGTTCCGGTTCCCAGGTTGCTAGCCAAGAAACTTATGGG
ACTTTGGGTGCAATTGTAAAAAGCCGGACAGGAACCCGGCAAGTTGGTTTCCTTACAAACCGGCATGTTGCAGTTGATTTAGACTACCCTAGTCAGAAAATGTTTCATCC
TTTGCCTCCCAGCCTTGGACCTGGTGTTTATCTGGGTGCTGTGGAGAGAGCAACGTCGTTTATCACTGATGATGTCTGGTATGGCATCTTTGCTGGAACAAATCCAGAAA
CATTTGTGCGAGCTGATGGAGCGTTCATTCCATTTGCTGAAGATTTTAACATGAACAACGTCATTACATTTGTAAAAGGCGTCGGTGAGATTGGTGATGTCAACAAAATA
GACCTGCAATCCCCAATCAACAGTCTTATTGGACGAAAGGTGATCAAGGTTGGAAGAAGTTCAGGCTTGACCAGAGGGACTATAATGGCATATGCCCTCGAGTATAACGA
TGTAAAAGGGATTTGTTTCTTCACTGACTTTCTTGTTGTTGGAGATGACCAACAGACGTTTGACCTTGAAGGTGACAGTGGAAGCCTTATTCTTTTAACTGGTCAGGATG
AGGAAAAGCCACGTCCAGTTGGGATTATCTGGGGAGGAACAGCTAATCGGGGTCGACTGAAATTAAAAGTTGGTCAGCCACCAGAGAATTGGACCAGTGGAGTTGATCTT
GGACGCCTTCTTGATCTCCTTGAGCTCGATCTTATTACTACAAATGATGGTTTACAAGTGCATGAACAAAGGAATAATTCAGTTGGAGGGATTGAGTCTACTGTTGCGGA
GTCCTGTCTTGATCGGATGCCATTAAAATATAGACTTAAAGAGAACTCTGAGCCGCTCGGTTTAAGTGTCCAGCAAATTTCCCCTGAAGGTGAATCCTCTCAGGGGATGA
TCTCACCTTTTAAGCATGCTGCATTTCACATAGAAAATGGGTTTGAGGCGACTCCAAGCGTCGAACTCCAATTTATACCAAGATTAACCAGCAGCTATCCTCTACATCAG
AAGAACGAAGAAAGCCAAAAGTTGAAAAATCTGTCTGCCCTGAGGAATGGCTATGATAGTGAAGTTTCAGTTTCGCTGCAGTTGGGTGAGCCAGAAGCAAAGAGAAGGAA
GCACTTGGATTCTCTTTCAAGTATCAAAGAGTCATCTTCATGA
mRNA sequenceShow/hide mRNA sequence
ATGGACAGGACAAGACTGGATTTAAGCTTTCATCATGCAGTATCAACACAATCAGAGGAATCTGCCTTGGACTTAGAAAGGAACTATTGCAGTCATCTTAATATGCCTTC
ATCAAGTCCATCACCTAGTCAATGTTTTGCTCCGGGTAGTCAGCTGTCTGAGACCAATGCTTATTTTTCATGGCCCACTTCCAGCCGTTTAAACGACGCTGCAGAAGATA
GAGCAAACTATTTTGGGAACCTTCAGAAAGGCGTGCTTCCTGAAATTCTGGGCCGGCTTCCCACTGGGCAGCGAGCTACTACTTTGCTCGAGCTAATGACCATAAGGGCA
TTCCATAGCAAGATCTTGCGTCGTTTTAGCCTTGGAACTGCAATAGGATTCCGAATCCAGAAGGGTATGTTGACAGATATCCCTGCTATTATTGTCTTTGTTGCACGCAA
AGTTCACATGCAGTGGCTCAGTGATGTTCAATGTCTACCTGCTGCACTTGAGGGACCTGGAGGTATATGGTGCGATGTTGACGTTGTGGAGTTCTCCTATTATGGGGCAC
CGGCAGCTACACCTAAAGAAGAAGTATACACAGAGCTTGTTGATGGCCTGAGGGGAAGCGATCCAACAATTGGTTCCGGTTCCCAGGTTGCTAGCCAAGAAACTTATGGG
ACTTTGGGTGCAATTGTAAAAAGCCGGACAGGAACCCGGCAAGTTGGTTTCCTTACAAACCGGCATGTTGCAGTTGATTTAGACTACCCTAGTCAGAAAATGTTTCATCC
TTTGCCTCCCAGCCTTGGACCTGGTGTTTATCTGGGTGCTGTGGAGAGAGCAACGTCGTTTATCACTGATGATGTCTGGTATGGCATCTTTGCTGGAACAAATCCAGAAA
CATTTGTGCGAGCTGATGGAGCGTTCATTCCATTTGCTGAAGATTTTAACATGAACAACGTCATTACATTTGTAAAAGGCGTCGGTGAGATTGGTGATGTCAACAAAATA
GACCTGCAATCCCCAATCAACAGTCTTATTGGACGAAAGGTGATCAAGGTTGGAAGAAGTTCAGGCTTGACCAGAGGGACTATAATGGCATATGCCCTCGAGTATAACGA
TGTAAAAGGGATTTGTTTCTTCACTGACTTTCTTGTTGTTGGAGATGACCAACAGACGTTTGACCTTGAAGGTGACAGTGGAAGCCTTATTCTTTTAACTGGTCAGGATG
AGGAAAAGCCACGTCCAGTTGGGATTATCTGGGGAGGAACAGCTAATCGGGGTCGACTGAAATTAAAAGTTGGTCAGCCACCAGAGAATTGGACCAGTGGAGTTGATCTT
GGACGCCTTCTTGATCTCCTTGAGCTCGATCTTATTACTACAAATGATGGTTTACAAGTGCATGAACAAAGGAATAATTCAGTTGGAGGGATTGAGTCTACTGTTGCGGA
GTCCTGTCTTGATCGGATGCCATTAAAATATAGACTTAAAGAGAACTCTGAGCCGCTCGGTTTAAGTGTCCAGCAAATTTCCCCTGAAGGTGAATCCTCTCAGGGGATGA
TCTCACCTTTTAAGCATGCTGCATTTCACATAGAAAATGGGTTTGAGGCGACTCCAAGCGTCGAACTCCAATTTATACCAAGATTAACCAGCAGCTATCCTCTACATCAG
AAGAACGAAGAAAGCCAAAAGTTGAAAAATCTGTCTGCCCTGAGGAATGGCTATGATAGTGAAGTTTCAGTTTCGCTGCAGTTGGGTGAGCCAGAAGCAAAGAGAAGGAA
GCACTTGGATTCTCTTTCAAGTATCAAAGAGTCATCTTCATGA
Protein sequenceShow/hide protein sequence
MDRTRLDLSFHHAVSTQSEESALDLERNYCSHLNMPSSSPSPSQCFAPGSQLSETNAYFSWPTSSRLNDAAEDRANYFGNLQKGVLPEILGRLPTGQRATTLLELMTIRA
FHSKILRRFSLGTAIGFRIQKGMLTDIPAIIVFVARKVHMQWLSDVQCLPAALEGPGGIWCDVDVVEFSYYGAPAATPKEEVYTELVDGLRGSDPTIGSGSQVASQETYG
TLGAIVKSRTGTRQVGFLTNRHVAVDLDYPSQKMFHPLPPSLGPGVYLGAVERATSFITDDVWYGIFAGTNPETFVRADGAFIPFAEDFNMNNVITFVKGVGEIGDVNKI
DLQSPINSLIGRKVIKVGRSSGLTRGTIMAYALEYNDVKGICFFTDFLVVGDDQQTFDLEGDSGSLILLTGQDEEKPRPVGIIWGGTANRGRLKLKVGQPPENWTSGVDL
GRLLDLLELDLITTNDGLQVHEQRNNSVGGIESTVAESCLDRMPLKYRLKENSEPLGLSVQQISPEGESSQGMISPFKHAAFHIENGFEATPSVELQFIPRLTSSYPLHQ
KNEESQKLKNLSALRNGYDSEVSVSLQLGEPEAKRRKHLDSLSSIKESSS