; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr019081 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr019081
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionTrypsin family protein
Genome locationtig00153285:258301..267163
RNA-Seq ExpressionSgr019081
SyntenySgr019081
Gene Ontology termsGO:0043231 - intracellular membrane-bounded organelle (cellular component)
InterPro domainsIPR009003 - Peptidase S1, PA clan


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_008439446.1 PREDICTED: uncharacterized protein LOC103484249 isoform X1 [Cucumis melo]1.2e-30990.02Show/hide
Query:  MDRTRLDLSFHHSVSTQSEESALDLERNYCSHLNLPSSSPSPSQCFAPGSQLSESNAAYFSWPTSSRLNDAAEDRANYFGNLQKGVLPEILGRLPTGQRA
        MDRTRLDL+FHHSVSTQSEESALDLERNYCSHL+LPSSSPSPSQCFAPGSQLSE+NAAYFSWPTSSRLNDAAEDRANYFGNLQKGVLPEILGRLPTGQRA
Subjt:  MDRTRLDLSFHHSVSTQSEESALDLERNYCSHLNLPSSSPSPSQCFAPGSQLSESNAAYFSWPTSSRLNDAAEDRANYFGNLQKGVLPEILGRLPTGQRA

Query:  TTLLELMTIRAFHSKILRRFSLGTAIGFRIQKGMLTDIPAIIVFVARKVHRQWLSDVQCLPAALEGPGGIWCDVDVVEFSYYGAPAATPKEELYTELVDG
        TTLLELMTIRAFHSKILRRFSLGTAIGFRIQKGMLTDIPAIIVFVARKVHRQWLSDVQCLPAALEGPGGIWCDVDVVEFSYYGAPAATPKEE+YTELVDG
Subjt:  TTLLELMTIRAFHSKILRRFSLGTAIGFRIQKGMLTDIPAIIVFVARKVHRQWLSDVQCLPAALEGPGGIWCDVDVVEFSYYGAPAATPKEELYTELVDG

Query:  LRGSDPTIGSGSQVASQETYGTLGAIVKSRTGNRQVGFLTNRHVAVDLDYPSQKMFHPLPPSLGPGVYLGAVERATSFITDDVWYGIFAGTNP-------
        LRGSDPT+GSGSQVASQETYGTLGAIVKSRTG RQVGFLTNRHVAVDLDYPSQKMFHPLPPSLGPGVYLGAVERATSFITDDVWYGIFAGTNP       
Subjt:  LRGSDPTIGSGSQVASQETYGTLGAIVKSRTGNRQVGFLTNRHVAVDLDYPSQKMFHPLPPSLGPGVYLGAVERATSFITDDVWYGIFAGTNP-------

Query:  --------------------GVGEIGDVNKIDLQSPINSLIGRKVIKVGRSSGLTRGTIMAYALEYNDVKGICFFTDFLVVGDDQQTFDLEGDSGSLILL
                            GVGEIGDVNKIDLQSPINSLIGRKVIKVGRSSGLTRGTIMAYALEYNDVKGICFFTDFLVVGDDQQTFDLEGDSGSLILL
Subjt:  --------------------GVGEIGDVNKIDLQSPINSLIGRKVIKVGRSSGLTRGTIMAYALEYNDVKGICFFTDFLVVGDDQQTFDLEGDSGSLILL

Query:  TGQDGDKPRPVGIIWGGTANRGRLKLKVGQPPENWTSGVDLGRLLDLLELDLITTSDGLQAAVHEQRNTSVGGIDSIVAESSLDRMPLKYRLKENSEPLG
        TGQD +KPRPVGIIWGGTANRGRLKLKVGQPPENWTSGVDLGRLLDLLELDLITT+DGLQAAVHEQRN SVGGIDS VAES LDRMPLKYRLKENSEPLG
Subjt:  TGQDGDKPRPVGIIWGGTANRGRLKLKVGQPPENWTSGVDLGRLLDLLELDLITTSDGLQAAVHEQRNTSVGGIDSIVAESSLDRMPLKYRLKENSEPLG

Query:  LSVQQISPEGESSQGLISPFKHAAFHIENGFEMAPSVELQFIPRLTSSSPLHQKSEESRELKNLSALRNGYDSEISVSLQLG--EPEAKRRKHLGSISTV
         SVQQISPEGESSQG+ISPFKHA FHIENG+E+ PSVELQFIPRLTS+SPLHQK++E++ELKNLSALR GYDSE+SVSLQLG  EPEAKRRK L  +S++
Subjt:  LSVQQISPEGESSQGLISPFKHAAFHIENGFEMAPSVELQFIPRLTSSSPLHQKSEESRELKNLSALRNGYDSEISVSLQLG--EPEAKRRKHLGSISTV

Query:  K
        K
Subjt:  K

XP_022146579.1 uncharacterized protein LOC111015756 isoform X1 [Momordica charantia]0.0e+0090.83Show/hide
Query:  MDRTRLDLSFHHSVSTQSEESALDLERNYCSHLNLPSSSPSPSQCFAPGSQLSESNAAYFSWPTSSRLNDAAEDRANYFGNLQKGVLPEILGRLPTGQRA
        M+RTRLDLSFHHSVSTQSEESALDLERNYCSHLNLPSSSPSPSQCFAPGSQLSE+NAAYFSWPTSSRLNDAAEDRANYFGNLQKGVLPEILGRLP+GQRA
Subjt:  MDRTRLDLSFHHSVSTQSEESALDLERNYCSHLNLPSSSPSPSQCFAPGSQLSESNAAYFSWPTSSRLNDAAEDRANYFGNLQKGVLPEILGRLPTGQRA

Query:  TTLLELMTIRAFHSKILRRFSLGTAIGFRIQKGMLTDIPAIIVFVARKVHRQWLSDVQCLPAALEGPGGIWCDVDVVEFSYYGAPAATPKEELYTELVDG
        TTLLELMTIRAFHSKILRRFSLGTAIGFRIQKGMLTDIPAIIVFVARKVHRQWL+DVQCLPAALEGPGGIWCDVDVVEFSYYGAPAATPKEE+YTELVDG
Subjt:  TTLLELMTIRAFHSKILRRFSLGTAIGFRIQKGMLTDIPAIIVFVARKVHRQWLSDVQCLPAALEGPGGIWCDVDVVEFSYYGAPAATPKEELYTELVDG

Query:  LRGSDPTIGSGSQVASQETYGTLGAIVKSRTGNRQVGFLTNRHVAVDLDYPSQKMFHPLPPSLGPGVYLGAVERATSFITDDVWYGIFAGTNP-------
        LRGSDPTIGSGSQVASQETYGTLGAIVKSRTG RQVGFLTNRHVAVDLDYPSQKMFHPLPPSLGPGVYLGAVERATSFITDDVWYGIFAGTNP       
Subjt:  LRGSDPTIGSGSQVASQETYGTLGAIVKSRTGNRQVGFLTNRHVAVDLDYPSQKMFHPLPPSLGPGVYLGAVERATSFITDDVWYGIFAGTNP-------

Query:  --------------------GVGEIGDVNKIDLQSPINSLIGRKVIKVGRSSGLTRGTIMAYALEYNDVKGICFFTDFLVVGDDQQTFDLEGDSGSLILL
                            GVGEIGDVNKIDLQSPINSLIGRKVIKVGRSSGLTRGTIMAYALEYNDVKGICFFTDFLVVGDDQQTFDLEGDSGSLILL
Subjt:  --------------------GVGEIGDVNKIDLQSPINSLIGRKVIKVGRSSGLTRGTIMAYALEYNDVKGICFFTDFLVVGDDQQTFDLEGDSGSLILL

Query:  TGQDGDKPRPVGIIWGGTANRGRLKLKVGQPPENWTSGVDLGRLLDLLELDLITTSDGLQAAVHEQRNTSVGGIDSIVAESSLDRMPLKYRLKENSEPLG
        TGQDG+K RPVGIIWGGTANRGRLKLKVGQPPENWTSGVDLGRLLDLLELDLITTSDGLQAAVHEQRNTSVGGIDS VAESSL+R+PLKYRLKENSEPLG
Subjt:  TGQDGDKPRPVGIIWGGTANRGRLKLKVGQPPENWTSGVDLGRLLDLLELDLITTSDGLQAAVHEQRNTSVGGIDSIVAESSLDRMPLKYRLKENSEPLG

Query:  LSVQQISPEGESSQGLISPFKHAAFHIE-NGFEMAPSVELQFIPRLTSSSPLHQKSEESRELKNLSALRNGYDSEISVSLQLGEPEAKRRKHLGSISTVK
        LSVQQISPEGESSQGLISPFKHAAFHIE N FEMAPSVELQF+PRLTSSSP+HQK++ES ELK+LSALRNG+DSE+SVSLQLGEPE KRR+H  S+S++K
Subjt:  LSVQQISPEGESSQGLISPFKHAAFHIE-NGFEMAPSVELQFIPRLTSSSPLHQKSEESRELKNLSALRNGYDSEISVSLQLGEPEAKRRKHLGSISTVK

XP_022146590.1 uncharacterized protein LOC111015756 isoform X2 [Momordica charantia]5.2e-30990.5Show/hide
Query:  MDRTRLDLSFHHSVSTQSEESALDLERNYCSHLNLPSSSPSPSQCFAPGSQLSESNAAYFSWPTSSRLNDAAEDRANYFGNLQKGVLPEILGRLPTGQRA
        M+RTRLDLSFHHSVSTQSEESALDLERNYCSHLNLPSSSPSPSQCFAPGSQLSE+NAAYFSWPTSSRLNDAAEDRANYFGNLQKGVLPEILGRLP+GQRA
Subjt:  MDRTRLDLSFHHSVSTQSEESALDLERNYCSHLNLPSSSPSPSQCFAPGSQLSESNAAYFSWPTSSRLNDAAEDRANYFGNLQKGVLPEILGRLPTGQRA

Query:  TTLLELMTIRAFHSKILRRFSLGTAIGFRIQKGMLTDIPAIIVFVARKVHRQWLSDVQCLPAALEGPGGIWCDVDVVEFSYYGAPAATPKEELYTELVDG
        TTLLELMTIRAFHSKILRRFSLGTAIGFRIQKGMLTDIPAIIVFVARKVHRQWL+DVQCLPAALEGPGGIWCDVDVVEFSYYGAPAATPKEE+YTELVDG
Subjt:  TTLLELMTIRAFHSKILRRFSLGTAIGFRIQKGMLTDIPAIIVFVARKVHRQWLSDVQCLPAALEGPGGIWCDVDVVEFSYYGAPAATPKEELYTELVDG

Query:  LRGSDPTIGSGSQVASQETYGTLGAIVKSRTGNRQVGFLTNRHVAVDLDYPSQKMFHPLPPSLGPGVYLGAVERATSFITDDVWYGIFAGTNP-------
        LRGSDPTIGSGSQVASQETYGTLGAIVKSRTG RQVGFLTNRHVAVDLDYPSQKMFHPLPPSLGPGVYLGAVERATSFITDDVWYGIFAGTNP       
Subjt:  LRGSDPTIGSGSQVASQETYGTLGAIVKSRTGNRQVGFLTNRHVAVDLDYPSQKMFHPLPPSLGPGVYLGAVERATSFITDDVWYGIFAGTNP-------

Query:  --------------------GVGEIGDVNKIDLQSPINSLIGRKVIKVGRSSGLTRGTIMAYALEYNDVKGICFFTDFLVVGDDQQTFDLEGDSGSLILL
                            GVGEIGDVNKIDLQSPINSLIGRKVIKVGRSSGLTRGTIMAYALEYNDVKGICFFTDFLVVGDDQQTFDLEGDSGSLILL
Subjt:  --------------------GVGEIGDVNKIDLQSPINSLIGRKVIKVGRSSGLTRGTIMAYALEYNDVKGICFFTDFLVVGDDQQTFDLEGDSGSLILL

Query:  TGQDGDKPRPVGIIWGGTANRGRLKLKVGQPPENWTSGVDLGRLLDLLELDLITTSDGLQAAVHEQRNTSVGGIDSIVAESSLDRMPLKYRLKENSEPLG
        TGQDG+K RPVGIIWGGTANRGRLKLKVGQPPENWTSGVDLGRLLDLLELDLITTSDGLQ  VHEQRNTSVGGIDS VAESSL+R+PLKYRLKENSEPLG
Subjt:  TGQDGDKPRPVGIIWGGTANRGRLKLKVGQPPENWTSGVDLGRLLDLLELDLITTSDGLQAAVHEQRNTSVGGIDSIVAESSLDRMPLKYRLKENSEPLG

Query:  LSVQQISPEGESSQGLISPFKHAAFHIE-NGFEMAPSVELQFIPRLTSSSPLHQKSEESRELKNLSALRNGYDSEISVSLQLGEPEAKRRKHLGSISTVK
        LSVQQISPEGESSQGLISPFKHAAFHIE N FEMAPSVELQF+PRLTSSSP+HQK++ES ELK+LSALRNG+DSE+SVSLQLGEPE KRR+H  S+S++K
Subjt:  LSVQQISPEGESSQGLISPFKHAAFHIE-NGFEMAPSVELQFIPRLTSSSPLHQKSEESRELKNLSALRNGYDSEISVSLQLGEPEAKRRKHLGSISTVK

XP_023543756.1 uncharacterized protein LOC111803536 isoform X1 [Cucurbita pepo subsp. pepo]2.0e-30889.98Show/hide
Query:  MDRTRLDLSFHHSVSTQSEESALDLERNYCSHLNLPSSSPSPSQCFAPGSQLSESNAAYFSWPTSSRLNDAAEDRANYFGNLQKGVLPEILGRLPTGQRA
        MDRTRLDLS HHSVSTQSEESALDLERNYCSHLN+PSSSPSPSQCFAPGSQLSE+NAAYFSWPTSSRLNDAAEDRANYFGNLQKGVLPEILGRLPTGQRA
Subjt:  MDRTRLDLSFHHSVSTQSEESALDLERNYCSHLNLPSSSPSPSQCFAPGSQLSESNAAYFSWPTSSRLNDAAEDRANYFGNLQKGVLPEILGRLPTGQRA

Query:  TTLLELMTIRAFHSKILRRFSLGTAIGFRIQKGMLTDIPAIIVFVARKVHRQWLSDVQCLPAALEGPGGIWCDVDVVEFSYYGAPAATPKEELYTELVDG
        TTLLELMTIRAFHSKILRRFSLGTAIGFRIQKGMLTDIPAIIVFVARKVHRQWLSDVQCLPAALEGPGGIWCDVDVVEFSYYGAPAATPKEE+YTELVDG
Subjt:  TTLLELMTIRAFHSKILRRFSLGTAIGFRIQKGMLTDIPAIIVFVARKVHRQWLSDVQCLPAALEGPGGIWCDVDVVEFSYYGAPAATPKEELYTELVDG

Query:  LRGSDPTIGSGSQVASQETYGTLGAIVKSRTGNRQVGFLTNRHVAVDLDYPSQKMFHPLPPSLGPGVYLGAVERATSFITDDVWYGIFAGTNP-------
        LRGSDPTIGSGSQVASQETYGTLGAIVKSRTG +QVGFLTNRHVAVDLDYPSQKMFHPLPPSLGPGVYLGAVERATSFITDDVWYGIFAGTNP       
Subjt:  LRGSDPTIGSGSQVASQETYGTLGAIVKSRTGNRQVGFLTNRHVAVDLDYPSQKMFHPLPPSLGPGVYLGAVERATSFITDDVWYGIFAGTNP-------

Query:  --------------------GVGEIGDVNKIDLQSPINSLIGRKVIKVGRSSGLTRGTIMAYALEYNDVKGICFFTDFLVVGDDQQTFDLEGDSGSLILL
                            GVGEIGDVNKIDLQSPINSLIGRKVIKVGRSSGLTRGTIMAYALEYNDVKGICFFTDFLVVGDDQQTFDLEGDSGSLILL
Subjt:  --------------------GVGEIGDVNKIDLQSPINSLIGRKVIKVGRSSGLTRGTIMAYALEYNDVKGICFFTDFLVVGDDQQTFDLEGDSGSLILL

Query:  TGQDGDKPRPVGIIWGGTANRGRLKLKVGQPPENWTSGVDLGRLLDLLELDLITTSDGLQAAVHEQRNTSVGGIDSIVAESSLDRMPLKYRLKENSEPLG
        TG D +KPRPVGIIWGGTANRGRLKLKVGQPPENWTSGVDLGRLLDLLELDLITT+DGLQAAVHEQRN SVGGIDS VAES  DRMPL YRL+ENSEPLG
Subjt:  TGQDGDKPRPVGIIWGGTANRGRLKLKVGQPPENWTSGVDLGRLLDLLELDLITTSDGLQAAVHEQRNTSVGGIDSIVAESSLDRMPLKYRLKENSEPLG

Query:  LSVQQISPEGESSQGLISPFKHAAFHIENGFEMAPSVELQFIPRLTSSSPLHQKSEESRELKNLSALRNGYDSEISVSLQLGEPEAKRRKHLGSISTVK
        LSVQ+ISPEGESSQGLISPFKHAA  IENGFE+ PSVELQFIPRL SSSPLHQK+EE +ELKNLS LRNGYD E+SVSL+LGEPEAKRRKHL S+S++K
Subjt:  LSVQQISPEGESSQGLISPFKHAAFHIENGFEMAPSVELQFIPRLTSSSPLHQKSEESRELKNLSALRNGYDSEISVSLQLGEPEAKRRKHLGSISTVK

XP_038877731.1 LOW QUALITY PROTEIN: protein NARROW LEAF 1-like [Benincasa hispida]2.0e-30890.65Show/hide
Query:  MDRTRLDLSFHHSVSTQSEESALDLERNYCSHLNLPSSSPSPSQCFAPGSQLSESNAAYFSWPTSSRLNDAAEDRANYFGNLQKGVLPEILGRLPTGQRA
        MDRTRLDLSFHHSVSTQSEESALDLERNYCSHLN+PSSSPSPSQCFAPGSQLSE+N AYFSWPTSSRLNDAAEDRANYFGNLQKGVLPEILGRLPTGQRA
Subjt:  MDRTRLDLSFHHSVSTQSEESALDLERNYCSHLNLPSSSPSPSQCFAPGSQLSESNAAYFSWPTSSRLNDAAEDRANYFGNLQKGVLPEILGRLPTGQRA

Query:  TTLLELMTIRAFHSKILRRFSLGTAIGFRIQKGMLTDIPAIIVFVARKVHRQWLSDVQCLPAALEGPGGIWCDVDVVEFSYYGAPAATPKEELYTELVDG
        TTLLELMTIRAFHSKILRRFSLGTAIGFRIQKGMLTDIPAIIVFVARKVHRQWLSDVQCLPAALEGPGGIWCDVDVVEFSYYGAPAATPKEE+YTELVDG
Subjt:  TTLLELMTIRAFHSKILRRFSLGTAIGFRIQKGMLTDIPAIIVFVARKVHRQWLSDVQCLPAALEGPGGIWCDVDVVEFSYYGAPAATPKEELYTELVDG

Query:  LRGSDPTIGSGSQVASQETYGTLGAIVKSRTGNRQVGFLTNRHVAVDLDYPSQKMFHPLPPSLGPGVYLGAVERATSFITDDVWYGIFAGTNP-------
        LRGSDPTIGSGSQVASQETYGTLGAIVKSRTG RQVGFLTNRHVAVDLDYPSQKMFHPLPPSLGPGVYLGAVERATSFITDDVWYGIFAGTNP       
Subjt:  LRGSDPTIGSGSQVASQETYGTLGAIVKSRTGNRQVGFLTNRHVAVDLDYPSQKMFHPLPPSLGPGVYLGAVERATSFITDDVWYGIFAGTNP-------

Query:  --------------------GVGEIGDVNKIDLQSPINSLIGRKVIKVGRSSGLTRGTIMAYALEYNDVKGICFFTDFLVVGDDQQTFDLEGDSGSLILL
                            GVGEIGDVNKIDLQSPINSLIGRKVIKVGRSSGLTRGTIM YALEYNDVKGIC FTDFLVVGDDQQTFDLEGDSGSLILL
Subjt:  --------------------GVGEIGDVNKIDLQSPINSLIGRKVIKVGRSSGLTRGTIMAYALEYNDVKGICFFTDFLVVGDDQQTFDLEGDSGSLILL

Query:  TGQDGDKPRPVGIIWGGTANRGRLKLKVGQPPENWTSGVDLGRLLDLLELDLITTSDGLQAAVHEQRNTSVGGIDSIVAESSLDRMPLKYRLKENSEPLG
        TGQD +KPRPVGIIWGGTANRGRLKLKVGQPPENWTSGVDLGRLLDLLELDLITT+DG QAAVHEQRN SVGGIDS VAES LDRMPLKYRLKENSE LG
Subjt:  TGQDGDKPRPVGIIWGGTANRGRLKLKVGQPPENWTSGVDLGRLLDLLELDLITTSDGLQAAVHEQRNTSVGGIDSIVAESSLDRMPLKYRLKENSEPLG

Query:  LSVQQISPEGESSQGLISPFKHAAFHIENGFEMAPSVELQFIPRLTSSSPLHQKSEESRELKNLSALRNGYDSEISVSLQLGEPEAKRRKHLGSISTVK
        LSVQQI PEGESSQG+ISPFKHAAFHIENGFE+ PSVELQFIPRLTSSS L QK+EES++LKNLSALRNGYDSE+SVSLQLGEPEAKRRKHL S+S++K
Subjt:  LSVQQISPEGESSQGLISPFKHAAFHIENGFEMAPSVELQFIPRLTSSSPLHQKSEESRELKNLSALRNGYDSEISVSLQLGEPEAKRRKHLGSISTVK

TrEMBL top hitse value%identityAlignment
A0A1S3AYD6 uncharacterized protein LOC103484249 isoform X15.6e-31090.02Show/hide
Query:  MDRTRLDLSFHHSVSTQSEESALDLERNYCSHLNLPSSSPSPSQCFAPGSQLSESNAAYFSWPTSSRLNDAAEDRANYFGNLQKGVLPEILGRLPTGQRA
        MDRTRLDL+FHHSVSTQSEESALDLERNYCSHL+LPSSSPSPSQCFAPGSQLSE+NAAYFSWPTSSRLNDAAEDRANYFGNLQKGVLPEILGRLPTGQRA
Subjt:  MDRTRLDLSFHHSVSTQSEESALDLERNYCSHLNLPSSSPSPSQCFAPGSQLSESNAAYFSWPTSSRLNDAAEDRANYFGNLQKGVLPEILGRLPTGQRA

Query:  TTLLELMTIRAFHSKILRRFSLGTAIGFRIQKGMLTDIPAIIVFVARKVHRQWLSDVQCLPAALEGPGGIWCDVDVVEFSYYGAPAATPKEELYTELVDG
        TTLLELMTIRAFHSKILRRFSLGTAIGFRIQKGMLTDIPAIIVFVARKVHRQWLSDVQCLPAALEGPGGIWCDVDVVEFSYYGAPAATPKEE+YTELVDG
Subjt:  TTLLELMTIRAFHSKILRRFSLGTAIGFRIQKGMLTDIPAIIVFVARKVHRQWLSDVQCLPAALEGPGGIWCDVDVVEFSYYGAPAATPKEELYTELVDG

Query:  LRGSDPTIGSGSQVASQETYGTLGAIVKSRTGNRQVGFLTNRHVAVDLDYPSQKMFHPLPPSLGPGVYLGAVERATSFITDDVWYGIFAGTNP-------
        LRGSDPT+GSGSQVASQETYGTLGAIVKSRTG RQVGFLTNRHVAVDLDYPSQKMFHPLPPSLGPGVYLGAVERATSFITDDVWYGIFAGTNP       
Subjt:  LRGSDPTIGSGSQVASQETYGTLGAIVKSRTGNRQVGFLTNRHVAVDLDYPSQKMFHPLPPSLGPGVYLGAVERATSFITDDVWYGIFAGTNP-------

Query:  --------------------GVGEIGDVNKIDLQSPINSLIGRKVIKVGRSSGLTRGTIMAYALEYNDVKGICFFTDFLVVGDDQQTFDLEGDSGSLILL
                            GVGEIGDVNKIDLQSPINSLIGRKVIKVGRSSGLTRGTIMAYALEYNDVKGICFFTDFLVVGDDQQTFDLEGDSGSLILL
Subjt:  --------------------GVGEIGDVNKIDLQSPINSLIGRKVIKVGRSSGLTRGTIMAYALEYNDVKGICFFTDFLVVGDDQQTFDLEGDSGSLILL

Query:  TGQDGDKPRPVGIIWGGTANRGRLKLKVGQPPENWTSGVDLGRLLDLLELDLITTSDGLQAAVHEQRNTSVGGIDSIVAESSLDRMPLKYRLKENSEPLG
        TGQD +KPRPVGIIWGGTANRGRLKLKVGQPPENWTSGVDLGRLLDLLELDLITT+DGLQAAVHEQRN SVGGIDS VAES LDRMPLKYRLKENSEPLG
Subjt:  TGQDGDKPRPVGIIWGGTANRGRLKLKVGQPPENWTSGVDLGRLLDLLELDLITTSDGLQAAVHEQRNTSVGGIDSIVAESSLDRMPLKYRLKENSEPLG

Query:  LSVQQISPEGESSQGLISPFKHAAFHIENGFEMAPSVELQFIPRLTSSSPLHQKSEESRELKNLSALRNGYDSEISVSLQLG--EPEAKRRKHLGSISTV
         SVQQISPEGESSQG+ISPFKHA FHIENG+E+ PSVELQFIPRLTS+SPLHQK++E++ELKNLSALR GYDSE+SVSLQLG  EPEAKRRK L  +S++
Subjt:  LSVQQISPEGESSQGLISPFKHAAFHIENGFEMAPSVELQFIPRLTSSSPLHQKSEESRELKNLSALRNGYDSEISVSLQLG--EPEAKRRKHLGSISTV

Query:  K
        K
Subjt:  K

A0A1S3AYT3 uncharacterized protein LOC103484249 isoform X21.8e-30789.68Show/hide
Query:  MDRTRLDLSFHHSVSTQSEESALDLERNYCSHLNLPSSSPSPSQCFAPGSQLSESNAAYFSWPTSSRLNDAAEDRANYFGNLQKGVLPEILGRLPTGQRA
        MDRTRLDL+FHHSVSTQSEESALDLERNYCSHL+LPSSSPSPSQCFAPGSQLSE+NAAYFSWPTSSRLNDAAEDRANYFGNLQKGVLPEILGRLPTGQRA
Subjt:  MDRTRLDLSFHHSVSTQSEESALDLERNYCSHLNLPSSSPSPSQCFAPGSQLSESNAAYFSWPTSSRLNDAAEDRANYFGNLQKGVLPEILGRLPTGQRA

Query:  TTLLELMTIRAFHSKILRRFSLGTAIGFRIQKGMLTDIPAIIVFVARKVHRQWLSDVQCLPAALEGPGGIWCDVDVVEFSYYGAPAATPKEELYTELVDG
        TTLLELMTIRAFHSKILRRFSLGTAIGFRIQKGMLTDIPAIIVFVARKVHRQWLSDVQCLPAALEGPGGIWCDVDVVEFSYYGAPAATPKEE+YTELVDG
Subjt:  TTLLELMTIRAFHSKILRRFSLGTAIGFRIQKGMLTDIPAIIVFVARKVHRQWLSDVQCLPAALEGPGGIWCDVDVVEFSYYGAPAATPKEELYTELVDG

Query:  LRGSDPTIGSGSQVASQETYGTLGAIVKSRTGNRQVGFLTNRHVAVDLDYPSQKMFHPLPPSLGPGVYLGAVERATSFITDDVWYGIFAGTNP-------
        LRGSDPT+GSGSQVASQETYGTLGAIVKSRTG RQVGFLTNRHVAVDLDYPSQKMFHPLPPSLGPGVYLGAVERATSFITDDVWYGIFAGTNP       
Subjt:  LRGSDPTIGSGSQVASQETYGTLGAIVKSRTGNRQVGFLTNRHVAVDLDYPSQKMFHPLPPSLGPGVYLGAVERATSFITDDVWYGIFAGTNP-------

Query:  --------------------GVGEIGDVNKIDLQSPINSLIGRKVIKVGRSSGLTRGTIMAYALEYNDVKGICFFTDFLVVGDDQQTFDLEGDSGSLILL
                            GVGEIGDVNKIDLQSPINSLIGRKVIKVGRSSGLTRGTIMAYALEYNDVKGICFFTDFLVVGDDQQTFDLEGDSGSLILL
Subjt:  --------------------GVGEIGDVNKIDLQSPINSLIGRKVIKVGRSSGLTRGTIMAYALEYNDVKGICFFTDFLVVGDDQQTFDLEGDSGSLILL

Query:  TGQDGDKPRPVGIIWGGTANRGRLKLKVGQPPENWTSGVDLGRLLDLLELDLITTSDGLQAAVHEQRNTSVGGIDSIVAESSLDRMPLKYRLKENSEPLG
        TGQD +KPRPVGIIWGGTANRGRLKLKVGQPPENWTSGVDLGRLLDLLELDLITT+DGLQ  VHEQRN SVGGIDS VAES LDRMPLKYRLKENSEPLG
Subjt:  TGQDGDKPRPVGIIWGGTANRGRLKLKVGQPPENWTSGVDLGRLLDLLELDLITTSDGLQAAVHEQRNTSVGGIDSIVAESSLDRMPLKYRLKENSEPLG

Query:  LSVQQISPEGESSQGLISPFKHAAFHIENGFEMAPSVELQFIPRLTSSSPLHQKSEESRELKNLSALRNGYDSEISVSLQLG--EPEAKRRKHLGSISTV
         SVQQISPEGESSQG+ISPFKHA FHIENG+E+ PSVELQFIPRLTS+SPLHQK++E++ELKNLSALR GYDSE+SVSLQLG  EPEAKRRK L  +S++
Subjt:  LSVQQISPEGESSQGLISPFKHAAFHIENGFEMAPSVELQFIPRLTSSSPLHQKSEESRELKNLSALRNGYDSEISVSLQLG--EPEAKRRKHLGSISTV

Query:  K
        K
Subjt:  K

A0A6J1CXN4 uncharacterized protein LOC111015756 isoform X22.5e-30990.5Show/hide
Query:  MDRTRLDLSFHHSVSTQSEESALDLERNYCSHLNLPSSSPSPSQCFAPGSQLSESNAAYFSWPTSSRLNDAAEDRANYFGNLQKGVLPEILGRLPTGQRA
        M+RTRLDLSFHHSVSTQSEESALDLERNYCSHLNLPSSSPSPSQCFAPGSQLSE+NAAYFSWPTSSRLNDAAEDRANYFGNLQKGVLPEILGRLP+GQRA
Subjt:  MDRTRLDLSFHHSVSTQSEESALDLERNYCSHLNLPSSSPSPSQCFAPGSQLSESNAAYFSWPTSSRLNDAAEDRANYFGNLQKGVLPEILGRLPTGQRA

Query:  TTLLELMTIRAFHSKILRRFSLGTAIGFRIQKGMLTDIPAIIVFVARKVHRQWLSDVQCLPAALEGPGGIWCDVDVVEFSYYGAPAATPKEELYTELVDG
        TTLLELMTIRAFHSKILRRFSLGTAIGFRIQKGMLTDIPAIIVFVARKVHRQWL+DVQCLPAALEGPGGIWCDVDVVEFSYYGAPAATPKEE+YTELVDG
Subjt:  TTLLELMTIRAFHSKILRRFSLGTAIGFRIQKGMLTDIPAIIVFVARKVHRQWLSDVQCLPAALEGPGGIWCDVDVVEFSYYGAPAATPKEELYTELVDG

Query:  LRGSDPTIGSGSQVASQETYGTLGAIVKSRTGNRQVGFLTNRHVAVDLDYPSQKMFHPLPPSLGPGVYLGAVERATSFITDDVWYGIFAGTNP-------
        LRGSDPTIGSGSQVASQETYGTLGAIVKSRTG RQVGFLTNRHVAVDLDYPSQKMFHPLPPSLGPGVYLGAVERATSFITDDVWYGIFAGTNP       
Subjt:  LRGSDPTIGSGSQVASQETYGTLGAIVKSRTGNRQVGFLTNRHVAVDLDYPSQKMFHPLPPSLGPGVYLGAVERATSFITDDVWYGIFAGTNP-------

Query:  --------------------GVGEIGDVNKIDLQSPINSLIGRKVIKVGRSSGLTRGTIMAYALEYNDVKGICFFTDFLVVGDDQQTFDLEGDSGSLILL
                            GVGEIGDVNKIDLQSPINSLIGRKVIKVGRSSGLTRGTIMAYALEYNDVKGICFFTDFLVVGDDQQTFDLEGDSGSLILL
Subjt:  --------------------GVGEIGDVNKIDLQSPINSLIGRKVIKVGRSSGLTRGTIMAYALEYNDVKGICFFTDFLVVGDDQQTFDLEGDSGSLILL

Query:  TGQDGDKPRPVGIIWGGTANRGRLKLKVGQPPENWTSGVDLGRLLDLLELDLITTSDGLQAAVHEQRNTSVGGIDSIVAESSLDRMPLKYRLKENSEPLG
        TGQDG+K RPVGIIWGGTANRGRLKLKVGQPPENWTSGVDLGRLLDLLELDLITTSDGLQ  VHEQRNTSVGGIDS VAESSL+R+PLKYRLKENSEPLG
Subjt:  TGQDGDKPRPVGIIWGGTANRGRLKLKVGQPPENWTSGVDLGRLLDLLELDLITTSDGLQAAVHEQRNTSVGGIDSIVAESSLDRMPLKYRLKENSEPLG

Query:  LSVQQISPEGESSQGLISPFKHAAFHIE-NGFEMAPSVELQFIPRLTSSSPLHQKSEESRELKNLSALRNGYDSEISVSLQLGEPEAKRRKHLGSISTVK
        LSVQQISPEGESSQGLISPFKHAAFHIE N FEMAPSVELQF+PRLTSSSP+HQK++ES ELK+LSALRNG+DSE+SVSLQLGEPE KRR+H  S+S++K
Subjt:  LSVQQISPEGESSQGLISPFKHAAFHIE-NGFEMAPSVELQFIPRLTSSSPLHQKSEESRELKNLSALRNGYDSEISVSLQLGEPEAKRRKHLGSISTVK

A0A6J1CZZ3 uncharacterized protein LOC111015756 isoform X10.0e+0090.83Show/hide
Query:  MDRTRLDLSFHHSVSTQSEESALDLERNYCSHLNLPSSSPSPSQCFAPGSQLSESNAAYFSWPTSSRLNDAAEDRANYFGNLQKGVLPEILGRLPTGQRA
        M+RTRLDLSFHHSVSTQSEESALDLERNYCSHLNLPSSSPSPSQCFAPGSQLSE+NAAYFSWPTSSRLNDAAEDRANYFGNLQKGVLPEILGRLP+GQRA
Subjt:  MDRTRLDLSFHHSVSTQSEESALDLERNYCSHLNLPSSSPSPSQCFAPGSQLSESNAAYFSWPTSSRLNDAAEDRANYFGNLQKGVLPEILGRLPTGQRA

Query:  TTLLELMTIRAFHSKILRRFSLGTAIGFRIQKGMLTDIPAIIVFVARKVHRQWLSDVQCLPAALEGPGGIWCDVDVVEFSYYGAPAATPKEELYTELVDG
        TTLLELMTIRAFHSKILRRFSLGTAIGFRIQKGMLTDIPAIIVFVARKVHRQWL+DVQCLPAALEGPGGIWCDVDVVEFSYYGAPAATPKEE+YTELVDG
Subjt:  TTLLELMTIRAFHSKILRRFSLGTAIGFRIQKGMLTDIPAIIVFVARKVHRQWLSDVQCLPAALEGPGGIWCDVDVVEFSYYGAPAATPKEELYTELVDG

Query:  LRGSDPTIGSGSQVASQETYGTLGAIVKSRTGNRQVGFLTNRHVAVDLDYPSQKMFHPLPPSLGPGVYLGAVERATSFITDDVWYGIFAGTNP-------
        LRGSDPTIGSGSQVASQETYGTLGAIVKSRTG RQVGFLTNRHVAVDLDYPSQKMFHPLPPSLGPGVYLGAVERATSFITDDVWYGIFAGTNP       
Subjt:  LRGSDPTIGSGSQVASQETYGTLGAIVKSRTGNRQVGFLTNRHVAVDLDYPSQKMFHPLPPSLGPGVYLGAVERATSFITDDVWYGIFAGTNP-------

Query:  --------------------GVGEIGDVNKIDLQSPINSLIGRKVIKVGRSSGLTRGTIMAYALEYNDVKGICFFTDFLVVGDDQQTFDLEGDSGSLILL
                            GVGEIGDVNKIDLQSPINSLIGRKVIKVGRSSGLTRGTIMAYALEYNDVKGICFFTDFLVVGDDQQTFDLEGDSGSLILL
Subjt:  --------------------GVGEIGDVNKIDLQSPINSLIGRKVIKVGRSSGLTRGTIMAYALEYNDVKGICFFTDFLVVGDDQQTFDLEGDSGSLILL

Query:  TGQDGDKPRPVGIIWGGTANRGRLKLKVGQPPENWTSGVDLGRLLDLLELDLITTSDGLQAAVHEQRNTSVGGIDSIVAESSLDRMPLKYRLKENSEPLG
        TGQDG+K RPVGIIWGGTANRGRLKLKVGQPPENWTSGVDLGRLLDLLELDLITTSDGLQAAVHEQRNTSVGGIDS VAESSL+R+PLKYRLKENSEPLG
Subjt:  TGQDGDKPRPVGIIWGGTANRGRLKLKVGQPPENWTSGVDLGRLLDLLELDLITTSDGLQAAVHEQRNTSVGGIDSIVAESSLDRMPLKYRLKENSEPLG

Query:  LSVQQISPEGESSQGLISPFKHAAFHIE-NGFEMAPSVELQFIPRLTSSSPLHQKSEESRELKNLSALRNGYDSEISVSLQLGEPEAKRRKHLGSISTVK
        LSVQQISPEGESSQGLISPFKHAAFHIE N FEMAPSVELQF+PRLTSSSP+HQK++ES ELK+LSALRNG+DSE+SVSLQLGEPE KRR+H  S+S++K
Subjt:  LSVQQISPEGESSQGLISPFKHAAFHIE-NGFEMAPSVELQFIPRLTSSSPLHQKSEESRELKNLSALRNGYDSEISVSLQLGEPEAKRRKHLGSISTVK

A0A6J1IRS8 uncharacterized protein LOC111478754 isoform X11.4e-30789.98Show/hide
Query:  MDRTRLDLSFHHSVSTQSEESALDLERNYCSHLNLPSSSPSPSQCFAPGSQLSESNAAYFSWPTSSRLNDAAEDRANYFGNLQKGVLPEILGRLPTGQRA
        MDRTRLDLS HHSVSTQSEESALDLERNYCSHLN+PSSSPSPSQCFAPGSQLSE+NAAYFSWPTSSRLNDAAEDRANYFGNLQKGVLPEILGRLPTGQRA
Subjt:  MDRTRLDLSFHHSVSTQSEESALDLERNYCSHLNLPSSSPSPSQCFAPGSQLSESNAAYFSWPTSSRLNDAAEDRANYFGNLQKGVLPEILGRLPTGQRA

Query:  TTLLELMTIRAFHSKILRRFSLGTAIGFRIQKGMLTDIPAIIVFVARKVHRQWLSDVQCLPAALEGPGGIWCDVDVVEFSYYGAPAATPKEELYTELVDG
        TTLLELMTIRAFHSKILRRFSLGTAIGFRIQKGMLTDIPAIIVFVARKVHRQWLSDVQCLPAALEGPGGIWCDVDVVEFSYYGAPAATPKEE+YTELVDG
Subjt:  TTLLELMTIRAFHSKILRRFSLGTAIGFRIQKGMLTDIPAIIVFVARKVHRQWLSDVQCLPAALEGPGGIWCDVDVVEFSYYGAPAATPKEELYTELVDG

Query:  LRGSDPTIGSGSQVASQETYGTLGAIVKSRTGNRQVGFLTNRHVAVDLDYPSQKMFHPLPPSLGPGVYLGAVERATSFITDDVWYGIFAGTNP-------
        LRGSDPTIGSGSQVASQETYGTLGAIVKSRTG +QVGFLTNRHVAVDLDYPSQKMFHPLPPSLGPGVYLGAVERATSFITDDVWYGIFAGTNP       
Subjt:  LRGSDPTIGSGSQVASQETYGTLGAIVKSRTGNRQVGFLTNRHVAVDLDYPSQKMFHPLPPSLGPGVYLGAVERATSFITDDVWYGIFAGTNP-------

Query:  --------------------GVGEIGDVNKIDLQSPINSLIGRKVIKVGRSSGLTRGTIMAYALEYNDVKGICFFTDFLVVGDDQQTFDLEGDSGSLILL
                            GVGEI DVNKIDLQSPINSLIGRKVIKVGRSSGLTRGTIMAYALEYNDVKGICFFTDFLVVGDDQQTFDLEGDSGSLILL
Subjt:  --------------------GVGEIGDVNKIDLQSPINSLIGRKVIKVGRSSGLTRGTIMAYALEYNDVKGICFFTDFLVVGDDQQTFDLEGDSGSLILL

Query:  TGQDGDKPRPVGIIWGGTANRGRLKLKVGQPPENWTSGVDLGRLLDLLELDLITTSDGLQAAVHEQRNTSVGGIDSIVAESSLDRMPLKYRLKENSEPLG
        TG D +KPRPVGIIWGGTANRGRLKLKVGQPPENWTSGVDLGRLLDLLELDLITT+DGLQAAVHEQRN SVGGIDS VAES LDRMPL YRLKENSEPLG
Subjt:  TGQDGDKPRPVGIIWGGTANRGRLKLKVGQPPENWTSGVDLGRLLDLLELDLITTSDGLQAAVHEQRNTSVGGIDSIVAESSLDRMPLKYRLKENSEPLG

Query:  LSVQQISPEGESSQGLISPFKHAAFHIENGFEMAPSVELQFIPRLTSSSPLHQKSEESRELKNLSALRNGYDSEISVSLQLGEPEAKRRKHLGSISTVK
        LSVQ+ISPEGESSQGLISPFKHAA  IENGFE+ PSVELQFIPRL SSSPLHQK+EE +ELK LS LRNGYD E+SVSL+LGEPEAKRRKHL S+S++K
Subjt:  LSVQQISPEGESSQGLISPFKHAAFHIENGFEMAPSVELQFIPRLTSSSPLHQKSEESRELKNLSALRNGYDSEISVSLQLGEPEAKRRKHLGSISTVK

SwissProt top hitse value%identityAlignment
B4XT64 Protein NARROW LEAF 11.0e-19062.05Show/hide
Query:  QSEESALDLERNYCSHLNLPSSSPSPSQCFAPGSQLSESNAAYFSWPTSSRLNDAAEDRANYFGNLQKGVLPEILGRLPTGQRATTLLELMTIRAFHSKI
        QSEES+LD++     H + P  SPS  Q  A G   +E++AAYF WPTS+  + AAE RANYFGNLQKG+LP   GRLP GQ+A +LL+LMTIRAFHSKI
Subjt:  QSEESALDLERNYCSHLNLPSSSPSPSQCFAPGSQLSESNAAYFSWPTSSRLNDAAEDRANYFGNLQKGVLPEILGRLPTGQRATTLLELMTIRAFHSKI

Query:  LRRFSLGTAIGFRIQKGMLTDIPAIIVFVARKVHRQWLSDVQCLPAALEGPGGIWCDVDVVEFSYYGAPAATPKEELYTELVDGLRGSDPTIGSGSQVAS
        LRRFSLGTA+GFRI+KG LTDIPAI+VFVARKVH++WL+  QCLPA LEGPGG+WCDVDVVEFSYYGAPA TPKE++++ELVD L GSD  IGSGSQVAS
Subjt:  LRRFSLGTAIGFRIQKGMLTDIPAIIVFVARKVHRQWLSDVQCLPAALEGPGGIWCDVDVVEFSYYGAPAATPKEELYTELVDGLRGSDPTIGSGSQVAS

Query:  QETYGTLGAIVKSRTGNRQVGFLTNRHVAVDLDYPSQKMFHPLPPSLGPGVYLGAVERATSFITDDVWYGIFAGTNP-----------------------
         ET+GTLGAIVK RTGN+QVGFLTN HVAVDLDYP+QKMFHPLPP+LGPGVYLGAVERATSFITDDVWYGI+AGTNP                       
Subjt:  QETYGTLGAIVKSRTGNRQVGFLTNRHVAVDLDYPSQKMFHPLPPSLGPGVYLGAVERATSFITDDVWYGIFAGTNP-----------------------

Query:  ----GVGEIGDVNKIDLQSPINSLIGRKVIKVGRSSGLTRGTIMAYALEYNDVKGICFFTDFLVVGDDQQTFDLEGDSGSLILLTGQDGDKPRPVGIIWG
            GVG+IGDV  IDLQ P+NSLIGR+V KVGRSSG T GT+MAYALEYND KGICFFTD LVVG+++QTFDLEGDSGSLI+LT QDG+KPRP+GIIWG
Subjt:  ----GVGEIGDVNKIDLQSPINSLIGRKVIKVGRSSGLTRGTIMAYALEYNDVKGICFFTDFLVVGDDQQTFDLEGDSGSLILLTGQDGDKPRPVGIIWG

Query:  GTANRGRLKLKVGQPPENWTSGVDLGRLLDLLELDLITTSDGLQAAVHEQRNTSVGGIDSIVAESSLDRMPL-KYRLKENSEPLGLSVQQISPEGESSQG
        GTANRGRLKL     PENWTSGVDLGRLLD LELD+I T++ LQ AV +QR   V  + S V ESS   + + + +++E  EPLG+ +QQ+     ++ G
Subjt:  GTANRGRLKLKVGQPPENWTSGVDLGRLLDLLELDLITTSDGLQAAVHEQRNTSVGGIDSIVAESSLDRMPL-KYRLKENSEPLGLSVQQISPEGESSQG

Query:  LISPFKHAAFHIENGFEMAPSV----ELQFIPRLTSSSPLHQKSEESRELKNLSALRNGYDSEISVSLQLGEPEAKR
                      G E + +V    E QFI      SP+    +  R + NL+   N  + E+++SL LG+ E KR
Subjt:  LISPFKHAAFHIENGFEMAPSV----ELQFIPRLTSSSPLHQKSEESRELKNLSALRNGYDSEISVSLQLGEPEAKR

Arabidopsis top hitse value%identityAlignment
AT2G35155.1 Trypsin family protein2.6e-21867.81Show/hide
Query:  SVSTQSEESALDLERN-YCSHLNLP-SSSPSPSQCFAPGSQLSESNAAYFSWPTSSRLNDAAEDRANYFGNLQKGVLPEILGRLPTGQRATTLLELMTIR
        + S++SE+SALDLERN +C+HL+LP SSSPSP Q F    Q +ESNA YFSWPT SRLND  EDRANYFGNLQKGVLPE +GRLP+GQ+ATTLLELMTIR
Subjt:  SVSTQSEESALDLERN-YCSHLNLP-SSSPSPSQCFAPGSQLSESNAAYFSWPTSSRLNDAAEDRANYFGNLQKGVLPEILGRLPTGQRATTLLELMTIR

Query:  AFHSKILRRFSLGTAIGFRIQKGMLTDIPAIIVFVARKVHRQWLSDVQCLPAALEGPGGIWCDVDVVEFSYYGAPAATPKEELYTELVDGLRGSDPTIGS
        AFHSKILRRFSLGTA+GFRI +G+LT++PAI+VFVARKVHRQWL+ +QCLP+ALEGPGG+WCDVDVVEF YYGAPAATPKE++Y ELVDGLRGSDP IGS
Subjt:  AFHSKILRRFSLGTAIGFRIQKGMLTDIPAIIVFVARKVHRQWLSDVQCLPAALEGPGGIWCDVDVVEFSYYGAPAATPKEELYTELVDGLRGSDPTIGS

Query:  GSQVASQETYGTLGAIVKSRTGNRQVGFLTNRHVAVDLDYPSQKMFHPLPPSLGPGVYLGAVERATSFITDDVWYGIFAGTNP-----------------
        GSQVASQETYGTLGAIVKSRTGN QVGFLTNRHVAVDLDYPSQKMFHPLPPSLGPGVYLGAVERATSFITDD WYGIFAGTNP                 
Subjt:  GSQVASQETYGTLGAIVKSRTGNRQVGFLTNRHVAVDLDYPSQKMFHPLPPSLGPGVYLGAVERATSFITDDVWYGIFAGTNP-----------------

Query:  ----------GVGEIGDVNKIDLQSPINSLIGRKVIKVGRSSGLTRGTIMAYALEYNDVKGICFFTDFLVVGDDQQTFDLEGDSGSLILLTGQDGDKPRP
                  G+GEIGDV+ IDLQSPI+SLIG++V+KVGRSSG T GTIMAYALEYND KGICF TDFLV+G++QQTFDLEGDSGSLILLTG +G KPRP
Subjt:  ----------GVGEIGDVNKIDLQSPINSLIGRKVIKVGRSSGLTRGTIMAYALEYNDVKGICFFTDFLVVGDDQQTFDLEGDSGSLILLTGQDGDKPRP

Query:  VGIIWGGTANRGRLKLKVGQPPENWTSGVDLGRLLDLLELDLITTSDGLQ--AAVHEQRNTSVGGIDSIVAESSLDRMPLKYRLKENSEPLGLSVQQISP
        VGIIWGGTANRGRLKL  GQ PENWTSGVDLGRLLDLLELDLIT++  L+  AA  E+RNTSV  +DS V++SS              +P+        P
Subjt:  VGIIWGGTANRGRLKLKVGQPPENWTSGVDLGRLLDLLELDLITTSDGLQ--AAVHEQRNTSVGGIDSIVAESSLDRMPLKYRLKENSEPLGLSVQQISP

Query:  EGESSQGLISPFKHAAFHIENGFEMAPSVE--LQFIPRLTSSSPLHQKSEESRELKNLSALRNGYDSEISVSLQLGEPEAKRRK
         G+       PF    FHIE   +    VE  +   P   + S    K +E  +L NL AL+N  + E+++SL LGEP+ K+ K
Subjt:  EGESSQGLISPFKHAAFHIENGFEMAPSVE--LQFIPRLTSSSPLHQKSEESRELKNLSALRNGYDSEISVSLQLGEPEAKRRK

AT3G12950.1 Trypsin family protein9.5e-17659.46Show/hide
Query:  GSQLSESNAAYFSWPTSSRLNDAAEDRANYFGNLQK------GVLPEILGRLPTGQRATTLLELMTIRAFHSKILRRFSLGTAIGFRIQKGMLTDIPAII
        G     + A+YFSWPTSSRL++AAE+RANYF NLQK       V PE +   P GQRATTLLELMTIRAFHSK+LR +SLGTAIGFRI++G+LTDIPAII
Subjt:  GSQLSESNAAYFSWPTSSRLNDAAEDRANYFGNLQK------GVLPEILGRLPTGQRATTLLELMTIRAFHSKILRRFSLGTAIGFRIQKGMLTDIPAII

Query:  VFVARKVHRQWLSDVQCLPAALEGPGGIWCDVDVVEFSYYGAP--AATPKEELYTELVDGLRGSDPTIGSGSQVASQETYGTLGAIVKSRTGNRQVGFLT
        VFV+RKVH+QWLS +QCLP ALEG GGIWCDVDVVEFSY+G P    TPK+   T++VD L+GSDP IGSGSQVASQET GTLGAIV+S+TG RQVGF+T
Subjt:  VFVARKVHRQWLSDVQCLPAALEGPGGIWCDVDVVEFSYYGAP--AATPKEELYTELVDGLRGSDPTIGSGSQVASQETYGTLGAIVKSRTGNRQVGFLT

Query:  NRHVAVDLDYPSQKMFHPLPPSLGPGVYLGAVERATSFITDDVWYGIFAGTNP----------------------------GVGEIGDVNKIDLQSPINS
        NRHVAV+LDYPSQKMFHPLPP+LGPGVYLGAVERATSFITDD+W+GIFAGTNP                            GVGEIG+V  I+LQSP+ S
Subjt:  NRHVAVDLDYPSQKMFHPLPPSLGPGVYLGAVERATSFITDDVWYGIFAGTNP----------------------------GVGEIGDVNKIDLQSPINS

Query:  LIGRKVIKVGRSSGLTRGTIMAYALEYNDVKGICFFTDFLVVGDDQQT-FDLEGDSGSLILLTGQDGDKPRPVGIIWGGTANRGRLKLKVGQPPENWTSG
        L+G++V+KVGRSSGLT GT++AYALEYND +G+CF TDFLVVG++ ++ FDLEGDSGSLI++ G+  +K RP+GIIWGGT +RGRLKLKVG+ PE+WT+G
Subjt:  LIGRKVIKVGRSSGLTRGTIMAYALEYNDVKGICFFTDFLVVGDDQQT-FDLEGDSGSLILLTGQDGDKPRPVGIIWGGTANRGRLKLKVGQPPENWTSG

Query:  VDLGRLLDLLELDLITTSDGLQAAVHEQRNTSVGGIDSIVAESSLDRMPLKYRLKENSEPLGLSVQQISPEGESSQGLISPFKHAAFHIENGFEM---AP
        VDLGRLL  L+LDLITT +GL+AAV EQR  S  G+ S+VA+SS   + LK   KE   P            E  +  + P +     +E   E    AP
Subjt:  VDLGRLLDLLELDLITTSDGLQAAVHEQRNTSVGGIDSIVAESSLDRMPLKYRLKENSEPLGLSVQQISPEGESSQGLISPFKHAAFHIENGFEM---AP

Query:  SVELQFIPRLTSSSPLHQKSEESRELKNLSALRNGYDSEISVSLQLGEPEAKRRK
        SVE QF+P  +         E +RE           D ++ V L+LG+  AKRR+
Subjt:  SVELQFIPRLTSSSPLHQKSEESRELKNLSALRNGYDSEISVSLQLGEPEAKRRK

AT5G45030.1 Trypsin family protein4.6e-21567.33Show/hide
Query:  MDRTRLDLSFHHSVSTQSEES-ALDLERNYCSHLNLPSSSPSPSQCFAPGSQLSESN--AAYFSWPTSSRLNDAAEDRANYFGNLQKGVLPEILGRLPTG
        M+  RLDL FHHS S+QS ES ALDL++N  +H+ L SS  SP Q F  G+Q  E++  AAYFSWPTSSRLND+AEDRANYF NLQKGVLPE    LPTG
Subjt:  MDRTRLDLSFHHSVSTQSEES-ALDLERNYCSHLNLPSSSPSPSQCFAPGSQLSESN--AAYFSWPTSSRLNDAAEDRANYFGNLQKGVLPEILGRLPTG

Query:  QRATTLLELMTIRAFHSKILRRFSLGTAIGFRIQKGMLTDIPAIIVFVARKVHRQWLSDVQCLPAALEGPGGIWCDVDVVEFSYYGAPAATPKEELYTEL
        ++ATTLLELM IRAFHSK LRRFSLGTAIGFRI++G+LT+I AI+VFVARKVH+QWL+ +QCLP ALEGPGG+WCDVDVVEF YYGAPA TPKE++YTEL
Subjt:  QRATTLLELMTIRAFHSKILRRFSLGTAIGFRIQKGMLTDIPAIIVFVARKVHRQWLSDVQCLPAALEGPGGIWCDVDVVEFSYYGAPAATPKEELYTEL

Query:  VDGLRGSDPTIGSGSQVASQETYGTLGAIVKSRTGNRQVGFLTNRHVAVDLDYPSQKMFHPLPPSLGPGVYLGAVERATSFITDDVWYGIFAGTNP----
        VD LRGS  +IGSGSQVASQETYGTLGAIVKS+TG RQVGFLTNRHVAVDLDYPSQKMFHPLPPSLGPGVYLGAVERATSFITDD+WYGIFAGTNP    
Subjt:  VDGLRGSDPTIGSGSQVASQETYGTLGAIVKSRTGNRQVGFLTNRHVAVDLDYPSQKMFHPLPPSLGPGVYLGAVERATSFITDDVWYGIFAGTNP----

Query:  -----------------------GVGEIGDVNKIDLQSPINSLIGRKVIKVGRSSGLTRGTIMAYALEYNDVKGICFFTDFLVVGDDQQTFDLEGDSGSL
                               G+GEIGD++  DLQSP+NSLIGRKV+KVGRSSGLT GTIMAYALEYND KGICF TDFLVVG++QQTFDLEGDSGSL
Subjt:  -----------------------GVGEIGDVNKIDLQSPINSLIGRKVIKVGRSSGLTRGTIMAYALEYNDVKGICFFTDFLVVGDDQQTFDLEGDSGSL

Query:  ILLTGQD--GDKPRPVGIIWGGTANRGRLKLKVGQPPENWTSGVDLGRLLDLLELDLITTSDGLQAAVHEQRN-TSVGGIDSIVAESSLDRMPL-KYRLK
        ILL   D   +KPRPVGIIWGGTANRGRLKLKVG+ PENWTSGVDLGR+L+LLELDLIT+++GLQAAV EQRN      +DS V ESS     + + +  
Subjt:  ILLTGQD--GDKPRPVGIIWGGTANRGRLKLKVGQPPENWTSGVDLGRLLDLLELDLITTSDGLQAAVHEQRN-TSVGGIDSIVAESSLDRMPL-KYRLK

Query:  ENSEPLGLSVQQISPEGESSQGLISPFKHAAFHIENGFE-MAPSVELQFIPRLTSS-SPLHQK--SEESRELKNLSALR-NGYDSEISVSLQLGEPEAKR
        EN EP+ L+VQQ+  E ++S        H  F IE+  E +A   E QFIP  +++ S LHQK    E+ E KNLS+L+ +    EI  SLQLGE + K+
Subjt:  ENSEPLGLSVQQISPEGESSQGLISPFKHAAFHIENGFE-MAPSVELQFIPRLTSS-SPLHQK--SEESRELKNLSALR-NGYDSEISVSLQLGEPEAKR

Query:  RKHLGS
        RK   S
Subjt:  RKHLGS

AT5G45030.2 Trypsin family protein4.6e-21567.33Show/hide
Query:  MDRTRLDLSFHHSVSTQSEES-ALDLERNYCSHLNLPSSSPSPSQCFAPGSQLSESN--AAYFSWPTSSRLNDAAEDRANYFGNLQKGVLPEILGRLPTG
        M+  RLDL FHHS S+QS ES ALDL++N  +H+ L SS  SP Q F  G+Q  E++  AAYFSWPTSSRLND+AEDRANYF NLQKGVLPE    LPTG
Subjt:  MDRTRLDLSFHHSVSTQSEES-ALDLERNYCSHLNLPSSSPSPSQCFAPGSQLSESN--AAYFSWPTSSRLNDAAEDRANYFGNLQKGVLPEILGRLPTG

Query:  QRATTLLELMTIRAFHSKILRRFSLGTAIGFRIQKGMLTDIPAIIVFVARKVHRQWLSDVQCLPAALEGPGGIWCDVDVVEFSYYGAPAATPKEELYTEL
        ++ATTLLELM IRAFHSK LRRFSLGTAIGFRI++G+LT+I AI+VFVARKVH+QWL+ +QCLP ALEGPGG+WCDVDVVEF YYGAPA TPKE++YTEL
Subjt:  QRATTLLELMTIRAFHSKILRRFSLGTAIGFRIQKGMLTDIPAIIVFVARKVHRQWLSDVQCLPAALEGPGGIWCDVDVVEFSYYGAPAATPKEELYTEL

Query:  VDGLRGSDPTIGSGSQVASQETYGTLGAIVKSRTGNRQVGFLTNRHVAVDLDYPSQKMFHPLPPSLGPGVYLGAVERATSFITDDVWYGIFAGTNP----
        VD LRGS  +IGSGSQVASQETYGTLGAIVKS+TG RQVGFLTNRHVAVDLDYPSQKMFHPLPPSLGPGVYLGAVERATSFITDD+WYGIFAGTNP    
Subjt:  VDGLRGSDPTIGSGSQVASQETYGTLGAIVKSRTGNRQVGFLTNRHVAVDLDYPSQKMFHPLPPSLGPGVYLGAVERATSFITDDVWYGIFAGTNP----

Query:  -----------------------GVGEIGDVNKIDLQSPINSLIGRKVIKVGRSSGLTRGTIMAYALEYNDVKGICFFTDFLVVGDDQQTFDLEGDSGSL
                               G+GEIGD++  DLQSP+NSLIGRKV+KVGRSSGLT GTIMAYALEYND KGICF TDFLVVG++QQTFDLEGDSGSL
Subjt:  -----------------------GVGEIGDVNKIDLQSPINSLIGRKVIKVGRSSGLTRGTIMAYALEYNDVKGICFFTDFLVVGDDQQTFDLEGDSGSL

Query:  ILLTGQD--GDKPRPVGIIWGGTANRGRLKLKVGQPPENWTSGVDLGRLLDLLELDLITTSDGLQAAVHEQRN-TSVGGIDSIVAESSLDRMPL-KYRLK
        ILL   D   +KPRPVGIIWGGTANRGRLKLKVG+ PENWTSGVDLGR+L+LLELDLIT+++GLQAAV EQRN      +DS V ESS     + + +  
Subjt:  ILLTGQD--GDKPRPVGIIWGGTANRGRLKLKVGQPPENWTSGVDLGRLLDLLELDLITTSDGLQAAVHEQRN-TSVGGIDSIVAESSLDRMPL-KYRLK

Query:  ENSEPLGLSVQQISPEGESSQGLISPFKHAAFHIENGFE-MAPSVELQFIPRLTSS-SPLHQK--SEESRELKNLSALR-NGYDSEISVSLQLGEPEAKR
        EN EP+ L+VQQ+  E ++S        H  F IE+  E +A   E QFIP  +++ S LHQK    E+ E KNLS+L+ +    EI  SLQLGE + K+
Subjt:  ENSEPLGLSVQQISPEGESSQGLISPFKHAAFHIENGFE-MAPSVELQFIPRLTSS-SPLHQK--SEESRELKNLSALR-NGYDSEISVSLQLGEPEAKR

Query:  RKHLGS
        RK   S
Subjt:  RKHLGS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGACAGGACACGACTGGATTTAAGTTTTCATCACTCAGTATCAACACAGTCAGAGGAATCTGCCTTGGACTTGGAAAGGAACTATTGCAGTCATCTTAATCTGCCTTC
ATCAAGTCCATCACCTAGTCAATGCTTTGCTCCAGGTAGTCAGCTGTCTGAGAGCAATGCTGCTTACTTTTCATGGCCCACTTCCAGCCGTTTAAACGATGCTGCAGAAG
ATAGAGCAAACTATTTTGGGAACCTTCAAAAGGGCGTGCTTCCTGAAATTCTGGGTCGGCTGCCCACTGGGCAGCGAGCTACCACTTTGCTTGAGCTTATGACCATAAGG
GCATTTCATAGCAAGATATTGCGTCGTTTTAGCCTCGGAACTGCAATAGGATTTCGAATTCAGAAGGGTATGTTGACAGATATCCCTGCTATTATTGTCTTTGTTGCACG
AAAAGTTCACAGGCAGTGGCTCAGTGATGTTCAATGTCTACCTGCTGCACTTGAGGGCCCTGGAGGTATATGGTGTGATGTTGATGTTGTGGAGTTCTCCTACTATGGTG
CACCGGCAGCTACACCTAAAGAAGAATTATATACAGAGCTTGTTGATGGCCTGAGGGGAAGTGATCCAACAATTGGTTCTGGTTCCCAGGTTGCTAGCCAAGAAACTTAT
GGCACTTTGGGTGCCATTGTCAAAAGCCGAACAGGAAACCGGCAAGTTGGTTTTCTTACGAACCGACACGTTGCAGTTGATTTAGACTACCCTAGTCAGAAAATGTTTCA
TCCTTTGCCTCCCAGCCTTGGGCCTGGTGTATATCTGGGTGCTGTGGAGAGAGCAACATCGTTTATCACTGATGACGTCTGGTATGGGATCTTTGCTGGAACAAATCCAG
GTGTCGGTGAGATTGGCGATGTCAACAAAATAGACCTGCAATCCCCAATCAACAGTCTCATTGGACGGAAAGTGATCAAGGTTGGAAGAAGTTCTGGCTTGACTAGAGGG
ACTATAATGGCATATGCCCTGGAGTATAATGATGTAAAAGGGATTTGTTTCTTCACCGACTTTCTTGTTGTTGGAGATGACCAGCAGACGTTTGACCTTGAAGGTGATAG
TGGAAGCCTTATTCTTTTAACTGGTCAGGATGGGGACAAGCCACGTCCAGTTGGGATTATTTGGGGAGGAACAGCTAATCGAGGTCGACTGAAATTAAAAGTTGGTCAAC
CCCCAGAGAATTGGACCAGTGGAGTTGATCTTGGACGCCTTCTTGATCTCCTTGAGCTCGATCTTATTACAACAAGTGATGGTTTGCAAGCTGCAGTGCATGAACAAAGG
AATACTTCAGTCGGAGGGATTGATTCTATTGTTGCAGAGTCGTCTCTCGACCGGATGCCATTAAAATATAGACTCAAAGAGAACTCGGAGCCACTTGGTTTGAGTGTCCA
GCAAATTTCTCCTGAAGGTGAGTCCTCTCAGGGGCTGATCTCACCTTTTAAGCATGCTGCGTTTCACATAGAAAATGGGTTTGAGATGGCTCCAAGTGTCGAACTCCAAT
TTATACCAAGATTAACCAGCAGCTCTCCACTGCATCAGAAGAGCGAAGAAAGTCGCGAGTTGAAAAATCTGTCCGCCCTGAGGAATGGCTATGATAGCGAGATATCAGTT
TCGCTGCAGTTGGGTGAGCCAGAAGCAAAGAGAAGGAAGCACTTGGGTTCTATTTCAACTGTAAAAACTGGGACTGGCATATTACAAAGGTTGAAGCCTCAAGGTGGGAG
AGTTGCCATTCTTTTAAACATGTCCTGTGAAGTGAAAAGCTTTGAGGTTAGTTTTCAGTTCATACAACATATTATGGAAGAAGGGTATCGAAGCCGGTCCTTAGCATCTA
AAATATCTATTTACGTGGACCAACTAGGCTGGCGAAAACTGTTGGTTTTTTCCATTCAAGGTGGTGGTGTTGATATTGATGTCAAATGCATGTTCGTATGTCTTGGAAAC
ATCATGTGGTTACAGCTAAAGCATGACAAGATTAGCAACCAGATAAAGAGAAACACCACCAGATTAGCCCAGGAAACCTGCCAGCATGATCTGGGGAAGGGAAAACATAT
TGCCAATGGTGACCCGGGGCCGGGCTTAGAGCCTAGAGGTGGGTTTCTTCCATCAGGTCATCTTGCAGCTTCAATCACTCGATCACCAACAACAACACTGTTGGCTATCG
CATGCAGCTTCAATGGGTATTGCTTTTGCTCCCTACCCAGACCAGATCACACGCATTTCAGATAA
mRNA sequenceShow/hide mRNA sequence
ATGGACAGGACACGACTGGATTTAAGTTTTCATCACTCAGTATCAACACAGTCAGAGGAATCTGCCTTGGACTTGGAAAGGAACTATTGCAGTCATCTTAATCTGCCTTC
ATCAAGTCCATCACCTAGTCAATGCTTTGCTCCAGGTAGTCAGCTGTCTGAGAGCAATGCTGCTTACTTTTCATGGCCCACTTCCAGCCGTTTAAACGATGCTGCAGAAG
ATAGAGCAAACTATTTTGGGAACCTTCAAAAGGGCGTGCTTCCTGAAATTCTGGGTCGGCTGCCCACTGGGCAGCGAGCTACCACTTTGCTTGAGCTTATGACCATAAGG
GCATTTCATAGCAAGATATTGCGTCGTTTTAGCCTCGGAACTGCAATAGGATTTCGAATTCAGAAGGGTATGTTGACAGATATCCCTGCTATTATTGTCTTTGTTGCACG
AAAAGTTCACAGGCAGTGGCTCAGTGATGTTCAATGTCTACCTGCTGCACTTGAGGGCCCTGGAGGTATATGGTGTGATGTTGATGTTGTGGAGTTCTCCTACTATGGTG
CACCGGCAGCTACACCTAAAGAAGAATTATATACAGAGCTTGTTGATGGCCTGAGGGGAAGTGATCCAACAATTGGTTCTGGTTCCCAGGTTGCTAGCCAAGAAACTTAT
GGCACTTTGGGTGCCATTGTCAAAAGCCGAACAGGAAACCGGCAAGTTGGTTTTCTTACGAACCGACACGTTGCAGTTGATTTAGACTACCCTAGTCAGAAAATGTTTCA
TCCTTTGCCTCCCAGCCTTGGGCCTGGTGTATATCTGGGTGCTGTGGAGAGAGCAACATCGTTTATCACTGATGACGTCTGGTATGGGATCTTTGCTGGAACAAATCCAG
GTGTCGGTGAGATTGGCGATGTCAACAAAATAGACCTGCAATCCCCAATCAACAGTCTCATTGGACGGAAAGTGATCAAGGTTGGAAGAAGTTCTGGCTTGACTAGAGGG
ACTATAATGGCATATGCCCTGGAGTATAATGATGTAAAAGGGATTTGTTTCTTCACCGACTTTCTTGTTGTTGGAGATGACCAGCAGACGTTTGACCTTGAAGGTGATAG
TGGAAGCCTTATTCTTTTAACTGGTCAGGATGGGGACAAGCCACGTCCAGTTGGGATTATTTGGGGAGGAACAGCTAATCGAGGTCGACTGAAATTAAAAGTTGGTCAAC
CCCCAGAGAATTGGACCAGTGGAGTTGATCTTGGACGCCTTCTTGATCTCCTTGAGCTCGATCTTATTACAACAAGTGATGGTTTGCAAGCTGCAGTGCATGAACAAAGG
AATACTTCAGTCGGAGGGATTGATTCTATTGTTGCAGAGTCGTCTCTCGACCGGATGCCATTAAAATATAGACTCAAAGAGAACTCGGAGCCACTTGGTTTGAGTGTCCA
GCAAATTTCTCCTGAAGGTGAGTCCTCTCAGGGGCTGATCTCACCTTTTAAGCATGCTGCGTTTCACATAGAAAATGGGTTTGAGATGGCTCCAAGTGTCGAACTCCAAT
TTATACCAAGATTAACCAGCAGCTCTCCACTGCATCAGAAGAGCGAAGAAAGTCGCGAGTTGAAAAATCTGTCCGCCCTGAGGAATGGCTATGATAGCGAGATATCAGTT
TCGCTGCAGTTGGGTGAGCCAGAAGCAAAGAGAAGGAAGCACTTGGGTTCTATTTCAACTGTAAAAACTGGGACTGGCATATTACAAAGGTTGAAGCCTCAAGGTGGGAG
AGTTGCCATTCTTTTAAACATGTCCTGTGAAGTGAAAAGCTTTGAGGTTAGTTTTCAGTTCATACAACATATTATGGAAGAAGGGTATCGAAGCCGGTCCTTAGCATCTA
AAATATCTATTTACGTGGACCAACTAGGCTGGCGAAAACTGTTGGTTTTTTCCATTCAAGGTGGTGGTGTTGATATTGATGTCAAATGCATGTTCGTATGTCTTGGAAAC
ATCATGTGGTTACAGCTAAAGCATGACAAGATTAGCAACCAGATAAAGAGAAACACCACCAGATTAGCCCAGGAAACCTGCCAGCATGATCTGGGGAAGGGAAAACATAT
TGCCAATGGTGACCCGGGGCCGGGCTTAGAGCCTAGAGGTGGGTTTCTTCCATCAGGTCATCTTGCAGCTTCAATCACTCGATCACCAACAACAACACTGTTGGCTATCG
CATGCAGCTTCAATGGGTATTGCTTTTGCTCCCTACCCAGACCAGATCACACGCATTTCAGATAA
Protein sequenceShow/hide protein sequence
MDRTRLDLSFHHSVSTQSEESALDLERNYCSHLNLPSSSPSPSQCFAPGSQLSESNAAYFSWPTSSRLNDAAEDRANYFGNLQKGVLPEILGRLPTGQRATTLLELMTIR
AFHSKILRRFSLGTAIGFRIQKGMLTDIPAIIVFVARKVHRQWLSDVQCLPAALEGPGGIWCDVDVVEFSYYGAPAATPKEELYTELVDGLRGSDPTIGSGSQVASQETY
GTLGAIVKSRTGNRQVGFLTNRHVAVDLDYPSQKMFHPLPPSLGPGVYLGAVERATSFITDDVWYGIFAGTNPGVGEIGDVNKIDLQSPINSLIGRKVIKVGRSSGLTRG
TIMAYALEYNDVKGICFFTDFLVVGDDQQTFDLEGDSGSLILLTGQDGDKPRPVGIIWGGTANRGRLKLKVGQPPENWTSGVDLGRLLDLLELDLITTSDGLQAAVHEQR
NTSVGGIDSIVAESSLDRMPLKYRLKENSEPLGLSVQQISPEGESSQGLISPFKHAAFHIENGFEMAPSVELQFIPRLTSSSPLHQKSEESRELKNLSALRNGYDSEISV
SLQLGEPEAKRRKHLGSISTVKTGTGILQRLKPQGGRVAILLNMSCEVKSFEVSFQFIQHIMEEGYRSRSLASKISIYVDQLGWRKLLVFSIQGGGVDIDVKCMFVCLGN
IMWLQLKHDKISNQIKRNTTRLAQETCQHDLGKGKHIANGDPGPGLEPRGGFLPSGHLAASITRSPTTTLLAIACSFNGYCFCSLPRPDHTHFR