; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0014156 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0014156
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionProtein NARROW LEAF 1
Genome locationchr1:55747020..55752594
RNA-Seq ExpressionLag0014156
SyntenyLag0014156
Gene Ontology termsGO:0043231 - intracellular membrane-bounded organelle (cellular component)
InterPro domainsIPR009003 - Peptidase S1, PA clan


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0052425.1 uncharacterized protein E6C27_scaffold120G00200 [Cucumis melo var. makuwa]0.0e+0095.19Show/hide
Query:  MDRTRLDLSFHHSVSTQSEESALDLERNYCSHLNLPSSSPSPSQCFAPGSQLSETNAAYFSWPTSSRLNDAAEDRANYFGNLQKGVLPEILGRLPTGQRA
        MDRTRLDL+FHHSVSTQSEESALDLERNYCSHL+LPSSSPSPSQCFAPGSQLSETNAAYFSWPTSSRLNDAAEDRANYFGNLQKGVLPEILGRLPTGQRA
Subjt:  MDRTRLDLSFHHSVSTQSEESALDLERNYCSHLNLPSSSPSPSQCFAPGSQLSETNAAYFSWPTSSRLNDAAEDRANYFGNLQKGVLPEILGRLPTGQRA

Query:  TTLLELMTIRAFHSKILRRFSLGTAIGFRIQKGMLTDIPAIIVFVARKVHRQWLNDVQCLPAALEGPGGIWCDVDVVEFSYYGAPAATPKEEIYTELVDG
        TTLLELMTIRAFHSKILRRFSLGTAIGFRIQKGMLTDIPAIIVFVARKVHRQWL+DVQCLPAALEGPGGIWCDVDVVEFSYYGAPAATPKEE+YTELVDG
Subjt:  TTLLELMTIRAFHSKILRRFSLGTAIGFRIQKGMLTDIPAIIVFVARKVHRQWLNDVQCLPAALEGPGGIWCDVDVVEFSYYGAPAATPKEEIYTELVDG

Query:  LRGSDPTIGSGSQVASQETYGTLGAIVKSRTGTRQVGFLTNRHVAVDLDYPSQKMFHPLPPSLGPGVYLGAVERATSFITDDVWYGIFAGTNPETFVRAD
        LRGSDPT+GSGSQVASQETYGTLGAIVKSRTGTRQVGFLTNRHVAVDLDYPSQKMFHPLPPSLGPGVYLGAVERATSFITDDVWYGIFAGTNPETFVRAD
Subjt:  LRGSDPTIGSGSQVASQETYGTLGAIVKSRTGTRQVGFLTNRHVAVDLDYPSQKMFHPLPPSLGPGVYLGAVERATSFITDDVWYGIFAGTNPETFVRAD

Query:  GAFIPFAEDFNMNNVVTFVKGVGEIGDVNKIDLQSPINSLIGRKVIKVGRSSGLTRGTIMAYALEYNDVKGICFFTDFLVVGDDQQTFDLEGDSGSLILL
        GAFIPFAEDFNMNNVVTFVKGVGEIGDVNKIDLQSPINSLIGRKVIKVGRSSGLTRGTIMAYALEYNDVKGICFFTDFLVVGDDQQTFDLEGDSGSLILL
Subjt:  GAFIPFAEDFNMNNVVTFVKGVGEIGDVNKIDLQSPINSLIGRKVIKVGRSSGLTRGTIMAYALEYNDVKGICFFTDFLVVGDDQQTFDLEGDSGSLILL

Query:  TGQDEEKPRPVGIIWGGTANRGRLKLKVGQPPENWTSGVDLGRLLDLLELDLITTSDGLQVHEQRNNSVGGIDSTVAESCLDRLPLKYRLKENSESLGLC
        TGQDEEKPRPVGIIWGGTANRGRLKLKVGQPPENWTSGVDLGRLLDLLELDLITT+DGLQVHEQRNNSVGGIDSTVAESCLDR+PLKYRLKENSE LG  
Subjt:  TGQDEEKPRPVGIIWGGTANRGRLKLKVGQPPENWTSGVDLGRLLDLLELDLITTSDGLQVHEQRNNSVGGIDSTVAESCLDRLPLKYRLKENSESLGLC

Query:  VQQISPEGESSQGLISPFKHAAFHIENGFEMAPSVELQFIPGLTSSSPLHQKNEECQEMKNLSSTRNGYDSEVSVSLQLG--EPEAKRRKHSDCLSSIKE
        VQQISPEGESSQG+ISPFKHA FHIENG+E+ PSVELQFIP LTS+SPLHQKNEE QE+KNL++ R G+DSEVSVSLQLG  EPEAKRRK  DCLSSIKE
Subjt:  VQQISPEGESSQGLISPFKHAAFHIENGFEMAPSVELQFIPGLTSSSPLHQKNEECQEMKNLSSTRNGYDSEVSVSLQLG--EPEAKRRKHSDCLSSIKE

Query:  SST
        SS+
Subjt:  SST

XP_004134526.1 protein NARROW LEAF 1 isoform X1 [Cucumis sativus]0.0e+0095.21Show/hide
Query:  MDRTRLDLSFHHSVSTQSEESALDLERNYCSHLNLPSSSPSPSQCFAPGSQLSETNAAYFSWPTSSRLNDAAEDRANYFGNLQKGVLPEILGRLPTGQRA
        MDRTRLDL+FHHSVSTQSEESALDLERNYCSHL+LPSSSPSPSQCFAPGSQLSETNAAYFSWPTSSRLNDAAEDRANYFGNLQKGVLPEILGRLPTGQRA
Subjt:  MDRTRLDLSFHHSVSTQSEESALDLERNYCSHLNLPSSSPSPSQCFAPGSQLSETNAAYFSWPTSSRLNDAAEDRANYFGNLQKGVLPEILGRLPTGQRA

Query:  TTLLELMTIRAFHSKILRRFSLGTAIGFRIQKGMLTDIPAIIVFVARKVHRQWLNDVQCLPAALEGPGGIWCDVDVVEFSYYGAPAATPKEEIYTELVDG
        TTLLELMTIRAFHSKILRRFSLGTAIGFRIQKGMLTDIPAIIVFVARKVHRQWL+DVQCLPAALEGPGGIWCDVDVVEFSYYGAPAATPKEE+YTELVDG
Subjt:  TTLLELMTIRAFHSKILRRFSLGTAIGFRIQKGMLTDIPAIIVFVARKVHRQWLNDVQCLPAALEGPGGIWCDVDVVEFSYYGAPAATPKEEIYTELVDG

Query:  LRGSDPTIGSGSQVASQETYGTLGAIVKSRTGTRQVGFLTNRHVAVDLDYPSQKMFHPLPPSLGPGVYLGAVERATSFITDDVWYGIFAGTNPETFVRAD
        LRGSDPTIGSGSQVASQETYGTLGAIVKSRTGTRQVGFLTNRHVAVDLDYPSQKMFHPLPPSLGPGVYLGAVERATSFITDDVWYGIFAGTNPETFVRAD
Subjt:  LRGSDPTIGSGSQVASQETYGTLGAIVKSRTGTRQVGFLTNRHVAVDLDYPSQKMFHPLPPSLGPGVYLGAVERATSFITDDVWYGIFAGTNPETFVRAD

Query:  GAFIPFAEDFNMNNVVTFVKGVGEIGDVNKIDLQSPINSLIGRKVIKVGRSSGLTRGTIMAYALEYNDVKGICFFTDFLVVGDDQQTFDLEGDSGSLILL
        GAFIPFAEDFNMNNVVTFVKGVGE+GDVNKIDLQSPINSLIGRKVIKVGRSSGLTRGTIMAYALEYNDVKGICFFTDFLVVGDDQQTFDLEGDSGSLILL
Subjt:  GAFIPFAEDFNMNNVVTFVKGVGEIGDVNKIDLQSPINSLIGRKVIKVGRSSGLTRGTIMAYALEYNDVKGICFFTDFLVVGDDQQTFDLEGDSGSLILL

Query:  TGQDEEKPRPVGIIWGGTANRGRLKLKVGQPPENWTSGVDLGRLLDLLELDLITTSDGLQ--VHEQRNNSVGGIDSTVAESCLDRLPLKYRLKENSESLG
        TGQDEEKPRPVGIIWGGTANRGRLKLKVGQPPENWTSGVDLGRLLDLLELDLITT+DGLQ  VHEQRNNSVGGIDSTVAESCLDR+PLKYRLKENSE LG
Subjt:  TGQDEEKPRPVGIIWGGTANRGRLKLKVGQPPENWTSGVDLGRLLDLLELDLITTSDGLQ--VHEQRNNSVGGIDSTVAESCLDRLPLKYRLKENSESLG

Query:  LCVQQISPEGESSQGLISPFKHAAFHIENGFEMAPSVELQFIPGLTSSSPLHQKNEECQEMKNLSSTRNGYDSEVSVSLQLG--EPEAKRRKHSDCLSSI
        L VQQISPEGESSQG+ISPFKH AF IENGFE+ PS+ELQFIP LTS+SPL QKNE+ QE+KNLS+ RNGYDSEVSVSLQLG  EPEAKRRKH DCLSSI
Subjt:  LCVQQISPEGESSQGLISPFKHAAFHIENGFEMAPSVELQFIPGLTSSSPLHQKNEECQEMKNLSSTRNGYDSEVSVSLQLG--EPEAKRRKHSDCLSSI

Query:  KESST
        KESS+
Subjt:  KESST

XP_008439446.1 PREDICTED: uncharacterized protein LOC103484249 isoform X1 [Cucumis melo]0.0e+0095.04Show/hide
Query:  MDRTRLDLSFHHSVSTQSEESALDLERNYCSHLNLPSSSPSPSQCFAPGSQLSETNAAYFSWPTSSRLNDAAEDRANYFGNLQKGVLPEILGRLPTGQRA
        MDRTRLDL+FHHSVSTQSEESALDLERNYCSHL+LPSSSPSPSQCFAPGSQLSETNAAYFSWPTSSRLNDAAEDRANYFGNLQKGVLPEILGRLPTGQRA
Subjt:  MDRTRLDLSFHHSVSTQSEESALDLERNYCSHLNLPSSSPSPSQCFAPGSQLSETNAAYFSWPTSSRLNDAAEDRANYFGNLQKGVLPEILGRLPTGQRA

Query:  TTLLELMTIRAFHSKILRRFSLGTAIGFRIQKGMLTDIPAIIVFVARKVHRQWLNDVQCLPAALEGPGGIWCDVDVVEFSYYGAPAATPKEEIYTELVDG
        TTLLELMTIRAFHSKILRRFSLGTAIGFRIQKGMLTDIPAIIVFVARKVHRQWL+DVQCLPAALEGPGGIWCDVDVVEFSYYGAPAATPKEE+YTELVDG
Subjt:  TTLLELMTIRAFHSKILRRFSLGTAIGFRIQKGMLTDIPAIIVFVARKVHRQWLNDVQCLPAALEGPGGIWCDVDVVEFSYYGAPAATPKEEIYTELVDG

Query:  LRGSDPTIGSGSQVASQETYGTLGAIVKSRTGTRQVGFLTNRHVAVDLDYPSQKMFHPLPPSLGPGVYLGAVERATSFITDDVWYGIFAGTNPETFVRAD
        LRGSDPT+GSGSQVASQETYGTLGAIVKSRTGTRQVGFLTNRHVAVDLDYPSQKMFHPLPPSLGPGVYLGAVERATSFITDDVWYGIFAGTNPETFVRAD
Subjt:  LRGSDPTIGSGSQVASQETYGTLGAIVKSRTGTRQVGFLTNRHVAVDLDYPSQKMFHPLPPSLGPGVYLGAVERATSFITDDVWYGIFAGTNPETFVRAD

Query:  GAFIPFAEDFNMNNVVTFVKGVGEIGDVNKIDLQSPINSLIGRKVIKVGRSSGLTRGTIMAYALEYNDVKGICFFTDFLVVGDDQQTFDLEGDSGSLILL
        GAFIPFAEDFNMNNVVTFVKGVGEIGDVNKIDLQSPINSLIGRKVIKVGRSSGLTRGTIMAYALEYNDVKGICFFTDFLVVGDDQQTFDLEGDSGSLILL
Subjt:  GAFIPFAEDFNMNNVVTFVKGVGEIGDVNKIDLQSPINSLIGRKVIKVGRSSGLTRGTIMAYALEYNDVKGICFFTDFLVVGDDQQTFDLEGDSGSLILL

Query:  TGQDEEKPRPVGIIWGGTANRGRLKLKVGQPPENWTSGVDLGRLLDLLELDLITTSDGLQ--VHEQRNNSVGGIDSTVAESCLDRLPLKYRLKENSESLG
        TGQDEEKPRPVGIIWGGTANRGRLKLKVGQPPENWTSGVDLGRLLDLLELDLITT+DGLQ  VHEQRNNSVGGIDSTVAESCLDR+PLKYRLKENSE LG
Subjt:  TGQDEEKPRPVGIIWGGTANRGRLKLKVGQPPENWTSGVDLGRLLDLLELDLITTSDGLQ--VHEQRNNSVGGIDSTVAESCLDRLPLKYRLKENSESLG

Query:  LCVQQISPEGESSQGLISPFKHAAFHIENGFEMAPSVELQFIPGLTSSSPLHQKNEECQEMKNLSSTRNGYDSEVSVSLQLG--EPEAKRRKHSDCLSSI
          VQQISPEGESSQG+ISPFKHA FHIENG+E+ PSVELQFIP LTS+SPLHQKN+E QE+KNLS+ R GYDSEVSVSLQLG  EPEAKRRK  DCLSSI
Subjt:  LCVQQISPEGESSQGLISPFKHAAFHIENGFEMAPSVELQFIPGLTSSSPLHQKNEECQEMKNLSSTRNGYDSEVSVSLQLG--EPEAKRRKHSDCLSSI

Query:  KESST
        KESS+
Subjt:  KESST

XP_008439448.1 PREDICTED: uncharacterized protein LOC103484249 isoform X2 [Cucumis melo]0.0e+0095.36Show/hide
Query:  MDRTRLDLSFHHSVSTQSEESALDLERNYCSHLNLPSSSPSPSQCFAPGSQLSETNAAYFSWPTSSRLNDAAEDRANYFGNLQKGVLPEILGRLPTGQRA
        MDRTRLDL+FHHSVSTQSEESALDLERNYCSHL+LPSSSPSPSQCFAPGSQLSETNAAYFSWPTSSRLNDAAEDRANYFGNLQKGVLPEILGRLPTGQRA
Subjt:  MDRTRLDLSFHHSVSTQSEESALDLERNYCSHLNLPSSSPSPSQCFAPGSQLSETNAAYFSWPTSSRLNDAAEDRANYFGNLQKGVLPEILGRLPTGQRA

Query:  TTLLELMTIRAFHSKILRRFSLGTAIGFRIQKGMLTDIPAIIVFVARKVHRQWLNDVQCLPAALEGPGGIWCDVDVVEFSYYGAPAATPKEEIYTELVDG
        TTLLELMTIRAFHSKILRRFSLGTAIGFRIQKGMLTDIPAIIVFVARKVHRQWL+DVQCLPAALEGPGGIWCDVDVVEFSYYGAPAATPKEE+YTELVDG
Subjt:  TTLLELMTIRAFHSKILRRFSLGTAIGFRIQKGMLTDIPAIIVFVARKVHRQWLNDVQCLPAALEGPGGIWCDVDVVEFSYYGAPAATPKEEIYTELVDG

Query:  LRGSDPTIGSGSQVASQETYGTLGAIVKSRTGTRQVGFLTNRHVAVDLDYPSQKMFHPLPPSLGPGVYLGAVERATSFITDDVWYGIFAGTNPETFVRAD
        LRGSDPT+GSGSQVASQETYGTLGAIVKSRTGTRQVGFLTNRHVAVDLDYPSQKMFHPLPPSLGPGVYLGAVERATSFITDDVWYGIFAGTNPETFVRAD
Subjt:  LRGSDPTIGSGSQVASQETYGTLGAIVKSRTGTRQVGFLTNRHVAVDLDYPSQKMFHPLPPSLGPGVYLGAVERATSFITDDVWYGIFAGTNPETFVRAD

Query:  GAFIPFAEDFNMNNVVTFVKGVGEIGDVNKIDLQSPINSLIGRKVIKVGRSSGLTRGTIMAYALEYNDVKGICFFTDFLVVGDDQQTFDLEGDSGSLILL
        GAFIPFAEDFNMNNVVTFVKGVGEIGDVNKIDLQSPINSLIGRKVIKVGRSSGLTRGTIMAYALEYNDVKGICFFTDFLVVGDDQQTFDLEGDSGSLILL
Subjt:  GAFIPFAEDFNMNNVVTFVKGVGEIGDVNKIDLQSPINSLIGRKVIKVGRSSGLTRGTIMAYALEYNDVKGICFFTDFLVVGDDQQTFDLEGDSGSLILL

Query:  TGQDEEKPRPVGIIWGGTANRGRLKLKVGQPPENWTSGVDLGRLLDLLELDLITTSDGLQVHEQRNNSVGGIDSTVAESCLDRLPLKYRLKENSESLGLC
        TGQDEEKPRPVGIIWGGTANRGRLKLKVGQPPENWTSGVDLGRLLDLLELDLITT+DGLQVHEQRNNSVGGIDSTVAESCLDR+PLKYRLKENSE LG  
Subjt:  TGQDEEKPRPVGIIWGGTANRGRLKLKVGQPPENWTSGVDLGRLLDLLELDLITTSDGLQVHEQRNNSVGGIDSTVAESCLDRLPLKYRLKENSESLGLC

Query:  VQQISPEGESSQGLISPFKHAAFHIENGFEMAPSVELQFIPGLTSSSPLHQKNEECQEMKNLSSTRNGYDSEVSVSLQLG--EPEAKRRKHSDCLSSIKE
        VQQISPEGESSQG+ISPFKHA FHIENG+E+ PSVELQFIP LTS+SPLHQKN+E QE+KNLS+ R GYDSEVSVSLQLG  EPEAKRRK  DCLSSIKE
Subjt:  VQQISPEGESSQGLISPFKHAAFHIENGFEMAPSVELQFIPGLTSSSPLHQKNEECQEMKNLSSTRNGYDSEVSVSLQLG--EPEAKRRKHSDCLSSIKE

Query:  SST
        SS+
Subjt:  SST

XP_023543759.1 uncharacterized protein LOC111803536 isoform X2 [Cucurbita pepo subsp. pepo]0.0e+0095.17Show/hide
Query:  MDRTRLDLSFHHSVSTQSEESALDLERNYCSHLNLPSSSPSPSQCFAPGSQLSETNAAYFSWPTSSRLNDAAEDRANYFGNLQKGVLPEILGRLPTGQRA
        MDRTRLDLS HHSVSTQSEESALDLERNYCSHLN+PSSSPSPSQCFAPGSQLSETNAAYFSWPTSSRLNDAAEDRANYFGNLQKGVLPEILGRLPTGQRA
Subjt:  MDRTRLDLSFHHSVSTQSEESALDLERNYCSHLNLPSSSPSPSQCFAPGSQLSETNAAYFSWPTSSRLNDAAEDRANYFGNLQKGVLPEILGRLPTGQRA

Query:  TTLLELMTIRAFHSKILRRFSLGTAIGFRIQKGMLTDIPAIIVFVARKVHRQWLNDVQCLPAALEGPGGIWCDVDVVEFSYYGAPAATPKEEIYTELVDG
        TTLLELMTIRAFHSKILRRFSLGTAIGFRIQKGMLTDIPAIIVFVARKVHRQWL+DVQCLPAALEGPGGIWCDVDVVEFSYYGAPAATPKEE+YTELVDG
Subjt:  TTLLELMTIRAFHSKILRRFSLGTAIGFRIQKGMLTDIPAIIVFVARKVHRQWLNDVQCLPAALEGPGGIWCDVDVVEFSYYGAPAATPKEEIYTELVDG

Query:  LRGSDPTIGSGSQVASQETYGTLGAIVKSRTGTRQVGFLTNRHVAVDLDYPSQKMFHPLPPSLGPGVYLGAVERATSFITDDVWYGIFAGTNPETFVRAD
        LRGSDPTIGSGSQVASQETYGTLGAIVKSRTGT+QVGFLTNRHVAVDLDYPSQKMFHPLPPSLGPGVYLGAVERATSFITDDVWYGIFAGTNPETFVRAD
Subjt:  LRGSDPTIGSGSQVASQETYGTLGAIVKSRTGTRQVGFLTNRHVAVDLDYPSQKMFHPLPPSLGPGVYLGAVERATSFITDDVWYGIFAGTNPETFVRAD

Query:  GAFIPFAEDFNMNNVVTFVKGVGEIGDVNKIDLQSPINSLIGRKVIKVGRSSGLTRGTIMAYALEYNDVKGICFFTDFLVVGDDQQTFDLEGDSGSLILL
        GAFIPFAEDFNMNNV+TFVKGVGEIGDVNKIDLQSPINSLIGRKVIKVGRSSGLTRGTIMAYALEYNDVKGICFFTDFLVVGDDQQTFDLEGDSGSLILL
Subjt:  GAFIPFAEDFNMNNVVTFVKGVGEIGDVNKIDLQSPINSLIGRKVIKVGRSSGLTRGTIMAYALEYNDVKGICFFTDFLVVGDDQQTFDLEGDSGSLILL

Query:  TGQDEEKPRPVGIIWGGTANRGRLKLKVGQPPENWTSGVDLGRLLDLLELDLITTSDGLQVHEQRNNSVGGIDSTVAESCLDRLPLKYRLKENSESLGLC
        TG DEEKPRPVGIIWGGTANRGRLKLKVGQPPENWTSGVDLGRLLDLLELDLITT+DGLQVHEQRNNSVGGIDSTVAESC DR+PL YRL+ENSE LGL 
Subjt:  TGQDEEKPRPVGIIWGGTANRGRLKLKVGQPPENWTSGVDLGRLLDLLELDLITTSDGLQVHEQRNNSVGGIDSTVAESCLDRLPLKYRLKENSESLGLC

Query:  VQQISPEGESSQGLISPFKHAAFHIENGFEMAPSVELQFIPGLTSSSPLHQKNEECQEMKNLSSTRNGYDSEVSVSLQLGEPEAKRRKHSDCLSSIKESS
        VQ+ISPEGESSQGLISPFKHAA  IENGFE+ PSVELQFIP L SSSPLHQKNEE QE+KNLS+ RNGYD EVSVSL+LGEPEAKRRKH D LSSIKESS
Subjt:  VQQISPEGESSQGLISPFKHAAFHIENGFEMAPSVELQFIPGLTSSSPLHQKNEECQEMKNLSSTRNGYDSEVSVSLQLGEPEAKRRKHSDCLSSIKESS

TrEMBL top hitse value%identityAlignment
A0A0A0KIK0 Uncharacterized protein0.0e+0095.21Show/hide
Query:  MDRTRLDLSFHHSVSTQSEESALDLERNYCSHLNLPSSSPSPSQCFAPGSQLSETNAAYFSWPTSSRLNDAAEDRANYFGNLQKGVLPEILGRLPTGQRA
        MDRTRLDL+FHHSVSTQSEESALDLERNYCSHL+LPSSSPSPSQCFAPGSQLSETNAAYFSWPTSSRLNDAAEDRANYFGNLQKGVLPEILGRLPTGQRA
Subjt:  MDRTRLDLSFHHSVSTQSEESALDLERNYCSHLNLPSSSPSPSQCFAPGSQLSETNAAYFSWPTSSRLNDAAEDRANYFGNLQKGVLPEILGRLPTGQRA

Query:  TTLLELMTIRAFHSKILRRFSLGTAIGFRIQKGMLTDIPAIIVFVARKVHRQWLNDVQCLPAALEGPGGIWCDVDVVEFSYYGAPAATPKEEIYTELVDG
        TTLLELMTIRAFHSKILRRFSLGTAIGFRIQKGMLTDIPAIIVFVARKVHRQWL+DVQCLPAALEGPGGIWCDVDVVEFSYYGAPAATPKEE+YTELVDG
Subjt:  TTLLELMTIRAFHSKILRRFSLGTAIGFRIQKGMLTDIPAIIVFVARKVHRQWLNDVQCLPAALEGPGGIWCDVDVVEFSYYGAPAATPKEEIYTELVDG

Query:  LRGSDPTIGSGSQVASQETYGTLGAIVKSRTGTRQVGFLTNRHVAVDLDYPSQKMFHPLPPSLGPGVYLGAVERATSFITDDVWYGIFAGTNPETFVRAD
        LRGSDPTIGSGSQVASQETYGTLGAIVKSRTGTRQVGFLTNRHVAVDLDYPSQKMFHPLPPSLGPGVYLGAVERATSFITDDVWYGIFAGTNPETFVRAD
Subjt:  LRGSDPTIGSGSQVASQETYGTLGAIVKSRTGTRQVGFLTNRHVAVDLDYPSQKMFHPLPPSLGPGVYLGAVERATSFITDDVWYGIFAGTNPETFVRAD

Query:  GAFIPFAEDFNMNNVVTFVKGVGEIGDVNKIDLQSPINSLIGRKVIKVGRSSGLTRGTIMAYALEYNDVKGICFFTDFLVVGDDQQTFDLEGDSGSLILL
        GAFIPFAEDFNMNNVVTFVKGVGE+GDVNKIDLQSPINSLIGRKVIKVGRSSGLTRGTIMAYALEYNDVKGICFFTDFLVVGDDQQTFDLEGDSGSLILL
Subjt:  GAFIPFAEDFNMNNVVTFVKGVGEIGDVNKIDLQSPINSLIGRKVIKVGRSSGLTRGTIMAYALEYNDVKGICFFTDFLVVGDDQQTFDLEGDSGSLILL

Query:  TGQDEEKPRPVGIIWGGTANRGRLKLKVGQPPENWTSGVDLGRLLDLLELDLITTSDGLQ--VHEQRNNSVGGIDSTVAESCLDRLPLKYRLKENSESLG
        TGQDEEKPRPVGIIWGGTANRGRLKLKVGQPPENWTSGVDLGRLLDLLELDLITT+DGLQ  VHEQRNNSVGGIDSTVAESCLDR+PLKYRLKENSE LG
Subjt:  TGQDEEKPRPVGIIWGGTANRGRLKLKVGQPPENWTSGVDLGRLLDLLELDLITTSDGLQ--VHEQRNNSVGGIDSTVAESCLDRLPLKYRLKENSESLG

Query:  LCVQQISPEGESSQGLISPFKHAAFHIENGFEMAPSVELQFIPGLTSSSPLHQKNEECQEMKNLSSTRNGYDSEVSVSLQLG--EPEAKRRKHSDCLSSI
        L VQQISPEGESSQG+ISPFKH AF IENGFE+ PS+ELQFIP LTS+SPL QKNE+ QE+KNLS+ RNGYDSEVSVSLQLG  EPEAKRRKH DCLSSI
Subjt:  LCVQQISPEGESSQGLISPFKHAAFHIENGFEMAPSVELQFIPGLTSSSPLHQKNEECQEMKNLSSTRNGYDSEVSVSLQLG--EPEAKRRKHSDCLSSI

Query:  KESST
        KESS+
Subjt:  KESST

A0A1S3AYD6 uncharacterized protein LOC103484249 isoform X10.0e+0095.04Show/hide
Query:  MDRTRLDLSFHHSVSTQSEESALDLERNYCSHLNLPSSSPSPSQCFAPGSQLSETNAAYFSWPTSSRLNDAAEDRANYFGNLQKGVLPEILGRLPTGQRA
        MDRTRLDL+FHHSVSTQSEESALDLERNYCSHL+LPSSSPSPSQCFAPGSQLSETNAAYFSWPTSSRLNDAAEDRANYFGNLQKGVLPEILGRLPTGQRA
Subjt:  MDRTRLDLSFHHSVSTQSEESALDLERNYCSHLNLPSSSPSPSQCFAPGSQLSETNAAYFSWPTSSRLNDAAEDRANYFGNLQKGVLPEILGRLPTGQRA

Query:  TTLLELMTIRAFHSKILRRFSLGTAIGFRIQKGMLTDIPAIIVFVARKVHRQWLNDVQCLPAALEGPGGIWCDVDVVEFSYYGAPAATPKEEIYTELVDG
        TTLLELMTIRAFHSKILRRFSLGTAIGFRIQKGMLTDIPAIIVFVARKVHRQWL+DVQCLPAALEGPGGIWCDVDVVEFSYYGAPAATPKEE+YTELVDG
Subjt:  TTLLELMTIRAFHSKILRRFSLGTAIGFRIQKGMLTDIPAIIVFVARKVHRQWLNDVQCLPAALEGPGGIWCDVDVVEFSYYGAPAATPKEEIYTELVDG

Query:  LRGSDPTIGSGSQVASQETYGTLGAIVKSRTGTRQVGFLTNRHVAVDLDYPSQKMFHPLPPSLGPGVYLGAVERATSFITDDVWYGIFAGTNPETFVRAD
        LRGSDPT+GSGSQVASQETYGTLGAIVKSRTGTRQVGFLTNRHVAVDLDYPSQKMFHPLPPSLGPGVYLGAVERATSFITDDVWYGIFAGTNPETFVRAD
Subjt:  LRGSDPTIGSGSQVASQETYGTLGAIVKSRTGTRQVGFLTNRHVAVDLDYPSQKMFHPLPPSLGPGVYLGAVERATSFITDDVWYGIFAGTNPETFVRAD

Query:  GAFIPFAEDFNMNNVVTFVKGVGEIGDVNKIDLQSPINSLIGRKVIKVGRSSGLTRGTIMAYALEYNDVKGICFFTDFLVVGDDQQTFDLEGDSGSLILL
        GAFIPFAEDFNMNNVVTFVKGVGEIGDVNKIDLQSPINSLIGRKVIKVGRSSGLTRGTIMAYALEYNDVKGICFFTDFLVVGDDQQTFDLEGDSGSLILL
Subjt:  GAFIPFAEDFNMNNVVTFVKGVGEIGDVNKIDLQSPINSLIGRKVIKVGRSSGLTRGTIMAYALEYNDVKGICFFTDFLVVGDDQQTFDLEGDSGSLILL

Query:  TGQDEEKPRPVGIIWGGTANRGRLKLKVGQPPENWTSGVDLGRLLDLLELDLITTSDGLQ--VHEQRNNSVGGIDSTVAESCLDRLPLKYRLKENSESLG
        TGQDEEKPRPVGIIWGGTANRGRLKLKVGQPPENWTSGVDLGRLLDLLELDLITT+DGLQ  VHEQRNNSVGGIDSTVAESCLDR+PLKYRLKENSE LG
Subjt:  TGQDEEKPRPVGIIWGGTANRGRLKLKVGQPPENWTSGVDLGRLLDLLELDLITTSDGLQ--VHEQRNNSVGGIDSTVAESCLDRLPLKYRLKENSESLG

Query:  LCVQQISPEGESSQGLISPFKHAAFHIENGFEMAPSVELQFIPGLTSSSPLHQKNEECQEMKNLSSTRNGYDSEVSVSLQLG--EPEAKRRKHSDCLSSI
          VQQISPEGESSQG+ISPFKHA FHIENG+E+ PSVELQFIP LTS+SPLHQKN+E QE+KNLS+ R GYDSEVSVSLQLG  EPEAKRRK  DCLSSI
Subjt:  LCVQQISPEGESSQGLISPFKHAAFHIENGFEMAPSVELQFIPGLTSSSPLHQKNEECQEMKNLSSTRNGYDSEVSVSLQLG--EPEAKRRKHSDCLSSI

Query:  KESST
        KESS+
Subjt:  KESST

A0A1S3AYT3 uncharacterized protein LOC103484249 isoform X20.0e+0095.36Show/hide
Query:  MDRTRLDLSFHHSVSTQSEESALDLERNYCSHLNLPSSSPSPSQCFAPGSQLSETNAAYFSWPTSSRLNDAAEDRANYFGNLQKGVLPEILGRLPTGQRA
        MDRTRLDL+FHHSVSTQSEESALDLERNYCSHL+LPSSSPSPSQCFAPGSQLSETNAAYFSWPTSSRLNDAAEDRANYFGNLQKGVLPEILGRLPTGQRA
Subjt:  MDRTRLDLSFHHSVSTQSEESALDLERNYCSHLNLPSSSPSPSQCFAPGSQLSETNAAYFSWPTSSRLNDAAEDRANYFGNLQKGVLPEILGRLPTGQRA

Query:  TTLLELMTIRAFHSKILRRFSLGTAIGFRIQKGMLTDIPAIIVFVARKVHRQWLNDVQCLPAALEGPGGIWCDVDVVEFSYYGAPAATPKEEIYTELVDG
        TTLLELMTIRAFHSKILRRFSLGTAIGFRIQKGMLTDIPAIIVFVARKVHRQWL+DVQCLPAALEGPGGIWCDVDVVEFSYYGAPAATPKEE+YTELVDG
Subjt:  TTLLELMTIRAFHSKILRRFSLGTAIGFRIQKGMLTDIPAIIVFVARKVHRQWLNDVQCLPAALEGPGGIWCDVDVVEFSYYGAPAATPKEEIYTELVDG

Query:  LRGSDPTIGSGSQVASQETYGTLGAIVKSRTGTRQVGFLTNRHVAVDLDYPSQKMFHPLPPSLGPGVYLGAVERATSFITDDVWYGIFAGTNPETFVRAD
        LRGSDPT+GSGSQVASQETYGTLGAIVKSRTGTRQVGFLTNRHVAVDLDYPSQKMFHPLPPSLGPGVYLGAVERATSFITDDVWYGIFAGTNPETFVRAD
Subjt:  LRGSDPTIGSGSQVASQETYGTLGAIVKSRTGTRQVGFLTNRHVAVDLDYPSQKMFHPLPPSLGPGVYLGAVERATSFITDDVWYGIFAGTNPETFVRAD

Query:  GAFIPFAEDFNMNNVVTFVKGVGEIGDVNKIDLQSPINSLIGRKVIKVGRSSGLTRGTIMAYALEYNDVKGICFFTDFLVVGDDQQTFDLEGDSGSLILL
        GAFIPFAEDFNMNNVVTFVKGVGEIGDVNKIDLQSPINSLIGRKVIKVGRSSGLTRGTIMAYALEYNDVKGICFFTDFLVVGDDQQTFDLEGDSGSLILL
Subjt:  GAFIPFAEDFNMNNVVTFVKGVGEIGDVNKIDLQSPINSLIGRKVIKVGRSSGLTRGTIMAYALEYNDVKGICFFTDFLVVGDDQQTFDLEGDSGSLILL

Query:  TGQDEEKPRPVGIIWGGTANRGRLKLKVGQPPENWTSGVDLGRLLDLLELDLITTSDGLQVHEQRNNSVGGIDSTVAESCLDRLPLKYRLKENSESLGLC
        TGQDEEKPRPVGIIWGGTANRGRLKLKVGQPPENWTSGVDLGRLLDLLELDLITT+DGLQVHEQRNNSVGGIDSTVAESCLDR+PLKYRLKENSE LG  
Subjt:  TGQDEEKPRPVGIIWGGTANRGRLKLKVGQPPENWTSGVDLGRLLDLLELDLITTSDGLQVHEQRNNSVGGIDSTVAESCLDRLPLKYRLKENSESLGLC

Query:  VQQISPEGESSQGLISPFKHAAFHIENGFEMAPSVELQFIPGLTSSSPLHQKNEECQEMKNLSSTRNGYDSEVSVSLQLG--EPEAKRRKHSDCLSSIKE
        VQQISPEGESSQG+ISPFKHA FHIENG+E+ PSVELQFIP LTS+SPLHQKN+E QE+KNLS+ R GYDSEVSVSLQLG  EPEAKRRK  DCLSSIKE
Subjt:  VQQISPEGESSQGLISPFKHAAFHIENGFEMAPSVELQFIPGLTSSSPLHQKNEECQEMKNLSSTRNGYDSEVSVSLQLG--EPEAKRRKHSDCLSSIKE

Query:  SST
        SS+
Subjt:  SST

A0A5A7UFD1 Uncharacterized protein0.0e+0095.19Show/hide
Query:  MDRTRLDLSFHHSVSTQSEESALDLERNYCSHLNLPSSSPSPSQCFAPGSQLSETNAAYFSWPTSSRLNDAAEDRANYFGNLQKGVLPEILGRLPTGQRA
        MDRTRLDL+FHHSVSTQSEESALDLERNYCSHL+LPSSSPSPSQCFAPGSQLSETNAAYFSWPTSSRLNDAAEDRANYFGNLQKGVLPEILGRLPTGQRA
Subjt:  MDRTRLDLSFHHSVSTQSEESALDLERNYCSHLNLPSSSPSPSQCFAPGSQLSETNAAYFSWPTSSRLNDAAEDRANYFGNLQKGVLPEILGRLPTGQRA

Query:  TTLLELMTIRAFHSKILRRFSLGTAIGFRIQKGMLTDIPAIIVFVARKVHRQWLNDVQCLPAALEGPGGIWCDVDVVEFSYYGAPAATPKEEIYTELVDG
        TTLLELMTIRAFHSKILRRFSLGTAIGFRIQKGMLTDIPAIIVFVARKVHRQWL+DVQCLPAALEGPGGIWCDVDVVEFSYYGAPAATPKEE+YTELVDG
Subjt:  TTLLELMTIRAFHSKILRRFSLGTAIGFRIQKGMLTDIPAIIVFVARKVHRQWLNDVQCLPAALEGPGGIWCDVDVVEFSYYGAPAATPKEEIYTELVDG

Query:  LRGSDPTIGSGSQVASQETYGTLGAIVKSRTGTRQVGFLTNRHVAVDLDYPSQKMFHPLPPSLGPGVYLGAVERATSFITDDVWYGIFAGTNPETFVRAD
        LRGSDPT+GSGSQVASQETYGTLGAIVKSRTGTRQVGFLTNRHVAVDLDYPSQKMFHPLPPSLGPGVYLGAVERATSFITDDVWYGIFAGTNPETFVRAD
Subjt:  LRGSDPTIGSGSQVASQETYGTLGAIVKSRTGTRQVGFLTNRHVAVDLDYPSQKMFHPLPPSLGPGVYLGAVERATSFITDDVWYGIFAGTNPETFVRAD

Query:  GAFIPFAEDFNMNNVVTFVKGVGEIGDVNKIDLQSPINSLIGRKVIKVGRSSGLTRGTIMAYALEYNDVKGICFFTDFLVVGDDQQTFDLEGDSGSLILL
        GAFIPFAEDFNMNNVVTFVKGVGEIGDVNKIDLQSPINSLIGRKVIKVGRSSGLTRGTIMAYALEYNDVKGICFFTDFLVVGDDQQTFDLEGDSGSLILL
Subjt:  GAFIPFAEDFNMNNVVTFVKGVGEIGDVNKIDLQSPINSLIGRKVIKVGRSSGLTRGTIMAYALEYNDVKGICFFTDFLVVGDDQQTFDLEGDSGSLILL

Query:  TGQDEEKPRPVGIIWGGTANRGRLKLKVGQPPENWTSGVDLGRLLDLLELDLITTSDGLQVHEQRNNSVGGIDSTVAESCLDRLPLKYRLKENSESLGLC
        TGQDEEKPRPVGIIWGGTANRGRLKLKVGQPPENWTSGVDLGRLLDLLELDLITT+DGLQVHEQRNNSVGGIDSTVAESCLDR+PLKYRLKENSE LG  
Subjt:  TGQDEEKPRPVGIIWGGTANRGRLKLKVGQPPENWTSGVDLGRLLDLLELDLITTSDGLQVHEQRNNSVGGIDSTVAESCLDRLPLKYRLKENSESLGLC

Query:  VQQISPEGESSQGLISPFKHAAFHIENGFEMAPSVELQFIPGLTSSSPLHQKNEECQEMKNLSSTRNGYDSEVSVSLQLG--EPEAKRRKHSDCLSSIKE
        VQQISPEGESSQG+ISPFKHA FHIENG+E+ PSVELQFIP LTS+SPLHQKNEE QE+KNL++ R G+DSEVSVSLQLG  EPEAKRRK  DCLSSIKE
Subjt:  VQQISPEGESSQGLISPFKHAAFHIENGFEMAPSVELQFIPGLTSSSPLHQKNEECQEMKNLSSTRNGYDSEVSVSLQLG--EPEAKRRKHSDCLSSIKE

Query:  SST
        SS+
Subjt:  SST

A0A6J1CXN4 uncharacterized protein LOC111015756 isoform X20.0e+0095.51Show/hide
Query:  MDRTRLDLSFHHSVSTQSEESALDLERNYCSHLNLPSSSPSPSQCFAPGSQLSETNAAYFSWPTSSRLNDAAEDRANYFGNLQKGVLPEILGRLPTGQRA
        M+RTRLDLSFHHSVSTQSEESALDLERNYCSHLNLPSSSPSPSQCFAPGSQLSETNAAYFSWPTSSRLNDAAEDRANYFGNLQKGVLPEILGRLP+GQRA
Subjt:  MDRTRLDLSFHHSVSTQSEESALDLERNYCSHLNLPSSSPSPSQCFAPGSQLSETNAAYFSWPTSSRLNDAAEDRANYFGNLQKGVLPEILGRLPTGQRA

Query:  TTLLELMTIRAFHSKILRRFSLGTAIGFRIQKGMLTDIPAIIVFVARKVHRQWLNDVQCLPAALEGPGGIWCDVDVVEFSYYGAPAATPKEEIYTELVDG
        TTLLELMTIRAFHSKILRRFSLGTAIGFRIQKGMLTDIPAIIVFVARKVHRQWLNDVQCLPAALEGPGGIWCDVDVVEFSYYGAPAATPKEEIYTELVDG
Subjt:  TTLLELMTIRAFHSKILRRFSLGTAIGFRIQKGMLTDIPAIIVFVARKVHRQWLNDVQCLPAALEGPGGIWCDVDVVEFSYYGAPAATPKEEIYTELVDG

Query:  LRGSDPTIGSGSQVASQETYGTLGAIVKSRTGTRQVGFLTNRHVAVDLDYPSQKMFHPLPPSLGPGVYLGAVERATSFITDDVWYGIFAGTNPETFVRAD
        LRGSDPTIGSGSQVASQETYGTLGAIVKSRTGTRQVGFLTNRHVAVDLDYPSQKMFHPLPPSLGPGVYLGAVERATSFITDDVWYGIFAGTNPETFVRAD
Subjt:  LRGSDPTIGSGSQVASQETYGTLGAIVKSRTGTRQVGFLTNRHVAVDLDYPSQKMFHPLPPSLGPGVYLGAVERATSFITDDVWYGIFAGTNPETFVRAD

Query:  GAFIPFAEDFNMNNVVTFVKGVGEIGDVNKIDLQSPINSLIGRKVIKVGRSSGLTRGTIMAYALEYNDVKGICFFTDFLVVGDDQQTFDLEGDSGSLILL
        GAFIPFAEDFNMNNVVTFVKGVGEIGDVNKIDLQSPINSLIGRKVIKVGRSSGLTRGTIMAYALEYNDVKGICFFTDFLVVGDDQQTFDLEGDSGSLILL
Subjt:  GAFIPFAEDFNMNNVVTFVKGVGEIGDVNKIDLQSPINSLIGRKVIKVGRSSGLTRGTIMAYALEYNDVKGICFFTDFLVVGDDQQTFDLEGDSGSLILL

Query:  TGQDEEKPRPVGIIWGGTANRGRLKLKVGQPPENWTSGVDLGRLLDLLELDLITTSDGLQVHEQRNNSVGGIDSTVAESCLDRLPLKYRLKENSESLGLC
        TGQD EK RPVGIIWGGTANRGRLKLKVGQPPENWTSGVDLGRLLDLLELDLITTSDGLQVHEQRN SVGGIDSTVAES L+R+PLKYRLKENSE LGL 
Subjt:  TGQDEEKPRPVGIIWGGTANRGRLKLKVGQPPENWTSGVDLGRLLDLLELDLITTSDGLQVHEQRNNSVGGIDSTVAESCLDRLPLKYRLKENSESLGLC

Query:  VQQISPEGESSQGLISPFKHAAFHIE-NGFEMAPSVELQFIPGLTSSSPLHQKNEECQEMKNLSSTRNGYDSEVSVSLQLGEPEAKRRKHSDCLSSIKES
        VQQISPEGESSQGLISPFKHAAFHIE N FEMAPSVELQF+P LTSSSP+HQKN+E  E+K+LS+ RNG+DSEVSVSLQLGEPE KRR+HSD LSSIKES
Subjt:  VQQISPEGESSQGLISPFKHAAFHIE-NGFEMAPSVELQFIPGLTSSSPLHQKNEECQEMKNLSSTRNGYDSEVSVSLQLGEPEAKRRKHSDCLSSIKES

Query:  ST
        S+
Subjt:  ST

SwissProt top hitse value%identityAlignment
B4XT64 Protein NARROW LEAF 11.6e-20264.22Show/hide
Query:  QSEESALDLERNYCSHLNLPSSSPSPSQCFAPGSQLSETNAAYFSWPTSSRLNDAAEDRANYFGNLQKGVLPEILGRLPTGQRATTLLELMTIRAFHSKI
        QSEES+LD++     H + P  SPS  Q  A G   +E +AAYF WPTS+  + AAE RANYFGNLQKG+LP   GRLP GQ+A +LL+LMTIRAFHSKI
Subjt:  QSEESALDLERNYCSHLNLPSSSPSPSQCFAPGSQLSETNAAYFSWPTSSRLNDAAEDRANYFGNLQKGVLPEILGRLPTGQRATTLLELMTIRAFHSKI

Query:  LRRFSLGTAIGFRIQKGMLTDIPAIIVFVARKVHRQWLNDVQCLPAALEGPGGIWCDVDVVEFSYYGAPAATPKEEIYTELVDGLRGSDPTIGSGSQVAS
        LRRFSLGTA+GFRI+KG LTDIPAI+VFVARKVH++WLN  QCLPA LEGPGG+WCDVDVVEFSYYGAPA TPKE++++ELVD L GSD  IGSGSQVAS
Subjt:  LRRFSLGTAIGFRIQKGMLTDIPAIIVFVARKVHRQWLNDVQCLPAALEGPGGIWCDVDVVEFSYYGAPAATPKEEIYTELVDGLRGSDPTIGSGSQVAS

Query:  QETYGTLGAIVKSRTGTRQVGFLTNRHVAVDLDYPSQKMFHPLPPSLGPGVYLGAVERATSFITDDVWYGIFAGTNPETFVRADGAFIPFAEDFNMNNVV
         ET+GTLGAIVK RTG +QVGFLTN HVAVDLDYP+QKMFHPLPP+LGPGVYLGAVERATSFITDDVWYGI+AGTNPETFVRADGAFIPFA+DF+++ V 
Subjt:  QETYGTLGAIVKSRTGTRQVGFLTNRHVAVDLDYPSQKMFHPLPPSLGPGVYLGAVERATSFITDDVWYGIFAGTNPETFVRADGAFIPFAEDFNMNNVV

Query:  TFVKGVGEIGDVNKIDLQSPINSLIGRKVIKVGRSSGLTRGTIMAYALEYNDVKGICFFTDFLVVGDDQQTFDLEGDSGSLILLTGQDEEKPRPVGIIWG
        T V+GVG+IGDV  IDLQ P+NSLIGR+V KVGRSSG T GT+MAYALEYND KGICFFTD LVVG+++QTFDLEGDSGSLI+LT QD EKPRP+GIIWG
Subjt:  TFVKGVGEIGDVNKIDLQSPINSLIGRKVIKVGRSSGLTRGTIMAYALEYNDVKGICFFTDFLVVGDDQQTFDLEGDSGSLILLTGQDEEKPRPVGIIWG

Query:  GTANRGRLKLKVGQPPENWTSGVDLGRLLDLLELDLITTSDGLQ--VHEQRNNSVGGIDSTVAESCLDRLPL-KYRLKENSESLGLCVQQISPEGESSQG
        GTANRGRLKL     PENWTSGVDLGRLLD LELD+I T++ LQ  V +QR   V  + S V ES    + + + +++E  E LG+ +QQ+     ++ G
Subjt:  GTANRGRLKLKVGQPPENWTSGVDLGRLLDLLELDLITTSDGLQ--VHEQRNNSVGGIDSTVAESCLDRLPL-KYRLKENSESLGLCVQQISPEGESSQG

Query:  LISPFKHAAFHIENGFEMAPSV----ELQFIPGLTSSSPLHQKNEECQEMKNLSSTRNGYDSEVSVSLQLGEPEAKRRKHSDCLSSI
                      G E + +V    E QFI      SP+    +  + + NL+   N  + E+++SL LG+ E KR + SD  SS+
Subjt:  LISPFKHAAFHIENGFEMAPSV----ELQFIPGLTSSSPLHQKNEECQEMKNLSSTRNGYDSEVSVSLQLGEPEAKRRKHSDCLSSI

Arabidopsis top hitse value%identityAlignment
AT2G35155.1 Trypsin family protein3.0e-23370.21Show/hide
Query:  SVSTQSEESALDLERN-YCSHLNLP-SSSPSPSQCFAPGSQLSETNAAYFSWPTSSRLNDAAEDRANYFGNLQKGVLPEILGRLPTGQRATTLLELMTIR
        + S++SE+SALDLERN +C+HL+LP SSSPSP Q F    Q +E+NA YFSWPT SRLND  EDRANYFGNLQKGVLPE +GRLP+GQ+ATTLLELMTIR
Subjt:  SVSTQSEESALDLERN-YCSHLNLP-SSSPSPSQCFAPGSQLSETNAAYFSWPTSSRLNDAAEDRANYFGNLQKGVLPEILGRLPTGQRATTLLELMTIR

Query:  AFHSKILRRFSLGTAIGFRIQKGMLTDIPAIIVFVARKVHRQWLNDVQCLPAALEGPGGIWCDVDVVEFSYYGAPAATPKEEIYTELVDGLRGSDPTIGS
        AFHSKILRRFSLGTA+GFRI +G+LT++PAI+VFVARKVHRQWLN +QCLP+ALEGPGG+WCDVDVVEF YYGAPAATPKE++Y ELVDGLRGSDP IGS
Subjt:  AFHSKILRRFSLGTAIGFRIQKGMLTDIPAIIVFVARKVHRQWLNDVQCLPAALEGPGGIWCDVDVVEFSYYGAPAATPKEEIYTELVDGLRGSDPTIGS

Query:  GSQVASQETYGTLGAIVKSRTGTRQVGFLTNRHVAVDLDYPSQKMFHPLPPSLGPGVYLGAVERATSFITDDVWYGIFAGTNPETFVRADGAFIPFAEDF
        GSQVASQETYGTLGAIVKSRTG  QVGFLTNRHVAVDLDYPSQKMFHPLPPSLGPGVYLGAVERATSFITDD WYGIFAGTNPETFVRADGAFIPFAEDF
Subjt:  GSQVASQETYGTLGAIVKSRTGTRQVGFLTNRHVAVDLDYPSQKMFHPLPPSLGPGVYLGAVERATSFITDDVWYGIFAGTNPETFVRADGAFIPFAEDF

Query:  NMNNVVTFVKGVGEIGDVNKIDLQSPINSLIGRKVIKVGRSSGLTRGTIMAYALEYNDVKGICFFTDFLVVGDDQQTFDLEGDSGSLILLTGQDEEKPRP
        N +NV T +KG+GEIGDV+ IDLQSPI+SLIG++V+KVGRSSG T GTIMAYALEYND KGICF TDFLV+G++QQTFDLEGDSGSLILLTG + +KPRP
Subjt:  NMNNVVTFVKGVGEIGDVNKIDLQSPINSLIGRKVIKVGRSSGLTRGTIMAYALEYNDVKGICFFTDFLVVGDDQQTFDLEGDSGSLILLTGQDEEKPRP

Query:  VGIIWGGTANRGRLKLKVGQPPENWTSGVDLGRLLDLLELDLITTSDGLQ----VHEQRNNSVGGIDSTVAESCLDRLPLKYRLKENSESLGLCVQQISP
        VGIIWGGTANRGRLKL  GQ PENWTSGVDLGRLLDLLELDLIT++  L+      E+RN SV  +DSTV++S                          P
Subjt:  VGIIWGGTANRGRLKLKVGQPPENWTSGVDLGRLLDLLELDLITTSDGLQ----VHEQRNNSVGGIDSTVAESCLDRLPLKYRLKENSESLGLCVQQISP

Query:  EGESSQGLISPFKHAAFHIENGFEMAPSVE--LQFIPGLTSSSPLHQKNEECQEMKNLSSTRNGYDSEVSVSLQLGEPEAKRRK
         G+       PF    FHIE   +    VE  +   P   + S    K +E  ++ NL + +N  + EV++SL LGEP+ K+ K
Subjt:  EGESSQGLISPFKHAAFHIENGFEMAPSVE--LQFIPGLTSSSPLHQKNEECQEMKNLSSTRNGYDSEVSVSLQLGEPEAKRRK

AT3G12950.1 Trypsin family protein1.7e-18862.7Show/hide
Query:  GSQLSETNAAYFSWPTSSRLNDAAEDRANYFGNLQK------GVLPEILGRLPTGQRATTLLELMTIRAFHSKILRRFSLGTAIGFRIQKGMLTDIPAII
        G     T A+YFSWPTSSRL++AAE+RANYF NLQK       V PE +   P GQRATTLLELMTIRAFHSK+LR +SLGTAIGFRI++G+LTDIPAII
Subjt:  GSQLSETNAAYFSWPTSSRLNDAAEDRANYFGNLQK------GVLPEILGRLPTGQRATTLLELMTIRAFHSKILRRFSLGTAIGFRIQKGMLTDIPAII

Query:  VFVARKVHRQWLNDVQCLPAALEGPGGIWCDVDVVEFSYYGAP--AATPKEEIYTELVDGLRGSDPTIGSGSQVASQETYGTLGAIVKSRTGTRQVGFLT
        VFV+RKVH+QWL+ +QCLP ALEG GGIWCDVDVVEFSY+G P    TPK+   T++VD L+GSDP IGSGSQVASQET GTLGAIV+S+TG RQVGF+T
Subjt:  VFVARKVHRQWLNDVQCLPAALEGPGGIWCDVDVVEFSYYGAP--AATPKEEIYTELVDGLRGSDPTIGSGSQVASQETYGTLGAIVKSRTGTRQVGFLT

Query:  NRHVAVDLDYPSQKMFHPLPPSLGPGVYLGAVERATSFITDDVWYGIFAGTNPETFVRADGAFIPFAEDFNMNNVVTFVK-GVGEIGDVNKIDLQSPINS
        NRHVAV+LDYPSQKMFHPLPP+LGPGVYLGAVERATSFITDD+W+GIFAGTNPETFVRADGAFIPFA+D++++ V T VK GVGEIG+V  I+LQSP+ S
Subjt:  NRHVAVDLDYPSQKMFHPLPPSLGPGVYLGAVERATSFITDDVWYGIFAGTNPETFVRADGAFIPFAEDFNMNNVVTFVK-GVGEIGDVNKIDLQSPINS

Query:  LIGRKVIKVGRSSGLTRGTIMAYALEYNDVKGICFFTDFLVVGDDQQT-FDLEGDSGSLILLTGQDEEKPRPVGIIWGGTANRGRLKLKVGQPPENWTSG
        L+G++V+KVGRSSGLT GT++AYALEYND +G+CF TDFLVVG++ ++ FDLEGDSGSLI++ G  EEK RP+GIIWGGT +RGRLKLKVG+ PE+WT+G
Subjt:  LIGRKVIKVGRSSGLTRGTIMAYALEYNDVKGICFFTDFLVVGDDQQT-FDLEGDSGSLILLTGQDEEKPRPVGIIWGGTANRGRLKLKVGQPPENWTSG

Query:  VDLGRLLDLLELDLITTSDGLQ--VHEQRNNSVGGIDSTVAESCLDRLPLKYRLKENSESLGLCVQQISPEGESSQGLISPFKHAAFHIENGFEM---AP
        VDLGRLL  L+LDLITT +GL+  V EQR  S  G+ S VA+S    + LK              ++ SPE E  +  + P +     +E   E    AP
Subjt:  VDLGRLLDLLELDLITTSDGLQ--VHEQRNNSVGGIDSTVAESCLDRLPLKYRLKENSESLGLCVQQISPEGESSQGLISPFKHAAFHIENGFEM---AP

Query:  SVELQFIPGLTSSSPLHQKNEECQEMKNLSSTRNGYDSEVSVSLQLGEPEAKRRK
        SVE QF+P  +         E  +E      T    D ++ V L+LG+  AKRR+
Subjt:  SVELQFIPGLTSSSPLHQKNEECQEMKNLSSTRNGYDSEVSVSLQLGEPEAKRRK

AT5G45030.1 Trypsin family protein3.0e-23371.4Show/hide
Query:  MDRTRLDLSFHHSVSTQSEES-ALDLERNYCSHLNLPSSSPSPSQCFAPGSQLSETN--AAYFSWPTSSRLNDAAEDRANYFGNLQKGVLPEILGRLPTG
        M+  RLDL FHHS S+QS ES ALDL++N  +H+ L SS  SP Q F  G+Q  ET+  AAYFSWPTSSRLND+AEDRANYF NLQKGVLPE    LPTG
Subjt:  MDRTRLDLSFHHSVSTQSEES-ALDLERNYCSHLNLPSSSPSPSQCFAPGSQLSETN--AAYFSWPTSSRLNDAAEDRANYFGNLQKGVLPEILGRLPTG

Query:  QRATTLLELMTIRAFHSKILRRFSLGTAIGFRIQKGMLTDIPAIIVFVARKVHRQWLNDVQCLPAALEGPGGIWCDVDVVEFSYYGAPAATPKEEIYTEL
        ++ATTLLELM IRAFHSK LRRFSLGTAIGFRI++G+LT+I AI+VFVARKVH+QWLN +QCLP ALEGPGG+WCDVDVVEF YYGAPA TPKE++YTEL
Subjt:  QRATTLLELMTIRAFHSKILRRFSLGTAIGFRIQKGMLTDIPAIIVFVARKVHRQWLNDVQCLPAALEGPGGIWCDVDVVEFSYYGAPAATPKEEIYTEL

Query:  VDGLRGSDPTIGSGSQVASQETYGTLGAIVKSRTGTRQVGFLTNRHVAVDLDYPSQKMFHPLPPSLGPGVYLGAVERATSFITDDVWYGIFAGTNPETFV
        VD LRGS  +IGSGSQVASQETYGTLGAIVKS+TG RQVGFLTNRHVAVDLDYPSQKMFHPLPPSLGPGVYLGAVERATSFITDD+WYGIFAGTNPETFV
Subjt:  VDGLRGSDPTIGSGSQVASQETYGTLGAIVKSRTGTRQVGFLTNRHVAVDLDYPSQKMFHPLPPSLGPGVYLGAVERATSFITDDVWYGIFAGTNPETFV

Query:  RADGAFIPFAEDFNMNNVVTFVKGVGEIGDVNKIDLQSPINSLIGRKVIKVGRSSGLTRGTIMAYALEYNDVKGICFFTDFLVVGDDQQTFDLEGDSGSL
        RADGAFIPFAEDFN NNV T VKG+GEIGD++  DLQSP+NSLIGRKV+KVGRSSGLT GTIMAYALEYND KGICF TDFLVVG++QQTFDLEGDSGSL
Subjt:  RADGAFIPFAEDFNMNNVVTFVKGVGEIGDVNKIDLQSPINSLIGRKVIKVGRSSGLTRGTIMAYALEYNDVKGICFFTDFLVVGDDQQTFDLEGDSGSL

Query:  ILLTGQDE--EKPRPVGIIWGGTANRGRLKLKVGQPPENWTSGVDLGRLLDLLELDLITTSDGLQ--VHEQRNNSV-GGIDSTVAESCLDRLPL-KYRLK
        ILL   DE  EKPRPVGIIWGGTANRGRLKLKVG+ PENWTSGVDLGR+L+LLELDLIT+++GLQ  V EQRN  +   +DSTV ES      + + +  
Subjt:  ILLTGQDE--EKPRPVGIIWGGTANRGRLKLKVGQPPENWTSGVDLGRLLDLLELDLITTSDGLQ--VHEQRNNSV-GGIDSTVAESCLDRLPL-KYRLK

Query:  ENSESLGLCVQQISPEGESSQGLISPFKHAAFHIENGFE-MAPSVELQFIPGLTSS-SPLHQK--NEECQEMKNLSSTR-NGYDSEVSVSLQLGEPEAKR
        EN E + L VQQ+  E ++S        H  F IE+  E +A   E QFIP  +++ S LHQK    E  E KNLSS + +    E+  SLQLGE + K+
Subjt:  ENSESLGLCVQQISPEGESSQGLISPFKHAAFHIENGFE-MAPSVELQFIPGLTSS-SPLHQK--NEECQEMKNLSSTR-NGYDSEVSVSLQLGEPEAKR

Query:  RKHSD
        RK +D
Subjt:  RKHSD

AT5G45030.2 Trypsin family protein3.0e-23371.4Show/hide
Query:  MDRTRLDLSFHHSVSTQSEES-ALDLERNYCSHLNLPSSSPSPSQCFAPGSQLSETN--AAYFSWPTSSRLNDAAEDRANYFGNLQKGVLPEILGRLPTG
        M+  RLDL FHHS S+QS ES ALDL++N  +H+ L SS  SP Q F  G+Q  ET+  AAYFSWPTSSRLND+AEDRANYF NLQKGVLPE    LPTG
Subjt:  MDRTRLDLSFHHSVSTQSEES-ALDLERNYCSHLNLPSSSPSPSQCFAPGSQLSETN--AAYFSWPTSSRLNDAAEDRANYFGNLQKGVLPEILGRLPTG

Query:  QRATTLLELMTIRAFHSKILRRFSLGTAIGFRIQKGMLTDIPAIIVFVARKVHRQWLNDVQCLPAALEGPGGIWCDVDVVEFSYYGAPAATPKEEIYTEL
        ++ATTLLELM IRAFHSK LRRFSLGTAIGFRI++G+LT+I AI+VFVARKVH+QWLN +QCLP ALEGPGG+WCDVDVVEF YYGAPA TPKE++YTEL
Subjt:  QRATTLLELMTIRAFHSKILRRFSLGTAIGFRIQKGMLTDIPAIIVFVARKVHRQWLNDVQCLPAALEGPGGIWCDVDVVEFSYYGAPAATPKEEIYTEL

Query:  VDGLRGSDPTIGSGSQVASQETYGTLGAIVKSRTGTRQVGFLTNRHVAVDLDYPSQKMFHPLPPSLGPGVYLGAVERATSFITDDVWYGIFAGTNPETFV
        VD LRGS  +IGSGSQVASQETYGTLGAIVKS+TG RQVGFLTNRHVAVDLDYPSQKMFHPLPPSLGPGVYLGAVERATSFITDD+WYGIFAGTNPETFV
Subjt:  VDGLRGSDPTIGSGSQVASQETYGTLGAIVKSRTGTRQVGFLTNRHVAVDLDYPSQKMFHPLPPSLGPGVYLGAVERATSFITDDVWYGIFAGTNPETFV

Query:  RADGAFIPFAEDFNMNNVVTFVKGVGEIGDVNKIDLQSPINSLIGRKVIKVGRSSGLTRGTIMAYALEYNDVKGICFFTDFLVVGDDQQTFDLEGDSGSL
        RADGAFIPFAEDFN NNV T VKG+GEIGD++  DLQSP+NSLIGRKV+KVGRSSGLT GTIMAYALEYND KGICF TDFLVVG++QQTFDLEGDSGSL
Subjt:  RADGAFIPFAEDFNMNNVVTFVKGVGEIGDVNKIDLQSPINSLIGRKVIKVGRSSGLTRGTIMAYALEYNDVKGICFFTDFLVVGDDQQTFDLEGDSGSL

Query:  ILLTGQDE--EKPRPVGIIWGGTANRGRLKLKVGQPPENWTSGVDLGRLLDLLELDLITTSDGLQ--VHEQRNNSV-GGIDSTVAESCLDRLPL-KYRLK
        ILL   DE  EKPRPVGIIWGGTANRGRLKLKVG+ PENWTSGVDLGR+L+LLELDLIT+++GLQ  V EQRN  +   +DSTV ES      + + +  
Subjt:  ILLTGQDE--EKPRPVGIIWGGTANRGRLKLKVGQPPENWTSGVDLGRLLDLLELDLITTSDGLQ--VHEQRNNSV-GGIDSTVAESCLDRLPL-KYRLK

Query:  ENSESLGLCVQQISPEGESSQGLISPFKHAAFHIENGFE-MAPSVELQFIPGLTSS-SPLHQK--NEECQEMKNLSSTR-NGYDSEVSVSLQLGEPEAKR
        EN E + L VQQ+  E ++S        H  F IE+  E +A   E QFIP  +++ S LHQK    E  E KNLSS + +    E+  SLQLGE + K+
Subjt:  ENSESLGLCVQQISPEGESSQGLISPFKHAAFHIENGFE-MAPSVELQFIPGLTSS-SPLHQK--NEECQEMKNLSSTR-NGYDSEVSVSLQLGEPEAKR

Query:  RKHSD
        RK +D
Subjt:  RKHSD


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGACAGGACAAGGCTGGATTTAAGTTTTCATCATTCAGTGTCAACACAATCAGAGGAATCTGCCTTGGACTTGGAAAGGAACTATTGCAGTCATCTTAATCTACCTTC
ATCAAGTCCCTCACCTAGTCAATGCTTTGCTCCAGGTAGTCAGCTGTCTGAGACCAACGCTGCTTACTTTTCATGGCCCACTTCCAGCCGTTTAAACGACGCTGCAGAAG
ATAGAGCAAACTATTTTGGGAACCTTCAGAAAGGAGTGCTTCCTGAAATTTTGGGCCGTCTGCCCACTGGGCAGCGAGCTACTACTTTGCTTGAGCTAATGACCATAAGG
GCATTTCATAGCAAGATCTTGCGTCGTTTTAGCCTTGGAACTGCAATAGGATTTCGAATCCAGAAGGGTATGTTGACAGATATCCCTGCTATTATTGTCTTTGTTGCGCG
CAAAGTTCACAGGCAGTGGCTCAATGATGTTCAATGTCTACCCGCTGCACTTGAGGGACCTGGAGGTATATGGTGTGATGTTGATGTTGTGGAGTTCTCCTATTATGGTG
CTCCGGCAGCTACACCTAAAGAAGAAATATACACAGAGCTTGTTGATGGCCTGAGGGGAAGTGATCCAACAATTGGTTCTGGTTCCCAGGTTGCTAGCCAAGAAACTTAT
GGGACTTTGGGTGCAATTGTCAAAAGTCGTACAGGAACCCGGCAAGTTGGTTTCCTTACAAACCGTCATGTTGCAGTCGATTTAGACTACCCTAGTCAGAAAATGTTTCA
TCCTTTGCCTCCCAGCCTTGGACCTGGTGTATATCTGGGTGCTGTGGAGAGAGCAACATCGTTTATCACTGATGATGTCTGGTATGGCATCTTTGCTGGAACAAATCCAG
AAACATTTGTGCGAGCTGATGGAGCGTTCATTCCCTTCGCCGAAGATTTCAACATGAATAACGTCGTTACATTTGTAAAAGGCGTCGGTGAGATTGGTGATGTCAACAAA
ATAGACCTGCAGTCCCCGATCAACAGTCTCATTGGACGAAAAGTGATCAAGGTTGGAAGAAGTTCGGGCTTGACCAGAGGGACTATAATGGCATATGCCCTTGAGTATAA
CGATGTAAAAGGGATTTGTTTCTTCACCGACTTTCTTGTTGTTGGAGATGACCAGCAGACGTTTGACCTTGAAGGTGATAGTGGAAGCCTTATTCTCTTAACTGGTCAGG
ATGAGGAAAAACCACGTCCAGTTGGGATTATCTGGGGAGGAACAGCTAATCGAGGTCGGCTGAAATTAAAAGTTGGTCAGCCTCCAGAGAATTGGACCAGTGGAGTTGAT
CTTGGACGCCTTCTTGATCTCCTTGAGCTCGATCTTATTACAACAAGTGATGGTTTACAAGTGCATGAACAAAGGAACAATTCAGTTGGAGGGATTGATTCTACTGTTGC
GGAGTCCTGTCTCGATCGGCTGCCGTTAAAATATAGACTTAAAGAGAACTCCGAGTCACTTGGTTTATGTGTCCAGCAAATTTCTCCTGAAGGTGAATCCTCCCAGGGGC
TGATCTCACCTTTCAAGCATGCTGCATTCCACATAGAAAACGGGTTTGAGATGGCTCCAAGTGTTGAACTCCAGTTTATACCAGGATTAACCAGCAGCTCTCCGCTGCAT
CAGAAGAACGAAGAATGCCAAGAGATGAAAAATCTGTCCTCCACGAGGAATGGCTATGATAGCGAGGTATCAGTTTCACTGCAGTTGGGTGAGCCAGAAGCAAAGAGAAG
GAAGCACTCGGATTGTCTTTCAAGTATCAAAGAGTCATCAACATGA
mRNA sequenceShow/hide mRNA sequence
ATGGACAGGACAAGGCTGGATTTAAGTTTTCATCATTCAGTGTCAACACAATCAGAGGAATCTGCCTTGGACTTGGAAAGGAACTATTGCAGTCATCTTAATCTACCTTC
ATCAAGTCCCTCACCTAGTCAATGCTTTGCTCCAGGTAGTCAGCTGTCTGAGACCAACGCTGCTTACTTTTCATGGCCCACTTCCAGCCGTTTAAACGACGCTGCAGAAG
ATAGAGCAAACTATTTTGGGAACCTTCAGAAAGGAGTGCTTCCTGAAATTTTGGGCCGTCTGCCCACTGGGCAGCGAGCTACTACTTTGCTTGAGCTAATGACCATAAGG
GCATTTCATAGCAAGATCTTGCGTCGTTTTAGCCTTGGAACTGCAATAGGATTTCGAATCCAGAAGGGTATGTTGACAGATATCCCTGCTATTATTGTCTTTGTTGCGCG
CAAAGTTCACAGGCAGTGGCTCAATGATGTTCAATGTCTACCCGCTGCACTTGAGGGACCTGGAGGTATATGGTGTGATGTTGATGTTGTGGAGTTCTCCTATTATGGTG
CTCCGGCAGCTACACCTAAAGAAGAAATATACACAGAGCTTGTTGATGGCCTGAGGGGAAGTGATCCAACAATTGGTTCTGGTTCCCAGGTTGCTAGCCAAGAAACTTAT
GGGACTTTGGGTGCAATTGTCAAAAGTCGTACAGGAACCCGGCAAGTTGGTTTCCTTACAAACCGTCATGTTGCAGTCGATTTAGACTACCCTAGTCAGAAAATGTTTCA
TCCTTTGCCTCCCAGCCTTGGACCTGGTGTATATCTGGGTGCTGTGGAGAGAGCAACATCGTTTATCACTGATGATGTCTGGTATGGCATCTTTGCTGGAACAAATCCAG
AAACATTTGTGCGAGCTGATGGAGCGTTCATTCCCTTCGCCGAAGATTTCAACATGAATAACGTCGTTACATTTGTAAAAGGCGTCGGTGAGATTGGTGATGTCAACAAA
ATAGACCTGCAGTCCCCGATCAACAGTCTCATTGGACGAAAAGTGATCAAGGTTGGAAGAAGTTCGGGCTTGACCAGAGGGACTATAATGGCATATGCCCTTGAGTATAA
CGATGTAAAAGGGATTTGTTTCTTCACCGACTTTCTTGTTGTTGGAGATGACCAGCAGACGTTTGACCTTGAAGGTGATAGTGGAAGCCTTATTCTCTTAACTGGTCAGG
ATGAGGAAAAACCACGTCCAGTTGGGATTATCTGGGGAGGAACAGCTAATCGAGGTCGGCTGAAATTAAAAGTTGGTCAGCCTCCAGAGAATTGGACCAGTGGAGTTGAT
CTTGGACGCCTTCTTGATCTCCTTGAGCTCGATCTTATTACAACAAGTGATGGTTTACAAGTGCATGAACAAAGGAACAATTCAGTTGGAGGGATTGATTCTACTGTTGC
GGAGTCCTGTCTCGATCGGCTGCCGTTAAAATATAGACTTAAAGAGAACTCCGAGTCACTTGGTTTATGTGTCCAGCAAATTTCTCCTGAAGGTGAATCCTCCCAGGGGC
TGATCTCACCTTTCAAGCATGCTGCATTCCACATAGAAAACGGGTTTGAGATGGCTCCAAGTGTTGAACTCCAGTTTATACCAGGATTAACCAGCAGCTCTCCGCTGCAT
CAGAAGAACGAAGAATGCCAAGAGATGAAAAATCTGTCCTCCACGAGGAATGGCTATGATAGCGAGGTATCAGTTTCACTGCAGTTGGGTGAGCCAGAAGCAAAGAGAAG
GAAGCACTCGGATTGTCTTTCAAGTATCAAAGAGTCATCAACATGA
Protein sequenceShow/hide protein sequence
MDRTRLDLSFHHSVSTQSEESALDLERNYCSHLNLPSSSPSPSQCFAPGSQLSETNAAYFSWPTSSRLNDAAEDRANYFGNLQKGVLPEILGRLPTGQRATTLLELMTIR
AFHSKILRRFSLGTAIGFRIQKGMLTDIPAIIVFVARKVHRQWLNDVQCLPAALEGPGGIWCDVDVVEFSYYGAPAATPKEEIYTELVDGLRGSDPTIGSGSQVASQETY
GTLGAIVKSRTGTRQVGFLTNRHVAVDLDYPSQKMFHPLPPSLGPGVYLGAVERATSFITDDVWYGIFAGTNPETFVRADGAFIPFAEDFNMNNVVTFVKGVGEIGDVNK
IDLQSPINSLIGRKVIKVGRSSGLTRGTIMAYALEYNDVKGICFFTDFLVVGDDQQTFDLEGDSGSLILLTGQDEEKPRPVGIIWGGTANRGRLKLKVGQPPENWTSGVD
LGRLLDLLELDLITTSDGLQVHEQRNNSVGGIDSTVAESCLDRLPLKYRLKENSESLGLCVQQISPEGESSQGLISPFKHAAFHIENGFEMAPSVELQFIPGLTSSSPLH
QKNEECQEMKNLSSTRNGYDSEVSVSLQLGEPEAKRRKHSDCLSSIKESST