; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MS004617 (gene) of Bitter gourd (TR) v1 genome

Gene IDMS004617
OrganismMomordica charantia cv. TR (Bitter gourd (TR) v1)
DescriptionPeptidase S1, PA clan
Genome locationscaffold995:725078..729069
RNA-Seq ExpressionMS004617
SyntenyMS004617
Gene Ontology termsGO:0043231 - intracellular membrane-bounded organelle (cellular component)
InterPro domainsIPR009003 - Peptidase S1, PA clan


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6607973.1 Protein NARROW LEAF 1, partial [Cucurbita argyrosperma subsp. sororia]0.0e+0096.84Show/hide
Query:  MEQTRHNGRINCSGSTPSEESALDLERNCCSHSNLPSFSPPTLQPFASAGQHCESNAAYFSWPTPIRLSVAAEERANYFANLQKGVLPDILHPLPKGQRA
        MEQTRHN RINCSGSTPSEESALDLERNC  HSNLPSFSPPTLQPFASAGQHCESNAAYFSWPTPIRLSVAAEERANYFANLQKGVLPDIL PLPKGQRA
Subjt:  MEQTRHNGRINCSGSTPSEESALDLERNCCSHSNLPSFSPPTLQPFASAGQHCESNAAYFSWPTPIRLSVAAEERANYFANLQKGVLPDILHPLPKGQRA

Query:  TTLLELMTIRAFHSKILRCYSLGTAIGFRIRKGVLTDIPAILVFVSRKVHKQWLSPIQCLPTALEGPGGVWCDVDVVEFSYFGAPNPAPKEQLYTEIVDD
        TTLLELMTIRAFHSKILRCYSLGTAIGFRI+KGVLTDIPAILVFVSRKVHKQWLSPIQCLPTALEGPGGVWCDVDVVEFSYFGAPNPAPKEQLYTEIVDD
Subjt:  TTLLELMTIRAFHSKILRCYSLGTAIGFRIRKGVLTDIPAILVFVSRKVHKQWLSPIQCLPTALEGPGGVWCDVDVVEFSYFGAPNPAPKEQLYTEIVDD

Query:  LRGSDLCIGSGSQVASQETYGTLGAIVRSQTGSRQVGFLTNRHVAVDLDYPNQKMFHPLPPTLGPGVYLGAVERATSFITDELWYGIFAGINPETFVRAD
        LRGSDLCIGSGSQVASQETYGTLGAIVRSQTGSRQVGFLTNRHVAVDLDYPNQKMFHPLPPTLGPGVYLGAVERATSFITDELWYGIFAGINPETFVRAD
Subjt:  LRGSDLCIGSGSQVASQETYGTLGAIVRSQTGSRQVGFLTNRHVAVDLDYPNQKMFHPLPPTLGPGVYLGAVERATSFITDELWYGIFAGINPETFVRAD

Query:  GAFIPFADDFEMSTVTTSVKGVGEVGDVKFIDLQSPISTLIGKQVVKVGRSSGLTTGTVLAYALEYNDEKGICFLTDFLVVGENQQTFDLEGDSGSLIIL
        GAFIPFADDF+MSTVTTSVKGVGEVGDVKFIDLQSPISTLIGKQVVKVGRSSGLTTGTVLAYALEYNDEKGICFLTDFLVVGENQQTFDLEGDSGSLIIL
Subjt:  GAFIPFADDFEMSTVTTSVKGVGEVGDVKFIDLQSPISTLIGKQVVKVGRSSGLTTGTVLAYALEYNDEKGICFLTDFLVVGENQQTFDLEGDSGSLIIL

Query:  KGENRESLQPIGIIWGGTANRGRLKLKVGQPPENWTSGVDLGRLLNLLELDLITSDEGLKAAVQEQRTVSATVIGSIVGDSSPPDTTLPKEKSEEKFEPL
        KGENRESLQPIGIIWGGTANRGRLKLKVGQPPENWTSGVDLGRLLNLLELDLIT+D+G KAAVQEQRTVSATVIGSIVGDSSPPDTTLPK+KSEEKFEPL
Subjt:  KGENRESLQPIGIIWGGTANRGRLKLKVGQPPENWTSGVDLGRLLNLLELDLITSDEGLKAAVQEQRTVSATVIGSIVGDSSPPDTTLPKEKSEEKFEPL

Query:  GFQIQHMPTEVEPSSAEERPLLETEFHLEAGTSMAPSVEHQFIPSLFSCSPSHQNSSLDRAVSQNLSSLRSDCEDICVSLQLGDHEAKRRRSDASVSMEE
        GFQIQHMPTEVEPSSA++RPLLETEFHLEAGTS APSVEHQFIPSLFSCSPS  NS+L RAVSQNLSSLRSDCEDI VSL LGDHEAKRRRSDASVSMEE
Subjt:  GFQIQHMPTEVEPSSAEERPLLETEFHLEAGTSMAPSVEHQFIPSLFSCSPSHQNSSLDRAVSQNLSSLRSDCEDICVSLQLGDHEAKRRRSDASVSMEE

Query:  LK
        LK
Subjt:  LK

XP_022136083.1 uncharacterized protein LOC111007860 isoform X1 [Momordica charantia]0.0e+0099.83Show/hide
Query:  MEQTRHNGRINCSGSTPSEESALDLERNCCSHSNLPSFSPPTLQPFASAGQHCESNAAYFSWPTPIRLSVAAEERANYFANLQKGVLPDILHPLPKGQRA
        MEQTRHNGRINCSGSTPSEESALDLERNCCSHSNLPSFSPPTLQPFASAGQHCESNAAYFSWPTPIRLSVAAEERANYFANLQKGVLPDILHPLPKGQRA
Subjt:  MEQTRHNGRINCSGSTPSEESALDLERNCCSHSNLPSFSPPTLQPFASAGQHCESNAAYFSWPTPIRLSVAAEERANYFANLQKGVLPDILHPLPKGQRA

Query:  TTLLELMTIRAFHSKILRCYSLGTAIGFRIRKGVLTDIPAILVFVSRKVHKQWLSPIQCLPTALEGPGGVWCDVDVVEFSYFGAPNPAPKEQLYTEIVDD
        TTLLELMTIRAFHSKILRCYSLGTAIGFRIRKGVLTDIPAILVFVSRKVHKQWLSPIQCLPTALEGPGGVWCDVDVVEFSYFGAPNPAPKEQLYTEIVDD
Subjt:  TTLLELMTIRAFHSKILRCYSLGTAIGFRIRKGVLTDIPAILVFVSRKVHKQWLSPIQCLPTALEGPGGVWCDVDVVEFSYFGAPNPAPKEQLYTEIVDD

Query:  LRGSDLCIGSGSQVASQETYGTLGAIVRSQTGSRQVGFLTNRHVAVDLDYPNQKMFHPLPPTLGPGVYLGAVERATSFITDELWYGIFAGINPETFVRAD
        LRGSDLCIGSGSQVASQETYGTLGAIVRSQTGSRQVGFLTNRHVAVDLDYPNQKMFHPLPPTLGPGVYLGAVERATSFITDELWYGIFAGINPETFVRAD
Subjt:  LRGSDLCIGSGSQVASQETYGTLGAIVRSQTGSRQVGFLTNRHVAVDLDYPNQKMFHPLPPTLGPGVYLGAVERATSFITDELWYGIFAGINPETFVRAD

Query:  GAFIPFADDFEMSTVTTSVKGVGEVGDVKFIDLQSPISTLIGKQVVKVGRSSGLTTGTVLAYALEYNDEKGICFLTDFLVVGENQQTFDLEGDSGSLIIL
        GAFIPFADDFEMSTVTTSVKGVGEVGDVKFIDLQSPISTLIGK+VVKVGRSSGLTTGTVLAYALEYNDEKGICFLTDFLVVGENQQTFDLEGDSGSLIIL
Subjt:  GAFIPFADDFEMSTVTTSVKGVGEVGDVKFIDLQSPISTLIGKQVVKVGRSSGLTTGTVLAYALEYNDEKGICFLTDFLVVGENQQTFDLEGDSGSLIIL

Query:  KGENRESLQPIGIIWGGTANRGRLKLKVGQPPENWTSGVDLGRLLNLLELDLITSDEGLKAAVQEQRTVSATVIGSIVGDSSPPDTTLPKEKSEEKFEPL
        KGENRESLQPIGIIWGGTANRGRLKLKVGQPPENWTSGVDLGRLLNLLELDLITSDEGLKAAVQEQRTVSATVIGSIVGDSSPPDTTLPKEKSEEKFEPL
Subjt:  KGENRESLQPIGIIWGGTANRGRLKLKVGQPPENWTSGVDLGRLLNLLELDLITSDEGLKAAVQEQRTVSATVIGSIVGDSSPPDTTLPKEKSEEKFEPL

Query:  GFQIQHMPTEVEPSSAEERPLLETEFHLEAGTSMAPSVEHQFIPSLFSCSPSHQNSSLDRAVSQNLSSLRSDCEDICVSLQLGDHEAKRRRSDASVSMEE
        GFQIQHMPTEVEPSSAEERPLLETEFHLEAGTSMAPSVEHQFIPSLFSCSPSHQNSSLDRAVSQNLSSLRSDCEDICVSLQLGDHEAKRRRSDASVSMEE
Subjt:  GFQIQHMPTEVEPSSAEERPLLETEFHLEAGTSMAPSVEHQFIPSLFSCSPSHQNSSLDRAVSQNLSSLRSDCEDICVSLQLGDHEAKRRRSDASVSMEE

Query:  LK
        LK
Subjt:  LK

XP_022136089.1 uncharacterized protein LOC111007860 isoform X2 [Momordica charantia]0.0e+0099.67Show/hide
Query:  MEQTRHNGRINCSGSTPSEESALDLERNCCSHSNLPSFSPPTLQPFASAGQHCESNAAYFSWPTPIRLSVAAEERANYFANLQKGVLPDILHPLPKGQRA
        MEQTRHNGRINCSGSTPSEESALDLERNCCSHSNLPSFSPPTLQPFASAGQHCESNAAYFSWPTPIRLSVAAEERANYFANLQKGVLPDILHPLPKGQRA
Subjt:  MEQTRHNGRINCSGSTPSEESALDLERNCCSHSNLPSFSPPTLQPFASAGQHCESNAAYFSWPTPIRLSVAAEERANYFANLQKGVLPDILHPLPKGQRA

Query:  TTLLELMTIRAFHSKILRCYSLGTAIGFRIRKGVLTDIPAILVFVSRKVHKQWLSPIQCLPTALEGPGGVWCDVDVVEFSYFGAPNPAPKEQLYTEIVDD
        TTLLELMTIRAFHSKILRCYSLGTAIGFRIRKGVLTDIPAILVFVSRKVHKQWLSPIQCLPTALEGPGGVWCDVDVVEFSYFGAPNPAPKEQLYTEIVDD
Subjt:  TTLLELMTIRAFHSKILRCYSLGTAIGFRIRKGVLTDIPAILVFVSRKVHKQWLSPIQCLPTALEGPGGVWCDVDVVEFSYFGAPNPAPKEQLYTEIVDD

Query:  LRGSDLCIGSGSQVASQETYGTLGAIVRSQTGSRQVGFLTNRHVAVDLDYPNQKMFHPLPPTLGPGVYLGAVERATSFITDELWYGIFAGINPETFVRAD
        LRGSDLCIGSGSQVASQETYGTLGAIVRSQTGSRQVGFLTNRHVAVDLDYPNQKMFHPLPPTLGPGVYLGAVERATSFITDELWYGIFAGINPETFVRAD
Subjt:  LRGSDLCIGSGSQVASQETYGTLGAIVRSQTGSRQVGFLTNRHVAVDLDYPNQKMFHPLPPTLGPGVYLGAVERATSFITDELWYGIFAGINPETFVRAD

Query:  GAFIPFADDFEMSTVTTSVKGVGEVGDVKFIDLQSPISTLIGKQVVKVGRSSGLTTGTVLAYALEYNDEKGICFLTDFLVVGENQQTFDLEGDSGSLIIL
        GAFIPFADDFEMSTVTTSVKGVGEVGDVKFIDLQSPISTLIGK+VVKVGRSSGLTTGTVLAYALEYNDEKGICFLTDFLVVGENQQTFDLEGDSGSLIIL
Subjt:  GAFIPFADDFEMSTVTTSVKGVGEVGDVKFIDLQSPISTLIGKQVVKVGRSSGLTTGTVLAYALEYNDEKGICFLTDFLVVGENQQTFDLEGDSGSLIIL

Query:  KGENRESLQPIGIIWGGTANRGRLKLKVGQPPENWTSGVDLGRLLNLLELDLITSDEGLKAAVQEQRTVSATVIGSIVGDSSPPDTTLPKEKSEEKFEPL
        KGENRESLQPIGIIWGGTANRGRLKLKVGQPPENWTSGVDLGRLLNLLELDLITSDEGLK AVQEQRTVSATVIGSIVGDSSPPDTTLPKEKSEEKFEPL
Subjt:  KGENRESLQPIGIIWGGTANRGRLKLKVGQPPENWTSGVDLGRLLNLLELDLITSDEGLKAAVQEQRTVSATVIGSIVGDSSPPDTTLPKEKSEEKFEPL

Query:  GFQIQHMPTEVEPSSAEERPLLETEFHLEAGTSMAPSVEHQFIPSLFSCSPSHQNSSLDRAVSQNLSSLRSDCEDICVSLQLGDHEAKRRRSDASVSMEE
        GFQIQHMPTEVEPSSAEERPLLETEFHLEAGTSMAPSVEHQFIPSLFSCSPSHQNSSLDRAVSQNLSSLRSDCEDICVSLQLGDHEAKRRRSDASVSMEE
Subjt:  GFQIQHMPTEVEPSSAEERPLLETEFHLEAGTSMAPSVEHQFIPSLFSCSPSHQNSSLDRAVSQNLSSLRSDCEDICVSLQLGDHEAKRRRSDASVSMEE

Query:  LK
        LK
Subjt:  LK

XP_022940289.1 uncharacterized protein LOC111445958 [Cucurbita moschata]0.0e+0096.68Show/hide
Query:  MEQTRHNGRINCSGSTPSEESALDLERNCCSHSNLPSFSPPTLQPFASAGQHCESNAAYFSWPTPIRLSVAAEERANYFANLQKGVLPDILHPLPKGQRA
        MEQTRHN RINCSGSTPSEESALDLERNC  HSNLPSFSPPTLQPFASAGQHCESNAAYFSWPTPIRLSVAAEERANYFANLQKGVLPDILHPLPKGQRA
Subjt:  MEQTRHNGRINCSGSTPSEESALDLERNCCSHSNLPSFSPPTLQPFASAGQHCESNAAYFSWPTPIRLSVAAEERANYFANLQKGVLPDILHPLPKGQRA

Query:  TTLLELMTIRAFHSKILRCYSLGTAIGFRIRKGVLTDIPAILVFVSRKVHKQWLSPIQCLPTALEGPGGVWCDVDVVEFSYFGAPNPAPKEQLYTEIVDD
        TTLLELMTIRAFHSKILRCYSLGTAIGFRI+KGVLTDIPAILVFVSRKVHKQWLSPIQCLPTALEGPGGVWCDVDVVEFSYFGAPNPAPKEQLYTEIVDD
Subjt:  TTLLELMTIRAFHSKILRCYSLGTAIGFRIRKGVLTDIPAILVFVSRKVHKQWLSPIQCLPTALEGPGGVWCDVDVVEFSYFGAPNPAPKEQLYTEIVDD

Query:  LRGSDLCIGSGSQVASQETYGTLGAIVRSQTGSRQVGFLTNRHVAVDLDYPNQKMFHPLPPTLGPGVYLGAVERATSFITDELWYGIFAGINPETFVRAD
        LRGSDLCIGSGSQVASQETYGTLGAIVRSQTGSRQVGFLTNRHVAVDLDYPNQKMFHPLPPTLGPGVYLGAVERATSFITDELWYGIFAGINPETFVRAD
Subjt:  LRGSDLCIGSGSQVASQETYGTLGAIVRSQTGSRQVGFLTNRHVAVDLDYPNQKMFHPLPPTLGPGVYLGAVERATSFITDELWYGIFAGINPETFVRAD

Query:  GAFIPFADDFEMSTVTTSVKGVGEVGDVKFIDLQSPISTLIGKQVVKVGRSSGLTTGTVLAYALEYNDEKGICFLTDFLVVGENQQTFDLEGDSGSLIIL
        GAFIPFADDF+MSTVTTSVKGVGEVGDVKFIDLQSPISTLIGKQVVKVGRSSGLTTGTVLAYALEYNDEKGICFLTDFLVVGENQQTFDLEGDSGSLIIL
Subjt:  GAFIPFADDFEMSTVTTSVKGVGEVGDVKFIDLQSPISTLIGKQVVKVGRSSGLTTGTVLAYALEYNDEKGICFLTDFLVVGENQQTFDLEGDSGSLIIL

Query:  KGENRESLQPIGIIWGGTANRGRLKLKVGQPPENWTSGVDLGRLLNLLELDLITSDEGLKAAVQEQRTVSATVIGSIVGDSSPPDTTLPKEKSEEKFEPL
        KGENRESLQPIGIIWGGTANRGRLKLKVGQPPENWTSGVDLGRLLNLLELDLIT+D+G KAAVQEQRTVSATVIGSIVGDSSPPDTTLPK+KSEEKFEPL
Subjt:  KGENRESLQPIGIIWGGTANRGRLKLKVGQPPENWTSGVDLGRLLNLLELDLITSDEGLKAAVQEQRTVSATVIGSIVGDSSPPDTTLPKEKSEEKFEPL

Query:  GFQIQHMPTEVEPSSAEERPLLETEFHLEAGTSMAPSVEHQFIPSLFSCSPSHQNSSLDRAVSQNLSSLRSDCEDICVSLQLGDHEAKRRRSDASVSMEE
        GFQIQHMPTEVEPSSA++RPLLETEFHLEAGTS AP+VEHQFIPSLFSCSPS  NS+L RAVSQNLSSLRS+CEDI VSL LGDHEAKRRRSDASVSMEE
Subjt:  GFQIQHMPTEVEPSSAEERPLLETEFHLEAGTSMAPSVEHQFIPSLFSCSPSHQNSSLDRAVSQNLSSLRSDCEDICVSLQLGDHEAKRRRSDASVSMEE

Query:  LK
        LK
Subjt:  LK

XP_022981089.1 uncharacterized protein LOC111480347 [Cucurbita maxima]0.0e+0097.01Show/hide
Query:  MEQTRHNGRINCSGSTPSEESALDLERNCCSHSNLPSFSPPTLQPFASAGQHCESNAAYFSWPTPIRLSVAAEERANYFANLQKGVLPDILHPLPKGQRA
        MEQTRHN RINCSGSTPSEESALDLERNC  HSNLPSFSPPTLQPFASAGQHCESNAAYFSWPTPIRLSVAAEERANYFANLQKGVLPDILHPLPKGQRA
Subjt:  MEQTRHNGRINCSGSTPSEESALDLERNCCSHSNLPSFSPPTLQPFASAGQHCESNAAYFSWPTPIRLSVAAEERANYFANLQKGVLPDILHPLPKGQRA

Query:  TTLLELMTIRAFHSKILRCYSLGTAIGFRIRKGVLTDIPAILVFVSRKVHKQWLSPIQCLPTALEGPGGVWCDVDVVEFSYFGAPNPAPKEQLYTEIVDD
        TTLLELMTIRAFHSKILRCYSLGTAIGFRI+KGVLTDIPAILVFVSRKVHKQWLSPIQCLPTALEGPGGVWCDVDVVEFSYFGAPNPAPKEQLYTEIVDD
Subjt:  TTLLELMTIRAFHSKILRCYSLGTAIGFRIRKGVLTDIPAILVFVSRKVHKQWLSPIQCLPTALEGPGGVWCDVDVVEFSYFGAPNPAPKEQLYTEIVDD

Query:  LRGSDLCIGSGSQVASQETYGTLGAIVRSQTGSRQVGFLTNRHVAVDLDYPNQKMFHPLPPTLGPGVYLGAVERATSFITDELWYGIFAGINPETFVRAD
        LRGSDLCIGSGSQVASQETYGTLGAIVRSQTGSRQVGFLTNRHVAVDLDYPNQKMFHPLPPTLGPGVYLGAVERATSFITDELWYGIFAGINPETFVRAD
Subjt:  LRGSDLCIGSGSQVASQETYGTLGAIVRSQTGSRQVGFLTNRHVAVDLDYPNQKMFHPLPPTLGPGVYLGAVERATSFITDELWYGIFAGINPETFVRAD

Query:  GAFIPFADDFEMSTVTTSVKGVGEVGDVKFIDLQSPISTLIGKQVVKVGRSSGLTTGTVLAYALEYNDEKGICFLTDFLVVGENQQTFDLEGDSGSLIIL
        GAFIPFADDF+MSTVTTSVKGVGEVGDVKFIDLQSPISTLIGKQVVKVGRSSGLTTGTVLAYALEYNDEKGICFLTDFLVVGENQQTFDLEGDSGSLIIL
Subjt:  GAFIPFADDFEMSTVTTSVKGVGEVGDVKFIDLQSPISTLIGKQVVKVGRSSGLTTGTVLAYALEYNDEKGICFLTDFLVVGENQQTFDLEGDSGSLIIL

Query:  KGENRESLQPIGIIWGGTANRGRLKLKVGQPPENWTSGVDLGRLLNLLELDLITSDEGLKAAVQEQRTVSATVIGSIVGDSSPPDTTLPKEKSEEKFEPL
        KGENRESLQPIGIIWGGTANRGRLKLKVGQPPENWTSGVDLGRLLNLLELDLIT+D+G KAAVQEQRTVSATVIGSIVGDSSPPDTTLPK+KSEEKFEPL
Subjt:  KGENRESLQPIGIIWGGTANRGRLKLKVGQPPENWTSGVDLGRLLNLLELDLITSDEGLKAAVQEQRTVSATVIGSIVGDSSPPDTTLPKEKSEEKFEPL

Query:  GFQIQHMPTEVEPSSAEERPLLETEFHLEAGTSMAPSVEHQFIPSLFSCSPSHQNSSLDRAVSQNLSSLRSDCEDICVSLQLGDHEAKRRRSDASVSMEE
        GFQIQHMPTEVEPSSA++RPLLETEFHLEAGTS APSVEHQFIPSLFSCSPS  NS+L RAVSQNLSSLRSDCEDI VSL LGDHEAKRRRSDASVSMEE
Subjt:  GFQIQHMPTEVEPSSAEERPLLETEFHLEAGTSMAPSVEHQFIPSLFSCSPSHQNSSLDRAVSQNLSSLRSDCEDICVSLQLGDHEAKRRRSDASVSMEE

Query:  LK
        LK
Subjt:  LK

TrEMBL top hitse value%identityAlignment
A0A6J1C3A8 uncharacterized protein LOC111007860 isoform X20.0e+0099.67Show/hide
Query:  MEQTRHNGRINCSGSTPSEESALDLERNCCSHSNLPSFSPPTLQPFASAGQHCESNAAYFSWPTPIRLSVAAEERANYFANLQKGVLPDILHPLPKGQRA
        MEQTRHNGRINCSGSTPSEESALDLERNCCSHSNLPSFSPPTLQPFASAGQHCESNAAYFSWPTPIRLSVAAEERANYFANLQKGVLPDILHPLPKGQRA
Subjt:  MEQTRHNGRINCSGSTPSEESALDLERNCCSHSNLPSFSPPTLQPFASAGQHCESNAAYFSWPTPIRLSVAAEERANYFANLQKGVLPDILHPLPKGQRA

Query:  TTLLELMTIRAFHSKILRCYSLGTAIGFRIRKGVLTDIPAILVFVSRKVHKQWLSPIQCLPTALEGPGGVWCDVDVVEFSYFGAPNPAPKEQLYTEIVDD
        TTLLELMTIRAFHSKILRCYSLGTAIGFRIRKGVLTDIPAILVFVSRKVHKQWLSPIQCLPTALEGPGGVWCDVDVVEFSYFGAPNPAPKEQLYTEIVDD
Subjt:  TTLLELMTIRAFHSKILRCYSLGTAIGFRIRKGVLTDIPAILVFVSRKVHKQWLSPIQCLPTALEGPGGVWCDVDVVEFSYFGAPNPAPKEQLYTEIVDD

Query:  LRGSDLCIGSGSQVASQETYGTLGAIVRSQTGSRQVGFLTNRHVAVDLDYPNQKMFHPLPPTLGPGVYLGAVERATSFITDELWYGIFAGINPETFVRAD
        LRGSDLCIGSGSQVASQETYGTLGAIVRSQTGSRQVGFLTNRHVAVDLDYPNQKMFHPLPPTLGPGVYLGAVERATSFITDELWYGIFAGINPETFVRAD
Subjt:  LRGSDLCIGSGSQVASQETYGTLGAIVRSQTGSRQVGFLTNRHVAVDLDYPNQKMFHPLPPTLGPGVYLGAVERATSFITDELWYGIFAGINPETFVRAD

Query:  GAFIPFADDFEMSTVTTSVKGVGEVGDVKFIDLQSPISTLIGKQVVKVGRSSGLTTGTVLAYALEYNDEKGICFLTDFLVVGENQQTFDLEGDSGSLIIL
        GAFIPFADDFEMSTVTTSVKGVGEVGDVKFIDLQSPISTLIGK+VVKVGRSSGLTTGTVLAYALEYNDEKGICFLTDFLVVGENQQTFDLEGDSGSLIIL
Subjt:  GAFIPFADDFEMSTVTTSVKGVGEVGDVKFIDLQSPISTLIGKQVVKVGRSSGLTTGTVLAYALEYNDEKGICFLTDFLVVGENQQTFDLEGDSGSLIIL

Query:  KGENRESLQPIGIIWGGTANRGRLKLKVGQPPENWTSGVDLGRLLNLLELDLITSDEGLKAAVQEQRTVSATVIGSIVGDSSPPDTTLPKEKSEEKFEPL
        KGENRESLQPIGIIWGGTANRGRLKLKVGQPPENWTSGVDLGRLLNLLELDLITSDEGLK AVQEQRTVSATVIGSIVGDSSPPDTTLPKEKSEEKFEPL
Subjt:  KGENRESLQPIGIIWGGTANRGRLKLKVGQPPENWTSGVDLGRLLNLLELDLITSDEGLKAAVQEQRTVSATVIGSIVGDSSPPDTTLPKEKSEEKFEPL

Query:  GFQIQHMPTEVEPSSAEERPLLETEFHLEAGTSMAPSVEHQFIPSLFSCSPSHQNSSLDRAVSQNLSSLRSDCEDICVSLQLGDHEAKRRRSDASVSMEE
        GFQIQHMPTEVEPSSAEERPLLETEFHLEAGTSMAPSVEHQFIPSLFSCSPSHQNSSLDRAVSQNLSSLRSDCEDICVSLQLGDHEAKRRRSDASVSMEE
Subjt:  GFQIQHMPTEVEPSSAEERPLLETEFHLEAGTSMAPSVEHQFIPSLFSCSPSHQNSSLDRAVSQNLSSLRSDCEDICVSLQLGDHEAKRRRSDASVSMEE

Query:  LK
        LK
Subjt:  LK

A0A6J1C6M7 uncharacterized protein LOC111007860 isoform X10.0e+0099.83Show/hide
Query:  MEQTRHNGRINCSGSTPSEESALDLERNCCSHSNLPSFSPPTLQPFASAGQHCESNAAYFSWPTPIRLSVAAEERANYFANLQKGVLPDILHPLPKGQRA
        MEQTRHNGRINCSGSTPSEESALDLERNCCSHSNLPSFSPPTLQPFASAGQHCESNAAYFSWPTPIRLSVAAEERANYFANLQKGVLPDILHPLPKGQRA
Subjt:  MEQTRHNGRINCSGSTPSEESALDLERNCCSHSNLPSFSPPTLQPFASAGQHCESNAAYFSWPTPIRLSVAAEERANYFANLQKGVLPDILHPLPKGQRA

Query:  TTLLELMTIRAFHSKILRCYSLGTAIGFRIRKGVLTDIPAILVFVSRKVHKQWLSPIQCLPTALEGPGGVWCDVDVVEFSYFGAPNPAPKEQLYTEIVDD
        TTLLELMTIRAFHSKILRCYSLGTAIGFRIRKGVLTDIPAILVFVSRKVHKQWLSPIQCLPTALEGPGGVWCDVDVVEFSYFGAPNPAPKEQLYTEIVDD
Subjt:  TTLLELMTIRAFHSKILRCYSLGTAIGFRIRKGVLTDIPAILVFVSRKVHKQWLSPIQCLPTALEGPGGVWCDVDVVEFSYFGAPNPAPKEQLYTEIVDD

Query:  LRGSDLCIGSGSQVASQETYGTLGAIVRSQTGSRQVGFLTNRHVAVDLDYPNQKMFHPLPPTLGPGVYLGAVERATSFITDELWYGIFAGINPETFVRAD
        LRGSDLCIGSGSQVASQETYGTLGAIVRSQTGSRQVGFLTNRHVAVDLDYPNQKMFHPLPPTLGPGVYLGAVERATSFITDELWYGIFAGINPETFVRAD
Subjt:  LRGSDLCIGSGSQVASQETYGTLGAIVRSQTGSRQVGFLTNRHVAVDLDYPNQKMFHPLPPTLGPGVYLGAVERATSFITDELWYGIFAGINPETFVRAD

Query:  GAFIPFADDFEMSTVTTSVKGVGEVGDVKFIDLQSPISTLIGKQVVKVGRSSGLTTGTVLAYALEYNDEKGICFLTDFLVVGENQQTFDLEGDSGSLIIL
        GAFIPFADDFEMSTVTTSVKGVGEVGDVKFIDLQSPISTLIGK+VVKVGRSSGLTTGTVLAYALEYNDEKGICFLTDFLVVGENQQTFDLEGDSGSLIIL
Subjt:  GAFIPFADDFEMSTVTTSVKGVGEVGDVKFIDLQSPISTLIGKQVVKVGRSSGLTTGTVLAYALEYNDEKGICFLTDFLVVGENQQTFDLEGDSGSLIIL

Query:  KGENRESLQPIGIIWGGTANRGRLKLKVGQPPENWTSGVDLGRLLNLLELDLITSDEGLKAAVQEQRTVSATVIGSIVGDSSPPDTTLPKEKSEEKFEPL
        KGENRESLQPIGIIWGGTANRGRLKLKVGQPPENWTSGVDLGRLLNLLELDLITSDEGLKAAVQEQRTVSATVIGSIVGDSSPPDTTLPKEKSEEKFEPL
Subjt:  KGENRESLQPIGIIWGGTANRGRLKLKVGQPPENWTSGVDLGRLLNLLELDLITSDEGLKAAVQEQRTVSATVIGSIVGDSSPPDTTLPKEKSEEKFEPL

Query:  GFQIQHMPTEVEPSSAEERPLLETEFHLEAGTSMAPSVEHQFIPSLFSCSPSHQNSSLDRAVSQNLSSLRSDCEDICVSLQLGDHEAKRRRSDASVSMEE
        GFQIQHMPTEVEPSSAEERPLLETEFHLEAGTSMAPSVEHQFIPSLFSCSPSHQNSSLDRAVSQNLSSLRSDCEDICVSLQLGDHEAKRRRSDASVSMEE
Subjt:  GFQIQHMPTEVEPSSAEERPLLETEFHLEAGTSMAPSVEHQFIPSLFSCSPSHQNSSLDRAVSQNLSSLRSDCEDICVSLQLGDHEAKRRRSDASVSMEE

Query:  LK
        LK
Subjt:  LK

A0A6J1F9R8 uncharacterized protein LOC1114433850.0e+0095.36Show/hide
Query:  MEQTRHNGRINCSGSTPSEESALDLERNCCSHSNLPSFSPPTLQPFASAGQHCESNAAYFSWPTPIRLSVAAEERANYFANLQKGVLPDILHPLPKGQRA
        MEQTR + RINCSGSTPSEESAL+LERN CSHSNLPSFS PTLQPFASAGQHCESNAAYFSWPTPIR+SV  EERANYFANLQKGVLPDILHPLPKGQRA
Subjt:  MEQTRHNGRINCSGSTPSEESALDLERNCCSHSNLPSFSPPTLQPFASAGQHCESNAAYFSWPTPIRLSVAAEERANYFANLQKGVLPDILHPLPKGQRA

Query:  TTLLELMTIRAFHSKILRCYSLGTAIGFRIRKGVLTDIPAILVFVSRKVHKQWLSPIQCLPTALEGPGGVWCDVDVVEFSYFGAPNPAPKEQLYTEIVDD
         TLLELMTIRAFHSKILRCYSLGTAIGFRIRKGVLTDIPAILVFVSRKVHKQWLSPIQCLPTALEGPGGVWCDVDVVEFSYFGAPNPAPKEQLYTEIVDD
Subjt:  TTLLELMTIRAFHSKILRCYSLGTAIGFRIRKGVLTDIPAILVFVSRKVHKQWLSPIQCLPTALEGPGGVWCDVDVVEFSYFGAPNPAPKEQLYTEIVDD

Query:  LRGSDLCIGSGSQVASQETYGTLGAIVRSQTGSRQVGFLTNRHVAVDLDYPNQKMFHPLPPTLGPGVYLGAVERATSFITDELWYGIFAGINPETFVRAD
        LRGSD CIGSGSQVASQETYGTLGAIVRSQTGSRQVGFLTNRHVAVDLDYPNQKMFHPLPPTLGPGVYLGAVERATSFITDELWYGIFAGINPETFVRAD
Subjt:  LRGSDLCIGSGSQVASQETYGTLGAIVRSQTGSRQVGFLTNRHVAVDLDYPNQKMFHPLPPTLGPGVYLGAVERATSFITDELWYGIFAGINPETFVRAD

Query:  GAFIPFADDFEMSTVTTSVKGVGEVGDVKFIDLQSPISTLIGKQVVKVGRSSGLTTGTVLAYALEYNDEKGICFLTDFLVVGENQQTFDLEGDSGSLIIL
        G+FIPFADDF+MSTVTTSVKGVGE+GDVKFIDLQSPISTLIGKQVVKVGRSSGLTTGTVLAYALEYNDEKGICFLTDFLVVGENQQTFDLEGDSGSLIIL
Subjt:  GAFIPFADDFEMSTVTTSVKGVGEVGDVKFIDLQSPISTLIGKQVVKVGRSSGLTTGTVLAYALEYNDEKGICFLTDFLVVGENQQTFDLEGDSGSLIIL

Query:  KGENRESLQPIGIIWGGTANRGRLKLKVGQPPENWTSGVDLGRLLNLLELDLITSDEGLKAAVQEQRTVSATVIGSIVGDSSPPDTTLPKEKSEEKFEPL
        KGENRESL+PIGIIWGGTANRGRLKLKVGQPPENWTSGVDLGRLLNLLELDLITSDEGLKAAVQEQRTVSATVIGSIVGDSSPPDTTLP+EKSEEKFEPL
Subjt:  KGENRESLQPIGIIWGGTANRGRLKLKVGQPPENWTSGVDLGRLLNLLELDLITSDEGLKAAVQEQRTVSATVIGSIVGDSSPPDTTLPKEKSEEKFEPL

Query:  GFQIQHMPTEVEPSSAEERP-LLETEFHLEAGTSMAPSVEHQFIPSLFSCSPSHQNSSLDRAVSQNLSSLRSDCEDICVSLQLGDHEAKRRRSDASVSME
        GFQIQHM TEVEPSSA+++  LLETEFHLEAGTS APSVEHQFIPSLFSCSPSHQNSSL RAVSQNLSSLR+DCEDICVSLQLGDHEAKR+R D SVSME
Subjt:  GFQIQHMPTEVEPSSAEERP-LLETEFHLEAGTSMAPSVEHQFIPSLFSCSPSHQNSSLDRAVSQNLSSLRSDCEDICVSLQLGDHEAKRRRSDASVSME

Query:  ELK
        ELK
Subjt:  ELK

A0A6J1FJM9 uncharacterized protein LOC1114459580.0e+0096.68Show/hide
Query:  MEQTRHNGRINCSGSTPSEESALDLERNCCSHSNLPSFSPPTLQPFASAGQHCESNAAYFSWPTPIRLSVAAEERANYFANLQKGVLPDILHPLPKGQRA
        MEQTRHN RINCSGSTPSEESALDLERNC  HSNLPSFSPPTLQPFASAGQHCESNAAYFSWPTPIRLSVAAEERANYFANLQKGVLPDILHPLPKGQRA
Subjt:  MEQTRHNGRINCSGSTPSEESALDLERNCCSHSNLPSFSPPTLQPFASAGQHCESNAAYFSWPTPIRLSVAAEERANYFANLQKGVLPDILHPLPKGQRA

Query:  TTLLELMTIRAFHSKILRCYSLGTAIGFRIRKGVLTDIPAILVFVSRKVHKQWLSPIQCLPTALEGPGGVWCDVDVVEFSYFGAPNPAPKEQLYTEIVDD
        TTLLELMTIRAFHSKILRCYSLGTAIGFRI+KGVLTDIPAILVFVSRKVHKQWLSPIQCLPTALEGPGGVWCDVDVVEFSYFGAPNPAPKEQLYTEIVDD
Subjt:  TTLLELMTIRAFHSKILRCYSLGTAIGFRIRKGVLTDIPAILVFVSRKVHKQWLSPIQCLPTALEGPGGVWCDVDVVEFSYFGAPNPAPKEQLYTEIVDD

Query:  LRGSDLCIGSGSQVASQETYGTLGAIVRSQTGSRQVGFLTNRHVAVDLDYPNQKMFHPLPPTLGPGVYLGAVERATSFITDELWYGIFAGINPETFVRAD
        LRGSDLCIGSGSQVASQETYGTLGAIVRSQTGSRQVGFLTNRHVAVDLDYPNQKMFHPLPPTLGPGVYLGAVERATSFITDELWYGIFAGINPETFVRAD
Subjt:  LRGSDLCIGSGSQVASQETYGTLGAIVRSQTGSRQVGFLTNRHVAVDLDYPNQKMFHPLPPTLGPGVYLGAVERATSFITDELWYGIFAGINPETFVRAD

Query:  GAFIPFADDFEMSTVTTSVKGVGEVGDVKFIDLQSPISTLIGKQVVKVGRSSGLTTGTVLAYALEYNDEKGICFLTDFLVVGENQQTFDLEGDSGSLIIL
        GAFIPFADDF+MSTVTTSVKGVGEVGDVKFIDLQSPISTLIGKQVVKVGRSSGLTTGTVLAYALEYNDEKGICFLTDFLVVGENQQTFDLEGDSGSLIIL
Subjt:  GAFIPFADDFEMSTVTTSVKGVGEVGDVKFIDLQSPISTLIGKQVVKVGRSSGLTTGTVLAYALEYNDEKGICFLTDFLVVGENQQTFDLEGDSGSLIIL

Query:  KGENRESLQPIGIIWGGTANRGRLKLKVGQPPENWTSGVDLGRLLNLLELDLITSDEGLKAAVQEQRTVSATVIGSIVGDSSPPDTTLPKEKSEEKFEPL
        KGENRESLQPIGIIWGGTANRGRLKLKVGQPPENWTSGVDLGRLLNLLELDLIT+D+G KAAVQEQRTVSATVIGSIVGDSSPPDTTLPK+KSEEKFEPL
Subjt:  KGENRESLQPIGIIWGGTANRGRLKLKVGQPPENWTSGVDLGRLLNLLELDLITSDEGLKAAVQEQRTVSATVIGSIVGDSSPPDTTLPKEKSEEKFEPL

Query:  GFQIQHMPTEVEPSSAEERPLLETEFHLEAGTSMAPSVEHQFIPSLFSCSPSHQNSSLDRAVSQNLSSLRSDCEDICVSLQLGDHEAKRRRSDASVSMEE
        GFQIQHMPTEVEPSSA++RPLLETEFHLEAGTS AP+VEHQFIPSLFSCSPS  NS+L RAVSQNLSSLRS+CEDI VSL LGDHEAKRRRSDASVSMEE
Subjt:  GFQIQHMPTEVEPSSAEERPLLETEFHLEAGTSMAPSVEHQFIPSLFSCSPSHQNSSLDRAVSQNLSSLRSDCEDICVSLQLGDHEAKRRRSDASVSMEE

Query:  LK
        LK
Subjt:  LK

A0A6J1J109 uncharacterized protein LOC1114803470.0e+0097.01Show/hide
Query:  MEQTRHNGRINCSGSTPSEESALDLERNCCSHSNLPSFSPPTLQPFASAGQHCESNAAYFSWPTPIRLSVAAEERANYFANLQKGVLPDILHPLPKGQRA
        MEQTRHN RINCSGSTPSEESALDLERNC  HSNLPSFSPPTLQPFASAGQHCESNAAYFSWPTPIRLSVAAEERANYFANLQKGVLPDILHPLPKGQRA
Subjt:  MEQTRHNGRINCSGSTPSEESALDLERNCCSHSNLPSFSPPTLQPFASAGQHCESNAAYFSWPTPIRLSVAAEERANYFANLQKGVLPDILHPLPKGQRA

Query:  TTLLELMTIRAFHSKILRCYSLGTAIGFRIRKGVLTDIPAILVFVSRKVHKQWLSPIQCLPTALEGPGGVWCDVDVVEFSYFGAPNPAPKEQLYTEIVDD
        TTLLELMTIRAFHSKILRCYSLGTAIGFRI+KGVLTDIPAILVFVSRKVHKQWLSPIQCLPTALEGPGGVWCDVDVVEFSYFGAPNPAPKEQLYTEIVDD
Subjt:  TTLLELMTIRAFHSKILRCYSLGTAIGFRIRKGVLTDIPAILVFVSRKVHKQWLSPIQCLPTALEGPGGVWCDVDVVEFSYFGAPNPAPKEQLYTEIVDD

Query:  LRGSDLCIGSGSQVASQETYGTLGAIVRSQTGSRQVGFLTNRHVAVDLDYPNQKMFHPLPPTLGPGVYLGAVERATSFITDELWYGIFAGINPETFVRAD
        LRGSDLCIGSGSQVASQETYGTLGAIVRSQTGSRQVGFLTNRHVAVDLDYPNQKMFHPLPPTLGPGVYLGAVERATSFITDELWYGIFAGINPETFVRAD
Subjt:  LRGSDLCIGSGSQVASQETYGTLGAIVRSQTGSRQVGFLTNRHVAVDLDYPNQKMFHPLPPTLGPGVYLGAVERATSFITDELWYGIFAGINPETFVRAD

Query:  GAFIPFADDFEMSTVTTSVKGVGEVGDVKFIDLQSPISTLIGKQVVKVGRSSGLTTGTVLAYALEYNDEKGICFLTDFLVVGENQQTFDLEGDSGSLIIL
        GAFIPFADDF+MSTVTTSVKGVGEVGDVKFIDLQSPISTLIGKQVVKVGRSSGLTTGTVLAYALEYNDEKGICFLTDFLVVGENQQTFDLEGDSGSLIIL
Subjt:  GAFIPFADDFEMSTVTTSVKGVGEVGDVKFIDLQSPISTLIGKQVVKVGRSSGLTTGTVLAYALEYNDEKGICFLTDFLVVGENQQTFDLEGDSGSLIIL

Query:  KGENRESLQPIGIIWGGTANRGRLKLKVGQPPENWTSGVDLGRLLNLLELDLITSDEGLKAAVQEQRTVSATVIGSIVGDSSPPDTTLPKEKSEEKFEPL
        KGENRESLQPIGIIWGGTANRGRLKLKVGQPPENWTSGVDLGRLLNLLELDLIT+D+G KAAVQEQRTVSATVIGSIVGDSSPPDTTLPK+KSEEKFEPL
Subjt:  KGENRESLQPIGIIWGGTANRGRLKLKVGQPPENWTSGVDLGRLLNLLELDLITSDEGLKAAVQEQRTVSATVIGSIVGDSSPPDTTLPKEKSEEKFEPL

Query:  GFQIQHMPTEVEPSSAEERPLLETEFHLEAGTSMAPSVEHQFIPSLFSCSPSHQNSSLDRAVSQNLSSLRSDCEDICVSLQLGDHEAKRRRSDASVSMEE
        GFQIQHMPTEVEPSSA++RPLLETEFHLEAGTS APSVEHQFIPSLFSCSPS  NS+L RAVSQNLSSLRSDCEDI VSL LGDHEAKRRRSDASVSMEE
Subjt:  GFQIQHMPTEVEPSSAEERPLLETEFHLEAGTSMAPSVEHQFIPSLFSCSPSHQNSSLDRAVSQNLSSLRSDCEDICVSLQLGDHEAKRRRSDASVSMEE

Query:  LK
        LK
Subjt:  LK

SwissProt top hitse value%identityAlignment
B4XT64 Protein NARROW LEAF 16.4e-21264.74Show/hide
Query:  SGSTPSEESALDLERNCCSHSNLPSFSPPTLQPFASAGQHCESNAAYFSWPTPIRLSVAAEERANYFANLQKGVLPDILHPLPKGQRATTLLELMTIRAF
        SG   SEES+LD++     H + P    P++QP AS   H E++AAYF WPT      AAE RANYF NLQKG+LP     LPKGQ+A +LL+LMTIRAF
Subjt:  SGSTPSEESALDLERNCCSHSNLPSFSPPTLQPFASAGQHCESNAAYFSWPTPIRLSVAAEERANYFANLQKGVLPDILHPLPKGQRATTLLELMTIRAF

Query:  HSKILRCYSLGTAIGFRIRKGVLTDIPAILVFVSRKVHKQWLSPIQCLPTALEGPGGVWCDVDVVEFSYFGAPNPAPKEQLYTEIVDDLRGSDLCIGSGS
        HSKILR +SLGTA+GFRIRKG LTDIPAILVFV+RKVHK+WL+P QCLP  LEGPGGVWCDVDVVEFSY+GAP   PKEQ+++E+VD L GSD CIGSGS
Subjt:  HSKILRCYSLGTAIGFRIRKGVLTDIPAILVFVSRKVHKQWLSPIQCLPTALEGPGGVWCDVDVVEFSYFGAPNPAPKEQLYTEIVDDLRGSDLCIGSGS

Query:  QVASQETYGTLGAIVRSQTGSRQVGFLTNRHVAVDLDYPNQKMFHPLPPTLGPGVYLGAVERATSFITDELWYGIFAGINPETFVRADGAFIPFADDFEM
        QVAS ET+GTLGAIV+ +TG++QVGFLTN HVAVDLDYPNQKMFHPLPP LGPGVYLGAVERATSFITD++WYGI+AG NPETFVRADGAFIPFADDF++
Subjt:  QVASQETYGTLGAIVRSQTGSRQVGFLTNRHVAVDLDYPNQKMFHPLPPTLGPGVYLGAVERATSFITDELWYGIFAGINPETFVRADGAFIPFADDFEM

Query:  STVTTSVKGVGEVGDVKFIDLQSPISTLIGKQVVKVGRSSGLTTGTVLAYALEYNDEKGICFLTDFLVVGENQQTFDLEGDSGSLIILKGENRESLQPIG
        STVTT V+GVG++GDVK IDLQ P+++LIG+QV KVGRSSG TTGTV+AYALEYNDEKGICF TD LVVGEN+QTFDLEGDSGSLIIL  ++ E  +PIG
Subjt:  STVTTSVKGVGEVGDVKFIDLQSPISTLIGKQVVKVGRSSGLTTGTVLAYALEYNDEKGICFLTDFLVVGENQQTFDLEGDSGSLIILKGENRESLQPIG

Query:  IIWGGTANRGRLKLKVGQPPENWTSGVDLGRLLNLLELDLITSDEGLKAAVQEQRTVSATVIGSIVGDSSPPDTTLPKEKSEEKFEPLGFQIQHMPTEVE
        IIWGGTANRGRLKL     PENWTSGVDLGRLL+ LELD+I ++E L+ AVQ+QR      + S VG+SS     +P+EK EE FEPLG QIQ +P    
Subjt:  IIWGGTANRGRLKLKVGQPPENWTSGVDLGRLLNLLELDLITSDEGLKAAVQEQRTVSATVIGSIVGDSSPPDTTLPKEKSEEKFEPLGFQIQHMPTEVE

Query:  PSSAEERPLLETEFHLEAGTSMAPSVEHQFIPSLFSCSPSHQNSSLDRAVSQNLSSLRSDCEDICVSLQLGDHEAKRRRSDASVSME
         +S  E          EA  ++    EHQFI +    SP   +    R+++ NL++     E++ +SL LGD E KR RSD+  S++
Subjt:  PSSAEERPLLETEFHLEAGTSMAPSVEHQFIPSLFSCSPSHQNSSLDRAVSQNLSSLRSDCEDICVSLQLGDHEAKRRRSDASVSME

Arabidopsis top hitse value%identityAlignment
AT2G35155.1 Trypsin family protein7.2e-21966.5Show/hide
Query:  INCSGSTPSEESALDLERN-CCSHSNLPSFSPPT-LQPFASAGQHCESNAAYFSWPTPIRLSVAAEERANYFANLQKGVLPDILHPLPKGQRATTLLELM
        I  + S+ SE+SALDLERN  C+H +LPS S P+ LQPF    QH ESNA YFSWPT  RL+   E+RANYF NLQKGVLP+ +  LP GQ+ATTLLELM
Subjt:  INCSGSTPSEESALDLERN-CCSHSNLPSFSPPT-LQPFASAGQHCESNAAYFSWPTPIRLSVAAEERANYFANLQKGVLPDILHPLPKGQRATTLLELM

Query:  TIRAFHSKILRCYSLGTAIGFRIRKGVLTDIPAILVFVSRKVHKQWLSPIQCLPTALEGPGGVWCDVDVVEFSYFGAPNPAPKEQLYTEIVDDLRGSDLC
        TIRAFHSKILR +SLGTA+GFRI +GVLT++PAILVFV+RKVH+QWL+P+QCLP+ALEGPGGVWCDVDVVEF Y+GAP   PKEQ+Y E+VD LRGSD C
Subjt:  TIRAFHSKILRCYSLGTAIGFRIRKGVLTDIPAILVFVSRKVHKQWLSPIQCLPTALEGPGGVWCDVDVVEFSYFGAPNPAPKEQLYTEIVDDLRGSDLC

Query:  IGSGSQVASQETYGTLGAIVRSQTGSRQVGFLTNRHVAVDLDYPNQKMFHPLPPTLGPGVYLGAVERATSFITDELWYGIFAGINPETFVRADGAFIPFA
        IGSGSQVASQETYGTLGAIV+S+TG+ QVGFLTNRHVAVDLDYP+QKMFHPLPP+LGPGVYLGAVERATSFITD+ WYGIFAG NPETFVRADGAFIPFA
Subjt:  IGSGSQVASQETYGTLGAIVRSQTGSRQVGFLTNRHVAVDLDYPNQKMFHPLPPTLGPGVYLGAVERATSFITDELWYGIFAGINPETFVRADGAFIPFA

Query:  DDFEMSTVTTSVKGVGEVGDVKFIDLQSPISTLIGKQVVKVGRSSGLTTGTVLAYALEYNDEKGICFLTDFLVVGENQQTFDLEGDSGSLIILKGENRES
        +DF  S VTT +KG+GE+GDV  IDLQSPI +LIGKQVVKVGRSSG TTGT++AYALEYNDEKGICFLTDFLV+GENQQTFDLEGDSGSLI+L G N + 
Subjt:  DDFEMSTVTTSVKGVGEVGDVKFIDLQSPISTLIGKQVVKVGRSSGLTTGTVLAYALEYNDEKGICFLTDFLVVGENQQTFDLEGDSGSLIILKGENRES

Query:  LQPIGIIWGGTANRGRLKLKVGQPPENWTSGVDLGRLLNLLELDLITSDEGLK--AAVQEQRTVSATVIGSIVGDSSPPDTTLPKEKSEEKFEPLGFQIQ
         +P+GIIWGGTANRGRLKL  GQ PENWTSGVDLGRLL+LLELDLITS+  L+  AA +E+R  S T + S V  SSPPD     +K +E FEP      
Subjt:  LQPIGIIWGGTANRGRLKLKVGQPPENWTSGVDLGRLLNLLELDLITSDEGLK--AAVQEQRTVSATVIGSIVGDSSPPDTTLPKEKSEEKFEPLGFQIQ

Query:  HMPTEVEPSSAEERPLLETEFHLEAGTSMAPSVEHQFIPSLFSCSPSHQNSSLDRAVSQNLSSLR-SDCEDICVSLQLGDHEAKR
        H+   ++P+   E  +      +   TS   +++ Q IP L                  NL +L+ S  E++ +SL LG+ + K+
Subjt:  HMPTEVEPSSAEERPLLETEFHLEAGTSMAPSVEHQFIPSLFSCSPSHQNSSLDRAVSQNLSSLR-SDCEDICVSLQLGDHEAKR

AT3G12950.1 Trypsin family protein2.2e-21269.91Show/hide
Query:  FASAGQHCESNAA-YFSWPTPIRLSVAAEERANYFANLQK------GVLPDILHPLPKGQRATTLLELMTIRAFHSKILRCYSLGTAIGFRIRKGVLTDI
        + S GQHCE  AA YFSWPT  RLS AAEERANYF+NLQK       V P+ +   PKGQRATTLLELMTIRAFHSK+LRCYSLGTAIGFRIR+GVLTDI
Subjt:  FASAGQHCESNAA-YFSWPTPIRLSVAAEERANYFANLQK------GVLPDILHPLPKGQRATTLLELMTIRAFHSKILRCYSLGTAIGFRIRKGVLTDI

Query:  PAILVFVSRKVHKQWLSPIQCLPTALEGPGGVWCDVDVVEFSYFGAPN--PAPKEQLYTEIVDDLRGSDLCIGSGSQVASQETYGTLGAIVRSQTGSRQV
        PAI+VFVSRKVHKQWLSP+QCLPTALEG GG+WCDVDVVEFSYFG P+  P PK+   T+IVD L+GSD  IGSGSQVASQET GTLGAIVRSQTG RQV
Subjt:  PAILVFVSRKVHKQWLSPIQCLPTALEGPGGVWCDVDVVEFSYFGAPN--PAPKEQLYTEIVDDLRGSDLCIGSGSQVASQETYGTLGAIVRSQTGSRQV

Query:  GFLTNRHVAVDLDYPNQKMFHPLPPTLGPGVYLGAVERATSFITDELWYGIFAGINPETFVRADGAFIPFADDFEMSTVTTSVK-GVGEVGDVKFIDLQS
        GF+TNRHVAV+LDYP+QKMFHPLPP LGPGVYLGAVERATSFITD+LW+GIFAG NPETFVRADGAFIPFADD+++S VTTSVK GVGE+G+VK I+LQS
Subjt:  GFLTNRHVAVDLDYPNQKMFHPLPPTLGPGVYLGAVERATSFITDELWYGIFAGINPETFVRADGAFIPFADDFEMSTVTTSVK-GVGEVGDVKFIDLQS

Query:  PISTLIGKQVVKVGRSSGLTTGTVLAYALEYNDEKGICFLTDFLVVGENQQT-FDLEGDSGSLIILKGENRESLQPIGIIWGGTANRGRLKLKVGQPPEN
        P+ +L+GKQVVKVGRSSGLTTGTVLAYALEYNDE+G+CFLTDFLVVGEN ++ FDLEGDSGSLI++KGE  E  +PIGIIWGGT +RGRLKLKVG+ PE+
Subjt:  PISTLIGKQVVKVGRSSGLTTGTVLAYALEYNDEKGICFLTDFLVVGENQQT-FDLEGDSGSLIILKGENRESLQPIGIIWGGTANRGRLKLKVGQPPEN

Query:  WTSGVDLGRLLNLLELDLITSDEGLKAAVQEQRTVSATVIGSIVGDSSPPDTTLPKEK--SEEKFE-PLGFQIQHMPTEVEPSSAEERPLLETEFHLEAG
        WT+GVDLGRLL  L+LDLIT+DEGLKAAVQEQR  S T + S+V DSSPP   L KEK   EEK E  LG      P +V+    EER  +ET+      
Subjt:  WTSGVDLGRLLNLLELDLITSDEGLKAAVQEQRTVSATVIGSIVGDSSPPDTTLPKEK--SEEKFE-PLGFQIQHMPTEVEPSSAEERPLLETEFHLEAG

Query:  TSMAPSVEHQFIPSLF-SCSPSHQNSSLDRAVSQNLSSLRSDCEDICVSLQLGDHEAKRRRSDAS
           APSVEHQF+P+    CS S    +    +    ++   D  D+CV L+LGD  AKRRR+  +
Subjt:  TSMAPSVEHQFIPSLF-SCSPSHQNSSLDRAVSQNLSSLRSDCEDICVSLQLGDHEAKRRRSDAS

AT5G45030.1 Trypsin family protein5.0e-22066.94Show/hide
Query:  MEQTRHNGRINCSGSTPSEES-ALDLERNCCSHSNLPSFSPPTLQPFASAGQHCESN--AAYFSWPTPIRLSVAAEERANYFANLQKGVLPDILHPLPKG
        ME  R + R + S S+ S ES ALDL++N  +H  L S SP  LQPF S  QH E++  AAYFSWPT  RL+ +AE+RANYFANLQKGVLP+    LP G
Subjt:  MEQTRHNGRINCSGSTPSEES-ALDLERNCCSHSNLPSFSPPTLQPFASAGQHCESN--AAYFSWPTPIRLSVAAEERANYFANLQKGVLPDILHPLPKG

Query:  QRATTLLELMTIRAFHSKILRCYSLGTAIGFRIRKGVLTDIPAILVFVSRKVHKQWLSPIQCLPTALEGPGGVWCDVDVVEFSYFGAPNPAPKEQLYTEI
        ++ATTLLELM IRAFHSK LR +SLGTAIGFRIR+GVLT+I AILVFV+RKVHKQWL+P+QCLPTALEGPGGVWCDVDVVEF Y+GAP   PKEQ+YTE+
Subjt:  QRATTLLELMTIRAFHSKILRCYSLGTAIGFRIRKGVLTDIPAILVFVSRKVHKQWLSPIQCLPTALEGPGGVWCDVDVVEFSYFGAPNPAPKEQLYTEI

Query:  VDDLRGSDLCIGSGSQVASQETYGTLGAIVRSQTGSRQVGFLTNRHVAVDLDYPNQKMFHPLPPTLGPGVYLGAVERATSFITDELWYGIFAGINPETFV
        VDDLRGS   IGSGSQVASQETYGTLGAIV+S+TG RQVGFLTNRHVAVDLDYP+QKMFHPLPP+LGPGVYLGAVERATSFITD+LWYGIFAG NPETFV
Subjt:  VDDLRGSDLCIGSGSQVASQETYGTLGAIVRSQTGSRQVGFLTNRHVAVDLDYPNQKMFHPLPPTLGPGVYLGAVERATSFITDELWYGIFAGINPETFV

Query:  RADGAFIPFADDFEMSTVTTSVKGVGEVGDVKFIDLQSPISTLIGKQVVKVGRSSGLTTGTVLAYALEYNDEKGICFLTDFLVVGENQQTFDLEGDSGSL
        RADGAFIPFA+DF  + VTT+VKG+GE+GD+   DLQSP+++LIG++VVKVGRSSGLTTGT++AYALEYNDEKGICFLTDFLVVGENQQTFDLEGDSGSL
Subjt:  RADGAFIPFADDFEMSTVTTSVKGVGEVGDVKFIDLQSPISTLIGKQVVKVGRSSGLTTGTVLAYALEYNDEKGICFLTDFLVVGENQQTFDLEGDSGSL

Query:  IILKG--ENRESLQPIGIIWGGTANRGRLKLKVGQPPENWTSGVDLGRLLNLLELDLITSDEGLKAAVQEQRT-VSATVIGSIVGDSSPPDTTLPKEKSE
        I+L    E  E  +P+GIIWGGTANRGRLKLKVG+ PENWTSGVDLGR+LNLLELDLITS+EGL+AAV EQR  +    + S V +SSP    + + K+ 
Subjt:  IILKG--ENRESLQPIGIIWGGTANRGRLKLKVGQPPENWTSGVDLGRLLNLLELDLITSDEGLKAAVQEQRT-VSATVIGSIVGDSSPPDTTLPKEKSE

Query:  EKFEPLGFQIQHMPTEVEPSSAEERPLLETEFHLEAGTSMAPSVEHQFIPSLF-SCSPSHQN-SSLDRAVSQNLSSLR--SDCEDICVSLQLGDHEA-KR
        E FEP+   +Q +   +E  ++   P  + E  LE   S+A   EHQFIPS   + S  HQ  +  +   S+NLSSL+  S  ++I  SLQLG+ +  KR
Subjt:  EKFEPLGFQIQHMPTEVEPSSAEERPLLETEFHLEAGTSMAPSVEHQFIPSLF-SCSPSHQN-SSLDRAVSQNLSSLR--SDCEDICVSLQLGDHEA-KR

Query:  RRSDASVSMEE
        +R+D+    +E
Subjt:  RRSDASVSMEE

AT5G45030.2 Trypsin family protein5.0e-22066.94Show/hide
Query:  MEQTRHNGRINCSGSTPSEES-ALDLERNCCSHSNLPSFSPPTLQPFASAGQHCESN--AAYFSWPTPIRLSVAAEERANYFANLQKGVLPDILHPLPKG
        ME  R + R + S S+ S ES ALDL++N  +H  L S SP  LQPF S  QH E++  AAYFSWPT  RL+ +AE+RANYFANLQKGVLP+    LP G
Subjt:  MEQTRHNGRINCSGSTPSEES-ALDLERNCCSHSNLPSFSPPTLQPFASAGQHCESN--AAYFSWPTPIRLSVAAEERANYFANLQKGVLPDILHPLPKG

Query:  QRATTLLELMTIRAFHSKILRCYSLGTAIGFRIRKGVLTDIPAILVFVSRKVHKQWLSPIQCLPTALEGPGGVWCDVDVVEFSYFGAPNPAPKEQLYTEI
        ++ATTLLELM IRAFHSK LR +SLGTAIGFRIR+GVLT+I AILVFV+RKVHKQWL+P+QCLPTALEGPGGVWCDVDVVEF Y+GAP   PKEQ+YTE+
Subjt:  QRATTLLELMTIRAFHSKILRCYSLGTAIGFRIRKGVLTDIPAILVFVSRKVHKQWLSPIQCLPTALEGPGGVWCDVDVVEFSYFGAPNPAPKEQLYTEI

Query:  VDDLRGSDLCIGSGSQVASQETYGTLGAIVRSQTGSRQVGFLTNRHVAVDLDYPNQKMFHPLPPTLGPGVYLGAVERATSFITDELWYGIFAGINPETFV
        VDDLRGS   IGSGSQVASQETYGTLGAIV+S+TG RQVGFLTNRHVAVDLDYP+QKMFHPLPP+LGPGVYLGAVERATSFITD+LWYGIFAG NPETFV
Subjt:  VDDLRGSDLCIGSGSQVASQETYGTLGAIVRSQTGSRQVGFLTNRHVAVDLDYPNQKMFHPLPPTLGPGVYLGAVERATSFITDELWYGIFAGINPETFV

Query:  RADGAFIPFADDFEMSTVTTSVKGVGEVGDVKFIDLQSPISTLIGKQVVKVGRSSGLTTGTVLAYALEYNDEKGICFLTDFLVVGENQQTFDLEGDSGSL
        RADGAFIPFA+DF  + VTT+VKG+GE+GD+   DLQSP+++LIG++VVKVGRSSGLTTGT++AYALEYNDEKGICFLTDFLVVGENQQTFDLEGDSGSL
Subjt:  RADGAFIPFADDFEMSTVTTSVKGVGEVGDVKFIDLQSPISTLIGKQVVKVGRSSGLTTGTVLAYALEYNDEKGICFLTDFLVVGENQQTFDLEGDSGSL

Query:  IILKG--ENRESLQPIGIIWGGTANRGRLKLKVGQPPENWTSGVDLGRLLNLLELDLITSDEGLKAAVQEQRT-VSATVIGSIVGDSSPPDTTLPKEKSE
        I+L    E  E  +P+GIIWGGTANRGRLKLKVG+ PENWTSGVDLGR+LNLLELDLITS+EGL+AAV EQR  +    + S V +SSP    + + K+ 
Subjt:  IILKG--ENRESLQPIGIIWGGTANRGRLKLKVGQPPENWTSGVDLGRLLNLLELDLITSDEGLKAAVQEQRT-VSATVIGSIVGDSSPPDTTLPKEKSE

Query:  EKFEPLGFQIQHMPTEVEPSSAEERPLLETEFHLEAGTSMAPSVEHQFIPSLF-SCSPSHQN-SSLDRAVSQNLSSLR--SDCEDICVSLQLGDHEA-KR
        E FEP+   +Q +   +E  ++   P  + E  LE   S+A   EHQFIPS   + S  HQ  +  +   S+NLSSL+  S  ++I  SLQLG+ +  KR
Subjt:  EKFEPLGFQIQHMPTEVEPSSAEERPLLETEFHLEAGTSMAPSVEHQFIPSLF-SCSPSHQN-SSLDRAVSQNLSSLR--SDCEDICVSLQLGDHEA-KR

Query:  RRSDASVSMEE
        +R+D+    +E
Subjt:  RRSDASVSMEE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAGCAGACTAGACACAATGGCAGGATCAACTGCTCGGGTTCTACTCCATCCGAGGAATCAGCTCTTGATCTTGAAAGAAATTGCTGCAGTCACTCTAATTTACCTTC
ATTCAGCCCACCTACGCTTCAGCCGTTTGCATCCGCTGGGCAGCATTGCGAGAGCAATGCTGCATACTTTTCATGGCCCACCCCCATTCGGTTAAGCGTCGCTGCTGAGG
AGAGGGCGAACTATTTTGCCAACCTTCAGAAAGGGGTACTACCAGATATCCTTCATCCATTACCAAAAGGGCAGCGAGCAACCACGTTACTTGAGCTCATGACAATAAGA
GCCTTCCATAGCAAGATCCTGCGTTGTTACAGTCTTGGCACGGCCATTGGGTTCCGTATACGAAAGGGCGTGCTGACTGACATTCCTGCCATTCTTGTTTTTGTTTCCAG
GAAAGTGCACAAGCAATGGCTTAGTCCTATCCAATGCCTGCCCACTGCCCTGGAGGGTCCAGGTGGTGTGTGGTGTGATGTGGATGTTGTAGAATTCTCATATTTTGGTG
CACCCAACCCTGCTCCTAAAGAACAGTTGTATACGGAAATTGTCGATGATCTACGTGGCAGTGATCTGTGCATTGGCTCTGGTTCCCAGGTTGCCAGCCAGGAGACTTAT
GGAACCTTGGGTGCTATCGTAAGGAGTCAAACAGGCAGTCGACAAGTTGGTTTTCTCACAAACCGTCATGTCGCAGTTGATTTAGATTATCCAAATCAGAAGATGTTTCA
TCCTCTTCCACCAACGCTCGGGCCAGGGGTGTATCTTGGTGCTGTGGAGAGAGCTACTTCATTCATCACTGATGAGCTTTGGTATGGAATTTTTGCTGGAATAAACCCAG
AGACGTTCGTCAGGGCTGATGGGGCATTTATTCCTTTTGCCGATGATTTTGAGATGTCGACCGTCACTACATCTGTAAAAGGGGTGGGAGAGGTTGGCGACGTGAAGTTT
ATTGACTTGCAGTCGCCTATCAGTACCCTCATAGGGAAGCAGGTGGTGAAAGTTGGGAGAAGTTCTGGCTTGACCACAGGAACTGTTTTGGCTTATGCTCTTGAGTACAA
TGATGAGAAAGGAATATGCTTCTTAACTGATTTTCTCGTGGTTGGTGAGAATCAACAGACTTTCGATCTCGAAGGAGATAGCGGAAGCCTCATCATTTTAAAGGGTGAGA
ATAGGGAGAGTTTGCAACCGATTGGGATCATATGGGGTGGAACTGCAAACCGAGGTCGCCTTAAGTTAAAAGTCGGCCAGCCTCCCGAGAATTGGACAAGTGGGGTTGAT
CTCGGGCGCCTTCTCAATCTGCTCGAACTTGATCTAATCACAAGTGATGAAGGGCTTAAAGCAGCAGTGCAAGAACAGAGAACTGTCTCGGCAACCGTCATCGGGTCCAT
CGTCGGAGACTCCTCTCCTCCCGATACAACATTGCCAAAGGAGAAAAGTGAAGAGAAGTTTGAGCCATTGGGCTTCCAAATCCAGCATATGCCTACAGAAGTGGAACCTT
CTTCAGCCGAGGAGCGGCCGCTCCTGGAGACCGAGTTTCATCTCGAAGCCGGGACGAGCATGGCTCCCAGTGTCGAACATCAGTTCATTCCAAGCCTTTTCAGTTGCTCT
CCCTCGCATCAAAACAGTTCACTGGACCGTGCAGTTTCCCAGAACCTCTCTTCGCTTCGGAGCGACTGCGAAGACATTTGCGTTTCCTTGCAACTGGGCGACCATGAAGC
AAAGAGAAGACGCTCAGATGCTTCTGTTTCCATGGAAGAACTGAAA
mRNA sequenceShow/hide mRNA sequence
ATGGAGCAGACTAGACACAATGGCAGGATCAACTGCTCGGGTTCTACTCCATCCGAGGAATCAGCTCTTGATCTTGAAAGAAATTGCTGCAGTCACTCTAATTTACCTTC
ATTCAGCCCACCTACGCTTCAGCCGTTTGCATCCGCTGGGCAGCATTGCGAGAGCAATGCTGCATACTTTTCATGGCCCACCCCCATTCGGTTAAGCGTCGCTGCTGAGG
AGAGGGCGAACTATTTTGCCAACCTTCAGAAAGGGGTACTACCAGATATCCTTCATCCATTACCAAAAGGGCAGCGAGCAACCACGTTACTTGAGCTCATGACAATAAGA
GCCTTCCATAGCAAGATCCTGCGTTGTTACAGTCTTGGCACGGCCATTGGGTTCCGTATACGAAAGGGCGTGCTGACTGACATTCCTGCCATTCTTGTTTTTGTTTCCAG
GAAAGTGCACAAGCAATGGCTTAGTCCTATCCAATGCCTGCCCACTGCCCTGGAGGGTCCAGGTGGTGTGTGGTGTGATGTGGATGTTGTAGAATTCTCATATTTTGGTG
CACCCAACCCTGCTCCTAAAGAACAGTTGTATACGGAAATTGTCGATGATCTACGTGGCAGTGATCTGTGCATTGGCTCTGGTTCCCAGGTTGCCAGCCAGGAGACTTAT
GGAACCTTGGGTGCTATCGTAAGGAGTCAAACAGGCAGTCGACAAGTTGGTTTTCTCACAAACCGTCATGTCGCAGTTGATTTAGATTATCCAAATCAGAAGATGTTTCA
TCCTCTTCCACCAACGCTCGGGCCAGGGGTGTATCTTGGTGCTGTGGAGAGAGCTACTTCATTCATCACTGATGAGCTTTGGTATGGAATTTTTGCTGGAATAAACCCAG
AGACGTTCGTCAGGGCTGATGGGGCATTTATTCCTTTTGCCGATGATTTTGAGATGTCGACCGTCACTACATCTGTAAAAGGGGTGGGAGAGGTTGGCGACGTGAAGTTT
ATTGACTTGCAGTCGCCTATCAGTACCCTCATAGGGAAGCAGGTGGTGAAAGTTGGGAGAAGTTCTGGCTTGACCACAGGAACTGTTTTGGCTTATGCTCTTGAGTACAA
TGATGAGAAAGGAATATGCTTCTTAACTGATTTTCTCGTGGTTGGTGAGAATCAACAGACTTTCGATCTCGAAGGAGATAGCGGAAGCCTCATCATTTTAAAGGGTGAGA
ATAGGGAGAGTTTGCAACCGATTGGGATCATATGGGGTGGAACTGCAAACCGAGGTCGCCTTAAGTTAAAAGTCGGCCAGCCTCCCGAGAATTGGACAAGTGGGGTTGAT
CTCGGGCGCCTTCTCAATCTGCTCGAACTTGATCTAATCACAAGTGATGAAGGGCTTAAAGCAGCAGTGCAAGAACAGAGAACTGTCTCGGCAACCGTCATCGGGTCCAT
CGTCGGAGACTCCTCTCCTCCCGATACAACATTGCCAAAGGAGAAAAGTGAAGAGAAGTTTGAGCCATTGGGCTTCCAAATCCAGCATATGCCTACAGAAGTGGAACCTT
CTTCAGCCGAGGAGCGGCCGCTCCTGGAGACCGAGTTTCATCTCGAAGCCGGGACGAGCATGGCTCCCAGTGTCGAACATCAGTTCATTCCAAGCCTTTTCAGTTGCTCT
CCCTCGCATCAAAACAGTTCACTGGACCGTGCAGTTTCCCAGAACCTCTCTTCGCTTCGGAGCGACTGCGAAGACATTTGCGTTTCCTTGCAACTGGGCGACCATGAAGC
AAAGAGAAGACGCTCAGATGCTTCTGTTTCCATGGAAGAACTGAAA
Protein sequenceShow/hide protein sequence
MEQTRHNGRINCSGSTPSEESALDLERNCCSHSNLPSFSPPTLQPFASAGQHCESNAAYFSWPTPIRLSVAAEERANYFANLQKGVLPDILHPLPKGQRATTLLELMTIR
AFHSKILRCYSLGTAIGFRIRKGVLTDIPAILVFVSRKVHKQWLSPIQCLPTALEGPGGVWCDVDVVEFSYFGAPNPAPKEQLYTEIVDDLRGSDLCIGSGSQVASQETY
GTLGAIVRSQTGSRQVGFLTNRHVAVDLDYPNQKMFHPLPPTLGPGVYLGAVERATSFITDELWYGIFAGINPETFVRADGAFIPFADDFEMSTVTTSVKGVGEVGDVKF
IDLQSPISTLIGKQVVKVGRSSGLTTGTVLAYALEYNDEKGICFLTDFLVVGENQQTFDLEGDSGSLIILKGENRESLQPIGIIWGGTANRGRLKLKVGQPPENWTSGVD
LGRLLNLLELDLITSDEGLKAAVQEQRTVSATVIGSIVGDSSPPDTTLPKEKSEEKFEPLGFQIQHMPTEVEPSSAEERPLLETEFHLEAGTSMAPSVEHQFIPSLFSCS
PSHQNSSLDRAVSQNLSSLRSDCEDICVSLQLGDHEAKRRRSDASVSMEELK