; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MC11g0756 (gene) of Bitter gourd (Dali-11) v1 genome

Gene IDMC11g0756
OrganismMomordica charantia cv. Dali-11 (Bitter gourd (Dali-11) v1)
DescriptionPeptidase S1, PA clan
Genome locationMC11:6110383..6114373
RNA-Seq ExpressionMC11g0756
SyntenyMC11g0756
Gene Ontology termsGO:0043231 - intracellular membrane-bounded organelle (cellular component)
InterPro domainsIPR009003 - Peptidase S1, PA clan


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6607973.1 Protein NARROW LEAF 1, partial [Cucurbita argyrosperma subsp. sororia]0.096.51Show/hide
Query:  MEQTRHNGRINCSGSTPSEESALDLERNCCSHSNLPSFSPPTLQPFASAGQHCESNAAYFSWPTPIRLSVAAEERANYFANLQKGVLPDILHPLPKGQRA
        MEQTRHN RINCSGSTPSEESALDLERNC  HSNLPSFSPPTLQPFASAGQHCESNAAYFSWPTPIRLSVAAEERANYFANLQKGVLPDIL PLPKGQRA
Subjt:  MEQTRHNGRINCSGSTPSEESALDLERNCCSHSNLPSFSPPTLQPFASAGQHCESNAAYFSWPTPIRLSVAAEERANYFANLQKGVLPDILHPLPKGQRA

Query:  TTLLELMTIRAFHSKILRCYSLGTAIGFRIRKGVLTDIPAILVFVSRKVHKQWLSPIQCLPTALEGPGGVWCDVDVVEFSYFGAPNPAPKEQLYTEIVDD
        TTLLELMTIRAFHSKILRCYSLGTAIGFRI+KGVLTDIPAILVFVSRKVHKQWLSPIQCLPTALEGPGGVWCDVDVVEFSYFGAPNPAPKEQLYTEIVDD
Subjt:  TTLLELMTIRAFHSKILRCYSLGTAIGFRIRKGVLTDIPAILVFVSRKVHKQWLSPIQCLPTALEGPGGVWCDVDVVEFSYFGAPNPAPKEQLYTEIVDD

Query:  LRGSDLCIGSGSQVASQETYGTLGAIVRSQTGSRQVGFLTNRHVAVDLDYPNQKMFHPLPPTLGPGVYLGAVERATSFITDELWYGIFAGINPETFVRAD
        LRGSDLCIGSGSQVASQETYGTLGAIVRSQTGSRQVGFLTNRHVAVDLDYPNQKMFHPLPPTLGPGVYLGAVERATSFITDELWYGIFAGINPETFVRAD
Subjt:  LRGSDLCIGSGSQVASQETYGTLGAIVRSQTGSRQVGFLTNRHVAVDLDYPNQKMFHPLPPTLGPGVYLGAVERATSFITDELWYGIFAGINPETFVRAD

Query:  GAFIPFADDFEMSTVTTSVKGVGEVGDVKFIDLQSPISTLIGKRVVKVGRSSGLTTGTVLAYALEYNDEKGICFLTDFLVVGENQQTFDLEGDSGSLIIL
        GAFIPFADDF+MSTVTTSVKGVGEVGDVKFIDLQSPISTLIGK+VVKVGRSSGLTTGTVLAYALEYNDEKGICFLTDFLVVGENQQTFDLEGDSGSLIIL
Subjt:  GAFIPFADDFEMSTVTTSVKGVGEVGDVKFIDLQSPISTLIGKRVVKVGRSSGLTTGTVLAYALEYNDEKGICFLTDFLVVGENQQTFDLEGDSGSLIIL

Query:  KGENKESLQPIGIIWGGTANRGRLKLKVGQPPENWTSGVDLGRLLNLLELDLITSDEGLKAAVQEQRTVSATVIGSIVGDSSPPDTTLPKEKSEEKFEPL
        KGEN+ESLQPIGIIWGGTANRGRLKLKVGQPPENWTSGVDLGRLLNLLELDLIT+D+G KAAVQEQRTVSATVIGSIVGDSSPPDTTLPK+KSEEKFEPL
Subjt:  KGENKESLQPIGIIWGGTANRGRLKLKVGQPPENWTSGVDLGRLLNLLELDLITSDEGLKAAVQEQRTVSATVIGSIVGDSSPPDTTLPKEKSEEKFEPL

Query:  GFQIQHMPTEVEPSSAEERPLLETEFHLEAGTSMAPSVEHQFIPSLFSCSPSHQNSSLDRAVSQNLSSLRSDCEDICVSLQLGDHEAKRRRSDASVSMEE
        GFQIQHMPTEVEPSSA++RPLLETEFHLEAGTS APSVEHQFIPSLFSCSPS  NS+L RAVSQNLSSLRSDCEDI VSL LGDHEAKRRRSDASVSMEE
Subjt:  GFQIQHMPTEVEPSSAEERPLLETEFHLEAGTSMAPSVEHQFIPSLFSCSPSHQNSSLDRAVSQNLSSLRSDCEDICVSLQLGDHEAKRRRSDASVSMEE

Query:  LK
        LK
Subjt:  LK

XP_022136083.1 uncharacterized protein LOC111007860 isoform X1 [Momordica charantia]0.099.83Show/hide
Query:  MEQTRHNGRINCSGSTPSEESALDLERNCCSHSNLPSFSPPTLQPFASAGQHCESNAAYFSWPTPIRLSVAAEERANYFANLQKGVLPDILHPLPKGQRA
        MEQTRHNGRINCSGSTPSEESALDLERNCCSHSNLPSFSPPTLQPFASAGQHCESNAAYFSWPTPIRLSVAAEERANYFANLQKGVLPDILHPLPKGQRA
Subjt:  MEQTRHNGRINCSGSTPSEESALDLERNCCSHSNLPSFSPPTLQPFASAGQHCESNAAYFSWPTPIRLSVAAEERANYFANLQKGVLPDILHPLPKGQRA

Query:  TTLLELMTIRAFHSKILRCYSLGTAIGFRIRKGVLTDIPAILVFVSRKVHKQWLSPIQCLPTALEGPGGVWCDVDVVEFSYFGAPNPAPKEQLYTEIVDD
        TTLLELMTIRAFHSKILRCYSLGTAIGFRIRKGVLTDIPAILVFVSRKVHKQWLSPIQCLPTALEGPGGVWCDVDVVEFSYFGAPNPAPKEQLYTEIVDD
Subjt:  TTLLELMTIRAFHSKILRCYSLGTAIGFRIRKGVLTDIPAILVFVSRKVHKQWLSPIQCLPTALEGPGGVWCDVDVVEFSYFGAPNPAPKEQLYTEIVDD

Query:  LRGSDLCIGSGSQVASQETYGTLGAIVRSQTGSRQVGFLTNRHVAVDLDYPNQKMFHPLPPTLGPGVYLGAVERATSFITDELWYGIFAGINPETFVRAD
        LRGSDLCIGSGSQVASQETYGTLGAIVRSQTGSRQVGFLTNRHVAVDLDYPNQKMFHPLPPTLGPGVYLGAVERATSFITDELWYGIFAGINPETFVRAD
Subjt:  LRGSDLCIGSGSQVASQETYGTLGAIVRSQTGSRQVGFLTNRHVAVDLDYPNQKMFHPLPPTLGPGVYLGAVERATSFITDELWYGIFAGINPETFVRAD

Query:  GAFIPFADDFEMSTVTTSVKGVGEVGDVKFIDLQSPISTLIGKRVVKVGRSSGLTTGTVLAYALEYNDEKGICFLTDFLVVGENQQTFDLEGDSGSLIIL
        GAFIPFADDFEMSTVTTSVKGVGEVGDVKFIDLQSPISTLIGKRVVKVGRSSGLTTGTVLAYALEYNDEKGICFLTDFLVVGENQQTFDLEGDSGSLIIL
Subjt:  GAFIPFADDFEMSTVTTSVKGVGEVGDVKFIDLQSPISTLIGKRVVKVGRSSGLTTGTVLAYALEYNDEKGICFLTDFLVVGENQQTFDLEGDSGSLIIL

Query:  KGENKESLQPIGIIWGGTANRGRLKLKVGQPPENWTSGVDLGRLLNLLELDLITSDEGLKAAVQEQRTVSATVIGSIVGDSSPPDTTLPKEKSEEKFEPL
        KGEN+ESLQPIGIIWGGTANRGRLKLKVGQPPENWTSGVDLGRLLNLLELDLITSDEGLKAAVQEQRTVSATVIGSIVGDSSPPDTTLPKEKSEEKFEPL
Subjt:  KGENKESLQPIGIIWGGTANRGRLKLKVGQPPENWTSGVDLGRLLNLLELDLITSDEGLKAAVQEQRTVSATVIGSIVGDSSPPDTTLPKEKSEEKFEPL

Query:  GFQIQHMPTEVEPSSAEERPLLETEFHLEAGTSMAPSVEHQFIPSLFSCSPSHQNSSLDRAVSQNLSSLRSDCEDICVSLQLGDHEAKRRRSDASVSMEE
        GFQIQHMPTEVEPSSAEERPLLETEFHLEAGTSMAPSVEHQFIPSLFSCSPSHQNSSLDRAVSQNLSSLRSDCEDICVSLQLGDHEAKRRRSDASVSMEE
Subjt:  GFQIQHMPTEVEPSSAEERPLLETEFHLEAGTSMAPSVEHQFIPSLFSCSPSHQNSSLDRAVSQNLSSLRSDCEDICVSLQLGDHEAKRRRSDASVSMEE

Query:  LK
        LK
Subjt:  LK

XP_022136089.1 uncharacterized protein LOC111007860 isoform X2 [Momordica charantia]0.099.67Show/hide
Query:  MEQTRHNGRINCSGSTPSEESALDLERNCCSHSNLPSFSPPTLQPFASAGQHCESNAAYFSWPTPIRLSVAAEERANYFANLQKGVLPDILHPLPKGQRA
        MEQTRHNGRINCSGSTPSEESALDLERNCCSHSNLPSFSPPTLQPFASAGQHCESNAAYFSWPTPIRLSVAAEERANYFANLQKGVLPDILHPLPKGQRA
Subjt:  MEQTRHNGRINCSGSTPSEESALDLERNCCSHSNLPSFSPPTLQPFASAGQHCESNAAYFSWPTPIRLSVAAEERANYFANLQKGVLPDILHPLPKGQRA

Query:  TTLLELMTIRAFHSKILRCYSLGTAIGFRIRKGVLTDIPAILVFVSRKVHKQWLSPIQCLPTALEGPGGVWCDVDVVEFSYFGAPNPAPKEQLYTEIVDD
        TTLLELMTIRAFHSKILRCYSLGTAIGFRIRKGVLTDIPAILVFVSRKVHKQWLSPIQCLPTALEGPGGVWCDVDVVEFSYFGAPNPAPKEQLYTEIVDD
Subjt:  TTLLELMTIRAFHSKILRCYSLGTAIGFRIRKGVLTDIPAILVFVSRKVHKQWLSPIQCLPTALEGPGGVWCDVDVVEFSYFGAPNPAPKEQLYTEIVDD

Query:  LRGSDLCIGSGSQVASQETYGTLGAIVRSQTGSRQVGFLTNRHVAVDLDYPNQKMFHPLPPTLGPGVYLGAVERATSFITDELWYGIFAGINPETFVRAD
        LRGSDLCIGSGSQVASQETYGTLGAIVRSQTGSRQVGFLTNRHVAVDLDYPNQKMFHPLPPTLGPGVYLGAVERATSFITDELWYGIFAGINPETFVRAD
Subjt:  LRGSDLCIGSGSQVASQETYGTLGAIVRSQTGSRQVGFLTNRHVAVDLDYPNQKMFHPLPPTLGPGVYLGAVERATSFITDELWYGIFAGINPETFVRAD

Query:  GAFIPFADDFEMSTVTTSVKGVGEVGDVKFIDLQSPISTLIGKRVVKVGRSSGLTTGTVLAYALEYNDEKGICFLTDFLVVGENQQTFDLEGDSGSLIIL
        GAFIPFADDFEMSTVTTSVKGVGEVGDVKFIDLQSPISTLIGKRVVKVGRSSGLTTGTVLAYALEYNDEKGICFLTDFLVVGENQQTFDLEGDSGSLIIL
Subjt:  GAFIPFADDFEMSTVTTSVKGVGEVGDVKFIDLQSPISTLIGKRVVKVGRSSGLTTGTVLAYALEYNDEKGICFLTDFLVVGENQQTFDLEGDSGSLIIL

Query:  KGENKESLQPIGIIWGGTANRGRLKLKVGQPPENWTSGVDLGRLLNLLELDLITSDEGLKAAVQEQRTVSATVIGSIVGDSSPPDTTLPKEKSEEKFEPL
        KGEN+ESLQPIGIIWGGTANRGRLKLKVGQPPENWTSGVDLGRLLNLLELDLITSDEGLKA VQEQRTVSATVIGSIVGDSSPPDTTLPKEKSEEKFEPL
Subjt:  KGENKESLQPIGIIWGGTANRGRLKLKVGQPPENWTSGVDLGRLLNLLELDLITSDEGLKAAVQEQRTVSATVIGSIVGDSSPPDTTLPKEKSEEKFEPL

Query:  GFQIQHMPTEVEPSSAEERPLLETEFHLEAGTSMAPSVEHQFIPSLFSCSPSHQNSSLDRAVSQNLSSLRSDCEDICVSLQLGDHEAKRRRSDASVSMEE
        GFQIQHMPTEVEPSSAEERPLLETEFHLEAGTSMAPSVEHQFIPSLFSCSPSHQNSSLDRAVSQNLSSLRSDCEDICVSLQLGDHEAKRRRSDASVSMEE
Subjt:  GFQIQHMPTEVEPSSAEERPLLETEFHLEAGTSMAPSVEHQFIPSLFSCSPSHQNSSLDRAVSQNLSSLRSDCEDICVSLQLGDHEAKRRRSDASVSMEE

Query:  LK
        LK
Subjt:  LK

XP_022940289.1 uncharacterized protein LOC111445958 [Cucurbita moschata]0.096.35Show/hide
Query:  MEQTRHNGRINCSGSTPSEESALDLERNCCSHSNLPSFSPPTLQPFASAGQHCESNAAYFSWPTPIRLSVAAEERANYFANLQKGVLPDILHPLPKGQRA
        MEQTRHN RINCSGSTPSEESALDLERNC  HSNLPSFSPPTLQPFASAGQHCESNAAYFSWPTPIRLSVAAEERANYFANLQKGVLPDILHPLPKGQRA
Subjt:  MEQTRHNGRINCSGSTPSEESALDLERNCCSHSNLPSFSPPTLQPFASAGQHCESNAAYFSWPTPIRLSVAAEERANYFANLQKGVLPDILHPLPKGQRA

Query:  TTLLELMTIRAFHSKILRCYSLGTAIGFRIRKGVLTDIPAILVFVSRKVHKQWLSPIQCLPTALEGPGGVWCDVDVVEFSYFGAPNPAPKEQLYTEIVDD
        TTLLELMTIRAFHSKILRCYSLGTAIGFRI+KGVLTDIPAILVFVSRKVHKQWLSPIQCLPTALEGPGGVWCDVDVVEFSYFGAPNPAPKEQLYTEIVDD
Subjt:  TTLLELMTIRAFHSKILRCYSLGTAIGFRIRKGVLTDIPAILVFVSRKVHKQWLSPIQCLPTALEGPGGVWCDVDVVEFSYFGAPNPAPKEQLYTEIVDD

Query:  LRGSDLCIGSGSQVASQETYGTLGAIVRSQTGSRQVGFLTNRHVAVDLDYPNQKMFHPLPPTLGPGVYLGAVERATSFITDELWYGIFAGINPETFVRAD
        LRGSDLCIGSGSQVASQETYGTLGAIVRSQTGSRQVGFLTNRHVAVDLDYPNQKMFHPLPPTLGPGVYLGAVERATSFITDELWYGIFAGINPETFVRAD
Subjt:  LRGSDLCIGSGSQVASQETYGTLGAIVRSQTGSRQVGFLTNRHVAVDLDYPNQKMFHPLPPTLGPGVYLGAVERATSFITDELWYGIFAGINPETFVRAD

Query:  GAFIPFADDFEMSTVTTSVKGVGEVGDVKFIDLQSPISTLIGKRVVKVGRSSGLTTGTVLAYALEYNDEKGICFLTDFLVVGENQQTFDLEGDSGSLIIL
        GAFIPFADDF+MSTVTTSVKGVGEVGDVKFIDLQSPISTLIGK+VVKVGRSSGLTTGTVLAYALEYNDEKGICFLTDFLVVGENQQTFDLEGDSGSLIIL
Subjt:  GAFIPFADDFEMSTVTTSVKGVGEVGDVKFIDLQSPISTLIGKRVVKVGRSSGLTTGTVLAYALEYNDEKGICFLTDFLVVGENQQTFDLEGDSGSLIIL

Query:  KGENKESLQPIGIIWGGTANRGRLKLKVGQPPENWTSGVDLGRLLNLLELDLITSDEGLKAAVQEQRTVSATVIGSIVGDSSPPDTTLPKEKSEEKFEPL
        KGEN+ESLQPIGIIWGGTANRGRLKLKVGQPPENWTSGVDLGRLLNLLELDLIT+D+G KAAVQEQRTVSATVIGSIVGDSSPPDTTLPK+KSEEKFEPL
Subjt:  KGENKESLQPIGIIWGGTANRGRLKLKVGQPPENWTSGVDLGRLLNLLELDLITSDEGLKAAVQEQRTVSATVIGSIVGDSSPPDTTLPKEKSEEKFEPL

Query:  GFQIQHMPTEVEPSSAEERPLLETEFHLEAGTSMAPSVEHQFIPSLFSCSPSHQNSSLDRAVSQNLSSLRSDCEDICVSLQLGDHEAKRRRSDASVSMEE
        GFQIQHMPTEVEPSSA++RPLLETEFHLEAGTS AP+VEHQFIPSLFSCSPS  NS+L RAVSQNLSSLRS+CEDI VSL LGDHEAKRRRSDASVSMEE
Subjt:  GFQIQHMPTEVEPSSAEERPLLETEFHLEAGTSMAPSVEHQFIPSLFSCSPSHQNSSLDRAVSQNLSSLRSDCEDICVSLQLGDHEAKRRRSDASVSMEE

Query:  LK
        LK
Subjt:  LK

XP_022981089.1 uncharacterized protein LOC111480347 [Cucurbita maxima]0.096.68Show/hide
Query:  MEQTRHNGRINCSGSTPSEESALDLERNCCSHSNLPSFSPPTLQPFASAGQHCESNAAYFSWPTPIRLSVAAEERANYFANLQKGVLPDILHPLPKGQRA
        MEQTRHN RINCSGSTPSEESALDLERNC  HSNLPSFSPPTLQPFASAGQHCESNAAYFSWPTPIRLSVAAEERANYFANLQKGVLPDILHPLPKGQRA
Subjt:  MEQTRHNGRINCSGSTPSEESALDLERNCCSHSNLPSFSPPTLQPFASAGQHCESNAAYFSWPTPIRLSVAAEERANYFANLQKGVLPDILHPLPKGQRA

Query:  TTLLELMTIRAFHSKILRCYSLGTAIGFRIRKGVLTDIPAILVFVSRKVHKQWLSPIQCLPTALEGPGGVWCDVDVVEFSYFGAPNPAPKEQLYTEIVDD
        TTLLELMTIRAFHSKILRCYSLGTAIGFRI+KGVLTDIPAILVFVSRKVHKQWLSPIQCLPTALEGPGGVWCDVDVVEFSYFGAPNPAPKEQLYTEIVDD
Subjt:  TTLLELMTIRAFHSKILRCYSLGTAIGFRIRKGVLTDIPAILVFVSRKVHKQWLSPIQCLPTALEGPGGVWCDVDVVEFSYFGAPNPAPKEQLYTEIVDD

Query:  LRGSDLCIGSGSQVASQETYGTLGAIVRSQTGSRQVGFLTNRHVAVDLDYPNQKMFHPLPPTLGPGVYLGAVERATSFITDELWYGIFAGINPETFVRAD
        LRGSDLCIGSGSQVASQETYGTLGAIVRSQTGSRQVGFLTNRHVAVDLDYPNQKMFHPLPPTLGPGVYLGAVERATSFITDELWYGIFAGINPETFVRAD
Subjt:  LRGSDLCIGSGSQVASQETYGTLGAIVRSQTGSRQVGFLTNRHVAVDLDYPNQKMFHPLPPTLGPGVYLGAVERATSFITDELWYGIFAGINPETFVRAD

Query:  GAFIPFADDFEMSTVTTSVKGVGEVGDVKFIDLQSPISTLIGKRVVKVGRSSGLTTGTVLAYALEYNDEKGICFLTDFLVVGENQQTFDLEGDSGSLIIL
        GAFIPFADDF+MSTVTTSVKGVGEVGDVKFIDLQSPISTLIGK+VVKVGRSSGLTTGTVLAYALEYNDEKGICFLTDFLVVGENQQTFDLEGDSGSLIIL
Subjt:  GAFIPFADDFEMSTVTTSVKGVGEVGDVKFIDLQSPISTLIGKRVVKVGRSSGLTTGTVLAYALEYNDEKGICFLTDFLVVGENQQTFDLEGDSGSLIIL

Query:  KGENKESLQPIGIIWGGTANRGRLKLKVGQPPENWTSGVDLGRLLNLLELDLITSDEGLKAAVQEQRTVSATVIGSIVGDSSPPDTTLPKEKSEEKFEPL
        KGEN+ESLQPIGIIWGGTANRGRLKLKVGQPPENWTSGVDLGRLLNLLELDLIT+D+G KAAVQEQRTVSATVIGSIVGDSSPPDTTLPK+KSEEKFEPL
Subjt:  KGENKESLQPIGIIWGGTANRGRLKLKVGQPPENWTSGVDLGRLLNLLELDLITSDEGLKAAVQEQRTVSATVIGSIVGDSSPPDTTLPKEKSEEKFEPL

Query:  GFQIQHMPTEVEPSSAEERPLLETEFHLEAGTSMAPSVEHQFIPSLFSCSPSHQNSSLDRAVSQNLSSLRSDCEDICVSLQLGDHEAKRRRSDASVSMEE
        GFQIQHMPTEVEPSSA++RPLLETEFHLEAGTS APSVEHQFIPSLFSCSPS  NS+L RAVSQNLSSLRSDCEDI VSL LGDHEAKRRRSDASVSMEE
Subjt:  GFQIQHMPTEVEPSSAEERPLLETEFHLEAGTSMAPSVEHQFIPSLFSCSPSHQNSSLDRAVSQNLSSLRSDCEDICVSLQLGDHEAKRRRSDASVSMEE

Query:  LK
        LK
Subjt:  LK

TrEMBL top hitse value%identityAlignment
A0A6J1C3A8 uncharacterized protein LOC111007860 isoform X20.099.67Show/hide
Query:  MEQTRHNGRINCSGSTPSEESALDLERNCCSHSNLPSFSPPTLQPFASAGQHCESNAAYFSWPTPIRLSVAAEERANYFANLQKGVLPDILHPLPKGQRA
        MEQTRHNGRINCSGSTPSEESALDLERNCCSHSNLPSFSPPTLQPFASAGQHCESNAAYFSWPTPIRLSVAAEERANYFANLQKGVLPDILHPLPKGQRA
Subjt:  MEQTRHNGRINCSGSTPSEESALDLERNCCSHSNLPSFSPPTLQPFASAGQHCESNAAYFSWPTPIRLSVAAEERANYFANLQKGVLPDILHPLPKGQRA

Query:  TTLLELMTIRAFHSKILRCYSLGTAIGFRIRKGVLTDIPAILVFVSRKVHKQWLSPIQCLPTALEGPGGVWCDVDVVEFSYFGAPNPAPKEQLYTEIVDD
        TTLLELMTIRAFHSKILRCYSLGTAIGFRIRKGVLTDIPAILVFVSRKVHKQWLSPIQCLPTALEGPGGVWCDVDVVEFSYFGAPNPAPKEQLYTEIVDD
Subjt:  TTLLELMTIRAFHSKILRCYSLGTAIGFRIRKGVLTDIPAILVFVSRKVHKQWLSPIQCLPTALEGPGGVWCDVDVVEFSYFGAPNPAPKEQLYTEIVDD

Query:  LRGSDLCIGSGSQVASQETYGTLGAIVRSQTGSRQVGFLTNRHVAVDLDYPNQKMFHPLPPTLGPGVYLGAVERATSFITDELWYGIFAGINPETFVRAD
        LRGSDLCIGSGSQVASQETYGTLGAIVRSQTGSRQVGFLTNRHVAVDLDYPNQKMFHPLPPTLGPGVYLGAVERATSFITDELWYGIFAGINPETFVRAD
Subjt:  LRGSDLCIGSGSQVASQETYGTLGAIVRSQTGSRQVGFLTNRHVAVDLDYPNQKMFHPLPPTLGPGVYLGAVERATSFITDELWYGIFAGINPETFVRAD

Query:  GAFIPFADDFEMSTVTTSVKGVGEVGDVKFIDLQSPISTLIGKRVVKVGRSSGLTTGTVLAYALEYNDEKGICFLTDFLVVGENQQTFDLEGDSGSLIIL
        GAFIPFADDFEMSTVTTSVKGVGEVGDVKFIDLQSPISTLIGKRVVKVGRSSGLTTGTVLAYALEYNDEKGICFLTDFLVVGENQQTFDLEGDSGSLIIL
Subjt:  GAFIPFADDFEMSTVTTSVKGVGEVGDVKFIDLQSPISTLIGKRVVKVGRSSGLTTGTVLAYALEYNDEKGICFLTDFLVVGENQQTFDLEGDSGSLIIL

Query:  KGENKESLQPIGIIWGGTANRGRLKLKVGQPPENWTSGVDLGRLLNLLELDLITSDEGLKAAVQEQRTVSATVIGSIVGDSSPPDTTLPKEKSEEKFEPL
        KGEN+ESLQPIGIIWGGTANRGRLKLKVGQPPENWTSGVDLGRLLNLLELDLITSDEGLKA VQEQRTVSATVIGSIVGDSSPPDTTLPKEKSEEKFEPL
Subjt:  KGENKESLQPIGIIWGGTANRGRLKLKVGQPPENWTSGVDLGRLLNLLELDLITSDEGLKAAVQEQRTVSATVIGSIVGDSSPPDTTLPKEKSEEKFEPL

Query:  GFQIQHMPTEVEPSSAEERPLLETEFHLEAGTSMAPSVEHQFIPSLFSCSPSHQNSSLDRAVSQNLSSLRSDCEDICVSLQLGDHEAKRRRSDASVSMEE
        GFQIQHMPTEVEPSSAEERPLLETEFHLEAGTSMAPSVEHQFIPSLFSCSPSHQNSSLDRAVSQNLSSLRSDCEDICVSLQLGDHEAKRRRSDASVSMEE
Subjt:  GFQIQHMPTEVEPSSAEERPLLETEFHLEAGTSMAPSVEHQFIPSLFSCSPSHQNSSLDRAVSQNLSSLRSDCEDICVSLQLGDHEAKRRRSDASVSMEE

Query:  LK
        LK
Subjt:  LK

A0A6J1C6M7 uncharacterized protein LOC111007860 isoform X10.099.83Show/hide
Query:  MEQTRHNGRINCSGSTPSEESALDLERNCCSHSNLPSFSPPTLQPFASAGQHCESNAAYFSWPTPIRLSVAAEERANYFANLQKGVLPDILHPLPKGQRA
        MEQTRHNGRINCSGSTPSEESALDLERNCCSHSNLPSFSPPTLQPFASAGQHCESNAAYFSWPTPIRLSVAAEERANYFANLQKGVLPDILHPLPKGQRA
Subjt:  MEQTRHNGRINCSGSTPSEESALDLERNCCSHSNLPSFSPPTLQPFASAGQHCESNAAYFSWPTPIRLSVAAEERANYFANLQKGVLPDILHPLPKGQRA

Query:  TTLLELMTIRAFHSKILRCYSLGTAIGFRIRKGVLTDIPAILVFVSRKVHKQWLSPIQCLPTALEGPGGVWCDVDVVEFSYFGAPNPAPKEQLYTEIVDD
        TTLLELMTIRAFHSKILRCYSLGTAIGFRIRKGVLTDIPAILVFVSRKVHKQWLSPIQCLPTALEGPGGVWCDVDVVEFSYFGAPNPAPKEQLYTEIVDD
Subjt:  TTLLELMTIRAFHSKILRCYSLGTAIGFRIRKGVLTDIPAILVFVSRKVHKQWLSPIQCLPTALEGPGGVWCDVDVVEFSYFGAPNPAPKEQLYTEIVDD

Query:  LRGSDLCIGSGSQVASQETYGTLGAIVRSQTGSRQVGFLTNRHVAVDLDYPNQKMFHPLPPTLGPGVYLGAVERATSFITDELWYGIFAGINPETFVRAD
        LRGSDLCIGSGSQVASQETYGTLGAIVRSQTGSRQVGFLTNRHVAVDLDYPNQKMFHPLPPTLGPGVYLGAVERATSFITDELWYGIFAGINPETFVRAD
Subjt:  LRGSDLCIGSGSQVASQETYGTLGAIVRSQTGSRQVGFLTNRHVAVDLDYPNQKMFHPLPPTLGPGVYLGAVERATSFITDELWYGIFAGINPETFVRAD

Query:  GAFIPFADDFEMSTVTTSVKGVGEVGDVKFIDLQSPISTLIGKRVVKVGRSSGLTTGTVLAYALEYNDEKGICFLTDFLVVGENQQTFDLEGDSGSLIIL
        GAFIPFADDFEMSTVTTSVKGVGEVGDVKFIDLQSPISTLIGKRVVKVGRSSGLTTGTVLAYALEYNDEKGICFLTDFLVVGENQQTFDLEGDSGSLIIL
Subjt:  GAFIPFADDFEMSTVTTSVKGVGEVGDVKFIDLQSPISTLIGKRVVKVGRSSGLTTGTVLAYALEYNDEKGICFLTDFLVVGENQQTFDLEGDSGSLIIL

Query:  KGENKESLQPIGIIWGGTANRGRLKLKVGQPPENWTSGVDLGRLLNLLELDLITSDEGLKAAVQEQRTVSATVIGSIVGDSSPPDTTLPKEKSEEKFEPL
        KGEN+ESLQPIGIIWGGTANRGRLKLKVGQPPENWTSGVDLGRLLNLLELDLITSDEGLKAAVQEQRTVSATVIGSIVGDSSPPDTTLPKEKSEEKFEPL
Subjt:  KGENKESLQPIGIIWGGTANRGRLKLKVGQPPENWTSGVDLGRLLNLLELDLITSDEGLKAAVQEQRTVSATVIGSIVGDSSPPDTTLPKEKSEEKFEPL

Query:  GFQIQHMPTEVEPSSAEERPLLETEFHLEAGTSMAPSVEHQFIPSLFSCSPSHQNSSLDRAVSQNLSSLRSDCEDICVSLQLGDHEAKRRRSDASVSMEE
        GFQIQHMPTEVEPSSAEERPLLETEFHLEAGTSMAPSVEHQFIPSLFSCSPSHQNSSLDRAVSQNLSSLRSDCEDICVSLQLGDHEAKRRRSDASVSMEE
Subjt:  GFQIQHMPTEVEPSSAEERPLLETEFHLEAGTSMAPSVEHQFIPSLFSCSPSHQNSSLDRAVSQNLSSLRSDCEDICVSLQLGDHEAKRRRSDASVSMEE

Query:  LK
        LK
Subjt:  LK

A0A6J1F9R8 uncharacterized protein LOC1114433850.095.02Show/hide
Query:  MEQTRHNGRINCSGSTPSEESALDLERNCCSHSNLPSFSPPTLQPFASAGQHCESNAAYFSWPTPIRLSVAAEERANYFANLQKGVLPDILHPLPKGQRA
        MEQTR + RINCSGSTPSEESAL+LERN CSHSNLPSFS PTLQPFASAGQHCESNAAYFSWPTPIR+SV  EERANYFANLQKGVLPDILHPLPKGQRA
Subjt:  MEQTRHNGRINCSGSTPSEESALDLERNCCSHSNLPSFSPPTLQPFASAGQHCESNAAYFSWPTPIRLSVAAEERANYFANLQKGVLPDILHPLPKGQRA

Query:  TTLLELMTIRAFHSKILRCYSLGTAIGFRIRKGVLTDIPAILVFVSRKVHKQWLSPIQCLPTALEGPGGVWCDVDVVEFSYFGAPNPAPKEQLYTEIVDD
         TLLELMTIRAFHSKILRCYSLGTAIGFRIRKGVLTDIPAILVFVSRKVHKQWLSPIQCLPTALEGPGGVWCDVDVVEFSYFGAPNPAPKEQLYTEIVDD
Subjt:  TTLLELMTIRAFHSKILRCYSLGTAIGFRIRKGVLTDIPAILVFVSRKVHKQWLSPIQCLPTALEGPGGVWCDVDVVEFSYFGAPNPAPKEQLYTEIVDD

Query:  LRGSDLCIGSGSQVASQETYGTLGAIVRSQTGSRQVGFLTNRHVAVDLDYPNQKMFHPLPPTLGPGVYLGAVERATSFITDELWYGIFAGINPETFVRAD
        LRGSD CIGSGSQVASQETYGTLGAIVRSQTGSRQVGFLTNRHVAVDLDYPNQKMFHPLPPTLGPGVYLGAVERATSFITDELWYGIFAGINPETFVRAD
Subjt:  LRGSDLCIGSGSQVASQETYGTLGAIVRSQTGSRQVGFLTNRHVAVDLDYPNQKMFHPLPPTLGPGVYLGAVERATSFITDELWYGIFAGINPETFVRAD

Query:  GAFIPFADDFEMSTVTTSVKGVGEVGDVKFIDLQSPISTLIGKRVVKVGRSSGLTTGTVLAYALEYNDEKGICFLTDFLVVGENQQTFDLEGDSGSLIIL
        G+FIPFADDF+MSTVTTSVKGVGE+GDVKFIDLQSPISTLIGK+VVKVGRSSGLTTGTVLAYALEYNDEKGICFLTDFLVVGENQQTFDLEGDSGSLIIL
Subjt:  GAFIPFADDFEMSTVTTSVKGVGEVGDVKFIDLQSPISTLIGKRVVKVGRSSGLTTGTVLAYALEYNDEKGICFLTDFLVVGENQQTFDLEGDSGSLIIL

Query:  KGENKESLQPIGIIWGGTANRGRLKLKVGQPPENWTSGVDLGRLLNLLELDLITSDEGLKAAVQEQRTVSATVIGSIVGDSSPPDTTLPKEKSEEKFEPL
        KGEN+ESL+PIGIIWGGTANRGRLKLKVGQPPENWTSGVDLGRLLNLLELDLITSDEGLKAAVQEQRTVSATVIGSIVGDSSPPDTTLP+EKSEEKFEPL
Subjt:  KGENKESLQPIGIIWGGTANRGRLKLKVGQPPENWTSGVDLGRLLNLLELDLITSDEGLKAAVQEQRTVSATVIGSIVGDSSPPDTTLPKEKSEEKFEPL

Query:  GFQIQHMPTEVEPSSAEERPLL-ETEFHLEAGTSMAPSVEHQFIPSLFSCSPSHQNSSLDRAVSQNLSSLRSDCEDICVSLQLGDHEAKRRRSDASVSME
        GFQIQHM TEVEPSSA+++ LL ETEFHLEAGTS APSVEHQFIPSLFSCSPSHQNSSL RAVSQNLSSLR+DCEDICVSLQLGDHEAKR+R D SVSME
Subjt:  GFQIQHMPTEVEPSSAEERPLL-ETEFHLEAGTSMAPSVEHQFIPSLFSCSPSHQNSSLDRAVSQNLSSLRSDCEDICVSLQLGDHEAKRRRSDASVSME

Query:  ELK
        ELK
Subjt:  ELK

A0A6J1FJM9 uncharacterized protein LOC1114459580.096.35Show/hide
Query:  MEQTRHNGRINCSGSTPSEESALDLERNCCSHSNLPSFSPPTLQPFASAGQHCESNAAYFSWPTPIRLSVAAEERANYFANLQKGVLPDILHPLPKGQRA
        MEQTRHN RINCSGSTPSEESALDLERNC  HSNLPSFSPPTLQPFASAGQHCESNAAYFSWPTPIRLSVAAEERANYFANLQKGVLPDILHPLPKGQRA
Subjt:  MEQTRHNGRINCSGSTPSEESALDLERNCCSHSNLPSFSPPTLQPFASAGQHCESNAAYFSWPTPIRLSVAAEERANYFANLQKGVLPDILHPLPKGQRA

Query:  TTLLELMTIRAFHSKILRCYSLGTAIGFRIRKGVLTDIPAILVFVSRKVHKQWLSPIQCLPTALEGPGGVWCDVDVVEFSYFGAPNPAPKEQLYTEIVDD
        TTLLELMTIRAFHSKILRCYSLGTAIGFRI+KGVLTDIPAILVFVSRKVHKQWLSPIQCLPTALEGPGGVWCDVDVVEFSYFGAPNPAPKEQLYTEIVDD
Subjt:  TTLLELMTIRAFHSKILRCYSLGTAIGFRIRKGVLTDIPAILVFVSRKVHKQWLSPIQCLPTALEGPGGVWCDVDVVEFSYFGAPNPAPKEQLYTEIVDD

Query:  LRGSDLCIGSGSQVASQETYGTLGAIVRSQTGSRQVGFLTNRHVAVDLDYPNQKMFHPLPPTLGPGVYLGAVERATSFITDELWYGIFAGINPETFVRAD
        LRGSDLCIGSGSQVASQETYGTLGAIVRSQTGSRQVGFLTNRHVAVDLDYPNQKMFHPLPPTLGPGVYLGAVERATSFITDELWYGIFAGINPETFVRAD
Subjt:  LRGSDLCIGSGSQVASQETYGTLGAIVRSQTGSRQVGFLTNRHVAVDLDYPNQKMFHPLPPTLGPGVYLGAVERATSFITDELWYGIFAGINPETFVRAD

Query:  GAFIPFADDFEMSTVTTSVKGVGEVGDVKFIDLQSPISTLIGKRVVKVGRSSGLTTGTVLAYALEYNDEKGICFLTDFLVVGENQQTFDLEGDSGSLIIL
        GAFIPFADDF+MSTVTTSVKGVGEVGDVKFIDLQSPISTLIGK+VVKVGRSSGLTTGTVLAYALEYNDEKGICFLTDFLVVGENQQTFDLEGDSGSLIIL
Subjt:  GAFIPFADDFEMSTVTTSVKGVGEVGDVKFIDLQSPISTLIGKRVVKVGRSSGLTTGTVLAYALEYNDEKGICFLTDFLVVGENQQTFDLEGDSGSLIIL

Query:  KGENKESLQPIGIIWGGTANRGRLKLKVGQPPENWTSGVDLGRLLNLLELDLITSDEGLKAAVQEQRTVSATVIGSIVGDSSPPDTTLPKEKSEEKFEPL
        KGEN+ESLQPIGIIWGGTANRGRLKLKVGQPPENWTSGVDLGRLLNLLELDLIT+D+G KAAVQEQRTVSATVIGSIVGDSSPPDTTLPK+KSEEKFEPL
Subjt:  KGENKESLQPIGIIWGGTANRGRLKLKVGQPPENWTSGVDLGRLLNLLELDLITSDEGLKAAVQEQRTVSATVIGSIVGDSSPPDTTLPKEKSEEKFEPL

Query:  GFQIQHMPTEVEPSSAEERPLLETEFHLEAGTSMAPSVEHQFIPSLFSCSPSHQNSSLDRAVSQNLSSLRSDCEDICVSLQLGDHEAKRRRSDASVSMEE
        GFQIQHMPTEVEPSSA++RPLLETEFHLEAGTS AP+VEHQFIPSLFSCSPS  NS+L RAVSQNLSSLRS+CEDI VSL LGDHEAKRRRSDASVSMEE
Subjt:  GFQIQHMPTEVEPSSAEERPLLETEFHLEAGTSMAPSVEHQFIPSLFSCSPSHQNSSLDRAVSQNLSSLRSDCEDICVSLQLGDHEAKRRRSDASVSMEE

Query:  LK
        LK
Subjt:  LK

A0A6J1J109 uncharacterized protein LOC1114803470.096.68Show/hide
Query:  MEQTRHNGRINCSGSTPSEESALDLERNCCSHSNLPSFSPPTLQPFASAGQHCESNAAYFSWPTPIRLSVAAEERANYFANLQKGVLPDILHPLPKGQRA
        MEQTRHN RINCSGSTPSEESALDLERNC  HSNLPSFSPPTLQPFASAGQHCESNAAYFSWPTPIRLSVAAEERANYFANLQKGVLPDILHPLPKGQRA
Subjt:  MEQTRHNGRINCSGSTPSEESALDLERNCCSHSNLPSFSPPTLQPFASAGQHCESNAAYFSWPTPIRLSVAAEERANYFANLQKGVLPDILHPLPKGQRA

Query:  TTLLELMTIRAFHSKILRCYSLGTAIGFRIRKGVLTDIPAILVFVSRKVHKQWLSPIQCLPTALEGPGGVWCDVDVVEFSYFGAPNPAPKEQLYTEIVDD
        TTLLELMTIRAFHSKILRCYSLGTAIGFRI+KGVLTDIPAILVFVSRKVHKQWLSPIQCLPTALEGPGGVWCDVDVVEFSYFGAPNPAPKEQLYTEIVDD
Subjt:  TTLLELMTIRAFHSKILRCYSLGTAIGFRIRKGVLTDIPAILVFVSRKVHKQWLSPIQCLPTALEGPGGVWCDVDVVEFSYFGAPNPAPKEQLYTEIVDD

Query:  LRGSDLCIGSGSQVASQETYGTLGAIVRSQTGSRQVGFLTNRHVAVDLDYPNQKMFHPLPPTLGPGVYLGAVERATSFITDELWYGIFAGINPETFVRAD
        LRGSDLCIGSGSQVASQETYGTLGAIVRSQTGSRQVGFLTNRHVAVDLDYPNQKMFHPLPPTLGPGVYLGAVERATSFITDELWYGIFAGINPETFVRAD
Subjt:  LRGSDLCIGSGSQVASQETYGTLGAIVRSQTGSRQVGFLTNRHVAVDLDYPNQKMFHPLPPTLGPGVYLGAVERATSFITDELWYGIFAGINPETFVRAD

Query:  GAFIPFADDFEMSTVTTSVKGVGEVGDVKFIDLQSPISTLIGKRVVKVGRSSGLTTGTVLAYALEYNDEKGICFLTDFLVVGENQQTFDLEGDSGSLIIL
        GAFIPFADDF+MSTVTTSVKGVGEVGDVKFIDLQSPISTLIGK+VVKVGRSSGLTTGTVLAYALEYNDEKGICFLTDFLVVGENQQTFDLEGDSGSLIIL
Subjt:  GAFIPFADDFEMSTVTTSVKGVGEVGDVKFIDLQSPISTLIGKRVVKVGRSSGLTTGTVLAYALEYNDEKGICFLTDFLVVGENQQTFDLEGDSGSLIIL

Query:  KGENKESLQPIGIIWGGTANRGRLKLKVGQPPENWTSGVDLGRLLNLLELDLITSDEGLKAAVQEQRTVSATVIGSIVGDSSPPDTTLPKEKSEEKFEPL
        KGEN+ESLQPIGIIWGGTANRGRLKLKVGQPPENWTSGVDLGRLLNLLELDLIT+D+G KAAVQEQRTVSATVIGSIVGDSSPPDTTLPK+KSEEKFEPL
Subjt:  KGENKESLQPIGIIWGGTANRGRLKLKVGQPPENWTSGVDLGRLLNLLELDLITSDEGLKAAVQEQRTVSATVIGSIVGDSSPPDTTLPKEKSEEKFEPL

Query:  GFQIQHMPTEVEPSSAEERPLLETEFHLEAGTSMAPSVEHQFIPSLFSCSPSHQNSSLDRAVSQNLSSLRSDCEDICVSLQLGDHEAKRRRSDASVSMEE
        GFQIQHMPTEVEPSSA++RPLLETEFHLEAGTS APSVEHQFIPSLFSCSPS  NS+L RAVSQNLSSLRSDCEDI VSL LGDHEAKRRRSDASVSMEE
Subjt:  GFQIQHMPTEVEPSSAEERPLLETEFHLEAGTSMAPSVEHQFIPSLFSCSPSHQNSSLDRAVSQNLSSLRSDCEDICVSLQLGDHEAKRRRSDASVSMEE

Query:  LK
        LK
Subjt:  LK

SwissProt top hitse value%identityAlignment
B4XT64 Protein NARROW LEAF 11.4e-21164.57Show/hide
Query:  SGSTPSEESALDLERNCCSHSNLPSFSPPTLQPFASAGQHCESNAAYFSWPTPIRLSVAAEERANYFANLQKGVLPDILHPLPKGQRATTLLELMTIRAF
        SG   SEES+LD++     H + P    P++QP AS   H E++AAYF WPT      AAE RANYF NLQKG+LP     LPKGQ+A +LL+LMTIRAF
Subjt:  SGSTPSEESALDLERNCCSHSNLPSFSPPTLQPFASAGQHCESNAAYFSWPTPIRLSVAAEERANYFANLQKGVLPDILHPLPKGQRATTLLELMTIRAF

Query:  HSKILRCYSLGTAIGFRIRKGVLTDIPAILVFVSRKVHKQWLSPIQCLPTALEGPGGVWCDVDVVEFSYFGAPNPAPKEQLYTEIVDDLRGSDLCIGSGS
        HSKILR +SLGTA+GFRIRKG LTDIPAILVFV+RKVHK+WL+P QCLP  LEGPGGVWCDVDVVEFSY+GAP   PKEQ+++E+VD L GSD CIGSGS
Subjt:  HSKILRCYSLGTAIGFRIRKGVLTDIPAILVFVSRKVHKQWLSPIQCLPTALEGPGGVWCDVDVVEFSYFGAPNPAPKEQLYTEIVDDLRGSDLCIGSGS

Query:  QVASQETYGTLGAIVRSQTGSRQVGFLTNRHVAVDLDYPNQKMFHPLPPTLGPGVYLGAVERATSFITDELWYGIFAGINPETFVRADGAFIPFADDFEM
        QVAS ET+GTLGAIV+ +TG++QVGFLTN HVAVDLDYPNQKMFHPLPP LGPGVYLGAVERATSFITD++WYGI+AG NPETFVRADGAFIPFADDF++
Subjt:  QVASQETYGTLGAIVRSQTGSRQVGFLTNRHVAVDLDYPNQKMFHPLPPTLGPGVYLGAVERATSFITDELWYGIFAGINPETFVRADGAFIPFADDFEM

Query:  STVTTSVKGVGEVGDVKFIDLQSPISTLIGKRVVKVGRSSGLTTGTVLAYALEYNDEKGICFLTDFLVVGENQQTFDLEGDSGSLIILKGENKESLQPIG
        STVTT V+GVG++GDVK IDLQ P+++LIG++V KVGRSSG TTGTV+AYALEYNDEKGICF TD LVVGEN+QTFDLEGDSGSLIIL  ++ E  +PIG
Subjt:  STVTTSVKGVGEVGDVKFIDLQSPISTLIGKRVVKVGRSSGLTTGTVLAYALEYNDEKGICFLTDFLVVGENQQTFDLEGDSGSLIILKGENKESLQPIG

Query:  IIWGGTANRGRLKLKVGQPPENWTSGVDLGRLLNLLELDLITSDEGLKAAVQEQRTVSATVIGSIVGDSSPPDTTLPKEKSEEKFEPLGFQIQHMPTEVE
        IIWGGTANRGRLKL     PENWTSGVDLGRLL+ LELD+I ++E L+ AVQ+QR      + S VG+SS     +P+EK EE FEPLG QIQ +P    
Subjt:  IIWGGTANRGRLKLKVGQPPENWTSGVDLGRLLNLLELDLITSDEGLKAAVQEQRTVSATVIGSIVGDSSPPDTTLPKEKSEEKFEPLGFQIQHMPTEVE

Query:  PSSAEERPLLETEFHLEAGTSMAPSVEHQFIPSLFSCSPSHQNSSLDRAVSQNLSSLRSDCEDICVSLQLGDHEAKRRRSDASVSME
         +S  E          EA  ++    EHQFI +    SP   +    R+++ NL++     E++ +SL LGD E KR RSD+  S++
Subjt:  PSSAEERPLLETEFHLEAGTSMAPSVEHQFIPSLFSCSPSHQNSSLDRAVSQNLSSLRSDCEDICVSLQLGDHEAKRRRSDASVSME

Arabidopsis top hitse value%identityAlignment
AT2G35155.1 Trypsin family protein2.1e-21866.32Show/hide
Query:  INCSGSTPSEESALDLERN-CCSHSNLPSFSPPT-LQPFASAGQHCESNAAYFSWPTPIRLSVAAEERANYFANLQKGVLPDILHPLPKGQRATTLLELM
        I  + S+ SE+SALDLERN  C+H +LPS S P+ LQPF    QH ESNA YFSWPT  RL+   E+RANYF NLQKGVLP+ +  LP GQ+ATTLLELM
Subjt:  INCSGSTPSEESALDLERN-CCSHSNLPSFSPPT-LQPFASAGQHCESNAAYFSWPTPIRLSVAAEERANYFANLQKGVLPDILHPLPKGQRATTLLELM

Query:  TIRAFHSKILRCYSLGTAIGFRIRKGVLTDIPAILVFVSRKVHKQWLSPIQCLPTALEGPGGVWCDVDVVEFSYFGAPNPAPKEQLYTEIVDDLRGSDLC
        TIRAFHSKILR +SLGTA+GFRI +GVLT++PAILVFV+RKVH+QWL+P+QCLP+ALEGPGGVWCDVDVVEF Y+GAP   PKEQ+Y E+VD LRGSD C
Subjt:  TIRAFHSKILRCYSLGTAIGFRIRKGVLTDIPAILVFVSRKVHKQWLSPIQCLPTALEGPGGVWCDVDVVEFSYFGAPNPAPKEQLYTEIVDDLRGSDLC

Query:  IGSGSQVASQETYGTLGAIVRSQTGSRQVGFLTNRHVAVDLDYPNQKMFHPLPPTLGPGVYLGAVERATSFITDELWYGIFAGINPETFVRADGAFIPFA
        IGSGSQVASQETYGTLGAIV+S+TG+ QVGFLTNRHVAVDLDYP+QKMFHPLPP+LGPGVYLGAVERATSFITD+ WYGIFAG NPETFVRADGAFIPFA
Subjt:  IGSGSQVASQETYGTLGAIVRSQTGSRQVGFLTNRHVAVDLDYPNQKMFHPLPPTLGPGVYLGAVERATSFITDELWYGIFAGINPETFVRADGAFIPFA

Query:  DDFEMSTVTTSVKGVGEVGDVKFIDLQSPISTLIGKRVVKVGRSSGLTTGTVLAYALEYNDEKGICFLTDFLVVGENQQTFDLEGDSGSLIILKGENKES
        +DF  S VTT +KG+GE+GDV  IDLQSPI +LIGK+VVKVGRSSG TTGT++AYALEYNDEKGICFLTDFLV+GENQQTFDLEGDSGSLI+L G N + 
Subjt:  DDFEMSTVTTSVKGVGEVGDVKFIDLQSPISTLIGKRVVKVGRSSGLTTGTVLAYALEYNDEKGICFLTDFLVVGENQQTFDLEGDSGSLIILKGENKES

Query:  LQPIGIIWGGTANRGRLKLKVGQPPENWTSGVDLGRLLNLLELDLITSDEGLK--AAVQEQRTVSATVIGSIVGDSSPPDTTLPKEKSEEKFEPLGFQIQ
         +P+GIIWGGTANRGRLKL  GQ PENWTSGVDLGRLL+LLELDLITS+  L+  AA +E+R  S T + S V  SSPPD     +K +E FEP      
Subjt:  LQPIGIIWGGTANRGRLKLKVGQPPENWTSGVDLGRLLNLLELDLITSDEGLK--AAVQEQRTVSATVIGSIVGDSSPPDTTLPKEKSEEKFEPLGFQIQ

Query:  HMPTEVEPSSAEERPLLETEFHLEAGTSMAPSVEHQFIPSLFSCSPSHQNSSLDRAVSQNLSSLR-SDCEDICVSLQLGDHEAKR
        H+   ++P+   E  +      +   TS   +++ Q IP L                  NL +L+ S  E++ +SL LG+ + K+
Subjt:  HMPTEVEPSSAEERPLLETEFHLEAGTSMAPSVEHQFIPSLFSCSPSHQNSSLDRAVSQNLSSLR-SDCEDICVSLQLGDHEAKR

AT3G12950.1 Trypsin family protein6.5e-21269.73Show/hide
Query:  FASAGQHCESNAA-YFSWPTPIRLSVAAEERANYFANLQK------GVLPDILHPLPKGQRATTLLELMTIRAFHSKILRCYSLGTAIGFRIRKGVLTDI
        + S GQHCE  AA YFSWPT  RLS AAEERANYF+NLQK       V P+ +   PKGQRATTLLELMTIRAFHSK+LRCYSLGTAIGFRIR+GVLTDI
Subjt:  FASAGQHCESNAA-YFSWPTPIRLSVAAEERANYFANLQK------GVLPDILHPLPKGQRATTLLELMTIRAFHSKILRCYSLGTAIGFRIRKGVLTDI

Query:  PAILVFVSRKVHKQWLSPIQCLPTALEGPGGVWCDVDVVEFSYFGAPN--PAPKEQLYTEIVDDLRGSDLCIGSGSQVASQETYGTLGAIVRSQTGSRQV
        PAI+VFVSRKVHKQWLSP+QCLPTALEG GG+WCDVDVVEFSYFG P+  P PK+   T+IVD L+GSD  IGSGSQVASQET GTLGAIVRSQTG RQV
Subjt:  PAILVFVSRKVHKQWLSPIQCLPTALEGPGGVWCDVDVVEFSYFGAPN--PAPKEQLYTEIVDDLRGSDLCIGSGSQVASQETYGTLGAIVRSQTGSRQV

Query:  GFLTNRHVAVDLDYPNQKMFHPLPPTLGPGVYLGAVERATSFITDELWYGIFAGINPETFVRADGAFIPFADDFEMSTVTTSVK-GVGEVGDVKFIDLQS
        GF+TNRHVAV+LDYP+QKMFHPLPP LGPGVYLGAVERATSFITD+LW+GIFAG NPETFVRADGAFIPFADD+++S VTTSVK GVGE+G+VK I+LQS
Subjt:  GFLTNRHVAVDLDYPNQKMFHPLPPTLGPGVYLGAVERATSFITDELWYGIFAGINPETFVRADGAFIPFADDFEMSTVTTSVK-GVGEVGDVKFIDLQS

Query:  PISTLIGKRVVKVGRSSGLTTGTVLAYALEYNDEKGICFLTDFLVVGENQQT-FDLEGDSGSLIILKGENKESLQPIGIIWGGTANRGRLKLKVGQPPEN
        P+ +L+GK+VVKVGRSSGLTTGTVLAYALEYNDE+G+CFLTDFLVVGEN ++ FDLEGDSGSLI++KGE  E  +PIGIIWGGT +RGRLKLKVG+ PE+
Subjt:  PISTLIGKRVVKVGRSSGLTTGTVLAYALEYNDEKGICFLTDFLVVGENQQT-FDLEGDSGSLIILKGENKESLQPIGIIWGGTANRGRLKLKVGQPPEN

Query:  WTSGVDLGRLLNLLELDLITSDEGLKAAVQEQRTVSATVIGSIVGDSSPPDTTLPKEK--SEEKFE-PLGFQIQHMPTEVEPSSAEERPLLETEFHLEAG
        WT+GVDLGRLL  L+LDLIT+DEGLKAAVQEQR  S T + S+V DSSPP   L KEK   EEK E  LG      P +V+    EER  +ET+      
Subjt:  WTSGVDLGRLLNLLELDLITSDEGLKAAVQEQRTVSATVIGSIVGDSSPPDTTLPKEK--SEEKFE-PLGFQIQHMPTEVEPSSAEERPLLETEFHLEAG

Query:  TSMAPSVEHQFIPSLF-SCSPSHQNSSLDRAVSQNLSSLRSDCEDICVSLQLGDHEAKRRRSDAS
           APSVEHQF+P+    CS S    +    +    ++   D  D+CV L+LGD  AKRRR+  +
Subjt:  TSMAPSVEHQFIPSLF-SCSPSHQNSSLDRAVSQNLSSLRSDCEDICVSLQLGDHEAKRRRSDAS

AT5G45030.1 Trypsin family protein3.8e-22066.94Show/hide
Query:  MEQTRHNGRINCSGSTPSEES-ALDLERNCCSHSNLPSFSPPTLQPFASAGQHCESN--AAYFSWPTPIRLSVAAEERANYFANLQKGVLPDILHPLPKG
        ME  R + R + S S+ S ES ALDL++N  +H  L S SP  LQPF S  QH E++  AAYFSWPT  RL+ +AE+RANYFANLQKGVLP+    LP G
Subjt:  MEQTRHNGRINCSGSTPSEES-ALDLERNCCSHSNLPSFSPPTLQPFASAGQHCESN--AAYFSWPTPIRLSVAAEERANYFANLQKGVLPDILHPLPKG

Query:  QRATTLLELMTIRAFHSKILRCYSLGTAIGFRIRKGVLTDIPAILVFVSRKVHKQWLSPIQCLPTALEGPGGVWCDVDVVEFSYFGAPNPAPKEQLYTEI
        ++ATTLLELM IRAFHSK LR +SLGTAIGFRIR+GVLT+I AILVFV+RKVHKQWL+P+QCLPTALEGPGGVWCDVDVVEF Y+GAP   PKEQ+YTE+
Subjt:  QRATTLLELMTIRAFHSKILRCYSLGTAIGFRIRKGVLTDIPAILVFVSRKVHKQWLSPIQCLPTALEGPGGVWCDVDVVEFSYFGAPNPAPKEQLYTEI

Query:  VDDLRGSDLCIGSGSQVASQETYGTLGAIVRSQTGSRQVGFLTNRHVAVDLDYPNQKMFHPLPPTLGPGVYLGAVERATSFITDELWYGIFAGINPETFV
        VDDLRGS   IGSGSQVASQETYGTLGAIV+S+TG RQVGFLTNRHVAVDLDYP+QKMFHPLPP+LGPGVYLGAVERATSFITD+LWYGIFAG NPETFV
Subjt:  VDDLRGSDLCIGSGSQVASQETYGTLGAIVRSQTGSRQVGFLTNRHVAVDLDYPNQKMFHPLPPTLGPGVYLGAVERATSFITDELWYGIFAGINPETFV

Query:  RADGAFIPFADDFEMSTVTTSVKGVGEVGDVKFIDLQSPISTLIGKRVVKVGRSSGLTTGTVLAYALEYNDEKGICFLTDFLVVGENQQTFDLEGDSGSL
        RADGAFIPFA+DF  + VTT+VKG+GE+GD+   DLQSP+++LIG++VVKVGRSSGLTTGT++AYALEYNDEKGICFLTDFLVVGENQQTFDLEGDSGSL
Subjt:  RADGAFIPFADDFEMSTVTTSVKGVGEVGDVKFIDLQSPISTLIGKRVVKVGRSSGLTTGTVLAYALEYNDEKGICFLTDFLVVGENQQTFDLEGDSGSL

Query:  IILKG--ENKESLQPIGIIWGGTANRGRLKLKVGQPPENWTSGVDLGRLLNLLELDLITSDEGLKAAVQEQRT-VSATVIGSIVGDSSPPDTTLPKEKSE
        I+L    E  E  +P+GIIWGGTANRGRLKLKVG+ PENWTSGVDLGR+LNLLELDLITS+EGL+AAV EQR  +    + S V +SSP    + + K+ 
Subjt:  IILKG--ENKESLQPIGIIWGGTANRGRLKLKVGQPPENWTSGVDLGRLLNLLELDLITSDEGLKAAVQEQRT-VSATVIGSIVGDSSPPDTTLPKEKSE

Query:  EKFEPLGFQIQHMPTEVEPSSAEERPLLETEFHLEAGTSMAPSVEHQFIPSLF-SCSPSHQN-SSLDRAVSQNLSSLR--SDCEDICVSLQLGDHEA-KR
        E FEP+   +Q +   +E  ++   P  + E  LE   S+A   EHQFIPS   + S  HQ  +  +   S+NLSSL+  S  ++I  SLQLG+ +  KR
Subjt:  EKFEPLGFQIQHMPTEVEPSSAEERPLLETEFHLEAGTSMAPSVEHQFIPSLF-SCSPSHQN-SSLDRAVSQNLSSLR--SDCEDICVSLQLGDHEA-KR

Query:  RRSDASVSMEE
        +R+D+    +E
Subjt:  RRSDASVSMEE

AT5G45030.2 Trypsin family protein3.8e-22066.94Show/hide
Query:  MEQTRHNGRINCSGSTPSEES-ALDLERNCCSHSNLPSFSPPTLQPFASAGQHCESN--AAYFSWPTPIRLSVAAEERANYFANLQKGVLPDILHPLPKG
        ME  R + R + S S+ S ES ALDL++N  +H  L S SP  LQPF S  QH E++  AAYFSWPT  RL+ +AE+RANYFANLQKGVLP+    LP G
Subjt:  MEQTRHNGRINCSGSTPSEES-ALDLERNCCSHSNLPSFSPPTLQPFASAGQHCESN--AAYFSWPTPIRLSVAAEERANYFANLQKGVLPDILHPLPKG

Query:  QRATTLLELMTIRAFHSKILRCYSLGTAIGFRIRKGVLTDIPAILVFVSRKVHKQWLSPIQCLPTALEGPGGVWCDVDVVEFSYFGAPNPAPKEQLYTEI
        ++ATTLLELM IRAFHSK LR +SLGTAIGFRIR+GVLT+I AILVFV+RKVHKQWL+P+QCLPTALEGPGGVWCDVDVVEF Y+GAP   PKEQ+YTE+
Subjt:  QRATTLLELMTIRAFHSKILRCYSLGTAIGFRIRKGVLTDIPAILVFVSRKVHKQWLSPIQCLPTALEGPGGVWCDVDVVEFSYFGAPNPAPKEQLYTEI

Query:  VDDLRGSDLCIGSGSQVASQETYGTLGAIVRSQTGSRQVGFLTNRHVAVDLDYPNQKMFHPLPPTLGPGVYLGAVERATSFITDELWYGIFAGINPETFV
        VDDLRGS   IGSGSQVASQETYGTLGAIV+S+TG RQVGFLTNRHVAVDLDYP+QKMFHPLPP+LGPGVYLGAVERATSFITD+LWYGIFAG NPETFV
Subjt:  VDDLRGSDLCIGSGSQVASQETYGTLGAIVRSQTGSRQVGFLTNRHVAVDLDYPNQKMFHPLPPTLGPGVYLGAVERATSFITDELWYGIFAGINPETFV

Query:  RADGAFIPFADDFEMSTVTTSVKGVGEVGDVKFIDLQSPISTLIGKRVVKVGRSSGLTTGTVLAYALEYNDEKGICFLTDFLVVGENQQTFDLEGDSGSL
        RADGAFIPFA+DF  + VTT+VKG+GE+GD+   DLQSP+++LIG++VVKVGRSSGLTTGT++AYALEYNDEKGICFLTDFLVVGENQQTFDLEGDSGSL
Subjt:  RADGAFIPFADDFEMSTVTTSVKGVGEVGDVKFIDLQSPISTLIGKRVVKVGRSSGLTTGTVLAYALEYNDEKGICFLTDFLVVGENQQTFDLEGDSGSL

Query:  IILKG--ENKESLQPIGIIWGGTANRGRLKLKVGQPPENWTSGVDLGRLLNLLELDLITSDEGLKAAVQEQRT-VSATVIGSIVGDSSPPDTTLPKEKSE
        I+L    E  E  +P+GIIWGGTANRGRLKLKVG+ PENWTSGVDLGR+LNLLELDLITS+EGL+AAV EQR  +    + S V +SSP    + + K+ 
Subjt:  IILKG--ENKESLQPIGIIWGGTANRGRLKLKVGQPPENWTSGVDLGRLLNLLELDLITSDEGLKAAVQEQRT-VSATVIGSIVGDSSPPDTTLPKEKSE

Query:  EKFEPLGFQIQHMPTEVEPSSAEERPLLETEFHLEAGTSMAPSVEHQFIPSLF-SCSPSHQN-SSLDRAVSQNLSSLR--SDCEDICVSLQLGDHEA-KR
        E FEP+   +Q +   +E  ++   P  + E  LE   S+A   EHQFIPS   + S  HQ  +  +   S+NLSSL+  S  ++I  SLQLG+ +  KR
Subjt:  EKFEPLGFQIQHMPTEVEPSSAEERPLLETEFHLEAGTSMAPSVEHQFIPSLF-SCSPSHQN-SSLDRAVSQNLSSLR--SDCEDICVSLQLGDHEA-KR

Query:  RRSDASVSMEE
        +R+D+    +E
Subjt:  RRSDASVSMEE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAGCAGACTAGACACAATGGCAGGATCAACTGCTCGGGTTCTACTCCATCCGAGGAATCAGCTCTTGATCTTGAAAGAAATTGCTGCAGTCACTCTAATTTACCTTC
ATTCAGCCCACCTACGCTTCAGCCGTTTGCATCCGCTGGGCAGCATTGCGAGAGCAATGCTGCATACTTTTCATGGCCCACCCCCATTCGGTTAAGCGTCGCTGCTGAGG
AGAGGGCGAACTATTTTGCCAACCTTCAGAAAGGGGTACTACCAGATATCCTTCATCCGTTACCAAAAGGGCAGCGAGCAACCACGTTACTTGAGCTCATGACAATAAGA
GCCTTCCATAGCAAGATCCTGCGTTGTTACAGTCTTGGCACGGCCATTGGGTTCCGTATACGAAAGGGCGTGCTGACTGACATTCCTGCCATTCTTGTTTTTGTTTCCAG
GAAAGTGCACAAGCAATGGCTTAGTCCTATCCAATGCCTGCCCACTGCCCTGGAGGGTCCAGGTGGTGTGTGGTGTGATGTGGATGTTGTAGAATTCTCATATTTTGGTG
CACCCAACCCTGCTCCTAAAGAACAGTTGTATACGGAAATTGTCGATGATCTACGTGGCAGTGATCTGTGCATTGGCTCTGGTTCCCAGGTTGCCAGCCAGGAGACTTAT
GGAACCTTGGGTGCTATCGTAAGGAGTCAAACAGGCAGTCGACAAGTTGGTTTTCTCACAAACCGTCATGTCGCAGTTGATTTAGATTATCCAAATCAGAAGATGTTTCA
TCCTCTTCCACCAACGCTCGGGCCAGGGGTGTATCTTGGTGCTGTGGAGAGAGCTACTTCATTCATCACTGATGAGCTTTGGTATGGAATTTTTGCTGGAATAAACCCAG
AGACGTTCGTCAGGGCTGATGGGGCATTTATTCCTTTTGCCGATGATTTTGAGATGTCGACCGTCACTACATCTGTAAAAGGGGTGGGAGAGGTTGGCGACGTGAAGTTT
ATTGACTTGCAGTCGCCTATCAGTACCCTCATAGGGAAGCGGGTGGTGAAAGTTGGGAGAAGTTCTGGCTTGACCACAGGAACTGTTTTGGCGTATGCTCTTGAGTACAA
TGATGAGAAAGGAATATGCTTCTTGACTGATTTTCTCGTGGTTGGCGAGAATCAACAGACTTTCGATCTCGAAGGAGATAGCGGAAGCCTCATCATTTTAAAGGGTGAGA
ATAAGGAGAGTTTGCAACCGATTGGGATCATATGGGGTGGAACTGCAAACCGAGGTCGCCTTAAGTTAAAAGTCGGCCAGCCTCCCGAGAATTGGACAAGTGGGGTTGAT
CTCGGGCGCCTTCTCAATCTGCTCGAACTTGATCTAATCACAAGTGATGAAGGGCTTAAAGCAGCAGTGCAAGAACAGAGAACTGTCTCGGCAACCGTCATCGGGTCCAT
TGTCGGAGACTCCTCTCCTCCCGATACAACATTGCCAAAGGAGAAAAGTGAAGAGAAGTTTGAGCCATTGGGCTTCCAAATCCAGCATATGCCTACAGAAGTGGAACCTT
CTTCAGCCGAGGAGCGGCCACTCCTGGAGACCGAGTTTCATCTCGAAGCCGGGACGAGCATGGCTCCCAGTGTCGAACATCAGTTCATTCCAAGCCTTTTCAGTTGCTCT
CCCTCGCATCAAAACAGTTCACTGGACCGTGCAGTTTCCCAGAACCTCTCTTCGCTTCGGAGCGACTGCGAAGACATTTGCGTTTCCTTGCAACTGGGCGACCATGAAGC
AAAGAGAAGACGCTCGGATGCTTCTGTTTCCATGGAAGAACTGAAA
mRNA sequenceShow/hide mRNA sequence
ATGGAGCAGACTAGACACAATGGCAGGATCAACTGCTCGGGTTCTACTCCATCCGAGGAATCAGCTCTTGATCTTGAAAGAAATTGCTGCAGTCACTCTAATTTACCTTC
ATTCAGCCCACCTACGCTTCAGCCGTTTGCATCCGCTGGGCAGCATTGCGAGAGCAATGCTGCATACTTTTCATGGCCCACCCCCATTCGGTTAAGCGTCGCTGCTGAGG
AGAGGGCGAACTATTTTGCCAACCTTCAGAAAGGGGTACTACCAGATATCCTTCATCCGTTACCAAAAGGGCAGCGAGCAACCACGTTACTTGAGCTCATGACAATAAGA
GCCTTCCATAGCAAGATCCTGCGTTGTTACAGTCTTGGCACGGCCATTGGGTTCCGTATACGAAAGGGCGTGCTGACTGACATTCCTGCCATTCTTGTTTTTGTTTCCAG
GAAAGTGCACAAGCAATGGCTTAGTCCTATCCAATGCCTGCCCACTGCCCTGGAGGGTCCAGGTGGTGTGTGGTGTGATGTGGATGTTGTAGAATTCTCATATTTTGGTG
CACCCAACCCTGCTCCTAAAGAACAGTTGTATACGGAAATTGTCGATGATCTACGTGGCAGTGATCTGTGCATTGGCTCTGGTTCCCAGGTTGCCAGCCAGGAGACTTAT
GGAACCTTGGGTGCTATCGTAAGGAGTCAAACAGGCAGTCGACAAGTTGGTTTTCTCACAAACCGTCATGTCGCAGTTGATTTAGATTATCCAAATCAGAAGATGTTTCA
TCCTCTTCCACCAACGCTCGGGCCAGGGGTGTATCTTGGTGCTGTGGAGAGAGCTACTTCATTCATCACTGATGAGCTTTGGTATGGAATTTTTGCTGGAATAAACCCAG
AGACGTTCGTCAGGGCTGATGGGGCATTTATTCCTTTTGCCGATGATTTTGAGATGTCGACCGTCACTACATCTGTAAAAGGGGTGGGAGAGGTTGGCGACGTGAAGTTT
ATTGACTTGCAGTCGCCTATCAGTACCCTCATAGGGAAGCGGGTGGTGAAAGTTGGGAGAAGTTCTGGCTTGACCACAGGAACTGTTTTGGCGTATGCTCTTGAGTACAA
TGATGAGAAAGGAATATGCTTCTTGACTGATTTTCTCGTGGTTGGCGAGAATCAACAGACTTTCGATCTCGAAGGAGATAGCGGAAGCCTCATCATTTTAAAGGGTGAGA
ATAAGGAGAGTTTGCAACCGATTGGGATCATATGGGGTGGAACTGCAAACCGAGGTCGCCTTAAGTTAAAAGTCGGCCAGCCTCCCGAGAATTGGACAAGTGGGGTTGAT
CTCGGGCGCCTTCTCAATCTGCTCGAACTTGATCTAATCACAAGTGATGAAGGGCTTAAAGCAGCAGTGCAAGAACAGAGAACTGTCTCGGCAACCGTCATCGGGTCCAT
TGTCGGAGACTCCTCTCCTCCCGATACAACATTGCCAAAGGAGAAAAGTGAAGAGAAGTTTGAGCCATTGGGCTTCCAAATCCAGCATATGCCTACAGAAGTGGAACCTT
CTTCAGCCGAGGAGCGGCCACTCCTGGAGACCGAGTTTCATCTCGAAGCCGGGACGAGCATGGCTCCCAGTGTCGAACATCAGTTCATTCCAAGCCTTTTCAGTTGCTCT
CCCTCGCATCAAAACAGTTCACTGGACCGTGCAGTTTCCCAGAACCTCTCTTCGCTTCGGAGCGACTGCGAAGACATTTGCGTTTCCTTGCAACTGGGCGACCATGAAGC
AAAGAGAAGACGCTCGGATGCTTCTGTTTCCATGGAAGAACTGAAA
Protein sequenceShow/hide protein sequence
MEQTRHNGRINCSGSTPSEESALDLERNCCSHSNLPSFSPPTLQPFASAGQHCESNAAYFSWPTPIRLSVAAEERANYFANLQKGVLPDILHPLPKGQRATTLLELMTIR
AFHSKILRCYSLGTAIGFRIRKGVLTDIPAILVFVSRKVHKQWLSPIQCLPTALEGPGGVWCDVDVVEFSYFGAPNPAPKEQLYTEIVDDLRGSDLCIGSGSQVASQETY
GTLGAIVRSQTGSRQVGFLTNRHVAVDLDYPNQKMFHPLPPTLGPGVYLGAVERATSFITDELWYGIFAGINPETFVRADGAFIPFADDFEMSTVTTSVKGVGEVGDVKF
IDLQSPISTLIGKRVVKVGRSSGLTTGTVLAYALEYNDEKGICFLTDFLVVGENQQTFDLEGDSGSLIILKGENKESLQPIGIIWGGTANRGRLKLKVGQPPENWTSGVD
LGRLLNLLELDLITSDEGLKAAVQEQRTVSATVIGSIVGDSSPPDTTLPKEKSEEKFEPLGFQIQHMPTEVEPSSAEERPLLETEFHLEAGTSMAPSVEHQFIPSLFSCS
PSHQNSSLDRAVSQNLSSLRSDCEDICVSLQLGDHEAKRRRSDASVSMEELK