; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

IVF0016485 (gene) of Melon (IVF77) v1 genome

Gene IDIVF0016485
OrganismCucumis melo ssp. agrestis cv. IVF77 (Melon (IVF77) v1)
DescriptionTrypsin family protein
Genome locationchr07:5396109..5401415
RNA-Seq ExpressionIVF0016485
SyntenyIVF0016485
Gene Ontology termsGO:0043231 - intracellular membrane-bounded organelle (cellular component)
InterPro domainsIPR009003 - Peptidase S1, PA clan


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004144638.1 protein NARROW LEAF 1 [Cucumis sativus]0.094.7Show/hide
Query:  MEQTRHNRRINCSGSIPSEESALDLERNCCSHSDLPSFSSPTLQPFASAGQHFGFNTAYFSWPTPIRLSVGTEERANYFANLQKGVLPDILHPLPKGQRA
        MEQTRHNRRINCSGS PSEESALDLERNCCSHSDLPSFSSPTLQPFASAGQHFGFNTAYFSWPTPIRLSVGTEERANYFANLQKGVLPDILHPLPKGQRA
Subjt:  MEQTRHNRRINCSGSIPSEESALDLERNCCSHSDLPSFSSPTLQPFASAGQHFGFNTAYFSWPTPIRLSVGTEERANYFANLQKGVLPDILHPLPKGQRA

Query:  NTLLELMTIRAFHSKILRCYSLGTAIGFRIRKGVLTDIPAILVFVSRKVHKQWLN-------SLFYSGARWCLVRCGCCRILIFGAPNPAPKEQLYTEIV
        NTLLELMTIRAFHSKILRCYSLGTAIGFRIRKGVLTDIPAILVFVSRKVHKQWL+       +L   G  WC V         FGAPNPAPKEQLYTEIV
Subjt:  NTLLELMTIRAFHSKILRCYSLGTAIGFRIRKGVLTDIPAILVFVSRKVHKQWLN-------SLFYSGARWCLVRCGCCRILIFGAPNPAPKEQLYTEIV

Query:  DDLRGSDPCIGSGSQVASQETYGTLGAIVRSQTGGRQVGFLTNRHVAVDLDYPNQKMFHPLPPTLGPGVYLGAVERATSFITDELWYGIFAGINPETFVR
        DDLRGSDPCIGSGSQVASQETYGTLGAIVRSQTGGRQVGFLTNRHVAVDLDYPNQKMFHPLPPTLGPGVYLGAVERATSFITDELWYGIFAGINPETFVR
Subjt:  DDLRGSDPCIGSGSQVASQETYGTLGAIVRSQTGGRQVGFLTNRHVAVDLDYPNQKMFHPLPPTLGPGVYLGAVERATSFITDELWYGIFAGINPETFVR

Query:  ADGAFIPFADDFDMSTVTTSVKGVGQVGDVKFIDLQSSISTLIGKQVVKVGRSSGLTTGTVLAYALEYNDEKGICFLTDFLVVGENQQTFDLEGDSGSLI
        ADGAFIPFADDFDMSTVTTSVKGVGQVGDVKFIDLQS ISTLIGKQVVKVGRSSGLTTGTVLAYALEYNDEKGICFLTDFLVVGENQQTFDLEGDSGSLI
Subjt:  ADGAFIPFADDFDMSTVTTSVKGVGQVGDVKFIDLQSSISTLIGKQVVKVGRSSGLTTGTVLAYALEYNDEKGICFLTDFLVVGENQQTFDLEGDSGSLI

Query:  ILKGENRETLQPIGIIWGGTANRGRLKLKVGQPPENWTSGVDLGRLLNLLELDLITSDEGLKAAVQEQITVSATVIGSIVGDSSPPDTTLPKEKSEEKSE
        ILKGENR+TLQPIGIIWGGTANRGRLKLKVGQPPENWTSGVDLGRLLNLLELDLITSDEGLKAAVQEQITVSATVIGSIVGDSSPPDTTLPKEKSEEKSE
Subjt:  ILKGENRETLQPIGIIWGGTANRGRLKLKVGQPPENWTSGVDLGRLLNLLELDLITSDEGLKAAVQEQITVSATVIGSIVGDSSPPDTTLPKEKSEEKSE

Query:  PLGFQIQHMPTEVEPSTAKDRPLLETEFHLEPGMNRAPSVEHQFIPSLFSCSPPHQNSTLDRAVSQNLSSLRSDCEDPCVSLQLGDHEAKRRRSDASVSM
         LGFQIQHMPTEVEPS AKDRPLLETEFHLEPGMNRAPSVEHQFIPSLFSCSP HQNSTLDRAVSQNLS LRSDCED CVSLQLGDHEAKRRRSDASVSM
Subjt:  PLGFQIQHMPTEVEPSTAKDRPLLETEFHLEPGMNRAPSVEHQFIPSLFSCSPPHQNSTLDRAVSQNLSSLRSDCEDPCVSLQLGDHEAKRRRSDASVSM

Query:  EELK
        EELK
Subjt:  EELK

XP_008465434.1 PREDICTED: uncharacterized protein LOC103503046 [Cucumis melo]0.096.03Show/hide
Query:  MEQTRHNRRINCSGSIPSEESALDLERNCCSHSDLPSFSSPTLQPFASAGQHFGFNTAYFSWPTPIRLSVGTEERANYFANLQKGVLPDILHPLPKGQRA
        MEQTRHNRRINCSGSIPSEESALDLERNCCSHSDLPSFSSPTLQPFASAGQHFGFNTAYFSWPTPIRLSVGTEERANYFANLQKGVLPDILHPLPKGQRA
Subjt:  MEQTRHNRRINCSGSIPSEESALDLERNCCSHSDLPSFSSPTLQPFASAGQHFGFNTAYFSWPTPIRLSVGTEERANYFANLQKGVLPDILHPLPKGQRA

Query:  NTLLELMTIRAFHSKILRCYSLGTAIGFRIRKGVLTDIPAILVFVSRKVHKQWLN-------SLFYSGARWCLVRCGCCRILIFGAPNPAPKEQLYTEIV
        NTLLELMTIRAFHSKILRCYSLGTAIGFRIRKGVLTDIPAILVFVSRKVHKQWL+       +L   G  WC V         FGAPNPAPKEQLYTEIV
Subjt:  NTLLELMTIRAFHSKILRCYSLGTAIGFRIRKGVLTDIPAILVFVSRKVHKQWLN-------SLFYSGARWCLVRCGCCRILIFGAPNPAPKEQLYTEIV

Query:  DDLRGSDPCIGSGSQVASQETYGTLGAIVRSQTGGRQVGFLTNRHVAVDLDYPNQKMFHPLPPTLGPGVYLGAVERATSFITDELWYGIFAGINPETFVR
        DDLRGSDPCIGSGSQVASQETYGTLGAIVRSQTGGRQVGFLTNRHVAVDLDYPNQKMFHPLPPTLGPGVYLGAVERATSFITDELWYGIFAGINPETFVR
Subjt:  DDLRGSDPCIGSGSQVASQETYGTLGAIVRSQTGGRQVGFLTNRHVAVDLDYPNQKMFHPLPPTLGPGVYLGAVERATSFITDELWYGIFAGINPETFVR

Query:  ADGAFIPFADDFDMSTVTTSVKGVGQVGDVKFIDLQSSISTLIGKQVVKVGRSSGLTTGTVLAYALEYNDEKGICFLTDFLVVGENQQTFDLEGDSGSLI
        ADGAFIPFADDFDMSTVTTSVKGVGQVGDVKFIDLQSSISTLIGKQVVKVGRSSGLTTGTVLAYALEYNDEKGICFLTDFLVVGENQQTFDLEGDSGSLI
Subjt:  ADGAFIPFADDFDMSTVTTSVKGVGQVGDVKFIDLQSSISTLIGKQVVKVGRSSGLTTGTVLAYALEYNDEKGICFLTDFLVVGENQQTFDLEGDSGSLI

Query:  ILKGENRETLQPIGIIWGGTANRGRLKLKVGQPPENWTSGVDLGRLLNLLELDLITSDEGLKAAVQEQITVSATVIGSIVGDSSPPDTTLPKEKSEEKSE
        ILKGENRETLQPIGIIWGGTANRGRLKLKVGQPPENWTSGVDLGRLLNLLELDLITSDEGLKAAVQEQITVSATVIGSIVGDSSPPDTTLPKEKSEEKSE
Subjt:  ILKGENRETLQPIGIIWGGTANRGRLKLKVGQPPENWTSGVDLGRLLNLLELDLITSDEGLKAAVQEQITVSATVIGSIVGDSSPPDTTLPKEKSEEKSE

Query:  PLGFQIQHMPTEVEPSTAKDRPLLETEFHLEPGMNRAPSVEHQFIPSLFSCSPPHQNSTLDRAVSQNLSSLRSDCEDPCVSLQLGDHEAKRRRSDASVSM
        PLGFQIQHMPTEVEPSTAKDRPLLETEFHLEPGMNRAPSVEHQFIPSLFSCSPPHQNSTLDRAVSQNLSSLRSDCEDPCVSLQLGDHEAKRRRSDASVSM
Subjt:  PLGFQIQHMPTEVEPSTAKDRPLLETEFHLEPGMNRAPSVEHQFIPSLFSCSPPHQNSTLDRAVSQNLSSLRSDCEDPCVSLQLGDHEAKRRRSDASVSM

Query:  EELK
        EELK
Subjt:  EELK

XP_022136083.1 uncharacterized protein LOC111007860 isoform X1 [Momordica charantia]0.091.06Show/hide
Query:  MEQTRHNRRINCSGSIPSEESALDLERNCCSHSDLPSFSSPTLQPFASAGQHFGFNTAYFSWPTPIRLSVGTEERANYFANLQKGVLPDILHPLPKGQRA
        MEQTRHN RINCSGS PSEESALDLERNCCSHS+LPSFS PTLQPFASAGQH   N AYFSWPTPIRLSV  EERANYFANLQKGVLPDILHPLPKGQRA
Subjt:  MEQTRHNRRINCSGSIPSEESALDLERNCCSHSDLPSFSSPTLQPFASAGQHFGFNTAYFSWPTPIRLSVGTEERANYFANLQKGVLPDILHPLPKGQRA

Query:  NTLLELMTIRAFHSKILRCYSLGTAIGFRIRKGVLTDIPAILVFVSRKVHKQWLN-------SLFYSGARWCLVRCGCCRILIFGAPNPAPKEQLYTEIV
         TLLELMTIRAFHSKILRCYSLGTAIGFRIRKGVLTDIPAILVFVSRKVHKQWL+       +L   G  WC V         FGAPNPAPKEQLYTEIV
Subjt:  NTLLELMTIRAFHSKILRCYSLGTAIGFRIRKGVLTDIPAILVFVSRKVHKQWLN-------SLFYSGARWCLVRCGCCRILIFGAPNPAPKEQLYTEIV

Query:  DDLRGSDPCIGSGSQVASQETYGTLGAIVRSQTGGRQVGFLTNRHVAVDLDYPNQKMFHPLPPTLGPGVYLGAVERATSFITDELWYGIFAGINPETFVR
        DDLRGSD CIGSGSQVASQETYGTLGAIVRSQTG RQVGFLTNRHVAVDLDYPNQKMFHPLPPTLGPGVYLGAVERATSFITDELWYGIFAGINPETFVR
Subjt:  DDLRGSDPCIGSGSQVASQETYGTLGAIVRSQTGGRQVGFLTNRHVAVDLDYPNQKMFHPLPPTLGPGVYLGAVERATSFITDELWYGIFAGINPETFVR

Query:  ADGAFIPFADDFDMSTVTTSVKGVGQVGDVKFIDLQSSISTLIGKQVVKVGRSSGLTTGTVLAYALEYNDEKGICFLTDFLVVGENQQTFDLEGDSGSLI
        ADGAFIPFADDF+MSTVTTSVKGVG+VGDVKFIDLQS ISTLIGK+VVKVGRSSGLTTGTVLAYALEYNDEKGICFLTDFLVVGENQQTFDLEGDSGSLI
Subjt:  ADGAFIPFADDFDMSTVTTSVKGVGQVGDVKFIDLQSSISTLIGKQVVKVGRSSGLTTGTVLAYALEYNDEKGICFLTDFLVVGENQQTFDLEGDSGSLI

Query:  ILKGENRETLQPIGIIWGGTANRGRLKLKVGQPPENWTSGVDLGRLLNLLELDLITSDEGLKAAVQEQITVSATVIGSIVGDSSPPDTTLPKEKSEEKSE
        ILKGENRE+LQPIGIIWGGTANRGRLKLKVGQPPENWTSGVDLGRLLNLLELDLITSDEGLKAAVQEQ TVSATVIGSIVGDSSPPDTTLPKEKSEEK E
Subjt:  ILKGENRETLQPIGIIWGGTANRGRLKLKVGQPPENWTSGVDLGRLLNLLELDLITSDEGLKAAVQEQITVSATVIGSIVGDSSPPDTTLPKEKSEEKSE

Query:  PLGFQIQHMPTEVEPSTAKDRPLLETEFHLEPGMNRAPSVEHQFIPSLFSCSPPHQNSTLDRAVSQNLSSLRSDCEDPCVSLQLGDHEAKRRRSDASVSM
        PLGFQIQHMPTEVEPS+A++RPLLETEFHLE G + APSVEHQFIPSLFSCSP HQNS+LDRAVSQNLSSLRSDCED CVSLQLGDHEAKRRRSDASVSM
Subjt:  PLGFQIQHMPTEVEPSTAKDRPLLETEFHLEPGMNRAPSVEHQFIPSLFSCSPPHQNSTLDRAVSQNLSSLRSDCEDPCVSLQLGDHEAKRRRSDASVSM

Query:  EELK
        EELK
Subjt:  EELK

XP_022136089.1 uncharacterized protein LOC111007860 isoform X2 [Momordica charantia]0.090.89Show/hide
Query:  MEQTRHNRRINCSGSIPSEESALDLERNCCSHSDLPSFSSPTLQPFASAGQHFGFNTAYFSWPTPIRLSVGTEERANYFANLQKGVLPDILHPLPKGQRA
        MEQTRHN RINCSGS PSEESALDLERNCCSHS+LPSFS PTLQPFASAGQH   N AYFSWPTPIRLSV  EERANYFANLQKGVLPDILHPLPKGQRA
Subjt:  MEQTRHNRRINCSGSIPSEESALDLERNCCSHSDLPSFSSPTLQPFASAGQHFGFNTAYFSWPTPIRLSVGTEERANYFANLQKGVLPDILHPLPKGQRA

Query:  NTLLELMTIRAFHSKILRCYSLGTAIGFRIRKGVLTDIPAILVFVSRKVHKQWLN-------SLFYSGARWCLVRCGCCRILIFGAPNPAPKEQLYTEIV
         TLLELMTIRAFHSKILRCYSLGTAIGFRIRKGVLTDIPAILVFVSRKVHKQWL+       +L   G  WC V         FGAPNPAPKEQLYTEIV
Subjt:  NTLLELMTIRAFHSKILRCYSLGTAIGFRIRKGVLTDIPAILVFVSRKVHKQWLN-------SLFYSGARWCLVRCGCCRILIFGAPNPAPKEQLYTEIV

Query:  DDLRGSDPCIGSGSQVASQETYGTLGAIVRSQTGGRQVGFLTNRHVAVDLDYPNQKMFHPLPPTLGPGVYLGAVERATSFITDELWYGIFAGINPETFVR
        DDLRGSD CIGSGSQVASQETYGTLGAIVRSQTG RQVGFLTNRHVAVDLDYPNQKMFHPLPPTLGPGVYLGAVERATSFITDELWYGIFAGINPETFVR
Subjt:  DDLRGSDPCIGSGSQVASQETYGTLGAIVRSQTGGRQVGFLTNRHVAVDLDYPNQKMFHPLPPTLGPGVYLGAVERATSFITDELWYGIFAGINPETFVR

Query:  ADGAFIPFADDFDMSTVTTSVKGVGQVGDVKFIDLQSSISTLIGKQVVKVGRSSGLTTGTVLAYALEYNDEKGICFLTDFLVVGENQQTFDLEGDSGSLI
        ADGAFIPFADDF+MSTVTTSVKGVG+VGDVKFIDLQS ISTLIGK+VVKVGRSSGLTTGTVLAYALEYNDEKGICFLTDFLVVGENQQTFDLEGDSGSLI
Subjt:  ADGAFIPFADDFDMSTVTTSVKGVGQVGDVKFIDLQSSISTLIGKQVVKVGRSSGLTTGTVLAYALEYNDEKGICFLTDFLVVGENQQTFDLEGDSGSLI

Query:  ILKGENRETLQPIGIIWGGTANRGRLKLKVGQPPENWTSGVDLGRLLNLLELDLITSDEGLKAAVQEQITVSATVIGSIVGDSSPPDTTLPKEKSEEKSE
        ILKGENRE+LQPIGIIWGGTANRGRLKLKVGQPPENWTSGVDLGRLLNLLELDLITSDEGLKA VQEQ TVSATVIGSIVGDSSPPDTTLPKEKSEEK E
Subjt:  ILKGENRETLQPIGIIWGGTANRGRLKLKVGQPPENWTSGVDLGRLLNLLELDLITSDEGLKAAVQEQITVSATVIGSIVGDSSPPDTTLPKEKSEEKSE

Query:  PLGFQIQHMPTEVEPSTAKDRPLLETEFHLEPGMNRAPSVEHQFIPSLFSCSPPHQNSTLDRAVSQNLSSLRSDCEDPCVSLQLGDHEAKRRRSDASVSM
        PLGFQIQHMPTEVEPS+A++RPLLETEFHLE G + APSVEHQFIPSLFSCSP HQNS+LDRAVSQNLSSLRSDCED CVSLQLGDHEAKRRRSDASVSM
Subjt:  PLGFQIQHMPTEVEPSTAKDRPLLETEFHLEPGMNRAPSVEHQFIPSLFSCSPPHQNSTLDRAVSQNLSSLRSDCEDPCVSLQLGDHEAKRRRSDASVSM

Query:  EELK
        EELK
Subjt:  EELK

XP_038898393.1 protein NARROW LEAF 1 [Benincasa hispida]0.094.04Show/hide
Query:  MEQTRHNRRINCSGSIPSEESALDLERNCCSHSDLPSFSSPTLQPFASAGQHFGFNTAYFSWPTPIRLSVGTEERANYFANLQKGVLPDILHPLPKGQRA
        MEQTRHNRRINCSGS PSEESALDLERNCCSHSDLPSFSSPTLQPFASAGQHFGFNTAYFSWPTPIRLSVGTEERANYFANLQKGVLPDILHPLPKGQRA
Subjt:  MEQTRHNRRINCSGSIPSEESALDLERNCCSHSDLPSFSSPTLQPFASAGQHFGFNTAYFSWPTPIRLSVGTEERANYFANLQKGVLPDILHPLPKGQRA

Query:  NTLLELMTIRAFHSKILRCYSLGTAIGFRIRKGVLTDIPAILVFVSRKVHKQWLN-------SLFYSGARWCLVRCGCCRILIFGAPNPAPKEQLYTEIV
        NTLLELMTIRAFHSKILRCYSLGTAIGFRIRKGVLTDIPAILVFVSRKVHKQWL+       +L   G  WC V         FGAPNPAPKEQLYTEIV
Subjt:  NTLLELMTIRAFHSKILRCYSLGTAIGFRIRKGVLTDIPAILVFVSRKVHKQWLN-------SLFYSGARWCLVRCGCCRILIFGAPNPAPKEQLYTEIV

Query:  DDLRGSDPCIGSGSQVASQETYGTLGAIVRSQTGGRQVGFLTNRHVAVDLDYPNQKMFHPLPPTLGPGVYLGAVERATSFITDELWYGIFAGINPETFVR
        DDLRGSDPCIGSGSQVASQETYGTLGAIVRSQTG RQVGFLTNRHVAVDLDYPNQKMFHPLPPTLGPGVYLGAVERATSFITDELWYGIFAGINPETFVR
Subjt:  DDLRGSDPCIGSGSQVASQETYGTLGAIVRSQTGGRQVGFLTNRHVAVDLDYPNQKMFHPLPPTLGPGVYLGAVERATSFITDELWYGIFAGINPETFVR

Query:  ADGAFIPFADDFDMSTVTTSVKGVGQVGDVKFIDLQSSISTLIGKQVVKVGRSSGLTTGTVLAYALEYNDEKGICFLTDFLVVGENQQTFDLEGDSGSLI
        ADGAFIPFADDFDMSTVTTSVKGVG+VGDVKFIDLQS ISTLIGKQVVKVGRSSGLTTGTVLAYALEYNDEKGICFLTDFLVVGENQQTFDLEGDSGSLI
Subjt:  ADGAFIPFADDFDMSTVTTSVKGVGQVGDVKFIDLQSSISTLIGKQVVKVGRSSGLTTGTVLAYALEYNDEKGICFLTDFLVVGENQQTFDLEGDSGSLI

Query:  ILKGENRETLQPIGIIWGGTANRGRLKLKVGQPPENWTSGVDLGRLLNLLELDLITSDEGLKAAVQEQITVSATVIGSIVGDSSPPDTTLPKEKSEEKSE
        ILKGENRE+LQPIGIIWGGTANRGRLKLKVGQPPENWTSGVDLGRLLNLLELDLITSDEGLKAAVQEQITVSATVIGSIVGDSSPPDTTLPKEKSEEKSE
Subjt:  ILKGENRETLQPIGIIWGGTANRGRLKLKVGQPPENWTSGVDLGRLLNLLELDLITSDEGLKAAVQEQITVSATVIGSIVGDSSPPDTTLPKEKSEEKSE

Query:  PLGFQIQHMPTEVEPSTAKDRPLLETEFHLEPGMNRAPSVEHQFIPSLFSCSPPHQNSTLDRAVSQNLSSLRSDCEDPCVSLQLGDHEAKRRRSDASVSM
        PLGFQIQHMPTEVEPS+ KDRPLLETEFHLE GMN APSVEHQFIPSLFSCSP HQNSTLDRAVSQNLSSLRSD EDPCVSLQLGDHEAKRRRSDASVS+
Subjt:  PLGFQIQHMPTEVEPSTAKDRPLLETEFHLEPGMNRAPSVEHQFIPSLFSCSPPHQNSTLDRAVSQNLSSLRSDCEDPCVSLQLGDHEAKRRRSDASVSM

Query:  EELK
        EELK
Subjt:  EELK

TrEMBL top hitse value%identityAlignment
A0A0A0L2V0 Uncharacterized protein0.0e+0088.4Show/hide
Query:  MPPLTYSLYLLSFSHLQCLLFQTFSFQCLCAEASAQLYFSI----------------------------------------------ELGIHNTILSFYV
        MPPLTYSLYLLSFSHLQCLLFQTF+FQCLCAEASAQLYFS+                                              ELG+HNTILSF+V
Subjt:  MPPLTYSLYLLSFSHLQCLLFQTFSFQCLCAEASAQLYFSI----------------------------------------------ELGIHNTILSFYV

Query:  SHIMEQTRHNRRINCSGSIPSEESALDLERNCCSHSDLPSFSSPTLQPFASAGQHFGFNTAYFSWPTPIRLSVGTEERANYFANLQKGVLPDILHPLPKG
        SHIMEQTRHNRRINCSGS PSEESALDLERNCCSHSDLPSFSSPTLQPFASAGQHFGFNTAYFSWPTPIRLSVGTEERANYFANLQKGVLPDILHPLPKG
Subjt:  SHIMEQTRHNRRINCSGSIPSEESALDLERNCCSHSDLPSFSSPTLQPFASAGQHFGFNTAYFSWPTPIRLSVGTEERANYFANLQKGVLPDILHPLPKG

Query:  QRANTLLELMTIRAFHSKILRCYSLGTAIGFRIRKGVLTDIPAILVFVSRKVHKQWLN-------SLFYSGARWCLVRCGCCRILIFGAPNPAPKEQLYT
        QRANTLLELMTIRAFHSKILRCYSLGTAIGFRIRKGVLTDIPAILVFVSRKVHKQWL+       +L   G  WC V         FGAPNPAPKEQLYT
Subjt:  QRANTLLELMTIRAFHSKILRCYSLGTAIGFRIRKGVLTDIPAILVFVSRKVHKQWLN-------SLFYSGARWCLVRCGCCRILIFGAPNPAPKEQLYT

Query:  EIVDDLRGSDPCIGSGSQVASQETYGTLGAIVRSQTGGRQVGFLTNRHVAVDLDYPNQKMFHPLPPTLGPGVYLGAVERATSFITDELWYGIFAGINPET
        EIVDDLRGSDPCIGSGSQVASQETYGTLGAIVRSQTGGRQVGFLTNRHVAVDLDYPNQKMFHPLPPTLGPGVYLGAVERATSFITDELWYGIFAGINPET
Subjt:  EIVDDLRGSDPCIGSGSQVASQETYGTLGAIVRSQTGGRQVGFLTNRHVAVDLDYPNQKMFHPLPPTLGPGVYLGAVERATSFITDELWYGIFAGINPET

Query:  FVRADGAFIPFADDFDMSTVTTSVKGVGQVGDVKFIDLQSSISTLIGKQVVKVGRSSGLTTGTVLAYALEYNDEKGICFLTDFLVVGENQQTFDLEGDSG
        FVRADGAFIPFADDFDMSTVTTSVKGVGQVGDVKFIDLQS ISTLIGKQVVKVGRSSGLTTGTVLAYALEYNDEKGICFLTDFLVVGENQQTFDLEGDSG
Subjt:  FVRADGAFIPFADDFDMSTVTTSVKGVGQVGDVKFIDLQSSISTLIGKQVVKVGRSSGLTTGTVLAYALEYNDEKGICFLTDFLVVGENQQTFDLEGDSG

Query:  SLIILKGENRETLQPIGIIWGGTANRGRLKLKVGQPPENWTSGVDLGRLLNLLELDLITSDEGLKAAVQEQITVSATVIGSIVGDSSPPDTTLPKEKSEE
        SLIILKGENR+TLQPIGIIWGGTANRGRLKLKVGQPPENWTSGVDLGRLLNLLELDLITSDEGLKAAVQEQITVSATVIGSIVGDSSPPDTTLPKEKSEE
Subjt:  SLIILKGENRETLQPIGIIWGGTANRGRLKLKVGQPPENWTSGVDLGRLLNLLELDLITSDEGLKAAVQEQITVSATVIGSIVGDSSPPDTTLPKEKSEE

Query:  KSEPLGFQIQHMPTEVEPSTAKDRPLLETEFHLEPGMNRAPSVEHQFIPSLFSCSPPHQNSTLDRAVSQNLSSLRSDCEDPCVSLQLGDHEAKRRRSDAS
        KSE LGFQIQHMPTEVEPS AKDRPLLETEFHLEPGMNRAPSVEHQFIPSLFSCSP HQNSTLDRAVSQNLS LRSDCED CVSLQLGDHEAKRRRSDAS
Subjt:  KSEPLGFQIQHMPTEVEPSTAKDRPLLETEFHLEPGMNRAPSVEHQFIPSLFSCSPPHQNSTLDRAVSQNLSSLRSDCEDPCVSLQLGDHEAKRRRSDAS

Query:  VSMEELK
        VSMEELK
Subjt:  VSMEELK

A0A1S3CNV7 uncharacterized protein LOC1035030460.0e+0096.03Show/hide
Query:  MEQTRHNRRINCSGSIPSEESALDLERNCCSHSDLPSFSSPTLQPFASAGQHFGFNTAYFSWPTPIRLSVGTEERANYFANLQKGVLPDILHPLPKGQRA
        MEQTRHNRRINCSGSIPSEESALDLERNCCSHSDLPSFSSPTLQPFASAGQHFGFNTAYFSWPTPIRLSVGTEERANYFANLQKGVLPDILHPLPKGQRA
Subjt:  MEQTRHNRRINCSGSIPSEESALDLERNCCSHSDLPSFSSPTLQPFASAGQHFGFNTAYFSWPTPIRLSVGTEERANYFANLQKGVLPDILHPLPKGQRA

Query:  NTLLELMTIRAFHSKILRCYSLGTAIGFRIRKGVLTDIPAILVFVSRKVHKQWLN-------SLFYSGARWCLVRCGCCRILIFGAPNPAPKEQLYTEIV
        NTLLELMTIRAFHSKILRCYSLGTAIGFRIRKGVLTDIPAILVFVSRKVHKQWL+       +L   G  WC V         FGAPNPAPKEQLYTEIV
Subjt:  NTLLELMTIRAFHSKILRCYSLGTAIGFRIRKGVLTDIPAILVFVSRKVHKQWLN-------SLFYSGARWCLVRCGCCRILIFGAPNPAPKEQLYTEIV

Query:  DDLRGSDPCIGSGSQVASQETYGTLGAIVRSQTGGRQVGFLTNRHVAVDLDYPNQKMFHPLPPTLGPGVYLGAVERATSFITDELWYGIFAGINPETFVR
        DDLRGSDPCIGSGSQVASQETYGTLGAIVRSQTGGRQVGFLTNRHVAVDLDYPNQKMFHPLPPTLGPGVYLGAVERATSFITDELWYGIFAGINPETFVR
Subjt:  DDLRGSDPCIGSGSQVASQETYGTLGAIVRSQTGGRQVGFLTNRHVAVDLDYPNQKMFHPLPPTLGPGVYLGAVERATSFITDELWYGIFAGINPETFVR

Query:  ADGAFIPFADDFDMSTVTTSVKGVGQVGDVKFIDLQSSISTLIGKQVVKVGRSSGLTTGTVLAYALEYNDEKGICFLTDFLVVGENQQTFDLEGDSGSLI
        ADGAFIPFADDFDMSTVTTSVKGVGQVGDVKFIDLQSSISTLIGKQVVKVGRSSGLTTGTVLAYALEYNDEKGICFLTDFLVVGENQQTFDLEGDSGSLI
Subjt:  ADGAFIPFADDFDMSTVTTSVKGVGQVGDVKFIDLQSSISTLIGKQVVKVGRSSGLTTGTVLAYALEYNDEKGICFLTDFLVVGENQQTFDLEGDSGSLI

Query:  ILKGENRETLQPIGIIWGGTANRGRLKLKVGQPPENWTSGVDLGRLLNLLELDLITSDEGLKAAVQEQITVSATVIGSIVGDSSPPDTTLPKEKSEEKSE
        ILKGENRETLQPIGIIWGGTANRGRLKLKVGQPPENWTSGVDLGRLLNLLELDLITSDEGLKAAVQEQITVSATVIGSIVGDSSPPDTTLPKEKSEEKSE
Subjt:  ILKGENRETLQPIGIIWGGTANRGRLKLKVGQPPENWTSGVDLGRLLNLLELDLITSDEGLKAAVQEQITVSATVIGSIVGDSSPPDTTLPKEKSEEKSE

Query:  PLGFQIQHMPTEVEPSTAKDRPLLETEFHLEPGMNRAPSVEHQFIPSLFSCSPPHQNSTLDRAVSQNLSSLRSDCEDPCVSLQLGDHEAKRRRSDASVSM
        PLGFQIQHMPTEVEPSTAKDRPLLETEFHLEPGMNRAPSVEHQFIPSLFSCSPPHQNSTLDRAVSQNLSSLRSDCEDPCVSLQLGDHEAKRRRSDASVSM
Subjt:  PLGFQIQHMPTEVEPSTAKDRPLLETEFHLEPGMNRAPSVEHQFIPSLFSCSPPHQNSTLDRAVSQNLSSLRSDCEDPCVSLQLGDHEAKRRRSDASVSM

Query:  EELK
        EELK
Subjt:  EELK

A0A5D3CE06 Uncharacterized protein0.0e+0096.03Show/hide
Query:  MEQTRHNRRINCSGSIPSEESALDLERNCCSHSDLPSFSSPTLQPFASAGQHFGFNTAYFSWPTPIRLSVGTEERANYFANLQKGVLPDILHPLPKGQRA
        MEQTRHNRRINCSGSIPSEESALDLERNCCSHSDLPSFSSPTLQPFASAGQHFGFNTAYFSWPTPIRLSVGTEERANYFANLQKGVLPDILHPLPKGQRA
Subjt:  MEQTRHNRRINCSGSIPSEESALDLERNCCSHSDLPSFSSPTLQPFASAGQHFGFNTAYFSWPTPIRLSVGTEERANYFANLQKGVLPDILHPLPKGQRA

Query:  NTLLELMTIRAFHSKILRCYSLGTAIGFRIRKGVLTDIPAILVFVSRKVHKQWLN-------SLFYSGARWCLVRCGCCRILIFGAPNPAPKEQLYTEIV
        NTLLELMTIRAFHSKILRCYSLGTAIGFRIRKGVLTDIPAILVFVSRKVHKQWL+       +L   G  WC V         FGAPNPAPKEQLYTEIV
Subjt:  NTLLELMTIRAFHSKILRCYSLGTAIGFRIRKGVLTDIPAILVFVSRKVHKQWLN-------SLFYSGARWCLVRCGCCRILIFGAPNPAPKEQLYTEIV

Query:  DDLRGSDPCIGSGSQVASQETYGTLGAIVRSQTGGRQVGFLTNRHVAVDLDYPNQKMFHPLPPTLGPGVYLGAVERATSFITDELWYGIFAGINPETFVR
        DDLRGSDPCIGSGSQVASQETYGTLGAIVRSQTGGRQVGFLTNRHVAVDLDYPNQKMFHPLPPTLGPGVYLGAVERATSFITDELWYGIFAGINPETFVR
Subjt:  DDLRGSDPCIGSGSQVASQETYGTLGAIVRSQTGGRQVGFLTNRHVAVDLDYPNQKMFHPLPPTLGPGVYLGAVERATSFITDELWYGIFAGINPETFVR

Query:  ADGAFIPFADDFDMSTVTTSVKGVGQVGDVKFIDLQSSISTLIGKQVVKVGRSSGLTTGTVLAYALEYNDEKGICFLTDFLVVGENQQTFDLEGDSGSLI
        ADGAFIPFADDFDMSTVTTSVKGVGQVGDVKFIDLQSSISTLIGKQVVKVGRSSGLTTGTVLAYALEYNDEKGICFLTDFLVVGENQQTFDLEGDSGSLI
Subjt:  ADGAFIPFADDFDMSTVTTSVKGVGQVGDVKFIDLQSSISTLIGKQVVKVGRSSGLTTGTVLAYALEYNDEKGICFLTDFLVVGENQQTFDLEGDSGSLI

Query:  ILKGENRETLQPIGIIWGGTANRGRLKLKVGQPPENWTSGVDLGRLLNLLELDLITSDEGLKAAVQEQITVSATVIGSIVGDSSPPDTTLPKEKSEEKSE
        ILKGENRETLQPIGIIWGGTANRGRLKLKVGQPPENWTSGVDLGRLLNLLELDLITSDEGLKAAVQEQITVSATVIGSIVGDSSPPDTTLPKEKSEEKSE
Subjt:  ILKGENRETLQPIGIIWGGTANRGRLKLKVGQPPENWTSGVDLGRLLNLLELDLITSDEGLKAAVQEQITVSATVIGSIVGDSSPPDTTLPKEKSEEKSE

Query:  PLGFQIQHMPTEVEPSTAKDRPLLETEFHLEPGMNRAPSVEHQFIPSLFSCSPPHQNSTLDRAVSQNLSSLRSDCEDPCVSLQLGDHEAKRRRSDASVSM
        PLGFQIQHMPTEVEPSTAKDRPLLETEFHLEPGMNRAPSVEHQFIPSLFSCSPPHQNSTLDRAVSQNLSSLRSDCEDPCVSLQLGDHEAKRRRSDASVSM
Subjt:  PLGFQIQHMPTEVEPSTAKDRPLLETEFHLEPGMNRAPSVEHQFIPSLFSCSPPHQNSTLDRAVSQNLSSLRSDCEDPCVSLQLGDHEAKRRRSDASVSM

Query:  EELK
        EELK
Subjt:  EELK

A0A6J1C3A8 uncharacterized protein LOC111007860 isoform X20.0e+0090.89Show/hide
Query:  MEQTRHNRRINCSGSIPSEESALDLERNCCSHSDLPSFSSPTLQPFASAGQHFGFNTAYFSWPTPIRLSVGTEERANYFANLQKGVLPDILHPLPKGQRA
        MEQTRHN RINCSGS PSEESALDLERNCCSHS+LPSFS PTLQPFASAGQH   N AYFSWPTPIRLSV  EERANYFANLQKGVLPDILHPLPKGQRA
Subjt:  MEQTRHNRRINCSGSIPSEESALDLERNCCSHSDLPSFSSPTLQPFASAGQHFGFNTAYFSWPTPIRLSVGTEERANYFANLQKGVLPDILHPLPKGQRA

Query:  NTLLELMTIRAFHSKILRCYSLGTAIGFRIRKGVLTDIPAILVFVSRKVHKQWLN-------SLFYSGARWCLVRCGCCRILIFGAPNPAPKEQLYTEIV
         TLLELMTIRAFHSKILRCYSLGTAIGFRIRKGVLTDIPAILVFVSRKVHKQWL+       +L   G  WC V         FGAPNPAPKEQLYTEIV
Subjt:  NTLLELMTIRAFHSKILRCYSLGTAIGFRIRKGVLTDIPAILVFVSRKVHKQWLN-------SLFYSGARWCLVRCGCCRILIFGAPNPAPKEQLYTEIV

Query:  DDLRGSDPCIGSGSQVASQETYGTLGAIVRSQTGGRQVGFLTNRHVAVDLDYPNQKMFHPLPPTLGPGVYLGAVERATSFITDELWYGIFAGINPETFVR
        DDLRGSD CIGSGSQVASQETYGTLGAIVRSQTG RQVGFLTNRHVAVDLDYPNQKMFHPLPPTLGPGVYLGAVERATSFITDELWYGIFAGINPETFVR
Subjt:  DDLRGSDPCIGSGSQVASQETYGTLGAIVRSQTGGRQVGFLTNRHVAVDLDYPNQKMFHPLPPTLGPGVYLGAVERATSFITDELWYGIFAGINPETFVR

Query:  ADGAFIPFADDFDMSTVTTSVKGVGQVGDVKFIDLQSSISTLIGKQVVKVGRSSGLTTGTVLAYALEYNDEKGICFLTDFLVVGENQQTFDLEGDSGSLI
        ADGAFIPFADDF+MSTVTTSVKGVG+VGDVKFIDLQS ISTLIGK+VVKVGRSSGLTTGTVLAYALEYNDEKGICFLTDFLVVGENQQTFDLEGDSGSLI
Subjt:  ADGAFIPFADDFDMSTVTTSVKGVGQVGDVKFIDLQSSISTLIGKQVVKVGRSSGLTTGTVLAYALEYNDEKGICFLTDFLVVGENQQTFDLEGDSGSLI

Query:  ILKGENRETLQPIGIIWGGTANRGRLKLKVGQPPENWTSGVDLGRLLNLLELDLITSDEGLKAAVQEQITVSATVIGSIVGDSSPPDTTLPKEKSEEKSE
        ILKGENRE+LQPIGIIWGGTANRGRLKLKVGQPPENWTSGVDLGRLLNLLELDLITSDEGLK AVQEQ TVSATVIGSIVGDSSPPDTTLPKEKSEEK E
Subjt:  ILKGENRETLQPIGIIWGGTANRGRLKLKVGQPPENWTSGVDLGRLLNLLELDLITSDEGLKAAVQEQITVSATVIGSIVGDSSPPDTTLPKEKSEEKSE

Query:  PLGFQIQHMPTEVEPSTAKDRPLLETEFHLEPGMNRAPSVEHQFIPSLFSCSPPHQNSTLDRAVSQNLSSLRSDCEDPCVSLQLGDHEAKRRRSDASVSM
        PLGFQIQHMPTEVEPS+A++RPLLETEFHLE G + APSVEHQFIPSLFSCSP HQNS+LDRAVSQNLSSLRSDCED CVSLQLGDHEAKRRRSDASVSM
Subjt:  PLGFQIQHMPTEVEPSTAKDRPLLETEFHLEPGMNRAPSVEHQFIPSLFSCSPPHQNSTLDRAVSQNLSSLRSDCEDPCVSLQLGDHEAKRRRSDASVSM

Query:  EELK
        EELK
Subjt:  EELK

A0A6J1C6M7 uncharacterized protein LOC111007860 isoform X10.0e+0091.06Show/hide
Query:  MEQTRHNRRINCSGSIPSEESALDLERNCCSHSDLPSFSSPTLQPFASAGQHFGFNTAYFSWPTPIRLSVGTEERANYFANLQKGVLPDILHPLPKGQRA
        MEQTRHN RINCSGS PSEESALDLERNCCSHS+LPSFS PTLQPFASAGQH   N AYFSWPTPIRLSV  EERANYFANLQKGVLPDILHPLPKGQRA
Subjt:  MEQTRHNRRINCSGSIPSEESALDLERNCCSHSDLPSFSSPTLQPFASAGQHFGFNTAYFSWPTPIRLSVGTEERANYFANLQKGVLPDILHPLPKGQRA

Query:  NTLLELMTIRAFHSKILRCYSLGTAIGFRIRKGVLTDIPAILVFVSRKVHKQWLN-------SLFYSGARWCLVRCGCCRILIFGAPNPAPKEQLYTEIV
         TLLELMTIRAFHSKILRCYSLGTAIGFRIRKGVLTDIPAILVFVSRKVHKQWL+       +L   G  WC V         FGAPNPAPKEQLYTEIV
Subjt:  NTLLELMTIRAFHSKILRCYSLGTAIGFRIRKGVLTDIPAILVFVSRKVHKQWLN-------SLFYSGARWCLVRCGCCRILIFGAPNPAPKEQLYTEIV

Query:  DDLRGSDPCIGSGSQVASQETYGTLGAIVRSQTGGRQVGFLTNRHVAVDLDYPNQKMFHPLPPTLGPGVYLGAVERATSFITDELWYGIFAGINPETFVR
        DDLRGSD CIGSGSQVASQETYGTLGAIVRSQTG RQVGFLTNRHVAVDLDYPNQKMFHPLPPTLGPGVYLGAVERATSFITDELWYGIFAGINPETFVR
Subjt:  DDLRGSDPCIGSGSQVASQETYGTLGAIVRSQTGGRQVGFLTNRHVAVDLDYPNQKMFHPLPPTLGPGVYLGAVERATSFITDELWYGIFAGINPETFVR

Query:  ADGAFIPFADDFDMSTVTTSVKGVGQVGDVKFIDLQSSISTLIGKQVVKVGRSSGLTTGTVLAYALEYNDEKGICFLTDFLVVGENQQTFDLEGDSGSLI
        ADGAFIPFADDF+MSTVTTSVKGVG+VGDVKFIDLQS ISTLIGK+VVKVGRSSGLTTGTVLAYALEYNDEKGICFLTDFLVVGENQQTFDLEGDSGSLI
Subjt:  ADGAFIPFADDFDMSTVTTSVKGVGQVGDVKFIDLQSSISTLIGKQVVKVGRSSGLTTGTVLAYALEYNDEKGICFLTDFLVVGENQQTFDLEGDSGSLI

Query:  ILKGENRETLQPIGIIWGGTANRGRLKLKVGQPPENWTSGVDLGRLLNLLELDLITSDEGLKAAVQEQITVSATVIGSIVGDSSPPDTTLPKEKSEEKSE
        ILKGENRE+LQPIGIIWGGTANRGRLKLKVGQPPENWTSGVDLGRLLNLLELDLITSDEGLKAAVQEQ TVSATVIGSIVGDSSPPDTTLPKEKSEEK E
Subjt:  ILKGENRETLQPIGIIWGGTANRGRLKLKVGQPPENWTSGVDLGRLLNLLELDLITSDEGLKAAVQEQITVSATVIGSIVGDSSPPDTTLPKEKSEEKSE

Query:  PLGFQIQHMPTEVEPSTAKDRPLLETEFHLEPGMNRAPSVEHQFIPSLFSCSPPHQNSTLDRAVSQNLSSLRSDCEDPCVSLQLGDHEAKRRRSDASVSM
        PLGFQIQHMPTEVEPS+A++RPLLETEFHLE G + APSVEHQFIPSLFSCSP HQNS+LDRAVSQNLSSLRSDCED CVSLQLGDHEAKRRRSDASVSM
Subjt:  PLGFQIQHMPTEVEPSTAKDRPLLETEFHLEPGMNRAPSVEHQFIPSLFSCSPPHQNSTLDRAVSQNLSSLRSDCEDPCVSLQLGDHEAKRRRSDASVSM

Query:  EELK
        EELK
Subjt:  EELK

SwissProt top hitse value%identityAlignment
B4XT64 Protein NARROW LEAF 11.3e-18960.85Show/hide
Query:  SGSIPSEESALDLERNCCSHSDLPSFSSPTLQPFASAGQHFGFNTAYFSWPTPIRLSVGTEERANYFANLQKGVLPDILHPLPKGQRANTLLELMTIRAF
        SG   SEES+LD++     H   P   SP++QP AS   H   + AYF WPT        E RANYF NLQKG+LP     LPKGQ+AN+LL+LMTIRAF
Subjt:  SGSIPSEESALDLERNCCSHSDLPSFSSPTLQPFASAGQHFGFNTAYFSWPTPIRLSVGTEERANYFANLQKGVLPDILHPLPKGQRANTLLELMTIRAF

Query:  HSKILRCYSLGTAIGFRIRKGVLTDIPAILVFVSRKVHKQWLNS-------LFYSGARWCLVRCGCCRILIFGAPNPAPKEQLYTEIVDDLRGSDPCIGS
        HSKILR +SLGTA+GFRIRKG LTDIPAILVFV+RKVHK+WLN        L   G  WC V         +GAP   PKEQ+++E+VD L GSD CIGS
Subjt:  HSKILRCYSLGTAIGFRIRKGVLTDIPAILVFVSRKVHKQWLNS-------LFYSGARWCLVRCGCCRILIFGAPNPAPKEQLYTEIVDDLRGSDPCIGS

Query:  GSQVASQETYGTLGAIVRSQTGGRQVGFLTNRHVAVDLDYPNQKMFHPLPPTLGPGVYLGAVERATSFITDELWYGIFAGINPETFVRADGAFIPFADDF
        GSQVAS ET+GTLGAIV+ +TG +QVGFLTN HVAVDLDYPNQKMFHPLPP LGPGVYLGAVERATSFITD++WYGI+AG NPETFVRADGAFIPFADDF
Subjt:  GSQVASQETYGTLGAIVRSQTGGRQVGFLTNRHVAVDLDYPNQKMFHPLPPTLGPGVYLGAVERATSFITDELWYGIFAGINPETFVRADGAFIPFADDF

Query:  DMSTVTTSVKGVGQVGDVKFIDLQSSISTLIGKQVVKVGRSSGLTTGTVLAYALEYNDEKGICFLTDFLVVGENQQTFDLEGDSGSLIILKGENRETLQP
        D+STVTT V+GVG +GDVK IDLQ  +++LIG+QV KVGRSSG TTGTV+AYALEYNDEKGICF TD LVVGEN+QTFDLEGDSGSLIIL  ++ E  +P
Subjt:  DMSTVTTSVKGVGQVGDVKFIDLQSSISTLIGKQVVKVGRSSGLTTGTVLAYALEYNDEKGICFLTDFLVVGENQQTFDLEGDSGSLIILKGENRETLQP

Query:  IGIIWGGTANRGRLKLKVGQPPENWTSGVDLGRLLNLLELDLITSDEGLKAAVQEQITVSATVIGSIVGDSSPPDTTLPKEKSEEKSEPLGFQIQHMPT-
        IGIIWGGTANRGRLKL     PENWTSGVDLGRLL+ LELD+I ++E L+ AVQ+Q       + S VG+SS     +P+EK EE  EPLG QIQ +P  
Subjt:  IGIIWGGTANRGRLKLKVGQPPENWTSGVDLGRLLNLLELDLITSDEGLKAAVQEQITVSATVIGSIVGDSSPPDTTLPKEKSEEKSEPLGFQIQHMPT-

Query:  EVEPSTAKDRPLLETEFHLEPGMNRAPSVEHQFIPSLFSCSPPHQNSTLDRAVSQNLSSLRSDCEDPCVSLQLGDHEAKRRRSDASVSME
        +V  S  +      T  ++E         EHQFI +    SP   +    R+++ NL++     E+  +SL LGD E KR RSD+  S++
Subjt:  EVEPSTAKDRPLLETEFHLEPGMNRAPSVEHQFIPSLFSCSPPHQNSTLDRAVSQNLSSLRSDCEDPCVSLQLGDHEAKRRRSDASVSME

Arabidopsis top hitse value%identityAlignment
AT2G35155.1 Trypsin family protein5.5e-19662.56Show/hide
Query:  RRINCSGSIPSEESALDLERN-CCSHSDLPSFSSPT-LQPFASAGQHFGFNTAYFSWPTPIRLSVGTEERANYFANLQKGVLPDILHPLPKGQRANTLLE
        R I  + S  SE+SALDLERN  C+H  LPS SSP+ LQPF    QH   N  YFSWPT  RL+   E+RANYF NLQKGVLP+ +  LP GQ+A TLLE
Subjt:  RRINCSGSIPSEESALDLERN-CCSHSDLPSFSSPT-LQPFASAGQHFGFNTAYFSWPTPIRLSVGTEERANYFANLQKGVLPDILHPLPKGQRANTLLE

Query:  LMTIRAFHSKILRCYSLGTAIGFRIRKGVLTDIPAILVFVSRKVHKQWLN-------SLFYSGARWCLVRCGCCRILIFGAPNPAPKEQLYTEIVDDLRG
        LMTIRAFHSKILR +SLGTA+GFRI +GVLT++PAILVFV+RKVH+QWLN       +L   G  WC V         +GAP   PKEQ+Y E+VD LRG
Subjt:  LMTIRAFHSKILRCYSLGTAIGFRIRKGVLTDIPAILVFVSRKVHKQWLN-------SLFYSGARWCLVRCGCCRILIFGAPNPAPKEQLYTEIVDDLRG

Query:  SDPCIGSGSQVASQETYGTLGAIVRSQTGGRQVGFLTNRHVAVDLDYPNQKMFHPLPPTLGPGVYLGAVERATSFITDELWYGIFAGINPETFVRADGAF
        SDPCIGSGSQVASQETYGTLGAIV+S+TG  QVGFLTNRHVAVDLDYP+QKMFHPLPP+LGPGVYLGAVERATSFITD+ WYGIFAG NPETFVRADGAF
Subjt:  SDPCIGSGSQVASQETYGTLGAIVRSQTGGRQVGFLTNRHVAVDLDYPNQKMFHPLPPTLGPGVYLGAVERATSFITDELWYGIFAGINPETFVRADGAF

Query:  IPFADDFDMSTVTTSVKGVGQVGDVKFIDLQSSISTLIGKQVVKVGRSSGLTTGTVLAYALEYNDEKGICFLTDFLVVGENQQTFDLEGDSGSLIILKGE
        IPFA+DF+ S VTT +KG+G++GDV  IDLQS I +LIGKQVVKVGRSSG TTGT++AYALEYNDEKGICFLTDFLV+GENQQTFDLEGDSGSLI+L G 
Subjt:  IPFADDFDMSTVTTSVKGVGQVGDVKFIDLQSSISTLIGKQVVKVGRSSGLTTGTVLAYALEYNDEKGICFLTDFLVVGENQQTFDLEGDSGSLIILKGE

Query:  NRETLQPIGIIWGGTANRGRLKLKVGQPPENWTSGVDLGRLLNLLELDLITSDEGLK--AAVQEQITVSATVIGSIVGDSSPPDTTLPKEKSEEKSEPLG
        N +  +P+GIIWGGTANRGRLKL  GQ PENWTSGVDLGRLL+LLELDLITS+  L+  AA +E+   S T + S V  SSPPD     +K +E  EP  
Subjt:  NRETLQPIGIIWGGTANRGRLKLKVGQPPENWTSGVDLGRLLNLLELDLITSDEGLK--AAVQEQITVSATVIGSIVGDSSPPDTTLPKEKSEEKSEPLG

Query:  FQIQHMPTEVEPSTAKDRPLLETEFHL---EPGMNRAPS-VEHQFIPSLFSCSPPHQNSTLDRAVSQNLSSLRSDCEDPC-VSLQLGDHEAKR
             +P E     A  +P LE E H+      +N + S ++ Q IP L                  NL +L++  E+   +SL LG+ + K+
Subjt:  FQIQHMPTEVEPSTAKDRPLLETEFHL---EPGMNRAPS-VEHQFIPSLFSCSPPHQNSTLDRAVSQNLSSLRSDCEDPC-VSLQLGDHEAKR

AT3G12950.1 Trypsin family protein2.0e-19065.14Show/hide
Query:  FASAGQHFGFNTA-YFSWPTPIRLSVGTEERANYFANLQK------GVLPDILHPLPKGQRANTLLELMTIRAFHSKILRCYSLGTAIGFRIRKGVLTDI
        + S GQH  F  A YFSWPT  RLS   EERANYF+NLQK       V P+ +   PKGQRA TLLELMTIRAFHSK+LRCYSLGTAIGFRIR+GVLTDI
Subjt:  FASAGQHFGFNTA-YFSWPTPIRLSVGTEERANYFANLQK------GVLPDILHPLPKGQRANTLLELMTIRAFHSKILRCYSLGTAIGFRIRKGVLTDI

Query:  PAILVFVSRKVHKQWLN-------SLFYSGARWCLVRCGCCRILIFGAPN--PAPKEQLYTEIVDDLRGSDPCIGSGSQVASQETYGTLGAIVRSQTGGR
        PAI+VFVSRKVHKQWL+       +L  +G  WC V         FG P+  P PK+   T+IVD L+GSDP IGSGSQVASQET GTLGAIVRSQTGGR
Subjt:  PAILVFVSRKVHKQWLN-------SLFYSGARWCLVRCGCCRILIFGAPN--PAPKEQLYTEIVDDLRGSDPCIGSGSQVASQETYGTLGAIVRSQTGGR

Query:  QVGFLTNRHVAVDLDYPNQKMFHPLPPTLGPGVYLGAVERATSFITDELWYGIFAGINPETFVRADGAFIPFADDFDMSTVTTSVK-GVGQVGDVKFIDL
        QVGF+TNRHVAV+LDYP+QKMFHPLPP LGPGVYLGAVERATSFITD+LW+GIFAG NPETFVRADGAFIPFADD+D+S VTTSVK GVG++G+VK I+L
Subjt:  QVGFLTNRHVAVDLDYPNQKMFHPLPPTLGPGVYLGAVERATSFITDELWYGIFAGINPETFVRADGAFIPFADDFDMSTVTTSVK-GVGQVGDVKFIDL

Query:  QSSISTLIGKQVVKVGRSSGLTTGTVLAYALEYNDEKGICFLTDFLVVGENQQT-FDLEGDSGSLIILKGENRETLQPIGIIWGGTANRGRLKLKVGQPP
        QS + +L+GKQVVKVGRSSGLTTGTVLAYALEYNDE+G+CFLTDFLVVGEN ++ FDLEGDSGSLI++KGE  E  +PIGIIWGGT +RGRLKLKVG+ P
Subjt:  QSSISTLIGKQVVKVGRSSGLTTGTVLAYALEYNDEKGICFLTDFLVVGENQQT-FDLEGDSGSLIILKGENRETLQPIGIIWGGTANRGRLKLKVGQPP

Query:  ENWTSGVDLGRLLNLLELDLITSDEGLKAAVQEQITVSATVIGSIVGDSSPPDTTLPKEK--SEEKSE-PLG-FQIQHMPTEVEPSTAKDRPLLETEFHL
        E+WT+GVDLGRLL  L+LDLIT+DEGLKAAVQEQ   S T + S+V DSSPP   L KEK   EEK E  LG  Q+QH+  E           +ET+   
Subjt:  ENWTSGVDLGRLLNLLELDLITSDEGLKAAVQEQITVSATVIGSIVGDSSPPDTTLPKEK--SEEKSE-PLG-FQIQHMPTEVEPSTAKDRPLLETEFHL

Query:  EPGMNRAPSVEHQFIPSLF-SCSPPHQNSTLDRAVSQNLSSLRSDCEDPCVSLQLGDHEAKRRRSDAS
              APSVEHQF+P+    CS      T    +    ++   D  D CV L+LGD  AKRRR+  +
Subjt:  EPGMNRAPSVEHQFIPSLF-SCSPPHQNSTLDRAVSQNLSSLRSDCEDPCVSLQLGDHEAKRRRSDAS

AT5G45030.1 Trypsin family protein2.3e-19462.05Show/hide
Query:  MEQTRHNRRINCSGSIPSEES-ALDLERNCCSHSDLPSFSSPTLQPFASAGQH--FGFNTAYFSWPTPIRLSVGTEERANYFANLQKGVLPDILHPLPKG
        ME  R + R + S S  S ES ALDL++N  +H  L S SSP LQPF S  QH       AYFSWPT  RL+   E+RANYFANLQKGVLP+    LP G
Subjt:  MEQTRHNRRINCSGSIPSEES-ALDLERNCCSHSDLPSFSSPTLQPFASAGQH--FGFNTAYFSWPTPIRLSVGTEERANYFANLQKGVLPDILHPLPKG

Query:  QRANTLLELMTIRAFHSKILRCYSLGTAIGFRIRKGVLTDIPAILVFVSRKVHKQWLN-------SLFYSGARWCLVRCGCCRILIFGAPNPAPKEQLYT
        ++A TLLELM IRAFHSK LR +SLGTAIGFRIR+GVLT+I AILVFV+RKVHKQWLN       +L   G  WC V         +GAP   PKEQ+YT
Subjt:  QRANTLLELMTIRAFHSKILRCYSLGTAIGFRIRKGVLTDIPAILVFVSRKVHKQWLN-------SLFYSGARWCLVRCGCCRILIFGAPNPAPKEQLYT

Query:  EIVDDLRGSDPCIGSGSQVASQETYGTLGAIVRSQTGGRQVGFLTNRHVAVDLDYPNQKMFHPLPPTLGPGVYLGAVERATSFITDELWYGIFAGINPET
        E+VDDLRGS   IGSGSQVASQETYGTLGAIV+S+TG RQVGFLTNRHVAVDLDYP+QKMFHPLPP+LGPGVYLGAVERATSFITD+LWYGIFAG NPET
Subjt:  EIVDDLRGSDPCIGSGSQVASQETYGTLGAIVRSQTGGRQVGFLTNRHVAVDLDYPNQKMFHPLPPTLGPGVYLGAVERATSFITDELWYGIFAGINPET

Query:  FVRADGAFIPFADDFDMSTVTTSVKGVGQVGDVKFIDLQSSISTLIGKQVVKVGRSSGLTTGTVLAYALEYNDEKGICFLTDFLVVGENQQTFDLEGDSG
        FVRADGAFIPFA+DF+ + VTT+VKG+G++GD+   DLQS +++LIG++VVKVGRSSGLTTGT++AYALEYNDEKGICFLTDFLVVGENQQTFDLEGDSG
Subjt:  FVRADGAFIPFADDFDMSTVTTSVKGVGQVGDVKFIDLQSSISTLIGKQVVKVGRSSGLTTGTVLAYALEYNDEKGICFLTDFLVVGENQQTFDLEGDSG

Query:  SLIILKG--ENRETLQPIGIIWGGTANRGRLKLKVGQPPENWTSGVDLGRLLNLLELDLITSDEGLKAAVQEQIT-VSATVIGSIVGDSSPPDTTLPKEK
        SLI+L    E  E  +P+GIIWGGTANRGRLKLKVG+ PENWTSGVDLGR+LNLLELDLITS+EGL+AAV EQ   +    + S V +SSP    + + K
Subjt:  SLIILKG--ENRETLQPIGIIWGGTANRGRLKLKVGQPPENWTSGVDLGRLLNLLELDLITSDEGLKAAVQEQIT-VSATVIGSIVGDSSPPDTTLPKEK

Query:  SEEKSEPLGFQIQHMPTEVEPSTAKDRPLLETEFHLEPGMNRAPSV-EHQFIPSLF-SCSPPHQN-STLDRAVSQNLSSLR--SDCEDPCVSLQLGDHEA
        + E  EP+   +Q +  E       D   +  EF +E  +     + EHQFIPS   + S  HQ  +  +   S+NLSSL+  S  ++   SLQLG+ + 
Subjt:  SEEKSEPLGFQIQHMPTEVEPSTAKDRPLLETEFHLEPGMNRAPSV-EHQFIPSLF-SCSPPHQN-STLDRAVSQNLSSLR--SDCEDPCVSLQLGDHEA

Query:  -KRRRSDASVSMEE
         KR+R+D+    +E
Subjt:  -KRRRSDASVSMEE

AT5G45030.2 Trypsin family protein2.3e-19462.05Show/hide
Query:  MEQTRHNRRINCSGSIPSEES-ALDLERNCCSHSDLPSFSSPTLQPFASAGQH--FGFNTAYFSWPTPIRLSVGTEERANYFANLQKGVLPDILHPLPKG
        ME  R + R + S S  S ES ALDL++N  +H  L S SSP LQPF S  QH       AYFSWPT  RL+   E+RANYFANLQKGVLP+    LP G
Subjt:  MEQTRHNRRINCSGSIPSEES-ALDLERNCCSHSDLPSFSSPTLQPFASAGQH--FGFNTAYFSWPTPIRLSVGTEERANYFANLQKGVLPDILHPLPKG

Query:  QRANTLLELMTIRAFHSKILRCYSLGTAIGFRIRKGVLTDIPAILVFVSRKVHKQWLN-------SLFYSGARWCLVRCGCCRILIFGAPNPAPKEQLYT
        ++A TLLELM IRAFHSK LR +SLGTAIGFRIR+GVLT+I AILVFV+RKVHKQWLN       +L   G  WC V         +GAP   PKEQ+YT
Subjt:  QRANTLLELMTIRAFHSKILRCYSLGTAIGFRIRKGVLTDIPAILVFVSRKVHKQWLN-------SLFYSGARWCLVRCGCCRILIFGAPNPAPKEQLYT

Query:  EIVDDLRGSDPCIGSGSQVASQETYGTLGAIVRSQTGGRQVGFLTNRHVAVDLDYPNQKMFHPLPPTLGPGVYLGAVERATSFITDELWYGIFAGINPET
        E+VDDLRGS   IGSGSQVASQETYGTLGAIV+S+TG RQVGFLTNRHVAVDLDYP+QKMFHPLPP+LGPGVYLGAVERATSFITD+LWYGIFAG NPET
Subjt:  EIVDDLRGSDPCIGSGSQVASQETYGTLGAIVRSQTGGRQVGFLTNRHVAVDLDYPNQKMFHPLPPTLGPGVYLGAVERATSFITDELWYGIFAGINPET

Query:  FVRADGAFIPFADDFDMSTVTTSVKGVGQVGDVKFIDLQSSISTLIGKQVVKVGRSSGLTTGTVLAYALEYNDEKGICFLTDFLVVGENQQTFDLEGDSG
        FVRADGAFIPFA+DF+ + VTT+VKG+G++GD+   DLQS +++LIG++VVKVGRSSGLTTGT++AYALEYNDEKGICFLTDFLVVGENQQTFDLEGDSG
Subjt:  FVRADGAFIPFADDFDMSTVTTSVKGVGQVGDVKFIDLQSSISTLIGKQVVKVGRSSGLTTGTVLAYALEYNDEKGICFLTDFLVVGENQQTFDLEGDSG

Query:  SLIILKG--ENRETLQPIGIIWGGTANRGRLKLKVGQPPENWTSGVDLGRLLNLLELDLITSDEGLKAAVQEQIT-VSATVIGSIVGDSSPPDTTLPKEK
        SLI+L    E  E  +P+GIIWGGTANRGRLKLKVG+ PENWTSGVDLGR+LNLLELDLITS+EGL+AAV EQ   +    + S V +SSP    + + K
Subjt:  SLIILKG--ENRETLQPIGIIWGGTANRGRLKLKVGQPPENWTSGVDLGRLLNLLELDLITSDEGLKAAVQEQIT-VSATVIGSIVGDSSPPDTTLPKEK

Query:  SEEKSEPLGFQIQHMPTEVEPSTAKDRPLLETEFHLEPGMNRAPSV-EHQFIPSLF-SCSPPHQN-STLDRAVSQNLSSLR--SDCEDPCVSLQLGDHEA
        + E  EP+   +Q +  E       D   +  EF +E  +     + EHQFIPS   + S  HQ  +  +   S+NLSSL+  S  ++   SLQLG+ + 
Subjt:  SEEKSEPLGFQIQHMPTEVEPSTAKDRPLLETEFHLEPGMNRAPSV-EHQFIPSLF-SCSPPHQN-STLDRAVSQNLSSLR--SDCEDPCVSLQLGDHEA

Query:  -KRRRSDASVSMEE
         KR+R+D+    +E
Subjt:  -KRRRSDASVSMEE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCCTCCACTCACTTACTCCCTTTACCTTTTATCTTTTTCCCATCTCCAATGTCTTCTCTTTCAAACCTTCTCTTTTCAATGTTTATGTGCAGAAGCCTCTGCTCAGCT
CTACTTTTCTATTGAACTAGGAATACATAATACCATCTTGTCCTTTTATGTGAGCCATATAATGGAGCAAACAAGACACAATAGGAGGATCAACTGCTCTGGTTCAATCC
CATCCGAGGAATCAGCCCTGGATCTTGAAAGGAATTGCTGCAGTCACTCCGACTTACCTTCTTTCAGCTCACCTACACTTCAGCCATTTGCATCTGCTGGACAGCATTTT
GGGTTCAATACTGCTTACTTTTCATGGCCTACCCCTATCCGGTTAAGCGTTGGTACGGAGGAAAGGGCAAATTATTTTGCAAATCTTCAGAAAGGGGTGCTACCAGATAT
CCTTCATCCGTTACCGAAAGGGCAGCGGGCAAACACATTACTCGAACTCATGACAATAAGAGCCTTCCATAGCAAGATCCTGCGTTGTTACAGTCTTGGAACGGCGATTG
GATTCCGTATTCGAAAGGGTGTGCTCACTGATATTCCTGCTATTCTTGTTTTTGTTTCCAGGAAAGTTCACAAGCAATGGCTTAATTCTTTGTTCTATTCAGGGGCCAGG
TGGTGTCTGGTGCGATGTGGATGTTGTAGAATTCTCATATTTGGTGCACCCAACCCTGCTCCTAAAGAACAGTTGTACACTGAAATTGTTGATGATCTGCGTGGCTCTGA
TCCCTGCATTGGCTCTGGTTCTCAGGTGGCCAGCCAAGAGACCTATGGAACCTTGGGCGCTATAGTAAGAAGTCAAACGGGTGGTCGACAAGTTGGTTTTCTCACAAACC
GTCATGTTGCAGTTGATTTAGATTATCCTAATCAGAAGATGTTTCACCCTCTCCCACCGACACTTGGGCCCGGGGTGTATCTTGGGGCTGTGGAGAGAGCTACTTCATTT
ATCACTGACGAGCTTTGGTATGGAATTTTTGCTGGAATAAACCCAGAGACGTTTGTCAGGGCAGATGGAGCATTTATTCCTTTTGCTGATGATTTTGACATGTCAACTGT
CACTACGTCTGTAAAAGGTGTGGGACAGGTCGGTGACGTGAAGTTTATCGACTTGCAATCATCTATCAGTACCCTCATAGGGAAGCAGGTGGTGAAAGTTGGAAGAAGTT
CTGGCTTGACTACAGGAACTGTTTTGGCCTATGCTCTCGAGTACAATGATGAGAAAGGGATATGCTTCTTAACTGATTTTCTCGTTGTAGGTGAGAATCAACAGACTTTC
GATCTCGAAGGAGATAGTGGAAGCCTCATTATTTTAAAGGGCGAGAATCGAGAGACTTTGCAACCAATTGGGATTATATGGGGTGGAACTGCTAACCGAGGCCGGCTTAA
GTTAAAAGTTGGCCAACCTCCTGAGAATTGGACGAGTGGCGTTGATCTTGGGCGTCTTCTCAATCTGCTTGAACTTGATCTAATCACAAGTGATGAAGGGCTCAAAGCGG
CAGTGCAAGAGCAAATAACTGTATCTGCAACCGTAATCGGGTCAATTGTTGGAGACTCCTCTCCTCCCGATACAACCCTGCCAAAGGAGAAGAGTGAAGAGAAGTCTGAG
CCGTTGGGTTTTCAGATCCAGCATATGCCTACAGAAGTAGAACCTTCTACAGCTAAAGATCGGCCGCTCCTGGAGACCGAGTTTCATCTTGAACCAGGAATGAACAGGGC
TCCCAGTGTCGAACATCAGTTCATTCCAAGCCTTTTCAGCTGCTCTCCCCCTCATCAAAACAGCACTTTGGATCGTGCCGTTTCCCAAAACCTATCTTCGCTTCGAAGCG
ACTGTGAAGACCCTTGTGTTTCCTTGCAATTGGGTGACCATGAAGCCAAGAGACGACGGTCGGATGCTTCTGTTTCCATGGAAGAACTGAAATAG
mRNA sequenceShow/hide mRNA sequence
ATGCCTCCACTCACTTACTCCCTTTACCTTTTATCTTTTTCCCATCTCCAATGTCTTCTCTTTCAAACCTTCTCTTTTCAATGTTTATGTGCAGAAGCCTCTGCTCAGCT
CTACTTTTCTATTGAACTAGGAATACATAATACCATCTTGTCCTTTTATGTGAGCCATATAATGGAGCAAACAAGACACAATAGGAGGATCAACTGCTCTGGTTCAATCC
CATCCGAGGAATCAGCCCTGGATCTTGAAAGGAATTGCTGCAGTCACTCCGACTTACCTTCTTTCAGCTCACCTACACTTCAGCCATTTGCATCTGCTGGACAGCATTTT
GGGTTCAATACTGCTTACTTTTCATGGCCTACCCCTATCCGGTTAAGCGTTGGTACGGAGGAAAGGGCAAATTATTTTGCAAATCTTCAGAAAGGGGTGCTACCAGATAT
CCTTCATCCGTTACCGAAAGGGCAGCGGGCAAACACATTACTCGAACTCATGACAATAAGAGCCTTCCATAGCAAGATCCTGCGTTGTTACAGTCTTGGAACGGCGATTG
GATTCCGTATTCGAAAGGGTGTGCTCACTGATATTCCTGCTATTCTTGTTTTTGTTTCCAGGAAAGTTCACAAGCAATGGCTTAATTCTTTGTTCTATTCAGGGGCCAGG
TGGTGTCTGGTGCGATGTGGATGTTGTAGAATTCTCATATTTGGTGCACCCAACCCTGCTCCTAAAGAACAGTTGTACACTGAAATTGTTGATGATCTGCGTGGCTCTGA
TCCCTGCATTGGCTCTGGTTCTCAGGTGGCCAGCCAAGAGACCTATGGAACCTTGGGCGCTATAGTAAGAAGTCAAACGGGTGGTCGACAAGTTGGTTTTCTCACAAACC
GTCATGTTGCAGTTGATTTAGATTATCCTAATCAGAAGATGTTTCACCCTCTCCCACCGACACTTGGGCCCGGGGTGTATCTTGGGGCTGTGGAGAGAGCTACTTCATTT
ATCACTGACGAGCTTTGGTATGGAATTTTTGCTGGAATAAACCCAGAGACGTTTGTCAGGGCAGATGGAGCATTTATTCCTTTTGCTGATGATTTTGACATGTCAACTGT
CACTACGTCTGTAAAAGGTGTGGGACAGGTCGGTGACGTGAAGTTTATCGACTTGCAATCATCTATCAGTACCCTCATAGGGAAGCAGGTGGTGAAAGTTGGAAGAAGTT
CTGGCTTGACTACAGGAACTGTTTTGGCCTATGCTCTCGAGTACAATGATGAGAAAGGGATATGCTTCTTAACTGATTTTCTCGTTGTAGGTGAGAATCAACAGACTTTC
GATCTCGAAGGAGATAGTGGAAGCCTCATTATTTTAAAGGGCGAGAATCGAGAGACTTTGCAACCAATTGGGATTATATGGGGTGGAACTGCTAACCGAGGCCGGCTTAA
GTTAAAAGTTGGCCAACCTCCTGAGAATTGGACGAGTGGCGTTGATCTTGGGCGTCTTCTCAATCTGCTTGAACTTGATCTAATCACAAGTGATGAAGGGCTCAAAGCGG
CAGTGCAAGAGCAAATAACTGTATCTGCAACCGTAATCGGGTCAATTGTTGGAGACTCCTCTCCTCCCGATACAACCCTGCCAAAGGAGAAGAGTGAAGAGAAGTCTGAG
CCGTTGGGTTTTCAGATCCAGCATATGCCTACAGAAGTAGAACCTTCTACAGCTAAAGATCGGCCGCTCCTGGAGACCGAGTTTCATCTTGAACCAGGAATGAACAGGGC
TCCCAGTGTCGAACATCAGTTCATTCCAAGCCTTTTCAGCTGCTCTCCCCCTCATCAAAACAGCACTTTGGATCGTGCCGTTTCCCAAAACCTATCTTCGCTTCGAAGCG
ACTGTGAAGACCCTTGTGTTTCCTTGCAATTGGGTGACCATGAAGCCAAGAGACGACGGTCGGATGCTTCTGTTTCCATGGAAGAACTGAAATAG
Protein sequenceShow/hide protein sequence
MPPLTYSLYLLSFSHLQCLLFQTFSFQCLCAEASAQLYFSIELGIHNTILSFYVSHIMEQTRHNRRINCSGSIPSEESALDLERNCCSHSDLPSFSSPTLQPFASAGQHF
GFNTAYFSWPTPIRLSVGTEERANYFANLQKGVLPDILHPLPKGQRANTLLELMTIRAFHSKILRCYSLGTAIGFRIRKGVLTDIPAILVFVSRKVHKQWLNSLFYSGAR
WCLVRCGCCRILIFGAPNPAPKEQLYTEIVDDLRGSDPCIGSGSQVASQETYGTLGAIVRSQTGGRQVGFLTNRHVAVDLDYPNQKMFHPLPPTLGPGVYLGAVERATSF
ITDELWYGIFAGINPETFVRADGAFIPFADDFDMSTVTTSVKGVGQVGDVKFIDLQSSISTLIGKQVVKVGRSSGLTTGTVLAYALEYNDEKGICFLTDFLVVGENQQTF
DLEGDSGSLIILKGENRETLQPIGIIWGGTANRGRLKLKVGQPPENWTSGVDLGRLLNLLELDLITSDEGLKAAVQEQITVSATVIGSIVGDSSPPDTTLPKEKSEEKSE
PLGFQIQHMPTEVEPSTAKDRPLLETEFHLEPGMNRAPSVEHQFIPSLFSCSPPHQNSTLDRAVSQNLSSLRSDCEDPCVSLQLGDHEAKRRRSDASVSMEELK