; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Clc11G17690 (gene) of Watermelon (cordophanus) v2 genome

Gene IDClc11G17690
OrganismCitrullus lanatus subsp. cordophanus (Watermelon (cordophanus) v2)
DescriptionPeptidase S1, PA clan
Genome locationClcChr11:28411564..28418316
RNA-Seq ExpressionClc11G17690
SyntenyClc11G17690
Gene Ontology termsGO:0043231 - intracellular membrane-bounded organelle (cellular component)
InterPro domainsIPR009003 - Peptidase S1, PA clan


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004144638.1 protein NARROW LEAF 1 [Cucumis sativus]0.0e+0097.67Show/hide
Query:  MEQTRHNRRINCSGSTPSEESALDLERNCCSHSDLPSFSSPTLQPFASAGQHFGFNTAYFSWPTPIRLSVGTEERANYFANLQKGVLPDILHPLPKGQRA
        MEQTRHNRRINCSGSTPSEESALDLERNCCSHSDLPSFSSPTLQPFASAGQHFGFNTAYFSWPTPIRLSVGTEERANYFANLQKGVLPDILHPLPKGQRA
Subjt:  MEQTRHNRRINCSGSTPSEESALDLERNCCSHSDLPSFSSPTLQPFASAGQHFGFNTAYFSWPTPIRLSVGTEERANYFANLQKGVLPDILHPLPKGQRA

Query:  NTLLELMTIRAFHSKILRCYSLGTAIGFRIRKGVLTDIPAILVFVSRKVHKQWLSPIQCLPTALEGPGGVWCDVDVVEFSYFGAPNPAPKEQLYTEIVDD
        NTLLELMTIRAFHSKILRCYSLGTAIGFRIRKGVLTDIPAILVFVSRKVHKQWLSPIQCLPTALEGPGGVWCDVDVVEFSYFGAPNPAPKEQLYTEIVDD
Subjt:  NTLLELMTIRAFHSKILRCYSLGTAIGFRIRKGVLTDIPAILVFVSRKVHKQWLSPIQCLPTALEGPGGVWCDVDVVEFSYFGAPNPAPKEQLYTEIVDD

Query:  LRGSDPCIGSGSQVASQETYGTLGAIVRSQTGSRQVGFLTNRHVAVDLDYPNQKMFHPLPPTLGPGVYLGAVERATSFITDELWYGIFAGINPETFVRAD
        LRGSDPCIGSGSQVASQETYGTLGAIVRSQTG RQVGFLTNRHVAVDLDYPNQKMFHPLPPTLGPGVYLGAVERATSFITDELWYGIFAGINPETFVRAD
Subjt:  LRGSDPCIGSGSQVASQETYGTLGAIVRSQTGSRQVGFLTNRHVAVDLDYPNQKMFHPLPPTLGPGVYLGAVERATSFITDELWYGIFAGINPETFVRAD

Query:  GAFIPFADDFDMSTVTTSVKGVGEVGDVKFIDLQSPISTLIGKQVVKVGRSSGLTTGTVLAYALEYNDEKGICFLTDFLVVGENQQTFDLEGDSGSLIIL
        GAFIPFADDFDMSTVTTSVKGVG+VGDVKFIDLQSPISTLIGKQVVKVGRSSGLTTGTVLAYALEYNDEKGICFLTDFLVVGENQQTFDLEGDSGSLIIL
Subjt:  GAFIPFADDFDMSTVTTSVKGVGEVGDVKFIDLQSPISTLIGKQVVKVGRSSGLTTGTVLAYALEYNDEKGICFLTDFLVVGENQQTFDLEGDSGSLIIL

Query:  KGENRESLQPIGIIWGGTANRGRLKLKVGQPPENWTSGVDLGRLLNLLELDLITSDEGLKAAVQEQTTVSATIIGSIVGDSSPPDTTLPKEKSEEKSEPL
        KGENR++LQPIGIIWGGTANRGRLKLKVGQPPENWTSGVDLGRLLNLLELDLITSDEGLKAAVQEQ TVSAT+IGSIVGDSSPPDTTLPKEKSEEKSE L
Subjt:  KGENRESLQPIGIIWGGTANRGRLKLKVGQPPENWTSGVDLGRLLNLLELDLITSDEGLKAAVQEQTTVSATIIGSIVGDSSPPDTTLPKEKSEEKSEPL

Query:  GFQIQHMPIEVEPSSAKDRPLLETEFHLEPGMNMAPSVEHQFIPSFFSCSPSHQNSTLDRAVSQNLSSLRSDCEDPCVSLQLGDHEAKRQRSDASVSMEE
        GFQIQHMP EVEP SAKDRPLLETEFHLEPGMN APSVEHQFIPS FSCSPSHQNSTLDRAVSQNLS LRSDCED CVSLQLGDHEAKR+RSDASVSMEE
Subjt:  GFQIQHMPIEVEPSSAKDRPLLETEFHLEPGMNMAPSVEHQFIPSFFSCSPSHQNSTLDRAVSQNLSSLRSDCEDPCVSLQLGDHEAKRQRSDASVSMEE

Query:  LK
        LK
Subjt:  LK

XP_008465434.1 PREDICTED: uncharacterized protein LOC103503046 [Cucumis melo]0.0e+0097.84Show/hide
Query:  MEQTRHNRRINCSGSTPSEESALDLERNCCSHSDLPSFSSPTLQPFASAGQHFGFNTAYFSWPTPIRLSVGTEERANYFANLQKGVLPDILHPLPKGQRA
        MEQTRHNRRINCSGS PSEESALDLERNCCSHSDLPSFSSPTLQPFASAGQHFGFNTAYFSWPTPIRLSVGTEERANYFANLQKGVLPDILHPLPKGQRA
Subjt:  MEQTRHNRRINCSGSTPSEESALDLERNCCSHSDLPSFSSPTLQPFASAGQHFGFNTAYFSWPTPIRLSVGTEERANYFANLQKGVLPDILHPLPKGQRA

Query:  NTLLELMTIRAFHSKILRCYSLGTAIGFRIRKGVLTDIPAILVFVSRKVHKQWLSPIQCLPTALEGPGGVWCDVDVVEFSYFGAPNPAPKEQLYTEIVDD
        NTLLELMTIRAFHSKILRCYSLGTAIGFRIRKGVLTDIPAILVFVSRKVHKQWLSPIQCLPTALEGPGGVWCDVDVVEFSYFGAPNPAPKEQLYTEIVDD
Subjt:  NTLLELMTIRAFHSKILRCYSLGTAIGFRIRKGVLTDIPAILVFVSRKVHKQWLSPIQCLPTALEGPGGVWCDVDVVEFSYFGAPNPAPKEQLYTEIVDD

Query:  LRGSDPCIGSGSQVASQETYGTLGAIVRSQTGSRQVGFLTNRHVAVDLDYPNQKMFHPLPPTLGPGVYLGAVERATSFITDELWYGIFAGINPETFVRAD
        LRGSDPCIGSGSQVASQETYGTLGAIVRSQTG RQVGFLTNRHVAVDLDYPNQKMFHPLPPTLGPGVYLGAVERATSFITDELWYGIFAGINPETFVRAD
Subjt:  LRGSDPCIGSGSQVASQETYGTLGAIVRSQTGSRQVGFLTNRHVAVDLDYPNQKMFHPLPPTLGPGVYLGAVERATSFITDELWYGIFAGINPETFVRAD

Query:  GAFIPFADDFDMSTVTTSVKGVGEVGDVKFIDLQSPISTLIGKQVVKVGRSSGLTTGTVLAYALEYNDEKGICFLTDFLVVGENQQTFDLEGDSGSLIIL
        GAFIPFADDFDMSTVTTSVKGVG+VGDVKFIDLQS ISTLIGKQVVKVGRSSGLTTGTVLAYALEYNDEKGICFLTDFLVVGENQQTFDLEGDSGSLIIL
Subjt:  GAFIPFADDFDMSTVTTSVKGVGEVGDVKFIDLQSPISTLIGKQVVKVGRSSGLTTGTVLAYALEYNDEKGICFLTDFLVVGENQQTFDLEGDSGSLIIL

Query:  KGENRESLQPIGIIWGGTANRGRLKLKVGQPPENWTSGVDLGRLLNLLELDLITSDEGLKAAVQEQTTVSATIIGSIVGDSSPPDTTLPKEKSEEKSEPL
        KGENRE+LQPIGIIWGGTANRGRLKLKVGQPPENWTSGVDLGRLLNLLELDLITSDEGLKAAVQEQ TVSAT+IGSIVGDSSPPDTTLPKEKSEEKSEPL
Subjt:  KGENRESLQPIGIIWGGTANRGRLKLKVGQPPENWTSGVDLGRLLNLLELDLITSDEGLKAAVQEQTTVSATIIGSIVGDSSPPDTTLPKEKSEEKSEPL

Query:  GFQIQHMPIEVEPSSAKDRPLLETEFHLEPGMNMAPSVEHQFIPSFFSCSPSHQNSTLDRAVSQNLSSLRSDCEDPCVSLQLGDHEAKRQRSDASVSMEE
        GFQIQHMP EVEPS+AKDRPLLETEFHLEPGMN APSVEHQFIPS FSCSP HQNSTLDRAVSQNLSSLRSDCEDPCVSLQLGDHEAKR+RSDASVSMEE
Subjt:  GFQIQHMPIEVEPSSAKDRPLLETEFHLEPGMNMAPSVEHQFIPSFFSCSPSHQNSTLDRAVSQNLSSLRSDCEDPCVSLQLGDHEAKRQRSDASVSMEE

Query:  LK
        LK
Subjt:  LK

XP_022136083.1 uncharacterized protein LOC111007860 isoform X1 [Momordica charantia]0.0e+0095.68Show/hide
Query:  MEQTRHNRRINCSGSTPSEESALDLERNCCSHSDLPSFSSPTLQPFASAGQHFGFNTAYFSWPTPIRLSVGTEERANYFANLQKGVLPDILHPLPKGQRA
        MEQTRHN RINCSGSTPSEESALDLERNCCSHS+LPSFS PTLQPFASAGQH   N AYFSWPTPIRLSV  EERANYFANLQKGVLPDILHPLPKGQRA
Subjt:  MEQTRHNRRINCSGSTPSEESALDLERNCCSHSDLPSFSSPTLQPFASAGQHFGFNTAYFSWPTPIRLSVGTEERANYFANLQKGVLPDILHPLPKGQRA

Query:  NTLLELMTIRAFHSKILRCYSLGTAIGFRIRKGVLTDIPAILVFVSRKVHKQWLSPIQCLPTALEGPGGVWCDVDVVEFSYFGAPNPAPKEQLYTEIVDD
         TLLELMTIRAFHSKILRCYSLGTAIGFRIRKGVLTDIPAILVFVSRKVHKQWLSPIQCLPTALEGPGGVWCDVDVVEFSYFGAPNPAPKEQLYTEIVDD
Subjt:  NTLLELMTIRAFHSKILRCYSLGTAIGFRIRKGVLTDIPAILVFVSRKVHKQWLSPIQCLPTALEGPGGVWCDVDVVEFSYFGAPNPAPKEQLYTEIVDD

Query:  LRGSDPCIGSGSQVASQETYGTLGAIVRSQTGSRQVGFLTNRHVAVDLDYPNQKMFHPLPPTLGPGVYLGAVERATSFITDELWYGIFAGINPETFVRAD
        LRGSD CIGSGSQVASQETYGTLGAIVRSQTGSRQVGFLTNRHVAVDLDYPNQKMFHPLPPTLGPGVYLGAVERATSFITDELWYGIFAGINPETFVRAD
Subjt:  LRGSDPCIGSGSQVASQETYGTLGAIVRSQTGSRQVGFLTNRHVAVDLDYPNQKMFHPLPPTLGPGVYLGAVERATSFITDELWYGIFAGINPETFVRAD

Query:  GAFIPFADDFDMSTVTTSVKGVGEVGDVKFIDLQSPISTLIGKQVVKVGRSSGLTTGTVLAYALEYNDEKGICFLTDFLVVGENQQTFDLEGDSGSLIIL
        GAFIPFADDF+MSTVTTSVKGVGEVGDVKFIDLQSPISTLIGK+VVKVGRSSGLTTGTVLAYALEYNDEKGICFLTDFLVVGENQQTFDLEGDSGSLIIL
Subjt:  GAFIPFADDFDMSTVTTSVKGVGEVGDVKFIDLQSPISTLIGKQVVKVGRSSGLTTGTVLAYALEYNDEKGICFLTDFLVVGENQQTFDLEGDSGSLIIL

Query:  KGENRESLQPIGIIWGGTANRGRLKLKVGQPPENWTSGVDLGRLLNLLELDLITSDEGLKAAVQEQTTVSATIIGSIVGDSSPPDTTLPKEKSEEKSEPL
        KGENRESLQPIGIIWGGTANRGRLKLKVGQPPENWTSGVDLGRLLNLLELDLITSDEGLKAAVQEQ TVSAT+IGSIVGDSSPPDTTLPKEKSEEK EPL
Subjt:  KGENRESLQPIGIIWGGTANRGRLKLKVGQPPENWTSGVDLGRLLNLLELDLITSDEGLKAAVQEQTTVSATIIGSIVGDSSPPDTTLPKEKSEEKSEPL

Query:  GFQIQHMPIEVEPSSAKDRPLLETEFHLEPGMNMAPSVEHQFIPSFFSCSPSHQNSTLDRAVSQNLSSLRSDCEDPCVSLQLGDHEAKRQRSDASVSMEE
        GFQIQHMP EVEPSSA++RPLLETEFHLE G +MAPSVEHQFIPS FSCSPSHQNS+LDRAVSQNLSSLRSDCED CVSLQLGDHEAKR+RSDASVSMEE
Subjt:  GFQIQHMPIEVEPSSAKDRPLLETEFHLEPGMNMAPSVEHQFIPSFFSCSPSHQNSTLDRAVSQNLSSLRSDCEDPCVSLQLGDHEAKRQRSDASVSMEE

Query:  LK
        LK
Subjt:  LK

XP_022136089.1 uncharacterized protein LOC111007860 isoform X2 [Momordica charantia]0.0e+0095.51Show/hide
Query:  MEQTRHNRRINCSGSTPSEESALDLERNCCSHSDLPSFSSPTLQPFASAGQHFGFNTAYFSWPTPIRLSVGTEERANYFANLQKGVLPDILHPLPKGQRA
        MEQTRHN RINCSGSTPSEESALDLERNCCSHS+LPSFS PTLQPFASAGQH   N AYFSWPTPIRLSV  EERANYFANLQKGVLPDILHPLPKGQRA
Subjt:  MEQTRHNRRINCSGSTPSEESALDLERNCCSHSDLPSFSSPTLQPFASAGQHFGFNTAYFSWPTPIRLSVGTEERANYFANLQKGVLPDILHPLPKGQRA

Query:  NTLLELMTIRAFHSKILRCYSLGTAIGFRIRKGVLTDIPAILVFVSRKVHKQWLSPIQCLPTALEGPGGVWCDVDVVEFSYFGAPNPAPKEQLYTEIVDD
         TLLELMTIRAFHSKILRCYSLGTAIGFRIRKGVLTDIPAILVFVSRKVHKQWLSPIQCLPTALEGPGGVWCDVDVVEFSYFGAPNPAPKEQLYTEIVDD
Subjt:  NTLLELMTIRAFHSKILRCYSLGTAIGFRIRKGVLTDIPAILVFVSRKVHKQWLSPIQCLPTALEGPGGVWCDVDVVEFSYFGAPNPAPKEQLYTEIVDD

Query:  LRGSDPCIGSGSQVASQETYGTLGAIVRSQTGSRQVGFLTNRHVAVDLDYPNQKMFHPLPPTLGPGVYLGAVERATSFITDELWYGIFAGINPETFVRAD
        LRGSD CIGSGSQVASQETYGTLGAIVRSQTGSRQVGFLTNRHVAVDLDYPNQKMFHPLPPTLGPGVYLGAVERATSFITDELWYGIFAGINPETFVRAD
Subjt:  LRGSDPCIGSGSQVASQETYGTLGAIVRSQTGSRQVGFLTNRHVAVDLDYPNQKMFHPLPPTLGPGVYLGAVERATSFITDELWYGIFAGINPETFVRAD

Query:  GAFIPFADDFDMSTVTTSVKGVGEVGDVKFIDLQSPISTLIGKQVVKVGRSSGLTTGTVLAYALEYNDEKGICFLTDFLVVGENQQTFDLEGDSGSLIIL
        GAFIPFADDF+MSTVTTSVKGVGEVGDVKFIDLQSPISTLIGK+VVKVGRSSGLTTGTVLAYALEYNDEKGICFLTDFLVVGENQQTFDLEGDSGSLIIL
Subjt:  GAFIPFADDFDMSTVTTSVKGVGEVGDVKFIDLQSPISTLIGKQVVKVGRSSGLTTGTVLAYALEYNDEKGICFLTDFLVVGENQQTFDLEGDSGSLIIL

Query:  KGENRESLQPIGIIWGGTANRGRLKLKVGQPPENWTSGVDLGRLLNLLELDLITSDEGLKAAVQEQTTVSATIIGSIVGDSSPPDTTLPKEKSEEKSEPL
        KGENRESLQPIGIIWGGTANRGRLKLKVGQPPENWTSGVDLGRLLNLLELDLITSDEGLK AVQEQ TVSAT+IGSIVGDSSPPDTTLPKEKSEEK EPL
Subjt:  KGENRESLQPIGIIWGGTANRGRLKLKVGQPPENWTSGVDLGRLLNLLELDLITSDEGLKAAVQEQTTVSATIIGSIVGDSSPPDTTLPKEKSEEKSEPL

Query:  GFQIQHMPIEVEPSSAKDRPLLETEFHLEPGMNMAPSVEHQFIPSFFSCSPSHQNSTLDRAVSQNLSSLRSDCEDPCVSLQLGDHEAKRQRSDASVSMEE
        GFQIQHMP EVEPSSA++RPLLETEFHLE G +MAPSVEHQFIPS FSCSPSHQNS+LDRAVSQNLSSLRSDCED CVSLQLGDHEAKR+RSDASVSMEE
Subjt:  GFQIQHMPIEVEPSSAKDRPLLETEFHLEPGMNMAPSVEHQFIPSFFSCSPSHQNSTLDRAVSQNLSSLRSDCEDPCVSLQLGDHEAKRQRSDASVSMEE

Query:  LK
        LK
Subjt:  LK

XP_038898393.1 protein NARROW LEAF 1 [Benincasa hispida]0.0e+0098.5Show/hide
Query:  MEQTRHNRRINCSGSTPSEESALDLERNCCSHSDLPSFSSPTLQPFASAGQHFGFNTAYFSWPTPIRLSVGTEERANYFANLQKGVLPDILHPLPKGQRA
        MEQTRHNRRINCSGSTPSEESALDLERNCCSHSDLPSFSSPTLQPFASAGQHFGFNTAYFSWPTPIRLSVGTEERANYFANLQKGVLPDILHPLPKGQRA
Subjt:  MEQTRHNRRINCSGSTPSEESALDLERNCCSHSDLPSFSSPTLQPFASAGQHFGFNTAYFSWPTPIRLSVGTEERANYFANLQKGVLPDILHPLPKGQRA

Query:  NTLLELMTIRAFHSKILRCYSLGTAIGFRIRKGVLTDIPAILVFVSRKVHKQWLSPIQCLPTALEGPGGVWCDVDVVEFSYFGAPNPAPKEQLYTEIVDD
        NTLLELMTIRAFHSKILRCYSLGTAIGFRIRKGVLTDIPAILVFVSRKVHKQWLSPIQCLPTALEGPGGVWCDVDVVEFSYFGAPNPAPKEQLYTEIVDD
Subjt:  NTLLELMTIRAFHSKILRCYSLGTAIGFRIRKGVLTDIPAILVFVSRKVHKQWLSPIQCLPTALEGPGGVWCDVDVVEFSYFGAPNPAPKEQLYTEIVDD

Query:  LRGSDPCIGSGSQVASQETYGTLGAIVRSQTGSRQVGFLTNRHVAVDLDYPNQKMFHPLPPTLGPGVYLGAVERATSFITDELWYGIFAGINPETFVRAD
        LRGSDPCIGSGSQVASQETYGTLGAIVRSQTGSRQVGFLTNRHVAVDLDYPNQKMFHPLPPTLGPGVYLGAVERATSFITDELWYGIFAGINPETFVRAD
Subjt:  LRGSDPCIGSGSQVASQETYGTLGAIVRSQTGSRQVGFLTNRHVAVDLDYPNQKMFHPLPPTLGPGVYLGAVERATSFITDELWYGIFAGINPETFVRAD

Query:  GAFIPFADDFDMSTVTTSVKGVGEVGDVKFIDLQSPISTLIGKQVVKVGRSSGLTTGTVLAYALEYNDEKGICFLTDFLVVGENQQTFDLEGDSGSLIIL
        GAFIPFADDFDMSTVTTSVKGVGEVGDVKFIDLQSPISTLIGKQVVKVGRSSGLTTGTVLAYALEYNDEKGICFLTDFLVVGENQQTFDLEGDSGSLIIL
Subjt:  GAFIPFADDFDMSTVTTSVKGVGEVGDVKFIDLQSPISTLIGKQVVKVGRSSGLTTGTVLAYALEYNDEKGICFLTDFLVVGENQQTFDLEGDSGSLIIL

Query:  KGENRESLQPIGIIWGGTANRGRLKLKVGQPPENWTSGVDLGRLLNLLELDLITSDEGLKAAVQEQTTVSATIIGSIVGDSSPPDTTLPKEKSEEKSEPL
        KGENRESLQPIGIIWGGTANRGRLKLKVGQPPENWTSGVDLGRLLNLLELDLITSDEGLKAAVQEQ TVSAT+IGSIVGDSSPPDTTLPKEKSEEKSEPL
Subjt:  KGENRESLQPIGIIWGGTANRGRLKLKVGQPPENWTSGVDLGRLLNLLELDLITSDEGLKAAVQEQTTVSATIIGSIVGDSSPPDTTLPKEKSEEKSEPL

Query:  GFQIQHMPIEVEPSSAKDRPLLETEFHLEPGMNMAPSVEHQFIPSFFSCSPSHQNSTLDRAVSQNLSSLRSDCEDPCVSLQLGDHEAKRQRSDASVSMEE
        GFQIQHMP EVEPSS KDRPLLETEFHLE GMNMAPSVEHQFIPS FSCSPSHQNSTLDRAVSQNLSSLRSD EDPCVSLQLGDHEAKR+RSDASVS+EE
Subjt:  GFQIQHMPIEVEPSSAKDRPLLETEFHLEPGMNMAPSVEHQFIPSFFSCSPSHQNSTLDRAVSQNLSSLRSDCEDPCVSLQLGDHEAKRQRSDASVSMEE

Query:  LK
        LK
Subjt:  LK

TrEMBL top hitse value%identityAlignment
A0A0A0L2V0 Uncharacterized protein0.0e+0097.69Show/hide
Query:  HVSHIMEQTRHNRRINCSGSTPSEESALDLERNCCSHSDLPSFSSPTLQPFASAGQHFGFNTAYFSWPTPIRLSVGTEERANYFANLQKGVLPDILHPLP
        HVSHIMEQTRHNRRINCSGSTPSEESALDLERNCCSHSDLPSFSSPTLQPFASAGQHFGFNTAYFSWPTPIRLSVGTEERANYFANLQKGVLPDILHPLP
Subjt:  HVSHIMEQTRHNRRINCSGSTPSEESALDLERNCCSHSDLPSFSSPTLQPFASAGQHFGFNTAYFSWPTPIRLSVGTEERANYFANLQKGVLPDILHPLP

Query:  KGQRANTLLELMTIRAFHSKILRCYSLGTAIGFRIRKGVLTDIPAILVFVSRKVHKQWLSPIQCLPTALEGPGGVWCDVDVVEFSYFGAPNPAPKEQLYT
        KGQRANTLLELMTIRAFHSKILRCYSLGTAIGFRIRKGVLTDIPAILVFVSRKVHKQWLSPIQCLPTALEGPGGVWCDVDVVEFSYFGAPNPAPKEQLYT
Subjt:  KGQRANTLLELMTIRAFHSKILRCYSLGTAIGFRIRKGVLTDIPAILVFVSRKVHKQWLSPIQCLPTALEGPGGVWCDVDVVEFSYFGAPNPAPKEQLYT

Query:  EIVDDLRGSDPCIGSGSQVASQETYGTLGAIVRSQTGSRQVGFLTNRHVAVDLDYPNQKMFHPLPPTLGPGVYLGAVERATSFITDELWYGIFAGINPET
        EIVDDLRGSDPCIGSGSQVASQETYGTLGAIVRSQTG RQVGFLTNRHVAVDLDYPNQKMFHPLPPTLGPGVYLGAVERATSFITDELWYGIFAGINPET
Subjt:  EIVDDLRGSDPCIGSGSQVASQETYGTLGAIVRSQTGSRQVGFLTNRHVAVDLDYPNQKMFHPLPPTLGPGVYLGAVERATSFITDELWYGIFAGINPET

Query:  FVRADGAFIPFADDFDMSTVTTSVKGVGEVGDVKFIDLQSPISTLIGKQVVKVGRSSGLTTGTVLAYALEYNDEKGICFLTDFLVVGENQQTFDLEGDSG
        FVRADGAFIPFADDFDMSTVTTSVKGVG+VGDVKFIDLQSPISTLIGKQVVKVGRSSGLTTGTVLAYALEYNDEKGICFLTDFLVVGENQQTFDLEGDSG
Subjt:  FVRADGAFIPFADDFDMSTVTTSVKGVGEVGDVKFIDLQSPISTLIGKQVVKVGRSSGLTTGTVLAYALEYNDEKGICFLTDFLVVGENQQTFDLEGDSG

Query:  SLIILKGENRESLQPIGIIWGGTANRGRLKLKVGQPPENWTSGVDLGRLLNLLELDLITSDEGLKAAVQEQTTVSATIIGSIVGDSSPPDTTLPKEKSEE
        SLIILKGENR++LQPIGIIWGGTANRGRLKLKVGQPPENWTSGVDLGRLLNLLELDLITSDEGLKAAVQEQ TVSAT+IGSIVGDSSPPDTTLPKEKSEE
Subjt:  SLIILKGENRESLQPIGIIWGGTANRGRLKLKVGQPPENWTSGVDLGRLLNLLELDLITSDEGLKAAVQEQTTVSATIIGSIVGDSSPPDTTLPKEKSEE

Query:  KSEPLGFQIQHMPIEVEPSSAKDRPLLETEFHLEPGMNMAPSVEHQFIPSFFSCSPSHQNSTLDRAVSQNLSSLRSDCEDPCVSLQLGDHEAKRQRSDAS
        KSE LGFQIQHMP EVEP SAKDRPLLETEFHLEPGMN APSVEHQFIPS FSCSPSHQNSTLDRAVSQNLS LRSDCED CVSLQLGDHEAKR+RSDAS
Subjt:  KSEPLGFQIQHMPIEVEPSSAKDRPLLETEFHLEPGMNMAPSVEHQFIPSFFSCSPSHQNSTLDRAVSQNLSSLRSDCEDPCVSLQLGDHEAKRQRSDAS

Query:  VSMEELK
        VSMEELK
Subjt:  VSMEELK

A0A1S3CNV7 uncharacterized protein LOC1035030460.0e+0097.84Show/hide
Query:  MEQTRHNRRINCSGSTPSEESALDLERNCCSHSDLPSFSSPTLQPFASAGQHFGFNTAYFSWPTPIRLSVGTEERANYFANLQKGVLPDILHPLPKGQRA
        MEQTRHNRRINCSGS PSEESALDLERNCCSHSDLPSFSSPTLQPFASAGQHFGFNTAYFSWPTPIRLSVGTEERANYFANLQKGVLPDILHPLPKGQRA
Subjt:  MEQTRHNRRINCSGSTPSEESALDLERNCCSHSDLPSFSSPTLQPFASAGQHFGFNTAYFSWPTPIRLSVGTEERANYFANLQKGVLPDILHPLPKGQRA

Query:  NTLLELMTIRAFHSKILRCYSLGTAIGFRIRKGVLTDIPAILVFVSRKVHKQWLSPIQCLPTALEGPGGVWCDVDVVEFSYFGAPNPAPKEQLYTEIVDD
        NTLLELMTIRAFHSKILRCYSLGTAIGFRIRKGVLTDIPAILVFVSRKVHKQWLSPIQCLPTALEGPGGVWCDVDVVEFSYFGAPNPAPKEQLYTEIVDD
Subjt:  NTLLELMTIRAFHSKILRCYSLGTAIGFRIRKGVLTDIPAILVFVSRKVHKQWLSPIQCLPTALEGPGGVWCDVDVVEFSYFGAPNPAPKEQLYTEIVDD

Query:  LRGSDPCIGSGSQVASQETYGTLGAIVRSQTGSRQVGFLTNRHVAVDLDYPNQKMFHPLPPTLGPGVYLGAVERATSFITDELWYGIFAGINPETFVRAD
        LRGSDPCIGSGSQVASQETYGTLGAIVRSQTG RQVGFLTNRHVAVDLDYPNQKMFHPLPPTLGPGVYLGAVERATSFITDELWYGIFAGINPETFVRAD
Subjt:  LRGSDPCIGSGSQVASQETYGTLGAIVRSQTGSRQVGFLTNRHVAVDLDYPNQKMFHPLPPTLGPGVYLGAVERATSFITDELWYGIFAGINPETFVRAD

Query:  GAFIPFADDFDMSTVTTSVKGVGEVGDVKFIDLQSPISTLIGKQVVKVGRSSGLTTGTVLAYALEYNDEKGICFLTDFLVVGENQQTFDLEGDSGSLIIL
        GAFIPFADDFDMSTVTTSVKGVG+VGDVKFIDLQS ISTLIGKQVVKVGRSSGLTTGTVLAYALEYNDEKGICFLTDFLVVGENQQTFDLEGDSGSLIIL
Subjt:  GAFIPFADDFDMSTVTTSVKGVGEVGDVKFIDLQSPISTLIGKQVVKVGRSSGLTTGTVLAYALEYNDEKGICFLTDFLVVGENQQTFDLEGDSGSLIIL

Query:  KGENRESLQPIGIIWGGTANRGRLKLKVGQPPENWTSGVDLGRLLNLLELDLITSDEGLKAAVQEQTTVSATIIGSIVGDSSPPDTTLPKEKSEEKSEPL
        KGENRE+LQPIGIIWGGTANRGRLKLKVGQPPENWTSGVDLGRLLNLLELDLITSDEGLKAAVQEQ TVSAT+IGSIVGDSSPPDTTLPKEKSEEKSEPL
Subjt:  KGENRESLQPIGIIWGGTANRGRLKLKVGQPPENWTSGVDLGRLLNLLELDLITSDEGLKAAVQEQTTVSATIIGSIVGDSSPPDTTLPKEKSEEKSEPL

Query:  GFQIQHMPIEVEPSSAKDRPLLETEFHLEPGMNMAPSVEHQFIPSFFSCSPSHQNSTLDRAVSQNLSSLRSDCEDPCVSLQLGDHEAKRQRSDASVSMEE
        GFQIQHMP EVEPS+AKDRPLLETEFHLEPGMN APSVEHQFIPS FSCSP HQNSTLDRAVSQNLSSLRSDCEDPCVSLQLGDHEAKR+RSDASVSMEE
Subjt:  GFQIQHMPIEVEPSSAKDRPLLETEFHLEPGMNMAPSVEHQFIPSFFSCSPSHQNSTLDRAVSQNLSSLRSDCEDPCVSLQLGDHEAKRQRSDASVSMEE

Query:  LK
        LK
Subjt:  LK

A0A5D3CE06 Uncharacterized protein0.0e+0097.84Show/hide
Query:  MEQTRHNRRINCSGSTPSEESALDLERNCCSHSDLPSFSSPTLQPFASAGQHFGFNTAYFSWPTPIRLSVGTEERANYFANLQKGVLPDILHPLPKGQRA
        MEQTRHNRRINCSGS PSEESALDLERNCCSHSDLPSFSSPTLQPFASAGQHFGFNTAYFSWPTPIRLSVGTEERANYFANLQKGVLPDILHPLPKGQRA
Subjt:  MEQTRHNRRINCSGSTPSEESALDLERNCCSHSDLPSFSSPTLQPFASAGQHFGFNTAYFSWPTPIRLSVGTEERANYFANLQKGVLPDILHPLPKGQRA

Query:  NTLLELMTIRAFHSKILRCYSLGTAIGFRIRKGVLTDIPAILVFVSRKVHKQWLSPIQCLPTALEGPGGVWCDVDVVEFSYFGAPNPAPKEQLYTEIVDD
        NTLLELMTIRAFHSKILRCYSLGTAIGFRIRKGVLTDIPAILVFVSRKVHKQWLSPIQCLPTALEGPGGVWCDVDVVEFSYFGAPNPAPKEQLYTEIVDD
Subjt:  NTLLELMTIRAFHSKILRCYSLGTAIGFRIRKGVLTDIPAILVFVSRKVHKQWLSPIQCLPTALEGPGGVWCDVDVVEFSYFGAPNPAPKEQLYTEIVDD

Query:  LRGSDPCIGSGSQVASQETYGTLGAIVRSQTGSRQVGFLTNRHVAVDLDYPNQKMFHPLPPTLGPGVYLGAVERATSFITDELWYGIFAGINPETFVRAD
        LRGSDPCIGSGSQVASQETYGTLGAIVRSQTG RQVGFLTNRHVAVDLDYPNQKMFHPLPPTLGPGVYLGAVERATSFITDELWYGIFAGINPETFVRAD
Subjt:  LRGSDPCIGSGSQVASQETYGTLGAIVRSQTGSRQVGFLTNRHVAVDLDYPNQKMFHPLPPTLGPGVYLGAVERATSFITDELWYGIFAGINPETFVRAD

Query:  GAFIPFADDFDMSTVTTSVKGVGEVGDVKFIDLQSPISTLIGKQVVKVGRSSGLTTGTVLAYALEYNDEKGICFLTDFLVVGENQQTFDLEGDSGSLIIL
        GAFIPFADDFDMSTVTTSVKGVG+VGDVKFIDLQS ISTLIGKQVVKVGRSSGLTTGTVLAYALEYNDEKGICFLTDFLVVGENQQTFDLEGDSGSLIIL
Subjt:  GAFIPFADDFDMSTVTTSVKGVGEVGDVKFIDLQSPISTLIGKQVVKVGRSSGLTTGTVLAYALEYNDEKGICFLTDFLVVGENQQTFDLEGDSGSLIIL

Query:  KGENRESLQPIGIIWGGTANRGRLKLKVGQPPENWTSGVDLGRLLNLLELDLITSDEGLKAAVQEQTTVSATIIGSIVGDSSPPDTTLPKEKSEEKSEPL
        KGENRE+LQPIGIIWGGTANRGRLKLKVGQPPENWTSGVDLGRLLNLLELDLITSDEGLKAAVQEQ TVSAT+IGSIVGDSSPPDTTLPKEKSEEKSEPL
Subjt:  KGENRESLQPIGIIWGGTANRGRLKLKVGQPPENWTSGVDLGRLLNLLELDLITSDEGLKAAVQEQTTVSATIIGSIVGDSSPPDTTLPKEKSEEKSEPL

Query:  GFQIQHMPIEVEPSSAKDRPLLETEFHLEPGMNMAPSVEHQFIPSFFSCSPSHQNSTLDRAVSQNLSSLRSDCEDPCVSLQLGDHEAKRQRSDASVSMEE
        GFQIQHMP EVEPS+AKDRPLLETEFHLEPGMN APSVEHQFIPS FSCSP HQNSTLDRAVSQNLSSLRSDCEDPCVSLQLGDHEAKR+RSDASVSMEE
Subjt:  GFQIQHMPIEVEPSSAKDRPLLETEFHLEPGMNMAPSVEHQFIPSFFSCSPSHQNSTLDRAVSQNLSSLRSDCEDPCVSLQLGDHEAKRQRSDASVSMEE

Query:  LK
        LK
Subjt:  LK

A0A6J1C3A8 uncharacterized protein LOC111007860 isoform X20.0e+0095.51Show/hide
Query:  MEQTRHNRRINCSGSTPSEESALDLERNCCSHSDLPSFSSPTLQPFASAGQHFGFNTAYFSWPTPIRLSVGTEERANYFANLQKGVLPDILHPLPKGQRA
        MEQTRHN RINCSGSTPSEESALDLERNCCSHS+LPSFS PTLQPFASAGQH   N AYFSWPTPIRLSV  EERANYFANLQKGVLPDILHPLPKGQRA
Subjt:  MEQTRHNRRINCSGSTPSEESALDLERNCCSHSDLPSFSSPTLQPFASAGQHFGFNTAYFSWPTPIRLSVGTEERANYFANLQKGVLPDILHPLPKGQRA

Query:  NTLLELMTIRAFHSKILRCYSLGTAIGFRIRKGVLTDIPAILVFVSRKVHKQWLSPIQCLPTALEGPGGVWCDVDVVEFSYFGAPNPAPKEQLYTEIVDD
         TLLELMTIRAFHSKILRCYSLGTAIGFRIRKGVLTDIPAILVFVSRKVHKQWLSPIQCLPTALEGPGGVWCDVDVVEFSYFGAPNPAPKEQLYTEIVDD
Subjt:  NTLLELMTIRAFHSKILRCYSLGTAIGFRIRKGVLTDIPAILVFVSRKVHKQWLSPIQCLPTALEGPGGVWCDVDVVEFSYFGAPNPAPKEQLYTEIVDD

Query:  LRGSDPCIGSGSQVASQETYGTLGAIVRSQTGSRQVGFLTNRHVAVDLDYPNQKMFHPLPPTLGPGVYLGAVERATSFITDELWYGIFAGINPETFVRAD
        LRGSD CIGSGSQVASQETYGTLGAIVRSQTGSRQVGFLTNRHVAVDLDYPNQKMFHPLPPTLGPGVYLGAVERATSFITDELWYGIFAGINPETFVRAD
Subjt:  LRGSDPCIGSGSQVASQETYGTLGAIVRSQTGSRQVGFLTNRHVAVDLDYPNQKMFHPLPPTLGPGVYLGAVERATSFITDELWYGIFAGINPETFVRAD

Query:  GAFIPFADDFDMSTVTTSVKGVGEVGDVKFIDLQSPISTLIGKQVVKVGRSSGLTTGTVLAYALEYNDEKGICFLTDFLVVGENQQTFDLEGDSGSLIIL
        GAFIPFADDF+MSTVTTSVKGVGEVGDVKFIDLQSPISTLIGK+VVKVGRSSGLTTGTVLAYALEYNDEKGICFLTDFLVVGENQQTFDLEGDSGSLIIL
Subjt:  GAFIPFADDFDMSTVTTSVKGVGEVGDVKFIDLQSPISTLIGKQVVKVGRSSGLTTGTVLAYALEYNDEKGICFLTDFLVVGENQQTFDLEGDSGSLIIL

Query:  KGENRESLQPIGIIWGGTANRGRLKLKVGQPPENWTSGVDLGRLLNLLELDLITSDEGLKAAVQEQTTVSATIIGSIVGDSSPPDTTLPKEKSEEKSEPL
        KGENRESLQPIGIIWGGTANRGRLKLKVGQPPENWTSGVDLGRLLNLLELDLITSDEGLK AVQEQ TVSAT+IGSIVGDSSPPDTTLPKEKSEEK EPL
Subjt:  KGENRESLQPIGIIWGGTANRGRLKLKVGQPPENWTSGVDLGRLLNLLELDLITSDEGLKAAVQEQTTVSATIIGSIVGDSSPPDTTLPKEKSEEKSEPL

Query:  GFQIQHMPIEVEPSSAKDRPLLETEFHLEPGMNMAPSVEHQFIPSFFSCSPSHQNSTLDRAVSQNLSSLRSDCEDPCVSLQLGDHEAKRQRSDASVSMEE
        GFQIQHMP EVEPSSA++RPLLETEFHLE G +MAPSVEHQFIPS FSCSPSHQNS+LDRAVSQNLSSLRSDCED CVSLQLGDHEAKR+RSDASVSMEE
Subjt:  GFQIQHMPIEVEPSSAKDRPLLETEFHLEPGMNMAPSVEHQFIPSFFSCSPSHQNSTLDRAVSQNLSSLRSDCEDPCVSLQLGDHEAKRQRSDASVSMEE

Query:  LK
        LK
Subjt:  LK

A0A6J1C6M7 uncharacterized protein LOC111007860 isoform X10.0e+0095.68Show/hide
Query:  MEQTRHNRRINCSGSTPSEESALDLERNCCSHSDLPSFSSPTLQPFASAGQHFGFNTAYFSWPTPIRLSVGTEERANYFANLQKGVLPDILHPLPKGQRA
        MEQTRHN RINCSGSTPSEESALDLERNCCSHS+LPSFS PTLQPFASAGQH   N AYFSWPTPIRLSV  EERANYFANLQKGVLPDILHPLPKGQRA
Subjt:  MEQTRHNRRINCSGSTPSEESALDLERNCCSHSDLPSFSSPTLQPFASAGQHFGFNTAYFSWPTPIRLSVGTEERANYFANLQKGVLPDILHPLPKGQRA

Query:  NTLLELMTIRAFHSKILRCYSLGTAIGFRIRKGVLTDIPAILVFVSRKVHKQWLSPIQCLPTALEGPGGVWCDVDVVEFSYFGAPNPAPKEQLYTEIVDD
         TLLELMTIRAFHSKILRCYSLGTAIGFRIRKGVLTDIPAILVFVSRKVHKQWLSPIQCLPTALEGPGGVWCDVDVVEFSYFGAPNPAPKEQLYTEIVDD
Subjt:  NTLLELMTIRAFHSKILRCYSLGTAIGFRIRKGVLTDIPAILVFVSRKVHKQWLSPIQCLPTALEGPGGVWCDVDVVEFSYFGAPNPAPKEQLYTEIVDD

Query:  LRGSDPCIGSGSQVASQETYGTLGAIVRSQTGSRQVGFLTNRHVAVDLDYPNQKMFHPLPPTLGPGVYLGAVERATSFITDELWYGIFAGINPETFVRAD
        LRGSD CIGSGSQVASQETYGTLGAIVRSQTGSRQVGFLTNRHVAVDLDYPNQKMFHPLPPTLGPGVYLGAVERATSFITDELWYGIFAGINPETFVRAD
Subjt:  LRGSDPCIGSGSQVASQETYGTLGAIVRSQTGSRQVGFLTNRHVAVDLDYPNQKMFHPLPPTLGPGVYLGAVERATSFITDELWYGIFAGINPETFVRAD

Query:  GAFIPFADDFDMSTVTTSVKGVGEVGDVKFIDLQSPISTLIGKQVVKVGRSSGLTTGTVLAYALEYNDEKGICFLTDFLVVGENQQTFDLEGDSGSLIIL
        GAFIPFADDF+MSTVTTSVKGVGEVGDVKFIDLQSPISTLIGK+VVKVGRSSGLTTGTVLAYALEYNDEKGICFLTDFLVVGENQQTFDLEGDSGSLIIL
Subjt:  GAFIPFADDFDMSTVTTSVKGVGEVGDVKFIDLQSPISTLIGKQVVKVGRSSGLTTGTVLAYALEYNDEKGICFLTDFLVVGENQQTFDLEGDSGSLIIL

Query:  KGENRESLQPIGIIWGGTANRGRLKLKVGQPPENWTSGVDLGRLLNLLELDLITSDEGLKAAVQEQTTVSATIIGSIVGDSSPPDTTLPKEKSEEKSEPL
        KGENRESLQPIGIIWGGTANRGRLKLKVGQPPENWTSGVDLGRLLNLLELDLITSDEGLKAAVQEQ TVSAT+IGSIVGDSSPPDTTLPKEKSEEK EPL
Subjt:  KGENRESLQPIGIIWGGTANRGRLKLKVGQPPENWTSGVDLGRLLNLLELDLITSDEGLKAAVQEQTTVSATIIGSIVGDSSPPDTTLPKEKSEEKSEPL

Query:  GFQIQHMPIEVEPSSAKDRPLLETEFHLEPGMNMAPSVEHQFIPSFFSCSPSHQNSTLDRAVSQNLSSLRSDCEDPCVSLQLGDHEAKRQRSDASVSMEE
        GFQIQHMP EVEPSSA++RPLLETEFHLE G +MAPSVEHQFIPS FSCSPSHQNS+LDRAVSQNLSSLRSDCED CVSLQLGDHEAKR+RSDASVSMEE
Subjt:  GFQIQHMPIEVEPSSAKDRPLLETEFHLEPGMNMAPSVEHQFIPSFFSCSPSHQNSTLDRAVSQNLSSLRSDCEDPCVSLQLGDHEAKRQRSDASVSMEE

Query:  LK
        LK
Subjt:  LK

SwissProt top hitse value%identityAlignment
B4XT64 Protein NARROW LEAF 18.8e-20964.29Show/hide
Query:  SGSTPSEESALDLERNCCSHSDLPSFSSPTLQPFASAGQHFGFNTAYFSWPTPIRLSVGTEERANYFANLQKGVLPDILHPLPKGQRANTLLELMTIRAF
        SG   SEES+LD++     H   P   SP++QP AS   H   + AYF WPT        E RANYF NLQKG+LP     LPKGQ+AN+LL+LMTIRAF
Subjt:  SGSTPSEESALDLERNCCSHSDLPSFSSPTLQPFASAGQHFGFNTAYFSWPTPIRLSVGTEERANYFANLQKGVLPDILHPLPKGQRANTLLELMTIRAF

Query:  HSKILRCYSLGTAIGFRIRKGVLTDIPAILVFVSRKVHKQWLSPIQCLPTALEGPGGVWCDVDVVEFSYFGAPNPAPKEQLYTEIVDDLRGSDPCIGSGS
        HSKILR +SLGTA+GFRIRKG LTDIPAILVFV+RKVHK+WL+P QCLP  LEGPGGVWCDVDVVEFSY+GAP   PKEQ+++E+VD L GSD CIGSGS
Subjt:  HSKILRCYSLGTAIGFRIRKGVLTDIPAILVFVSRKVHKQWLSPIQCLPTALEGPGGVWCDVDVVEFSYFGAPNPAPKEQLYTEIVDDLRGSDPCIGSGS

Query:  QVASQETYGTLGAIVRSQTGSRQVGFLTNRHVAVDLDYPNQKMFHPLPPTLGPGVYLGAVERATSFITDELWYGIFAGINPETFVRADGAFIPFADDFDM
        QVAS ET+GTLGAIV+ +TG++QVGFLTN HVAVDLDYPNQKMFHPLPP LGPGVYLGAVERATSFITD++WYGI+AG NPETFVRADGAFIPFADDFD+
Subjt:  QVASQETYGTLGAIVRSQTGSRQVGFLTNRHVAVDLDYPNQKMFHPLPPTLGPGVYLGAVERATSFITDELWYGIFAGINPETFVRADGAFIPFADDFDM

Query:  STVTTSVKGVGEVGDVKFIDLQSPISTLIGKQVVKVGRSSGLTTGTVLAYALEYNDEKGICFLTDFLVVGENQQTFDLEGDSGSLIILKGENRESLQPIG
        STVTT V+GVG++GDVK IDLQ P+++LIG+QV KVGRSSG TTGTV+AYALEYNDEKGICF TD LVVGEN+QTFDLEGDSGSLIIL  ++ E  +PIG
Subjt:  STVTTSVKGVGEVGDVKFIDLQSPISTLIGKQVVKVGRSSGLTTGTVLAYALEYNDEKGICFLTDFLVVGENQQTFDLEGDSGSLIILKGENRESLQPIG

Query:  IIWGGTANRGRLKLKVGQPPENWTSGVDLGRLLNLLELDLITSDEGLKAAVQEQTTVSATIIGSIVGDSSPPDTTLPKEKSEEKSEPLGFQIQHMP-IEV
        IIWGGTANRGRLKL     PENWTSGVDLGRLL+ LELD+I ++E L+ AVQ+Q       + S VG+SS     +P+EK EE  EPLG QIQ +P  +V
Subjt:  IIWGGTANRGRLKLKVGQPPENWTSGVDLGRLLNLLELDLITSDEGLKAAVQEQTTVSATIIGSIVGDSSPPDTTLPKEKSEEKSEPLGFQIQHMP-IEV

Query:  EPSSAKDRPLLETEFHLEPGMNMAPSVEHQFIPSFFSCSPSHQNSTLDRAVSQNLSSLRSDCEDPCVSLQLGDHEAKRQRSDASVSME
          S  +      T  ++E         EHQFI +F   SP   +    R+++ NL++     E+  +SL LGD E KR RSD+  S++
Subjt:  EPSSAKDRPLLETEFHLEPGMNMAPSVEHQFIPSFFSCSPSHQNSTLDRAVSQNLSSLRSDCEDPCVSLQLGDHEAKRQRSDASVSME

Arabidopsis top hitse value%identityAlignment
AT2G35155.1 Trypsin family protein1.1e-21465.25Show/hide
Query:  RRINCSGSTPSEESALDLERN-CCSHSDLPSFSSPT-LQPFASAGQHFGFNTAYFSWPTPIRLSVGTEERANYFANLQKGVLPDILHPLPKGQRANTLLE
        R I  + S+ SE+SALDLERN  C+H  LPS SSP+ LQPF    QH   N  YFSWPT  RL+   E+RANYF NLQKGVLP+ +  LP GQ+A TLLE
Subjt:  RRINCSGSTPSEESALDLERN-CCSHSDLPSFSSPT-LQPFASAGQHFGFNTAYFSWPTPIRLSVGTEERANYFANLQKGVLPDILHPLPKGQRANTLLE

Query:  LMTIRAFHSKILRCYSLGTAIGFRIRKGVLTDIPAILVFVSRKVHKQWLSPIQCLPTALEGPGGVWCDVDVVEFSYFGAPNPAPKEQLYTEIVDDLRGSD
        LMTIRAFHSKILR +SLGTA+GFRI +GVLT++PAILVFV+RKVH+QWL+P+QCLP+ALEGPGGVWCDVDVVEF Y+GAP   PKEQ+Y E+VD LRGSD
Subjt:  LMTIRAFHSKILRCYSLGTAIGFRIRKGVLTDIPAILVFVSRKVHKQWLSPIQCLPTALEGPGGVWCDVDVVEFSYFGAPNPAPKEQLYTEIVDDLRGSD

Query:  PCIGSGSQVASQETYGTLGAIVRSQTGSRQVGFLTNRHVAVDLDYPNQKMFHPLPPTLGPGVYLGAVERATSFITDELWYGIFAGINPETFVRADGAFIP
        PCIGSGSQVASQETYGTLGAIV+S+TG+ QVGFLTNRHVAVDLDYP+QKMFHPLPP+LGPGVYLGAVERATSFITD+ WYGIFAG NPETFVRADGAFIP
Subjt:  PCIGSGSQVASQETYGTLGAIVRSQTGSRQVGFLTNRHVAVDLDYPNQKMFHPLPPTLGPGVYLGAVERATSFITDELWYGIFAGINPETFVRADGAFIP

Query:  FADDFDMSTVTTSVKGVGEVGDVKFIDLQSPISTLIGKQVVKVGRSSGLTTGTVLAYALEYNDEKGICFLTDFLVVGENQQTFDLEGDSGSLIILKGENR
        FA+DF+ S VTT +KG+GE+GDV  IDLQSPI +LIGKQVVKVGRSSG TTGT++AYALEYNDEKGICFLTDFLV+GENQQTFDLEGDSGSLI+L G N 
Subjt:  FADDFDMSTVTTSVKGVGEVGDVKFIDLQSPISTLIGKQVVKVGRSSGLTTGTVLAYALEYNDEKGICFLTDFLVVGENQQTFDLEGDSGSLIILKGENR

Query:  ESLQPIGIIWGGTANRGRLKLKVGQPPENWTSGVDLGRLLNLLELDLITSDEGLK--AAVQEQTTVSATIIGSIVGDSSPPDTTLPKEKSEEKSEPLGFQ
        +  +P+GIIWGGTANRGRLKL  GQ PENWTSGVDLGRLL+LLELDLITS+  L+  AA +E+   S T + S V  SSPPD     +K +E  E     
Subjt:  ESLQPIGIIWGGTANRGRLKLKVGQPPENWTSGVDLGRLLNLLELDLITSDEGLK--AAVQEQTTVSATIIGSIVGDSSPPDTTLPKEKSEEKSEPLGFQ

Query:  IQHMPIEVEPSSAKDRPLLETEFHLEPGMNMAPSV-EHQFIPSFFSCSPSHQNSTLDRAVSQNLSSLRSDCEDPC-VSLQLGDHEAKRQR
                        P +  EFH+E  +     V EH FI        +      +     NL +L++  E+   +SL LG+ + K+ +
Subjt:  IQHMPIEVEPSSAKDRPLLETEFHLEPGMNMAPSV-EHQFIPSFFSCSPSHQNSTLDRAVSQNLSSLRSDCEDPC-VSLQLGDHEAKRQR

AT3G12950.1 Trypsin family protein2.8e-21069.08Show/hide
Query:  FASAGQHFGFNTA-YFSWPTPIRLSVGTEERANYFANLQK------GVLPDILHPLPKGQRANTLLELMTIRAFHSKILRCYSLGTAIGFRIRKGVLTDI
        + S GQH  F  A YFSWPT  RLS   EERANYF+NLQK       V P+ +   PKGQRA TLLELMTIRAFHSK+LRCYSLGTAIGFRIR+GVLTDI
Subjt:  FASAGQHFGFNTA-YFSWPTPIRLSVGTEERANYFANLQK------GVLPDILHPLPKGQRANTLLELMTIRAFHSKILRCYSLGTAIGFRIRKGVLTDI

Query:  PAILVFVSRKVHKQWLSPIQCLPTALEGPGGVWCDVDVVEFSYFGAPN--PAPKEQLYTEIVDDLRGSDPCIGSGSQVASQETYGTLGAIVRSQTGSRQV
        PAI+VFVSRKVHKQWLSP+QCLPTALEG GG+WCDVDVVEFSYFG P+  P PK+   T+IVD L+GSDP IGSGSQVASQET GTLGAIVRSQTG RQV
Subjt:  PAILVFVSRKVHKQWLSPIQCLPTALEGPGGVWCDVDVVEFSYFGAPN--PAPKEQLYTEIVDDLRGSDPCIGSGSQVASQETYGTLGAIVRSQTGSRQV

Query:  GFLTNRHVAVDLDYPNQKMFHPLPPTLGPGVYLGAVERATSFITDELWYGIFAGINPETFVRADGAFIPFADDFDMSTVTTSVK-GVGEVGDVKFIDLQS
        GF+TNRHVAV+LDYP+QKMFHPLPP LGPGVYLGAVERATSFITD+LW+GIFAG NPETFVRADGAFIPFADD+D+S VTTSVK GVGE+G+VK I+LQS
Subjt:  GFLTNRHVAVDLDYPNQKMFHPLPPTLGPGVYLGAVERATSFITDELWYGIFAGINPETFVRADGAFIPFADDFDMSTVTTSVK-GVGEVGDVKFIDLQS

Query:  PISTLIGKQVVKVGRSSGLTTGTVLAYALEYNDEKGICFLTDFLVVGENQQT-FDLEGDSGSLIILKGENRESLQPIGIIWGGTANRGRLKLKVGQPPEN
        P+ +L+GKQVVKVGRSSGLTTGTVLAYALEYNDE+G+CFLTDFLVVGEN ++ FDLEGDSGSLI++KGE  E  +PIGIIWGGT +RGRLKLKVG+ PE+
Subjt:  PISTLIGKQVVKVGRSSGLTTGTVLAYALEYNDEKGICFLTDFLVVGENQQT-FDLEGDSGSLIILKGENRESLQPIGIIWGGTANRGRLKLKVGQPPEN

Query:  WTSGVDLGRLLNLLELDLITSDEGLKAAVQEQTTVSATIIGSIVGDSSPPDTTLPKEK--SEEKSE-PLG-FQIQHMPIEVEPSSAKDRPLLETEFHLEP
        WT+GVDLGRLL  L+LDLIT+DEGLKAAVQEQ   S T + S+V DSSPP   L KEK   EEK E  LG  Q+QH+ +E           +ET+     
Subjt:  WTSGVDLGRLLNLLELDLITSDEGLKAAVQEQTTVSATIIGSIVGDSSPPDTTLPKEK--SEEKSE-PLG-FQIQHMPIEVEPSSAKDRPLLETEFHLEP

Query:  GMNMAPSVEHQFIPSFF-SCSPSHQNSTLDRAVSQNLSSLRSDCEDPCVSLQLGDHEAKRQRSDAS
            APSVEHQF+P+F   CS S    T    +    ++   D  D CV L+LGD  AKR+R+  +
Subjt:  GMNMAPSVEHQFIPSFF-SCSPSHQNSTLDRAVSQNLSSLRSDCEDPCVSLQLGDHEAKRQRSDAS

AT5G45030.1 Trypsin family protein1.1e-21465.85Show/hide
Query:  MEQTRHNRRINCSGSTPSEES-ALDLERNCCSHSDLPSFSSPTLQPFASAGQH--FGFNTAYFSWPTPIRLSVGTEERANYFANLQKGVLPDILHPLPKG
        ME  R + R + S S+ S ES ALDL++N  +H  L S SSP LQPF S  QH       AYFSWPT  RL+   E+RANYFANLQKGVLP+    LP G
Subjt:  MEQTRHNRRINCSGSTPSEES-ALDLERNCCSHSDLPSFSSPTLQPFASAGQH--FGFNTAYFSWPTPIRLSVGTEERANYFANLQKGVLPDILHPLPKG

Query:  QRANTLLELMTIRAFHSKILRCYSLGTAIGFRIRKGVLTDIPAILVFVSRKVHKQWLSPIQCLPTALEGPGGVWCDVDVVEFSYFGAPNPAPKEQLYTEI
        ++A TLLELM IRAFHSK LR +SLGTAIGFRIR+GVLT+I AILVFV+RKVHKQWL+P+QCLPTALEGPGGVWCDVDVVEF Y+GAP   PKEQ+YTE+
Subjt:  QRANTLLELMTIRAFHSKILRCYSLGTAIGFRIRKGVLTDIPAILVFVSRKVHKQWLSPIQCLPTALEGPGGVWCDVDVVEFSYFGAPNPAPKEQLYTEI

Query:  VDDLRGSDPCIGSGSQVASQETYGTLGAIVRSQTGSRQVGFLTNRHVAVDLDYPNQKMFHPLPPTLGPGVYLGAVERATSFITDELWYGIFAGINPETFV
        VDDLRGS   IGSGSQVASQETYGTLGAIV+S+TG RQVGFLTNRHVAVDLDYP+QKMFHPLPP+LGPGVYLGAVERATSFITD+LWYGIFAG NPETFV
Subjt:  VDDLRGSDPCIGSGSQVASQETYGTLGAIVRSQTGSRQVGFLTNRHVAVDLDYPNQKMFHPLPPTLGPGVYLGAVERATSFITDELWYGIFAGINPETFV

Query:  RADGAFIPFADDFDMSTVTTSVKGVGEVGDVKFIDLQSPISTLIGKQVVKVGRSSGLTTGTVLAYALEYNDEKGICFLTDFLVVGENQQTFDLEGDSGSL
        RADGAFIPFA+DF+ + VTT+VKG+GE+GD+   DLQSP+++LIG++VVKVGRSSGLTTGT++AYALEYNDEKGICFLTDFLVVGENQQTFDLEGDSGSL
Subjt:  RADGAFIPFADDFDMSTVTTSVKGVGEVGDVKFIDLQSPISTLIGKQVVKVGRSSGLTTGTVLAYALEYNDEKGICFLTDFLVVGENQQTFDLEGDSGSL

Query:  IILKG--ENRESLQPIGIIWGGTANRGRLKLKVGQPPENWTSGVDLGRLLNLLELDLITSDEGLKAAVQEQTT-VSATIIGSIVGDSSPPDTTLPKEKSE
        I+L    E  E  +P+GIIWGGTANRGRLKLKVG+ PENWTSGVDLGR+LNLLELDLITS+EGL+AAV EQ   +    + S V +SSP    + + K+ 
Subjt:  IILKG--ENRESLQPIGIIWGGTANRGRLKLKVGQPPENWTSGVDLGRLLNLLELDLITSDEGLKAAVQEQTT-VSATIIGSIVGDSSPPDTTLPKEKSE

Query:  EKSEPLGFQIQHMPIEVEPSSAKDRPLLETEFHLEPGM-NMAPSVEHQFIPSFF-SCSPSHQN-STLDRAVSQNLSSLR--SDCEDPCVSLQLGDHEA-K
        E  EP+   +Q + IE + S+      +  EF +E  + ++A   EHQFIPS   + S  HQ  +  +   S+NLSSL+  S  ++   SLQLG+ +  K
Subjt:  EKSEPLGFQIQHMPIEVEPSSAKDRPLLETEFHLEPGM-NMAPSVEHQFIPSFF-SCSPSHQN-STLDRAVSQNLSSLR--SDCEDPCVSLQLGDHEA-K

Query:  RQRSDASVSMEE
        R+R+D+    +E
Subjt:  RQRSDASVSMEE

AT5G45030.2 Trypsin family protein1.1e-21465.85Show/hide
Query:  MEQTRHNRRINCSGSTPSEES-ALDLERNCCSHSDLPSFSSPTLQPFASAGQH--FGFNTAYFSWPTPIRLSVGTEERANYFANLQKGVLPDILHPLPKG
        ME  R + R + S S+ S ES ALDL++N  +H  L S SSP LQPF S  QH       AYFSWPT  RL+   E+RANYFANLQKGVLP+    LP G
Subjt:  MEQTRHNRRINCSGSTPSEES-ALDLERNCCSHSDLPSFSSPTLQPFASAGQH--FGFNTAYFSWPTPIRLSVGTEERANYFANLQKGVLPDILHPLPKG

Query:  QRANTLLELMTIRAFHSKILRCYSLGTAIGFRIRKGVLTDIPAILVFVSRKVHKQWLSPIQCLPTALEGPGGVWCDVDVVEFSYFGAPNPAPKEQLYTEI
        ++A TLLELM IRAFHSK LR +SLGTAIGFRIR+GVLT+I AILVFV+RKVHKQWL+P+QCLPTALEGPGGVWCDVDVVEF Y+GAP   PKEQ+YTE+
Subjt:  QRANTLLELMTIRAFHSKILRCYSLGTAIGFRIRKGVLTDIPAILVFVSRKVHKQWLSPIQCLPTALEGPGGVWCDVDVVEFSYFGAPNPAPKEQLYTEI

Query:  VDDLRGSDPCIGSGSQVASQETYGTLGAIVRSQTGSRQVGFLTNRHVAVDLDYPNQKMFHPLPPTLGPGVYLGAVERATSFITDELWYGIFAGINPETFV
        VDDLRGS   IGSGSQVASQETYGTLGAIV+S+TG RQVGFLTNRHVAVDLDYP+QKMFHPLPP+LGPGVYLGAVERATSFITD+LWYGIFAG NPETFV
Subjt:  VDDLRGSDPCIGSGSQVASQETYGTLGAIVRSQTGSRQVGFLTNRHVAVDLDYPNQKMFHPLPPTLGPGVYLGAVERATSFITDELWYGIFAGINPETFV

Query:  RADGAFIPFADDFDMSTVTTSVKGVGEVGDVKFIDLQSPISTLIGKQVVKVGRSSGLTTGTVLAYALEYNDEKGICFLTDFLVVGENQQTFDLEGDSGSL
        RADGAFIPFA+DF+ + VTT+VKG+GE+GD+   DLQSP+++LIG++VVKVGRSSGLTTGT++AYALEYNDEKGICFLTDFLVVGENQQTFDLEGDSGSL
Subjt:  RADGAFIPFADDFDMSTVTTSVKGVGEVGDVKFIDLQSPISTLIGKQVVKVGRSSGLTTGTVLAYALEYNDEKGICFLTDFLVVGENQQTFDLEGDSGSL

Query:  IILKG--ENRESLQPIGIIWGGTANRGRLKLKVGQPPENWTSGVDLGRLLNLLELDLITSDEGLKAAVQEQTT-VSATIIGSIVGDSSPPDTTLPKEKSE
        I+L    E  E  +P+GIIWGGTANRGRLKLKVG+ PENWTSGVDLGR+LNLLELDLITS+EGL+AAV EQ   +    + S V +SSP    + + K+ 
Subjt:  IILKG--ENRESLQPIGIIWGGTANRGRLKLKVGQPPENWTSGVDLGRLLNLLELDLITSDEGLKAAVQEQTT-VSATIIGSIVGDSSPPDTTLPKEKSE

Query:  EKSEPLGFQIQHMPIEVEPSSAKDRPLLETEFHLEPGM-NMAPSVEHQFIPSFF-SCSPSHQN-STLDRAVSQNLSSLR--SDCEDPCVSLQLGDHEA-K
        E  EP+   +Q + IE + S+      +  EF +E  + ++A   EHQFIPS   + S  HQ  +  +   S+NLSSL+  S  ++   SLQLG+ +  K
Subjt:  EKSEPLGFQIQHMPIEVEPSSAKDRPLLETEFHLEPGM-NMAPSVEHQFIPSFF-SCSPSHQN-STLDRAVSQNLSSLR--SDCEDPCVSLQLGDHEA-K

Query:  RQRSDASVSMEE
        R+R+D+    +E
Subjt:  RQRSDASVSMEE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAAGAATGCAAGCACTGTGGACACGTGAGCCATATCATGGAGCAGACAAGACACAATAGGAGGATCAACTGCTCTGGTTCAACTCCATCCGAGGAATCAGCGCTGGA
TCTTGAAAGAAACTGCTGCAGTCACTCTGATTTACCTTCTTTCAGTTCACCTACACTTCAGCCATTTGCATCTGCTGGACAGCATTTTGGGTTCAATACTGCTTACTTTT
CATGGCCCACCCCTATCCGGTTAAGTGTTGGTACGGAGGAAAGGGCAAATTATTTTGCAAATCTTCAGAAAGGGGTGCTACCAGATATCCTTCATCCGTTACCAAAAGGG
CAGCGGGCAAACACATTACTCGAGCTCATGACAATAAGAGCCTTCCATAGCAAGATCCTGCGTTGTTACAGTCTTGGAACGGCAATTGGATTCCGTATTCGAAAGGGTGT
GCTGACTGATATTCCTGCTATTCTTGTTTTTGTTTCCAGGAAAGTTCACAAGCAATGGCTTAGTCCTATCCAATGCCTGCCCACTGCCCTGGAGGGGCCAGGTGGTGTGT
GGTGCGACGTGGATGTTGTAGAATTCTCATATTTTGGTGCACCCAACCCTGCTCCTAAAGAACAGTTGTACACTGAGATTGTCGATGATCTGCGTGGCTCTGATCCATGC
ATTGGCTCTGGTTCTCAGGTGGCCAGCCAGGAGACCTATGGAACCTTGGGCGCTATAGTAAGAAGTCAAACGGGCAGTCGACAAGTTGGTTTTCTCACAAACCGTCATGT
TGCGGTCGATTTAGATTATCCAAATCAGAAAATGTTTCACCCTCTTCCACCGACACTTGGGCCAGGGGTGTATCTTGGTGCTGTGGAGAGAGCTACTTCATTCATCACTG
ATGAGCTTTGGTACGGAATTTTTGCTGGAATAAACCCAGAGACATTCGTCAGGGCAGATGGGGCATTTATTCCTTTTGCCGATGATTTTGACATGTCAACTGTCACTACA
TCTGTAAAAGGTGTGGGAGAGGTCGGTGACGTGAAGTTTATTGACTTGCAGTCGCCTATCAGTACCCTCATAGGGAAGCAGGTGGTGAAAGTTGGAAGAAGTTCTGGCTT
GACTACAGGAACTGTTTTGGCCTATGCTCTCGAGTACAATGATGAGAAAGGAATATGCTTTTTAACTGATTTTCTCGTTGTAGGTGAGAATCAACAAACTTTCGATCTTG
AAGGAGATAGTGGAAGCCTCATTATTCTAAAGGGTGAGAATCGAGAGAGTTTGCAACCAATTGGGATCATATGGGGTGGAACTGCTAACCGAGGTCGGCTTAAGTTGAAA
GTCGGCCAACCTCCTGAGAATTGGACGAGTGGTGTTGATCTTGGGCGCCTTCTCAATCTGCTTGAACTTGATCTAATCACAAGTGATGAAGGGCTCAAAGCGGCAGTACA
AGAGCAAACAACTGTATCAGCAACCATTATCGGGTCAATTGTTGGAGACTCCTCTCCTCCCGATACAACACTGCCAAAGGAGAAGAGTGAAGAGAAGTCTGAGCCATTGG
GTTTTCAGATCCAGCATATGCCTATAGAAGTAGAGCCTTCTTCAGCTAAAGACCGGCCGCTCCTGGAGACCGAGTTTCATCTTGAACCCGGGATGAACATGGCTCCCAGC
GTCGAACATCAGTTCATTCCAAGCTTTTTCAGTTGCTCTCCCTCCCATCAAAACAGCACTCTGGACCGTGCCGTTTCCCAAAACCTATCTTCGCTCCGAAGCGACTGTGA
AGACCCTTGCGTTTCCTTGCAACTGGGTGACCATGAAGCCAAGAGACAACGCTCGGATGCTTCTGTTTCCATGGAAGAACTGAAATAG
mRNA sequenceShow/hide mRNA sequence
ATGGAAGAATGCAAGCACTGTGGACACGTGAGCCATATCATGGAGCAGACAAGACACAATAGGAGGATCAACTGCTCTGGTTCAACTCCATCCGAGGAATCAGCGCTGGA
TCTTGAAAGAAACTGCTGCAGTCACTCTGATTTACCTTCTTTCAGTTCACCTACACTTCAGCCATTTGCATCTGCTGGACAGCATTTTGGGTTCAATACTGCTTACTTTT
CATGGCCCACCCCTATCCGGTTAAGTGTTGGTACGGAGGAAAGGGCAAATTATTTTGCAAATCTTCAGAAAGGGGTGCTACCAGATATCCTTCATCCGTTACCAAAAGGG
CAGCGGGCAAACACATTACTCGAGCTCATGACAATAAGAGCCTTCCATAGCAAGATCCTGCGTTGTTACAGTCTTGGAACGGCAATTGGATTCCGTATTCGAAAGGGTGT
GCTGACTGATATTCCTGCTATTCTTGTTTTTGTTTCCAGGAAAGTTCACAAGCAATGGCTTAGTCCTATCCAATGCCTGCCCACTGCCCTGGAGGGGCCAGGTGGTGTGT
GGTGCGACGTGGATGTTGTAGAATTCTCATATTTTGGTGCACCCAACCCTGCTCCTAAAGAACAGTTGTACACTGAGATTGTCGATGATCTGCGTGGCTCTGATCCATGC
ATTGGCTCTGGTTCTCAGGTGGCCAGCCAGGAGACCTATGGAACCTTGGGCGCTATAGTAAGAAGTCAAACGGGCAGTCGACAAGTTGGTTTTCTCACAAACCGTCATGT
TGCGGTCGATTTAGATTATCCAAATCAGAAAATGTTTCACCCTCTTCCACCGACACTTGGGCCAGGGGTGTATCTTGGTGCTGTGGAGAGAGCTACTTCATTCATCACTG
ATGAGCTTTGGTACGGAATTTTTGCTGGAATAAACCCAGAGACATTCGTCAGGGCAGATGGGGCATTTATTCCTTTTGCCGATGATTTTGACATGTCAACTGTCACTACA
TCTGTAAAAGGTGTGGGAGAGGTCGGTGACGTGAAGTTTATTGACTTGCAGTCGCCTATCAGTACCCTCATAGGGAAGCAGGTGGTGAAAGTTGGAAGAAGTTCTGGCTT
GACTACAGGAACTGTTTTGGCCTATGCTCTCGAGTACAATGATGAGAAAGGAATATGCTTTTTAACTGATTTTCTCGTTGTAGGTGAGAATCAACAAACTTTCGATCTTG
AAGGAGATAGTGGAAGCCTCATTATTCTAAAGGGTGAGAATCGAGAGAGTTTGCAACCAATTGGGATCATATGGGGTGGAACTGCTAACCGAGGTCGGCTTAAGTTGAAA
GTCGGCCAACCTCCTGAGAATTGGACGAGTGGTGTTGATCTTGGGCGCCTTCTCAATCTGCTTGAACTTGATCTAATCACAAGTGATGAAGGGCTCAAAGCGGCAGTACA
AGAGCAAACAACTGTATCAGCAACCATTATCGGGTCAATTGTTGGAGACTCCTCTCCTCCCGATACAACACTGCCAAAGGAGAAGAGTGAAGAGAAGTCTGAGCCATTGG
GTTTTCAGATCCAGCATATGCCTATAGAAGTAGAGCCTTCTTCAGCTAAAGACCGGCCGCTCCTGGAGACCGAGTTTCATCTTGAACCCGGGATGAACATGGCTCCCAGC
GTCGAACATCAGTTCATTCCAAGCTTTTTCAGTTGCTCTCCCTCCCATCAAAACAGCACTCTGGACCGTGCCGTTTCCCAAAACCTATCTTCGCTCCGAAGCGACTGTGA
AGACCCTTGCGTTTCCTTGCAACTGGGTGACCATGAAGCCAAGAGACAACGCTCGGATGCTTCTGTTTCCATGGAAGAACTGAAATAG
Protein sequenceShow/hide protein sequence
MEECKHCGHVSHIMEQTRHNRRINCSGSTPSEESALDLERNCCSHSDLPSFSSPTLQPFASAGQHFGFNTAYFSWPTPIRLSVGTEERANYFANLQKGVLPDILHPLPKG
QRANTLLELMTIRAFHSKILRCYSLGTAIGFRIRKGVLTDIPAILVFVSRKVHKQWLSPIQCLPTALEGPGGVWCDVDVVEFSYFGAPNPAPKEQLYTEIVDDLRGSDPC
IGSGSQVASQETYGTLGAIVRSQTGSRQVGFLTNRHVAVDLDYPNQKMFHPLPPTLGPGVYLGAVERATSFITDELWYGIFAGINPETFVRADGAFIPFADDFDMSTVTT
SVKGVGEVGDVKFIDLQSPISTLIGKQVVKVGRSSGLTTGTVLAYALEYNDEKGICFLTDFLVVGENQQTFDLEGDSGSLIILKGENRESLQPIGIIWGGTANRGRLKLK
VGQPPENWTSGVDLGRLLNLLELDLITSDEGLKAAVQEQTTVSATIIGSIVGDSSPPDTTLPKEKSEEKSEPLGFQIQHMPIEVEPSSAKDRPLLETEFHLEPGMNMAPS
VEHQFIPSFFSCSPSHQNSTLDRAVSQNLSSLRSDCEDPCVSLQLGDHEAKRQRSDASVSMEELK