; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lsi07G010830 (gene) of Bottle gourd (USVL1VR-Ls) v1 genome

Gene IDLsi07G010830
OrganismLagenaria siceraria USVL1VR-Ls (Bottle gourd (USVL1VR-Ls) v1)
DescriptionAspartyl protease family protein 1-like
Genome locationchr07:16210070..16217873
RNA-Seq ExpressionLsi07G010830
SyntenyLsi07G010830
Gene Ontology termsGO:0006508 - proteolysis (biological process)
GO:0016021 - integral component of membrane (cellular component)
GO:0004190 - aspartic-type endopeptidase activity (molecular function)
InterPro domainsIPR001461 - Aspartic peptidase A1 family
IPR001969 - Aspartic peptidase, active site
IPR021109 - Aspartic peptidase domain superfamily
IPR033121 - Peptidase family A1 domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0052941.1 aspartyl protease family protein 1-like [Cucumis melo var. makuwa]1.2e-23480.66Show/hide
Query:  SSPSTFFLTLCVFFSIFTFISHSSHALGSFSFPIHHRYSHAVRHILPFYALPEEGTVDYYAAMVHTDHFVHSRRL--VEDQPPLTFFSGNQTVRINPLGF
        SS STF LTLC F SIFTFISH SH  GSF+F IHH YS AVR ILPF++ P+EGT+DYYAAMV TDHFVHSRRL  V+D PPLTF SGN+T+RI+PLGF
Subjt:  SSPSTFFLTLCVFFSIFTFISHSSHALGSFSFPIHHRYSHAVRHILPFYALPEEGTVDYYAAMVHTDHFVHSRRL--VEDQPPLTFFSGNQTVRINPLGF

Query:  LYYAEVTVGTPETSYLVALDTGSDLFWLPCDCVNCITGFNTSQGPVNFNIYSPNNSSTSKEVQCSSSLCSHAEQCSSPSDTCPYQVISLSLQSST-----
        LYYAEVTVGTP   YLVALDTGSDLFWLPCDCVNCITG NT+QGPVNFNIYSPNNSSTSKEVQCSSSLCSH +QCS PSDTCPYQV  LS  +S+     
Subjt:  LYYAEVTVGTPETSYLVALDTGSDLFWLPCDCVNCITGFNTSQGPVNFNIYSPNNSSTSKEVQCSSSLCSHAEQCSSPSDTCPYQVISLSLQSST-----

Query:  ----------------NCTPLAMCGKDQSGAFLSSAAPNGLFGLGIENVSVPSILANAGLISNSFSLCFGPARMGRIEFGDKGSPGQSETPFNLGRKHPT
                        N T    CGKDQSGAFLSSAAPNGLFGLGIENVSVPSILANAGLISNSFSLCFGPARMGRIEFGDKGSP Q+ETPFNLGR+HPT
Subjt:  ----------------NCTPLAMCGKDQSGAFLSSAAPNGLFGLGIENVSVPSILANAGLISNSFSLCFGPARMGRIEFGDKGSPGQSETPFNLGRKHPT

Query:  YNISITQIGVGGHVSNLDVAAIFDSGTSFTYLNDPVYSLFADKFDSMIEEKRYTVDSDIPFENCYELSPNQTTFTYPVMNLTMKGGGHFVINHPIVLISY
        YN+SITQI VGGH+SNLDVA IFDSGTSFTYLNDP YSLFADKFDSM+EEKRYT++SDIPFENCYELSP+QTTFTYPVMNLTMKGGGHFVINHPIVL+S 
Subjt:  YNISITQIGVGGHVSNLDVAAIFDSGTSFTYLNDPVYSLFADKFDSMIEEKRYTVDSDIPFENCYELSPNQTTFTYPVMNLTMKGGGHFVINHPIVLISY

Query:  ESTHLFCLAIARSDSINIIGQNFMTGYHIVFDREKMVLGWKESNCTGYEDEKTNNLPVSPTPAPAAAPSTTIINPQANSNINNTSQTIEKPRPTNNSSNL
        +S  LFCLAIARSDSINIIGQNFMTGYHIVFDREKMVLGWKESNCTGYEDE TNNLPV P+P PAAAP TT I PQANSN+NNT+QTIEKPRPTN SS L
Subjt:  ESTHLFCLAIARSDSINIIGQNFMTGYHIVFDREKMVLGWKESNCTGYEDEKTNNLPVSPTPAPAAAPSTTIINPQANSNINNTSQTIEKPRPTNNSSNL

Query:  LTSVILTFLMSVMKSLM
         TSVILTFLM V+  L+
Subjt:  LTSVILTFLMSVMKSLM

KAG6596257.1 Aspartyl protease family protein 1, partial [Cucurbita argyrosperma subsp. sororia]1.6e-29569.07Show/hide
Query:  MSSPSTFFLTLCVFFSIFTFISHSSHALGSFSFPIHHRYSHAVRHILPFYALPEEGTVDYYAAMVHTDHFVHSRRLVEDQPPLTFFSGNQTVRINPLGFL
        M+SPS F LTLCVFFS+ +F+S SS ALGSFSF IHHRYS  VR ILP   LPEEGTVDYY AMV  D  +H RRL EDQPPLTF  GN+TVR+NPLGFL
Subjt:  MSSPSTFFLTLCVFFSIFTFISHSSHALGSFSFPIHHRYSHAVRHILPFYALPEEGTVDYYAAMVHTDHFVHSRRLVEDQPPLTFFSGNQTVRINPLGFL

Query:  YYAEVTVGTPETSYLVALDTGSDLFWLPCDCVNCITGFNTSQGPVNFNIYSPNNSSTSKEVQCSSSLCSHAEQCSSPSDTCPYQVISLSLQSST------
        +YA+VTVGTP+ SYLVALDTGSDLFWLPCDCVNC+T +NTS+G   FNIYSP+NSSTSKEV CSSSLC HA QC SPSD CPY++  LS  +S+      
Subjt:  YYAEVTVGTPETSYLVALDTGSDLFWLPCDCVNCITGFNTSQGPVNFNIYSPNNSSTSKEVQCSSSLCSHAEQCSSPSDTCPYQVISLSLQSST------

Query:  ---------------NCTPLAMCGKDQSGAFLSSAAPNGLFGLGIENVSVPSILANAGLISNSFSLCFGPARMGRIEFGDKGSPGQSETPFNLGRKHPTY
                       N      CG+DQSGAFLS+AAPNGLFGLGIE+VSVPSILAN GL SNSFSLCFGP  MGRIEFGDKGSPGQSETPFN+G +HPTY
Subjt:  ---------------NCTPLAMCGKDQSGAFLSSAAPNGLFGLGIENVSVPSILANAGLISNSFSLCFGPARMGRIEFGDKGSPGQSETPFNLGRKHPTY

Query:  NISITQIGVGGHVSNLDVAAIFDSGTSFTYLNDPVYSLFADKFDSMIEEKRYTVDSDIPFENCYELSPNQTTFTYPVMNLTMKGGGHFVINHPIVLISYE
        NISITQ+ VGG+VSNLD AA+FDSGTSFTYLN+P YSL ADKFDSM++EKRY  + DIPFENCYELSPNQT F YPVMNLTMKGG HF INHPIV+++ E
Subjt:  NISITQIGVGGHVSNLDVAAIFDSGTSFTYLNDPVYSLFADKFDSMIEEKRYTVDSDIPFENCYELSPNQTTFTYPVMNLTMKGGGHFVINHPIVLISYE

Query:  STHLF-CLAIARSDSINIIGQNFMTGYHIVFDREKMVLGWKESNCTGYEDEKTNNLPVSPTPAPAAAPST-TIINPQANSNINNTSQTIEKPRPTNNSSN
        +T  F CLAI+RSD+INIIGQNFMTGYHIVFDREKMVLGWKESNCTGYED KTNNLP+ P+ AP AAP+  T I P+ANS +NN+S+T++KPR  NNS  
Subjt:  STHLF-CLAIARSDSINIIGQNFMTGYHIVFDREKMVLGWKESNCTGYEDEKTNNLPVSPTPAPAAAPST-TIINPQANSNINNTSQTIEKPRPTNNSSN

Query:  LLTSVILTFLMSVMKSLMKDLFGVQESEISQSFSNHLPTSNLPSMAVARCFLPFPLETSKHPLS---ASLFTSSSSTDYSFTVAFHSDSRRPRGFKLPLT
        L +SVIL+                  S    +      +    +MA ARC LP PLET KHP S   AS  +SSSST  SFTVAF+S SRR   F LP++
Subjt:  LLTSVILTFLMSVMKSLMKDLFGVQESEISQSFSNHLPTSNLPSMAVARCFLPFPLETSKHPLS---ASLFTSSSSTDYSFTVAFHSDSRRPRGFKLPLT

Query:  TLCCKMPLRGTFLFLFLELNFNNSWTQTQLLSNLLEVKAKPQDSEATLVPGFFTEFKHLLLPITDRNPFLSEGTRQANMVAIATTAALAKNNGADITVVL
        TLCCK   R                           VKAKP DSEAT+V G FTEFKHLLLPITDRNPFLSEGTRQ    AIATTAALAKNNGADITV+L
Subjt:  TLCCKMPLRGTFLFLFLELNFNNSWTQTQLLSNLLEVKAKPQDSEATLVPGFFTEFKHLLLPITDRNPFLSEGTRQANMVAIATTAALAKNNGADITVVL

Query:  IDEKQKDSFPEHENQLSSIRWHLSEGGFQEFKLLERLGEGSKPTAIIGEVADDLNLDLVVLSMETIHSKHVDANLLAEFIPCAVMLLPL
        IDEKQK+SFPEHENQLSSIRWHLSEGGFQE+KLLERLG+GSKPTAIIGEVADDLNLDLVVLSME +HSKHVDANLLAEFIPC VMLLPL
Subjt:  IDEKQKDSFPEHENQLSSIRWHLSEGGFQEFKLLERLGEGSKPTAIIGEVADDLNLDLVVLSMETIHSKHVDANLLAEFIPCAVMLLPL

TYK11398.1 aspartyl protease family protein 1-like [Cucumis melo var. makuwa]2.2e-23380.66Show/hide
Query:  SSPSTFFLTLCVFFSIFTFISHSSHALGSFSFPIHHRYSHAVRHILPFYALPEEGTVDYYAAMVHTDHFVHSRRL--VEDQPPLTFFSGNQTVRINPLGF
        SS STF LTLC F SIFTFISH SH  GSF+F IHH YS AVR ILPF++ P+EGT+DYYAAMV TDHFVHSRRL  V+D PPLTF SGN+T+RI+PLGF
Subjt:  SSPSTFFLTLCVFFSIFTFISHSSHALGSFSFPIHHRYSHAVRHILPFYALPEEGTVDYYAAMVHTDHFVHSRRL--VEDQPPLTFFSGNQTVRINPLGF

Query:  LYYAEVTVGTPETSYLVALDTGSDLFWLPCDCVNCITGFNTSQGPVNFNIYSPNNSSTSKEVQCSSSLCSHAEQCSSPSDTCPYQVISLSLQSST-----
        LYYAEVTVGTP   YLVALDTGSDLFWLPCDCVNCITG NT+QGPVNFNIYSPNNSSTSKEVQCSSSLCSH +QCS PSDTCPYQV  LS  +S+     
Subjt:  LYYAEVTVGTPETSYLVALDTGSDLFWLPCDCVNCITGFNTSQGPVNFNIYSPNNSSTSKEVQCSSSLCSHAEQCSSPSDTCPYQVISLSLQSST-----

Query:  ----------------NCTPLAMCGKDQSGAFLSSAAPNGLFGLGIENVSVPSILANAGLISNSFSLCFGPARMGRIEFGDKGSPGQSETPFNLGRKHPT
                        N T    CGKDQSGAFLSSAAPNGLFGLGIENVSVPSILANAGLISNSFSLCFGPARMGRIEFGDKGSP Q+ETPFNLGR+HPT
Subjt:  ----------------NCTPLAMCGKDQSGAFLSSAAPNGLFGLGIENVSVPSILANAGLISNSFSLCFGPARMGRIEFGDKGSPGQSETPFNLGRKHPT

Query:  YNISITQIGVGGHVSNLDVAAIFDSGTSFTYLNDPVYSLFADKFDSMIEEKRYTVDSDIPFENCYELSPNQTTFTYPVMNLTMKGGGHFVINHPIVLISY
        YN+SITQI VGGH+SNLDVA IFDSGTSFTYLNDP YSLFADKFDSM+EEKRYT++SDIPFENCYELSP+QTTFTYPVMNLTMKGGGHFVINHPIVL+S 
Subjt:  YNISITQIGVGGHVSNLDVAAIFDSGTSFTYLNDPVYSLFADKFDSMIEEKRYTVDSDIPFENCYELSPNQTTFTYPVMNLTMKGGGHFVINHPIVLISY

Query:  ESTHLFCLAIARSDSINIIGQNFMTGYHIVFDREKMVLGWKESNCTGYEDEKTNNLPVSPTPAPAAAPSTTIINPQANSNINNTSQTIEKPRPTNNSSNL
        +S  LFCLAIARSDSINIIGQNFMTGYHIVFDREKMVLGWKESNCTGYEDE TNNLPV P+P PAAAP TT I PQANSN+NNT+QTIEKPRPTN SS L
Subjt:  ESTHLFCLAIARSDSINIIGQNFMTGYHIVFDREKMVLGWKESNCTGYEDEKTNNLPVSPTPAPAAAPSTTIINPQANSNINNTSQTIEKPRPTNNSSNL

Query:  LTSVILTFLMSVMKSLM
         TSVILTFLM V+  L+
Subjt:  LTSVILTFLMSVMKSLM

XP_008448518.1 PREDICTED: aspartyl protease family protein 1-like [Cucumis melo]6.5e-23380.04Show/hide
Query:  SSPSTFFLTLCVFFSIFTFISHSSHALGSFSFPIHHRYSHAVRHILPFYALPEEGTVDYYAAMVHTDHFVHSRRL--VEDQPPLTFFSGNQTVRINPLGF
        SS STF LTLC F SIFTFISH SH  GSF+F IHH YS AVR ILPF++ P+EGT+DYYAAMV TDHFVHSRRL  V+D PPLTF SGN+T+RI+PLGF
Subjt:  SSPSTFFLTLCVFFSIFTFISHSSHALGSFSFPIHHRYSHAVRHILPFYALPEEGTVDYYAAMVHTDHFVHSRRL--VEDQPPLTFFSGNQTVRINPLGF

Query:  LYYAEVTVGTPETSYLVALDTGSDLFWLPCDCVNCITGFNTSQGPVNFNIYSPNNSSTSKEVQCSSSLCSHAEQCSSPSDTCPYQVISLSLQSST-----
        LYYAEVTVGTP   YLVALDTGSDLFWLPCDCVNCITG NT+QGPVNFNIYSPNNSSTSKEVQCSSSLCSH +QCS PSDTCPYQV  LS  +S+     
Subjt:  LYYAEVTVGTPETSYLVALDTGSDLFWLPCDCVNCITGFNTSQGPVNFNIYSPNNSSTSKEVQCSSSLCSHAEQCSSPSDTCPYQVISLSLQSST-----

Query:  ----------------NCTPLAMCGKDQSGAFLSSAAPNGLFGLGIENVSVPSILANAGLISNSFSLCFGPARMGRIEFGDKGSPGQSETPFNLGRKHPT
                        N T    CGKDQSGAFLSSAAPNGLFGLGIENVSVPSILANAGLISNSFSLCFGPARMGRIEFGDKGSP Q+ETPFNLGR+HPT
Subjt:  ----------------NCTPLAMCGKDQSGAFLSSAAPNGLFGLGIENVSVPSILANAGLISNSFSLCFGPARMGRIEFGDKGSPGQSETPFNLGRKHPT

Query:  YNISITQIGVGGHVSNLDVAAIFDSGTSFTYLNDPVYSLFADKFDSMIEEKRYTVDSDIPFENCYELSPNQTTFTYPVMNLTMKGGGHFVINHPIVLISY
        YN+SITQI VGGH+SNLDVA IFDSGTSFTYLNDP YSLFADKFDSM+EEKRYT++SDIPFENCYELSP+QTTFTYPVMNLTMKGGGHFVINHPIVL+S 
Subjt:  YNISITQIGVGGHVSNLDVAAIFDSGTSFTYLNDPVYSLFADKFDSMIEEKRYTVDSDIPFENCYELSPNQTTFTYPVMNLTMKGGGHFVINHPIVLISY

Query:  ESTHLFCLAIARSDSINIIGQNFMTGYHIVFDREKMVLGWKESNC----TGYEDEKTNNLPVSPTPAPAAAPSTTIINPQANSNINNTSQTIEKPRPTNN
        +S  LFCLAIARSDSINIIGQNFMTGYHIVFDREKMVLGWKESNC    TGYEDE TNNLPV P+P PAAAP TT I PQANSN+NNT+QTIEKPRPTN 
Subjt:  ESTHLFCLAIARSDSINIIGQNFMTGYHIVFDREKMVLGWKESNC----TGYEDEKTNNLPVSPTPAPAAAPSTTIINPQANSNINNTSQTIEKPRPTNN

Query:  SSNLLTSVILTFLMSVMKSLM
        SS L TSVILTFLM V+  L+
Subjt:  SSNLLTSVILTFLMSVMKSLM

XP_038906112.1 aspartyl protease family protein 1 [Benincasa hispida]4.2e-24885.91Show/hide
Query:  MSSPSTFFLTLCVFFSIFTFISHSSHALGSFSFPIHHRYSHAVRHILPFYALPEEGTVDYYAAMVHTDHFVHSRRLVEDQPPLTFFSGNQTVRINPLGFL
        M+SPSTFFLTLC FFSIFTFISHSSHALGSF+F IHH YS AVR ILP  ALP+EGT+DYYAAMV TDHFVHSRRLV+DQPPLTFFSGNQT+RINPLGFL
Subjt:  MSSPSTFFLTLCVFFSIFTFISHSSHALGSFSFPIHHRYSHAVRHILPFYALPEEGTVDYYAAMVHTDHFVHSRRLVEDQPPLTFFSGNQTVRINPLGFL

Query:  YYAEVTVGTPETSYLVALDTGSDLFWLPCDCVNCITGFNTSQGPVNFNIYSPNNSSTSKEVQCSSSLCSHAEQCSSPSDTCPYQVISLSLQSST------
        YYAEVTVGTPE SYLVALDTGSDLFWLPCDCVNCITGFNTSQGPVNFNIYSP+NSSTSKEVQCSSSLC+HA+QCSS SDTCPY+V  LS  +S+      
Subjt:  YYAEVTVGTPETSYLVALDTGSDLFWLPCDCVNCITGFNTSQGPVNFNIYSPNNSSTSKEVQCSSSLCSHAEQCSSPSDTCPYQVISLSLQSST------

Query:  ---------------NCTPLAMCGKDQSGAFLSSAAPNGLFGLGIENVSVPSILANAGLISNSFSLCFGPARMGRIEFGDKGSPGQSETPFNLGRKHPTY
                       N      CGKDQSGAFLSSAAPNGLFGLGIENVSVPSILANAGLISNSFSLCFGPARMGRIEFGDKGSPGQSETPFNLGRKHPTY
Subjt:  ---------------NCTPLAMCGKDQSGAFLSSAAPNGLFGLGIENVSVPSILANAGLISNSFSLCFGPARMGRIEFGDKGSPGQSETPFNLGRKHPTY

Query:  NISITQIGVGGHVSNLDVAAIFDSGTSFTYLNDPVYSLFADKFDSMIEEKRYTVDSDIPFENCYELSPNQTTFTYPVMNLTMKGGGHFVINHPIVLISYE
        NISITQIGVGGHVSN+DVAAIFDSGTSFTYLNDP YS+FADKFDSMIEEKRYT+ S +PFENCYELSPNQTTFTYPVMNLTMKGGGHFVINHP+VLIS  
Subjt:  NISITQIGVGGHVSNLDVAAIFDSGTSFTYLNDPVYSLFADKFDSMIEEKRYTVDSDIPFENCYELSPNQTTFTYPVMNLTMKGGGHFVINHPIVLISYE

Query:  STHLFCLAIARSDSINIIGQNFMTGYHIVFDREKMVLGWKESNCTGYEDEKTNNLPVSPTPAPAAAPSTTIINPQANSNINNTSQTIEKPRPTNNSSNLL
         T LFCLAIARSDSINIIGQNFMTGYHIVFDREKMVLGWKESNCTGYEDE TNNLPV PTPAPAAAP +TIINPQANSNINNTSQ+IEKP+PTN+SSNLL
Subjt:  STHLFCLAIARSDSINIIGQNFMTGYHIVFDREKMVLGWKESNCTGYEDEKTNNLPVSPTPAPAAAPSTTIINPQANSNINNTSQTIEKPRPTNNSSNLL

Query:  TSVILTFLMSV
        TSVILTFLMSV
Subjt:  TSVILTFLMSV

TrEMBL top hitse value%identityAlignment
A0A0A0L3M6 Peptidase A1 domain-containing protein3.2e-23380.62Show/hide
Query:  SSPSTFFLTLCVFFSIFTFISHSSHALGSFSFPIHHRYSHAVRHILPFYALPEEGTVDYYAAMVHTDHFVHSRRL--VEDQPPLTFFSGNQTVRINPLGF
        SS STF LTLC FF IF FISH SH  GSF+F IHH YS AVR ILPF++ P+EGT+DYYAAMV TDHFVHSRRL  V+D  PLTF SGN+T+RI+PLGF
Subjt:  SSPSTFFLTLCVFFSIFTFISHSSHALGSFSFPIHHRYSHAVRHILPFYALPEEGTVDYYAAMVHTDHFVHSRRL--VEDQPPLTFFSGNQTVRINPLGF

Query:  LYYAEVTVGTPETSYLVALDTGSDLFWLPCDCVNCITGFNTSQGPVNFNIYSPNNSSTSKEVQCSSSLCSHAEQCSSPSDTCPYQVISLSLQSST-----
        LYYAEVTVGTP   YLVALDTGSDLFWLPCDCVNCITG NT+QGPVNFNIYSPNNSSTSKEVQCSSSLCSH +QCSSPSDTCPYQV  LS  +S+     
Subjt:  LYYAEVTVGTPETSYLVALDTGSDLFWLPCDCVNCITGFNTSQGPVNFNIYSPNNSSTSKEVQCSSSLCSHAEQCSSPSDTCPYQVISLSLQSST-----

Query:  ----------------NCTPLAMCGKDQSGAFLSSAAPNGLFGLGIENVSVPSILANAGLISNSFSLCFGPARMGRIEFGDKGSPGQSETPFNLGRKHPT
                        N      CGKDQSGAFLSSAAPNGLFGLGIENVSVPSILANAGLISNSFSLCFGPARMGRIEFGDKGSPGQ+ETPFNLGR+HPT
Subjt:  ----------------NCTPLAMCGKDQSGAFLSSAAPNGLFGLGIENVSVPSILANAGLISNSFSLCFGPARMGRIEFGDKGSPGQSETPFNLGRKHPT

Query:  YNISITQIGVGGHVSNLDVAAIFDSGTSFTYLNDPVYSLFADKFDSMIEEKRYTVDSDIPFENCYELSPNQTTFTYPVMNLTMKGGGHFVINHPIVLISY
        YN+SITQIGVGGH+S+LDVA IFDSGTSFTYLNDP YSLFADKF SM+EEK++T++SDIPFENCYELSPNQTTFTYP+MNLTMKGGGHFVINHPIVLIS 
Subjt:  YNISITQIGVGGHVSNLDVAAIFDSGTSFTYLNDPVYSLFADKFDSMIEEKRYTVDSDIPFENCYELSPNQTTFTYPVMNLTMKGGGHFVINHPIVLISY

Query:  ESTHLFCLAIARSDSINIIGQNFMTGYHIVFDREKMVLGWKESNCTGYEDEKTNNLPVSPTPAPAAAPSTTIINPQANSNINNTSQTIEKPRPTNNSSNL
        ES  LFCLAIARSDSINIIGQNFMTGYHIVFDREKMVLGWKESNCTGYEDE TNNLPV PTP PAAAP TT I PQANSNINNT+QTIEKPRP+N SS L
Subjt:  ESTHLFCLAIARSDSINIIGQNFMTGYHIVFDREKMVLGWKESNCTGYEDEKTNNLPVSPTPAPAAAPSTTIINPQANSNINNTSQTIEKPRPTNNSSNL

Query:  LTSVILTFLMSVMKSL
         TSVILTFL+SV+  L
Subjt:  LTSVILTFLMSVMKSL

A0A1S3BKR5 aspartyl protease family protein 1-like3.2e-23380.04Show/hide
Query:  SSPSTFFLTLCVFFSIFTFISHSSHALGSFSFPIHHRYSHAVRHILPFYALPEEGTVDYYAAMVHTDHFVHSRRL--VEDQPPLTFFSGNQTVRINPLGF
        SS STF LTLC F SIFTFISH SH  GSF+F IHH YS AVR ILPF++ P+EGT+DYYAAMV TDHFVHSRRL  V+D PPLTF SGN+T+RI+PLGF
Subjt:  SSPSTFFLTLCVFFSIFTFISHSSHALGSFSFPIHHRYSHAVRHILPFYALPEEGTVDYYAAMVHTDHFVHSRRL--VEDQPPLTFFSGNQTVRINPLGF

Query:  LYYAEVTVGTPETSYLVALDTGSDLFWLPCDCVNCITGFNTSQGPVNFNIYSPNNSSTSKEVQCSSSLCSHAEQCSSPSDTCPYQVISLSLQSST-----
        LYYAEVTVGTP   YLVALDTGSDLFWLPCDCVNCITG NT+QGPVNFNIYSPNNSSTSKEVQCSSSLCSH +QCS PSDTCPYQV  LS  +S+     
Subjt:  LYYAEVTVGTPETSYLVALDTGSDLFWLPCDCVNCITGFNTSQGPVNFNIYSPNNSSTSKEVQCSSSLCSHAEQCSSPSDTCPYQVISLSLQSST-----

Query:  ----------------NCTPLAMCGKDQSGAFLSSAAPNGLFGLGIENVSVPSILANAGLISNSFSLCFGPARMGRIEFGDKGSPGQSETPFNLGRKHPT
                        N T    CGKDQSGAFLSSAAPNGLFGLGIENVSVPSILANAGLISNSFSLCFGPARMGRIEFGDKGSP Q+ETPFNLGR+HPT
Subjt:  ----------------NCTPLAMCGKDQSGAFLSSAAPNGLFGLGIENVSVPSILANAGLISNSFSLCFGPARMGRIEFGDKGSPGQSETPFNLGRKHPT

Query:  YNISITQIGVGGHVSNLDVAAIFDSGTSFTYLNDPVYSLFADKFDSMIEEKRYTVDSDIPFENCYELSPNQTTFTYPVMNLTMKGGGHFVINHPIVLISY
        YN+SITQI VGGH+SNLDVA IFDSGTSFTYLNDP YSLFADKFDSM+EEKRYT++SDIPFENCYELSP+QTTFTYPVMNLTMKGGGHFVINHPIVL+S 
Subjt:  YNISITQIGVGGHVSNLDVAAIFDSGTSFTYLNDPVYSLFADKFDSMIEEKRYTVDSDIPFENCYELSPNQTTFTYPVMNLTMKGGGHFVINHPIVLISY

Query:  ESTHLFCLAIARSDSINIIGQNFMTGYHIVFDREKMVLGWKESNC----TGYEDEKTNNLPVSPTPAPAAAPSTTIINPQANSNINNTSQTIEKPRPTNN
        +S  LFCLAIARSDSINIIGQNFMTGYHIVFDREKMVLGWKESNC    TGYEDE TNNLPV P+P PAAAP TT I PQANSN+NNT+QTIEKPRPTN 
Subjt:  ESTHLFCLAIARSDSINIIGQNFMTGYHIVFDREKMVLGWKESNC----TGYEDEKTNNLPVSPTPAPAAAPSTTIINPQANSNINNTSQTIEKPRPTNN

Query:  SSNLLTSVILTFLMSVMKSLM
        SS L TSVILTFLM V+  L+
Subjt:  SSNLLTSVILTFLMSVMKSLM

A0A5A7UHC9 Aspartyl protease family protein 1-like5.8e-23580.66Show/hide
Query:  SSPSTFFLTLCVFFSIFTFISHSSHALGSFSFPIHHRYSHAVRHILPFYALPEEGTVDYYAAMVHTDHFVHSRRL--VEDQPPLTFFSGNQTVRINPLGF
        SS STF LTLC F SIFTFISH SH  GSF+F IHH YS AVR ILPF++ P+EGT+DYYAAMV TDHFVHSRRL  V+D PPLTF SGN+T+RI+PLGF
Subjt:  SSPSTFFLTLCVFFSIFTFISHSSHALGSFSFPIHHRYSHAVRHILPFYALPEEGTVDYYAAMVHTDHFVHSRRL--VEDQPPLTFFSGNQTVRINPLGF

Query:  LYYAEVTVGTPETSYLVALDTGSDLFWLPCDCVNCITGFNTSQGPVNFNIYSPNNSSTSKEVQCSSSLCSHAEQCSSPSDTCPYQVISLSLQSST-----
        LYYAEVTVGTP   YLVALDTGSDLFWLPCDCVNCITG NT+QGPVNFNIYSPNNSSTSKEVQCSSSLCSH +QCS PSDTCPYQV  LS  +S+     
Subjt:  LYYAEVTVGTPETSYLVALDTGSDLFWLPCDCVNCITGFNTSQGPVNFNIYSPNNSSTSKEVQCSSSLCSHAEQCSSPSDTCPYQVISLSLQSST-----

Query:  ----------------NCTPLAMCGKDQSGAFLSSAAPNGLFGLGIENVSVPSILANAGLISNSFSLCFGPARMGRIEFGDKGSPGQSETPFNLGRKHPT
                        N T    CGKDQSGAFLSSAAPNGLFGLGIENVSVPSILANAGLISNSFSLCFGPARMGRIEFGDKGSP Q+ETPFNLGR+HPT
Subjt:  ----------------NCTPLAMCGKDQSGAFLSSAAPNGLFGLGIENVSVPSILANAGLISNSFSLCFGPARMGRIEFGDKGSPGQSETPFNLGRKHPT

Query:  YNISITQIGVGGHVSNLDVAAIFDSGTSFTYLNDPVYSLFADKFDSMIEEKRYTVDSDIPFENCYELSPNQTTFTYPVMNLTMKGGGHFVINHPIVLISY
        YN+SITQI VGGH+SNLDVA IFDSGTSFTYLNDP YSLFADKFDSM+EEKRYT++SDIPFENCYELSP+QTTFTYPVMNLTMKGGGHFVINHPIVL+S 
Subjt:  YNISITQIGVGGHVSNLDVAAIFDSGTSFTYLNDPVYSLFADKFDSMIEEKRYTVDSDIPFENCYELSPNQTTFTYPVMNLTMKGGGHFVINHPIVLISY

Query:  ESTHLFCLAIARSDSINIIGQNFMTGYHIVFDREKMVLGWKESNCTGYEDEKTNNLPVSPTPAPAAAPSTTIINPQANSNINNTSQTIEKPRPTNNSSNL
        +S  LFCLAIARSDSINIIGQNFMTGYHIVFDREKMVLGWKESNCTGYEDE TNNLPV P+P PAAAP TT I PQANSN+NNT+QTIEKPRPTN SS L
Subjt:  ESTHLFCLAIARSDSINIIGQNFMTGYHIVFDREKMVLGWKESNCTGYEDEKTNNLPVSPTPAPAAAPSTTIINPQANSNINNTSQTIEKPRPTNNSSNL

Query:  LTSVILTFLMSVMKSLM
         TSVILTFLM V+  L+
Subjt:  LTSVILTFLMSVMKSLM

A0A5D3CJM4 Aspartyl protease family protein 1-like1.1e-23380.66Show/hide
Query:  SSPSTFFLTLCVFFSIFTFISHSSHALGSFSFPIHHRYSHAVRHILPFYALPEEGTVDYYAAMVHTDHFVHSRRL--VEDQPPLTFFSGNQTVRINPLGF
        SS STF LTLC F SIFTFISH SH  GSF+F IHH YS AVR ILPF++ P+EGT+DYYAAMV TDHFVHSRRL  V+D PPLTF SGN+T+RI+PLGF
Subjt:  SSPSTFFLTLCVFFSIFTFISHSSHALGSFSFPIHHRYSHAVRHILPFYALPEEGTVDYYAAMVHTDHFVHSRRL--VEDQPPLTFFSGNQTVRINPLGF

Query:  LYYAEVTVGTPETSYLVALDTGSDLFWLPCDCVNCITGFNTSQGPVNFNIYSPNNSSTSKEVQCSSSLCSHAEQCSSPSDTCPYQVISLSLQSST-----
        LYYAEVTVGTP   YLVALDTGSDLFWLPCDCVNCITG NT+QGPVNFNIYSPNNSSTSKEVQCSSSLCSH +QCS PSDTCPYQV  LS  +S+     
Subjt:  LYYAEVTVGTPETSYLVALDTGSDLFWLPCDCVNCITGFNTSQGPVNFNIYSPNNSSTSKEVQCSSSLCSHAEQCSSPSDTCPYQVISLSLQSST-----

Query:  ----------------NCTPLAMCGKDQSGAFLSSAAPNGLFGLGIENVSVPSILANAGLISNSFSLCFGPARMGRIEFGDKGSPGQSETPFNLGRKHPT
                        N T    CGKDQSGAFLSSAAPNGLFGLGIENVSVPSILANAGLISNSFSLCFGPARMGRIEFGDKGSP Q+ETPFNLGR+HPT
Subjt:  ----------------NCTPLAMCGKDQSGAFLSSAAPNGLFGLGIENVSVPSILANAGLISNSFSLCFGPARMGRIEFGDKGSPGQSETPFNLGRKHPT

Query:  YNISITQIGVGGHVSNLDVAAIFDSGTSFTYLNDPVYSLFADKFDSMIEEKRYTVDSDIPFENCYELSPNQTTFTYPVMNLTMKGGGHFVINHPIVLISY
        YN+SITQI VGGH+SNLDVA IFDSGTSFTYLNDP YSLFADKFDSM+EEKRYT++SDIPFENCYELSP+QTTFTYPVMNLTMKGGGHFVINHPIVL+S 
Subjt:  YNISITQIGVGGHVSNLDVAAIFDSGTSFTYLNDPVYSLFADKFDSMIEEKRYTVDSDIPFENCYELSPNQTTFTYPVMNLTMKGGGHFVINHPIVLISY

Query:  ESTHLFCLAIARSDSINIIGQNFMTGYHIVFDREKMVLGWKESNCTGYEDEKTNNLPVSPTPAPAAAPSTTIINPQANSNINNTSQTIEKPRPTNNSSNL
        +S  LFCLAIARSDSINIIGQNFMTGYHIVFDREKMVLGWKESNCTGYEDE TNNLPV P+P PAAAP TT I PQANSN+NNT+QTIEKPRPTN SS L
Subjt:  ESTHLFCLAIARSDSINIIGQNFMTGYHIVFDREKMVLGWKESNCTGYEDEKTNNLPVSPTPAPAAAPSTTIINPQANSNINNTSQTIEKPRPTNNSSNL

Query:  LTSVILTFLMSVMKSLM
         TSVILTFLM V+  L+
Subjt:  LTSVILTFLMSVMKSLM

A0A6J1FTX0 aspartyl protease family protein 1-like isoform X16.2e-20571.88Show/hide
Query:  MSSPSTFFLTLCVFFSIFTFISHSSHALGSFSFPIHHRYSHAVRHILPFYALPEEGTVDYYAAMVHTDHFVHSRRLVEDQPPLTFFSGNQTVRINPLGFL
        M+SPS F LTLCVFFS+F+F+S SS ALGSFSF IHHRYS  VR ILP   LPEEGTVDYY AMV  D  +H RRL EDQPPLTF  GN+TVR+NPLGFL
Subjt:  MSSPSTFFLTLCVFFSIFTFISHSSHALGSFSFPIHHRYSHAVRHILPFYALPEEGTVDYYAAMVHTDHFVHSRRLVEDQPPLTFFSGNQTVRINPLGFL

Query:  YYAEVTVGTPETSYLVALDTGSDLFWLPCDCVNCITGFNTSQGPVNFNIYSPNNSSTSKEVQCSSSLCSHAEQCSSPSDTCPYQVISLSLQSST------
        +YA+VTVGTP+ SYLVALDTGSDLFWLPCDCVNC+T +NTS+G   FNIYSP+NSSTSKEV CSSSLC HA QC SPSD CPY+V  LS  +S+      
Subjt:  YYAEVTVGTPETSYLVALDTGSDLFWLPCDCVNCITGFNTSQGPVNFNIYSPNNSSTSKEVQCSSSLCSHAEQCSSPSDTCPYQVISLSLQSST------

Query:  ---------------NCTPLAMCGKDQSGAFLSSAAPNGLFGLGIENVSVPSILANAGLISNSFSLCFGPARMGRIEFGDKGSPGQSETPFNLGRKHPTY
                       N      CG+DQSGAFLS+AAPNGLFGLGIE+VSVPSILAN GL SNSFSLCFGP  MGRIEFGDKGSPGQSETPFN+G +HPTY
Subjt:  ---------------NCTPLAMCGKDQSGAFLSSAAPNGLFGLGIENVSVPSILANAGLISNSFSLCFGPARMGRIEFGDKGSPGQSETPFNLGRKHPTY

Query:  NISITQIGVGGHVSNLDVAAIFDSGTSFTYLNDPVYSLFADKFDSMIEEKRYTVDSDIPFENCYELSPNQTTFTYPVMNLTMKGGGHFVINHPIVLISYE
        NISITQ+ VGG+VSNLD AA+FDSGTSFTYLN+P YSL ADKFDSM++EKRY  + DIPFENCYELSPNQT F YPVMNLTM+GG HF INHPIV+++ E
Subjt:  NISITQIGVGGHVSNLDVAAIFDSGTSFTYLNDPVYSLFADKFDSMIEEKRYTVDSDIPFENCYELSPNQTTFTYPVMNLTMKGGGHFVINHPIVLISYE

Query:  STHLF-CLAIARSDSINIIGQNFMTGYHIVFDREKMVLGWKESNCTGYEDEKTNNLPVSPTPAPAAAPST-TIINPQANSNINNTSQTIEKPRPTNNSSN
        +T  F CLAI+RSD+INIIGQNFMTGYHIVFDREKMVLGWKESNCTGYED KTNNLP+ P+ AP AAP+  T I P+ANS +NN+S+T++KPR  NNS  
Subjt:  STHLF-CLAIARSDSINIIGQNFMTGYHIVFDREKMVLGWKESNCTGYEDEKTNNLPVSPTPAPAAAPST-TIINPQANSNINNTSQTIEKPRPTNNSSN

Query:  LLTSVILTFLMS
        L +SVIL  LM+
Subjt:  LLTSVILTFLMS

SwissProt top hitse value%identityAlignment
Q4V3D2 Aspartic proteinase 361.4e-2327.07Show/hide
Query:  GSFSFPIHHRYSHAVRHILPFYALPEEGTVDYYAAMVHTDHFVHSRRLVEDQPPLTFFSGNQTVRINPLGFLYYAEVTVGTPETSYLVALDTGSDLFWLP
        G+F F + H+++               G     + +   D F H+R L     PL    G  + R + +G LY+ ++ +G+P   Y V +DTGSD+ W+ 
Subjt:  GSFSFPIHHRYSHAVRHILPFYALPEEGTVDYYAAMVHTDHFVHSRRLVEDQPPLTFFSGNQTVRINPLGFLYYAEVTVGTPETSYLVALDTGSDLFWLP

Query:  C-DCVNCITGFNTSQGPVNFNIYSPNNSSTSKEVQCSSSLCS---HAEQCSSPSDTCPYQVI--------------SLSLQSST---NCTPLAM-----C
        C  C  C     T  G +  ++Y    SSTSK V C    CS    +E C +    C Y V+              +++L+  T      PLA      C
Subjt:  C-DCVNCITGFNTSQGPVNFNIYSPNNSSTSKEVQCSSSLCS---HAEQCSSPSDTCPYQVI--------------SLSLQSST---NCTPLAM-----C

Query:  GKDQSGAF-LSSAAPNGLFGLGIENVSVPSILANAGLISNSFSLCFGPARMGRI-EFGDKGSPGQSETPFNLGRKHPTYNISITQIGVGGH---------
        GK+QSG    + +A +G+ G G  N S+ S LA  G     FS C      G I   G+  SP    TP    + H  YN+ +  + V G          
Subjt:  GKDQSGAF-LSSAAPNGLFGLGIENVSVPSILANAGLISNSFSLCFGPARMGRI-EFGDKGSPGQSETPFNLGRKHPTYNISITQIGVGGH---------

Query:  VSNLDVAAIFDSGTSFTYLNDPVYSLFADKFDSMIEEKRYTVDSDIPFENCYELSPNQTTFTYPVMNLTMKGGGHFVINHPIVLISYESTHLFCL-----
         +N D   I DSGT+  YL   +Y+   +K  +  + K + V        C+  + N T   +PV+NL      HF     + L  Y   +LF L     
Subjt:  VSNLDVAAIFDSGTSFTYLNDPVYSLFADKFDSMIEEKRYTVDSDIPFENCYELSPNQTTFTYPVMNLTMKGGGHFVINHPIVLISYESTHLFCL-----

Query:  -------AIARSDSINII--GQNFMTGYHIVFDREKMVLGWKESNCT
                +   D  ++I  G   ++   +V+D E  V+GW + NC+
Subjt:  -------AIARSDSINII--GQNFMTGYHIVFDREKMVLGWKESNCT

Q8VYV9 Aspartyl protease family protein 17.0e-12145.75Show/hide
Query:  SSPSTFFLTLCVFFSIFTFISHSSHALGSFSFPIHHRYSHAVRHILPFYALPEEGTVDYYAAMVHTDHFVHSRRLV-EDQPPLTFFSGNQTVRINPLGFL
        SS    FL L +  +  +++       G F F  HHR+S  V  +LP   LP   +  YY  M H D  +  RRL  EDQ  +TF  GN+TVR++ LGFL
Subjt:  SSPSTFFLTLCVFFSIFTFISHSSHALGSFSFPIHHRYSHAVRHILPFYALPEEGTVDYYAAMVHTDHFVHSRRLV-EDQPPLTFFSGNQTVRINPLGFL

Query:  YYAEVTVGTPETSYLVALDTGSDLFWLPCDCVNCITGFNTSQG-PVNFNIYSPNNSSTSKEVQCSSSLCSHAEQCSSPSDTCPYQVISL-----------
        +YA VTVGTP   ++VALDTGSDLFWLPCDC NC+       G  ++ NIYSPN SSTS +V C+S+LC+  ++C+SP   CPYQ+  L           
Subjt:  YYAEVTVGTPETSYLVALDTGSDLFWLPCDCVNCITGFNTSQG-PVNFNIYSPNNSSTSKEVQCSSSLCSHAEQCSSPSDTCPYQVISL-----------

Query:  -------SLQSSTNCTPLAM---CGKDQSGAFLSSAAPNGLFGLGIENVSVPSILANAGLISNSFSLCFGPARMGRIEFGDKGSPGQSETPFNLGRKHPT
               S   S+   P  +   CG+ Q+G F   AAPNGLFGLG+E++SVPS+LA  G+ +NSFS+CFG    GRI FGDKGS  Q ETP N+ + HPT
Subjt:  -------SLQSSTNCTPLAM---CGKDQSGAFLSSAAPNGLFGLGIENVSVPSILANAGLISNSFSLCFGPARMGRIEFGDKGSPGQSETPFNLGRKHPT

Query:  YNISITQIGVGGHVSNLDVAAIFDSGTSFTYLNDPVYSLFADKFDSMIEEKRY-TVDSDIPFENCYELSPNQTTFTYPVMNLTMKGGGHFVINHPIVLIS
        YNI++T+I VGG+  +L+  A+FDSGTSFTYL D  Y+L ++ F+S+  +KRY T DS++PFE CY LSPN+ +F YP +NLTMKGG  + + HP+V+I 
Subjt:  YNISITQIGVGGHVSNLDVAAIFDSGTSFTYLNDPVYSLFADKFDSMIEEKRY-TVDSDIPFENCYELSPNQTTFTYPVMNLTMKGGGHFVINHPIVLIS

Query:  YESTHLFCLAIARSDSINIIGQNFMTGYHIVFDREKMVLGWKESNCTGYEDEKTNNLPVSPTPAPAAAPSTTIINPQANSNINNTSQTIEKPRPTNNSSN
         + T ++CLAI + + I+IIGQNFMTGY +VFDREK++LGWKES+C  Y  E +     S   + +A P  +  +P+A      T+   ++P  +  S+ 
Subjt:  YESTHLFCLAIARSDSINIIGQNFMTGYHIVFDREKMVLGWKESNCTGYEDEKTNNLPVSPTPAPAAAPSTTIINPQANSNINNTSQTIEKPRPTNNSSN

Query:  LLTSVILT-FLMSVMKSL
           S+ L+ F  S++  L
Subjt:  LLTSVILT-FLMSVMKSL

Q9LS40 Protein ASPARTIC PROTEASE IN GUARD CELL 12.4e-2026.97Show/hide
Query:  YYAEVTVGTPETSYLVALDTGSDLFWLPCD-CVNCITGFNTSQGPVNFNIYSPNNSSTSKEVQCSSSLCSHAEQCSSPSDTCPYQVI-------------
        Y++ + VGTP     + LDTGSD+ W+ C+ C +C    +         +++P +SST K + CS+  CS  E  +  S+ C YQV              
Subjt:  YYAEVTVGTPETSYLVALDTGSDLFWLPCD-CVNCITGFNTSQGPVNFNIYSPNNSSTSKEVQCSSSLCSHAEQCSSPSDTCPYQVI-------------

Query:  -SLSLQSSTNCTPLAM-CGKDQSGAFLSSAAPNGLFGLGIENVSVPSILANAGLISNSFSLCFGPARMGRIEFGDKGS----PGQSETPFNLGRKHPT-Y
         +++  +S     +A+ CG D  G F  +A   GL GLG   +S+ + +      + SFS C      G+    D  S     G +  P    +K  T Y
Subjt:  -SLSLQSSTNCTPLAM-CGKDQSGAFLSSAAPNGLFGLGIENVSVPSILANAGLISNSFSLCFGPARMGRIEFGDKGS----PGQSETPFNLGRKHPT-Y

Query:  NISITQIGVGGH-------VSNLDVA----AIFDSGTSFTYLNDPVYSLFADKFDSMIEEKRYTVDSDIPFENCYELSPNQTTFTYPVMNLTMKGGGHFV
         + ++   VGG        + ++D +     I D GT+ T L    Y+   D F  +    +    S   F+ CY+ S + +T   P +     GG    
Subjt:  NISITQIGVGGH-------VSNLDVA----AIFDSGTSFTYLNDPVYSLFADKFDSMIEEKRYTVDSDIPFENCYELSPNQTTFTYPVMNLTMKGGGHFV

Query:  INHPIVLISYESTHLFCLAIA-RSDSINIIGQNFMTGYHIVFDREKMVLGWKESNC
        +     LI  + +  FC A A  S S++IIG     G  I +D  K V+G   + C
Subjt:  INHPIVLISYESTHLFCLAIA-RSDSINIIGQNFMTGYHIVFDREKMVLGWKESNC

Q9LX20 Aspartic proteinase-like protein 12.5e-5732.39Show/hide
Query:  FLTLCVFFSIFTFISHSSHALGSFSFPIHHRYSH----AVRHILPFYALPEEGTVDYYAAMVHTDHFVHSRRLVEDQPPLTFFSGNQTVRI-NPLGFLYY
        FL  CV      F++        FS  + HR+S     +++      +LP + +++YY  +  +D       L      L    G++T+   N  G+L+Y
Subjt:  FLTLCVFFSIFTFISHSSHALGSFSFPIHHRYSH----AVRHILPFYALPEEGTVDYYAAMVHTDHFVHSRRLVEDQPPLTFFSGNQTVRI-NPLGFLYY

Query:  AEVTVGTPETSYLVALDTGSDLFWLPCDCVNC---ITGFNTSQGPVNFNIYSPNNSSTSKEVQCSSSLCSHAEQCSSPSDTCPYQVISLS----------
          + +GTP  S+LVALDTGS+L W+PC+CV C    + + +S    + N Y+P++SSTSK   CS  LC  A  C SP + CPY V  LS          
Subjt:  AEVTVGTPETSYLVALDTGSDLFWLPCDCVNC---ITGFNTSQGPVNFNIYSPNNSSTSKEVQCSSSLCSHAEQCSSPSDTCPYQVISLS----------

Query:  ----------------LQSSTNCTPLAMCGKDQSGAFLSSAAPNGLFGLGIENVSVPSILANAGLISNSFSLCFGPARMGRIEFGDKGSPGQSETPFNL-
                          SS     +  CGK QSG +L   AP+GL GLG   +SVPS L+ AGL+ NSFSLCF     GRI FGD G   Q  TPF   
Subjt:  ----------------LQSSTNCTPLAMCGKDQSGAFLSSAAPNGLFGLGIENVSVPSILANAGLISNSFSLCFGPARMGRIEFGDKGSPGQSETPFNL-

Query:  -GRKHPTYNISITQIGVGGH-VSNLDVAAIFDSGTSFTYLNDPVYSLFADKFDSMIEEKRYTVDSDIPFENCYELSPNQTTFTYPVMNLTMKGGGHFVIN
           K+  Y + +    +G   +         DSG SFTYL + +Y   A + D  I       +  + +E CYE S        P + L       FVI+
Subjt:  -GRKHPTYNISITQIGVGGH-VSNLDVAAIFDSGTSFTYLNDPVYSLFADKFDSMIEEKRYTVDSDIPFENCYELSPNQTTFTYPVMNLTMKGGGHFVIN

Query:  HPI-VLISYESTHLFCLAIARS--DSINIIGQNFMTGYHIVFDREKMVLGWKESNCTGYEDEKTNNLPVSPTPAPAAAPSTTIINPQANSNINNTSQTIE
         P+ V    +    FCL I+ S  + I  IGQN+M GY +VFDRE M LGW  S C   +++K    P   +P   ++P+    + Q +   +  S  I 
Subjt:  HPI-VLISYESTHLFCLAIARS--DSINIIGQNFMTGYHIVFDREKMVLGWKESNCTGYEDEKTNNLPVSPTPAPAAAPSTTIINPQANSNINNTSQTIE

Query:  KPRPTNNSSNLLTSVILTFLMSVMKSLM
           P+   S+  +S   + +M +  SL+
Subjt:  KPRPTNNSSNLLTSVILTFLMSVMKSLM

Q9S9K4 Aspartic proteinase 395.4e-2025.22Show/hide
Query:  LCVFFSIFTFISHSSHALGSFSFPIHHRYSHAVRHILPFYALPEEGTVDYYAAMVHTDHFVHSRRLVEDQPPLTFFSGNQTVRINPLGFLYYAEVTVGTP
        LC+  ++F  +   + A  +F F   H+++   +++  F +                D   HSR L     PL    G  + R++ +G LY+ ++ +G+P
Subjt:  LCVFFSIFTFISHSSHALGSFSFPIHHRYSHAVRHILPFYALPEEGTVDYYAAMVHTDHFVHSRRLVEDQPPLTFFSGNQTVRINPLGFLYYAEVTVGTP

Query:  ETSYLVALDTGSDLFWLPC-DCVNCITGFNTSQGPVNFNIYSPNNSSTSKEVQCSSSLCSHAEQCSS--PSDTCPYQVI--------------SLSLQSS
           Y V +DTGSD+ W+ C  C  C T  N +      +++  N SSTSK+V C    CS   Q  S  P+  C Y ++               L+L+  
Subjt:  ETSYLVALDTGSDLFWLPC-DCVNCITGFNTSQGPVNFNIYSPNNSSTSKEVQCSSSLCSHAEQCSS--PSDTCPYQVI--------------SLSLQSS

Query:  T---NCTPLAM-----CGKDQSGAF-LSSAAPNGLFGLGIENVSVPSILANAGLISNSFSLCFGPARMGRI-EFGDKGSPGQSETPFNLGRKHPTYNISI
        T      PL       CG DQSG      +A +G+ G G  N SV S LA  G     FS C    + G I   G   SP    TP    + H  YN+ +
Subjt:  T---NCTPLAM-----CGKDQSGAF-LSSAAPNGLFGLGIENVSVPSILANAGLISNSFSLCFGPARMGRI-EFGDKGSPGQSETPFNLGRKHPTYNISI

Query:  TQIGVGGHVSNLDV--------AAIFDSGTSFTYLNDPVYSLFADKFDSMIEEKRYTVDSDIPFENCYELSPNQTTFTYPVMNLTMKGGGHFVINHPIVL
          + V G  ++LD+          I DSGT+  Y    +Y    +   +    K + V+       C+  S N     +P ++   +      +     L
Subjt:  TQIGVGGHVSNLDV--------AAIFDSGTSFTYLNDPVYSLFADKFDSMIEEKRYTVDSDIPFENCYELSPNQTTFTYPVMNLTMKGGGHFVINHPIVL

Query:  ISYESTHLFC-------LAIARSDSINIIGQNFMTGYHIVFDREKMVLGWKESNCT
         + E   L+C       L       + ++G   ++   +V+D +  V+GW + NC+
Subjt:  ISYESTHLFC-------LAIARSDSINIIGQNFMTGYHIVFDREKMVLGWKESNCT

Arabidopsis top hitse value%identityAlignment
AT2G17760.1 Eukaryotic aspartyl protease family protein5.0e-12245.75Show/hide
Query:  SSPSTFFLTLCVFFSIFTFISHSSHALGSFSFPIHHRYSHAVRHILPFYALPEEGTVDYYAAMVHTDHFVHSRRLV-EDQPPLTFFSGNQTVRINPLGFL
        SS    FL L +  +  +++       G F F  HHR+S  V  +LP   LP   +  YY  M H D  +  RRL  EDQ  +TF  GN+TVR++ LGFL
Subjt:  SSPSTFFLTLCVFFSIFTFISHSSHALGSFSFPIHHRYSHAVRHILPFYALPEEGTVDYYAAMVHTDHFVHSRRLV-EDQPPLTFFSGNQTVRINPLGFL

Query:  YYAEVTVGTPETSYLVALDTGSDLFWLPCDCVNCITGFNTSQG-PVNFNIYSPNNSSTSKEVQCSSSLCSHAEQCSSPSDTCPYQVISL-----------
        +YA VTVGTP   ++VALDTGSDLFWLPCDC NC+       G  ++ NIYSPN SSTS +V C+S+LC+  ++C+SP   CPYQ+  L           
Subjt:  YYAEVTVGTPETSYLVALDTGSDLFWLPCDCVNCITGFNTSQG-PVNFNIYSPNNSSTSKEVQCSSSLCSHAEQCSSPSDTCPYQVISL-----------

Query:  -------SLQSSTNCTPLAM---CGKDQSGAFLSSAAPNGLFGLGIENVSVPSILANAGLISNSFSLCFGPARMGRIEFGDKGSPGQSETPFNLGRKHPT
               S   S+   P  +   CG+ Q+G F   AAPNGLFGLG+E++SVPS+LA  G+ +NSFS+CFG    GRI FGDKGS  Q ETP N+ + HPT
Subjt:  -------SLQSSTNCTPLAM---CGKDQSGAFLSSAAPNGLFGLGIENVSVPSILANAGLISNSFSLCFGPARMGRIEFGDKGSPGQSETPFNLGRKHPT

Query:  YNISITQIGVGGHVSNLDVAAIFDSGTSFTYLNDPVYSLFADKFDSMIEEKRY-TVDSDIPFENCYELSPNQTTFTYPVMNLTMKGGGHFVINHPIVLIS
        YNI++T+I VGG+  +L+  A+FDSGTSFTYL D  Y+L ++ F+S+  +KRY T DS++PFE CY LSPN+ +F YP +NLTMKGG  + + HP+V+I 
Subjt:  YNISITQIGVGGHVSNLDVAAIFDSGTSFTYLNDPVYSLFADKFDSMIEEKRY-TVDSDIPFENCYELSPNQTTFTYPVMNLTMKGGGHFVINHPIVLIS

Query:  YESTHLFCLAIARSDSINIIGQNFMTGYHIVFDREKMVLGWKESNCTGYEDEKTNNLPVSPTPAPAAAPSTTIINPQANSNINNTSQTIEKPRPTNNSSN
         + T ++CLAI + + I+IIGQNFMTGY +VFDREK++LGWKES+C  Y  E +     S   + +A P  +  +P+A      T+   ++P  +  S+ 
Subjt:  YESTHLFCLAIARSDSINIIGQNFMTGYHIVFDREKMVLGWKESNCTGYEDEKTNNLPVSPTPAPAAAPSTTIINPQANSNINNTSQTIEKPRPTNNSSN

Query:  LLTSVILT-FLMSVMKSL
           S+ L+ F  S++  L
Subjt:  LLTSVILT-FLMSVMKSL

AT3G51330.1 Eukaryotic aspartyl protease family protein3.5e-9942.39Show/hide
Query:  ALGSFSFPIHHRYSHAVRHILPFYAL-PEEGTVDYYAAMVHTDHFVHSRRLV--EDQPPLTFFSGNQTVRINPLGFLYYAEVTVGTPETSYLVALDTGSD
        A G FSF +HH +S  V+  L    L PE+G+++Y+  +   D  +  R L    ++ P+TF  GN+T+ I+ LGFL+YA V+VGTP T +LVALDTGSD
Subjt:  ALGSFSFPIHHRYSHAVRHILPFYAL-PEEGTVDYYAAMVHTDHFVHSRRLV--EDQPPLTFFSGNQTVRINPLGFLYYAEVTVGTPETSYLVALDTGSD

Query:  LFWLPCDC-VNCI-----TGFNTSQGPVNFNIYSPNNSSTSKEVQCSSSLCSHAEQCSSPSDTCPYQVISLSLQSSTNCT----------------PLAM
        LFWLPC+C   CI      G + S+     N+YSPN SSTS  ++CS   C  + +CSSP+ +CPYQ+  LS  + T  T                P+  
Subjt:  LFWLPCDC-VNCI-----TGFNTSQGPVNFNIYSPNNSSTSKEVQCSSSLCSHAEQCSSPSDTCPYQVISLSLQSSTNCT----------------PLAM

Query:  -----CGKDQSGAFLSSAAPNGLFGLGIENVSVPSILANAGLISNSFSLCFGPA--RMGRIEFGDKGSPGQSETPFNLGRKHPTYNISITQIGVGGHVSN
             CGK+Q+G   SSAA NGL GLG+++ SVPSILA A + +NSFS+CFG     +GRI FGDKG   Q ETP       PTY +S+T++ VGG    
Subjt:  -----CGKDQSGAFLSSAAPNGLFGLGIENVSVPSILANAGLISNSFSLCFGPA--RMGRIEFGDKGSPGQSETPFNLGRKHPTYNISITQIGVGGHVSN

Query:  LDVAAIFDSGTSFTYLNDPVYSLFADKFDSMIEEKRYTVDSDIPFENCYELSPNQTTFTYPVMNLTMKGGGHFVINHPIVLI-SYESTHLFCLAIARS--
        + + A+FD+GTSFT+L +P Y L    FD  + +KR  +D ++PFE CY+LSPN+TT  +P + +T +GG    + +P+ ++ + +++ ++CL I +S  
Subjt:  LDVAAIFDSGTSFTYLNDPVYSLFADKFDSMIEEKRYTVDSDIPFENCYELSPNQTTFTYPVMNLTMKGGGHFVINHPIVLI-SYESTHLFCLAIARS--

Query:  DSINIIGQNFMTGYHIVFDREKMVLGWKESNCTGYEDEKTNNLPVSPTPAPAAAPSTTIINPQANSNINNTSQTIEKPRPTNNSSN
          INIIGQNFM+GY IVFDRE+M+LGWK S+C  +EDE   +   +P P    APS +   P  +      + T  +  P N++ N
Subjt:  DSINIIGQNFMTGYHIVFDREKMVLGWKESNCTGYEDEKTNNLPVSPTPAPAAAPSTTIINPQANSNINNTSQTIEKPRPTNNSSN

AT3G51340.1 Eukaryotic aspartyl protease family protein2.6e-8638.06Show/hide
Query:  FLTLCVFFSIFTFISHSSHALGSFSFPIHHRYSHAVRHILPFYAL-PEEGTVDYYAAMVHTDHFVHSRRLV--EDQPPLTFFSGNQTVRINPLGFLYYAE
        F+ L +   IF  +     A G FSF +HH +S  V+  L F  L PE G+++Y+  + H D F+  R L    ++ PLT    N T+ +N LGFL+YA 
Subjt:  FLTLCVFFSIFTFISHSSHALGSFSFPIHHRYSHAVRHILPFYAL-PEEGTVDYYAAMVHTDHFVHSRRLV--EDQPPLTFFSGNQTVRINPLGFLYYAE

Query:  VTVGTPETSYLVALDTGSDLFWLPCDC-VNCITGFNTSQ--GPVNFNIYSPNNSSTSKEVQCSSSLCSHAEQCSSPSDTCPYQVISLSLQSST-------
        V++GTP T +LVALDTGSDLFWLPC+C   CI     ++    V  N+Y+PN S+TS  ++CS   C  + +CSSP   CPYQ+   S   +T       
Subjt:  VTVGTPETSYLVALDTGSDLFWLPCDC-VNCITGFNTSQ--GPVNFNIYSPNNSSTSKEVQCSSSLCSHAEQCSSPSDTCPYQVISLSLQSST-------

Query:  -------------NCTPLAMCGKDQSGAFLSSAAPNGLFGLGIENVSVPSILANAGLISNSFSLCFGP--ARMGRIEFGDKGSPGQSETPFNLGRKHPTY
                     N      CG++Q+GAF +  A NG+ GL ++  SVPS+LA A + +NSFS+CFG   + +GRI FGDKG   Q ETP         Y
Subjt:  -------------NCTPLAMCGKDQSGAFLSSAAPNGLFGLGIENVSVPSILANAGLISNSFSLCFGP--ARMGRIEFGDKGSPGQSETPFNLGRKHPTY

Query:  NISITQIGVGGHVSNLDVAAIFDSGTSFTYLNDPVYSLFADKFDSMIEEKRYTVDSDIPFENCYEL-------------------SPNQTTFTYPVMNLT
         +++T + VGG   ++ + A+FD+G+SFT L +  Y +F   FD ++E+KR  VD D PFE CY+L                   +P +  F + + N +
Subjt:  NISITQIGVGGHVSNLDVAAIFDSGTSFTYLNDPVYSLFADKFDSMIEEKRYTVDSDIPFENCYEL-------------------SPNQTTFTYPVMNLT

Query:  MKGGGHFVINHPIVLISYESTHLFCLAIARSDSINIIGQNFMTGYHIVFDREKMVLGWKESNCTGYEDEKTNNLPVSPTPAPAAAPSTTIINPQANSNIN
         +           V  S E T ++CL I +S ++NIIGQN M+G+ IVFDRE+M+LGWK+SNC  +EDE   +    P    A  PS +   P A++   
Subjt:  MKGGGHFVINHPIVLISYESTHLFCLAIARSDSINIIGQNFMTGYHIVFDREKMVLGWKESNCTGYEDEKTNNLPVSPTPAPAAAPSTTIINPQANSNIN

Query:  NTSQTIEKPRPTNNS
         T  TI+    T NS
Subjt:  NTSQTIEKPRPTNNS

AT3G51350.1 Eukaryotic aspartyl protease family protein4.7e-9640.99Show/hide
Query:  ALGSFSFPIHHRYSHAVRHILPFYAL-PEEGTVDYYAAMVHTDHFVHSRRLV--EDQPPLTFFSGNQTVRINPLGFLYYAEVTVGTPETSYLVALDTGSD
        A G F F +HH +S +V+  L    L PE+G+++Y+  + H D  +  R L    D+ P+TF  GN TV +  LG LYYA V+VGTP +S+LVALDTGSD
Subjt:  ALGSFSFPIHHRYSHAVRHILPFYAL-PEEGTVDYYAAMVHTDHFVHSRRLV--EDQPPLTFFSGNQTVRINPLGFLYYAEVTVGTPETSYLVALDTGSD

Query:  LFWLPCDC-VNCITGFNTSQGP--VNFNIYSPNNSSTSKEVQCSSSLCSHAEQCSSPSDTCPYQVISLSLQSST----------------NCTPLAM---
        LFWLPC+C   CI        P  V  N+Y+PN S+TS  ++CS   C  +++CSSPS  CPYQ IS S  + T                N TP+     
Subjt:  LFWLPCDC-VNCITGFNTSQGP--VNFNIYSPNNSSTSKEVQCSSSLCSHAEQCSSPSDTCPYQVISLSLQSST----------------NCTPLAM---

Query:  --CGKDQSGAFLSSAAPNGLFGLGIENVSVPSILANAGLISNSFSLCFGP--ARMGRIEFGDKGSPGQSETPFNLGRKHPTYNISITQIGVGGHVSNLDV
          CG+ Q+G F  + + NG+ GLGI+  SVPS+LA A + +NSFS+CFG     +GRI FGD+G   Q ETPF        Y ++I+ + V G   ++ +
Subjt:  --CGKDQSGAFLSSAAPNGLFGLGIENVSVPSILANAGLISNSFSLCFGP--ARMGRIEFGDKGSPGQSETPFNLGRKHPTYNISITQIGVGGHVSNLDV

Query:  AAIFDSGTSFTYLNDPVYSLFADKFDSMIEEKRYTVDSDIPFENCYELSPNQTTFTYPVMNLTMKGGGHFVINHPIVLI-SYESTHLFCLAIARSD--SI
         A FD+G+SFT+L +P Y +    FD ++E++R  VD ++PFE CY+LSPN TT  +P++ +T  GG   ++N+P     + E   ++CL + +S    I
Subjt:  AAIFDSGTSFTYLNDPVYSLFADKFDSMIEEKRYTVDSDIPFENCYELSPNQTTFTYPVMNLTMKGGGHFVINHPIVLI-SYESTHLFCLAIARSD--SI

Query:  NIIGQNFMTGYHIVFDREKMVLGWKESNCTGYEDEKTNNLPVSPTPAPAAAPSTTIINPQANSNINNTSQTIEKPRPTNNSSN
        N+IGQNF+ GY IVFDRE+M+LGWK+S C  +EDE   +   +P P    AP+ ++  P   S     S T     P N++ N
Subjt:  NIIGQNFMTGYHIVFDREKMVLGWKESNCTGYEDEKTNNLPVSPTPAPAAAPSTTIINPQANSNINNTSQTIEKPRPTNNSSN

AT4G35880.1 Eukaryotic aspartyl protease family protein2.2e-10943.39Show/hide
Query:  FFLTLCVFFSIFTFISHSSHALGSFSFPIHHRYSHAVRHILP----FYALPEEGTVDYYAAMVHTDHFVHSRRL----VEDQPPLTFFSGNQTVRINPLG
        FF T      I   +S  S     F+F +HHR+S  V+        F   P +G+ +Y+ A+V  D  +  RRL     E +  LTF  GN T RI+ LG
Subjt:  FFLTLCVFFSIFTFISHSSHALGSFSFPIHHRYSHAVRHILP----FYALPEEGTVDYYAAMVHTDHFVHSRRL----VEDQPPLTFFSGNQTVRINPLG

Query:  FLYYAEVTVGTPETSYLVALDTGSDLFWLPCDCVNCI-TGFNTSQGPVNFNIYSPNNSSTSKEVQCSSSLCSHAEQCSSPSDTCPYQVISLSLQSSTNCT
        FL+Y  V +GTP   ++VALDTGSDLFW+PCDC  C  T   T       +IY+P  S+T+K+V C++SLC+   QC     TCPY V  +S Q+ST+  
Subjt:  FLYYAEVTVGTPETSYLVALDTGSDLFWLPCDCVNCI-TGFNTSQGPVNFNIYSPNNSSTSKEVQCSSSLCSHAEQCSSPSDTCPYQVISLSLQSSTNCT

Query:  PLA---------------------MCGKDQSGAFLSSAAPNGLFGLGIENVSVPSILANAGLISNSFSLCFGPARMGRIEFGDKGSPGQSETPFNLGRKH
         +                       CG+ QSG+FL  AAPNGLFGLG+E +SVPS+LA  GL+++SFS+CFG   +GRI FGDKGS  Q ETPFNL   H
Subjt:  PLA---------------------MCGKDQSGAFLSSAAPNGLFGLGIENVSVPSILANAGLISNSFSLCFGPARMGRIEFGDKGSPGQSETPFNLGRKH

Query:  PTYNISITQIGVGGHVSNLDVAAIFDSGTSFTYLNDPVYSLFADKFDSMIEEKRYTVDSDIPFENCYELSPNQTTFTYPVMNLTMKGGGHFVINHPIVLI
        P YNI++T++ VG  + + +  A+FD+GTSFTYL DP+Y+  ++ F S  ++KR++ DS IPFE CY++S +      P ++LTMKG  HF IN PI++I
Subjt:  PTYNISITQIGVGGHVSNLDVAAIFDSGTSFTYLNDPVYSLFADKFDSMIEEKRYTVDSDIPFENCYELSPNQTTFTYPVMNLTMKGGGHFVINHPIVLI

Query:  SYESTHLFCLAIARSDSINIIGQNFMTGYHIVFDREKMVLGWKESNCTGYEDEKTNNLPVSPTPAPAAAPSTTIINPQANSNINNTSQTIEKPRPTNNSS
        S E   ++CLAI +S  +NIIGQN+MTGY +VFDREK+VL WK+ +C   E+  T     + T A A A +  I     +S ++ T+QTI K   +N+S 
Subjt:  SYESTHLFCLAIARSDSINIIGQNFMTGYHIVFDREKMVLGWKESNCTGYEDEKTNNLPVSPTPAPAAAPSTTIINPQANSNINNTSQTIEKPRPTNNSS

Query:  NLLTSVI
        N ++  +
Subjt:  NLLTSVI


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCCACAGACCATAGTTACGTCAAGCTCCGGAGTGAAGAATGTCTACAGAGAGAGAAGAAGACTGAGACTGACCTTGCTTCATCTTCTTCATCTTCTTCATCTTCTTC
TTCAACCCTTTCAAATTCTGCTTCTAATTCCTCCTCCAAATCCCAAGCTATGTCTTCTCCTTCTACCTTCTTCCTAACCCTCTGCGTTTTCTTTTCCATTTTCACCTTCA
TTTCCCATTCCTCTCATGCTCTCGGATCTTTCTCCTTCCCTATCCACCACCGCTACTCCCACGCCGTCCGTCATATCCTCCCCTTCTATGCCTTACCCGAGGAAGGCACT
GTCGATTACTACGCCGCCATGGTCCATACAGACCATTTTGTTCATTCTCGTCGTCTTGTCGAAGATCAACCTCCTCTCACTTTCTTCTCCGGCAACCAAACCGTTCGAAT
TAACCCTCTTGGATTCCTGTATTATGCTGAGGTTACGGTGGGAACACCGGAGACATCGTACCTAGTGGCGTTGGACACTGGCAGTGATTTGTTCTGGTTACCATGCGACT
GTGTTAATTGTATAACTGGGTTTAATACATCCCAAGGGCCAGTAAACTTTAATATCTACAGCCCAAATAATTCATCAACTAGCAAGGAGGTCCAATGTAGTAGTTCATTG
TGTTCACACGCGGAGCAGTGCTCCTCACCAAGTGACACGTGCCCTTATCAGGTTATTAGTTTATCTCTTCAATCTTCGACAAATTGCACACCCTTAGCAATGTGTGGTAA
GGACCAGAGTGGTGCATTTTTGAGCTCTGCAGCACCAAATGGCCTATTTGGGTTAGGTATCGAGAATGTTTCAGTTCCTAGCATCTTGGCAAATGCAGGACTCATTTCAA
ATTCCTTTTCCTTATGTTTTGGACCTGCTAGAATGGGAAGAATTGAATTTGGAGATAAAGGTAGTCCGGGCCAAAGTGAAACACCATTCAACTTAGGACGAAAACATCCT
ACTTATAACATCAGCATAACTCAGATAGGGGTGGGAGGACACGTTTCCAATCTTGATGTTGCTGCAATTTTTGACTCTGGAACCTCATTTACCTACCTGAACGACCCGGT
CTATTCACTTTTTGCTGACAAATTTGATTCTATGATTGAAGAAAAGCGGTATACAGTGGATTCAGACATCCCTTTTGAAAACTGCTATGAACTGAGCCCAAATCAAACCA
CATTCACTTACCCGGTAATGAATCTGACAATGAAAGGTGGTGGACATTTTGTCATCAATCATCCAATAGTTTTGATATCCTACGAGTCGACGCATCTTTTTTGTCTTGCC
ATTGCTAGAAGTGACAGCATAAATATCATTGGACAAAACTTCATGACGGGTTATCACATAGTCTTTGACCGTGAAAAGATGGTTTTGGGATGGAAGGAATCAAACTGCAC
TGGTTATGAAGATGAAAAAACCAACAATCTTCCCGTCAGTCCGACCCCCGCTCCTGCTGCTGCCCCTAGCACAACAATCATCAACCCACAAGCCAACAGCAACATTAATA
ACACTTCTCAAACAATAGAAAAACCAAGACCTACAAATAATAGTTCAAATCTTCTAACCTCAGTCATTCTCACATTCTTAATGTCTGTGATGAAATCGTTAATGAAAGAT
CTATTTGGTGTTCAAGAATCAGAGATCTCTCAATCTTTCTCGAATCACCTTCCAACTTCCAACCTTCCATCTATGGCGGTTGCTCGCTGCTTCCTCCCTTTTCCTCTTGA
AACTTCGAAGCATCCACTCTCTGCTTCCCTTTTCACTTCTTCTTCTTCTACTGATTATTCCTTCACTGTCGCGTTTCATTCTGATTCTCGTCGGCCCCGAGGTTTTAAGC
TCCCACTCACCACTCTCTGCTGCAAAATGCCCCTTCGCGGTACGTTTCTCTTTCTCTTTCTTGAACTAAATTTTAACAACAGTTGGACACAAACACAACTCTTGAGCAAC
TTGTTAGAAGTAAAAGCCAAGCCACAAGATTCGGAAGCAACATTAGTTCCCGGCTTCTTCACCGAATTCAAACACCTGCTGCTTCCAATAACCGATCGCAATCCTTTTCT
TTCCGAGGGAACGAGACAGGCCAACATGGTTGCTATTGCTACTACTGCTGCTTTGGCAAAGAACAATGGGGCTGACATAACAGTAGTCTTGATTGACGAAAAGCAGAAAG
ATTCGTTTCCGGAGCACGAGAACCAACTCTCGAGCATTCGTTGGCATTTGTCTGAAGGTGGATTCCAAGAGTTTAAATTGTTGGAGCGATTAGGGGAAGGAAGCAAGCCA
ACAGCAATCATTGGGGAGGTGGCTGATGATCTGAACTTAGATTTGGTTGTTCTAAGCATGGAAACCATTCATTCTAAGCATGTGGATGCTAACCTACTGGCTGAGTTCAT
TCCATGCGCTGTTATGCTCTTGCCATTATGA
mRNA sequenceShow/hide mRNA sequence
ATGTCCACAGACCATAGTTACGTCAAGCTCCGGAGTGAAGAATGTCTACAGAGAGAGAAGAAGACTGAGACTGACCTTGCTTCATCTTCTTCATCTTCTTCATCTTCTTC
TTCAACCCTTTCAAATTCTGCTTCTAATTCCTCCTCCAAATCCCAAGCTATGTCTTCTCCTTCTACCTTCTTCCTAACCCTCTGCGTTTTCTTTTCCATTTTCACCTTCA
TTTCCCATTCCTCTCATGCTCTCGGATCTTTCTCCTTCCCTATCCACCACCGCTACTCCCACGCCGTCCGTCATATCCTCCCCTTCTATGCCTTACCCGAGGAAGGCACT
GTCGATTACTACGCCGCCATGGTCCATACAGACCATTTTGTTCATTCTCGTCGTCTTGTCGAAGATCAACCTCCTCTCACTTTCTTCTCCGGCAACCAAACCGTTCGAAT
TAACCCTCTTGGATTCCTGTATTATGCTGAGGTTACGGTGGGAACACCGGAGACATCGTACCTAGTGGCGTTGGACACTGGCAGTGATTTGTTCTGGTTACCATGCGACT
GTGTTAATTGTATAACTGGGTTTAATACATCCCAAGGGCCAGTAAACTTTAATATCTACAGCCCAAATAATTCATCAACTAGCAAGGAGGTCCAATGTAGTAGTTCATTG
TGTTCACACGCGGAGCAGTGCTCCTCACCAAGTGACACGTGCCCTTATCAGGTTATTAGTTTATCTCTTCAATCTTCGACAAATTGCACACCCTTAGCAATGTGTGGTAA
GGACCAGAGTGGTGCATTTTTGAGCTCTGCAGCACCAAATGGCCTATTTGGGTTAGGTATCGAGAATGTTTCAGTTCCTAGCATCTTGGCAAATGCAGGACTCATTTCAA
ATTCCTTTTCCTTATGTTTTGGACCTGCTAGAATGGGAAGAATTGAATTTGGAGATAAAGGTAGTCCGGGCCAAAGTGAAACACCATTCAACTTAGGACGAAAACATCCT
ACTTATAACATCAGCATAACTCAGATAGGGGTGGGAGGACACGTTTCCAATCTTGATGTTGCTGCAATTTTTGACTCTGGAACCTCATTTACCTACCTGAACGACCCGGT
CTATTCACTTTTTGCTGACAAATTTGATTCTATGATTGAAGAAAAGCGGTATACAGTGGATTCAGACATCCCTTTTGAAAACTGCTATGAACTGAGCCCAAATCAAACCA
CATTCACTTACCCGGTAATGAATCTGACAATGAAAGGTGGTGGACATTTTGTCATCAATCATCCAATAGTTTTGATATCCTACGAGTCGACGCATCTTTTTTGTCTTGCC
ATTGCTAGAAGTGACAGCATAAATATCATTGGACAAAACTTCATGACGGGTTATCACATAGTCTTTGACCGTGAAAAGATGGTTTTGGGATGGAAGGAATCAAACTGCAC
TGGTTATGAAGATGAAAAAACCAACAATCTTCCCGTCAGTCCGACCCCCGCTCCTGCTGCTGCCCCTAGCACAACAATCATCAACCCACAAGCCAACAGCAACATTAATA
ACACTTCTCAAACAATAGAAAAACCAAGACCTACAAATAATAGTTCAAATCTTCTAACCTCAGTCATTCTCACATTCTTAATGTCTGTGATGAAATCGTTAATGAAAGAT
CTATTTGGTGTTCAAGAATCAGAGATCTCTCAATCTTTCTCGAATCACCTTCCAACTTCCAACCTTCCATCTATGGCGGTTGCTCGCTGCTTCCTCCCTTTTCCTCTTGA
AACTTCGAAGCATCCACTCTCTGCTTCCCTTTTCACTTCTTCTTCTTCTACTGATTATTCCTTCACTGTCGCGTTTCATTCTGATTCTCGTCGGCCCCGAGGTTTTAAGC
TCCCACTCACCACTCTCTGCTGCAAAATGCCCCTTCGCGGTACGTTTCTCTTTCTCTTTCTTGAACTAAATTTTAACAACAGTTGGACACAAACACAACTCTTGAGCAAC
TTGTTAGAAGTAAAAGCCAAGCCACAAGATTCGGAAGCAACATTAGTTCCCGGCTTCTTCACCGAATTCAAACACCTGCTGCTTCCAATAACCGATCGCAATCCTTTTCT
TTCCGAGGGAACGAGACAGGCCAACATGGTTGCTATTGCTACTACTGCTGCTTTGGCAAAGAACAATGGGGCTGACATAACAGTAGTCTTGATTGACGAAAAGCAGAAAG
ATTCGTTTCCGGAGCACGAGAACCAACTCTCGAGCATTCGTTGGCATTTGTCTGAAGGTGGATTCCAAGAGTTTAAATTGTTGGAGCGATTAGGGGAAGGAAGCAAGCCA
ACAGCAATCATTGGGGAGGTGGCTGATGATCTGAACTTAGATTTGGTTGTTCTAAGCATGGAAACCATTCATTCTAAGCATGTGGATGCTAACCTACTGGCTGAGTTCAT
TCCATGCGCTGTTATGCTCTTGCCATTATGATTTTTGTACACAATATTACAGTTTTATGCAGTCATTATCATATCTATGTTTTCTGTAACAATGTACTGGGTGTTCTTGT
GATATATGTATATATAAAGAGCTTTTAGAGTTTAATGGGACCCTTTTAATATTATCTAGGAAAAATACCCAATTAATCCTCCAAATTTGGGATAGGTTGTATTTAAATCT
TTTAATATTCAAACAACAATTTTACTCTCAAATGTTATTTGACAATTTGTAAGGTGGAGAATCCAACCTCTACCTTTGAGTTTGATAGTA
Protein sequenceShow/hide protein sequence
MSTDHSYVKLRSEECLQREKKTETDLASSSSSSSSSSSTLSNSASNSSSKSQAMSSPSTFFLTLCVFFSIFTFISHSSHALGSFSFPIHHRYSHAVRHILPFYALPEEGT
VDYYAAMVHTDHFVHSRRLVEDQPPLTFFSGNQTVRINPLGFLYYAEVTVGTPETSYLVALDTGSDLFWLPCDCVNCITGFNTSQGPVNFNIYSPNNSSTSKEVQCSSSL
CSHAEQCSSPSDTCPYQVISLSLQSSTNCTPLAMCGKDQSGAFLSSAAPNGLFGLGIENVSVPSILANAGLISNSFSLCFGPARMGRIEFGDKGSPGQSETPFNLGRKHP
TYNISITQIGVGGHVSNLDVAAIFDSGTSFTYLNDPVYSLFADKFDSMIEEKRYTVDSDIPFENCYELSPNQTTFTYPVMNLTMKGGGHFVINHPIVLISYESTHLFCLA
IARSDSINIIGQNFMTGYHIVFDREKMVLGWKESNCTGYEDEKTNNLPVSPTPAPAAAPSTTIINPQANSNINNTSQTIEKPRPTNNSSNLLTSVILTFLMSVMKSLMKD
LFGVQESEISQSFSNHLPTSNLPSMAVARCFLPFPLETSKHPLSASLFTSSSSTDYSFTVAFHSDSRRPRGFKLPLTTLCCKMPLRGTFLFLFLELNFNNSWTQTQLLSN
LLEVKAKPQDSEATLVPGFFTEFKHLLLPITDRNPFLSEGTRQANMVAIATTAALAKNNGADITVVLIDEKQKDSFPEHENQLSSIRWHLSEGGFQEFKLLERLGEGSKP
TAIIGEVADDLNLDLVVLSMETIHSKHVDANLLAEFIPCAVMLLPL