; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

HG10012859 (gene) of Bottle gourd (Hangzhou Gourd) v1 genome

Gene IDHG10012859
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
DescriptionCysteine proteinase
Genome locationChr01:24824950..24826426
RNA-Seq ExpressionHG10012859
SyntenyHG10012859
Gene Ontology termsGO:0006508 - proteolysis (biological process)
GO:0008234 - cysteine-type peptidase activity (molecular function)
InterPro domainsIPR000668 - Peptidase C1A, papain C-terminal
IPR025661 - Cysteine peptidase, asparagine active site
IPR038765 - Papain-like cysteine peptidase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0038986.1 senescence-specific cysteine protease SAG39-like [Cucumis melo var. makuwa]1.6e-2239.64Show/hide
Query:  VDWRTEGAVTSVKLQKKNRCWVYAAVGAIEGIYKIMTGELPILSVDEIIQQNPRHGDEGGGY-AKIVYKHAIRFGIQYKAD--------------RFGKR
        +DWRT GAVT V+ Q  + C VYAAV A+EGIY+I T +L   S D++I+    H  + GGY A  + ++ I +G   K                 F  +
Subjt:  VDWRTEGAVTSVKLQKKNRCWVYAAVGAIEGIYKIMTGELPILSVDEIIQQNPRHGDEGGGY-AKIVYKHAIRFGIQYKAD--------------RFGKR

Query:  IKEEPKP-----------------YRGPFGEQMDHEMLAVGYTPNHIILKNSWGTDWGDDGYMKITRKA
        ++ +P                   Y GPFG + DHE++ VGYTP++ ILKNSWG DWG+DGYMK++R A
Subjt:  IKEEPKP-----------------YRGPFGEQMDHEMLAVGYTPNHIILKNSWGTDWGDDGYMKITRKA

KAF2542396.1 hypothetical protein F2Q68_00029910 [Brassica cretica]2.6e-2035.61Show/hide
Query:  PHSKVPKYVDWRTEGAVTSVKLQKK-NRCWVYAAVGAIEGIYKIMTGELPILSVDEIIQQNP-RHGDEGGGYAKIVYKHAIRFGI--------QYKADR-
        P  ++P+ VDWR EGAVT+VK Q   + CW +++V A+EGI KI+TGEL  LS  +++  N   +G +G GY  I +K  I   I         YKA + 
Subjt:  PHSKVPKYVDWRTEGAVTSVKLQKK-NRCWVYAAVGAIEGIYKIMTGELPILSVDEIIQQNP-RHGDEGGGYAKIVYKHAIRFGI--------QYKADR-

Query:  ----------------------------FGKRIKEEPKPYRGPFGEQMDHEMLAVGY----TPNHIILKNSWGTDWGDDGYMKITRK---AARIGGEFRE
                                      K +  +P  Y GP G Q+DH ++ VGY      ++ I+KNSWGT WG+ G+ K+ R     A I G    
Subjt:  ----------------------------FGKRIKEEPKPYRGPFGEQMDHEMLAVGY----TPNHIILKNSWGTDWGDDGYMKITRK---AARIGGEFRE

Query:  ASYPI
        ASYPI
Subjt:  ASYPI

KAF3528335.1 hypothetical protein DY000_02038025 [Brassica cretica]2.6e-2035.61Show/hide
Query:  PHSKVPKYVDWRTEGAVTSVKLQKK-NRCWVYAAVGAIEGIYKIMTGELPILSVDEIIQQNP-RHGDEGGGYAKIVYKHAIRFGI--------QYKADR-
        P  ++P+ VDWR EGAVT+VK Q   + CW +++V A+EGI KI+TGEL  LS  +++  N   +G +G GY  I +K  I   I         YKA + 
Subjt:  PHSKVPKYVDWRTEGAVTSVKLQKK-NRCWVYAAVGAIEGIYKIMTGELPILSVDEIIQQNP-RHGDEGGGYAKIVYKHAIRFGI--------QYKADR-

Query:  ----------------------------FGKRIKEEPKPYRGPFGEQMDHEMLAVGY----TPNHIILKNSWGTDWGDDGYMKITRK---AARIGGEFRE
                                      K +  +P  Y GP G Q+DH ++ VGY      ++ I+KNSWGT WG+ G+ K+ R     A I G    
Subjt:  ----------------------------FGKRIKEEPKPYRGPFGEQMDHEMLAVGY----TPNHIILKNSWGTDWGDDGYMKITRK---AARIGGEFRE

Query:  ASYPI
        ASYPI
Subjt:  ASYPI

KAG6572278.1 Cysteine protease XCP2, partial [Cucurbita argyrosperma subsp. sororia]5.7e-2836.95Show/hide
Query:  VPKYVDWRTEGAVTSVKLQ-KKNRCWVYAAVGAIEGIYKIMTGELPILSVDEIIQQNPRHGDEG-GGYAKIVYKHAIRFGIQYKADRFGKRIKEEPKP--
        +PKYV+WR EGAVTSVK Q     CW+Y  + AIE  YKI  G+L  LS +++I     H + G GG   +V+++A++  I  ++D +GKR+ +  +   
Subjt:  VPKYVDWRTEGAVTSVKLQ-KKNRCWVYAAVGAIEGIYKIMTGELPILSVDEIIQQNPRHGDEG-GGYAKIVYKHAIRFGIQYKADRFGKRIKEEPKP--

Query:  ------------------------------------------YRGPFGEQMDHEMLAVGYTPNHIILKNSWGTDWGDDGYMKITRKAARIGGEFRE-ASY
                                                  Y GPFG  +DH +LAVGYTP++II+KN WGT WGD GYM ++RKA +  G F   A+Y
Subjt:  ------------------------------------------YRGPFGEQMDHEMLAVGYTPNHIILKNSWGTDWGDDGYMKITRKAARIGGEFRE-ASY

Query:  PII
        P++
Subjt:  PII

KMZ74540.1 Cysteine proteinase [Zostera marina]1.5e-2038.17Show/hide
Query:  YRNLLGRAFSTV-----HGRAFSTAHPHSKVPKYVDWRTEGAVTSVKLQKK-NRCWVYAAVGAIEGIYKIMTGELPILSVDEIIQ-QNPRHGDEGGGYAK
        + NL    F  +     H       +   + P  VDWR+EGAVTSVK Q+K   CW + AVGA+EGI KI+T  L  LS  E+I   N  +G +GG   K
Subjt:  YRNLLGRAFSTV-----HGRAFSTAHPHSKVPKYVDWRTEGAVTSVKLQKK-NRCWVYAAVGAIEGIYKIMTGELPILSVDEIIQ-QNPRHGDEGGGYAK

Query:  IVYKHAIRFGIQYKADRFGKRIKEEPKPYRGPFGEQ-------MDHEMLAVGYTPN----HIILKNSWGTDWGDDGYMKITRKAAR
                 GI    D   K  K+E    +G + E+       +DH  L VGY  +    + ILKN+WGTDWG++GYM+I R + R
Subjt:  IVYKHAIRFGIQYKADRFGKRIKEEPKPYRGPFGEQ-------MDHEMLAVGYTPN----HIILKNSWGTDWGDDGYMKITRKAAR

TrEMBL top hitse value%identityAlignment
A0A0D3A5L6 Uncharacterized protein9.5e-2135.61Show/hide
Query:  PHSKVPKYVDWRTEGAVTSVKLQKK-NRCWVYAAVGAIEGIYKIMTGELPILSVDEIIQQNP-RHGDEGGGYAKIVYKHAIRFGI--------QYKADR-
        P  ++P+ VDWR EGAVT+VK Q   + CW +++V A+EGI KI+TGEL  LS  +++  N   +G +G GY  I +K  I   I         YKA + 
Subjt:  PHSKVPKYVDWRTEGAVTSVKLQKK-NRCWVYAAVGAIEGIYKIMTGELPILSVDEIIQQNP-RHGDEGGGYAKIVYKHAIRFGI--------QYKADR-

Query:  ----------------------------FGKRIKEEPKPYRGPFGEQMDHEMLAVGY----TPNHIILKNSWGTDWGDDGYMKITRK---AARIGGEFRE
                                      K +  +P  Y GP G Q+DH ++ VGY      ++ I+KNSWGT WG+ G+ K+ R     A I G    
Subjt:  ----------------------------FGKRIKEEPKPYRGPFGEQMDHEMLAVGY----TPNHIILKNSWGTDWGDDGYMKITRK---AARIGGEFRE

Query:  ASYPI
        ASYPI
Subjt:  ASYPI

A0A0K9Q259 Cysteine proteinase7.3e-2138.17Show/hide
Query:  YRNLLGRAFSTV-----HGRAFSTAHPHSKVPKYVDWRTEGAVTSVKLQKK-NRCWVYAAVGAIEGIYKIMTGELPILSVDEIIQ-QNPRHGDEGGGYAK
        + NL    F  +     H       +   + P  VDWR+EGAVTSVK Q+K   CW + AVGA+EGI KI+T  L  LS  E+I   N  +G +GG   K
Subjt:  YRNLLGRAFSTV-----HGRAFSTAHPHSKVPKYVDWRTEGAVTSVKLQKK-NRCWVYAAVGAIEGIYKIMTGELPILSVDEIIQ-QNPRHGDEGGGYAK

Query:  IVYKHAIRFGIQYKADRFGKRIKEEPKPYRGPFGEQ-------MDHEMLAVGYTPN----HIILKNSWGTDWGDDGYMKITRKAAR
                 GI    D   K  K+E    +G + E+       +DH  L VGY  +    + ILKN+WGTDWG++GYM+I R + R
Subjt:  IVYKHAIRFGIQYKADRFGKRIKEEPKPYRGPFGEQ-------MDHEMLAVGYTPN----HIILKNSWGTDWGDDGYMKITRKAAR

A0A1D6HI75 Cysteine protease XCP18.9e-1935.82Show/hide
Query:  SFAGYRNLLGRAFS--TVHGRAFS-TAHPHSKVPKYVDWRTEGAVTSVKLQKK-NRCWVYAAVGAIEGIYKIMTGELPILSVDEIIQQNPR--HGDEGG-
        S+ G +  L RA +  T    AF   A     +P  VDWR +GAVT VK Q K   CW +++V A+EGI +I+TG+L  LS  E++  +    HG EGG 
Subjt:  SFAGYRNLLGRAFS--TVHGRAFS-TAHPHSKVPKYVDWRTEGAVTSVKLQKK-NRCWVYAAVGAIEGIYKIMTGELPILSVDEIIQQNPR--HGDEGG-

Query:  ---GYAKIVYKHAIRFGIQYKADRFGKRIKEEPKPYRGPFGEQMDHEMLAVG----YTPNHIILKNSWGTDWGDDGYMKI---TRKAARIGGEFREASYP
            +A ++    I     Y         KE+   + G    ++DH + AVG    Y  N+I +KNSWG +WG+ GY++I   T K   + G +  ASYP
Subjt:  ---GYAKIVYKHAIRFGIQYKADRFGKRIKEEPKPYRGPFGEQMDHEMLAVG----YTPNHIILKNSWGTDWGDDGYMKI---TRKAARIGGEFREASYP

Query:  I
        +
Subjt:  I

A0A5A7TAC2 Senescence-specific cysteine protease SAG39-like7.8e-2339.64Show/hide
Query:  VDWRTEGAVTSVKLQKKNRCWVYAAVGAIEGIYKIMTGELPILSVDEIIQQNPRHGDEGGGY-AKIVYKHAIRFGIQYKAD--------------RFGKR
        +DWRT GAVT V+ Q  + C VYAAV A+EGIY+I T +L   S D++I+    H  + GGY A  + ++ I +G   K                 F  +
Subjt:  VDWRTEGAVTSVKLQKKNRCWVYAAVGAIEGIYKIMTGELPILSVDEIIQQNPRHGDEGGGY-AKIVYKHAIRFGIQYKAD--------------RFGKR

Query:  IKEEPKP-----------------YRGPFGEQMDHEMLAVGYTPNHIILKNSWGTDWGDDGYMKITRKA
        ++ +P                   Y GPFG + DHE++ VGYTP++ ILKNSWG DWG+DGYMK++R A
Subjt:  IKEEPKP-----------------YRGPFGEQMDHEMLAVGYTPNHIILKNSWGTDWGDDGYMKITRKA

M4DB45 Uncharacterized protein2.1e-2035.61Show/hide
Query:  PHSKVPKYVDWRTEGAVTSVKLQKK-NRCWVYAAVGAIEGIYKIMTGELPILSVDEIIQQNP-RHGDEGGGYAKIVYKHAIRFGI--------QYKADR-
        P  ++P+ VDWR EGAVT+VK Q   + CW +++V A+EGI KI+TGEL  LS  +++  N   +G +G GY  I +K  I   I         YKA + 
Subjt:  PHSKVPKYVDWRTEGAVTSVKLQKK-NRCWVYAAVGAIEGIYKIMTGELPILSVDEIIQQNP-RHGDEGGGYAKIVYKHAIRFGI--------QYKADR-

Query:  ----------------------------FGKRIKEEPKPYRGPFGEQMDHEMLAVGY----TPNHIILKNSWGTDWGDDGYMKITRK---AARIGGEFRE
                                      K +  +P  Y GP G Q+DH ++ VGY      ++ I+KNSWGT WG+ G+ K+ R     A I G    
Subjt:  ----------------------------FGKRIKEEPKPYRGPFGEQMDHEMLAVGY----TPNHIILKNSWGTDWGDDGYMKITRK---AARIGGEFRE

Query:  ASYPI
        ASYPI
Subjt:  ASYPI

SwissProt top hitse value%identityAlignment
P00784 Papain1.7e-2231.43Show/hide
Query:  VPKYVDWRTEGAVTSVKLQKK-NRCWVYAAVGAIEGIYKIMTGELPILSVDEIIQQNPRHGDEGGGYAKIVYKHAIRFGIQYK-------ADRFGKRIKE
        +P+YVDWR +GAVT VK Q     CW ++AV  IEGI KI TG L   S  E++  + R     GGY     +   ++GI Y+         R+ +  ++
Subjt:  VPKYVDWRTEGAVTSVKLQKK-NRCWVYAAVGAIEGIYKIMTGELPILSVDEIIQQNPRHGDEGGGYAKIVYKHAIRFGIQYK-------ADRFGKRIKE

Query:  EP-----------KPYR---------------------------------GPFGEQMDHEMLAVGYTPNHIILKNSWGTDWGDDGYMKITRKAAR---IG
         P           +PY                                  GP G ++DH + AVGY PN+I++KNSWGT WG++GY++I R       + 
Subjt:  EP-----------KPYR---------------------------------GPFGEQMDHEMLAVGYTPNHIILKNSWGTDWGDDGYMKITRKAAR---IG

Query:  GEFREASYPI
        G +  + YP+
Subjt:  GEFREASYPI

P10056 Caricain3.3e-1829.58Show/hide
Query:  VPKYVDWRTEGAVTSVKLQKK-NRCWVYAAVGAIEGIYKIMTGELPILSVDEIIQQNPRHGDEGGGYAKIVYKHAIRFGIQ------YKADRFGKRIKEE
        +P+ VDWR +GAVT V+ Q     CW ++AV  +EGI KI TG+L  LS  E++    R     GGY     ++  + GI       YKA +   R K+ 
Subjt:  VPKYVDWRTEGAVTSVKLQKK-NRCWVYAAVGAIEGIYKIMTGELPILSVDEIIQQNPRHGDEGGGYAKIVYKHAIRFGIQ------YKADRFGKRIKEE

Query:  PKP---------------------------------------------YRGPFGEQMDHEMLAVGYTPN----HIILKNSWGTDWGDDGYMKITR---KA
          P                                             + GP G ++DH + AVGY  +    +I++KNSWGT WG+ GY++I R    +
Subjt:  PKP---------------------------------------------YRGPFGEQMDHEMLAVGYTPN----HIILKNSWGTDWGDDGYMKITR---KA

Query:  ARIGGEFREASYP
          + G ++ + YP
Subjt:  ARIGGEFREASYP

P14080 Chymopapain2.5e-1827.83Show/hide
Query:  PKYVDWRTEGAVTSVKLQKK-NRCWVYAAVGAIEGIYKIMTGELPILSVDEIIQQNPRHGDEGGGYAKIVYKHAIRFGIQ------YKADRFGKRIKEEP
        P+ +DWR +GAVT VK Q     CW ++ +  +EGI KI+TG L  LS  E++  +       GGY     ++    G+       Y+A ++  R  ++P
Subjt:  PKYVDWRTEGAVTSVKLQKK-NRCWVYAAVGAIEGIYKIMTGELPILSVDEIIQQNPRHGDEGGGYAKIVYKHAIRFGIQ------YKADRFGKRIKEEP

Query:  KP---------------------------------------------YRGPFGEQMDHEMLAVGY----TPNHIILKNSWGTDWGDDGYMKITRKAARIG
         P                                             + GP G ++DH + AVGY      N+II+KNSWG +WG+ GYM++ R++    
Subjt:  KP---------------------------------------------YRGPFGEQMDHEMLAVGY----TPNHIILKNSWGTDWGDDGYMKITRKAARIG

Query:  GE---FREASYP
        G    ++ + YP
Subjt:  GE---FREASYP

P25251 Cysteine proteinase COT44 (Fragment)1.6e-1729.78Show/hide
Query:  FSTAHPHSKVPKYVDWRTEGAVTSVKLQKK-NRCWVYAAVGAIEGIYKIMTGELPILSVDEII-------------------------------QQNPRH
        +S A    +VP  VDWR +GAV ++K Q     CW ++   A+EGI KI+TGEL  LS  E++                               +  P H
Subjt:  FSTAHPHSKVPKYVDWRTEGAVTSVKLQKK-NRCWVYAAVGAIEGIYKIMTGELPILSVDEII-------------------------------QQNPRH

Query:  GDEG--------------GGYAKIVYKH--AIRFGIQYK-----ADRFGKRIKE-EPKPYRGPFGEQMDHEMLAVGYTP----NHIILKNSWGTDWGDDG
        G  G               GY  +  K   A++  + Y+      D  G+  +  +   + G  G  MDH ++AVGY      ++ I++NSWGT WG+DG
Subjt:  GDEG--------------GGYAKIVYKH--AIRFGIQYK-----ADRFGKRIKE-EPKPYRGPFGEQMDHEMLAVGYTP----NHIILKNSWGTDWGDDG

Query:  YMKITRKAARIGGEFR---EASYPI
        Y+++ R  A   G+     EASYP+
Subjt:  YMKITRKAARIGGEFR---EASYPI

P84346 Mexicain1.1e-1830.46Show/hide
Query:  PKYVDWRTEGAVTSVKLQKK-NRCWVYAAVGAIEGIYKIMTGELPILSVDEIIQQNPRHGDEGGGYAKIVYKHAIRFGI--------QYKADRFGKRIKE
        P+ +DWR +GAVT VK Q     CW ++ V  IEGI KI+TG+L  LS  E++    R     GGY     ++ +  G+        + K  R   + K+
Subjt:  PKYVDWRTEGAVTSVKLQKK-NRCWVYAAVGAIEGIYKIMTGELPILSVDEIIQQNPRHGDEGGGYAKIVYKHAIRFGI--------QYKADRFGKRIKE

Query:  EPKP-------------------------------------------YRGPFGEQMDHEMLAVGYTPNHIILKNSWGTDWGDDGYMKITRKAARIGG
         PK                                            Y GP G   DH + AVGY   +++LKNSWG +WG+ GY++I R + R  G
Subjt:  EPKP-------------------------------------------YRGPFGEQMDHEMLAVGYTPNHIILKNSWGTDWGDDGYMKITRKAARIGG

Arabidopsis top hitse value%identityAlignment
AT1G47128.1 Granulin repeat cysteine protease family protein1.8e-1627.19Show/hide
Query:  KVPKYVDWRTEGAVTSVKLQKK-NRCWVYAAVGAIEGIYKIMTGELPILSVDEIIQQNPRHGDE-GGGYAKIVYKHAIRFG-------------------
        ++P+ +DWR +GAV  VK Q     CW ++ +GA+EGI +I+TG+L  LS  E++  +  + +   GG     ++  I+ G                   
Subjt:  KVPKYVDWRTEGAVTSVKLQKK-NRCWVYAAVGAIEGIYKIMTGELPILSVDEIIQQNPRHGDE-GGGYAKIVYKHAIRFG-------------------

Query:  ----------------IQYKADRFGKRIKEEPKP-----------------YRGPFGEQMDHEMLAVGY-TPN---HIILKNSWGTDWGDDGYMKITRKA
                          Y  +   K +  +P                   + G  G Q+DH ++AVGY T N   + I++NSWG  WG+ GY+++ R  
Subjt:  ----------------IQYKADRFGKRIKEEPKP-----------------YRGPFGEQMDHEMLAVGY-TPN---HIILKNSWGTDWGDDGYMKITRKA

Query:  ARIGGEFR---EASYPI
        A   G+     E SYPI
Subjt:  ARIGGEFR---EASYPI

AT3G48340.1 Cysteine proteinases superfamily protein1.8e-1628.9Show/hide
Query:  SKVPKYVDWRTEGAVTSVKLQKK-NRCWVYAAVGAIEGIYKIMTGELPILSVDEIIQQNPRHGDE-GGGYAKIVYKHAIR------------FGIQYKAD
        SK+P  VDWR +GAVT +K Q K   CW ++ V A+EGI KI T +L  LS  E++  + +  +   GG  +I ++   +             GI  K D
Subjt:  SKVPKYVDWRTEGAVTSVKLQKK-NRCWVYAAVGAIEGIYKIMTGELPILSVDEIIQQNPRHGDE-GGGYAKIVYKHAIR------------FGIQYKAD

Query:  -----------------------RFGKRIKEEPKP-----------------YRGPFGEQMDHEMLAVGY----TPNHIILKNSWGTDWGDDGYMKITRK
                                  K +  +P                   + G  G +++H + AVGY       + I++NSWG +WG+ GY+KI R+
Subjt:  -----------------------RFGKRIKEEPKP-----------------YRGPFGEQMDHEMLAVGY----TPNHIILKNSWGTDWGDDGYMKITRK

Query:  AARIGGE---FREASYPI
             G      EASYPI
Subjt:  AARIGGE---FREASYPI

AT4G11310.1 Papain family cysteine protease2.6e-1831.94Show/hide
Query:  VPKYVDWRTEGAVTSVKLQKKNR-CWVYAAVGAIEGIYKIMTGELPILSVDEIIQQNPRHGDEGGGYAKIVYKHAIRFG-------IQYKADR--FGKRI
        +PK VDWR EGAVT VK Q   R CW ++ VGA+EG+ KI+TGEL  LS  ++I  N  +   GGG  +  Y+  ++ G         YKA       R+
Subjt:  VPKYVDWRTEGAVTSVKLQKKNR-CWVYAAVGAIEGIYKIMTGELPILSVDEIIQQNPRHGDEGGGYAKIVYKHAIRFG-------IQYKADR--FGKRI

Query:  KEEPK--------------------------------------------PYRGPFGEQMDHEMLAVGY-TPN---HIILKNSWGTDWGDDGYMKITRKAA
        KE  K                                             + G  G  ++H ++ VGY T N   + ++KNS G  WG+ GYMK+ R  A
Subjt:  KEEPK--------------------------------------------PYRGPFGEQMDHEMLAVGY-TPN---HIILKNSWGTDWGDDGYMKITRKAA

Query:  R---IGGEFREASYPI
            + G    ASYP+
Subjt:  R---IGGEFREASYPI

AT4G11320.1 Papain family cysteine protease2.6e-1832.41Show/hide
Query:  VPKYVDWRTEGAVTSVKLQKKNR-CWVYAAVGAIEGIYKIMTGELPILSVDEIIQQNPRHGDEGGGYAKIVYKHAIRFG-------IQYKA--DRFGKRI
        +PK VDWR EGAVT VK Q   R CW ++ VGA+EG+ KI+TGEL  LS  ++I  N  +   GGG  +  Y+  +  G         YKA       R+
Subjt:  VPKYVDWRTEGAVTSVKLQKKNR-CWVYAAVGAIEGIYKIMTGELPILSVDEIIQQNPRHGDEGGGYAKIVYKHAIRFG-------IQYKA--DRFGKRI

Query:  KEEPK--------------------------------------------PYRGPFGEQMDHEMLAVGY-TPN---HIILKNSWGTDWGDDGYMKITRKAA
        KE+ K                                             + G  G  ++H ++ VGY T N   + I+KNS G  WG+ GYMK+ R  A
Subjt:  KEEPK--------------------------------------------PYRGPFGEQMDHEMLAVGY-TPN---HIILKNSWGTDWGDDGYMKITRKAA

Query:  R---IGGEFREASYPI
            + G    ASYP+
Subjt:  R---IGGEFREASYPI

AT4G23520.1 Cysteine proteinases superfamily protein2.0e-1829.33Show/hide
Query:  KVPKYVDWRTEGAVTSVKLQKK-NRCWVYAAVGAIEGIYKIMTGELPILSVDEIIQQN-------------------------------PRHGDEGG---
        ++P+ VDWR EGAV+ +K Q   N CW ++ V A+EG+ KI+TGEL  LS  E++  N                               P  G +G    
Subjt:  KVPKYVDWRTEGAVTSVKLQKK-NRCWVYAAVGAIEGIYKIMTGELPILSVDEIIQQN-------------------------------PRHGDEGG---

Query:  -----------------------GYAKIVYKHAIRFGIQYKADRFGKRIKEEPKPYRGPFGEQMDHEMLAVGY----TPNHIILKNSWGTDWGDDGYMKI
                                  K V    +  G+  K+  F   +      Y GP G  +DH ++ VGY      ++ I++NSWGT WGD GY+KI
Subjt:  -----------------------GYAKIVYKHAIRFGIQYKADRFGKRIKEEPKPYRGPFGEQMDHEMLAVGY----TPNHIILKNSWGTDWGDDGYMKI

Query:  TRK---AARIGGEFREASYPIIDKA
         R       + G    ASYPI + A
Subjt:  TRK---AARIGGEFREASYPIIDKA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGATGTTGAGAGCTTTTCTAACTGGTGTTGTGGGCCTTCAGTTCTTTTATATTGGGCCTTCTTTAGCCCATGTTGAAAACCTTGTGTATTTGCAGAGAATCGGCAGCTT
CTCTTCTCAGCCCATTTTTGTTTCCGGGAGAAAGATGTATTCATTCGCTGGCTATCGGAATCTCCTCGGCCGTGCATTTTCAACCGTCCACGGCCGTGCATTTTCAACCG
CCCATCCGCATTCCAAGGTGCCGAAATATGTGGATTGGAGAACCGAAGGTGCTGTCACTTCGGTGAAGCTCCAAAAGAAAAATAGATGCTGGGTTTATGCTGCTGTAGGA
GCAATTGAAGGAATATACAAAATAATGACTGGAGAGCTACCTATACTATCAGTAGATGAAATCATCCAACAAAACCCCCGACATGGTGATGAAGGTGGTGGTTATGCGAA
GATCGTGTACAAACACGCAATACGCTTTGGAATACAATACAAAGCAGATAGATTCGGAAAACGGATAAAAGAAGAGCCGAAACCATATCGGGGACCATTTGGAGAGCAAA
TGGACCATGAAATGCTTGCGGTAGGGTATACCCCAAATCATATAATATTAAAGAATTCATGGGGAACAGATTGGGGGGATGATGGGTACATGAAGATTACCCGAAAAGCA
GCCAGAATTGGAGGAGAATTTAGGGAGGCCAGCTATCCCATCATAGACAAAGCTTATTAA
mRNA sequenceShow/hide mRNA sequence
ATGATGTTGAGAGCTTTTCTAACTGGTGTTGTGGGCCTTCAGTTCTTTTATATTGGGCCTTCTTTAGCCCATGTTGAAAACCTTGTGTATTTGCAGAGAATCGGCAGCTT
CTCTTCTCAGCCCATTTTTGTTTCCGGGAGAAAGATGTATTCATTCGCTGGCTATCGGAATCTCCTCGGCCGTGCATTTTCAACCGTCCACGGCCGTGCATTTTCAACCG
CCCATCCGCATTCCAAGGTGCCGAAATATGTGGATTGGAGAACCGAAGGTGCTGTCACTTCGGTGAAGCTCCAAAAGAAAAATAGATGCTGGGTTTATGCTGCTGTAGGA
GCAATTGAAGGAATATACAAAATAATGACTGGAGAGCTACCTATACTATCAGTAGATGAAATCATCCAACAAAACCCCCGACATGGTGATGAAGGTGGTGGTTATGCGAA
GATCGTGTACAAACACGCAATACGCTTTGGAATACAATACAAAGCAGATAGATTCGGAAAACGGATAAAAGAAGAGCCGAAACCATATCGGGGACCATTTGGAGAGCAAA
TGGACCATGAAATGCTTGCGGTAGGGTATACCCCAAATCATATAATATTAAAGAATTCATGGGGAACAGATTGGGGGGATGATGGGTACATGAAGATTACCCGAAAAGCA
GCCAGAATTGGAGGAGAATTTAGGGAGGCCAGCTATCCCATCATAGACAAAGCTTATTAA
Protein sequenceShow/hide protein sequence
MMLRAFLTGVVGLQFFYIGPSLAHVENLVYLQRIGSFSSQPIFVSGRKMYSFAGYRNLLGRAFSTVHGRAFSTAHPHSKVPKYVDWRTEGAVTSVKLQKKNRCWVYAAVG
AIEGIYKIMTGELPILSVDEIIQQNPRHGDEGGGYAKIVYKHAIRFGIQYKADRFGKRIKEEPKPYRGPFGEQMDHEMLAVGYTPNHIILKNSWGTDWGDDGYMKITRKA
ARIGGEFREASYPIIDKAY