; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CmoCh08G000800 (gene) of Cucurbita moschata (Rifu) v1 genome

Gene IDCmoCh08G000800
OrganismCucurbita moschata Rifu (Cucurbita moschata (Rifu) v1)
DescriptionEukaryotic aspartyl protease family protein
Genome locationCmo_Chr08:412769..419746
RNA-Seq ExpressionCmoCh08G000800
SyntenyCmoCh08G000800
Gene Ontology termsGO:0006508 - proteolysis (biological process)
GO:0016021 - integral component of membrane (cellular component)
GO:0004190 - aspartic-type endopeptidase activity (molecular function)
InterPro domainsIPR001461 - Aspartic peptidase A1 family
IPR021109 - Aspartic peptidase domain superfamily
IPR032799 - Xylanase inhibitor, C-terminal
IPR032861 - Xylanase inhibitor, N-terminal
IPR033121 - Peptidase family A1 domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6592874.1 Aspartic proteinase-like protein 2, partial [Cucurbita argyrosperma subsp. sororia]3.3e-13498.32Show/hide
Query:  CGARQSGDLGSPGEEALDGILGFGKSNSSIISQLASSRKVKKMFAHCLDGINGGGIFAMGHVVQPKVIMTPLVPNQPHYNVNMTGVQVGRVMLNISADVF
        CGARQSGDLGSPGEEALDGILGFGKSNSSIISQLASSRKVKKMFAHCLDGINGGGIFAMGHVVQPKVIMTPLVPNQPHYNVNMTGVQVGRVMLNISADVF
Subjt:  CGARQSGDLGSPGEEALDGILGFGKSNSSIISQLASSRKVKKMFAHCLDGINGGGIFAMGHVVQPKVIMTPLVPNQPHYNVNMTGVQVGRVMLNISADVF

Query:  EAGDRKGTIIDSGTTLAYLPELIYEPLVTMILSRQHNLEVQTIHGEYKCFQYSRSVDDGFPPVTFHFENSLLLKVYPHEYLFQHEGLWCIGWQNSGMQSR
        EAGDRKGTIIDSGTTLAYLPELIYEPLVTMILSRQHNLEVQTIHGEYKCFQYSRSVDDGFPPVTFHFENSLLLKVYPHEYLFQHEGLWCIGWQNSGMQSR
Subjt:  EAGDRKGTIIDSGTTLAYLPELIYEPLVTMILSRQHNLEVQTIHGEYKCFQYSRSVDDGFPPVTFHFENSLLLKVYPHEYLFQHEGLWCIGWQNSGMQSR

Query:  DRKNVTLFGDLVLSNKLVLYDLENQTIGWTEYNCEYKV
        DRKNVTLFGDLVLSNKLVLYDLENQTIGWTEYNC   +
Subjt:  DRKNVTLFGDLVLSNKLVLYDLENQTIGWTEYNCEYKV

KAG7025279.1 Aspartic proteinase-like protein 2 [Cucurbita argyrosperma subsp. argyrosperma]2.2e-13897.19Show/hide
Query:  CGARQSGDLGSPGEEALDGILGFGKSNSSIISQLASSRKVKKMFAHCLDGINGGGIFAMGHVVQPKVIMTPLVPNQPHYNVNMTGVQVGRVMLNISADVF
        CGARQSGDLGSPGEEALDGILGFGKSNSSIISQLASSRKVKKMFAHCLDGINGGGIFAMGHVVQPKVIMTPLVPNQPHYNVNMTGVQVGRVMLNISADVF
Subjt:  CGARQSGDLGSPGEEALDGILGFGKSNSSIISQLASSRKVKKMFAHCLDGINGGGIFAMGHVVQPKVIMTPLVPNQPHYNVNMTGVQVGRVMLNISADVF

Query:  EAGDRKGTIIDSGTTLAYLPELIYEPLVTMILSRQHNLEVQTIHGEYKCFQYSRSVDDGFPPVTFHFENSLLLKVYPHEYLFQHEGLWCIGWQNSGMQSR
        EAGDRKGTIIDSGTTLAYLPELIYEPLVTMILSRQHNLEVQTIHGEYKCFQYSRSVDDGFPPVTFHFENSLLLKVYPHEYLFQHEGLWCIGWQNSGMQSR
Subjt:  EAGDRKGTIIDSGTTLAYLPELIYEPLVTMILSRQHNLEVQTIHGEYKCFQYSRSVDDGFPPVTFHFENSLLLKVYPHEYLFQHEGLWCIGWQNSGMQSR

Query:  DRKNVTLFGDLVLSNKLVLYDLENQTIGWTEYNCEYKVIASLFSKLLPH
        DRKNVTLFGDLVLSNKLVLYDLENQTIGWTEYNCEYKVIAS   ++  H
Subjt:  DRKNVTLFGDLVLSNKLVLYDLENQTIGWTEYNCEYKVIASLFSKLLPH

XP_022959530.1 aspartic proteinase-like protein 2 [Cucurbita moschata]3.3e-13498.32Show/hide
Query:  CGARQSGDLGSPGEEALDGILGFGKSNSSIISQLASSRKVKKMFAHCLDGINGGGIFAMGHVVQPKVIMTPLVPNQPHYNVNMTGVQVGRVMLNISADVF
        CGARQSGDLGSPGEEALDGILGFGKSNSSIISQLASSRKVKKMFAHCLDGINGGGIFAMGHVVQPKVIMTPLVPNQPHYNVNMTGVQVGRVMLNISADVF
Subjt:  CGARQSGDLGSPGEEALDGILGFGKSNSSIISQLASSRKVKKMFAHCLDGINGGGIFAMGHVVQPKVIMTPLVPNQPHYNVNMTGVQVGRVMLNISADVF

Query:  EAGDRKGTIIDSGTTLAYLPELIYEPLVTMILSRQHNLEVQTIHGEYKCFQYSRSVDDGFPPVTFHFENSLLLKVYPHEYLFQHEGLWCIGWQNSGMQSR
        EAGDRKGTIIDSGTTLAYLPELIYEPLVTMILSRQHNLEVQTIHGEYKCFQYSRSVDDGFPPVTFHFENSLLLKVYPHEYLFQHEGLWCIGWQNSGMQSR
Subjt:  EAGDRKGTIIDSGTTLAYLPELIYEPLVTMILSRQHNLEVQTIHGEYKCFQYSRSVDDGFPPVTFHFENSLLLKVYPHEYLFQHEGLWCIGWQNSGMQSR

Query:  DRKNVTLFGDLVLSNKLVLYDLENQTIGWTEYNCEYKV
        DRKNVTLFGDLVLSNKLVLYDLENQTIGWTEYNC   +
Subjt:  DRKNVTLFGDLVLSNKLVLYDLENQTIGWTEYNCEYKV

XP_023005003.1 aspartic proteinase-like protein 2 [Cucurbita maxima]8.1e-13397.48Show/hide
Query:  CGARQSGDLGSPGEEALDGILGFGKSNSSIISQLASSRKVKKMFAHCLDGINGGGIFAMGHVVQPKVIMTPLVPNQPHYNVNMTGVQVGRVMLNISADVF
        CGARQSGDLGS GEEALDGILGFGKSNSSIISQLASSRKVKKMFAHCLDGINGGGIFAMGHVVQPKVIMTPLVPNQPHYNVNMTGVQVGRVMLNISADVF
Subjt:  CGARQSGDLGSPGEEALDGILGFGKSNSSIISQLASSRKVKKMFAHCLDGINGGGIFAMGHVVQPKVIMTPLVPNQPHYNVNMTGVQVGRVMLNISADVF

Query:  EAGDRKGTIIDSGTTLAYLPELIYEPLVTMILSRQHNLEVQTIHGEYKCFQYSRSVDDGFPPVTFHFENSLLLKVYPHEYLFQHEGLWCIGWQNSGMQSR
        EAGDRKGTIIDSGTTLAYLPELIYEPLVTMILSRQHNLEVQ+IHGEYKCFQYSRSVDDGFPPVTFHFENSLLLKVYPHEYLFQHEGLWCIGWQNSGMQSR
Subjt:  EAGDRKGTIIDSGTTLAYLPELIYEPLVTMILSRQHNLEVQTIHGEYKCFQYSRSVDDGFPPVTFHFENSLLLKVYPHEYLFQHEGLWCIGWQNSGMQSR

Query:  DRKNVTLFGDLVLSNKLVLYDLENQTIGWTEYNCEYKV
        DRKNVTLFGDLVLSNKLVLYDLENQTIGWTEYNC   +
Subjt:  DRKNVTLFGDLVLSNKLVLYDLENQTIGWTEYNCEYKV

XP_023514824.1 aspartic proteinase-like protein 2 [Cucurbita pepo subsp. pepo]8.1e-13397.48Show/hide
Query:  CGARQSGDLGSPGEEALDGILGFGKSNSSIISQLASSRKVKKMFAHCLDGINGGGIFAMGHVVQPKVIMTPLVPNQPHYNVNMTGVQVGRVMLNISADVF
        CGARQSGDLGS GEEALDGILGFGKSNSSIISQLASSRKVKKMFAHCLDGINGGGIFAMGHVVQPKVIMTPLVPNQPHYNVNMTGVQVGRV+LNISADVF
Subjt:  CGARQSGDLGSPGEEALDGILGFGKSNSSIISQLASSRKVKKMFAHCLDGINGGGIFAMGHVVQPKVIMTPLVPNQPHYNVNMTGVQVGRVMLNISADVF

Query:  EAGDRKGTIIDSGTTLAYLPELIYEPLVTMILSRQHNLEVQTIHGEYKCFQYSRSVDDGFPPVTFHFENSLLLKVYPHEYLFQHEGLWCIGWQNSGMQSR
        EAGDRKGTIIDSGTTLAYLPELIYEPLVTMILSRQHNLEVQTIHGEYKCFQYSRSVDDGFPPVTFHFENSLLLKVYPHEYLFQHEGLWCIGWQNSGMQSR
Subjt:  EAGDRKGTIIDSGTTLAYLPELIYEPLVTMILSRQHNLEVQTIHGEYKCFQYSRSVDDGFPPVTFHFENSLLLKVYPHEYLFQHEGLWCIGWQNSGMQSR

Query:  DRKNVTLFGDLVLSNKLVLYDLENQTIGWTEYNCEYKV
        DRKNVTLFGDLVLSNKLVLYDLENQTIGWTEYNC   +
Subjt:  DRKNVTLFGDLVLSNKLVLYDLENQTIGWTEYNCEYKV

TrEMBL top hitse value%identityAlignment
A0A0A0KAX9 Peptidase A1 domain-containing protein1.0e-12592.02Show/hide
Query:  CGARQSGDLGSPGEEALDGILGFGKSNSSIISQLASSRKVKKMFAHCLDGINGGGIFAMGHVVQPKVIMTPLVPNQPHYNVNMTGVQVGRVMLNISADVF
        CGARQSGDLGS GEEALDGILGFGKSNSSIISQLAS+RKVKKMFAHCLDG NGGGIFAMGHVVQPKV MTPLVPNQPHYNVNMTGVQVG ++LNISADVF
Subjt:  CGARQSGDLGSPGEEALDGILGFGKSNSSIISQLASSRKVKKMFAHCLDGINGGGIFAMGHVVQPKVIMTPLVPNQPHYNVNMTGVQVGRVMLNISADVF

Query:  EAGDRKGTIIDSGTTLAYLPELIYEPLVTMILSRQHNLEVQTIHGEYKCFQYSRSVDDGFPPVTFHFENSLLLKVYPHEYLFQHEGLWCIGWQNSGMQSR
        EAGDRKGTIIDSGTTLAYLPELIYEPLV  ILS+QHNLEVQTIHGEYKCFQYS  VDDGFPPV FHFENSLLLKVYPHEYLFQ+E LWCIGWQNSGMQSR
Subjt:  EAGDRKGTIIDSGTTLAYLPELIYEPLVTMILSRQHNLEVQTIHGEYKCFQYSRSVDDGFPPVTFHFENSLLLKVYPHEYLFQHEGLWCIGWQNSGMQSR

Query:  DRKNVTLFGDLVLSNKLVLYDLENQTIGWTEYNCEYKV
        DRKNVTLFGDLVLSNKLVLYDLENQTIGWTEYNC   +
Subjt:  DRKNVTLFGDLVLSNKLVLYDLENQTIGWTEYNCEYKV

A0A1S4DVW7 aspartic proteinase-like protein 2 isoform X11.2e-12693.28Show/hide
Query:  CGARQSGDLGSPGEEALDGILGFGKSNSSIISQLASSRKVKKMFAHCLDGINGGGIFAMGHVVQPKVIMTPLVPNQPHYNVNMTGVQVGRVMLNISADVF
        CGARQSGDLGS GEEALDGILGFGKSNSSIISQLASSRKVKKMFAHCLDG NGGGIFAMGHVVQPKV MTPLVPNQPHYNVNMTGVQVG VMLNISADVF
Subjt:  CGARQSGDLGSPGEEALDGILGFGKSNSSIISQLASSRKVKKMFAHCLDGINGGGIFAMGHVVQPKVIMTPLVPNQPHYNVNMTGVQVGRVMLNISADVF

Query:  EAGDRKGTIIDSGTTLAYLPELIYEPLVTMILSRQHNLEVQTIHGEYKCFQYSRSVDDGFPPVTFHFENSLLLKVYPHEYLFQHEGLWCIGWQNSGMQSR
        EAGDRKGTIIDSGTTLAYLPELIYEPLV  ILS+QHNLEVQTIHGEYKCFQYS  VDDGFPPV FHFENSLLLKVYPHEYLFQ+E LWCIGWQNSGMQSR
Subjt:  EAGDRKGTIIDSGTTLAYLPELIYEPLVTMILSRQHNLEVQTIHGEYKCFQYSRSVDDGFPPVTFHFENSLLLKVYPHEYLFQHEGLWCIGWQNSGMQSR

Query:  DRKNVTLFGDLVLSNKLVLYDLENQTIGWTEYNCEYKV
        DRKNVTLFGDLVLSNKLVLYDLENQTIGWTEYNC   +
Subjt:  DRKNVTLFGDLVLSNKLVLYDLENQTIGWTEYNCEYKV

A0A6J1DAP8 aspartic proteinase-like protein 21.4e-12794.12Show/hide
Query:  CGARQSGDLGSPGEEALDGILGFGKSNSSIISQLASSRKVKKMFAHCLDGINGGGIFAMGHVVQPKVIMTPLVPNQPHYNVNMTGVQVGRVMLNISADVF
        CGARQSGDLGS GEEALDGILGFGKSNSSIISQLASSRKVKKMFAHCLDGINGGGIFAMGHVVQPKV MTPLVPNQPHYNVNMTGVQVGRVMLNISADVF
Subjt:  CGARQSGDLGSPGEEALDGILGFGKSNSSIISQLASSRKVKKMFAHCLDGINGGGIFAMGHVVQPKVIMTPLVPNQPHYNVNMTGVQVGRVMLNISADVF

Query:  EAGDRKGTIIDSGTTLAYLPELIYEPLVTMILSRQHNLEVQTIHGEYKCFQYSRSVDDGFPPVTFHFENSLLLKVYPHEYLFQHEGLWCIGWQNSGMQSR
        EAGDRKGTIID GTTLAYLPELIY PLVTMI+SRQ NLEVQTIHGEYKCFQYS SVDDGFPPV FHFENSLLLKVYPHEYLFQ+EGLWC+GWQNSGMQSR
Subjt:  EAGDRKGTIIDSGTTLAYLPELIYEPLVTMILSRQHNLEVQTIHGEYKCFQYSRSVDDGFPPVTFHFENSLLLKVYPHEYLFQHEGLWCIGWQNSGMQSR

Query:  DRKNVTLFGDLVLSNKLVLYDLENQTIGWTEYNCEYKV
        DRKNVTLFGDLVLSNKLVLYDLENQTIGWTEYNC   +
Subjt:  DRKNVTLFGDLVLSNKLVLYDLENQTIGWTEYNCEYKV

A0A6J1H6J5 aspartic proteinase-like protein 21.6e-13498.32Show/hide
Query:  CGARQSGDLGSPGEEALDGILGFGKSNSSIISQLASSRKVKKMFAHCLDGINGGGIFAMGHVVQPKVIMTPLVPNQPHYNVNMTGVQVGRVMLNISADVF
        CGARQSGDLGSPGEEALDGILGFGKSNSSIISQLASSRKVKKMFAHCLDGINGGGIFAMGHVVQPKVIMTPLVPNQPHYNVNMTGVQVGRVMLNISADVF
Subjt:  CGARQSGDLGSPGEEALDGILGFGKSNSSIISQLASSRKVKKMFAHCLDGINGGGIFAMGHVVQPKVIMTPLVPNQPHYNVNMTGVQVGRVMLNISADVF

Query:  EAGDRKGTIIDSGTTLAYLPELIYEPLVTMILSRQHNLEVQTIHGEYKCFQYSRSVDDGFPPVTFHFENSLLLKVYPHEYLFQHEGLWCIGWQNSGMQSR
        EAGDRKGTIIDSGTTLAYLPELIYEPLVTMILSRQHNLEVQTIHGEYKCFQYSRSVDDGFPPVTFHFENSLLLKVYPHEYLFQHEGLWCIGWQNSGMQSR
Subjt:  EAGDRKGTIIDSGTTLAYLPELIYEPLVTMILSRQHNLEVQTIHGEYKCFQYSRSVDDGFPPVTFHFENSLLLKVYPHEYLFQHEGLWCIGWQNSGMQSR

Query:  DRKNVTLFGDLVLSNKLVLYDLENQTIGWTEYNCEYKV
        DRKNVTLFGDLVLSNKLVLYDLENQTIGWTEYNC   +
Subjt:  DRKNVTLFGDLVLSNKLVLYDLENQTIGWTEYNCEYKV

A0A6J1KXX5 aspartic proteinase-like protein 23.9e-13397.48Show/hide
Query:  CGARQSGDLGSPGEEALDGILGFGKSNSSIISQLASSRKVKKMFAHCLDGINGGGIFAMGHVVQPKVIMTPLVPNQPHYNVNMTGVQVGRVMLNISADVF
        CGARQSGDLGS GEEALDGILGFGKSNSSIISQLASSRKVKKMFAHCLDGINGGGIFAMGHVVQPKVIMTPLVPNQPHYNVNMTGVQVGRVMLNISADVF
Subjt:  CGARQSGDLGSPGEEALDGILGFGKSNSSIISQLASSRKVKKMFAHCLDGINGGGIFAMGHVVQPKVIMTPLVPNQPHYNVNMTGVQVGRVMLNISADVF

Query:  EAGDRKGTIIDSGTTLAYLPELIYEPLVTMILSRQHNLEVQTIHGEYKCFQYSRSVDDGFPPVTFHFENSLLLKVYPHEYLFQHEGLWCIGWQNSGMQSR
        EAGDRKGTIIDSGTTLAYLPELIYEPLVTMILSRQHNLEVQ+IHGEYKCFQYSRSVDDGFPPVTFHFENSLLLKVYPHEYLFQHEGLWCIGWQNSGMQSR
Subjt:  EAGDRKGTIIDSGTTLAYLPELIYEPLVTMILSRQHNLEVQTIHGEYKCFQYSRSVDDGFPPVTFHFENSLLLKVYPHEYLFQHEGLWCIGWQNSGMQSR

Query:  DRKNVTLFGDLVLSNKLVLYDLENQTIGWTEYNCEYKV
        DRKNVTLFGDLVLSNKLVLYDLENQTIGWTEYNC   +
Subjt:  DRKNVTLFGDLVLSNKLVLYDLENQTIGWTEYNCEYKV

SwissProt top hitse value%identityAlignment
A2ZC67 Aspartic proteinase Asp14.8e-1124.39Show/hide
Query:  LDGILGFGKSNSSIISQLASSRKV-KKMFAHCLDGINGGGIFAMGHVVQPK--VIMTPLVPNQPHYNVNMTGVQVGRVMLNISADVFEAGDRKGTIIDSG
        ++GILG G+   +++SQL S   + K +  HC+    G G    G    P   V  +P+     HY+     +Q       ISA   E       I DSG
Subjt:  LDGILGFGKSNSSIISQLASSRKV-KKMFAHCLDGINGGGIFAMGHVVQPK--VIMTPLVPNQPHYNVNMTGVQVGRVMLNISADVFEAGDRKGTIIDSG

Query:  TTLAYLPELIYEPLVTMI---LSRQHNL-------------------EVQTIHGEYKCFQ-YSRSVDDGFPPVTFHFENSLLLKVYPHEYL-FQHEGLWC
         T  Y     Y   ++++   LS++                      +++TI    KCF+  S    DG        +    L++ P  YL    EG  C
Subjt:  TTLAYLPELIYEPLVTMI---LSRQHNL-------------------EVQTIHGEYKCFQ-YSRSVDDGFPPVTFHFENSLLLKVYPHEYL-FQHEGLWC

Query:  IGWQNSGMQSRDRKNVTLFGDLVLSNKLVLYDLENQTIGWTEYNCE
        +G  +   +        L G + + +++V+YD E   +GW  Y C+
Subjt:  IGWQNSGMQSRDRKNVTLFGDLVLSNKLVLYDLENQTIGWTEYNCE

Q3EBM5 Probable aspartic protease At2g356158.1e-1125.39Show/hide
Query:  CGARQSGDLGSPGEEALDGILGFGKSNSSIISQLASSRKVKKMFAHCLD----GINGGGIFAMGHVVQPK-------VIMTPLVPNQP--HYNVNMTGVQ
        CG    G     G     GI+G G  + S+ISQL SS  + K F++CL       NG  +  +G    P        V+ TPLV  +P  +Y + +  + 
Subjt:  CGARQSGDLGSPGEEALDGILGFGKSNSSIISQLASSRKVKKMFAHCLD----GINGGGIFAMGHVVQPK-------VIMTPLVPNQP--HYNVNMTGVQ

Query:  VGRVMLNISADVFEAGD-------RKGTIIDSGTTLAYLPELIYEPLVTMILSRQHNLE-VQTIHGEYK-CFQYSRSVDDGFPPVTFHFENSLLLKVYPH
        VG+  +  +   +   D           IIDSGTTL  L    ++   + +       + V    G    CF+ S S + G P +T HF  + +     +
Subjt:  VGRVMLNISADVFEAGD-------RKGTIIDSGTTLAYLPELIYEPLVTMILSRQHNLE-VQTIHGEYK-CFQYSRSVDDGFPPVTFHFENSLLLKVYPH

Query:  EYLFQHEGLWCIGWQNSGMQSRDRKNVTLFGDLVLSNKLVLYDLENQTIGWTEYNC
         ++   E + C+    +         V ++G+    + LV YDLE +T+ +   +C
Subjt:  EYLFQHEGLWCIGWQNSGMQSRDRKNVTLFGDLVLSNKLVLYDLENQTIGWTEYNC

Q4V3D2 Aspartic proteinase 362.3e-6645.08Show/hide
Query:  AKCPCSYKLSSEYGIQAVVLGYQQSVEGP--------KRIRRNLR-YPRPDDI---CGARQSGDLGSPGEEALDGILGFGKSNSSIISQLASSRKVKKMF
        AK PCSY          VV G   + +G         +++  NLR  P   ++   CG  QSG LG   + A+DGI+GFG+SN+SIISQLA+    K++F
Subjt:  AKCPCSYKLSSEYGIQAVVLGYQQSVEGP--------KRIRRNLR-YPRPDDI---CGARQSGDLGSPGEEALDGILGFGKSNSSIISQLASSRKVKKMF

Query:  AHCLDGINGGGIFAMGHVVQPKVIMTPLVPNQPHYNVNMTGVQVGRVMLNISADVFEAGDRKGTIIDSGTTLAYLPELIYEPLVTMILSRQHNLEVQTIH
        +HCLD +NGGGIFA+G V  P V  TP+VPNQ HYNV + G+ V    +++   +       GTIIDSGTTLAYLP+ +Y  L+  I ++Q  +++  + 
Subjt:  AHCLDGINGGGIFAMGHVVQPKVIMTPLVPNQPHYNVNMTGVQVGRVMLNISADVFEAGDRKGTIIDSGTTLAYLPELIYEPLVTMILSRQHNLEVQTIH

Query:  GEYKCFQYSRSVDDGFPPVTFHFENSLLLKVYPHEYLFQ-HEGLWCIGWQNSGMQSRDRKNVTLFGDLVLSNKLVLYDLENQTIGWTEYNCEYKV
          + CF ++ + D  FP V  HFE+SL L VYPH+YLF   E ++C GWQ+ GM ++D  +V L GDLVLSNKLV+YDLEN+ IGW ++NC   +
Subjt:  GEYKCFQYSRSVDDGFPPVTFHFENSLLLKVYPHEYLFQ-HEGLWCIGWQNSGMQSRDRKNVTLFGDLVLSNKLVLYDLENQTIGWTEYNCEYKV

Q9LTW4 Aspartic proteinase NANA, chloroplast3.9e-1325.68Show/hide
Query:  PCSYKLSSEYGIQAVVLGYQQSVEGPKRIRRNLRYPRPDDICGARQSGDLGSPGEEALDGILGFGKSNSSIISQLASSRKVKKMFAHC----LDGINGGG
        PCSY      G  A  +  ++++       R  R P     C +  +G       +  DG+LG   S+ S  S   S    K  F++C    L   N   
Subjt:  PCSYKLSSEYGIQAVVLGYQQSVEGPKRIRRNLRYPRPDDICGARQSGDLGSPGEEALDGILGFGKSNSSIISQLASSRKVKKMFAHC----LDGINGGG

Query:  IFAMGHVVQPKVIM---TPL----VPNQPHYNVNMTGVQVGRVMLNISADVFEAGDRKGTIIDSGTTLAYLPELIYEPLVTMILSRQHNLEVQTIHGEYK
            G     K      TPL    +P  P Y +N+ G+ +G  ML+I + V++A    GTI+DSGT+L  L +  Y+ +VT +   ++ +E++ +  E  
Subjt:  IFAMGHVVQPKVIM---TPL----VPNQPHYNVNMTGVQVGRVMLNISADVFEAGDRKGTIIDSGTTLAYLPELIYEPLVTMILSRQHNLEVQTIHGEYK

Query:  CFQYSRSVDDGF-----PPVTFHFENSLLLKVYPHEYLFQ-HEGLWCIGWQNSGMQSRDRKNVTLFGDLVLSNKLVLYDLENQTIGWTEYNC
          +Y  S   GF     P +TFH +     + +   YL     G+ C+G+ ++G  +       + G+++  N L  +DL   T+ +    C
Subjt:  CFQYSRSVDDGF-----PPVTFHFENSLLLKVYPHEYLFQ-HEGLWCIGWQNSGMQSRDRKNVTLFGDLVLSNKLVLYDLENQTIGWTEYNC

Q9S9K4 Aspartic proteinase 399.4e-6850.21Show/hide
Query:  CGARQSGDLGSPGEEALDGILGFGKSNSSIISQLASSRKVKKMFAHCLDGINGGGIFAMGHVVQPKVIMTPLVPNQPHYNVNMTGVQVGRVMLNISADVF
        CG+ QSG LG+ G+ A+DG++GFG+SN+S++SQLA++   K++F+HCLD + GGGIFA+G V  PKV  TP+VPNQ HYNV + G+ V    L++   + 
Subjt:  CGARQSGDLGSPGEEALDGILGFGKSNSSIISQLASSRKVKKMFAHCLDGINGGGIFAMGHVVQPKVIMTPLVPNQPHYNVNMTGVQVGRVMLNISADVF

Query:  EAGDRKGTIIDSGTTLAYLPELIYEPLVTMILSRQHNLEVQTIHGEYKCFQYSRSVDDGFPPVTFHFENSLLLKVYPHEYLFQ-HEGLWCIGWQNSGMQS
          G   GTI+DSGTTLAY P+++Y+ L+  IL+RQ  +++  +   ++CF +S +VD+ FPPV+F FE+S+ L VYPH+YLF   E L+C GWQ  G+ +
Subjt:  EAGDRKGTIIDSGTTLAYLPELIYEPLVTMILSRQHNLEVQTIHGEYKCFQYSRSVDDGFPPVTFHFENSLLLKVYPHEYLFQ-HEGLWCIGWQNSGMQS

Query:  RDRKNVTLFGDLVLSNKLVLYDLENQTIGWTEYNCEYKV
         +R  V L GDLVLSNKLV+YDL+N+ IGW ++NC   +
Subjt:  RDRKNVTLFGDLVLSNKLVLYDLENQTIGWTEYNCEYKV

Arabidopsis top hitse value%identityAlignment
AT1G05840.1 Eukaryotic aspartyl protease family protein1.6e-10776.47Show/hide
Query:  CGARQSGDLGSPGEEALDGILGFGKSNSSIISQLASSRKVKKMFAHCLDGINGGGIFAMGHVVQPKVIMTPLVPNQPHYNVNMTGVQVGRVMLNISADVF
        CGARQSGDL S  EEALDGILGFGK+NSS+ISQLASS +VKK+FAHCLDG NGGGIFA+G VVQPKV MTPLVPNQPHYNVNMT VQVG+  L I AD+F
Subjt:  CGARQSGDLGSPGEEALDGILGFGKSNSSIISQLASSRKVKKMFAHCLDGINGGGIFAMGHVVQPKVIMTPLVPNQPHYNVNMTGVQVGRVMLNISADVF

Query:  EAGDRKGTIIDSGTTLAYLPELIYEPLVTMILSRQHNLEVQTIHGEYKCFQYSRSVDDGFPPVTFHFENSLLLKVYPHEYLFQHEGLWCIGWQNSGMQSR
        + GDRKG IIDSGTTLAYLPE+IYEPLV  I S++  L+V  +  +YKCFQYS  VD+GFP VTFHFENS+ L+VYPH+YLF HEG+WCIGWQNS MQSR
Subjt:  EAGDRKGTIIDSGTTLAYLPELIYEPLVTMILSRQHNLEVQTIHGEYKCFQYSRSVDDGFPPVTFHFENSLLLKVYPHEYLFQHEGLWCIGWQNSGMQSR

Query:  DRKNVTLFGDLVLSNKLVLYDLENQTIGWTEYNCEYKV
        DR+N+TL GDLVLSNKLVLYDLENQ IGWTEYNC   +
Subjt:  DRKNVTLFGDLVLSNKLVLYDLENQTIGWTEYNCEYKV

AT1G65240.1 Eukaryotic aspartyl protease family protein6.7e-6950.21Show/hide
Query:  CGARQSGDLGSPGEEALDGILGFGKSNSSIISQLASSRKVKKMFAHCLDGINGGGIFAMGHVVQPKVIMTPLVPNQPHYNVNMTGVQVGRVMLNISADVF
        CG+ QSG LG+ G+ A+DG++GFG+SN+S++SQLA++   K++F+HCLD + GGGIFA+G V  PKV  TP+VPNQ HYNV + G+ V    L++   + 
Subjt:  CGARQSGDLGSPGEEALDGILGFGKSNSSIISQLASSRKVKKMFAHCLDGINGGGIFAMGHVVQPKVIMTPLVPNQPHYNVNMTGVQVGRVMLNISADVF

Query:  EAGDRKGTIIDSGTTLAYLPELIYEPLVTMILSRQHNLEVQTIHGEYKCFQYSRSVDDGFPPVTFHFENSLLLKVYPHEYLFQ-HEGLWCIGWQNSGMQS
          G   GTI+DSGTTLAY P+++Y+ L+  IL+RQ  +++  +   ++CF +S +VD+ FPPV+F FE+S+ L VYPH+YLF   E L+C GWQ  G+ +
Subjt:  EAGDRKGTIIDSGTTLAYLPELIYEPLVTMILSRQHNLEVQTIHGEYKCFQYSRSVDDGFPPVTFHFENSLLLKVYPHEYLFQ-HEGLWCIGWQNSGMQS

Query:  RDRKNVTLFGDLVLSNKLVLYDLENQTIGWTEYNCEYKV
         +R  V L GDLVLSNKLV+YDL+N+ IGW ++NC   +
Subjt:  RDRKNVTLFGDLVLSNKLVLYDLENQTIGWTEYNCEYKV

AT3G02740.1 Eukaryotic aspartyl protease family protein3.1e-7454.47Show/hide
Query:  CGARQSGDLGSPGEEALDGILGFGKSNSSIISQLASSRKVKKMFAHCLDGINGGGIFAMGHVVQPKVIMTPLVPNQPHYNVNMTGVQVGRVMLNISADVF
        CG++QSG LG   + A+DGI+GFG+SNSS ISQLAS  KVK+ FAHCLD  NGGGIFA+G VV PKV  TP++    HY+VN+  ++VG  +L +S++ F
Subjt:  CGARQSGDLGSPGEEALDGILGFGKSNSSIISQLASSRKVKKMFAHCLDGINGGGIFAMGHVVQPKVIMTPLVPNQPHYNVNMTGVQVGRVMLNISADVF

Query:  EAGDRKGTIIDSGTTLAYLPELIYEPLVTMILSRQHNLEVQTIHGEYKCFQYSRSVDDGFPPVTFHFENSLLLKVYPHEYLFQ-HEGLWCIGWQNSGMQS
        ++GD KG IIDSGTTL YLP+ +Y PL+  IL+    L + T+   + CF Y+  + D FP VTF F+ S+ L VYP EYLFQ  E  WC GWQN G+Q+
Subjt:  EAGDRKGTIIDSGTTLAYLPELIYEPLVTMILSRQHNLEVQTIHGEYKCFQYSRSVDDGFPPVTFHFENSLLLKVYPHEYLFQ-HEGLWCIGWQNSGMQS

Query:  RDRKNVTLFGDLVLSNKLVLYDLENQTIGWTEYNC
        +   ++T+ GD+ LSNKLV+YD+ENQ IGWT +NC
Subjt:  RDRKNVTLFGDLVLSNKLVLYDLENQTIGWTEYNC

AT5G22850.1 Eukaryotic aspartyl protease family protein1.8e-5041.2Show/hide
Query:  CGARQSGDLGSPGEEALDGILGFGKSNSSIISQLASSRKVKKMFAHCLDGIN-GGGIFAMGHVVQPKVIMTPLVPNQPHYNVNMTGVQVGRVMLNISADV
        C   Q+GDL    + A+DGI GFG+   S+ISQLAS     ++F+HCL G N GGGI  +G +V+P ++ TPLVP+QPHYNVN+  + V    L I+  V
Subjt:  CGARQSGDLGSPGEEALDGILGFGKSNSSIISQLASSRKVKKMFAHCLDGIN-GGGIFAMGHVVQPKVIMTPLVPNQPHYNVNMTGVQVGRVMLNISADV

Query:  FEAGDRKGTIIDSGTTLAYLPELIYEPLVTMILSRQHNLEVQTIHGEYKCFQYSRSVDDGFPPVTFHFENSLLLKVYPHEYLFQHE-----GLWCIGWQN
        F   + +GTIID+GTTLAYL E  Y P V  I +         +    +C+  + SV D FPPV+ +F     + + P +YL Q        +WCIG+Q 
Subjt:  FEAGDRKGTIIDSGTTLAYLPELIYEPLVTMILSRQHNLEVQTIHGEYKCFQYSRSVDDGFPPVTFHFENSLLLKVYPHEYLFQHE-----GLWCIGWQN

Query:  SGMQSRDRKNVTLFGDLVLSNKLVLYDLENQTIGWTEYNCEYKVIASLFS
                + +T+ GDLVL +K+ +YDL  Q IGW  Y+C   V  S  S
Subjt:  SGMQSRDRKNVTLFGDLVLSNKLVLYDLENQTIGWTEYNCEYKVIASLFS

AT5G36260.1 Eukaryotic aspartyl protease family protein1.7e-6745.08Show/hide
Query:  AKCPCSYKLSSEYGIQAVVLGYQQSVEGP--------KRIRRNLR-YPRPDDI---CGARQSGDLGSPGEEALDGILGFGKSNSSIISQLASSRKVKKMF
        AK PCSY          VV G   + +G         +++  NLR  P   ++   CG  QSG LG   + A+DGI+GFG+SN+SIISQLA+    K++F
Subjt:  AKCPCSYKLSSEYGIQAVVLGYQQSVEGP--------KRIRRNLR-YPRPDDI---CGARQSGDLGSPGEEALDGILGFGKSNSSIISQLASSRKVKKMF

Query:  AHCLDGINGGGIFAMGHVVQPKVIMTPLVPNQPHYNVNMTGVQVGRVMLNISADVFEAGDRKGTIIDSGTTLAYLPELIYEPLVTMILSRQHNLEVQTIH
        +HCLD +NGGGIFA+G V  P V  TP+VPNQ HYNV + G+ V    +++   +       GTIIDSGTTLAYLP+ +Y  L+  I ++Q  +++  + 
Subjt:  AHCLDGINGGGIFAMGHVVQPKVIMTPLVPNQPHYNVNMTGVQVGRVMLNISADVFEAGDRKGTIIDSGTTLAYLPELIYEPLVTMILSRQHNLEVQTIH

Query:  GEYKCFQYSRSVDDGFPPVTFHFENSLLLKVYPHEYLFQ-HEGLWCIGWQNSGMQSRDRKNVTLFGDLVLSNKLVLYDLENQTIGWTEYNCEYKV
          + CF ++ + D  FP V  HFE+SL L VYPH+YLF   E ++C GWQ+ GM ++D  +V L GDLVLSNKLV+YDLEN+ IGW ++NC   +
Subjt:  GEYKCFQYSRSVDDGFPPVTFHFENSLLLKVYPHEYLFQ-HEGLWCIGWQNSGMQSRDRKNVTLFGDLVLSNKLVLYDLENQTIGWTEYNCEYKV


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAGTCTTTGCGTAAAGAAAAAAGAACTCGAGAATGGCTGTCCAAGGAGAGGGCAAAATGTCCTTGTTCCTACAAGTTGAGCTCTGAGTATGGTATTCAAGCCGTAGT
ACTAGGCTACCAGCAGTCAGTGGAAGGGCCAAAGAGAATTAGGCGGAATTTGAGATATCCACGGCCAGATGACATATGTGGTGCTAGACAATCTGGGGATTTAGGTTCGC
CTGGTGAAGAAGCACTTGATGGGATACTTGGTTTCGGAAAATCAAATTCATCTATTATTTCACAACTAGCCTCCTCAAGAAAAGTGAAGAAGATGTTTGCTCATTGCCTA
GACGGAATAAATGGGGGTGGTATATTTGCGATGGGACATGTTGTGCAGCCAAAAGTTATCATGACTCCGTTGGTACCAAATCAGCCACATTACAACGTTAATATGACGGG
AGTACAAGTTGGTCGTGTCATGTTAAATATTTCTGCTGATGTATTTGAGGCGGGAGATAGAAAAGGGACGATCATTGATAGTGGCACAACTTTGGCATATCTTCCAGAAT
TGATTTACGAGCCATTAGTGACCATGATACTCTCACGGCAACACAATTTGGAAGTTCAAACCATTCATGGAGAATATAAATGTTTTCAGTACTCAAGAAGTGTCGATGAT
GGATTTCCTCCAGTTACTTTCCATTTCGAGAATTCACTCTTGTTGAAGGTTTATCCTCATGAATATCTATTCCAACATGAGGGCTTGTGGTGTATTGGTTGGCAAAACAG
TGGGATGCAATCTAGGGATAGGAAGAACGTTACCCTCTTTGGAGATTTAGTGCTTTCAAATAAGCTAGTTTTATACGATCTCGAGAACCAAACAATCGGGTGGACCGAAT
ACAACTGTGAGTACAAAGTTATTGCTTCATTGTTTTCGAAGTTGTTGCCGCATGTTCTTCAAGCATCAAAGTGCAAGATGAACAAACCGGAACACGTACAGATTGAATAC
CAAATGGGCTGTGATGTTGCTATTCCTAATCTTGGTGATGCACTGGTCAGCTCATTCCAGATGCCTCAGCTAACACAAAATCCCGACAGCAGGCCCTACAGAATTGAGTT
GGCTATGTCTGCTAAAAGATTCGCAGACCCTCTTCAGCAGTATAGTTTCTGGTAG
mRNA sequenceShow/hide mRNA sequence
ATGGAGTCTTTGCGTAAAGAAAAAAGAACTCGAGAATGGCTGTCCAAGGAGAGGGCAAAATGTCCTTGTTCCTACAAGTTGAGCTCTGAGTATGGTATTCAAGCCGTAGT
ACTAGGCTACCAGCAGTCAGTGGAAGGGCCAAAGAGAATTAGGCGGAATTTGAGATATCCACGGCCAGATGACATATGTGGTGCTAGACAATCTGGGGATTTAGGTTCGC
CTGGTGAAGAAGCACTTGATGGGATACTTGGTTTCGGAAAATCAAATTCATCTATTATTTCACAACTAGCCTCCTCAAGAAAAGTGAAGAAGATGTTTGCTCATTGCCTA
GACGGAATAAATGGGGGTGGTATATTTGCGATGGGACATGTTGTGCAGCCAAAAGTTATCATGACTCCGTTGGTACCAAATCAGCCACATTACAACGTTAATATGACGGG
AGTACAAGTTGGTCGTGTCATGTTAAATATTTCTGCTGATGTATTTGAGGCGGGAGATAGAAAAGGGACGATCATTGATAGTGGCACAACTTTGGCATATCTTCCAGAAT
TGATTTACGAGCCATTAGTGACCATGATACTCTCACGGCAACACAATTTGGAAGTTCAAACCATTCATGGAGAATATAAATGTTTTCAGTACTCAAGAAGTGTCGATGAT
GGATTTCCTCCAGTTACTTTCCATTTCGAGAATTCACTCTTGTTGAAGGTTTATCCTCATGAATATCTATTCCAACATGAGGGCTTGTGGTGTATTGGTTGGCAAAACAG
TGGGATGCAATCTAGGGATAGGAAGAACGTTACCCTCTTTGGAGATTTAGTGCTTTCAAATAAGCTAGTTTTATACGATCTCGAGAACCAAACAATCGGGTGGACCGAAT
ACAACTGTGAGTACAAAGTTATTGCTTCATTGTTTTCGAAGTTGTTGCCGCATGTTCTTCAAGCATCAAAGTGCAAGATGAACAAACCGGAACACGTACAGATTGAATAC
CAAATGGGCTGTGATGTTGCTATTCCTAATCTTGGTGATGCACTGGTCAGCTCATTCCAGATGCCTCAGCTAACACAAAATCCCGACAGCAGGCCCTACAGAATTGAGTT
GGCTATGTCTGCTAAAAGATTCGCAGACCCTCTTCAGCAGTATAGTTTCTGGTAG
Protein sequenceShow/hide protein sequence
MESLRKEKRTREWLSKERAKCPCSYKLSSEYGIQAVVLGYQQSVEGPKRIRRNLRYPRPDDICGARQSGDLGSPGEEALDGILGFGKSNSSIISQLASSRKVKKMFAHCL
DGINGGGIFAMGHVVQPKVIMTPLVPNQPHYNVNMTGVQVGRVMLNISADVFEAGDRKGTIIDSGTTLAYLPELIYEPLVTMILSRQHNLEVQTIHGEYKCFQYSRSVDD
GFPPVTFHFENSLLLKVYPHEYLFQHEGLWCIGWQNSGMQSRDRKNVTLFGDLVLSNKLVLYDLENQTIGWTEYNCEYKVIASLFSKLLPHVLQASKCKMNKPEHVQIEY
QMGCDVAIPNLGDALVSSFQMPQLTQNPDSRPYRIELAMSAKRFADPLQQYSFW