; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr029977 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr029977
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
Descriptionaspartic proteinase CDR1-like
Genome locationtig00153554:1704662..1715209
RNA-Seq ExpressionSgr029977
SyntenySgr029977
Gene Ontology termsNA
InterPro domainsIPR021109 - Aspartic peptidase domain superfamily
IPR032861 - Xylanase inhibitor, N-terminal


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022140900.1 aspartic proteinase CDR1-like [Momordica charantia]9.0e-5645.43Show/hide
Query:  ITPENSQFLVKLSVGTRPREVFAVLDTGSDLIWTQCLPCASCYQQINPVFDPSRSLSFWELLCWSPPCHLWGSGAWCSGGGGPCSYSCGYGGGLTLGSLA
        ITP+NSQFLVKL+VGT PR+VFA+LDTGSDL WTQCLPCA+CY Q NP+FDPSRS SF EL C +  CHL GSGA CSGGG  C YS GYGGGLT G+LA
Subjt:  ITPENSQFLVKLSVGTRPREVFAVLDTGSDLIWTQCLPCASCYQQINPVFDPSRSLSFWELLCWSPPCHLWGSGAWCSGGGGPCSYSCGYGGGLTLGSLA

Query:  TETIAITSRFGQRV-SFRNVVFGCGHNNSGDW-------------PIRWRTKVLPLPDAIQHRPQNFKQPHN---------RVGFGSS--GPRSHHNPAG
        TET+A++SR G+RV  F+NVVFGCGHNNSG +             P+ + +++ P       +  +   P N         ++G GS   GP        
Subjt:  TETIAITSRFGQRV-SFRNVVFGCGHNNSGDW-------------PIRWRTKVLPLPDAIQHRPQNFKQPHN---------RVGFGSS--GPRSHHNPAG

Query:  SRARPDILLSHTHWNQRRRNLPPVQFVGTGGEGKRDTRFRHSADSSPPGPVWPFGLRSEAADSIDACGRESLLRKG---EEVRGAAGDS---HFDGGVDL
            P             +   P    G   +G          DS  P  + P    S  A  +    R   +  G     VR  A      HFDGGV+L
Subjt:  SRARPDILLSHTHWNQRRRNLPPVQFVGTGGEGKRDTRFRHSADSSPPGPVWPFGLRSEAADSIDACGRESLLRKG---EEVRGAAGDS---HFDGGVDL

Query:  QLSTVQTFIRMQDGSFCFSVQGVSGSGG
         LSTVQTFIR +DGSFCF+V G+SG+GG
Subjt:  QLSTVQTFIRMQDGSFCFSVQGVSGSGG

XP_022929935.1 aspartic proteinase CDR1-like [Cucurbita moschata]9.4e-4539.82Show/hide
Query:  ITPENSQFLVKLSVGTRPREVFAVLDTGSDLIWTQCLPCASCYQQINPVFDPSRSLSFWELLCWSPPCHLWGSGAWCSGGGGPCSYSCGYGGGLTLGSLA
        I PE S+F+VK++VGT P EV A+LDTGSDL W QC PCA CY+Q NP++DPS+SL+F  L C SP CHL GSGA CS G   C Y  GYG G T G LA
Subjt:  ITPENSQFLVKLSVGTRPREVFAVLDTGSDLIWTQCLPCASCYQQINPVFDPSRSLSFWELLCWSPPCHLWGSGAWCSGGGGPCSYSCGYGGGLTLGSLA

Query:  TETIAITSRFGQRVSFRNVVFGCGHNNSGDWPIRWR----------TKVLPLPDAIQHRPQNF-KQPHN---------RVGFGS--SGP--------RSH
        TE +A+TSR G + SF  VVFGCGHNNSG +               + V  +  ++  R  +    P+N          +G GS   GP        R+ 
Subjt:  TETIAITSRFGQRVSFRNVVFGCGHNNSGDWPIRWR----------TKVLPLPDAIQHRPQNF-KQPHN---------RVGFGS--SGP--------RSH

Query:  HNPAGSRARPDILLSHTHWNQRRRNLPPVQFVGTGGEGKRDTRFRHSADSSPPGPVWP---FG---------LRSEAADSIDACGRESLLRKGEEVRGAA
           + S     I +  T        L P    G   +G          D+  P  + P   +G         + S+  D    C +++L   G+ V    
Subjt:  HNPAGSRARPDILLSHTHWNQRRRNLPPVQFVGTGGEGKRDTRFRHSADSSPPGPVWP---FG---------LRSEAADSIDACGRESLLRKGEEVRGAA

Query:  GDSHFDGGVDLQLSTVQTFIRMQDGSFCFSVQGV
           HFDGGVDL+LSTVQTF +M DGSFCF+  GV
Subjt:  GDSHFDGGVDLQLSTVQTFIRMQDGSFCFSVQGV

XP_022942002.1 aspartic proteinase CDR1-like [Cucurbita moschata]7.9e-4439.21Show/hide
Query:  ITPENSQFLVKLSVGTRPREVFAVLDTGSDLIWTQCLPCASCYQQINPVFDPSRSLSFWELLCWSPPCHLWGSGAWCSGGGGPCSYSCGYGGGLTLGSLA
        I PE SQF+VK+++GT P +V A+LDTGSDL W QC PC  CY+Q NP++DPS+S +F  L C SP CHLWGSGA CS G   C Y  GYG G TLG LA
Subjt:  ITPENSQFLVKLSVGTRPREVFAVLDTGSDLIWTQCLPCASCYQQINPVFDPSRSLSFWELLCWSPPCHLWGSGAWCSGGGGPCSYSCGYGGGLTLGSLA

Query:  TETIAITSRFGQRVSFRNVVFGCGHNNSGDWPIRWR----------TKVLPLPDAIQHRPQNF-KQPHN---------RVGFGS--SGP--------RSH
        TE +AITSR G   SF  VVFGCGHNNSG +               + V  +  ++  R  +    P+N          +G GS   GP        R+ 
Subjt:  TETIAITSRFGQRVSFRNVVFGCGHNNSGDWPIRWR----------TKVLPLPDAIQHRPQNF-KQPHN---------RVGFGS--SGP--------RSH

Query:  HNPAGSRARPDILLSHTHWNQRRRNLPPVQFVGTGGEGKRDTRFRHSADSSPPGPVWPFGLRSEAADSIDACGRESLLRKGEEV---RGAAGD----SHF
           + S     I +  T           V +  +G   K +       D+  P  + P  L    A  +    R   L+  ++    +G  G+     HF
Subjt:  HNPAGSRARPDILLSHTHWNQRRRNLPPVQFVGTGGEGKRDTRFRHSADSSPPGPVWPFGLRSEAADSIDACGRESLLRKGEEV---RGAAGD----SHF

Query:  DGGVDLQLSTVQTFIRMQDGSFCFSVQGV
        DG VDL+LST QTF +M DGSFCF+  GV
Subjt:  DGGVDLQLSTVQTFIRMQDGSFCFSVQGV

XP_022987324.1 aspartic proteinase CDR1-like [Cucurbita maxima]1.4e-4339.22Show/hide
Query:  ITPENSQFLVKLSVGTRPREVFAVLDTGSDLIWTQCLPCASCYQQINPVFDPSRSLSFWELLCWSPPCHLWGSGAWCSGGGGPCSYSCGYGGGLTLGSLA
        I PE S+F+VK+++GT P EV A+LDTGSDL W QC PCA CYQQ NP++DPS+S +F  L C SP CHL GSGA CS G   C YS GYG G T G LA
Subjt:  ITPENSQFLVKLSVGTRPREVFAVLDTGSDLIWTQCLPCASCYQQINPVFDPSRSLSFWELLCWSPPCHLWGSGAWCSGGGGPCSYSCGYGGGLTLGSLA

Query:  TETIAITSRFGQRVSFRNVVFGCGHNNSGDWPIRWR----------TKVLPLPDAIQHRPQNF-KQPHN---------RVGFGS--SGP--------RSH
        +E +A+TSR G    F  VVFGCGHNNSG +               + V  +  ++  R  +    P+N          +G GS   GP        R+ 
Subjt:  TETIAITSRFGQRVSFRNVVFGCGHNNSGDWPIRWR----------TKVLPLPDAIQHRPQNF-KQPHN---------RVGFGS--SGP--------RSH

Query:  HNPAGSRARPDILLSHTHWNQRRRNLPPVQFVGTGGEGKRDTRFRHSADSSPPGPVWP---FG---------LRSEAADSIDACGRESLLRKGEEVRGAA
           + S     I +        R+ L P    G   +G          D+  P  + P   +G         + S+  D    C +++L   G+ V    
Subjt:  HNPAGSRARPDILLSHTHWNQRRRNLPPVQFVGTGGEGKRDTRFRHSADSSPPGPVWP---FG---------LRSEAADSIDACGRESLLRKGEEVRGAA

Query:  GDSHFDGGVDLQLSTVQTFIRMQDGSFCFSVQGV
           HFDGGVDL+LSTVQTF +M DGSFCF+  GV
Subjt:  GDSHFDGGVDLQLSTVQTFIRMQDGSFCFSVQGV

XP_023526675.1 aspartic proteinase CDR1-like [Cucurbita pepo subsp. pepo]6.1e-4439.51Show/hide
Query:  ITPENSQFLVKLSVGTRPREVFAVLDTGSDLIWTQCLPCASCYQQINPVFDPSRSLSFWELLCWSPPCHLWGSGAWCSGGGGPCSYSCGYGGGLTLGSLA
        I PE SQF+VK++VGT P EV A+LDTGSDL W QC PCA CY+Q NP++DPS+S +F  L C SP CHLWGSGA CS G   C Y  GYG G T G LA
Subjt:  ITPENSQFLVKLSVGTRPREVFAVLDTGSDLIWTQCLPCASCYQQINPVFDPSRSLSFWELLCWSPPCHLWGSGAWCSGGGGPCSYSCGYGGGLTLGSLA

Query:  TETIAITSRFGQRVSFRNVVFGCGHNNSGDWPIRWR----------TKVLPLPDAIQHRPQNF-KQPHNRVGFGSSGPRSHHNPAGSRAR-PDILLS---
        TE +A+TSR G    F  VVFGCGHNNSG +               + V  +  ++  R  +    P+N     SS   S    +GS  + PD+  +   
Subjt:  TETIAITSRFGQRVSFRNVVFGCGHNNSGDWPIRWR----------TKVLPLPDAIQHRPQNF-KQPHNRVGFGSSGPRSHHNPAGSRAR-PDILLS---

Query:  ----HTHWNQRRRNLP------PVQFVGTGGEGKRDTRFRHSADSSPPGPVWPFGLRSEAADSIDACGRESLLRKGEEV---RGAAGD----SHFDGGVD
             T ++   + +       P   +G   +G          D+  P  + P  L    A  +    R   L+  ++    +G  G+     HFDG VD
Subjt:  ----HTHWNQRRRNLP------PVQFVGTGGEGKRDTRFRHSADSSPPGPVWPFGLRSEAADSIDACGRESLLRKGEEV---RGAAGD----SHFDGGVD

Query:  LQLSTVQTFIRMQDGSFCFSVQGV
        L+LST QTF +M DGSFCF+  GV
Subjt:  LQLSTVQTFIRMQDGSFCFSVQGV

TrEMBL top hitse value%identityAlignment
A0A6J1CGG6 aspartic proteinase CDR1-like4.4e-5645.43Show/hide
Query:  ITPENSQFLVKLSVGTRPREVFAVLDTGSDLIWTQCLPCASCYQQINPVFDPSRSLSFWELLCWSPPCHLWGSGAWCSGGGGPCSYSCGYGGGLTLGSLA
        ITP+NSQFLVKL+VGT PR+VFA+LDTGSDL WTQCLPCA+CY Q NP+FDPSRS SF EL C +  CHL GSGA CSGGG  C YS GYGGGLT G+LA
Subjt:  ITPENSQFLVKLSVGTRPREVFAVLDTGSDLIWTQCLPCASCYQQINPVFDPSRSLSFWELLCWSPPCHLWGSGAWCSGGGGPCSYSCGYGGGLTLGSLA

Query:  TETIAITSRFGQRV-SFRNVVFGCGHNNSGDW-------------PIRWRTKVLPLPDAIQHRPQNFKQPHN---------RVGFGSS--GPRSHHNPAG
        TET+A++SR G+RV  F+NVVFGCGHNNSG +             P+ + +++ P       +  +   P N         ++G GS   GP        
Subjt:  TETIAITSRFGQRV-SFRNVVFGCGHNNSGDW-------------PIRWRTKVLPLPDAIQHRPQNFKQPHN---------RVGFGSS--GPRSHHNPAG

Query:  SRARPDILLSHTHWNQRRRNLPPVQFVGTGGEGKRDTRFRHSADSSPPGPVWPFGLRSEAADSIDACGRESLLRKG---EEVRGAAGDS---HFDGGVDL
            P             +   P    G   +G          DS  P  + P    S  A  +    R   +  G     VR  A      HFDGGV+L
Subjt:  SRARPDILLSHTHWNQRRRNLPPVQFVGTGGEGKRDTRFRHSADSSPPGPVWPFGLRSEAADSIDACGRESLLRKG---EEVRGAAGDS---HFDGGVDL

Query:  QLSTVQTFIRMQDGSFCFSVQGVSGSGG
         LSTVQTFIR +DGSFCF+V G+SG+GG
Subjt:  QLSTVQTFIRMQDGSFCFSVQGVSGSGG

A0A6J1EVM9 aspartic proteinase CDR1-like4.5e-4539.82Show/hide
Query:  ITPENSQFLVKLSVGTRPREVFAVLDTGSDLIWTQCLPCASCYQQINPVFDPSRSLSFWELLCWSPPCHLWGSGAWCSGGGGPCSYSCGYGGGLTLGSLA
        I PE S+F+VK++VGT P EV A+LDTGSDL W QC PCA CY+Q NP++DPS+SL+F  L C SP CHL GSGA CS G   C Y  GYG G T G LA
Subjt:  ITPENSQFLVKLSVGTRPREVFAVLDTGSDLIWTQCLPCASCYQQINPVFDPSRSLSFWELLCWSPPCHLWGSGAWCSGGGGPCSYSCGYGGGLTLGSLA

Query:  TETIAITSRFGQRVSFRNVVFGCGHNNSGDWPIRWR----------TKVLPLPDAIQHRPQNF-KQPHN---------RVGFGS--SGP--------RSH
        TE +A+TSR G + SF  VVFGCGHNNSG +               + V  +  ++  R  +    P+N          +G GS   GP        R+ 
Subjt:  TETIAITSRFGQRVSFRNVVFGCGHNNSGDWPIRWR----------TKVLPLPDAIQHRPQNF-KQPHN---------RVGFGS--SGP--------RSH

Query:  HNPAGSRARPDILLSHTHWNQRRRNLPPVQFVGTGGEGKRDTRFRHSADSSPPGPVWP---FG---------LRSEAADSIDACGRESLLRKGEEVRGAA
           + S     I +  T        L P    G   +G          D+  P  + P   +G         + S+  D    C +++L   G+ V    
Subjt:  HNPAGSRARPDILLSHTHWNQRRRNLPPVQFVGTGGEGKRDTRFRHSADSSPPGPVWP---FG---------LRSEAADSIDACGRESLLRKGEEVRGAA

Query:  GDSHFDGGVDLQLSTVQTFIRMQDGSFCFSVQGV
           HFDGGVDL+LSTVQTF +M DGSFCF+  GV
Subjt:  GDSHFDGGVDLQLSTVQTFIRMQDGSFCFSVQGV

A0A6J1FP13 aspartic proteinase CDR1-like3.8e-4439.21Show/hide
Query:  ITPENSQFLVKLSVGTRPREVFAVLDTGSDLIWTQCLPCASCYQQINPVFDPSRSLSFWELLCWSPPCHLWGSGAWCSGGGGPCSYSCGYGGGLTLGSLA
        I PE SQF+VK+++GT P +V A+LDTGSDL W QC PC  CY+Q NP++DPS+S +F  L C SP CHLWGSGA CS G   C Y  GYG G TLG LA
Subjt:  ITPENSQFLVKLSVGTRPREVFAVLDTGSDLIWTQCLPCASCYQQINPVFDPSRSLSFWELLCWSPPCHLWGSGAWCSGGGGPCSYSCGYGGGLTLGSLA

Query:  TETIAITSRFGQRVSFRNVVFGCGHNNSGDWPIRWR----------TKVLPLPDAIQHRPQNF-KQPHN---------RVGFGS--SGP--------RSH
        TE +AITSR G   SF  VVFGCGHNNSG +               + V  +  ++  R  +    P+N          +G GS   GP        R+ 
Subjt:  TETIAITSRFGQRVSFRNVVFGCGHNNSGDWPIRWR----------TKVLPLPDAIQHRPQNF-KQPHN---------RVGFGS--SGP--------RSH

Query:  HNPAGSRARPDILLSHTHWNQRRRNLPPVQFVGTGGEGKRDTRFRHSADSSPPGPVWPFGLRSEAADSIDACGRESLLRKGEEV---RGAAGD----SHF
           + S     I +  T           V +  +G   K +       D+  P  + P  L    A  +    R   L+  ++    +G  G+     HF
Subjt:  HNPAGSRARPDILLSHTHWNQRRRNLPPVQFVGTGGEGKRDTRFRHSADSSPPGPVWPFGLRSEAADSIDACGRESLLRKGEEV---RGAAGD----SHF

Query:  DGGVDLQLSTVQTFIRMQDGSFCFSVQGV
        DG VDL+LST QTF +M DGSFCF+  GV
Subjt:  DGGVDLQLSTVQTFIRMQDGSFCFSVQGV

A0A6J1HV99 aspartic proteinase CDR1-like1.1e-4339.06Show/hide
Query:  ITPENSQFLVKLSVGTRPREVFAVLDTGSDLIWTQCLPCASCYQQINPVFDPSRSLSFWELLCWSPPCHLWGSGAWCSGGGGPCSYSCGYGGGLTLGSLA
        I PE SQF++K++VGT P EV A+LDTGSDL WTQCLPCA+CY+Q NP++DPS+S +F  L C  P CHL GSGA CS G   C Y+ GYG GLT G LA
Subjt:  ITPENSQFLVKLSVGTRPREVFAVLDTGSDLIWTQCLPCASCYQQINPVFDPSRSLSFWELLCWSPPCHLWGSGAWCSGGGGPCSYSCGYGGGLTLGSLA

Query:  TETIAITSRFGQRVSFRNVVFGCGHNNSGDW-------------PIRWRTKVLP----LPDAIQHRPQNFKQP-HNRVGFGSSGPRSHHNPAGSRARPDI
        TE +A+TSR G    F+ VVFGCGHNNSG +              I + +++ P       ++   P N      + +  GS        P    A+   
Subjt:  TETIAITSRFGQRVSFRNVVFGCGHNNSGDW-------------PIRWRTKVLP----LPDAIQHRPQNFKQP-HNRVGFGSSGPRSHHNPAGSRARPDI

Query:  LLSHTHWN------QRRRNLPPVQFVGTGGEGKRDTRFRHSADSSPPGPVWP----FGLRSEAADSIDACGRESLLRKGEEVRGAAGDSHFDGGVDLQLS
        +   T+++           L P    G   +G          D+  P  + P      L +E    I +      L   + +       HFDGGVDL LS
Subjt:  LLSHTHWN------QRRRNLPPVQFVGTGGEGKRDTRFRHSADSSPPGPVWP----FGLRSEAADSIDACGRESLLRKGEEVRGAAGDSHFDGGVDLQLS

Query:  TVQTFIRMQDGSFCFSVQGV
        T+QTF +M DGSFCF+  GV
Subjt:  TVQTFIRMQDGSFCFSVQGV

A0A6J1JIJ5 aspartic proteinase CDR1-like6.6e-4439.22Show/hide
Query:  ITPENSQFLVKLSVGTRPREVFAVLDTGSDLIWTQCLPCASCYQQINPVFDPSRSLSFWELLCWSPPCHLWGSGAWCSGGGGPCSYSCGYGGGLTLGSLA
        I PE S+F+VK+++GT P EV A+LDTGSDL W QC PCA CYQQ NP++DPS+S +F  L C SP CHL GSGA CS G   C YS GYG G T G LA
Subjt:  ITPENSQFLVKLSVGTRPREVFAVLDTGSDLIWTQCLPCASCYQQINPVFDPSRSLSFWELLCWSPPCHLWGSGAWCSGGGGPCSYSCGYGGGLTLGSLA

Query:  TETIAITSRFGQRVSFRNVVFGCGHNNSGDWPIRWR----------TKVLPLPDAIQHRPQNF-KQPHN---------RVGFGS--SGP--------RSH
        +E +A+TSR G    F  VVFGCGHNNSG +               + V  +  ++  R  +    P+N          +G GS   GP        R+ 
Subjt:  TETIAITSRFGQRVSFRNVVFGCGHNNSGDWPIRWR----------TKVLPLPDAIQHRPQNF-KQPHN---------RVGFGS--SGP--------RSH

Query:  HNPAGSRARPDILLSHTHWNQRRRNLPPVQFVGTGGEGKRDTRFRHSADSSPPGPVWP---FG---------LRSEAADSIDACGRESLLRKGEEVRGAA
           + S     I +        R+ L P    G   +G          D+  P  + P   +G         + S+  D    C +++L   G+ V    
Subjt:  HNPAGSRARPDILLSHTHWNQRRRNLPPVQFVGTGGEGKRDTRFRHSADSSPPGPVWP---FG---------LRSEAADSIDACGRESLLRKGEEVRGAA

Query:  GDSHFDGGVDLQLSTVQTFIRMQDGSFCFSVQGV
           HFDGGVDL+LSTVQTF +M DGSFCF+  GV
Subjt:  GDSHFDGGVDLQLSTVQTFIRMQDGSFCFSVQGV

SwissProt top hitse value%identityAlignment
Q3EBM5 Probable aspartic protease At2g356156.8e-2233.51Show/hide
Query:  PPYSPSLTTLHYFSSMLLRESRRRQWLSQPGTDPPRLSQISLLPPFLRLFSGECADDITPENSQFLVKLSVGTRPREVFAVLDTGSDLIWTQCLPCASCY
        P Y+P +T     ++  LR   R +  +       +LSQ  L               +   + +F + +++GT P +VFA+ DTGSDL W QC PC  CY
Subjt:  PPYSPSLTTLHYFSSMLLRESRRRQWLSQPGTDPPRLSQISLLPPFLRLFSGECADDITPENSQFLVKLSVGTRPREVFAVLDTGSDLIWTQCLPCASCY

Query:  QQINPVFDPSRSLSFWELLCWSPPCH-LWGSGAWCSGGGGPCSYSCGYGG-GLTLGSLATETIAITSRFGQRVSFRNVVFGCGHNNSG
        ++  P+FD  +S ++    C S  C  L  +   C      C Y   YG    + G +ATET++I S  G  VSF   VFGCG+NN G
Subjt:  QQINPVFDPSRSLSFWELLCWSPPCH-LWGSGAWCSGGGGPCSYSCGYGG-GLTLGSLATETIAITSRFGQRVSFRNVVFGCGHNNSG

Q6XBF8 Aspartic proteinase CDR12.4e-2727.64Show/hide
Query:  DITPENSQFLVKLSVGTRPREVFAVLDTGSDLIWTQCLPCASCYQQINPVFDPSRSLSFWELLCWSPPCHLWGSGAWCSGGGGPCSYSCGYG-GGLTLGS
        D+T  + ++L+ +S+GT P  + A+ DTGSDL+WTQC PC  CY Q++P+FDP  S ++ ++ C S  C    + A CS     CSYS  YG    T G+
Subjt:  DITPENSQFLVKLSVGTRPREVFAVLDTGSDLIWTQCLPCASCYQQINPVFDPSRSLSFWELLCWSPPCHLWGSGAWCSGGGGPCSYSCGYG-GGLTLGS

Query:  LATETIAITSRFGQRVSFRNVVFGCGHNNSGDWPIRWR----------TKVLPLPDAIQHR------PQNFKQPH-NRVGFGS----SGPRSHHNPAGSR
        +A +T+ + S   + +  +N++ GCGHNN+G +  +            + +  L D+I  +      P   K+   +++ FG+    SG      P  ++
Subjt:  LATETIAITSRFGQRVSFRNVVFGCGHNNSGDWPIRWR----------TKVLPLPDAIQHR------PQNFKQPH-NRVGFGS----SGPRSHHNPAGSR

Query:  ARPDILLSHTHWNQRRRNLPPVQFVGTGGEGKRDTRFRHSADSSPPGPVWPFG-LRSEAADSIDACGRESLLRKGEEVRGAAGD-------SHFDGGVDL
        A  +     T           +Q+ G+  E         S  +    P   +  L    A SIDA  ++           A GD        HFD G D+
Subjt:  ARPDILLSHTHWNQRRRNLPPVQFVGTGGEGKRDTRFRHSADSSPPGPVWPFG-LRSEAADSIDACGRESLLRKGEEVRGAAGD-------SHFDGGVDL

Query:  QLSTVQTFIRMQDGSFCFSVQG
        +L +   F+++ +   CF+ +G
Subjt:  QLSTVQTFIRMQDGSFCFSVQG

Q766C3 Aspartic proteinase nepenthesin-13.4e-2145.24Show/hide
Query:  NSQFLVKLSVGTRPREVFAVLDTGSDLIWTQCLPCASCYQQINPVFDPSRSLSFWELLCWSPPCHLWGSGAWCSGGGGPCSYSCGYG-GGLTLGSLATET
        + ++L+ LS+GT  +   A++DTGSDLIWTQC PC  C+ Q  P+F+P  S SF  L C S  C    S   CS     C Y+ GYG G  T GS+ TET
Subjt:  NSQFLVKLSVGTRPREVFAVLDTGSDLIWTQCLPCASCYQQINPVFDPSRSLSFWELLCWSPPCHLWGSGAWCSGGGGPCSYSCGYG-GGLTLGSLATET

Query:  IAITSRFGQRVSFRNVVFGCGHNNSG
        +   S     VS  N+ FGCG NN G
Subjt:  IAITSRFGQRVSFRNVVFGCGHNNSG

Q9LNJ3 Aspartyl protease family protein 26.1e-2335.33Show/hide
Query:  SPSLTTLHYFSSMLLRESRRRQWLSQPGTDPPRLSQISLLPPFLRLFSGECADDITPENSQFLVKLSVGTRPREVFAVLDTGSDLIWTQCLPCASCYQQI
        S + T    FSS L R+SRR + ++      P  +      P    FS      ++  + ++  +L VGT  R V+ VLDTGSD++W QC PC  CY Q 
Subjt:  SPSLTTLHYFSSMLLRESRRRQWLSQPGTDPPRLSQISLLPPFLRLFSGECADDITPENSQFLVKLSVGTRPREVFAVLDTGSDLIWTQCLPCASCYQQI

Query:  NPVFDPSRSLSFWELLCWSPPCHLWGSGAWCSGGGGPCSYSCGYG-GGLTLGSLATETIAITSRFGQRVSFRNVVFGCGHNNSG
        +P+FDP +S ++  + C SP C    S A C+     C Y   YG G  T+G  +TET+       +R   + V  GCGH+N G
Subjt:  NPVFDPSRSLSFWELLCWSPPCHLWGSGAWCSGGGGPCSYSCGYG-GGLTLGSLATETIAITSRFGQRVSFRNVVFGCGHNNSG

Q9LS40 Protein ASPARTIC PROTEASE IN GUARD CELL 13.0e-2242.86Show/hide
Query:  NSQFLVKLSVGTRPREVFAVLDTGSDLIWTQCLPCASCYQQINPVFDPSRSLSFWELLCWSPPCHLWGSGAWCSGGGGPCSYSCGYG-GGLTLGSLATET
        + ++  ++ VGT  +E++ VLDTGSD+ W QC PCA CYQQ +PVF+P+ S ++  L C +P C L  + A  S     C Y   YG G  T+G LAT+T
Subjt:  NSQFLVKLSVGTRPREVFAVLDTGSDLIWTQCLPCASCYQQINPVFDPSRSLSFWELLCWSPPCHLWGSGAWCSGGGGPCSYSCGYG-GGLTLGSLATET

Query:  IAITSRFGQRVSFRNVVFGCGHNNSG
        +     FG      NV  GCGH+N G
Subjt:  IAITSRFGQRVSFRNVVFGCGHNNSG

Arabidopsis top hitse value%identityAlignment
AT1G01300.1 Eukaryotic aspartyl protease family protein4.4e-2435.33Show/hide
Query:  SPSLTTLHYFSSMLLRESRRRQWLSQPGTDPPRLSQISLLPPFLRLFSGECADDITPENSQFLVKLSVGTRPREVFAVLDTGSDLIWTQCLPCASCYQQI
        S + T    FSS L R+SRR + ++      P  +      P    FS      ++  + ++  +L VGT  R V+ VLDTGSD++W QC PC  CY Q 
Subjt:  SPSLTTLHYFSSMLLRESRRRQWLSQPGTDPPRLSQISLLPPFLRLFSGECADDITPENSQFLVKLSVGTRPREVFAVLDTGSDLIWTQCLPCASCYQQI

Query:  NPVFDPSRSLSFWELLCWSPPCHLWGSGAWCSGGGGPCSYSCGYG-GGLTLGSLATETIAITSRFGQRVSFRNVVFGCGHNNSG
        +P+FDP +S ++  + C SP C    S A C+     C Y   YG G  T+G  +TET+       +R   + V  GCGH+N G
Subjt:  NPVFDPSRSLSFWELLCWSPPCHLWGSGAWCSGGGGPCSYSCGYG-GGLTLGSLATETIAITSRFGQRVSFRNVVFGCGHNNSG

AT1G64830.1 Eukaryotic aspartyl protease family protein8.0e-2643.08Show/hide
Query:  ITPENSQFLVKLSVGTRPREVFAVLDTGSDLIWTQCLPCASCYQQINPVFDPSRSLSFWELLCWSPPCHLWGSGAWCSGGGGPCSYSCGYG-GGLTLGSL
        IT    ++L+ +S+GT P  + A+ DTGSDLIWTQC PC  CYQQ +P+FDP  S ++ ++ C S  C      A CS     CSY+  YG    T G +
Subjt:  ITPENSQFLVKLSVGTRPREVFAVLDTGSDLIWTQCLPCASCYQQINPVFDPSRSLSFWELLCWSPPCHLWGSGAWCSGGGGPCSYSCGYG-GGLTLGSL

Query:  ATETIAITSRFGQRVSFRNVVFGCGHNNSG
        A +T+ + S   + VS RN++ GCGH N+G
Subjt:  ATETIAITSRFGQRVSFRNVVFGCGHNNSG

AT2G28010.1 Eukaryotic aspartyl protease family protein1.7e-2343.65Show/hide
Query:  ENSQFLVKLSVGTRPREVFAVLDTGSDLIWTQCLPCASCYQQINPVFDPSRSLSFWELLCWSPPCHLWGSGAWCSGGGGPCSYSCGY-GGGLTLGSLATE
        +NS +L+KL VGT P E+ A++DTGS++ WTQCLPC  CY+Q  P+FDPS+S +F E  C                 G  C Y   Y     T+G+LATE
Subjt:  ENSQFLVKLSVGTRPREVFAVLDTGSDLIWTQCLPCASCYQQINPVFDPSRSLSFWELLCWSPPCHLWGSGAWCSGGGGPCSYSCGY-GGGLTLGSLATE

Query:  TIAITSRFGQRVSFRNVVFGCGHNNS
        TI + S  G+       + GCGHNNS
Subjt:  TIAITSRFGQRVSFRNVVFGCGHNNS

AT2G28040.1 Eukaryotic aspartyl protease family protein1.7e-2343.28Show/hide
Query:  DITPENSQFLVKLSVGTRPREVFAVLDTGSDLIWTQCLPCASCYQQINPVFDPSRSLSFWELLCWSPPCHLWGSGAWCSGGGGPCSYSCGYGG-GLTLGS
        D   +  ++L+KL +GT P E+ AVLDTGS+ IWTQCLPC  CY Q  P+FDPS+S +F E+               C      C Y   YGG   T G+
Subjt:  DITPENSQFLVKLSVGTRPREVFAVLDTGSDLIWTQCLPCASCYQQINPVFDPSRSLSFWELLCWSPPCHLWGSGAWCSGGGGPCSYSCGYGG-GLTLGS

Query:  LATETIAITSRFGQRVSFRNVVFGCGHNNSGDWP
        L TET+ I S  GQ       + GCG NNSG  P
Subjt:  LATETIAITSRFGQRVSFRNVVFGCGHNNSGDWP

AT5G33340.1 Eukaryotic aspartyl protease family protein1.7e-2827.64Show/hide
Query:  DITPENSQFLVKLSVGTRPREVFAVLDTGSDLIWTQCLPCASCYQQINPVFDPSRSLSFWELLCWSPPCHLWGSGAWCSGGGGPCSYSCGYG-GGLTLGS
        D+T  + ++L+ +S+GT P  + A+ DTGSDL+WTQC PC  CY Q++P+FDP  S ++ ++ C S  C    + A CS     CSYS  YG    T G+
Subjt:  DITPENSQFLVKLSVGTRPREVFAVLDTGSDLIWTQCLPCASCYQQINPVFDPSRSLSFWELLCWSPPCHLWGSGAWCSGGGGPCSYSCGYG-GGLTLGS

Query:  LATETIAITSRFGQRVSFRNVVFGCGHNNSGDWPIRWR----------TKVLPLPDAIQHR------PQNFKQPH-NRVGFGS----SGPRSHHNPAGSR
        +A +T+ + S   + +  +N++ GCGHNN+G +  +            + +  L D+I  +      P   K+   +++ FG+    SG      P  ++
Subjt:  LATETIAITSRFGQRVSFRNVVFGCGHNNSGDWPIRWR----------TKVLPLPDAIQHR------PQNFKQPH-NRVGFGS----SGPRSHHNPAGSR

Query:  ARPDILLSHTHWNQRRRNLPPVQFVGTGGEGKRDTRFRHSADSSPPGPVWPFG-LRSEAADSIDACGRESLLRKGEEVRGAAGD-------SHFDGGVDL
        A  +     T           +Q+ G+  E         S  +    P   +  L    A SIDA  ++           A GD        HFD G D+
Subjt:  ARPDILLSHTHWNQRRRNLPPVQFVGTGGEGKRDTRFRHSADSSPPGPVWPFG-LRSEAADSIDACGRESLLRKGEEVRGAAGD-------SHFDGGVDL

Query:  QLSTVQTFIRMQDGSFCFSVQG
        +L +   F+++ +   CF+ +G
Subjt:  QLSTVQTFIRMQDGSFCFSVQG


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCCTCCATACTCTCCTTCTCTCACTACTTTACATTACTTCTCCTCCATGCTTCTCAGAGAAAGCCGCCGCCGGCAATGGCTGTCTCAGCCTGGAACTGATCCGCCGCG
ACTCTCCCAAATCTCCCTTCTTCCGCCCTTCCTACGACTCTTCTCCGGAGAGTGCGCAGATGACATAACGCCGGAGAACAGCCAGTTTCTGGTGAAGCTCTCCGTCGGAA
CGCGGCCGAGAGAGGTCTTCGCCGTCCTCGACACCGGCAGCGACCTGATCTGGACTCAGTGTCTTCCCTGTGCGAGTTGCTACCAGCAAATCAACCCGGTTTTTGATCCT
TCGAGATCGTTGAGTTTCTGGGAGCTTTTGTGCTGGTCGCCGCCGTGCCACTTGTGGGGTTCCGGGGCGTGGTGCTCGGGCGGCGGCGGCCCCTGTAGCTACAGCTGTGG
GTATGGAGGCGGATTGACGCTGGGAAGTTTGGCCACTGAAACGATCGCCATTACTTCGAGATTTGGACAGAGAGTTTCGTTTCGGAACGTTGTGTTTGGGTGTGGACATA
ATAACAGCGGAGATTGGCCCATCCGCTGGCGGACGAAAGTGCTCCCTCTGCCTGATGCCATTCAACACCGACCCCAGAATTTCAAGCAGCCTCACAATCGGGTCGGGTTT
GGAAGTTCGGGGCCCCGGAGTCATCACAATCCCGCTGGTTCCCGCGCCCGACCCGACATTCTACTCTCTCACACTCACTGGAATCAGCGTCGGAGGAACCTTCCTCCCGT
ACAGTTCGTCGGGACCGGCGGCGAAGGGAAACGTGATACTCGATTCCGGCACTCCGCCGACTCTTCTCCCCCAGGACCGGTATGGCCGTTTGGCCTCCGAAGTGAAGCGG
CGGATTCGATTGACGCCTGTGGGAGGGAGTCTTTGCTACGGAAGGGTGAGGAGGTTCGTGGCGCCGCCGGTGACTCGCACTTCGACGGCGGAGTGGACTTGCAGCTGAGT
ACGGTTCAGACGTTCATTCGGATGCAAGATGGGTCGTTTTGCTTCTCCGTGCAAGGCGTTTCCGGCAGCGGCGGATCATCGGCAGCTTTATGCAGGCGAATTTTTTGCGA
CTGTCGGCGCCGGCGGAGTACTGAGAAGGATGAGAAGGAGAAGAGGGAAGGCGTGACTGCAGGGAGAGGAGGAATGAGGCAGATCAGCTCCATGGAAACCGAAGCTAGAT
CTGCTGAGATGGAAGGATATTCTGCCGCACCGCCGTCACTGTCGATTCTACTGGTCGTCCCTCCAAAACTCCGACTCCTGATCTCTGCTTCAAACCCTTGTCAATGGCGG
ATGCCGAGCTCCTCCCTCTGCTGCAGAGGTCCACGCATTCGGAGAACTGACTACACTCACGGCATGATTTCTAACGATGCAGTAGACTTGTGTTTGAAGATGATGAAGGA
GGGTAGAGTCGTGTATTCCGATATAATTGAAGCAACAAATGAATTTGATGATAGGTACTGCATAGGGGAAGGAGGATCAAGAAAAGTTTACAGAGTGGAAATATATGCTT
GGGGTGAGGATGTAAGAGGTGTTGTTCAGGCATTATCTTATCTATATCATAATCGTAAACTTCTGACTATAGATAGAAAGAACAATGTCTTGTTGAACTTGGAATTTGAA
GCGCATGTTGCAAATTTTGGCATTGCGAGGTTTTTGAAGCCTGACATGTCTCGATGA
mRNA sequenceShow/hide mRNA sequence
ATGCCTCCATACTCTCCTTCTCTCACTACTTTACATTACTTCTCCTCCATGCTTCTCAGAGAAAGCCGCCGCCGGCAATGGCTGTCTCAGCCTGGAACTGATCCGCCGCG
ACTCTCCCAAATCTCCCTTCTTCCGCCCTTCCTACGACTCTTCTCCGGAGAGTGCGCAGATGACATAACGCCGGAGAACAGCCAGTTTCTGGTGAAGCTCTCCGTCGGAA
CGCGGCCGAGAGAGGTCTTCGCCGTCCTCGACACCGGCAGCGACCTGATCTGGACTCAGTGTCTTCCCTGTGCGAGTTGCTACCAGCAAATCAACCCGGTTTTTGATCCT
TCGAGATCGTTGAGTTTCTGGGAGCTTTTGTGCTGGTCGCCGCCGTGCCACTTGTGGGGTTCCGGGGCGTGGTGCTCGGGCGGCGGCGGCCCCTGTAGCTACAGCTGTGG
GTATGGAGGCGGATTGACGCTGGGAAGTTTGGCCACTGAAACGATCGCCATTACTTCGAGATTTGGACAGAGAGTTTCGTTTCGGAACGTTGTGTTTGGGTGTGGACATA
ATAACAGCGGAGATTGGCCCATCCGCTGGCGGACGAAAGTGCTCCCTCTGCCTGATGCCATTCAACACCGACCCCAGAATTTCAAGCAGCCTCACAATCGGGTCGGGTTT
GGAAGTTCGGGGCCCCGGAGTCATCACAATCCCGCTGGTTCCCGCGCCCGACCCGACATTCTACTCTCTCACACTCACTGGAATCAGCGTCGGAGGAACCTTCCTCCCGT
ACAGTTCGTCGGGACCGGCGGCGAAGGGAAACGTGATACTCGATTCCGGCACTCCGCCGACTCTTCTCCCCCAGGACCGGTATGGCCGTTTGGCCTCCGAAGTGAAGCGG
CGGATTCGATTGACGCCTGTGGGAGGGAGTCTTTGCTACGGAAGGGTGAGGAGGTTCGTGGCGCCGCCGGTGACTCGCACTTCGACGGCGGAGTGGACTTGCAGCTGAGT
ACGGTTCAGACGTTCATTCGGATGCAAGATGGGTCGTTTTGCTTCTCCGTGCAAGGCGTTTCCGGCAGCGGCGGATCATCGGCAGCTTTATGCAGGCGAATTTTTTGCGA
CTGTCGGCGCCGGCGGAGTACTGAGAAGGATGAGAAGGAGAAGAGGGAAGGCGTGACTGCAGGGAGAGGAGGAATGAGGCAGATCAGCTCCATGGAAACCGAAGCTAGAT
CTGCTGAGATGGAAGGATATTCTGCCGCACCGCCGTCACTGTCGATTCTACTGGTCGTCCCTCCAAAACTCCGACTCCTGATCTCTGCTTCAAACCCTTGTCAATGGCGG
ATGCCGAGCTCCTCCCTCTGCTGCAGAGGTCCACGCATTCGGAGAACTGACTACACTCACGGCATGATTTCTAACGATGCAGTAGACTTGTGTTTGAAGATGATGAAGGA
GGGTAGAGTCGTGTATTCCGATATAATTGAAGCAACAAATGAATTTGATGATAGGTACTGCATAGGGGAAGGAGGATCAAGAAAAGTTTACAGAGTGGAAATATATGCTT
GGGGTGAGGATGTAAGAGGTGTTGTTCAGGCATTATCTTATCTATATCATAATCGTAAACTTCTGACTATAGATAGAAAGAACAATGTCTTGTTGAACTTGGAATTTGAA
GCGCATGTTGCAAATTTTGGCATTGCGAGGTTTTTGAAGCCTGACATGTCTCGATGA
Protein sequenceShow/hide protein sequence
MPPYSPSLTTLHYFSSMLLRESRRRQWLSQPGTDPPRLSQISLLPPFLRLFSGECADDITPENSQFLVKLSVGTRPREVFAVLDTGSDLIWTQCLPCASCYQQINPVFDP
SRSLSFWELLCWSPPCHLWGSGAWCSGGGGPCSYSCGYGGGLTLGSLATETIAITSRFGQRVSFRNVVFGCGHNNSGDWPIRWRTKVLPLPDAIQHRPQNFKQPHNRVGF
GSSGPRSHHNPAGSRARPDILLSHTHWNQRRRNLPPVQFVGTGGEGKRDTRFRHSADSSPPGPVWPFGLRSEAADSIDACGRESLLRKGEEVRGAAGDSHFDGGVDLQLS
TVQTFIRMQDGSFCFSVQGVSGSGGSSAALCRRIFCDCRRRRSTEKDEKEKREGVTAGRGGMRQISSMETEARSAEMEGYSAAPPSLSILLVVPPKLRLLISASNPCQWR
MPSSSLCCRGPRIRRTDYTHGMISNDAVDLCLKMMKEGRVVYSDIIEATNEFDDRYCIGEGGSRKVYRVEIYAWGEDVRGVVQALSYLYHNRKLLTIDRKNNVLLNLEFE
AHVANFGIARFLKPDMSR