; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

HG10001610 (gene) of Bottle gourd (Hangzhou Gourd) v1 genome

Gene IDHG10001610
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
DescriptionEukaryotic aspartyl protease family protein
Genome locationChr09:18646041..18648077
RNA-Seq ExpressionHG10001610
SyntenyHG10001610
Gene Ontology termsGO:0006508 - proteolysis (biological process)
GO:0005576 - extracellular region (cellular component)
GO:0004190 - aspartic-type endopeptidase activity (molecular function)
InterPro domainsIPR001461 - Aspartic peptidase A1 family
IPR001969 - Aspartic peptidase, active site
IPR021109 - Aspartic peptidase domain superfamily
IPR032799 - Xylanase inhibitor, C-terminal
IPR032861 - Xylanase inhibitor, N-terminal
IPR033121 - Peptidase family A1 domain
IPR034161 - Pepsin-like domain, plant


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0033565.1 aspartic proteinase CDR1 [Cucumis melo var. makuwa]3.1e-19672.44Show/hide
Query:  MKFQDLNHRLKDIHEHDHKRYQSISTSLNRKPIEERAEAKAKAKAEAEAEAEAEAEAEAAARESILPPTSPTPIGLKMISGSDYGSSEYFVQLKVGTPPQ
        MK QDL+ R+KDIHEHD  R++SIS S+N+K IE+     A+ +AEAEA  + E      A+ +ILPP + TPIG+KMISG+D+GSSEYFVQLKVGTP Q
Subjt:  MKFQDLNHRLKDIHEHDHKRYQSISTSLNRKPIEERAEAKAKAKAEAEAEAEAEAEAEAAARESILPPTSPTPIGLKMISGSDYGSSEYFVQLKVGTPPQ

Query:  TFMLIADTGSDLTWMKCRYRRCIGNCSSKASHKSRNEGKIRFRNAFLANYSSSFKTIHCSSTLCKKDLSDLFSIGECQTPTSPCLYDYSYSGGASAKGIF
        TFMLIADTGSDLTW+KCRYRRC GNCS   +HKS+NE K RFR+A LAN SS+FKT+ CSST+C  +L++LF++ EC TPTSPC+YDYSY+GGASAKGIF
Subjt:  TFMLIADTGSDLTWMKCRYRRCIGNCSSKASHKSRNEGKIRFRNAFLANYSSSFKTIHCSSTLCKKDLSDLFSIGECQTPTSPCLYDYSYSGGASAKGIF

Query:  AIETLTVNLTNGKEKQLHNSIIGCTESVQGRIFRGADGVIGLGTSSYSFTYKAAQNANGGGFSYCLVDHLSDHTATSYFILGIPSPSASSAASSVLPSGN
        A ETLTV LTNGKEKQL NSIIGCTE VQG +F GADGV+GLGTSSYS TYKAA+NANGGGFSYCLVDHL+D  A SYF+LG+P+PS S++ SS  P   
Subjt:  AIETLTVNLTNGKEKQLHNSIIGCTESVQGRIFRGADGVIGLGTSSYSFTYKAAQNANGGGFSYCLVDHLSDHTATSYFILGIPSPSASSAASSVLPSGN

Query:  MSFTRLFVGDPYSSFYGVHLIGISADGVMLNIPPRVWDINSGGGTIVDSGTSLTMLAAPAFDMVMEALTPKLKQFEIIEIQPFEFCFNNSQYTHEMVPKL
        MS+T+L+VGDPYSSFYGV LIGISADG MLNIPPRVWD   G GTI+DSGTSLT+LA PAFD+VME LT +LKQF+ IEI+PF FCFNNSQYTH+M PKL
Subjt:  MSFTRLFVGDPYSSFYGVHLIGISADGVMLNIPPRVWDINSGGGTIVDSGTSLTMLAAPAFDMVMEALTPKLKQFEIIEIQPFEFCFNNSQYTHEMVPKL

Query:  GFHFRDGTVFQPPAKSYIVSVGEFISCIGFVSMPFPATNIIGNILQQNHLWQFDFFNGKVGFAPSECV
         FHF DGTVF+PP KSYIVSVGEFISCIG VSMPFP+ NIIGNILQQNHLWQFDF   +VGFA SEC+
Subjt:  GFHFRDGTVFQPPAKSYIVSVGEFISCIGFVSMPFPATNIIGNILQQNHLWQFDFFNGKVGFAPSECV

XP_004140022.2 aspartic proteinase NANA, chloroplast [Cucumis sativus]7.3e-22271.85Show/hide
Query:  MSPISHFCFFF----LFFFLSVHIAF----GDYD---------------QETVKLDLLHRHHPQVSEKLHGDMKFQDLNHRLKDIHEHDHKRYQSISTSL
        MSPIS+FCFFF    LFFFLS   +F    GD D               QE +K DLLHRHHPQV+EK+HGDMK QD++ R+KDIHEHDH R++SIS S+
Subjt:  MSPISHFCFFF----LFFFLSVHIAF----GDYD---------------QETVKLDLLHRHHPQVSEKLHGDMKFQDLNHRLKDIHEHDHKRYQSISTSL

Query:  NRKPIEERAEAKAKAKAEAEAEAEAEAEAEAAARESILPPTSPTPIGLKMISGSDYGSSEYFVQLKVGTPPQTFMLIADTGSDLTWMKCRYRRCIGNCSS
        N+K +E+     A+ +AEAEA  E     E  A+ +ILPP + TPIG++MISG+D+GSSEYFV+LKVGTP QTFMLIADTGSDLTWMKCRYRRC GNCSS
Subjt:  NRKPIEERAEAKAKAKAEAEAEAEAEAEAEAAARESILPPTSPTPIGLKMISGSDYGSSEYFVQLKVGTPPQTFMLIADTGSDLTWMKCRYRRCIGNCSS

Query:  KASHKSRNEGKIRFRNAFLANYSSSFKTIHCSSTLCKKDLSDLFSIGECQTPTSPCLYDYSYSGGASAKGIFAIETLTVNLTNGKEKQLHNSIIGCTESV
          +HKS+NE K RFR+AFLAN+SSSFKT+ CSST+C  DL+DLF++ EC  PTSPC+YDYSY+GGASAKGIFA ETLTV LTNGKEKQLHNSIIGCTESV
Subjt:  KASHKSRNEGKIRFRNAFLANYSSSFKTIHCSSTLCKKDLSDLFSIGECQTPTSPCLYDYSYSGGASAKGIFAIETLTVNLTNGKEKQLHNSIIGCTESV

Query:  QGRIFRGADGVIGLGTSSYSFTYKAAQNANGGGFSYCLVDHLSDHTATSYFILGIPSPSASSAASSVLPSGNMSFTRLFVGDPYSSFYGVHLIGISADGV
        QG +F GADGV+GLGTSSYS TYKAA+NANGGGFSYCLVDHL+D  A SYF+LGIP+PS S++ SS      M++T+L+VGDPYSSFYGV LIGISA+G+
Subjt:  QGRIFRGADGVIGLGTSSYSFTYKAAQNANGGGFSYCLVDHLSDHTATSYFILGIPSPSASSAASSVLPSGNMSFTRLFVGDPYSSFYGVHLIGISADGV

Query:  MLNIPPRVWDINSGGGTIVDSGTSLTMLAAPAFDMVMEALTPKLKQFEIIEIQPFEFCFNNSQYTHEMVPKLGFHFRDGTVFQPPAKSYIVSVGEFISCI
        MLNIP RVWDINSGGGTI+DSGTSLT+LAAPAFDMVMEALTP+LK+F+ +EI+PF+FCFNNSQYTHEM PKL FHF DGTVF+PP KSYIVSVG+FISCI
Subjt:  MLNIPPRVWDINSGGGTIVDSGTSLTMLAAPAFDMVMEALTPKLKQFEIIEIQPFEFCFNNSQYTHEMVPKLGFHFRDGTVFQPPAKSYIVSVGEFISCI

Query:  GFVSMPFPATNIIGNILQQNHLWQFDFFNGKVGFAPSECV
        GFVSMPFPA NIIGNILQQNHLWQFDF   +VGFAPSEC+
Subjt:  GFVSMPFPATNIIGNILQQNHLWQFDFFNGKVGFAPSECV

XP_008456273.1 PREDICTED: aspartic proteinase CDR1 [Cucumis melo]1.1e-21270.36Show/hide
Query:  MSPISHFCFFF-LFFFLSVHIAF----GDY-----------DQETVKLDLLHRHHPQVSEKLHGDMKFQDLNHRLKDIHEHDHKRYQSISTSLNRKPIEE
        MSPIS+FCFFF L FFLS   +F    GD            +Q+T++ DLLHRHHPQVSEKL+GDMK QDL+ R+KDIHEHD  R++SIS S+N+K IE+
Subjt:  MSPISHFCFFF-LFFFLSVHIAF----GDY-----------DQETVKLDLLHRHHPQVSEKLHGDMKFQDLNHRLKDIHEHDHKRYQSISTSLNRKPIEE

Query:  RAEAKAKAKAEAEAEAEAEAEAEAAARESILPPTSPTPIGLKMISGSDYGSSEYFVQLKVGTPPQTFMLIADTGSDLTWMKCRYRRCIGNCSSKASHKSR
             A+ +AEAEA  + E      A+ +ILPP + TPIG+KMISG+D+GSSEYFVQLKVGTP QTFMLIADTGSDLTW+KCRYRRC GNCS   +HKS+
Subjt:  RAEAKAKAKAEAEAEAEAEAEAEAAARESILPPTSPTPIGLKMISGSDYGSSEYFVQLKVGTPPQTFMLIADTGSDLTWMKCRYRRCIGNCSSKASHKSR

Query:  NEGKIRFRNAFLANYSSSFKTIHCSSTLCKKDLSDLFSIGECQTPTSPCLYDYSYSGGASAKGIFAIETLTVNLTNGKEKQLHNSIIGCTESVQGRIFRG
        NE K RFR+A LAN SS+FKT+ CSST+C  +L++LF++ EC TPTSPC+YDYSY+GGASAKGIFA ETLTV LTNGKEKQL NSIIGCTE VQG +F G
Subjt:  NEGKIRFRNAFLANYSSSFKTIHCSSTLCKKDLSDLFSIGECQTPTSPCLYDYSYSGGASAKGIFAIETLTVNLTNGKEKQLHNSIIGCTESVQGRIFRG

Query:  ADGVIGLGTSSYSFTYKAAQNANGGGFSYCLVDHLSDHTATSYFILGIPSPSASSAASSVLPSGNMSFTRLFVGDPYSSFYGVHLIGISADGVMLNIPPR
        ADGV+GLGTSSYS TYKAA+NANGGGFSYCLVDHL+D  A SYF+LG+P+PS S++ SS  P   MS+T+L+VGDPYSSFYGV LIGISADG MLNIPPR
Subjt:  ADGVIGLGTSSYSFTYKAAQNANGGGFSYCLVDHLSDHTATSYFILGIPSPSASSAASSVLPSGNMSFTRLFVGDPYSSFYGVHLIGISADGVMLNIPPR

Query:  VWDINSGGGTIVDSGTSLTMLAAPAFDMVMEALTPKLKQFEIIEIQPFEFCFNNSQYTHEMVPKLGFHFRDGTVFQPPAKSYIVSVGEFISCIGFVSMPF
        VWD   G GTI+DSGTSLT+LA PAFD+VME LT +LKQF+ IEI+PF FCFNNSQYTH+M PKL FHF DGTVF+PP KSYIVSVGEFISCIG VSMPF
Subjt:  VWDINSGGGTIVDSGTSLTMLAAPAFDMVMEALTPKLKQFEIIEIQPFEFCFNNSQYTHEMVPKLGFHFRDGTVFQPPAKSYIVSVGEFISCIGFVSMPF

Query:  PATNIIGNILQQNHLWQFDFFNGKVGFAPSECV
        P+ NIIGNILQQNHLWQFDF   +VGFA SEC+
Subjt:  PATNIIGNILQQNHLWQFDFFNGKVGFAPSECV

XP_022943789.1 aspartic proteinase NANA, chloroplast-like isoform X2 [Cucurbita moschata]3.6e-16857.69Show/hide
Query:  MSPISHFCFFFLF-FFLSVHIAF-GDYDQE--------TVKLDLLHRHHPQVSEKLHGDMKFQDLNHRLKDIHEHDHKRYQSISTSLNRKPIEERAEAKA
        MSPIS    FF F FFLSVH+AF GD  Q+         VKLD++HRHHP V EKL+G+ +      R +DIHEHDH R +SISTS+     + +     
Subjt:  MSPISHFCFFFLF-FFLSVHIAF-GDYDQE--------TVKLDLLHRHHPQVSEKLHGDMKFQDLNHRLKDIHEHDHKRYQSISTSLNRKPIEERAEAKA

Query:  KAKAEAEAEAEAEAEAEAAARESILPPTSPTPIGLKMISGSDYGSSEYFVQLKVGTPPQTFMLIADTGSDLTWMKCRYRRCIGNCSSKASHKSRNEGKIR
                                LP  S  PI LK+ SG D+G++EYFVQ +VGTPPQ F+LI DTGSDLTW+KCRYRRC+GNC++ A HKSR E K++
Subjt:  KAKAEAEAEAEAEAEAEAAARESILPPTSPTPIGLKMISGSDYGSSEYFVQLKVGTPPQTFMLIADTGSDLTWMKCRYRRCIGNCSSKASHKSRNEGKIR

Query:  FRNAFLANYSSSFKTIHCSSTLCKKDLSDLFSIGECQTPTSPCLYDYSYSGGASAKGIFAIETLTVNLTNGKEKQLHNSIIGCTESVQGRIFRGADGVIG
        F + FLAN+SSSFK I C S  C  DL  LF+I +CQ P++PC+YDYSY GG +A G+FA ET+TV LTNGKEKQLH+++IGCTE       +G DG++G
Subjt:  FRNAFLANYSSSFKTIHCSSTLCKKDLSDLFSIGECQTPTSPCLYDYSYSGGASAKGIFAIETLTVNLTNGKEKQLHNSIIGCTESVQGRIFRGADGVIG

Query:  LGTSSYSFTYKAAQNANGGGFSYCLVDHLSDHTATSYFILGIPSPSASSAASSVLPSGNMSFTRLFVGDPYSSFYGVHLIGISADGVMLNIPPRVWDINS
        LGT ++SF ++AA + NGGGFSYCL+DHLS H+ATSYFILG P     +   SV P GNM+F  L +G P++S+YGV LIGIS DGV LNIPPRVWDI  
Subjt:  LGTSSYSFTYKAAQNANGGGFSYCLVDHLSDHTATSYFILGIPSPSASSAASSVLPSGNMSFTRLFVGDPYSSFYGVHLIGISADGVMLNIPPRVWDINS

Query:  GGGTIVDSGTSLTMLAAPAFDMVMEALTPKLKQFEIIEIQPFEFCFNNSQYTHEMVPKLGFHFRDGTVFQPPAKSYIVSVGEFISCIGFVSMPFPATNII
        GGGTI+DSGTSL+ML APAFD+ MEA+  KLK+F+ I   PF +CFN + Y+HEM PKL FHF  G VF+PP KSYIV V + I C+GF S+PFP TNII
Subjt:  GGGTIVDSGTSLTMLAAPAFDMVMEALTPKLKQFEIIEIQPFEFCFNNSQYTHEMVPKLGFHFRDGTVFQPPAKSYIVSVGEFISCIGFVSMPFPATNII

Query:  GNILQQNHLWQFDFFNGKVGFAPSECV
        GNILQQN LWQFDFFN KVGFAPS+C+
Subjt:  GNILQQNHLWQFDFFNGKVGFAPSECV

XP_038901983.1 aspartic proteinase NANA, chloroplast [Benincasa hispida]3.7e-24281.57Show/hide
Query:  MSPISHFCFFFLFFFLSVHIAFGD--YDQETVKLDLLHRHHPQVSEKLHGDMKFQDLNHRLKDIHEHDHKRYQSISTSLNRKPIEERAEAKAKAKAEAEA
        MSPISHFC FFLFFFLSV IAFGD  +DQE VKLDLLHRHHPQVSEKLHGD+K +++N R+KDI EHD KRYQ+IS+SLNR  ++E+   +A        
Subjt:  MSPISHFCFFFLFFFLSVHIAFGD--YDQETVKLDLLHRHHPQVSEKLHGDMKFQDLNHRLKDIHEHDHKRYQSISTSLNRKPIEERAEAKAKAKAEAEA

Query:  EAEAEAEAEAAARESILPPTSPTPIGLKMISGSDYGSSEYFVQLKVGTPPQTFMLIADTGSDLTWMKCRYRRCIGNCSSKASHKSRNEGKIRFRNAFLAN
               AE A ++  LPP S TPIGLKMISGSDYGSSEYFVQLKVGTPPQTFMLIADTGSDLTWMKCRYRRCIGNCSS  +HK+RNE K+RFRNAFLAN
Subjt:  EAEAEAEAEAAARESILPPTSPTPIGLKMISGSDYGSSEYFVQLKVGTPPQTFMLIADTGSDLTWMKCRYRRCIGNCSSKASHKSRNEGKIRFRNAFLAN

Query:  YSSSFKTIHCSSTLCKKDLSDLFSIGECQTPTSPCLYDYSYSGGASAKGIFAIETLTVNLTNGKEKQLHNSIIGCTESVQGRIFRGADGVIGLGTSSYSF
        YSSSFKTI CSS +C  DL+DLFSIGECQTPTSPCLYDYSYSGGASAKG+FAIETLTV LTNGKEKQLHNSIIGCTESVQGRIF GADGVIGLGTSSYSF
Subjt:  YSSSFKTIHCSSTLCKKDLSDLFSIGECQTPTSPCLYDYSYSGGASAKGIFAIETLTVNLTNGKEKQLHNSIIGCTESVQGRIFRGADGVIGLGTSSYSF

Query:  TYKAAQNANGGGFSYCLVDHLSDHTATSYFILGIP--SPSASSAASSVLPSGNMSFTRLFVGDPYSSFYGVHLIGISADGVMLNIPPRVWDINSGGGTIV
        TYKAA+NANGGGF+YCLVDHLSD TATSYFILG P  S  +++AASSV P+GNMSFT+LF+GDPYSSFYGV L+GISADGVMLNIPPRVWDINSGGGTIV
Subjt:  TYKAAQNANGGGFSYCLVDHLSDHTATSYFILGIP--SPSASSAASSVLPSGNMSFTRLFVGDPYSSFYGVHLIGISADGVMLNIPPRVWDINSGGGTIV

Query:  DSGTSLTMLAAPAFDMVMEALTPKLKQFEIIEIQPFEFCFNNSQYTHEMVPKLGFHFRDGTVFQPPAKSYIVSVGEFISCIGFVSMPFPATNIIGNILQQ
        DSGTSLTMLAAPAFDMVMEAL PKLK FE IEI+PF+FCFNNS+YTHEM PKL FHF DGTVFQPP KSYIVSVGE+ISCIGFVSMPFPATNIIGNILQQ
Subjt:  DSGTSLTMLAAPAFDMVMEALTPKLKQFEIIEIQPFEFCFNNSQYTHEMVPKLGFHFRDGTVFQPPAKSYIVSVGEFISCIGFVSMPFPATNIIGNILQQ

Query:  NHLWQFDFFNGKVGFAPSECV
        NHLW+FDF  G VGFAPSECV
Subjt:  NHLWQFDFFNGKVGFAPSECV

TrEMBL top hitse value%identityAlignment
A0A0A0KG92 Peptidase A1 domain-containing protein3.6e-22271.85Show/hide
Query:  MSPISHFCFFF----LFFFLSVHIAF----GDYD---------------QETVKLDLLHRHHPQVSEKLHGDMKFQDLNHRLKDIHEHDHKRYQSISTSL
        MSPIS+FCFFF    LFFFLS   +F    GD D               QE +K DLLHRHHPQV+EK+HGDMK QD++ R+KDIHEHDH R++SIS S+
Subjt:  MSPISHFCFFF----LFFFLSVHIAF----GDYD---------------QETVKLDLLHRHHPQVSEKLHGDMKFQDLNHRLKDIHEHDHKRYQSISTSL

Query:  NRKPIEERAEAKAKAKAEAEAEAEAEAEAEAAARESILPPTSPTPIGLKMISGSDYGSSEYFVQLKVGTPPQTFMLIADTGSDLTWMKCRYRRCIGNCSS
        N+K +E+     A+ +AEAEA  E     E  A+ +ILPP + TPIG++MISG+D+GSSEYFV+LKVGTP QTFMLIADTGSDLTWMKCRYRRC GNCSS
Subjt:  NRKPIEERAEAKAKAKAEAEAEAEAEAEAEAAARESILPPTSPTPIGLKMISGSDYGSSEYFVQLKVGTPPQTFMLIADTGSDLTWMKCRYRRCIGNCSS

Query:  KASHKSRNEGKIRFRNAFLANYSSSFKTIHCSSTLCKKDLSDLFSIGECQTPTSPCLYDYSYSGGASAKGIFAIETLTVNLTNGKEKQLHNSIIGCTESV
          +HKS+NE K RFR+AFLAN+SSSFKT+ CSST+C  DL+DLF++ EC  PTSPC+YDYSY+GGASAKGIFA ETLTV LTNGKEKQLHNSIIGCTESV
Subjt:  KASHKSRNEGKIRFRNAFLANYSSSFKTIHCSSTLCKKDLSDLFSIGECQTPTSPCLYDYSYSGGASAKGIFAIETLTVNLTNGKEKQLHNSIIGCTESV

Query:  QGRIFRGADGVIGLGTSSYSFTYKAAQNANGGGFSYCLVDHLSDHTATSYFILGIPSPSASSAASSVLPSGNMSFTRLFVGDPYSSFYGVHLIGISADGV
        QG +F GADGV+GLGTSSYS TYKAA+NANGGGFSYCLVDHL+D  A SYF+LGIP+PS S++ SS      M++T+L+VGDPYSSFYGV LIGISA+G+
Subjt:  QGRIFRGADGVIGLGTSSYSFTYKAAQNANGGGFSYCLVDHLSDHTATSYFILGIPSPSASSAASSVLPSGNMSFTRLFVGDPYSSFYGVHLIGISADGV

Query:  MLNIPPRVWDINSGGGTIVDSGTSLTMLAAPAFDMVMEALTPKLKQFEIIEIQPFEFCFNNSQYTHEMVPKLGFHFRDGTVFQPPAKSYIVSVGEFISCI
        MLNIP RVWDINSGGGTI+DSGTSLT+LAAPAFDMVMEALTP+LK+F+ +EI+PF+FCFNNSQYTHEM PKL FHF DGTVF+PP KSYIVSVG+FISCI
Subjt:  MLNIPPRVWDINSGGGTIVDSGTSLTMLAAPAFDMVMEALTPKLKQFEIIEIQPFEFCFNNSQYTHEMVPKLGFHFRDGTVFQPPAKSYIVSVGEFISCI

Query:  GFVSMPFPATNIIGNILQQNHLWQFDFFNGKVGFAPSECV
        GFVSMPFPA NIIGNILQQNHLWQFDF   +VGFAPSEC+
Subjt:  GFVSMPFPATNIIGNILQQNHLWQFDFFNGKVGFAPSECV

A0A1S3C2F3 aspartic proteinase CDR15.1e-21370.36Show/hide
Query:  MSPISHFCFFF-LFFFLSVHIAF----GDY-----------DQETVKLDLLHRHHPQVSEKLHGDMKFQDLNHRLKDIHEHDHKRYQSISTSLNRKPIEE
        MSPIS+FCFFF L FFLS   +F    GD            +Q+T++ DLLHRHHPQVSEKL+GDMK QDL+ R+KDIHEHD  R++SIS S+N+K IE+
Subjt:  MSPISHFCFFF-LFFFLSVHIAF----GDY-----------DQETVKLDLLHRHHPQVSEKLHGDMKFQDLNHRLKDIHEHDHKRYQSISTSLNRKPIEE

Query:  RAEAKAKAKAEAEAEAEAEAEAEAAARESILPPTSPTPIGLKMISGSDYGSSEYFVQLKVGTPPQTFMLIADTGSDLTWMKCRYRRCIGNCSSKASHKSR
             A+ +AEAEA  + E      A+ +ILPP + TPIG+KMISG+D+GSSEYFVQLKVGTP QTFMLIADTGSDLTW+KCRYRRC GNCS   +HKS+
Subjt:  RAEAKAKAKAEAEAEAEAEAEAEAAARESILPPTSPTPIGLKMISGSDYGSSEYFVQLKVGTPPQTFMLIADTGSDLTWMKCRYRRCIGNCSSKASHKSR

Query:  NEGKIRFRNAFLANYSSSFKTIHCSSTLCKKDLSDLFSIGECQTPTSPCLYDYSYSGGASAKGIFAIETLTVNLTNGKEKQLHNSIIGCTESVQGRIFRG
        NE K RFR+A LAN SS+FKT+ CSST+C  +L++LF++ EC TPTSPC+YDYSY+GGASAKGIFA ETLTV LTNGKEKQL NSIIGCTE VQG +F G
Subjt:  NEGKIRFRNAFLANYSSSFKTIHCSSTLCKKDLSDLFSIGECQTPTSPCLYDYSYSGGASAKGIFAIETLTVNLTNGKEKQLHNSIIGCTESVQGRIFRG

Query:  ADGVIGLGTSSYSFTYKAAQNANGGGFSYCLVDHLSDHTATSYFILGIPSPSASSAASSVLPSGNMSFTRLFVGDPYSSFYGVHLIGISADGVMLNIPPR
        ADGV+GLGTSSYS TYKAA+NANGGGFSYCLVDHL+D  A SYF+LG+P+PS S++ SS  P   MS+T+L+VGDPYSSFYGV LIGISADG MLNIPPR
Subjt:  ADGVIGLGTSSYSFTYKAAQNANGGGFSYCLVDHLSDHTATSYFILGIPSPSASSAASSVLPSGNMSFTRLFVGDPYSSFYGVHLIGISADGVMLNIPPR

Query:  VWDINSGGGTIVDSGTSLTMLAAPAFDMVMEALTPKLKQFEIIEIQPFEFCFNNSQYTHEMVPKLGFHFRDGTVFQPPAKSYIVSVGEFISCIGFVSMPF
        VWD   G GTI+DSGTSLT+LA PAFD+VME LT +LKQF+ IEI+PF FCFNNSQYTH+M PKL FHF DGTVF+PP KSYIVSVGEFISCIG VSMPF
Subjt:  VWDINSGGGTIVDSGTSLTMLAAPAFDMVMEALTPKLKQFEIIEIQPFEFCFNNSQYTHEMVPKLGFHFRDGTVFQPPAKSYIVSVGEFISCIGFVSMPF

Query:  PATNIIGNILQQNHLWQFDFFNGKVGFAPSECV
        P+ NIIGNILQQNHLWQFDF   +VGFA SEC+
Subjt:  PATNIIGNILQQNHLWQFDFFNGKVGFAPSECV

A0A5D3B701 Aspartic proteinase CDR11.5e-19672.44Show/hide
Query:  MKFQDLNHRLKDIHEHDHKRYQSISTSLNRKPIEERAEAKAKAKAEAEAEAEAEAEAEAAARESILPPTSPTPIGLKMISGSDYGSSEYFVQLKVGTPPQ
        MK QDL+ R+KDIHEHD  R++SIS S+N+K IE+     A+ +AEAEA  + E      A+ +ILPP + TPIG+KMISG+D+GSSEYFVQLKVGTP Q
Subjt:  MKFQDLNHRLKDIHEHDHKRYQSISTSLNRKPIEERAEAKAKAKAEAEAEAEAEAEAEAAARESILPPTSPTPIGLKMISGSDYGSSEYFVQLKVGTPPQ

Query:  TFMLIADTGSDLTWMKCRYRRCIGNCSSKASHKSRNEGKIRFRNAFLANYSSSFKTIHCSSTLCKKDLSDLFSIGECQTPTSPCLYDYSYSGGASAKGIF
        TFMLIADTGSDLTW+KCRYRRC GNCS   +HKS+NE K RFR+A LAN SS+FKT+ CSST+C  +L++LF++ EC TPTSPC+YDYSY+GGASAKGIF
Subjt:  TFMLIADTGSDLTWMKCRYRRCIGNCSSKASHKSRNEGKIRFRNAFLANYSSSFKTIHCSSTLCKKDLSDLFSIGECQTPTSPCLYDYSYSGGASAKGIF

Query:  AIETLTVNLTNGKEKQLHNSIIGCTESVQGRIFRGADGVIGLGTSSYSFTYKAAQNANGGGFSYCLVDHLSDHTATSYFILGIPSPSASSAASSVLPSGN
        A ETLTV LTNGKEKQL NSIIGCTE VQG +F GADGV+GLGTSSYS TYKAA+NANGGGFSYCLVDHL+D  A SYF+LG+P+PS S++ SS  P   
Subjt:  AIETLTVNLTNGKEKQLHNSIIGCTESVQGRIFRGADGVIGLGTSSYSFTYKAAQNANGGGFSYCLVDHLSDHTATSYFILGIPSPSASSAASSVLPSGN

Query:  MSFTRLFVGDPYSSFYGVHLIGISADGVMLNIPPRVWDINSGGGTIVDSGTSLTMLAAPAFDMVMEALTPKLKQFEIIEIQPFEFCFNNSQYTHEMVPKL
        MS+T+L+VGDPYSSFYGV LIGISADG MLNIPPRVWD   G GTI+DSGTSLT+LA PAFD+VME LT +LKQF+ IEI+PF FCFNNSQYTH+M PKL
Subjt:  MSFTRLFVGDPYSSFYGVHLIGISADGVMLNIPPRVWDINSGGGTIVDSGTSLTMLAAPAFDMVMEALTPKLKQFEIIEIQPFEFCFNNSQYTHEMVPKL

Query:  GFHFRDGTVFQPPAKSYIVSVGEFISCIGFVSMPFPATNIIGNILQQNHLWQFDFFNGKVGFAPSECV
         FHF DGTVF+PP KSYIVSVGEFISCIG VSMPFP+ NIIGNILQQNHLWQFDF   +VGFA SEC+
Subjt:  GFHFRDGTVFQPPAKSYIVSVGEFISCIGFVSMPFPATNIIGNILQQNHLWQFDFFNGKVGFAPSECV

A0A6J1FVB3 aspartic proteinase NANA, chloroplast-like isoform X21.7e-16857.69Show/hide
Query:  MSPISHFCFFFLF-FFLSVHIAF-GDYDQE--------TVKLDLLHRHHPQVSEKLHGDMKFQDLNHRLKDIHEHDHKRYQSISTSLNRKPIEERAEAKA
        MSPIS    FF F FFLSVH+AF GD  Q+         VKLD++HRHHP V EKL+G+ +      R +DIHEHDH R +SISTS+     + +     
Subjt:  MSPISHFCFFFLF-FFLSVHIAF-GDYDQE--------TVKLDLLHRHHPQVSEKLHGDMKFQDLNHRLKDIHEHDHKRYQSISTSLNRKPIEERAEAKA

Query:  KAKAEAEAEAEAEAEAEAAARESILPPTSPTPIGLKMISGSDYGSSEYFVQLKVGTPPQTFMLIADTGSDLTWMKCRYRRCIGNCSSKASHKSRNEGKIR
                                LP  S  PI LK+ SG D+G++EYFVQ +VGTPPQ F+LI DTGSDLTW+KCRYRRC+GNC++ A HKSR E K++
Subjt:  KAKAEAEAEAEAEAEAEAAARESILPPTSPTPIGLKMISGSDYGSSEYFVQLKVGTPPQTFMLIADTGSDLTWMKCRYRRCIGNCSSKASHKSRNEGKIR

Query:  FRNAFLANYSSSFKTIHCSSTLCKKDLSDLFSIGECQTPTSPCLYDYSYSGGASAKGIFAIETLTVNLTNGKEKQLHNSIIGCTESVQGRIFRGADGVIG
        F + FLAN+SSSFK I C S  C  DL  LF+I +CQ P++PC+YDYSY GG +A G+FA ET+TV LTNGKEKQLH+++IGCTE       +G DG++G
Subjt:  FRNAFLANYSSSFKTIHCSSTLCKKDLSDLFSIGECQTPTSPCLYDYSYSGGASAKGIFAIETLTVNLTNGKEKQLHNSIIGCTESVQGRIFRGADGVIG

Query:  LGTSSYSFTYKAAQNANGGGFSYCLVDHLSDHTATSYFILGIPSPSASSAASSVLPSGNMSFTRLFVGDPYSSFYGVHLIGISADGVMLNIPPRVWDINS
        LGT ++SF ++AA + NGGGFSYCL+DHLS H+ATSYFILG P     +   SV P GNM+F  L +G P++S+YGV LIGIS DGV LNIPPRVWDI  
Subjt:  LGTSSYSFTYKAAQNANGGGFSYCLVDHLSDHTATSYFILGIPSPSASSAASSVLPSGNMSFTRLFVGDPYSSFYGVHLIGISADGVMLNIPPRVWDINS

Query:  GGGTIVDSGTSLTMLAAPAFDMVMEALTPKLKQFEIIEIQPFEFCFNNSQYTHEMVPKLGFHFRDGTVFQPPAKSYIVSVGEFISCIGFVSMPFPATNII
        GGGTI+DSGTSL+ML APAFD+ MEA+  KLK+F+ I   PF +CFN + Y+HEM PKL FHF  G VF+PP KSYIV V + I C+GF S+PFP TNII
Subjt:  GGGTIVDSGTSLTMLAAPAFDMVMEALTPKLKQFEIIEIQPFEFCFNNSQYTHEMVPKLGFHFRDGTVFQPPAKSYIVSVGEFISCIGFVSMPFPATNII

Query:  GNILQQNHLWQFDFFNGKVGFAPSECV
        GNILQQN LWQFDFFN KVGFAPS+C+
Subjt:  GNILQQNHLWQFDFFNGKVGFAPSECV

A0A6J1FXD5 aspartic proteinase NANA, chloroplast-like isoform X11.7e-16857.69Show/hide
Query:  MSPISHFCFFFLF-FFLSVHIAF-GDYDQE--------TVKLDLLHRHHPQVSEKLHGDMKFQDLNHRLKDIHEHDHKRYQSISTSLNRKPIEERAEAKA
        MSPIS    FF F FFLSVH+AF GD  Q+         VKLD++HRHHP V EKL+G+ +      R +DIHEHDH R +SISTS+     + +     
Subjt:  MSPISHFCFFFLF-FFLSVHIAF-GDYDQE--------TVKLDLLHRHHPQVSEKLHGDMKFQDLNHRLKDIHEHDHKRYQSISTSLNRKPIEERAEAKA

Query:  KAKAEAEAEAEAEAEAEAAARESILPPTSPTPIGLKMISGSDYGSSEYFVQLKVGTPPQTFMLIADTGSDLTWMKCRYRRCIGNCSSKASHKSRNEGKIR
                                LP  S  PI LK+ SG D+G++EYFVQ +VGTPPQ F+LI DTGSDLTW+KCRYRRC+GNC++ A HKSR E K++
Subjt:  KAKAEAEAEAEAEAEAEAAARESILPPTSPTPIGLKMISGSDYGSSEYFVQLKVGTPPQTFMLIADTGSDLTWMKCRYRRCIGNCSSKASHKSRNEGKIR

Query:  FRNAFLANYSSSFKTIHCSSTLCKKDLSDLFSIGECQTPTSPCLYDYSYSGGASAKGIFAIETLTVNLTNGKEKQLHNSIIGCTESVQGRIFRGADGVIG
        F + FLAN+SSSFK I C S  C  DL  LF+I +CQ P++PC+YDYSY GG +A G+FA ET+TV LTNGKEKQLH+++IGCTE       +G DG++G
Subjt:  FRNAFLANYSSSFKTIHCSSTLCKKDLSDLFSIGECQTPTSPCLYDYSYSGGASAKGIFAIETLTVNLTNGKEKQLHNSIIGCTESVQGRIFRGADGVIG

Query:  LGTSSYSFTYKAAQNANGGGFSYCLVDHLSDHTATSYFILGIPSPSASSAASSVLPSGNMSFTRLFVGDPYSSFYGVHLIGISADGVMLNIPPRVWDINS
        LGT ++SF ++AA + NGGGFSYCL+DHLS H+ATSYFILG P     +   SV P GNM+F  L +G P++S+YGV LIGIS DGV LNIPPRVWDI  
Subjt:  LGTSSYSFTYKAAQNANGGGFSYCLVDHLSDHTATSYFILGIPSPSASSAASSVLPSGNMSFTRLFVGDPYSSFYGVHLIGISADGVMLNIPPRVWDINS

Query:  GGGTIVDSGTSLTMLAAPAFDMVMEALTPKLKQFEIIEIQPFEFCFNNSQYTHEMVPKLGFHFRDGTVFQPPAKSYIVSVGEFISCIGFVSMPFPATNII
        GGGTI+DSGTSL+ML APAFD+ MEA+  KLK+F+ I   PF +CFN + Y+HEM PKL FHF  G VF+PP KSYIV V + I C+GF S+PFP TNII
Subjt:  GGGTIVDSGTSLTMLAAPAFDMVMEALTPKLKQFEIIEIQPFEFCFNNSQYTHEMVPKLGFHFRDGTVFQPPAKSYIVSVGEFISCIGFVSMPFPATNII

Query:  GNILQQNHLWQFDFFNGKVGFAPSECV
        GNILQQN LWQFDFFN KVGFAPS+C+
Subjt:  GNILQQNHLWQFDFFNGKVGFAPSECV

SwissProt top hitse value%identityAlignment
Q8S9J6 Aspartyl protease family protein At5g107701.0e-3229.41Show/hide
Query:  GSDYGSSEYFVQLKVGTPPQTFMLIADTGSDLTWMKCRYRRCIGNCSSKASHKSRNEGKIRFRNAFLANYSSSFKTIHCSSTLCKKDLSDLFSIGECQTP
        GS  GS  Y V + +GTP     LI DTGSDLTW +C  + C+  C  +                F  + S+S+  + CSS  C    S   + G C   
Subjt:  GSDYGSSEYFVQLKVGTPPQTFMLIADTGSDLTWMKCRYRRCIGNCSSKASHKSRNEGKIRFRNAFLANYSSSFKTIHCSSTLCKKDLSDLFSIGECQTP

Query:  TSPCLYDYSYSGGASAKGIFAIETLTVNLTNGKEKQLHNSIIGCTESVQGRIFRGADGVIGLGTSSYSFTYKAAQNANGGGFSYCLVDHLSDHTATSYFI
         S C+Y   Y   + + G  A E  T  LTN           GC E+ QG +F G  G++GLG    SF  + A  A    FSYCL    S     ++  
Subjt:  TSPCLYDYSYSGGASAKGIFAIETLTVNLTNGKEKQLHNSIIGCTESVQGRIFRGADGVIGLGTSSYSFTYKAAQNANGGGFSYCLVDHLSDHTATSYFI

Query:  LGIPSPSASSAASSVLPSGNMSFTRLFVGDPYSSFYGVHLIGISADGVMLNIPPRVWDINSGGGTIVDSGTSLTMLAAPAFDMVMEALTPKLKQFEIIE-
         GI              S ++ FT +      +SFYG++++ I+  G  L IP  V+   S  G ++DSGT +T L   A+  +  +   K+ ++     
Subjt:  LGIPSPSASSAASSVLPSGNMSFTRLFVGDPYSSFYGVHLIGISADGVMLNIPPRVWDINSGGGTIVDSGTSLTMLAAPAFDMVMEALTPKLKQFEIIE-

Query:  IQPFEFCFNNSQYTHEMVPKLGFHFRDGTVFQPPAKS--YIVSVGEFISCIGFVSMPFPA-TNIIGNILQQNHLWQFDFFNGKVGFAPSEC
        +   + CF+ S +    +PK+ F F  G V +  +K   Y+  + +   C+ F      +   I GN+ QQ     +D   G+VGFAP+ C
Subjt:  IQPFEFCFNNSQYTHEMVPKLGFHFRDGTVFQPPAKS--YIVSVGEFISCIGFVSMPFPA-TNIIGNILQQNHLWQFDFFNGKVGFAPSEC

Q9LHE3 Protein ASPARTIC PROTEASE IN GUARD CELL 21.1e-3930.83Show/hide
Query:  GLKMISGSDYGSSEYFVQLKVGTPPQTFMLIADTGSDLTWMKCRYRRCIGNCSSKASHKSRNEGKIRFRNAFLANYSSSFKTIHCSSTLCKKDLSDLFSI
        G  ++SG D GS EYFV++ VG+PP+   ++ D+GSD+ W++C+          K  +K  +         F    S S+  + C S++C +       I
Subjt:  GLKMISGSDYGSSEYFVQLKVGTPPQTFMLIADTGSDLTWMKCRYRRCIGNCSSKASHKSRNEGKIRFRNAFLANYSSSFKTIHCSSTLCKKDLSDLFSI

Query:  GECQTPTSPCLYDYSYSGGASAKGIFAIETLTVNLTNGKEKQLHNSIIGCTESVQGRIFRGADGVIGLGTSSYSFTYKAAQNANGGGFSYCLVDHLSDHT
              +  C Y+  Y  G+  KG  A+ETLT   T      + N  +GC    +G +F GA G++G+G  S SF  + +    GG F YCLV   +D T
Subjt:  GECQTPTSPCLYDYSYSGGASAKGIFAIETLTVNLTNGKEKQLHNSIIGCTESVQGRIFRGADGVIGLGTSSYSFTYKAAQNANGGGFSYCLVDHLSDHT

Query:  ATSYFILGIPSPSASSAASSVLPSGNMSFTRLFVGDPYSSFYGVHLIGISADGVMLNIPPRVWDI--NSGGGTIVDSGTSLTML---AAPAFDMVMEALT
         +  F                LP G  S+  L       SFY V L G+   GV + +P  V+D+     GG ++D+GT++T L   A  AF    ++ T
Subjt:  ATSYFILGIPSPSASSAASSVLPSGNMSFTRLFVGDPYSSFYGVHLIGISADGVMLNIPPRVWDI--NSGGGTIVDSGTSLTML---AAPAFDMVMEALT

Query:  PKLKQFEIIEIQPFEFCFNNSQYTHEMVPKLGFHFRDGTVFQPPAKSYIVSVGEF-ISCIGFVSMPFPATNIIGNILQQNHLWQFDFFNGKVGFAPSEC
          L +   + I  F+ C++ S +    VP + F+F +G V   PA+++++ V +    C  F + P    +IIGNI Q+     FD  NG VGF P+ C
Subjt:  PKLKQFEIIEIQPFEFCFNNSQYTHEMVPKLGFHFRDGTVFQPPAKSYIVSVGEF-ISCIGFVSMPFPATNIIGNILQQNHLWQFDFFNGKVGFAPSEC

Q9LNJ3 Aspartyl protease family protein 21.9e-3932.85Show/hide
Query:  SPTPIGL--KMISGSDYGSSEYFVQLKVGTPPQTFMLIADTGSDLTWMKCR-YRRCIGNCSSKASHKSRNEGKIRFRNAFLANYSSSFKTIHCSSTLCKK
        +P P G    ++SG   GS EYF +L VGTP +   ++ DTGSD+ W++C   RRC                       F    S ++ TI CSS  C++
Subjt:  SPTPIGL--KMISGSDYGSSEYFVQLKVGTPPQTFMLIADTGSDLTWMKCR-YRRCIGNCSSKASHKSRNEGKIRFRNAFLANYSSSFKTIHCSSTLCKK

Query:  DLSDLFSIGECQTPTSPCLYDYSYSGGASAKGIFAIETLTVNLTNGKEKQLHNSIIGCTESVQGRIFRGADGVIGLGTSSYSFTYKAAQNANGGGFSYCL
            L S G C T    CLY  SY  G+   G F+ ETLT      +  ++    +GC    +G +F GA G++GLG    SF  +     N   FSYCL
Subjt:  DLSDLFSIGECQTPTSPCLYDYSYSGGASAKGIFAIETLTVNLTNGKEKQLHNSIIGCTESVQGRIFRGADGVIGLGTSSYSFTYKAAQNANGGGFSYCL

Query:  VDHLSDHTATSYFILGIPSPSASSAASSVLPSGNMS------FTRLFVGDPYSSFYGVHLIGISADGVML-NIPPRVWDIN--SGGGTIVDSGTSLTMLA
        VD                  SASS  SSV+  GN +      FT L       +FY V L+GIS  G  +  +   ++ ++    GG I+DSGTS+T L 
Subjt:  VDHLSDHTATSYFILGIPSPSASSAASSVLPSGNMS------FTRLFVGDPYSSFYGVHLIGISADGVML-NIPPRVWDIN--SGGGTIVDSGTSLTMLA

Query:  APAFDMVMEALTPKLKQFE-IIEIQPFEFCFNNSQYTHEMVPKLGFHFRDGTVFQPPAKSYIVSV---GEFISCIGFVSMPFPATNIIGNILQQNHLWQF
         PA+  + +A     K  +   +   F+ CF+ S      VP +  HFR G     PA +Y++ V   G+F  C  F        +IIGNI QQ     +
Subjt:  APAFDMVMEALTPKLKQFE-IIEIQPFEFCFNNSQYTHEMVPKLGFHFRDGTVFQPPAKSYIVSV---GEFISCIGFVSMPFPATNIIGNILQQNHLWQF

Query:  DFFNGKVGFAPSEC
        D  + +VGFAP  C
Subjt:  DFFNGKVGFAPSEC

Q9LS40 Protein ASPARTIC PROTEASE IN GUARD CELL 11.3e-3227.22Show/hide
Query:  EKLHGDMKFQDLNHRLKDIHEHD------HKRYQSISTSLNRKPIEERAEAKAKAKAEAEAEAEAEAEAEAAARESILPPTSPTPIGLKMISGSDYGSSE
        E L   + F   +    ++H  D      HK Y+S++ S   +     A   AK +   E    ++ +               TP+    +SG+  GS E
Subjt:  EKLHGDMKFQDLNHRLKDIHEHD------HKRYQSISTSLNRKPIEERAEAKAKAKAEAEAEAEAEAEAEAAARESILPPTSPTPIGLKMISGSDYGSSE

Query:  YFVQLKVGTPPQTFMLIADTGSDLTWMKCRYRRCIGNCSSKASHKSRNEGKIRFRNAFLANYSSSFKTIHCSSTLCKKDLSDLFSIGECQTPTSPCLYDY
        YF ++ VGTP +   L+ DTGSD+ W++C       +C  ++               F    SS++K++ CS+  C      L     C+  ++ CLY  
Subjt:  YFVQLKVGTPPQTFMLIADTGSDLTWMKCRYRRCIGNCSSKASHKSRNEGKIRFRNAFLANYSSSFKTIHCSSTLCKKDLSDLFSIGECQTPTSPCLYDY

Query:  SYSGGASAKGIFAIETLTVNLTNGKEKQLHNSIIGCTESVQGRIFRGADGVIGLGTSSYSFTYKAAQNANGGGFSYCLVDHLSDHTATSYFILGIPSPSA
        SY  G+   G  A +T+T     G   +++N  +GC    +G +F GA G++GLG    S T           FSYCLVD  S               S+
Subjt:  SYSGGASAKGIFAIETLTVNLTNGKEKQLHNSIIGCTESVQGRIFRGADGVIGLGTSSYSFTYKAAQNANGGGFSYCLVDHLSDHTATSYFILGIPSPSA

Query:  SSAASSVLPSGNMSFTRLFVGDPYSSFYGVHLIGISADGVMLNIPPRVWDINS--GGGTIVDSGTSLTMLAAPAFDMVMEA---LTPKLKQFEIIEIQPF
        S   +SV   G  +   L       +FY V L G S  G  + +P  ++D+++   GG I+D GT++T L   A++ + +A   LT  LK+     I  F
Subjt:  SSAASSVLPSGNMSFTRLFVGDPYSSFYGVHLIGISADGVMLNIPPRVWDINS--GGGTIVDSGTSLTMLAAPAFDMVMEA---LTPKLKQFEIIEIQPF

Query:  EFCFNNSQYTHEMVPKLGFHFRDGTVFQPPAKSYIVSVGEF-ISCIGFVSMPFPATNIIGNILQQNHLWQFDFFNGKVGFAPSEC
        + C++ S  +   VP + FHF  G     PAK+Y++ V +    C  F      + +IIGN+ QQ     +D     +G + ++C
Subjt:  EFCFNNSQYTHEMVPKLGFHFRDGTVFQPPAKSYIVSVGEF-ISCIGFVSMPFPATNIIGNILQQNHLWQFDFFNGKVGFAPSEC

Q9LTW4 Aspartic proteinase NANA, chloroplast1.1e-9043.31Show/hide
Query:  RESILPPTSPTPIGLKMI--SGSDYGSSEYFVQLKVGTPPQTFMLIADTGSDLTWMKCRYRRCIGNCSSKASHKSRNEGKIRFRNAFLANYSSSFKTIHC
        R S++     + +G+KM   SG DYG+++YF +++VGTP + F ++ DTGS+LTW+ CRYR                 GK   R  F A+ S SFKT+ C
Subjt:  RESILPPTSPTPIGLKMI--SGSDYGSSEYFVQLKVGTPPQTFMLIADTGSDLTWMKCRYRRCIGNCSSKASHKSRNEGKIRFRNAFLANYSSSFKTIHC

Query:  SSTLCKKDLSDLFSIGECQTPTSPCLYDYSYSGGASAKGIFAIETLTVNLTNGKEKQLHNSIIGCTESVQGRIFRGADGVIGLGTSSYSFTYKAAQNANG
         +  CK DL +LFS+  C TP++PC YDY Y+ G++A+G+FA ET+TV LTNG+  +L   +IGC+ S  G+ F+GADGV+GL  S +SFT   A +  G
Subjt:  SSTLCKKDLSDLFSIGECQTPTSPCLYDYSYSGGASAKGIFAIETLTVNLTNGKEKQLHNSIIGCTESVQGRIFRGADGVIGLGTSSYSFTYKAAQNANG

Query:  GGFSYCLVDHLSDHTATSYFILGIPSPSASSAASSVLPSGNMSFTRLFVGDPYSSFYGVHLIGISADGVMLNIPPRVWDINSGGGTIVDSGTSLTMLAAP
          FSYCLVDHLS+   ++Y I G    S+ S  ++   +  +  TR+        FY +++IGIS    ML+IP +VWD  SGGGTI+DSGTSLT+LA  
Subjt:  GGFSYCLVDHLSDHTATSYFILGIPSPSASSAASSVLPSGNMSFTRLFVGDPYSSFYGVHLIGISADGVMLNIPPRVWDINSGGGTIVDSGTSLTMLAAP

Query:  AFDMVMEALTPKLKQFEIIEIQ--PFEFCFN-NSQYTHEMVPKLGFHFRDGTVFQPPAKSYIVSVGEFISCIGFVSMPFPATNIIGNILQQNHLWQFDFF
        A+  V+  L   L + + ++ +  P E+CF+  S +    +P+L FH + G  F+P  KSY+V     + C+GFVS   PATN+IGNI+QQN+LW+FD  
Subjt:  AFDMVMEALTPKLKQFEIIEIQ--PFEFCFN-NSQYTHEMVPKLGFHFRDGTVFQPPAKSYIVSVGEFISCIGFVSMPFPATNIIGNILQQNHLWQFDFF

Query:  NGKVGFAPSEC
           + FAPS C
Subjt:  NGKVGFAPSEC

Arabidopsis top hitse value%identityAlignment
AT2G42980.1 Eukaryotic aspartyl protease family protein3.0e-4830.66Show/hide
Query:  HRLKDIHEHDHKRYQSISTSLNRKPIEERAEAKAKAKAEAEAEAEAEAEAEAAARESILPPTSPTPIGLKMISGSDYGSSEYFVQLKVGTPPQTFMLIAD
        H + D+   D  R +++    N+   ++  + + K  ++                    P  SP  +   + SG   GS EYF+ + VGTPP+ F LI D
Subjt:  HRLKDIHEHDHKRYQSISTSLNRKPIEERAEAKAKAKAEAEAEAEAEAEAEAAARESILPPTSPTPIGLKMISGSDYGSSEYFVQLKVGTPPQTFMLIAD

Query:  TGSDLTWMKC-RYRRCIGNCSSKASHKSRNEGKIRFRNAFLANYSSSFKTIHCSSTLCKKDLSDLFSIGECQTPTSPCLYDYSYSGGASAKGIFAIETLT
        TGSDL W++C     C          K+                S+SFK I C+   C   +S      +C++    C Y Y Y   ++  G FA+ET T
Subjt:  TGSDLTWMKC-RYRRCIGNCSSKASHKSRNEGKIRFRNAFLANYSSSFKTIHCSSTLCKKDLSDLFSIGECQTPTSPCLYDYSYSGGASAKGIFAIETLT

Query:  VNLT----NGKEKQLHNSIIGCTESVQGRIFRGADGVIGLGTSSYSFTYKAAQNANGGGFSYCLVDHLSDHTATSYFILGIPSPSASSAASSVLPSGNMS
        VNLT       E ++ N + GC    +G +F GA G++GLG    SF+    Q+  G  FSYCLVD  S+   +S  I G            +L   N++
Subjt:  VNLT----NGKEKQLHNSIIGCTESVQGRIFRGADGVIGLGTSSYSFTYKAAQNANGGGFSYCLVDHLSDHTATSYFILGIPSPSASSAASSVLPSGNMS

Query:  FTRLFVGDPYS--SFYGVHLIGISADGVMLNIPPRVWDINS--GGGTIVDSGTSLTMLAAPAFDMVMEALTPKLKQ-FEIIEIQP-FEFCFNNS--QYTH
        FT    G   S  +FY + +  I   G  L+IP   W+I+S   GGTI+DSGT+L+  A PA++++      K+K+ + I    P  + CFN S  +  +
Subjt:  FTRLFVGDPYS--SFYGVHLIGISADGVMLNIPPRVWDINS--GGGTIVDSGTSLTMLAAPAFDMVMEALTPKLKQ-FEIIEIQP-FEFCFNNS--QYTH

Query:  EMVPKLGFHFRDGTVFQPPAKSYIVSVGEFISCIGFVSMPFPATNIIGNILQQNHLWQFDFFNGKVGFAPSEC
          +P+LG  F DGTV+  PA++  + + E + C+  +  P    +IIGN  QQN    +D    ++GF P++C
Subjt:  EMVPKLGFHFRDGTVFQPPAKSYIVSVGEFISCIGFVSMPFPATNIIGNILQQNHLWQFDFFNGKVGFAPSEC

AT3G12700.1 Eukaryotic aspartyl protease family protein7.6e-9243.31Show/hide
Query:  RESILPPTSPTPIGLKMI--SGSDYGSSEYFVQLKVGTPPQTFMLIADTGSDLTWMKCRYRRCIGNCSSKASHKSRNEGKIRFRNAFLANYSSSFKTIHC
        R S++     + +G+KM   SG DYG+++YF +++VGTP + F ++ DTGS+LTW+ CRYR                 GK   R  F A+ S SFKT+ C
Subjt:  RESILPPTSPTPIGLKMI--SGSDYGSSEYFVQLKVGTPPQTFMLIADTGSDLTWMKCRYRRCIGNCSSKASHKSRNEGKIRFRNAFLANYSSSFKTIHC

Query:  SSTLCKKDLSDLFSIGECQTPTSPCLYDYSYSGGASAKGIFAIETLTVNLTNGKEKQLHNSIIGCTESVQGRIFRGADGVIGLGTSSYSFTYKAAQNANG
         +  CK DL +LFS+  C TP++PC YDY Y+ G++A+G+FA ET+TV LTNG+  +L   +IGC+ S  G+ F+GADGV+GL  S +SFT   A +  G
Subjt:  SSTLCKKDLSDLFSIGECQTPTSPCLYDYSYSGGASAKGIFAIETLTVNLTNGKEKQLHNSIIGCTESVQGRIFRGADGVIGLGTSSYSFTYKAAQNANG

Query:  GGFSYCLVDHLSDHTATSYFILGIPSPSASSAASSVLPSGNMSFTRLFVGDPYSSFYGVHLIGISADGVMLNIPPRVWDINSGGGTIVDSGTSLTMLAAP
          FSYCLVDHLS+   ++Y I G    S+ S  ++   +  +  TR+        FY +++IGIS    ML+IP +VWD  SGGGTI+DSGTSLT+LA  
Subjt:  GGFSYCLVDHLSDHTATSYFILGIPSPSASSAASSVLPSGNMSFTRLFVGDPYSSFYGVHLIGISADGVMLNIPPRVWDINSGGGTIVDSGTSLTMLAAP

Query:  AFDMVMEALTPKLKQFEIIEIQ--PFEFCFN-NSQYTHEMVPKLGFHFRDGTVFQPPAKSYIVSVGEFISCIGFVSMPFPATNIIGNILQQNHLWQFDFF
        A+  V+  L   L + + ++ +  P E+CF+  S +    +P+L FH + G  F+P  KSY+V     + C+GFVS   PATN+IGNI+QQN+LW+FD  
Subjt:  AFDMVMEALTPKLKQFEIIEIQ--PFEFCFN-NSQYTHEMVPKLGFHFRDGTVFQPPAKSYIVSVGEFISCIGFVSMPFPATNIIGNILQQNHLWQFDFF

Query:  NGKVGFAPSEC
           + FAPS C
Subjt:  NGKVGFAPSEC

AT3G25700.1 Eukaryotic aspartyl protease family protein6.5e-5935.57Show/hide
Query:  MISGSDYGSSEYFVQLKVGTPPQTFMLIADTGSDLTWMKCRYRRCIGNCSSKASHKSRNEGKIRFRNAFLANYSSSFKTIHCSSTLCK-KDLSDLFSIGE
        ++SG+  GS +YFV L++G PPQ+ +LIADTGSDL W+KC   R   NC    SH S           F   +SS+F   HC   +C+     D   I  
Subjt:  MISGSDYGSSEYFVQLKVGTPPQTFMLIADTGSDLTWMKCRYRRCIGNCSSKASHKSRNEGKIRFRNAFLANYSSSFKTIHCSSTLCK-KDLSDLFSIGE

Query:  CQTPTSPCLYDYSYSGGASAKGIFAIETLTVNLTNGKEKQLHNSIIGC-----TESVQGRIFRGADGVIGLGTSSYSFTYKAAQNANGGGFSYCLVDHLS
             S C Y+Y Y+ G+   G+FA ET ++  ++GKE +L +   GC      +SV G  F GA+GV+GLG    SF  +  +   G  FSYCL+D+  
Subjt:  CQTPTSPCLYDYSYSGGASAKGIFAIETLTVNLTNGKEKQLHNSIIGC-----TESVQGRIFRGADGVIGLGTSSYSFTYKAAQNANGGGFSYCLVDHLS

Query:  DHTATSYFILGIPSPSASSAASSVLPSGNMSFTRLFVGDPYSSFYGVHLIGISADGVMLNIPPRVWDI--NSGGGTIVDSGTSLTMLAAPAFDMVMEALT
            TSY I+G      S           + FT L       +FY V L  +  +G  L I P +W+I  +  GGT+VDSGT+L  LA PA+  V+ A+ 
Subjt:  DHTATSYFILGIPSPSASSAASSVLPSGNMSFTRLFVGDPYSSFYGVHLIGISADGVMLNIPPRVWDI--NSGGGTIVDSGTSLTMLAAPAFDMVMEALT

Query:  PKLKQFEIIEIQP-FEFCFNNSQYT--HEMVPKLGFHFRDGTVFQPPAKSYIVSVGEFISCIGFVSM-PFPATNIIGNILQQNHLWQFDFFNGKVGFAPS
         ++K      + P F+ C N S  T   +++P+L F F  G VF PP ++Y +   E I C+   S+ P    ++IGN++QQ  L++FD    ++GF+  
Subjt:  PKLKQFEIIEIQP-FEFCFNNSQYT--HEMVPKLGFHFRDGTVFQPPAKSYIVSVGEFISCIGFVSM-PFPATNIIGNILQQNHLWQFDFFNGKVGFAPS

Query:  EC
         C
Subjt:  EC

AT3G59080.1 Eukaryotic aspartyl protease family protein1.7e-5134.59Show/hide
Query:  SGSDYGSSEYFVQLKVGTPPQTFMLIADTGSDLTWMKCRYRRCIGNCSSKASHKSRNEGKIRFRNAFL-ANYSSSFKTIHCSSTLCKKDLSDLFSIGECQ
        SG   GS EYF+ + VG+PP+ F LI DTGSDL W++C    C  +C  +               AF     S+S+K I C+   C   +S       C+
Subjt:  SGSDYGSSEYFVQLKVGTPPQTFMLIADTGSDLTWMKCRYRRCIGNCSSKASHKSRNEGKIRFRNAFL-ANYSSSFKTIHCSSTLCKKDLSDLFSIGECQ

Query:  TPTSPCLYDYSYSGGASAKGIFAIETLTVNL-TNGKEKQLH---NSIIGCTESVQGRIFRGADGVIGLGTSSYSFTYKAAQNANGGGFSYCLVDHLSDHT
        +    C Y Y Y   ++  G FA+ET TVNL TNG   +L+   N + GC    +G +F GA G++GLG    SF+    Q+  G  FSYCLVD  SD  
Subjt:  TPTSPCLYDYSYSGGASAKGIFAIETLTVNL-TNGKEKQLH---NSIIGCTESVQGRIFRGADGVIGLGTSSYSFTYKAAQNANGGGFSYCLVDHLSDHT

Query:  ATSYFILGIPSPSASSAASSVLPSGNMSFTRLFVG--DPYSSFYGVHLIGISADGVMLNIPPRVWDINS--GGGTIVDSGTSLTMLAAPAFDMVMEALTP
         +S  I G            +L   N++FT    G  +   +FY V +  I   G +LNIP   W+I+S   GGTI+DSGT+L+  A PA++ +   +  
Subjt:  ATSYFILGIPSPSASSAASSVLPSGNMSFTRLFVG--DPYSSFYGVHLIGISADGVMLNIPPRVWDINS--GGGTIVDSGTSLTMLAAPAFDMVMEALTP

Query:  KLK-QFEIIEIQP-FEFCFNNSQYTHEMVPKLGFHFRDGTVFQPPAKSYIVSVGEFISCIGFVSMPFPATNIIGNILQQNHLWQFDFFNGKVGFAPSEC
        K K ++ +    P  + CFN S   +  +P+LG  F DG V+  P ++  + + E + C+  +  P  A +IIGN  QQN    +D    ++G+AP++C
Subjt:  KLK-QFEIIEIQP-FEFCFNNSQYTHEMVPKLGFHFRDGTVFQPPAKSYIVSVGEFISCIGFVSMPFPATNIIGNILQQNHLWQFDFFNGKVGFAPSEC

AT3G59080.2 Eukaryotic aspartyl protease family protein4.4e-4732.66Show/hide
Query:  SGSDYGSSEYFVQLKVGTPPQTFMLIADTGSDLTWMKCRYRRCIGNCSSKASHKSRNEGKIRFRNAFLANYSSSFKTIHCSSTLCKKDLSDLFSIGECQT
        SG   GS EYF+ + VG+PP+ F LI DTGSDL W++C                                       + C          D F     Q 
Subjt:  SGSDYGSSEYFVQLKVGTPPQTFMLIADTGSDLTWMKCRYRRCIGNCSSKASHKSRNEGKIRFRNAFLANYSSSFKTIHCSSTLCKKDLSDLFSIGECQT

Query:  PTSPCLYDYSYSGGASAKGIFAIETLTVNL-TNGKEKQLH---NSIIGCTESVQGRIFRGADGVIGLGTSSYSFTYKAAQNANGGGFSYCLVDHLSDHTA
            C Y Y Y   ++  G FA+ET TVNL TNG   +L+   N + GC    +G +F GA G++GLG    SF+    Q+  G  FSYCLVD  SD   
Subjt:  PTSPCLYDYSYSGGASAKGIFAIETLTVNL-TNGKEKQLH---NSIIGCTESVQGRIFRGADGVIGLGTSSYSFTYKAAQNANGGGFSYCLVDHLSDHTA

Query:  TSYFILGIPSPSASSAASSVLPSGNMSFTRLFVG--DPYSSFYGVHLIGISADGVMLNIPPRVWDINS--GGGTIVDSGTSLTMLAAPAFDMVMEALTPK
        +S  I G            +L   N++FT    G  +   +FY V +  I   G +LNIP   W+I+S   GGTI+DSGT+L+  A PA++ +   +  K
Subjt:  TSYFILGIPSPSASSAASSVLPSGNMSFTRLFVG--DPYSSFYGVHLIGISADGVMLNIPPRVWDINS--GGGTIVDSGTSLTMLAAPAFDMVMEALTPK

Query:  LK-QFEIIEIQP-FEFCFNNSQYTHEMVPKLGFHFRDGTVFQPPAKSYIVSVGEFISCIGFVSMPFPATNIIGNILQQNHLWQFDFFNGKVGFAPSEC
         K ++ +    P  + CFN S   +  +P+LG  F DG V+  P ++  + + E + C+  +  P  A +IIGN  QQN    +D    ++G+AP++C
Subjt:  LK-QFEIIEIQP-FEFCFNNSQYTHEMVPKLGFHFRDGTVFQPPAKSYIVSVGEFISCIGFVSMPFPATNIIGNILQQNHLWQFDFFNGKVGFAPSEC


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCGCCTATTTCACATTTTTGTTTCTTCTTCCTTTTCTTCTTCCTCTCTGTTCACATCGCATTCGGCGACTATGATCAAGAAACTGTAAAACTCGATCTACTTCACCG
TCACCATCCACAAGTCTCCGAGAAGCTTCACGGTGATATGAAATTTCAAGATCTAAATCATCGCCTCAAGGATATTCACGAACACGACCACAAACGTTATCAATCGATCT
CCACGTCGTTGAATCGGAAGCCAATTGAGGAGAGGGCTGAGGCTAAGGCTAAGGCTAAGGCGGAGGCTGAGGCGGAGGCGGAGGCTGAGGCTGAGGCTGAGGCTGCAGCG
AGGGAATCGATACTTCCACCGACATCACCAACGCCGATAGGGCTGAAAATGATATCAGGTTCTGATTATGGAAGTAGTGAGTATTTTGTTCAATTGAAAGTCGGAACGCC
GCCGCAGACGTTTATGTTGATCGCCGATACTGGAAGTGATCTAACGTGGATGAAATGTAGATATCGGAGATGTATCGGAAATTGTAGCAGCAAAGCAAGTCATAAGAGCC
GAAACGAAGGAAAAATTAGATTTAGAAATGCATTTTTGGCGAATTATTCATCATCTTTTAAGACGATTCATTGCAGCTCGACGTTGTGTAAGAAGGATCTTTCGGATCTG
TTCTCAATTGGAGAATGCCAAACCCCAACCAGCCCTTGTCTATATGATTACAGCTACTCAGGAGGAGCAAGTGCAAAGGGAATATTCGCAATTGAGACCCTAACCGTAAA
CCTAACAAATGGAAAAGAAAAACAACTTCACAATTCTATAATCGGCTGCACCGAATCAGTCCAAGGCAGGATCTTCCGCGGAGCCGACGGCGTCATTGGCTTAGGCACTA
GCTCCTACTCTTTCACCTACAAAGCCGCCCAAAACGCCAACGGCGGCGGCTTCTCTTACTGCCTTGTCGACCATCTCAGCGACCACACCGCCACCAGCTACTTCATCCTC
GGCATCCCTTCCCCTTCTGCTTCCTCTGCCGCCTCCTCCGTCCTCCCTTCCGGCAACATGTCCTTCACCAGACTCTTCGTCGGCGACCCTTACAGCAGCTTCTATGGCGT
CCATCTCATCGGAATCTCCGCCGACGGCGTCATGCTCAACATTCCTCCTCGCGTTTGGGACATCAATTCTGGCGGCGGAACCATCGTCGACTCCGGAACTAGCCTCACCA
TGCTGGCGGCGCCGGCGTTTGATATGGTCATGGAAGCTCTGACTCCGAAGCTGAAGCAATTCGAGATAATTGAAATCCAACCCTTCGAATTTTGCTTCAATAACAGCCAG
TACACTCATGAAATGGTCCCGAAGCTCGGATTCCATTTCCGCGACGGCACGGTGTTTCAGCCGCCGGCAAAAAGCTACATTGTTTCGGTGGGTGAATTCATTAGCTGTAT
TGGGTTCGTTTCTATGCCTTTCCCGGCCACCAATATCATTGGGAATATTCTTCAGCAGAATCACCTTTGGCAATTTGATTTCTTTAACGGAAAAGTCGGTTTTGCCCCCT
CTGAATGCGTTTAA
mRNA sequenceShow/hide mRNA sequence
ATGTCGCCTATTTCACATTTTTGTTTCTTCTTCCTTTTCTTCTTCCTCTCTGTTCACATCGCATTCGGCGACTATGATCAAGAAACTGTAAAACTCGATCTACTTCACCG
TCACCATCCACAAGTCTCCGAGAAGCTTCACGGTGATATGAAATTTCAAGATCTAAATCATCGCCTCAAGGATATTCACGAACACGACCACAAACGTTATCAATCGATCT
CCACGTCGTTGAATCGGAAGCCAATTGAGGAGAGGGCTGAGGCTAAGGCTAAGGCTAAGGCGGAGGCTGAGGCGGAGGCGGAGGCTGAGGCTGAGGCTGAGGCTGCAGCG
AGGGAATCGATACTTCCACCGACATCACCAACGCCGATAGGGCTGAAAATGATATCAGGTTCTGATTATGGAAGTAGTGAGTATTTTGTTCAATTGAAAGTCGGAACGCC
GCCGCAGACGTTTATGTTGATCGCCGATACTGGAAGTGATCTAACGTGGATGAAATGTAGATATCGGAGATGTATCGGAAATTGTAGCAGCAAAGCAAGTCATAAGAGCC
GAAACGAAGGAAAAATTAGATTTAGAAATGCATTTTTGGCGAATTATTCATCATCTTTTAAGACGATTCATTGCAGCTCGACGTTGTGTAAGAAGGATCTTTCGGATCTG
TTCTCAATTGGAGAATGCCAAACCCCAACCAGCCCTTGTCTATATGATTACAGCTACTCAGGAGGAGCAAGTGCAAAGGGAATATTCGCAATTGAGACCCTAACCGTAAA
CCTAACAAATGGAAAAGAAAAACAACTTCACAATTCTATAATCGGCTGCACCGAATCAGTCCAAGGCAGGATCTTCCGCGGAGCCGACGGCGTCATTGGCTTAGGCACTA
GCTCCTACTCTTTCACCTACAAAGCCGCCCAAAACGCCAACGGCGGCGGCTTCTCTTACTGCCTTGTCGACCATCTCAGCGACCACACCGCCACCAGCTACTTCATCCTC
GGCATCCCTTCCCCTTCTGCTTCCTCTGCCGCCTCCTCCGTCCTCCCTTCCGGCAACATGTCCTTCACCAGACTCTTCGTCGGCGACCCTTACAGCAGCTTCTATGGCGT
CCATCTCATCGGAATCTCCGCCGACGGCGTCATGCTCAACATTCCTCCTCGCGTTTGGGACATCAATTCTGGCGGCGGAACCATCGTCGACTCCGGAACTAGCCTCACCA
TGCTGGCGGCGCCGGCGTTTGATATGGTCATGGAAGCTCTGACTCCGAAGCTGAAGCAATTCGAGATAATTGAAATCCAACCCTTCGAATTTTGCTTCAATAACAGCCAG
TACACTCATGAAATGGTCCCGAAGCTCGGATTCCATTTCCGCGACGGCACGGTGTTTCAGCCGCCGGCAAAAAGCTACATTGTTTCGGTGGGTGAATTCATTAGCTGTAT
TGGGTTCGTTTCTATGCCTTTCCCGGCCACCAATATCATTGGGAATATTCTTCAGCAGAATCACCTTTGGCAATTTGATTTCTTTAACGGAAAAGTCGGTTTTGCCCCCT
CTGAATGCGTTTAA
Protein sequenceShow/hide protein sequence
MSPISHFCFFFLFFFLSVHIAFGDYDQETVKLDLLHRHHPQVSEKLHGDMKFQDLNHRLKDIHEHDHKRYQSISTSLNRKPIEERAEAKAKAKAEAEAEAEAEAEAEAAA
RESILPPTSPTPIGLKMISGSDYGSSEYFVQLKVGTPPQTFMLIADTGSDLTWMKCRYRRCIGNCSSKASHKSRNEGKIRFRNAFLANYSSSFKTIHCSSTLCKKDLSDL
FSIGECQTPTSPCLYDYSYSGGASAKGIFAIETLTVNLTNGKEKQLHNSIIGCTESVQGRIFRGADGVIGLGTSSYSFTYKAAQNANGGGFSYCLVDHLSDHTATSYFIL
GIPSPSASSAASSVLPSGNMSFTRLFVGDPYSSFYGVHLIGISADGVMLNIPPRVWDINSGGGTIVDSGTSLTMLAAPAFDMVMEALTPKLKQFEIIEIQPFEFCFNNSQ
YTHEMVPKLGFHFRDGTVFQPPAKSYIVSVGEFISCIGFVSMPFPATNIIGNILQQNHLWQFDFFNGKVGFAPSECV