; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

HG10016937 (gene) of Bottle gourd (Hangzhou Gourd) v1 genome

Gene IDHG10016937
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
DescriptionEukaryotic aspartyl protease family protein
Genome locationChr03:9493812..9495203
RNA-Seq ExpressionHG10016937
SyntenyHG10016937
Gene Ontology termsGO:0006508 - proteolysis (biological process)
GO:0004190 - aspartic-type endopeptidase activity (molecular function)
InterPro domainsIPR001461 - Aspartic peptidase A1 family
IPR001969 - Aspartic peptidase, active site
IPR021109 - Aspartic peptidase domain superfamily
IPR032799 - Xylanase inhibitor, C-terminal
IPR032861 - Xylanase inhibitor, N-terminal
IPR033121 - Peptidase family A1 domain
IPR034161 - Pepsin-like domain, plant


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7034471.1 Aspartic proteinase nepenthesin-1, partial [Cucurbita argyrosperma subsp. argyrosperma]1.6e-22383.37Show/hide
Query:  MEFFPIPFLLSIFLLLPPSSSSPISTITLPLTAFPSNSLTDDPWKTIDYLLSASLNRAQHLKKPQTKSNSSIQNVSLFPRSYGAYSISLAFGTPPQNLSF
        MEFF IPFLLSI LLL  SSSS  +T+TLPLT FPS      PWK I +L+SASL RAQHLK P+TKSN+SIQNV+LFPRSYGAYSISLAFGTPPQ+LS 
Subjt:  MEFFPIPFLLSIFLLLPPSSSSPISTITLPLTAFPSNSLTDDPWKTIDYLLSASLNRAQHLKKPQTKSNSSIQNVSLFPRSYGAYSISLAFGTPPQNLSF

Query:  IFDTGSSVVWFPCTASYLCSNCSFPNVDAATIPKFVPKLSSSAKIIGCRNPKCAWIFGPNLKSRCRSCSPKSRNCFDSCPGYGIQYGSGATAGFLLSETL
        +FDTGSS+VWFPCTA Y CSNCSFPNVDAATIPKF+PKLSSSAKIIGCRN KC+WIFGPNLK+ CRSCSP+SR C D+CPGYGIQYGSGATAGFLLSETL
Subjt:  IFDTGSSVVWFPCTASYLCSNCSFPNVDAATIPKFVPKLSSSAKIIGCRNPKCAWIFGPNLKSRCRSCSPKSRNCFDSCPGYGIQYGSGATAGFLLSETL

Query:  DFPKKRVPDFLVGCSVLSVHQPAGIAGFGRGPESLPSQMRLKRFSYCLVSRRFDDSPVSSPLVLNSGSESNESKSKSLIYAPFRKNPSGSNAAFREYYYL
        DFP+KRVPDFLVGCSV+SVHQPAGIAGFGRGPESLPSQM LKRFS+CLV R+FDDSPVSSPLVL+S SES ESK+ SLIYAPFR+NPSGSNAAFREYYYL
Subjt:  DFPKKRVPDFLVGCSVLSVHQPAGIAGFGRGPESLPSQMRLKRFSYCLVSRRFDDSPVSSPLVLNSGSESNESKSKSLIYAPFRKNPSGSNAAFREYYYL

Query:  SLRRIIIGGKPVKFPYKYLVPDSAGNGGAIIDSGSTFTFLDKPIFEAISEELEKQLVKYPRDKGVEVQSGLRPCFDISKEKLAEFPELVLKFKGGAKLSL
        +LRRI+IG KPVKFPYKYLVP+SAGNGGAIIDSGSTFTFLDKPIFEA++EELEKQLVKYPR KGVE +SGLRPCFDISKE+  EFPEL+LKFKGGA L+L
Subjt:  SLRRIIIGGKPVKFPYKYLVPDSAGNGGAIIDSGSTFTFLDKPIFEAISEELEKQLVKYPRDKGVEVQSGLRPCFDISKEKLAEFPELVLKFKGGAKLSL

Query:  PPANYLALVTDDGVVCLTMMTDVAAVGGGGGPAIIFGAFQQQNVLVEYDLARDRIGFRKQRCT
        PP+NYLALV D  VVCLTM+TDV  +GGGGGPAIIFGAFQQQNVLV+YDLA++RIGFRKQRCT
Subjt:  PPANYLALVTDDGVVCLTMMTDVAAVGGGGGPAIIFGAFQQQNVLVEYDLARDRIGFRKQRCT

XP_022925946.1 probable aspartyl protease At4g16563 [Cucurbita moschata]9.2e-22483.37Show/hide
Query:  MEFFPIPFLLSIFLLLPPSSSSPISTITLPLTAFPSNSLTDDPWKTIDYLLSASLNRAQHLKKPQTKSNSSIQNVSLFPRSYGAYSISLAFGTPPQNLSF
        MEFF IPFLLSI LLL  SSSS  +T+TLPLT FPS      PWK I +L+SASL RAQHLK P+TKSN+SIQNV+LFPRSYGAYSISLAFGTPPQ+LS 
Subjt:  MEFFPIPFLLSIFLLLPPSSSSPISTITLPLTAFPSNSLTDDPWKTIDYLLSASLNRAQHLKKPQTKSNSSIQNVSLFPRSYGAYSISLAFGTPPQNLSF

Query:  IFDTGSSVVWFPCTASYLCSNCSFPNVDAATIPKFVPKLSSSAKIIGCRNPKCAWIFGPNLKSRCRSCSPKSRNCFDSCPGYGIQYGSGATAGFLLSETL
        +FDTGSS+VWFPCTA Y CSNCSFPNVDAATIPKF+PKLSSSAKIIGCRN KC+WIFGPNLK+ CRSCSP+SR C D+CPGYGIQYGSGATAGFLLSETL
Subjt:  IFDTGSSVVWFPCTASYLCSNCSFPNVDAATIPKFVPKLSSSAKIIGCRNPKCAWIFGPNLKSRCRSCSPKSRNCFDSCPGYGIQYGSGATAGFLLSETL

Query:  DFPKKRVPDFLVGCSVLSVHQPAGIAGFGRGPESLPSQMRLKRFSYCLVSRRFDDSPVSSPLVLNSGSESNESKSKSLIYAPFRKNPSGSNAAFREYYYL
        DFP+KRVPDFLVGCSV+SVHQPAGIAGFGRGPESLPSQM LKRFS+CLV R+FDDSPVSSPLVL+S SES ESK+ SLIYAPFR+NPSGSNAAFREYYYL
Subjt:  DFPKKRVPDFLVGCSVLSVHQPAGIAGFGRGPESLPSQMRLKRFSYCLVSRRFDDSPVSSPLVLNSGSESNESKSKSLIYAPFRKNPSGSNAAFREYYYL

Query:  SLRRIIIGGKPVKFPYKYLVPDSAGNGGAIIDSGSTFTFLDKPIFEAISEELEKQLVKYPRDKGVEVQSGLRPCFDISKEKLAEFPELVLKFKGGAKLSL
        +LRRI+IG KPVKFPYKYLVP+SAGNGGAIIDSGSTFTFLDKPIFEA++EELEKQLVKYPR KGVE +SGLRPCFDISKE+  EFPEL+LKFKGGA L+L
Subjt:  SLRRIIIGGKPVKFPYKYLVPDSAGNGGAIIDSGSTFTFLDKPIFEAISEELEKQLVKYPRDKGVEVQSGLRPCFDISKEKLAEFPELVLKFKGGAKLSL

Query:  PPANYLALVTDDGVVCLTMMTDVAAVGGGGGPAIIFGAFQQQNVLVEYDLARDRIGFRKQRCT
        PP+NYLALV D  VVCLTM+TDV  +GGGGGPAIIFGAFQQQNVLV+YDLA++RIGFRKQRCT
Subjt:  PPANYLALVTDDGVVCLTMMTDVAAVGGGGGPAIIFGAFQQQNVLVEYDLARDRIGFRKQRCT

XP_022979057.1 probable aspartyl protease At4g16563 [Cucurbita maxima]7.5e-22684.45Show/hide
Query:  MEFFPIPFLLSIFLLLPPSSSSPISTITLPLTAFPSNSLTDDPWKTIDYLLSASLNRAQHLKKPQTKSNSSIQNVSLFPRSYGAYSISLAFGTPPQNLSF
        MEFFPI FLLSI LLL  SSSS   T+TLPLTAFPS  LT  PWK I +L+SASL RAQHLK P+TKSN+SIQNV+LFPRSYGAYSISLAFGTPPQ+LS 
Subjt:  MEFFPIPFLLSIFLLLPPSSSSPISTITLPLTAFPSNSLTDDPWKTIDYLLSASLNRAQHLKKPQTKSNSSIQNVSLFPRSYGAYSISLAFGTPPQNLSF

Query:  IFDTGSSVVWFPCTASYLCSNCSFPNVDAATIPKFVPKLSSSAKIIGCRNPKCAWIFGPNLKSRCRSCSPKSRNCFDSCPGYGIQYGSGATAGFLLSETL
        +FDTGSS+VWFPCTA Y CSNCSFPNVDAATIPKF+PKLSSSA+IIGCRN KC+WIFGPNLKS CRSCSP+SR C D+CPGYGIQYGSGATAGFLLSETL
Subjt:  IFDTGSSVVWFPCTASYLCSNCSFPNVDAATIPKFVPKLSSSAKIIGCRNPKCAWIFGPNLKSRCRSCSPKSRNCFDSCPGYGIQYGSGATAGFLLSETL

Query:  DFPKKRVPDFLVGCSVLSVHQPAGIAGFGRGPESLPSQMRLKRFSYCLVSRRFDDSPVSSPLVLNSGSESNESKSKSLIYAPFRKNPSGSNAAFREYYYL
        DFP+KRVPDFLVGCSVLSVHQPAGIAGFGRGPESLPSQM LKRFS+CLV R+FDDSPVSSPLVL+S  ES +SK+ SLIYAPFR+NPSGSNAAFREYYYL
Subjt:  DFPKKRVPDFLVGCSVLSVHQPAGIAGFGRGPESLPSQMRLKRFSYCLVSRRFDDSPVSSPLVLNSGSESNESKSKSLIYAPFRKNPSGSNAAFREYYYL

Query:  SLRRIIIGGKPVKFPYKYLVPDSAGNGGAIIDSGSTFTFLDKPIFEAISEELEKQLVKYPRDKGVEVQSGLRPCFDISKEKLAEFPELVLKFKGGAKLSL
        +LRRI+IG KPVKFPYKYLVP+SAGNGGAIIDSGSTFTFLDKPIFEA++EELEKQLVKYPR KGVE +SGLRPCFDISKE+  EFPEL+LKFKGGA L+L
Subjt:  SLRRIIIGGKPVKFPYKYLVPDSAGNGGAIIDSGSTFTFLDKPIFEAISEELEKQLVKYPRDKGVEVQSGLRPCFDISKEKLAEFPELVLKFKGGAKLSL

Query:  PPANYLALVTDDGVVCLTMMTDVAAVGGGGGPAIIFGAFQQQNVLVEYDLARDRIGFRKQRCT
        PPANYLALVTD GVVCLTM+TDV  +GGGGGPAIIFGAFQQQNVLV+YDLA++RIGFRKQRCT
Subjt:  PPANYLALVTDDGVVCLTMMTDVAAVGGGGGPAIIFGAFQQQNVLVEYDLARDRIGFRKQRCT

XP_023543736.1 probable aspartyl protease At4g16563 [Cucurbita pepo subsp. pepo]3.0e-22784.67Show/hide
Query:  MEFFPIPFLLSIFLLLPPSSSSPISTITLPLTAFPSNSLTDDPWKTIDYLLSASLNRAQHLKKPQTKSNSSIQNVSLFPRSYGAYSISLAFGTPPQNLSF
        MEFFPIPFLLSI LLL  SSSS  +T+TLPLT FPS   T  PWK I +L+SASL RAQHLK P+ KSN+SIQNV+LFPRSYGAYSISLAFGTPPQ+LS 
Subjt:  MEFFPIPFLLSIFLLLPPSSSSPISTITLPLTAFPSNSLTDDPWKTIDYLLSASLNRAQHLKKPQTKSNSSIQNVSLFPRSYGAYSISLAFGTPPQNLSF

Query:  IFDTGSSVVWFPCTASYLCSNCSFPNVDAATIPKFVPKLSSSAKIIGCRNPKCAWIFGPNLKSRCRSCSPKSRNCFDSCPGYGIQYGSGATAGFLLSETL
        +FDTGSS+VWFPCTA Y CSNCSFPNVDAATIPKF+PKLSSSAKIIGCRN KC+WIFGPNLKS CRSCSP+SR C D+CPGYGIQYGSGATAGFLLSETL
Subjt:  IFDTGSSVVWFPCTASYLCSNCSFPNVDAATIPKFVPKLSSSAKIIGCRNPKCAWIFGPNLKSRCRSCSPKSRNCFDSCPGYGIQYGSGATAGFLLSETL

Query:  DFPKKRVPDFLVGCSVLSVHQPAGIAGFGRGPESLPSQMRLKRFSYCLVSRRFDDSPVSSPLVLNSGSESNESKSKSLIYAPFRKNPSGSNAAFREYYYL
        DFP+KRVPDFLVGCSV+SVHQPAGIAGFGRGPESLPSQM LKRFS+CLV R+FDDSPVSSPLVL+S SES ESK+ SLIYAPFR+NPSGSNAAFREYYYL
Subjt:  DFPKKRVPDFLVGCSVLSVHQPAGIAGFGRGPESLPSQMRLKRFSYCLVSRRFDDSPVSSPLVLNSGSESNESKSKSLIYAPFRKNPSGSNAAFREYYYL

Query:  SLRRIIIGGKPVKFPYKYLVPDSAGNGGAIIDSGSTFTFLDKPIFEAISEELEKQLVKYPRDKGVEVQSGLRPCFDISKEKLAEFPELVLKFKGGAKLSL
        +LRRI+IG KPVKFPYKYLVP+SAGNGGAIIDSGSTFTFLDKPIFEA++EELEKQLVKYPR KGVE +SGLRPCFDISKE+  EFPEL+LKFKGGA L+L
Subjt:  SLRRIIIGGKPVKFPYKYLVPDSAGNGGAIIDSGSTFTFLDKPIFEAISEELEKQLVKYPRDKGVEVQSGLRPCFDISKEKLAEFPELVLKFKGGAKLSL

Query:  PPANYLALVTDDGVVCLTMMTDVAAVGGGGGPAIIFGAFQQQNVLVEYDLARDRIGFRKQRCT
        PPANYLALVTD GVVCLTM+TDV  +GGGGGPAIIFGAFQQQNVLV+YDLA+DRIGFRKQRCT
Subjt:  PPANYLALVTDDGVVCLTMMTDVAAVGGGGGPAIIFGAFQQQNVLVEYDLARDRIGFRKQRCT

XP_038881211.1 probable aspartyl protease At4g16563 [Benincasa hispida]5.7e-23488.55Show/hide
Query:  MEFFPIPFLLSIFLLLPPSSSSPISTITLPLTAFPSNSLTDDPWKTIDYLLSASLNRAQHLKKPQTKSNSSIQNVSLFPRSYGAYSISLAFGTPPQNLSF
        MEFFPIPFL SIFLLLP SSSS ISTITLPLTAFPS  LTDDP K I+YLLSASLNRAQHLK PQTK   SIQNVSLF RSYGAYSI+LAFGTPPQNLSF
Subjt:  MEFFPIPFLLSIFLLLPPSSSSPISTITLPLTAFPSNSLTDDPWKTIDYLLSASLNRAQHLKKPQTKSNSSIQNVSLFPRSYGAYSISLAFGTPPQNLSF

Query:  IFDTGSSVVWFPCTASYLCSNCSFPNVDAATIPKFVPKLSSSAKIIGCRNPKCAWIFGPNLKSRCRSCSPKSRNCFDSCPGYGIQYGSGATAGFLLSETL
        +FDTGSS+VWFPCTA Y CSNCSFPNVDAATIPKFVPKLSSSAKIIGCRNPKCAWIFGPNL SRCR+C+PKSRNC  SCPGYGI YGSGATAGFLLSETL
Subjt:  IFDTGSSVVWFPCTASYLCSNCSFPNVDAATIPKFVPKLSSSAKIIGCRNPKCAWIFGPNLKSRCRSCSPKSRNCFDSCPGYGIQYGSGATAGFLLSETL

Query:  DFPKKRVPDFLVGCSVLSVHQPAGIAGFGRGPESLPSQMRLKRFSYCLVSRRFDDSPVSSPLVLNSGSESNESKSKSLIYAPFRKNPSGSNAAFREYYYL
        DFPKK VPDFLVGCSV SVHQPAGIAGFGR PESLPSQMRLKRFSYCLVSR FDDSPVSSPLVL+SGSES++SK++S IYAPFR+NPSGSNAAFREYYYL
Subjt:  DFPKKRVPDFLVGCSVLSVHQPAGIAGFGRGPESLPSQMRLKRFSYCLVSRRFDDSPVSSPLVLNSGSESNESKSKSLIYAPFRKNPSGSNAAFREYYYL

Query:  SLRRIIIGGKPVKFPYKYLVPDSAGNGGAIIDSGSTFTFLDKPIFEAISEELEKQLVKYPRDKGVEVQSGLRPCFDISKEKLAEFPELVLKFKGGAKLSL
        SLRRI+IGGKPVK PYKYL+PDSAG GGAIIDSGSTFTFLDKPIFEA++ ELEKQLVKYPR K VEVQSGLRPCFDISKE   EFPELVLKFKGGAKLSL
Subjt:  SLRRIIIGGKPVKFPYKYLVPDSAGNGGAIIDSGSTFTFLDKPIFEAISEELEKQLVKYPRDKGVEVQSGLRPCFDISKEKLAEFPELVLKFKGGAKLSL

Query:  PPANYLALVTDDGVVCLTMMTDVAAVGGGGGPAIIFGAFQQQNVLVEYDLARDRIGFRKQRCT
        PP NYLALVTD GVVCLTMMTDV  VGGG GPAIIFGAFQQQNVLVEYDLAR+RIGFRKQRCT
Subjt:  PPANYLALVTDDGVVCLTMMTDVAAVGGGGGPAIIFGAFQQQNVLVEYDLARDRIGFRKQRCT

TrEMBL top hitse value%identityAlignment
A0A0A0KHK2 Peptidase A1 domain-containing protein4.2e-22284.27Show/hide
Query:  MEFFPIPFLLSIFLLLPPSSSSPISTITLPLTAFPSNSLTDDPWKTIDYLLSASLNRAQHLKKPQTKSNSSIQNVSLFPRSYGAYSISLAFGTPPQNLSF
        MEF PIPFL SIFLLLP SSSS  ST  LPLT FPS S T DP+KTI+ LLSASLNRAQHLK PQ+KSN+SIQNVSLFPRSYGAYS+SLAFGTPPQNLSF
Subjt:  MEFFPIPFLLSIFLLLPPSSSSPISTITLPLTAFPSNSLTDDPWKTIDYLLSASLNRAQHLKKPQTKSNSSIQNVSLFPRSYGAYSISLAFGTPPQNLSF

Query:  IFDTGSSVVWFPCTASYLCSNCSFPNVDAATIPKFVPKLSSSAKIIGCRNPKCAWIFGPNLKSRCRSCSPKSRNCFDSCPGYGIQYGSGATAGFLLSETL
        IFDTGSS+VWFPCTA Y CS CSFP VD ATI KFVPKLSSS K++GCRNPKCAWIFGPNLKSRCR+C+ KSR C DSCPGYG+QYGSGATAG LLSETL
Subjt:  IFDTGSSVVWFPCTASYLCSNCSFPNVDAATIPKFVPKLSSSAKIIGCRNPKCAWIFGPNLKSRCRSCSPKSRNCFDSCPGYGIQYGSGATAGFLLSETL

Query:  DFPKKRVPDFLVGCSVLSVHQPAGIAGFGRGPESLPSQMRLKRFSYCLVSRRFDDSPVSSPLVLNSGSESNESKSKSLIYAPFRKNPSGSNAAFREYYYL
        D   KRVPDFLVGCSV+SVHQPAGIAGFGRGPESLPSQMRLKRFS+CLVSR FDDSPVSSPLVL+SGSES+ESK+KS IYAPFR+NPS SNAAFREYYYL
Subjt:  DFPKKRVPDFLVGCSVLSVHQPAGIAGFGRGPESLPSQMRLKRFSYCLVSRRFDDSPVSSPLVLNSGSESNESKSKSLIYAPFRKNPSGSNAAFREYYYL

Query:  SLRRIIIGGKPVKFPYKYLVPDSAGNGGAIIDSGSTFTFLDKPIFEAISEELEKQLVKYPRDKGVEVQSGLRPCFDISK-EKLAEFPELVLKFKGGAKLS
        SLRRI+IGGKPVKFPYKYLVPDS GNGGAIIDSGSTFTFLDKPIFEAI++ELEKQLVKYPR K VE QSGLRPCF+I K E+ AEFP++VLKFKGG KLS
Subjt:  SLRRIIIGGKPVKFPYKYLVPDSAGNGGAIIDSGSTFTFLDKPIFEAISEELEKQLVKYPRDKGVEVQSGLRPCFDISK-EKLAEFPELVLKFKGGAKLS

Query:  LPPANYLALVTDDGVVCLTMMTDVAAVGGGGGPAIIFGAFQQQNVLVEYDLARDRIGFRKQRCT
        L   NYLA+VTD+GVVCLTMMTD A VGGGGGPAII GAFQQQNVLVEYDLA+ RIGFRKQ+CT
Subjt:  LPPANYLALVTDDGVVCLTMMTDVAAVGGGGGPAIIFGAFQQQNVLVEYDLARDRIGFRKQRCT

A0A1S3CHV2 aspartic proteinase nepenthesin-2-like9.9e-21681.25Show/hide
Query:  MEFFPIPFLLSIFLLLPPSSSSPISTITLPLTAFPSNSLTDDPWKTIDYLLSASLNRAQHLKKPQTKSNSSIQNVSLFPRSYGAYSISLAFGTPPQNLSF
        MEF PIPFL SIFLLLP SSS   S+ITLPL  FPS   T DP KTI++LLSASL+RAQHLK PQ+KSN+S +NVSLFPRSYGAY++SLAFGTPPQNLSF
Subjt:  MEFFPIPFLLSIFLLLPPSSSSPISTITLPLTAFPSNSLTDDPWKTIDYLLSASLNRAQHLKKPQTKSNSSIQNVSLFPRSYGAYSISLAFGTPPQNLSF

Query:  IFDTGSSVVWFPCTASYLCSNCSFPNVDAATIPKFVPKLSSSAKIIGCRNPKCAWIFGPNLKSRCRSCSPKSRNCFDSCPGYGIQYGSGATAGFLLSETL
        IFDTGSS+VWFPCTA Y C++CSFP+VD ATI KFVPKLSSS KI+GCRNPKCAWIFGPNLKSRCR+C+PKSR C DSCPGYGIQYGSGATAG LLSETL
Subjt:  IFDTGSSVVWFPCTASYLCSNCSFPNVDAATIPKFVPKLSSSAKIIGCRNPKCAWIFGPNLKSRCRSCSPKSRNCFDSCPGYGIQYGSGATAGFLLSETL

Query:  DFPKKRVPDFLVGCSVLSVHQPAGIAGFGRGPESLPSQMRLKRFSYCLVSRRFDDSPVSSPLVLNSGSESNESKSKSLIYAPFRKNPSGSNAAFREYYYL
        D   KRVPDFLVGCSV+SVHQPAGIAGFGRGPESLPSQMRLKRFS+CL+ R FDDSPVSSPLVL+SG ES+ESK+KS IYAPF++NPS SN AFREYYYL
Subjt:  DFPKKRVPDFLVGCSVLSVHQPAGIAGFGRGPESLPSQMRLKRFSYCLVSRRFDDSPVSSPLVLNSGSESNESKSKSLIYAPFRKNPSGSNAAFREYYYL

Query:  SLRRIIIGGKPVKFPYKYLVPDSAGNGGAIIDSGSTFTFLDKPIFEAISEELEKQLVKYPRDKGVEVQSGLRPCFDISK-EKLAEFPELVLKFKGGAKLS
        SLRRI+IGGKPVKFPYKYLVPDS G GGAIIDSGSTFTFLDKPIFEAI+ ELEKQLVKYPR K +E ++GLRPCF+ISK E+ AEFPE+ LKFKGG KLS
Subjt:  SLRRIIIGGKPVKFPYKYLVPDSAGNGGAIIDSGSTFTFLDKPIFEAISEELEKQLVKYPRDKGVEVQSGLRPCFDISK-EKLAEFPELVLKFKGGAKLS

Query:  LPPANYLALVTDDGVVCLTMMTDVAAVGGGGGPAIIFGAFQQQNVLVEYDLARDRIGFRKQRCT
        LPP NYL +VTD  VVCLTMMT+   VG GGGPAIIFGAFQQQNVLVEYDLA+ RIGFRKQ+CT
Subjt:  LPPANYLALVTDDGVVCLTMMTDVAAVGGGGGPAIIFGAFQQQNVLVEYDLARDRIGFRKQRCT

A0A5A7SGF9 Aspartic proteinase nepenthesin-2-like9.9e-21681.25Show/hide
Query:  MEFFPIPFLLSIFLLLPPSSSSPISTITLPLTAFPSNSLTDDPWKTIDYLLSASLNRAQHLKKPQTKSNSSIQNVSLFPRSYGAYSISLAFGTPPQNLSF
        MEF PIPFL SIFLLLP SSS   S+ITLPL  FPS   T DP KTI++LLSASL+RAQHLK PQ+KSN+S +NVSLFPRSYGAY++SLAFGTPPQNLSF
Subjt:  MEFFPIPFLLSIFLLLPPSSSSPISTITLPLTAFPSNSLTDDPWKTIDYLLSASLNRAQHLKKPQTKSNSSIQNVSLFPRSYGAYSISLAFGTPPQNLSF

Query:  IFDTGSSVVWFPCTASYLCSNCSFPNVDAATIPKFVPKLSSSAKIIGCRNPKCAWIFGPNLKSRCRSCSPKSRNCFDSCPGYGIQYGSGATAGFLLSETL
        IFDTGSS+VWFPCTA Y C++CSFP+VD ATI KFVPKLSSS KI+GCRNPKCAWIFGPNLKSRCR+C+PKSR C DSCPGYGIQYGSGATAG LLSETL
Subjt:  IFDTGSSVVWFPCTASYLCSNCSFPNVDAATIPKFVPKLSSSAKIIGCRNPKCAWIFGPNLKSRCRSCSPKSRNCFDSCPGYGIQYGSGATAGFLLSETL

Query:  DFPKKRVPDFLVGCSVLSVHQPAGIAGFGRGPESLPSQMRLKRFSYCLVSRRFDDSPVSSPLVLNSGSESNESKSKSLIYAPFRKNPSGSNAAFREYYYL
        D   KRVPDFLVGCSV+SVHQPAGIAGFGRGPESLPSQMRLKRFS+CL+ R FDDSPVSSPLVL+SG ES+ESK+KS IYAPF++NPS SN AFREYYYL
Subjt:  DFPKKRVPDFLVGCSVLSVHQPAGIAGFGRGPESLPSQMRLKRFSYCLVSRRFDDSPVSSPLVLNSGSESNESKSKSLIYAPFRKNPSGSNAAFREYYYL

Query:  SLRRIIIGGKPVKFPYKYLVPDSAGNGGAIIDSGSTFTFLDKPIFEAISEELEKQLVKYPRDKGVEVQSGLRPCFDISK-EKLAEFPELVLKFKGGAKLS
        SLRRI+IGGKPVKFPYKYLVPDS G GGAIIDSGSTFTFLDKPIFEAI+ ELEKQLVKYPR K +E ++GLRPCF+ISK E+ AEFPE+ LKFKGG KLS
Subjt:  SLRRIIIGGKPVKFPYKYLVPDSAGNGGAIIDSGSTFTFLDKPIFEAISEELEKQLVKYPRDKGVEVQSGLRPCFDISK-EKLAEFPELVLKFKGGAKLS

Query:  LPPANYLALVTDDGVVCLTMMTDVAAVGGGGGPAIIFGAFQQQNVLVEYDLARDRIGFRKQRCT
        LPP NYL +VTD  VVCLTMMT+   VG GGGPAIIFGAFQQQNVLVEYDLA+ RIGFRKQ+CT
Subjt:  LPPANYLALVTDDGVVCLTMMTDVAAVGGGGGPAIIFGAFQQQNVLVEYDLARDRIGFRKQRCT

A0A6J1EDJ0 probable aspartyl protease At4g165634.4e-22483.37Show/hide
Query:  MEFFPIPFLLSIFLLLPPSSSSPISTITLPLTAFPSNSLTDDPWKTIDYLLSASLNRAQHLKKPQTKSNSSIQNVSLFPRSYGAYSISLAFGTPPQNLSF
        MEFF IPFLLSI LLL  SSSS  +T+TLPLT FPS      PWK I +L+SASL RAQHLK P+TKSN+SIQNV+LFPRSYGAYSISLAFGTPPQ+LS 
Subjt:  MEFFPIPFLLSIFLLLPPSSSSPISTITLPLTAFPSNSLTDDPWKTIDYLLSASLNRAQHLKKPQTKSNSSIQNVSLFPRSYGAYSISLAFGTPPQNLSF

Query:  IFDTGSSVVWFPCTASYLCSNCSFPNVDAATIPKFVPKLSSSAKIIGCRNPKCAWIFGPNLKSRCRSCSPKSRNCFDSCPGYGIQYGSGATAGFLLSETL
        +FDTGSS+VWFPCTA Y CSNCSFPNVDAATIPKF+PKLSSSAKIIGCRN KC+WIFGPNLK+ CRSCSP+SR C D+CPGYGIQYGSGATAGFLLSETL
Subjt:  IFDTGSSVVWFPCTASYLCSNCSFPNVDAATIPKFVPKLSSSAKIIGCRNPKCAWIFGPNLKSRCRSCSPKSRNCFDSCPGYGIQYGSGATAGFLLSETL

Query:  DFPKKRVPDFLVGCSVLSVHQPAGIAGFGRGPESLPSQMRLKRFSYCLVSRRFDDSPVSSPLVLNSGSESNESKSKSLIYAPFRKNPSGSNAAFREYYYL
        DFP+KRVPDFLVGCSV+SVHQPAGIAGFGRGPESLPSQM LKRFS+CLV R+FDDSPVSSPLVL+S SES ESK+ SLIYAPFR+NPSGSNAAFREYYYL
Subjt:  DFPKKRVPDFLVGCSVLSVHQPAGIAGFGRGPESLPSQMRLKRFSYCLVSRRFDDSPVSSPLVLNSGSESNESKSKSLIYAPFRKNPSGSNAAFREYYYL

Query:  SLRRIIIGGKPVKFPYKYLVPDSAGNGGAIIDSGSTFTFLDKPIFEAISEELEKQLVKYPRDKGVEVQSGLRPCFDISKEKLAEFPELVLKFKGGAKLSL
        +LRRI+IG KPVKFPYKYLVP+SAGNGGAIIDSGSTFTFLDKPIFEA++EELEKQLVKYPR KGVE +SGLRPCFDISKE+  EFPEL+LKFKGGA L+L
Subjt:  SLRRIIIGGKPVKFPYKYLVPDSAGNGGAIIDSGSTFTFLDKPIFEAISEELEKQLVKYPRDKGVEVQSGLRPCFDISKEKLAEFPELVLKFKGGAKLSL

Query:  PPANYLALVTDDGVVCLTMMTDVAAVGGGGGPAIIFGAFQQQNVLVEYDLARDRIGFRKQRCT
        PP+NYLALV D  VVCLTM+TDV  +GGGGGPAIIFGAFQQQNVLV+YDLA++RIGFRKQRCT
Subjt:  PPANYLALVTDDGVVCLTMMTDVAAVGGGGGPAIIFGAFQQQNVLVEYDLARDRIGFRKQRCT

A0A6J1IMR7 probable aspartyl protease At4g165633.6e-22684.45Show/hide
Query:  MEFFPIPFLLSIFLLLPPSSSSPISTITLPLTAFPSNSLTDDPWKTIDYLLSASLNRAQHLKKPQTKSNSSIQNVSLFPRSYGAYSISLAFGTPPQNLSF
        MEFFPI FLLSI LLL  SSSS   T+TLPLTAFPS  LT  PWK I +L+SASL RAQHLK P+TKSN+SIQNV+LFPRSYGAYSISLAFGTPPQ+LS 
Subjt:  MEFFPIPFLLSIFLLLPPSSSSPISTITLPLTAFPSNSLTDDPWKTIDYLLSASLNRAQHLKKPQTKSNSSIQNVSLFPRSYGAYSISLAFGTPPQNLSF

Query:  IFDTGSSVVWFPCTASYLCSNCSFPNVDAATIPKFVPKLSSSAKIIGCRNPKCAWIFGPNLKSRCRSCSPKSRNCFDSCPGYGIQYGSGATAGFLLSETL
        +FDTGSS+VWFPCTA Y CSNCSFPNVDAATIPKF+PKLSSSA+IIGCRN KC+WIFGPNLKS CRSCSP+SR C D+CPGYGIQYGSGATAGFLLSETL
Subjt:  IFDTGSSVVWFPCTASYLCSNCSFPNVDAATIPKFVPKLSSSAKIIGCRNPKCAWIFGPNLKSRCRSCSPKSRNCFDSCPGYGIQYGSGATAGFLLSETL

Query:  DFPKKRVPDFLVGCSVLSVHQPAGIAGFGRGPESLPSQMRLKRFSYCLVSRRFDDSPVSSPLVLNSGSESNESKSKSLIYAPFRKNPSGSNAAFREYYYL
        DFP+KRVPDFLVGCSVLSVHQPAGIAGFGRGPESLPSQM LKRFS+CLV R+FDDSPVSSPLVL+S  ES +SK+ SLIYAPFR+NPSGSNAAFREYYYL
Subjt:  DFPKKRVPDFLVGCSVLSVHQPAGIAGFGRGPESLPSQMRLKRFSYCLVSRRFDDSPVSSPLVLNSGSESNESKSKSLIYAPFRKNPSGSNAAFREYYYL

Query:  SLRRIIIGGKPVKFPYKYLVPDSAGNGGAIIDSGSTFTFLDKPIFEAISEELEKQLVKYPRDKGVEVQSGLRPCFDISKEKLAEFPELVLKFKGGAKLSL
        +LRRI+IG KPVKFPYKYLVP+SAGNGGAIIDSGSTFTFLDKPIFEA++EELEKQLVKYPR KGVE +SGLRPCFDISKE+  EFPEL+LKFKGGA L+L
Subjt:  SLRRIIIGGKPVKFPYKYLVPDSAGNGGAIIDSGSTFTFLDKPIFEAISEELEKQLVKYPRDKGVEVQSGLRPCFDISKEKLAEFPELVLKFKGGAKLSL

Query:  PPANYLALVTDDGVVCLTMMTDVAAVGGGGGPAIIFGAFQQQNVLVEYDLARDRIGFRKQRCT
        PPANYLALVTD GVVCLTM+TDV  +GGGGGPAIIFGAFQQQNVLV+YDLA++RIGFRKQRCT
Subjt:  PPANYLALVTDDGVVCLTMMTDVAAVGGGGGPAIIFGAFQQQNVLVEYDLARDRIGFRKQRCT

SwissProt top hitse value%identityAlignment
Q766C2 Aspartic proteinase nepenthesin-29.7e-3531.27Show/hide
Query:  GAYSISLAFGTPPQNLSFIFDTGSSVVWFPCTASYLCSNCSFPNVDAATIPKFVPKLSSSAKIIGCRNPKCAWIFGPNLKSRCRSCSPKSRNCFDSCPGY
        G Y +++A GTP  + S I DTGS ++W  C     C+ C      +   P F P+ SSS   + C +  C  +               S  C ++   Y
Subjt:  GAYSISLAFGTPPQNLSFIFDTGSSVVWFPCTASYLCSNCSFPNVDAATIPKFVPKLSSSAKIIGCRNPKCAWIFGPNLKSRCRSCSPKSRNCFDSCPGY

Query:  GIQYGSGATA-GFLLSETLDFPKKRVPDFLVGCSV----LSVHQPAGIAGFGRGPESLPSQMRLKRFSYCLVSRRFDDSPVSSPLVLNSGSESNESKSKS
           YG G+T  G++ +ET  F    VP+   GC            AG+ G G GP SLPSQ+ + +FSYC+ S     S   S L L S +      S S
Subjt:  GIQYGSGATA-GFLLSETLDFPKKRVPDFLVGCSV----LSVHQPAGIAGFGRGPESLPSQMRLKRFSYCLVSRRFDDSPVSSPLVLNSGSESNESKSKS

Query:  LIYAPFRKNPSGSNAAFREYYYLSLRRIIIGGKPVKFPYKYLVPDSAGNGGAIIDSGSTFTFLDKPIFEAISEELEKQLVKYPRDKGVEVQSGLRPCF-D
                NP+        YYY++L+ I +GG  +  P         G GG IIDSG+T T+L +  + A+++    Q+     D   E  SGL  CF  
Subjt:  LIYAPFRKNPSGSNAAFREYYYLSLRRIIIGGKPVKFPYKYLVPDSAGNGGAIIDSGSTFTFLDKPIFEAISEELEKQLVKYPRDKGVEVQSGLRPCF-D

Query:  ISKEKLAEFPELVLKFKGGAKLSLPPANYLALVTDDGVVCLTMMTDVAAVGGGGGPAI-IFGAFQQQNVLVEYDLARDRIGFRKQRC
         S     + PE+ ++F GG  L+L   N L +   +GV+CL       A+G      I IFG  QQQ   V YDL    + F   +C
Subjt:  ISKEKLAEFPELVLKFKGGAKLSLPPANYLALVTDDGVVCLTMMTDVAAVGGGGGPAI-IFGAFQQQNVLVEYDLARDRIGFRKQRC

Q766C3 Aspartic proteinase nepenthesin-11.7e-3430.23Show/hide
Query:  GAYSISLAFGTPPQNLSFIFDTGSSVVWFPCTASYLCSNCSFPNVDAATIPKFVPKLSSSAKIIGCRNPKCAWIFGPNLKSRCRSCSPKSRNCFDSCPGY
        G Y ++L+ GTP Q  S I DTGS ++W  C     C N S         P F P+ SSS   + C +  C  +  P               C ++   Y
Subjt:  GAYSISLAFGTPPQNLSFIFDTGSSVVWFPCTASYLCSNCSFPNVDAATIPKFVPKLSSSAKIIGCRNPKCAWIFGPNLKSRCRSCSPKSRNCFDSCPGY

Query:  GIQYGSGA-TAGFLLSETLDFPKKRVPDFLVGCSV----LSVHQPAGIAGFGRGPESLPSQMRLKRFSYCLVSRRFDDSPVSSPLVLNSGSESNESKSKS
           YG G+ T G + +ETL F    +P+   GC            AG+ G GRGP SLPSQ+ + +FSYC+       S   S L+L S + S  + S +
Subjt:  GIQYGSGA-TAGFLLSETLDFPKKRVPDFLVGCSV----LSVHQPAGIAGFGRGPESLPSQMRLKRFSYCLVSRRFDDSPVSSPLVLNSGSESNESKSKS

Query:  LIYAPFRKNPSGSNAAFREYYYLSLRRIIIGGKPVKF-PYKYLVPDSAGNGGAIIDSGSTFTFLDKPIFEAISEELEKQLVKYPRDKGVEVQSGLRPCFD
               + P+        +YY++L  + +G   +   P  + +  + G GG IIDSG+T T+     ++++ +E   Q +  P   G    SG   CF 
Subjt:  LIYAPFRKNPSGSNAAFREYYYLSLRRIIIGGKPVKF-PYKYLVPDSAGNGGAIIDSGSTFTFLDKPIFEAISEELEKQLVKYPRDKGVEVQSGLRPCFD

Query:  I-SKEKLAEFPELVLKFKGGAKLSLPPANYLALVTDDGVVCLTMMTDVAAVGGGGGPAIIFGAFQQQNVLVEYDLARDRIGFRKQRC
          S     + P  V+ F GG  L LP  NY  +   +G++CL       A+G       IFG  QQQN+LV YD     + F   +C
Subjt:  I-SKEKLAEFPELVLKFKGGAKLSLPPANYLALVTDDGVVCLTMMTDVAAVGGGGGPAIIFGAFQQQNVLVEYDLARDRIGFRKQRC

Q940R4 Probable aspartyl protease At4g165631.9e-4630.1Show/hide
Query:  ISTITLPLTAFPSNSLTDDPWKT--IDYLLSASLNRAQHLKKPQTKSNSSIQNVSLFPRSYGAYSISLAFGTPPQNLSFIFDTGSSVVWFPCTASYLCSN
        +S+++ PL    S+SL+     +  +  L S+S   +   ++   K     Q +SL   S   Y ISL+ G+    +S   DTGS +VWFPC   + C  
Subjt:  ISTITLPLTAFPSNSLTDDPWKT--IDYLLSASLNRAQHLKKPQTKSNSSIQNVSLFPRSYGAYSISLAFGTPPQNLSFIFDTGSSVVWFPCTASYLCSN

Query:  CSFPNVDAATIPKFVP-KLSSSAKIIGCRNPKCAWIFG--PNLK----SRCRSCSPKSRNCFDS---CPGYGIQYGSGATAGFLLSETLDFPKKRVPDFL
        C     ++  +P   P  LSSSA  + C +P C+      P+      S C     ++ +C  S   CP +   YG G+    L S++L  P   V +F 
Subjt:  CSFPNVDAATIPKFVP-KLSSSAKIIGCRNPKCAWIFG--PNLK----SRCRSCSPKSRNCFDS---CPGYGIQYGSGATAGFLLSETLDFPKKRVPDFL

Query:  VGCSVLSVHQPAGIAGFGRGPESLPSQMRL------KRFSYCLVSRRFDDSPV--SSPLVL----------------NSGSESNESKSKSLIYAPFRKNP
         GC+  ++ +P G+AGFGRG  SLP+Q+ +        FSYCLVS  FD   V   SPL+L                +   +  + K    ++    +NP
Subjt:  VGCSVLSVHQPAGIAGFGRGPESLPSQMRL------KRFSYCLVSRRFDDSPV--SSPLVL----------------NSGSESNESKSKSLIYAPFRKNP

Query:  SGSNAAFREYYYLSLRRIIIGGKPVKFPYKYLVPDSAGNGGAIIDSGSTFTFLDKPIFEAISEELEKQLVK-YPRDKGVEVQSGLRPCFDISKEKLAEFP
                 +Y +SL+ I IG + +  P      D  G GG ++DSG+TFT L    + ++ EE + ++ + + R   VE  SG+ PC+ ++  +  + P
Subjt:  SGSNAAFREYYYLSLRRIIIGGKPVKFPYKYLVPDSAGNGGAIIDSGSTFTFLDKPIFEAISEELEKQLVK-YPRDKGVEVQSGLRPCFDISKEKLAEFP

Query:  ELVLKFKGG-AKLSLPPANYLALVTDDG--------VVCLTMMTDVAAVGGGGGPAIIFGAFQQQNVLVEYDLARDRIGFRKQRC
         LVL F G  + ++LP  NY     D G        + CL +M         GG   I G +QQQ   V YDL   R+GF K++C
Subjt:  ELVLKFKGG-AKLSLPPANYLALVTDDG--------VVCLTMMTDVAAVGGGGGPAIIFGAFQQQNVLVEYDLARDRIGFRKQRC

Q9LHE3 Protein ASPARTIC PROTEASE IN GUARD CELL 21.8e-2826.18Show/hide
Query:  LSASLNRAQHLKKPQTKSNSSIQN-----VSLFPRSYGAYSISLAFGTPPQNLSFIFDTGSSVVWFPCTASYLCSNCSFPNVDAATIPKFVPKLSSSAKI
        +SA L R      P + S   + +     VS   +  G Y + +  G+PP++   + D+GS +VW  C    LC   S P  D        P  S S   
Subjt:  LSASLNRAQHLKKPQTKSNSSIQN-----VSLFPRSYGAYSISLAFGTPPQNLSFIFDTGSSVVWFPCTASYLCSNCSFPNVDAATIPKFVPKLSSSAKI

Query:  IGCRNPKCAWIFGPNLKSRCRSCSPKSRNCFDSCPGYGIQYGSGA-TAGFLLSETLDFPKKRVPDFLVGCSVLSVHQ---PAGIAGFGRGPESLPSQMRL
        + C +  C  I              ++  C      Y + YG G+ T G L  ETL F K  V +  +GC   +       AG+ G G G  S   Q+  
Subjt:  IGCRNPKCAWIFGPNLKSRCRSCSPKSRNCFDSCPGYGIQYGSGA-TAGFLLSETLDFPKKRVPDFLVGCSVLSVHQ---PAGIAGFGRGPESLPSQMRL

Query:  K---RFSYCLVSRRFDDSPVSSPLVLNSGSESNESKSKSLIYAPFRKNPSGSNAAFREYYYLSLRRIIIGGKPVKFPYKYLVPDSAGNGGAIIDSGSTFT
        +    F YCLVSR  D    +  LV        E+      + P  +NP   +     +YY+ L+ + +GG  +  P         G+GG ++D+G+  T
Subjt:  K---RFSYCLVSRRFDDSPVSSPLVLNSGSESNESKSKSLIYAPFRKNPSGSNAAFREYYYLSLRRIIIGGKPVKFPYKYLVPDSAGNGGAIIDSGSTFT

Query:  FLDKPIFEAISEELEKQLVKYPRDKGVEVQSGLRPCFDISKEKLAEFPELVLKFKGGAKLSLPPANYLALVTDDGVVCLTMMTDVAAVGGGGGPAIIFGA
         L    + A  +  + Q    PR  GV +      C+D+S       P +   F  G  L+LP  N+L  V D G  C         +        I G 
Subjt:  FLDKPIFEAISEELEKQLVKYPRDKGVEVQSGLRPCFDISKEKLAEFPELVLKFKGGAKLSLPPANYLALVTDDGVVCLTMMTDVAAVGGGGGPAIIFGA

Query:  FQQQNVLVEYDLARDRIGFRKQRC
         QQ+ + V +D A   +GF    C
Subjt:  FQQQNVLVEYDLARDRIGFRKQRC

Q9LNJ3 Aspartyl protease family protein 25.7e-3530.54Show/hide
Query:  SSSSPISTITLPLTAFPSNSLTDDPWKTIDYLLSASLNR-------------------AQHLKKPQTKSNSSIQNVSLFPRSYGAYSISLAFGTPPQNLS
        S S   S+ITL L    + S      KT D L S+ L R                     H  +P   S+S +  +S   +  G Y   L  GTP + + 
Subjt:  SSSSPISTITLPLTAFPSNSLTDDPWKTIDYLLSASLNR-------------------AQHLKKPQTKSNSSIQNVSLFPRSYGAYSISLAFGTPPQNLS

Query:  FIFDTGSSVVWFPCTASYLCSNCSFPNVDAATIPKFVPKLSSSAKIIGCRNPKCAWIFGPNLKSRCRSCSPKSRNCFDSCPGYGIQYGSGA-TAGFLLSE
         + DTGS +VW  C     C + S P  D        P+ S +   I C +P C  +      +R ++C             Y + YG G+ T G   +E
Subjt:  FIFDTGSSVVWFPCTASYLCSNCSFPNVDAATIPKFVPKLSSSAKIIGCRNPKCAWIFGPNLKSRCRSCSPKSRNCFDSCPGYGIQYGSGA-TAGFLLSE

Query:  TLDFPKKRVPDFLVGCSVLSVHQ-------PAGIAGFGRGPESLPSQMRLK---RFSYCLVSRRFDDSPVSSPLVLNSGSESNESKSKSLIYAPFRKNPS
        TL F + RV    +GC     H         AG+ G G+G  S P Q   +   +FSYCLV R     P S           N + S+   + P   NP 
Subjt:  TLDFPKKRVPDFLVGCSVLSVHQ-------PAGIAGFGRGPESLPSQMRLK---RFSYCLVSRRFDDSPVSSPLVLNSGSESNESKSKSLIYAPFRKNPS

Query:  GSNAAFREYYYLSLRRIIIGGKPVKFPYKYLVP-DSAGNGGAIIDSGSTFTFLDKPIFEAISEELE---KQLVKYPRDKGVEVQSGLRPCFDISKEKLAE
                +YY+ L  I +GG  V      L   D  GNGG IIDSG++ T L +P + A+ +      K L + P        S    CFD+S     +
Subjt:  GSNAAFREYYYLSLRRIIIGGKPVKFPYKYLVP-DSAGNGGAIIDSGSTFTFLDKPIFEAISEELE---KQLVKYPRDKGVEVQSGLRPCFDISKEKLAE

Query:  FPELVLKFKGGAKLSLPPANYLALVTDDGVVCLTMMTDVAAVGGGGGPAIIFGAFQQQNVLVEYDLARDRIGFRKQRC
         P +VL F+ GA +SLP  NYL  V  +G  C       A  G  GG +II G  QQQ   V YDLA  R+GF    C
Subjt:  FPELVLKFKGGAKLSLPPANYLALVTDDGVVCLTMMTDVAAVGGGGGPAIIFGAFQQQNVLVEYDLARDRIGFRKQRC

Arabidopsis top hitse value%identityAlignment
AT2G42980.1 Eukaryotic aspartyl protease family protein5.1e-3931.08Show/hide
Query:  GAYSISLAFGTPPQNLSFIFDTGSSVVWFPCTASYLCSNCSFPNVDAATIPKFVPKLSSSAKIIGCRNPKCAWIFGPNLKSRCRSCSPKSRNCFDSCPGY
        G Y + +  GTPP++ S I DTGS + W  C   Y C + +    D        PK S+S K I C +P+C+ I  P+   +C S +        SCP Y
Subjt:  GAYSISLAFGTPPQNLSFIFDTGSSVVWFPCTASYLCSNCSFPNVDAATIPKFVPKLSSSAKIIGCRNPKCAWIFGPNLKSRCRSCSPKSRNCFDSCPGY

Query:  GIQYGSGA-TAGFLLSETLDF---------PKKRVPDFLVGCSVLS---VHQPAGIAGFGRGPESLPSQMRL---KRFSYCLVSRRFDDSPVSSPLVLNS
           YG  + T G    ET             + +V + + GC   +       +G+ G GRGP S  SQ++      FSYCLV R   ++ VSS L+   
Subjt:  GIQYGSGA-TAGFLLSETLDF---------PKKRVPDFLVGCSVLS---VHQPAGIAGFGRGPESLPSQMRL---KRFSYCLVSRRFDDSPVSSPLVLNS

Query:  GSESNESKSKSLIYAPFRKNPSGSNAAFREYYYLSLRRIIIGGKPVKFPYKYLVPDSAGNGGAIIDSGSTFTFLDKPIFEAISEEL-EKQLVKYPRDKGV
        G + +     +L +  F    +G   +   +YY+ ++ I++GGK +  P +     S G+GG IIDSG+T ++  +P +E I  +  EK    YP  +  
Subjt:  GSESNESKSKSLIYAPFRKNPSGSNAAFREYYYLSLRRIIIGGKPVKFPYKYLVPDSAGNGGAIIDSGSTFTFLDKPIFEAISEEL-EKQLVKYPRDKGV

Query:  EVQSGLRPCFDIS--KEKLAEFPELVLKFKGGAKLSLPPANYLALVTDDGVVCLTMMTDVAAVGGGGGPAIIFGAFQQQNVLVEYDLARDRIGFRKQRC
         V   L PCF++S  +E     PEL + F  G   + P  N    +++D +VCL      A +G       I G +QQQN  + YD  R R+GF   +C
Subjt:  EVQSGLRPCFDIS--KEKLAEFPELVLKFKGGAKLSLPPANYLALVTDDGVVCLTMMTDVAAVGGGGGPAIIFGAFQQQNVLVEYDLARDRIGFRKQRC

AT3G52500.1 Eukaryotic aspartyl protease family protein3.9e-14053.83Show/hide
Query:  SPISTITLPLTAFP-SNSLTDDPWKTIDYLLSASLNRAQHLK-----KPQ-------TKSNSSIQNVSLFPRSYGAYSISLAFGTPPQNLSFIFDTGSSV
        S +S + LPL+ F  S+    DP+ ++  L  +S+ RA  LK     KP        T +++++    L  +SYG YS+SL+FGTP Q + F+FDTGSS+
Subjt:  SPISTITLPLTAFP-SNSLTDDPWKTIDYLLSASLNRAQHLK-----KPQ-------TKSNSSIQNVSLFPRSYGAYSISLAFGTPPQNLSFIFDTGSSV

Query:  VWFPCTASYLCSNCSFPNVDAATIPKFVPKLSSSAKIIGCRNPKCAWIFGPNLKSRCRSCSPKSRNCFDSCPGYGIQYGSGATAGFLLSETLDFPKKRVP
        VW PCT+ YLCS C F  +D   IP+F+PK SSS+KIIGC++PKC +++GPN+  +CR C P +RNC   CP Y +QYG G+TAG L++E LDFP   VP
Subjt:  VWFPCTASYLCSNCSFPNVDAATIPKFVPKLSSSAKIIGCRNPKCAWIFGPNLKSRCRSCSPKSRNCFDSCPGYGIQYGSGATAGFLLSETLDFPKKRVP

Query:  DFLVGCSVLSVHQPAGIAGFGRGPESLPSQMRLKRFSYCLVSRRFDDSPVSSPLVLNSGSESNE-SKSKSLIYAPFRKNPSGSNAAFREYYYLSLRRIII
        DF+VGCS++S  QPAGIAGFGRGP SLPSQM LKRFS+CLVSRRFDD+ V++ L L++GS  N  SK+  L Y PFRKNP+ SN AF EYYYL+LRRI +
Subjt:  DFLVGCSVLSVHQPAGIAGFGRGPESLPSQMRLKRFSYCLVSRRFDDSPVSSPLVLNSGSESNE-SKSKSLIYAPFRKNPSGSNAAFREYYYLSLRRIII

Query:  GGKPVKFPYKYLVPDSAGNGGAIIDSGSTFTFLDKPIFEAISEELEKQLVKYPRDKGVEVQSGLRPCFDISKEKLAEFPELVLKFKGGAKLSLPPANYLA
        G K VK PYKYL P + G+GG+I+DSGSTFTF+++P+FE ++EE   Q+  Y R+K +E ++GL PCF+IS +     PEL+ +FKGGAKL LP +NY  
Subjt:  GGKPVKFPYKYLVPDSAGNGGAIIDSGSTFTFLDKPIFEAISEELEKQLVKYPRDKGVEVQSGLRPCFDISKEKLAEFPELVLKFKGGAKLSLPPANYLA

Query:  LVTDDGVVCLTMMTD-VAAVGGGGGPAIIFGAFQQQNVLVEYDLARDRIGFRKQRCT
         V +   VCLT+++D      GG GPAII G+FQQQN LVEYDL  DR GF K++C+
Subjt:  LVTDDGVVCLTMMTD-VAAVGGGGGPAIIFGAFQQQNVLVEYDLARDRIGFRKQRCT

AT3G61820.1 Eukaryotic aspartyl protease family protein4.2e-4131.4Show/hide
Query:  KTIDYLLSASLNRAQHLKKPQTKSNSSIQNVSLFPRSYGAYSISLAFGTPPQNLSFIFDTGSSVVWFPCTASYLCSNCSFPNVDAATIPKFVPKLSSSAK
        K+I  L + S  R    + P+T    S   +S   +  G Y + L  GTP  N+  + DTGS VVW  C+    C N      DA     F PK S +  
Subjt:  KTIDYLLSASLNRAQHLKKPQTKSNSSIQNVSLFPRSYGAYSISLAFGTPPQNLSFIFDTGSSVVWFPCTASYLCSNCSFPNVDAATIPKFVPKLSSSAK

Query:  IIGCRNPKCAWIFGPNLKSRCRSCSPKSRNCFDSCPGYGIQYGSGA-TAGFLLSETLDFPKKRVPDFLVGCSVLSVHQ-------PAGIAGFGRGPESLP
         + C +  C       L       + +S+ C      Y + YG G+ T G   +ETL F   RV    +GC     H         AG+ G GRG  S P
Subjt:  IIGCRNPKCAWIFGPNLKSRCRSCSPKSRNCFDSCPGYGIQYGSGA-TAGFLLSETLDFPKKRVPDFLVGCSVLSVHQ-------PAGIAGFGRGPESLP

Query:  SQMRLK---RFSYCLVSRRFDDSPVSSPLVLNSGSESNESKSKSLIYAPFRKNPSGSNAAFREYYYLSLRRIIIGGKPVK-FPYKYLVPDSAGNGGAIID
        SQ + +   +FSYCLV R    S    P  +  G   N +  K+ ++ P   NP         +YYL L  I +GG  V          D+ GNGG IID
Subjt:  SQMRLK---RFSYCLVSRRFDDSPVSSPLVLNSGSESNESKSKSLIYAPFRKNPSGSNAAFREYYYLSLRRIIIGGKPVK-FPYKYLVPDSAGNGGAIID

Query:  SGSTFTFLDKPIFEAISEELEKQLVKYPRDKGVEVQSGLRPCFDISKEKLAEFPELVLKFKGGAKLSLPPANYLALVTDDGVVCLTMMTDVAAVGGGGGP
        SG++ T L +P + A+ +       K  R     +      CFD+S     + P +V  F GG ++SLP +NYL  V  +G  C        A  G  G 
Subjt:  SGSTFTFLDKPIFEAISEELEKQLVKYPRDKGVEVQSGLRPCFDISKEKLAEFPELVLKFKGGAKLSLPPANYLALVTDDGVVCLTMMTDVAAVGGGGGP

Query:  AIIFGAFQQQNVLVEYDLARDRIGFRKQRC
          I G  QQQ   V YDL   R+GF  + C
Subjt:  AIIFGAFQQQNVLVEYDLARDRIGFRKQRC

AT4G16563.1 Eukaryotic aspartyl protease family protein1.3e-4730.1Show/hide
Query:  ISTITLPLTAFPSNSLTDDPWKT--IDYLLSASLNRAQHLKKPQTKSNSSIQNVSLFPRSYGAYSISLAFGTPPQNLSFIFDTGSSVVWFPCTASYLCSN
        +S+++ PL    S+SL+     +  +  L S+S   +   ++   K     Q +SL   S   Y ISL+ G+    +S   DTGS +VWFPC   + C  
Subjt:  ISTITLPLTAFPSNSLTDDPWKT--IDYLLSASLNRAQHLKKPQTKSNSSIQNVSLFPRSYGAYSISLAFGTPPQNLSFIFDTGSSVVWFPCTASYLCSN

Query:  CSFPNVDAATIPKFVP-KLSSSAKIIGCRNPKCAWIFG--PNLK----SRCRSCSPKSRNCFDS---CPGYGIQYGSGATAGFLLSETLDFPKKRVPDFL
        C     ++  +P   P  LSSSA  + C +P C+      P+      S C     ++ +C  S   CP +   YG G+    L S++L  P   V +F 
Subjt:  CSFPNVDAATIPKFVP-KLSSSAKIIGCRNPKCAWIFG--PNLK----SRCRSCSPKSRNCFDS---CPGYGIQYGSGATAGFLLSETLDFPKKRVPDFL

Query:  VGCSVLSVHQPAGIAGFGRGPESLPSQMRL------KRFSYCLVSRRFDDSPV--SSPLVL----------------NSGSESNESKSKSLIYAPFRKNP
         GC+  ++ +P G+AGFGRG  SLP+Q+ +        FSYCLVS  FD   V   SPL+L                +   +  + K    ++    +NP
Subjt:  VGCSVLSVHQPAGIAGFGRGPESLPSQMRL------KRFSYCLVSRRFDDSPV--SSPLVL----------------NSGSESNESKSKSLIYAPFRKNP

Query:  SGSNAAFREYYYLSLRRIIIGGKPVKFPYKYLVPDSAGNGGAIIDSGSTFTFLDKPIFEAISEELEKQLVK-YPRDKGVEVQSGLRPCFDISKEKLAEFP
                 +Y +SL+ I IG + +  P      D  G GG ++DSG+TFT L    + ++ EE + ++ + + R   VE  SG+ PC+ ++  +  + P
Subjt:  SGSNAAFREYYYLSLRRIIIGGKPVKFPYKYLVPDSAGNGGAIIDSGSTFTFLDKPIFEAISEELEKQLVK-YPRDKGVEVQSGLRPCFDISKEKLAEFP

Query:  ELVLKFKGG-AKLSLPPANYLALVTDDG--------VVCLTMMTDVAAVGGGGGPAIIFGAFQQQNVLVEYDLARDRIGFRKQRC
         LVL F G  + ++LP  NY     D G        + CL +M         GG   I G +QQQ   V YDL   R+GF K++C
Subjt:  ELVLKFKGG-AKLSLPPANYLALVTDDG--------VVCLTMMTDVAAVGGGGGPAIIFGAFQQQNVLVEYDLARDRIGFRKQRC

AT5G45120.1 Eukaryotic aspartyl protease family protein4.2e-4933.99Show/hide
Query:  YSISLAFGTPPQNLSFIFDTGSSVVWFPC-TASYLCSNC-SFPNVDAATIPKFVPKLSSSAKIIGCRNPKCAWI------FGPNLKSRCRSCSPKSRNCF
        Y I+L  GTPPQ +    DTGS + W PC   S+ C  C    N D  +   F P  SS++    C +  C  I      F P   + C         C 
Subjt:  YSISLAFGTPPQNLSFIFDTGSSVVWFPC-TASYLCSNC-SFPNVDAATIPKFVPKLSSSAKIIGCRNPKCAWI------FGPNLKSRCRSCSPKSRNCF

Query:  DSCPGYGIQYGSGA-TAGFLLSETLDFPKKRVPDFLVGCSVLSVHQPAGIAGFGRGPESLPSQMRL--KRFSYCLVSRRFDDSP-VSSPLVLNSGSESNE
          CP +   YG G   +G L  + L    + VP F  GC   +  +P GIAGFGRG  SLPSQ+    K FS+C +  +F ++P +SSPL+L + S  + 
Subjt:  DSCPGYGIQYGSGA-TAGFLLSETLDFPKKRVPDFLVGCSVLSVHQPAGIAGFGRGPESLPSQMRL--KRFSYCLVSRRFDDSP-VSSPLVLNSGSESNE

Query:  SKSKSLIYAPFRKNPSGSNAAFREYYYLSLRRIIIGGK--PVKFPYKYLVPDSAGNGGAIIDSGSTFTFLDKPIFEAISEELEKQLVKYPRDKGVEVQSG
        + + SL + P    P   N+     YY+ L  I IG    P + P      DS GNGG ++DSG+T+T L +P +  +   L+   + YPR    E ++G
Subjt:  SKSKSLIYAPFRKNPSGSNAAFREYYYLSLRRIIIGGK--PVKFPYKYLVPDSAGNGGAIIDSGSTFTFLDKPIFEAISEELEKQLVKYPRDKGVEVQSG

Query:  LRPCF----------DISKEKLAEFPELVLKFKGGAKLSLPPAN--YLALVTDDG--VVCLTMMTDVAAVGGGGGPAIIFGAFQQQNVLVEYDLARDRIG
           C+           +  + +  FP +   F   A L LP  N  Y      DG  V CL          G  GPA +FG+FQQQNV V YDL ++RIG
Subjt:  LRPCF----------DISKEKLAEFPELVLKFKGGAKLSLPPAN--YLALVTDDG--VVCLTMMTDVAAVGGGGGPAIIFGAFQQQNVLVEYDLARDRIG

Query:  FRKQRC
        F+   C
Subjt:  FRKQRC


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAGTTCTTCCCCATTCCCTTTCTCCTTTCCATCTTTCTCCTTCTTCCCCCTTCATCTTCTTCCCCTATCTCCACTATTACACTCCCCCTCACTGCCTTCCCTTCCAA
TTCACTTACAGATGATCCATGGAAAACCATCGATTATCTTCTCTCTGCTTCACTCAACAGAGCTCAACATCTCAAGAAGCCACAAACAAAATCAAACAGTTCCATCCAGA
ATGTCTCTCTGTTCCCTCGTAGCTATGGAGCTTATTCAATCTCACTTGCCTTCGGAACTCCACCGCAGAATTTATCGTTCATTTTCGATACTGGAAGTAGTGTCGTCTGG
TTCCCCTGCACTGCTAGTTATCTTTGTTCTAATTGTTCGTTTCCTAATGTGGATGCTGCAACGATTCCGAAATTTGTTCCCAAATTATCTTCCTCTGCGAAGATTATTGG
TTGTCGAAATCCGAAATGTGCTTGGATTTTTGGCCCTAATTTGAAATCTAGGTGTAGAAGTTGTAGCCCTAAATCTCGAAATTGTTTCGATTCTTGTCCTGGCTATGGAA
TTCAGTATGGCTCTGGTGCAACCGCTGGATTTCTCCTCTCTGAAACGCTTGATTTCCCGAAGAAACGAGTGCCGGATTTTCTCGTCGGTTGTTCCGTCTTGTCCGTTCAT
CAACCAGCCGGCATTGCCGGATTCGGCCGCGGTCCTGAATCGTTGCCCTCGCAAATGCGGCTGAAACGATTCTCCTATTGCCTCGTTTCTCGTCGGTTCGACGACTCACC
CGTGAGTAGTCCTCTAGTACTGAACTCCGGCTCGGAATCTAACGAATCGAAGAGTAAGAGTCTCATTTACGCACCGTTTCGAAAGAATCCATCAGGATCCAACGCCGCAT
TTCGAGAGTACTATTACCTTAGTCTTCGGAGAATCATCATCGGTGGAAAGCCGGTGAAGTTCCCGTACAAGTATCTTGTGCCGGATTCCGCCGGGAACGGCGGCGCGATC
ATTGATTCCGGTTCGACGTTTACGTTTCTGGATAAGCCGATTTTCGAAGCCATATCGGAAGAGTTGGAGAAGCAGCTGGTGAAATATCCTCGAGATAAGGGCGTTGAAGT
GCAGTCCGGTTTAAGGCCGTGCTTCGATATTTCCAAGGAGAAATTGGCGGAGTTTCCGGAACTGGTTTTGAAGTTTAAAGGCGGAGCGAAGCTGAGTTTGCCGCCGGCGA
ATTACTTGGCATTGGTGACGGATGACGGCGTGGTGTGCTTGACGATGATGACGGATGTAGCCGCCGTCGGCGGTGGCGGGGGGCCAGCGATTATATTCGGGGCGTTTCAG
CAGCAGAATGTTTTGGTGGAGTATGATTTGGCGAGGGACAGAATCGGATTTCGGAAGCAGAGATGCACGTGA
mRNA sequenceShow/hide mRNA sequence
ATGGAGTTCTTCCCCATTCCCTTTCTCCTTTCCATCTTTCTCCTTCTTCCCCCTTCATCTTCTTCCCCTATCTCCACTATTACACTCCCCCTCACTGCCTTCCCTTCCAA
TTCACTTACAGATGATCCATGGAAAACCATCGATTATCTTCTCTCTGCTTCACTCAACAGAGCTCAACATCTCAAGAAGCCACAAACAAAATCAAACAGTTCCATCCAGA
ATGTCTCTCTGTTCCCTCGTAGCTATGGAGCTTATTCAATCTCACTTGCCTTCGGAACTCCACCGCAGAATTTATCGTTCATTTTCGATACTGGAAGTAGTGTCGTCTGG
TTCCCCTGCACTGCTAGTTATCTTTGTTCTAATTGTTCGTTTCCTAATGTGGATGCTGCAACGATTCCGAAATTTGTTCCCAAATTATCTTCCTCTGCGAAGATTATTGG
TTGTCGAAATCCGAAATGTGCTTGGATTTTTGGCCCTAATTTGAAATCTAGGTGTAGAAGTTGTAGCCCTAAATCTCGAAATTGTTTCGATTCTTGTCCTGGCTATGGAA
TTCAGTATGGCTCTGGTGCAACCGCTGGATTTCTCCTCTCTGAAACGCTTGATTTCCCGAAGAAACGAGTGCCGGATTTTCTCGTCGGTTGTTCCGTCTTGTCCGTTCAT
CAACCAGCCGGCATTGCCGGATTCGGCCGCGGTCCTGAATCGTTGCCCTCGCAAATGCGGCTGAAACGATTCTCCTATTGCCTCGTTTCTCGTCGGTTCGACGACTCACC
CGTGAGTAGTCCTCTAGTACTGAACTCCGGCTCGGAATCTAACGAATCGAAGAGTAAGAGTCTCATTTACGCACCGTTTCGAAAGAATCCATCAGGATCCAACGCCGCAT
TTCGAGAGTACTATTACCTTAGTCTTCGGAGAATCATCATCGGTGGAAAGCCGGTGAAGTTCCCGTACAAGTATCTTGTGCCGGATTCCGCCGGGAACGGCGGCGCGATC
ATTGATTCCGGTTCGACGTTTACGTTTCTGGATAAGCCGATTTTCGAAGCCATATCGGAAGAGTTGGAGAAGCAGCTGGTGAAATATCCTCGAGATAAGGGCGTTGAAGT
GCAGTCCGGTTTAAGGCCGTGCTTCGATATTTCCAAGGAGAAATTGGCGGAGTTTCCGGAACTGGTTTTGAAGTTTAAAGGCGGAGCGAAGCTGAGTTTGCCGCCGGCGA
ATTACTTGGCATTGGTGACGGATGACGGCGTGGTGTGCTTGACGATGATGACGGATGTAGCCGCCGTCGGCGGTGGCGGGGGGCCAGCGATTATATTCGGGGCGTTTCAG
CAGCAGAATGTTTTGGTGGAGTATGATTTGGCGAGGGACAGAATCGGATTTCGGAAGCAGAGATGCACGTGA
Protein sequenceShow/hide protein sequence
MEFFPIPFLLSIFLLLPPSSSSPISTITLPLTAFPSNSLTDDPWKTIDYLLSASLNRAQHLKKPQTKSNSSIQNVSLFPRSYGAYSISLAFGTPPQNLSFIFDTGSSVVW
FPCTASYLCSNCSFPNVDAATIPKFVPKLSSSAKIIGCRNPKCAWIFGPNLKSRCRSCSPKSRNCFDSCPGYGIQYGSGATAGFLLSETLDFPKKRVPDFLVGCSVLSVH
QPAGIAGFGRGPESLPSQMRLKRFSYCLVSRRFDDSPVSSPLVLNSGSESNESKSKSLIYAPFRKNPSGSNAAFREYYYLSLRRIIIGGKPVKFPYKYLVPDSAGNGGAI
IDSGSTFTFLDKPIFEAISEELEKQLVKYPRDKGVEVQSGLRPCFDISKEKLAEFPELVLKFKGGAKLSLPPANYLALVTDDGVVCLTMMTDVAAVGGGGGPAIIFGAFQ
QQNVLVEYDLARDRIGFRKQRCT