; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr018207 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr018207
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionEukaryotic aspartyl protease family protein
Genome locationtig00153145:187051..188412
RNA-Seq ExpressionSgr018207
SyntenySgr018207
Gene Ontology termsGO:0006508 - proteolysis (biological process)
GO:0004190 - aspartic-type endopeptidase activity (molecular function)
InterPro domainsIPR001461 - Aspartic peptidase A1 family
IPR001969 - Aspartic peptidase, active site
IPR021109 - Aspartic peptidase domain superfamily
IPR032799 - Xylanase inhibitor, C-terminal
IPR032861 - Xylanase inhibitor, N-terminal
IPR033121 - Peptidase family A1 domain
IPR034161 - Pepsin-like domain, plant


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7028143.1 Aspartic proteinase nepenthesin-2, partial [Cucurbita argyrosperma subsp. argyrosperma]2.1e-22886.03Show/hide
Query:  MAVSSALCFFYILLFSAVSAI--DSSNVITLPLSAFPHHSSSDPLETLNFLASASLSRAHQIKSPKSNSSISKSPLSPHSYGAYSAPLSFGTPPQTLHLI
        MA    LCF YILL  +VSAI   ++N ITLPLSAFPH SSSDPL+ LNFLASAS +RAHQIK+PKSN S+SKSPLSPHSYGAYSAPLSFGTPPQTLHLI
Subjt:  MAVSSALCFFYILLFSAVSAI--DSSNVITLPLSAFPHHSSSDPLETLNFLASASLSRAHQIKSPKSNSSISKSPLSPHSYGAYSAPLSFGTPPQTLHLI

Query:  FDTGSSLVWFPCTSRYLCSECSFPKVDPAGIPRFVPKLSTSSKLLGCQNPKCAWIFGPDVKSQCRSCNPTTENCTQTCPAYVVQYGSGSTAGLLLSETLD
        FDTGSSLVWFPCTS+YLCS+CSFPK+DP  IPRFVPKLS+SSKL+GCQNPKCAW+FGPDVKSQCR+CNP TENCTQTCPAY VQYGSGSTAGLLLSETLD
Subjt:  FDTGSSLVWFPCTSRYLCSECSFPKVDPAGIPRFVPKLSTSSKLLGCQNPKCAWIFGPDVKSQCRSCNPTTENCTQTCPAYVVQYGSGSTAGLLLSETLD

Query:  FPEKKIPNFVVGCSFLSIHQPSGIAGFGRGSESLPSQMGLKKFSYCLASRRFDDTPHSGELVLDSGGEKTGDLSYTPFRKNPSVSNHAYKEYYYLTIRKI
        FP++KIPNFVVGCSFLSIHQPSGIAGFGRGSESLPSQMGLKKF+YCLASR+FDD+PHSGEL+LDSGG KTGDL+YTPFR+NPSVSNHAYKEYYYL+IRKI
Subjt:  FPEKKIPNFVVGCSFLSIHQPSGIAGFGRGSESLPSQMGLKKFSYCLASRRFDDTPHSGELVLDSGGEKTGDLSYTPFRKNPSVSNHAYKEYYYLTIRKI

Query:  IVGNQAVKVPYKFLAPGPDGSGGSIIDSGSTFTFLEKPVFEAVAQEFEKQLANHTRATDVESLTGLRPCFDISKEKSVEFPELIFQFKGGAKLALPLNNY
        +VGNQ VKVPYK+L PG DGSGGSIIDSGSTFTF++KPVFEAVA+ FEKQLAN TRATDVES TGLRPCFDISKEKSVEFPELIFQFKGGAK ALPLNNY
Subjt:  IVGNQAVKVPYKFLAPGPDGSGGSIIDSGSTFTFLEKPVFEAVAQEFEKQLANHTRATDVESLTGLRPCFDISKEKSVEFPELIFQFKGGAKLALPLNNY

Query:  FALVSSSGVACLTIVTQKVAAG---GPSVILGAFQQQNFYVEYDLVNDRLGFRQQTCS
        FALVSSSGVACLT+VT K A+G   GPSVILGAFQQQNFYVEYDLVN+RLGFRQQ+CS
Subjt:  FALVSSSGVACLTIVTQKVAAG---GPSVILGAFQQQNFYVEYDLVNDRLGFRQQTCS

XP_022940517.1 probable aspartyl protease At4g16563 [Cucurbita moschata]4.6e-22886.03Show/hide
Query:  MAVSSALCFFYILLFSAVSAI--DSSNVITLPLSAFPHHSSSDPLETLNFLASASLSRAHQIKSPKSNSSISKSPLSPHSYGAYSAPLSFGTPPQTLHLI
        MA    LCF YILL  +VSAI   ++N ITLPLSA PH SSSDPL+ LNFLASAS +RAHQIK+PKSN S+SKSPLSPHSYGAYSAPLSFGTPPQTLHLI
Subjt:  MAVSSALCFFYILLFSAVSAI--DSSNVITLPLSAFPHHSSSDPLETLNFLASASLSRAHQIKSPKSNSSISKSPLSPHSYGAYSAPLSFGTPPQTLHLI

Query:  FDTGSSLVWFPCTSRYLCSECSFPKVDPAGIPRFVPKLSTSSKLLGCQNPKCAWIFGPDVKSQCRSCNPTTENCTQTCPAYVVQYGSGSTAGLLLSETLD
        FDTGSSLVWFPCTS+YLCS+CSFPK+DP  IPRFVPKLS+SSKL+GCQNPKCAW+FGPDVKSQCR+CNP TENCTQTCPAY VQYGSGSTAGLLLSETLD
Subjt:  FDTGSSLVWFPCTSRYLCSECSFPKVDPAGIPRFVPKLSTSSKLLGCQNPKCAWIFGPDVKSQCRSCNPTTENCTQTCPAYVVQYGSGSTAGLLLSETLD

Query:  FPEKKIPNFVVGCSFLSIHQPSGIAGFGRGSESLPSQMGLKKFSYCLASRRFDDTPHSGELVLDSGGEKTGDLSYTPFRKNPSVSNHAYKEYYYLTIRKI
        FP++KIPNFVVGCSFLSIHQPSGIAGFGRGSESLPSQMGLKKF+YCLASR+FDD+PHSGEL+LDSGG KTGDL+YTPFR+NPSVSNHAYKEYYYL+IRKI
Subjt:  FPEKKIPNFVVGCSFLSIHQPSGIAGFGRGSESLPSQMGLKKFSYCLASRRFDDTPHSGELVLDSGGEKTGDLSYTPFRKNPSVSNHAYKEYYYLTIRKI

Query:  IVGNQAVKVPYKFLAPGPDGSGGSIIDSGSTFTFLEKPVFEAVAQEFEKQLANHTRATDVESLTGLRPCFDISKEKSVEFPELIFQFKGGAKLALPLNNY
        +VGNQ VKVPYK+L PG DGSGGSIIDSGSTFTF++KPVFEAVA+ FEKQLAN TRATDVES TGLRPCFDISKEKSVEFPELIFQFKGGAK ALPLNNY
Subjt:  IVGNQAVKVPYKFLAPGPDGSGGSIIDSGSTFTFLEKPVFEAVAQEFEKQLANHTRATDVESLTGLRPCFDISKEKSVEFPELIFQFKGGAKLALPLNNY

Query:  FALVSSSGVACLTIVTQKVAAG---GPSVILGAFQQQNFYVEYDLVNDRLGFRQQTCS
        FALVSSSGVACLT+VT K AAG   GPSVILGAFQQQNFYVEYDLVN+RLGFRQQ+CS
Subjt:  FALVSSSGVACLTIVTQKVAAG---GPSVILGAFQQQNFYVEYDLVNDRLGFRQQTCS

XP_022982947.1 probable aspartyl protease At4g16563 [Cucurbita maxima]2.7e-22885.31Show/hide
Query:  MAVSSALCFFYILLFSAVSAIDSSNVITLPLSAFPHHSSSDPLETLNFLASASLSRAHQIKSPKSNS-SISKSPLSPHSYGAYSAPLSFGTPPQTLHLIF
        MA    LCFFYILL S+VSAI  +N IT+PLS+FPHHSSSDPL+TLNFLASAS +RAHQIK+PKS S S+SKSPLSPHSYGAYS PLSFGTPPQTLHLIF
Subjt:  MAVSSALCFFYILLFSAVSAIDSSNVITLPLSAFPHHSSSDPLETLNFLASASLSRAHQIKSPKSNS-SISKSPLSPHSYGAYSAPLSFGTPPQTLHLIF

Query:  DTGSSLVWFPCTSRYLCSECSFPKVDPAGIPRFVPKLSTSSKLLGCQNPKCAWIFGPDVKSQCRSCNPTTENCTQTCPAYVVQYGSGSTAGLLLSETLDF
        DTGSSLVW PCTS+YLCSECSFPK+DPAGIPRF+PKLS++SKL+GCQNPKCAWIFGPDVKSQCRSCNP TENCTQTCPAYVVQYGSGSTAGLLLSETLDF
Subjt:  DTGSSLVWFPCTSRYLCSECSFPKVDPAGIPRFVPKLSTSSKLLGCQNPKCAWIFGPDVKSQCRSCNPTTENCTQTCPAYVVQYGSGSTAGLLLSETLDF

Query:  PEKKIPNFVVGCSFLSIHQPSGIAGFGRGSESLPSQMGLKKFSYCLASRRFDDTPHSGELVLDSGGEKTGDLSYTPFRKNPSVSNHAYKEYYYLTIRKII
        P+KK  NFVVGCSFLSIHQPSGIAGFGRGSESLPSQMGLKKF+YCLASR+FDD+PH+GEL+LDS G KT  LSYTPFR+NPSVSNHAYKEYYYLTIRKI 
Subjt:  PEKKIPNFVVGCSFLSIHQPSGIAGFGRGSESLPSQMGLKKFSYCLASRRFDDTPHSGELVLDSGGEKTGDLSYTPFRKNPSVSNHAYKEYYYLTIRKII

Query:  VGNQAVKVPYKFLAPGPDGSGGSIIDSGSTFTFLEKPVFEAVAQEFEKQLANHTRATDVESLTGLRPCFDISKEKSVEFPELIFQFKGGAKLALPLNNYF
        VG +AVKVPYK+L PGPDG+GGSIIDSGSTFTF++KPVFEAVAQE EKQLAN TRATDVESLTGLRPCFDISK+KSVEFPEL FQ KGGAK  LPL+NYF
Subjt:  VGNQAVKVPYKFLAPGPDGSGGSIIDSGSTFTFLEKPVFEAVAQEFEKQLANHTRATDVESLTGLRPCFDISKEKSVEFPELIFQFKGGAKLALPLNNYF

Query:  ALVSSSGVACLTIVTQKVA--AGGPSVILGAFQQQNFYVEYDLVNDRLGFRQQTCS
        ALVSSSGVACLT+VT K A   GGPS+ILGAFQQQNFYVEYDLVN ++GFRQQTCS
Subjt:  ALVSSSGVACLTIVTQKVA--AGGPSVILGAFQQQNFYVEYDLVNDRLGFRQQTCS

XP_023528159.1 probable aspartyl protease At4g16563 [Cucurbita pepo subsp. pepo]3.5e-22885.75Show/hide
Query:  MAVSSALCFFYILLFSAVSAIDSSNVITLPLSAFPHHSSSDPLETLNFLASASLSRAHQIKSPKSNS-SISKSPLSPHSYGAYSAPLSFGTPPQTLHLIF
        MA    LCFFYILL S+VSAI  +N ITLPLS+FPHHSSSDPL+TLNFLASAS +RAHQIK+PKS S S+SKSPLSPHSYGAYS PLSFGTP QTLHLIF
Subjt:  MAVSSALCFFYILLFSAVSAIDSSNVITLPLSAFPHHSSSDPLETLNFLASASLSRAHQIKSPKSNS-SISKSPLSPHSYGAYSAPLSFGTPPQTLHLIF

Query:  DTGSSLVWFPCTSRYLCSECSFPKVDPAGIPRFVPKLSTSSKLLGCQNPKCAWIFGPDVKSQCRSCNPTTENCTQTCPAYVVQYGSGSTAGLLLSETLDF
        DTGSSLVW PCTS+YLCSECSFPK+DPAGIPRF+PKLS+SSKL+GCQNPKCAWIFGPDVKSQCRSCNP TENCTQTCPAYVVQYGSGSTAGLLLSETLDF
Subjt:  DTGSSLVWFPCTSRYLCSECSFPKVDPAGIPRFVPKLSTSSKLLGCQNPKCAWIFGPDVKSQCRSCNPTTENCTQTCPAYVVQYGSGSTAGLLLSETLDF

Query:  PEKKIPNFVVGCSFLSIHQPSGIAGFGRGSESLPSQMGLKKFSYCLASRRFDDTPHSGELVLDSGGEKTGDLSYTPFRKNPSVSNHAYKEYYYLTIRKII
          KKI NFVVGCSFLSIHQPSGIAGFGRGSESLPSQMGLKKF+YCLASR+FDD+PH+GEL+LDS G KT  L+YTPFR+NPSVSNHAYKEYYYLTIRKI 
Subjt:  PEKKIPNFVVGCSFLSIHQPSGIAGFGRGSESLPSQMGLKKFSYCLASRRFDDTPHSGELVLDSGGEKTGDLSYTPFRKNPSVSNHAYKEYYYLTIRKII

Query:  VGNQAVKVPYKFLAPGPDGSGGSIIDSGSTFTFLEKPVFEAVAQEFEKQLANHTRATDVESLTGLRPCFDISKEKSVEFPELIFQFKGGAKLALPLNNYF
        VGN+AVKVPYK+L PGPDG+GGSIIDSGSTFTF++KPVFEAVAQE EKQLAN TRATDVESLTGLRPCFDISK+KSVEFPEL FQ KGGAK ALPL+NYF
Subjt:  VGNQAVKVPYKFLAPGPDGSGGSIIDSGSTFTFLEKPVFEAVAQEFEKQLANHTRATDVESLTGLRPCFDISKEKSVEFPELIFQFKGGAKLALPLNNYF

Query:  ALVSSSGVACLTIVTQKVA--AGGPSVILGAFQQQNFYVEYDLVNDRLGFRQQTCS
        ALVSSSGVACLT+VT K A   GGPS+ILGAFQQQNFYVEYDLVN ++GFRQQTCS
Subjt:  ALVSSSGVACLTIVTQKVA--AGGPSVILGAFQQQNFYVEYDLVNDRLGFRQQTCS

XP_038905730.1 probable aspartyl protease At4g16563 [Benincasa hispida]9.9e-23187.28Show/hide
Query:  MAVSSALCFFYILLFSAVSAIDSSNVITLPLSAFPHHSSSDPLETLNFLASASLSRAHQIKSPKSNSSISKSPLSPHSYGAYSAPLSFGTPPQTLHLIFD
        MA  S+L FFYILLFS+VSAI ++N ITLPL+AFPH SSSDPL+TL FLASAS +RAHQIK+PKSN S+SKSPL PHSYGAYS PLSFGTP QTLHLIFD
Subjt:  MAVSSALCFFYILLFSAVSAIDSSNVITLPLSAFPHHSSSDPLETLNFLASASLSRAHQIKSPKSNSSISKSPLSPHSYGAYSAPLSFGTPPQTLHLIFD

Query:  TGSSLVWFPCTSRYLCSECSFPKVDPAGIPRFVPKLSTSSKLLGCQNPKCAWIFGPDVKSQCRSCNPTTENCTQTCPAYVVQYGSGSTAGLLLSETLDFP
        TGSSLVWFPCTSRYLCSECSFPK+DP GIPRFVPKLS+SSKL+GCQNPKCAWIFGP+VKSQCRSCNP TENCTQTCPAYVVQYGSGSTAGLLLSETLDFP
Subjt:  TGSSLVWFPCTSRYLCSECSFPKVDPAGIPRFVPKLSTSSKLLGCQNPKCAWIFGPDVKSQCRSCNPTTENCTQTCPAYVVQYGSGSTAGLLLSETLDFP

Query:  EKKIPNFVVGCSFLSIHQPSGIAGFGRGSESLPSQMGLKKFSYCLASRRFDDTPHSGELVLDSGGEKTGDLSYTPFRKNPSVSNHAYKEYYYLTIRKIIV
        +KKIPNFVVGCSFLSIHQPSGIAGFGRGSESLPSQMGLKKF+YCLASR+FDD+PHSGEL+LDS G KT  LSYTPFR+NPSVSNHAYKEYYYL IRKI V
Subjt:  EKKIPNFVVGCSFLSIHQPSGIAGFGRGSESLPSQMGLKKFSYCLASRRFDDTPHSGELVLDSGGEKTGDLSYTPFRKNPSVSNHAYKEYYYLTIRKIIV

Query:  GNQAVKVPYKFLAPGPDGSGGSIIDSGSTFTFLEKPVFEAVAQEFEKQLANHTRATDVESLTGLRPCFDISKEKSVEFPELIFQFKGGAKLALPLNNYFA
        GNQAVKVPYKFL PGPDG+GGSIIDSGSTFTF++KPVFEAVAQEFEKQLAN TRATDVESLTGLRPCFDISK+KSVEFPELIFQFKGGAK ALPL+NYFA
Subjt:  GNQAVKVPYKFLAPGPDGSGGSIIDSGSTFTFLEKPVFEAVAQEFEKQLANHTRATDVESLTGLRPCFDISKEKSVEFPELIFQFKGGAKLALPLNNYFA

Query:  LVSSSGVACLTIVTQKVAA---GGPSVILGAFQQQNFYVEYDLVNDRLGFRQQTCS
        LVSSSGVACLT+VT K  A   GGPSVI GAFQQQNFYVEYDLVN++LGFRQQTC+
Subjt:  LVSSSGVACLTIVTQKVAA---GGPSVILGAFQQQNFYVEYDLVNDRLGFRQQTCS

TrEMBL top hitse value%identityAlignment
A0A5A7TRK2 Aspartic proteinase nepenthesin-21.8e-22584.87Show/hide
Query:  MAVSSALCFFYILLFSAVSAIDSSNVITLPLSAFPHHSSSDPLETLNFLASASLSRAHQIKSPKSNSSISKSPLSPHSYGAYSAPLSFGTPPQTLHLIFD
        MA  S L FFYILLFS++SAI +SN ITLPL++ PH SSSDPL+ L FLASAS +RAH+IK+PKSN S+SKSPLSPHSYGAYS PLSFGTP QTLHLIFD
Subjt:  MAVSSALCFFYILLFSAVSAIDSSNVITLPLSAFPHHSSSDPLETLNFLASASLSRAHQIKSPKSNSSISKSPLSPHSYGAYSAPLSFGTPPQTLHLIFD

Query:  TGSSLVWFPCTSRYLCSECSFPKVDPAGIPRFVPKLSTSSKLLGCQNPKCAWIFGPDVKSQCRSCNPTTENCTQTCPAYVVQYGSGSTAGLLLSETLDFP
        TGSSLVWFPCTSRYLC+ECSFPK+DP GIPRFVPKLS+SSKL+GCQNPKCAWIFGPDVKSQCRSCNP TENCTQTCPAYVVQYGSGSTAGLLLSETLDFP
Subjt:  TGSSLVWFPCTSRYLCSECSFPKVDPAGIPRFVPKLSTSSKLLGCQNPKCAWIFGPDVKSQCRSCNPTTENCTQTCPAYVVQYGSGSTAGLLLSETLDFP

Query:  EKKIPNFVVGCSFLSIHQPSGIAGFGRGSESLPSQMGLKKFSYCLASRRFDDTPHSGELVLDSGGEKTGDLSYTPFRKNPSVSNHAYKEYYYLTIRKIIV
         KKIPNFVVGCSFLSIHQPSGIAGFGRGSESLPSQMGLKKF+YCLASR+FDD+ HSG+L+LDS G KT  L+YT FR+NPSVSNHAYKEYYYL IRKIIV
Subjt:  EKKIPNFVVGCSFLSIHQPSGIAGFGRGSESLPSQMGLKKFSYCLASRRFDDTPHSGELVLDSGGEKTGDLSYTPFRKNPSVSNHAYKEYYYLTIRKIIV

Query:  GNQAVKVPYKFLAPGPDGSGGSIIDSGSTFTFLEKPVFEAVAQEFEKQLANHTRATDVESLTGLRPCFDISKEKSVEFPELIFQFKGGAKLALPLNNYFA
        GNQAVKVPYK+L PGPDG+GGSIIDSGSTFTF++KPV + VAQEFEKQLAN TRATDVE+LTGLRPCFD+SKEKSVEFPELIFQFKGGAK ALPLNNYFA
Subjt:  GNQAVKVPYKFLAPGPDGSGGSIIDSGSTFTFLEKPVFEAVAQEFEKQLANHTRATDVESLTGLRPCFDISKEKSVEFPELIFQFKGGAKLALPLNNYFA

Query:  LVSSSGVACLTIVTQKV---AAGGPSVILGAFQQQNFYVEYDLVNDRLGFRQQTCS
        LVSSSGVACLT+VT        GGPSVILGAFQQQNFYVEYDLVN+RLGFR+QTC+
Subjt:  LVSSSGVACLTIVTQKV---AAGGPSVILGAFQQQNFYVEYDLVNDRLGFRQQTCS

A0A6J1F3G5 probable aspartyl protease At4g165634.2e-22785.31Show/hide
Query:  MAVSSALCFFYILLFSAVSAIDSSNVITLPLSAFPHHSSSDPLETLNFLASASLSRAHQIKSPKSNS-SISKSPLSPHSYGAYSAPLSFGTPPQTLHLIF
        MA    LCFFYILL S+VSAI  +N ITLPLS+FPHHSSSDPL+TLNFLASAS +RAHQIK+PKS S S+SKSPLSPHSYGAYS PLSFGTP QTLHLIF
Subjt:  MAVSSALCFFYILLFSAVSAIDSSNVITLPLSAFPHHSSSDPLETLNFLASASLSRAHQIKSPKSNS-SISKSPLSPHSYGAYSAPLSFGTPPQTLHLIF

Query:  DTGSSLVWFPCTSRYLCSECSFPKVDPAGIPRFVPKLSTSSKLLGCQNPKCAWIFGPDVKSQCRSCNPTTENCTQTCPAYVVQYGSGSTAGLLLSETLDF
        DTGSSLVW PCTS+YLCSECSFPK+DPA IPRF+PKLS+SSKL+GCQNPKCAWIFGPDVKSQCRSCNP TENCTQTCPAYVVQYGSGSTAGLLLSETLDF
Subjt:  DTGSSLVWFPCTSRYLCSECSFPKVDPAGIPRFVPKLSTSSKLLGCQNPKCAWIFGPDVKSQCRSCNPTTENCTQTCPAYVVQYGSGSTAGLLLSETLDF

Query:  PEKKIPNFVVGCSFLSIHQPSGIAGFGRGSESLPSQMGLKKFSYCLASRRFDDTPHSGELVLDSGGEKTGDLSYTPFRKNPSVSNHAYKEYYYLTIRKII
        P KKI NFVVGCSFLSIHQPSGIAGFGRGSESLPSQMGLKKF+YCLASR+FDD+PH+GEL+LDS G KT  L+YTPFR+NPSVSNHAYKEYYYLTIRKI 
Subjt:  PEKKIPNFVVGCSFLSIHQPSGIAGFGRGSESLPSQMGLKKFSYCLASRRFDDTPHSGELVLDSGGEKTGDLSYTPFRKNPSVSNHAYKEYYYLTIRKII

Query:  VGNQAVKVPYKFLAPGPDGSGGSIIDSGSTFTFLEKPVFEAVAQEFEKQLANHTRATDVESLTGLRPCFDISKEKSVEFPELIFQFKGGAKLALPLNNYF
        VGN+AVKVPYK+L PGPDG+GGSIIDSGSTFTF++KPVFEAVAQE EKQLAN TRATDVESLTGLRPCFDISK+KSVEFPEL F  KGGAK A PL+NYF
Subjt:  VGNQAVKVPYKFLAPGPDGSGGSIIDSGSTFTFLEKPVFEAVAQEFEKQLANHTRATDVESLTGLRPCFDISKEKSVEFPELIFQFKGGAKLALPLNNYF

Query:  ALVSSSGVACLTIVTQKVA--AGGPSVILGAFQQQNFYVEYDLVNDRLGFRQQTCS
        ALVSSSGVACLT+VT K A   GGPS+ILGAFQQQNFYVEYDLVN ++GFRQQTCS
Subjt:  ALVSSSGVACLTIVTQKVA--AGGPSVILGAFQQQNFYVEYDLVNDRLGFRQQTCS

A0A6J1FJU4 probable aspartyl protease At4g165632.2e-22886.03Show/hide
Query:  MAVSSALCFFYILLFSAVSAI--DSSNVITLPLSAFPHHSSSDPLETLNFLASASLSRAHQIKSPKSNSSISKSPLSPHSYGAYSAPLSFGTPPQTLHLI
        MA    LCF YILL  +VSAI   ++N ITLPLSA PH SSSDPL+ LNFLASAS +RAHQIK+PKSN S+SKSPLSPHSYGAYSAPLSFGTPPQTLHLI
Subjt:  MAVSSALCFFYILLFSAVSAI--DSSNVITLPLSAFPHHSSSDPLETLNFLASASLSRAHQIKSPKSNSSISKSPLSPHSYGAYSAPLSFGTPPQTLHLI

Query:  FDTGSSLVWFPCTSRYLCSECSFPKVDPAGIPRFVPKLSTSSKLLGCQNPKCAWIFGPDVKSQCRSCNPTTENCTQTCPAYVVQYGSGSTAGLLLSETLD
        FDTGSSLVWFPCTS+YLCS+CSFPK+DP  IPRFVPKLS+SSKL+GCQNPKCAW+FGPDVKSQCR+CNP TENCTQTCPAY VQYGSGSTAGLLLSETLD
Subjt:  FDTGSSLVWFPCTSRYLCSECSFPKVDPAGIPRFVPKLSTSSKLLGCQNPKCAWIFGPDVKSQCRSCNPTTENCTQTCPAYVVQYGSGSTAGLLLSETLD

Query:  FPEKKIPNFVVGCSFLSIHQPSGIAGFGRGSESLPSQMGLKKFSYCLASRRFDDTPHSGELVLDSGGEKTGDLSYTPFRKNPSVSNHAYKEYYYLTIRKI
        FP++KIPNFVVGCSFLSIHQPSGIAGFGRGSESLPSQMGLKKF+YCLASR+FDD+PHSGEL+LDSGG KTGDL+YTPFR+NPSVSNHAYKEYYYL+IRKI
Subjt:  FPEKKIPNFVVGCSFLSIHQPSGIAGFGRGSESLPSQMGLKKFSYCLASRRFDDTPHSGELVLDSGGEKTGDLSYTPFRKNPSVSNHAYKEYYYLTIRKI

Query:  IVGNQAVKVPYKFLAPGPDGSGGSIIDSGSTFTFLEKPVFEAVAQEFEKQLANHTRATDVESLTGLRPCFDISKEKSVEFPELIFQFKGGAKLALPLNNY
        +VGNQ VKVPYK+L PG DGSGGSIIDSGSTFTF++KPVFEAVA+ FEKQLAN TRATDVES TGLRPCFDISKEKSVEFPELIFQFKGGAK ALPLNNY
Subjt:  IVGNQAVKVPYKFLAPGPDGSGGSIIDSGSTFTFLEKPVFEAVAQEFEKQLANHTRATDVESLTGLRPCFDISKEKSVEFPELIFQFKGGAKLALPLNNY

Query:  FALVSSSGVACLTIVTQKVAAG---GPSVILGAFQQQNFYVEYDLVNDRLGFRQQTCS
        FALVSSSGVACLT+VT K AAG   GPSVILGAFQQQNFYVEYDLVN+RLGFRQQ+CS
Subjt:  FALVSSSGVACLTIVTQKVAAG---GPSVILGAFQQQNFYVEYDLVNDRLGFRQQTCS

A0A6J1IXY3 probable aspartyl protease At4g165631.3e-22885.31Show/hide
Query:  MAVSSALCFFYILLFSAVSAIDSSNVITLPLSAFPHHSSSDPLETLNFLASASLSRAHQIKSPKSNS-SISKSPLSPHSYGAYSAPLSFGTPPQTLHLIF
        MA    LCFFYILL S+VSAI  +N IT+PLS+FPHHSSSDPL+TLNFLASAS +RAHQIK+PKS S S+SKSPLSPHSYGAYS PLSFGTPPQTLHLIF
Subjt:  MAVSSALCFFYILLFSAVSAIDSSNVITLPLSAFPHHSSSDPLETLNFLASASLSRAHQIKSPKSNS-SISKSPLSPHSYGAYSAPLSFGTPPQTLHLIF

Query:  DTGSSLVWFPCTSRYLCSECSFPKVDPAGIPRFVPKLSTSSKLLGCQNPKCAWIFGPDVKSQCRSCNPTTENCTQTCPAYVVQYGSGSTAGLLLSETLDF
        DTGSSLVW PCTS+YLCSECSFPK+DPAGIPRF+PKLS++SKL+GCQNPKCAWIFGPDVKSQCRSCNP TENCTQTCPAYVVQYGSGSTAGLLLSETLDF
Subjt:  DTGSSLVWFPCTSRYLCSECSFPKVDPAGIPRFVPKLSTSSKLLGCQNPKCAWIFGPDVKSQCRSCNPTTENCTQTCPAYVVQYGSGSTAGLLLSETLDF

Query:  PEKKIPNFVVGCSFLSIHQPSGIAGFGRGSESLPSQMGLKKFSYCLASRRFDDTPHSGELVLDSGGEKTGDLSYTPFRKNPSVSNHAYKEYYYLTIRKII
        P+KK  NFVVGCSFLSIHQPSGIAGFGRGSESLPSQMGLKKF+YCLASR+FDD+PH+GEL+LDS G KT  LSYTPFR+NPSVSNHAYKEYYYLTIRKI 
Subjt:  PEKKIPNFVVGCSFLSIHQPSGIAGFGRGSESLPSQMGLKKFSYCLASRRFDDTPHSGELVLDSGGEKTGDLSYTPFRKNPSVSNHAYKEYYYLTIRKII

Query:  VGNQAVKVPYKFLAPGPDGSGGSIIDSGSTFTFLEKPVFEAVAQEFEKQLANHTRATDVESLTGLRPCFDISKEKSVEFPELIFQFKGGAKLALPLNNYF
        VG +AVKVPYK+L PGPDG+GGSIIDSGSTFTF++KPVFEAVAQE EKQLAN TRATDVESLTGLRPCFDISK+KSVEFPEL FQ KGGAK  LPL+NYF
Subjt:  VGNQAVKVPYKFLAPGPDGSGGSIIDSGSTFTFLEKPVFEAVAQEFEKQLANHTRATDVESLTGLRPCFDISKEKSVEFPELIFQFKGGAKLALPLNNYF

Query:  ALVSSSGVACLTIVTQKVA--AGGPSVILGAFQQQNFYVEYDLVNDRLGFRQQTCS
        ALVSSSGVACLT+VT K A   GGPS+ILGAFQQQNFYVEYDLVN ++GFRQQTCS
Subjt:  ALVSSSGVACLTIVTQKVA--AGGPSVILGAFQQQNFYVEYDLVNDRLGFRQQTCS

A0A6J1KTP0 probable aspartyl protease At4g165638.5e-22886.03Show/hide
Query:  MAVSSALCFFYILLFSAVSAI--DSSNVITLPLSAFPHHSSSDPLETLNFLASASLSRAHQIKSPKSNSSISKSPLSPHSYGAYSAPLSFGTPPQTLHLI
        MA    LCF YILL  +VSAI   ++N ITLPLSAFPHHSSSDPL+ LNFLASAS +RAHQIK+PKSN S+SKS LSPHSYGAYSAPLSFGTPPQTLHLI
Subjt:  MAVSSALCFFYILLFSAVSAI--DSSNVITLPLSAFPHHSSSDPLETLNFLASASLSRAHQIKSPKSNSSISKSPLSPHSYGAYSAPLSFGTPPQTLHLI

Query:  FDTGSSLVWFPCTSRYLCSECSFPKVDPAGIPRFVPKLSTSSKLLGCQNPKCAWIFGPDVKSQCRSCNPTTENCTQTCPAYVVQYGSGSTAGLLLSETLD
        FDTGSSLVWFPCTS+YLCS+CSFPK+DP  IPRFVPKLS+SSKL+GCQNPKC W+FGPDVKSQCR+CN  TENCTQTCPAYVVQYGSGSTAGLLLSETLD
Subjt:  FDTGSSLVWFPCTSRYLCSECSFPKVDPAGIPRFVPKLSTSSKLLGCQNPKCAWIFGPDVKSQCRSCNPTTENCTQTCPAYVVQYGSGSTAGLLLSETLD

Query:  FPEKKIPNFVVGCSFLSIHQPSGIAGFGRGSESLPSQMGLKKFSYCLASRRFDDTPHSGELVLDSGGEKTGDLSYTPFRKNPSVSNHAYKEYYYLTIRKI
        FP++KIPNFVVGCSFLSIHQPSGIAGFGRGSESLPSQMGLKKF+YCLASR+FDD+PHSGEL+LDSGGEKTGDL+YTPFR+NPSVSNHAYKEYYYL+IRKI
Subjt:  FPEKKIPNFVVGCSFLSIHQPSGIAGFGRGSESLPSQMGLKKFSYCLASRRFDDTPHSGELVLDSGGEKTGDLSYTPFRKNPSVSNHAYKEYYYLTIRKI

Query:  IVGNQAVKVPYKFLAPGPDGSGGSIIDSGSTFTFLEKPVFEAVAQEFEKQLANHTRATDVESLTGLRPCFDISKEKSVEFPELIFQFKGGAKLALPLNNY
        +VGNQ VKVPYK+L PG DGSGGSIIDSGSTFTF++KPVFEAVA+ FEKQLAN TRATDVES TGLRPCFDISKEKSVEFPELIFQFKGGAK  L LNNY
Subjt:  IVGNQAVKVPYKFLAPGPDGSGGSIIDSGSTFTFLEKPVFEAVAQEFEKQLANHTRATDVESLTGLRPCFDISKEKSVEFPELIFQFKGGAKLALPLNNY

Query:  FALVSSSGVACLTIVTQKVAAG---GPSVILGAFQQQNFYVEYDLVNDRLGFRQQTCS
        FALVSSSGVACLT+VT K AAG   GPSVILGAFQQQNFYVEYDLVN+RLGFRQQTCS
Subjt:  FALVSSSGVACLTIVTQKVAAG---GPSVILGAFQQQNFYVEYDLVNDRLGFRQQTCS

SwissProt top hitse value%identityAlignment
Q6F4N5 Aspartyl protease 253.9e-3628.29Show/hide
Query:  MAVSSALCFFYILLFSAVSAIDSSNVITLPLSAFPHHSSSDPLETLNFLASASLSRAHQIKSPKSNSSISKSPL-SPHSYGAYSAPLSFGTPPQTLHLIF
        MA ++ +    +LL + V+A  +     L +    H SS  PLE++  LA    +R   + S  + + +S +P+ S  +  +Y      G+P Q L L  
Subjt:  MAVSSALCFFYILLFSAVSAIDSSNVITLPLSAFPHHSSSDPLETLNFLASASLSRAHQIKSPKSNSSISKSPL-SPHSYGAYSAPLSFGTPPQTLHLIF

Query:  DTGSSLVWFPCTSRYLCSECSFPKVDPAGIPRFVPKLSTSSKLLGCQNPKCAWIFGPDVKSQCRSCNPTTENCTQTCPAYVVQYGSGSTAGLLLSETLDF
        DT +   W  C+    C   S           F P  S+S   L C +  C    G    +     +      T    A+   +   S    L S+TL  
Subjt:  DTGSSLVWFPCTSRYLCSECSFPKVDPAGIPRFVPKLSTSSKLLGCQNPKCAWIFGPDVKSQCRSCNPTTENCTQTCPAYVVQYGSGSTAGLLLSETLDF

Query:  PEKKIPNFVVGCSFLSIHQPS------GIAGFGRGSESLPSQMGL---KKFSYCLASRRFDDTPHSGELVLDSGGEKTGDLSYTPFRKNPSVSNHAYKEY
         +  IPN+  GC   S+  P+      G+ G GRG  +L SQ G      FSYCL S R      SG L L +GG +   + YTP  +NP  S+      
Subjt:  PEKKIPNFVVGCSFLSIHQPS------GIAGFGRGSESLPSQMGL---KKFSYCLASRRFDDTPHSGELVLDSGGEKTGDLSYTPFRKNPSVSNHAYKEY

Query:  YYLTIRKIIVGNQAVKVPYKFLAPGPDGSGGSIIDSGSTFTFLEKPVFEAVAQEFEKQLANHTRATDVESLTGLRPCFDISKEKSVEFPELIFQFKGGAK
        YY+ +  + VG+  VKVP    A       G+++DSG+  T    PV+ A+ +EF +Q+A  +  T   SL     CF+  +  +   P +     GG  
Subjt:  YYLTIRKIIVGNQAVKVPYKFLAPGPDGSGGSIIDSGSTFTFLEKPVFEAVAQEFEKQLANHTRATDVESLTGLRPCFDISKEKSVEFPELIFQFKGGAK

Query:  LALPLNNYFALVSSSGVACLTIVTQKVAAGGPSVILGAFQQQNFYVEYDLVNDRLGFRQQTCS
        LALP+ N     S++ +ACL +            ++   QQQN  V +D+ N R+GF +++C+
Subjt:  LALPLNNYFALVSSSGVACLTIVTQKVAAGGPSVILGAFQQQNFYVEYDLVNDRLGFRQQTCS

Q766C2 Aspartic proteinase nepenthesin-28.6e-3632.43Show/hide
Query:  RAHQIKSPKSNSSISKSPLSPHSYGAYSAPLSFGTPPQTLHLIFDTGSSLVWFPCTSRYLCSEC-SFPKVDPAGIPRFVPKLSTSSKLLGCQNPKCAWIF
        R   I +   +SS  ++P+     G Y   ++ GTP  +   I DTGS L+W  C     C++C S P       P F P+ S+S   L C++  C    
Subjt:  RAHQIKSPKSNSSISKSPLSPHSYGAYSAPLSFGTPPQTLHLIFDTGSSLVWFPCTSRYLCSEC-SFPKVDPAGIPRFVPKLSTSSKLLGCQNPKCAWIF

Query:  GPDVKSQCRSCNPTTENCTQTCPAYVVQYGSGSTA-GLLLSETLDFPEKKIPNFVVGC----SFLSIHQPSGIAGFGRGSESLPSQMGLKKFSYCLASRR
          D+ S         E C      Y   YG GST  G + +ET  F    +PN   GC            +G+ G G G  SLPSQ+G+ +FSYC+ S  
Subjt:  GPDVKSQCRSCNPTTENCTQTCPAYVVQYGSGSTA-GLLLSETLDFPEKKIPNFVVGC----SFLSIHQPSGIAGFGRGSESLPSQMGLKKFSYCLASRR

Query:  FDDTPHSGELVLDSGGEKTGDLSYTPFRK--NPSVSNHAYKEYYYLTIRKIIVGNQAVKVPYKFLAPGPDGSGGSIIDSGSTFTFLEKPVFEAVAQEFEK
           +P +  L   + G   G  S T      NP+        YYY+T++ I VG   + +P        DG+GG IIDSG+T T+L +  + AVAQ F  
Subjt:  FDDTPHSGELVLDSGGEKTGDLSYTPFRK--NPSVSNHAYKEYYYLTIRKIIVGNQAVKVPYKFLAPGPDGSGGSIIDSGSTFTFLEKPVFEAVAQEFEK

Query:  QLANHTRATDVESLTGLRPCFDISKEKS-VEFPELIFQFKGGAKLALPLNNYFALVS-SSGVACLTIVTQKVAAGGPSVILGAFQQQNFYVEYDLVNDRL
        Q+      T  ES +GL  CF    + S V+ PE+  QF GG    L L     L+S + GV CL + +          I G  QQQ   V YDL N  +
Subjt:  QLANHTRATDVESLTGLRPCFDISKEKS-VEFPELIFQFKGGAKLALPLNNYFALVS-SSGVACLTIVTQKVAAGGPSVILGAFQQQNFYVEYDLVNDRL

Query:  GFRQQTC
         F    C
Subjt:  GFRQQTC

Q766C3 Aspartic proteinase nepenthesin-13.3e-3531.59Show/hide
Query:  GAYSAPLSFGTPPQTLHLIFDTGSSLVWFPCTSRYLCSECSFPKVDPAGIPRFVPKLSTSSKLLGCQNPKCAWIFGPDVKSQCRSCNPTTENCTQTCPAY
        G Y   LS GTP Q    I DTGS L+W  C     C++C          P F P+ S+S   L C +  C  +  P               C+     Y
Subjt:  GAYSAPLSFGTPPQTLHLIFDTGSSLVWFPCTSRYLCSECSFPKVDPAGIPRFVPKLSTSSKLLGCQNPKCAWIFGPDVKSQCRSCNPTTENCTQTCPAY

Query:  VVQYGSGS-TAGLLLSETLDFPEKKIPNFVVGC----SFLSIHQPSGIAGFGRGSESLPSQMGLKKFSYCLASRRFDDTPHSGELVLDSGGEKTGDLSYT
           YG GS T G + +ETL F    IPN   GC            +G+ G GRG  SLPSQ+ + KFSYC+       TP    L+L       G L+ +
Subjt:  VVQYGSGS-TAGLLLSETLDFPEKKIPNFVVGC----SFLSIHQPSGIAGFGRGSESLPSQMGLKKFSYCLASRRFDDTPHSGELVLDSGGEKTGDLSYT

Query:  PFRKNPS---VSNHAYKEYYYLTIRKIIVGNQAVKV-PYKFLAPGPDGSGGSIIDSGSTFTFLEKPVFEAVAQEFEKQLANHTRATDVESLTGLRPCFDI
            +P+   + +     +YY+T+  + VG+  + + P  F     +G+GG IIDSG+T T+     +++V QEF  Q+          S +G   CF  
Subjt:  PFRKNPS---VSNHAYKEYYYLTIRKIIVGNQAVKV-PYKFLAPGPDGSGGSIIDSGSTFTFLEKPVFEAVAQEFEKQLANHTRATDVESLTGLRPCFDI

Query:  SKEKS-VEFPELIFQFKGGAKLALPLNNYFALVSSSGVACLTIVTQKVAAGGPSVILGAFQQQNFYVEYDLVNDRLGFRQQTC
          + S ++ P  +  F GG  L LP  NYF +  S+G+ CL + +   ++ G S I G  QQQN  V YD  N  + F    C
Subjt:  SKEKS-VEFPELIFQFKGGAKLALPLNNYFALVSSSGVACLTIVTQKVAAGGPSVILGAFQQQNFYVEYDLVNDRLGFRQQTC

Q940R4 Probable aspartyl protease At4g165631.5e-4829.57Show/hide
Query:  AVSAIDSSNVITLPLSAFPHHSSSDPLETLNFLASASLSRAHQIKSPKSNSSISKSPLSPHSYGAYSAPLSFGTPPQTLHLIFDTGSSLVWFPCTSRYLC
        +VS++ +  ++ L  S      SS PL  L   +S S +R  +    +    +S  P+S  S   Y   LS G+    + L  DTGS LVWFPC   + C
Subjt:  AVSAIDSSNVITLPLSAFPHHSSSDPLETLNFLASASLSRAHQIKSPKSNSSISKSPLSPHSYGAYSAPLSFGTPPQTLHLIFDTGSSLVWFPCTSRYLC

Query:  SECSFPKVDPAGIPRFVPKLSTSSKLLGCQNPKCAWIFGPDVKSQ-CRSCN-----PTTENCTQT---CPAYVVQYGSGSTAGLLLSETLDFPEKKIPNF
          C    + P+        LS+S+  + C +P C+        S  C   N       T +C  +   CP +   YG GS    L S++L  P   + NF
Subjt:  SECSFPKVDPAGIPRFVPKLSTSSKLLGCQNPKCAWIFGPDVKSQ-CRSCN-----PTTENCTQT---CPAYVVQYGSGSTAGLLLSETLDFPEKKIPNF

Query:  VVGCSFLSIHQPSGIAGFGRGSESLPSQMGL------KKFSYCLASRRFDD--TPHSGELVL--------------------DSGGEKTGDLSYTPFRKN
          GC+  ++ +P G+AGFGRG  SLP+Q+ +        FSYCL S  FD         L+L                    D   +K  +  +T   +N
Subjt:  VVGCSFLSIHQPSGIAGFGRGSESLPSQMGL------KKFSYCLASRRFDD--TPHSGELVL--------------------DSGGEKTGDLSYTPFRKN

Query:  PSVSNHAYKEYYYLTIRKIIVGNQAVKVPYKFLAPGPDGSGGSIIDSGSTFTFLEKPVFEAVAQEFEKQLAN-HTRATDVESLTGLRPCFDISKEKSVEF
        P    H Y  +Y ++++ I +G + +  P        +G GG ++DSG+TFT L    + +V +EF+ ++   H RA  VE  +G+ PC+ ++  ++V+ 
Subjt:  PSVSNHAYKEYYYLTIRKIIVGNQAVKVPYKFLAPGPDGSGGSIIDSGSTFTFLEKPVFEAVAQEFEKQLAN-HTRATDVESLTGLRPCFDISKEKSVEF

Query:  PELIFQFKGG-AKLALPLNNYFALVSSSG--------VACLTIVT---QKVAAGGPSVILGAFQQQNFYVEYDLVNDRLGFRQQTCS
        P L+  F G  + + LP  NYF      G        + CL ++    +    GG   ILG +QQQ F V YDL+N R+GF ++ C+
Subjt:  PELIFQFKGG-AKLALPLNNYFALVSSSG--------VACLTIVT---QKVAAGGPSVILGAFQQQNFYVEYDLVNDRLGFRQQTCS

Q9LNJ3 Aspartyl protease family protein 21.5e-3530.47Show/hide
Query:  FSAVSAIDSSNVITLPLSAFPHHSSSDPLETLNFLASASLSR-AHQIKS------------------PKSNSSISKSPLSPHSYGAYSAPLSFGTPPQTL
        F + S  +SS+ ITL L      SS+   +T + L S+ L R + ++KS                  P   SS   S LS  S G Y   L  GTP + +
Subjt:  FSAVSAIDSSNVITLPLSAFPHHSSSDPLETLNFLASASLSR-AHQIKS------------------PKSNSSISKSPLSPHSYGAYSAPLSFGTPPQTL

Query:  HLIFDTGSSLVWFPCTSRYLCSECSFPKVDPAGIPRFVPKLSTSSKLLGCQNPKCAWIFGPDVKSQCRSCNPTTENCTQTCPAYVVQYGSGS-TAGLLLS
        +++ DTGS +VW  C     C  C + + DP     F P+ S +   + C +P C        +     CN   + C      Y V YG GS T G   +
Subjt:  HLIFDTGSSLVWFPCTSRYLCSECSFPKVDPAGIPRFVPKLSTSSKLLGCQNPKCAWIFGPDVKSQCRSCNPTTENCTQTCPAYVVQYGSGS-TAGLLLS

Query:  ETLDFPEKKIPNFVVGCSFLS---IHQPSGIAGFGRGSESLPSQMGLK---KFSYCLASRRFDDTPHSGELVLDSGGEKTGDLSYTPFRKNPSVSNHAYK
        ETL F   ++    +GC   +       +G+ G G+G  S P Q G +   KFSYCL  R     P S   V+      +    +TP   NP +      
Subjt:  ETLDFPEKKIPNFVVGCSFLS---IHQPSGIAGFGRGSESLPSQMGLK---KFSYCLASRRFDDTPHSGELVLDSGGEKTGDLSYTPFRKNPSVSNHAYK

Query:  EYYYLTIRKIIVGNQAVK-VPYKFLAPGPDGSGGSIIDSGSTFTFLEKPVFEAVAQEFEKQLANHTRATDVESLTGLRPCFDISKEKSVEFPELIFQFKG
         +YY+ +  I VG   V  V          G+GG IIDSG++ T L +P + A+   F        RA D         CFD+S    V+ P ++  F+ 
Subjt:  EYYYLTIRKIIVGNQAVK-VPYKFLAPGPDGSGGSIIDSGSTFTFLEKPVFEAVAQEFEKQLANHTRATDVESLTGLRPCFDISKEKSVEFPELIFQFKG

Query:  GAKLALPLNNYFALVSSSGVACLTIVTQKVAAGGPSVILGAFQQQNFYVEYDLVNDRLGFRQQTCS
        GA ++LP  NY   V ++G  C          GG S+I G  QQQ F V YDL + R+GF    C+
Subjt:  GAKLALPLNNYFALVSSSGVACLTIVTQKVAAGGPSVILGAFQQQNFYVEYDLVNDRLGFRQQTCS

Arabidopsis top hitse value%identityAlignment
AT1G25510.1 Eukaryotic aspartyl protease family protein2.8e-4231.03Show/hide
Query:  GAYSAPLSFGTPPQTLHLIFDTGSSLVWFPCTSRYLCSECSFPKVDPAGIPRFVPKLSTSSKLLGCQNPKCAWIFGPDVKSQCRSCNPTTENCTQTCPAY
        G Y   +  G P + ++++ DTGS + W  CT    C++C + + +P     F P  S+S + L C  P+C  +      S+CR+          TC  Y
Subjt:  GAYSAPLSFGTPPQTLHLIFDTGSSLVWFPCTSRYLCSECSFPKVDPAGIPRFVPKLSTSSKLLGCQNPKCAWIFGPDVKSQCRSCNPTTENCTQTCPAY

Query:  VVQYGSGS-TAGLLLSETLDFPEKKIPNFVVGCSFLS---IHQPSGIAGFGRGSESLPSQMGLKKFSYCLASRRFDDTPHSGELVLDSGGEKTGDLSYTP
         V YG GS T G   +ETL      + N  VGC   +       +G+ G G G  +LPSQ+    FSYCL  R  D         +D G   + D    P
Subjt:  VVQYGSGS-TAGLLLSETLDFPEKKIPNFVVGCSFLS---IHQPSGIAGFGRGSESLPSQMGLKKFSYCLASRRFDDTPHSGELVLDSGGEKTGDLSYTP

Query:  FRKNPSVSNHAYKEYYYLTIRKIIVGNQAVKVPYKFLAPGPDGSGGSIIDSGSTFTFLEKPVFEAVAQEFEKQLANHTRATDVESLTGLRPCFDISKEKS
              + NH    +YYL +  I VG + +++P         GSGG IIDSG+  T L+  ++ ++   F K   +  +A  V        C+++S + +
Subjt:  FRKNPSVSNHAYKEYYYLTIRKIIVGNQAVKVPYKFLAPGPDGSGGSIIDSGSTFTFLEKPVFEAVAQEFEKQLANHTRATDVESLTGLRPCFDISKEKS

Query:  VEFPELIFQFKGGAKLALPLNNYFALVSSSGVACLTIVTQKVAAGGPSVILGAFQQQNFYVEYDLVNDRLGFRQQTC
        VE P + F F GG  LALP  NY   V S G  CL              I+G  QQQ   V +DL N  +GF    C
Subjt:  VEFPELIFQFKGGAKLALPLNNYFALVSSSGVACLTIVTQKVAAGGPSVILGAFQQQNFYVEYDLVNDRLGFRQQTC

AT2G42980.1 Eukaryotic aspartyl protease family protein2.2e-4232.66Show/hide
Query:  GAYSAPLSFGTPPQTLHLIFDTGSSLVWFPCTSRYLCSECSFPKVDPAGIPRFVPKLSTSSKLLGCQNPKCAWIFGPDVKSQCRSCNPTTENCTQTCPAY
        G Y   +  GTPP+   LI DTGS L W  C   Y C   +    D        PK S S K + C +P+C+ I  PD   QC S N       Q+CP Y
Subjt:  GAYSAPLSFGTPPQTLHLIFDTGSSLVWFPCTSRYLCSECSFPKVDPAGIPRFVPKLSTSSKLLGCQNPKCAWIFGPDVKSQCRSCNPTTENCTQTCPAY

Query:  VVQYGSGS-TAGLLLSETLDF---------PEKKIPNFVVGCSFLS---IHQPSGIAGFGRGSESLPSQMGL---KKFSYCLASRRFDDTPHSGELVLDS
           YG  S T G    ET             E K+ N + GC   +       SG+ G GRG  S  SQ+       FSYCL  R   +T  S +L+   
Subjt:  VVQYGSGS-TAGLLLSETLDF---------PEKKIPNFVVGCSFLS---IHQPSGIAGFGRGSESLPSQMGL---KKFSYCLASRRFDDTPHSGELVLDS

Query:  GGEKTGDLSYTPFRKNPSVS--NHAYKEYYYLTIRKIIVGNQAVKVPYKFLAPGPDGSGGSIIDSGSTFTFLEKPVFEAVAQEF-EKQLANHTRATDVES
         GE    L++T       V+   ++ + +YY+ I+ I+VG +A+ +P +      DG GG+IIDSG+T ++  +P +E +  +F EK   N+    D   
Subjt:  GGEKTGDLSYTPFRKNPSVS--NHAYKEYYYLTIRKIIVGNQAVKVPYKFLAPGPDGSGGSIIDSGSTFTFLEKPVFEAVAQEF-EKQLANHTRATDVES

Query:  LTGLRPCFDIS--KEKSVEFPELIFQFKGGAKLALPLNNYFALVSSSGVACLTIVTQKVAAGGPSVILGAFQQQNFYVEYDLVNDRLGFRQQTCS
        L    PCF++S  +E ++  PEL   F  G     P  N F  +S   V    + T K        I+G +QQQNF++ YD    RLGF    C+
Subjt:  LTGLRPCFDIS--KEKSVEFPELIFQFKGGAKLALPLNNYFALVSSSGVACLTIVTQKVAAGGPSVILGAFQQQNFYVEYDLVNDRLGFRQQTCS

AT3G52500.1 Eukaryotic aspartyl protease family protein1.2e-15256.09Show/hide
Query:  VSSALCFFYILLFSAVSAIDSSNVITLPLSAFPH--HSSSDPLETLNFLASASLSRAHQIK--------------SPKSNSSISKSPLSPHSYGAYSAPL
        ++S++ FF+++  S VSA      + LPLS F H   S  DP  +L  LA +S++RAH++K              +  +++++ KSPLS  SYG YS  L
Subjt:  VSSALCFFYILLFSAVSAIDSSNVITLPLSAFPH--HSSSDPLETLNFLASASLSRAHQIK--------------SPKSNSSISKSPLSPHSYGAYSAPL

Query:  SFGTPPQTLHLIFDTGSSLVWFPCTSRYLCSECSFPKVDPAGIPRFVPKLSTSSKLLGCQNPKCAWIFGPDVKSQCRSCNPTTENCTQTCPAYVVQYGSG
        SFGTP QT+  +FDTGSSLVW PCTSRYLCS C F  +DP  IPRF+PK S+SSK++GCQ+PKC +++GP+V  QCR C+P T NCT  CP Y++QYG G
Subjt:  SFGTPPQTLHLIFDTGSSLVWFPCTSRYLCSECSFPKVDPAGIPRFVPKLSTSSKLLGCQNPKCAWIFGPDVKSQCRSCNPTTENCTQTCPAYVVQYGSG

Query:  STAGLLLSETLDFPEKKIPNFVVGCSFLSIHQPSGIAGFGRGSESLPSQMGLKKFSYCLASRRFDDTPHSGELVLDSG-----GEKTGDLSYTPFRKNPS
        STAG+L++E LDFP+  +P+FVVGCS +S  QP+GIAGFGRG  SLPSQM LK+FS+CL SRRFDDT  + +L LD+G     G KT  L+YTPFRKNP+
Subjt:  STAGLLLSETLDFPEKKIPNFVVGCSFLSIHQPSGIAGFGRGSESLPSQMGLKKFSYCLASRRFDDTPHSGELVLDSG-----GEKTGDLSYTPFRKNPS

Query:  VSNHAYKEYYYLTIRKIIVGNQAVKVPYKFLAPGPDGSGGSIIDSGSTFTFLEKPVFEAVAQEFEKQLANHTRATDVESLTGLRPCFDISKEKSVEFPEL
        VSN A+ EYYYL +R+I VG + VK+PYK+LAPG +G GGSI+DSGSTFTF+E+PVFE VA+EF  Q++N+TR  D+E  TGL PCF+IS +  V  PEL
Subjt:  VSNHAYKEYYYLTIRKIIVGNQAVKVPYKFLAPGPDGSGGSIIDSGSTFTFLEKPVFEAVAQEFEKQLANHTRATDVESLTGLRPCFDISKEKSVEFPEL

Query:  IFQFKGGAKLALPLNNYFALVSSSGVACLTIVTQKV----AAGGPSVILGAFQQQNFYVEYDLVNDRLGFRQQTCS
        IF+FKGGAKL LPL+NYF  V ++   CLT+V+ K        GP++ILG+FQQQN+ VEYDL NDR GF ++ CS
Subjt:  IFQFKGGAKLALPLNNYFALVSSSGVACLTIVTQKV----AAGGPSVILGAFQQQNFYVEYDLVNDRLGFRQQTCS

AT4G16563.1 Eukaryotic aspartyl protease family protein1.1e-4929.57Show/hide
Query:  AVSAIDSSNVITLPLSAFPHHSSSDPLETLNFLASASLSRAHQIKSPKSNSSISKSPLSPHSYGAYSAPLSFGTPPQTLHLIFDTGSSLVWFPCTSRYLC
        +VS++ +  ++ L  S      SS PL  L   +S S +R  +    +    +S  P+S  S   Y   LS G+    + L  DTGS LVWFPC   + C
Subjt:  AVSAIDSSNVITLPLSAFPHHSSSDPLETLNFLASASLSRAHQIKSPKSNSSISKSPLSPHSYGAYSAPLSFGTPPQTLHLIFDTGSSLVWFPCTSRYLC

Query:  SECSFPKVDPAGIPRFVPKLSTSSKLLGCQNPKCAWIFGPDVKSQ-CRSCN-----PTTENCTQT---CPAYVVQYGSGSTAGLLLSETLDFPEKKIPNF
          C    + P+        LS+S+  + C +P C+        S  C   N       T +C  +   CP +   YG GS    L S++L  P   + NF
Subjt:  SECSFPKVDPAGIPRFVPKLSTSSKLLGCQNPKCAWIFGPDVKSQ-CRSCN-----PTTENCTQT---CPAYVVQYGSGSTAGLLLSETLDFPEKKIPNF

Query:  VVGCSFLSIHQPSGIAGFGRGSESLPSQMGL------KKFSYCLASRRFDD--TPHSGELVL--------------------DSGGEKTGDLSYTPFRKN
          GC+  ++ +P G+AGFGRG  SLP+Q+ +        FSYCL S  FD         L+L                    D   +K  +  +T   +N
Subjt:  VVGCSFLSIHQPSGIAGFGRGSESLPSQMGL------KKFSYCLASRRFDD--TPHSGELVL--------------------DSGGEKTGDLSYTPFRKN

Query:  PSVSNHAYKEYYYLTIRKIIVGNQAVKVPYKFLAPGPDGSGGSIIDSGSTFTFLEKPVFEAVAQEFEKQLAN-HTRATDVESLTGLRPCFDISKEKSVEF
        P    H Y  +Y ++++ I +G + +  P        +G GG ++DSG+TFT L    + +V +EF+ ++   H RA  VE  +G+ PC+ ++  ++V+ 
Subjt:  PSVSNHAYKEYYYLTIRKIIVGNQAVKVPYKFLAPGPDGSGGSIIDSGSTFTFLEKPVFEAVAQEFEKQLAN-HTRATDVESLTGLRPCFDISKEKSVEF

Query:  PELIFQFKGG-AKLALPLNNYFALVSSSG--------VACLTIVT---QKVAAGGPSVILGAFQQQNFYVEYDLVNDRLGFRQQTCS
        P L+  F G  + + LP  NYF      G        + CL ++    +    GG   ILG +QQQ F V YDL+N R+GF ++ C+
Subjt:  PELIFQFKGG-AKLALPLNNYFALVSSSG--------VACLTIVT---QKVAAGGPSVILGAFQQQNFYVEYDLVNDRLGFRQQTCS

AT5G45120.1 Eukaryotic aspartyl protease family protein3.1e-4932.16Show/hide
Query:  PHHSSSDPLETLNFLASASL-----SRAHQIKSPKSNSSISKSPLSPHSYGAYSAPLSFGTPPQTLHLIFDTGSSLVWFPCTS-RYLCSECSFPKVDPAG
        P  SSS  L      +S SL         +IK P S+  +   PL     G Y   L+ GTPPQ + +  DTGS L W PC +  + C EC   K +   
Subjt:  PHHSSSDPLETLNFLASASL-----SRAHQIKSPKSNSSISKSPLSPHSYGAYSAPLSFGTPPQTLHLIFDTGSSLVWFPCTS-RYLCSECSFPKVDPAG

Query:  IPR-FVPKLSTSSKLLGCQNPKCAWI------FGPDVKSQCRSCNPTTENCTQTCPAYVVQYGSGS-TAGLLLSETLDFPEKKIPNFVVGCSFLSIHQPS
         P  F P  S++S    C +  C  I      F P   + C         C + CP++   YG G   +G+L  + L    + +P F  GC   +  +P 
Subjt:  IPR-FVPKLSTSSKLLGCQNPKCAWI------FGPDVKSQCRSCNPTTENCTQTCPAYVVQYGSGS-TAGLLLSETLDFPEKKIPNFVVGCSFLSIHQPS

Query:  GIAGFGRGSESLPSQMGL--KKFSYCLASRRFDDTPH-SGELVLDSGG---EKTGDLSYTPFRKNPSVSNHAYKEYYYLTIRKIIVGNQ--AVKVPYKFL
        GIAGFGRG  SLPSQ+G   K FS+C    +F + P+ S  L+L +       T  L +TP    P      Y   YY+ +  I +G      +VP    
Subjt:  GIAGFGRGSESLPSQMGL--KKFSYCLASRRFDDTPH-SGELVLDSGG---EKTGDLSYTPFRKNPSVSNHAYKEYYYLTIRKIIVGNQ--AVKVPYKFL

Query:  APGPDGSGGSIIDSGSTFTFLEKPVFEAVAQEFEKQLANHTRATDVESLTGLRPCFDI--------SKEKSVE--FPELIFQFKGGAKLALPLNNYFALV
             G+GG ++DSG+T+T L +P +  +    +  +  + RAT+ ES TG   C+ +        S E  V   FP + F F   A L LP  N F  +
Subjt:  APGPDGSGGSIIDSGSTFTFLEKPVFEAVAQEFEKQLANHTRATDVESLTGLRPCFDI--------SKEKSVE--FPELIFQFKGGAKLALPLNNYFALV

Query:  SS----SGVACLTIVTQKVAAGGPSVILGAFQQQNFYVEYDLVNDRLGFRQQTC
        S+    S V CL     +    GP+ + G+FQQQN  V YDL  +R+GF+   C
Subjt:  SS----SGVACLTIVTQKVAAGGPSVILGAFQQQNFYVEYDLVNDRLGFRQQTC


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCGGTTTCTTCCGCTCTCTGTTTCTTCTACATTCTCCTATTCTCTGCTGTTTCCGCCATTGACAGCAGCAACGTCATTACCCTCCCTCTCTCGGCCTTTCCTCACCA
TTCATCTTCAGACCCATTGGAGACTCTGAACTTCCTCGCTTCTGCTTCTCTAAGCAGAGCCCATCAAATCAAGAGCCCGAAATCCAACTCCTCCATCTCCAAATCTCCTC
TCTCCCCCCATAGCTATGGAGCTTACTCTGCTCCACTCAGCTTCGGGACGCCCCCACAGACCCTGCATTTGATCTTCGATACAGGTAGCAGCCTCGTTTGGTTTCCTTGT
ACTTCCCGATATCTCTGCTCCGAGTGTTCTTTCCCCAAGGTAGATCCCGCCGGAATCCCCAGATTTGTTCCGAAATTATCTACCTCTTCGAAGCTTCTCGGTTGCCAGAA
TCCGAAATGTGCCTGGATATTTGGTCCCGACGTGAAATCTCAGTGCCGGAGCTGTAATCCTACGACGGAGAACTGTACCCAAACTTGCCCTGCTTATGTTGTTCAGTATG
GTTCTGGCTCGACGGCCGGGCTTTTGCTCTCGGAGACGCTTGATTTTCCGGAAAAGAAAATACCCAATTTTGTTGTTGGGTGTTCGTTCTTGTCGATCCACCAACCCTCT
GGGATTGCCGGATTCGGCCGAGGATCCGAATCGTTGCCGTCGCAGATGGGTCTCAAGAAATTCTCCTACTGCCTCGCGTCCCGTCGATTCGATGACACGCCGCATTCCGG
CGAGCTTGTTCTAGATTCCGGCGGCGAGAAGACCGGTGACCTCAGCTACACACCGTTCCGGAAGAACCCCTCTGTATCTAACCACGCTTACAAAGAATACTATTACTTAA
CCATACGCAAAATCATCGTCGGCAACCAGGCCGTGAAGGTGCCGTACAAGTTTCTGGCGCCGGGACCCGACGGAAGCGGCGGATCTATAATCGACTCCGGCTCCACCTTC
ACATTTCTGGAAAAACCGGTGTTCGAGGCGGTGGCCCAAGAGTTTGAGAAGCAGTTGGCGAATCATACGAGAGCCACCGACGTAGAATCTCTGACCGGATTACGGCCGTG
CTTCGATATTTCGAAAGAGAAATCGGTGGAGTTCCCGGAGTTGATCTTCCAGTTCAAAGGCGGAGCGAAACTGGCACTGCCGTTGAACAACTATTTCGCTCTGGTCAGTA
GCTCCGGCGTGGCGTGTTTGACGATCGTGACGCAGAAAGTGGCGGCCGGTGGGCCGTCTGTGATATTGGGTGCGTTTCAGCAGCAGAATTTCTATGTAGAGTATGATTTG
GTAAACGATAGACTAGGATTTCGACAACAGACTTGCAGTTAG
mRNA sequenceShow/hide mRNA sequence
ATGGCGGTTTCTTCCGCTCTCTGTTTCTTCTACATTCTCCTATTCTCTGCTGTTTCCGCCATTGACAGCAGCAACGTCATTACCCTCCCTCTCTCGGCCTTTCCTCACCA
TTCATCTTCAGACCCATTGGAGACTCTGAACTTCCTCGCTTCTGCTTCTCTAAGCAGAGCCCATCAAATCAAGAGCCCGAAATCCAACTCCTCCATCTCCAAATCTCCTC
TCTCCCCCCATAGCTATGGAGCTTACTCTGCTCCACTCAGCTTCGGGACGCCCCCACAGACCCTGCATTTGATCTTCGATACAGGTAGCAGCCTCGTTTGGTTTCCTTGT
ACTTCCCGATATCTCTGCTCCGAGTGTTCTTTCCCCAAGGTAGATCCCGCCGGAATCCCCAGATTTGTTCCGAAATTATCTACCTCTTCGAAGCTTCTCGGTTGCCAGAA
TCCGAAATGTGCCTGGATATTTGGTCCCGACGTGAAATCTCAGTGCCGGAGCTGTAATCCTACGACGGAGAACTGTACCCAAACTTGCCCTGCTTATGTTGTTCAGTATG
GTTCTGGCTCGACGGCCGGGCTTTTGCTCTCGGAGACGCTTGATTTTCCGGAAAAGAAAATACCCAATTTTGTTGTTGGGTGTTCGTTCTTGTCGATCCACCAACCCTCT
GGGATTGCCGGATTCGGCCGAGGATCCGAATCGTTGCCGTCGCAGATGGGTCTCAAGAAATTCTCCTACTGCCTCGCGTCCCGTCGATTCGATGACACGCCGCATTCCGG
CGAGCTTGTTCTAGATTCCGGCGGCGAGAAGACCGGTGACCTCAGCTACACACCGTTCCGGAAGAACCCCTCTGTATCTAACCACGCTTACAAAGAATACTATTACTTAA
CCATACGCAAAATCATCGTCGGCAACCAGGCCGTGAAGGTGCCGTACAAGTTTCTGGCGCCGGGACCCGACGGAAGCGGCGGATCTATAATCGACTCCGGCTCCACCTTC
ACATTTCTGGAAAAACCGGTGTTCGAGGCGGTGGCCCAAGAGTTTGAGAAGCAGTTGGCGAATCATACGAGAGCCACCGACGTAGAATCTCTGACCGGATTACGGCCGTG
CTTCGATATTTCGAAAGAGAAATCGGTGGAGTTCCCGGAGTTGATCTTCCAGTTCAAAGGCGGAGCGAAACTGGCACTGCCGTTGAACAACTATTTCGCTCTGGTCAGTA
GCTCCGGCGTGGCGTGTTTGACGATCGTGACGCAGAAAGTGGCGGCCGGTGGGCCGTCTGTGATATTGGGTGCGTTTCAGCAGCAGAATTTCTATGTAGAGTATGATTTG
GTAAACGATAGACTAGGATTTCGACAACAGACTTGCAGTTAG
Protein sequenceShow/hide protein sequence
MAVSSALCFFYILLFSAVSAIDSSNVITLPLSAFPHHSSSDPLETLNFLASASLSRAHQIKSPKSNSSISKSPLSPHSYGAYSAPLSFGTPPQTLHLIFDTGSSLVWFPC
TSRYLCSECSFPKVDPAGIPRFVPKLSTSSKLLGCQNPKCAWIFGPDVKSQCRSCNPTTENCTQTCPAYVVQYGSGSTAGLLLSETLDFPEKKIPNFVVGCSFLSIHQPS
GIAGFGRGSESLPSQMGLKKFSYCLASRRFDDTPHSGELVLDSGGEKTGDLSYTPFRKNPSVSNHAYKEYYYLTIRKIIVGNQAVKVPYKFLAPGPDGSGGSIIDSGSTF
TFLEKPVFEAVAQEFEKQLANHTRATDVESLTGLRPCFDISKEKSVEFPELIFQFKGGAKLALPLNNYFALVSSSGVACLTIVTQKVAAGGPSVILGAFQQQNFYVEYDL
VNDRLGFRQQTCS