; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0009883 (gene) of Snake gourd v1 genome

Gene IDTan0009883
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionEukaryotic aspartyl protease family protein
Genome locationLG08:66393473..66395490
RNA-Seq ExpressionTan0009883
SyntenyTan0009883
Gene Ontology termsGO:0006508 - proteolysis (biological process)
GO:0004190 - aspartic-type endopeptidase activity (molecular function)
InterPro domainsIPR001461 - Aspartic peptidase A1 family
IPR001969 - Aspartic peptidase, active site
IPR021109 - Aspartic peptidase domain superfamily
IPR032799 - Xylanase inhibitor, C-terminal
IPR032861 - Xylanase inhibitor, N-terminal
IPR033121 - Peptidase family A1 domain
IPR034161 - Pepsin-like domain, plant


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6596604.1 Aspartic proteinase nepenthesin-2, partial [Cucurbita argyrosperma subsp. sororia]1.9e-24592.39Show/hide
Query:  MAAPPSLCFFYILLFSSVSAIV--ISNPITLPLSAFPHSFSSDPLQTLNFLASASQNRAHQIKTPKSQSNNSVSKSPLSPHSYGAYSAPLSFGTPPQTLH
        MAAPP LCF YILL  SVSAIV   +N ITLPLSA PH  SSDPLQ LNFLASASQNRAHQIKTPKS   NSVSKSPLSPHSYGAYSAPLSFGTPPQTLH
Subjt:  MAAPPSLCFFYILLFSSVSAIV--ISNPITLPLSAFPHSFSSDPLQTLNFLASASQNRAHQIKTPKSQSNNSVSKSPLSPHSYGAYSAPLSFGTPPQTLH

Query:  LIFDTGSSLVWFPCTSKYLCSECSFPKIDPAGIPRFVPKLSSSSKLIGCQNPKCAWIFGPDVKSQCRSCNPKTDNCTQTCPAYVVQYGSGSTAGLLLSET
        LIFDTGSSLVWFPCTSKYLCS+CSFPKIDP  IPRFVPKLSSSSKL+GCQNPKCAW+FGPDVKSQCR+CNPKT+NCTQTCPAY VQYGSGSTAGLLLSET
Subjt:  LIFDTGSSLVWFPCTSKYLCSECSFPKIDPAGIPRFVPKLSSSSKLIGCQNPKCAWIFGPDVKSQCRSCNPKTDNCTQTCPAYVVQYGSGSTAGLLLSET

Query:  LDFPDKKIPNFVVGCSFLSIHQPSGIAGFGRGSESLPSQMGLKKFAYCLASRKFDDSPHSGELILDSGGGKTGDLTYTPFRQNPSVSNHAYKEYYYLSIR
        LDFPD+KIPNFVVGCSFLSIHQPSGIAGFGRGSESLPSQMGLKKFAYCLASRKFDDSPHSGELILDSGG KTGDLTYTPFRQNPSVSNHAYKEYYYLSIR
Subjt:  LDFPDKKIPNFVVGCSFLSIHQPSGIAGFGRGSESLPSQMGLKKFAYCLASRKFDDSPHSGELILDSGGGKTGDLTYTPFRQNPSVSNHAYKEYYYLSIR

Query:  KILVGNQAVKVPYKYLVPGPDGSGGSIIDSGSTFTFMDKPVFEAVAEAFEKQLANRTRATDVESATGLRPCFDISKDKSVEFPELIFQFKGGAKWVLPLN
        KILVGNQ VKVPYKYLVPG DGSGGSIIDSGSTFTFMDKPVFEAVAEAFEKQL NRTRATDVESATGLRPCFDISK+KSVEFPELIFQFKGGAKW LPLN
Subjt:  KILVGNQAVKVPYKYLVPGPDGSGGSIIDSGSTFTFMDKPVFEAVAEAFEKQLANRTRATDVESATGLRPCFDISKDKSVEFPELIFQFKGGAKWVLPLN

Query:  NYFALVSSSGVACLTVVTHKGAAGGGGGPSVILGAFQQQNFYVEYDLVNERLGFRQQTCS
        NYFALVSSSGVACLTVVTHK A+GGG GPSVILGAFQQQNFYVEYDLVNERLGFRQQ+CS
Subjt:  NYFALVSSSGVACLTVVTHKGAAGGGGGPSVILGAFQQQNFYVEYDLVNERLGFRQQTCS

KAG7028143.1 Aspartic proteinase nepenthesin-2, partial [Cucurbita argyrosperma subsp. argyrosperma]1.3e-24692.83Show/hide
Query:  MAAPPSLCFFYILLFSSVSAIV--ISNPITLPLSAFPHSFSSDPLQTLNFLASASQNRAHQIKTPKSQSNNSVSKSPLSPHSYGAYSAPLSFGTPPQTLH
        MAAPP LCF YILL  SVSAIV   +N ITLPLSAFPH  SSDPLQ LNFLASASQNRAHQIKTPKS   NSVSKSPLSPHSYGAYSAPLSFGTPPQTLH
Subjt:  MAAPPSLCFFYILLFSSVSAIV--ISNPITLPLSAFPHSFSSDPLQTLNFLASASQNRAHQIKTPKSQSNNSVSKSPLSPHSYGAYSAPLSFGTPPQTLH

Query:  LIFDTGSSLVWFPCTSKYLCSECSFPKIDPAGIPRFVPKLSSSSKLIGCQNPKCAWIFGPDVKSQCRSCNPKTDNCTQTCPAYVVQYGSGSTAGLLLSET
        LIFDTGSSLVWFPCTSKYLCS+CSFPKIDP  IPRFVPKLSSSSKL+GCQNPKCAW+FGPDVKSQCR+CNPKT+NCTQTCPAY VQYGSGSTAGLLLSET
Subjt:  LIFDTGSSLVWFPCTSKYLCSECSFPKIDPAGIPRFVPKLSSSSKLIGCQNPKCAWIFGPDVKSQCRSCNPKTDNCTQTCPAYVVQYGSGSTAGLLLSET

Query:  LDFPDKKIPNFVVGCSFLSIHQPSGIAGFGRGSESLPSQMGLKKFAYCLASRKFDDSPHSGELILDSGGGKTGDLTYTPFRQNPSVSNHAYKEYYYLSIR
        LDFPD+KIPNFVVGCSFLSIHQPSGIAGFGRGSESLPSQMGLKKFAYCLASRKFDDSPHSGELILDSGG KTGDLTYTPFRQNPSVSNHAYKEYYYLSIR
Subjt:  LDFPDKKIPNFVVGCSFLSIHQPSGIAGFGRGSESLPSQMGLKKFAYCLASRKFDDSPHSGELILDSGGGKTGDLTYTPFRQNPSVSNHAYKEYYYLSIR

Query:  KILVGNQAVKVPYKYLVPGPDGSGGSIIDSGSTFTFMDKPVFEAVAEAFEKQLANRTRATDVESATGLRPCFDISKDKSVEFPELIFQFKGGAKWVLPLN
        KILVGNQ VKVPYKYLVPG DGSGGSIIDSGSTFTFMDKPVFEAVAEAFEKQLANRTRATDVESATGLRPCFDISK+KSVEFPELIFQFKGGAKW LPLN
Subjt:  KILVGNQAVKVPYKYLVPGPDGSGGSIIDSGSTFTFMDKPVFEAVAEAFEKQLANRTRATDVESATGLRPCFDISKDKSVEFPELIFQFKGGAKWVLPLN

Query:  NYFALVSSSGVACLTVVTHKGAAGGGGGPSVILGAFQQQNFYVEYDLVNERLGFRQQTCS
        NYFALVSSSGVACLTVVTHK A+GGG GPSVILGAFQQQNFYVEYDLVNERLGFRQQ+CS
Subjt:  NYFALVSSSGVACLTVVTHKGAAGGGGGPSVILGAFQQQNFYVEYDLVNERLGFRQQTCS

XP_022940517.1 probable aspartyl protease At4g16563 [Cucurbita moschata]2.9e-24692.83Show/hide
Query:  MAAPPSLCFFYILLFSSVSAIV--ISNPITLPLSAFPHSFSSDPLQTLNFLASASQNRAHQIKTPKSQSNNSVSKSPLSPHSYGAYSAPLSFGTPPQTLH
        MAAPP LCF YILL  SVSAIV   +N ITLPLSA PH  SSDPLQ LNFLASASQNRAHQIKTPKS   NSVSKSPLSPHSYGAYSAPLSFGTPPQTLH
Subjt:  MAAPPSLCFFYILLFSSVSAIV--ISNPITLPLSAFPHSFSSDPLQTLNFLASASQNRAHQIKTPKSQSNNSVSKSPLSPHSYGAYSAPLSFGTPPQTLH

Query:  LIFDTGSSLVWFPCTSKYLCSECSFPKIDPAGIPRFVPKLSSSSKLIGCQNPKCAWIFGPDVKSQCRSCNPKTDNCTQTCPAYVVQYGSGSTAGLLLSET
        LIFDTGSSLVWFPCTSKYLCS+CSFPKIDP  IPRFVPKLSSSSKL+GCQNPKCAW+FGPDVKSQCR+CNPKT+NCTQTCPAY VQYGSGSTAGLLLSET
Subjt:  LIFDTGSSLVWFPCTSKYLCSECSFPKIDPAGIPRFVPKLSSSSKLIGCQNPKCAWIFGPDVKSQCRSCNPKTDNCTQTCPAYVVQYGSGSTAGLLLSET

Query:  LDFPDKKIPNFVVGCSFLSIHQPSGIAGFGRGSESLPSQMGLKKFAYCLASRKFDDSPHSGELILDSGGGKTGDLTYTPFRQNPSVSNHAYKEYYYLSIR
        LDFPD+KIPNFVVGCSFLSIHQPSGIAGFGRGSESLPSQMGLKKFAYCLASRKFDDSPHSGELILDSGG KTGDLTYTPFRQNPSVSNHAYKEYYYLSIR
Subjt:  LDFPDKKIPNFVVGCSFLSIHQPSGIAGFGRGSESLPSQMGLKKFAYCLASRKFDDSPHSGELILDSGGGKTGDLTYTPFRQNPSVSNHAYKEYYYLSIR

Query:  KILVGNQAVKVPYKYLVPGPDGSGGSIIDSGSTFTFMDKPVFEAVAEAFEKQLANRTRATDVESATGLRPCFDISKDKSVEFPELIFQFKGGAKWVLPLN
        KILVGNQ VKVPYKYLVPG DGSGGSIIDSGSTFTFMDKPVFEAVAEAFEKQLANRTRATDVESATGLRPCFDISK+KSVEFPELIFQFKGGAKW LPLN
Subjt:  KILVGNQAVKVPYKYLVPGPDGSGGSIIDSGSTFTFMDKPVFEAVAEAFEKQLANRTRATDVESATGLRPCFDISKDKSVEFPELIFQFKGGAKWVLPLN

Query:  NYFALVSSSGVACLTVVTHKGAAGGGGGPSVILGAFQQQNFYVEYDLVNERLGFRQQTCS
        NYFALVSSSGVACLTVVTHK AAGGG GPSVILGAFQQQNFYVEYDLVNERLGFRQQ+CS
Subjt:  NYFALVSSSGVACLTVVTHKGAAGGGGGPSVILGAFQQQNFYVEYDLVNERLGFRQQTCS

XP_023005632.1 probable aspartyl protease At4g16563 [Cucurbita maxima]3.5e-24492.61Show/hide
Query:  MAAPPSLCFFYILLFSSVSAI--VISNPITLPLSAFPHSFSSDPLQTLNFLASASQNRAHQIKTPKSQSNNSVSKSPLSPHSYGAYSAPLSFGTPPQTLH
        MAAPP LCF YILL  SVSAI  V +N ITLPLSAFPH  SSDPLQ LNFLASASQNRAHQIKTPKS   NSVSKS LSPHSYGAYSAPLSFGTPPQTLH
Subjt:  MAAPPSLCFFYILLFSSVSAI--VISNPITLPLSAFPHSFSSDPLQTLNFLASASQNRAHQIKTPKSQSNNSVSKSPLSPHSYGAYSAPLSFGTPPQTLH

Query:  LIFDTGSSLVWFPCTSKYLCSECSFPKIDPAGIPRFVPKLSSSSKLIGCQNPKCAWIFGPDVKSQCRSCNPKTDNCTQTCPAYVVQYGSGSTAGLLLSET
        LIFDTGSSLVWFPCTSKYLCS+CSFPKIDP  IPRFVPKLSSSSKL+GCQNPKC W+FGPDVKSQCR+CN KT+NCTQTCPAYVVQYGSGSTAGLLLSET
Subjt:  LIFDTGSSLVWFPCTSKYLCSECSFPKIDPAGIPRFVPKLSSSSKLIGCQNPKCAWIFGPDVKSQCRSCNPKTDNCTQTCPAYVVQYGSGSTAGLLLSET

Query:  LDFPDKKIPNFVVGCSFLSIHQPSGIAGFGRGSESLPSQMGLKKFAYCLASRKFDDSPHSGELILDSGGGKTGDLTYTPFRQNPSVSNHAYKEYYYLSIR
        LDFPD+KIPNFVVGCSFLSIHQPSGIAGFGRGSESLPSQMGLKKFAYCLASRKFDDSPHSGELILDSGG KTGDLTYTPFRQNPSVSNHAYKEYYYLSIR
Subjt:  LDFPDKKIPNFVVGCSFLSIHQPSGIAGFGRGSESLPSQMGLKKFAYCLASRKFDDSPHSGELILDSGGGKTGDLTYTPFRQNPSVSNHAYKEYYYLSIR

Query:  KILVGNQAVKVPYKYLVPGPDGSGGSIIDSGSTFTFMDKPVFEAVAEAFEKQLANRTRATDVESATGLRPCFDISKDKSVEFPELIFQFKGGAKWVLPLN
        KILVGNQ VKVPYKYLVPG DGSGGSIIDSGSTFTFMDKPVFEAVAEAFEKQLANRTRATDVESATGLRPCFDISK+KSVEFPELIFQFKGGAKW L LN
Subjt:  KILVGNQAVKVPYKYLVPGPDGSGGSIIDSGSTFTFMDKPVFEAVAEAFEKQLANRTRATDVESATGLRPCFDISKDKSVEFPELIFQFKGGAKWVLPLN

Query:  NYFALVSSSGVACLTVVTHKGAAGGGGGPSVILGAFQQQNFYVEYDLVNERLGFRQQTCS
        NYFALVSSSGVACLTVVTHK AAGGG GPSVILGAFQQQNFYVEYDLVNERLGFRQQTCS
Subjt:  NYFALVSSSGVACLTVVTHKGAAGGGGGPSVILGAFQQQNFYVEYDLVNERLGFRQQTCS

XP_023540611.1 probable aspartyl protease At4g16563 [Cucurbita pepo subsp. pepo]4.9e-24693.04Show/hide
Query:  MAAPPSLCFFYILLFSSVSAIV--ISNPITLPLSAFPHSFSSDPLQTLNFLASASQNRAHQIKTPKSQSNNSVSKSPLSPHSYGAYSAPLSFGTPPQTLH
        MAAPP LCF YILL  SVSAIV   +N ITLPLSAFPH  SSDPLQ LNFLASASQNRAHQIKTPKS   NSVSKSPLSPHSYGAYSAPLSFGTPPQTLH
Subjt:  MAAPPSLCFFYILLFSSVSAIV--ISNPITLPLSAFPHSFSSDPLQTLNFLASASQNRAHQIKTPKSQSNNSVSKSPLSPHSYGAYSAPLSFGTPPQTLH

Query:  LIFDTGSSLVWFPCTSKYLCSECSFPKIDPAGIPRFVPKLSSSSKLIGCQNPKCAWIFGPDVKSQCRSCNPKTDNCTQTCPAYVVQYGSGSTAGLLLSET
        LIFDTGSSLVWFPCTSKYLCS+CSFPKIDP  IPRFVPKLSSSSKL+GCQNPKCAW+FGPDVKSQCR+CN KT+NCTQTCPAYVVQYGSGSTAGLLLSET
Subjt:  LIFDTGSSLVWFPCTSKYLCSECSFPKIDPAGIPRFVPKLSSSSKLIGCQNPKCAWIFGPDVKSQCRSCNPKTDNCTQTCPAYVVQYGSGSTAGLLLSET

Query:  LDFPDKKIPNFVVGCSFLSIHQPSGIAGFGRGSESLPSQMGLKKFAYCLASRKFDDSPHSGELILDSGGGKTGDLTYTPFRQNPSVSNHAYKEYYYLSIR
        LDFPD+KIPNFVVGCSFLSIHQPSGIAGFGRGSESLPSQMGLKKFAYCLASRKFDDSPHSGELILDSGG KTGDLTYTPFRQNPSVSNHAYKEYYYLSIR
Subjt:  LDFPDKKIPNFVVGCSFLSIHQPSGIAGFGRGSESLPSQMGLKKFAYCLASRKFDDSPHSGELILDSGGGKTGDLTYTPFRQNPSVSNHAYKEYYYLSIR

Query:  KILVGNQAVKVPYKYLVPGPDGSGGSIIDSGSTFTFMDKPVFEAVAEAFEKQLANRTRATDVESATGLRPCFDISKDKSVEFPELIFQFKGGAKWVLPLN
        KILVGNQ VKVPYKYLVPG DGSGGSIIDSGSTFTFMDKPVFEAVAEAFEKQLANRTRATDVESATGL PCFDISK+KSVEFPELIFQFKGGAKW LPLN
Subjt:  KILVGNQAVKVPYKYLVPGPDGSGGSIIDSGSTFTFMDKPVFEAVAEAFEKQLANRTRATDVESATGLRPCFDISKDKSVEFPELIFQFKGGAKWVLPLN

Query:  NYFALVSSSGVACLTVVTHKGAAGGGGGPSVILGAFQQQNFYVEYDLVNERLGFRQQTCS
        NYFALVSSSGVACLTVVTHK AAGGG GPSVILGAFQQQNFYVEYDLVNERLGFRQQTCS
Subjt:  NYFALVSSSGVACLTVVTHKGAAGGGGGPSVILGAFQQQNFYVEYDLVNERLGFRQQTCS

TrEMBL top hitse value%identityAlignment
A0A5A7TRK2 Aspartic proteinase nepenthesin-21.3e-23688.43Show/hide
Query:  MAAPPSLCFFYILLFSSVSAIVISNPITLPLSAFPHSFSSDPLQTLNFLASASQNRAHQIKTPKSQSNNSVSKSPLSPHSYGAYSAPLSFGTPPQTLHLI
        MA+P  L FFYILLFSS+SAI  SNPITLPL++ PH  SSDPLQ L FLASAS+NRAH+IKTPKS   NSVSKSPLSPHSYGAYS PLSFGTP QTLHLI
Subjt:  MAAPPSLCFFYILLFSSVSAIVISNPITLPLSAFPHSFSSDPLQTLNFLASASQNRAHQIKTPKSQSNNSVSKSPLSPHSYGAYSAPLSFGTPPQTLHLI

Query:  FDTGSSLVWFPCTSKYLCSECSFPKIDPAGIPRFVPKLSSSSKLIGCQNPKCAWIFGPDVKSQCRSCNPKTDNCTQTCPAYVVQYGSGSTAGLLLSETLD
        FDTGSSLVWFPCTS+YLC+ECSFPKIDP GIPRFVPKLSSSSKL+GCQNPKCAWIFGPDVKSQCRSCNPKT+NCTQTCPAYVVQYGSGSTAGLLLSETLD
Subjt:  FDTGSSLVWFPCTSKYLCSECSFPKIDPAGIPRFVPKLSSSSKLIGCQNPKCAWIFGPDVKSQCRSCNPKTDNCTQTCPAYVVQYGSGSTAGLLLSETLD

Query:  FPDKKIPNFVVGCSFLSIHQPSGIAGFGRGSESLPSQMGLKKFAYCLASRKFDDSPHSGELILDSGGGKTGDLTYTPFRQNPSVSNHAYKEYYYLSIRKI
        FP+KKIPNFVVGCSFLSIHQPSGIAGFGRGSESLPSQMGLKKFAYCLASRKFDDS HSG+LILDS G KT  LTYT FRQNPSVSNHAYKEYYYL+IRKI
Subjt:  FPDKKIPNFVVGCSFLSIHQPSGIAGFGRGSESLPSQMGLKKFAYCLASRKFDDSPHSGELILDSGGGKTGDLTYTPFRQNPSVSNHAYKEYYYLSIRKI

Query:  LVGNQAVKVPYKYLVPGPDGSGGSIIDSGSTFTFMDKPVFEAVAEAFEKQLANRTRATDVESATGLRPCFDISKDKSVEFPELIFQFKGGAKWVLPLNNY
        +VGNQAVKVPYKYLVPGPDG+GGSIIDSGSTFTFMDKPV + VA+ FEKQLANRTRATDVE+ TGLRPCFD+SK+KSVEFPELIFQFKGGAKW LPLNNY
Subjt:  LVGNQAVKVPYKYLVPGPDGSGGSIIDSGSTFTFMDKPVFEAVAEAFEKQLANRTRATDVESATGLRPCFDISKDKSVEFPELIFQFKGGAKWVLPLNNY

Query:  FALVSSSGVACLTVVTHKGAAGGGGGPSVILGAFQQQNFYVEYDLVNERLGFRQQTCS
        FALVSSSGVACLTVVTH    GGGGGPSVILGAFQQQNFYVEYDLVNERLGFR+QTC+
Subjt:  FALVSSSGVACLTVVTHKGAAGGGGGPSVILGAFQQQNFYVEYDLVNERLGFRQQTCS

A0A6J1F3G5 probable aspartyl protease At4g165633.7e-23989.74Show/hide
Query:  MAAPPSLCFFYILLFSSVSAIVISNPITLPLSAFPHSFSSDPLQTLNFLASASQNRAHQIKTPKSQSNNSVSKSPLSPHSYGAYSAPLSFGTPPQTLHLI
        MA PP LCFFYILL SSVSAI  +NPITLPLS+FPH  SSDPLQTLNFLASASQNRAHQIK PKS+S NSVSKSPLSPHSYGAYS PLSFGTP QTLHLI
Subjt:  MAAPPSLCFFYILLFSSVSAIVISNPITLPLSAFPHSFSSDPLQTLNFLASASQNRAHQIKTPKSQSNNSVSKSPLSPHSYGAYSAPLSFGTPPQTLHLI

Query:  FDTGSSLVWFPCTSKYLCSECSFPKIDPAGIPRFVPKLSSSSKLIGCQNPKCAWIFGPDVKSQCRSCNPKTDNCTQTCPAYVVQYGSGSTAGLLLSETLD
        FDTGSSLVW PCTSKYLCSECSFPKIDPA IPRF+PKLSSSSKL+GCQNPKCAWIFGPDVKSQCRSCNPKT+NCTQTCPAYVVQYGSGSTAGLLLSETLD
Subjt:  FDTGSSLVWFPCTSKYLCSECSFPKIDPAGIPRFVPKLSSSSKLIGCQNPKCAWIFGPDVKSQCRSCNPKTDNCTQTCPAYVVQYGSGSTAGLLLSETLD

Query:  FPDKKIPNFVVGCSFLSIHQPSGIAGFGRGSESLPSQMGLKKFAYCLASRKFDDSPHSGELILDSGGGKTGDLTYTPFRQNPSVSNHAYKEYYYLSIRKI
        FP+KKI NFVVGCSFLSIHQPSGIAGFGRGSESLPSQMGLKKFAYCLASRKFDDSPH+GELILDS G KT  LTYTPFRQNPSVSNHAYKEYYYL+IRKI
Subjt:  FPDKKIPNFVVGCSFLSIHQPSGIAGFGRGSESLPSQMGLKKFAYCLASRKFDDSPHSGELILDSGGGKTGDLTYTPFRQNPSVSNHAYKEYYYLSIRKI

Query:  LVGNQAVKVPYKYLVPGPDGSGGSIIDSGSTFTFMDKPVFEAVAEAFEKQLANRTRATDVESATGLRPCFDISKDKSVEFPELIFQFKGGAKWVLPLNNY
         VGN+AVKVPYKYLVPGPDG+GGSIIDSGSTFTFMDKPVFEAVA+  EKQLANRTRATDVES TGLRPCFDISKDKSVEFPEL F  KGGAKW  PL+NY
Subjt:  LVGNQAVKVPYKYLVPGPDGSGGSIIDSGSTFTFMDKPVFEAVAEAFEKQLANRTRATDVESATGLRPCFDISKDKSVEFPELIFQFKGGAKWVLPLNNY

Query:  FALVSSSGVACLTVVTHKGAAGGGGGPSVILGAFQQQNFYVEYDLVNERLGFRQQTCS
        FALVSSSGVACLTVVTHK AA  GGGPS+ILGAFQQQNFYVEYDLVN+++GFRQQTCS
Subjt:  FALVSSSGVACLTVVTHKGAAGGGGGPSVILGAFQQQNFYVEYDLVNERLGFRQQTCS

A0A6J1FJU4 probable aspartyl protease At4g165631.4e-24692.83Show/hide
Query:  MAAPPSLCFFYILLFSSVSAIV--ISNPITLPLSAFPHSFSSDPLQTLNFLASASQNRAHQIKTPKSQSNNSVSKSPLSPHSYGAYSAPLSFGTPPQTLH
        MAAPP LCF YILL  SVSAIV   +N ITLPLSA PH  SSDPLQ LNFLASASQNRAHQIKTPKS   NSVSKSPLSPHSYGAYSAPLSFGTPPQTLH
Subjt:  MAAPPSLCFFYILLFSSVSAIV--ISNPITLPLSAFPHSFSSDPLQTLNFLASASQNRAHQIKTPKSQSNNSVSKSPLSPHSYGAYSAPLSFGTPPQTLH

Query:  LIFDTGSSLVWFPCTSKYLCSECSFPKIDPAGIPRFVPKLSSSSKLIGCQNPKCAWIFGPDVKSQCRSCNPKTDNCTQTCPAYVVQYGSGSTAGLLLSET
        LIFDTGSSLVWFPCTSKYLCS+CSFPKIDP  IPRFVPKLSSSSKL+GCQNPKCAW+FGPDVKSQCR+CNPKT+NCTQTCPAY VQYGSGSTAGLLLSET
Subjt:  LIFDTGSSLVWFPCTSKYLCSECSFPKIDPAGIPRFVPKLSSSSKLIGCQNPKCAWIFGPDVKSQCRSCNPKTDNCTQTCPAYVVQYGSGSTAGLLLSET

Query:  LDFPDKKIPNFVVGCSFLSIHQPSGIAGFGRGSESLPSQMGLKKFAYCLASRKFDDSPHSGELILDSGGGKTGDLTYTPFRQNPSVSNHAYKEYYYLSIR
        LDFPD+KIPNFVVGCSFLSIHQPSGIAGFGRGSESLPSQMGLKKFAYCLASRKFDDSPHSGELILDSGG KTGDLTYTPFRQNPSVSNHAYKEYYYLSIR
Subjt:  LDFPDKKIPNFVVGCSFLSIHQPSGIAGFGRGSESLPSQMGLKKFAYCLASRKFDDSPHSGELILDSGGGKTGDLTYTPFRQNPSVSNHAYKEYYYLSIR

Query:  KILVGNQAVKVPYKYLVPGPDGSGGSIIDSGSTFTFMDKPVFEAVAEAFEKQLANRTRATDVESATGLRPCFDISKDKSVEFPELIFQFKGGAKWVLPLN
        KILVGNQ VKVPYKYLVPG DGSGGSIIDSGSTFTFMDKPVFEAVAEAFEKQLANRTRATDVESATGLRPCFDISK+KSVEFPELIFQFKGGAKW LPLN
Subjt:  KILVGNQAVKVPYKYLVPGPDGSGGSIIDSGSTFTFMDKPVFEAVAEAFEKQLANRTRATDVESATGLRPCFDISKDKSVEFPELIFQFKGGAKWVLPLN

Query:  NYFALVSSSGVACLTVVTHKGAAGGGGGPSVILGAFQQQNFYVEYDLVNERLGFRQQTCS
        NYFALVSSSGVACLTVVTHK AAGGG GPSVILGAFQQQNFYVEYDLVNERLGFRQQ+CS
Subjt:  NYFALVSSSGVACLTVVTHKGAAGGGGGPSVILGAFQQQNFYVEYDLVNERLGFRQQTCS

A0A6J1IXY3 probable aspartyl protease At4g165634.4e-24089.52Show/hide
Query:  MAAPPSLCFFYILLFSSVSAIVISNPITLPLSAFPHSFSSDPLQTLNFLASASQNRAHQIKTPKSQSNNSVSKSPLSPHSYGAYSAPLSFGTPPQTLHLI
        MA PP LCFFYILL SSVSAI  +NPIT+PLS+FPH  SSDPLQTLNFLASASQNRAHQIK PKS+S NSVSKSPLSPHSYGAYS PLSFGTPPQTLHLI
Subjt:  MAAPPSLCFFYILLFSSVSAIVISNPITLPLSAFPHSFSSDPLQTLNFLASASQNRAHQIKTPKSQSNNSVSKSPLSPHSYGAYSAPLSFGTPPQTLHLI

Query:  FDTGSSLVWFPCTSKYLCSECSFPKIDPAGIPRFVPKLSSSSKLIGCQNPKCAWIFGPDVKSQCRSCNPKTDNCTQTCPAYVVQYGSGSTAGLLLSETLD
        FDTGSSLVW PCTSKYLCSECSFPKIDPAGIPRF+PKLSS+SKL+GCQNPKCAWIFGPDVKSQCRSCNPKT+NCTQTCPAYVVQYGSGSTAGLLLSETLD
Subjt:  FDTGSSLVWFPCTSKYLCSECSFPKIDPAGIPRFVPKLSSSSKLIGCQNPKCAWIFGPDVKSQCRSCNPKTDNCTQTCPAYVVQYGSGSTAGLLLSETLD

Query:  FPDKKIPNFVVGCSFLSIHQPSGIAGFGRGSESLPSQMGLKKFAYCLASRKFDDSPHSGELILDSGGGKTGDLTYTPFRQNPSVSNHAYKEYYYLSIRKI
        FPDKK  NFVVGCSFLSIHQPSGIAGFGRGSESLPSQMGLKKFAYCLASRKFDDSPH+GELILDS G KT  L+YTPFRQNPSVSNHAYKEYYYL+IRKI
Subjt:  FPDKKIPNFVVGCSFLSIHQPSGIAGFGRGSESLPSQMGLKKFAYCLASRKFDDSPHSGELILDSGGGKTGDLTYTPFRQNPSVSNHAYKEYYYLSIRKI

Query:  LVGNQAVKVPYKYLVPGPDGSGGSIIDSGSTFTFMDKPVFEAVAEAFEKQLANRTRATDVESATGLRPCFDISKDKSVEFPELIFQFKGGAKWVLPLNNY
         VG +AVKVPYKYLVPGPDG+GGSIIDSGSTFTFMDKPVFEAVA+  EKQLANRTRATDVES TGLRPCFDISKDKSVEFPEL FQ KGGAKW LPL+NY
Subjt:  LVGNQAVKVPYKYLVPGPDGSGGSIIDSGSTFTFMDKPVFEAVAEAFEKQLANRTRATDVESATGLRPCFDISKDKSVEFPELIFQFKGGAKWVLPLNNY

Query:  FALVSSSGVACLTVVTHKGAAGGGGGPSVILGAFQQQNFYVEYDLVNERLGFRQQTCS
        FALVSSSGVACLTVVTHK  A  GGGPS+ILGAFQQQNFYVEYDLVN+++GFRQQTCS
Subjt:  FALVSSSGVACLTVVTHKGAAGGGGGPSVILGAFQQQNFYVEYDLVNERLGFRQQTCS

A0A6J1KTP0 probable aspartyl protease At4g165631.7e-24492.61Show/hide
Query:  MAAPPSLCFFYILLFSSVSAI--VISNPITLPLSAFPHSFSSDPLQTLNFLASASQNRAHQIKTPKSQSNNSVSKSPLSPHSYGAYSAPLSFGTPPQTLH
        MAAPP LCF YILL  SVSAI  V +N ITLPLSAFPH  SSDPLQ LNFLASASQNRAHQIKTPKS   NSVSKS LSPHSYGAYSAPLSFGTPPQTLH
Subjt:  MAAPPSLCFFYILLFSSVSAI--VISNPITLPLSAFPHSFSSDPLQTLNFLASASQNRAHQIKTPKSQSNNSVSKSPLSPHSYGAYSAPLSFGTPPQTLH

Query:  LIFDTGSSLVWFPCTSKYLCSECSFPKIDPAGIPRFVPKLSSSSKLIGCQNPKCAWIFGPDVKSQCRSCNPKTDNCTQTCPAYVVQYGSGSTAGLLLSET
        LIFDTGSSLVWFPCTSKYLCS+CSFPKIDP  IPRFVPKLSSSSKL+GCQNPKC W+FGPDVKSQCR+CN KT+NCTQTCPAYVVQYGSGSTAGLLLSET
Subjt:  LIFDTGSSLVWFPCTSKYLCSECSFPKIDPAGIPRFVPKLSSSSKLIGCQNPKCAWIFGPDVKSQCRSCNPKTDNCTQTCPAYVVQYGSGSTAGLLLSET

Query:  LDFPDKKIPNFVVGCSFLSIHQPSGIAGFGRGSESLPSQMGLKKFAYCLASRKFDDSPHSGELILDSGGGKTGDLTYTPFRQNPSVSNHAYKEYYYLSIR
        LDFPD+KIPNFVVGCSFLSIHQPSGIAGFGRGSESLPSQMGLKKFAYCLASRKFDDSPHSGELILDSGG KTGDLTYTPFRQNPSVSNHAYKEYYYLSIR
Subjt:  LDFPDKKIPNFVVGCSFLSIHQPSGIAGFGRGSESLPSQMGLKKFAYCLASRKFDDSPHSGELILDSGGGKTGDLTYTPFRQNPSVSNHAYKEYYYLSIR

Query:  KILVGNQAVKVPYKYLVPGPDGSGGSIIDSGSTFTFMDKPVFEAVAEAFEKQLANRTRATDVESATGLRPCFDISKDKSVEFPELIFQFKGGAKWVLPLN
        KILVGNQ VKVPYKYLVPG DGSGGSIIDSGSTFTFMDKPVFEAVAEAFEKQLANRTRATDVESATGLRPCFDISK+KSVEFPELIFQFKGGAKW L LN
Subjt:  KILVGNQAVKVPYKYLVPGPDGSGGSIIDSGSTFTFMDKPVFEAVAEAFEKQLANRTRATDVESATGLRPCFDISKDKSVEFPELIFQFKGGAKWVLPLN

Query:  NYFALVSSSGVACLTVVTHKGAAGGGGGPSVILGAFQQQNFYVEYDLVNERLGFRQQTCS
        NYFALVSSSGVACLTVVTHK AAGGG GPSVILGAFQQQNFYVEYDLVNERLGFRQQTCS
Subjt:  NYFALVSSSGVACLTVVTHKGAAGGGGGPSVILGAFQQQNFYVEYDLVNERLGFRQQTCS

SwissProt top hitse value%identityAlignment
Q6F4N5 Aspartyl protease 254.9e-3126.71Show/hide
Query:  MAAPPSLCFFYILLFSSVSAIVISNPITLPLSAFPHSFSSDPLQTLNFLASASQNRAHQIKTPKSQSNNSVSKSPL-SPHSYGAYSAPLSFGTPPQTLHL
        MAA  ++    +LL ++V+A          LS + +   S P    + +A A  + A  +      +   VS +P+ S  +  +Y      G+P Q L L
Subjt:  MAAPPSLCFFYILLFSSVSAIVISNPITLPLSAFPHSFSSDPLQTLNFLASASQNRAHQIKTPKSQSNNSVSKSPL-SPHSYGAYSAPLSFGTPPQTLHL

Query:  IFDTGSSLVWFPCTSKYLCSECSFPKIDPAGIPRFVPKLSSSSKLIGCQNPKCAWIFGPDVKSQCRSCNPKTDNCTQTCPAYVVQYGSGSTAGLLLSETL
          DT +   W  C+    C  C    +       F P  SSS   + C +  C    G    +     +      T    A+   +   S    L S+TL
Subjt:  IFDTGSSLVWFPCTSKYLCSECSFPKIDPAGIPRFVPKLSSSSKLIGCQNPKCAWIFGPDVKSQCRSCNPKTDNCTQTCPAYVVQYGSGSTAGLLLSETL

Query:  DFPDKKIPNFVVGCSFLSIHQPS------GIAGFGRGSESLPSQMGL---KKFAYCLASRKFDDSPHSGELILDSGGGKTGDLTYTPFRQNPSVSNHAYK
              IPN+  GC   S+  P+      G+ G GRG  +L SQ G      F+YCL S  +     SG L L +GGG+   + YTP  +NP  S+    
Subjt:  DFPDKKIPNFVVGCSFLSIHQPS------GIAGFGRGSESLPSQMGL---KKFAYCLASRKFDDSPHSGELILDSGGGKTGDLTYTPFRQNPSVSNHAYK

Query:  EYYYLSIRKILVGNQAVKVPYKYLVPGPDGSGGSIIDSGSTFTFMDKPVFEAVAEAFEKQLANRTRATDVESATGLRPCFDISKDKSVEFPELIFQFKGG
          YY+++  + VG+  VKVP            G+++DSG+  T    PV+ A+ E F +Q+A  +  T   S      CF+  +  +   P +     GG
Subjt:  EYYYLSIRKILVGNQAVKVPYKYLVPGPDGSGGSIIDSGSTFTFMDKPVFEAVAEAFEKQLANRTRATDVESATGLRPCFDISKDKSVEFPELIFQFKGG

Query:  AKWVLPLNNYFALVSSSGVACLTVVTHKGAAGGGGGPSVILGAFQQQNFYVEYDLVNERLGFRQQTCS
            LP+ N     S++ +ACL +     A         ++   QQQN  V +D+ N R+GF +++C+
Subjt:  AKWVLPLNNYFALVSSSGVACLTVVTHKGAAGGGGGPSVILGAFQQQNFYVEYDLVNERLGFRQQTCS

Q766C2 Aspartic proteinase nepenthesin-23.9e-3633.42Show/hide
Query:  GAYSAPLSFGTPPQTLHLIFDTGSSLVWFPCTSKYLCSEC-SFPKIDPAGIPRFVPKLSSSSKLIGCQNPKCAWIFGPDVKSQCRSCNPKTDNCTQTCPA
        G Y   ++ GTP  +   I DTGS L+W  C     C++C S P       P F P+ SSS   + C++  C      D+ S+  +CN   + C      
Subjt:  GAYSAPLSFGTPPQTLHLIFDTGSSLVWFPCTSKYLCSEC-SFPKIDPAGIPRFVPKLSSSSKLIGCQNPKCAWIFGPDVKSQCRSCNPKTDNCTQTCPA

Query:  YVVQYGSGSTA-GLLLSETLDFPDKKIPNFVVGC----SFLSIHQPSGIAGFGRGSESLPSQMGLKKFAYCLASRKFDDSPHSGELILDSGGGKTGDLTY
        Y   YG GST  G + +ET  F    +PN   GC            +G+ G G G  SLPSQ+G+ +F+YC+ S     SP +  L   + G   G  + 
Subjt:  YVVQYGSGSTA-GLLLSETLDFPDKKIPNFVVGC----SFLSIHQPSGIAGFGRGSESLPSQMGLKKFAYCLASRKFDDSPHSGELILDSGGGKTGDLTY

Query:  TPFRQ--NPSVSNHAYKEYYYLSIRKILVGNQAVKVPYKYLVPGPDGSGGSIIDSGSTFTFMDKPVFEAVAEAFEKQLANRTRATDVESATGLRPCFDIS
        T      NP+        YYY++++ I VG   + +P        DG+GG IIDSG+T T++ +  + AVA+AF  Q+      T  ES++GL  CF   
Subjt:  TPFRQ--NPSVSNHAYKEYYYLSIRKILVGNQAVKVPYKYLVPGPDGSGGSIIDSGSTFTFMDKPVFEAVAEAFEKQLANRTRATDVESATGLRPCFDIS

Query:  KDKS-VEFPELIFQFKGGAKWVLPLNNYFALVS-SSGVACLTVVTHKGAAGGGGGPSVILGAFQQQNFYVEYDLVNERLGFRQQTC
         D S V+ PE+  QF GG   VL L     L+S + GV CL +    G++   G    I G  QQQ   V YDL N  + F    C
Subjt:  KDKS-VEFPELIFQFKGGAKWVLPLNNYFALVS-SSGVACLTVVTHKGAAGGGGGPSVILGAFQQQNFYVEYDLVNERLGFRQQTC

Q766C3 Aspartic proteinase nepenthesin-14.8e-3430.03Show/hide
Query:  GAYSAPLSFGTPPQTLHLIFDTGSSLVWFPCTSKYLCSECSFPKIDPAGIPRFVPKLSSSSKLIGCQNPKCAWIFGPDVKSQCRSCNPKTDNCTQTCPAY
        G Y   LS GTP Q    I DTGS L+W  C     C   S P  +P G        SSS   + C +  C  +  P               C+     Y
Subjt:  GAYSAPLSFGTPPQTLHLIFDTGSSLVWFPCTSKYLCSECSFPKIDPAGIPRFVPKLSSSSKLIGCQNPKCAWIFGPDVKSQCRSCNPKTDNCTQTCPAY

Query:  VVQYGSGS-TAGLLLSETLDFPDKKIPNFVVGC----SFLSIHQPSGIAGFGRGSESLPSQMGLKKFAYCLASRKFDDSPHSGELILDSGGGKTGDLTYT
           YG GS T G + +ETL F    IPN   GC            +G+ G GRG  SLPSQ+ + KF+YC+       S  S  L+       T     T
Subjt:  VVQYGSGS-TAGLLLSETLDFPDKKIPNFVVGC----SFLSIHQPSGIAGFGRGSESLPSQMGLKKFAYCLASRKFDDSPHSGELILDSGGGKTGDLTYT

Query:  PFRQNPSVSNHAYKEYYYLSIRKILVGNQAVKV-PYKYLVPGPDGSGGSIIDSGSTFTFMDKPVFEAVAEAFEKQLANRTRATDVESATGLRPCFDISKD
           Q+  +       +YY+++  + VG+  + + P  + +   +G+GG IIDSG+T T+     +++V + F  Q+          S++G   CF    D
Subjt:  PFRQNPSVSNHAYKEYYYLSIRKILVGNQAVKV-PYKYLVPGPDGSGGSIIDSGSTFTFMDKPVFEAVAEAFEKQLANRTRATDVESATGLRPCFDISKD

Query:  KS-VEFPELIFQFKGGAKWVLPLNNYFALVSSSGVACLTVVTHKGAAGGGGGPSVILGAFQQQNFYVEYDLVNERLGFRQQTC
         S ++ P  +  F GG    LP  NYF +  S+G+ CL       A G       I G  QQQN  V YD  N  + F    C
Subjt:  KS-VEFPELIFQFKGGAKWVLPLNNYFALVSSSGVACLTVVTHKGAAGGGGGPSVILGAFQQQNFYVEYDLVNERLGFRQQTC

Q940R4 Probable aspartyl protease At4g165632.0e-4830.53Show/hide
Query:  ISNPITLPLSAFPHSF-----SSDPLQTLNFLASASQNRAHQIKTPKSQSNNSVSKSPLSPHSYGAYSAPLSFGTPPQTLHLIFDTGSSLVWFPCTSKYL
        +S P+ L LS   HS      SS PL  L   +S S  R  +    + Q   S+   P+S  S   Y   LS G+    + L  DTGS LVWFPC   + 
Subjt:  ISNPITLPLSAFPHSF-----SSDPLQTLNFLASASQNRAHQIKTPKSQSNNSVSKSPLSPHSYGAYSAPLSFGTPPQTLHLIFDTGSSLVWFPCTSKYL

Query:  CSECSFPKIDPAGIPRFVPKLSSSSKLIGCQNPKCAWIFGPDVKSQ-CRSCN-----PKTDNCTQT---CPAYVVQYGSGSTAGLLLSETLDFPDKKIPN
        C  C    + P+        LSSS+  + C +P C+        S  C   N      +T +C  +   CP +   YG GS    L S++L  P   + N
Subjt:  CSECSFPKIDPAGIPRFVPKLSSSSKLIGCQNPKCAWIFGPDVKSQ-CRSCN-----PKTDNCTQT---CPAYVVQYGSGSTAGLLLSETLDFPDKKIPN

Query:  FVVGCSFLSIHQPSGIAGFGRGSESLPSQMGL------KKFAYCLASRKFDDS--PHSGELIL--------------------DSGGGKTGDLTYTPFRQ
        F  GC+  ++ +P G+AGFGRG  SLP+Q+ +        F+YCL S  FD         LIL                    D    K  +  +T   +
Subjt:  FVVGCSFLSIHQPSGIAGFGRGSESLPSQMGL------KKFAYCLASRKFDDS--PHSGELIL--------------------DSGGGKTGDLTYTPFRQ

Query:  NPSVSNHAYKEYYYLSIRKILVGNQAVKVPYKYLVPGPDGSGGSIIDSGSTFTFMDKPVFEAVAEAFEKQLAN-RTRATDVESATGLRPCFDISKDKSVE
        NP    H Y  +Y +S++ I +G + +  P        +G GG ++DSG+TFT +    + +V E F+ ++     RA  VE ++G+ PC+ +  +++V+
Subjt:  NPSVSNHAYKEYYYLSIRKILVGNQAVKVPYKYLVPGPDGSGGSIIDSGSTFTFMDKPVFEAVAEAFEKQLAN-RTRATDVESATGLRPCFDISKDKSVE

Query:  FPELIFQFKGGAKWV-LPLNNYFALVSSSG--------VACLTVVTHKGAAGGGGGPSVILGAFQQQNFYVEYDLVNERLGFRQQTCS
         P L+  F G    V LP  NYF      G        + CL ++     +   GG   ILG +QQQ F V YDL+N R+GF ++ C+
Subjt:  FPELIFQFKGGAKWV-LPLNNYFALVSSSG--------VACLTVVTHKGAAGGGGGPSVILGAFQQQNFYVEYDLVNERLGFRQQTCS

Q9LNJ3 Aspartyl protease family protein 25.1e-3630.94Show/hide
Query:  PKSQSNNSVSKSPLSPHSYGAYSAPLSFGTPPQTLHLIFDTGSSLVWFPCTSKYLCSECSFPKIDPAGIPRFVPKLSSSSKLIGCQNPKCAWIFGPDVKS
        P+    +S   S LS  S G Y   L  GTP + ++++ DTGS +VW  C     C  C + + DP     F P+ S +   I C +P C        + 
Subjt:  PKSQSNNSVSKSPLSPHSYGAYSAPLSFGTPPQTLHLIFDTGSSLVWFPCTSKYLCSECSFPKIDPAGIPRFVPKLSSSSKLIGCQNPKCAWIFGPDVKS

Query:  QCRSCNPKTDNCTQTCPAYVVQYGSGS-TAGLLLSETLDFPDKKIPNFVVGCSFLS---IHQPSGIAGFGRGSESLPSQMGLK---KFAYCLASRKFDDS
            CN +   C      Y V YG GS T G   +ETL F   ++    +GC   +       +G+ G G+G  S P Q G +   KF+YCL  R     
Subjt:  QCRSCNPKTDNCTQTCPAYVVQYGSGS-TAGLLLSETLDFPDKKIPNFVVGCSFLS---IHQPSGIAGFGRGSESLPSQMGLK---KFAYCLASRKFDDS

Query:  PHSGELILDSGGGKTGDLTYTPFRQNPSVSNHAYKEYYYLSIRKILVGNQAVK-VPYKYLVPGPDGSGGSIIDSGSTFTFMDKPVFEAVAEAFEKQLANR
        P S   ++      +    +TP   NP +       +YY+ +  I VG   V  V          G+GG IIDSG++ T + +P + A+ +AF       
Subjt:  PHSGELILDSGGGKTGDLTYTPFRQNPSVSNHAYKEYYYLSIRKILVGNQAVK-VPYKYLVPGPDGSGGSIIDSGSTFTFMDKPVFEAVAEAFEKQLANR

Query:  TRATDVESATGLRPCFDISKDKSVEFPELIFQFKGGAKWVLPLNNYFALVSSSGVACLTVVTHKGAAGGGGGPSVILGAFQQQNFYVEYDLVNERLGFRQ
         RA D         CFD+S    V+ P ++  F+ GA   LP  NY   V ++G  C         AG  GG S+I G  QQQ F V YDL + R+GF  
Subjt:  TRATDVESATGLRPCFDISKDKSVEFPELIFQFKGGAKWVLPLNNYFALVSSSGVACLTVVTHKGAAGGGGGPSVILGAFQQQNFYVEYDLVNERLGFRQ

Query:  QTCS
          C+
Subjt:  QTCS

Arabidopsis top hitse value%identityAlignment
AT1G25510.1 Eukaryotic aspartyl protease family protein2.7e-4030.29Show/hide
Query:  GAYSAPLSFGTPPQTLHLIFDTGSSLVWFPCTSKYLCSECSFPKIDPAGIPRFVPKLSSSSKLIGCQNPKCAWIFGPDVKSQCRSCNPKTDNCTQTCPAY
        G Y   +  G P + ++++ DTGS + W  CT    C++C + + +P     F P  SSS + + C  P+C  +      S+CR+          TC  Y
Subjt:  GAYSAPLSFGTPPQTLHLIFDTGSSLVWFPCTSKYLCSECSFPKIDPAGIPRFVPKLSSSSKLIGCQNPKCAWIFGPDVKSQCRSCNPKTDNCTQTCPAY

Query:  VVQYGSGS-TAGLLLSETLDFPDKKIPNFVVGCSFLS---IHQPSGIAGFGRGSESLPSQMGLKKFAYCLASRKFDDSPHSGELILDSGGGKTGDLTYTP
         V YG GS T G   +ETL      + N  VGC   +       +G+ G G G  +LPSQ+    F+YCL  R  D +       +D G   + D    P
Subjt:  VVQYGSGS-TAGLLLSETLDFPDKKIPNFVVGCSFLS---IHQPSGIAGFGRGSESLPSQMGLKKFAYCLASRKFDDSPHSGELILDSGGGKTGDLTYTP

Query:  FRQNPSVSNHAYKEYYYLSIRKILVGNQAVKVPYKYLVPGPDGSGGSIIDSGSTFTFMDKPVFEAVAEAFEKQLANRTRATDVESATGLR---PCFDISK
              + NH    +YYL +  I VG + +++P         GSGG IIDSG+  T +   ++ ++ ++F K         D+E A G+     C+++S 
Subjt:  FRQNPSVSNHAYKEYYYLSIRKILVGNQAVKVPYKYLVPGPDGSGGSIIDSGSTFTFMDKPVFEAVAEAFEKQLANRTRATDVESATGLR---PCFDISK

Query:  DKSVEFPELIFQFKGGAKWVLPLNNYFALVSSSGVACLTVVTHKGAAGGGGGPSVILGAFQQQNFYVEYDLVNERLGFRQQTC
          +VE P + F F GG    LP  NY   V S G  CL       A         I+G  QQQ   V +DL N  +GF    C
Subjt:  DKSVEFPELIFQFKGGAKWVLPLNNYFALVSSSGVACLTVVTHKGAAGGGGGPSVILGAFQQQNFYVEYDLVNERLGFRQQTC

AT2G42980.1 Eukaryotic aspartyl protease family protein2.4e-4132.33Show/hide
Query:  GAYSAPLSFGTPPQTLHLIFDTGSSLVWFPCTSKYLCSECSFPKIDPAGIPRFVPKLSSSSKLIGCQNPKCAWIFGPDVKSQCRSCNPKTDNCTQTCPAY
        G Y   +  GTPP+   LI DTGS L W  C   Y C   +    D        PK S+S K I C +P+C+ I  PD   QC S N       Q+CP Y
Subjt:  GAYSAPLSFGTPPQTLHLIFDTGSSLVWFPCTSKYLCSECSFPKIDPAGIPRFVPKLSSSSKLIGCQNPKCAWIFGPDVKSQCRSCNPKTDNCTQTCPAY

Query:  VVQYGSGS-TAGLLLSETLDF---------PDKKIPNFVVGCSFLS---IHQPSGIAGFGRGSESLPSQMGL---KKFAYCLASRKFDDSPHSGELILDS
           YG  S T G    ET             + K+ N + GC   +       SG+ G GRG  S  SQ+       F+YCL  R   ++  S +LI   
Subjt:  VVQYGSGS-TAGLLLSETLDF---------PDKKIPNFVVGCSFLS---IHQPSGIAGFGRGSESLPSQMGL---KKFAYCLASRKFDDSPHSGELILDS

Query:  GGGKTGDL---TYTPFRQNPSVSNHAYKEYYYLSIRKILVGNQAVKVPYKYLVPGPDGSGGSIIDSGSTFTFMDKPVFEAVAEAF-EKQLANRTRATDVE
          G+  DL   T   F    +   ++ + +YY+ I+ ILVG +A+ +P +      DG GG+IIDSG+T ++  +P +E +   F EK   N     D  
Subjt:  GGGKTGDL---TYTPFRQNPSVSNHAYKEYYYLSIRKILVGNQAVKVPYKYLVPGPDGSGGSIIDSGSTFTFMDKPVFEAVAEAF-EKQLANRTRATDVE

Query:  SATGLRPCFDIS--KDKSVEFPELIFQFKGGAKWVLPLNNYFALVSSSGVACLTVVTHKGAAGGGGGPSVILGAFQQQNFYVEYDLVNERLGFRQQTCS
            L PCF++S  ++ ++  PEL   F  G  W  P  N F  +S   + CL ++      G       I+G +QQQNF++ YD    RLGF    C+
Subjt:  SATGLRPCFDIS--KDKSVEFPELIFQFKGGAKWVLPLNNYFALVSSSGVACLTVVTHKGAAGGGGGPSVILGAFQQQNFYVEYDLVNERLGFRQQTCS

AT3G52500.1 Eukaryotic aspartyl protease family protein2.4e-15056.87Show/hide
Query:  SLCFFYILLFSSVSAIVISNPITLPLSAFPHSFSS--DPLQTLNFLASASQNRAHQIK------------TPKSQSNNSVSKSPLSPHSYGAYSAPLSFG
        S+ FF+++  S VSA      + LPLS F HS  S  DP  +L  LA +S  RAH++K            +  + ++ +V KSPLS  SYG YS  LSFG
Subjt:  SLCFFYILLFSSVSAIVISNPITLPLSAFPHSFSS--DPLQTLNFLASASQNRAHQIK------------TPKSQSNNSVSKSPLSPHSYGAYSAPLSFG

Query:  TPPQTLHLIFDTGSSLVWFPCTSKYLCSECSFPKIDPAGIPRFVPKLSSSSKLIGCQNPKCAWIFGPDVKSQCRSCNPKTDNCTQTCPAYVVQYGSGSTA
        TP QT+  +FDTGSSLVW PCTS+YLCS C F  +DP  IPRF+PK SSSSK+IGCQ+PKC +++GP+V  QCR C+P T NCT  CP Y++QYG GSTA
Subjt:  TPPQTLHLIFDTGSSLVWFPCTSKYLCSECSFPKIDPAGIPRFVPKLSSSSKLIGCQNPKCAWIFGPDVKSQCRSCNPKTDNCTQTCPAYVVQYGSGSTA

Query:  GLLLSETLDFPDKKIPNFVVGCSFLSIHQPSGIAGFGRGSESLPSQMGLKKFAYCLASRKFDDSPHSGELILDSG-----GGKTGDLTYTPFRQNPSVSN
        G+L++E LDFPD  +P+FVVGCS +S  QP+GIAGFGRG  SLPSQM LK+F++CL SR+FDD+  + +L LD+G     G KT  LTYTPFR+NP+VSN
Subjt:  GLLLSETLDFPDKKIPNFVVGCSFLSIHQPSGIAGFGRGSESLPSQMGLKKFAYCLASRKFDDSPHSGELILDSG-----GGKTGDLTYTPFRQNPSVSN

Query:  HAYKEYYYLSIRKILVGNQAVKVPYKYLVPGPDGSGGSIIDSGSTFTFMDKPVFEAVAEAFEKQLANRTRATDVESATGLRPCFDISKDKSVEFPELIFQ
         A+ EYYYL++R+I VG + VK+PYKYL PG +G GGSI+DSGSTFTFM++PVFE VAE F  Q++N TR  D+E  TGL PCF+IS    V  PELIF+
Subjt:  HAYKEYYYLSIRKILVGNQAVKVPYKYLVPGPDGSGGSIIDSGSTFTFMDKPVFEAVAEAFEKQLANRTRATDVESATGLRPCFDISKDKSVEFPELIFQ

Query:  FKGGAKWVLPLNNYFALVSSSGVACLTVVTHKGA-AGGGGGPSVILGAFQQQNFYVEYDLVNERLGFRQQTCS
        FKGGAK  LPL+NYF  V ++   CLTVV+ K     GG GP++ILG+FQQQN+ VEYDL N+R GF ++ CS
Subjt:  FKGGAKWVLPLNNYFALVSSSGVACLTVVTHKGA-AGGGGGPSVILGAFQQQNFYVEYDLVNERLGFRQQTCS

AT4G16563.1 Eukaryotic aspartyl protease family protein1.4e-4930.53Show/hide
Query:  ISNPITLPLSAFPHSF-----SSDPLQTLNFLASASQNRAHQIKTPKSQSNNSVSKSPLSPHSYGAYSAPLSFGTPPQTLHLIFDTGSSLVWFPCTSKYL
        +S P+ L LS   HS      SS PL  L   +S S  R  +    + Q   S+   P+S  S   Y   LS G+    + L  DTGS LVWFPC   + 
Subjt:  ISNPITLPLSAFPHSF-----SSDPLQTLNFLASASQNRAHQIKTPKSQSNNSVSKSPLSPHSYGAYSAPLSFGTPPQTLHLIFDTGSSLVWFPCTSKYL

Query:  CSECSFPKIDPAGIPRFVPKLSSSSKLIGCQNPKCAWIFGPDVKSQ-CRSCN-----PKTDNCTQT---CPAYVVQYGSGSTAGLLLSETLDFPDKKIPN
        C  C    + P+        LSSS+  + C +P C+        S  C   N      +T +C  +   CP +   YG GS    L S++L  P   + N
Subjt:  CSECSFPKIDPAGIPRFVPKLSSSSKLIGCQNPKCAWIFGPDVKSQ-CRSCN-----PKTDNCTQT---CPAYVVQYGSGSTAGLLLSETLDFPDKKIPN

Query:  FVVGCSFLSIHQPSGIAGFGRGSESLPSQMGL------KKFAYCLASRKFDDS--PHSGELIL--------------------DSGGGKTGDLTYTPFRQ
        F  GC+  ++ +P G+AGFGRG  SLP+Q+ +        F+YCL S  FD         LIL                    D    K  +  +T   +
Subjt:  FVVGCSFLSIHQPSGIAGFGRGSESLPSQMGL------KKFAYCLASRKFDDS--PHSGELIL--------------------DSGGGKTGDLTYTPFRQ

Query:  NPSVSNHAYKEYYYLSIRKILVGNQAVKVPYKYLVPGPDGSGGSIIDSGSTFTFMDKPVFEAVAEAFEKQLAN-RTRATDVESATGLRPCFDISKDKSVE
        NP    H Y  +Y +S++ I +G + +  P        +G GG ++DSG+TFT +    + +V E F+ ++     RA  VE ++G+ PC+ +  +++V+
Subjt:  NPSVSNHAYKEYYYLSIRKILVGNQAVKVPYKYLVPGPDGSGGSIIDSGSTFTFMDKPVFEAVAEAFEKQLAN-RTRATDVESATGLRPCFDISKDKSVE

Query:  FPELIFQFKGGAKWV-LPLNNYFALVSSSG--------VACLTVVTHKGAAGGGGGPSVILGAFQQQNFYVEYDLVNERLGFRQQTCS
         P L+  F G    V LP  NYF      G        + CL ++     +   GG   ILG +QQQ F V YDL+N R+GF ++ C+
Subjt:  FPELIFQFKGGAKWV-LPLNNYFALVSSSG--------VACLTVVTHKGAAGGGGGPSVILGAFQQQNFYVEYDLVNERLGFRQQTCS

AT5G45120.1 Eukaryotic aspartyl protease family protein3.2e-4930.8Show/hide
Query:  NFLASASQNRAHQIKTPKSQSNNSVSKSPLSP---------HSYGAYSAPLSFGTPPQTLHLIFDTGSSLVWFPCTS-KYLCSECSFPKIDPAGIPR-FV
        +FL       +  + TPKSQ+   + K PLS               Y   L+ GTPPQ + +  DTGS L W PC +  + C EC   K +    P  F 
Subjt:  NFLASASQNRAHQIKTPKSQSNNSVSKSPLSP---------HSYGAYSAPLSFGTPPQTLHLIFDTGSSLVWFPCTS-KYLCSECSFPKIDPAGIPR-FV

Query:  PKLSSSSKLIGCQNPKCAWI------FGPDVKSQCRSCNPKTDNCTQTCPAYVVQYGSGS-TAGLLLSETLDFPDKKIPNFVVGCSFLSIHQPSGIAGFG
        P  SS+S    C +  C  I      F P   + C         C + CP++   YG G   +G+L  + L    + +P F  GC   +  +P GIAGFG
Subjt:  PKLSSSSKLIGCQNPKCAWI------FGPDVKSQCRSCNPKTDNCTQTCPAYVVQYGSGS-TAGLLLSETLDFPDKKIPNFVVGCSFLSIHQPSGIAGFG

Query:  RGSESLPSQMGL--KKFAYCLASRKFDDSPH-SGELILDSGG---GKTGDLTYTPFRQNPSVSNHAYKEYYYLSIRKILVGNQ--AVKVPYKYLVPGPDG
        RG  SLPSQ+G   K F++C    KF ++P+ S  LIL +       T  L +TP    P      Y   YY+ +  I +G      +VP         G
Subjt:  RGSESLPSQMGL--KKFAYCLASRKFDDSPH-SGELILDSGG---GKTGDLTYTPFRQNPSVSNHAYKEYYYLSIRKILVGNQ--AVKVPYKYLVPGPDG

Query:  SGGSIIDSGSTFTFMDKPVFEAVAEAFEKQLANRTRATDVESATGLRPCF----------DISKDKSVEFPELIFQFKGGAKWVLPL-NNYFALVSSSGV
        +GG ++DSG+T+T + +P +  +    +  +    RAT+ ES TG   C+           +  D  + FP + F F   A  +LP  N+++A+ + S  
Subjt:  SGGSIIDSGSTFTFMDKPVFEAVAEAFEKQLANRTRATDVESATGLRPCF----------DISKDKSVEFPELIFQFKGGAKWVLPL-NNYFALVSSSGV

Query:  ACLTVVTHKGAAGGGGGPSVILGAFQQQNFYVEYDLVNERLGFRQQTC
        + +  +  +    G  GP+ + G+FQQQN  V YDL  ER+GF+   C
Subjt:  ACLTVVTHKGAAGGGGGPSVILGAFQQQNFYVEYDLVNERLGFRQQTC


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCGGCTCCTCCCTCTCTCTGTTTCTTCTACATTCTCCTCTTCTCCTCTGTTTCCGCCATTGTCATCAGCAACCCCATCACCCTCCCTCTCTCCGCTTTCCCCCACTC
TTTTTCTTCAGATCCACTTCAAACTCTCAATTTCCTCGCCTCTGCTTCCCAGAACAGAGCCCATCAAATCAAAACCCCCAAATCTCAATCCAACAACTCCGTTTCCAAAT
CCCCTCTCTCCCCCCATAGCTATGGAGCTTACTCTGCTCCACTCAGCTTCGGTACTCCACCGCAGACTCTCCATTTGATCTTCGATACAGGTAGCAGCCTCGTTTGGTTT
CCTTGCACTTCCAAATATCTCTGTTCCGAATGTTCTTTCCCCAAAATAGATCCCGCCGGAATCCCCAGATTTGTCCCCAAATTGTCTTCTTCTTCGAAGCTTATCGGCTG
CCAGAATCCCAAATGTGCGTGGATTTTTGGCCCAGATGTGAAGTCTCAGTGCCGGAGTTGTAACCCCAAAACAGACAACTGTACCCAAACTTGCCCTGCTTATGTTGTTC
AGTATGGGTCCGGCTCCACCGCGGGGCTTTTGTTATCGGAGACCCTGGATTTTCCCGATAAGAAAATCCCCAATTTCGTTGTTGGGTGTTCGTTTTTGTCGATCCATCAG
CCCTCTGGAATCGCTGGATTCGGCCGAGGATCCGAATCGCTGCCCTCGCAAATGGGTCTGAAGAAATTTGCTTACTGTCTCGCGTCTCGGAAATTCGACGACTCGCCGCA
TTCCGGCGAGCTGATTCTGGATTCTGGCGGGGGAAAGACCGGCGATCTGACCTACACGCCATTCCGGCAGAACCCTTCTGTTTCTAACCACGCTTATAAGGAATATTATT
ACTTATCTATACGCAAAATCCTCGTCGGAAACCAGGCTGTGAAGGTGCCGTACAAGTATCTGGTGCCGGGACCCGACGGCAGCGGGGGATCAATCATCGATTCCGGCTCG
ACCTTCACGTTTATGGACAAACCGGTGTTCGAGGCGGTGGCGGAAGCGTTCGAGAAGCAGTTGGCGAATCGGACGAGAGCCACCGATGTGGAATCTGCCACCGGATTACG
GCCGTGTTTTGACATTTCAAAGGACAAATCGGTGGAGTTTCCGGAGTTGATTTTCCAGTTTAAAGGCGGAGCGAAATGGGTTCTGCCATTGAATAACTATTTCGCTTTGG
TCAGTAGCTCCGGCGTGGCGTGTTTGACGGTTGTTACGCATAAGGGGGCGGCGGGCGGCGGCGGTGGGCCGTCTGTGATTTTGGGGGCTTTTCAGCAGCAGAATTTCTAT
GTGGAGTATGATTTGGTGAATGAAAGATTAGGATTTCGGCAACAGACTTGCAGTTAG
mRNA sequenceShow/hide mRNA sequence
CTCTTCTTCCTCTTCCAATTTATACTCCAATCCTCTCATCAACTCAACTTCCATTCCCTTCATACATTTCTAAATCCAACCCACTCCATAAAAATCAAATCACTATTACA
AGAACCAAGAAGCTCAACCAAGCTTCTTCCACTTCCATGGCGGCTCCTCCCTCTCTCTGTTTCTTCTACATTCTCCTCTTCTCCTCTGTTTCCGCCATTGTCATCAGCAA
CCCCATCACCCTCCCTCTCTCCGCTTTCCCCCACTCTTTTTCTTCAGATCCACTTCAAACTCTCAATTTCCTCGCCTCTGCTTCCCAGAACAGAGCCCATCAAATCAAAA
CCCCCAAATCTCAATCCAACAACTCCGTTTCCAAATCCCCTCTCTCCCCCCATAGCTATGGAGCTTACTCTGCTCCACTCAGCTTCGGTACTCCACCGCAGACTCTCCAT
TTGATCTTCGATACAGGTAGCAGCCTCGTTTGGTTTCCTTGCACTTCCAAATATCTCTGTTCCGAATGTTCTTTCCCCAAAATAGATCCCGCCGGAATCCCCAGATTTGT
CCCCAAATTGTCTTCTTCTTCGAAGCTTATCGGCTGCCAGAATCCCAAATGTGCGTGGATTTTTGGCCCAGATGTGAAGTCTCAGTGCCGGAGTTGTAACCCCAAAACAG
ACAACTGTACCCAAACTTGCCCTGCTTATGTTGTTCAGTATGGGTCCGGCTCCACCGCGGGGCTTTTGTTATCGGAGACCCTGGATTTTCCCGATAAGAAAATCCCCAAT
TTCGTTGTTGGGTGTTCGTTTTTGTCGATCCATCAGCCCTCTGGAATCGCTGGATTCGGCCGAGGATCCGAATCGCTGCCCTCGCAAATGGGTCTGAAGAAATTTGCTTA
CTGTCTCGCGTCTCGGAAATTCGACGACTCGCCGCATTCCGGCGAGCTGATTCTGGATTCTGGCGGGGGAAAGACCGGCGATCTGACCTACACGCCATTCCGGCAGAACC
CTTCTGTTTCTAACCACGCTTATAAGGAATATTATTACTTATCTATACGCAAAATCCTCGTCGGAAACCAGGCTGTGAAGGTGCCGTACAAGTATCTGGTGCCGGGACCC
GACGGCAGCGGGGGATCAATCATCGATTCCGGCTCGACCTTCACGTTTATGGACAAACCGGTGTTCGAGGCGGTGGCGGAAGCGTTCGAGAAGCAGTTGGCGAATCGGAC
GAGAGCCACCGATGTGGAATCTGCCACCGGATTACGGCCGTGTTTTGACATTTCAAAGGACAAATCGGTGGAGTTTCCGGAGTTGATTTTCCAGTTTAAAGGCGGAGCGA
AATGGGTTCTGCCATTGAATAACTATTTCGCTTTGGTCAGTAGCTCCGGCGTGGCGTGTTTGACGGTTGTTACGCATAAGGGGGCGGCGGGCGGCGGCGGTGGGCCGTCT
GTGATTTTGGGGGCTTTTCAGCAGCAGAATTTCTATGTGGAGTATGATTTGGTGAATGAAAGATTAGGATTTCGGCAACAGACTTGCAGTTAGAAACGGTGTCGTTTCGG
TGTATGTACGTTGTTATGTTTTCTAACTGGGTGGTGGCGGCCGGTCAAGTCACCGGAGCTACGTCGGTGGTTTCAATTTGACAGTGGCGGCGTTCAGTCAATTCAAAATC
ATTTTTTTTTTCACTTTTCTTGTAATAATTATTCGATGTTGTTGCAACTGTTGTAACTTGTAATGATATATCTCATATTTCATAATTCATAAGCAAATCTTTTTGCAGAA
TTCCTTAAAATTAAAAACATATATAAAATATTTGCAATCTTACCTTTCATGCTTGGGAAGAGAAATAGCCGTTATTCTCATCCTTCTCCTCTCATTTTCCGTACTTTATT
TTGAATTTTGATTGTCAAATCGGTTTGTTTAGTAACTTATAATTAATATCTAGTTTTTTTTAAGTAGATTAACCATTGAGAGTAACAGAATTCGAACTTATCACCTCTTA
GTAACTATTATAATTAAGTATCTATTTGAACATTGGAG
Protein sequenceShow/hide protein sequence
MAAPPSLCFFYILLFSSVSAIVISNPITLPLSAFPHSFSSDPLQTLNFLASASQNRAHQIKTPKSQSNNSVSKSPLSPHSYGAYSAPLSFGTPPQTLHLIFDTGSSLVWF
PCTSKYLCSECSFPKIDPAGIPRFVPKLSSSSKLIGCQNPKCAWIFGPDVKSQCRSCNPKTDNCTQTCPAYVVQYGSGSTAGLLLSETLDFPDKKIPNFVVGCSFLSIHQ
PSGIAGFGRGSESLPSQMGLKKFAYCLASRKFDDSPHSGELILDSGGGKTGDLTYTPFRQNPSVSNHAYKEYYYLSIRKILVGNQAVKVPYKYLVPGPDGSGGSIIDSGS
TFTFMDKPVFEAVAEAFEKQLANRTRATDVESATGLRPCFDISKDKSVEFPELIFQFKGGAKWVLPLNNYFALVSSSGVACLTVVTHKGAAGGGGGPSVILGAFQQQNFY
VEYDLVNERLGFRQQTCS