; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Cla97C10G201480 (gene) of Watermelon (97103) v2.5 genome

Gene IDCla97C10G201480
OrganismCitrullus lanatus subsp. vulgaris cv. 97103 (Watermelon (97103) v2.5)
DescriptionEukaryotic aspartyl protease family protein
Genome locationCla97Chr10:31535864..31538903
RNA-Seq ExpressionCla97C10G201480
SyntenyCla97C10G201480
Gene Ontology termsGO:0006508 - proteolysis (biological process)
GO:0004190 - aspartic-type endopeptidase activity (molecular function)
InterPro domainsIPR001461 - Aspartic peptidase A1 family
IPR001969 - Aspartic peptidase, active site
IPR021109 - Aspartic peptidase domain superfamily
IPR032799 - Xylanase inhibitor, C-terminal
IPR032861 - Xylanase inhibitor, N-terminal
IPR033121 - Peptidase family A1 domain
IPR034161 - Pepsin-like domain, plant


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6580736.1 Aspartyl protease 25, partial [Cucurbita argyrosperma subsp. sororia]9.8e-24181.66Show/hide
Query:  LTALSLL--CSSS------PFILQTPLSQPSHQHNSLSLHTFPIHNY----IRITLQAGRSQPK-----ASSMASPSPLSFFYILLFSSVSSIANTNPIT
        LT LSLL  CS++      PF L  P+  P      + +   P  ++     RIT +  + + K     ASSMA P  L FFYILL SSVS+IA+TNPIT
Subjt:  LTALSLL--CSSS------PFILQTPLSQPSHQHNSLSLHTFPIHNY----IRITLQAGRSQPK-----ASSMASPSPLSFFYILLFSSVSSIANTNPIT

Query:  LPLHAFPHLPSSDPLQTLTFLASASQNRAHQIKTP--KSNSVSKSPLSPHSYGAYSTPLSFGTPQQTLHLIFDTGSSLVWFPCTSRYLCSECSFPKIDPA
        LPL +FPH  SSDPLQTL FLASASQNRAHQIK P  KSNSVSKSPLSPHSYGAYSTPLSFGTP QTLHLIFDTGSSLVW PCTS+YLCSECSFPKIDPA
Subjt:  LPLHAFPHLPSSDPLQTLTFLASASQNRAHQIKTP--KSNSVSKSPLSPHSYGAYSTPLSFGTPQQTLHLIFDTGSSLVWFPCTSRYLCSECSFPKIDPA

Query:  GIPRFVPKLSSSSKLVGCQNPKCAWIFGPDVKSQCRSCNPKTENCTQTCPAYVVQYGSGSTAGLLLSETLDFPDKKIPNFVVGCSFLSIHQPSGIAGFGR
         IPRF+PKLSSSSKLVGCQNPKCAWIFGPDVKSQCRSCNPKTENCTQTCPAYVVQYGSGSTAGLLLSETLDFP+KKI NFVVGCSFLSIHQPSGIAGFGR
Subjt:  GIPRFVPKLSSSSKLVGCQNPKCAWIFGPDVKSQCRSCNPKTENCTQTCPAYVVQYGSGSTAGLLLSETLDFPDKKIPNFVVGCSFLSIHQPSGIAGFGR

Query:  GSESLPSQMGLKKFAYCLASRKFDDSPHSGELILDSTGVKTGGLTYTPFRQNPSVSNHAYKEYYYLSIRKILVGNQAVKVLYKYLVPGPDGNGGSIIDSG
        GSESLPSQMGLKKFAYCLASRKFDDSPH+GELILDS+G KT GLTYTPFRQNPSVSNHAYKEYYYL+IRKI VGN+AVKV YKYLVPGPDGNGGSIIDSG
Subjt:  GSESLPSQMGLKKFAYCLASRKFDDSPHSGELILDSTGVKTGGLTYTPFRQNPSVSNHAYKEYYYLSIRKILVGNQAVKVLYKYLVPGPDGNGGSIIDSG

Query:  STFTFMDKPVFEAVAQEFEKQLANRTRATDVESLTGLRPCFDISKDKSVDFPELIFQFKGGAKWALPLSNYFALVSSSGVACLTVVTHKTEAGGGGGPSV
        STFTFMDKPVFEAVAQE EKQLANRTRATDVESLTGLRPCFDISKDKSV+FPEL F  KGGAKWALPLSNYFALVSSSGVACLTVVTHK  A  GGGPS+
Subjt:  STFTFMDKPVFEAVAQEFEKQLANRTRATDVESLTGLRPCFDISKDKSVDFPELIFQFKGGAKWALPLSNYFALVSSSGVACLTVVTHKTEAGGGGGPSV

Query:  ILGAFQQQNFYVEYDLVNERLGFRQQTCT
        ILGAFQQQN YVE+DLVN+++GFRQQTC+
Subjt:  ILGAFQQQNFYVEYDLVNERLGFRQQTCT

XP_004136706.1 probable aspartyl protease At4g16563 [Cucumis sativus]9.1e-24791.68Show/hide
Query:  MASPSPLSFFYILLFSSVSSIANTNPITLPLHAFPHLPSSDPLQTLTFLASASQNRAHQIKTPKSNSVSKSPLSPHSYGAYSTPLSFGTPQQTLHLIFDT
        MASPSPLSFFY+LLFSS+S+IA++NPITLPL++FPHL S DPLQ LTFLAS+SQ RAHQIKTPKSNSV KSPLSPHSYGAYSTPLSFGTPQQTLHLIFDT
Subjt:  MASPSPLSFFYILLFSSVSSIANTNPITLPLHAFPHLPSSDPLQTLTFLASASQNRAHQIKTPKSNSVSKSPLSPHSYGAYSTPLSFGTPQQTLHLIFDT

Query:  GSSLVWFPCTSRYLCSECSFPKIDPAGIPRFVPKLSSSSKLVGCQNPKCAWIFGPDVKSQCRSCNPKTENCTQTCPAYVVQYGSGSTAGLLLSETLDFPD
        GSSLVWFPCTSRYLCSECSFPKIDP GIPRFVPKLSSSSKLVGCQNPKC+WIFGPDVKSQCRSCNPKTENCTQTCPAYVVQYGSGSTAGLLLSETLDFPD
Subjt:  GSSLVWFPCTSRYLCSECSFPKIDPAGIPRFVPKLSSSSKLVGCQNPKCAWIFGPDVKSQCRSCNPKTENCTQTCPAYVVQYGSGSTAGLLLSETLDFPD

Query:  KKIPNFVVGCSFLSIHQPSGIAGFGRGSESLPSQMGLKKFAYCLASRKFDDSPHSGELILDSTGVKTGGLTYTPFRQNPSVSNHAYKEYYYLSIRKILVG
        KKIPNFVVGCSFLSIHQPSGIAGFGRGSESLPSQMGLKKFAYCLASRKFDDSPHSG+LILDSTGVK+ GLTYTPFRQNPSVSN+AYKEYYYL+IRKI+VG
Subjt:  KKIPNFVVGCSFLSIHQPSGIAGFGRGSESLPSQMGLKKFAYCLASRKFDDSPHSGELILDSTGVKTGGLTYTPFRQNPSVSNHAYKEYYYLSIRKILVG

Query:  NQAVKVLYKYLVPGPDGNGGSIIDSGSTFTFMDKPVFEAVAQEFEKQLANRTRATDVESLTGLRPCFDISKDKSVDFPELIFQFKGGAKWALPLSNYFAL
        NQAVKV YK+LVPGPDGNGGSIIDSGSTFTFMDKPV E VA+EFEKQLAN TRATDVE+LTGLRPCFDISK+KSV FPELIFQFKGGAKWALPL+NYFAL
Subjt:  NQAVKVLYKYLVPGPDGNGGSIIDSGSTFTFMDKPVFEAVAQEFEKQLANRTRATDVESLTGLRPCFDISKDKSVDFPELIFQFKGGAKWALPLSNYFAL

Query:  VSSSGVACLTVVTHKTE--AGGGGGPSVILGAFQQQNFYVEYDLVNERLGFRQQTCT
        VSSSGVACLTVVTH+ E   GGGGGPSVILGAFQQQNFYVEYDLVN+RLGFRQQTC+
Subjt:  VSSSGVACLTVVTHKTE--AGGGGGPSVILGAFQQQNFYVEYDLVNERLGFRQQTCT

XP_008442902.1 PREDICTED: aspartic proteinase nepenthesin-2 [Cucumis melo]5.2e-25092.75Show/hide
Query:  MASPSPLSFFYILLFSSVSSIANTNPITLPLHAFPHLPSSDPLQTLTFLASASQNRAHQIKTPKSNSVSKSPLSPHSYGAYSTPLSFGTPQQTLHLIFDT
        MASPSPLSFFYILLFSS+S+I+N+NPITLPL++ PHL SSDPLQ LTFLASAS+NRAH+IKTPKSNSVSKSPLSPHSYGAYSTPLSFGTPQQTLHLIFDT
Subjt:  MASPSPLSFFYILLFSSVSSIANTNPITLPLHAFPHLPSSDPLQTLTFLASASQNRAHQIKTPKSNSVSKSPLSPHSYGAYSTPLSFGTPQQTLHLIFDT

Query:  GSSLVWFPCTSRYLCSECSFPKIDPAGIPRFVPKLSSSSKLVGCQNPKCAWIFGPDVKSQCRSCNPKTENCTQTCPAYVVQYGSGSTAGLLLSETLDFPD
        GSSLVWFPCTSRYLC+ECSFPKIDP GIPRFVPKLSSSSKLVGCQNPKCAWIFGPDVKSQCRSCNPKTENCTQTCPAYVVQYGSGSTAGLLLSETLDFP+
Subjt:  GSSLVWFPCTSRYLCSECSFPKIDPAGIPRFVPKLSSSSKLVGCQNPKCAWIFGPDVKSQCRSCNPKTENCTQTCPAYVVQYGSGSTAGLLLSETLDFPD

Query:  KKIPNFVVGCSFLSIHQPSGIAGFGRGSESLPSQMGLKKFAYCLASRKFDDSPHSGELILDSTGVKTGGLTYTPFRQNPSVSNHAYKEYYYLSIRKILVG
        KKIPNFVVGCSFLSIHQPSGIAGFGRGSESLPSQMGLKKFAYCLASRKFDDS HSG+LILDS+GVKT GLTYT FRQNPSVSNHAYKEYYYL+IRKI+VG
Subjt:  KKIPNFVVGCSFLSIHQPSGIAGFGRGSESLPSQMGLKKFAYCLASRKFDDSPHSGELILDSTGVKTGGLTYTPFRQNPSVSNHAYKEYYYLSIRKILVG

Query:  NQAVKVLYKYLVPGPDGNGGSIIDSGSTFTFMDKPVFEAVAQEFEKQLANRTRATDVESLTGLRPCFDISKDKSVDFPELIFQFKGGAKWALPLSNYFAL
        NQAVKV YKYLVPGPDGNGGSIIDSGSTFTFMDKPV + VAQEFEKQLANRTRATDVE+LTGLRPCFD+SK+KSV+FPELIFQFKGGAKWALPL+NYFAL
Subjt:  NQAVKVLYKYLVPGPDGNGGSIIDSGSTFTFMDKPVFEAVAQEFEKQLANRTRATDVESLTGLRPCFDISKDKSVDFPELIFQFKGGAKWALPLSNYFAL

Query:  VSSSGVACLTVVTHKTEAGGGGGPSVILGAFQQQNFYVEYDLVNERLGFRQQTCT
        VSSSGVACLTVVTH TE GGGGGPSVILGAFQQQNFYVEYDLVNERLGFR+QTCT
Subjt:  VSSSGVACLTVVTHKTEAGGGGGPSVILGAFQQQNFYVEYDLVNERLGFRQQTCT

XP_022982947.1 probable aspartyl protease At4g16563 [Cucurbita maxima]5.7e-24190.15Show/hide
Query:  MASPSPLSFFYILLFSSVSSIANTNPITLPLHAFPHLPSSDPLQTLTFLASASQNRAHQIKTPK--SNSVSKSPLSPHSYGAYSTPLSFGTPQQTLHLIF
        MA P PL FFYILL SSVS+IA+TNPIT+PL +FPH  SSDPLQTL FLASASQNRAHQIK PK  SNSVSKSPLSPHSYGAYSTPLSFGTP QTLHLIF
Subjt:  MASPSPLSFFYILLFSSVSSIANTNPITLPLHAFPHLPSSDPLQTLTFLASASQNRAHQIKTPK--SNSVSKSPLSPHSYGAYSTPLSFGTPQQTLHLIF

Query:  DTGSSLVWFPCTSRYLCSECSFPKIDPAGIPRFVPKLSSSSKLVGCQNPKCAWIFGPDVKSQCRSCNPKTENCTQTCPAYVVQYGSGSTAGLLLSETLDF
        DTGSSLVW PCTS+YLCSECSFPKIDPAGIPRF+PKLSS+SKLVGCQNPKCAWIFGPDVKSQCRSCNPKTENCTQTCPAYVVQYGSGSTAGLLLSETLDF
Subjt:  DTGSSLVWFPCTSRYLCSECSFPKIDPAGIPRFVPKLSSSSKLVGCQNPKCAWIFGPDVKSQCRSCNPKTENCTQTCPAYVVQYGSGSTAGLLLSETLDF

Query:  PDKKIPNFVVGCSFLSIHQPSGIAGFGRGSESLPSQMGLKKFAYCLASRKFDDSPHSGELILDSTGVKTGGLTYTPFRQNPSVSNHAYKEYYYLSIRKIL
        PDKK  NFVVGCSFLSIHQPSGIAGFGRGSESLPSQMGLKKFAYCLASRKFDDSPH+GELILDS+G KT GL+YTPFRQNPSVSNHAYKEYYYL+IRKI 
Subjt:  PDKKIPNFVVGCSFLSIHQPSGIAGFGRGSESLPSQMGLKKFAYCLASRKFDDSPHSGELILDSTGVKTGGLTYTPFRQNPSVSNHAYKEYYYLSIRKIL

Query:  VGNQAVKVLYKYLVPGPDGNGGSIIDSGSTFTFMDKPVFEAVAQEFEKQLANRTRATDVESLTGLRPCFDISKDKSVDFPELIFQFKGGAKWALPLSNYF
        VG +AVKV YKYLVPGPDGNGGSIIDSGSTFTFMDKPVFEAVAQE EKQLANRTRATDVESLTGLRPCFDISKDKSV+FPEL FQ KGGAKW LPLSNYF
Subjt:  VGNQAVKVLYKYLVPGPDGNGGSIIDSGSTFTFMDKPVFEAVAQEFEKQLANRTRATDVESLTGLRPCFDISKDKSVDFPELIFQFKGGAKWALPLSNYF

Query:  ALVSSSGVACLTVVTHKTEAGGGGGPSVILGAFQQQNFYVEYDLVNERLGFRQQTCT
        ALVSSSGVACLTVVTHKT A  GGGPS+ILGAFQQQNFYVEYDLVN+++GFRQQTC+
Subjt:  ALVSSSGVACLTVVTHKTEAGGGGGPSVILGAFQQQNFYVEYDLVNERLGFRQQTCT

XP_038905730.1 probable aspartyl protease At4g16563 [Benincasa hispida]2.0e-25796.26Show/hide
Query:  MASPSPLSFFYILLFSSVSSIANTNPITLPLHAFPHLPSSDPLQTLTFLASASQNRAHQIKTPKSNSVSKSPLSPHSYGAYSTPLSFGTPQQTLHLIFDT
        MA PS LSFFYILLFSSVS+IANTNPITLPL+AFPHL SSDPLQTLTFLASASQNRAHQIKTPKSNSVSKSPL PHSYGAYSTPLSFGTPQQTLHLIFDT
Subjt:  MASPSPLSFFYILLFSSVSSIANTNPITLPLHAFPHLPSSDPLQTLTFLASASQNRAHQIKTPKSNSVSKSPLSPHSYGAYSTPLSFGTPQQTLHLIFDT

Query:  GSSLVWFPCTSRYLCSECSFPKIDPAGIPRFVPKLSSSSKLVGCQNPKCAWIFGPDVKSQCRSCNPKTENCTQTCPAYVVQYGSGSTAGLLLSETLDFPD
        GSSLVWFPCTSRYLCSECSFPKIDP GIPRFVPKLSSSSKLVGCQNPKCAWIFGP+VKSQCRSCNPKTENCTQTCPAYVVQYGSGSTAGLLLSETLDFPD
Subjt:  GSSLVWFPCTSRYLCSECSFPKIDPAGIPRFVPKLSSSSKLVGCQNPKCAWIFGPDVKSQCRSCNPKTENCTQTCPAYVVQYGSGSTAGLLLSETLDFPD

Query:  KKIPNFVVGCSFLSIHQPSGIAGFGRGSESLPSQMGLKKFAYCLASRKFDDSPHSGELILDSTGVKTGGLTYTPFRQNPSVSNHAYKEYYYLSIRKILVG
        KKIPNFVVGCSFLSIHQPSGIAGFGRGSESLPSQMGLKKFAYCLASRKFDDSPHSGELILDSTGVKT GL+YTPFRQNPSVSNHAYKEYYYL+IRKI VG
Subjt:  KKIPNFVVGCSFLSIHQPSGIAGFGRGSESLPSQMGLKKFAYCLASRKFDDSPHSGELILDSTGVKTGGLTYTPFRQNPSVSNHAYKEYYYLSIRKILVG

Query:  NQAVKVLYKYLVPGPDGNGGSIIDSGSTFTFMDKPVFEAVAQEFEKQLANRTRATDVESLTGLRPCFDISKDKSVDFPELIFQFKGGAKWALPLSNYFAL
        NQAVKV YK+LVPGPDGNGGSIIDSGSTFTFMDKPVFEAVAQEFEKQLANRTRATDVESLTGLRPCFDISKDKSV+FPELIFQFKGGAKWALPLSNYFAL
Subjt:  NQAVKVLYKYLVPGPDGNGGSIIDSGSTFTFMDKPVFEAVAQEFEKQLANRTRATDVESLTGLRPCFDISKDKSVDFPELIFQFKGGAKWALPLSNYFAL

Query:  VSSSGVACLTVVTHKTEAGGGGGPSVILGAFQQQNFYVEYDLVNERLGFRQQTCT
        VSSSGVACLTVVTHKTEAGGGGGPSVI GAFQQQNFYVEYDLVNE+LGFRQQTCT
Subjt:  VSSSGVACLTVVTHKTEAGGGGGPSVILGAFQQQNFYVEYDLVNERLGFRQQTCT

TrEMBL top hitse value%identityAlignment
A0A0A0LBI9 Peptidase A1 domain-containing protein4.4e-24791.68Show/hide
Query:  MASPSPLSFFYILLFSSVSSIANTNPITLPLHAFPHLPSSDPLQTLTFLASASQNRAHQIKTPKSNSVSKSPLSPHSYGAYSTPLSFGTPQQTLHLIFDT
        MASPSPLSFFY+LLFSS+S+IA++NPITLPL++FPHL S DPLQ LTFLAS+SQ RAHQIKTPKSNSV KSPLSPHSYGAYSTPLSFGTPQQTLHLIFDT
Subjt:  MASPSPLSFFYILLFSSVSSIANTNPITLPLHAFPHLPSSDPLQTLTFLASASQNRAHQIKTPKSNSVSKSPLSPHSYGAYSTPLSFGTPQQTLHLIFDT

Query:  GSSLVWFPCTSRYLCSECSFPKIDPAGIPRFVPKLSSSSKLVGCQNPKCAWIFGPDVKSQCRSCNPKTENCTQTCPAYVVQYGSGSTAGLLLSETLDFPD
        GSSLVWFPCTSRYLCSECSFPKIDP GIPRFVPKLSSSSKLVGCQNPKC+WIFGPDVKSQCRSCNPKTENCTQTCPAYVVQYGSGSTAGLLLSETLDFPD
Subjt:  GSSLVWFPCTSRYLCSECSFPKIDPAGIPRFVPKLSSSSKLVGCQNPKCAWIFGPDVKSQCRSCNPKTENCTQTCPAYVVQYGSGSTAGLLLSETLDFPD

Query:  KKIPNFVVGCSFLSIHQPSGIAGFGRGSESLPSQMGLKKFAYCLASRKFDDSPHSGELILDSTGVKTGGLTYTPFRQNPSVSNHAYKEYYYLSIRKILVG
        KKIPNFVVGCSFLSIHQPSGIAGFGRGSESLPSQMGLKKFAYCLASRKFDDSPHSG+LILDSTGVK+ GLTYTPFRQNPSVSN+AYKEYYYL+IRKI+VG
Subjt:  KKIPNFVVGCSFLSIHQPSGIAGFGRGSESLPSQMGLKKFAYCLASRKFDDSPHSGELILDSTGVKTGGLTYTPFRQNPSVSNHAYKEYYYLSIRKILVG

Query:  NQAVKVLYKYLVPGPDGNGGSIIDSGSTFTFMDKPVFEAVAQEFEKQLANRTRATDVESLTGLRPCFDISKDKSVDFPELIFQFKGGAKWALPLSNYFAL
        NQAVKV YK+LVPGPDGNGGSIIDSGSTFTFMDKPV E VA+EFEKQLAN TRATDVE+LTGLRPCFDISK+KSV FPELIFQFKGGAKWALPL+NYFAL
Subjt:  NQAVKVLYKYLVPGPDGNGGSIIDSGSTFTFMDKPVFEAVAQEFEKQLANRTRATDVESLTGLRPCFDISKDKSVDFPELIFQFKGGAKWALPLSNYFAL

Query:  VSSSGVACLTVVTHKTE--AGGGGGPSVILGAFQQQNFYVEYDLVNERLGFRQQTCT
        VSSSGVACLTVVTH+ E   GGGGGPSVILGAFQQQNFYVEYDLVN+RLGFRQQTC+
Subjt:  VSSSGVACLTVVTHKTE--AGGGGGPSVILGAFQQQNFYVEYDLVNERLGFRQQTCT

A0A1S3B6B5 aspartic proteinase nepenthesin-22.5e-25092.75Show/hide
Query:  MASPSPLSFFYILLFSSVSSIANTNPITLPLHAFPHLPSSDPLQTLTFLASASQNRAHQIKTPKSNSVSKSPLSPHSYGAYSTPLSFGTPQQTLHLIFDT
        MASPSPLSFFYILLFSS+S+I+N+NPITLPL++ PHL SSDPLQ LTFLASAS+NRAH+IKTPKSNSVSKSPLSPHSYGAYSTPLSFGTPQQTLHLIFDT
Subjt:  MASPSPLSFFYILLFSSVSSIANTNPITLPLHAFPHLPSSDPLQTLTFLASASQNRAHQIKTPKSNSVSKSPLSPHSYGAYSTPLSFGTPQQTLHLIFDT

Query:  GSSLVWFPCTSRYLCSECSFPKIDPAGIPRFVPKLSSSSKLVGCQNPKCAWIFGPDVKSQCRSCNPKTENCTQTCPAYVVQYGSGSTAGLLLSETLDFPD
        GSSLVWFPCTSRYLC+ECSFPKIDP GIPRFVPKLSSSSKLVGCQNPKCAWIFGPDVKSQCRSCNPKTENCTQTCPAYVVQYGSGSTAGLLLSETLDFP+
Subjt:  GSSLVWFPCTSRYLCSECSFPKIDPAGIPRFVPKLSSSSKLVGCQNPKCAWIFGPDVKSQCRSCNPKTENCTQTCPAYVVQYGSGSTAGLLLSETLDFPD

Query:  KKIPNFVVGCSFLSIHQPSGIAGFGRGSESLPSQMGLKKFAYCLASRKFDDSPHSGELILDSTGVKTGGLTYTPFRQNPSVSNHAYKEYYYLSIRKILVG
        KKIPNFVVGCSFLSIHQPSGIAGFGRGSESLPSQMGLKKFAYCLASRKFDDS HSG+LILDS+GVKT GLTYT FRQNPSVSNHAYKEYYYL+IRKI+VG
Subjt:  KKIPNFVVGCSFLSIHQPSGIAGFGRGSESLPSQMGLKKFAYCLASRKFDDSPHSGELILDSTGVKTGGLTYTPFRQNPSVSNHAYKEYYYLSIRKILVG

Query:  NQAVKVLYKYLVPGPDGNGGSIIDSGSTFTFMDKPVFEAVAQEFEKQLANRTRATDVESLTGLRPCFDISKDKSVDFPELIFQFKGGAKWALPLSNYFAL
        NQAVKV YKYLVPGPDGNGGSIIDSGSTFTFMDKPV + VAQEFEKQLANRTRATDVE+LTGLRPCFD+SK+KSV+FPELIFQFKGGAKWALPL+NYFAL
Subjt:  NQAVKVLYKYLVPGPDGNGGSIIDSGSTFTFMDKPVFEAVAQEFEKQLANRTRATDVESLTGLRPCFDISKDKSVDFPELIFQFKGGAKWALPLSNYFAL

Query:  VSSSGVACLTVVTHKTEAGGGGGPSVILGAFQQQNFYVEYDLVNERLGFRQQTCT
        VSSSGVACLTVVTH TE GGGGGPSVILGAFQQQNFYVEYDLVNERLGFR+QTCT
Subjt:  VSSSGVACLTVVTHKTEAGGGGGPSVILGAFQQQNFYVEYDLVNERLGFRQQTCT

A0A5A7TRK2 Aspartic proteinase nepenthesin-22.5e-25092.75Show/hide
Query:  MASPSPLSFFYILLFSSVSSIANTNPITLPLHAFPHLPSSDPLQTLTFLASASQNRAHQIKTPKSNSVSKSPLSPHSYGAYSTPLSFGTPQQTLHLIFDT
        MASPSPLSFFYILLFSS+S+I+N+NPITLPL++ PHL SSDPLQ LTFLASAS+NRAH+IKTPKSNSVSKSPLSPHSYGAYSTPLSFGTPQQTLHLIFDT
Subjt:  MASPSPLSFFYILLFSSVSSIANTNPITLPLHAFPHLPSSDPLQTLTFLASASQNRAHQIKTPKSNSVSKSPLSPHSYGAYSTPLSFGTPQQTLHLIFDT

Query:  GSSLVWFPCTSRYLCSECSFPKIDPAGIPRFVPKLSSSSKLVGCQNPKCAWIFGPDVKSQCRSCNPKTENCTQTCPAYVVQYGSGSTAGLLLSETLDFPD
        GSSLVWFPCTSRYLC+ECSFPKIDP GIPRFVPKLSSSSKLVGCQNPKCAWIFGPDVKSQCRSCNPKTENCTQTCPAYVVQYGSGSTAGLLLSETLDFP+
Subjt:  GSSLVWFPCTSRYLCSECSFPKIDPAGIPRFVPKLSSSSKLVGCQNPKCAWIFGPDVKSQCRSCNPKTENCTQTCPAYVVQYGSGSTAGLLLSETLDFPD

Query:  KKIPNFVVGCSFLSIHQPSGIAGFGRGSESLPSQMGLKKFAYCLASRKFDDSPHSGELILDSTGVKTGGLTYTPFRQNPSVSNHAYKEYYYLSIRKILVG
        KKIPNFVVGCSFLSIHQPSGIAGFGRGSESLPSQMGLKKFAYCLASRKFDDS HSG+LILDS+GVKT GLTYT FRQNPSVSNHAYKEYYYL+IRKI+VG
Subjt:  KKIPNFVVGCSFLSIHQPSGIAGFGRGSESLPSQMGLKKFAYCLASRKFDDSPHSGELILDSTGVKTGGLTYTPFRQNPSVSNHAYKEYYYLSIRKILVG

Query:  NQAVKVLYKYLVPGPDGNGGSIIDSGSTFTFMDKPVFEAVAQEFEKQLANRTRATDVESLTGLRPCFDISKDKSVDFPELIFQFKGGAKWALPLSNYFAL
        NQAVKV YKYLVPGPDGNGGSIIDSGSTFTFMDKPV + VAQEFEKQLANRTRATDVE+LTGLRPCFD+SK+KSV+FPELIFQFKGGAKWALPL+NYFAL
Subjt:  NQAVKVLYKYLVPGPDGNGGSIIDSGSTFTFMDKPVFEAVAQEFEKQLANRTRATDVESLTGLRPCFDISKDKSVDFPELIFQFKGGAKWALPLSNYFAL

Query:  VSSSGVACLTVVTHKTEAGGGGGPSVILGAFQQQNFYVEYDLVNERLGFRQQTCT
        VSSSGVACLTVVTH TE GGGGGPSVILGAFQQQNFYVEYDLVNERLGFR+QTCT
Subjt:  VSSSGVACLTVVTHKTEAGGGGGPSVILGAFQQQNFYVEYDLVNERLGFRQQTCT

A0A6J1F3G5 probable aspartyl protease At4g165631.4e-24090.37Show/hide
Query:  MASPSPLSFFYILLFSSVSSIANTNPITLPLHAFPHLPSSDPLQTLTFLASASQNRAHQIKTP--KSNSVSKSPLSPHSYGAYSTPLSFGTPQQTLHLIF
        MA P PL FFYILL SSVS+IA+TNPITLPL +FPH  SSDPLQTL FLASASQNRAHQIK P  KSNSVSKSPLSPHSYGAYSTPLSFGTP QTLHLIF
Subjt:  MASPSPLSFFYILLFSSVSSIANTNPITLPLHAFPHLPSSDPLQTLTFLASASQNRAHQIKTP--KSNSVSKSPLSPHSYGAYSTPLSFGTPQQTLHLIF

Query:  DTGSSLVWFPCTSRYLCSECSFPKIDPAGIPRFVPKLSSSSKLVGCQNPKCAWIFGPDVKSQCRSCNPKTENCTQTCPAYVVQYGSGSTAGLLLSETLDF
        DTGSSLVW PCTS+YLCSECSFPKIDPA IPRF+PKLSSSSKLVGCQNPKCAWIFGPDVKSQCRSCNPKTENCTQTCPAYVVQYGSGSTAGLLLSETLDF
Subjt:  DTGSSLVWFPCTSRYLCSECSFPKIDPAGIPRFVPKLSSSSKLVGCQNPKCAWIFGPDVKSQCRSCNPKTENCTQTCPAYVVQYGSGSTAGLLLSETLDF

Query:  PDKKIPNFVVGCSFLSIHQPSGIAGFGRGSESLPSQMGLKKFAYCLASRKFDDSPHSGELILDSTGVKTGGLTYTPFRQNPSVSNHAYKEYYYLSIRKIL
        P+KKI NFVVGCSFLSIHQPSGIAGFGRGSESLPSQMGLKKFAYCLASRKFDDSPH+GELILDS+G KT GLTYTPFRQNPSVSNHAYKEYYYL+IRKI 
Subjt:  PDKKIPNFVVGCSFLSIHQPSGIAGFGRGSESLPSQMGLKKFAYCLASRKFDDSPHSGELILDSTGVKTGGLTYTPFRQNPSVSNHAYKEYYYLSIRKIL

Query:  VGNQAVKVLYKYLVPGPDGNGGSIIDSGSTFTFMDKPVFEAVAQEFEKQLANRTRATDVESLTGLRPCFDISKDKSVDFPELIFQFKGGAKWALPLSNYF
        VGN+AVKV YKYLVPGPDGNGGSIIDSGSTFTFMDKPVFEAVAQE EKQLANRTRATDVESLTGLRPCFDISKDKSV+FPEL F  KGGAKWA PLSNYF
Subjt:  VGNQAVKVLYKYLVPGPDGNGGSIIDSGSTFTFMDKPVFEAVAQEFEKQLANRTRATDVESLTGLRPCFDISKDKSVDFPELIFQFKGGAKWALPLSNYF

Query:  ALVSSSGVACLTVVTHKTEAGGGGGPSVILGAFQQQNFYVEYDLVNERLGFRQQTCT
        ALVSSSGVACLTVVTHK  A  GGGPS+ILGAFQQQNFYVEYDLVN+++GFRQQTC+
Subjt:  ALVSSSGVACLTVVTHKTEAGGGGGPSVILGAFQQQNFYVEYDLVNERLGFRQQTCT

A0A6J1IXY3 probable aspartyl protease At4g165632.8e-24190.15Show/hide
Query:  MASPSPLSFFYILLFSSVSSIANTNPITLPLHAFPHLPSSDPLQTLTFLASASQNRAHQIKTPK--SNSVSKSPLSPHSYGAYSTPLSFGTPQQTLHLIF
        MA P PL FFYILL SSVS+IA+TNPIT+PL +FPH  SSDPLQTL FLASASQNRAHQIK PK  SNSVSKSPLSPHSYGAYSTPLSFGTP QTLHLIF
Subjt:  MASPSPLSFFYILLFSSVSSIANTNPITLPLHAFPHLPSSDPLQTLTFLASASQNRAHQIKTPK--SNSVSKSPLSPHSYGAYSTPLSFGTPQQTLHLIF

Query:  DTGSSLVWFPCTSRYLCSECSFPKIDPAGIPRFVPKLSSSSKLVGCQNPKCAWIFGPDVKSQCRSCNPKTENCTQTCPAYVVQYGSGSTAGLLLSETLDF
        DTGSSLVW PCTS+YLCSECSFPKIDPAGIPRF+PKLSS+SKLVGCQNPKCAWIFGPDVKSQCRSCNPKTENCTQTCPAYVVQYGSGSTAGLLLSETLDF
Subjt:  DTGSSLVWFPCTSRYLCSECSFPKIDPAGIPRFVPKLSSSSKLVGCQNPKCAWIFGPDVKSQCRSCNPKTENCTQTCPAYVVQYGSGSTAGLLLSETLDF

Query:  PDKKIPNFVVGCSFLSIHQPSGIAGFGRGSESLPSQMGLKKFAYCLASRKFDDSPHSGELILDSTGVKTGGLTYTPFRQNPSVSNHAYKEYYYLSIRKIL
        PDKK  NFVVGCSFLSIHQPSGIAGFGRGSESLPSQMGLKKFAYCLASRKFDDSPH+GELILDS+G KT GL+YTPFRQNPSVSNHAYKEYYYL+IRKI 
Subjt:  PDKKIPNFVVGCSFLSIHQPSGIAGFGRGSESLPSQMGLKKFAYCLASRKFDDSPHSGELILDSTGVKTGGLTYTPFRQNPSVSNHAYKEYYYLSIRKIL

Query:  VGNQAVKVLYKYLVPGPDGNGGSIIDSGSTFTFMDKPVFEAVAQEFEKQLANRTRATDVESLTGLRPCFDISKDKSVDFPELIFQFKGGAKWALPLSNYF
        VG +AVKV YKYLVPGPDGNGGSIIDSGSTFTFMDKPVFEAVAQE EKQLANRTRATDVESLTGLRPCFDISKDKSV+FPEL FQ KGGAKW LPLSNYF
Subjt:  VGNQAVKVLYKYLVPGPDGNGGSIIDSGSTFTFMDKPVFEAVAQEFEKQLANRTRATDVESLTGLRPCFDISKDKSVDFPELIFQFKGGAKWALPLSNYF

Query:  ALVSSSGVACLTVVTHKTEAGGGGGPSVILGAFQQQNFYVEYDLVNERLGFRQQTCT
        ALVSSSGVACLTVVTHKT A  GGGPS+ILGAFQQQNFYVEYDLVN+++GFRQQTC+
Subjt:  ALVSSSGVACLTVVTHKTEAGGGGGPSVILGAFQQQNFYVEYDLVNERLGFRQQTCT

SwissProt top hitse value%identityAlignment
O04496 Aspartyl protease AED32.3e-3028.64Show/hide
Query:  HLPSSDPLQTLTFLASASQNRAHQIKTPKSNSVSKSPLSPHSYGAYSTPLSFGTPQQTLHLIFDTGSSLVWFPCTSRYLCSECSFPKIDPAGIPRFVPKL
        H+ SSD    LT+L+S    +      PK  SV  +  +    G Y      GTP Q + ++ DT +  VW PC+    CS CS           F    
Subjt:  HLPSSDPLQTLTFLASASQNRAHQIKTPKSNSVSKSPLSPHSYGAYSTPLSFGTPQQTLHLIFDTGSSLVWFPCTSRYLCSECSFPKIDPAGIPRFVPKL

Query:  SSSSKLVGCQNPKCAWIFGPDVKSQCRSCNPKTENCTQTCPAYVVQYGSGST-AGLLLSETLDFPDKKIPNFVVGC---SFLSIHQPSGIAGFGRGSESL
        SS+   V C   +C    G      C S +P+   C     ++   YG  S+ +  L+ +TL      IPNF  GC   +  +   P G+ G GRG  SL
Subjt:  SSSSKLVGCQNPKCAWIFGPDVKSQCRSCNPKTENCTQTCPAYVVQYGSGST-AGLLLSETLDFPDKKIPNFVVGC---SFLSIHQPSGIAGFGRGSESL

Query:  PSQ---MGLKKFAYCLASRKFDDSPHSGELILDSTGVKTGGLTYTPFRQNPSVSNHAYKEYYYLSIRKILVGNQAVKVLYKYLVPGPDGNGGSIIDSGST
         SQ   +    F+YCL S  F     SG L L   G +   + YTP  +NP          YY+++  + VG+  V V   YL    +   G+IIDSG+ 
Subjt:  PSQ---MGLKKFAYCLASRKFDDSPHSGELILDSTGVKTGGLTYTPFRQNPSVSNHAYKEYYYLSIRKILVGNQAVKVLYKYLVPGPDGNGGSIIDSGST

Query:  FTFMDKPVFEAVAQEFEKQLANRTRATDVESLTGLRPCFDISKDKSVDFPELIFQFKGGAKWALPLSNYFALVSSSGVACLTVVTHKTEAGGGGGPSVIL
         T   +PV+EA+  EF KQ+      +   +L     CF  S D     P++           LP+ N     S+  + CL++   +  A        ++
Subjt:  FTFMDKPVFEAVAQEFEKQLANRTRATDVESLTGLRPCFDISKDKSVDFPELIFQFKGGAKWALPLSNYFALVSSSGVACLTVVTHKTEAGGGGGPSVIL

Query:  GAFQQQNFYVEYDLVNERLGFRQQTC
           QQQN  + +D+ N R+G   + C
Subjt:  GAFQQQNFYVEYDLVNERLGFRQQTC

Q766C2 Aspartic proteinase nepenthesin-21.8e-3532.64Show/hide
Query:  GAYSTPLSFGTPQQTLHLIFDTGSSLVWFPCTSRYLCSEC-SFPKIDPAGIPRFVPKLSSSSKLVGCQNPKCAWIFGPDVKSQCRSCNPKTENCTQTCPA
        G Y   ++ GTP  +   I DTGS L+W  C     C++C S P       P F P+ SSS   + C++  C      D+ S         E C      
Subjt:  GAYSTPLSFGTPQQTLHLIFDTGSSLVWFPCTSRYLCSEC-SFPKIDPAGIPRFVPKLSSSSKLVGCQNPKCAWIFGPDVKSQCRSCNPKTENCTQTCPA

Query:  YVVQYGSGSTA-GLLLSETLDFPDKKIPNFVVGC----SFLSIHQPSGIAGFGRGSESLPSQMGLKKFAYCLASRKFDDSPHSGELILDSTGVKTGGLTY
        Y   YG GST  G + +ET  F    +PN   GC            +G+ G G G  SLPSQ+G+ +F+YC+ S     SP +  L   ++GV  G  + 
Subjt:  YVVQYGSGSTA-GLLLSETLDFPDKKIPNFVVGC----SFLSIHQPSGIAGFGRGSESLPSQMGLKKFAYCLASRKFDDSPHSGELILDSTGVKTGGLTY

Query:  TPFRQ--NPSVSNHAYKEYYYLSIRKILVGNQAVKVLYKYLVPGPDGNGGSIIDSGSTFTFMDKPVFEAVAQEFEKQLANRTRATDVESLTGLRPCFDIS
        T      NP+        YYY++++ I VG   + +         DG GG IIDSG+T T++ +  + AVAQ F  Q+      T  ES +GL  CF   
Subjt:  TPFRQ--NPSVSNHAYKEYYYLSIRKILVGNQAVKVLYKYLVPGPDGNGGSIIDSGSTFTFMDKPVFEAVAQEFEKQLANRTRATDVESLTGLRPCFDIS

Query:  KDKS-VDFPELIFQFKGGAKWALPLSNYFALVS-SSGVACLTVVTHKTEAGGGGGPSVILGAFQQQNFYVEYDLVNERLGFRQQTC
         D S V  PE+  QF GG    L L     L+S + GV CL + +  ++ G       I G  QQQ   V YDL N  + F    C
Subjt:  KDKS-VDFPELIFQFKGGAKWALPLSNYFALVS-SSGVACLTVVTHKTEAGGGGGPSVILGAFQQQNFYVEYDLVNERLGFRQQTC

Q766C3 Aspartic proteinase nepenthesin-13.7e-3328.67Show/hide
Query:  HLPSSDPLQTLTFLASASQNRAHQIKTPKSNSVSKSPLSPHSY---GAYSTPLSFGTPQQTLHLIFDTGSSLVWFPCTSRYLCSECSFPKIDPAGIPRFV
        H+ S   L     L  A +  + +++  ++     S +    Y   G Y   LS GTP Q    I DTGS L+W  C     C   S P  +P G     
Subjt:  HLPSSDPLQTLTFLASASQNRAHQIKTPKSNSVSKSPLSPHSY---GAYSTPLSFGTPQQTLHLIFDTGSSLVWFPCTSRYLCSECSFPKIDPAGIPRFV

Query:  PKLSSSSKLVGCQNPKCAWIFGPDVKSQCRSCNPKTENCTQTCPAYVVQYGSGS-TAGLLLSETLDFPDKKIPNFVVGC----SFLSIHQPSGIAGFGRG
           SSS   + C +  C  +  P               C+     Y   YG GS T G + +ETL F    IPN   GC            +G+ G GRG
Subjt:  PKLSSSSKLVGCQNPKCAWIFGPDVKSQCRSCNPKTENCTQTCPAYVVQYGSGS-TAGLLLSETLDFPDKKIPNFVVGC----SFLSIHQPSGIAGFGRG

Query:  SESLPSQMGLKKFAYCLASRKFDDSPHSGELILDSTGVKTGGLTYTPFRQNPSVSNHAYKEYYYLSIRKILVGNQAVKV-LYKYLVPGPDGNGGSIIDSG
          SLPSQ+ + KF+YC+       S  S  L+       T G   T   Q+  +       +YY+++  + VG+  + +    + +   +G GG IIDSG
Subjt:  SESLPSQMGLKKFAYCLASRKFDDSPHSGELILDSTGVKTGGLTYTPFRQNPSVSNHAYKEYYYLSIRKILVGNQAVKV-LYKYLVPGPDGNGGSIIDSG

Query:  STFTFMDKPVFEAVAQEFEKQLANRTRATDVESLTGLRPCFDISKDKS-VDFPELIFQFKGGAKWALPLSNYFALVSSSGVACLTVVTHKTEAGGGGGPS
        +T T+     +++V QEF  Q+          S +G   CF    D S +  P  +  F GG    LP  NYF +  S+G+ CL +       G      
Subjt:  STFTFMDKPVFEAVAQEFEKQLANRTRATDVESLTGLRPCFDISKDKS-VDFPELIFQFKGGAKWALPLSNYFALVSSSGVACLTVVTHKTEAGGGGGPS

Query:  VILGAFQQQNFYVEYDLVNERLGFRQQTC
         I G  QQQN  V YD  N  + F    C
Subjt:  VILGAFQQQNFYVEYDLVNERLGFRQQTC

Q940R4 Probable aspartyl protease At4g165634.5e-4729.9Show/hide
Query:  SVSSIANTNPITLPLHAFPHLPSSDPLQTLTFLASASQNRAHQIKTPKSNSVSKSPLSPHSYGAYSTPLSFGTPQQTLHLIFDTGSSLVWFPCTSRYLCS
        SVSS++    + L         SS PL  L   +S S  R  +    +       P+S  S   Y   LS G+    + L  DTGS LVWFPC   + C 
Subjt:  SVSSIANTNPITLPLHAFPHLPSSDPLQTLTFLASASQNRAHQIKTPKSNSVSKSPLSPHSYGAYSTPLSFGTPQQTLHLIFDTGSSLVWFPCTSRYLCS

Query:  ECSFPKIDPAGIPRFVPKLSSSSKLVGCQNPKCAWIFGPDVKSQ-CRSCN-----PKTENCTQT---CPAYVVQYGSGSTAGLLLSETLDFPDKKIPNFV
         C    + P+        LSSS+  V C +P C+        S  C   N      +T +C  +   CP +   YG GS    L S++L  P   + NF 
Subjt:  ECSFPKIDPAGIPRFVPKLSSSSKLVGCQNPKCAWIFGPDVKSQ-CRSCN-----PKTENCTQT---CPAYVVQYGSGSTAGLLLSETLDFPDKKIPNFV

Query:  VGCSFLSIHQPSGIAGFGRGSESLPSQMGL------KKFAYCLASRKFDDS--PHSGELIL--------------------DSTGVKTGGLTYTPFRQNP
         GC+  ++ +P G+AGFGRG  SLP+Q+ +        F+YCL S  FD         LIL                    D    K     +T   +NP
Subjt:  VGCSFLSIHQPSGIAGFGRGSESLPSQMGL------KKFAYCLASRKFDDS--PHSGELIL--------------------DSTGVKTGGLTYTPFRQNP

Query:  SVSNHAYKEYYYLSIRKILVGNQAVKVLYKYLVPGPDGNGGSIIDSGSTFTFMDKPVFEAVAQEFEKQLAN-RTRATDVESLTGLRPCFDISKDKSVDFP
            H Y  +Y +S++ I +G + +           +G GG ++DSG+TFT +    + +V +EF+ ++     RA  VE  +G+ PC+ +  +++V  P
Subjt:  SVSNHAYKEYYYLSIRKILVGNQAVKVLYKYLVPGPDGNGGSIIDSGSTFTFMDKPVFEAVAQEFEKQLAN-RTRATDVESLTGLRPCFDISKDKSVDFP

Query:  ELIFQFKGG-AKWALPLSNYFALVSSSG--------VACLTVVTHKTEAGGGGGPSVILGAFQQQNFYVEYDLVNERLGFRQQTC
         L+  F G  +   LP  NYF      G        + CL ++    E+   GG   ILG +QQQ F V YDL+N R+GF ++ C
Subjt:  ELIFQFKGG-AKWALPLSNYFALVSSSG--------VACLTVVTHKTEAGGGGGPSVILGAFQQQNFYVEYDLVNERLGFRQQTC

Q9LNJ3 Aspartyl protease family protein 25.5e-3729.12Show/hide
Query:  PKASSMASPSPLSF--------FYILLFSSVSSIANTNPITLPLHAFPHLPSS-DPLQTLTFLASASQNRAHQIKT-------------PKSNSVSKSPL
        P + S+   SP+SF             F S S   +++ ITL L     L S+  P +  +        R   I T             P+    S S +
Subjt:  PKASSMASPSPLSF--------FYILLFSSVSSIANTNPITLPLHAFPHLPSS-DPLQTLTFLASASQNRAHQIKT-------------PKSNSVSKSPL

Query:  SPHSYGA--YSTPLSFGTPQQTLHLIFDTGSSLVWFPCTSRYLCSECSFPKIDPAGIPRFVPKLSSSSKLVGCQNPKCAWIFGPDVKSQCRSCNPKTENC
        S  S G+  Y T L  GTP + ++++ DTGS +VW  C     C  C + + DP     F P+ S +   + C +P C        +     CN + + C
Subjt:  SPHSYGA--YSTPLSFGTPQQTLHLIFDTGSSLVWFPCTSRYLCSECSFPKIDPAGIPRFVPKLSSSSKLVGCQNPKCAWIFGPDVKSQCRSCNPKTENC

Query:  TQTCPAYVVQYGSGS-TAGLLLSETLDFPDKKIPNFVVGCSFLS---IHQPSGIAGFGRGSESLPSQMGLK---KFAYCLASRKFDDSPHSGELILDSTG
              Y V YG GS T G   +ETL F   ++    +GC   +       +G+ G G+G  S P Q G +   KF+YCL  R     P S   ++    
Subjt:  TQTCPAYVVQYGSGS-TAGLLLSETLDFPDKKIPNFVVGCSFLS---IHQPSGIAGFGRGSESLPSQMGLK---KFAYCLASRKFDDSPHSGELILDSTG

Query:  VKTGGLTYTPFRQNPSVSNHAYKEYYYLSIRKILVGNQAVK-VLYKYLVPGPDGNGGSIIDSGSTFTFMDKPVFEAVAQEFEKQLANRTRATDVESLTGL
          +    +TP   NP +       +YY+ +  I VG   V  V          GNGG IIDSG++ T + +P + A+   F        RA D       
Subjt:  VKTGGLTYTPFRQNPSVSNHAYKEYYYLSIRKILVGNQAVK-VLYKYLVPGPDGNGGSIIDSGSTFTFMDKPVFEAVAQEFEKQLANRTRATDVESLTGL

Query:  RPCFDISKDKSVDFPELIFQFKGGAKWALPLSNYFALVSSSGVACLTVVTHKTEAGGGGGPSVILGAFQQQNFYVEYDLVNERLGFRQQTC
          CFD+S    V  P ++  F+ GA  +LP +NY   V ++G  C         AG  GG S+I G  QQQ F V YDL + R+GF    C
Subjt:  RPCFDISKDKSVDFPELIFQFKGGAKWALPLSNYFALVSSSGVACLTVVTHKTEAGGGGGPSVILGAFQQQNFYVEYDLVNERLGFRQQTC

Arabidopsis top hitse value%identityAlignment
AT2G03200.1 Eukaryotic aspartyl protease family protein1.4e-4032.32Show/hide
Query:  SSMASPSPLSFFYILLFS---SVSS-----IANTNPITLPLHAF----PHLPSSDPLQTLTFLASASQNRAHQI------------KTPKSNSVSKSPLS
        +S +S S L  F+++LFS   SVSS     I  T P  LP   F     H+ S   L  +  +        H++              P   +  K+P  
Subjt:  SSMASPSPLSFFYILLFS---SVSS-----IANTNPITLPLHAF----PHLPSSDPLQTLTFLASASQNRAHQI------------KTPKSNSVSKSPLS

Query:  PHSYGAYSTPLSFGTPQQTLHLIFDTGSSLVWFPCTSRYLCSECSFPKIDPAGIPRFVPKLSSSSKLVGCQNPKCAWIFGPDVKSQCRSCNPKTENCTQT
          S G +   LS G P      I DTGS L+W  C     C+EC          P F P+ SSS   VGC +  C  +          +CN   + C   
Subjt:  PHSYGAYSTPLSFGTPQQTLHLIFDTGSSLVWFPCTSRYLCSECSFPKIDPAGIPRFVPKLSSSSKLVGCQNPKCAWIFGPDVKSQCRSCNPKTENCTQT

Query:  CPAYVVQYGS-GSTAGLLLSETLDFPDK-KIPNFVVGCSFLS----IHQPSGIAGFGRGSESLPSQMGLKKFAYCLASRKFDDSPHSGELILDS--TGV-
           Y+  YG   ST GLL +ET  F D+  I     GC   +      Q SG+ G GRG  SL SQ+   KF+YCL S   +DS  S  L + S  +G+ 
Subjt:  CPAYVVQYGS-GSTAGLLLSETLDFPDK-KIPNFVVGCSFLS----IHQPSGIAGFGRGSESLPSQMGLKKFAYCLASRKFDDSPHSGELILDS--TGV-

Query:  -KTGGLTYTPFRQNPS-VSNHAYKEYYYLSIRKILVGNQAVKVLYKYLVPGPDGNGGSIIDSGSTFTFMDKPVFEAVAQEFEKQLANRTRATDVESLTGL
         KTG        +  S + N     +YYL ++ I VG + + V         DG GG IIDSG+T T++++  F+ + +EF  ++   +   D    TGL
Subjt:  -KTGGLTYTPFRQNPS-VSNHAYKEYYYLSIRKILVGNQAVKVLYKYLVPGPDGNGGSIIDSGSTFTFMDKPVFEAVAQEFEKQLANRTRATDVESLTGL

Query:  RPCFDI-SKDKSVDFPELIFQFKGGAKWALPLSNYFALVSSSGVACLTVVTHKTEAGGGGGPSVILGAFQQQNFYVEYDLVNERLGFRQQTC
          CF +    K++  P++IF FK GA   LP  NY    SS+GV CL +       G   G S I G  QQQNF V +DL  E + F    C
Subjt:  RPCFDI-SKDKSVDFPELIFQFKGGAKWALPLSNYFALVSSSGVACLTVVTHKTEAGGGGGPSVILGAFQQQNFYVEYDLVNERLGFRQQTC

AT2G42980.1 Eukaryotic aspartyl protease family protein9.4e-4031.49Show/hide
Query:  GAYSTPLSFGTPQQTLHLIFDTGSSLVWFPCTSRYLCSECSFPKIDPAGIPRFVPKLSSSSKLVGCQNPKCAWIFGPDVKSQCRSCNPKTENCTQTCPAY
        G Y   +  GTP +   LI DTGS L W  C   Y C   +    D        PK S+S K + C +P+C+ I  PD   QC S N       Q+CP Y
Subjt:  GAYSTPLSFGTPQQTLHLIFDTGSSLVWFPCTSRYLCSECSFPKIDPAGIPRFVPKLSSSSKLVGCQNPKCAWIFGPDVKSQCRSCNPKTENCTQTCPAY

Query:  VVQYGSGS-TAGLLLSETLDF---------PDKKIPNFVVGCSFLS---IHQPSGIAGFGRGSESLPSQMGL---KKFAYCLASRKFDDSPHSGELIL--
           YG  S T G    ET             + K+ N + GC   +       SG+ G GRG  S  SQ+       F+YCL  R   ++  S +LI   
Subjt:  VVQYGSGS-TAGLLLSETLDF---------PDKKIPNFVVGCSFLS---IHQPSGIAGFGRGSESLPSQMGL---KKFAYCLASRKFDDSPHSGELIL--

Query:  DSTGVKTGGLTYTPFRQNPSVSNHAYKEYYYLSIRKILVGNQAVKVLYKYLVPGPDGNGGSIIDSGSTFTFMDKPVFEAVAQEF-EKQLANRTRATDVES
        D   +    L +T F        ++ + +YY+ I+ ILVG +A+ +  +      DG+GG+IIDSG+T ++  +P +E +  +F EK   N     D   
Subjt:  DSTGVKTGGLTYTPFRQNPSVSNHAYKEYYYLSIRKILVGNQAVKVLYKYLVPGPDGNGGSIIDSGSTFTFMDKPVFEAVAQEF-EKQLANRTRATDVES

Query:  LTGLRPCFDIS--KDKSVDFPELIFQFKGGAKWALPLSNYFALVSSSGVACLTVVTHKTEAGGGGGPSVILGAFQQQNFYVEYDLVNERLGFRQQTC
        L    PCF++S  ++ ++  PEL   F  G  W  P  N F  +S   + CL ++      G       I+G +QQQNF++ YD    RLGF    C
Subjt:  LTGLRPCFDIS--KDKSVDFPELIFQFKGGAKWALPLSNYFALVSSSGVACLTVVTHKTEAGGGGGPSVILGAFQQQNFYVEYDLVNERLGFRQQTC

AT3G52500.1 Eukaryotic aspartyl protease family protein2.0e-15157.23Show/hide
Query:  FFYILLFSSVSSIANTNPITLPLHAFPHLPSS--DPLQTLTFLASASQNRAHQIK---------------TPKSNSVSKSPLSPHSYGAYSTPLSFGTPQ
        FF+ L+F SV S      + LPL  F H   S  DP  +L  LA +S  RAH++K               T  S +V KSPLS  SYG YS  LSFGTP 
Subjt:  FFYILLFSSVSSIANTNPITLPLHAFPHLPSS--DPLQTLTFLASASQNRAHQIK---------------TPKSNSVSKSPLSPHSYGAYSTPLSFGTPQ

Query:  QTLHLIFDTGSSLVWFPCTSRYLCSECSFPKIDPAGIPRFVPKLSSSSKLVGCQNPKCAWIFGPDVKSQCRSCNPKTENCTQTCPAYVVQYGSGSTAGLL
        QT+  +FDTGSSLVW PCTSRYLCS C F  +DP  IPRF+PK SSSSK++GCQ+PKC +++GP+V  QCR C+P T NCT  CP Y++QYG GSTAG+L
Subjt:  QTLHLIFDTGSSLVWFPCTSRYLCSECSFPKIDPAGIPRFVPKLSSSSKLVGCQNPKCAWIFGPDVKSQCRSCNPKTENCTQTCPAYVVQYGSGSTAGLL

Query:  LSETLDFPDKKIPNFVVGCSFLSIHQPSGIAGFGRGSESLPSQMGLKKFAYCLASRKFDDSPHSGELILD-----STGVKTGGLTYTPFRQNPSVSNHAY
        ++E LDFPD  +P+FVVGCS +S  QP+GIAGFGRG  SLPSQM LK+F++CL SR+FDD+  + +L LD     ++G KT GLTYTPFR+NP+VSN A+
Subjt:  LSETLDFPDKKIPNFVVGCSFLSIHQPSGIAGFGRGSESLPSQMGLKKFAYCLASRKFDDSPHSGELILD-----STGVKTGGLTYTPFRQNPSVSNHAY

Query:  KEYYYLSIRKILVGNQAVKVLYKYLVPGPDGNGGSIIDSGSTFTFMDKPVFEAVAQEFEKQLANRTRATDVESLTGLRPCFDISKDKSVDFPELIFQFKG
         EYYYL++R+I VG + VK+ YKYL PG +G+GGSI+DSGSTFTFM++PVFE VA+EF  Q++N TR  D+E  TGL PCF+IS    V  PELIF+FKG
Subjt:  KEYYYLSIRKILVGNQAVKVLYKYLVPGPDGNGGSIIDSGSTFTFMDKPVFEAVAQEFEKQLANRTRATDVESLTGLRPCFDISKDKSVDFPELIFQFKG

Query:  GAKWALPLSNYFALVSSSGVACLTVVTHKT-EAGGGGGPSVILGAFQQQNFYVEYDLVNERLGFRQQTCT
        GAK  LPLSNYF  V ++   CLTVV+ KT    GG GP++ILG+FQQQN+ VEYDL N+R GF ++ C+
Subjt:  GAKWALPLSNYFALVSSSGVACLTVVTHKT-EAGGGGGPSVILGAFQQQNFYVEYDLVNERLGFRQQTCT

AT4G16563.1 Eukaryotic aspartyl protease family protein3.2e-4829.9Show/hide
Query:  SVSSIANTNPITLPLHAFPHLPSSDPLQTLTFLASASQNRAHQIKTPKSNSVSKSPLSPHSYGAYSTPLSFGTPQQTLHLIFDTGSSLVWFPCTSRYLCS
        SVSS++    + L         SS PL  L   +S S  R  +    +       P+S  S   Y   LS G+    + L  DTGS LVWFPC   + C 
Subjt:  SVSSIANTNPITLPLHAFPHLPSSDPLQTLTFLASASQNRAHQIKTPKSNSVSKSPLSPHSYGAYSTPLSFGTPQQTLHLIFDTGSSLVWFPCTSRYLCS

Query:  ECSFPKIDPAGIPRFVPKLSSSSKLVGCQNPKCAWIFGPDVKSQ-CRSCN-----PKTENCTQT---CPAYVVQYGSGSTAGLLLSETLDFPDKKIPNFV
         C    + P+        LSSS+  V C +P C+        S  C   N      +T +C  +   CP +   YG GS    L S++L  P   + NF 
Subjt:  ECSFPKIDPAGIPRFVPKLSSSSKLVGCQNPKCAWIFGPDVKSQ-CRSCN-----PKTENCTQT---CPAYVVQYGSGSTAGLLLSETLDFPDKKIPNFV

Query:  VGCSFLSIHQPSGIAGFGRGSESLPSQMGL------KKFAYCLASRKFDDS--PHSGELIL--------------------DSTGVKTGGLTYTPFRQNP
         GC+  ++ +P G+AGFGRG  SLP+Q+ +        F+YCL S  FD         LIL                    D    K     +T   +NP
Subjt:  VGCSFLSIHQPSGIAGFGRGSESLPSQMGL------KKFAYCLASRKFDDS--PHSGELIL--------------------DSTGVKTGGLTYTPFRQNP

Query:  SVSNHAYKEYYYLSIRKILVGNQAVKVLYKYLVPGPDGNGGSIIDSGSTFTFMDKPVFEAVAQEFEKQLAN-RTRATDVESLTGLRPCFDISKDKSVDFP
            H Y  +Y +S++ I +G + +           +G GG ++DSG+TFT +    + +V +EF+ ++     RA  VE  +G+ PC+ +  +++V  P
Subjt:  SVSNHAYKEYYYLSIRKILVGNQAVKVLYKYLVPGPDGNGGSIIDSGSTFTFMDKPVFEAVAQEFEKQLAN-RTRATDVESLTGLRPCFDISKDKSVDFP

Query:  ELIFQFKGG-AKWALPLSNYFALVSSSG--------VACLTVVTHKTEAGGGGGPSVILGAFQQQNFYVEYDLVNERLGFRQQTC
         L+  F G  +   LP  NYF      G        + CL ++    E+   GG   ILG +QQQ F V YDL+N R+GF ++ C
Subjt:  ELIFQFKGG-AKWALPLSNYFALVSSSG--------VACLTVVTHKTEAGGGGGPSVILGAFQQQNFYVEYDLVNERLGFRQQTC

AT5G45120.1 Eukaryotic aspartyl protease family protein2.1e-4730.02Show/hide
Query:  ILLFSSVSSIAN-TNPITLPLHAFPHLPSSDPLQTLTFLASASQNRAHQIKTPKSNSVS--KSPLSP---------HSYGAYSTPLSFGTPQQTLHLIFD
        + LF  ++ + N TN      H  P   SS      +FL       +  + TPKS +    K PLS               Y   L+ GTP Q + +  D
Subjt:  ILLFSSVSSIAN-TNPITLPLHAFPHLPSSDPLQTLTFLASASQNRAHQIKTPKSNSVS--KSPLSP---------HSYGAYSTPLSFGTPQQTLHLIFD

Query:  TGSSLVWFPCTS-RYLCSECSFPKIDPAGIPR-FVPKLSSSSKLVGCQNPKCAWI------FGPDVKSQCRSCNPKTENCTQTCPAYVVQYGSGS-TAGL
        TGS L W PC +  + C EC   K +    P  F P  SS+S    C +  C  I      F P   + C         C + CP++   YG G   +G+
Subjt:  TGSSLVWFPCTS-RYLCSECSFPKIDPAGIPR-FVPKLSSSSKLVGCQNPKCAWI------FGPDVKSQCRSCNPKTENCTQTCPAYVVQYGSGS-TAGL

Query:  LLSETLDFPDKKIPNFVVGCSFLSIHQPSGIAGFGRGSESLPSQMGL--KKFAYCLASRKFDDSPH-SGELILDSTGVK---TGGLTYTPFRQNPSVSNH
        L  + L    + +P F  GC   +  +P GIAGFGRG  SLPSQ+G   K F++C    KF ++P+ S  LIL ++ +    T  L +TP    P     
Subjt:  LLSETLDFPDKKIPNFVVGCSFLSIHQPSGIAGFGRGSESLPSQMGL--KKFAYCLASRKFDDSPH-SGELILDSTGVK---TGGLTYTPFRQNPSVSNH

Query:  AYKEYYYLSIRKILVGNQAVKVLYKYLVPGPD--GNGGSIIDSGSTFTFMDKPVFEAVAQEFEKQLANRTRATDVESLTGLRPCF----------DISKD
         Y   YY+ +  I +G           +   D  GNGG ++DSG+T+T + +P +  +    +  +    RAT+ ES TG   C+           +  D
Subjt:  AYKEYYYLSIRKILVGNQAVKVLYKYLVPGPD--GNGGSIIDSGSTFTFMDKPVFEAVAQEFEKQLANRTRATDVESLTGLRPCF----------DISKD

Query:  KSVDFPELIFQFKGGAKWALPLSN-YFALVSSSGVACLTVVTHKTEAGGGGGPSVILGAFQQQNFYVEYDLVNERLGFRQQTC
          + FP + F F   A   LP  N ++A+ + S  + +  +  +    G  GP+ + G+FQQQN  V YDL  ER+GF+   C
Subjt:  KSVDFPELIFQFKGGAKWALPLSN-YFALVSSSGVACLTVVTHKTEAGGGGGPSVILGAFQQQNFYVEYDLVNERLGFRQQTC


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTTTGAAAGACTAGAAGAGCGTCCGACTAAGCTGAGCGAAGAAGACCATACCACGTTCCTCACTCCTCAGTTCCATAGTTCCAACGGCCAAAAAAGAGCCCTTCCCTG
CTTCCAGCTCACCGCTTTGTCCCTTCTTTGCTCTTCCTCCCCATTTATACTTCAAACCCCCCTTTCTCAACCTTCTCATCAACATAACTCTCTTTCCCTTCATACATTTC
CAATCCATAACTATATCAGGATTACATTACAAGCAGGAAGAAGCCAACCCAAAGCTTCTTCCATGGCGTCTCCTTCCCCTCTCTCTTTCTTCTACATTCTCCTCTTCTCC
TCTGTTTCCTCCATTGCCAACACCAACCCAATCACCCTCCCTCTCCACGCCTTCCCCCACCTTCCTTCTTCAGATCCACTCCAAACTCTCACTTTCCTCGCCTCTGCTTC
CCAAAACAGAGCTCATCAAATCAAAACCCCCAAATCCAACTCTGTTTCCAAGTCCCCTCTCTCCCCTCACAGCTATGGAGCTTACTCAACTCCACTCAGCTTTGGTACTC
CACAACAGACTCTGCATTTGATCTTCGATACAGGTAGTAGCCTCGTTTGGTTCCCTTGTACTTCCAGATATCTCTGTTCCGAATGTTCCTTCCCCAAAATAGATCCCGCC
GGAATCCCCAGATTTGTCCCCAAATTGTCTTCCTCTTCCAAGCTTGTCGGTTGCCAGAATCCCAAATGTGCTTGGATTTTTGGCCCCGATGTCAAATCTCAATGCCGGAG
TTGTAACCCCAAAACAGAGAACTGTACCCAAACTTGCCCTGCTTACGTCGTTCAGTACGGTTCCGGCTCCACGGCTGGGCTTTTGCTATCGGAAACGCTTGATTTTCCCG
ATAAGAAAATCCCCAATTTTGTTGTTGGCTGTTCGTTTTTGTCGATCCATCAACCCTCTGGAATCGCCGGATTCGGCCGAGGATCTGAATCGCTCCCCTCGCAAATGGGT
CTCAAGAAATTCGCGTACTGCCTTGCGTCTCGGAAATTCGACGACTCGCCGCATTCTGGTGAGCTGATTCTAGATTCCACCGGCGTGAAGACCGGCGGTCTCACCTACAC
GCCGTTCCGGCAGAACCCCTCTGTTTCTAACCACGCTTATAAAGAATACTATTACTTAAGCATACGCAAAATCCTCGTCGGAAACCAGGCCGTGAAGGTGCTGTACAAGT
ATCTGGTGCCGGGCCCCGACGGCAACGGTGGATCTATCATCGATTCCGGCTCCACCTTCACGTTTATGGACAAACCAGTCTTCGAGGCAGTGGCGCAAGAGTTCGAGAAG
CAGTTGGCGAACCGGACGAGAGCCACCGATGTGGAATCTCTCACCGGATTACGGCCGTGTTTCGACATTTCGAAGGACAAATCGGTGGATTTTCCGGAGCTGATTTTCCA
GTTTAAAGGCGGAGCGAAATGGGCTCTGCCGTTGAGTAACTATTTCGCTTTAGTCAGTAGCTCCGGCGTGGCGTGTTTGACGGTTGTGACGCATAAGACGGAGGCGGGCG
GCGGCGGTGGGCCGTCTGTGATTTTGGGGGCTTTCCAGCAGCAGAATTTCTATGTGGAGTATGATTTGGTGAATGAAAGATTGGGATTTCGGCAACAGACTTGCACTTAG
mRNA sequenceShow/hide mRNA sequence
ATGTTTGAAAGACTAGAAGAGCGTCCGACTAAGCTGAGCGAAGAAGACCATACCACGTTCCTCACTCCTCAGTTCCATAGTTCCAACGGCCAAAAAAGAGCCCTTCCCTG
CTTCCAGCTCACCGCTTTGTCCCTTCTTTGCTCTTCCTCCCCATTTATACTTCAAACCCCCCTTTCTCAACCTTCTCATCAACATAACTCTCTTTCCCTTCATACATTTC
CAATCCATAACTATATCAGGATTACATTACAAGCAGGAAGAAGCCAACCCAAAGCTTCTTCCATGGCGTCTCCTTCCCCTCTCTCTTTCTTCTACATTCTCCTCTTCTCC
TCTGTTTCCTCCATTGCCAACACCAACCCAATCACCCTCCCTCTCCACGCCTTCCCCCACCTTCCTTCTTCAGATCCACTCCAAACTCTCACTTTCCTCGCCTCTGCTTC
CCAAAACAGAGCTCATCAAATCAAAACCCCCAAATCCAACTCTGTTTCCAAGTCCCCTCTCTCCCCTCACAGCTATGGAGCTTACTCAACTCCACTCAGCTTTGGTACTC
CACAACAGACTCTGCATTTGATCTTCGATACAGGTAGTAGCCTCGTTTGGTTCCCTTGTACTTCCAGATATCTCTGTTCCGAATGTTCCTTCCCCAAAATAGATCCCGCC
GGAATCCCCAGATTTGTCCCCAAATTGTCTTCCTCTTCCAAGCTTGTCGGTTGCCAGAATCCCAAATGTGCTTGGATTTTTGGCCCCGATGTCAAATCTCAATGCCGGAG
TTGTAACCCCAAAACAGAGAACTGTACCCAAACTTGCCCTGCTTACGTCGTTCAGTACGGTTCCGGCTCCACGGCTGGGCTTTTGCTATCGGAAACGCTTGATTTTCCCG
ATAAGAAAATCCCCAATTTTGTTGTTGGCTGTTCGTTTTTGTCGATCCATCAACCCTCTGGAATCGCCGGATTCGGCCGAGGATCTGAATCGCTCCCCTCGCAAATGGGT
CTCAAGAAATTCGCGTACTGCCTTGCGTCTCGGAAATTCGACGACTCGCCGCATTCTGGTGAGCTGATTCTAGATTCCACCGGCGTGAAGACCGGCGGTCTCACCTACAC
GCCGTTCCGGCAGAACCCCTCTGTTTCTAACCACGCTTATAAAGAATACTATTACTTAAGCATACGCAAAATCCTCGTCGGAAACCAGGCCGTGAAGGTGCTGTACAAGT
ATCTGGTGCCGGGCCCCGACGGCAACGGTGGATCTATCATCGATTCCGGCTCCACCTTCACGTTTATGGACAAACCAGTCTTCGAGGCAGTGGCGCAAGAGTTCGAGAAG
CAGTTGGCGAACCGGACGAGAGCCACCGATGTGGAATCTCTCACCGGATTACGGCCGTGTTTCGACATTTCGAAGGACAAATCGGTGGATTTTCCGGAGCTGATTTTCCA
GTTTAAAGGCGGAGCGAAATGGGCTCTGCCGTTGAGTAACTATTTCGCTTTAGTCAGTAGCTCCGGCGTGGCGTGTTTGACGGTTGTGACGCATAAGACGGAGGCGGGCG
GCGGCGGTGGGCCGTCTGTGATTTTGGGGGCTTTCCAGCAGCAGAATTTCTATGTGGAGTATGATTTGGTGAATGAAAGATTGGGATTTCGGCAACAGACTTGCACTTAG
Protein sequenceShow/hide protein sequence
MFERLEERPTKLSEEDHTTFLTPQFHSSNGQKRALPCFQLTALSLLCSSSPFILQTPLSQPSHQHNSLSLHTFPIHNYIRITLQAGRSQPKASSMASPSPLSFFYILLFS
SVSSIANTNPITLPLHAFPHLPSSDPLQTLTFLASASQNRAHQIKTPKSNSVSKSPLSPHSYGAYSTPLSFGTPQQTLHLIFDTGSSLVWFPCTSRYLCSECSFPKIDPA
GIPRFVPKLSSSSKLVGCQNPKCAWIFGPDVKSQCRSCNPKTENCTQTCPAYVVQYGSGSTAGLLLSETLDFPDKKIPNFVVGCSFLSIHQPSGIAGFGRGSESLPSQMG
LKKFAYCLASRKFDDSPHSGELILDSTGVKTGGLTYTPFRQNPSVSNHAYKEYYYLSIRKILVGNQAVKVLYKYLVPGPDGNGGSIIDSGSTFTFMDKPVFEAVAQEFEK
QLANRTRATDVESLTGLRPCFDISKDKSVDFPELIFQFKGGAKWALPLSNYFALVSSSGVACLTVVTHKTEAGGGGGPSVILGAFQQQNFYVEYDLVNERLGFRQQTCT