; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sed0003727 (gene) of Chayote v1 genome

Gene IDSed0003727
OrganismSechium edule (Chayote v1)
DescriptionEukaryotic aspartyl protease family protein
Genome locationLG04:31781604..31783693
RNA-Seq ExpressionSed0003727
SyntenySed0003727
Gene Ontology termsGO:0006508 - proteolysis (biological process)
GO:0005576 - extracellular region (cellular component)
GO:0004190 - aspartic-type endopeptidase activity (molecular function)
InterPro domainsIPR001969 - Aspartic peptidase, active site
IPR021109 - Aspartic peptidase domain superfamily
IPR032799 - Xylanase inhibitor, C-terminal
IPR032861 - Xylanase inhibitor, N-terminal
IPR033121 - Peptidase family A1 domain
IPR034161 - Pepsin-like domain, plant


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004147205.1 probable aspartyl protease At4g16563 [Cucumis sativus]1.1e-23585.5Show/hide
Query:  SPVFAFLCFLLSSSSVFSSQILLLPLSHSLSSNLNN-NNTHNLLKSAADRSSARHRRRRRTHLSLPLSPGGDYTLSFNLGSHAHTISLYMDTGSDFVWFP
        SPVF FL   L SS VFSSQI LLPLSHSLSS++++ NNTHNLLKS A RSSAR  R R  HLSLPLSPGGDYTLSFNLGS +H ISLYMDTGSD VWFP
Subjt:  SPVFAFLCFLLSSSSVFSSQILLLPLSHSLSSNLNN-NNTHNLLKSAADRSSARHRRRRRTHLSLPLSPGGDYTLSFNLGSHAHTISLYMDTGSDFVWFP

Query:  CSPFECILCEGKPQIQPPFPQIPKQKSVSCSAPACSAAHGASLSASHLCAISRCPLESIEVSECSSFSCPPLYYAYGDGSLIAQLYRDSLGLPAPAPSPP
        CSPFECILCEGKP+IQ P P+I   KSVSCSA ACSAAHG SLSASHLCAISRCPLESIE+SECSSFSCPP YYAYGDGSL+A+LYRDSL LP PAPSPP
Subjt:  CSPFECILCEGKPQIQPPFPQIPKQKSVSCSAPACSAAHGASLSASHLCAISRCPLESIEVSECSSFSCPPLYYAYGDGSLIAQLYRDSLGLPAPAPSPP

Query:  IRVRNFTFGCAHTALGEPVGVAGFGRGALSMPSQLATFSPQLGNRFSYCLVSHSFNTDRVRRPSPLILGRYSDGQSEFVYTSLLDNPKHPYFYSVGLAGI
        I VRNFTFGCAHT LGEPVGVAGFGRG LSMPSQLATFSPQLGNRFSYCLVSHSF  DRVRRPSPLILGRY  G++EF+YTSLL+NPKHPYFYSVGLAGI
Subjt:  IRVRNFTFGCAHTALGEPVGVAGFGRGALSMPSQLATFSPQLGNRFSYCLVSHSFNTDRVRRPSPLILGRYSDGQSEFVYTSLLDNPKHPYFYSVGLAGI

Query:  SVGTVTIPAPEFLKRVDDGGSGGVVVDSGTTFTMLPAGLYESVVAQFENRTGRVATRASRIEENTGLSPCYYYDDSIQVPRVVLHFVGERSSVVLPRKNY
        SVG + IPAPEFL +VD+GGSGGVVVDSGTTFTMLPAGLYESVVA+FENRTG+VA RA RIEENTGLSPCYYY++S+ VPRVVLHFVGE+S+VVLPRKNY
Subjt:  SVGTVTIPAPEFLKRVDDGGSGGVVVDSGTTFTMLPAGLYESVVAQFENRTGRVATRASRIEENTGLSPCYYYDDSIQVPRVVLHFVGERSSVVLPRKNY

Query:  FYEFFDGGDG-VGKKRKVGCLMLMNGGDEAELAGGPGATLGNYQQQGFEVVYDLEKNRVGFARRQCSTLWDSLNRS
        FYEF DGGDG VG+KRKVGCLMLMNGGDEAELAGGPGATLGNYQQQGFEVVYDLEKNRVGFARRQCSTLWD+LNRS
Subjt:  FYEFFDGGDG-VGKKRKVGCLMLMNGGDEAELAGGPGATLGNYQQQGFEVVYDLEKNRVGFARRQCSTLWDSLNRS

XP_008448851.1 PREDICTED: aspartic proteinase nepenthesin-1 [Cucumis melo]9.1e-23584.94Show/hide
Query:  SPVFAFLCFLLSSSSVFSSQILLLPLSHSLSSNLNN-NNTHNLLKSAADRSSARHRRRRRTHLSLPLSPGGDYTLSFNLGSHAHTISLYMDTGSDFVWFP
        SPVF FL   L SS VFSSQI LLPLSHSLSS++++ N+THNLLKS A RSSAR  R R  HLSLPLSPGGDYTLSFNLGS +H ISLYMDTGSD VWFP
Subjt:  SPVFAFLCFLLSSSSVFSSQILLLPLSHSLSSNLNN-NNTHNLLKSAADRSSARHRRRRRTHLSLPLSPGGDYTLSFNLGSHAHTISLYMDTGSDFVWFP

Query:  CSPFECILCEGKPQIQPPFPQIPKQKSVSCSAPACSAAHGASLSASHLCAISRCPLESIEVSECSSFSCPPLYYAYGDGSLIAQLYRDSLGLPAPAPSPP
        CSPFECILCEGKP+IQ P P+I   KSVSCSA ACSAAHG SLSASHLCAISRCPLESIE+SECSSFSCPP YYAYGDGSL+A+LYRDSL LP PAPSPP
Subjt:  CSPFECILCEGKPQIQPPFPQIPKQKSVSCSAPACSAAHGASLSASHLCAISRCPLESIEVSECSSFSCPPLYYAYGDGSLIAQLYRDSLGLPAPAPSPP

Query:  IRVRNFTFGCAHTALGEPVGVAGFGRGALSMPSQLATFSPQLGNRFSYCLVSHSFNTDRVRRPSPLILGRYSDGQSEFVYTSLLDNPKHPYFYSVGLAGI
        I VRNFTFGCAHT LGEPVGVAGFGRG LSMPSQLATFSPQLGNRFSYCLVSHSF  DRVRRPSPLILGRY  G++EF+YTSLL+NPKHPYFYSVGLAGI
Subjt:  IRVRNFTFGCAHTALGEPVGVAGFGRGALSMPSQLATFSPQLGNRFSYCLVSHSFNTDRVRRPSPLILGRYSDGQSEFVYTSLLDNPKHPYFYSVGLAGI

Query:  SVGTVTIPAPEFLKRVDDGGSGGVVVDSGTTFTMLPAGLYESVVAQFENRTGRVATRASRIEENTGLSPCYYYDDSIQVPRVVLHFVGERSSVVLPRKNY
        SVG V IPAPEFL++VD+ GSGGVVVDSGTTFTMLP+GLYESVVA+FENRTG+VA RA RIEENTGLSPCYYY++S+ VPRVVLHFVGE+SSVVLPRKNY
Subjt:  SVGTVTIPAPEFLKRVDDGGSGGVVVDSGTTFTMLPAGLYESVVAQFENRTGRVATRASRIEENTGLSPCYYYDDSIQVPRVVLHFVGERSSVVLPRKNY

Query:  FYEFFDGGDG---VGKKRKVGCLMLMNGGDEAELAGGPGATLGNYQQQGFEVVYDLEKNRVGFARRQCSTLWDSLNRS
        FYEF DGGDG   VG+KRKVGCLMLMNGGDEAELAGGPGATLGNYQQQGFEVVYDLEKNRVGFARRQCSTLWD+LNRS
Subjt:  FYEFFDGGDG---VGKKRKVGCLMLMNGGDEAELAGGPGATLGNYQQQGFEVVYDLEKNRVGFARRQCSTLWDSLNRS

XP_023007805.1 probable aspartyl protease At4g16563 [Cucurbita maxima]7.7e-23485Show/hide
Query:  MASPVFAFLCFLLSSSSVFSSQILLLPLSHSLSSNLNNNNTHNLLKSAADRSSARHRRRRRT----HLSLPLSPGGDYTLSFNLGSHAHTISLYMDTGSD
        MASPVF FL   L  S VFSSQILLLPLS+SLSS+ + NNTHNLLKS A RSSAR   RRRT    HLSLPLSPGGDYTLSFNLGS +  ISLYMDTGSD
Subjt:  MASPVFAFLCFLLSSSSVFSSQILLLPLSHSLSSNLNNNNTHNLLKSAADRSSARHRRRRRT----HLSLPLSPGGDYTLSFNLGSHAHTISLYMDTGSD

Query:  FVWFPCSPFECILCEGKPQIQPPFPQIPKQKSVSCSAPACSAAHGASLSASHLCAISRCPLESIEVSECSSFSCPPLYYAYGDGSLIAQLYRDSLGLPAP
         VWFPCSPFECILCEGKP+IQ P P+I  QKSVSCSA ACSAAHG SLSASHLCAISRCPLESIEVSECSSFSCPP YYAYGDGSLI +LYRDSL LPAP
Subjt:  FVWFPCSPFECILCEGKPQIQPPFPQIPKQKSVSCSAPACSAAHGASLSASHLCAISRCPLESIEVSECSSFSCPPLYYAYGDGSLIAQLYRDSLGLPAP

Query:  APSPPIRVRNFTFGCAHTALGEPVGVAGFGRGALSMPSQLATFSPQLGNRFSYCLVSHSFNTDRVRRPSPLILGRYSDGQSEFVYTSLLDNPKHPYFYSV
        APSP I VRNFTFGCAH+ALGEP+GVAGFGRG LSMP QLATFSPQLGNRFSYCLVSHSF  DRVRRPSPLILGRY   ++EF+YTS+L+NPKHPYFYSV
Subjt:  APSPPIRVRNFTFGCAHTALGEPVGVAGFGRGALSMPSQLATFSPQLGNRFSYCLVSHSFNTDRVRRPSPLILGRYSDGQSEFVYTSLLDNPKHPYFYSV

Query:  GLAGISVGTVTIPAPEFLKRVDDGGSGGVVVDSGTTFTMLPAGLYESVVAQFENRTGRVATRASRIEENTGLSPCYYYDDSIQVPRVVLHFVGERSSVVL
        GLAGISVG+V IPAPEFLK+VD+GGSGGVVVDSGTTFTMLPAGLY SVVAQFENRTGRVA+RAS+IEENTGLSPCYYY+ S++VPRVVLHFVGE+SSV+L
Subjt:  GLAGISVGTVTIPAPEFLKRVDDGGSGGVVVDSGTTFTMLPAGLYESVVAQFENRTGRVATRASRIEENTGLSPCYYYDDSIQVPRVVLHFVGERSSVVL

Query:  PRKNYFYEFFDGGDGVGKKRKVGCLMLMNGGDEAELAGGPGATLGNYQQQGFEVVYDLEKNRVGFARRQCSTLWDSLNRS
        PRKNYFYEF DGGDGVG+K KVGCLMLMNGGDEAELAGGPGATLGNYQQQGFEV YDLE NRVGFARRQCSTLWDSLNRS
Subjt:  PRKNYFYEFFDGGDGVGKKRKVGCLMLMNGGDEAELAGGPGATLGNYQQQGFEVVYDLEKNRVGFARRQCSTLWDSLNRS

XP_023553227.1 probable aspartyl protease At4g16563 [Cucurbita pepo subsp. pepo]2.0e-23485.48Show/hide
Query:  MASPVFAFLCFLLSSSSVFSSQILLLPLSHSLSSNLNNNNTHNLLKSAADRSSARHRRRRRT----HLSLPLSPGGDYTLSFNLGSHAHTISLYMDTGSD
        MASPVF FL   L SS VFSSQ+LLLPLS+SLSS+ + NNTHNLLKS A RSSAR   RRRT    HLSLPLSPGGDYTLSFNLGS +  ISLYMDTGSD
Subjt:  MASPVFAFLCFLLSSSSVFSSQILLLPLSHSLSSNLNNNNTHNLLKSAADRSSARHRRRRRT----HLSLPLSPGGDYTLSFNLGSHAHTISLYMDTGSD

Query:  FVWFPCSPFECILCEGKPQIQPPFPQIPKQKSVSCSAPACSAAHGASLSASHLCAISRCPLESIEVSECSSFSCPPLYYAYGDGSLIAQLYRDSLGL--P
         VWFPCSPFECILCEGKP+IQ P P+I  +KSVSCSA ACSAAHG SLSASHLCAISRCPLESIEVSECSSFSCPP YYAYGDGSLI +LYRDSL L  P
Subjt:  FVWFPCSPFECILCEGKPQIQPPFPQIPKQKSVSCSAPACSAAHGASLSASHLCAISRCPLESIEVSECSSFSCPPLYYAYGDGSLIAQLYRDSLGL--P

Query:  APAPSPPIRVRNFTFGCAHTALGEPVGVAGFGRGALSMPSQLATFSPQLGNRFSYCLVSHSFNTDRVRRPSPLILGRYSDGQSEFVYTSLLDNPKHPYFY
        APAPSP I VRNFTFGCAH+ALGEP+GVAGFGRG LSMPSQLATFSPQLGNRFSYCLVSHSF  DRVRRPSPLILGRY   ++EF+YTSLL+NPKHPYFY
Subjt:  APAPSPPIRVRNFTFGCAHTALGEPVGVAGFGRGALSMPSQLATFSPQLGNRFSYCLVSHSFNTDRVRRPSPLILGRYSDGQSEFVYTSLLDNPKHPYFY

Query:  SVGLAGISVGTVTIPAPEFLKRVDDGGSGGVVVDSGTTFTMLPAGLYESVVAQFENRTGRVATRASRIEENTGLSPCYYYDDSIQVPRVVLHFVGERSSV
        SVGLAGISVG+V IPAPEFLKRVD+GGSGGVVVDSGTTFTMLPAGLY SVVAQFENRTGRVA+RASRIEENTGLSPCYYY++S++VPRVVLHFVGE+SSV
Subjt:  SVGLAGISVGTVTIPAPEFLKRVDDGGSGGVVVDSGTTFTMLPAGLYESVVAQFENRTGRVATRASRIEENTGLSPCYYYDDSIQVPRVVLHFVGERSSV

Query:  VLPRKNYFYEFFDGGDGVGKKRKVGCLMLMNGGDEAELAGGPGATLGNYQQQGFEVVYDLEKNRVGFARRQCSTLWDSLNRS
        VLPRKNYFYEF DGGDGV +KRKVGCLMLMNGGDEAELAGGPGATLGNYQQQGFEV YDLE NRVGFARRQCSTLWDSLNRS
Subjt:  VLPRKNYFYEFFDGGDGVGKKRKVGCLMLMNGGDEAELAGGPGATLGNYQQQGFEVVYDLEKNRVGFARRQCSTLWDSLNRS

XP_038905814.1 probable aspartyl protease At4g16563 [Benincasa hispida]2.4e-23584.82Show/hide
Query:  MASPVFAFLCFLLSSSSVFSSQILLLPLSHSLSSNLNN-NNTHNLLKSAADRSSARHRRRRRT----HLSLPLSPGGDYTLSFNLGSHAHTISLYMDTGS
        MAS VF  L   L SS VFSSQ+LLLPLSHSLSS++++ NNTHNLLKS A RSSAR   RRRT    HLSLPLSPGGDYTLSFNLGS +H ISLYMDTGS
Subjt:  MASPVFAFLCFLLSSSSVFSSQILLLPLSHSLSSNLNN-NNTHNLLKSAADRSSARHRRRRRT----HLSLPLSPGGDYTLSFNLGSHAHTISLYMDTGS

Query:  DFVWFPCSPFECILCEGKPQIQPPFPQIPKQKSVSCSAPACSAAHGASLSASHLCAISRCPLESIEVSECSSFSCPPLYYAYGDGSLIAQLYRDSLGLPA
        D VWFPCSPFECILCEGKP++Q P P+I   KSVSCSAPACSAAHG SLSASHLCAIS+CPLESIE+SECSSFSCPP YYAYGDGSLIA+LYRDSL LPA
Subjt:  DFVWFPCSPFECILCEGKPQIQPPFPQIPKQKSVSCSAPACSAAHGASLSASHLCAISRCPLESIEVSECSSFSCPPLYYAYGDGSLIAQLYRDSLGLPA

Query:  PAPSPPIRVRNFTFGCAHTALGEPVGVAGFGRGALSMPSQLATFSPQLGNRFSYCLVSHSFNTDRVRRPSPLILGRYSDGQSEFVYTSLLDNPKHPYFYS
        PAPSP I VRNFTFGCAHTALGEPVGVAGFGRG LSMPSQLATFSPQLGNRFSYCLVSHSF  +RVRRPSPLILGRY  G++EF+YTSLL+NPKHPYFYS
Subjt:  PAPSPPIRVRNFTFGCAHTALGEPVGVAGFGRGALSMPSQLATFSPQLGNRFSYCLVSHSFNTDRVRRPSPLILGRYSDGQSEFVYTSLLDNPKHPYFYS

Query:  VGLAGISVGTVTIPAPEFLKRVDDGGSGGVVVDSGTTFTMLPAGLYESVVAQFENRTGRVATRASRIEENTGLSPCYYYDDSIQVPRVVLHFVGERSSVV
        VGL GISVG + IPAPEFLK+VD+GGSGGVVVDSGTTFTMLPAGLY+SVVA FENRTGRVA RA RIEENTGLSPCYYY++S++VPRVVLHFVGE+SSV+
Subjt:  VGLAGISVGTVTIPAPEFLKRVDDGGSGGVVVDSGTTFTMLPAGLYESVVAQFENRTGRVATRASRIEENTGLSPCYYYDDSIQVPRVVLHFVGERSSVV

Query:  LPRKNYFYEFFDGGDGVGKKRKVGCLMLMNGGDEAELAGGPGATLGNYQQQGFEVVYDLEKNRVGFARRQCSTLWDSLNRS
        LP+KNYFYEF DGGDGVGKKRKVGCLMLMNGGDEAELAGGPGATLGNYQQQGFEV YDL KNRVGFARRQCSTLWDSLNRS
Subjt:  LPRKNYFYEFFDGGDGVGKKRKVGCLMLMNGGDEAELAGGPGATLGNYQQQGFEVVYDLEKNRVGFARRQCSTLWDSLNRS

TrEMBL top hitse value%identityAlignment
A0A0A0L5I7 Pepsin A5.2e-23685.5Show/hide
Query:  SPVFAFLCFLLSSSSVFSSQILLLPLSHSLSSNLNN-NNTHNLLKSAADRSSARHRRRRRTHLSLPLSPGGDYTLSFNLGSHAHTISLYMDTGSDFVWFP
        SPVF FL   L SS VFSSQI LLPLSHSLSS++++ NNTHNLLKS A RSSAR  R R  HLSLPLSPGGDYTLSFNLGS +H ISLYMDTGSD VWFP
Subjt:  SPVFAFLCFLLSSSSVFSSQILLLPLSHSLSSNLNN-NNTHNLLKSAADRSSARHRRRRRTHLSLPLSPGGDYTLSFNLGSHAHTISLYMDTGSDFVWFP

Query:  CSPFECILCEGKPQIQPPFPQIPKQKSVSCSAPACSAAHGASLSASHLCAISRCPLESIEVSECSSFSCPPLYYAYGDGSLIAQLYRDSLGLPAPAPSPP
        CSPFECILCEGKP+IQ P P+I   KSVSCSA ACSAAHG SLSASHLCAISRCPLESIE+SECSSFSCPP YYAYGDGSL+A+LYRDSL LP PAPSPP
Subjt:  CSPFECILCEGKPQIQPPFPQIPKQKSVSCSAPACSAAHGASLSASHLCAISRCPLESIEVSECSSFSCPPLYYAYGDGSLIAQLYRDSLGLPAPAPSPP

Query:  IRVRNFTFGCAHTALGEPVGVAGFGRGALSMPSQLATFSPQLGNRFSYCLVSHSFNTDRVRRPSPLILGRYSDGQSEFVYTSLLDNPKHPYFYSVGLAGI
        I VRNFTFGCAHT LGEPVGVAGFGRG LSMPSQLATFSPQLGNRFSYCLVSHSF  DRVRRPSPLILGRY  G++EF+YTSLL+NPKHPYFYSVGLAGI
Subjt:  IRVRNFTFGCAHTALGEPVGVAGFGRGALSMPSQLATFSPQLGNRFSYCLVSHSFNTDRVRRPSPLILGRYSDGQSEFVYTSLLDNPKHPYFYSVGLAGI

Query:  SVGTVTIPAPEFLKRVDDGGSGGVVVDSGTTFTMLPAGLYESVVAQFENRTGRVATRASRIEENTGLSPCYYYDDSIQVPRVVLHFVGERSSVVLPRKNY
        SVG + IPAPEFL +VD+GGSGGVVVDSGTTFTMLPAGLYESVVA+FENRTG+VA RA RIEENTGLSPCYYY++S+ VPRVVLHFVGE+S+VVLPRKNY
Subjt:  SVGTVTIPAPEFLKRVDDGGSGGVVVDSGTTFTMLPAGLYESVVAQFENRTGRVATRASRIEENTGLSPCYYYDDSIQVPRVVLHFVGERSSVVLPRKNY

Query:  FYEFFDGGDG-VGKKRKVGCLMLMNGGDEAELAGGPGATLGNYQQQGFEVVYDLEKNRVGFARRQCSTLWDSLNRS
        FYEF DGGDG VG+KRKVGCLMLMNGGDEAELAGGPGATLGNYQQQGFEVVYDLEKNRVGFARRQCSTLWD+LNRS
Subjt:  FYEFFDGGDG-VGKKRKVGCLMLMNGGDEAELAGGPGATLGNYQQQGFEVVYDLEKNRVGFARRQCSTLWDSLNRS

A0A1S3BK28 aspartic proteinase nepenthesin-14.4e-23584.94Show/hide
Query:  SPVFAFLCFLLSSSSVFSSQILLLPLSHSLSSNLNN-NNTHNLLKSAADRSSARHRRRRRTHLSLPLSPGGDYTLSFNLGSHAHTISLYMDTGSDFVWFP
        SPVF FL   L SS VFSSQI LLPLSHSLSS++++ N+THNLLKS A RSSAR  R R  HLSLPLSPGGDYTLSFNLGS +H ISLYMDTGSD VWFP
Subjt:  SPVFAFLCFLLSSSSVFSSQILLLPLSHSLSSNLNN-NNTHNLLKSAADRSSARHRRRRRTHLSLPLSPGGDYTLSFNLGSHAHTISLYMDTGSDFVWFP

Query:  CSPFECILCEGKPQIQPPFPQIPKQKSVSCSAPACSAAHGASLSASHLCAISRCPLESIEVSECSSFSCPPLYYAYGDGSLIAQLYRDSLGLPAPAPSPP
        CSPFECILCEGKP+IQ P P+I   KSVSCSA ACSAAHG SLSASHLCAISRCPLESIE+SECSSFSCPP YYAYGDGSL+A+LYRDSL LP PAPSPP
Subjt:  CSPFECILCEGKPQIQPPFPQIPKQKSVSCSAPACSAAHGASLSASHLCAISRCPLESIEVSECSSFSCPPLYYAYGDGSLIAQLYRDSLGLPAPAPSPP

Query:  IRVRNFTFGCAHTALGEPVGVAGFGRGALSMPSQLATFSPQLGNRFSYCLVSHSFNTDRVRRPSPLILGRYSDGQSEFVYTSLLDNPKHPYFYSVGLAGI
        I VRNFTFGCAHT LGEPVGVAGFGRG LSMPSQLATFSPQLGNRFSYCLVSHSF  DRVRRPSPLILGRY  G++EF+YTSLL+NPKHPYFYSVGLAGI
Subjt:  IRVRNFTFGCAHTALGEPVGVAGFGRGALSMPSQLATFSPQLGNRFSYCLVSHSFNTDRVRRPSPLILGRYSDGQSEFVYTSLLDNPKHPYFYSVGLAGI

Query:  SVGTVTIPAPEFLKRVDDGGSGGVVVDSGTTFTMLPAGLYESVVAQFENRTGRVATRASRIEENTGLSPCYYYDDSIQVPRVVLHFVGERSSVVLPRKNY
        SVG V IPAPEFL++VD+ GSGGVVVDSGTTFTMLP+GLYESVVA+FENRTG+VA RA RIEENTGLSPCYYY++S+ VPRVVLHFVGE+SSVVLPRKNY
Subjt:  SVGTVTIPAPEFLKRVDDGGSGGVVVDSGTTFTMLPAGLYESVVAQFENRTGRVATRASRIEENTGLSPCYYYDDSIQVPRVVLHFVGERSSVVLPRKNY

Query:  FYEFFDGGDG---VGKKRKVGCLMLMNGGDEAELAGGPGATLGNYQQQGFEVVYDLEKNRVGFARRQCSTLWDSLNRS
        FYEF DGGDG   VG+KRKVGCLMLMNGGDEAELAGGPGATLGNYQQQGFEVVYDLEKNRVGFARRQCSTLWD+LNRS
Subjt:  FYEFFDGGDG---VGKKRKVGCLMLMNGGDEAELAGGPGATLGNYQQQGFEVVYDLEKNRVGFARRQCSTLWDSLNRS

A0A5D3CP11 Aspartic proteinase nepenthesin-18.3e-23484.79Show/hide
Query:  SPVFAFLCFLLSSSSVFSSQILLLPLSHSLSSNLNN-NNTHNLLKSAADRSSARHRRRRRTHLSLPLSPGGDYTLSFNLGSHAHTISLYMDTGSDFVWFP
        SPVF FL   L SS VFSSQI LLPLSHSLSS++++ N+THNLLKS A RSSAR  R R  HLSLPLSPGGDYTLSFNLGS +H ISLYMDTGSD VWFP
Subjt:  SPVFAFLCFLLSSSSVFSSQILLLPLSHSLSSNLNN-NNTHNLLKSAADRSSARHRRRRRTHLSLPLSPGGDYTLSFNLGSHAHTISLYMDTGSDFVWFP

Query:  CSPFECILCEGKPQIQPPFPQIPKQKSVSCSAPACSAAHGASLSASHLCAISRCPLESIEVSECSSFSCPPLYYAYGDGSLIAQLYRDSLGL--PAPAPS
        CSPFECILCEGKP+IQ P P+I   KSVSCSA ACSAAHG SLSASHLCAISRCPLESIE+SECSSFSCPP YYAYGDGSL+A+LYRDSL L  PAPAPS
Subjt:  CSPFECILCEGKPQIQPPFPQIPKQKSVSCSAPACSAAHGASLSASHLCAISRCPLESIEVSECSSFSCPPLYYAYGDGSLIAQLYRDSLGL--PAPAPS

Query:  PPIRVRNFTFGCAHTALGEPVGVAGFGRGALSMPSQLATFSPQLGNRFSYCLVSHSFNTDRVRRPSPLILGRYSDGQSEFVYTSLLDNPKHPYFYSVGLA
        PPI VRNFTFGCAHT LGEPVGVAGFGRG LSMPSQLATFSPQLGNRFSYCLVSHSF  DRVRRPSPLILGRY  G++EF+YTSLL+NPKHPYFYSVGLA
Subjt:  PPIRVRNFTFGCAHTALGEPVGVAGFGRGALSMPSQLATFSPQLGNRFSYCLVSHSFNTDRVRRPSPLILGRYSDGQSEFVYTSLLDNPKHPYFYSVGLA

Query:  GISVGTVTIPAPEFLKRVDDGGSGGVVVDSGTTFTMLPAGLYESVVAQFENRTGRVATRASRIEENTGLSPCYYYDDSIQVPRVVLHFVGERSSVVLPRK
        GISVG V IPAPEFL++VD+ GSGGVVVDSGTTFTMLP+GLYESVVA+FENRTG+VA RA RIEENTGLSPCYYY +S+ VPRVVLHFVGE+SSVVLPRK
Subjt:  GISVGTVTIPAPEFLKRVDDGGSGGVVVDSGTTFTMLPAGLYESVVAQFENRTGRVATRASRIEENTGLSPCYYYDDSIQVPRVVLHFVGERSSVVLPRK

Query:  NYFYEFFDGGDG---VGKKRKVGCLMLMNGGDEAELAGGPGATLGNYQQQGFEVVYDLEKNRVGFARRQCSTLWDSLNRS
        NYFYEF DGGDG   VG+KRKVGCLMLMNGGDEAELAGGPGATLGNYQQQGFEVVYDLEKNRVGFARRQCSTLWD+LNRS
Subjt:  NYFYEFFDGGDG---VGKKRKVGCLMLMNGGDEAELAGGPGATLGNYQQQGFEVVYDLEKNRVGFARRQCSTLWDSLNRS

A0A6J1EC44 probable aspartyl protease At4g165634.9e-23485.27Show/hide
Query:  MASPVFAFLCFLLSSSSVFSSQILLLPLSHSLSSNLNNNNTHNLLKSAADRSSARHRRRRRT----HLSLPLSPGGDYTLSFNLGSHAHTISLYMDTGSD
        MASPVF FL   L SS VFSSQ+LLLPLS+SLSS+ + NNTHNLLKS A RSSAR   RRRT    HLSLPLSPGGDYTLSFNLGS +  ISLYMDTGSD
Subjt:  MASPVFAFLCFLLSSSSVFSSQILLLPLSHSLSSNLNNNNTHNLLKSAADRSSARHRRRRRT----HLSLPLSPGGDYTLSFNLGSHAHTISLYMDTGSD

Query:  FVWFPCSPFECILCEGKPQIQPPFPQIPKQKSVSCSAPACSAAHGASLSASHLCAISRCPLESIEVSECSSFSCPPLYYAYGDGSLIAQLYRDSLGL--P
         VWFPCSPFECILCEGKP+IQ P P+I  QKSVSCSA ACSAAHG SLSASHLCAISRCPLESIEVSECSSFSCPP YYAYGDGSLI +LYRDSL L  P
Subjt:  FVWFPCSPFECILCEGKPQIQPPFPQIPKQKSVSCSAPACSAAHGASLSASHLCAISRCPLESIEVSECSSFSCPPLYYAYGDGSLIAQLYRDSLGL--P

Query:  APAPSPPIRVRNFTFGCAHTALGEPVGVAGFGRGALSMPSQLATFSPQLGNRFSYCLVSHSFNTDRVRRPSPLILGRYSDGQSEFVYTSLLDNPKHPYFY
        APAPSP I VRNFTFGCAH+ALGEP+GVAGFGRG LSMPSQLATFSPQLGNRFSYCLVSHSF  DRVRRPSPLILGRY   ++EF+YTS+L+NPKHPYFY
Subjt:  APAPSPPIRVRNFTFGCAHTALGEPVGVAGFGRGALSMPSQLATFSPQLGNRFSYCLVSHSFNTDRVRRPSPLILGRYSDGQSEFVYTSLLDNPKHPYFY

Query:  SVGLAGISVGTVTIPAPEFLKRVDDGGSGGVVVDSGTTFTMLPAGLYESVVAQFENRTGRVATRASRIEENTGLSPCYYYDDSIQVPRVVLHFVGERSSV
        SVGLAGISVG+V IPAPEFLKRVD+GGSGGVVVDSGTTFTMLPAGLY SVVAQFENRTGRVA+RASRIEENTGLSPCY Y+ S++VPRVVLHFVGE+SSV
Subjt:  SVGLAGISVGTVTIPAPEFLKRVDDGGSGGVVVDSGTTFTMLPAGLYESVVAQFENRTGRVATRASRIEENTGLSPCYYYDDSIQVPRVVLHFVGERSSV

Query:  VLPRKNYFYEFFDGGDGVGKKRKVGCLMLMNGGDEAELAGGPGATLGNYQQQGFEVVYDLEKNRVGFARRQCSTLWDSLNRS
         LPRKNYFYEF DGGDGVG+KRKVGCLMLMNGGDEAELAGGPGATLGNYQQQGFEV YDLE NRVGFARRQCSTLWDSLNRS
Subjt:  VLPRKNYFYEFFDGGDGVGKKRKVGCLMLMNGGDEAELAGGPGATLGNYQQQGFEVVYDLEKNRVGFARRQCSTLWDSLNRS

A0A6J1L3Z9 probable aspartyl protease At4g165633.7e-23485Show/hide
Query:  MASPVFAFLCFLLSSSSVFSSQILLLPLSHSLSSNLNNNNTHNLLKSAADRSSARHRRRRRT----HLSLPLSPGGDYTLSFNLGSHAHTISLYMDTGSD
        MASPVF FL   L  S VFSSQILLLPLS+SLSS+ + NNTHNLLKS A RSSAR   RRRT    HLSLPLSPGGDYTLSFNLGS +  ISLYMDTGSD
Subjt:  MASPVFAFLCFLLSSSSVFSSQILLLPLSHSLSSNLNNNNTHNLLKSAADRSSARHRRRRRT----HLSLPLSPGGDYTLSFNLGSHAHTISLYMDTGSD

Query:  FVWFPCSPFECILCEGKPQIQPPFPQIPKQKSVSCSAPACSAAHGASLSASHLCAISRCPLESIEVSECSSFSCPPLYYAYGDGSLIAQLYRDSLGLPAP
         VWFPCSPFECILCEGKP+IQ P P+I  QKSVSCSA ACSAAHG SLSASHLCAISRCPLESIEVSECSSFSCPP YYAYGDGSLI +LYRDSL LPAP
Subjt:  FVWFPCSPFECILCEGKPQIQPPFPQIPKQKSVSCSAPACSAAHGASLSASHLCAISRCPLESIEVSECSSFSCPPLYYAYGDGSLIAQLYRDSLGLPAP

Query:  APSPPIRVRNFTFGCAHTALGEPVGVAGFGRGALSMPSQLATFSPQLGNRFSYCLVSHSFNTDRVRRPSPLILGRYSDGQSEFVYTSLLDNPKHPYFYSV
        APSP I VRNFTFGCAH+ALGEP+GVAGFGRG LSMP QLATFSPQLGNRFSYCLVSHSF  DRVRRPSPLILGRY   ++EF+YTS+L+NPKHPYFYSV
Subjt:  APSPPIRVRNFTFGCAHTALGEPVGVAGFGRGALSMPSQLATFSPQLGNRFSYCLVSHSFNTDRVRRPSPLILGRYSDGQSEFVYTSLLDNPKHPYFYSV

Query:  GLAGISVGTVTIPAPEFLKRVDDGGSGGVVVDSGTTFTMLPAGLYESVVAQFENRTGRVATRASRIEENTGLSPCYYYDDSIQVPRVVLHFVGERSSVVL
        GLAGISVG+V IPAPEFLK+VD+GGSGGVVVDSGTTFTMLPAGLY SVVAQFENRTGRVA+RAS+IEENTGLSPCYYY+ S++VPRVVLHFVGE+SSV+L
Subjt:  GLAGISVGTVTIPAPEFLKRVDDGGSGGVVVDSGTTFTMLPAGLYESVVAQFENRTGRVATRASRIEENTGLSPCYYYDDSIQVPRVVLHFVGERSSVVL

Query:  PRKNYFYEFFDGGDGVGKKRKVGCLMLMNGGDEAELAGGPGATLGNYQQQGFEVVYDLEKNRVGFARRQCSTLWDSLNRS
        PRKNYFYEF DGGDGVG+K KVGCLMLMNGGDEAELAGGPGATLGNYQQQGFEV YDLE NRVGFARRQCSTLWDSLNRS
Subjt:  PRKNYFYEFFDGGDGVGKKRKVGCLMLMNGGDEAELAGGPGATLGNYQQQGFEVVYDLEKNRVGFARRQCSTLWDSLNRS

SwissProt top hitse value%identityAlignment
Q766C2 Aspartic proteinase nepenthesin-22.7e-3228.51Show/hide
Query:  NNNTHNLLKSAADRSSARHRR-----RRRTHLSLPLSPG-GDYTLSFNLGSHAHTISLYMDTGSDFVWFPCSPFECILCEGKPQIQPPFPQIPKQKSVSC
        N   + L+K A  R   R R      +  + +  P+  G G+Y ++  +G+   + S  MDTGSD +W  C P  C  C        P P    Q S S 
Subjt:  NNNTHNLLKSAADRSSARHRR-----RRRTHLSLPLSPG-GDYTLSFNLGSHAHTISLYMDTGSDFVWFPCSPFECILCEGKPQIQPPFPQIPKQKSVSC

Query:  SAPACSAAHGASLSASHLCAISRCPLESIEVSECSSFSCPPLYYAYGDGSLIAQLYRDSLGLPAPAPSPPIRVRNFTFGCAHT----ALGEPVGVAGFGR
        S   C + +   L           P E+   +EC         Y YGDGS   Q Y  +        S P    N  FGC         G   G+ G G 
Subjt:  SAPACSAAHGASLSASHLCAISRCPLESIEVSECSSFSCPPLYYAYGDGSLIAQLYRDSLGLPAPAPSPPIRVRNFTFGCAHT----ALGEPVGVAGFGR

Query:  GALSMPSQLATFSPQLGNRFSYCLVSHSFNTDRVRRPSPLILGRYSDGQSE-FVYTSLLDNPKHPYFYSVGLAGISVGTVTIPAPEFLKRVDDGGSGGVV
        G LS+PSQL         +FSYC+ S+  ++     PS L LG  + G  E    T+L+ +  +P +Y + L GI+VG   +  P    ++ D G+GG++
Subjt:  GALSMPSQLATFSPQLGNRFSYCLVSHSFNTDRVRRPSPLILGRYSDGQSE-FVYTSLLDNPKHPYFYSVGLAGISVGTVTIPAPEFLKRVDDGGSGGVV

Query:  VDSGTTFTMLPAGLYESVVAQFENRTGRVATRASRIEENTGLSPCYYY---DDSIQVPRVVLHF------VGERSSVVLPRKNYFYEFFDGGDGVGKKRK
        +DSGTT T LP   Y +V   F ++           E ++GLS C+       ++QVP + + F      +GE++ ++ P +                  
Subjt:  VDSGTTFTMLPAGLYESVVAQFENRTGRVATRASRIEENTGLSPCYYY---DDSIQVPRVVLHF------VGERSSVVLPRKNYFYEFFDGGDGVGKKRK

Query:  VGCLMLMNGGDEAELAGGPGATLGNYQQQGFEVVYDLEKNRVGFARRQC
        V CL +   G  ++L     +  GN QQQ  +V+YDL+   V F   QC
Subjt:  VGCLMLMNGGDEAELAGGPGATLGNYQQQGFEVVYDLEKNRVGFARRQC

Q766C3 Aspartic proteinase nepenthesin-19.4e-3328.76Show/hide
Query:  NNNTHNLLKSAADRSSARHRRRRRTHLSLP-------LSPGGDYTLSFNLGSHAHTISLYMDTGSDFVWFPCSPFECILCEGKPQIQPPFPQIPKQKSVS
        N     LL+ A +R S R  +R    L+ P        +  G+Y ++ ++G+ A   S  MDTGSD +W  C P      +  P   P       Q S S
Subjt:  NNNTHNLLKSAADRSSARHRRRRRTHLSLP-------LSPGGDYTLSFNLGSHAHTISLYMDTGSDFVWFPCSPFECILCEGKPQIQPPFPQIPKQKSVS

Query:  CSAPACSAAHGASLSASHLCAISRCPLESIEVSECSSFSCPPLYYAYGDGSLIAQLYRDSLGLPAPAPSPPIRVRNFTFGCAHT----ALGEPVGVAGFG
         S   CS         S LC       +++    CS+  C    Y YGDGS      + S+G         + + N TFGC         G   G+ G G
Subjt:  CSAPACSAAHGASLSASHLCAISRCPLESIEVSECSSFSCPPLYYAYGDGSLIAQLYRDSLGLPAPAPSPPIRVRNFTFGCAHT----ALGEPVGVAGFG

Query:  RGALSMPSQLATFSPQLGNRFSYCLVSHSFNTDRVRRPSPLILGRYSDG-QSEFVYTSLLDNPKHPYFYSVGLAGISVGTVTIPA-PEFLKRVDDGGSGG
        RG LS+PSQL         +FSYC+     +T     PS L+LG  ++   +    T+L+ + + P FY + L G+SVG+  +P  P       + G+GG
Subjt:  RGALSMPSQLATFSPQLGNRFSYCLVSHSFNTDRVRRPSPLILGRYSDG-QSEFVYTSLLDNPKHPYFYSVGLAGISVGTVTIPA-PEFLKRVDDGGSGG

Query:  VVVDSGTTFTMLPAGLYESVVAQFENRTGRVATRASRIEENTGLSPCYYY---DDSIQVPRVVLHFVGERSSVVLPRKNYFYEFFDGGDGVGKKRKVGCL
        +++DSGTT T      Y+SV  +F ++        S    ++G   C+       ++Q+P  V+HF G    + LP +NYF         +     + CL
Subjt:  VVVDSGTTFTMLPAGLYESVVAQFENRTGRVATRASRIEENTGLSPCYYY---DDSIQVPRVVLHFVGERSSVVLPRKNYFYEFFDGGDGVGKKRKVGCL

Query:  MLMNGGDEAELAGGPGATLGNYQQQGFEVVYDLEKNRVGFARRQC
         + +      +        GN QQQ   VVYD   + V FA  QC
Subjt:  MLMNGGDEAELAGGPGATLGNYQQQGFEVVYDLEKNRVGFARRQC

Q940R4 Probable aspartyl protease At4g165631.3e-16460.41Show/hide
Query:  FLLSSSSVFSSQILLLPLSHSLSSNLNNNNTHNLLKSAADRSSAR----HRRRRRTHLSLPLSPGGDYTLSFNLGSHAHTISLYMDTGSDFVWFPCSPFE
        F  S SS+  S  LLL LSHSLS++ ++++  +LLKS++ RSSAR    H ++++  LSLP+S G DY +S ++GS +  +SLY+DTGSD VWFPC PF 
Subjt:  FLLSSSSVFSSQILLLPLSHSLSSNLNNNNTHNLLKSAADRSSAR----HRRRRRTHLSLPLSPGGDYTLSFNLGSHAHTISLYMDTGSDFVWFPCSPFE

Query:  CILCEGKP-QIQPPFPQIPKQKSVSCSAPACSAAHGASLSASHLCAISRCPLESIEVSEC--SSFSCPPLYYAYGDGSLIAQLYRDSLGLPAPAPSPPIR
        CILCE KP    PP        +VSCS+P+CSAAH +SL +S LCAIS CPL+ IE  +C  SS+ CPP YYAYGDGSL+A+LY DSL LP+      + 
Subjt:  CILCEGKP-QIQPPFPQIPKQKSVSCSAPACSAAHGASLSASHLCAISRCPLESIEVSEC--SSFSCPPLYYAYGDGSLIAQLYRDSLGLPAPAPSPPIR

Query:  VRNFTFGCAHTALGEPVGVAGFGRGALSMPSQLATFSPQLGNRFSYCLVSHSFNTDRVRRPSPLILGRYSD--------------------GQSEFVYTS
        V NFTFGCAHT L EP+GVAGFGRG LS+P+QLA  SP LGN FSYCLVSHSF++DRVRRPSPLILGR+ D                     ++EFV+T 
Subjt:  VRNFTFGCAHTALGEPVGVAGFGRGALSMPSQLATFSPQLGNRFSYCLVSHSFNTDRVRRPSPLILGRYSD--------------------GQSEFVYTS

Query:  LLDNPKHPYFYSVGLAGISVGTVTIPAPEFLKRVDDGGSGGVVVDSGTTFTMLPAGLYESVVAQFENRTGRVATRASRIEENTGLSPCYYYDDSIQVPRV
        +L+NPKHPYFYSV L GIS+G   IPAP  L+R+D  G GGVVVDSGTTFTMLPA  Y SVV +F++R GRV  RA R+E ++G+SPCYY + +++VP +
Subjt:  LLDNPKHPYFYSVGLAGISVGTVTIPAPEFLKRVDDGGSGGVVVDSGTTFTMLPAGLYESVVAQFENRTGRVATRASRIEENTGLSPCYYYDDSIQVPRV

Query:  VLHFVGERSSVVLPRKNYFYEFFDGGDGVGKKRKVGCLMLMNGGDEAELAGGPGATLGNYQQQGFEVVYDLEKNRVGFARRQCSTLWDSL
        VLHF G RSSV LPR+NYFYEF DGGDG  +KRK+GCLMLMNGGDE+EL GG GA LGNYQQQGFEVVYDL   RVGFA+R+C++LWDSL
Subjt:  VLHFVGERSSVVLPRKNYFYEFFDGGDGVGKKRKVGCLMLMNGGDEAELAGGPGATLGNYQQQGFEVVYDLEKNRVGFARRQCSTLWDSL

Q9LNJ3 Aspartyl protease family protein 27.2e-3330.31Show/hide
Query:  SSSSVFSSQILLLPLSHSLSSNLNNNNTHNLLKSAADRSSARHRR----------RRRTHLSLP----------LSPG-GDYTLSFNLGSHAHTISLYMD
        S S   SS  + L L H + +  +N     L  S   R S R +           R  TH   P          LS G G+Y     +G+ A  + + +D
Subjt:  SSSSVFSSQILLLPLSHSLSSNLNNNNTHNLLKSAADRSSARHRR----------RRRTHLSLP----------LSPG-GDYTLSFNLGSHAHTISLYMD

Query:  TGSDFVWFPCSPFECILCEGKPQIQPPFPQIPKQKSVSCSAPACSAAHGASLSASHLCAISRCPLESIEVSECSSFSCPPLY-YAYGDGSL-IAQLYRDS
        TGSD VW  C+P      +  P   P       +KS + +   CS+ H                   ++ + C++     LY  +YGDGS  +     ++
Subjt:  TGSDFVWFPCSPFECILCEGKPQIQPPFPQIPKQKSVSCSAPACSAAHGASLSASHLCAISRCPLESIEVSECSSFSCPPLY-YAYGDGSL-IAQLYRDS

Query:  LGLPAPAPSPPIRVRNFTFGCAHTALGEPVGVA---GFGRGALSMPSQLATFSPQLGNRFSYCLVSHSFNTDRVRRPSPLILGRYSDGQSEFVYTSLLDN
        L           RV+    GC H   G  VG A   G G+G LS P Q      +   +FSYCLV  S ++    +PS ++ G  +  +    +T LL N
Subjt:  LGLPAPAPSPPIRVRNFTFGCAHTALGEPVGVA---GFGRGALSMPSQLATFSPQLGNRFSYCLVSHSFNTDRVRRPSPLILGRYSDGQSEFVYTSLLDN

Query:  PKHPYFYSVGLAGISVGTVTIP-APEFLKRVDDGGSGGVVVDSGTTFTMLPAGLYESVVAQFENRTGRVATRASRIEENTGLSPCYYYD--DSIQVPRVV
        PK   FY VGL GISVG   +P     L ++D  G+GGV++DSGT+ T L    Y ++   F  R G  A    R  + +    C+     + ++VP VV
Subjt:  PKHPYFYSVGLAGISVGTVTIP-APEFLKRVDDGGSGGVVVDSGTTFTMLPAGLYESVVAQFENRTGRVATRASRIEENTGLSPCYYYD--DSIQVPRVV

Query:  LHFVGERSSVVLPRKNYFYEFFDGGDGVGKKRKVGCLMLMNGGDEAELAGGPG--ATLGNYQQQGFEVVYDLEKNRVGFARRQCS
        LHF G  + V LP  NY                    +  NG      AG  G  + +GN QQQGF VVYDL  +RVGFA   C+
Subjt:  LHFVGERSSVVLPRKNYFYEFFDGGDGVGKKRKVGCLMLMNGGDEAELAGGPG--ATLGNYQQQGFEVVYDLEKNRVGFARRQCS

Q9LS40 Protein ASPARTIC PROTEASE IN GUARD CELL 19.4e-3330.1Show/hide
Query:  GDYTLSFNLGSHAHTISLYMDTGSDFVWFPCSPFECILCEGKPQIQPPFPQIPKQKSVSCSAPACSAAHGASLSASHLCAISRCPLESIEVSECSSFSCP
        G+Y     +G+ A  + L +DTGSD  W  C P      +  P   P        KS++CSAP CS                      +E S C S  C 
Subjt:  GDYTLSFNLGSHAHTISLYMDTGSDFVWFPCSPFECILCEGKPQIQPPFPQIPKQKSVSCSAPACSAAHGASLSASHLCAISRCPLESIEVSECSSFSCP

Query:  PLY-YAYGDGSL-IAQLYRDSLGLPAPAPSPPIRVRNFTFGCAHTALG---EPVGVAGFGRGALSMPSQLATFSPQLGNRFSYCLVSHSFNTDRVRRPSP
         LY  +YGDGS  + +L  D++           ++ N   GC H   G      G+ G G G LS+ +Q+   S      FSYCLV          + S 
Subjt:  PLY-YAYGDGSL-IAQLYRDSLGLPAPAPSPPIRVRNFTFGCAHTALG---EPVGVAGFGRGALSMPSQLATFSPQLGNRFSYCLVSHSFNTDRVRRPSP

Query:  LILGRYSDGQSEFVYTSLLDNPKHPYFYSVGLAGISVGTVTIPAPEFLKRVDDGGSGGVVVDSGTTFTMLPAGLYESVVAQFENRTGRVATRASRIEENT
        L       G  +     LL N K   FY VGL+G SVG   +  P+ +  VD  GSGGV++D GT  T L    Y S+   F   T  +   +S I   +
Subjt:  LILGRYSDGQSEFVYTSLLDNPKHPYFYSVGLAGISVGTVTIPAPEFLKRVDDGGSGGVVVDSGTTFTMLPAGLYESVVAQFENRTGRVATRASRIEENT

Query:  GLSPCYYYD--DSIQVPRVVLHFVGERSSVVLPRKNYFYEFFDGGDGVGKKRKVGCLMLMNGGDEAELAGGPGATLGNYQQQGFEVVYDLEKNRVGFARR
            CY +    +++VP V  HF G + S+ LP KNY     D G          C           +       +GN QQQG  + YDL KN +G +  
Subjt:  GLSPCYYYD--DSIQVPRVVLHFVGERSSVVLPRKNYFYEFFDGGDGVGKKRKVGCLMLMNGGDEAELAGGPGATLGNYQQQGFEVVYDLEKNRVGFARR

Query:  QC
        +C
Subjt:  QC

Arabidopsis top hitse value%identityAlignment
AT1G01300.1 Eukaryotic aspartyl protease family protein5.1e-3430.31Show/hide
Query:  SSSSVFSSQILLLPLSHSLSSNLNNNNTHNLLKSAADRSSARHRR----------RRRTHLSLP----------LSPG-GDYTLSFNLGSHAHTISLYMD
        S S   SS  + L L H + +  +N     L  S   R S R +           R  TH   P          LS G G+Y     +G+ A  + + +D
Subjt:  SSSSVFSSQILLLPLSHSLSSNLNNNNTHNLLKSAADRSSARHRR----------RRRTHLSLP----------LSPG-GDYTLSFNLGSHAHTISLYMD

Query:  TGSDFVWFPCSPFECILCEGKPQIQPPFPQIPKQKSVSCSAPACSAAHGASLSASHLCAISRCPLESIEVSECSSFSCPPLY-YAYGDGSL-IAQLYRDS
        TGSD VW  C+P      +  P   P       +KS + +   CS+ H                   ++ + C++     LY  +YGDGS  +     ++
Subjt:  TGSDFVWFPCSPFECILCEGKPQIQPPFPQIPKQKSVSCSAPACSAAHGASLSASHLCAISRCPLESIEVSECSSFSCPPLY-YAYGDGSL-IAQLYRDS

Query:  LGLPAPAPSPPIRVRNFTFGCAHTALGEPVGVA---GFGRGALSMPSQLATFSPQLGNRFSYCLVSHSFNTDRVRRPSPLILGRYSDGQSEFVYTSLLDN
        L           RV+    GC H   G  VG A   G G+G LS P Q      +   +FSYCLV  S ++    +PS ++ G  +  +    +T LL N
Subjt:  LGLPAPAPSPPIRVRNFTFGCAHTALGEPVGVA---GFGRGALSMPSQLATFSPQLGNRFSYCLVSHSFNTDRVRRPSPLILGRYSDGQSEFVYTSLLDN

Query:  PKHPYFYSVGLAGISVGTVTIP-APEFLKRVDDGGSGGVVVDSGTTFTMLPAGLYESVVAQFENRTGRVATRASRIEENTGLSPCYYYD--DSIQVPRVV
        PK   FY VGL GISVG   +P     L ++D  G+GGV++DSGT+ T L    Y ++   F  R G  A    R  + +    C+     + ++VP VV
Subjt:  PKHPYFYSVGLAGISVGTVTIP-APEFLKRVDDGGSGGVVVDSGTTFTMLPAGLYESVVAQFENRTGRVATRASRIEENTGLSPCYYYD--DSIQVPRVV

Query:  LHFVGERSSVVLPRKNYFYEFFDGGDGVGKKRKVGCLMLMNGGDEAELAGGPG--ATLGNYQQQGFEVVYDLEKNRVGFARRQCS
        LHF G  + V LP  NY                    +  NG      AG  G  + +GN QQQGF VVYDL  +RVGFA   C+
Subjt:  LHFVGERSSVVLPRKNYFYEFFDGGDGVGKKRKVGCLMLMNGGDEAELAGGPG--ATLGNYQQQGFEVVYDLEKNRVGFARRQCS

AT1G25510.1 Eukaryotic aspartyl protease family protein2.3e-3430.35Show/hide
Query:  GDYTLSFNLGSHAHTISLYMDTGSDFVWFPCSPFECILCEGKPQIQPPFPQIPKQKSVSCSAPACSAAHGASLSASHLCAISRCPLESIEVSECSSFSCP
        G+Y     +G  A  + + +DTGSD  W  C+P      + +P  +P        + +SC  P C+A                     +EVSEC + +C 
Subjt:  GDYTLSFNLGSHAHTISLYMDTGSDFVWFPCSPFECILCEGKPQIQPPFPQIPKQKSVSCSAPACSAAHGASLSASHLCAISRCPLESIEVSECSSFSCP

Query:  PLY-YAYGDGS-LIAQLYRDSLGLPAPAPSPPIRVRNFTFGCAHTALGEPVGVA---GFGRGALSMPSQLATFSPQLGNRFSYCLVSHSFNTDRVRRPSP
         LY  +YGDGS  +     ++L + +        V+N   GC H+  G  VG A   G G G L++PSQL T S      FSYCLV    ++      S 
Subjt:  PLY-YAYGDGS-LIAQLYRDSLGLPAPAPSPPIRVRNFTFGCAHTALGEPVGVA---GFGRGALSMPSQLATFSPQLGNRFSYCLVSHSFNTDRVRRPSP

Query:  LILGRYSDGQSEFVYTSLLDNPKHPYFYSVGLAGISVGTVTIPAPEFLKRVDDGGSGGVVVDSGTTFTMLPAGLYESVVAQFENRTGRVATRASRIEENT
        +  G  +    + V   LL N +   FY +GL GISVG   +  P+    +D+ GSGG+++DSGT  T L   +Y S+   F   T  +   A     +T
Subjt:  LILGRYSDGQSEFVYTSLLDNPKHPYFYSVGLAGISVGTVTIPAPEFLKRVDDGGSGGVVVDSGTTFTMLPAGLYESVVAQFENRTGRVATRASRIEENT

Query:  GLSPCYYYD--DSIQVPRVVLHFVGERSSVVLPRKNYFYEFFDGGDGVGKKRKVGCLMLMNGGDEAELAGGPGATLGNYQQQGFEVVYDLEKNRVGFARR
            CY      +++VP V  HF G +  + LP KNY        D VG      CL                A +GN QQQG  V +DL  + +GF+  
Subjt:  GLSPCYYYD--DSIQVPRVVLHFVGERSSVVLPRKNYFYEFFDGGDGVGKKRKVGCLMLMNGGDEAELAGGPGATLGNYQQQGFEVVYDLEKNRVGFARR

Query:  QC
        +C
Subjt:  QC

AT3G52500.1 Eukaryotic aspartyl protease family protein5.4e-4430.57Show/hide
Query:  MASPVFAFLCFLLSSSSVFSSQILLLPLSHSLSSNLNNNNTHNLLKSAADRSSARHRRRRR-------------------THLSLPLSPG--GDYTLSFN
        MAS +F F  FL+  S V + ++ L P SH   S+ +  + +  L+  A+ S AR  + +                    T +  PLS    G Y++S +
Subjt:  MASPVFAFLCFLLSSSSVFSSQILLLPLSHSLSSNLNNNNTHNLLKSAADRSSARHRRRRR-------------------THLSLPLSPG--GDYTLSFN

Query:  LGSHAHTISLYMDTGSDFVWFPC-SPFECILCEGK---PQIQPPFPQIPKQKS----VSCSAPACSAAHGASLSASHLCAISRCPLESIEVSECSSFSCP
         G+ + TI    DTGS  VW PC S + C  C+     P + P F  IPK  S    + C +P C   +G ++         +C         C +  CP
Subjt:  LGSHAHTISLYMDTGSDFVWFPC-SPFECILCEGK---PQIQPPFPQIPKQKS----VSCSAPACSAAHGASLSASHLCAISRCPLESIEVSECSSFSCP

Query:  PLYYAYGDGSLIAQLYRDSLGLPAPAPSPPIRVRNFTFGCAHTALGEPVGVAGFGRGALSMPSQLATFSPQLGNRFSYCLVSHSFNTDRVRRPSPLILGR
        P    YG GS    L  + L        P + V +F  GC+  +  +P G+AGFGRG +S+PSQ+         RFS+CLVS  F+   V     L  G 
Subjt:  PLYYAYGDGSLIAQLYRDSLGLPAPAPSPPIRVRNFTFGCAHTALGEPVGVAGFGRGALSMPSQLATFSPQLGNRFSYCLVSHSFNTDRVRRPSPLILGR

Query:  YSDGQSE---FVYTSLLDNPK-----HPYFYSVGLAGISVGTVTIPAPEFLKRVDDGGSGGVVVDSGTTFTMLPAGLYESVVAQFENRTGRVATRASRIE
          +  S+     YT    NP         +Y + L  I VG   +  P         G GG +VDSG+TFT +   ++E V  +F ++     TR   +E
Subjt:  YSDGQSE---FVYTSLLDNPK-----HPYFYSVGLAGISVGTVTIPAPEFLKRVDDGGSGGVVVDSGTTFTMLPAGLYESVVAQFENRTGRVATRASRIE

Query:  ENTGLSPCYYYD--DSIQVPRVVLHFVGERSSVVLPRKNYFYEFFDGGDGVGKKRKVGCLMLMNGGDEAELAG-GPGATLGNYQQQGFEVVYDLEKNRVG
        + TGL PC+       + VP ++  F G  + + LP  NYF  F    D V       CL +++        G GP   LG++QQQ + V YDLE +R G
Subjt:  ENTGLSPCYYYD--DSIQVPRVVLHFVGERSSVVLPRKNYFYEFFDGGDGVGKKRKVGCLMLMNGGDEAELAG-GPGATLGNYQQQGFEVVYDLEKNRVG

Query:  FARRQCS
        FA+++CS
Subjt:  FARRQCS

AT4G16563.1 Eukaryotic aspartyl protease family protein9.6e-16660.41Show/hide
Query:  FLLSSSSVFSSQILLLPLSHSLSSNLNNNNTHNLLKSAADRSSAR----HRRRRRTHLSLPLSPGGDYTLSFNLGSHAHTISLYMDTGSDFVWFPCSPFE
        F  S SS+  S  LLL LSHSLS++ ++++  +LLKS++ RSSAR    H ++++  LSLP+S G DY +S ++GS +  +SLY+DTGSD VWFPC PF 
Subjt:  FLLSSSSVFSSQILLLPLSHSLSSNLNNNNTHNLLKSAADRSSAR----HRRRRRTHLSLPLSPGGDYTLSFNLGSHAHTISLYMDTGSDFVWFPCSPFE

Query:  CILCEGKP-QIQPPFPQIPKQKSVSCSAPACSAAHGASLSASHLCAISRCPLESIEVSEC--SSFSCPPLYYAYGDGSLIAQLYRDSLGLPAPAPSPPIR
        CILCE KP    PP        +VSCS+P+CSAAH +SL +S LCAIS CPL+ IE  +C  SS+ CPP YYAYGDGSL+A+LY DSL LP+      + 
Subjt:  CILCEGKP-QIQPPFPQIPKQKSVSCSAPACSAAHGASLSASHLCAISRCPLESIEVSEC--SSFSCPPLYYAYGDGSLIAQLYRDSLGLPAPAPSPPIR

Query:  VRNFTFGCAHTALGEPVGVAGFGRGALSMPSQLATFSPQLGNRFSYCLVSHSFNTDRVRRPSPLILGRYSD--------------------GQSEFVYTS
        V NFTFGCAHT L EP+GVAGFGRG LS+P+QLA  SP LGN FSYCLVSHSF++DRVRRPSPLILGR+ D                     ++EFV+T 
Subjt:  VRNFTFGCAHTALGEPVGVAGFGRGALSMPSQLATFSPQLGNRFSYCLVSHSFNTDRVRRPSPLILGRYSD--------------------GQSEFVYTS

Query:  LLDNPKHPYFYSVGLAGISVGTVTIPAPEFLKRVDDGGSGGVVVDSGTTFTMLPAGLYESVVAQFENRTGRVATRASRIEENTGLSPCYYYDDSIQVPRV
        +L+NPKHPYFYSV L GIS+G   IPAP  L+R+D  G GGVVVDSGTTFTMLPA  Y SVV +F++R GRV  RA R+E ++G+SPCYY + +++VP +
Subjt:  LLDNPKHPYFYSVGLAGISVGTVTIPAPEFLKRVDDGGSGGVVVDSGTTFTMLPAGLYESVVAQFENRTGRVATRASRIEENTGLSPCYYYDDSIQVPRV

Query:  VLHFVGERSSVVLPRKNYFYEFFDGGDGVGKKRKVGCLMLMNGGDEAELAGGPGATLGNYQQQGFEVVYDLEKNRVGFARRQCSTLWDSL
        VLHF G RSSV LPR+NYFYEF DGGDG  +KRK+GCLMLMNGGDE+EL GG GA LGNYQQQGFEVVYDL   RVGFA+R+C++LWDSL
Subjt:  VLHFVGERSSVVLPRKNYFYEFFDGGDGVGKKRKVGCLMLMNGGDEAELAGGPGATLGNYQQQGFEVVYDLEKNRVGFARRQCSTLWDSL

AT5G45120.1 Eukaryotic aspartyl protease family protein1.7e-5032.78Show/hide
Query:  YTLSFNLGSHAHTISLYMDTGSDFVWFPCS--PFECILCEG--KPQIQPPFPQIPKQKSV----SCSAPACSAAHGASLSASHLCAISRCPLESIEVSEC
        Y ++ N+G+    + +Y+DTGSD  W PC    F+CI C       ++ P    P   S     SC++  C   H +S +    CA++ C +  +  S C
Subjt:  YTLSFNLGSHAHTISLYMDTGSDFVWFPCS--PFECILCEG--KPQIQPPFPQIPKQKSV----SCSAPACSAAHGASLSASHLCAISRCPLESIEVSEC

Query:  SSFSCPPLYYAYGDGSLIAQLYRDSLGLPAPAPSPPIRVRNFTFGCAHTALGEPVGVAGFGRGALSMPSQLATFSPQLGNRFSYCLVSHSFNTDRVRRPS
            CP   Y YG+G LI+ +    + L A     P     F+FGC  +   EP+G+AGFGRG LS+PSQL      L   FS+C +   F  +     S
Subjt:  SSFSCPPLYYAYGDGSLIAQLYRDSLGLPAPAPSPPIRVRNFTFGCAHTALGEPVGVAGFGRGALSMPSQLATFSPQLGNRFSYCLVSHSFNTDRVRRPS

Query:  PLILGRYS---DGQSEFVYTSLLDNPKHPYFYSVGLAGISVGTVTIP--APEFLKRVDDGGSGGVVVDSGTTFTMLPAGLYESVVAQFENRTGRVATRAS
        PLILG  +   +      +T +L+ P +P  Y +GL  I++GT   P   P  L++ D  G+GG++VDSGTT+T LP   Y  ++   ++       RA+
Subjt:  PLILGRYS---DGQSEFVYTSLLDNPKHPYFYSVGLAGISVGTVTIP--APEFLKRVDDGGSGGVVVDSGTTFTMLPAGLYESVVAQFENRTGRVATRAS

Query:  RIEENTGLSPCYYY------------DDSIQVPRVVLHFVGERSSVVLPRKNYFYEFFDGGDGVGKKRKVGCLMLMNGGDEAELAGGPGATLGNYQQQGF
          E  TG   CY              D  +  P +  HF+   ++++LP+ N FY      DG      V CL+  N  D      GP    G++QQQ  
Subjt:  RIEENTGLSPCYYY------------DDSIQVPRVVLHFVGERSSVVLPRKNYFYEFFDGGDGVGKKRKVGCLMLMNGGDEAELAGGPGATLGNYQQQGF

Query:  EVVYDLEKNRVGFARRQC
        +VVYDLEK R+GF    C
Subjt:  EVVYDLEKNRVGFARRQC


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTTCCCCTGTTTTTGCATTCCTCTGTTTTCTCCTTTCTTCTTCCTCTGTTTTCTCTTCACAAATTCTCCTTCTCCCTCTCTCCCATTCCTTATCATCCAACCTCAA
CAACAACAACACCCACAACCTCCTCAAATCCGCCGCCGACCGCTCCTCCGCCCGCCACCGCCGCCGCCGCCGCACCCACCTCTCCCTCCCCCTGTCCCCCGGCGGCGACT
ACACTCTCTCCTTCAACCTGGGCTCCCACGCTCACACCATTTCCCTCTACATGGACACCGGCAGCGACTTCGTCTGGTTCCCCTGTTCCCCCTTCGAGTGTATTCTCTGC
GAAGGCAAGCCCCAAATTCAACCCCCCTTTCCCCAAATTCCCAAACAAAAATCTGTTTCTTGCAGCGCCCCCGCCTGCTCCGCCGCCCACGGCGCCTCCCTCTCCGCCTC
CCACCTCTGCGCTATTTCCCGATGCCCCCTCGAATCCATTGAAGTTTCTGAGTGCTCTTCCTTTTCTTGTCCGCCTCTGTATTACGCTTACGGCGATGGGAGTTTGATTG
CTCAGCTTTATAGAGACAGCTTGGGTTTGCCGGCGCCGGCGCCCTCACCGCCGATTCGTGTTCGGAATTTCACTTTCGGGTGTGCACATACGGCGCTCGGGGAGCCGGTC
GGGGTTGCTGGGTTCGGCCGGGGGGCTTTGTCGATGCCGAGTCAGTTAGCTACTTTCTCACCTCAATTAGGGAACCGCTTTTCTTATTGCTTGGTTTCTCACTCCTTTAA
TACGGACCGGGTTCGCCGACCGAGTCCGTTGATTCTGGGCCGGTATTCTGATGGCCAGTCGGAGTTTGTTTACACGTCTTTGCTTGATAATCCGAAGCACCCTTATTTTT
ACTCGGTTGGATTGGCCGGAATCTCGGTCGGCACAGTGACGATTCCGGCGCCGGAGTTTTTGAAAAGGGTGGACGACGGCGGCAGCGGTGGTGTTGTGGTGGATTCCGGT
ACTACTTTCACTATGCTCCCGGCGGGTTTGTATGAATCGGTTGTGGCTCAGTTTGAGAATCGGACCGGTCGGGTTGCAACGCGGGCGAGCCGGATTGAAGAAAATACCGG
GTTGAGTCCTTGCTATTATTATGATGACTCAATTCAAGTGCCACGTGTCGTGCTGCATTTCGTCGGGGAAAGATCTAGTGTGGTGCTTCCCAGGAAGAACTACTTTTACG
AGTTTTTTGACGGTGGAGATGGGGTGGGGAAGAAGAGAAAAGTTGGGTGTTTGATGTTGATGAACGGCGGAGATGAGGCTGAGCTGGCAGGTGGGCCCGGGGCCACACTA
GGCAACTATCAACAACAGGGCTTTGAAGTGGTTTACGATTTGGAGAAAAACCGGGTCGGTTTTGCCCGGCGACAGTGTTCAACTCTTTGGGACAGCCTGAACCGAAGTTA
A
mRNA sequenceShow/hide mRNA sequence
ATTTTAACAAATGATCGGTTATTGAAAATGACATTTGAAAAATAAGTTATTATAACAAGAATTATAATAAACTATGTTGGAGAGATTATAATAAATATATATGTATCTAT
AGTTTAGATAAAGAGCACATAAATTTAGGTAACGGGACAAATATAGGCTGTTATTATAATTAACAGATCGATTTACATAAACGCGGCACAAAGACATATTAAAGAGTTTA
TGTGAAAAAGAAAGAAGCTCCCGTTAGTTCTAATGGCCACTAGTCTTGGAACCCATACAAAATCCTACCACTTCTTCAAACCAGAACCTTCTTTCTTCTTCTTCTTCTTT
TCTTCTTCTTCTTCAATTCCCTTCTCTTTATATTCCAAACTTCATCTCTCTCTCTCTCTGTCTATCCTCTTCAATGGCTTCCCCTGTTTTTGCATTCCTCTGTTTTCTCC
TTTCTTCTTCCTCTGTTTTCTCTTCACAAATTCTCCTTCTCCCTCTCTCCCATTCCTTATCATCCAACCTCAACAACAACAACACCCACAACCTCCTCAAATCCGCCGCC
GACCGCTCCTCCGCCCGCCACCGCCGCCGCCGCCGCACCCACCTCTCCCTCCCCCTGTCCCCCGGCGGCGACTACACTCTCTCCTTCAACCTGGGCTCCCACGCTCACAC
CATTTCCCTCTACATGGACACCGGCAGCGACTTCGTCTGGTTCCCCTGTTCCCCCTTCGAGTGTATTCTCTGCGAAGGCAAGCCCCAAATTCAACCCCCCTTTCCCCAAA
TTCCCAAACAAAAATCTGTTTCTTGCAGCGCCCCCGCCTGCTCCGCCGCCCACGGCGCCTCCCTCTCCGCCTCCCACCTCTGCGCTATTTCCCGATGCCCCCTCGAATCC
ATTGAAGTTTCTGAGTGCTCTTCCTTTTCTTGTCCGCCTCTGTATTACGCTTACGGCGATGGGAGTTTGATTGCTCAGCTTTATAGAGACAGCTTGGGTTTGCCGGCGCC
GGCGCCCTCACCGCCGATTCGTGTTCGGAATTTCACTTTCGGGTGTGCACATACGGCGCTCGGGGAGCCGGTCGGGGTTGCTGGGTTCGGCCGGGGGGCTTTGTCGATGC
CGAGTCAGTTAGCTACTTTCTCACCTCAATTAGGGAACCGCTTTTCTTATTGCTTGGTTTCTCACTCCTTTAATACGGACCGGGTTCGCCGACCGAGTCCGTTGATTCTG
GGCCGGTATTCTGATGGCCAGTCGGAGTTTGTTTACACGTCTTTGCTTGATAATCCGAAGCACCCTTATTTTTACTCGGTTGGATTGGCCGGAATCTCGGTCGGCACAGT
GACGATTCCGGCGCCGGAGTTTTTGAAAAGGGTGGACGACGGCGGCAGCGGTGGTGTTGTGGTGGATTCCGGTACTACTTTCACTATGCTCCCGGCGGGTTTGTATGAAT
CGGTTGTGGCTCAGTTTGAGAATCGGACCGGTCGGGTTGCAACGCGGGCGAGCCGGATTGAAGAAAATACCGGGTTGAGTCCTTGCTATTATTATGATGACTCAATTCAA
GTGCCACGTGTCGTGCTGCATTTCGTCGGGGAAAGATCTAGTGTGGTGCTTCCCAGGAAGAACTACTTTTACGAGTTTTTTGACGGTGGAGATGGGGTGGGGAAGAAGAG
AAAAGTTGGGTGTTTGATGTTGATGAACGGCGGAGATGAGGCTGAGCTGGCAGGTGGGCCCGGGGCCACACTAGGCAACTATCAACAACAGGGCTTTGAAGTGGTTTACG
ATTTGGAGAAAAACCGGGTCGGTTTTGCCCGGCGACAGTGTTCAACTCTTTGGGACAGCCTGAACCGAAGTTAATGTGAAATGTGGGACCGGTTCGAGGACTGGAAGGTT
GACTATTTTGTGCTTTGACTTTGTTGTAGTTGCAATAGTCCACGCTTTTTGGAGTAAATGAGGTAATTTGACGTTTGATGTGGGCCTTCTTTTGTAAATTCTTGCTAGCA
CTATACTTCTTTGCTTCATTGTTATTTATTATAGTTGAAATTTGTACGTGAAAAGTGTTATAAATGTTCATAAACACAAGTGGGAAAATATTATTTATTTTAAGGCAGGA
Protein sequenceShow/hide protein sequence
MASPVFAFLCFLLSSSSVFSSQILLLPLSHSLSSNLNNNNTHNLLKSAADRSSARHRRRRRTHLSLPLSPGGDYTLSFNLGSHAHTISLYMDTGSDFVWFPCSPFECILC
EGKPQIQPPFPQIPKQKSVSCSAPACSAAHGASLSASHLCAISRCPLESIEVSECSSFSCPPLYYAYGDGSLIAQLYRDSLGLPAPAPSPPIRVRNFTFGCAHTALGEPV
GVAGFGRGALSMPSQLATFSPQLGNRFSYCLVSHSFNTDRVRRPSPLILGRYSDGQSEFVYTSLLDNPKHPYFYSVGLAGISVGTVTIPAPEFLKRVDDGGSGGVVVDSG
TTFTMLPAGLYESVVAQFENRTGRVATRASRIEENTGLSPCYYYDDSIQVPRVVLHFVGERSSVVLPRKNYFYEFFDGGDGVGKKRKVGCLMLMNGGDEAELAGGPGATL
GNYQQQGFEVVYDLEKNRVGFARRQCSTLWDSLNRS