; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Pay0020505 (gene) of Melon (Payzawat) v1 genome

Gene IDPay0020505
OrganismCucumis melo var. inodorus cv. Payzawat (Melon (Payzawat) v1)
DescriptionEukaryotic aspartyl protease family protein
Genome locationchr06:33842350..33844442
RNA-Seq ExpressionPay0020505
SyntenyPay0020505
Gene Ontology termsGO:0006508 - proteolysis (biological process)
GO:0005576 - extracellular region (cellular component)
GO:0004190 - aspartic-type endopeptidase activity (molecular function)
InterPro domainsIPR001969 - Aspartic peptidase, active site
IPR021109 - Aspartic peptidase domain superfamily
IPR032799 - Xylanase inhibitor, C-terminal
IPR032861 - Xylanase inhibitor, N-terminal
IPR033121 - Peptidase family A1 domain
IPR034161 - Pepsin-like domain, plant


Homology Show/hide homology
GenBank top hitse value%identityAlignment
TYK12019.1 aspartic proteinase nepenthesin-1 [Cucumis melo var. makuwa]6.7e-27899.38Show/hide
Query:  MAVSPVFIFLLCFLLSSPVFSSQIFLLPLSHSLSSSISDFNSTHNLLKSTATRSSARFHRHRHNHLSLPLSPGGDYTLSFNLGSESHKISLYMDTGSDLV
        MAVSPVFIFLLCFLLSSPVFSSQIFLLPLSHSLSSSISDFNSTHNLLKSTATRSSARFHRHRHNHLSLPLSPGGDYTLSFNLGSESHKISLYMDTGSDLV
Subjt:  MAVSPVFIFLLCFLLSSPVFSSQIFLLPLSHSLSSSISDFNSTHNLLKSTATRSSARFHRHRHNHLSLPLSPGGDYTLSFNLGSESHKISLYMDTGSDLV

Query:  WFPCSPFECILCEGKPKIQSPLPKISNNKSVSCSAAACSAAHGGSLSASHLCAISRCPLESIEISECSSFSCPPFYYAYGDGSLVARLYRDSLSLPT--P
        WFPCSPFECILCEGKPKIQSPLPKISNNKSVSCSAAACSAAHGGSLSASHLCAISRCPLESIEISECSSFSCPPFYYAYGDGSLVARLYRDSLSLPT  P
Subjt:  WFPCSPFECILCEGKPKIQSPLPKISNNKSVSCSAAACSAAHGGSLSASHLCAISRCPLESIEISECSSFSCPPFYYAYGDGSLVARLYRDSLSLPT--P

Query:  APSPPINVRNFTFGCAHTTLGEPVGVAGFGRGVLSMPSQLATFSPQLGNRFSYCLVSHSFAADRVRRPSPLILGRYHTGETEFIYTSLLENPKHPYFYSV
        APSPPINVRNFTFGCAHTTLGEPVGVAGFGRGVLSMPSQLATFSPQLGNRFSYCLVSHSFAADRVRRPSPLILGRYHTGETEFIYTSLLENPKHPYFYSV
Subjt:  APSPPINVRNFTFGCAHTTLGEPVGVAGFGRGVLSMPSQLATFSPQLGNRFSYCLVSHSFAADRVRRPSPLILGRYHTGETEFIYTSLLENPKHPYFYSV

Query:  GLAGISVGNVRIPAPEFLRKVDESGSGGVVVDSGTTFTMLPSGLYESVVAEFENRTGKVANRARRIEENTGLSPCYYYENSVGVPRVVLHFVGEKSSVVL
        GLAGISVGNVRIPAPEFLRKVDESGSGGVVVDSGTTFTMLPSGLYESVVAEFENRTGKVANRARRIEENTGLSPCYYY+NSVGVPRVVLHFVGEKSSVVL
Subjt:  GLAGISVGNVRIPAPEFLRKVDESGSGGVVVDSGTTFTMLPSGLYESVVAEFENRTGKVANRARRIEENTGLSPCYYYENSVGVPRVVLHFVGEKSSVVL

Query:  PRKNYFYEFLDGGDGVVEVGRKRKVGCLMLMNGGDEAELAGGPGATLGNYQQQGFEVVYDLEKNRVGFARRQCSTLWDNLNRS
        PRKNYFYEFLDGGDGVVEVGRKRKVGCLMLMNGGDEAELAGGPGATLGNYQQQGFEVVYDLEKNRVGFARRQCSTLWDNLNRS
Subjt:  PRKNYFYEFLDGGDGVVEVGRKRKVGCLMLMNGGDEAELAGGPGATLGNYQQQGFEVVYDLEKNRVGFARRQCSTLWDNLNRS

XP_004147205.1 probable aspartyl protease At4g16563 [Cucumis sativus]7.7e-27497.92Show/hide
Query:  MAVSPVFIFLLCFLLSSPVFSSQIFLLPLSHSLSSSISDFNSTHNLLKSTATRSSARFHRHRHNHLSLPLSPGGDYTLSFNLGSESHKISLYMDTGSDLV
        MAVSPVFIFLLCFLLSSPVFSSQIFLLPLSHSLSSSISDFN+THNLLKSTATRSSARFHRHRHNHLSLPLSPGGDYTLSFNLGSESHKISLYMDTGSDLV
Subjt:  MAVSPVFIFLLCFLLSSPVFSSQIFLLPLSHSLSSSISDFNSTHNLLKSTATRSSARFHRHRHNHLSLPLSPGGDYTLSFNLGSESHKISLYMDTGSDLV

Query:  WFPCSPFECILCEGKPKIQSPLPKISNNKSVSCSAAACSAAHGGSLSASHLCAISRCPLESIEISECSSFSCPPFYYAYGDGSLVARLYRDSLSLPTPAP
        WFPCSPFECILCEGKPKIQSPLPKI+NNKSVSCSAAACSAAHGGSLSASHLCAISRCPLESIEISECSSFSCPPFYYAYGDGSLVARLYRDSLSLPTPAP
Subjt:  WFPCSPFECILCEGKPKIQSPLPKISNNKSVSCSAAACSAAHGGSLSASHLCAISRCPLESIEISECSSFSCPPFYYAYGDGSLVARLYRDSLSLPTPAP

Query:  SPPINVRNFTFGCAHTTLGEPVGVAGFGRGVLSMPSQLATFSPQLGNRFSYCLVSHSFAADRVRRPSPLILGRYHTGETEFIYTSLLENPKHPYFYSVGL
        SPPINVRNFTFGCAHTTLGEPVGVAGFGRGVLSMPSQLATFSPQLGNRFSYCLVSHSFAADRVRRPSPLILGRY+TGETEFIYTSLLENPKHPYFYSVGL
Subjt:  SPPINVRNFTFGCAHTTLGEPVGVAGFGRGVLSMPSQLATFSPQLGNRFSYCLVSHSFAADRVRRPSPLILGRYHTGETEFIYTSLLENPKHPYFYSVGL

Query:  AGISVGNVRIPAPEFLRKVDESGSGGVVVDSGTTFTMLPSGLYESVVAEFENRTGKVANRARRIEENTGLSPCYYYENSVGVPRVVLHFVGEKSSVVLPR
        AGISVGN+RIPAPEFL KVDE GSGGVVVDSGTTFTMLP+GLYESVVAEFENRTGKVANRARRIEENTGLSPCYYYENSVGVPRVVLHFVGEKS+VVLPR
Subjt:  AGISVGNVRIPAPEFLRKVDESGSGGVVVDSGTTFTMLPSGLYESVVAEFENRTGKVANRARRIEENTGLSPCYYYENSVGVPRVVLHFVGEKSSVVLPR

Query:  KNYFYEFLDGGDGVVEVGRKRKVGCLMLMNGGDEAELAGGPGATLGNYQQQGFEVVYDLEKNRVGFARRQCSTLWDNLNRS
        KNYFYEFLDGGDGV  VGRKRKVGCLMLMNGGDEAELAGGPGATLGNYQQQGFEVVYDLEKNRVGFARRQCSTLWDNLNRS
Subjt:  KNYFYEFLDGGDGVVEVGRKRKVGCLMLMNGGDEAELAGGPGATLGNYQQQGFEVVYDLEKNRVGFARRQCSTLWDNLNRS

XP_008448851.1 PREDICTED: aspartic proteinase nepenthesin-1 [Cucumis melo]9.4e-280100Show/hide
Query:  MAVSPVFIFLLCFLLSSPVFSSQIFLLPLSHSLSSSISDFNSTHNLLKSTATRSSARFHRHRHNHLSLPLSPGGDYTLSFNLGSESHKISLYMDTGSDLV
        MAVSPVFIFLLCFLLSSPVFSSQIFLLPLSHSLSSSISDFNSTHNLLKSTATRSSARFHRHRHNHLSLPLSPGGDYTLSFNLGSESHKISLYMDTGSDLV
Subjt:  MAVSPVFIFLLCFLLSSPVFSSQIFLLPLSHSLSSSISDFNSTHNLLKSTATRSSARFHRHRHNHLSLPLSPGGDYTLSFNLGSESHKISLYMDTGSDLV

Query:  WFPCSPFECILCEGKPKIQSPLPKISNNKSVSCSAAACSAAHGGSLSASHLCAISRCPLESIEISECSSFSCPPFYYAYGDGSLVARLYRDSLSLPTPAP
        WFPCSPFECILCEGKPKIQSPLPKISNNKSVSCSAAACSAAHGGSLSASHLCAISRCPLESIEISECSSFSCPPFYYAYGDGSLVARLYRDSLSLPTPAP
Subjt:  WFPCSPFECILCEGKPKIQSPLPKISNNKSVSCSAAACSAAHGGSLSASHLCAISRCPLESIEISECSSFSCPPFYYAYGDGSLVARLYRDSLSLPTPAP

Query:  SPPINVRNFTFGCAHTTLGEPVGVAGFGRGVLSMPSQLATFSPQLGNRFSYCLVSHSFAADRVRRPSPLILGRYHTGETEFIYTSLLENPKHPYFYSVGL
        SPPINVRNFTFGCAHTTLGEPVGVAGFGRGVLSMPSQLATFSPQLGNRFSYCLVSHSFAADRVRRPSPLILGRYHTGETEFIYTSLLENPKHPYFYSVGL
Subjt:  SPPINVRNFTFGCAHTTLGEPVGVAGFGRGVLSMPSQLATFSPQLGNRFSYCLVSHSFAADRVRRPSPLILGRYHTGETEFIYTSLLENPKHPYFYSVGL

Query:  AGISVGNVRIPAPEFLRKVDESGSGGVVVDSGTTFTMLPSGLYESVVAEFENRTGKVANRARRIEENTGLSPCYYYENSVGVPRVVLHFVGEKSSVVLPR
        AGISVGNVRIPAPEFLRKVDESGSGGVVVDSGTTFTMLPSGLYESVVAEFENRTGKVANRARRIEENTGLSPCYYYENSVGVPRVVLHFVGEKSSVVLPR
Subjt:  AGISVGNVRIPAPEFLRKVDESGSGGVVVDSGTTFTMLPSGLYESVVAEFENRTGKVANRARRIEENTGLSPCYYYENSVGVPRVVLHFVGEKSSVVLPR

Query:  KNYFYEFLDGGDGVVEVGRKRKVGCLMLMNGGDEAELAGGPGATLGNYQQQGFEVVYDLEKNRVGFARRQCSTLWDNLNRS
        KNYFYEFLDGGDGVVEVGRKRKVGCLMLMNGGDEAELAGGPGATLGNYQQQGFEVVYDLEKNRVGFARRQCSTLWDNLNRS
Subjt:  KNYFYEFLDGGDGVVEVGRKRKVGCLMLMNGGDEAELAGGPGATLGNYQQQGFEVVYDLEKNRVGFARRQCSTLWDNLNRS

XP_023553227.1 probable aspartyl protease At4g16563 [Cucurbita pepo subsp. pepo]5.5e-24889.46Show/hide
Query:  SPVFIFLLCFLLSSPVFSSQIFLLPLSHSLSSSISDFNSTHNLLKSTATRSSARFHR----HRHNHLSLPLSPGGDYTLSFNLGSESHKISLYMDTGSDL
        SPVF+FLLCFLLSSPVFSSQ+ LLPLS+SLSSS SDFN+THNLLKSTA RSSARFH     H  +HLSLPLSPGGDYTLSFNLGSES KISLYMDTGSDL
Subjt:  SPVFIFLLCFLLSSPVFSSQIFLLPLSHSLSSSISDFNSTHNLLKSTATRSSARFHR----HRHNHLSLPLSPGGDYTLSFNLGSESHKISLYMDTGSDL

Query:  VWFPCSPFECILCEGKPKIQSPLPKISNNKSVSCSAAACSAAHGGSLSASHLCAISRCPLESIEISECSSFSCPPFYYAYGDGSLVARLYRDSLSL--PT
        VWFPCSPFECILCEGKPKIQSPLPKI++ KSVSCSAAACSAAHGGSLSASHLCAISRCPLESIE+SECSSFSCPPFYYAYGDGSL+ RLYRDSLSL  P 
Subjt:  VWFPCSPFECILCEGKPKIQSPLPKISNNKSVSCSAAACSAAHGGSLSASHLCAISRCPLESIEISECSSFSCPPFYYAYGDGSLVARLYRDSLSL--PT

Query:  PAPSPPINVRNFTFGCAHTTLGEPVGVAGFGRGVLSMPSQLATFSPQLGNRFSYCLVSHSFAADRVRRPSPLILGRYHTGETEFIYTSLLENPKHPYFYS
        PAPSP INVRNFTFGCAH+ LGEP+GVAGFGRG+LSMPSQLATFSPQLGNRFSYCLVSHSFAADRVRRPSPLILGRY+  ETEFIYTSLLENPKHPYFYS
Subjt:  PAPSPPINVRNFTFGCAHTTLGEPVGVAGFGRGVLSMPSQLATFSPQLGNRFSYCLVSHSFAADRVRRPSPLILGRYHTGETEFIYTSLLENPKHPYFYS

Query:  VGLAGISVGNVRIPAPEFLRKVDESGSGGVVVDSGTTFTMLPSGLYESVVAEFENRTGKVANRARRIEENTGLSPCYYYENSVGVPRVVLHFVGEKSSVV
        VGLAGISVG+VRIPAPEFL++VDE GSGGVVVDSGTTFTMLP+GLY SVVA+FENRTG+VA+RA RIEENTGLSPCYYYENSV VPRVVLHFVGEKSSVV
Subjt:  VGLAGISVGNVRIPAPEFLRKVDESGSGGVVVDSGTTFTMLPSGLYESVVAEFENRTGKVANRARRIEENTGLSPCYYYENSVGVPRVVLHFVGEKSSVV

Query:  LPRKNYFYEFLDGGDGVVEVGRKRKVGCLMLMNGGDEAELAGGPGATLGNYQQQGFEVVYDLEKNRVGFARRQCSTLWDNLNRS
        LPRKNYFYEFLDGGDG   V RKRKVGCLMLMNGGDEAELAGGPGATLGNYQQQGFEV YDLE NRVGFARRQCSTLWD+LNRS
Subjt:  LPRKNYFYEFLDGGDGVVEVGRKRKVGCLMLMNGGDEAELAGGPGATLGNYQQQGFEVVYDLEKNRVGFARRQCSTLWDNLNRS

XP_038905814.1 probable aspartyl protease At4g16563 [Benincasa hispida]8.0e-25591.08Show/hide
Query:  SPVFIFLLCFLLSSPVFSSQIFLLPLSHSLSSSISDFNSTHNLLKSTATRSSARFHRHR----HNHLSLPLSPGGDYTLSFNLGSESHKISLYMDTGSDL
        S VF+ LLCFLLSSPVFSSQ+ LLPLSHSLSSSISDFN+THNLLKSTA RSSARFH  R    HNHLSLPLSPGGDYTLSFNLGSESHKISLYMDTGSDL
Subjt:  SPVFIFLLCFLLSSPVFSSQIFLLPLSHSLSSSISDFNSTHNLLKSTATRSSARFHRHR----HNHLSLPLSPGGDYTLSFNLGSESHKISLYMDTGSDL

Query:  VWFPCSPFECILCEGKPKIQSPLPKISNNKSVSCSAAACSAAHGGSLSASHLCAISRCPLESIEISECSSFSCPPFYYAYGDGSLVARLYRDSLSLPTPA
        VWFPCSPFECILCEGKPK+QSPLPKISNNKSVSCSA ACSAAHGGSLSASHLCAIS+CPLESIEISECSSFSCPPFYYAYGDGSL+ARLYRDSLSLP PA
Subjt:  VWFPCSPFECILCEGKPKIQSPLPKISNNKSVSCSAAACSAAHGGSLSASHLCAISRCPLESIEISECSSFSCPPFYYAYGDGSLVARLYRDSLSLPTPA

Query:  PSPPINVRNFTFGCAHTTLGEPVGVAGFGRGVLSMPSQLATFSPQLGNRFSYCLVSHSFAADRVRRPSPLILGRYHTGETEFIYTSLLENPKHPYFYSVG
        PSP INVRNFTFGCAHT LGEPVGVAGFGRG LSMPSQLATFSPQLGNRFSYCLVSHSFAA+RVRRPSPLILGRY+ GETEFIYTSLLENPKHPYFYSVG
Subjt:  PSPPINVRNFTFGCAHTTLGEPVGVAGFGRGVLSMPSQLATFSPQLGNRFSYCLVSHSFAADRVRRPSPLILGRYHTGETEFIYTSLLENPKHPYFYSVG

Query:  LAGISVGNVRIPAPEFLRKVDESGSGGVVVDSGTTFTMLPSGLYESVVAEFENRTGKVANRARRIEENTGLSPCYYYENSVGVPRVVLHFVGEKSSVVLP
        L GISVGN+ IPAPEFL+KVDE GSGGVVVDSGTTFTMLP+GLY+SVVA FENRTG+VANRARRIEENTGLSPCYYYENSV VPRVVLHFVGEKSSV+LP
Subjt:  LAGISVGNVRIPAPEFLRKVDESGSGGVVVDSGTTFTMLPSGLYESVVAEFENRTGKVANRARRIEENTGLSPCYYYENSVGVPRVVLHFVGEKSSVVLP

Query:  RKNYFYEFLDGGDGVVEVGRKRKVGCLMLMNGGDEAELAGGPGATLGNYQQQGFEVVYDLEKNRVGFARRQCSTLWDNLNRS
        +KNYFYEFLDGGDG   VG+KRKVGCLMLMNGGDEAELAGGPGATLGNYQQQGFEV YDL KNRVGFARRQCSTLWD+LNRS
Subjt:  RKNYFYEFLDGGDGVVEVGRKRKVGCLMLMNGGDEAELAGGPGATLGNYQQQGFEVVYDLEKNRVGFARRQCSTLWDNLNRS

TrEMBL top hitse value%identityAlignment
A0A0A0L5I7 Pepsin A3.7e-27497.92Show/hide
Query:  MAVSPVFIFLLCFLLSSPVFSSQIFLLPLSHSLSSSISDFNSTHNLLKSTATRSSARFHRHRHNHLSLPLSPGGDYTLSFNLGSESHKISLYMDTGSDLV
        MAVSPVFIFLLCFLLSSPVFSSQIFLLPLSHSLSSSISDFN+THNLLKSTATRSSARFHRHRHNHLSLPLSPGGDYTLSFNLGSESHKISLYMDTGSDLV
Subjt:  MAVSPVFIFLLCFLLSSPVFSSQIFLLPLSHSLSSSISDFNSTHNLLKSTATRSSARFHRHRHNHLSLPLSPGGDYTLSFNLGSESHKISLYMDTGSDLV

Query:  WFPCSPFECILCEGKPKIQSPLPKISNNKSVSCSAAACSAAHGGSLSASHLCAISRCPLESIEISECSSFSCPPFYYAYGDGSLVARLYRDSLSLPTPAP
        WFPCSPFECILCEGKPKIQSPLPKI+NNKSVSCSAAACSAAHGGSLSASHLCAISRCPLESIEISECSSFSCPPFYYAYGDGSLVARLYRDSLSLPTPAP
Subjt:  WFPCSPFECILCEGKPKIQSPLPKISNNKSVSCSAAACSAAHGGSLSASHLCAISRCPLESIEISECSSFSCPPFYYAYGDGSLVARLYRDSLSLPTPAP

Query:  SPPINVRNFTFGCAHTTLGEPVGVAGFGRGVLSMPSQLATFSPQLGNRFSYCLVSHSFAADRVRRPSPLILGRYHTGETEFIYTSLLENPKHPYFYSVGL
        SPPINVRNFTFGCAHTTLGEPVGVAGFGRGVLSMPSQLATFSPQLGNRFSYCLVSHSFAADRVRRPSPLILGRY+TGETEFIYTSLLENPKHPYFYSVGL
Subjt:  SPPINVRNFTFGCAHTTLGEPVGVAGFGRGVLSMPSQLATFSPQLGNRFSYCLVSHSFAADRVRRPSPLILGRYHTGETEFIYTSLLENPKHPYFYSVGL

Query:  AGISVGNVRIPAPEFLRKVDESGSGGVVVDSGTTFTMLPSGLYESVVAEFENRTGKVANRARRIEENTGLSPCYYYENSVGVPRVVLHFVGEKSSVVLPR
        AGISVGN+RIPAPEFL KVDE GSGGVVVDSGTTFTMLP+GLYESVVAEFENRTGKVANRARRIEENTGLSPCYYYENSVGVPRVVLHFVGEKS+VVLPR
Subjt:  AGISVGNVRIPAPEFLRKVDESGSGGVVVDSGTTFTMLPSGLYESVVAEFENRTGKVANRARRIEENTGLSPCYYYENSVGVPRVVLHFVGEKSSVVLPR

Query:  KNYFYEFLDGGDGVVEVGRKRKVGCLMLMNGGDEAELAGGPGATLGNYQQQGFEVVYDLEKNRVGFARRQCSTLWDNLNRS
        KNYFYEFLDGGDGV  VGRKRKVGCLMLMNGGDEAELAGGPGATLGNYQQQGFEVVYDLEKNRVGFARRQCSTLWDNLNRS
Subjt:  KNYFYEFLDGGDGVVEVGRKRKVGCLMLMNGGDEAELAGGPGATLGNYQQQGFEVVYDLEKNRVGFARRQCSTLWDNLNRS

A0A1S3BK28 aspartic proteinase nepenthesin-14.5e-280100Show/hide
Query:  MAVSPVFIFLLCFLLSSPVFSSQIFLLPLSHSLSSSISDFNSTHNLLKSTATRSSARFHRHRHNHLSLPLSPGGDYTLSFNLGSESHKISLYMDTGSDLV
        MAVSPVFIFLLCFLLSSPVFSSQIFLLPLSHSLSSSISDFNSTHNLLKSTATRSSARFHRHRHNHLSLPLSPGGDYTLSFNLGSESHKISLYMDTGSDLV
Subjt:  MAVSPVFIFLLCFLLSSPVFSSQIFLLPLSHSLSSSISDFNSTHNLLKSTATRSSARFHRHRHNHLSLPLSPGGDYTLSFNLGSESHKISLYMDTGSDLV

Query:  WFPCSPFECILCEGKPKIQSPLPKISNNKSVSCSAAACSAAHGGSLSASHLCAISRCPLESIEISECSSFSCPPFYYAYGDGSLVARLYRDSLSLPTPAP
        WFPCSPFECILCEGKPKIQSPLPKISNNKSVSCSAAACSAAHGGSLSASHLCAISRCPLESIEISECSSFSCPPFYYAYGDGSLVARLYRDSLSLPTPAP
Subjt:  WFPCSPFECILCEGKPKIQSPLPKISNNKSVSCSAAACSAAHGGSLSASHLCAISRCPLESIEISECSSFSCPPFYYAYGDGSLVARLYRDSLSLPTPAP

Query:  SPPINVRNFTFGCAHTTLGEPVGVAGFGRGVLSMPSQLATFSPQLGNRFSYCLVSHSFAADRVRRPSPLILGRYHTGETEFIYTSLLENPKHPYFYSVGL
        SPPINVRNFTFGCAHTTLGEPVGVAGFGRGVLSMPSQLATFSPQLGNRFSYCLVSHSFAADRVRRPSPLILGRYHTGETEFIYTSLLENPKHPYFYSVGL
Subjt:  SPPINVRNFTFGCAHTTLGEPVGVAGFGRGVLSMPSQLATFSPQLGNRFSYCLVSHSFAADRVRRPSPLILGRYHTGETEFIYTSLLENPKHPYFYSVGL

Query:  AGISVGNVRIPAPEFLRKVDESGSGGVVVDSGTTFTMLPSGLYESVVAEFENRTGKVANRARRIEENTGLSPCYYYENSVGVPRVVLHFVGEKSSVVLPR
        AGISVGNVRIPAPEFLRKVDESGSGGVVVDSGTTFTMLPSGLYESVVAEFENRTGKVANRARRIEENTGLSPCYYYENSVGVPRVVLHFVGEKSSVVLPR
Subjt:  AGISVGNVRIPAPEFLRKVDESGSGGVVVDSGTTFTMLPSGLYESVVAEFENRTGKVANRARRIEENTGLSPCYYYENSVGVPRVVLHFVGEKSSVVLPR

Query:  KNYFYEFLDGGDGVVEVGRKRKVGCLMLMNGGDEAELAGGPGATLGNYQQQGFEVVYDLEKNRVGFARRQCSTLWDNLNRS
        KNYFYEFLDGGDGVVEVGRKRKVGCLMLMNGGDEAELAGGPGATLGNYQQQGFEVVYDLEKNRVGFARRQCSTLWDNLNRS
Subjt:  KNYFYEFLDGGDGVVEVGRKRKVGCLMLMNGGDEAELAGGPGATLGNYQQQGFEVVYDLEKNRVGFARRQCSTLWDNLNRS

A0A5D3CP11 Aspartic proteinase nepenthesin-13.3e-27899.38Show/hide
Query:  MAVSPVFIFLLCFLLSSPVFSSQIFLLPLSHSLSSSISDFNSTHNLLKSTATRSSARFHRHRHNHLSLPLSPGGDYTLSFNLGSESHKISLYMDTGSDLV
        MAVSPVFIFLLCFLLSSPVFSSQIFLLPLSHSLSSSISDFNSTHNLLKSTATRSSARFHRHRHNHLSLPLSPGGDYTLSFNLGSESHKISLYMDTGSDLV
Subjt:  MAVSPVFIFLLCFLLSSPVFSSQIFLLPLSHSLSSSISDFNSTHNLLKSTATRSSARFHRHRHNHLSLPLSPGGDYTLSFNLGSESHKISLYMDTGSDLV

Query:  WFPCSPFECILCEGKPKIQSPLPKISNNKSVSCSAAACSAAHGGSLSASHLCAISRCPLESIEISECSSFSCPPFYYAYGDGSLVARLYRDSLSLPT--P
        WFPCSPFECILCEGKPKIQSPLPKISNNKSVSCSAAACSAAHGGSLSASHLCAISRCPLESIEISECSSFSCPPFYYAYGDGSLVARLYRDSLSLPT  P
Subjt:  WFPCSPFECILCEGKPKIQSPLPKISNNKSVSCSAAACSAAHGGSLSASHLCAISRCPLESIEISECSSFSCPPFYYAYGDGSLVARLYRDSLSLPT--P

Query:  APSPPINVRNFTFGCAHTTLGEPVGVAGFGRGVLSMPSQLATFSPQLGNRFSYCLVSHSFAADRVRRPSPLILGRYHTGETEFIYTSLLENPKHPYFYSV
        APSPPINVRNFTFGCAHTTLGEPVGVAGFGRGVLSMPSQLATFSPQLGNRFSYCLVSHSFAADRVRRPSPLILGRYHTGETEFIYTSLLENPKHPYFYSV
Subjt:  APSPPINVRNFTFGCAHTTLGEPVGVAGFGRGVLSMPSQLATFSPQLGNRFSYCLVSHSFAADRVRRPSPLILGRYHTGETEFIYTSLLENPKHPYFYSV

Query:  GLAGISVGNVRIPAPEFLRKVDESGSGGVVVDSGTTFTMLPSGLYESVVAEFENRTGKVANRARRIEENTGLSPCYYYENSVGVPRVVLHFVGEKSSVVL
        GLAGISVGNVRIPAPEFLRKVDESGSGGVVVDSGTTFTMLPSGLYESVVAEFENRTGKVANRARRIEENTGLSPCYYY+NSVGVPRVVLHFVGEKSSVVL
Subjt:  GLAGISVGNVRIPAPEFLRKVDESGSGGVVVDSGTTFTMLPSGLYESVVAEFENRTGKVANRARRIEENTGLSPCYYYENSVGVPRVVLHFVGEKSSVVL

Query:  PRKNYFYEFLDGGDGVVEVGRKRKVGCLMLMNGGDEAELAGGPGATLGNYQQQGFEVVYDLEKNRVGFARRQCSTLWDNLNRS
        PRKNYFYEFLDGGDGVVEVGRKRKVGCLMLMNGGDEAELAGGPGATLGNYQQQGFEVVYDLEKNRVGFARRQCSTLWDNLNRS
Subjt:  PRKNYFYEFLDGGDGVVEVGRKRKVGCLMLMNGGDEAELAGGPGATLGNYQQQGFEVVYDLEKNRVGFARRQCSTLWDNLNRS

A0A6J1EC44 probable aspartyl protease At4g165635.1e-24789.05Show/hide
Query:  SPVFIFLLCFLLSSPVFSSQIFLLPLSHSLSSSISDFNSTHNLLKSTATRSSARFHR----HRHNHLSLPLSPGGDYTLSFNLGSESHKISLYMDTGSDL
        SPVF+FLLCFL SSPVFSSQ+ LLPLS+SLSSS SDFN+THNLLKSTA RSSARFH     H  +HLSLPLSPGGDYTLSFNLGSES KISLYMDTGSDL
Subjt:  SPVFIFLLCFLLSSPVFSSQIFLLPLSHSLSSSISDFNSTHNLLKSTATRSSARFHR----HRHNHLSLPLSPGGDYTLSFNLGSESHKISLYMDTGSDL

Query:  VWFPCSPFECILCEGKPKIQSPLPKISNNKSVSCSAAACSAAHGGSLSASHLCAISRCPLESIEISECSSFSCPPFYYAYGDGSLVARLYRDSLSL--PT
        VWFPCSPFECILCEGKPKIQSPLPKISN KSVSCSAAACSAAHGGSLSASHLCAISRCPLESIE+SECSSFSCPPFYYAYGDGSL+ RLYRDSLSL  P 
Subjt:  VWFPCSPFECILCEGKPKIQSPLPKISNNKSVSCSAAACSAAHGGSLSASHLCAISRCPLESIEISECSSFSCPPFYYAYGDGSLVARLYRDSLSL--PT

Query:  PAPSPPINVRNFTFGCAHTTLGEPVGVAGFGRGVLSMPSQLATFSPQLGNRFSYCLVSHSFAADRVRRPSPLILGRYHTGETEFIYTSLLENPKHPYFYS
        PAPSP INVRNFTFGCAH+ LGEP+GVAGFGRG+LSMPSQLATFSPQLGNRFSYCLVSHSFAADRVRRPSPLILGRY+  ETEFIYTS+LENPKHPYFYS
Subjt:  PAPSPPINVRNFTFGCAHTTLGEPVGVAGFGRGVLSMPSQLATFSPQLGNRFSYCLVSHSFAADRVRRPSPLILGRYHTGETEFIYTSLLENPKHPYFYS

Query:  VGLAGISVGNVRIPAPEFLRKVDESGSGGVVVDSGTTFTMLPSGLYESVVAEFENRTGKVANRARRIEENTGLSPCYYYENSVGVPRVVLHFVGEKSSVV
        VGLAGISVG+VRIPAPEFL++VDE GSGGVVVDSGTTFTMLP+GLY SVVA+FENRTG+VA+RA RIEENTGLSPCY YE SV VPRVVLHFVGEKSSV 
Subjt:  VGLAGISVGNVRIPAPEFLRKVDESGSGGVVVDSGTTFTMLPSGLYESVVAEFENRTGKVANRARRIEENTGLSPCYYYENSVGVPRVVLHFVGEKSSVV

Query:  LPRKNYFYEFLDGGDGVVEVGRKRKVGCLMLMNGGDEAELAGGPGATLGNYQQQGFEVVYDLEKNRVGFARRQCSTLWDNLNRS
        LPRKNYFYEFLDGGDG   VGRKRKVGCLMLMNGGDEAELAGGPGATLGNYQQQGFEV YDLE NRVGFARRQCSTLWD+LNRS
Subjt:  LPRKNYFYEFLDGGDGVVEVGRKRKVGCLMLMNGGDEAELAGGPGATLGNYQQQGFEVVYDLEKNRVGFARRQCSTLWDNLNRS

A0A6J1L3Z9 probable aspartyl protease At4g165631.3e-24789.21Show/hide
Query:  SPVFIFLLCFLLSSPVFSSQIFLLPLSHSLSSSISDFNSTHNLLKSTATRSSARFHR----HRHNHLSLPLSPGGDYTLSFNLGSESHKISLYMDTGSDL
        SPVF+FLLCFLL SPVFSSQI LLPLS+SLSSS SDFN+THNLLKSTA RSSARFH     H  +HLSLPLSPGGDYTLSFNLGSES KISLYMDTGSDL
Subjt:  SPVFIFLLCFLLSSPVFSSQIFLLPLSHSLSSSISDFNSTHNLLKSTATRSSARFHR----HRHNHLSLPLSPGGDYTLSFNLGSESHKISLYMDTGSDL

Query:  VWFPCSPFECILCEGKPKIQSPLPKISNNKSVSCSAAACSAAHGGSLSASHLCAISRCPLESIEISECSSFSCPPFYYAYGDGSLVARLYRDSLSLPTPA
        VWFPCSPFECILCEGKPKIQSPLPKISN KSVSCSAAACSAAHGGSLSASHLCAISRCPLESIE+SECSSFSCPPFYYAYGDGSL+ RLYRDSLSLP PA
Subjt:  VWFPCSPFECILCEGKPKIQSPLPKISNNKSVSCSAAACSAAHGGSLSASHLCAISRCPLESIEISECSSFSCPPFYYAYGDGSLVARLYRDSLSLPTPA

Query:  PSPPINVRNFTFGCAHTTLGEPVGVAGFGRGVLSMPSQLATFSPQLGNRFSYCLVSHSFAADRVRRPSPLILGRYHTGETEFIYTSLLENPKHPYFYSVG
        PSP INVRNFTFGCAH+ LGEP+GVAGFGRG+LSMP QLATFSPQLGNRFSYCLVSHSFAADRVRRPSPLILGRY+  ETEFIYTS+LENPKHPYFYSVG
Subjt:  PSPPINVRNFTFGCAHTTLGEPVGVAGFGRGVLSMPSQLATFSPQLGNRFSYCLVSHSFAADRVRRPSPLILGRYHTGETEFIYTSLLENPKHPYFYSVG

Query:  LAGISVGNVRIPAPEFLRKVDESGSGGVVVDSGTTFTMLPSGLYESVVAEFENRTGKVANRARRIEENTGLSPCYYYENSVGVPRVVLHFVGEKSSVVLP
        LAGISVG+V IPAPEFL+KVDE GSGGVVVDSGTTFTMLP+GLY SVVA+FENRTG+VA+RA +IEENTGLSPCYYYE SV VPRVVLHFVGEKSSV+LP
Subjt:  LAGISVGNVRIPAPEFLRKVDESGSGGVVVDSGTTFTMLPSGLYESVVAEFENRTGKVANRARRIEENTGLSPCYYYENSVGVPRVVLHFVGEKSSVVLP

Query:  RKNYFYEFLDGGDGVVEVGRKRKVGCLMLMNGGDEAELAGGPGATLGNYQQQGFEVVYDLEKNRVGFARRQCSTLWDNLNRS
        RKNYFYEFLDGGDG   VGRK KVGCLMLMNGGDEAELAGGPGATLGNYQQQGFEV YDLE NRVGFARRQCSTLWD+LNRS
Subjt:  RKNYFYEFLDGGDGVVEVGRKRKVGCLMLMNGGDEAELAGGPGATLGNYQQQGFEVVYDLEKNRVGFARRQCSTLWDNLNRS

SwissProt top hitse value%identityAlignment
Q766C2 Aspartic proteinase nepenthesin-21.9e-3328.75Show/hide
Query:  GDYTLSFNLGSESHKISLYMDTGSDLVWFPCSPFECILCEGKPKIQSPLPKISNNKSVSCSAAACSAAHGGSLSASHLCAISRCPLESIEISECSSFSCP
        G+Y ++  +G+     S  MDTGSDL+W  C P  C  C        P P  +   S S S   C + +   L           P E+   +EC      
Subjt:  GDYTLSFNLGSESHKISLYMDTGSDLVWFPCSPFECILCEGKPKIQSPLPKISNNKSVSCSAAACSAAHGGSLSASHLCAISRCPLESIEISECSSFSCP

Query:  PFYYAYGDGSLV-ARLYRDSLSLPTPAPSPPINVRNFTFGCAHTT----LGEPVGVAGFGRGVLSMPSQLATFSPQLGNRFSYCLVSHSFAADRVRRPSP
         + Y YGDGS     +  ++ +  T       +V N  FGC         G   G+ G G G LS+PSQL         +FSYC+ S+  ++     PS 
Subjt:  PFYYAYGDGSLV-ARLYRDSLSLPTPAPSPPINVRNFTFGCAHTT----LGEPVGVAGFGRGVLSMPSQLATFSPQLGNRFSYCLVSHSFAADRVRRPSP

Query:  LILGRYHTGETE-FIYTSLLENPKHPYFYSVGLAGISVGNVRIPAPEFLRKVDESGSGGVVVDSGTTFTMLPSGLYESVVAEFENRTGKVANRARRIEEN
        L LG   +G  E    T+L+ +  +P +Y + L GI+VG   +  P    ++ + G+GG+++DSGTT T LP   Y +V   F ++     N     E +
Subjt:  LILGRYHTGETE-FIYTSLLENPKHPYFYSVGLAGISVGNVRIPAPEFLRKVDESGSGGVVVDSGTTFTMLPSGLYESVVAEFENRTGKVANRARRIEEN

Query:  TGLSPCYYYE---NSVGVPRVVLHFVGEKSSVVLPRKNYFYEFLDGGDGVVEVGRKRKVGCLMLMNGGDEAELAGGPGATLGNYQQQGFEVVYDLEKNRV
        +GLS C+      ++V VP + + F G                L+ G+  + +     V CL +   G  ++L     +  GN QQQ  +V+YDL+   V
Subjt:  TGLSPCYYYE---NSVGVPRVVLHFVGEKSSVVLPRKNYFYEFLDGGDGVVEVGRKRKVGCLMLMNGGDEAELAGGPGATLGNYQQQGFEVVYDLEKNRV

Query:  GFARRQC
         F   QC
Subjt:  GFARRQC

Q766C3 Aspartic proteinase nepenthesin-15.0e-3429.66Show/hide
Query:  GDYTLSFNLGSESHKISLYMDTGSDLVWFPCSPFECILCEGKPKIQSPLPKISNNKSVSCSAAACSAAHGGSLSASHLCAISRCPLESIEISECSSFSCP
        G+Y ++ ++G+ +   S  MDTGSDL+W  C P  C  C          P  +   S S S   CS         S LC       +++    CS+  C 
Subjt:  GDYTLSFNLGSESHKISLYMDTGSDLVWFPCSPFECILCEGKPKIQSPLPKISNNKSVSCSAAACSAAHGGSLSASHLCAISRCPLESIEISECSSFSCP

Query:  PFYYAYGDGSLVARLYRDSLSLPTPAPSPPINVRNFTFGCAHTT----LGEPVGVAGFGRGVLSMPSQLATFSPQLGNRFSYCLVSHSFAADRVRRPSPL
         + Y YGDGS      + S+   T      +++ N TFGC         G   G+ G GRG LS+PSQL         +FSYC+     +      PS L
Subjt:  PFYYAYGDGSLVARLYRDSLSLPTPAPSPPINVRNFTFGCAHTT----LGEPVGVAGFGRGVLSMPSQLATFSPQLGNRFSYCLVSHSFAADRVRRPSPL

Query:  ILGRYHTGETE-FIYTSLLENPKHPYFYSVGLAGISVGNVRIPA-PEFLRKVDESGSGGVVVDSGTTFTMLPSGLYESVVAEFENRTG-KVANRARRIEE
        +LG      T     T+L+++ + P FY + L G+SVG+ R+P  P        +G+GG+++DSGTT T   +  Y+SV  EF ++    V N +     
Subjt:  ILGRYHTGETE-FIYTSLLENPKHPYFYSVGLAGISVGNVRIPA-PEFLRKVDESGSGGVVVDSGTTFTMLPSGLYESVVAEFENRTG-KVANRARRIEE

Query:  NTGLSPCYYY---ENSVGVPRVVLHFVGEKSSVVLPRKNYFYEFLDGGDGVVEVGRKRKVGCLMLMNGGDEAELAGGPGATLGNYQQQGFEVVYDLEKNR
        ++G   C+      +++ +P  V+HF G    + LP +NY   F+   +G++         CL + +      +        GN QQQ   VVYD   + 
Subjt:  NTGLSPCYYY---ENSVGVPRVVLHFVGEKSSVVLPRKNYFYEFLDGGDGVVEVGRKRKVGCLMLMNGGDEAELAGGPGATLGNYQQQGFEVVYDLEKNR

Query:  VGFARRQC
        V FA  QC
Subjt:  VGFARRQC

Q940R4 Probable aspartyl protease At4g165638.5e-16762.71Show/hide
Query:  LLPLSHSLSSSISDFNSTHNLLKSTATRSSARFHRHRH----NHLSLPLSPGGDYTLSFNLGSESHKISLYMDTGSDLVWFPCSPFECILCEGKPKIQSP
        LL LSHSLS+S    +  H LLKS+++RSSARF RH H      LSLP+S G DY +S ++GS S  +SLY+DTGSDLVWFPC PF CILCE KP   SP
Subjt:  LLPLSHSLSSSISDFNSTHNLLKSTATRSSARFHRHRH----NHLSLPLSPGGDYTLSFNLGSESHKISLYMDTGSDLVWFPCSPFECILCEGKPKIQSP

Query:  LPKISNN-KSVSCSAAACSAAHGGSLSASHLCAISRCPLESIEISEC--SSFSCPPFYYAYGDGSLVARLYRDSLSLPTPAPSPPINVRNFTFGCAHTTL
           +S++  +VSCS+ +CSAAH  SL +S LCAIS CPL+ IE  +C  SS+ CPPFYYAYGDGSLVA+LY DSLSLP+      ++V NFTFGCAHTTL
Subjt:  LPKISNN-KSVSCSAAACSAAHGGSLSASHLCAISRCPLESIEISEC--SSFSCPPFYYAYGDGSLVARLYRDSLSLPTPAPSPPINVRNFTFGCAHTTL

Query:  GEPVGVAGFGRGVLSMPSQLATFSPQLGNRFSYCLVSHSFAADRVRRPSPLILGRY------HTGET--------------EFIYTSLLENPKHPYFYSV
         EP+GVAGFGRG LS+P+QLA  SP LGN FSYCLVSHSF +DRVRRPSPLILGR+        G T              EF++T +LENPKHPYFYSV
Subjt:  GEPVGVAGFGRGVLSMPSQLATFSPQLGNRFSYCLVSHSFAADRVRRPSPLILGRY------HTGET--------------EFIYTSLLENPKHPYFYSV

Query:  GLAGISVGNVRIPAPEFLRKVDESGSGGVVVDSGTTFTMLPSGLYESVVAEFENRTGKVANRARRIEENTGLSPCYYYENSVGVPRVVLHFVGEKSSVVL
         L GIS+G   IPAP  LR++D++G GGVVVDSGTTFTMLP+  Y SVV EF++R G+V  RA R+E ++G+SPCYY   +V VP +VLHF G +SSV L
Subjt:  GLAGISVGNVRIPAPEFLRKVDESGSGGVVVDSGTTFTMLPSGLYESVVAEFENRTGKVANRARRIEENTGLSPCYYYENSVGVPRVVLHFVGEKSSVVL

Query:  PRKNYFYEFLDGGDGVVEVGRKRKVGCLMLMNGGDEAELAGGPGATLGNYQQQGFEVVYDLEKNRVGFARRQCSTLWDNL
        PR+NYFYEF+DGGDG  E   KRK+GCLMLMNGGDE+EL GG GA LGNYQQQGFEVVYDL   RVGFA+R+C++LWD+L
Subjt:  PRKNYFYEFLDGGDGVVEVGRKRKVGCLMLMNGGDEAELAGGPGATLGNYQQQGFEVVYDLEKNRVGFARRQCSTLWDNL

Q9LNJ3 Aspartyl protease family protein 21.2e-3533.09Show/hide
Query:  LSPG-GDYTLSFNLGSESHKISLYMDTGSDLVWFPCSPFECILCEGKPKIQSPLPKISNNKSVSCSAAACSAAHGGSLSASHLCAISRCPLESIEISECS
        LS G G+Y     +G+ +  + + +DTGSD+VW  C+P  C  C  +       P     KS + +   CS+ H   L ++  C   R          C 
Subjt:  LSPG-GDYTLSFNLGSESHKISLYMDTGSDLVWFPCSPFECILCEGKPKIQSPLPKISNNKSVSCSAAACSAAHGGSLSASHLCAISRCPLESIEISECS

Query:  SFSCPPFYYAYGDGSL-VARLYRDSLSLPTPAPSPPINVRNFTFGCAHTTLGEPVGVA---GFGRGVLSMPSQLATFSPQLGNRFSYCLVSHSFAADRVR
              +  +YGDGS  V     ++L+           V+    GC H   G  VG A   G G+G LS P Q      +   +FSYCLV  S ++    
Subjt:  SFSCPPFYYAYGDGSL-VARLYRDSLSLPTPAPSPPINVRNFTFGCAHTTLGEPVGVA---GFGRGVLSMPSQLATFSPQLGNRFSYCLVSHSFAADRVR

Query:  RPSPLILGRYHTGETEFIYTSLLENPKHPYFYSVGLAGISVGNVRIP-APEFLRKVDESGSGGVVVDSGTTFTMLPSGLYESVVAEFENRTGKVANRARR
        +PS ++ G          +T LL NPK   FY VGL GISVG  R+P     L K+D+ G+GGV++DSGT+ T L    Y ++   F  R G  A   +R
Subjt:  RPSPLILGRYHTGETEFIYTSLLENPKHPYFYSVGLAGISVGNVRIP-APEFLRKVDESGSGGVVVDSGTTFTMLPSGLYESVVAEFENRTGKVANRARR

Query:  IEENTGLSPCYYYE--NSVGVPRVVLHFVGEKSSVVLPRKNYFYEFLDGGDGVVEVGRKRKVGCLMLMNGGDEAELAGGPGATLGNYQQQGFEVVYDLEK
          + +    C+     N V VP VVLHF G  + V LP  NY          ++ V    K  C         A   GG  + +GN QQQGF VVYDL  
Subjt:  IEENTGLSPCYYYE--NSVGVPRVVLHFVGEKSSVVLPRKNYFYEFLDGGDGVVEVGRKRKVGCLMLMNGGDEAELAGGPGATLGNYQQQGFEVVYDLEK

Query:  NRVGFARRQCS
        +RVGFA   C+
Subjt:  NRVGFARRQCS

Q9LS40 Protein ASPARTIC PROTEASE IN GUARD CELL 12.3e-3429.79Show/hide
Query:  RHRHNHLSLPLSPG-----GDYTLSFNLGSESHKISLYMDTGSDLVWFPCSPFECILCEGKPKIQSPLPKISNNKSVSCSAAACSAAHGGSLSASHLCAI
        R++   L+ P+  G     G+Y     +G+ + ++ L +DTGSD+ W  C P  C  C  +          S  KS++CSA  CS               
Subjt:  RHRHNHLSLPLSPG-----GDYTLSFNLGSESHKISLYMDTGSDLVWFPCSPFECILCEGKPKIQSPLPKISNNKSVSCSAAACSAAHGGSLSASHLCAI

Query:  SRCPLESIEISECSSFSCPPFYYAYGDGSL-VARLYRDSLSLPTPAPSPPINVRNFTFGCAHTTLG---EPVGVAGFGRGVLSMPSQLATFSPQLGNRFS
               +E S C S  C  +  +YGDGS  V  L  D+++           + N   GC H   G      G+ G G GVLS+ +Q+   S      FS
Subjt:  SRCPLESIEISECSSFSCPPFYYAYGDGSL-VARLYRDSLSLPTPAPSPPINVRNFTFGCAHTTLG---EPVGVAGFGRGVLSMPSQLATFSPQLGNRFS

Query:  YCLVSHSFAADRVRRPSPLILGRYHTGETEFIYTSLLENPKHPYFYSVGLAGISVGNVRIPAPEFLRKVDESGSGGVVVDSGTTFTMLPSGLYESVVAEF
        YCLV          + S L       G  +     LL N K   FY VGL+G SVG  ++  P+ +  VD SGSGGV++D GT  T L +  Y S+   F
Subjt:  YCLVSHSFAADRVRRPSPLILGRYHTGETEFIYTSLLENPKHPYFYSVGLAGISVGNVRIPAPEFLRKVDESGSGGVVVDSGTTFTMLPSGLYESVVAEF

Query:  ENRTGKVANRARRIEENTGLSPCYYYE--NSVGVPRVVLHFVGEKSSVVLPRKNYFYEFLDGGDGVVEVGRKRKVGCLMLMNGGDEAELAGGPGATLGNY
           T    N  +     +    CY +   ++V VP V  HF G K S+ LP KNY     D G             C           +       +GN 
Subjt:  ENRTGKVANRARRIEENTGLSPCYYYE--NSVGVPRVVLHFVGEKSSVVLPRKNYFYEFLDGGDGVVEVGRKRKVGCLMLMNGGDEAELAGGPGATLGNY

Query:  QQQGFEVVYDLEKNRVGFARRQC
        QQQG  + YDL KN +G +  +C
Subjt:  QQQGFEVVYDLEKNRVGFARRQC

Arabidopsis top hitse value%identityAlignment
AT1G01300.1 Eukaryotic aspartyl protease family protein8.5e-3733.09Show/hide
Query:  LSPG-GDYTLSFNLGSESHKISLYMDTGSDLVWFPCSPFECILCEGKPKIQSPLPKISNNKSVSCSAAACSAAHGGSLSASHLCAISRCPLESIEISECS
        LS G G+Y     +G+ +  + + +DTGSD+VW  C+P  C  C  +       P     KS + +   CS+ H   L ++  C   R          C 
Subjt:  LSPG-GDYTLSFNLGSESHKISLYMDTGSDLVWFPCSPFECILCEGKPKIQSPLPKISNNKSVSCSAAACSAAHGGSLSASHLCAISRCPLESIEISECS

Query:  SFSCPPFYYAYGDGSL-VARLYRDSLSLPTPAPSPPINVRNFTFGCAHTTLGEPVGVA---GFGRGVLSMPSQLATFSPQLGNRFSYCLVSHSFAADRVR
              +  +YGDGS  V     ++L+           V+    GC H   G  VG A   G G+G LS P Q      +   +FSYCLV  S ++    
Subjt:  SFSCPPFYYAYGDGSL-VARLYRDSLSLPTPAPSPPINVRNFTFGCAHTTLGEPVGVA---GFGRGVLSMPSQLATFSPQLGNRFSYCLVSHSFAADRVR

Query:  RPSPLILGRYHTGETEFIYTSLLENPKHPYFYSVGLAGISVGNVRIP-APEFLRKVDESGSGGVVVDSGTTFTMLPSGLYESVVAEFENRTGKVANRARR
        +PS ++ G          +T LL NPK   FY VGL GISVG  R+P     L K+D+ G+GGV++DSGT+ T L    Y ++   F  R G  A   +R
Subjt:  RPSPLILGRYHTGETEFIYTSLLENPKHPYFYSVGLAGISVGNVRIP-APEFLRKVDESGSGGVVVDSGTTFTMLPSGLYESVVAEFENRTGKVANRARR

Query:  IEENTGLSPCYYYE--NSVGVPRVVLHFVGEKSSVVLPRKNYFYEFLDGGDGVVEVGRKRKVGCLMLMNGGDEAELAGGPGATLGNYQQQGFEVVYDLEK
          + +    C+     N V VP VVLHF G  + V LP  NY          ++ V    K  C         A   GG  + +GN QQQGF VVYDL  
Subjt:  IEENTGLSPCYYYE--NSVGVPRVVLHFVGEKSSVVLPRKNYFYEFLDGGDGVVEVGRKRKVGCLMLMNGGDEAELAGGPGATLGNYQQQGFEVVYDLEK

Query:  NRVGFARRQCS
        +RVGFA   C+
Subjt:  NRVGFARRQCS

AT3G52500.1 Eukaryotic aspartyl protease family protein4.5e-4630.37Show/hide
Query:  SPVFIFLLCFLLSSPVFSSQIFLLPLSHSLSSSISDFNSTHNLLKSTATRSSARFHRHRHNH-------------------LSLPLSPG--GDYTLSFNL
        S +F F L FL  S V + ++ L P SHS  S    + S   L +S    S AR H+ +H                     +  PLS    G Y++S + 
Subjt:  SPVFIFLLCFLLSSPVFSSQIFLLPLSHSLSSSISDFNSTHNLLKSTATRSSARFHRHRHNH-------------------LSLPLSPG--GDYTLSFNL

Query:  GSESHKISLYMDTGSDLVWFPC-SPFECILCEGKPKIQSPLPKI-----SNNKSVSCSAAACSAAHGGSLSASHLCAISRCPLESIEISECSSFSCPPFY
        G+ S  I    DTGS LVW PC S + C  C+      + +P+      S++K + C +  C   +G ++         +C         C +  CPP+ 
Subjt:  GSESHKISLYMDTGSDLVWFPC-SPFECILCEGKPKIQSPLPKI-----SNNKSVSCSAAACSAAHGGSLSASHLCAISRCPLESIEISECSSFSCPPFY

Query:  YAYGDGSLVARLYRDSLSLPTPAPSPPINVRNFTFGCAHTTLGEPVGVAGFGRGVLSMPSQLATFSPQLGNRFSYCLVSHSFAADRVRRPSPLILGRYHT
          YG GS    L  + L        P + V +F  GC+  +  +P G+AGFGRG +S+PSQ+         RFS+CLVS  F    V     L  G  H 
Subjt:  YAYGDGSLVARLYRDSLSLPTPAPSPPINVRNFTFGCAHTTLGEPVGVAGFGRGVLSMPSQLATFSPQLGNRFSYCLVSHSFAADRVRRPSPLILGRYHT

Query:  GETE---FIYTSLLENPK-----HPYFYSVGLAGISVGNVRIPAPEFLRKVDESGSGGVVVDSGTTFTMLPSGLYESVVAEFENRTGKVANRARRIEENT
          ++     YT   +NP         +Y + L  I VG   +  P        +G GG +VDSG+TFT +   ++E V  EF ++      R + +E+ T
Subjt:  GETE---FIYTSLLENPK-----HPYFYSVGLAGISVGNVRIPAPEFLRKVDESGSGGVVVDSGTTFTMLPSGLYESVVAEFENRTGKVANRARRIEENT

Query:  GLSPCYYY--ENSVGVPRVVLHFVGEKSSVVLPRKNYFYEFLDGGDGVVEVGRKRKVGCLMLMNGGDEAELAG-GPGATLGNYQQQGFEVVYDLEKNRVG
        GL PC+    +  V VP ++  F G  + + LP  NYF  F+   D V          CL +++        G GP   LG++QQQ + V YDLE +R G
Subjt:  GLSPCYYY--ENSVGVPRVVLHFVGEKSSVVLPRKNYFYEFLDGGDGVVEVGRKRKVGCLMLMNGGDEAELAG-GPGATLGNYQQQGFEVVYDLEKNRVG

Query:  FARRQCS
        FA+++CS
Subjt:  FARRQCS

AT3G61820.1 Eukaryotic aspartyl protease family protein1.1e-3632.27Show/hide
Query:  LSPG-GDYTLSFNLGSESHKISLYMDTGSDLVWFPCSPFECILCEGKPKIQSPLPKISNNKSVSCSAAACSAAHGGSLSASHLCAISRCPLESIEISECS
        LS G G+Y +   +G+ +  + + +DTGSD+VW  CSP  C  C  +        K     +V C +  C       L  S  C   R            
Subjt:  LSPG-GDYTLSFNLGSESHKISLYMDTGSDLVWFPCSPFECILCEGKPKIQSPLPKISNNKSVSCSAAACSAAHGGSLSASHLCAISRCPLESIEISECS

Query:  SFSCPPFYYAYGDGSLVARLYRDSLSLPTPAPSPPINVRNFTFGCAHTTLGEPVGVA---GFGRGVLSMPSQLATFSPQLGNRFSYCLVSHSFAADRVRR
        S +C  +  +YGDGS          S  T        V +   GC H   G  VG A   G GRG LS PSQ      +   +FSYCLV  + +    + 
Subjt:  SFSCPPFYYAYGDGSLVARLYRDSLSLPTPAPSPPINVRNFTFGCAHTTLGEPVGVA---GFGRGVLSMPSQLATFSPQLGNRFSYCLVSHSFAADRVRR

Query:  PSPLILGRYHTGETEFIYTSLLENPKHPYFYSVGLAGISVGNVRIP-APEFLRKVDESGSGGVVVDSGTTFTMLPSGLYESVVAEFENRTGKVANRARRI
        PS ++ G     +T  ++T LL NPK   FY + L GISVG  R+P   E   K+D +G+GGV++DSGT+ T L    Y ++   F  R G  A + +R 
Subjt:  PSPLILGRYHTGETEFIYTSLLENPKHPYFYSVGLAGISVGNVRIP-APEFLRKVDESGSGGVVVDSGTTFTMLPSGLYESVVAEFENRTGKVANRARRI

Query:  EENTGLSPCYYYE--NSVGVPRVVLHFVGEKSSVVLPRKNYFYEFLDGGDGVVEVGRKRKVGCLMLMNGGDEAELAGGPGATLGNYQQQGFEVVYDLEKN
           +    C+      +V VP VV HF G    V LP  NY          ++ V  +   G       G    L     + +GN QQQGF V YDL  +
Subjt:  EENTGLSPCYYYE--NSVGVPRVVLHFVGEKSSVVLPRKNYFYEFLDGGDGVVEVGRKRKVGCLMLMNGGDEAELAGGPGATLGNYQQQGFEVVYDLEKN

Query:  RVGFARRQC
        RVGF  R C
Subjt:  RVGFARRQC

AT4G16563.1 Eukaryotic aspartyl protease family protein6.0e-16862.71Show/hide
Query:  LLPLSHSLSSSISDFNSTHNLLKSTATRSSARFHRHRH----NHLSLPLSPGGDYTLSFNLGSESHKISLYMDTGSDLVWFPCSPFECILCEGKPKIQSP
        LL LSHSLS+S    +  H LLKS+++RSSARF RH H      LSLP+S G DY +S ++GS S  +SLY+DTGSDLVWFPC PF CILCE KP   SP
Subjt:  LLPLSHSLSSSISDFNSTHNLLKSTATRSSARFHRHRH----NHLSLPLSPGGDYTLSFNLGSESHKISLYMDTGSDLVWFPCSPFECILCEGKPKIQSP

Query:  LPKISNN-KSVSCSAAACSAAHGGSLSASHLCAISRCPLESIEISEC--SSFSCPPFYYAYGDGSLVARLYRDSLSLPTPAPSPPINVRNFTFGCAHTTL
           +S++  +VSCS+ +CSAAH  SL +S LCAIS CPL+ IE  +C  SS+ CPPFYYAYGDGSLVA+LY DSLSLP+      ++V NFTFGCAHTTL
Subjt:  LPKISNN-KSVSCSAAACSAAHGGSLSASHLCAISRCPLESIEISEC--SSFSCPPFYYAYGDGSLVARLYRDSLSLPTPAPSPPINVRNFTFGCAHTTL

Query:  GEPVGVAGFGRGVLSMPSQLATFSPQLGNRFSYCLVSHSFAADRVRRPSPLILGRY------HTGET--------------EFIYTSLLENPKHPYFYSV
         EP+GVAGFGRG LS+P+QLA  SP LGN FSYCLVSHSF +DRVRRPSPLILGR+        G T              EF++T +LENPKHPYFYSV
Subjt:  GEPVGVAGFGRGVLSMPSQLATFSPQLGNRFSYCLVSHSFAADRVRRPSPLILGRY------HTGET--------------EFIYTSLLENPKHPYFYSV

Query:  GLAGISVGNVRIPAPEFLRKVDESGSGGVVVDSGTTFTMLPSGLYESVVAEFENRTGKVANRARRIEENTGLSPCYYYENSVGVPRVVLHFVGEKSSVVL
         L GIS+G   IPAP  LR++D++G GGVVVDSGTTFTMLP+  Y SVV EF++R G+V  RA R+E ++G+SPCYY   +V VP +VLHF G +SSV L
Subjt:  GLAGISVGNVRIPAPEFLRKVDESGSGGVVVDSGTTFTMLPSGLYESVVAEFENRTGKVANRARRIEENTGLSPCYYYENSVGVPRVVLHFVGEKSSVVL

Query:  PRKNYFYEFLDGGDGVVEVGRKRKVGCLMLMNGGDEAELAGGPGATLGNYQQQGFEVVYDLEKNRVGFARRQCSTLWDNL
        PR+NYFYEF+DGGDG  E   KRK+GCLMLMNGGDE+EL GG GA LGNYQQQGFEVVYDL   RVGFA+R+C++LWD+L
Subjt:  PRKNYFYEFLDGGDGVVEVGRKRKVGCLMLMNGGDEAELAGGPGATLGNYQQQGFEVVYDLEKNRVGFARRQCSTLWDNL

AT5G45120.1 Eukaryotic aspartyl protease family protein2.5e-5232.54Show/hide
Query:  VFIFLLCFLLSSPVFSSQIFLLPLSHSLSSSISDFNSTHNLLKSTATRSSARF---HRHRHNHLSLPLSP-----------GGDYTLSFNLGSESHKISL
        +F+FLL  LL +    +Q       H   SS     S+ + L  T T+SS             +  PLS               Y ++ N+G+    + +
Subjt:  VFIFLLCFLLSSPVFSSQIFLLPLSHSLSSSISDFNSTHNLLKSTATRSSARF---HRHRHNHLSLPLSP-----------GGDYTLSFNLGSESHKISL

Query:  YMDTGSDLVWFPCS--PFECILCEG-------KPKIQSPLPKISNNKSVSCSAAACSAAHGGSLSASHLCAISRCPLESIEISECSSFSCPPFYYAYGDG
        Y+DTGSDL W PC    F+CI C          P + SPL   ++ +  SC+++ C   H  S +    CA++ C +  +  S C    CP F Y YG+G
Subjt:  YMDTGSDLVWFPCS--PFECILCEG-------KPKIQSPLPKISNNKSVSCSAAACSAAHGGSLSASHLCAISRCPLESIEISECSSFSCPPFYYAYGDG

Query:  SLVAR-LYRDSLSLPTPAPSPPINVRNFTFGCAHTTLGEPVGVAGFGRGVLSMPSQLATFSPQLGNRFSYCLVSHSFAADRVRRPSPLILGRYHTG---E
         L++  L RD L   T       +V  F+FGC  +T  EP+G+AGFGRG+LS+PSQL      L   FS+C +   F  +     SPLILG         
Subjt:  SLVAR-LYRDSLSLPTPAPSPPINVRNFTFGCAHTTLGEPVGVAGFGRGVLSMPSQLATFSPQLGNRFSYCLVSHSFAADRVRRPSPLILGRYHTG---E

Query:  TEFIYTSLLENPKHPYFYSVGLAGISVGNVRIP--APEFLRKVDESGSGGVVVDSGTTFTMLPSGLYESVVAEFENRTGKVANRARRIEENTGLSPCY--
            +T +L  P +P  Y +GL  I++G    P   P  LR+ D  G+GG++VDSGTT+T LP   Y  ++   ++       RA   E  TG   CY  
Subjt:  TEFIYTSLLENPKHPYFYSVGLAGISVGNVRIP--APEFLRKVDESGSGGVVVDSGTTFTMLPSGLYESVVAEFENRTGKVANRARRIEENTGLSPCY--

Query:  --------YYENSVGV--PRVVLHFVGEKSSVVLPRKNYFYEFLDGGDGVVEVGRKRKVGCLMLMNGGDEAELAGGPGATLGNYQQQGFEVVYDLEKNRV
                  EN V +  P +  HF+   ++++LP+ N FY      DG V       V CL+  N  D      GP    G++QQQ  +VVYDLEK R+
Subjt:  --------YYENSVGV--PRVVLHFVGEKSSVVLPRKNYFYEFLDGGDGVVEVGRKRKVGCLMLMNGGDEAELAGGPGATLGNYQQQGFEVVYDLEKNRV

Query:  GFARRQC
        GF    C
Subjt:  GFARRQC


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCGGTTTCCCCTGTTTTCATCTTCCTCCTCTGTTTTCTCCTCTCCTCCCCTGTTTTCTCCTCACAAATTTTCCTTCTACCTCTCTCCCATTCCTTATCATCCTCAAT
CTCCGATTTCAACAGCACCCACAATCTTCTCAAATCCACCGCCACCCGCTCCTCCGCCCGATTCCACCGCCACCGCCATAACCACCTCTCTCTGCCCCTATCCCCCGGCG
GCGATTACACTCTCTCCTTCAACCTCGGCTCTGAGTCTCACAAAATTTCCCTCTATATGGACACCGGCAGCGACCTCGTTTGGTTCCCCTGTTCCCCGTTTGAGTGTATT
CTTTGTGAAGGTAAACCAAAAATTCAATCCCCTTTGCCCAAAATCTCAAATAACAAATCAGTTTCCTGCAGCGCCGCCGCCTGCTCCGCCGCTCATGGTGGCTCCCTCTC
CGCTTCCCACCTCTGTGCAATTTCTCGATGTCCACTTGAATCCATTGAAATTTCTGAGTGCTCCTCTTTTTCCTGTCCTCCGTTTTATTACGCTTACGGCGATGGGAGTT
TAGTTGCTCGGCTTTATAGAGATAGCCTCAGTTTGCCAACGCCAGCGCCATCTCCGCCGATTAATGTTCGGAATTTTACTTTTGGATGTGCCCACACGACGCTTGGCGAA
CCGGTTGGGGTTGCCGGATTCGGCCGTGGGGTGTTGTCGATGCCCAGTCAACTCGCGACTTTCTCACCTCAACTCGGGAACCGGTTTTCTTATTGTTTGGTTTCTCACTC
GTTTGCGGCGGACCGAGTTCGCCGCCCGAGTCCACTGATTCTCGGGCGGTACCACACCGGGGAGACGGAGTTCATTTACACTTCCTTGCTTGAGAATCCAAAACACCCTT
ATTTTTACTCAGTTGGATTGGCCGGAATATCAGTTGGGAATGTGAGGATTCCAGCGCCGGAGTTCTTGAGAAAAGTTGATGAGAGTGGTAGTGGCGGCGTTGTGGTGGAT
TCCGGCACTACTTTTACTATGCTCCCGTCAGGTTTGTATGAATCGGTGGTGGCCGAGTTCGAAAACCGTACCGGAAAGGTTGCAAACCGGGCGAGACGGATCGAAGAAAA
CACCGGGTTGAGCCCTTGCTATTACTACGAGAACTCAGTTGGCGTGCCACGTGTCGTGCTACATTTTGTTGGGGAGAAATCCAGTGTGGTGCTTCCTAGGAAGAACTATT
TCTATGAGTTTTTGGACGGTGGAGATGGGGTGGTGGAGGTGGGGAGGAAGAGAAAAGTTGGGTGTTTGATGCTAATGAACGGTGGAGATGAGGCTGAGCTGGCAGGTGGG
CCTGGTGCCACGCTTGGGAACTACCAACAACAAGGTTTTGAGGTGGTTTATGATTTGGAAAAGAACCGGGTTGGATTCGCTCGGCGGCAATGCTCCACTCTTTGGGACAA
TTTGAACCGGAGTTAG
mRNA sequenceShow/hide mRNA sequence
AAAGAAGGGTGTAGTTGTAATGGCCACCACTTCTCAAACCCATACAAAAACCCCATCTTCCCCTAACCAGAACCTTCTTCTTCTTCTTCTATTCTCAATCCCCATCTCTT
TAAATTCTCCAATTTCTCTCTCTAATTTCATTTCAAAAACCCCCCAATTTCTCTCTCTAAAATACTCCATTAATGGCGGTTTCCCCTGTTTTCATCTTCCTCCTCTGTTT
TCTCCTCTCCTCCCCTGTTTTCTCCTCACAAATTTTCCTTCTACCTCTCTCCCATTCCTTATCATCCTCAATCTCCGATTTCAACAGCACCCACAATCTTCTCAAATCCA
CCGCCACCCGCTCCTCCGCCCGATTCCACCGCCACCGCCATAACCACCTCTCTCTGCCCCTATCCCCCGGCGGCGATTACACTCTCTCCTTCAACCTCGGCTCTGAGTCT
CACAAAATTTCCCTCTATATGGACACCGGCAGCGACCTCGTTTGGTTCCCCTGTTCCCCGTTTGAGTGTATTCTTTGTGAAGGTAAACCAAAAATTCAATCCCCTTTGCC
CAAAATCTCAAATAACAAATCAGTTTCCTGCAGCGCCGCCGCCTGCTCCGCCGCTCATGGTGGCTCCCTCTCCGCTTCCCACCTCTGTGCAATTTCTCGATGTCCACTTG
AATCCATTGAAATTTCTGAGTGCTCCTCTTTTTCCTGTCCTCCGTTTTATTACGCTTACGGCGATGGGAGTTTAGTTGCTCGGCTTTATAGAGATAGCCTCAGTTTGCCA
ACGCCAGCGCCATCTCCGCCGATTAATGTTCGGAATTTTACTTTTGGATGTGCCCACACGACGCTTGGCGAACCGGTTGGGGTTGCCGGATTCGGCCGTGGGGTGTTGTC
GATGCCCAGTCAACTCGCGACTTTCTCACCTCAACTCGGGAACCGGTTTTCTTATTGTTTGGTTTCTCACTCGTTTGCGGCGGACCGAGTTCGCCGCCCGAGTCCACTGA
TTCTCGGGCGGTACCACACCGGGGAGACGGAGTTCATTTACACTTCCTTGCTTGAGAATCCAAAACACCCTTATTTTTACTCAGTTGGATTGGCCGGAATATCAGTTGGG
AATGTGAGGATTCCAGCGCCGGAGTTCTTGAGAAAAGTTGATGAGAGTGGTAGTGGCGGCGTTGTGGTGGATTCCGGCACTACTTTTACTATGCTCCCGTCAGGTTTGTA
TGAATCGGTGGTGGCCGAGTTCGAAAACCGTACCGGAAAGGTTGCAAACCGGGCGAGACGGATCGAAGAAAACACCGGGTTGAGCCCTTGCTATTACTACGAGAACTCAG
TTGGCGTGCCACGTGTCGTGCTACATTTTGTTGGGGAGAAATCCAGTGTGGTGCTTCCTAGGAAGAACTATTTCTATGAGTTTTTGGACGGTGGAGATGGGGTGGTGGAG
GTGGGGAGGAAGAGAAAAGTTGGGTGTTTGATGCTAATGAACGGTGGAGATGAGGCTGAGCTGGCAGGTGGGCCTGGTGCCACGCTTGGGAACTACCAACAACAAGGTTT
TGAGGTGGTTTATGATTTGGAAAAGAACCGGGTTGGATTCGCTCGGCGGCAATGCTCCACTCTTTGGGACAATTTGAACCGGAGTTAGTGATGAAGAGTGAACCCGGTTG
AGAAAGTGTTCTTTGACTTGTGACTATTGTCAACGGTCAACGCTATGAGGGTAAATAAGGAAATTTCAGGTTTGAAGGTTTTATTTTTTGTTGTAAATTCCTTGGGCACT
TCACTTCTTGCTTTAATTATTATTTTTAGTTGAAATTTGTATATTAAAAGTGTTCAAAAAATTTGTTGCAAAGAAGAAAAAAAAAATGTATAGCTCAATTTTATTGCATG
AAGCAAGGAATGAAGTGGTTGCTTTATCAAGTTAAGAATACAATAGATGTACTAAAGTAAGGATGTTAAAAGAGAAAATCATAAATCAAAATTATTTGTAAAACTTGAAA
ATGGAAATAAAAAGTCCATTGAAATACAGAATAGTCCAAAAACATCCTTCTCTATAATGAAAATTTCAAATTCTTCTCATCATCCTAAATTCTAAAAATATTTATAAATA
ATA
Protein sequenceShow/hide protein sequence
MAVSPVFIFLLCFLLSSPVFSSQIFLLPLSHSLSSSISDFNSTHNLLKSTATRSSARFHRHRHNHLSLPLSPGGDYTLSFNLGSESHKISLYMDTGSDLVWFPCSPFECI
LCEGKPKIQSPLPKISNNKSVSCSAAACSAAHGGSLSASHLCAISRCPLESIEISECSSFSCPPFYYAYGDGSLVARLYRDSLSLPTPAPSPPINVRNFTFGCAHTTLGE
PVGVAGFGRGVLSMPSQLATFSPQLGNRFSYCLVSHSFAADRVRRPSPLILGRYHTGETEFIYTSLLENPKHPYFYSVGLAGISVGNVRIPAPEFLRKVDESGSGGVVVD
SGTTFTMLPSGLYESVVAEFENRTGKVANRARRIEENTGLSPCYYYENSVGVPRVVLHFVGEKSSVVLPRKNYFYEFLDGGDGVVEVGRKRKVGCLMLMNGGDEAELAGG
PGATLGNYQQQGFEVVYDLEKNRVGFARRQCSTLWDNLNRS